diff options
Diffstat (limited to 'README')
-rw-r--r-- | README | 474 |
1 files changed, 474 insertions, 0 deletions
@@ -0,0 +1,474 @@ +Haveged, an entropy source + +INTRODUCTION + +Complete documentation on haveged can be found at http://www.issihosts.com/haveged/ + +Linux provides device interfaces (/dev/random and /dev/urandom) to a pool of +random numbers collected from system interrupt service routines. On some +systems, especially on those systems with high needs or limited user +interaction, the standard collection mechanism cannot meet demand. In those +cases, an adequate supply of random numbers can be maintained by feeding +additional entropy into /dev/random pool via a file system interface. The +haveged daemon was created to fulfill this function using random data generated +by the HAVEGE algorithm. + +The HAVEGE algorithm is based upon the indirect effects of unrelated hardware +events on the instruction timing of a calculation that is sensitive to processor +features such as branch predictors and instruction/data access mechanisms. +Samples from a high-resolution timer are input into the algorithm to produce a +stream of random data in a collection buffer. The algorithm requires no special +privilege other than access to a high resolution timer, as provided by hardware +instruction or a system call. + +The HAVEGE mechanism is implemented in C using in-line assembly only where +direct hardware access is needed. On modern compilers, compiler intrinsics are +used to replace much if not all in-line assembly. The haveged implementation of +HAVEGE adds two crucial features: automated tuning of the algorithm to an +environment and a run time facility to verify the data generated in the +collection buffer. + +The haveged collection loop is tuned at run-time to match the size of the hosts +L1 data and instruction caches. The size determination is made on the basis of +the best fit to the following (low to high): + +1. as a compiled default +2. as determined by cpuid, if available +3. as determined by the /sys file system, if available +4. as specified by initialization parameters. + +Run-time verification of collection buffer contents is based upon the +methodology from the proposed AIS-31 standard from BSI, the German Federal +Office for Information Security (Bundesamt für Sicherheit in der +Informationstechnik). In the suggested configuration, haveged passes most of the +requirements for a NTG.1 class device described in version two of the AIS-31 +specification. + +The haveged implementation also provides an experimental feature to multiplex +havege collections over multiple cpu cores. Interprocess coordination in this +configuration sacrifices a significant fraction of haveged throughput in +exchange for a cpu load spread over a number of processors. + +All of the above features are contained within the haveged-devel sub package of +haveged. The haveged package can be built with libtool to expose the devel sub +package or without libtool if the devel library is not needed. In either case, +the executable built contains a file system interface to haveged-devel features +and an optional facility to feed havege output into the random device. The build +system also provides "check" targets to test the output of the executable's +random number generator through the file system interface. + +TESTING haveged + +Testing via build check targets has been part of the haveged distribution since +the project moved to automake. Original tests were limited to a quick evaluation +implemented as an adaptation of the open source ent tool from Fourmilab and a +more thorough evaluation based upon the NIST SP800 test suite. Run time testing +was added to haveged version 1.5 as part of an effort to loosely follow the RNG +testing AIS-31 framework of the German Common Criteria organization, BSI. + +The significant features of the AIS-31 framework are: + +Run-time tests are broken into 2 groups, Procedure A containing 5 tests, and +Procedure B containing 3 tests. Procedure A consists of performing a 48-bit +disjointedness test on 64K sequences, followed by 257 repetitions of the four +FIPS-140-1 tests and an auto-correlation test on successive 2000 bit sequences. +Procedure B performs distribution tests for 10,000 occurrences of 1, 2, 3, 4 bit +runs in successive samples, followed by a entropy estimate based upon on another +256000+2560 bit sample. A sample must pass all individual tests to pass the +procedure. An ideal RNG is expected to pass Procedure A with a probability of +0.9987 and pass Procedure B with with a probability of 0.9998. One retry will be +attempted to recover from the the failure of a single test in either procedure. +The probability an ideal generator would fail the retry is nill. + +Special features of the haveged implementation: + +1) A failure of either procedure is a fatal error. In the case of haveged, the + instance will terminate. +2) The tests operate directly on the collection buffer after a fill. Performance + costs are acceptable except for the auto-correlation test, test5. An option + to skip test5 on some repetitions mitigates this problem. +3) Tests can take place at start up (a "tot" test) or continuously as specified + by a haveged parameter. In the continuous case, there is no implied alignment + between the collection buffer and the testing context. Collection buffer + contents will be released for consumption provided the buffer does not contain + a failed individual test. +4) The size of the input required to complete procedure B depends on the content. + This means there is no fixed alignment of the test input in the collection + buffer. +5) Procedure retries are logged. Extended information is available with -v3. Retries + are expected (see failure rates above) but normally only seen with output + ranges north of a few GB. + +More detailed information on the adaptation of the BIS framework can be found +at http://www.issihosts.com/haveged/ais31.html + +BUILDING haveged + +This package originated on "Enterprise Linux 5" systems (RHEL 5 / CentOS 5 / SL +5), but every effort has been made to retain and broaden the hardware support of +the original HAVEGE implementation. The package uses the automake build system. +By default, the build uses a libtool build to expose haveged-devel. The +contrib/build directory contains a build.sh script to toggle the libtool +requirement and recover from some autotools failures. Change directory to the +build directory and type ./build.sh for options. + +The configure process uses hardware detection via config.sub or the configure +"-host" command line argument. The configure "host" variable is used to select +in-line assembly or compiler intrinsics appropriate to the build target. + +Currently supported hosts are: + +1. x86 (x86_64 and i686) +2. aarch64 +3. armv7hl +4. ppc64le +5. ia64 +6. powerpc +7. s390 + + +The generic host type is provided for those systems without user level access to +a high-resolution system timer. In this case, the --enable-clock_gettime option +is set to 'yes'. + +The following build options are available to "./configure": + +1. --enable-clock_gettime (default 'no' for recognized hosts) +2. --enable-daemon (default 'yes' if Linux) +3. --enable-diagnostic (default 'no') +4. --enable-init (type, default 'no') +5. --enable-initdir (default '' unless enable--init="service.*") +6. --enable-nistest (default 'no' but recommended) +7. --enable-olt (default 'yes') +8. --enable-threads (experimental) +9. --enable-tune (default 'yes') + +Detailed option information is available by typing "./configure --help". For +options xxx that take "yes/no" arguments, --disable-xxx may be used as the +inverse of --enable-xxx. + +If --enable-clock_gettime() is 'yes', the clock_gettime(CLOCK_MONOTONIC) system +call will be used as a timer source. This option defaults to 'yes' for generic +host builds and 'no' otherwise. This option may proved useful if access to time +hardware is privileged. Due to variability of clock_gettime() implementations, +the adequacy of the clock_gettime() resolution cannot be known until run time. +It is strongly advised to use run-time testing if this option is used. + +If --enable-daemon is 'yes', ioctl access required to the random device and +read-write access to the /proc virtual file system is required. The daemon may +be run in the foreground or fork into the background based upon a command line +argument. The daemon interface targets the 2.6 and later kernel and may not work +on 2.4 kernels due to difference in the random interface between those two +kernel versions. The change in the proc file system from pool size expressed in +bytes to pool size expressed in bits has been taken into account, other changes +may be required. This option is 'no' when diagnostic modes are enabled. If +the option is no the executable is installed in the user bin directory instead +of the sbin directory. + +If --enable-diagnostic is 'yes', the capture and inject diagnostic interfaces +are enabled. The capture or inject diagnostic may be enabled singly by setting +the option to 'capture' or 'inject'. A setting for any value other than 'no' +for this option forces --enable-daemon=no. See DIAGNOSTICS below for details. + +The --enable-init option is active only when --enable-daemon is 'yes'. This +value can specify a template to be used in the installation of an init method +by the build's install target. The default value, 'no', disables the feature. +Other values can be used to install a traditional systemv init script or +systemd unit definition. See INSTALLATION for details. + +The --enable-initdir is active only when --enable-init='service.*', i.e. a +systemd install. See INSTALLATION for details. + +The --enable-nistest option enables more thorough testing for the check target. +See CHECKING for details. + +The --enable-olt option is provided to suppress the entire online test facility. +This option is provided for systems with a very limited resource budget and the +ability to thoroughly test the RNG output by other means. When enabled, the online +test system tests the output of the haveged random number generator using AIS-31 +test procedures A and B. Either or both tests may be run as a total failure check +(a "tot" test) at initialization and/or continuously during all subsequent haveged +operation - See the man page and the description at +http://www.issihosts.com/haveged/ais31.html for further information. + +The --enable-threads option is an experimental prototype for running multiple +collection threads in a single haveged instance. The goal is to create a +multi-core haveged that would spread collection overhead more evenly over the +available cpu resources. + +The --enable-tune option allows the on-line tuning facility to be suppressed. +This is intended for systems with special needs and or a limited resource budget. +Setting the option to 'yes' enables both the cpuid and virtual file system methods, +a value of 'no' suppresses both methods. Individual tuning methods can be selected +by setting the option to either 'cpuid' or 'vfs'. Note that the 'cpuid' method +is always conditional on host type and will not be present if the hardware +architecture does not support the instruction. + +The haveged build does not modify CFLAGS and DFLAGS, so for example: + +CFLAGS="-fpic -DGENERIC_DCACHE=32" LDFLAGS="-z now" ./configure --disable-tune + +would configure the build for a manually-tuned, hardened daemon with a default +data cache size of 32kb. + + +CHECKING haveged + +The build check target provides two test procedures for the build. + +1. A "quick" check based upon and adaptation of the public domain ENT program. + The "entest" program uses the ENT sources to subject a sample to the following: + + a) The Chi-Square result must fall within acceptable bounds (>1% and <99 %) + b) The entropy/character must exceed a minimum (7.5) + c) The arithmetic mean must exceed a minimum (127.0) + d) The monte-carlo approximation of PI must lie within error bounds (.5%) + e) The Sequential Correlation Coefficient must be below a minimum (.8) + + The program provides a pass-fail indication and an optional display of the + results to stdout. The entest checks are quite weak and intended only as a + quick sanity check. + +2. An adaptation of the NIST Statistical Test Suite as adapted by Oliver + Rochecouste of irisa.fr as part of the original havege project. More that 400 + tests are performed in a typical run. The program provides as pass-fail + indication with detailed results reported in the nist.out file in the + nist directory. You will need sit down with SP800-*.pdf available from the + NIST to review the detailed results. AIS31 provides recommendations for the + NIST test suite as 'additional tests'. See testing documentation at + http://www.issihosts.com/haveged/ais31.html for further information. + +The "quick" test is always part of the check target. The NIST suite is run only +when --enable-nistest is 'yes'. + +Both checks function the same way, haveged is run to collect a sample file in +the test directory which is then analyzed by the test program. A pass-fail return +is given in both cases, additional information is written to stdout. The input +samples and the nist.out report are deleted by the clean make target. + +Both test mechanisms are statistical and even a fully functional random number +generator will experience occasional failures. It is not uncommon to see one or +two failures in the NIST suite and the entest will occasionally fail with a small +sample size (usually the Chi-Square test barks). Early haveged releases used a +entest sample size of 1MB, this has been increased to 16MB because failures with +that sample size were all too common. A 16MB sample will also deplete and refill +the haveged collection area to exercise all buffer logic. + +Users are encouraged to run their own external tests. The --number==0 option is +a convenient means to pipe haveged output into external suites such as Dieharder, +the TESTU01 batteries, or PractRand. + + +RUNNING haveged + +The following invocation arguments are always available: + + --buffer , -b [] Buffer size [KW] - default : 128 + --data , -d [] Data cache size [KB], with fallback to 16 + --inst , -i [] Instruction cache size [KB], with fallback to 16 + --file , -f [] Sample output file - default: 'sample', '-' for stdout + --number , -n [] Output size in [k|m|g|t]. 0 = unlimited to stdout + --verbose , -v [] Verbose mask 0=none,1=summary,2=retries,4=timing,8=loop,16=code,32=test + --help , -h This help + +The "-b", "-d", "-i" options are needed only in special cases. Generator output +should be validated after changes to these values. + +If haveged is built with online testing enabled, the following is present + + --onlinetest , -o [] [t<x>][c<x>] x=[a[n][w]][b[w]] 't'ot, 'c'ontinuous, default: ta8b" + +The default configuration executes the "tot" test using AIS procedure B. At the completion +of the tot test, the buffer is reloaded and any continuous tests will be run before +data becomes available. + +If haveged is built with threads support, the following is present + + --threads , -t [] Number of threads + +If daemon interface is enabled, the following options are available: + + --Foreground , -F Run daemon in foreground, do not fork and detach, + --pid , -p [] The location of the daemon pid file, default: /var/run/haveged.pid + --run , -r [] 0=daemon,1=config info,>1=Write <r>KB sample file + --write , -w [] Set write_wakeup_threshold [bits] + +If haveged is running detached, diagnostic output is written to syslog, otherwise +output is written to stderr. + +If the daemon interface is enabled, non-zero "-r" options are used to test the +haveged random number generator; the random number generator will be configured, +the initial data collection pass will be executed, configuration details will be +written to stdout, and a "-r" KB sample of output will be written to the sample +output file for all "-r" > 1. The "-n" option provides a more friendly version +of r > 1. + +If the daemon interface and --run==1 is specified, or if --run is not available +and --number is not specified a summary of build, tuning, and execution +is displayed and haveged exits without generating data: + +<prog>: ver: <ver>; arch: <arch>; vend: <vend>, build: (<opts>); collect: <collect> +<prog>: cpu: <cpu> (<tune>);data: <data> (<tune>); inst: <inst> (<tune>); idx: <idx>; sz: <sz> +<prog>: tot tests(<spec>): <score>; continuous tests(<spec>):<score> last entropy estimate <ent> +<prog>: fills: <fills>, generated: <total> + +where + <prog> build: program name - normally haveged + <ver> build: package version + <arch> build: architecture: ia64, generic, ppc, s390, sparc, sparclite, x86 + <vend> build: vendor string of host from cpuid if available else 'generic' + <opts> build: version of gcc used and build option flags - see below. + <collect> tuning: collection buffer size + <cpu> tuning: (sources list for cpu info, see below) + <data> tuning: size, (sources list for L1 data cache) + <inst> tuning: size, (sources list for L1 instruction cache) + <idx> tuning: collector loops used/collector loops available + <sz> tuning: collector size used/collector size available. + <tune> tuning: tuning sources - see below + <spec> exec: tests to be executed in --onlinetest format + <ent> exec: last entropy estimate from procedure B. + <score> exec: pass/fail counts for AIS test procedures + <fills> exec: number of times buffer was filled + <total> exec: number of bytes output + +build option flags represent the ./configure options as: + C=clock_gettime, D=diagnostic I=tune with cpuid, M=multi-core, T=online tests, V=tune with vfs + +tuning sources are: + D=default value, P=instance parameter, C=cpuid present, + H=hyperthreading, A=AMD cpuid, A5=AMD fn5, A6=AMD fn6, A8=AMD fn8 + L2=Intel has leaf2, L4=Intel has leaf4, B=Intel leaf b, + 4=intel leaf4, V=virtual file system available + VS=/sys/devices/system/cpu/cpu%d/cache/index<n>/level, + VO=/sys/devices/system/cpu/online, VI=/proc/cpuinfo + VC=/sys/devices/system/cpu + +Sources displayed in parenthesis are white space separated lists of the above +tokens. The "collector * used/collector *available" values indicate the fit of +the haveged collection loop to the L1 instruction cache. + +Items marked 'exec:' above have meaning only when data is generated. The display +of test results is customized to match the test options specified. The last few +lines of the build/tuning summary (those items marked 'exec:' above) also +appear by themselves as the 'exec summary' in other circumstances. + +In other circumstances, the completion of initialization is announced by a banner +written to the log: + +"<prog>: starting up" +"Writing <n> byte output to <file>" + +The first line above is used when running as a daemon, the second otherwise. If +one or more online tests are running, test failures are logged as: + +<prog>: AIS-31 <stage> procedure <proc>: <action> <bytes> bytes fill <fill> + +where + <stage> is either 'tot' or 'continuous' + <proc> is either 'A' or 'B' + <action> is either 'retry' or 'fail' + <bytes> is number of bytes processed in procedure before failure + <fill> is the number of times the buffer was filled + +The exec summary is logged upon error or signal terminations. Other log output is +controlled by --verbose: + +A --verbose bit mask to obtain additional diagnostic information: + +0x01 Show exec summary on termination, retry summary +0x02 Show online test retry details +0x04 Show timing for collections +0x08 Show collection loop characteristics +0x10 Show code offsets +0x20 Show all online test completion details + +The "--write" option will set proc/sys/kernel/random/write_wakeup_threshold to +the given value. This is useful because this threshold is very small on some +systems. A minimum of 1024 is recommended. + +The file system interface supports file creation of up data setups up to 16tb or +can be part of a piped command set. See the man(8) page for examples. + +DIAGNOSTIC haveged build + +The diagnostic build is a special version of the non-daemon configuration with a +specialized --run option. The --run levels 0 and 1 correspond duplicate the +equivalent application functions. One or more of the addition commands may be +included depending on the --enable-diagnostic value chosen. + +capture (--run 2) + Run the RNG but collect the timer inputs in a separate buffer. The normal + RNG output is discarded and replaced by the timer inputs. Requires additional + buffering, performance is poor because 8 times as many fills are required to + generate same amount of output. + +inject tics (--run 4) + Replace timer source with file input. Requires additional buffering, source + must account for generator warmup. + +inject data (--run 8) + Bypass the normal RNG completely and read input directly into the output + buffer. Generator warmup is skipped. This option is useful for validating + the online test suite. + +Knowledge of haveged internals is needed to use the special features effectively +and interpret the results. The diagnostic build identifies itself as +'havege_diagnostic' in -help display and other outputs. + + +INSTALLATION + +If the daemon interface is not enabled, the install places the executable in +automake's bin_PROGRAMS directory and provides a man(8) page. A man(3) page +is provided for the libtool build. If the daemon interface is enabled, the +executable is installed in automake's sbin_PROGRAMS directory. + +If the daemon interface is enabled, the --enable-init setting provides a simple +template system to setup the init method. If --enable-init is set to none +no action is taken. Otherwise, the template must reside in the init.d build +directory and is selected by the setting. Template names "service.*" indicate +that a systemd style init, while template names "sysv.*" are used for sysv +style init scripts. + +Sample sysv style templates are provided for linux standard base, sysv.lsb, +and redhat systems, sysv.redhat , such as centos which have not moved to +systemd style inits. + +For systemd style installs, --enable-initdir specifies the systemd unit +directory. If the setting is not specified (or is ''), the default value is +obtained from hosts pkg-config query for systemdsystemunitdir. Sample systemd +templates are provided for forking, service.forking, and non-forking, +service.fedora, configurations. The non-forking configuration is recommended to +avoid the overhead of PID file and minimize start-up cost. + +Examples: + +./configure --enable-init=service.redhat +./configure --enable-init=sysv.lsb + +Custom init scripts can be added as necessary by adding templates to the +init.d directory. + +A sample file, haveged.spec, is provided in the build root as a guide for +those who want to build a rpm. As with init scripts, the sample may need +customization before use. Other SPEC file examples can be found in the +contrib directory (see EXTRAS for details). + + +EXTRAS + +The contrib directory contains bits and pieces that are not integrated into the +distribution. Currently this directory contains build related utilities in +the build directory and an unorganized collection of some of the tools used +to analyze haveged in the diags directory. + +The script contrib/build/build.sh organizes the build utilities. The script +provides methods to toggle the use of libtool in the build in particularly +problematic environments, a bootstrap to recover from automake error states +and the option to build and run a devel sample program after the devel +sub package has been built. + +Several sample package spec files are provided as contrib/build/*.spec. The +havege_sample.c source demonstrates usage of the havege.h API. + + |