1esl-selectn(1) Easel Manual esl-selectn(1)
2
3
4
6 esl-selectn - select random subset of lines from file
7
8
10 esl-selectn [options] nlines filename
11
12
13
15 esl-selectn selects nlines lines at random from file filename and out‐
16 puts them on stdout.
17
18
19 If filename is - (a single dash), input is read from stdin.
20
21
22
23 Uses an efficient reservoir sampling algorithm that only requires only
24 a single pass through filename, and memory storage proportional to
25 nlines (and importantly, not to the size of the file filename itself).
26 esl-selectn can therefore be used to create large scale statistical
27 sampling experiments, especially in combination with other Easel
28 miniapplications.
29
30
31
33 -h Print brief help; includes version number and summary of all op‐
34 tions, including expert options.
35
36
37
38 --seed <d>
39 Set the random number seed to <d>, an integer >= 0. The default
40 is 0, which means to use a randomly selected seed. A seed > 0
41 results in reproducible identical samples from different runs of
42 the same command.
43
44
45
47 http://bioeasel.org/
48
49
51 Copyright (C) 2020 Howard Hughes Medical Institute.
52 Freely distributed under the BSD open source license.
53
54
56 http://eddylab.org
57
58
59
60Easel 0.48 Nov 2020 esl-selectn(1)