1esl-seqrange(1) Easel Manual esl-seqrange(1)
2
3
4
6 esl-seqrange - determine a range of sequences for one of many parallel
7 processes
8
9
11 esl-sfetch [options] seqfile procidx nproc
12
13
15 esl-seqrange reads an SSI-indexed seqfile and determines the range of
16 sequence indices in that file that process number procidx out of nproc
17 total processes should operate on during a parallel processing of seq‐
18 file.
19
20
21 The seqfile must be indexed first using esl-sfetch --index seqfile.
22 This creates an SSI index file seqfile.ssi. An SSI file is required in
23 order for esl-seqrange to work.
24
25
26 Sequence index ranges are calculated using a simple rule: the number of
27 sequences for each process should be identical, or as close as possible
28 to identical, across all processes. The lengths of the sequences are
29 not considered (even though they probably should be).
30
31
33 -h Print brief help; includes version number and summary of all op‐
34 tions, including expert options.
35
36
37 --informat <s>
38 Assert that input seqfile is in format <s>, bypassing format au‐
39 todetection. Common choices for <s> include: fasta, embl, gen‐
40 bank. Alignment formats also work; common choices include:
41 stockholm, a2m, afa, psiblast, clustal, phylip. For more infor‐
42 mation, and for codes for some less common formats, see main
43 documentation. The string <s> is case-insensitive (fasta or
44 FASTA both work).
45
46
47
48
50 http://bioeasel.org/
51
52
54 Copyright (C) 2020 Howard Hughes Medical Institute.
55 Freely distributed under the BSD open source license.
56
57
59 http://eddylab.org
60
61
62
63
64Easel 0.48 Nov 2020 esl-seqrange(1)