1esl-seqstat(1) Easel Manual esl-seqstat(1)
2
3
4
6 esl-seqstat - summarize contents of a sequence file
7
8
10 esl-seqstat [options] seqfile
11
12
14 esl-seqstat summarizes the contents of the seqfile. It prints the for‐
15 mat, alphabet type, number of sequences, total number of residues, and
16 the mean, smallest, and largest sequence length.
17
18
19 If seqfile is - (a single dash), sequence input is read from stdin.
20
21
22
23
24
26 -h Print brief help; includes version number and summary of all
27 options, including expert options.
28
29
30 -a Additionally show a summary statistic line showing the name,
31 length, and description of each individual sequence. Each of
32 these lines is prefixed by an = character, in order to allow
33 these lines to be easily grepped out of the output.
34
35
36 -c Additionally print the residue composition of the sequence file.
37
38
39
40
42 --informat <s>
43 Assert that input seqfile is in format <s>, bypassing format au‐
44 todetection. Common choices for <s> include: fasta, embl, gen‐
45 bank. Alignment formats also work; common choices include:
46 stockholm, a2m, afa, psiblast, clustal, phylip. For more infor‐
47 mation, and for codes for some less common formats, see main
48 documentation. The string <s> is case-insensitive (fasta or
49 FASTA both work).
50
51
52
53 --amino
54 Assert that the seqfile contains protein sequences.
55
56
57 --dna Assert that the seqfile contains DNA sequences.
58
59
60 --rna Assert that the seqfile contains RNA sequences.
61
62
63
64
66 http://bioeasel.org/
67
68
70 Copyright (C) 2020 Howard Hughes Medical Institute.
71 Freely distributed under the BSD open source license.
72
73
75 http://eddylab.org
76
77
78
79
80Easel 0.48 Nov 2020 esl-seqstat(1)