1hmmemit(1) HMMER Manual hmmemit(1)
2
3
4
6 hmmemit - sample sequences from a profile
7
8
10 hmmemit [options] hmmfile
11
12
13
15 The hmmemit program samples (emits) sequences from the profile HMM(s)
16 in hmmfile, and writes them to output. Sampling sequences may be use‐
17 ful for a variety of purposes, including creating synthetic true posi‐
18 tives for benchmarks or tests.
19
20
21 The default is to sample one unaligned sequence from the core probabil‐
22 ity model, which means that each sequence consists of one full-length
23 domain. Alternatively, with the -c option, you can emit a simple ma‐
24 jority-rule consensus sequence; or with the -a option, you can emit an
25 alignment (in which case, you probably also want to set -N to something
26 other than its default of 1 sequence per model).
27
28
29 As another option, with the -p option you can sample a sequence from a
30 fully configured HMMER search profile. This means sampling a `homolo‐
31 gous sequence' by HMMER's definition, including nonhomologous flanking
32 sequences, local alignments, and multiple domains per sequence, depend‐
33 ing on the length model and alignment mode chosen for the profile.
34
35
36 The hmmfile may contain a library of HMMs, in which case each HMM will
37 be used in turn.
38
39
40 hmmfile may be '-' (dash), which means reading this input from stdin
41 rather than a file.
42
43
44
46 -h Help; print a brief reminder of command line usage and all
47 available options.
48
49
50
51 -o <f> Direct the output sequences to file <f>, rather than to stdout.
52
53
54 -N <n> Sample <n> sequences per model, rather than just one.
55
56
57
58
60 The default is to sample N sequences from the core model. Alterna‐
61 tively, you may choose one (and only one) of the following alterna‐
62 tives.
63
64
65
66 -a Emit an alignment for each HMM in the hmmfile rather than sam‐
67 pling unaligned sequences one at a time.
68
69
70 -c Emit a plurality-rule consensus sequence, instead of sampling a
71 sequence from the profile HMM's probability distribution. The
72 consensus sequence is formed by selecting the maximum probabil‐
73 ity residue at each match state.
74
75
76 -C Emit a fancier plurality-rule consensus sequence than the -c op‐
77 tion. If the maximum probability residue has p < minl show it as
78 a lower case 'any' residue (n or x); if p >= minl and < minu
79 show it as a lower case residue; and if p >= minu show it as an
80 upper case residue. The default settings of minu and minl are
81 both 0.0, which means -C gives the same output as -c unless you
82 also set minu and minl to what you want.
83
84
85 -p Sample unaligned sequences from the implicit search profile, not
86 from the core model. The core model consists only of the homol‐
87 ogous states (between the begin and end states of a HMMER Plan7
88 model). The profile includes the nonhomologous N, C, and J
89 states, local/glocal and uni/multihit algorithm configuration,
90 and the target length model. Therefore sequences sampled from a
91 profile may include nonhomologous as well as homologous se‐
92 quences, and may contain more than one homologous sequence seg‐
93 ment. By default, the profile is in multihit local mode, and the
94 target sequence length is configured for L=400.
95
96
97
98
99
101 These options require that you have set the -p option.
102
103
104 -L <n> Configure the profile's target sequence length model to generate
105 a mean length of approximately <n> rather than the default of
106 400.
107
108
109 --local
110 Configure the profile for multihit local alignment.
111
112
113 --unilocal
114 Configure the profile for unihit local alignment (Smith/Water‐
115 man).
116
117
118 --glocal
119 Configure the profile for multihit glocal alignment.
120
121
122 --uniglocal
123 Configure the profile for unihit glocal alignment.
124
125
126
128 These options require that you have set the -C option.
129
130
131 --minl <x>
132 Sets the minl threshold for showing weakly conserved residues as
133 lower case. (0 <= x <= 1)
134
135
136 --minu <x>
137 Sets the minu threshold for showing strongly conserved residues
138 as upper case. (0 <= x <= 1)
139
140
141
142
144 --seed <n>
145 Seed the random number generator with <n>, an integer >= 0. If
146 <n> is nonzero, any stochastic simulations will be reproducible;
147 the same command will give the same results. If <n> is 0, the
148 random number generator is seeded arbitrarily, and stochastic
149 simulations will vary from run to run of the same command. The
150 default is 0: use an arbitrary seed, so different hmmemit runs
151 will generate different samples.
152
153
154
155
156
157
159 See hmmer(1) for a master man page with a list of all the individual
160 man pages for programs in the HMMER package.
161
162
163 For complete documentation, see the user guide that came with your HM‐
164 MER distribution (Userguide.pdf); or see the HMMER web page (http://hm‐
165 mer.org/).
166
167
168
169
171 Copyright (C) 2020 Howard Hughes Medical Institute.
172 Freely distributed under the BSD open source license.
173
174 For additional information on copyright and licensing, see the file
175 called COPYRIGHT in your HMMER source distribution, or see the HMMER
176 web page (http://hmmer.org/).
177
178
179
181 http://eddylab.org
182
183
184
185
186
187
188HMMER 3.3.2 Nov 2020 hmmemit(1)