1hmmemit(1)                       HMMER Manual                       hmmemit(1)
2
3
4

NAME

6       hmmemit - sample sequences from a profile
7
8

SYNOPSIS

10       hmmemit [options] hmmfile
11
12
13

DESCRIPTION

15       The  hmmemit  program samples (emits) sequences from the profile HMM(s)
16       in hmmfile, and writes them to output.  Sampling sequences may be  use‐
17       ful  for a variety of purposes, including creating synthetic true posi‐
18       tives for benchmarks or tests.
19
20
21       The default is to sample one unaligned sequence from the core probabil‐
22       ity  model,  which means that each sequence consists of one full-length
23       domain.  Alternatively, with the -c option, you can emit a  simple  ma‐
24       jority-rule  consensus sequence; or with the -a option, you can emit an
25       alignment (in which case, you probably also want to set -N to something
26       other than its default of 1 sequence per model).
27
28
29       As  another option, with the -p option you can sample a sequence from a
30       fully configured HMMER search profile. This means sampling  a  `homolo‐
31       gous  sequence' by HMMER's definition, including nonhomologous flanking
32       sequences, local alignments, and multiple domains per sequence, depend‐
33       ing on the length model and alignment mode chosen for the profile.
34
35
36       The  hmmfile may contain a library of HMMs, in which case each HMM will
37       be used in turn.
38
39
40       hmmfile may be '-' (dash), which means reading this  input  from  stdin
41       rather than a file.
42
43
44

COMMON OPTIONS

46       -h     Help;  print  a  brief  reminder  of  command line usage and all
47              available options.
48
49
50
51       -o <f> Direct the output sequences to file <f>, rather than to stdout.
52
53
54       -N <n> Sample <n> sequences per model, rather than just one.
55
56
57
58

OPTIONS CONTROLLING WHAT TO EMIT

60       The default is to sample N sequences  from  the  core  model.  Alterna‐
61       tively,  you  may  choose  one (and only one) of the following alterna‐
62       tives.
63
64
65
66       -a     Emit an alignment for each HMM in the hmmfile rather  than  sam‐
67              pling unaligned sequences one at a time.
68
69
70       -c     Emit  a plurality-rule consensus sequence, instead of sampling a
71              sequence from the profile HMM's  probability  distribution.  The
72              consensus  sequence is formed by selecting the maximum probabil‐
73              ity residue at each match state.
74
75
76       -C     Emit a fancier plurality-rule consensus sequence than the -c op‐
77              tion. If the maximum probability residue has p < minl show it as
78              a lower case 'any' residue (n or x); if p >=  minl  and  <  minu
79              show  it as a lower case residue; and if p >= minu show it as an
80              upper case residue.  The default settings of minu and  minl  are
81              both  0.0, which means -C gives the same output as -c unless you
82              also set minu and minl to what you want.
83
84
85       -p     Sample unaligned sequences from the implicit search profile, not
86              from the core model.  The core model consists only of the homol‐
87              ogous states (between the begin and end states of a HMMER  Plan7
88              model).  The  profile  includes  the  nonhomologous  N, C, and J
89              states, local/glocal and uni/multihit  algorithm  configuration,
90              and the target length model.  Therefore sequences sampled from a
91              profile may include nonhomologous  as  well  as  homologous  se‐
92              quences,  and may contain more than one homologous sequence seg‐
93              ment. By default, the profile is in multihit local mode, and the
94              target sequence length is configured for L=400.
95
96
97
98
99

OPTIONS CONTROLLING EMISSION FROM PROFILES

101       These options require that you have set the -p option.
102
103
104       -L <n> Configure the profile's target sequence length model to generate
105              a mean length of approximately <n> rather than  the  default  of
106              400.
107
108
109       --local
110              Configure the profile for multihit local alignment.
111
112
113       --unilocal
114              Configure  the  profile for unihit local alignment (Smith/Water‐
115              man).
116
117
118       --glocal
119              Configure the profile for multihit glocal alignment.
120
121
122       --uniglocal
123              Configure the profile for unihit glocal alignment.
124
125
126

OPTIONS CONTROLLING FANCY CONSENSUS EMISSION

128       These options require that you have set the -C option.
129
130
131       --minl <x>
132              Sets the minl threshold for showing weakly conserved residues as
133              lower case.  (0 <= x <= 1)
134
135
136       --minu <x>
137              Sets  the minu threshold for showing strongly conserved residues
138              as upper case.  (0 <= x <= 1)
139
140
141
142

OTHER OPTIONS

144       --seed <n>
145              Seed the random number generator with <n>, an integer >= 0.   If
146              <n> is nonzero, any stochastic simulations will be reproducible;
147              the same command will give the same results.  If <n> is  0,  the
148              random  number  generator  is seeded arbitrarily, and stochastic
149              simulations will vary from run to run of the same command.   The
150              default  is  0: use an arbitrary seed, so different hmmemit runs
151              will generate different samples.
152
153
154
155
156
157

SEE ALSO

159       See hmmer(1) for a master man page with a list of  all  the  individual
160       man pages for programs in the HMMER package.
161
162
163       For  complete documentation, see the user guide that came with your HM‐
164       MER distribution (Userguide.pdf); or see the HMMER web page (http://hm
165       mer.org/).
166
167
168
169
171       Copyright (C) 2020 Howard Hughes Medical Institute.
172       Freely distributed under the BSD open source license.
173
174       For  additional  information  on  copyright and licensing, see the file
175       called COPYRIGHT in your HMMER source distribution, or  see  the  HMMER
176       web page (http://hmmer.org/).
177
178
179

AUTHOR

181       http://eddylab.org
182
183
184
185
186
187
188HMMER 3.3.2                        Nov 2020                         hmmemit(1)
Impressum