1SPHINX_FE(1) General Commands Manual SPHINX_FE(1)
2
3
4
6 sphinx_fe - Convert audio files to acoustic feature files
7
9 sphinx_fe [ options ]...
10
12 This program converts audio files (in either Microsoft WAV, NIST
13 Sphere, or raw format) to acoustic feature files for input to batch-
14 mode speech recognition. The resulting files are also useful for vari‐
15 ous other things. A list of options follows:
16
17 -alpha Preemphasis parameter
18
19 -argfile
20 file (e.g. feat.params from an acoustic model) to read parame‐
21 ters from. This will override anything set in other command
22 line arguments.
23
24 -blocksize
25 Number of samples to read at a time.
26
27 -build_outdirs
28 Create missing subdirectories in output directory
29
30 -c file for batch processing
31
32 -cep2spec
33 Input is cepstral files, output is log spectral files
34
35 -di directory, input file names are relative to this, if defined
36
37 -dither
38 Add 1/2-bit noise
39
40 -do directory, output files are relative to this
41
42 -doublebw
43 Use double bandwidth filters (same center freq)
44
45 -ei extension to be applied to all input files
46
47 -eo extension to be applied to all output files
48
49 -example
50 Shows example of how to use the tool
51
52 -frate Frame rate
53
54 -help Shows the usage of the tool
55
56 -i audio input file
57
58 -input_endian
59 Endianness of input data, big or little, ignored if NIST or MS
60 Wav
61
62 -lifter
63 Length of sin-curve for liftering, or 0 for no liftering.
64
65 -logspec
66 Write out logspectral files instead of cepstra
67
68 -lowerf
69 Lower edge of filters
70
71 -mach_endian
72 Endianness of machine, big or little
73
74 -mswav Defines input format as Microsoft Wav (RIFF)
75
76 -ncep Number of cep coefficients
77
78 -nchans
79 Number of channels of data (interlaced samples assumed)
80
81 -nfft Size of FFT
82
83 -nfilt Number of filter banks
84
85 -nist Defines input format as NIST sphere
86
87 -npart Number of parts to run in (supersedes -nskip and -runlen if non-
88 zero)
89
90 -nskip If a control file was specified, the number of utterances to
91 skip at the head of the file
92
93 -o cepstral output file
94
95 -ofmt Format of output files - one of sphinx, htk, text.
96
97 -part Index of the part to run (supersedes -nskip and -runlen if non-
98 zero)
99
100 -raw Defines input format as raw binary data
101
102 -remove_dc
103 Remove DC offset from each frame
104
105 -round_filters
106 Round mel filter frequencies to DFT points
107
108 -runlen
109 If a control file was specified, the number of utterances to
110 process, or -1 for all
111
112 -samprate
113 Sampling rate
114
115 -seed Seed for random number generator; if less than zero, pick our
116 own
117
118 -smoothspec
119 Write out cepstral-smoothed logspectral files
120
121 -sndfile
122 Use libsndfile to read input data
123
124 -spec2cep
125 Input is log spectral files, output is cepstral files
126
127 -sph2pipe
128 Input is NIST sphere (possibly with Shorten), use sph2pipe to
129 convert
130
131 -transform
132 Which type of transform to use to calculate cepstra (legacy,
133 dct, or htk)
134
135 -unit_area
136 Normalize mel filters to unit area
137
138 -upperf
139 Upper edge of filters
140
141 -verbose
142 Show input filenames
143
144 -warp_params
145 defining the warping function
146
147 -warp_type
148 Warping function type (or shape)
149
150 -whichchan
151 Channel to process (numbered from 1), or 0 to mix all channels
152
153 -wlen Hamming window length
154
155 Currently the only kind of features supported are MFCCs (mel-frequency
156 cepstral coefficients). There are numerous options which control the
157 properties of the output features. It is VERY important that you docu‐
158 ment the specific set of flags used to create any given set of feature
159 files, since this information is NOT recorded in the files themselves,
160 and any mismatch between the parameters used to extract features for
161 recognition and those used to extract features for training will cause
162 recognition to fail.
163
165 Written by numerous people at CMU from 1994 onwards. This manual page
166 by David Huggins-Daines <dhuggins@cs.cmu.edu>
167
169 Copyright © 1994-2007 Carnegie Mellon University. See the file COPYING
170 included with this package for more information.
171
172
173
174 2007-08-27 SPHINX_FE(1)