adintool(1)

1    adintool
2
3ADINTOOL(1)                 General Commands Manual                ADINTOOL(1)
4
5
6

NAME

8           adintool
9          - a tool to record / split / send / receive audio streams
10

SYNOPSIS

12       adintool {-in inputdev} {-out outputdev} [options...]
13

DESCRIPTION

15       adintool analyzes speech input, finds speech segments skipping silence,
16       and records the detected segments in various ways. It performs speech
17       detection based on zerocross number and power (level), and records the
18       detected parts to files or other output devices sucessively.
19
20       adintool is a upper version of adinrec with various functions.
21       Supported input device are: microphone input, a speech file, standard
22       tty input, and network socket (called adin-net server mode). Julius
23       plugin can be also used. Detected speech segments will be saved to
24       output devices: speech files, standard tty output, and network socket
25       (called adin-net client mode). For example, you can split the incoming
26       speech to segments and send them to Julius to be recognized.
27
28       Output format is WAV, 16bit (signed short), monoral. If the file
29       already exist, it will be overridden.
30

OPTIONS

32       All Julius options can be set. Only audio input related options are
33       treated and others are silently skipped. Below is a list of options.
34
35   adintool specific options
36        -freq  Hz
37           Set sampling rate in Hz. (default: 16,000)
38
39        -in  inputdev
40           Audio input device. "mic" to capture via microphone input, "file"
41           for audio file input, and "stdin" to read raw data from
42           standard-input. For file input, file name prompt will appear after
43           startup. Use "adinnet" to make adintool as "adinnet server",
44           receiving data from client via network socket. Default port number
45           is 5530, which can be altered by option "-inport".
46
47           Alternatively, input device can be set by "-input" option, in which
48           case you can use plugin input.
49
50        -out  outputdev
51           Audio output device store the data. Specify "file" to save to file,
52           in which the output filename should be given by "-filename". Use
53           "stdout" to standard out. "adinnet" will make adintool to be an
54           adinnet client, sending speech data to a server via tcp/ip socket.
55           "vecnet" will make adintool to be a vecnet client, sending feature
56           vectors extracted from input to a server via tcp/ip socket. When
57           using "adinnet" and "vecnet" output, the server name to send data
58           should be specified by "-server". The default port number is 5530,
59           which can be changed by "-port" option.
60
61        -inport  num
62           When adintool becomes adinnet server to receive data (-in adinnet),
63           set the port number to listen. (default: 5530)
64
65        -server  [host] [,host...]
66           When output to adinnet server (-out adinnet), set the hostname. You
67           can send to multiple hosts by specifying their hostnames as
68           comma-delimited list like "host1,host2,host3".
69
70        -port  [num] [,num...]
71           When adintool send a data to adinnet server (-out adinnet), set the
72           port number to connect. (default: 5530) For multiple servers,
73           specify port numbers for all servers like "5530,5530,5531".
74
75        -filename  file
76           When output to file (-out file), set the output filename. The
77           actual file name will be as "file.0000.wav" , "file.0001.wav" and
78           so on, where the four digit number increases as speech segment
79           detected. The initial number will be set to 0 by default, which can
80           be changed by "-startid" option. When using "-oneshot" option to
81           save only the first segment, the input will be saved as "file".
82
83        -startid  number
84           At file output, set the initial file number. (default: 0)
85
86        -oneshot
87           Exit after the end of first speech segment.
88
89        -nosegment
90           Do not perform speech detection for input, just treat all the input
91           as a single valid segment.
92
93        -raw
94           Output as RAW file (no header).
95
96        -autopause
97           When output to adinnet server, adintool enter pause state at every
98           end of speech segment. It will restart when the destination adinnet
99           server sends it a resume signal.
100
101        -loosesync
102           When output to multiple adinnet server, not to do strict
103           synchronization for restart. By default, when adintool has entered
104           pause state, it will not restart until resume commands are received
105           from all servers. This option will allow restart at least one
106           restart command has arrived.
107
108        -rewind  msec
109           When input is a live microphone device, and there has been some
110           continuing input at the moment adintool resumes, it start recording
111           backtracking by the specified milliseconds.
112
113        -paramtype  parameter_type
114           When output is a vecnet (-out vecnet), specify parameter type in
115           HTK format like "MFCC_E_D_N_Z".
116
117        -veclen  vector_length
118           When output is a vecnet (-out vecnet), specify vector length
119           (dim.).
120
121   Concerning Julius options
122        -input  {mic|rawfile|adinnet|stdin|netaudio|esd|alsa|oss}
123           Choose speech input source. Specify 'file' or 'rawfile' for
124           waveform file. On file input, users will be prompted to enter the
125           file name from stdin.
126
127           'mic' is to get audio input from a default live microphone device,
128           and 'adinnet' means receiving waveform data via tcpip network from
129           an adinnet client. 'netaudio' is from DatLink/NetAudio input, and
130           'stdin' means data input from standard input.
131
132           At Linux, you can choose API at run time by specifying alsa, oss
133           and esd.
134
135        -lv  thres
136           Level threshold for speech input detection. Values should be in
137           range from 0 to 32767. (default: 2000)
138
139        -zc  thres
140           Zero crossing threshold per second. Only input that goes over the
141           level threshold (-lv) will be counted. (default: 60)
142
143        -headmargin  msec
144           Silence margin at the start of speech segment in milliseconds.
145           (default: 300)
146
147        -tailmargin  msec
148           Silence margin at the end of speech segment in milliseconds.
149           (default: 400)
150
151        -zmean
152           This option enables DC offset removal.
153
154        -smpFreq  Hz
155           Set sampling rate in Hz. (default: 16,000)
156
157        -48
158           Record input with 48kHz sampling, and down-sample it to 16kHz
159           on-the-fly. This option is valid for 16kHz model only. The
160           down-sampling routine was ported from sptk. (Rev. 4.0)
161
162        -NA  devicename
163           Host name for DatLink server input (-input netaudio).
164
165        -adport  port_number
166           With -input adinnet, specify adinnet port number to listen.
167           (default: 5530)
168
169        -nostrip
170           Julius by default removes successive zero samples in input speech
171           data. This option stop it.
172
173        -C  jconffile
174           Load a jconf file at here. The content of the jconffile will be
175           expanded at this point.
176
177        -plugindir  dirlist
178           Specify which directories to load plugin. If several direcotries
179           exist, specify them by colon-separated list.
180

ENVIRONMENT VARIABLES

182        ALSADEV
183           (using mic input with alsa device) specify a capture device name.
184           If not specified, "default" will be used.
185
186        AUDIODEV
187           (using mic input with oss device) specify a capture device path. If
188           not specified, "/dev/dsp" will be used.
189
190        PORTAUDIO_DEV
191           (portaudio V19) specify the name of capture device to use. See the
192           instruction output of log at start up how to specify it.
193
194        LATENCY_MSEC
195           Try to set input latency of microphone input in milliseconds.
196           Smaller value will shorten latency but sometimes make process
197           unstable. Default value will depend on the running OS.
198

EXAMPLES

200       Record microphone input to files: "data.0000.wav", "data.0001.wav" and
201       so on:
202       Split a long speech file "foobar.raw" into "foobar.1500.wav",
203       "foobar.1501.wav" ...:
204       Copy an entire audio file via network socket.
205       Detect speech segment, send to Julius via network and recognize it:
206

COPYRIGHT

212       Copyright (c) 1991-2013 Kawahara Lab., Kyoto University
213
214       Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan
215
216       Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and
217       Technology
218
219       Copyright (c) 2005-2013 Julius project team, Nagoya Institute of
220       Technology
221

LICENSE

223       The same as Julius.
224
225
226
227                                  12/19/2013                       ADINTOOL(1)