hunspell(1)

1hunspell(1)                 General Commands Manual                hunspell(1)
2
3
4

NAME

6       hunspell - spell checker, stemmer and morphological analyzer
7

SYNOPSYS

9       hunspell [-1 -a -G -h -H -l -L -m -n -t] [-d dic] [-p ] [files(s)]
10

DESCRIPTION

12       Hunspell  is fashioned after the Ispell program.  The most common usage
13       is "hunspell filename".  In this case, hunspell will display each  word
14       which  does  not  appear in the dictionary at the top of the screen and
15       allow you to change it.  If there are "near misses" in  the  dictionary
16       (words which differ by only a single letter, a missing or extra letter,
17       a pair of transposed letters, a missing space or hyphen, or  a  special
18       string pair), then they are also displayed on following lines.  As well
19       as "near misses", ispell may display other guesses at ways to make  the
20       word  from  a  known  root, with each guess preceded by question marks.
21       Finally, the line containing the word and the previous line are printed
22       at  the  bottom of the screen.  If your terminal can display in reverse
23       video, the  word  itself  is  highlighted.   You  have  the  option  of
24       replacing  the word completely, or choosing one of the suggested words.
25       Commands are single characters as follows (case is ignored):
26
27              R      Replace the misspelled word completely.
28
29              Space  Accept the word this time only.
30
31              A      Accept the word for the rest of this hunspell session.
32
33              I      Accept the word, capitalized as it is in  the  file,  and
34                     update private dictionary.
35
36              U      Accept  the word, and add an uncapitalized (actually, all
37                     lower-case) version to the private dictionary.
38
39              0-n    Replace with one of the suggested words.
40
41              L      Look up words in system  dictionary  (controlled  by  the
42                     WORDS compilation option).
43
44              X      Write  the  rest of this file, ignoring misspellings, and
45                     start next file.
46
47              Q      Exit immediately and leave the file unchanged.
48
49              !      Shell escape (not implemented).
50
51              ^L     Redraw screen.
52
53              ^Z     Suspend ispell.
54
55              ?      Give help screen.
56

OPTIONS

58       -t     The input file is in TeX or LaTeX format.
59
60       -n     The input file is in nroff/troff format.
61
62       -H     The input file is in SGML/HTML format.
63
64
65       The -l or "list" option to hunspell  is  used  to  produce  a  list  of
66       misspelled words from the standard input.
67
68       The  -a  option  is  intended  to be used from other programs through a
69       pipe.  In this mode, hunspell prints a one-line version  identification
70       message,  and then begins reading lines of input.  For each input line,
71       a single line is written to the standard output for each  word  checked
72       for  spelling  on  the  line.   If  the  word  was  found  in  the main
73       dictionary, or your personal dictionary, then the line contains only  a
74       '*'.   If  the  word  was  found  through  affix removal, then the line
75       contains a '+', a space, and the root word.   If  the  word  was  found
76       through  compound  formation (concatenation of two words, then the line
77       contains only a '-'.
78
79       If the word is not in the dictionary, but there are near  misses,  then
80       the  line  contains  an '&', a space, the misspelled word, a space, the
81       number of near misses, the number of characters between  the  beginning
82       of  the line and the beginning of the misspelled word, a colon, another
83       space, and a list of the near misses separated by  commas  and  spaces.
84       Following  the  near  misses  (and identified only by the count of near
85       misses), if the word could be formed by adding (illegal) affixes  to  a
86       known  root,  is  a  list  of suggested derivations, again separated by
87       commas and spaces.
88
89       Also, each near miss or guess is capitalized the same as the input word
90       unless  such  capitalization  is  illegal; in the latter case each near
91       miss is capitalized correctly according to the dictionary.
92
93       Finally, if the word does not appear in the dictionary, and  there  are
94       no  near  misses, then the line contains a '#', a space, the misspelled
95       word, a space, and the character offset from the beginning of the line.
96       Each  sentence  of  text  input  is terminated with an additional blank
97       line, indicating that hunspell has completed processing the input line.
98
99       These output lines can be summarized as follows:
100
101              OK:    *
102
103              Root:  + <root>
104
105              Compound:
106                     -
107
108              Miss:  &  <original>  <count>  <offset>:  <miss>,  <miss>,  ...,
109                     <guess>, ...
110
111              Guess: ? <original> 0 <offset>: <guess>, <guess>, ...
112
113              None:  # <original> <offset>
114
115       For  example,  a  dummy dictionary containing the words "fray", "Frey",
116       "fry", and "refried"  might  produce  the  following  response  to  the
117       command "echo 'frqy refries | hunspell -a -m -d ./test.hash":
118              (#) Hunspell 0.4.1 (beta), 2005-05-26
119              & frqy 3 0: fray, Frey, fry
120              & refries 1 5: refried, re+fry-y+ies
121
122       This  mode is also suitable for interactive use when you want to figure
123       out the spelling of a single word.
124
125       When in the -a mode, hunspell will also accept lines  of  single  words
126       prefixed  with any of '*', '&', '@', '+', '-', '~', '#', '!', '%', '`',
127       or '^'.  A line starting with '*' tells hunspell  to  insert  the  word
128       into the user's dictionary (similar to the I command).  A line starting
129       with '&' tells hunspell to insert an all-lowercase version of the  word
130       into the user's dictionary (similar to the U command).  A line starting
131       with '@' causes hunspell to accept this word in the future (similar  to
132       the  A command).  A line starting with '+', followed immediately by tex
133       or nroff will cause hunspell to parse future input according the syntax
134       of  that  formatter.   A  line  consisting  solely  of a '+' will place
135       hunspell in TeX/LaTeX mode (similar to the -t option) and  '-'  returns
136       hunspell  to  nroff/troff  mode  (but  these  commands  are  obsolete).
137       However, the string character type is not changed; the '~' command must
138       be  used  to  do this.  A line starting with '~' causes hunspell to set
139       internal parameters (in particular, the default string character  type)
140       based on the filename given in the rest of the line.  (A file suffix is
141       sufficient, but the period must be included.  Instead of a file name or
142       suffix,  a  unique  name,  as listed in the language affix file, may be
143       specified.)  However, the formatter parsing is not  changed;   the  '+'
144       command must be used to change the formatter.  A line prefixed with '#'
145       will cause the personal dictionary to be saved.  A line  prefixed  with
146       '!'  will  turn on terse mode (see below), and a line prefixed with '%'
147       will return hunspell to normal (non-terse) mode.  A line prefixed  with
148       '`'  will  turn  on  verbose-correction mode (see below); this mode can
149       only be disabled by turning on terse mode with '%'.
150
151       Any input following the prefix characters '+', '-', '#', '!',  '%',  or
152       '`'  is  ignored, as is any input following the filename on a '~' line.
153       To allow spell-checking of lines beginning  with  these  characters,  a
154       line  starting  with '^' has that character removed before it is passed
155       to the  spell-checking  code.   It  is  recommended  that  programmatic
156       interfaces prefix every data line with an uparrow to protect themselves
157       against future changes in hunspell.
158
159       To summarize these:
160
161              *      Add to personal dictionary
162
163              @      Accept word, but leave out of dictionary
164
165              #      Save current personal dictionary
166
167              ~      Set parameters based on filename
168
169              +      Enter TeX mode
170
171              -      Exit TeX mode
172
173              !      Enter terse mode
174
175              %      Exit terse mode
176
177              `      Enter verbose-correction mode
178
179              ^      Spell-check rest of line
180
181       In terse mode, hunspell will not print lines beginning with  '*',  '+',
182       or  '-',  all  of  which  indicate  correct  words.  This significantly
183       improves running speed when the driving  program  is  going  to  ignore
184       correct words anyway.
185
186       In   verbose-correction  mode,  hunspell  includes  the  original  word
187       immediately after the indicator character  in  output  lines  beginning
188       with '*', '+', and '-', which simplifies interaction for some programs.
189
190
191       -d  dic
192              Path of affix and dic files, without file extension.
193
194       -G
195
196       -h, --help
197              Short help.
198
199       -p  custom_dictionary
200              Set     path     of     custom     dictionary.     Default    is
201              $HOME/.hunspell_default.   Setting   -d   or    the   DICTIONARY
202              environmental    variable,    custom    dictionary    will    be
203              $HOME/.hunspell_szótárnév
204
205       -u     Print typical mistakes with suggestions.
206
207       -u2    List typical mistakes with suggestions to a sed files.  Using of
208              this file:
209
210              sed -f correct.sed original.txt >corrected.txt
211
212

EXAMPLES

214       hunspell -d en_US english.html
215
216       hunspell -d /opt/OpenOffice.org1.0.1/share/dict/ooo/de_DE
217
218       hunspell *.html
219
220       hunspell -l text.html
221
222       hunspell -r *.html
223

ENVIRONMENT

225       DICTIONARY
226              Similar to -d.
227
228       WORDLIST
229              Equivalent to -p.
230

FILES

232       /usr/share/myspell/default.aff   Path   of   default  affix  file.  See
233       hunspell(4).
234
235       /usr/share/myspell/default.dic Path of default  dictiorary  file.   See
236       hunspell(4).  $HOME/.hunspell_default.Seehunspell(1).
237

AUTHOR

242       Hunspell  based  on  Ispell's  algorithms  and OpenOffice.org's Myspell
243       source code.
244
245       Author of International Ispell is Geoff Kuenning.
246
247       Author of MySpell is Kevin Hendricks.
248
249       Author of Hunspell is László Németh.
250
251       This manual based on Ispell's manual (See ispell(1)).
252

BUGS

254       Unicode tokenization haven't implemented yet.
255
256       Tokenize e-mail address, URL-s, paths. Put these character strings into
257       var element in HTML, and \url{pathl} in LaTeX.
258
259                                  2005-07-15                       hunspell(1)