1GENCTD(1)                       ICU 3.6 Manual                       GENCTD(1)
2
3
4

NAME

6       genctd - Compiles word list into ICU compact trie dictionary
7

SYNOPSIS

9       genctd  [  -h, -?, --help ] [ -V, --version ] [ -c, --copyright ] [ -v,
10       --verbose ] [ -d, --destdir destination ] [ -i, --icudatadir  directory
11       ] -o, --out output-file  dictionary-file
12

DESCRIPTION

14       genctd  reads  the word list from dictionary-file and creates a compact
15       trie dictionary file. Normally this data file has the .ctd extension.
16
17       Words begin at the beginning of a line and are terminated by the  first
18       whitespace.  Lines that begin with whitespace are ignored.
19

OPTIONS

21       -h, -?, --help
22              Print help about usage and exit.
23
24       -V, --version
25              Print the version of genctd and exit.
26
27       -c, --copyright
28              Embeds the standard ICU copyright into the output-file.
29
30       -v, --verbose
31              Display extra informative messages during execution.
32
33       -d, --destdir destination
34              Set the destination directory of the output-file to destination.
35
36       -i, --icudatadir directory
37              Look  for  any necessary ICU data files in directory.  For exam‐
38              ple, the file pnames.icu must be located when ICU's data is  not
39              built  as  a  shared library.  The default ICU data directory is
40              specified by the environment variable ICU_DATA.  Most configura‐
41              tions of ICU do not require this argument.
42
43        dictionary-file
44              The source file to read.
45
46       -o, --out output-file
47              The output data file to write.
48

CAVEATS

50       When the dictionary-file contains a byte order mark (BOM) at the begin‐
51       ning of the file, which is the Unicode character U+FEFF, then the  dic‐
52       tionary-file  is  interpreted  as Unicode. Without the BOM, the file is
53       interpreted in the current operating system default codepage.  In order
54       to  eliminate  any  ambiguity of the encoding for how the rule-file was
55       written, it is recommended that you write this file in UTF-8  with  the
56       BOM.
57

ENVIRONMENT

59       ICU_DATA  Specifies  the  directory  containing  ICU  data. Defaults to
60                 /usr/share/icu/3.6/.  Some tools in ICU depend on  the  pres‐
61                 ence of the trailing slash. It is thus important to make sure
62                 that it is present if ICU_DATA is set.
63

AUTHORS

65       Deborah Goldsmith
66

VERSION

68       1.0
69
71       Copyright (C) 2006 IBM, Inc. and others.
72

SEE ALSO

74       http://icu.sourceforge.net/userguide/boundaryAnalysis.html
75
76
77
78
79ICU MANPAGE                      8 March 2006                        GENCTD(1)
Impressum