1NGRAMSYMBOLS(1)                  User Commands                 NGRAMSYMBOLS(1)
2
3
4

NAME

6       ngramsymbols - manual page for ngramsymbols 1.3.4
7

DESCRIPTION

9       Derives a symbol table from a corpus.
10
11              Usage: ngramsymbols [--options] [in.txt [out.txt]]
12
13       PROGRAM FLAGS:
14
15       --OOV_symbol: type = string, default = "<unk>"
16
17              Class label for OOV symbols --epsilon_symbol: type = string, de‐
18              fault = "<epsilon>" Label for epsilon
19
20       LIBRARY FLAGS:
21
22       Flags from: flags.cc
23
24       --help: type = bool, default = false
25
26              show usage information --helpshort: type = bool, default = false
27              show  brief usage information --tmpdir: type = string, default =
28              "/tmp" temporary directory --v: type = int32, default =  0  ver‐
29              bosity level
30
31       Flags from: fst.cc
32
33       --fst_align: type = bool, default = false
34
35              Write FST data aligned where appropriate --fst_default_cache_gc:
36              type = bool, default = true Enable garbage collection  of  cache
37              --fst_default_cache_gc_limit:  type  =  int64, default = 1048576
38              Cache   byte   size    that    triggers    garbage    collection
39              --fst_read_mode:  type  =  string, default = "read" Default file
40              reading mode for mappable files --fst_verify_properties: type  =
41              bool, default = false Verify FST properties queried by TestProp‐
42              erties --save_relabel_ipairs: type = string, default =  ""  Save
43              input  relabel  pairs  to  file  --save_relabel_opairs:  type  =
44              string, default = "" Save output relabel pairs to file
45
46       Flags from: ngram-output.cc
47
48       --end_symbol: type = string, default = "</s>"
49
50              Class label for sentence end --start_symbol: type = string,  de‐
51              fault = "<s>" Class label for sentence start
52
53       Flags from: symbol-table.cc
54
55       --fst_compat_symbols: type = bool, default = true
56
57              Require symbol tables to match when appropriate --fst_field_sep‐
58              arator: type = string, default = "      " Set of characters used
59              as a separator between printed fields
60
61       Flags from: util.cc
62
63       --fst_error_fatal: type = bool, default = true
64
65              FST  errors are fatal; o.w. return objects flagged as bad: e.g.,
66              FSTs:  kError  property  set,  FST  weights:  not   a   Member()
67              --ngram_error_fatal:  type  =  bool, default = true NGram errors
68              are fatal if true; otherwise returns  objects  flagged  as  bad:
69              e.g., NGramModel::Error() is true
70
71       Flags from: weight.cc
72
73       --fst_weight_parentheses: type = string, default = ""
74
75              Characters  enclosing  the  first  weight of a printed composite
76              weight (e.g., pair weight, tuple weight and derived classes)  to
77              ensure  proper I/O of nested composite weights; must have size 0
78              (none) or 2 (open and close parenthesis) --fst_weight_separator:
79              type = string, default = "," Character separator between printed
80              composite weights; must be a single character
81
82
83
84ngramsymbols 1.3.4               January 2018                  NGRAMSYMBOLS(1)
Impressum