1NGRAMMAKE(1)                     User Commands                    NGRAMMAKE(1)
2
3
4

NAME

6       ngrammake - manual page for ngrammake 1.3.14
7

DESCRIPTION

9       Make n-gram model from input count file.
10
11              Usage: ngrammake [--options] [in.fst [out.fst]]
12
13       PROGRAM FLAGS:
14
15       --backoff: type = bool, default = false
16
17              Use  backoff smoothing (default: method dependent) --backoff_la‐
18              bel: type = int64_t, default = 0 Backoff label  --bins:  type  =
19              int64_t,  default  = -1 Number of bins for katz or absolute dis‐
20              counting --check_consistency: type = bool, default = false Check
21              model consistency --count_of_counts: type = std::string, default
22              = "" Read count-of-counts from file --discount_D: type = double,
23              default  =  -1  Absolute  discount value D to use --interpolate:
24              type = bool, default = false  Use  interpolated  smoothing  (de‐
25              fault: method dependent) --method: type = std::string, default =
26              "katz" One of: "absolute", "katz", "kneser_ney",  "presmoothed",
27              "unsmoothed", "katz_frac", "witten_bell" --norm_eps: type = dou‐
28              ble,  default  =  0.001  Normalization  check   epsilon   --wit‐
29              ten_bell_k:  type = double, default = 1 Witten-Bell hyperparame‐
30              ter K
31
32       LIBRARY FLAGS:
33
34       Flags from: flags.cc
35
36       --help: type = bool, default = false
37
38              show usage information --helpshort: type = bool, default = false
39              show  brief  usage information --tmpdir: type = std::string, de‐
40              fault = "/tmp" temporary directory --v: type = int32_t,  default
41              = 0 verbosity level
42
43       Flags from: fst.cc
44
45       --fst_align: type = bool, default = false
46
47              Write FST data aligned where appropriate --fst_default_cache_gc:
48              type = bool, default = true Enable garbage collection  of  cache
49              --fst_default_cache_gc_limit:  type = int64_t, default = 1048576
50              Cache   byte   size    that    triggers    garbage    collection
51              --fst_read_mode:  type  =  std::string, default = "read" Default
52              file reading mode for  mappable  files  --fst_verify_properties:
53              type  =  bool,  default = false Verify FST properties queried by
54              TestProperties --save_relabel_ipairs: type  =  std::string,  de‐
55              fault  =  ""  Save  input  relabel  pairs  to  file --save_rela‐
56              bel_opairs: type = std::string, default = "" Save output relabel
57              pairs to file
58
59       Flags from: ngram-output.cc
60
61       --end_symbol: type = std::string, default = "</s>"
62
63              Class label for sentence end --start_symbol: type = std::string,
64              default = "<s>" Class label for sentence start
65
66       Flags from: symbol-table.cc
67
68       --fst_compat_symbols: type = bool, default = true
69
70              Require symbol tables to match when appropriate --fst_field_sep‐
71              arator: type = std::string, default = "         " Set of charac‐
72              ters used as a separator between printed fields
73
74       Flags from: util.cc
75
76       --fst_error_fatal: type = bool, default = true
77
78              FST errors are fatal; o.w. return objects flagged as bad:  e.g.,
79              FSTs:   kError   property  set,  FST  weights:  not  a  Member()
80              --ngram_error_fatal: type = bool, default =  true  NGram  errors
81              are  fatal  if  true;  otherwise returns objects flagged as bad:
82              e.g., NGramModel::Error() is true
83
84       Flags from: weight.cc
85
86       --fst_weight_parentheses: type = std::string, default = ""
87
88              Characters enclosing the first weight  of  a  printed  composite
89              weight  (e.g., pair weight, tuple weight and derived classes) to
90              ensure proper I/O of nested composite weights; must have size  0
91              (none) or 2 (open and close parenthesis) --fst_weight_separator:
92              type = std::string, default = ","  Character  separator  between
93              printed composite weights; must be a single character
94
95
96
97ngrammake 1.3.14                 February 2022                    NGRAMMAKE(1)
Impressum