1NGRAMMAKE(1) User Commands NGRAMMAKE(1)
2
3
4
6 ngrammake - manual page for ngrammake 1.3.14
7
9 Make n-gram model from input count file.
10
11 Usage: ngrammake [--options] [in.fst [out.fst]]
12
13 PROGRAM FLAGS:
14
15 --backoff: type = bool, default = false
16
17 Use backoff smoothing (default: method dependent) --backoff_la‐
18 bel: type = int64_t, default = 0 Backoff label --bins: type =
19 int64_t, default = -1 Number of bins for katz or absolute dis‐
20 counting --check_consistency: type = bool, default = false Check
21 model consistency --count_of_counts: type = std::string, default
22 = "" Read count-of-counts from file --discount_D: type = double,
23 default = -1 Absolute discount value D to use --interpolate:
24 type = bool, default = false Use interpolated smoothing (de‐
25 fault: method dependent) --method: type = std::string, default =
26 "katz" One of: "absolute", "katz", "kneser_ney", "presmoothed",
27 "unsmoothed", "katz_frac", "witten_bell" --norm_eps: type = dou‐
28 ble, default = 0.001 Normalization check epsilon --wit‐
29 ten_bell_k: type = double, default = 1 Witten-Bell hyperparame‐
30 ter K
31
32 LIBRARY FLAGS:
33
34 Flags from: flags.cc
35
36 --help: type = bool, default = false
37
38 show usage information --helpshort: type = bool, default = false
39 show brief usage information --tmpdir: type = std::string, de‐
40 fault = "/tmp" temporary directory --v: type = int32_t, default
41 = 0 verbosity level
42
43 Flags from: fst.cc
44
45 --fst_align: type = bool, default = false
46
47 Write FST data aligned where appropriate --fst_default_cache_gc:
48 type = bool, default = true Enable garbage collection of cache
49 --fst_default_cache_gc_limit: type = int64_t, default = 1048576
50 Cache byte size that triggers garbage collection
51 --fst_read_mode: type = std::string, default = "read" Default
52 file reading mode for mappable files --fst_verify_properties:
53 type = bool, default = false Verify FST properties queried by
54 TestProperties --save_relabel_ipairs: type = std::string, de‐
55 fault = "" Save input relabel pairs to file --save_rela‐
56 bel_opairs: type = std::string, default = "" Save output relabel
57 pairs to file
58
59 Flags from: ngram-output.cc
60
61 --end_symbol: type = std::string, default = "</s>"
62
63 Class label for sentence end --start_symbol: type = std::string,
64 default = "<s>" Class label for sentence start
65
66 Flags from: symbol-table.cc
67
68 --fst_compat_symbols: type = bool, default = true
69
70 Require symbol tables to match when appropriate --fst_field_sep‐
71 arator: type = std::string, default = " " Set of charac‐
72 ters used as a separator between printed fields
73
74 Flags from: util.cc
75
76 --fst_error_fatal: type = bool, default = true
77
78 FST errors are fatal; o.w. return objects flagged as bad: e.g.,
79 FSTs: kError property set, FST weights: not a Member()
80 --ngram_error_fatal: type = bool, default = true NGram errors
81 are fatal if true; otherwise returns objects flagged as bad:
82 e.g., NGramModel::Error() is true
83
84 Flags from: weight.cc
85
86 --fst_weight_parentheses: type = std::string, default = ""
87
88 Characters enclosing the first weight of a printed composite
89 weight (e.g., pair weight, tuple weight and derived classes) to
90 ensure proper I/O of nested composite weights; must have size 0
91 (none) or 2 (open and close parenthesis) --fst_weight_separator:
92 type = std::string, default = "," Character separator between
93 printed composite weights; must be a single character
94
95
96
97ngrammake 1.3.14 February 2022 NGRAMMAKE(1)