1NGRAMREAD(1) User Commands NGRAMREAD(1)
2
3
4
6 ngramread - manual page for ngramread 1.3.14
7
9 Transform text formats to FST.
10
11 Usage: ngramread [--options] [in.txt [out.fst]]
12
13 PROGRAM FLAGS:
14
15 --ARPA: type = bool, default = false
16
17 Read model from ARPA format --OOV_symbol: type = std::string,
18 default = "<UNK>" Class label for OOV symbols --epsilon_symbol:
19 type = std::string, default = "<epsilon>" Label for epsilon
20 transitions --renormalize_arpa: type = bool, default = false If
21 true, attempts to renormalize an unnormalized ARPA format model
22 by normalizing the unigram state and recomputing the backoff
23 weights. Only used if --ARPA=true. --symbols: type =
24 std::string, default = "" Label symbol table
25
26 LIBRARY FLAGS:
27
28 Flags from: flags.cc
29
30 --help: type = bool, default = false
31
32 show usage information --helpshort: type = bool, default = false
33 show brief usage information --tmpdir: type = std::string, de‐
34 fault = "/tmp" temporary directory --v: type = int32_t, default
35 = 0 verbosity level
36
37 Flags from: fst.cc
38
39 --fst_align: type = bool, default = false
40
41 Write FST data aligned where appropriate --fst_default_cache_gc:
42 type = bool, default = true Enable garbage collection of cache
43 --fst_default_cache_gc_limit: type = int64_t, default = 1048576
44 Cache byte size that triggers garbage collection
45 --fst_read_mode: type = std::string, default = "read" Default
46 file reading mode for mappable files --fst_verify_properties:
47 type = bool, default = false Verify FST properties queried by
48 TestProperties --save_relabel_ipairs: type = std::string, de‐
49 fault = "" Save input relabel pairs to file --save_rela‐
50 bel_opairs: type = std::string, default = "" Save output relabel
51 pairs to file
52
53 Flags from: ngram-output.cc
54
55 --end_symbol: type = std::string, default = "</s>"
56
57 Class label for sentence end --start_symbol: type = std::string,
58 default = "<s>" Class label for sentence start
59
60 Flags from: symbol-table.cc
61
62 --fst_compat_symbols: type = bool, default = true
63
64 Require symbol tables to match when appropriate --fst_field_sep‐
65 arator: type = std::string, default = " " Set of charac‐
66 ters used as a separator between printed fields
67
68 Flags from: util.cc
69
70 --fst_error_fatal: type = bool, default = true
71
72 FST errors are fatal; o.w. return objects flagged as bad: e.g.,
73 FSTs: kError property set, FST weights: not a Member()
74 --ngram_error_fatal: type = bool, default = true NGram errors
75 are fatal if true; otherwise returns objects flagged as bad:
76 e.g., NGramModel::Error() is true
77
78 Flags from: weight.cc
79
80 --fst_weight_parentheses: type = std::string, default = ""
81
82 Characters enclosing the first weight of a printed composite
83 weight (e.g., pair weight, tuple weight and derived classes) to
84 ensure proper I/O of nested composite weights; must have size 0
85 (none) or 2 (open and close parenthesis) --fst_weight_separator:
86 type = std::string, default = "," Character separator between
87 printed composite weights; must be a single character
88
89
90
91ngramread 1.3.14 February 2022 NGRAMREAD(1)