1NGRAMREAD(1)                     User Commands                    NGRAMREAD(1)
2
3
4

NAME

6       ngramread - manual page for ngramread 1.3.14
7

DESCRIPTION

9       Transform text formats to FST.
10
11              Usage: ngramread [--options] [in.txt [out.fst]]
12
13       PROGRAM FLAGS:
14
15       --ARPA: type = bool, default = false
16
17              Read  model  from  ARPA format --OOV_symbol: type = std::string,
18              default = "<UNK>" Class label for OOV symbols  --epsilon_symbol:
19              type  =  std::string,  default  =  "<epsilon>" Label for epsilon
20              transitions --renormalize_arpa: type = bool, default = false  If
21              true,  attempts to renormalize an unnormalized ARPA format model
22              by normalizing the unigram state  and  recomputing  the  backoff
23              weights.    Only   used   if  --ARPA=true.   --symbols:  type  =
24              std::string, default = "" Label symbol table
25
26       LIBRARY FLAGS:
27
28       Flags from: flags.cc
29
30       --help: type = bool, default = false
31
32              show usage information --helpshort: type = bool, default = false
33              show  brief  usage information --tmpdir: type = std::string, de‐
34              fault = "/tmp" temporary directory --v: type = int32_t,  default
35              = 0 verbosity level
36
37       Flags from: fst.cc
38
39       --fst_align: type = bool, default = false
40
41              Write FST data aligned where appropriate --fst_default_cache_gc:
42              type = bool, default = true Enable garbage collection  of  cache
43              --fst_default_cache_gc_limit:  type = int64_t, default = 1048576
44              Cache   byte   size    that    triggers    garbage    collection
45              --fst_read_mode:  type  =  std::string, default = "read" Default
46              file reading mode for  mappable  files  --fst_verify_properties:
47              type  =  bool,  default = false Verify FST properties queried by
48              TestProperties --save_relabel_ipairs: type  =  std::string,  de‐
49              fault  =  ""  Save  input  relabel  pairs  to  file --save_rela‐
50              bel_opairs: type = std::string, default = "" Save output relabel
51              pairs to file
52
53       Flags from: ngram-output.cc
54
55       --end_symbol: type = std::string, default = "</s>"
56
57              Class label for sentence end --start_symbol: type = std::string,
58              default = "<s>" Class label for sentence start
59
60       Flags from: symbol-table.cc
61
62       --fst_compat_symbols: type = bool, default = true
63
64              Require symbol tables to match when appropriate --fst_field_sep‐
65              arator: type = std::string, default = "         " Set of charac‐
66              ters used as a separator between printed fields
67
68       Flags from: util.cc
69
70       --fst_error_fatal: type = bool, default = true
71
72              FST errors are fatal; o.w. return objects flagged as bad:  e.g.,
73              FSTs:   kError   property  set,  FST  weights:  not  a  Member()
74              --ngram_error_fatal: type = bool, default =  true  NGram  errors
75              are  fatal  if  true;  otherwise returns objects flagged as bad:
76              e.g., NGramModel::Error() is true
77
78       Flags from: weight.cc
79
80       --fst_weight_parentheses: type = std::string, default = ""
81
82              Characters enclosing the first weight  of  a  printed  composite
83              weight  (e.g., pair weight, tuple weight and derived classes) to
84              ensure proper I/O of nested composite weights; must have size  0
85              (none) or 2 (open and close parenthesis) --fst_weight_separator:
86              type = std::string, default = ","  Character  separator  between
87              printed composite weights; must be a single character
88
89
90
91ngramread 1.3.14                 February 2022                    NGRAMREAD(1)
Impressum