1NGRAMCOUNT(1)                    User Commands                   NGRAMCOUNT(1)
2
3
4

NAME

6       ngramcount - manual page for ngramcount 1.3.4
7

DESCRIPTION

9       Count ngram from input file.
10
11              Usage: ngramcount [--options] [in.far [out.fst]]
12
13       PROGRAM FLAGS:
14
15       --alpha: type = double, default = 1
16
17              Weight  for first FST --backoff_label: type = int64, default = 0
18              Backoff label --beta: type = double, default = 1 Weight for sec‐
19              ond  (and  subsequent)  FST(s) --check_consistency: type = bool,
20              default = false Check model consistency --context_pattern:  type
21              =   string,   default   =   ""  Pattern  of  contexts  to  count
22              --epsilon_as_backoff: type = bool, default = false Treat epsilon
23              in  the input Fsts as backoff --method: type = string, default =
24              "counts"  One  of:  "counts",  "histograms",  "count_of_counts",
25              "count_of_histograms" --norm_eps: type = double, default = 0.001
26              Normalization check epsilon --normalize: type = bool, default  =
27              false Normalize resulting model --order: type = int64, default =
28              3 Set maximal order of ngrams to be counted --output_fst: type =
29              bool,  default  =  true Output counts as fst (otherwise strings)
30              --require_symbols: type = bool, default =  true  Require  symbol
31              tables?  (default:  yes)  --round_to_int: type = bool, default =
32              false Round all counts to integers
33
34       LIBRARY FLAGS:
35
36       Flags from: flags.cc
37
38       --help: type = bool, default = false
39
40              show usage information --helpshort: type = bool, default = false
41              show  brief usage information --tmpdir: type = string, default =
42              "/tmp" temporary directory --v: type = int32, default =  0  ver‐
43              bosity level
44
45       Flags from: fst.cc
46
47       --fst_align: type = bool, default = false
48
49              Write FST data aligned where appropriate --fst_default_cache_gc:
50              type = bool, default = true Enable garbage collection  of  cache
51              --fst_default_cache_gc_limit:  type  =  int64, default = 1048576
52              Cache   byte   size    that    triggers    garbage    collection
53              --fst_read_mode:  type  =  string, default = "read" Default file
54              reading mode for mappable files --fst_verify_properties: type  =
55              bool, default = false Verify FST properties queried by TestProp‐
56              erties --save_relabel_ipairs: type = string, default =  ""  Save
57              input  relabel  pairs  to  file  --save_relabel_opairs:  type  =
58              string, default = "" Save output relabel pairs to file
59
60       Flags from: ngram-output.cc
61
62       --end_symbol: type = string, default = "</s>"
63
64              Class label for sentence  end  --start_symbol:  type  =  string,
65              default = "<s>" Class label for sentence start
66
67       Flags from: symbol-table.cc
68
69       --fst_compat_symbols: type = bool, default = true
70
71              Require symbol tables to match when appropriate --fst_field_sep‐
72              arator: type = string, default = "      " Set of characters used
73              as a separator between printed fields
74
75       Flags from: util.cc
76
77       --fst_error_fatal: type = bool, default = true
78
79              FST  errors are fatal; o.w. return objects flagged as bad: e.g.,
80              FSTs:  kError  property  set,  FST  weights:  not   a   Member()
81              --ngram_error_fatal:  type  =  bool, default = true NGram errors
82              are fatal if true; otherwise returns  objects  flagged  as  bad:
83              e.g., NGramModel::Error() is true
84
85       Flags from: weight.cc
86
87       --fst_weight_parentheses: type = string, default = ""
88
89              Characters  enclosing  the  first  weight of a printed composite
90              weight (e.g., pair weight, tuple weight and derived classes)  to
91              ensure  proper I/O of nested composite weights; must have size 0
92              (none) or 2 (open and close parenthesis) --fst_weight_separator:
93              type = string, default = "," Character separator between printed
94              composite weights; must be a single character
95
96
97
98ngramcount 1.3.4                 January 2018                    NGRAMCOUNT(1)
Impressum