1DAWG2WORDLIST(1)                                              DAWG2WORDLIST(1)
2
3
4

NAME

6       dawg2wordlist - convert a Tesseract DAWG to a wordlist
7

SYNOPSIS

9       dawg2wordlist UNICHARSET DAWG WORDLIST
10

DESCRIPTION

12       dawg2wordlist(1) converts a Tesseract Directed Acyclic Word Graph
13       (DAWG) to a list of words using a unicharset as key.
14

OPTIONS

16       UNICHARSET The unicharset of the language. This is the unicharset
17       generated by mftraining(1).
18
19       DAWG The input DAWG, created by wordlist2dawg(1)
20
21       WORDLIST Plain text (output) file in UTF-8, one word per line
22

SEE ALSO

24       tesseract(1), mftraining(1), wordlist2dawg(1), unicharset(5),
25       combine_tessdata(1)
26
27       https://tesseract-ocr.github.io/tessdoc/Training-Tesseract.html
28

COPYING

30       Copyright (C) 2012 Google, Inc. Licensed under the Apache License,
31       Version 2.0
32

AUTHOR

34       The Tesseract OCR engine was written by Ray Smith and his research
35       groups at Hewlett Packard (1985-1995) and Google (2006-present).
36
37
38
39                                  09/23/2022                  DAWG2WORDLIST(1)
Impressum