1PYTHON3-HTML2TEXT(1) User Commands PYTHON3-HTML2TEXT(1)
2
3
4
6 python3-html2text - manual page for python3-html2text 2020.1.16
7
9 usage: python3-html2text [-h] [--default-image-alt DEFAULT_IMAGE_ALT]
10
11 [--pad-tables] [--no-wrap-links] [--wrap-list-items]
12 [--ignore-emphasis] [--reference-links] [--ignore-links] [--pro‐
13 tect-links] [--ignore-images] [--images-as-html] [--im‐
14 ages-to-alt] [--images-with-size] [-g] [-d] [-e] [-b BODY_WIDTH]
15 [-i LIST_INDENT] [-s] [--escape-all] [--bypass-tables] [--ig‐
16 nore-tables] [--single-line-break] [--unicode-snob] [--no-auto‐
17 matic-links] [--no-skip-internal-links] [--links-after-para]
18 [--mark-code] [--decode-errors DECODE_ERRORS] [--open-quote
19 OPEN_QUOTE] [--close-quote CLOSE_QUOTE] [--version] [filename]
20 [encoding]
21
22 positional arguments:
23 filename encoding
24
25 options:
26 -h, --help
27 show this help message and exit
28
29 --default-image-alt DEFAULT_IMAGE_ALT
30 The default alt string for images with missing ones
31
32 --pad-tables
33 pad the cells to equal column width in tables
34
35 --no-wrap-links
36 don't wrap links during conversion
37
38 --wrap-list-items
39 wrap list items during conversion
40
41 --ignore-emphasis
42 don't include any formatting for emphasis
43
44 --reference-links
45 use reference style links instead of inline links
46
47 --ignore-links
48 don't include any formatting for links
49
50 --protect-links
51 protect links from line breaks surrounding them with angle
52 brackets
53
54 --ignore-images
55 don't include any formatting for images
56
57 --images-as-html
58 Always write image tags as raw html; preserves `height`, `width`
59 and `alt` if possible.
60
61 --images-to-alt
62 Discard image data, only keep alt text
63
64 --images-with-size
65 Write image tags with height and width attrs as raw html to re‐
66 tain dimensions
67
68 -g, --google-doc
69 convert an html-exported Google Document
70
71 -d, --dash-unordered-list
72 use a dash rather than a star for unordered list items
73
74 -e, --asterisk-emphasis
75 use an asterisk rather than an underscore for emphasized text
76
77 -b BODY_WIDTH, --body-width BODY_WIDTH
78 number of characters per output line, 0 for no wrap
79
80 -i LIST_INDENT, --google-list-indent LIST_INDENT
81 number of pixels Google indents nested lists
82
83 -s, --hide-strikethrough
84 hide strike-through text. only relevant when -g is specified as
85 well
86
87 --escape-all
88 Escape all special characters. Output is less readable, but
89 avoids corner case formatting issues.
90
91 --bypass-tables
92 Format tables in HTML rather than Markdown syntax.
93
94 --ignore-tables
95 Ignore table-related tags (table, th, td, tr) while keeping
96 rows.
97
98 --single-line-break
99 Use a single line break after a block element rather than two
100 line breaks. NOTE: Requires --body-width=0
101
102 --unicode-snob
103 Use unicode throughout document
104
105 --no-automatic-links
106 Do not use automatic links wherever applicable
107
108 --no-skip-internal-links
109 Do not skip internal links
110
111 --links-after-para
112 Put links after each paragraph instead of document
113
114 --mark-code
115 Mark program code blocks with [code]...[/code]
116
117 --decode-errors DECODE_ERRORS
118 What to do in case of decode errors.'ignore', 'strict' and 're‐
119 place' are acceptable values
120
121 --open-quote OPEN_QUOTE
122 The character used to open quotes
123
124 --close-quote CLOSE_QUOTE
125 The character used to close quotes
126
127 --version
128 show program's version number and exit
129
130
131
132python3-html2text 2020.1.16 January 2023 PYTHON3-HTML2TEXT(1)