1PDFTOHTML(1) General Commands Manual PDFTOHTML(1)
2
3
4
6 pdftohtml - program to convert pdf files into html, xml and png images
7
9 pdftohtml [options] <PDF-file> [<html-file> <xml-file>]
10
12 This manual page documents briefly the pdftohtml command. This manual
13 page was written for the Debian GNU/Linux distribution because the
14 original program does not have a manual page.
15
16 pdftohtml is a program that converts pdf documents into html. It gener‐
17 ates its output in the current working directory.
18
20 A summary of options are included below.
21
22 -h, -help
23 Show summary of options.
24
25 -f <int>
26 first page to print
27
28 -l <int>
29 last page to print
30
31 -q dont print any messages or errors
32
33 -v print copyright and version info
34
35 -p exchange .pdf links with .html
36
37 -c generate complex output
38
39 -i ignore images
40
41 -noframes
42 generate no frames. Not supported in complex output mode.
43
44 -stdout
45 use standard output
46
47 -zoom <fp>
48 zoom the pdf document (default 1.5)
49
50 -xml output for XML post-processing
51
52 -enc <string>
53 output text encoding name
54
55 -opw <string>
56 owner password (for encrypted files)
57
58 -upw <string>
59 user password (for encrypted files)
60
61 -hidden
62 force hidden text extraction
63
64 -dev output device name for Ghostscript (png16m, jpeg etc)
65
66 -nomerge
67 do not merge paragraphs
68
69 -nodrm override document DRM settings
70
71
73 Pdftohtml was developed by Gueorgui Ovtcharov and Rainer Dorsch. It is
74 based and benefits a lot from Derek Noonburg's xpdf package.
75
76 This manual page was written by Søren Boll Overgaard <boll@debian.org>,
77 for the Debian GNU/Linux system (but may be used by others).
78
79
80
81 PDFTOHTML(1)