1DUMPPDF(1) User Commands DUMPPDF(1)
2
3
4
6 dumppdf – extract pdf structure in XML format
7
9 dumppdf [-h] [--version] [--debug] [--extract-toc | --extract-embed‐
10 ded EXTRACT_EMBEDDED] [--page-numbers PAGE_NUMBERS [PAGE_NUMBERS ...]]
11 [--pagenos PAGENOS] [--objects OBJECTS] [--all] [--password PASSWORD]
12 [--outfile OUTFILE] [--raw-stream | --binary-stream | --text-stream]
13 files [files ...]
14
16 POSITIONAL ARGUMENTS
17 files One or more paths to PDF files.
18
19 OPTIONAL ARGUMENTS
20 -h, --help
21 Show a help message and exit.
22
23 --version, -v
24 Show program’s version number and exit.
25
26 --debug, -d
27 Use debug logging level.
28
29 --extract-toc, -T
30 Extract structure of outline
31
32 --extract-embedded EXTRACT_EMBEDDED, -E EXTRACT_EMBEDDED
33 Extract embedded files
34
35 PARSER
36 Used during PDF parsing
37
38 --page-numbers PAGE_NUMBERS [PAGE_NUMBERS ...]
39 A space-seperated list of page numbers to parse.
40
41 --pagenos PAGENOS, -p PAGENOS
42 A comma-separated list of page numbers to parse. Included for
43 legacy applications; use --page-numbers for more idiomatic argu‐
44 ment entry.
45
46 --objects OBJECTS, -i OBJECTS
47 Comma separated list of object numbers to extract
48
49 --all, -a
50 If the structure of all objects should be extracted
51
52 --password PASSWORD, -P PASSWORD
53 The password to use for decrypting PDF file.
54
55 OUTPUT
56 Used during output generation.
57
58 --outfile OUTFILE, -o OUTFILE
59 Path to file where output is written. Or “-” (default) to write
60 to stdout.
61
62 --raw-stream, -r
63 Write stream objects without encoding
64
65 --binary-stream, -b
66 Write stream objects with binary encoding
67
68 --text-stream, -t
69 Write stream objects as plain text
70
72 pdf2txt(1)
73
74
75
76 October 2021 DUMPPDF(1)