1DUMPPDF(1)                       User Commands                      DUMPPDF(1)
2
3
4

NAME

6       dumppdf – extract pdf structure in XML format
7

SYNOPSIS

9       dumppdf  [-h]  [--version]  [--debug] [--extract-toc | --extract-embed‐
10       ded EXTRACT_EMBEDDED]  [--page-numbers PAGE_NUMBERS [PAGE_NUMBERS ...]]
11       [--pagenos PAGENOS]  [--objects OBJECTS]  [--all] [--password PASSWORD]
12       [--outfile OUTFILE]    [--raw-stream | --binary-stream | --text-stream]
13       files [files ...]
14

OPTIONS

16   POSITIONAL ARGUMENTS
17       files  One or more paths to PDF files.
18
19   OPTIONAL ARGUMENTS
20       -h, --help
21              Show a help message and exit.
22
23       --version, -v
24              Show program’s version number and exit.
25
26       --debug, -d
27              Use debug logging level.
28
29       --extract-toc, -T
30              Extract structure of outline
31
32       --extract-embedded EXTRACT_EMBEDDED, -E EXTRACT_EMBEDDED
33              Extract embedded files
34
35   PARSER
36       Used during PDF parsing
37
38       --page-numbers PAGE_NUMBERS [PAGE_NUMBERS ...]
39              A space-seperated list of page numbers to parse.
40
41       --pagenos PAGENOS, -p PAGENOS
42              A  comma-separated  list of page numbers to parse.  Included for
43              legacy applications; use --page-numbers for more idiomatic argu‐
44              ment entry.
45
46       --objects OBJECTS, -i OBJECTS
47              Comma separated list of object numbers to extract
48
49       --all, -a
50              If the structure of all objects should be extracted
51
52       --password PASSWORD, -P PASSWORD
53              The password to use for decrypting PDF file.
54
55   OUTPUT
56       Used during output generation.
57
58       --outfile OUTFILE, -o OUTFILE
59              Path to file where output is written.  Or “-” (default) to write
60              to stdout.
61
62       --raw-stream, -r
63              Write stream objects without encoding
64
65       --binary-stream, -b
66              Write stream objects with binary encoding
67
68       --text-stream, -t
69              Write stream objects as plain text
70

SEE ALSO

72       pdf2txt(1)
73
74
75
76                                 October 2021                       DUMPPDF(1)
Impressum