1HXEXTRACT(1) HTML-XML-utils HXEXTRACT(1)
2
3
4
6 hxextract - extract selected elements from a HTML or XML file
7
9 hxextract [ -h | -? ] [ -x ] [ -s text ] [ -e text ] [ -b base ] ele‐
10 ment-or-class [ -c configfile | file-or-URL ]
11
13 hxextract outputs all elements with a certain name and/or class.
14
15 Input must be well-formed, since no HTML heuristics are applied.
16
18 The following options are supported:
19
20 -x Use XML format conventions.
21
22 -s text Insert text at the start of the output.
23
24 -e text Insert text at the end of the output.
25
26 -b base URL base
27
28 -c configfile
29 Read @chapter lines from configfile (lines must be of the
30 form "@chapter filename") and extract elements from each of
31 those files.
32
33 -h, -? Print command usage.
34
36 The following operands are supported:
37
38 element-or-class
39 The name of an element to extract (e.g., "H2"), or the name
40 of a class preceded by "." (e.g., ".example") or a combina‐
41 tion of both (e.g., "H2.example").
42
43 file-or-URL
44 A file name or a URL. To read from standard input, use "-".
45
47 hxselect(1)
48
49
50
515.x 21 Nov 2008 HXEXTRACT(1)