1man(1)                      General Commands Manual                     man(1)
2
3
4

NAME

6       normalize - pretty-print an HTML file
7

SYNOPSIS

9       normalize  [ -x ] [ -e ] [ -d ] [ -i indent ] [ -l line-length ] [ file
10       ]
11

DESCRIPTION

13       The normalize command pretty-prints an HTML file, and also tries to fix
14       small  errors.  The  output  is  the same HTML, but with a maximum line
15       length and with optional indentation to indicate the nesting  level  of
16       each line.
17

OPTIONS

19       The following options are supported:
20
21       -x        Use  XML conventions: empty elements are written with a slash
22                 at the end: <IMG />. Implies -e.
23
24       -e        Always insert endtags, even if HTML does  not  require  them.
25                 (For example, </p> and </li>.)
26
27       -d        Omit the DOCTYPE from the output.
28
29       -i indent Set  the  number  of  spaces  to  indent  each nesting level.
30                 Default is 2.  Not all elements cause an indent. In  general,
31                 elements that can occur in a block environment are started on
32                 a new line and cause an indent, but inline elements, such  as
33                 EM and SPAN do not cause an indent.
34
35       -l line-length
36                 Sets  the maximum length of lines.  normalize will wrap lines
37                 so that all lines are as long as possible, but no longer than
38                 this  length.  Default  is 72. Words that are longer than the
39                 line length will not be broken, and  will  extend  past  this
40                 length. A
41
42                 content  of  the  STYLE,  SCRIPT and PRE elements will not be
43                 line-wrapped.
44

OPERANDS

46       The following operand is supported:
47
48       file      The name of an HTML file. If absent, standard input  is  read
49                 instead.
50

EXIT STATUS

52       The following exit values are returned:
53
54       0         Successful completion.
55
56       >0        An error occurred in the parsing of the HTML file.  normalize
57                 will try to correct the error and produce output anyway.
58

SEE ALSO

60       xml2asc(1), UTF-8 (RFC 2279)
61

BUGS

63       The error recovery for incorrect HTML is primitive.
64
65       normalize will not omit an end tag if the white space  after  it  could
66       possibly  be  significant. E.g., it will not remove the first </p> from
67       "<div><p>text</p> <p>text</p></div>".
68
69
70
71                                  2 Dec 1998                            man(1)
Impressum