1HTML::Tree(3)         User Contributed Perl Documentation        HTML::Tree(3)
2
3
4

NAME

6       HTML::Tree - build and scan parse-trees of HTML
7

VERSION

9       4.1
10

SYNOPSIS

12           use HTML::TreeBuilder;
13           my $tree = HTML::TreeBuilder->new();
14           $tree->parse_file($filename);
15
16               # Then do something with the tree, using HTML::Element
17               # methods -- for example:
18
19           $tree->dump
20
21               # Finally:
22
23           $tree->delete;
24

DESCRIPTION

26       HTML-Tree is a suite of Perl modules for making parse trees out of HTML
27       source.  It consists of mainly two modules, whose documentation you
28       should refer to: HTML::TreeBuilder and HTML::Element.
29
30       HTML::TreeBuilder is the module that builds the parse trees.  (It uses
31       HTML::Parser to do the work of breaking the HTML up into tokens.)
32
33       The tree that TreeBuilder builds for you is made up of objects of the
34       class HTML::Element.
35
36       If you find that you do not properly understand the documentation for
37       HTML::TreeBuilder and HTML::Element, it may be because you are
38       unfamiliar with tree-shaped data structures, or with object-oriented
39       modules in general. Sean Burke has written some articles for The Perl
40       Journal ("www.tpj.com") that seek to provide that background.  The full
41       text of those articles is contained in this distribution, as:
42
43       HTML::Tree::AboutObjects
44           "User's View of Object-Oriented Modules" from TPJ17.
45
46       HTML::Tree::AboutTrees
47           "Trees" from TPJ18
48
49       HTML::Tree::Scanning
50           "Scanning HTML" from TPJ19
51
52       Readers already familiar with object-oriented modules and tree-shaped
53       data structures should read just the last article.  Readers without
54       that background should read the first, then the second, and then the
55       third.
56
57   new
58       Redirects to HTML::TreeBuilder::new
59
60   new_from_file
61       Redirects to HTML::TreeBuilder::new_from_file
62
63   new_from_content
64       Redirects to HTML::TreeBuilder::new_from_content
65

SUPPORT

67       You can find documentation for this module with the perldoc command.
68
69           perldoc HTML::Tree
70
71           You can also look for information at:
72
73       ·   AnnoCPAN: Annotated CPAN documentation
74
75           http://annocpan.org/dist/HTML-Tree <http://annocpan.org/dist/HTML-
76           Tree>
77
78       ·   CPAN Ratings
79
80           http://cpanratings.perl.org/d/HTML-Tree
81           <http://cpanratings.perl.org/d/HTML-Tree>
82
83       ·   RT: CPAN's request tracker
84
85           http://rt.cpan.org/NoAuth/Bugs.html?Dist=HTML-Tree
86           <http://rt.cpan.org/NoAuth/Bugs.html?Dist=HTML-Tree>
87
88       ·   Search CPAN
89
90           http://search.cpan.org/dist/HTML-Tree
91           <http://search.cpan.org/dist/HTML-Tree>
92

SEE ALSO

94       HTML::TreeBuilder, HTML::Element, HTML::Tagset, HTML::Parser,
95       HTML::DOMbo
96
97       The book Perl & LWP by Sean M. Burke published by O'Reilly and
98       Associates, 2002.  ISBN: 0-596-00178-9
99
100       It has several chapters to do with HTML processing in general, and
101       HTML-Tree specifically.  There's more info at:
102
103           http://www.oreilly.com/catalog/perllwp/
104
105           http://www.amazon.com/exec/obidos/ASIN/0596001789
106

SOURCE REPOSITORY

108       HTML::Tree is maintained in Subversion hosted at perl.org.
109
110           http://svn.perl.org/modules/HTML-Tree
111
112       The latest development work is always at:
113
114           http://svn.perl.org/modules/HTML-Tree/trunk
115
116       Any patches sent should be diffed against this repository.
117

ACKNOWLEDGEMENTS

119       Thanks to Gisle Aas, Sean Burke and Andy Lester for their original
120       work.
121
122       Thanks to Chicago Perl Mongers (http://chicago.pm.org) for their
123       patches submitted to HTML::Tree as part of the Phalanx project
124       (http://qa.perl.org/phalanx).
125
126       Thanks to the following people for additional patches and
127       documentation: Terrence Brannon, Gordon Lack, Chris Madsen and Ricardo
128       Signes.
129

AUTHOR

131       Original HTML-Tree author Gisle Aas.  Handed off to Sean M. Burke.  and
132       Andy Lester.  Currently maintained by Pete Krawczyk "<petek@cpan.org>".
133
135       Copyright 1995-1998 Gisle Aas; 1999-2004 Sean M. Burke; 2005 Andy
136       Lester; 2006 Pete Krawczyk.  (Except the articles contained in
137       HTML::Tree::AboutObjects, HTML::Tree::AboutTrees, and
138       HTML::Tree::Scanning, which are all copyright 2000 The Perl Journal.)
139
140       Except for those three TPJ articles, the whole HTML-Tree distribution,
141       of which this file is a part, is free software; you can redistribute it
142       and/or modify it under the same terms as Perl itself.
143
144       Those three TPJ articles may be distributed under the same terms as
145       Perl itself.
146
147       The programs in this library are distributed in the hope that they will
148       be useful, but without any warranty; without even the implied warranty
149       of merchantability or fitness for a particular purpose.
150
151
152
153perl v5.12.2                      2010-12-20                     HTML::Tree(3)
Impressum