Automatically generated by Pod::Man 2.28 (Pod::Simple 3.28) Standard preamble: ========================================================================
NAMEHTML::Tree - build and scan parse-trees of HTML
VERSIONThis document describes version 5.03 of HTML::Tree, released September 22, 2012 as part of HTML-Tree.
use HTML::TreeBuilder; my $tree = HTML::TreeBuilder->new(); $tree->parse_file($filename); # Then do something with the tree, using HTML::Element # methods -- for example: $tree->dump # Finally: $tree->delete;
DESCRIPTIONHTML-Tree is a suite of Perl modules for making parse trees out of
HTML::TreeBuilder is the module that builds the parse trees. (It uses HTML::Parser to do the work of breaking the
The tree that TreeBuilder builds for you is made up of objects of the class HTML::Element.
If you find that you do not properly understand the documentation for HTML::TreeBuilder and HTML::Element, it may be because you are unfamiliar with tree-shaped data structures, or with object-oriented modules in general. Sean Burke has written some articles for The Perl Journal ("www.tpj.com") that seek to provide that background. The full text of those articles is contained in this distribution, as:
``User's View of Object-Oriented Modules'' from TPJ17.
``Trees'' from TPJ18
Readers already familiar with object-oriented modules and tree-shaped data structures should read just the last article. Readers without that background should read the first, then the second, and then the third.
METHODSAll these methods simply redirect to the corresponding method in HTML::TreeBuilder. It's more efficient to use HTML::TreeBuilder directly, and skip loading HTML::Tree at all.
newRedirects to ``new'' in HTML::TreeBuilder.
new_from_fileRedirects to ``new_from_file'' in HTML::TreeBuilder.
new_from_contentRedirects to ``new_from_content'' in HTML::TreeBuilder.
new_from_urlRedirects to ``new_from_url'' in HTML::TreeBuilder.
SUPPORTYou can find documentation for this module with the perldoc command.
perldoc HTML::Tree You can also look for information at:
AnnoCPAN: Annotated CPANdocumentation
RT: CPAN's request tracker
If you have a question about how to use HTML-Tree, Stack Overflow is the place to ask it. Make sure you tag it both "perl" and "html-tree".
SEE ALSOHTML::TreeBuilder, HTML::Element, HTML::Tagset, HTML::Parser, HTML::DOMbo
The book Perl &
It has several chapters to do with
SOURCE REPOSITORYHTML-Tree is now maintained using Git. The main public repository is <github.com/madsen/HTML-Tree>.
The best way to send a patch is to make a pull request there.
ACKNOWLEDGEMENTSThanks to Gisle Aas, Sean Burke and Andy Lester for their original work.
Thanks to the following people for additional patches and documentation: Terrence Brannon, Gordon Lack, Chris Madsen and Ricardo Signes.
- Christopher J. Madsen "<perl AT cjmweb.net>"
- Jeff Fearn "<jfearn AT cpan.org>"
Original HTML-Tree author:
- Gisle Aas
- Sean M. Burke
- Andy Lester
- Pete Krawczyk "<petek AT cpan.org>"
You can follow or contribute to HTML-Tree's development at <github.com/madsen/HTML-Tree>.
COPYRIGHT AND LICENSECopyright 1995-1998 Gisle Aas, 1999-2004 Sean M. Burke, 2005 Andy Lester, 2006 Pete Krawczyk, 2010 Jeff Fearn, 2012 Christopher J. Madsen. (Except the articles contained in HTML::Tree::AboutObjects, HTML::Tree::AboutTrees, and HTML::Tree::Scanning, which are all copyright 2000 The Perl Journal.)
Except for those three
The programs in this library are distributed in the hope that they will be useful, but without any warranty; without even the implied warranty of merchantability or fitness for a particular purpose.