1XML::Fast(3) User Contributed Perl Documentation XML::Fast(3)
2
3
4
6 XML::Fast - Simple and very fast XML - hash conversion
7
9 use XML::Fast;
10
11 my $hash = xml2hash $xml;
12 my $hash2 = xml2hash $xml, attr => '.', text => '~';
13
15 This module implements simple, state machine based, XML parser written
16 in C.
17
18 It could parse and recover some kind of broken XML's. If you need XML
19 validator, use XML::LibXML
20
22 Another similar module is XML::Bare. I've used it for some time, but it
23 have some failures:
24
25 • If your XML have node with TextNode, then CDATANode, then again
26 TextNode, you'll got broken value
27
28 • It doesn't support charsets
29
30 • It doesn't support any kind of entities.
31
32 So, after count of tries to fix XML::Bare I've decided to write parser
33 from scratch.
34
35 Here is some features and principles:
36
37 • It uses minimal count of memory allocations.
38
39 • All XML is parsed in 1 scan.
40
41 • All values are copied from source XML only once (to destination
42 keys/values)
43
44 • If some types of nodes (for ex comments) are ignored, there are no
45 memory allocations/copy for them.
46
47 I've removed benchmark results, since they are very different for
48 different xml's. Sometimes XML::Bare is faster, sometimes not. So,
49 XML::Fast mainly should be considered not "faster-than-bare", but
50 "format-other-than-bare"
51
53 xml2hash $xml, [ %options ]
54 hash2xml $hash, [ %options ]
56 order [ = 0 ]
57 Not implemented yet. Strictly keep the output order. When enabled,
58 structures become more complex, but xml could be completely
59 reverted.
60
61 attr [ = '-' ]
62 Attribute prefix
63
64 <node attr="test" /> => { node => { -attr => "test" } }
65
66 text [ = '#text' ]
67 Key name for storing text
68
69 When undef, text nodes will be ignored
70
71 <node>text<sub /></node> => { node => { sub => '', '#text' => "test" } }
72
73 join [ = '' ]
74 Join separator for text nodes, splitted by subnodes
75
76 Ignored when "order" in effect
77
78 # default:
79 xml2hash( '<item>Test1<sub />Test2</item>' )
80 : { item => { sub => '', '~' => 'Test1Test2' } };
81
82 xml2hash( '<item>Test1<sub />Test2</item>', join => '+' )
83 : { item => { sub => '', '~' => 'Test1+Test2' } };
84
85 trim [ = 1 ]
86 Trim leading and trailing whitespace from text nodes
87
88 cdata [ = undef ]
89 When defined, CDATA sections will be stored under this key
90
91 # cdata = undef
92 <node><![CDATA[ test ]]></node> => { node => 'test' }
93
94 # cdata = '#'
95 <node><![CDATA[ test ]]></node> => { node => { '#' => 'test' } }
96
97 comm [ = undef ]
98 When defined, comments sections will be stored under this key
99
100 When undef, comments will be ignored
101
102 # comm = undef
103 <node><!-- comm --><sub/></node> => { node => { sub => '' } }
104
105 # comm = '/'
106 <node><!-- comm --><sub/></node> => { node => { sub => '', '/' => 'comm' } }
107
108 array => 1
109 Force all nodes to be kept as arrays.
110
111 # no array
112 <node><sub/></node> => { node => { sub => '' } }
113
114 # array = 1
115 <node><sub/></node> => { node => [ { sub => [ '' ] } ] }
116
117 array => [ 'node', 'names']
118 Force nodes with names to be stored as arrays
119
120 # no array
121 <node><sub/></node> => { node => { sub => '' } }
122
123 # array => ['sub']
124 <node><sub/></node> => { node => { sub => [ '' ] } }
125
126 utf8decode => 1
127 Force decoding of utf8 sequences, instead of just upgrading them
128 (may be useful for broken xml)
129
131 • XML::Bare
132
133 Another fast parser
134
135 • XML::LibXML
136
137 The most powerful XML parser for perl. If you don't need to parse
138 gigabytes of XML ;)
139
140 • XML::Hash::LX
141
142 XML parser, that uses XML::LibXML for parsing and then constructs
143 hash structure, identical to one, generated by this module. (At
144 least, it should ;)). But of course it is much more slower, than
145 XML::Fast
146
148 • Does not support wide charsets (UTF-16/32) (see RT71534
149 <https://rt.cpan.org/Ticket/Display.html?id=71534>)
150
152 • Ordered mode (as implemented in XML::Hash::LX)
153
154 • Create hash2xml, identical to one in XML::Hash::LX
155
156 • Partial content event-based parsing (I need this for reading XML
157 streams)
158
159 Patches, propositions and bug reports are welcome ;)
160
162 Mons Anderson, <mons@cpan.org>
163
165 Copyright (C) 2010 Mons Anderson
166
167 This library is free software; you can redistribute it and/or modify it
168 under the same terms as Perl itself.
169
170
171
172perl v5.34.0 2022-01-21 XML::Fast(3)