1PERLUNIPROPS(1)        Perl Programmers Reference Guide        PERLUNIPROPS(1)
2
3
4

NAME

6       perluniprops - Index of Unicode Version 5.2.0 properties in Perl
7

DESCRIPTION

9       There are many properties in Unicode, and Perl provides access to
10       almost all of them, as well as some additional extensions and short-cut
11       synonyms.
12
13       And just about all of the few that aren't accessible through the Perl
14       core are accessible through the modules: Unicode::Normalize and
15       Unicode::UCD, and for Unihan properties, via the CPAN module
16       Unicode::Unihan.
17
18       This document merely lists all available properties and does not
19       attempt to explain what each property really means.  There is a brief
20       description of each Perl extension.  There is some detail about Blocks,
21       Scripts, General_Category, and Bidi_Class in perlunicode, but to find
22       out about the intricacies of the Unicode properties, refer to the
23       Unicode standard.  A good starting place is
24       <http://www.unicode.org/reports/tr44/>.  More information on the Perl
25       extensions is in perlrecharclass.
26
27       Note that you can define your own properties; see "User-Defined
28       Character Properties" in perlunicode.
29

Properties accessible through \p{} and \P{}

31       The Perl regular expression \p{} and \P{} constructs give access to
32       most of the Unicode character properties.  The table below shows all
33       these constructs, both single and compound forms.
34
35       Compound forms consist of two components, separated by an equals sign
36       or a colon.  The first component is the property name, and the second
37       component is the particular value of the property to match against, for
38       example, '\p{Script: Greek}' or '\p{Script=Greek}' both mean to match
39       characters whose Script property is Greek.
40
41       Single forms, like '\p{Greek}', are mostly Perl-defined shortcuts for
42       their equivalent compound forms.  The table shows these equivalences.
43       (In our example, '\p{Greek}' is a just a shortcut for
44       '\p{Script=Greek}'.)  There are also a few Perl-defined single forms
45       that are not shortcuts for a compound form.  One such is \p{Word}.
46       These are also listed in the table.
47
48       In parsing these constructs, Perl always ignores Upper/lower case
49       differences everywhere within the {braces}.  Thus '\p{Greek}' means the
50       same thing as '\p{greek}'.  But note that changing the case of the 'p'
51       or 'P' before the left brace completely changes the meaning of the
52       construct, from "match" (for '\p{}') to "doesn't match" (for '\P{}').
53       Casing in this document is for improved legibility.
54
55       Also, white space, hyphens, and underscores are also normally ignored
56       everywhere between the {braces}, and hence can be freely added or
57       removed even if the "/x" modifier hasn't been specified on the regular
58       expression.  But a 'T' at the beginning of an entry in the table below
59       means that tighter (stricter) rules are used for that entry:
60
61       Single form (\p{name}) tighter rules:
62           White space, hyphens, and underscores ARE significant except for:
63
64           ·   white space adjacent to a non-word character
65
66           ·   underscores separating digits in numbers
67
68           That means, for example, that you can freely add or remove white
69           space adjacent to (but within) the braces without affecting the
70           meaning.
71
72       Compound form (\p{name=value} or \p{name:value}) tighter rules:
73           The tighter rules given above for the single form apply to
74           everything to the right of the colon or equals; the looser rules
75           still apply to everything to the left.
76
77           That means, for example, that you can freely add or remove white
78           space adjacent to (but within) the braces and the colon or equal
79           sign.
80
81       Some properties are considered obsolete, but still available.  There
82       are several varieties of obsolesence:
83
84       Obsolete
85           Properties marked with an 'O' in the table are considered obsolete.
86           At the time of this writing (Unicode version 5.2) there is no
87           information in the Unicode standard about the implications of a
88           property being obsolete.
89
90       Stabilized
91           Obsolete properties may be stabilized.  This means that they are
92           not actively maintained by Unicode, and will not be extended as new
93           characters are added to the standard.  Such properties are marked
94           with an 'S' in the table.  At the time of this writing (Unicode
95           version 5.2) there is no further information in the Unicode
96           standard about the implications of a property being stabilized.
97
98       Deprecated
99           Obsolete properties may be deprecated.  This means that their use
100           is strongly discouraged, so much so that a warning will be issued
101           if used, unless the regular expression is in the scope of a
102           "no warnings 'deprecated'" statement.  A 'D' flags each such entry
103           in the table, and the entry there for the longest, most descriptive
104           version of the property will give the reason it is deprecated, and
105           perhaps advice.  Perl may issue such a warning, even for properties
106           that aren't officially deprecated by Unicode, when there used to be
107           characters or code points that were matched by them, but no longer.
108           This is to warn you that your program may not work like it did on
109           earlier Unicode releases.
110
111           A deprecated property may be made unavailable in a future Perl
112           version, so it is best to move away from them.
113
114       Some Perl extensions are present for backwards compatibility and are
115       discouraged from being used, but not obsolete.  An 'X' flags each such
116       entry in the table.
117
118       Matches in the Block property have shortcuts that begin with 'In_'.
119       For example, \p{Block=Latin1} can be written as \p{In_Latin1}.  For
120       backward compatibility, if there is no conflict with another shortcut,
121       these may also be written as \p{Latin1} or \p{Is_Latin1}.  But, N.B.,
122       there are numerous such conflicting shortcuts.  Use of these forms for
123       Block is discouraged, and are flagged as such, not only because of the
124       potential confusion as to what is meant, but also because a later
125       release of Unicode may preempt the shortcut, and your program would no
126       longer be correct.  Use the 'In_' form instead to avoid this, or even
127       more clearly, use the compound form, e.g., \p{blk:latin1}.  See
128       "Blocks" in perlunicode for more information about this.
129
130       The table below has two columns.  The left column contains the \p{}
131       constructs to look up, possibly preceeded by the flags mentioned above;
132       and the right column contains information about them, like a
133       description, or synonyms.  It shows both the single and compound forms
134       for each property that has them.  If the left column is a short name
135       for a property, the right column will give its longer, more descriptive
136       name; and if the left column is the longest name, the right column will
137       show any equivalent shortest name, in both single and compound forms if
138       applicable.
139
140       The right column will also caution you if a property means something
141       different than what might normally be expected.
142
143       All single forms are Perl extensions; a few compound forms are as well,
144       and are noted as such.
145
146       Numbers in (parentheses) indicate the total number of code points
147       matched by the property.  For emphasis, those properties that match no
148       code points at all are listed as well in a separate section following
149       the table.
150
151       There is no description given for most non-Perl defined properties (See
152       http://www.unicode.org/reports/tr44/ for that).
153
154       For compactness, '*' is used as a wildcard instead of showing all
155       possible combinations.  For example, entries like:
156
157        \p{Gc: *}                                  \p{General_Category: *}
158
159       mean that 'Gc' is a synonym for 'General_Category', and anything that
160       is valid for the latter is also valid for the former.  Similarly,
161
162        \p{Is_*}                                   \p{*}
163
164       means that if and only if, for example, \p{Foo} exists, then \p{Is_Foo}
165       and \p{IsFoo} are also valid and all mean the same thing.  And
166       similarly, \p{Foo=Bar} means the same as \p{Is_Foo=Bar} and
167       \p{IsFoo=Bar}.  '*' here is restricted to something not beginning with
168       an underscore.
169
170       Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
171       'Y'.  And 'No', 'F', and 'False' are all synonyms for 'N'.  The table
172       shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
173       for the other possibilities.  Note that not all properties which have
174       values 'Yes' and 'No' are binary, and they have all their values
175       spelled out without using this wild card, and a "NOT" clause in their
176       description that highlights their not being binary.  These also require
177       the compound form to match them, whereas true binary properties have
178       both single and compound forms available.
179
180       Note that all non-essential underscores are removed in the display of
181       the short names below.
182
183       Summary legend:
184
185       * is a wild-card
186       (\d+) in the info column gives the number of code points matched by
187       this property.
188       D means this is deprecated.
189       O means this is obsolete.
190       S means this is stabilized.
191       T means tighter (stricter) name matching applies.
192       X means use of this form is discouraged.
193
194              NAME                           INFO
195
196        X \p{Aegean_Numbers}      \p{Block=Aegean_Numbers} (64)
197        T \p{Age: 1.1}            Code point's usage introduced in version
198                                    1.1 (33_979)
199        T \p{Age: 2.0}            Code point's usage was introduced in
200                                    version 2.0; See also Property
201                                    'Present_In' (144_521)
202        T \p{Age: 2.1}            Code point's usage was introduced in
203                                    version 2.1; See also Property
204                                    'Present_In' (2)
205        T \p{Age: 3.0}            Code point's usage was introduced in
206                                    version 3.0; See also Property
207                                    'Present_In' (10_307)
208        T \p{Age: 3.1}            Code point's usage was introduced in
209                                    version 3.1; See also Property
210                                    'Present_In' (44_978)
211        T \p{Age: 3.2}            Code point's usage was introduced in
212                                    version 3.2; See also Property
213                                    'Present_In' (1016)
214        T \p{Age: 4.0}            Code point's usage was introduced in
215                                    version 4.0; See also Property
216                                    'Present_In' (1226)
217        T \p{Age: 4.1}            Code point's usage was introduced in
218                                    version 4.1; See also Property
219                                    'Present_In' (1273)
220        T \p{Age: 5.0}            Code point's usage was introduced in
221                                    version 5.0; See also Property
222                                    'Present_In' (1369)
223        T \p{Age: 5.1}            Code point's usage was introduced in
224                                    version 5.1; See also Property
225                                    'Present_In' (1624)
226        T \p{Age: 5.2}            Code point's usage was introduced in
227                                    version 5.2; See also Property
228                                    'Present_In' (6648)
229          \p{Age: Unassigned}     Code point's usage has not been assigned
230                                    in any Unicode release thus far.
231                                    (867_169)
232          \p{AHex}                \p{ASCII_Hex_Digit} (= \p{ASCII_Hex_Digit=
233                                    Y}) (22)
234          \p{AHex: *}             \p{ASCII_Hex_Digit: *}
235          \p{All}                 \p{Any} (1_114_112)
236          \p{Alnum}               Alphabetic and (Decimal) Numeric (100_931)
237          \p{Alpha}               \p{Alphabetic=Y} (100_520)
238          \p{Alpha: *}            \p{Alphabetic: *}
239          \p{Alphabetic}          \p{Alpha} (= \p{Alphabetic=Y}) (100_520)
240          \p{Alphabetic: N*}      (Short: \p{Alpha=N}, \P{Alpha}) (1_013_592)
241          \p{Alphabetic: Y*}      (Short: \p{Alpha=Y}, \p{Alpha}) (100_520)
242        X \p{Alphabetic_Presentation_Forms} \p{Block=
243                                    Alphabetic_Presentation_Forms} (80)
244        X \p{Ancient_Greek_Musical_Notation} \p{Block=
245                                    Ancient_Greek_Musical_Notation} (80)
246        X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
247        X \p{Ancient_Symbols}     \p{Block=Ancient_Symbols} (64)
248          \p{Any}                 [\x{0000}-\x{10FFFF}] (1_114_112)
249          \p{Arab}                \p{Arabic} (= \p{Script=Arabic}) (NOT
250                                    \p{Block=Arabic}) (1030)
251          \p{Arabic}              \p{Script=Arabic} (Short: \p{Arab}; NOT
252                                    \p{Block=Arabic}) (1030)
253        X \p{Arabic_Presentation_Forms_A} \p{Block=
254                                    Arabic_Presentation_Forms_A} (688)
255        X \p{Arabic_Presentation_Forms_B} \p{Block=
256                                    Arabic_Presentation_Forms_B} (144)
257        X \p{Arabic_Supplement}   \p{Block=Arabic_Supplement} (48)
258          \p{Armenian}            \p{Script=Armenian} (Short: \p{Armn}; NOT
259                                    \p{Block=Armenian}) (90)
260          \p{Armi}                \p{Imperial_Aramaic} (= \p{Script=
261                                    Imperial_Aramaic}) (NOT \p{Block=
262                                    Imperial_Aramaic}) (31)
263          \p{Armn}                \p{Armenian} (= \p{Script=Armenian}) (NOT
264                                    \p{Block=Armenian}) (90)
265        X \p{Arrows}              \p{Block=Arrows} (112)
266          \p{ASCII}               \p{Block=Basic_Latin} [[:ASCII:]] (128)
267          \p{ASCII_Hex_Digit}     \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
268                                    (22)
269          \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090)
270          \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22)
271          \p{Assigned}            All assigned code points (246_877)
272          \p{Avestan}             \p{Script=Avestan} (Short: \p{Avst}; NOT
273                                    \p{Block=Avestan}) (61)
274          \p{Avst}                \p{Avestan} (= \p{Script=Avestan}) (NOT
275                                    \p{Block=Avestan}) (61)
276          \p{Bali}                \p{Balinese} (= \p{Script=Balinese}) (NOT
277                                    \p{Block=Balinese}) (121)
278          \p{Balinese}            \p{Script=Balinese} (Short: \p{Bali}; NOT
279                                    \p{Block=Balinese}) (121)
280          \p{Bamu}                \p{Bamum} (= \p{Script=Bamum}) (NOT
281                                    \p{Block=Bamum}) (88)
282          \p{Bamum}               \p{Script=Bamum} (Short: \p{Bamu}; NOT
283                                    \p{Block=Bamum}) (88)
284        X \p{Basic_Latin}         \p{ASCII} (= \p{Block=Basic_Latin}) (128)
285          \p{Bc: *}               \p{Bidi_Class: *}
286          \p{Beng}                \p{Bengali} (= \p{Script=Bengali}) (NOT
287                                    \p{Block=Bengali}) (92)
288          \p{Bengali}             \p{Script=Bengali} (Short: \p{Beng}; NOT
289                                    \p{Block=Bengali}) (92)
290          \p{Bidi_C}              \p{Bidi_Control} (= \p{Bidi_Control=Y}) (7)
291          \p{Bidi_C: *}           \p{Bidi_Control: *}
292          \p{Bidi_Class: AL}      \p{Bidi_Class=Arabic_Letter} (1116)
293          \p{Bidi_Class: AN}      \p{Bidi_Class=Arabic_Number} (48)
294          \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1116)
295          \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (48)
296          \p{Bidi_Class: B}       \p{Bidi_Class=Paragraph_Separator} (7)
297          \p{Bidi_Class: BN}      \p{Bidi_Class=Boundary_Neutral} (4016)
298          \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016)
299          \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15)
300          \p{Bidi_Class: CS}      \p{Bidi_Class=Common_Separator} (15)
301          \p{Bidi_Class: EN}      \p{Bidi_Class=European_Number} (131)
302          \p{Bidi_Class: ES}      \p{Bidi_Class=European_Separator} (12)
303          \p{Bidi_Class: ET}      \p{Bidi_Class=European_Terminator} (63)
304          \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (131)
305          \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12)
306          \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (63)
307          \p{Bidi_Class: L}       \p{Bidi_Class=Left_To_Right} (1_099_541)
308          \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_099_541)
309          \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1)
310          \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1)
311          \p{Bidi_Class: LRE}     \p{Bidi_Class=Left_To_Right_Embedding} (1)
312          \p{Bidi_Class: LRO}     \p{Bidi_Class=Left_To_Right_Override} (1)
313          \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1173)
314          \p{Bidi_Class: NSM}     \p{Bidi_Class=Nonspacing_Mark} (1173)
315          \p{Bidi_Class: ON}      \p{Bidi_Class=Other_Neutral} (3523)
316          \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (3523)
317          \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7)
318          \p{Bidi_Class: PDF}     \p{Bidi_Class=Pop_Directional_Format} (1)
319          \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1)
320          \p{Bidi_Class: R}       \p{Bidi_Class=Right_To_Left} (4441)
321          \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (4441)
322          \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1)
323          \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1)
324          \p{Bidi_Class: RLE}     \p{Bidi_Class=Right_To_Left_Embedding} (1)
325          \p{Bidi_Class: RLO}     \p{Bidi_Class=Right_To_Left_Override} (1)
326          \p{Bidi_Class: S}       \p{Bidi_Class=Segment_Separator} (3)
327          \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3)
328          \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (18)
329          \p{Bidi_Class: WS}      \p{Bidi_Class=White_Space} (18)
330          \p{Bidi_Control}        \p{Bidi_Control=Y} (Short: \p{BidiC}) (7)
331          \p{Bidi_Control: N*}    (Short: \p{BidiC=N}, \P{BidiC}) (1_114_105)
332          \p{Bidi_Control: Y*}    (Short: \p{BidiC=Y}, \p{BidiC}) (7)
333          \p{Bidi_M}              \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
334                                    (543)
335          \p{Bidi_M: *}           \p{Bidi_Mirrored: *}
336          \p{Bidi_Mirrored}       \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
337                                    (543)
338          \p{Bidi_Mirrored: N*}   (Short: \p{BidiM=N}, \P{BidiM}) (1_113_569)
339          \p{Bidi_Mirrored: Y*}   (Short: \p{BidiM=Y}, \p{BidiM}) (543)
340          \p{Blank}               \h, Horizontal white space (19)
341          \p{Blk: *}              \p{Block: *}
342          \p{Block: Aegean_Numbers} (Single: \p{InAegeanNumbers}) (64)
343          \p{Block: Alphabetic_Presentation_Forms} (Single:
344                                    \p{InAlphabeticPresentationForms}) (80)
345          \p{Block: Ancient_Greek_Musical_Notation} (Single:
346                                    \p{InAncientGreekMusicalNotation}) (80)
347          \p{Block: Ancient_Greek_Numbers} (Single:
348                                    \p{InAncientGreekNumbers}) (80)
349          \p{Block: Ancient_Symbols} (Single: \p{InAncientSymbols}) (64)
350          \p{Block: Arabic}       (Single: \p{InArabic}; NOT \p{Arabic} NOR
351                                    \p{Is_Arabic}) (256)
352          \p{Block: Arabic_Presentation_Forms_A} (Single:
353                                    \p{InArabicPresentationFormsA}) (688)
354          \p{Block: Arabic_Presentation_Forms_B} (Single:
355                                    \p{InArabicPresentationFormsB}) (144)
356          \p{Block: Arabic_Supplement} (Single: \p{InArabicSupplement}) (48)
357          \p{Block: Armenian}     (Single: \p{InArmenian}; NOT \p{Armenian}
358                                    NOR \p{Is_Armenian}) (96)
359          \p{Block: Arrows}       (Single: \p{InArrows}) (112)
360          \p{Block: ASCII}        \p{Block=Basic_Latin} (128)
361          \p{Block: Avestan}      (Single: \p{InAvestan}; NOT \p{Avestan}
362                                    NOR \p{Is_Avestan}) (64)
363          \p{Block: Balinese}     (Single: \p{InBalinese}; NOT \p{Balinese}
364                                    NOR \p{Is_Balinese}) (128)
365          \p{Block: Bamum}        (Single: \p{InBamum}; NOT \p{Bamum} NOR
366                                    \p{Is_Bamum}) (96)
367          \p{Block: Basic_Latin}  (Short: \p{Blk=ASCII}, \p{ASCII}) (128)
368          \p{Block: Bengali}      (Single: \p{InBengali}; NOT \p{Bengali}
369                                    NOR \p{Is_Bengali}) (128)
370          \p{Block: Block_Elements} (Single: \p{InBlockElements}) (32)
371          \p{Block: Bopomofo}     (Single: \p{InBopomofo}; NOT \p{Bopomofo}
372                                    NOR \p{Is_Bopomofo}) (48)
373          \p{Block: Bopomofo_Extended} (Single: \p{InBopomofoExtended}) (32)
374          \p{Block: Box_Drawing}  (Single: \p{InBoxDrawing}) (128)
375          \p{Block: Braille_Patterns} (Single: \p{InBraillePatterns}) (256)
376          \p{Block: Buginese}     (Single: \p{InBuginese}; NOT \p{Buginese}
377                                    NOR \p{Is_Buginese}) (32)
378          \p{Block: Buhid}        (Single: \p{InBuhid}; NOT \p{Buhid} NOR
379                                    \p{Is_Buhid}) (32)
380          \p{Block: Byzantine_Musical_Symbols} (Single:
381                                    \p{InByzantineMusicalSymbols}) (256)
382          \p{Block: Canadian_Syllabics} \p{Block=
383                                    Unified_Canadian_Aboriginal_Syllabics}
384                                    (640)
385          \p{Block: Carian}       (Single: \p{InCarian}; NOT \p{Carian} NOR
386                                    \p{Is_Carian}) (64)
387          \p{Block: Cham}         (Single: \p{InCham}; NOT \p{Cham} NOR
388                                    \p{Is_Cham}) (96)
389          \p{Block: Cherokee}     (Single: \p{InCherokee}; NOT \p{Cherokee}
390                                    NOR \p{Is_Cherokee}) (96)
391          \p{Block: CJK_Compatibility} (Single: \p{InCJKCompatibility}) (256)
392          \p{Block: CJK_Compatibility_Forms} (Single:
393                                    \p{InCJKCompatibilityForms}) (32)
394          \p{Block: CJK_Compatibility_Ideographs} (Single:
395                                    \p{InCJKCompatibilityIdeographs}) (512)
396          \p{Block: CJK_Compatibility_Ideographs_Supplement} (Single:
397                                    \p{InCJKCompatibilityIdeographs-
398                                    Supplement}) (544)
399          \p{Block: CJK_Radicals_Supplement} (Single:
400                                    \p{InCJKRadicalsSupplement}) (128)
401          \p{Block: CJK_Strokes}  (Single: \p{InCJKStrokes}) (48)
402          \p{Block: CJK_Symbols_And_Punctuation} (Single:
403                                    \p{InCJKSymbolsAndPunctuation}) (64)
404          \p{Block: CJK_Unified_Ideographs} (Single:
405                                    \p{InCJKUnifiedIdeographs}) (20_992)
406          \p{Block: CJK_Unified_Ideographs_Extension_A} (Single:
407                                    \p{InCJKUnifiedIdeographsExtensionA})
408                                    (6592)
409          \p{Block: CJK_Unified_Ideographs_Extension_B} (Single:
410                                    \p{InCJKUnifiedIdeographsExtensionB})
411                                    (42_720)
412          \p{Block: CJK_Unified_Ideographs_Extension_C} (Single:
413                                    \p{InCJKUnifiedIdeographsExtensionC})
414                                    (4160)
415          \p{Block: Combining_Diacritical_Marks} (Single:
416                                    \p{InCombiningDiacriticalMarks}) (112)
417          \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
418                                    CombiningMarksForSymbols},
419                                    \p{InCombiningMarksForSymbols}) (48)
420          \p{Block: Combining_Diacritical_Marks_Supplement} (Single:
421                                    \p{InCombiningDiacriticalMarks-
422                                    Supplement}) (64)
423          \p{Block: Combining_Half_Marks} (Single: \p{InCombiningHalfMarks})
424                                    (16)
425          \p{Block: Combining_Marks_For_Symbols} \p{Block=
426                                    Combining_Diacritical_Marks_For_Symbols}
427                                    (48)
428          \p{Block: Common_Indic_Number_Forms} (Single:
429                                    \p{InCommonIndicNumberForms}) (16)
430          \p{Block: Control_Pictures} (Single: \p{InControlPictures}) (64)
431          \p{Block: Coptic}       (Single: \p{InCoptic}; NOT \p{Coptic} NOR
432                                    \p{Is_Coptic}) (128)
433          \p{Block: Counting_Rod_Numerals} (Single:
434                                    \p{InCountingRodNumerals}) (32)
435          \p{Block: Cuneiform}    (Single: \p{InCuneiform}; NOT
436                                    \p{Cuneiform} NOR \p{Is_Cuneiform})
437                                    (1024)
438          \p{Block: Cuneiform_Numbers_And_Punctuation} (Single:
439                                    \p{InCuneiformNumbersAndPunctuation})
440                                    (128)
441          \p{Block: Currency_Symbols} (Single: \p{InCurrencySymbols}) (48)
442          \p{Block: Cypriot_Syllabary} (Single: \p{InCypriotSyllabary}) (64)
443          \p{Block: Cyrillic}     (Single: \p{InCyrillic}; NOT \p{Cyrillic}
444                                    NOR \p{Is_Cyrillic}) (256)
445          \p{Block: Cyrillic_Extended_A} (Single: \p{InCyrillicExtendedA})
446                                    (32)
447          \p{Block: Cyrillic_Extended_B} (Single: \p{InCyrillicExtendedB})
448                                    (96)
449          \p{Block: Cyrillic_Supplement} (Single: \p{InCyrillicSupplement})
450                                    (48)
451          \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
452                                    (48)
453          \p{Block: Deseret}      (Single: \p{InDeseret}) (80)
454          \p{Block: Devanagari}   (Single: \p{InDevanagari}; NOT
455                                    \p{Devanagari} NOR \p{Is_Devanagari})
456                                    (128)
457          \p{Block: Devanagari_Extended} (Single: \p{InDevanagariExtended})
458                                    (32)
459          \p{Block: Dingbats}     (Single: \p{InDingbats}) (192)
460          \p{Block: Domino_Tiles} (Single: \p{InDominoTiles}) (112)
461          \p{Block: Egyptian_Hieroglyphs} (Single:
462                                    \p{InEgyptianHieroglyphs}; NOT
463                                    \p{Egyptian_Hieroglyphs} NOR
464                                    \p{Is_Egyptian_Hieroglyphs}) (1072)
465          \p{Block: Enclosed_Alphanumeric_Supplement} (Single:
466                                    \p{InEnclosedAlphanumericSupplement})
467                                    (256)
468          \p{Block: Enclosed_Alphanumerics} (Single:
469                                    \p{InEnclosedAlphanumerics}) (160)
470          \p{Block: Enclosed_CJK_Letters_And_Months} (Single:
471                                    \p{InEnclosedCJKLettersAndMonths}) (256)
472          \p{Block: Enclosed_Ideographic_Supplement} (Single:
473                                    \p{InEnclosedIdeographicSupplement})
474                                    (256)
475          \p{Block: Ethiopic}     (Single: \p{InEthiopic}; NOT \p{Ethiopic}
476                                    NOR \p{Is_Ethiopic}) (384)
477          \p{Block: Ethiopic_Extended} (Single: \p{InEthiopicExtended}) (96)
478          \p{Block: Ethiopic_Supplement} (Single: \p{InEthiopicSupplement})
479                                    (32)
480          \p{Block: General_Punctuation} (Single: \p{InGeneralPunctuation})
481                                    (112)
482          \p{Block: Geometric_Shapes} (Single: \p{InGeometricShapes}) (96)
483          \p{Block: Georgian}     (Single: \p{InGeorgian}; NOT \p{Georgian}
484                                    NOR \p{Is_Georgian}) (96)
485          \p{Block: Georgian_Supplement} (Single: \p{InGeorgianSupplement})
486                                    (48)
487          \p{Block: Glagolitic}   (Single: \p{InGlagolitic}; NOT
488                                    \p{Glagolitic} NOR \p{Is_Glagolitic})
489                                    (96)
490          \p{Block: Gothic}       (Single: \p{InGothic}; NOT \p{Gothic} NOR
491                                    \p{Is_Gothic}) (32)
492          \p{Block: Greek}        \p{Block=Greek_And_Coptic} (NOT \p{Greek}
493                                    NOR \p{Is_Greek}) (144)
494          \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}, \p{InGreek};
495                                    NOT \p{Greek} NOR \p{Is_Greek}) (144)
496          \p{Block: Greek_Extended} (Single: \p{InGreekExtended}) (256)
497          \p{Block: Gujarati}     (Single: \p{InGujarati}; NOT \p{Gujarati}
498                                    NOR \p{Is_Gujarati}) (128)
499          \p{Block: Gurmukhi}     (Single: \p{InGurmukhi}; NOT \p{Gurmukhi}
500                                    NOR \p{Is_Gurmukhi}) (128)
501          \p{Block: Halfwidth_And_Fullwidth_Forms} (Single:
502                                    \p{InHalfwidthAndFullwidthForms}) (240)
503          \p{Block: Hangul_Compatibility_Jamo} (Single:
504                                    \p{InHangulCompatibilityJamo}) (96)
505          \p{Block: Hangul_Jamo}  (Single: \p{InHangulJamo}) (256)
506          \p{Block: Hangul_Jamo_Extended_A} (Single:
507                                    \p{InHangulJamoExtendedA}) (32)
508          \p{Block: Hangul_Jamo_Extended_B} (Single:
509                                    \p{InHangulJamoExtendedB}) (80)
510          \p{Block: Hangul_Syllables} (Single: \p{InHangulSyllables})
511                                    (11_184)
512          \p{Block: Hanunoo}      (Single: \p{InHanunoo}; NOT \p{Hanunoo}
513                                    NOR \p{Is_Hanunoo}) (32)
514          \p{Block: Hebrew}       (Single: \p{InHebrew}; NOT \p{Hebrew} NOR
515                                    \p{Is_Hebrew}) (112)
516          \p{Block: High_Private_Use_Surrogates} (Single:
517                                    \p{InHighPrivateUseSurrogates}) (128)
518          \p{Block: High_Surrogates} (Single: \p{InHighSurrogates}) (896)
519          \p{Block: Hiragana}     (Single: \p{InHiragana}; NOT \p{Hiragana}
520                                    NOR \p{Is_Hiragana}) (96)
521          \p{Block: Ideographic_Description_Characters} (Single:
522                                    \p{InIdeographicDescriptionCharacters})
523                                    (16)
524          \p{Block: Imperial_Aramaic} (Single: \p{InImperialAramaic}; NOT
525                                    \p{Imperial_Aramaic} NOR
526                                    \p{Is_Imperial_Aramaic}) (32)
527          \p{Block: Inscriptional_Pahlavi} (Single:
528                                    \p{InInscriptionalPahlavi}; NOT
529                                    \p{Inscriptional_Pahlavi} NOR
530                                    \p{Is_Inscriptional_Pahlavi}) (32)
531          \p{Block: Inscriptional_Parthian} (Single:
532                                    \p{InInscriptionalParthian}; NOT
533                                    \p{Inscriptional_Parthian} NOR
534                                    \p{Is_Inscriptional_Parthian}) (32)
535          \p{Block: IPA_Extensions} (Single: \p{InIPAExtensions}) (96)
536          \p{Block: Javanese}     (Single: \p{InJavanese}; NOT \p{Javanese}
537                                    NOR \p{Is_Javanese}) (96)
538          \p{Block: Kaithi}       (Single: \p{InKaithi}; NOT \p{Kaithi} NOR
539                                    \p{Is_Kaithi}) (80)
540          \p{Block: Kanbun}       (Single: \p{InKanbun}) (16)
541          \p{Block: Kangxi_Radicals} (Single: \p{InKangxiRadicals}) (224)
542          \p{Block: Kannada}      (Single: \p{InKannada}; NOT \p{Kannada}
543                                    NOR \p{Is_Kannada}) (128)
544          \p{Block: Katakana}     (Single: \p{InKatakana}; NOT \p{Katakana}
545                                    NOR \p{Is_Katakana}) (96)
546          \p{Block: Katakana_Phonetic_Extensions} (Single:
547                                    \p{InKatakanaPhoneticExtensions}) (16)
548          \p{Block: Kayah_Li}     (Single: \p{InKayahLi}) (48)
549          \p{Block: Kharoshthi}   (Single: \p{InKharoshthi}; NOT
550                                    \p{Kharoshthi} NOR \p{Is_Kharoshthi})
551                                    (96)
552          \p{Block: Khmer}        (Single: \p{InKhmer}; NOT \p{Khmer} NOR
553                                    \p{Is_Khmer}) (128)
554          \p{Block: Khmer_Symbols} (Single: \p{InKhmerSymbols}) (32)
555          \p{Block: Lao}          (Single: \p{InLao}; NOT \p{Lao} NOR
556                                    \p{Is_Lao}) (128)
557          \p{Block: Latin_1}      \p{Block=Latin_1_Supplement} (128)
558          \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1},
559                                    \p{InLatin1}) (128)
560          \p{Block: Latin_Extended_A} (Single: \p{InLatinExtendedA}) (128)
561          \p{Block: Latin_Extended_Additional} (Single:
562                                    \p{InLatinExtendedAdditional}) (256)
563          \p{Block: Latin_Extended_B} (Single: \p{InLatinExtendedB}) (208)
564          \p{Block: Latin_Extended_C} (Single: \p{InLatinExtendedC}) (32)
565          \p{Block: Latin_Extended_D} (Single: \p{InLatinExtendedD}) (224)
566          \p{Block: Lepcha}       (Single: \p{InLepcha}; NOT \p{Lepcha} NOR
567                                    \p{Is_Lepcha}) (80)
568          \p{Block: Letterlike_Symbols} (Single: \p{InLetterlikeSymbols})
569                                    (80)
570          \p{Block: Limbu}        (Single: \p{InLimbu}; NOT \p{Limbu} NOR
571                                    \p{Is_Limbu}) (80)
572          \p{Block: Linear_B_Ideograms} (Single: \p{InLinearBIdeograms})
573                                    (128)
574          \p{Block: Linear_B_Syllabary} (Single: \p{InLinearBSyllabary})
575                                    (128)
576          \p{Block: Lisu}         (Single: \p{InLisu}) (48)
577          \p{Block: Low_Surrogates} (Single: \p{InLowSurrogates}) (1024)
578          \p{Block: Lycian}       (Single: \p{InLycian}; NOT \p{Lycian} NOR
579                                    \p{Is_Lycian}) (32)
580          \p{Block: Lydian}       (Single: \p{InLydian}; NOT \p{Lydian} NOR
581                                    \p{Is_Lydian}) (32)
582          \p{Block: Mahjong_Tiles} (Single: \p{InMahjongTiles}) (48)
583          \p{Block: Malayalam}    (Single: \p{InMalayalam}; NOT
584                                    \p{Malayalam} NOR \p{Is_Malayalam}) (128)
585          \p{Block: Mathematical_Alphanumeric_Symbols} (Single:
586                                    \p{InMathematicalAlphanumericSymbols})
587                                    (1024)
588          \p{Block: Mathematical_Operators} (Single:
589                                    \p{InMathematicalOperators}) (256)
590          \p{Block: Meetei_Mayek} (Single: \p{InMeeteiMayek}; NOT
591                                    \p{Meetei_Mayek} NOR
592                                    \p{Is_Meetei_Mayek}) (64)
593          \p{Block: Miscellaneous_Mathematical_Symbols_A} (Single:
594                                    \p{InMiscellaneousMathematicalSymbolsA})
595                                    (48)
596          \p{Block: Miscellaneous_Mathematical_Symbols_B} (Single:
597                                    \p{InMiscellaneousMathematicalSymbolsB})
598                                    (128)
599          \p{Block: Miscellaneous_Symbols} (Single:
600                                    \p{InMiscellaneousSymbols}) (256)
601          \p{Block: Miscellaneous_Symbols_And_Arrows} (Single:
602                                    \p{InMiscellaneousSymbolsAndArrows})
603                                    (256)
604          \p{Block: Miscellaneous_Technical} (Single:
605                                    \p{InMiscellaneousTechnical}) (256)
606          \p{Block: Modifier_Tone_Letters} (Single:
607                                    \p{InModifierToneLetters}) (32)
608          \p{Block: Mongolian}    (Single: \p{InMongolian}; NOT
609                                    \p{Mongolian} NOR \p{Is_Mongolian}) (176)
610          \p{Block: Musical_Symbols} (Single: \p{InMusicalSymbols}) (256)
611          \p{Block: Myanmar}      (Single: \p{InMyanmar}; NOT \p{Myanmar}
612                                    NOR \p{Is_Myanmar}) (160)
613          \p{Block: Myanmar_Extended_A} (Single: \p{InMyanmarExtendedA}) (32)
614          \p{Block: New_Tai_Lue}  (Single: \p{InNewTaiLue}; NOT
615                                    \p{New_Tai_Lue} NOR \p{Is_New_Tai_Lue})
616                                    (96)
617          \p{Block: NKo}          (Single: \p{InNKo}; NOT \p{Nko} NOR
618                                    \p{Is_NKo}) (64)
619          \p{Block: No_Block}     (Single: \p{InNoBlock}) (864_192)
620          \p{Block: Number_Forms} (Single: \p{InNumberForms}) (64)
621          \p{Block: Ogham}        (Single: \p{InOgham}; NOT \p{Ogham} NOR
622                                    \p{Is_Ogham}) (32)
623          \p{Block: Ol_Chiki}     (Single: \p{InOlChiki}) (48)
624          \p{Block: Old_Italic}   (Single: \p{InOldItalic}; NOT
625                                    \p{Old_Italic} NOR \p{Is_Old_Italic})
626                                    (48)
627          \p{Block: Old_Persian}  (Single: \p{InOldPersian}; NOT
628                                    \p{Old_Persian} NOR \p{Is_Old_Persian})
629                                    (64)
630          \p{Block: Old_South_Arabian} (Single: \p{InOldSouthArabian}) (32)
631          \p{Block: Old_Turkic}   (Single: \p{InOldTurkic}; NOT
632                                    \p{Old_Turkic} NOR \p{Is_Old_Turkic})
633                                    (80)
634          \p{Block: Optical_Character_Recognition} (Single:
635                                    \p{InOpticalCharacterRecognition}) (32)
636          \p{Block: Oriya}        (Single: \p{InOriya}; NOT \p{Oriya} NOR
637                                    \p{Is_Oriya}) (128)
638          \p{Block: Osmanya}      (Single: \p{InOsmanya}; NOT \p{Osmanya}
639                                    NOR \p{Is_Osmanya}) (48)
640          \p{Block: Phags_Pa}     (Single: \p{InPhagsPa}; NOT \p{Phags_Pa}
641                                    NOR \p{Is_Phags_Pa}) (64)
642          \p{Block: Phaistos_Disc} (Single: \p{InPhaistosDisc}) (48)
643          \p{Block: Phoenician}   (Single: \p{InPhoenician}; NOT
644                                    \p{Phoenician} NOR \p{Is_Phoenician})
645                                    (32)
646          \p{Block: Phonetic_Extensions} (Single: \p{InPhoneticExtensions})
647                                    (128)
648          \p{Block: Phonetic_Extensions_Supplement} (Single:
649                                    \p{InPhoneticExtensionsSupplement}) (64)
650          \p{Block: Private_Use}  \p{Block=Private_Use_Area} (NOT
651                                    \p{Private_Use} NOR \p{Is_Private_Use})
652                                    (6400)
653          \p{Block: Private_Use_Area} (Short: \p{Blk=PrivateUse},
654                                    \p{InPrivateUse}; NOT \p{Private_Use}
655                                    NOR \p{Is_Private_Use}) (6400)
656          \p{Block: Rejang}       (Single: \p{InRejang}; NOT \p{Rejang} NOR
657                                    \p{Is_Rejang}) (48)
658          \p{Block: Rumi_Numeral_Symbols} (Single: \p{InRumiNumeralSymbols})
659                                    (32)
660          \p{Block: Runic}        (Single: \p{InRunic}; NOT \p{Runic} NOR
661                                    \p{Is_Runic}) (96)
662          \p{Block: Samaritan}    (Single: \p{InSamaritan}; NOT
663                                    \p{Samaritan} NOR \p{Is_Samaritan}) (64)
664          \p{Block: Saurashtra}   (Single: \p{InSaurashtra}; NOT
665                                    \p{Saurashtra} NOR \p{Is_Saurashtra})
666                                    (96)
667          \p{Block: Shavian}      (Single: \p{InShavian}) (48)
668          \p{Block: Sinhala}      (Single: \p{InSinhala}; NOT \p{Sinhala}
669                                    NOR \p{Is_Sinhala}) (128)
670          \p{Block: Small_Form_Variants} (Single: \p{InSmallFormVariants})
671                                    (32)
672          \p{Block: Spacing_Modifier_Letters} (Single:
673                                    \p{InSpacingModifierLetters}) (80)
674          \p{Block: Specials}     (Single: \p{InSpecials}) (16)
675          \p{Block: Sundanese}    (Single: \p{InSundanese}; NOT
676                                    \p{Sundanese} NOR \p{Is_Sundanese}) (64)
677          \p{Block: Superscripts_And_Subscripts} (Single:
678                                    \p{InSuperscriptsAndSubscripts}) (48)
679          \p{Block: Supplemental_Arrows_A} (Single:
680                                    \p{InSupplementalArrowsA}) (16)
681          \p{Block: Supplemental_Arrows_B} (Single:
682                                    \p{InSupplementalArrowsB}) (128)
683          \p{Block: Supplemental_Mathematical_Operators} (Single:
684                                    \p{InSupplementalMathematicalOperators})
685                                    (256)
686          \p{Block: Supplemental_Punctuation} (Single:
687                                    \p{InSupplementalPunctuation}) (128)
688          \p{Block: Supplementary_Private_Use_Area_A} (Single:
689                                    \p{InSupplementaryPrivateUseAreaA})
690                                    (65_536)
691          \p{Block: Supplementary_Private_Use_Area_B} (Single:
692                                    \p{InSupplementaryPrivateUseAreaB})
693                                    (65_536)
694          \p{Block: Syloti_Nagri} (Single: \p{InSylotiNagri}; NOT
695                                    \p{Syloti_Nagri} NOR
696                                    \p{Is_Syloti_Nagri}) (48)
697          \p{Block: Syriac}       (Single: \p{InSyriac}; NOT \p{Syriac} NOR
698                                    \p{Is_Syriac}) (80)
699          \p{Block: Tagalog}      (Single: \p{InTagalog}; NOT \p{Tagalog}
700                                    NOR \p{Is_Tagalog}) (32)
701          \p{Block: Tagbanwa}     (Single: \p{InTagbanwa}; NOT \p{Tagbanwa}
702                                    NOR \p{Is_Tagbanwa}) (32)
703          \p{Block: Tags}         (Single: \p{InTags}) (128)
704          \p{Block: Tai_Le}       (Single: \p{InTaiLe}; NOT \p{Tai_Le} NOR
705                                    \p{Is_Tai_Le}) (48)
706          \p{Block: Tai_Tham}     (Single: \p{InTaiTham}; NOT \p{Tai_Tham}
707                                    NOR \p{Is_Tai_Tham}) (144)
708          \p{Block: Tai_Viet}     (Single: \p{InTaiViet}; NOT \p{Tai_Viet}
709                                    NOR \p{Is_Tai_Viet}) (96)
710          \p{Block: Tai_Xuan_Jing_Symbols} (Single:
711                                    \p{InTaiXuanJingSymbols}) (96)
712          \p{Block: Tamil}        (Single: \p{InTamil}; NOT \p{Tamil} NOR
713                                    \p{Is_Tamil}) (128)
714          \p{Block: Telugu}       (Single: \p{InTelugu}; NOT \p{Telugu} NOR
715                                    \p{Is_Telugu}) (128)
716          \p{Block: Thaana}       (Single: \p{InThaana}; NOT \p{Thaana} NOR
717                                    \p{Is_Thaana}) (64)
718          \p{Block: Thai}         (Single: \p{InThai}; NOT \p{Thai} NOR
719                                    \p{Is_Thai}) (128)
720          \p{Block: Tibetan}      (Single: \p{InTibetan}; NOT \p{Tibetan}
721                                    NOR \p{Is_Tibetan}) (256)
722          \p{Block: Tifinagh}     (Single: \p{InTifinagh}; NOT \p{Tifinagh}
723                                    NOR \p{Is_Tifinagh}) (80)
724          \p{Block: Ugaritic}     (Single: \p{InUgaritic}; NOT \p{Ugaritic}
725                                    NOR \p{Is_Ugaritic}) (32)
726          \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
727                                    CanadianSyllabics},
728                                    \p{InCanadianSyllabics}) (640)
729          \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Single:
730                                    \p{InUnifiedCanadianAboriginalSyllabics-
731                                    Extended}) (80)
732          \p{Block: Vai}          (Single: \p{InVai}; NOT \p{Vai} NOR
733                                    \p{Is_Vai}) (320)
734          \p{Block: Variation_Selectors} (Single: \p{InVariationSelectors})
735                                    (16)
736          \p{Block: Variation_Selectors_Supplement} (Single:
737                                    \p{InVariationSelectorsSupplement}) (240)
738          \p{Block: Vedic_Extensions} (Single: \p{InVedicExtensions}) (48)
739          \p{Block: Vertical_Forms} (Single: \p{InVerticalForms}) (16)
740          \p{Block: Yi_Radicals}  (Single: \p{InYiRadicals}) (64)
741          \p{Block: Yi_Syllables} (Single: \p{InYiSyllables}) (1168)
742          \p{Block: Yijing_Hexagram_Symbols} (Single:
743                                    \p{InYijingHexagramSymbols}) (64)
744        X \p{Block_Elements}      \p{Block=Block_Elements} (32)
745          \p{Bopo}                \p{Bopomofo} (= \p{Script=Bopomofo}) (NOT
746                                    \p{Block=Bopomofo}) (65)
747          \p{Bopomofo}            \p{Script=Bopomofo} (Short: \p{Bopo}; NOT
748                                    \p{Block=Bopomofo}) (65)
749        X \p{Bopomofo_Extended}   \p{Block=Bopomofo_Extended} (32)
750        X \p{Box_Drawing}         \p{Block=Box_Drawing} (128)
751          \p{Brai}                \p{Braille} (= \p{Script=Braille}) (256)
752          \p{Braille}             \p{Script=Braille} (Short: \p{Brai}) (256)
753        X \p{Braille_Patterns}    \p{Block=Braille_Patterns} (256)
754          \p{Bugi}                \p{Buginese} (= \p{Script=Buginese}) (NOT
755                                    \p{Block=Buginese}) (30)
756          \p{Buginese}            \p{Script=Buginese} (Short: \p{Bugi}; NOT
757                                    \p{Block=Buginese}) (30)
758          \p{Buhd}                \p{Buhid} (= \p{Script=Buhid}) (NOT
759                                    \p{Block=Buhid}) (20)
760          \p{Buhid}               \p{Script=Buhid} (Short: \p{Buhd}; NOT
761                                    \p{Block=Buhid}) (20)
762        X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
763                                    (256)
764          \p{C}                   \p{Other} (= \p{General_Category=Other})
765                                    (1_006_956)
766          \p{Canadian_Aboriginal} \p{Script=Canadian_Aboriginal} (Short:
767                                    \p{Cans}) (710)
768        X \p{Canadian_Syllabics}  \p{Unified_Canadian_Aboriginal_Syllabics}
769                                    (= \p{Block=
770                                    Unified_Canadian_Aboriginal_Syllabics})
771                                    (640)
772        T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
773                                    Not_Reordered} (1_113_518)
774        T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
775                                    Overlay} (26)
776        T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
777                                    Nukta} (11)
778        T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
779                                    Kana_Voicing} (2)
780        T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
781                                    Virama} (27)
782        T \p{Canonical_Combining_Class: 10} (Short: \p{Ccc=10}) (1)
783        T \p{Canonical_Combining_Class: 11} (Short: \p{Ccc=11}) (1)
784        T \p{Canonical_Combining_Class: 12} (Short: \p{Ccc=12}) (1)
785        T \p{Canonical_Combining_Class: 13} (Short: \p{Ccc=13}) (1)
786        T \p{Canonical_Combining_Class: 14} (Short: \p{Ccc=14}) (1)
787        T \p{Canonical_Combining_Class: 15} (Short: \p{Ccc=15}) (1)
788        T \p{Canonical_Combining_Class: 16} (Short: \p{Ccc=16}) (1)
789        T \p{Canonical_Combining_Class: 17} (Short: \p{Ccc=17}) (1)
790        T \p{Canonical_Combining_Class: 18} (Short: \p{Ccc=18}) (2)
791        T \p{Canonical_Combining_Class: 19} (Short: \p{Ccc=19}) (2)
792        T \p{Canonical_Combining_Class: 20} (Short: \p{Ccc=20}) (1)
793        T \p{Canonical_Combining_Class: 21} (Short: \p{Ccc=21}) (1)
794        T \p{Canonical_Combining_Class: 22} (Short: \p{Ccc=22}) (1)
795        T \p{Canonical_Combining_Class: 23} (Short: \p{Ccc=23}) (1)
796        T \p{Canonical_Combining_Class: 24} (Short: \p{Ccc=24}) (1)
797        T \p{Canonical_Combining_Class: 25} (Short: \p{Ccc=25}) (1)
798        T \p{Canonical_Combining_Class: 26} (Short: \p{Ccc=26}) (1)
799        T \p{Canonical_Combining_Class: 27} (Short: \p{Ccc=27}) (1)
800        T \p{Canonical_Combining_Class: 28} (Short: \p{Ccc=28}) (1)
801        T \p{Canonical_Combining_Class: 29} (Short: \p{Ccc=29}) (1)
802        T \p{Canonical_Combining_Class: 30} (Short: \p{Ccc=30}) (2)
803        T \p{Canonical_Combining_Class: 31} (Short: \p{Ccc=31}) (2)
804        T \p{Canonical_Combining_Class: 32} (Short: \p{Ccc=32}) (2)
805        T \p{Canonical_Combining_Class: 33} (Short: \p{Ccc=33}) (1)
806        T \p{Canonical_Combining_Class: 34} (Short: \p{Ccc=34}) (1)
807        T \p{Canonical_Combining_Class: 35} (Short: \p{Ccc=35}) (1)
808        T \p{Canonical_Combining_Class: 36} (Short: \p{Ccc=36}) (1)
809        T \p{Canonical_Combining_Class: 84} (Short: \p{Ccc=84}) (1)
810        T \p{Canonical_Combining_Class: 91} (Short: \p{Ccc=91}) (1)
811        T \p{Canonical_Combining_Class: 103} (Short: \p{Ccc=103}) (2)
812        T \p{Canonical_Combining_Class: 107} (Short: \p{Ccc=107}) (4)
813        T \p{Canonical_Combining_Class: 118} (Short: \p{Ccc=118}) (2)
814        T \p{Canonical_Combining_Class: 122} (Short: \p{Ccc=122}) (4)
815        T \p{Canonical_Combining_Class: 129} (Short: \p{Ccc=129}) (1)
816        T \p{Canonical_Combining_Class: 130} (Short: \p{Ccc=130}) (6)
817        T \p{Canonical_Combining_Class: 132} (Short: \p{Ccc=132}) (1)
818        T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
819                                    Attached_Below_Left} (0)
820        T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
821                                    Attached_Below} (5)
822        T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
823                                    Attached_Above} (1)
824        T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
825                                    Attached_Above_Right} (9)
826        T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
827                                    Below_Left} (1)
828        T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
829                                    Below} (117)
830        T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
831                                    Below_Right} (4)
832        T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
833                                    Left} (2)
834        T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
835                                    Right} (1)
836        T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
837                                    Above_Left} (3)
838        T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
839                                    Above} (318)
840        T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
841                                    Above_Right} (4)
842        T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
843                                    Double_Below} (3)
844        T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
845                                    Double_Above} (5)
846        T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
847                                    Iota_Subscript} (1)
848          \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
849                                    Above} (318)
850          \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (318)
851          \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (3)
852          \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (4)
853          \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
854                                    Above_Left} (3)
855          \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
856                                    Above_Right} (4)
857          \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
858                                    Attached_Above} (1)
859          \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
860                                    Attached_Above_Right} (9)
861          \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
862                                    Attached_Below} (5)
863          \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
864                                    Attached_Below_Left} (0)
865          \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
866                                    (1)
867          \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
868                                    \p{Ccc=ATAR}) (9)
869          \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
870                                    (5)
871          \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
872                                    ATBL}) (0)
873          \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
874                                    Below} (117)
875          \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (117)
876          \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1)
877          \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4)
878          \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
879                                    Below_Left} (1)
880          \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
881                                    Below_Right} (4)
882          \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
883                                    Double_Above} (5)
884          \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
885                                    Double_Below} (3)
886          \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA}) (5)
887          \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB}) (3)
888          \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
889                                    (1)
890          \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
891                                    Iota_Subscript} (1)
892          \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV}) (2)
893          \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
894                                    Kana_Voicing} (2)
895          \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
896                                    Left} (2)
897          \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2)
898          \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
899                                    Nukta} (11)
900          \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
901                                    (1_113_518)
902          \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
903                                    Not_Reordered} (1_113_518)
904          \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (11)
905          \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
906                                    Overlay} (26)
907          \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (26)
908          \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
909                                    Right} (1)
910          \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1)
911          \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (27)
912          \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
913                                    Virama} (27)
914          \p{Cans}                \p{Canadian_Aboriginal} (= \p{Script=
915                                    Canadian_Aboriginal}) (710)
916          \p{Cari}                \p{Carian} (= \p{Script=Carian}) (NOT
917                                    \p{Block=Carian}) (49)
918          \p{Carian}              \p{Script=Carian} (Short: \p{Cari}; NOT
919                                    \p{Block=Carian}) (49)
920          \p{Case_Ignorable}      \p{Case_Ignorable=Y} (Short: \p{CI}) (1632)
921          \p{Case_Ignorable: N*}  (Short: \p{CI=N}, \P{CI}) (1_112_480)
922          \p{Case_Ignorable: Y*}  (Short: \p{CI=Y}, \p{CI}) (1632)
923          \p{Cased}               \p{Cased=Y} (3408)
924          \p{Cased: N*}           (Single: \P{Cased}) (1_110_704)
925          \p{Cased: Y*}           (Single: \p{Cased}) (3408)
926          \p{Cased_Letter}        \p{General_Category=Cased_Letter} (Short:
927                                    \p{LC}) (3207)
928          \p{Category: *}         \p{General_Category: *}
929          \p{Cc}                  \p{Cntrl} (= \p{General_Category=Control})
930                                    (65)
931          \p{Ccc: *}              \p{Canonical_Combining_Class: *}
932          \p{CE}                  \p{Composition_Exclusion} (=
933                                    \p{Composition_Exclusion=Y}) (81)
934          \p{CE: *}               \p{Composition_Exclusion: *}
935          \p{Cf}                  \p{Format} (= \p{General_Category=Format})
936                                    (140)
937          \p{Cham}                \p{Script=Cham} (NOT \p{Block=Cham}) (83)
938          \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
939                                    \p{CWCF}) (1093)
940          \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
941                                    (1_113_019)
942          \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
943                                    (1093)
944          \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
945                                    \p{CWCM}) (2110)
946          \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
947                                    (1_112_002)
948          \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
949                                    (2110)
950          \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
951                                    \p{CWL}) (1029)
952          \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
953                                    (1_113_083)
954          \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1029)
955          \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
956                                    Y} (Short: \p{CWKCF}) (9740)
957          \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
958                                    \P{CWKCF}) (1_104_372)
959          \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
960                                    \p{CWKCF}) (9740)
961          \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
962                                    \p{CWT}) (1085)
963          \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
964                                    (1_113_027)
965          \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1085)
966          \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
967                                    \p{CWU}) (1112)
968          \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
969                                    (1_113_000)
970          \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1112)
971          \p{Cher}                \p{Cherokee} (= \p{Script=Cherokee}) (NOT
972                                    \p{Block=Cherokee}) (85)
973          \p{Cherokee}            \p{Script=Cherokee} (Short: \p{Cher}; NOT
974                                    \p{Block=Cherokee}) (85)
975          \p{CI}                  \p{Case_Ignorable} (= \p{Case_Ignorable=
976                                    Y}) (1632)
977          \p{CI: *}               \p{Case_Ignorable: *}
978        X \p{CJK_Compatibility}   \p{Block=CJK_Compatibility} (256)
979        X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms} (32)
980        X \p{CJK_Compatibility_Ideographs} \p{Block=
981                                    CJK_Compatibility_Ideographs} (512)
982        X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
983                                    CJK_Compatibility_Ideographs_Supplement}
984                                    (544)
985        X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement} (128)
986        X \p{CJK_Strokes}         \p{Block=CJK_Strokes} (48)
987        X \p{CJK_Symbols_And_Punctuation} \p{Block=
988                                    CJK_Symbols_And_Punctuation} (64)
989        X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
990                                    (20_992)
991        X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
992                                    CJK_Unified_Ideographs_Extension_A}
993                                    (6592)
994        X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
995                                    CJK_Unified_Ideographs_Extension_B}
996                                    (42_720)
997        X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
998                                    CJK_Unified_Ideographs_Extension_C}
999                                    (4160)
1000          \p{Close_Punctuation}   \p{General_Category=Close_Punctuation}
1001                                    (Short: \p{Pe}) (71)
1002          \p{Cn}                  \p{Unassigned} (= \p{General_Category=
1003                                    Unassigned}) (867_235)
1004          \p{Cntrl}               \p{General_Category=Control} Control
1005                                    characters (Short: \p{Cc}) (65)
1006          \p{Co}                  \p{Private_Use} (= \p{General_Category=
1007                                    Private_Use}) (NOT \p{Private_Use_Area})
1008                                    (137_468)
1009        X \p{Combining_Diacritical_Marks} \p{Block=
1010                                    Combining_Diacritical_Marks} (112)
1011        X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
1012                                    Combining_Diacritical_Marks_For_Symbols}
1013                                    (Short: \p{InCombiningMarksForSymbols})
1014                                    (48)
1015        X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
1016                                    Combining_Diacritical_Marks_Supplement}
1017                                    (64)
1018        X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (16)
1019        X \p{Combining_Marks_For_Symbols}
1020                                    \p{Combining_Diacritical_Marks_For_-
1021                                    Symbols} (= \p{Block=
1022                                    Combining_Diacritical_Marks_For_-
1023                                    Symbols}) (48)
1024          \p{Common}              \p{Script=Common} (Short: \p{Zyyy}) (5395)
1025        X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1026                                    (16)
1027          \p{Comp_Ex}             \p{Full_Composition_Exclusion} (=
1028                                    \p{Full_Composition_Exclusion=Y}) (1118)
1029          \p{Comp_Ex: *}          \p{Full_Composition_Exclusion: *}
1030          \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
1031                                    \p{CE}) (81)
1032          \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031)
1033          \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81)
1034          \p{Connector_Punctuation} \p{General_Category=
1035                                    Connector_Punctuation} (Short: \p{Pc})
1036                                    (10)
1037          \p{Control}             \p{Cntrl} (= \p{General_Category=Control})
1038                                    (65)
1039        X \p{Control_Pictures}    \p{Block=Control_Pictures} (64)
1040          \p{Copt}                \p{Coptic} (= \p{Script=Coptic}) (NOT
1041                                    \p{Block=Coptic}) (135)
1042          \p{Coptic}              \p{Script=Coptic} (Short: \p{Copt}; NOT
1043                                    \p{Block=Coptic}) (135)
1044        X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (32)
1045          \p{Cprt}                \p{Cypriot} (= \p{Script=Cypriot}) (55)
1046          \p{Cs}                  \p{Surrogate} (= \p{General_Category=
1047                                    Surrogate}) (2048)
1048          \p{Cuneiform}           \p{Script=Cuneiform} (Short: \p{Xsux}; NOT
1049                                    \p{Block=Cuneiform}) (982)
1050        X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
1051                                    Cuneiform_Numbers_And_Punctuation} (128)
1052          \p{Currency_Symbol}     \p{General_Category=Currency_Symbol}
1053                                    (Short: \p{Sc}) (46)
1054        X \p{Currency_Symbols}    \p{Block=Currency_Symbols} (48)
1055          \p{CWCF}                \p{Changes_When_Casefolded} (=
1056                                    \p{Changes_When_Casefolded=Y}) (1093)
1057          \p{CWCF: *}             \p{Changes_When_Casefolded: *}
1058          \p{CWCM}                \p{Changes_When_Casemapped} (=
1059                                    \p{Changes_When_Casemapped=Y}) (2110)
1060          \p{CWCM: *}             \p{Changes_When_Casemapped: *}
1061          \p{CWKCF}               \p{Changes_When_NFKC_Casefolded} (=
1062                                    \p{Changes_When_NFKC_Casefolded=Y})
1063                                    (9740)
1064          \p{CWKCF: *}            \p{Changes_When_NFKC_Casefolded: *}
1065          \p{CWL}                 \p{Changes_When_Lowercased} (=
1066                                    \p{Changes_When_Lowercased=Y}) (1029)
1067          \p{CWL: *}              \p{Changes_When_Lowercased: *}
1068          \p{CWT}                 \p{Changes_When_Titlecased} (=
1069                                    \p{Changes_When_Titlecased=Y}) (1085)
1070          \p{CWT: *}              \p{Changes_When_Titlecased: *}
1071          \p{CWU}                 \p{Changes_When_Uppercased} (=
1072                                    \p{Changes_When_Uppercased=Y}) (1112)
1073          \p{CWU: *}              \p{Changes_When_Uppercased: *}
1074          \p{Cypriot}             \p{Script=Cypriot} (Short: \p{Cprt}) (55)
1075        X \p{Cypriot_Syllabary}   \p{Block=Cypriot_Syllabary} (64)
1076          \p{Cyrillic}            \p{Script=Cyrillic} (Short: \p{Cyrl}; NOT
1077                                    \p{Block=Cyrillic}) (404)
1078        X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (32)
1079        X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (96)
1080        X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (48)
1081        X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
1082                                    Cyrillic_Supplement}) (48)
1083          \p{Cyrl}                \p{Cyrillic} (= \p{Script=Cyrillic}) (NOT
1084                                    \p{Block=Cyrillic}) (404)
1085          \p{Dash}                \p{Dash=Y} (25)
1086          \p{Dash: N*}            (Single: \P{Dash}) (1_114_087)
1087          \p{Dash: Y*}            (Single: \p{Dash}) (25)
1088          \p{Dash_Punctuation}    \p{General_Category=Dash_Punctuation}
1089                                    (Short: \p{Pd}) (21)
1090          \p{Decimal_Number}      \p{Digit} (= \p{General_Category=
1091                                    Decimal_Number}) (411)
1092          \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
1093                                    (13_221)
1094          \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_221)
1095          \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (238)
1096          \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
1097          \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720)
1098          \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (238)
1099          \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
1100          \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240)
1101          \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1043)
1102          \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
1103          \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20)
1104          \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
1105          \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171)
1106          \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
1107          \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238)
1108          \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
1109          \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82)
1110          \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
1111          \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122)
1112          \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
1113          \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5)
1114          \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
1115                                    Non_Canonical} (Perl extension) (3467)
1116          \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
1117                                    decompositions (Short: \p{Dt=NonCanon})
1118                                    (Perl extension) (3467)
1119          \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_424)
1120          \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26)
1121          \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
1122          \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (251)
1123          \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (251)
1124          \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (30)
1125          \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (142)
1126          \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (142)
1127          \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
1128          \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35)
1129          \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104)
1130          \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
1131                                    Y} (Short: \p{DI}) (4167)
1132          \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
1133                                    (1_109_945)
1134          \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
1135                                    (4167)
1136          \p{Dep}                 \p{Deprecated} (= \p{Deprecated=Y}) (110)
1137          \p{Dep: *}              \p{Deprecated: *}
1138          \p{Deprecated}          \p{Deprecated=Y} (Short: \p{Dep}) (110)
1139          \p{Deprecated: N*}      (Short: \p{Dep=N}, \P{Dep}) (1_114_002)
1140          \p{Deprecated: Y*}      (Short: \p{Dep=Y}, \p{Dep}) (110)
1141          \p{Deseret}             \p{Script=Deseret} (Short: \p{Dsrt}) (80)
1142          \p{Deva}                \p{Devanagari} (= \p{Script=Devanagari})
1143                                    (NOT \p{Block=Devanagari}) (140)
1144          \p{Devanagari}          \p{Script=Devanagari} (Short: \p{Deva};
1145                                    NOT \p{Block=Devanagari}) (140)
1146        X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (32)
1147          \p{DI}                  \p{Default_Ignorable_Code_Point} (=
1148                                    \p{Default_Ignorable_Code_Point=Y})
1149                                    (4167)
1150          \p{DI: *}               \p{Default_Ignorable_Code_Point: *}
1151          \p{Dia}                 \p{Diacritic} (= \p{Diacritic=Y}) (639)
1152          \p{Dia: *}              \p{Diacritic: *}
1153          \p{Diacritic}           \p{Diacritic=Y} (Short: \p{Dia}) (639)
1154          \p{Diacritic: N*}       (Short: \p{Dia=N}, \P{Dia}) (1_113_473)
1155          \p{Diacritic: Y*}       (Short: \p{Dia=Y}, \p{Dia}) (639)
1156          \p{Digit}               \p{General_Category=Decimal_Number} \d,
1157                                    extended beyond just [0-9] (Short:
1158                                    \p{Nd}) (411)
1159        X \p{Dingbats}            \p{Block=Dingbats} (192)
1160        X \p{Domino_Tiles}        \p{Block=Domino_Tiles} (112)
1161          \p{Dsrt}                \p{Deseret} (= \p{Script=Deseret}) (80)
1162          \p{Dt: *}               \p{Decomposition_Type: *}
1163          \p{Ea: *}               \p{East_Asian_Width: *}
1164          \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_666)
1165          \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_666)
1166          \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
1167          \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104)
1168          \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
1169          \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123)
1170          \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (801_909)
1171          \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
1172          \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111)
1173          \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (801_909)
1174          \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (173_199)
1175          \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (173_199)
1176          \p{Egyp}                \p{Egyptian_Hieroglyphs} (= \p{Script=
1177                                    Egyptian_Hieroglyphs}) (NOT \p{Block=
1178                                    Egyptian_Hieroglyphs}) (1071)
1179          \p{Egyptian_Hieroglyphs} \p{Script=Egyptian_Hieroglyphs} (Short:
1180                                    \p{Egyp}; NOT \p{Block=
1181                                    Egyptian_Hieroglyphs}) (1071)
1182        X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
1183                                    Enclosed_Alphanumeric_Supplement} (256)
1184        X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics} (160)
1185        X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
1186                                    Enclosed_CJK_Letters_And_Months} (256)
1187        X \p{Enclosed_Ideographic_Supplement} \p{Block=
1188                                    Enclosed_Ideographic_Supplement} (256)
1189          \p{Enclosing_Mark}      \p{General_Category=Enclosing_Mark}
1190                                    (Short: \p{Me}) (13)
1191          \p{Ethi}                \p{Ethiopic} (= \p{Script=Ethiopic}) (NOT
1192                                    \p{Block=Ethiopic}) (461)
1193          \p{Ethiopic}            \p{Script=Ethiopic} (Short: \p{Ethi}; NOT
1194                                    \p{Block=Ethiopic}) (461)
1195        X \p{Ethiopic_Extended}   \p{Block=Ethiopic_Extended} (96)
1196        X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (32)
1197          \p{Ext}                 \p{Extender} (= \p{Extender=Y}) (28)
1198          \p{Ext: *}              \p{Extender: *}
1199          \p{Extender}            \p{Extender=Y} (Short: \p{Ext}) (28)
1200          \p{Extender: N*}        (Short: \p{Ext=N}, \P{Ext}) (1_114_084)
1201          \p{Extender: Y*}        (Short: \p{Ext=Y}, \p{Ext}) (28)
1202          \p{Final_Punctuation}   \p{General_Category=Final_Punctuation}
1203                                    (Short: \p{Pf}) (10)
1204          \p{Format}              \p{General_Category=Format} (Short:
1205                                    \p{Cf}) (140)
1206          \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
1207                                    (Short: \p{CompEx}) (1118)
1208          \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
1209                                    \P{CompEx}) (1_112_994)
1210          \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
1211                                    \p{CompEx}) (1118)
1212          \p{Gc: *}               \p{General_Category: *}
1213          \p{GCB: *}              \p{Grapheme_Cluster_Break: *}
1214          \p{General_Category: C} \p{General_Category=Other} (1_006_956)
1215          \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
1216                                    \p{Gc=LC}, \p{LC}) (3207)
1217          \p{General_Category: Cc} \p{General_Category=Control} (65)
1218          \p{General_Category: Cf} \p{General_Category=Format} (140)
1219          \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
1220                                    (71)
1221          \p{General_Category: Cn} \p{General_Category=Unassigned} (867_235)
1222          \p{General_Category: Cntrl} \p{General_Category=Control} (65)
1223          \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
1224          \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
1225                                    \p{Pc}) (10)
1226          \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65)
1227          \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
1228          \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
1229                                    (46)
1230          \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
1231                                    (21)
1232          \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
1233                                    (411)
1234          \p{General_Category: Digit} \p{General_Category=Decimal_Number}
1235                                    (411)
1236          \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
1237                                    (13)
1238          \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
1239                                    (10)
1240          \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (140)
1241          \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
1242                                    \p{Pi}) (12)
1243          \p{General_Category: L} \p{General_Category=Letter} (99_537)
1244        X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3207)
1245        X \p{General_Category: L_} \p{General_Category=Cased_Letter} (3207)
1246          \p{General_Category: LC} \p{General_Category=Cased_Letter} (3207)
1247          \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (99_537)
1248          \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
1249                                    (224)
1250          \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl}) (1)
1251          \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
1252                                    (1749)
1253          \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (202)
1254          \p{General_Category: Lo} \p{General_Category=Other_Letter} (96_128)
1255          \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll})
1256                                    (1749)
1257          \p{General_Category: Lt} \p{General_Category=Titlecase_Letter} (31)
1258          \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
1259                                    (1427)
1260          \p{General_Category: M} \p{General_Category=Mark} (1451)
1261          \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (1451)
1262          \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (945)
1263          \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (276)
1264          \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
1265          \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
1266                                    (1162)
1267          \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
1268                                    (202)
1269          \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
1270                                    (99)
1271          \p{General_Category: N} \p{General_Category=Number} (1064)
1272          \p{General_Category: Nd} \p{General_Category=Decimal_Number} (411)
1273          \p{General_Category: Nl} \p{General_Category=Letter_Number} (224)
1274          \p{General_Category: No} \p{General_Category=Other_Number} (429)
1275          \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
1276                                    (1162)
1277          \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1064)
1278          \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
1279                                    (72)
1280          \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (1_006_956)
1281          \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
1282                                    (96_128)
1283          \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No}) (429)
1284          \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
1285                                    (389)
1286          \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
1287                                    (3409)
1288          \p{General_Category: P} \p{General_Category=Punctuation} (585)
1289          \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
1290                                    \p{Zp}) (1)
1291          \p{General_Category: Pc} \p{General_Category=
1292                                    Connector_Punctuation} (10)
1293          \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (21)
1294          \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
1295                                    (71)
1296          \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
1297                                    (10)
1298          \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
1299                                    (12)
1300          \p{General_Category: Po} \p{General_Category=Other_Punctuation}
1301                                    (389)
1302          \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
1303                                    (137_468)
1304          \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (72)
1305          \p{General_Category: Punct} \p{General_Category=Punctuation} (585)
1306          \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (585)
1307          \p{General_Category: S} \p{General_Category=Symbol} (4499)
1308          \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (46)
1309          \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (20)
1310          \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (99)
1311          \p{General_Category: Sm} \p{General_Category=Math_Symbol} (945)
1312          \p{General_Category: So} \p{General_Category=Other_Symbol} (3409)
1313          \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
1314                                    (18)
1315          \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc}) (276)
1316          \p{General_Category: Surrogate} Mostly not usable in Perl. (Short:
1317                                    \p{Gc=Cs}, \p{Cs}) (2048)
1318          \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (4499)
1319          \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt})
1320                                    (31)
1321          \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
1322                                    (867_235)
1323          \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu})
1324                                    (1427)
1325          \p{General_Category: Z} \p{General_Category=Separator} (20)
1326          \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
1327          \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
1328                                    (1)
1329          \p{General_Category: Zs} \p{General_Category=Space_Separator} (18)
1330        X \p{General_Punctuation} \p{Block=General_Punctuation} (112)
1331        X \p{Geometric_Shapes}    \p{Block=Geometric_Shapes} (96)
1332          \p{Geor}                \p{Georgian} (= \p{Script=Georgian}) (NOT
1333                                    \p{Block=Georgian}) (120)
1334          \p{Georgian}            \p{Script=Georgian} (Short: \p{Geor}; NOT
1335                                    \p{Block=Georgian}) (120)
1336        X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (48)
1337          \p{Glag}                \p{Glagolitic} (= \p{Script=Glagolitic})
1338                                    (NOT \p{Block=Glagolitic}) (94)
1339          \p{Glagolitic}          \p{Script=Glagolitic} (Short: \p{Glag};
1340                                    NOT \p{Block=Glagolitic}) (94)
1341          \p{Goth}                \p{Gothic} (= \p{Script=Gothic}) (NOT
1342                                    \p{Block=Gothic}) (27)
1343          \p{Gothic}              \p{Script=Gothic} (Short: \p{Goth}; NOT
1344                                    \p{Block=Gothic}) (27)
1345          \p{Gr_Base}             \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
1346                                    (105_958)
1347          \p{Gr_Base: *}          \p{Grapheme_Base: *}
1348          \p{Gr_Ext}              \p{Grapheme_Extend} (= \p{Grapheme_Extend=
1349                                    Y}) (1198)
1350          \p{Gr_Ext: *}           \p{Grapheme_Extend: *}
1351          \p{Graph}               Characters that are graphical (244_744)
1352          \p{Grapheme_Base}       \p{Grapheme_Base=Y} (Short: \p{GrBase})
1353                                    (105_958)
1354          \p{Grapheme_Base: N*}   (Short: \p{GrBase=N}, \P{GrBase})
1355                                    (1_008_154)
1356          \p{Grapheme_Base: Y*}   (Short: \p{GrBase=Y}, \p{GrBase}) (105_958)
1357          \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
1358                                    (203)
1359          \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (203)
1360          \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1)
1361          \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
1362                                    (1205)
1363          \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1205)
1364          \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125)
1365          \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1)
1366          \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399)
1367          \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773)
1368          \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_100_901)
1369          \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
1370                                    (15)
1371          \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (15)
1372          \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
1373                                    SpacingMark} (257)
1374          \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (257)
1375          \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137)
1376          \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95)
1377          \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
1378                                    (1_100_901)
1379          \p{Grapheme_Extend}     \p{Grapheme_Extend=Y} (Short: \p{GrExt})
1380                                    (1198)
1381          \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_914)
1382          \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1198)
1383          \p{Greek}               \p{Script=Greek} (Short: \p{Grek}; NOT
1384                                    \p{Greek_And_Coptic}) (511)
1385        X \p{Greek_And_Coptic}    \p{Block=Greek_And_Coptic} (Short:
1386                                    \p{InGreek}) (144)
1387        X \p{Greek_Extended}      \p{Block=Greek_Extended} (256)
1388          \p{Grek}                \p{Greek} (= \p{Script=Greek}) (NOT
1389                                    \p{Greek_And_Coptic}) (511)
1390          \p{Gujarati}            \p{Script=Gujarati} (Short: \p{Gujr}; NOT
1391                                    \p{Block=Gujarati}) (83)
1392          \p{Gujr}                \p{Gujarati} (= \p{Script=Gujarati}) (NOT
1393                                    \p{Block=Gujarati}) (83)
1394          \p{Gurmukhi}            \p{Script=Gurmukhi} (Short: \p{Guru}; NOT
1395                                    \p{Block=Gurmukhi}) (79)
1396          \p{Guru}                \p{Gurmukhi} (= \p{Script=Gurmukhi}) (NOT
1397                                    \p{Block=Gurmukhi}) (79)
1398        X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
1399                                    Halfwidth_And_Fullwidth_Forms} (240)
1400          \p{Han}                 \p{Script=Han} (75_738)
1401          \p{Hang}                \p{Hangul} (= \p{Script=Hangul}) (11_737)
1402          \p{Hangul}              \p{Script=Hangul} (Short: \p{Hang})
1403                                    (11_737)
1404        X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
1405                                    (96)
1406        X \p{Hangul_Jamo}         \p{Block=Hangul_Jamo} (256)
1407        X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A} (32)
1408        X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B} (80)
1409          \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
1410                                    (125)
1411          \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125)
1412          \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
1413                                    (399)
1414          \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399)
1415          \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
1416                                    LVT_Syllable} (10_773)
1417          \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
1418                                    (10_773)
1419          \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
1420                                    Not_Applicable} (1_102_583)
1421          \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
1422                                    (1_102_583)
1423          \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
1424                                    (137)
1425          \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137)
1426          \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
1427                                    (95)
1428          \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95)
1429        X \p{Hangul_Syllables}    \p{Block=Hangul_Syllables} (11_184)
1430          \p{Hani}                \p{Han} (= \p{Script=Han}) (75_738)
1431          \p{Hano}                \p{Hanunoo} (= \p{Script=Hanunoo}) (NOT
1432                                    \p{Block=Hanunoo}) (21)
1433          \p{Hanunoo}             \p{Script=Hanunoo} (Short: \p{Hano}; NOT
1434                                    \p{Block=Hanunoo}) (21)
1435          \p{Hebr}                \p{Hebrew} (= \p{Script=Hebrew}) (NOT
1436                                    \p{Block=Hebrew}) (133)
1437          \p{Hebrew}              \p{Script=Hebrew} (Short: \p{Hebr}; NOT
1438                                    \p{Block=Hebrew}) (133)
1439          \p{Hex}                 \p{XDigit} (= \p{Hex_Digit=Y}) (44)
1440          \p{Hex: *}              \p{Hex_Digit: *}
1441          \p{Hex_Digit}           \p{XDigit} (= \p{Hex_Digit=Y}) (44)
1442          \p{Hex_Digit: N*}       (Short: \p{Hex=N}, \P{Hex}) (1_114_068)
1443          \p{Hex_Digit: Y*}       (Short: \p{Hex=Y}, \p{Hex}) (44)
1444        X \p{High_Private_Use_Surrogates} \p{Block=
1445                                    High_Private_Use_Surrogates} (128)
1446        X \p{High_Surrogates}     \p{Block=High_Surrogates} (896)
1447          \p{Hira}                \p{Hiragana} (= \p{Script=Hiragana}) (NOT
1448                                    \p{Block=Hiragana}) (90)
1449          \p{Hiragana}            \p{Script=Hiragana} (Short: \p{Hira}; NOT
1450                                    \p{Block=Hiragana}) (90)
1451          \p{HorizSpace}          \p{Blank} (19)
1452          \p{Hst: *}              \p{Hangul_Syllable_Type: *}
1453        S \p{Hyphen}              \p{Hyphen=Y} (11)
1454        S \p{Hyphen: N*}          Use the Line_Break property instead; see
1455                                    www.unicode.org/reports/tr14 (Single:
1456                                    \P{Hyphen}) (1_114_101)
1457        S \p{Hyphen: Y*}          Use the Line_Break property instead; see
1458                                    www.unicode.org/reports/tr14 (Single:
1459                                    \p{Hyphen}) (11)
1460          \p{ID_Continue}         \p{ID_Continue=Y} (Short: \p{IDC})
1461                                    (101_634)
1462          \p{ID_Continue: N*}     (Short: \p{IDC=N}, \P{IDC}) (1_012_478)
1463          \p{ID_Continue: Y*}     (Short: \p{IDC=Y}, \p{IDC}) (101_634)
1464          \p{ID_Start}            \p{ID_Start=Y} (Short: \p{IDS}) (99_764)
1465          \p{ID_Start: N*}        (Short: \p{IDS=N}, \P{IDS}) (1_014_348)
1466          \p{ID_Start: Y*}        (Short: \p{IDS=Y}, \p{IDS}) (99_764)
1467          \p{IDC}                 \p{ID_Continue} (= \p{ID_Continue=Y})
1468                                    (101_634)
1469          \p{IDC: *}              \p{ID_Continue: *}
1470          \p{Ideo}                \p{Ideographic} (= \p{Ideographic=Y})
1471                                    (75_408)
1472          \p{Ideo: *}             \p{Ideographic: *}
1473          \p{Ideographic}         \p{Ideographic=Y} (Short: \p{Ideo})
1474                                    (75_408)
1475          \p{Ideographic: N*}     (Short: \p{Ideo=N}, \P{Ideo}) (1_038_704)
1476          \p{Ideographic: Y*}     (Short: \p{Ideo=Y}, \p{Ideo}) (75_408)
1477        X \p{Ideographic_Description_Characters} \p{Block=
1478                                    Ideographic_Description_Characters} (16)
1479          \p{IDS}                 \p{ID_Start} (= \p{ID_Start=Y}) (99_764)
1480          \p{IDS: *}              \p{ID_Start: *}
1481          \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
1482                                    \p{IDSB}) (10)
1483          \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
1484                                    (1_114_102)
1485          \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10)
1486          \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
1487                                    \p{IDST}) (2)
1488          \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
1489                                    (1_114_110)
1490          \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2)
1491          \p{IDSB}                \p{IDS_Binary_Operator} (=
1492                                    \p{IDS_Binary_Operator=Y}) (10)
1493          \p{IDSB: *}             \p{IDS_Binary_Operator: *}
1494          \p{IDST}                \p{IDS_Trinary_Operator} (=
1495                                    \p{IDS_Trinary_Operator=Y}) (2)
1496          \p{IDST: *}             \p{IDS_Trinary_Operator: *}
1497          \p{Imperial_Aramaic}    \p{Script=Imperial_Aramaic} (Short:
1498                                    \p{Armi}; NOT \p{Block=
1499                                    Imperial_Aramaic}) (31)
1500          \p{In: *}               \p{Present_In: *} (Perl extension)
1501          \p{In_*}                \p{Block: *}
1502          \p{Inherited}           \p{Script=Inherited} (Short: \p{Zinh})
1503                                    (523)
1504          \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
1505                                    (Short: \p{Pi}) (12)
1506          \p{Inscriptional_Pahlavi} \p{Script=Inscriptional_Pahlavi} (Short:
1507                                    \p{Phli}; NOT \p{Block=
1508                                    Inscriptional_Pahlavi}) (27)
1509          \p{Inscriptional_Parthian} \p{Script=Inscriptional_Parthian}
1510                                    (Short: \p{Prti}; NOT \p{Block=
1511                                    Inscriptional_Parthian}) (30)
1512        X \p{IPA_Extensions}      \p{Block=IPA_Extensions} (96)
1513          \p{Is_*}                \p{*} (Any exceptions are individually
1514                                    noted beginning with the word NOT.) If
1515                                    an entry has flag(s) at its beginning,
1516                                    like 'D', the 'Is_' form has the same
1517                                    flag(s)
1518          \p{Ital}                \p{Old_Italic} (= \p{Script=Old_Italic})
1519                                    (NOT \p{Block=Old_Italic}) (35)
1520          \p{Java}                \p{Javanese} (= \p{Script=Javanese}) (NOT
1521                                    \p{Block=Javanese}) (91)
1522          \p{Javanese}            \p{Script=Javanese} (Short: \p{Java}; NOT
1523                                    \p{Block=Javanese}) (91)
1524          \p{Jg: *}               \p{Joining_Group: *}
1525          \p{Join_C}              \p{Join_Control} (= \p{Join_Control=Y}) (2)
1526          \p{Join_C: *}           \p{Join_Control: *}
1527          \p{Join_Control}        \p{Join_Control=Y} (Short: \p{JoinC}) (2)
1528          \p{Join_Control: N*}    (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110)
1529          \p{Join_Control: Y*}    (Short: \p{JoinC=Y}, \p{JoinC}) (2)
1530          \p{Joining_Group: Ain}  (Short: \p{Jg=Ain}) (7)
1531          \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1)
1532          \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10)
1533          \p{Joining_Group: Beh}  (Short: \p{Jg=Beh}) (19)
1534          \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2)
1535          \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
1536                                    BurushaskiYehBarree}) (2)
1537          \p{Joining_Group: Dal}  (Short: \p{Jg=Dal}) (14)
1538          \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4)
1539          \p{Joining_Group: E}    (Short: \p{Jg=E}) (1)
1540          \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7)
1541          \p{Joining_Group: Fe}   (Short: \p{Jg=Fe}) (1)
1542          \p{Joining_Group: Feh}  (Short: \p{Jg=Feh}) (9)
1543          \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1)
1544          \p{Joining_Group: Gaf}  (Short: \p{Jg=Gaf}) (13)
1545          \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3)
1546          \p{Joining_Group: Hah}  (Short: \p{Jg=Hah}) (17)
1547          \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
1548                                    HamzaOnHehGoal}) (1)
1549          \p{Joining_Group: He}   (Short: \p{Jg=He}) (1)
1550          \p{Joining_Group: Heh}  (Short: \p{Jg=Heh}) (1)
1551          \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2)
1552          \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1)
1553          \p{Joining_Group: Kaf}  (Short: \p{Jg=Kaf}) (5)
1554          \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1)
1555          \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1)
1556          \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2)
1557          \p{Joining_Group: Lam}  (Short: \p{Jg=Lam}) (6)
1558          \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1)
1559          \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (3)
1560          \p{Joining_Group: Mim}  (Short: \p{Jg=Mim}) (1)
1561          \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
1562                                    (1_113_883)
1563          \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8)
1564          \p{Joining_Group: Nun}  (Short: \p{Jg=Nun}) (1)
1565          \p{Joining_Group: Nya}  (Short: \p{Jg=Nya}) (1)
1566          \p{Joining_Group: Pe}   (Short: \p{Jg=Pe}) (1)
1567          \p{Joining_Group: Qaf}  (Short: \p{Jg=Qaf}) (4)
1568          \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1)
1569          \p{Joining_Group: Reh}  (Short: \p{Jg=Reh}) (16)
1570          \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1)
1571          \p{Joining_Group: Sad}  (Short: \p{Jg=Sad}) (5)
1572          \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1)
1573          \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11)
1574          \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1)
1575          \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1)
1576          \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1)
1577          \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1)
1578          \p{Joining_Group: Tah}  (Short: \p{Jg=Tah}) (3)
1579          \p{Joining_Group: Taw}  (Short: \p{Jg=Taw}) (1)
1580          \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3)
1581          \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2)
1582          \p{Joining_Group: Waw}  (Short: \p{Jg=Waw}) (15)
1583          \p{Joining_Group: Yeh}  (Short: \p{Jg=Yeh}) (7)
1584          \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2)
1585          \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1)
1586          \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1)
1587          \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1)
1588          \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1)
1589          \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1)
1590          \p{Joining_Type: C}     \p{Joining_Type=Join_Causing} (3)
1591          \p{Joining_Type: D}     \p{Joining_Type=Dual_Joining} (188)
1592          \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (188)
1593          \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (3)
1594          \p{Joining_Type: L}     \p{Joining_Type=Left_Joining} (0)
1595          \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (0)
1596          \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_112_539)
1597          \p{Joining_Type: R}     \p{Joining_Type=Right_Joining} (74)
1598          \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (74)
1599          \p{Joining_Type: T}     \p{Joining_Type=Transparent} (1308)
1600          \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1308)
1601          \p{Joining_Type: U}     \p{Joining_Type=Non_Joining} (1_112_539)
1602          \p{Jt: *}               \p{Joining_Type: *}
1603          \p{Kaithi}              \p{Script=Kaithi} (Short: \p{Kthi}; NOT
1604                                    \p{Block=Kaithi}) (66)
1605          \p{Kali}                \p{Kayah_Li} (= \p{Script=Kayah_Li}) (48)
1606          \p{Kana}                \p{Katakana} (= \p{Script=Katakana}) (NOT
1607                                    \p{Block=Katakana}) (299)
1608        X \p{Kanbun}              \p{Block=Kanbun} (16)
1609        X \p{Kangxi_Radicals}     \p{Block=Kangxi_Radicals} (224)
1610          \p{Kannada}             \p{Script=Kannada} (Short: \p{Knda}; NOT
1611                                    \p{Block=Kannada}) (84)
1612          \p{Katakana}            \p{Script=Katakana} (Short: \p{Kana}; NOT
1613                                    \p{Block=Katakana}) (299)
1614        X \p{Katakana_Phonetic_Extensions} \p{Block=
1615                                    Katakana_Phonetic_Extensions} (16)
1616          \p{Kayah_Li}            \p{Script=Kayah_Li} (Short: \p{Kali}) (48)
1617          \p{Khar}                \p{Kharoshthi} (= \p{Script=Kharoshthi})
1618                                    (NOT \p{Block=Kharoshthi}) (65)
1619          \p{Kharoshthi}          \p{Script=Kharoshthi} (Short: \p{Khar};
1620                                    NOT \p{Block=Kharoshthi}) (65)
1621          \p{Khmer}               \p{Script=Khmer} (Short: \p{Khmr}; NOT
1622                                    \p{Block=Khmer}) (146)
1623        X \p{Khmer_Symbols}       \p{Block=Khmer_Symbols} (32)
1624          \p{Khmr}                \p{Khmer} (= \p{Script=Khmer}) (NOT
1625                                    \p{Block=Khmer}) (146)
1626          \p{Knda}                \p{Kannada} (= \p{Script=Kannada}) (NOT
1627                                    \p{Block=Kannada}) (84)
1628          \p{Kthi}                \p{Kaithi} (= \p{Script=Kaithi}) (NOT
1629                                    \p{Block=Kaithi}) (66)
1630          \p{L}                   \p{Letter} (= \p{General_Category=Letter})
1631                                    (99_537)
1632          \p{L&}                  \p{Cased_Letter} (= \p{General_Category=
1633                                    Cased_Letter}) (3207)
1634          \p{L_}                  \p{Cased_Letter} (= \p{General_Category=
1635                                    Cased_Letter}) (3207)
1636          \p{Lana}                \p{Tai_Tham} (= \p{Script=Tai_Tham}) (NOT
1637                                    \p{Block=Tai_Tham}) (127)
1638          \p{Lao}                 \p{Script=Lao} (NOT \p{Block=Lao}) (65)
1639          \p{Laoo}                \p{Lao} (= \p{Script=Lao}) (NOT \p{Block=
1640                                    Lao}) (65)
1641          \p{Latin}               \p{Script=Latin} (Short: \p{Latn}) (1244)
1642        X \p{Latin_1}             \p{Latin_1_Supplement} (= \p{Block=
1643                                    Latin_1_Supplement}) (128)
1644        X \p{Latin_1_Supplement}  \p{Block=Latin_1_Supplement} (Short:
1645                                    \p{InLatin1}) (128)
1646        X \p{Latin_Extended_A}    \p{Block=Latin_Extended_A} (128)
1647        X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
1648                                    (256)
1649        X \p{Latin_Extended_B}    \p{Block=Latin_Extended_B} (208)
1650        X \p{Latin_Extended_C}    \p{Block=Latin_Extended_C} (32)
1651        X \p{Latin_Extended_D}    \p{Block=Latin_Extended_D} (224)
1652          \p{Latn}                \p{Latin} (= \p{Script=Latin}) (1244)
1653          \p{Lb: *}               \p{Line_Break: *}
1654          \p{LC}                  \p{Cased_Letter} (= \p{General_Category=
1655                                    Cased_Letter}) (3207)
1656          \p{Lepc}                \p{Lepcha} (= \p{Script=Lepcha}) (NOT
1657                                    \p{Block=Lepcha}) (74)
1658          \p{Lepcha}              \p{Script=Lepcha} (Short: \p{Lepc}; NOT
1659                                    \p{Block=Lepcha}) (74)
1660          \p{Letter}              \p{General_Category=Letter} (Short: \p{L})
1661                                    (99_537)
1662          \p{Letter_Number}       \p{General_Category=Letter_Number} (Short:
1663                                    \p{Nl}) (224)
1664        X \p{Letterlike_Symbols}  \p{Block=Letterlike_Symbols} (80)
1665          \p{Limb}                \p{Limbu} (= \p{Script=Limbu}) (NOT
1666                                    \p{Block=Limbu}) (66)
1667          \p{Limbu}               \p{Script=Limbu} (Short: \p{Limb}; NOT
1668                                    \p{Block=Limbu}) (66)
1669          \p{Linb}                \p{Linear_B} (= \p{Script=Linear_B}) (211)
1670          \p{Line_Break: AI}      \p{Line_Break=Ambiguous} (644)
1671          \p{Line_Break: AL}      \p{Line_Break=Alphabetic} (14_092)
1672          \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (14_092)
1673          \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (644)
1674          \p{Line_Break: B2}      \p{Line_Break=Break_Both} (1)
1675          \p{Line_Break: BA}      \p{Line_Break=Break_After} (137)
1676          \p{Line_Break: BB}      \p{Line_Break=Break_Before} (19)
1677          \p{Line_Break: BK}      \p{Line_Break=Mandatory_Break} (4)
1678          \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (137)
1679          \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (19)
1680          \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (1)
1681          \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1)
1682          \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1)
1683          \p{Line_Break: CB}      \p{Line_Break=Contingent_Break} (1)
1684          \p{Line_Break: CL}      \p{Line_Break=Close_Punctuation} (87)
1685          \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2)
1686          \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (87)
1687          \p{Line_Break: CM}      \p{Line_Break=Combining_Mark} (1436)
1688          \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (1436)
1689          \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (662)
1690          \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1)
1691          \p{Line_Break: CP}      \p{Line_Break=Close_Parenthesis} (2)
1692          \p{Line_Break: CR}      \p{Line_Break=Carriage_Return} (1)
1693          \p{Line_Break: EX}      \p{Line_Break=Exclamation} (34)
1694          \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (34)
1695          \p{Line_Break: GL}      \p{Line_Break=Glue} (16)
1696          \p{Line_Break: Glue}    (Short: \p{Lb=GL}) (16)
1697          \p{Line_Break: H2}      (Short: \p{Lb=H2}) (399)
1698          \p{Line_Break: H3}      (Short: \p{Lb=H3}) (10_773)
1699          \p{Line_Break: HY}      \p{Line_Break=Hyphen} (1)
1700          \p{Line_Break: Hyphen}  (Short: \p{Lb=HY}) (1)
1701          \p{Line_Break: ID}      \p{Line_Break=Ideographic} (161_775)
1702          \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (161_775)
1703          \p{Line_Break: IN}      \p{Line_Break=Inseparable} (4)
1704          \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13)
1705          \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (4)
1706          \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (4)
1707          \p{Line_Break: IS}      \p{Line_Break=Infix_Numeric} (13)
1708          \p{Line_Break: JL}      (Short: \p{Lb=JL}) (125)
1709          \p{Line_Break: JT}      (Short: \p{Lb=JT}) (137)
1710          \p{Line_Break: JV}      (Short: \p{Lb=JV}) (95)
1711          \p{Line_Break: LF}      \p{Line_Break=Line_Feed} (1)
1712          \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1)
1713          \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4)
1714          \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1)
1715          \p{Line_Break: NL}      \p{Line_Break=Next_Line} (1)
1716          \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (77)
1717          \p{Line_Break: NS}      \p{Line_Break=Nonstarter} (77)
1718          \p{Line_Break: NU}      \p{Line_Break=Numeric} (403)
1719          \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (403)
1720          \p{Line_Break: OP}      \p{Line_Break=Open_Punctuation} (81)
1721          \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (81)
1722          \p{Line_Break: PO}      \p{Line_Break=Postfix_Numeric} (28)
1723          \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (28)
1724          \p{Line_Break: PR}      \p{Line_Break=Prefix_Numeric} (43)
1725          \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (43)
1726          \p{Line_Break: QU}      \p{Line_Break=Quotation} (34)
1727          \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (34)
1728          \p{Line_Break: SA}      \p{Line_Break=Complex_Context} (662)
1729        D \p{Line_Break: SG}      \p{Line_Break=Surrogate} (2048)
1730          \p{Line_Break: SP}      \p{Line_Break=Space} (1)
1731          \p{Line_Break: Space}   (Short: \p{Lb=SP}) (1)
1732        D \p{Line_Break: Surrogate} Deprecated by Unicode because surrogates
1733                                    should never appear in well-formed text,
1734                                    and therefore shouldn't be the basis for
1735                                    line breaking (Short: \p{Lb=SG}) (2048)
1736          \p{Line_Break: SY}      \p{Line_Break=Break_Symbols} (1)
1737          \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (920_933)
1738          \p{Line_Break: WJ}      \p{Line_Break=Word_Joiner} (2)
1739          \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2)
1740          \p{Line_Break: XX}      \p{Line_Break=Unknown} (920_933)
1741          \p{Line_Break: ZW}      \p{Line_Break=ZWSpace} (1)
1742          \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1)
1743          \p{Line_Separator}      \p{General_Category=Line_Separator}
1744                                    (Short: \p{Zl}) (1)
1745          \p{Linear_B}            \p{Script=Linear_B} (Short: \p{Linb}) (211)
1746        X \p{Linear_B_Ideograms}  \p{Block=Linear_B_Ideograms} (128)
1747        X \p{Linear_B_Syllabary}  \p{Block=Linear_B_Syllabary} (128)
1748          \p{Lisu}                \p{Script=Lisu} (48)
1749          \p{Ll}                  \p{Lowercase_Letter} (=
1750                                    \p{General_Category=Lowercase_Letter})
1751                                    (1749)
1752          \p{Lm}                  \p{Modifier_Letter} (=
1753                                    \p{General_Category=Modifier_Letter})
1754                                    (202)
1755          \p{Lo}                  \p{Other_Letter} (= \p{General_Category=
1756                                    Other_Letter}) (96_128)
1757          \p{LOE}                 \p{Logical_Order_Exception} (=
1758                                    \p{Logical_Order_Exception=Y}) (15)
1759          \p{LOE: *}              \p{Logical_Order_Exception: *}
1760          \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
1761                                    \p{LOE}) (15)
1762          \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
1763                                    (1_114_097)
1764          \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (15)
1765        X \p{Low_Surrogates}      \p{Block=Low_Surrogates} (1024)
1766          \p{Lower}               \p{Lowercase=Y} (1908)
1767          \p{Lower: *}            \p{Lowercase: *}
1768          \p{Lowercase}           \p{Lower} (= \p{Lowercase=Y}) (1908)
1769          \p{Lowercase: N*}       (Short: \p{Lower=N}, \P{Lower}) (1_112_204)
1770          \p{Lowercase: Y*}       (Short: \p{Lower=Y}, \p{Lower}) (1908)
1771          \p{Lowercase_Letter}    \p{General_Category=Lowercase_Letter}
1772                                    (Short: \p{Ll}) (1749)
1773          \p{Lt}                  \p{Title} (= \p{General_Category=
1774                                    Titlecase_Letter}) (31)
1775          \p{Lu}                  \p{Uppercase_Letter} (=
1776                                    \p{General_Category=Uppercase_Letter})
1777                                    (1427)
1778          \p{Lyci}                \p{Lycian} (= \p{Script=Lycian}) (NOT
1779                                    \p{Block=Lycian}) (29)
1780          \p{Lycian}              \p{Script=Lycian} (Short: \p{Lyci}; NOT
1781                                    \p{Block=Lycian}) (29)
1782          \p{Lydi}                \p{Lydian} (= \p{Script=Lydian}) (NOT
1783                                    \p{Block=Lydian}) (27)
1784          \p{Lydian}              \p{Script=Lydian} (Short: \p{Lydi}; NOT
1785                                    \p{Block=Lydian}) (27)
1786          \p{M}                   \p{Mark} (= \p{General_Category=Mark})
1787                                    (1451)
1788        X \p{Mahjong_Tiles}       \p{Block=Mahjong_Tiles} (48)
1789          \p{Malayalam}           \p{Script=Malayalam} (Short: \p{Mlym}; NOT
1790                                    \p{Block=Malayalam}) (95)
1791          \p{Mark}                \p{General_Category=Mark} (Short: \p{M})
1792                                    (1451)
1793          \p{Math}                \p{Math=Y} (2161)
1794          \p{Math: N*}            (Single: \P{Math}) (1_111_951)
1795          \p{Math: Y*}            (Single: \p{Math}) (2161)
1796          \p{Math_Symbol}         \p{General_Category=Math_Symbol} (Short:
1797                                    \p{Sm}) (945)
1798        X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
1799                                    Mathematical_Alphanumeric_Symbols} (1024)
1800        X \p{Mathematical_Operators} \p{Block=Mathematical_Operators} (256)
1801          \p{Mc}                  \p{Spacing_Mark} (= \p{General_Category=
1802                                    Spacing_Mark}) (276)
1803          \p{Me}                  \p{Enclosing_Mark} (= \p{General_Category=
1804                                    Enclosing_Mark}) (13)
1805          \p{Meetei_Mayek}        \p{Script=Meetei_Mayek} (Short: \p{Mtei};
1806                                    NOT \p{Block=Meetei_Mayek}) (56)
1807        X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
1808                                    Miscellaneous_Mathematical_Symbols_A}
1809                                    (48)
1810        X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
1811                                    Miscellaneous_Mathematical_Symbols_B}
1812                                    (128)
1813        X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (256)
1814        X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
1815                                    Miscellaneous_Symbols_And_Arrows} (256)
1816        X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical} (256)
1817          \p{Mlym}                \p{Malayalam} (= \p{Script=Malayalam})
1818                                    (NOT \p{Block=Malayalam}) (95)
1819          \p{Mn}                  \p{Nonspacing_Mark} (=
1820                                    \p{General_Category=Nonspacing_Mark})
1821                                    (1162)
1822          \p{Modifier_Letter}     \p{General_Category=Modifier_Letter}
1823                                    (Short: \p{Lm}) (202)
1824          \p{Modifier_Symbol}     \p{General_Category=Modifier_Symbol}
1825                                    (Short: \p{Sk}) (99)
1826        X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
1827          \p{Mong}                \p{Mongolian} (= \p{Script=Mongolian})
1828                                    (NOT \p{Block=Mongolian}) (153)
1829          \p{Mongolian}           \p{Script=Mongolian} (Short: \p{Mong}; NOT
1830                                    \p{Block=Mongolian}) (153)
1831          \p{Mtei}                \p{Meetei_Mayek} (= \p{Script=
1832                                    Meetei_Mayek}) (NOT \p{Block=
1833                                    Meetei_Mayek}) (56)
1834        X \p{Musical_Symbols}     \p{Block=Musical_Symbols} (256)
1835          \p{Myanmar}             \p{Script=Myanmar} (Short: \p{Mymr}; NOT
1836                                    \p{Block=Myanmar}) (188)
1837        X \p{Myanmar_Extended_A}  \p{Block=Myanmar_Extended_A} (32)
1838          \p{Mymr}                \p{Myanmar} (= \p{Script=Myanmar}) (NOT
1839                                    \p{Block=Myanmar}) (188)
1840          \p{N}                   \p{Number} (= \p{General_Category=Number})
1841                                    (1064)
1842          \p{NChar}               \p{Noncharacter_Code_Point} (=
1843                                    \p{Noncharacter_Code_Point=Y}) (66)
1844          \p{NChar: *}            \p{Noncharacter_Code_Point: *}
1845          \p{Nd}                  \p{Digit} (= \p{General_Category=
1846                                    Decimal_Number}) (411)
1847          \p{New_Tai_Lue}         \p{Script=New_Tai_Lue} (Short: \p{Talu};
1848                                    NOT \p{Block=New_Tai_Lue}) (83)
1849          \p{NFC_QC: *}           \p{NFC_Quick_Check: *}
1850          \p{NFC_Quick_Check: M}  \p{NFC_Quick_Check=Maybe} (103)
1851          \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (103)
1852          \p{NFC_Quick_Check: N}  \p{NFC_Quick_Check=No} (NOT
1853                                    \P{NFC_Quick_Check} NOR \P{NFC_QC} NOR
1854                                    \P{Is_NFC_Quick_Check} NOR
1855                                    \P{Is_NFC_QC}) (1118)
1856          \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
1857                                    \P{NFC_Quick_Check} NOR \P{NFC_QC} NOR
1858                                    \P{Is_NFC_Quick_Check} NOR
1859                                    \P{Is_NFC_QC}) (1118)
1860          \p{NFC_Quick_Check: Y}  \p{NFC_Quick_Check=Yes} (NOT
1861                                    \p{NFC_Quick_Check} NOR \p{NFC_QC} NOR
1862                                    \p{Is_NFC_Quick_Check} NOR
1863                                    \p{Is_NFC_QC}) (1_112_891)
1864          \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
1865                                    \p{NFC_Quick_Check} NOR \p{NFC_QC} NOR
1866                                    \p{Is_NFC_Quick_Check} NOR
1867                                    \p{Is_NFC_QC}) (1_112_891)
1868          \p{NFD_QC: *}           \p{NFD_Quick_Check: *}
1869          \p{NFD_Quick_Check: N}  \p{NFD_Quick_Check=No} (NOT
1870                                    \P{NFD_Quick_Check} NOR \P{NFD_QC} NOR
1871                                    \P{Is_NFD_Quick_Check} NOR
1872                                    \P{Is_NFD_QC}) (13_221)
1873          \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
1874                                    \P{NFD_Quick_Check} NOR \P{NFD_QC} NOR
1875                                    \P{Is_NFD_Quick_Check} NOR
1876                                    \P{Is_NFD_QC}) (13_221)
1877          \p{NFD_Quick_Check: Y}  \p{NFD_Quick_Check=Yes} (NOT
1878                                    \p{NFD_Quick_Check} NOR \p{NFD_QC} NOR
1879                                    \p{Is_NFD_Quick_Check} NOR
1880                                    \p{Is_NFD_QC}) (1_100_891)
1881          \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
1882                                    \p{NFD_Quick_Check} NOR \p{NFD_QC} NOR
1883                                    \p{Is_NFD_Quick_Check} NOR
1884                                    \p{Is_NFD_QC}) (1_100_891)
1885          \p{NFKC_QC: *}          \p{NFKC_Quick_Check: *}
1886          \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (103)
1887          \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (103)
1888          \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
1889                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC} NOR
1890                                    \P{Is_NFKC_Quick_Check} NOR
1891                                    \P{Is_NFKC_QC}) (4597)
1892          \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
1893                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC} NOR
1894                                    \P{Is_NFKC_Quick_Check} NOR
1895                                    \P{Is_NFKC_QC}) (4597)
1896          \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
1897                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC} NOR
1898                                    \p{Is_NFKC_Quick_Check} NOR
1899                                    \p{Is_NFKC_QC}) (1_109_412)
1900          \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
1901                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC} NOR
1902                                    \p{Is_NFKC_Quick_Check} NOR
1903                                    \p{Is_NFKC_QC}) (1_109_412)
1904          \p{NFKD_QC: *}          \p{NFKD_Quick_Check: *}
1905          \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
1906                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC} NOR
1907                                    \P{Is_NFKD_Quick_Check} NOR
1908                                    \P{Is_NFKD_QC}) (16_688)
1909          \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
1910                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC} NOR
1911                                    \P{Is_NFKD_Quick_Check} NOR
1912                                    \P{Is_NFKD_QC}) (16_688)
1913          \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
1914                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC} NOR
1915                                    \p{Is_NFKD_Quick_Check} NOR
1916                                    \p{Is_NFKD_QC}) (1_097_424)
1917          \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
1918                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC} NOR
1919                                    \p{Is_NFKD_Quick_Check} NOR
1920                                    \p{Is_NFKD_QC}) (1_097_424)
1921          \p{Nko}                 \p{Script=Nko} (NOT \p{NKo}) (59)
1922          \p{Nkoo}                \p{Nko} (= \p{Script=Nko}) (NOT \p{NKo})
1923                                    (59)
1924          \p{Nl}                  \p{Letter_Number} (= \p{General_Category=
1925                                    Letter_Number}) (224)
1926          \p{No}                  \p{Other_Number} (= \p{General_Category=
1927                                    Other_Number}) (429)
1928        X \p{No_Block}            \p{Block=No_Block} (864_192)
1929          \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
1930                                    \p{NChar}) (66)
1931          \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
1932                                    (1_114_046)
1933          \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
1934                                    (66)
1935          \p{Nonspacing_Mark}     \p{General_Category=Nonspacing_Mark}
1936                                    (Short: \p{Mn}) (1162)
1937          \p{Nt: *}               \p{Numeric_Type: *}
1938          \p{Number}              \p{General_Category=Number} (Short: \p{N})
1939                                    (1064)
1940        X \p{Number_Forms}        \p{Block=Number_Forms} (64)
1941          \p{Numeric_Type: De}    \p{Numeric_Type=Decimal} (411)
1942          \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (411)
1943          \p{Numeric_Type: Di}    \p{Numeric_Type=Digit} (118)
1944          \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (118)
1945          \p{Numeric_Type: None}  (Short: \p{Nt=None}) (1_112_971)
1946          \p{Numeric_Type: Nu}    \p{Numeric_Type=Numeric} (612)
1947          \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (612)
1948        T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1)
1949        T \p{Numeric_Value: 0}    (Short: \p{Nv=0}) (55)
1950        T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (2)
1951        T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (1)
1952        T \p{Numeric_Value: 1/9}  (Short: \p{Nv=1/9}) (1)
1953        T \p{Numeric_Value: 1/8}  (Short: \p{Nv=1/8}) (4)
1954        T \p{Numeric_Value: 1/7}  (Short: \p{Nv=1/7}) (1)
1955        T \p{Numeric_Value: 1/6}  (Short: \p{Nv=1/6}) (2)
1956        T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (2)
1957        T \p{Numeric_Value: 1/5}  (Short: \p{Nv=1/5}) (1)
1958        T \p{Numeric_Value: 1/4}  (Short: \p{Nv=1/4}) (8)
1959        T \p{Numeric_Value: 1/3}  (Short: \p{Nv=1/3}) (4)
1960        T \p{Numeric_Value: 3/8}  (Short: \p{Nv=3/8}) (1)
1961        T \p{Numeric_Value: 2/5}  (Short: \p{Nv=2/5}) (1)
1962        T \p{Numeric_Value: 1/2}  (Short: \p{Nv=1/2}) (9)
1963        T \p{Numeric_Value: 3/5}  (Short: \p{Nv=3/5}) (1)
1964        T \p{Numeric_Value: 5/8}  (Short: \p{Nv=5/8}) (1)
1965        T \p{Numeric_Value: 2/3}  (Short: \p{Nv=2/3}) (5)
1966        T \p{Numeric_Value: 3/4}  (Short: \p{Nv=3/4}) (5)
1967        T \p{Numeric_Value: 4/5}  (Short: \p{Nv=4/5}) (1)
1968        T \p{Numeric_Value: 5/6}  (Short: \p{Nv=5/6}) (2)
1969        T \p{Numeric_Value: 7/8}  (Short: \p{Nv=7/8}) (1)
1970        T \p{Numeric_Value: 1}    (Short: \p{Nv=1}) (91)
1971        T \p{Numeric_Value: 3/2}  (Short: \p{Nv=3/2}) (1)
1972        T \p{Numeric_Value: 2}    (Short: \p{Nv=2}) (94)
1973        T \p{Numeric_Value: 5/2}  (Short: \p{Nv=5/2}) (1)
1974        T \p{Numeric_Value: 3}    (Short: \p{Nv=3}) (96)
1975        T \p{Numeric_Value: 7/2}  (Short: \p{Nv=7/2}) (1)
1976        T \p{Numeric_Value: 4}    (Short: \p{Nv=4}) (87)
1977        T \p{Numeric_Value: 9/2}  (Short: \p{Nv=9/2}) (1)
1978        T \p{Numeric_Value: 5}    (Short: \p{Nv=5}) (84)
1979        T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1)
1980        T \p{Numeric_Value: 6}    (Short: \p{Nv=6}) (76)
1981        T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1)
1982        T \p{Numeric_Value: 7}    (Short: \p{Nv=7}) (75)
1983        T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1)
1984        T \p{Numeric_Value: 8}    (Short: \p{Nv=8}) (71)
1985        T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1)
1986        T \p{Numeric_Value: 9}    (Short: \p{Nv=9}) (75)
1987        T \p{Numeric_Value: 10}   (Short: \p{Nv=10}) (38)
1988        T \p{Numeric_Value: 11}   (Short: \p{Nv=11}) (6)
1989        T \p{Numeric_Value: 12}   (Short: \p{Nv=12}) (6)
1990        T \p{Numeric_Value: 13}   (Short: \p{Nv=13}) (4)
1991        T \p{Numeric_Value: 14}   (Short: \p{Nv=14}) (4)
1992        T \p{Numeric_Value: 15}   (Short: \p{Nv=15}) (4)
1993        T \p{Numeric_Value: 16}   (Short: \p{Nv=16}) (5)
1994        T \p{Numeric_Value: 17}   (Short: \p{Nv=17}) (5)
1995        T \p{Numeric_Value: 18}   (Short: \p{Nv=18}) (5)
1996        T \p{Numeric_Value: 19}   (Short: \p{Nv=19}) (5)
1997        T \p{Numeric_Value: 20}   (Short: \p{Nv=20}) (17)
1998        T \p{Numeric_Value: 21}   (Short: \p{Nv=21}) (1)
1999        T \p{Numeric_Value: 22}   (Short: \p{Nv=22}) (1)
2000        T \p{Numeric_Value: 23}   (Short: \p{Nv=23}) (1)
2001        T \p{Numeric_Value: 24}   (Short: \p{Nv=24}) (1)
2002        T \p{Numeric_Value: 25}   (Short: \p{Nv=25}) (1)
2003        T \p{Numeric_Value: 26}   (Short: \p{Nv=26}) (1)
2004        T \p{Numeric_Value: 27}   (Short: \p{Nv=27}) (1)
2005        T \p{Numeric_Value: 28}   (Short: \p{Nv=28}) (1)
2006        T \p{Numeric_Value: 29}   (Short: \p{Nv=29}) (1)
2007        T \p{Numeric_Value: 30}   (Short: \p{Nv=30}) (9)
2008        T \p{Numeric_Value: 31}   (Short: \p{Nv=31}) (1)
2009        T \p{Numeric_Value: 32}   (Short: \p{Nv=32}) (1)
2010        T \p{Numeric_Value: 33}   (Short: \p{Nv=33}) (1)
2011        T \p{Numeric_Value: 34}   (Short: \p{Nv=34}) (1)
2012        T \p{Numeric_Value: 35}   (Short: \p{Nv=35}) (1)
2013        T \p{Numeric_Value: 36}   (Short: \p{Nv=36}) (1)
2014        T \p{Numeric_Value: 37}   (Short: \p{Nv=37}) (1)
2015        T \p{Numeric_Value: 38}   (Short: \p{Nv=38}) (1)
2016        T \p{Numeric_Value: 39}   (Short: \p{Nv=39}) (1)
2017        T \p{Numeric_Value: 40}   (Short: \p{Nv=40}) (8)
2018        T \p{Numeric_Value: 41}   (Short: \p{Nv=41}) (1)
2019        T \p{Numeric_Value: 42}   (Short: \p{Nv=42}) (1)
2020        T \p{Numeric_Value: 43}   (Short: \p{Nv=43}) (1)
2021        T \p{Numeric_Value: 44}   (Short: \p{Nv=44}) (1)
2022        T \p{Numeric_Value: 45}   (Short: \p{Nv=45}) (1)
2023        T \p{Numeric_Value: 46}   (Short: \p{Nv=46}) (1)
2024        T \p{Numeric_Value: 47}   (Short: \p{Nv=47}) (1)
2025        T \p{Numeric_Value: 48}   (Short: \p{Nv=48}) (1)
2026        T \p{Numeric_Value: 49}   (Short: \p{Nv=49}) (1)
2027        T \p{Numeric_Value: 50}   (Short: \p{Nv=50}) (18)
2028        T \p{Numeric_Value: 60}   (Short: \p{Nv=60}) (4)
2029        T \p{Numeric_Value: 70}   (Short: \p{Nv=70}) (4)
2030        T \p{Numeric_Value: 80}   (Short: \p{Nv=80}) (4)
2031        T \p{Numeric_Value: 90}   (Short: \p{Nv=90}) (5)
2032        T \p{Numeric_Value: 100}  (Short: \p{Nv=100}) (19)
2033        T \p{Numeric_Value: 200}  (Short: \p{Nv=200}) (2)
2034        T \p{Numeric_Value: 300}  (Short: \p{Nv=300}) (3)
2035        T \p{Numeric_Value: 400}  (Short: \p{Nv=400}) (2)
2036        T \p{Numeric_Value: 500}  (Short: \p{Nv=500}) (12)
2037        T \p{Numeric_Value: 600}  (Short: \p{Nv=600}) (2)
2038        T \p{Numeric_Value: 700}  (Short: \p{Nv=700}) (2)
2039        T \p{Numeric_Value: 800}  (Short: \p{Nv=800}) (2)
2040        T \p{Numeric_Value: 900}  (Short: \p{Nv=900}) (3)
2041        T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (16)
2042        T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (1)
2043        T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (1)
2044        T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (1)
2045        T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (5)
2046        T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (1)
2047        T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (1)
2048        T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (1)
2049        T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (1)
2050        T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (7)
2051        T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (1)
2052        T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (1)
2053        T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (1)
2054        T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (4)
2055        T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (1)
2056        T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (1)
2057        T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (1)
2058        T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (1)
2059        T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (1)
2060        T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
2061                                    (2)
2062        T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
2063                                    1000000000000}) (1)
2064          \p{Numeric_Value: NaN}  (Short: \p{Nv=NaN}) (1_112_971)
2065          \p{Nv: *}               \p{Numeric_Value: *}
2066        D \p{OAlpha}              \p{Other_Alphabetic} (=
2067                                    \p{Other_Alphabetic=Y}) (759)
2068        D \p{OAlpha: *}           \p{Other_Alphabetic: *}
2069        D \p{ODI}                 \p{Other_Default_Ignorable_Code_Point} (=
2070                                    \p{Other_Default_Ignorable_Code_Point=
2071                                    Y}) (3778)
2072        D \p{ODI: *}              \p{Other_Default_Ignorable_Code_Point: *}
2073          \p{Ogam}                \p{Ogham} (= \p{Script=Ogham}) (NOT
2074                                    \p{Block=Ogham}) (29)
2075          \p{Ogham}               \p{Script=Ogham} (Short: \p{Ogam}; NOT
2076                                    \p{Block=Ogham}) (29)
2077        D \p{OGr_Ext}             \p{Other_Grapheme_Extend} (=
2078                                    \p{Other_Grapheme_Extend=Y}) (23)
2079        D \p{OGr_Ext: *}          \p{Other_Grapheme_Extend: *}
2080        D \p{OIDC}                \p{Other_ID_Continue} (=
2081                                    \p{Other_ID_Continue=Y}) (11)
2082        D \p{OIDC: *}             \p{Other_ID_Continue: *}
2083        D \p{OIDS}                \p{Other_ID_Start} (= \p{Other_ID_Start=
2084                                    Y}) (4)
2085        D \p{OIDS: *}             \p{Other_ID_Start: *}
2086          \p{Ol_Chiki}            \p{Script=Ol_Chiki} (Short: \p{Olck}) (48)
2087          \p{Olck}                \p{Ol_Chiki} (= \p{Script=Ol_Chiki}) (48)
2088          \p{Old_Italic}          \p{Script=Old_Italic} (Short: \p{Ital};
2089                                    NOT \p{Block=Old_Italic}) (35)
2090          \p{Old_Persian}         \p{Script=Old_Persian} (Short: \p{Xpeo};
2091                                    NOT \p{Block=Old_Persian}) (50)
2092          \p{Old_South_Arabian}   \p{Script=Old_South_Arabian} (Short:
2093                                    \p{Sarb}) (32)
2094          \p{Old_Turkic}          \p{Script=Old_Turkic} (Short: \p{Orkh};
2095                                    NOT \p{Block=Old_Turkic}) (73)
2096        D \p{OLower}              \p{Other_Lowercase} (= \p{Other_Lowercase=
2097                                    Y}) (159)
2098        D \p{OLower: *}           \p{Other_Lowercase: *}
2099        D \p{OMath}               \p{Other_Math} (= \p{Other_Math=Y}) (1216)
2100        D \p{OMath: *}            \p{Other_Math: *}
2101          \p{Open_Punctuation}    \p{General_Category=Open_Punctuation}
2102                                    (Short: \p{Ps}) (72)
2103        X \p{Optical_Character_Recognition} \p{Block=
2104                                    Optical_Character_Recognition} (32)
2105          \p{Oriya}               \p{Script=Oriya} (Short: \p{Orya}; NOT
2106                                    \p{Block=Oriya}) (84)
2107          \p{Orkh}                \p{Old_Turkic} (= \p{Script=Old_Turkic})
2108                                    (NOT \p{Block=Old_Turkic}) (73)
2109          \p{Orya}                \p{Oriya} (= \p{Script=Oriya}) (NOT
2110                                    \p{Block=Oriya}) (84)
2111          \p{Osma}                \p{Osmanya} (= \p{Script=Osmanya}) (NOT
2112                                    \p{Block=Osmanya}) (40)
2113          \p{Osmanya}             \p{Script=Osmanya} (Short: \p{Osma}; NOT
2114                                    \p{Block=Osmanya}) (40)
2115          \p{Other}               \p{General_Category=Other} (Short: \p{C})
2116                                    (1_006_956)
2117        D \p{Other_Alphabetic}    \p{Other_Alphabetic=Y} (Short: \p{OAlpha})
2118                                    (759)
2119        D \p{Other_Alphabetic: N*} Used by Unicode internally for generating
2120                                    the Alphabetic property (which should be
2121                                    used instead) and not intended to be
2122                                    used stand-alone (Short: \p{OAlpha=N},
2123                                    \P{OAlpha}) (1_113_353)
2124        D \p{Other_Alphabetic: Y*} Used by Unicode internally for generating
2125                                    the Alphabetic property (which should be
2126                                    used instead) and not intended to be
2127                                    used stand-alone (Short: \p{OAlpha=Y},
2128                                    \p{OAlpha}) (759)
2129        D \p{Other_Default_Ignorable_Code_Point}
2130                                    \p{Other_Default_Ignorable_Code_Point=Y}
2131                                    (Short: \p{ODI}) (3778)
2132        D \p{Other_Default_Ignorable_Code_Point: N*} Used by Unicode
2133                                    internally for generating the
2134                                    Default_Ignorable_Code_Point property
2135                                    (which should be used instead) and not
2136                                    intended to be used stand-alone (Short:
2137                                    \p{ODI=N}, \P{ODI}) (1_110_334)
2138        D \p{Other_Default_Ignorable_Code_Point: Y*} Used by Unicode
2139                                    internally for generating the
2140                                    Default_Ignorable_Code_Point property
2141                                    (which should be used instead) and not
2142                                    intended to be used stand-alone (Short:
2143                                    \p{ODI=Y}, \p{ODI}) (3778)
2144        D \p{Other_Grapheme_Extend} \p{Other_Grapheme_Extend=Y} (Short:
2145                                    \p{OGrExt}) (23)
2146        D \p{Other_Grapheme_Extend: N*} Used by Unicode internally for
2147                                    generating the Grapheme_Extend property
2148                                    (which should be used instead) and not
2149                                    intended to be used stand-alone (Short:
2150                                    \p{OGrExt=N}, \P{OGrExt}) (1_114_089)
2151        D \p{Other_Grapheme_Extend: Y*} Used by Unicode internally for
2152                                    generating the Grapheme_Extend property
2153                                    (which should be used instead) and not
2154                                    intended to be used stand-alone (Short:
2155                                    \p{OGrExt=Y}, \p{OGrExt}) (23)
2156        D \p{Other_ID_Continue}   \p{Other_ID_Continue=Y} (Short: \p{OIDC})
2157                                    (11)
2158        D \p{Other_ID_Continue: N*} Used by Unicode internally for
2159                                    generating the ID_Continue property
2160                                    (which should be used instead) and not
2161                                    intended to be used stand-alone (Short:
2162                                    \p{OIDC=N}, \P{OIDC}) (1_114_101)
2163        D \p{Other_ID_Continue: Y*} Used by Unicode internally for
2164                                    generating the ID_Continue property
2165                                    (which should be used instead) and not
2166                                    intended to be used stand-alone (Short:
2167                                    \p{OIDC=Y}, \p{OIDC}) (11)
2168        D \p{Other_ID_Start}      \p{Other_ID_Start=Y} (Short: \p{OIDS}) (4)
2169        D \p{Other_ID_Start: N*}  Used by Unicode internally for generating
2170                                    the ID_Start property (which should be
2171                                    used instead) and not intended to be
2172                                    used stand-alone (Short: \p{OIDS=N},
2173                                    \P{OIDS}) (1_114_108)
2174        D \p{Other_ID_Start: Y*}  Used by Unicode internally for generating
2175                                    the ID_Start property (which should be
2176                                    used instead) and not intended to be
2177                                    used stand-alone (Short: \p{OIDS=Y},
2178                                    \p{OIDS}) (4)
2179          \p{Other_Letter}        \p{General_Category=Other_Letter} (Short:
2180                                    \p{Lo}) (96_128)
2181        D \p{Other_Lowercase}     \p{Other_Lowercase=Y} (Short: \p{OLower})
2182                                    (159)
2183        D \p{Other_Lowercase: N*} Used by Unicode internally for generating
2184                                    the Lowercase property (which should be
2185                                    used instead) and not intended to be
2186                                    used stand-alone (Short: \p{OLower=N},
2187                                    \P{OLower}) (1_113_953)
2188        D \p{Other_Lowercase: Y*} Used by Unicode internally for generating
2189                                    the Lowercase property (which should be
2190                                    used instead) and not intended to be
2191                                    used stand-alone (Short: \p{OLower=Y},
2192                                    \p{OLower}) (159)
2193        D \p{Other_Math}          \p{Other_Math=Y} (Short: \p{OMath}) (1216)
2194        D \p{Other_Math: N*}      Used by Unicode internally for generating
2195                                    the Math property (which should be used
2196                                    instead) and not intended to be used
2197                                    stand-alone (Short: \p{OMath=N},
2198                                    \P{OMath}) (1_112_896)
2199        D \p{Other_Math: Y*}      Used by Unicode internally for generating
2200                                    the Math property (which should be used
2201                                    instead) and not intended to be used
2202                                    stand-alone (Short: \p{OMath=Y},
2203                                    \p{OMath}) (1216)
2204          \p{Other_Number}        \p{General_Category=Other_Number} (Short:
2205                                    \p{No}) (429)
2206          \p{Other_Punctuation}   \p{General_Category=Other_Punctuation}
2207                                    (Short: \p{Po}) (389)
2208          \p{Other_Symbol}        \p{General_Category=Other_Symbol} (Short:
2209                                    \p{So}) (3409)
2210        D \p{Other_Uppercase}     \p{Other_Uppercase=Y} (Short: \p{OUpper})
2211                                    (42)
2212        D \p{Other_Uppercase: N*} Used by Unicode internally for generating
2213                                    the Uppercase property (which should be
2214                                    used instead) and not intended to be
2215                                    used stand-alone (Short: \p{OUpper=N},
2216                                    \P{OUpper}) (1_114_070)
2217        D \p{Other_Uppercase: Y*} Used by Unicode internally for generating
2218                                    the Uppercase property (which should be
2219                                    used instead) and not intended to be
2220                                    used stand-alone (Short: \p{OUpper=Y},
2221                                    \p{OUpper}) (42)
2222        D \p{OUpper}              \p{Other_Uppercase} (= \p{Other_Uppercase=
2223                                    Y}) (42)
2224        D \p{OUpper: *}           \p{Other_Uppercase: *}
2225          \p{P}                   \p{Punct} (= \p{General_Category=
2226                                    Punctuation}) (585)
2227          \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
2228                                    (Short: \p{Zp}) (1)
2229          \p{Pat_Syn}             \p{Pattern_Syntax} (= \p{Pattern_Syntax=
2230                                    Y}) (2760)
2231          \p{Pat_Syn: *}          \p{Pattern_Syntax: *}
2232          \p{Pat_WS}              \p{Pattern_White_Space} (=
2233                                    \p{Pattern_White_Space=Y}) (11)
2234          \p{Pat_WS: *}           \p{Pattern_White_Space: *}
2235          \p{Pattern_Syntax}      \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
2236                                    (2760)
2237          \p{Pattern_Syntax: N*}  (Short: \p{PatSyn=N}, \P{PatSyn})
2238                                    (1_111_352)
2239          \p{Pattern_Syntax: Y*}  (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760)
2240          \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
2241                                    \p{PatWS}) (11)
2242          \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
2243                                    (1_114_101)
2244          \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11)
2245          \p{Pc}                  \p{Connector_Punctuation} (=
2246                                    \p{General_Category=
2247                                    Connector_Punctuation}) (10)
2248          \p{Pd}                  \p{Dash_Punctuation} (=
2249                                    \p{General_Category=Dash_Punctuation})
2250                                    (21)
2251          \p{Pe}                  \p{Close_Punctuation} (=
2252                                    \p{General_Category=Close_Punctuation})
2253                                    (71)
2254          \p{PerlSpace}           \s, restricted to ASCII (5)
2255          \p{PerlWord}            \w, restricted to ASCII = [A-Za-z0-9_] (63)
2256          \p{Pf}                  \p{Final_Punctuation} (=
2257                                    \p{General_Category=Final_Punctuation})
2258                                    (10)
2259          \p{Phag}                \p{Phags_Pa} (= \p{Script=Phags_Pa}) (NOT
2260                                    \p{Block=Phags_Pa}) (56)
2261          \p{Phags_Pa}            \p{Script=Phags_Pa} (Short: \p{Phag}; NOT
2262                                    \p{Block=Phags_Pa}) (56)
2263        X \p{Phaistos_Disc}       \p{Block=Phaistos_Disc} (48)
2264          \p{Phli}                \p{Inscriptional_Pahlavi} (= \p{Script=
2265                                    Inscriptional_Pahlavi}) (NOT \p{Block=
2266                                    Inscriptional_Pahlavi}) (27)
2267          \p{Phnx}                \p{Phoenician} (= \p{Script=Phoenician})
2268                                    (NOT \p{Block=Phoenician}) (29)
2269          \p{Phoenician}          \p{Script=Phoenician} (Short: \p{Phnx};
2270                                    NOT \p{Block=Phoenician}) (29)
2271        X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (128)
2272        X \p{Phonetic_Extensions_Supplement} \p{Block=
2273                                    Phonetic_Extensions_Supplement} (64)
2274          \p{Pi}                  \p{Initial_Punctuation} (=
2275                                    \p{General_Category=
2276                                    Initial_Punctuation}) (12)
2277          \p{Po}                  \p{Other_Punctuation} (=
2278                                    \p{General_Category=Other_Punctuation})
2279                                    (389)
2280          \p{PosixAlnum}          [A-Za-z0-9] (62)
2281          \p{PosixAlpha}          [A-Za-z] (52)
2282          \p{PosixBlank}          \t and ' ' (2)
2283          \p{PosixCntrl}          [\x00-\x1F] (33)
2284          \p{PosixDigit}          [0-9] (10)
2285          \p{PosixGraph}          [\x21-\x7E] (94)
2286          \p{PosixLower}          [a-z] (26)
2287          \p{PosixPrint}          [\x20-\x7E] (95)
2288          \p{PosixPunct}          Graphical characters that aren't Word
2289                                    characters = [\x21-\x2F\x3A-\x40\x5B-
2290                                    \x60\x7B-\x7E] (32)
2291          \p{PosixSpace}          \t \n, \x0B, \f, \r, and ' ' (6)
2292          \p{PosixUpper}          [A-Z] (26)
2293        T \p{Present_In: 1.1}     \p{Age=1.1} (Short: \p{In=1.1}) (Perl
2294                                    extension) (33_979)
2295        T \p{Present_In: 2.0}     Code point's usage introduced in version
2296                                    2.0 or earlier (Short: \p{In=2.0}) (Perl
2297                                    extension) (178_500)
2298        T \p{Present_In: 2.1}     Code point's usage introduced in version
2299                                    2.1 or earlier (Short: \p{In=2.1}) (Perl
2300                                    extension) (178_502)
2301        T \p{Present_In: 3.0}     Code point's usage introduced in version
2302                                    3.0 or earlier (Short: \p{In=3.0}) (Perl
2303                                    extension) (188_809)
2304        T \p{Present_In: 3.1}     Code point's usage introduced in version
2305                                    3.1 or earlier (Short: \p{In=3.1}) (Perl
2306                                    extension) (233_787)
2307        T \p{Present_In: 3.2}     Code point's usage introduced in version
2308                                    3.2 or earlier (Short: \p{In=3.2}) (Perl
2309                                    extension) (234_803)
2310        T \p{Present_In: 4.0}     Code point's usage introduced in version
2311                                    4.0 or earlier (Short: \p{In=4.0}) (Perl
2312                                    extension) (236_029)
2313        T \p{Present_In: 4.1}     Code point's usage introduced in version
2314                                    4.1 or earlier (Short: \p{In=4.1}) (Perl
2315                                    extension) (237_302)
2316        T \p{Present_In: 5.0}     Code point's usage introduced in version
2317                                    5.0 or earlier (Short: \p{In=5.0}) (Perl
2318                                    extension) (238_671)
2319        T \p{Present_In: 5.1}     Code point's usage introduced in version
2320                                    5.1 or earlier (Short: \p{In=5.1}) (Perl
2321                                    extension) (240_295)
2322        T \p{Present_In: 5.2}     Code point's usage introduced in version
2323                                    5.2 or earlier (Short: \p{In=5.2}) (Perl
2324                                    extension) (246_943)
2325          \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
2326                                    Unassigned}) (Perl extension) (867_169)
2327          \p{Print}               Characters that are graphical plus space
2328                                    characters (but no controls) (244_762)
2329          \p{Private_Use}         \p{General_Category=Private_Use} (Short:
2330                                    \p{Co}; NOT \p{Private_Use_Area})
2331                                    (137_468)
2332        X \p{Private_Use_Area}    \p{Block=Private_Use_Area} (Short:
2333                                    \p{InPrivateUse}) (6400)
2334          \p{Prti}                \p{Inscriptional_Parthian} (= \p{Script=
2335                                    Inscriptional_Parthian}) (NOT \p{Block=
2336                                    Inscriptional_Parthian}) (30)
2337          \p{Ps}                  \p{Open_Punctuation} (=
2338                                    \p{General_Category=Open_Punctuation})
2339                                    (72)
2340          \p{Punct}               \p{General_Category=Punctuation} (Short:
2341                                    \p{P}) (585)
2342          \p{Punctuation}         \p{Punct} (= \p{General_Category=
2343                                    Punctuation}) (585)
2344          \p{Qaac}                \p{Coptic} (= \p{Script=Coptic}) (NOT
2345                                    \p{Block=Coptic}) (135)
2346          \p{Qaai}                \p{Inherited} (= \p{Script=Inherited})
2347                                    (523)
2348          \p{QMark}               \p{Quotation_Mark} (= \p{Quotation_Mark=
2349                                    Y}) (29)
2350          \p{QMark: *}            \p{Quotation_Mark: *}
2351          \p{Quotation_Mark}      \p{Quotation_Mark=Y} (Short: \p{QMark})
2352                                    (29)
2353          \p{Quotation_Mark: N*}  (Short: \p{QMark=N}, \P{QMark}) (1_114_083)
2354          \p{Quotation_Mark: Y*}  (Short: \p{QMark=Y}, \p{QMark}) (29)
2355          \p{Radical}             \p{Radical=Y} (329)
2356          \p{Radical: N*}         (Single: \P{Radical}) (1_113_783)
2357          \p{Radical: Y*}         (Single: \p{Radical}) (329)
2358          \p{Rejang}              \p{Script=Rejang} (Short: \p{Rjng}; NOT
2359                                    \p{Block=Rejang}) (37)
2360          \p{Rjng}                \p{Rejang} (= \p{Script=Rejang}) (NOT
2361                                    \p{Block=Rejang}) (37)
2362        X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (32)
2363          \p{Runic}               \p{Script=Runic} (Short: \p{Runr}; NOT
2364                                    \p{Block=Runic}) (78)
2365          \p{Runr}                \p{Runic} (= \p{Script=Runic}) (NOT
2366                                    \p{Block=Runic}) (78)
2367          \p{S}                   \p{Symbol} (= \p{General_Category=Symbol})
2368                                    (4499)
2369          \p{Samaritan}           \p{Script=Samaritan} (Short: \p{Samr}; NOT
2370                                    \p{Block=Samaritan}) (61)
2371          \p{Samr}                \p{Samaritan} (= \p{Script=Samaritan})
2372                                    (NOT \p{Block=Samaritan}) (61)
2373          \p{Sarb}                \p{Old_South_Arabian} (= \p{Script=
2374                                    Old_South_Arabian}) (32)
2375          \p{Saur}                \p{Saurashtra} (= \p{Script=Saurashtra})
2376                                    (NOT \p{Block=Saurashtra}) (81)
2377          \p{Saurashtra}          \p{Script=Saurashtra} (Short: \p{Saur};
2378                                    NOT \p{Block=Saurashtra}) (81)
2379          \p{SB: *}               \p{Sentence_Break: *}
2380          \p{Sc}                  \p{Currency_Symbol} (=
2381                                    \p{General_Category=Currency_Symbol})
2382                                    (46)
2383          \p{Sc: *}               \p{Script: *}
2384          \p{Script: Arab}        \p{Script=Arabic} (1030)
2385          \p{Script: Arabic}      (Short: \p{Sc=Arab}, \p{Arab}) (1030)
2386          \p{Script: Armenian}    (Short: \p{Sc=Armn}, \p{Armn}) (90)
2387          \p{Script: Armi}        \p{Script=Imperial_Aramaic} (31)
2388          \p{Script: Armn}        \p{Script=Armenian} (90)
2389          \p{Script: Avestan}     (Short: \p{Sc=Avst}, \p{Avst}) (61)
2390          \p{Script: Avst}        \p{Script=Avestan} (61)
2391          \p{Script: Bali}        \p{Script=Balinese} (121)
2392          \p{Script: Balinese}    (Short: \p{Sc=Bali}, \p{Bali}) (121)
2393          \p{Script: Bamu}        \p{Script=Bamum} (88)
2394          \p{Script: Bamum}       (Short: \p{Sc=Bamu}, \p{Bamu}) (88)
2395          \p{Script: Beng}        \p{Script=Bengali} (92)
2396          \p{Script: Bengali}     (Short: \p{Sc=Beng}, \p{Beng}) (92)
2397          \p{Script: Bopo}        \p{Script=Bopomofo} (65)
2398          \p{Script: Bopomofo}    (Short: \p{Sc=Bopo}, \p{Bopo}) (65)
2399          \p{Script: Brai}        \p{Script=Braille} (256)
2400          \p{Script: Braille}     (Short: \p{Sc=Brai}, \p{Brai}) (256)
2401          \p{Script: Bugi}        \p{Script=Buginese} (30)
2402          \p{Script: Buginese}    (Short: \p{Sc=Bugi}, \p{Bugi}) (30)
2403          \p{Script: Buhd}        \p{Script=Buhid} (20)
2404          \p{Script: Buhid}       (Short: \p{Sc=Buhd}, \p{Buhd}) (20)
2405          \p{Script: Canadian_Aboriginal} (Short: \p{Sc=Cans}, \p{Cans})
2406                                    (710)
2407          \p{Script: Cans}        \p{Script=Canadian_Aboriginal} (710)
2408          \p{Script: Cari}        \p{Script=Carian} (49)
2409          \p{Script: Carian}      (Short: \p{Sc=Cari}, \p{Cari}) (49)
2410          \p{Script: Cham}        (Short: \p{Sc=Cham}, \p{Cham}) (83)
2411          \p{Script: Cher}        \p{Script=Cherokee} (85)
2412          \p{Script: Cherokee}    (Short: \p{Sc=Cher}, \p{Cher}) (85)
2413          \p{Script: Common}      (Short: \p{Sc=Zyyy}, \p{Zyyy}) (5395)
2414          \p{Script: Copt}        \p{Script=Coptic} (135)
2415          \p{Script: Coptic}      (Short: \p{Sc=Copt}, \p{Copt}) (135)
2416          \p{Script: Cprt}        \p{Script=Cypriot} (55)
2417          \p{Script: Cuneiform}   (Short: \p{Sc=Xsux}, \p{Xsux}) (982)
2418          \p{Script: Cypriot}     (Short: \p{Sc=Cprt}, \p{Cprt}) (55)
2419          \p{Script: Cyrillic}    (Short: \p{Sc=Cyrl}, \p{Cyrl}) (404)
2420          \p{Script: Cyrl}        \p{Script=Cyrillic} (404)
2421          \p{Script: Deseret}     (Short: \p{Sc=Dsrt}, \p{Dsrt}) (80)
2422          \p{Script: Deva}        \p{Script=Devanagari} (140)
2423          \p{Script: Devanagari}  (Short: \p{Sc=Deva}, \p{Deva}) (140)
2424          \p{Script: Dsrt}        \p{Script=Deseret} (80)
2425          \p{Script: Egyp}        \p{Script=Egyptian_Hieroglyphs} (1071)
2426          \p{Script: Egyptian_Hieroglyphs} (Short: \p{Sc=Egyp}, \p{Egyp})
2427                                    (1071)
2428          \p{Script: Ethi}        \p{Script=Ethiopic} (461)
2429          \p{Script: Ethiopic}    (Short: \p{Sc=Ethi}, \p{Ethi}) (461)
2430          \p{Script: Geor}        \p{Script=Georgian} (120)
2431          \p{Script: Georgian}    (Short: \p{Sc=Geor}, \p{Geor}) (120)
2432          \p{Script: Glag}        \p{Script=Glagolitic} (94)
2433          \p{Script: Glagolitic}  (Short: \p{Sc=Glag}, \p{Glag}) (94)
2434          \p{Script: Goth}        \p{Script=Gothic} (27)
2435          \p{Script: Gothic}      (Short: \p{Sc=Goth}, \p{Goth}) (27)
2436          \p{Script: Greek}       (Short: \p{Sc=Grek}, \p{Grek}) (511)
2437          \p{Script: Grek}        \p{Script=Greek} (511)
2438          \p{Script: Gujarati}    (Short: \p{Sc=Gujr}, \p{Gujr}) (83)
2439          \p{Script: Gujr}        \p{Script=Gujarati} (83)
2440          \p{Script: Gurmukhi}    (Short: \p{Sc=Guru}, \p{Guru}) (79)
2441          \p{Script: Guru}        \p{Script=Gurmukhi} (79)
2442          \p{Script: Han}         (Short: \p{Sc=Han}, \p{Han}) (75_738)
2443          \p{Script: Hang}        \p{Script=Hangul} (11_737)
2444          \p{Script: Hangul}      (Short: \p{Sc=Hang}, \p{Hang}) (11_737)
2445          \p{Script: Hani}        \p{Script=Han} (75_738)
2446          \p{Script: Hano}        \p{Script=Hanunoo} (21)
2447          \p{Script: Hanunoo}     (Short: \p{Sc=Hano}, \p{Hano}) (21)
2448          \p{Script: Hebr}        \p{Script=Hebrew} (133)
2449          \p{Script: Hebrew}      (Short: \p{Sc=Hebr}, \p{Hebr}) (133)
2450          \p{Script: Hira}        \p{Script=Hiragana} (90)
2451          \p{Script: Hiragana}    (Short: \p{Sc=Hira}, \p{Hira}) (90)
2452          \p{Script: Imperial_Aramaic} (Short: \p{Sc=Armi}, \p{Armi}) (31)
2453          \p{Script: Inherited}   (Short: \p{Sc=Zinh}, \p{Zinh}) (523)
2454          \p{Script: Inscriptional_Pahlavi} (Short: \p{Sc=Phli}, \p{Phli})
2455                                    (27)
2456          \p{Script: Inscriptional_Parthian} (Short: \p{Sc=Prti}, \p{Prti})
2457                                    (30)
2458          \p{Script: Ital}        \p{Script=Old_Italic} (35)
2459          \p{Script: Java}        \p{Script=Javanese} (91)
2460          \p{Script: Javanese}    (Short: \p{Sc=Java}, \p{Java}) (91)
2461          \p{Script: Kaithi}      (Short: \p{Sc=Kthi}, \p{Kthi}) (66)
2462          \p{Script: Kali}        \p{Script=Kayah_Li} (48)
2463          \p{Script: Kana}        \p{Script=Katakana} (299)
2464          \p{Script: Kannada}     (Short: \p{Sc=Knda}, \p{Knda}) (84)
2465          \p{Script: Katakana}    (Short: \p{Sc=Kana}, \p{Kana}) (299)
2466          \p{Script: Kayah_Li}    (Short: \p{Sc=Kali}, \p{Kali}) (48)
2467          \p{Script: Khar}        \p{Script=Kharoshthi} (65)
2468          \p{Script: Kharoshthi}  (Short: \p{Sc=Khar}, \p{Khar}) (65)
2469          \p{Script: Khmer}       (Short: \p{Sc=Khmr}, \p{Khmr}) (146)
2470          \p{Script: Khmr}        \p{Script=Khmer} (146)
2471          \p{Script: Knda}        \p{Script=Kannada} (84)
2472          \p{Script: Kthi}        \p{Script=Kaithi} (66)
2473          \p{Script: Lana}        \p{Script=Tai_Tham} (127)
2474          \p{Script: Lao}         (Short: \p{Sc=Lao}, \p{Lao}) (65)
2475          \p{Script: Laoo}        \p{Script=Lao} (65)
2476          \p{Script: Latin}       (Short: \p{Sc=Latn}, \p{Latn}) (1244)
2477          \p{Script: Latn}        \p{Script=Latin} (1244)
2478          \p{Script: Lepc}        \p{Script=Lepcha} (74)
2479          \p{Script: Lepcha}      (Short: \p{Sc=Lepc}, \p{Lepc}) (74)
2480          \p{Script: Limb}        \p{Script=Limbu} (66)
2481          \p{Script: Limbu}       (Short: \p{Sc=Limb}, \p{Limb}) (66)
2482          \p{Script: Linb}        \p{Script=Linear_B} (211)
2483          \p{Script: Linear_B}    (Short: \p{Sc=Linb}, \p{Linb}) (211)
2484          \p{Script: Lisu}        (Short: \p{Sc=Lisu}, \p{Lisu}) (48)
2485          \p{Script: Lyci}        \p{Script=Lycian} (29)
2486          \p{Script: Lycian}      (Short: \p{Sc=Lyci}, \p{Lyci}) (29)
2487          \p{Script: Lydi}        \p{Script=Lydian} (27)
2488          \p{Script: Lydian}      (Short: \p{Sc=Lydi}, \p{Lydi}) (27)
2489          \p{Script: Malayalam}   (Short: \p{Sc=Mlym}, \p{Mlym}) (95)
2490          \p{Script: Meetei_Mayek} (Short: \p{Sc=Mtei}, \p{Mtei}) (56)
2491          \p{Script: Mlym}        \p{Script=Malayalam} (95)
2492          \p{Script: Mong}        \p{Script=Mongolian} (153)
2493          \p{Script: Mongolian}   (Short: \p{Sc=Mong}, \p{Mong}) (153)
2494          \p{Script: Mtei}        \p{Script=Meetei_Mayek} (56)
2495          \p{Script: Myanmar}     (Short: \p{Sc=Mymr}, \p{Mymr}) (188)
2496          \p{Script: Mymr}        \p{Script=Myanmar} (188)
2497          \p{Script: New_Tai_Lue} (Short: \p{Sc=Talu}, \p{Talu}) (83)
2498          \p{Script: Nko}         (Short: \p{Sc=Nko}, \p{Nko}) (59)
2499          \p{Script: Nkoo}        \p{Script=Nko} (59)
2500          \p{Script: Ogam}        \p{Script=Ogham} (29)
2501          \p{Script: Ogham}       (Short: \p{Sc=Ogam}, \p{Ogam}) (29)
2502          \p{Script: Ol_Chiki}    (Short: \p{Sc=Olck}, \p{Olck}) (48)
2503          \p{Script: Olck}        \p{Script=Ol_Chiki} (48)
2504          \p{Script: Old_Italic}  (Short: \p{Sc=Ital}, \p{Ital}) (35)
2505          \p{Script: Old_Persian} (Short: \p{Sc=Xpeo}, \p{Xpeo}) (50)
2506          \p{Script: Old_South_Arabian} (Short: \p{Sc=Sarb}, \p{Sarb}) (32)
2507          \p{Script: Old_Turkic}  (Short: \p{Sc=Orkh}, \p{Orkh}) (73)
2508          \p{Script: Oriya}       (Short: \p{Sc=Orya}, \p{Orya}) (84)
2509          \p{Script: Orkh}        \p{Script=Old_Turkic} (73)
2510          \p{Script: Orya}        \p{Script=Oriya} (84)
2511          \p{Script: Osma}        \p{Script=Osmanya} (40)
2512          \p{Script: Osmanya}     (Short: \p{Sc=Osma}, \p{Osma}) (40)
2513          \p{Script: Phag}        \p{Script=Phags_Pa} (56)
2514          \p{Script: Phags_Pa}    (Short: \p{Sc=Phag}, \p{Phag}) (56)
2515          \p{Script: Phli}        \p{Script=Inscriptional_Pahlavi} (27)
2516          \p{Script: Phnx}        \p{Script=Phoenician} (29)
2517          \p{Script: Phoenician}  (Short: \p{Sc=Phnx}, \p{Phnx}) (29)
2518          \p{Script: Prti}        \p{Script=Inscriptional_Parthian} (30)
2519          \p{Script: Qaac}        \p{Script=Coptic} (135)
2520          \p{Script: Qaai}        \p{Script=Inherited} (523)
2521          \p{Script: Rejang}      (Short: \p{Sc=Rjng}, \p{Rjng}) (37)
2522          \p{Script: Rjng}        \p{Script=Rejang} (37)
2523          \p{Script: Runic}       (Short: \p{Sc=Runr}, \p{Runr}) (78)
2524          \p{Script: Runr}        \p{Script=Runic} (78)
2525          \p{Script: Samaritan}   (Short: \p{Sc=Samr}, \p{Samr}) (61)
2526          \p{Script: Samr}        \p{Script=Samaritan} (61)
2527          \p{Script: Sarb}        \p{Script=Old_South_Arabian} (32)
2528          \p{Script: Saur}        \p{Script=Saurashtra} (81)
2529          \p{Script: Saurashtra}  (Short: \p{Sc=Saur}, \p{Saur}) (81)
2530          \p{Script: Shavian}     (Short: \p{Sc=Shaw}, \p{Shaw}) (48)
2531          \p{Script: Shaw}        \p{Script=Shavian} (48)
2532          \p{Script: Sinh}        \p{Script=Sinhala} (80)
2533          \p{Script: Sinhala}     (Short: \p{Sc=Sinh}, \p{Sinh}) (80)
2534          \p{Script: Sund}        \p{Script=Sundanese} (55)
2535          \p{Script: Sundanese}   (Short: \p{Sc=Sund}, \p{Sund}) (55)
2536          \p{Script: Sylo}        \p{Script=Syloti_Nagri} (44)
2537          \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}, \p{Sylo}) (44)
2538          \p{Script: Syrc}        \p{Script=Syriac} (77)
2539          \p{Script: Syriac}      (Short: \p{Sc=Syrc}, \p{Syrc}) (77)
2540          \p{Script: Tagalog}     (Short: \p{Sc=Tglg}, \p{Tglg}) (20)
2541          \p{Script: Tagb}        \p{Script=Tagbanwa} (18)
2542          \p{Script: Tagbanwa}    (Short: \p{Sc=Tagb}, \p{Tagb}) (18)
2543          \p{Script: Tai_Le}      (Short: \p{Sc=Tale}, \p{Tale}) (35)
2544          \p{Script: Tai_Tham}    (Short: \p{Sc=Lana}, \p{Lana}) (127)
2545          \p{Script: Tai_Viet}    (Short: \p{Sc=Tavt}, \p{Tavt}) (72)
2546          \p{Script: Tale}        \p{Script=Tai_Le} (35)
2547          \p{Script: Talu}        \p{Script=New_Tai_Lue} (83)
2548          \p{Script: Tamil}       (Short: \p{Sc=Taml}, \p{Taml}) (72)
2549          \p{Script: Taml}        \p{Script=Tamil} (72)
2550          \p{Script: Tavt}        \p{Script=Tai_Viet} (72)
2551          \p{Script: Telu}        \p{Script=Telugu} (93)
2552          \p{Script: Telugu}      (Short: \p{Sc=Telu}, \p{Telu}) (93)
2553          \p{Script: Tfng}        \p{Script=Tifinagh} (55)
2554          \p{Script: Tglg}        \p{Script=Tagalog} (20)
2555          \p{Script: Thaa}        \p{Script=Thaana} (50)
2556          \p{Script: Thaana}      (Short: \p{Sc=Thaa}, \p{Thaa}) (50)
2557          \p{Script: Thai}        (Short: \p{Sc=Thai}, \p{Thai}) (86)
2558          \p{Script: Tibetan}     (Short: \p{Sc=Tibt}, \p{Tibt}) (201)
2559          \p{Script: Tibt}        \p{Script=Tibetan} (201)
2560          \p{Script: Tifinagh}    (Short: \p{Sc=Tfng}, \p{Tfng}) (55)
2561          \p{Script: Ugar}        \p{Script=Ugaritic} (31)
2562          \p{Script: Ugaritic}    (Short: \p{Sc=Ugar}, \p{Ugar}) (31)
2563          \p{Script: Unknown}     (Short: \p{Sc=Zzzz}, \p{Zzzz}) (1_006_751)
2564          \p{Script: Vai}         (Short: \p{Sc=Vai}, \p{Vai}) (300)
2565          \p{Script: Vaii}        \p{Script=Vai} (300)
2566          \p{Script: Xpeo}        \p{Script=Old_Persian} (50)
2567          \p{Script: Xsux}        \p{Script=Cuneiform} (982)
2568          \p{Script: Yi}          (Short: \p{Sc=Yi}, \p{Yi}) (1220)
2569          \p{Script: Yiii}        \p{Script=Yi} (1220)
2570          \p{Script: Zinh}        \p{Script=Inherited} (523)
2571          \p{Script: Zyyy}        \p{Script=Common} (5395)
2572          \p{Script: Zzzz}        \p{Script=Unknown} (1_006_751)
2573          \p{SD}                  \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
2574          \p{SD: *}               \p{Soft_Dotted: *}
2575          \p{Sentence_Break: AT}  \p{Sentence_Break=ATerm} (4)
2576          \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4)
2577          \p{Sentence_Break: CL}  \p{Sentence_Break=Close} (177)
2578          \p{Sentence_Break: Close} (Short: \p{SB=CL}) (177)
2579          \p{Sentence_Break: CR}  (Short: \p{SB=CR}) (1)
2580          \p{Sentence_Break: EX}  \p{Sentence_Break=Extend} (1455)
2581          \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (1455)
2582          \p{Sentence_Break: FO}  \p{Sentence_Break=Format} (138)
2583          \p{Sentence_Break: Format} (Short: \p{SB=FO}) (138)
2584          \p{Sentence_Break: LE}  \p{Sentence_Break=OLetter} (96_405)
2585          \p{Sentence_Break: LF}  (Short: \p{SB=LF}) (1)
2586          \p{Sentence_Break: LO}  \p{Sentence_Break=Lower} (1907)
2587          \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (1907)
2588          \p{Sentence_Break: NU}  \p{Sentence_Break=Numeric} (403)
2589          \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (403)
2590          \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (96_405)
2591          \p{Sentence_Break: Other} (Short: \p{SB=XX}) (1_012_008)
2592          \p{Sentence_Break: SC}  \p{Sentence_Break=SContinue} (26)
2593          \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26)
2594          \p{Sentence_Break: SE}  \p{Sentence_Break=Sep} (3)
2595          \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3)
2596          \p{Sentence_Break: Sp}  (Short: \p{SB=Sp}) (21)
2597          \p{Sentence_Break: ST}  \p{Sentence_Break=STerm} (63)
2598          \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (63)
2599          \p{Sentence_Break: UP}  \p{Sentence_Break=Upper} (1500)
2600          \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1500)
2601          \p{Sentence_Break: XX}  \p{Sentence_Break=Other} (1_012_008)
2602          \p{Separator}           \p{General_Category=Separator} (Short:
2603                                    \p{Z}) (20)
2604          \p{Shavian}             \p{Script=Shavian} (Short: \p{Shaw}) (48)
2605          \p{Shaw}                \p{Shavian} (= \p{Script=Shavian}) (48)
2606          \p{Sinh}                \p{Sinhala} (= \p{Script=Sinhala}) (NOT
2607                                    \p{Block=Sinhala}) (80)
2608          \p{Sinhala}             \p{Script=Sinhala} (Short: \p{Sinh}; NOT
2609                                    \p{Block=Sinhala}) (80)
2610          \p{Sk}                  \p{Modifier_Symbol} (=
2611                                    \p{General_Category=Modifier_Symbol})
2612                                    (99)
2613          \p{Sm}                  \p{Math_Symbol} (= \p{General_Category=
2614                                    Math_Symbol}) (945)
2615        X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (32)
2616          \p{So}                  \p{Other_Symbol} (= \p{General_Category=
2617                                    Other_Symbol}) (3409)
2618          \p{Soft_Dotted}         \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
2619          \p{Soft_Dotted: N*}     (Short: \p{SD=N}, \P{SD}) (1_114_066)
2620          \p{Soft_Dotted: Y*}     (Short: \p{SD=Y}, \p{SD}) (46)
2621          \p{Space}               \p{White_Space=Y} \s including beyond
2622                                    ASCII plus vertical tab (26)
2623          \p{Space: *}            \p{White_Space: *}
2624          \p{Space_Separator}     \p{General_Category=Space_Separator}
2625                                    (Short: \p{Zs}) (18)
2626          \p{SpacePerl}           \s, including beyond ASCII (25)
2627          \p{Spacing_Mark}        \p{General_Category=Spacing_Mark} (Short:
2628                                    \p{Mc}) (276)
2629        X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
2630                                    (80)
2631        X \p{Specials}            \p{Block=Specials} (16)
2632          \p{STerm}               \p{STerm=Y} (66)
2633          \p{STerm: N*}           (Single: \P{STerm}) (1_114_046)
2634          \p{STerm: Y*}           (Single: \p{STerm}) (66)
2635          \p{Sund}                \p{Sundanese} (= \p{Script=Sundanese})
2636                                    (NOT \p{Block=Sundanese}) (55)
2637          \p{Sundanese}           \p{Script=Sundanese} (Short: \p{Sund}; NOT
2638                                    \p{Block=Sundanese}) (55)
2639        X \p{Superscripts_And_Subscripts} \p{Block=
2640                                    Superscripts_And_Subscripts} (48)
2641        X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
2642        X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
2643        X \p{Supplemental_Mathematical_Operators} \p{Block=
2644                                    Supplemental_Mathematical_Operators}
2645                                    (256)
2646        X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
2647                                    (128)
2648        X \p{Supplementary_Private_Use_Area_A} \p{Block=
2649                                    Supplementary_Private_Use_Area_A}
2650                                    (65_536)
2651        X \p{Supplementary_Private_Use_Area_B} \p{Block=
2652                                    Supplementary_Private_Use_Area_B}
2653                                    (65_536)
2654          \p{Surrogate}           \p{General_Category=Surrogate} (Short:
2655                                    \p{Cs}) (2048)
2656          \p{Sylo}                \p{Syloti_Nagri} (= \p{Script=
2657                                    Syloti_Nagri}) (NOT \p{Block=
2658                                    Syloti_Nagri}) (44)
2659          \p{Syloti_Nagri}        \p{Script=Syloti_Nagri} (Short: \p{Sylo};
2660                                    NOT \p{Block=Syloti_Nagri}) (44)
2661          \p{Symbol}              \p{General_Category=Symbol} (Short: \p{S})
2662                                    (4499)
2663          \p{Syrc}                \p{Syriac} (= \p{Script=Syriac}) (NOT
2664                                    \p{Block=Syriac}) (77)
2665          \p{Syriac}              \p{Script=Syriac} (Short: \p{Syrc}; NOT
2666                                    \p{Block=Syriac}) (77)
2667          \p{Tagalog}             \p{Script=Tagalog} (Short: \p{Tglg}; NOT
2668                                    \p{Block=Tagalog}) (20)
2669          \p{Tagb}                \p{Tagbanwa} (= \p{Script=Tagbanwa}) (NOT
2670                                    \p{Block=Tagbanwa}) (18)
2671          \p{Tagbanwa}            \p{Script=Tagbanwa} (Short: \p{Tagb}; NOT
2672                                    \p{Block=Tagbanwa}) (18)
2673        X \p{Tags}                \p{Block=Tags} (128)
2674          \p{Tai_Le}              \p{Script=Tai_Le} (Short: \p{Tale}; NOT
2675                                    \p{Block=Tai_Le}) (35)
2676          \p{Tai_Tham}            \p{Script=Tai_Tham} (Short: \p{Lana}; NOT
2677                                    \p{Block=Tai_Tham}) (127)
2678          \p{Tai_Viet}            \p{Script=Tai_Viet} (Short: \p{Tavt}; NOT
2679                                    \p{Block=Tai_Viet}) (72)
2680        X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (96)
2681          \p{Tale}                \p{Tai_Le} (= \p{Script=Tai_Le}) (NOT
2682                                    \p{Block=Tai_Le}) (35)
2683          \p{Talu}                \p{New_Tai_Lue} (= \p{Script=New_Tai_Lue})
2684                                    (NOT \p{Block=New_Tai_Lue}) (83)
2685          \p{Tamil}               \p{Script=Tamil} (Short: \p{Taml}; NOT
2686                                    \p{Block=Tamil}) (72)
2687          \p{Taml}                \p{Tamil} (= \p{Script=Tamil}) (NOT
2688                                    \p{Block=Tamil}) (72)
2689          \p{Tavt}                \p{Tai_Viet} (= \p{Script=Tai_Viet}) (NOT
2690                                    \p{Block=Tai_Viet}) (72)
2691          \p{Telu}                \p{Telugu} (= \p{Script=Telugu}) (NOT
2692                                    \p{Block=Telugu}) (93)
2693          \p{Telugu}              \p{Script=Telugu} (Short: \p{Telu}; NOT
2694                                    \p{Block=Telugu}) (93)
2695          \p{Term}                \p{Terminal_Punctuation} (=
2696                                    \p{Terminal_Punctuation=Y}) (161)
2697          \p{Term: *}             \p{Terminal_Punctuation: *}
2698          \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
2699                                    \p{Term}) (161)
2700          \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
2701                                    (1_113_951)
2702          \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (161)
2703          \p{Tfng}                \p{Tifinagh} (= \p{Script=Tifinagh}) (NOT
2704                                    \p{Block=Tifinagh}) (55)
2705          \p{Tglg}                \p{Tagalog} (= \p{Script=Tagalog}) (NOT
2706                                    \p{Block=Tagalog}) (20)
2707          \p{Thaa}                \p{Thaana} (= \p{Script=Thaana}) (NOT
2708                                    \p{Block=Thaana}) (50)
2709          \p{Thaana}              \p{Script=Thaana} (Short: \p{Thaa}; NOT
2710                                    \p{Block=Thaana}) (50)
2711          \p{Thai}                \p{Script=Thai} (NOT \p{Block=Thai}) (86)
2712          \p{Tibetan}             \p{Script=Tibetan} (Short: \p{Tibt}; NOT
2713                                    \p{Block=Tibetan}) (201)
2714          \p{Tibt}                \p{Tibetan} (= \p{Script=Tibetan}) (NOT
2715                                    \p{Block=Tibetan}) (201)
2716          \p{Tifinagh}            \p{Script=Tifinagh} (Short: \p{Tfng}; NOT
2717                                    \p{Block=Tifinagh}) (55)
2718          \p{Title}               \p{General_Category=Titlecase_Letter}
2719                                    (Short: \p{Lt}) (31)
2720          \p{Titlecase_Letter}    \p{Title} (= \p{General_Category=
2721                                    Titlecase_Letter}) (31)
2722          \p{Ugar}                \p{Ugaritic} (= \p{Script=Ugaritic}) (NOT
2723                                    \p{Block=Ugaritic}) (31)
2724          \p{Ugaritic}            \p{Script=Ugaritic} (Short: \p{Ugar}; NOT
2725                                    \p{Block=Ugaritic}) (31)
2726          \p{UIdeo}               \p{Unified_Ideograph} (=
2727                                    \p{Unified_Ideograph=Y}) (74_394)
2728          \p{UIdeo: *}            \p{Unified_Ideograph: *}
2729          \p{Unassigned}          \p{General_Category=Unassigned} (Short:
2730                                    \p{Cn}) (867_235)
2731        X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
2732                                    Unified_Canadian_Aboriginal_Syllabics}
2733                                    (Short: \p{InCanadianSyllabics}) (640)
2734        X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
2735                                    Unified_Canadian_Aboriginal_Syllabics_-
2736                                    Extended} (80)
2737          \p{Unified_Ideograph}   \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
2738                                    (74_394)
2739          \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
2740                                    (1_039_718)
2741          \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (74_394)
2742          \p{Unknown}             \p{Script=Unknown} (Short: \p{Zzzz})
2743                                    (1_006_751)
2744          \p{Upper}               \p{Uppercase=Y} (1469)
2745          \p{Upper: *}            \p{Uppercase: *}
2746          \p{Uppercase}           \p{Upper} (= \p{Uppercase=Y}) (1469)
2747          \p{Uppercase: N*}       (Short: \p{Upper=N}, \P{Upper}) (1_112_643)
2748          \p{Uppercase: Y*}       (Short: \p{Upper=Y}, \p{Upper}) (1469)
2749          \p{Uppercase_Letter}    \p{General_Category=Uppercase_Letter}
2750                                    (Short: \p{Lu}) (1427)
2751          \p{Vai}                 \p{Script=Vai} (NOT \p{Block=Vai}) (300)
2752          \p{Vaii}                \p{Vai} (= \p{Script=Vai}) (NOT \p{Block=
2753                                    Vai}) (300)
2754          \p{Variation_Selector}  \p{Variation_Selector=Y} (Short: \p{VS})
2755                                    (259)
2756          \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853)
2757          \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259)
2758        X \p{Variation_Selectors} \p{Block=Variation_Selectors} (16)
2759        X \p{Variation_Selectors_Supplement} \p{Block=
2760                                    Variation_Selectors_Supplement} (240)
2761        X \p{Vedic_Extensions}    \p{Block=Vedic_Extensions} (48)
2762        X \p{Vertical_Forms}      \p{Block=Vertical_Forms} (16)
2763          \p{VertSpace}           \v (7)
2764          \p{VS}                  \p{Variation_Selector} (=
2765                                    \p{Variation_Selector=Y}) (259)
2766          \p{VS: *}               \p{Variation_Selector: *}
2767          \p{WB: *}               \p{Word_Break: *}
2768          \p{White_Space}         \p{White_Space=Y} (Short: \p{WSpace}) (26)
2769          \p{White_Space: N*}     (Short: \p{Space=N}, \P{WSpace})
2770                                    (1_114_086)
2771          \p{White_Space: Y*}     (Short: \p{Space=Y}, \p{WSpace}) (26)
2772          \p{Word}                \w, including beyond ASCII (101_685)
2773          \p{Word_Break: ALetter} (Short: \p{WB=LE}) (23_694)
2774          \p{Word_Break: CR}      (Short: \p{WB=CR}) (1)
2775          \p{Word_Break: EX}      \p{Word_Break=ExtendNumLet} (10)
2776          \p{Word_Break: Extend}  (Short: \p{WB=Extend}) (1455)
2777          \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (10)
2778          \p{Word_Break: FO}      \p{Word_Break=Format} (137)
2779          \p{Word_Break: Format}  (Short: \p{WB=FO}) (137)
2780          \p{Word_Break: KA}      \p{Word_Break=Katakana} (309)
2781          \p{Word_Break: Katakana} (Short: \p{WB=KA}) (309)
2782          \p{Word_Break: LE}      \p{Word_Break=ALetter} (23_694)
2783          \p{Word_Break: LF}      (Short: \p{WB=LF}) (1)
2784          \p{Word_Break: MB}      \p{Word_Break=MidNumLet} (8)
2785          \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (8)
2786          \p{Word_Break: MidNum}  (Short: \p{WB=MN}) (15)
2787          \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (8)
2788          \p{Word_Break: ML}      \p{Word_Break=MidLetter} (8)
2789          \p{Word_Break: MN}      \p{Word_Break=MidNum} (15)
2790          \p{Word_Break: Newline} (Short: \p{WB=NL}) (5)
2791          \p{Word_Break: NL}      \p{Word_Break=Newline} (5)
2792          \p{Word_Break: NU}      \p{Word_Break=Numeric} (402)
2793          \p{Word_Break: Numeric} (Short: \p{WB=NU}) (402)
2794          \p{Word_Break: Other}   (Short: \p{WB=XX}) (1_088_067)
2795          \p{Word_Break: XX}      \p{Word_Break=Other} (1_088_067)
2796          \p{WSpace}              \p{White_Space} (= \p{White_Space=Y}) (26)
2797          \p{WSpace: *}           \p{White_Space: *}
2798          \p{XDigit}              \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
2799          \p{XID_Continue}        \p{XID_Continue=Y} (Short: \p{XIDC})
2800                                    (101_615)
2801          \p{XID_Continue: N*}    (Short: \p{XIDC=N}, \P{XIDC}) (1_012_497)
2802          \p{XID_Continue: Y*}    (Short: \p{XIDC=Y}, \p{XIDC}) (101_615)
2803          \p{XID_Start}           \p{XID_Start=Y} (Short: \p{XIDS}) (99_741)
2804          \p{XID_Start: N*}       (Short: \p{XIDS=N}, \P{XIDS}) (1_014_371)
2805          \p{XID_Start: Y*}       (Short: \p{XIDS=Y}, \p{XIDS}) (99_741)
2806          \p{XIDC}                \p{XID_Continue} (= \p{XID_Continue=Y})
2807                                    (101_615)
2808          \p{XIDC: *}             \p{XID_Continue: *}
2809          \p{XIDS}                \p{XID_Start} (= \p{XID_Start=Y}) (99_741)
2810          \p{XIDS: *}             \p{XID_Start: *}
2811          \p{Xpeo}                \p{Old_Persian} (= \p{Script=Old_Persian})
2812                                    (NOT \p{Block=Old_Persian}) (50)
2813          \p{Xsux}                \p{Cuneiform} (= \p{Script=Cuneiform})
2814                                    (NOT \p{Block=Cuneiform}) (982)
2815          \p{Yi}                  \p{Script=Yi} (1220)
2816        X \p{Yi_Radicals}         \p{Block=Yi_Radicals} (64)
2817        X \p{Yi_Syllables}        \p{Block=Yi_Syllables} (1168)
2818          \p{Yiii}                \p{Yi} (= \p{Script=Yi}) (1220)
2819        X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols} (64)
2820          \p{Z}                   \p{Separator} (= \p{General_Category=
2821                                    Separator}) (20)
2822          \p{Zinh}                \p{Inherited} (= \p{Script=Inherited})
2823                                    (523)
2824          \p{Zl}                  \p{Line_Separator} (= \p{General_Category=
2825                                    Line_Separator}) (1)
2826          \p{Zp}                  \p{Paragraph_Separator} (=
2827                                    \p{General_Category=
2828                                    Paragraph_Separator}) (1)
2829          \p{Zs}                  \p{Space_Separator} (=
2830                                    \p{General_Category=Space_Separator})
2831                                    (18)
2832          \p{Zyyy}                \p{Common} (= \p{Script=Common}) (5395)
2833          \p{Zzzz}                \p{Unknown} (= \p{Script=Unknown})
2834                                    (1_006_751)
2835        T \p{_CanonDCIJ}          (For internal use by Perl, not necessarily
2836                                    stable) (= \p{Soft_Dotted=Y}) (46)
2837        T \p{_Case_Ignorable}     (For internal use by Perl, not necessarily
2838                                    stable) (= \p{Case_Ignorable=Y}) (1632)
2839        T \p{_CombAbove}          (For internal use by Perl, not necessarily
2840                                    stable) (= \p{Canonical_Combining_Class=
2841                                    Above}) (318)
2842        T \p{_X_Begin}            (For internal use by Perl, not necessarily
2843                                    stable) (1_113_907)
2844        T \p{_X_Extend}           (For internal use by Perl, not necessarily
2845                                    stable) (1462)
2846        T \p{_X_LV_LVT_V}         (For internal use by Perl, not necessarily
2847                                    stable) (11_267)
2848
2849   Legal \p{} and \P{} constructs that match no characters
2850       Unicode has some property-value pairs that currently don't match
2851       anything.  This happens generally either because they are obsolete, or
2852       for symmetry with other forms, but no language has yet been encoded
2853       that uses them.  In this version of Unicode, the following match zero
2854       code points:
2855
2856       \p{Canonical_Combining_Class=Attached_Below_Left}
2857       \p{Joining_Type=Left_Joining}
2858

Properties not accessible through \p{} and \P{}

2860       A few properties are accessible in Perl via various function calls
2861       only.  These are:
2862        Lowercase_Mapping          lc() and lcfirst()
2863        Titlecase_Mapping          ucfirst()
2864        Uppercase_Mapping          uc()
2865
2866       Case_Folding is accessible through the /i modifier in regular
2867       expressions.
2868
2869       The Name property is accessible through the \N{} interpolation in
2870       double-quoted strings and regular expressions, but both usages require
2871       a "use charnames;" to be specified, which also contains related
2872       functions viacode() and vianame().
2873

Unicode regular expression properties that are NOT accepted by Perl

2875       Perl will generate an error for a few character properties in Unicode
2876       when used in a regular expression.  The non-Unihan ones are listed
2877       below, with the reasons they are not accepted, perhaps with work-
2878       arounds.  The short names for the properties are listed enclosed in
2879       (parentheses).
2880
2881       Expands_On_NFC (XO_NFC)
2882       Expands_On_NFD (XO_NFD)
2883       Expands_On_NFKC (XO_NFKC)
2884       Expands_On_NFKD (XO_NFKD)
2885           Easily computed, and yet doesn't cover the common encoding forms
2886           (UTF-16/8)
2887
2888       Grapheme_Link (Gr_Link)
2889           Deprecated by Unicode.  Use ccc=vr
2890           (Canonical_Combining_Class=Virama) instead
2891
2892       Jamo_Short_Name (JSN)
2893           Used by Unicode internally for generating other properties and not
2894           intended to be used stand-alone
2895
2896       Script=Katakana_Or_Hiragana (sc=Hrkt)
2897           Obsolete.  All code points previously matched by this have been
2898           moved to "Script=Common"
2899
2900       An installation can choose to allow any of these to be matched by
2901       changing the controlling lists contained in the program
2902       $Config{privlib}/unicore/lib/unicore/mktables and then re-running
2903       lib/unicore/mktables.  (%Config is available from the Config module).
2904

Files in the To directory (for serious hackers only)

2906       All Unicode properties are really mappings (in the mathematical sense)
2907       from code points to their respective values.  As part of its build
2908       process, Perl constructs tables containing these mappings for all
2909       properties that it deals with.  But only a few of these are written out
2910       into files.  Those written out are in the directory
2911       $Config{privlib}/unicore/To/ (%Config is available from the Config
2912       module).
2913
2914       Those ones written are ones needed by Perl internally during execution,
2915       or for which there is some demand, and those for which there is no
2916       access through the Perl core.  Generally, properties that can be used
2917       in regular expression matching do not have their map tables written,
2918       like Script.  Nor are the simplistic properties that have a better,
2919       more complete version, such as Simple_Uppercase_Mapping
2920       (Uppercase_Mapping is written instead).
2921
2922       None of the properties in the To directory are currently directly
2923       accessible through the Perl core, although some may be accessed
2924       indirectly.  For example, the uc() function implements the
2925       Uppercase_Mapping property and uses the Upper.pl file found in this
2926       directory.
2927
2928       The available files with their properties (short names in parentheses),
2929       and any flags or comments about them, are:
2930
2931          Bmg.pl           Bidi_Mirroring_Glyph (bmg)
2932          Digit.pl         Perl_Decimal_Digit
2933          Fold.pl          Case_Folding (cf)
2934          Lower.pl         Lowercase_Mapping (lc)
2935          NFKCCF.pl        NFKC_Casefold (NFKC_CF)
2936          Title.pl         Titlecase_Mapping (tc)
2937          Upper.pl         Uppercase_Mapping (uc)
2938
2939       An installation can choose to change which files are generated by
2940       changing the controlling lists contained in the program
2941       $Config{privlib}/unicore/lib/unicore/mktables and then re-running
2942       lib/unicore/mktables.
2943
2944       Each of these files defines two hash entries to help reading programs
2945       decipher it.  One of them looks like this:
2946
2947           $utf8::SwashInfo{'ToNAME'}{'format'} = 's';
2948
2949       where 'NAME' is a name to indicate the property.  For backwards
2950       compatibility, this is not necessarily the property's official Unicode
2951       name.  (The 'To' is also for backwards compatibility.)  The hash entry
2952       gives the format of the mapping fields of the table, currently one of
2953       the following:
2954
2955         b    binary
2956         d    single decimal digit
2957         f    floating point number
2958         i    integer
2959         r    rational: an integer or a fraction
2960         s    arbitrary string
2961         x    positive hex whole number; a code point
2962
2963       This format applies only to the entries in the main body of the table.
2964       Entries defined in hashes or ones that are missing from the list can
2965       have a different format.
2966
2967       The value that the missing entries have is given by the other SwashInfo
2968       hash entry line; it looks like this:
2969
2970           $utf8::SwashInfo{'ToNAME'}{'missing'} = 'NaN';
2971
2972       This example line says that any Unicode code points not explicitly
2973       listed in the file have the value 'NaN' under the property indicated by
2974       NAME.  If the value is the special string "<code point>", it means that
2975       the value for any missing code point is the code point itself.  This
2976       happens, for example, in the file for Uppercase_Mapping (To/Upper.pl),
2977       in which code points like the character 'A', are missing because the
2978       uppercase of 'A' is itself.
2979

SEE ALSO

2981       <http://www.unicode.org/reports/tr44/>
2982
2983       perlrecharclass
2984
2985       perlunicode
2986
2987
2988
2989perl v5.12.4                      2011-11-04                   PERLUNIPROPS(1)
Impressum