1PERLUNIPROPS(1)        Perl Programmers Reference Guide        PERLUNIPROPS(1)
2
3
4

NAME

6       perluniprops - Index of Unicode Version 6.1.0 character properties in
7       Perl
8

DESCRIPTION

10       This document provides information about the portion of the Unicode
11       database that deals with character properties, that is the portion that
12       is defined on single code points.  ("Other information in the Unicode
13       data base" below briefly mentions other data that Unicode provides.)
14
15       Perl can provide access to all non-provisional Unicode character
16       properties, though not all are enabled by default.  The omitted ones
17       are the Unihan properties (accessible via the CPAN module
18       Unicode::Unihan) and certain deprecated or Unicode-internal properties.
19       (An installation may choose to recompile Perl's tables to change this.
20       See "Unicode character properties that are NOT accepted by Perl".)
21
22       For most purposes, access to Unicode properties from the Perl core is
23       through regular expression matches, as described in the next section.
24       For some special purposes, and to access the properties that are not
25       suitable for regular expression matching, all the Unicode character
26       properties that Perl handles are accessible via the standard
27       Unicode::UCD module, as described in the section "Properties accessible
28       through Unicode::UCD".
29
30       Perl also provides some additional extensions and short-cut synonyms
31       for Unicode properties.
32
33       This document merely lists all available properties and does not
34       attempt to explain what each property really means.  There is a brief
35       description of each Perl extension; see "Other Properties" in
36       perlunicode for more information on these.  There is some detail about
37       Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but
38       to find out about the intricacies of the official Unicode properties,
39       refer to the Unicode standard.  A good starting place is
40       <http://www.unicode.org/reports/tr44/>.
41
42       Note that you can define your own properties; see "User-Defined
43       Character Properties" in perlunicode.
44

Properties accessible through "\p{}" and "\P{}"

46       The Perl regular expression "\p{}" and "\P{}" constructs give access to
47       most of the Unicode character properties.  The table below shows all
48       these constructs, both single and compound forms.
49
50       Compound forms consist of two components, separated by an equals sign
51       or a colon.  The first component is the property name, and the second
52       component is the particular value of the property to match against, for
53       example, "\p{Script: Greek}" and "\p{Script=Greek}" both mean to match
54       characters whose Script property is Greek.
55
56       Single forms, like "\p{Greek}", are mostly Perl-defined shortcuts for
57       their equivalent compound forms.  The table shows these equivalences.
58       (In our example, "\p{Greek}" is a just a shortcut for
59       "\p{Script=Greek}".)  There are also a few Perl-defined single forms
60       that are not shortcuts for a compound form.  One such is "\p{Word}".
61       These are also listed in the table.
62
63       In parsing these constructs, Perl always ignores Upper/lower case
64       differences everywhere within the {braces}.  Thus "\p{Greek}" means the
65       same thing as "\p{greek}".  But note that changing the case of the "p"
66       or "P" before the left brace completely changes the meaning of the
67       construct, from "match" (for "\p{}") to "doesn't match" (for "\P{}").
68       Casing in this document is for improved legibility.
69
70       Also, white space, hyphens, and underscores are also normally ignored
71       everywhere between the {braces}, and hence can be freely added or
72       removed even if the "/x" modifier hasn't been specified on the regular
73       expression.  But a 'T' at the beginning of an entry in the table below
74       means that tighter (stricter) rules are used for that entry:
75
76       Single form ("\p{name}") tighter rules:
77           White space, hyphens, and underscores ARE significant except for:
78
79           ·   white space adjacent to a non-word character
80
81           ·   underscores separating digits in numbers
82
83           That means, for example, that you can freely add or remove white
84           space adjacent to (but within) the braces without affecting the
85           meaning.
86
87       Compound form ("\p{name=value}" or "\p{name:value}") tighter rules:
88           The tighter rules given above for the single form apply to
89           everything to the right of the colon or equals; the looser rules
90           still apply to everything to the left.
91
92           That means, for example, that you can freely add or remove white
93           space adjacent to (but within) the braces and the colon or equal
94           sign.
95
96       Some properties are considered obsolete by Unicode, but still
97       available.  There are several varieties of obsolescence:
98
99       Stabilized
100           A property may be stabilized.  Such a determination does not
101           indicate that the property should or should not be used; instead it
102           is a declaration that the property will not be maintained nor
103           extended for newly encoded characters.  Such properties are marked
104           with an 'S' in the table.
105
106       Deprecated
107           A property may be deprecated, perhaps because its original intent
108           has been replaced by another property, or because its specification
109           was somehow defective.  This means that its use is strongly
110           discouraged, so much so that a warning will be issued if used,
111           unless the regular expression is in the scope of a
112           "no warnings 'deprecated'" statement.  A 'D' flags each such entry
113           in the table, and the entry there for the longest, most descriptive
114           version of the property will give the reason it is deprecated, and
115           perhaps advice.  Perl may issue such a warning, even for properties
116           that aren't officially deprecated by Unicode, when there used to be
117           characters or code points that were matched by them, but no longer.
118           This is to warn you that your program may not work like it did on
119           earlier Unicode releases.
120
121           A deprecated property may be made unavailable in a future Perl
122           version, so it is best to move away from them.
123
124           A deprecated property may also be stabilized, but this fact is not
125           shown.
126
127       Obsolete
128           Properties marked with an 'O' in the table are considered (plain)
129           obsolete.  Generally this designation is given to properties that
130           Unicode once used for internal purposes (but not any longer).
131
132       Some Perl extensions are present for backwards compatibility and are
133       discouraged from being used, but are not obsolete.  An 'X' flags each
134       such entry in the table.  Future Unicode versions may force some of
135       these extensions to be removed without warning, replaced by another
136       property with the same name that means something different.  Use the
137       equivalent shown instead.
138
139       Matches in the Block property have shortcuts that begin with "In_".
140       For example, "\p{Block=Latin1}" can be written as "\p{In_Latin1}".  For
141       backward compatibility, if there is no conflict with another shortcut,
142       these may also be written as "\p{Latin1}" or "\p{Is_Latin1}".  But,
143       N.B., there are numerous such conflicting shortcuts.  Use of these
144       forms for Block is discouraged, and are flagged as such, not only
145       because of the potential confusion as to what is meant, but also
146       because a later release of Unicode may preempt the shortcut, and your
147       program would no longer be correct.  Use the "In_" form instead to
148       avoid this, or even more clearly, use the compound form, e.g.,
149       "\p{blk:latin1}".  See "Blocks" in perlunicode for more information
150       about this.
151
152       The table below has two columns.  The left column contains the "\p{}"
153       constructs to look up, possibly preceded by the flags mentioned above;
154       and the right column contains information about them, like a
155       description, or synonyms.  It shows both the single and compound forms
156       for each property that has them.  If the left column is a short name
157       for a property, the right column will give its longer, more descriptive
158       name; and if the left column is the longest name, the right column will
159       show any equivalent shortest name, in both single and compound forms if
160       applicable.
161
162       The right column will also caution you if a property means something
163       different than what might normally be expected.
164
165       All single forms are Perl extensions; a few compound forms are as well,
166       and are noted as such.
167
168       Numbers in (parentheses) indicate the total number of code points
169       matched by the property.  For emphasis, those properties that match no
170       code points at all are listed as well in a separate section following
171       the table.
172
173       Most properties match the same code points regardless of whether "/i"
174       case-insensitive matching is specified or not.  But a few properties
175       are affected.  These are shown with the notation
176
177        (/i= other_property)
178
179       in the second column.  Under case-insensitive matching they match the
180       same code pode points as the property "other_property".
181
182       There is no description given for most non-Perl defined properties (See
183       <http://www.unicode.org/reports/tr44/> for that).
184
185       For compactness, '*' is used as a wildcard instead of showing all
186       possible combinations.  For example, entries like:
187
188        \p{Gc: *}                                  \p{General_Category: *}
189
190       mean that 'Gc' is a synonym for 'General_Category', and anything that
191       is valid for the latter is also valid for the former.  Similarly,
192
193        \p{Is_*}                                   \p{*}
194
195       means that if and only if, for example, "\p{Foo}" exists, then
196       "\p{Is_Foo}" and "\p{IsFoo}" are also valid and all mean the same
197       thing.  And similarly, "\p{Foo=Bar}" means the same as "\p{Is_Foo=Bar}"
198       and "\p{IsFoo=Bar}".  "*" here is restricted to something not beginning
199       with an underscore.
200
201       Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
202       'Y'.  And 'No', 'F', and 'False' are all synonyms for 'N'.  The table
203       shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
204       for the other possibilities.  Note that not all properties which have
205       values 'Yes' and 'No' are binary, and they have all their values
206       spelled out without using this wild card, and a "NOT" clause in their
207       description that highlights their not being binary.  These also require
208       the compound form to match them, whereas true binary properties have
209       both single and compound forms available.
210
211       Note that all non-essential underscores are removed in the display of
212       the short names below.
213
214       Legend summary:
215
216       * is a wild-card
217       (\d+) in the info column gives the number of code points matched by
218       this property.
219       D means this is deprecated.
220       O means this is obsolete.
221       S means this is stabilized.
222       T means tighter (stricter) name matching applies.
223       X means use of this form is discouraged, and may not be stable.
224
225              NAME                           INFO
226
227        X \p{Aegean_Numbers}      \p{Block=Aegean_Numbers} (64)
228        T \p{Age: 1.1}            \p{Age=V1_1} (33_979)
229        T \p{Age: 2.0}            \p{Age=V2_0} (144_521)
230        T \p{Age: 2.1}            \p{Age=V2_1} (2)
231        T \p{Age: 3.0}            \p{Age=V3_0} (10_307)
232        T \p{Age: 3.1}            \p{Age=V3_1} (44_978)
233        T \p{Age: 3.2}            \p{Age=V3_2} (1016)
234        T \p{Age: 4.0}            \p{Age=V4_0} (1226)
235        T \p{Age: 4.1}            \p{Age=V4_1} (1273)
236        T \p{Age: 5.0}            \p{Age=V5_0} (1369)
237        T \p{Age: 5.1}            \p{Age=V5_1} (1624)
238        T \p{Age: 5.2}            \p{Age=V5_2} (6648)
239        T \p{Age: 6.0}            \p{Age=V6_0} (2088)
240        T \p{Age: 6.1}            \p{Age=V6_1} (732)
241          \p{Age: NA}             \p{Age=Unassigned} (864_349)
242          \p{Age: Unassigned}     Code point's usage has not been assigned
243                                    in any Unicode release thus far. (Short:
244                                    \p{Age=NA}) (864_349)
245          \p{Age: V1_1}           Code point's usage introduced in version
246                                    1.1 (33_979)
247          \p{Age: V2_0}           Code point's usage was introduced in
248                                    version 2.0; See also Property
249                                    'Present_In' (144_521)
250          \p{Age: V2_1}           Code point's usage was introduced in
251                                    version 2.1; See also Property
252                                    'Present_In' (2)
253          \p{Age: V3_0}           Code point's usage was introduced in
254                                    version 3.0; See also Property
255                                    'Present_In' (10_307)
256          \p{Age: V3_1}           Code point's usage was introduced in
257                                    version 3.1; See also Property
258                                    'Present_In' (44_978)
259          \p{Age: V3_2}           Code point's usage was introduced in
260                                    version 3.2; See also Property
261                                    'Present_In' (1016)
262          \p{Age: V4_0}           Code point's usage was introduced in
263                                    version 4.0; See also Property
264                                    'Present_In' (1226)
265          \p{Age: V4_1}           Code point's usage was introduced in
266                                    version 4.1; See also Property
267                                    'Present_In' (1273)
268          \p{Age: V5_0}           Code point's usage was introduced in
269                                    version 5.0; See also Property
270                                    'Present_In' (1369)
271          \p{Age: V5_1}           Code point's usage was introduced in
272                                    version 5.1; See also Property
273                                    'Present_In' (1624)
274          \p{Age: V5_2}           Code point's usage was introduced in
275                                    version 5.2; See also Property
276                                    'Present_In' (6648)
277          \p{Age: V6_0}           Code point's usage was introduced in
278                                    version 6.0; See also Property
279                                    'Present_In' (2088)
280          \p{Age: V6_1}           Code point's usage was introduced in
281                                    version 6.1; See also Property
282                                    'Present_In' (732)
283          \p{AHex}                \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
284                                    (22)
285          \p{AHex: *}             \p{ASCII_Hex_Digit: *}
286        X \p{Alchemical}          \p{Alchemical_Symbols} (= \p{Block=
287                                    Alchemical_Symbols}) (128)
288        X \p{Alchemical_Symbols}  \p{Block=Alchemical_Symbols} (Short:
289                                    \p{InAlchemical}) (128)
290          \p{All}                 \p{Any} (1_114_112)
291          \p{Alnum}               Alphabetic and (decimal) Numeric (102_619)
292          \p{Alpha}               \p{Alphabetic=Y} (102_159)
293          \p{Alpha: *}            \p{Alphabetic: *}
294          \p{Alphabetic}          \p{Alpha} (= \p{Alphabetic=Y}) (102_159)
295          \p{Alphabetic: N*}      (Short: \p{Alpha=N}, \P{Alpha}) (1_011_953)
296          \p{Alphabetic: Y*}      (Short: \p{Alpha=Y}, \p{Alpha}) (102_159)
297        X \p{Alphabetic_PF}       \p{Alphabetic_Presentation_Forms} (=
298                                    \p{Block=Alphabetic_Presentation_Forms})
299                                    (80)
300        X \p{Alphabetic_Presentation_Forms} \p{Block=
301                                    Alphabetic_Presentation_Forms} (Short:
302                                    \p{InAlphabeticPF}) (80)
303        X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
304                                    \p{Block=
305                                    Ancient_Greek_Musical_Notation}) (80)
306        X \p{Ancient_Greek_Musical_Notation} \p{Block=
307                                    Ancient_Greek_Musical_Notation} (Short:
308                                    \p{InAncientGreekMusic}) (80)
309        X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
310        X \p{Ancient_Symbols}     \p{Block=Ancient_Symbols} (64)
311          \p{Any}                 [\x{0000}-\x{10FFFF}] (1_114_112)
312          \p{Arab}                \p{Arabic} (= \p{Script=Arabic}) (NOT
313                                    \p{Block=Arabic}) (1234)
314          \p{Arabic}              \p{Script=Arabic} (Short: \p{Arab}; NOT
315                                    \p{Block=Arabic}) (1234)
316        X \p{Arabic_Ext_A}        \p{Arabic_Extended_A} (= \p{Block=
317                                    Arabic_Extended_A}) (96)
318        X \p{Arabic_Extended_A}   \p{Block=Arabic_Extended_A} (Short:
319                                    \p{InArabicExtA}) (96)
320        X \p{Arabic_Math}         \p{Arabic_Mathematical_Alphabetic_Symbols}
321                                    (= \p{Block=
322                                    Arabic_Mathematical_Alphabetic_Symbols})
323                                    (256)
324        X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
325                                    Arabic_Mathematical_Alphabetic_Symbols}
326                                    (Short: \p{InArabicMath}) (256)
327        X \p{Arabic_PF_A}         \p{Arabic_Presentation_Forms_A} (=
328                                    \p{Block=Arabic_Presentation_Forms_A})
329                                    (688)
330        X \p{Arabic_PF_B}         \p{Arabic_Presentation_Forms_B} (=
331                                    \p{Block=Arabic_Presentation_Forms_B})
332                                    (144)
333        X \p{Arabic_Presentation_Forms_A} \p{Block=
334                                    Arabic_Presentation_Forms_A} (Short:
335                                    \p{InArabicPFA}) (688)
336        X \p{Arabic_Presentation_Forms_B} \p{Block=
337                                    Arabic_Presentation_Forms_B} (Short:
338                                    \p{InArabicPFB}) (144)
339        X \p{Arabic_Sup}          \p{Arabic_Supplement} (= \p{Block=
340                                    Arabic_Supplement}) (48)
341        X \p{Arabic_Supplement}   \p{Block=Arabic_Supplement} (Short:
342                                    \p{InArabicSup}) (48)
343          \p{Armenian}            \p{Script=Armenian} (Short: \p{Armn}; NOT
344                                    \p{Block=Armenian}) (91)
345          \p{Armi}                \p{Imperial_Aramaic} (= \p{Script=
346                                    Imperial_Aramaic}) (NOT \p{Block=
347                                    Imperial_Aramaic}) (31)
348          \p{Armn}                \p{Armenian} (= \p{Script=Armenian}) (NOT
349                                    \p{Block=Armenian}) (91)
350        X \p{Arrows}              \p{Block=Arrows} (112)
351          \p{ASCII}               \p{Block=Basic_Latin} [[:ASCII:]] (128)
352          \p{ASCII_Hex_Digit}     \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
353                                    (22)
354          \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090)
355          \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22)
356          \p{Assigned}            All assigned code points (249_697)
357          \p{Avestan}             \p{Script=Avestan} (Short: \p{Avst}; NOT
358                                    \p{Block=Avestan}) (61)
359          \p{Avst}                \p{Avestan} (= \p{Script=Avestan}) (NOT
360                                    \p{Block=Avestan}) (61)
361          \p{Bali}                \p{Balinese} (= \p{Script=Balinese}) (NOT
362                                    \p{Block=Balinese}) (121)
363          \p{Balinese}            \p{Script=Balinese} (Short: \p{Bali}; NOT
364                                    \p{Block=Balinese}) (121)
365          \p{Bamu}                \p{Bamum} (= \p{Script=Bamum}) (NOT
366                                    \p{Block=Bamum}) (657)
367          \p{Bamum}               \p{Script=Bamum} (Short: \p{Bamu}; NOT
368                                    \p{Block=Bamum}) (657)
369        X \p{Bamum_Sup}           \p{Bamum_Supplement} (= \p{Block=
370                                    Bamum_Supplement}) (576)
371        X \p{Bamum_Supplement}    \p{Block=Bamum_Supplement} (Short:
372                                    \p{InBamumSup}) (576)
373        X \p{Basic_Latin}         \p{ASCII} (= \p{Block=Basic_Latin}) (128)
374          \p{Batak}               \p{Script=Batak} (Short: \p{Batk}; NOT
375                                    \p{Block=Batak}) (56)
376          \p{Batk}                \p{Batak} (= \p{Script=Batak}) (NOT
377                                    \p{Block=Batak}) (56)
378          \p{Bc: *}               \p{Bidi_Class: *}
379          \p{Beng}                \p{Bengali} (= \p{Script=Bengali}) (NOT
380                                    \p{Block=Bengali}) (92)
381          \p{Bengali}             \p{Script=Bengali} (Short: \p{Beng}; NOT
382                                    \p{Block=Bengali}) (92)
383          \p{Bidi_C}              \p{Bidi_Control} (= \p{Bidi_Control=Y}) (7)
384          \p{Bidi_C: *}           \p{Bidi_Control: *}
385          \p{Bidi_Class: AL}      \p{Bidi_Class=Arabic_Letter} (1438)
386          \p{Bidi_Class: AN}      \p{Bidi_Class=Arabic_Number} (49)
387          \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1438)
388          \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (49)
389          \p{Bidi_Class: B}       \p{Bidi_Class=Paragraph_Separator} (7)
390          \p{Bidi_Class: BN}      \p{Bidi_Class=Boundary_Neutral} (4015)
391          \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4015)
392          \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15)
393          \p{Bidi_Class: CS}      \p{Bidi_Class=Common_Separator} (15)
394          \p{Bidi_Class: EN}      \p{Bidi_Class=European_Number} (131)
395          \p{Bidi_Class: ES}      \p{Bidi_Class=European_Separator} (12)
396          \p{Bidi_Class: ET}      \p{Bidi_Class=European_Terminator} (65)
397          \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (131)
398          \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12)
399          \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (65)
400          \p{Bidi_Class: L}       \p{Bidi_Class=Left_To_Right} (1_098_531)
401          \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_098_531)
402          \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1)
403          \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1)
404          \p{Bidi_Class: LRE}     \p{Bidi_Class=Left_To_Right_Embedding} (1)
405          \p{Bidi_Class: LRO}     \p{Bidi_Class=Left_To_Right_Override} (1)
406          \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1290)
407          \p{Bidi_Class: NSM}     \p{Bidi_Class=Nonspacing_Mark} (1290)
408          \p{Bidi_Class: ON}      \p{Bidi_Class=Other_Neutral} (4447)
409          \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (4447)
410          \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7)
411          \p{Bidi_Class: PDF}     \p{Bidi_Class=Pop_Directional_Format} (1)
412          \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1)
413          \p{Bidi_Class: R}       \p{Bidi_Class=Right_To_Left} (4086)
414          \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (4086)
415          \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1)
416          \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1)
417          \p{Bidi_Class: RLE}     \p{Bidi_Class=Right_To_Left_Embedding} (1)
418          \p{Bidi_Class: RLO}     \p{Bidi_Class=Right_To_Left_Override} (1)
419          \p{Bidi_Class: S}       \p{Bidi_Class=Segment_Separator} (3)
420          \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3)
421          \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (18)
422          \p{Bidi_Class: WS}      \p{Bidi_Class=White_Space} (18)
423          \p{Bidi_Control}        \p{Bidi_Control=Y} (Short: \p{BidiC}) (7)
424          \p{Bidi_Control: N*}    (Short: \p{BidiC=N}, \P{BidiC}) (1_114_105)
425          \p{Bidi_Control: Y*}    (Short: \p{BidiC=Y}, \p{BidiC}) (7)
426          \p{Bidi_M}              \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
427                                    (545)
428          \p{Bidi_M: *}           \p{Bidi_Mirrored: *}
429          \p{Bidi_Mirrored}       \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
430                                    (545)
431          \p{Bidi_Mirrored: N*}   (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567)
432          \p{Bidi_Mirrored: Y*}   (Short: \p{BidiM=Y}, \p{BidiM}) (545)
433          \p{Blank}               \h, Horizontal white space (19)
434          \p{Blk: *}              \p{Block: *}
435          \p{Block: Aegean_Numbers} (Single: \p{InAegeanNumbers}) (64)
436          \p{Block: Alchemical}   \p{Block=Alchemical_Symbols} (128)
437          \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical},
438                                    \p{InAlchemical}) (128)
439          \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
440                                    (80)
441          \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
442                                    AlphabeticPF}, \p{InAlphabeticPF}) (80)
443          \p{Block: Ancient_Greek_Music} \p{Block=
444                                    Ancient_Greek_Musical_Notation} (80)
445          \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
446                                    AncientGreekMusic},
447                                    \p{InAncientGreekMusic}) (80)
448          \p{Block: Ancient_Greek_Numbers} (Single:
449                                    \p{InAncientGreekNumbers}) (80)
450          \p{Block: Ancient_Symbols} (Single: \p{InAncientSymbols}) (64)
451          \p{Block: Arabic}       (Single: \p{InArabic}; NOT \p{Arabic} NOR
452                                    \p{Is_Arabic}) (256)
453          \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
454          \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA},
455                                    \p{InArabicExtA}) (96)
456          \p{Block: Arabic_Math}  \p{Block=
457                                    Arabic_Mathematical_Alphabetic_Symbols}
458                                    (256)
459          \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
460                                    ArabicMath}, \p{InArabicMath}) (256)
461          \p{Block: Arabic_PF_A}  \p{Block=Arabic_Presentation_Forms_A} (688)
462          \p{Block: Arabic_PF_B}  \p{Block=Arabic_Presentation_Forms_B} (144)
463          \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA},
464                                    \p{InArabicPFA}) (688)
465          \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB},
466                                    \p{InArabicPFB}) (144)
467          \p{Block: Arabic_Sup}   \p{Block=Arabic_Supplement} (48)
468          \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup},
469                                    \p{InArabicSup}) (48)
470          \p{Block: Armenian}     (Single: \p{InArmenian}; NOT \p{Armenian}
471                                    NOR \p{Is_Armenian}) (96)
472          \p{Block: Arrows}       (Single: \p{InArrows}) (112)
473          \p{Block: ASCII}        \p{Block=Basic_Latin} (128)
474          \p{Block: Avestan}      (Single: \p{InAvestan}; NOT \p{Avestan}
475                                    NOR \p{Is_Avestan}) (64)
476          \p{Block: Balinese}     (Single: \p{InBalinese}; NOT \p{Balinese}
477                                    NOR \p{Is_Balinese}) (128)
478          \p{Block: Bamum}        (Single: \p{InBamum}; NOT \p{Bamum} NOR
479                                    \p{Is_Bamum}) (96)
480          \p{Block: Bamum_Sup}    \p{Block=Bamum_Supplement} (576)
481          \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup},
482                                    \p{InBamumSup}) (576)
483          \p{Block: Basic_Latin}  (Short: \p{Blk=ASCII}, \p{ASCII}) (128)
484          \p{Block: Batak}        (Single: \p{InBatak}; NOT \p{Batak} NOR
485                                    \p{Is_Batak}) (64)
486          \p{Block: Bengali}      (Single: \p{InBengali}; NOT \p{Bengali}
487                                    NOR \p{Is_Bengali}) (128)
488          \p{Block: Block_Elements} (Single: \p{InBlockElements}) (32)
489          \p{Block: Bopomofo}     (Single: \p{InBopomofo}; NOT \p{Bopomofo}
490                                    NOR \p{Is_Bopomofo}) (48)
491          \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
492          \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt},
493                                    \p{InBopomofoExt}) (32)
494          \p{Block: Box_Drawing}  (Single: \p{InBoxDrawing}) (128)
495          \p{Block: Brahmi}       (Single: \p{InBrahmi}; NOT \p{Brahmi} NOR
496                                    \p{Is_Brahmi}) (128)
497          \p{Block: Braille}      \p{Block=Braille_Patterns} (256)
498          \p{Block: Braille_Patterns} (Short: \p{Blk=Braille},
499                                    \p{InBraille}) (256)
500          \p{Block: Buginese}     (Single: \p{InBuginese}; NOT \p{Buginese}
501                                    NOR \p{Is_Buginese}) (32)
502          \p{Block: Buhid}        (Single: \p{InBuhid}; NOT \p{Buhid} NOR
503                                    \p{Is_Buhid}) (32)
504          \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
505                                    (256)
506          \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
507                                    ByzantineMusic}, \p{InByzantineMusic})
508                                    (256)
509          \p{Block: Canadian_Syllabics} \p{Block=
510                                    Unified_Canadian_Aboriginal_Syllabics}
511                                    (640)
512          \p{Block: Carian}       (Single: \p{InCarian}; NOT \p{Carian} NOR
513                                    \p{Is_Carian}) (64)
514          \p{Block: Chakma}       (Single: \p{InChakma}; NOT \p{Chakma} NOR
515                                    \p{Is_Chakma}) (80)
516          \p{Block: Cham}         (Single: \p{InCham}; NOT \p{Cham} NOR
517                                    \p{Is_Cham}) (96)
518          \p{Block: Cherokee}     (Single: \p{InCherokee}; NOT \p{Cherokee}
519                                    NOR \p{Is_Cherokee}) (96)
520          \p{Block: CJK}          \p{Block=CJK_Unified_Ideographs} (20_992)
521          \p{Block: CJK_Compat}   \p{Block=CJK_Compatibility} (256)
522          \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
523          \p{Block: CJK_Compat_Ideographs} \p{Block=
524                                    CJK_Compatibility_Ideographs} (512)
525          \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
526                                    CJK_Compatibility_Ideographs_Supplement}
527                                    (544)
528          \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat},
529                                    \p{InCJKCompat}) (256)
530          \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms},
531                                    \p{InCJKCompatForms}) (32)
532          \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
533                                    CJKCompatIdeographs},
534                                    \p{InCJKCompatIdeographs}) (512)
535          \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
536                                    CJKCompatIdeographsSup},
537                                    \p{InCJKCompatIdeographsSup}) (544)
538          \p{Block: CJK_Ext_A}    \p{Block=
539                                    CJK_Unified_Ideographs_Extension_A}
540                                    (6592)
541          \p{Block: CJK_Ext_B}    \p{Block=
542                                    CJK_Unified_Ideographs_Extension_B}
543                                    (42_720)
544          \p{Block: CJK_Ext_C}    \p{Block=
545                                    CJK_Unified_Ideographs_Extension_C}
546                                    (4160)
547          \p{Block: CJK_Ext_D}    \p{Block=
548                                    CJK_Unified_Ideographs_Extension_D} (224)
549          \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
550          \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup},
551                                    \p{InCJKRadicalsSup}) (128)
552          \p{Block: CJK_Strokes}  (Single: \p{InCJKStrokes}) (48)
553          \p{Block: CJK_Symbols}  \p{Block=CJK_Symbols_And_Punctuation} (64)
554          \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols},
555                                    \p{InCJKSymbols}) (64)
556          \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}, \p{InCJK})
557                                    (20_992)
558          \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
559                                    CJKExtA}, \p{InCJKExtA}) (6592)
560          \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
561                                    CJKExtB}, \p{InCJKExtB}) (42_720)
562          \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
563                                    CJKExtC}, \p{InCJKExtC}) (4160)
564          \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
565                                    CJKExtD}, \p{InCJKExtD}) (224)
566          \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
567                                    Diacriticals}, \p{InDiacriticals}) (112)
568          \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
569                                    DiacriticalsForSymbols},
570                                    \p{InDiacriticalsForSymbols}) (48)
571          \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
572                                    DiacriticalsSup}, \p{InDiacriticalsSup})
573                                    (64)
574          \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks},
575                                    \p{InHalfMarks}) (16)
576          \p{Block: Combining_Marks_For_Symbols} \p{Block=
577                                    Combining_Diacritical_Marks_For_Symbols}
578                                    (48)
579          \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
580                                    IndicNumberForms},
581                                    \p{InIndicNumberForms}) (16)
582          \p{Block: Compat_Jamo}  \p{Block=Hangul_Compatibility_Jamo} (96)
583          \p{Block: Control_Pictures} (Single: \p{InControlPictures}) (64)
584          \p{Block: Coptic}       (Single: \p{InCoptic}; NOT \p{Coptic} NOR
585                                    \p{Is_Coptic}) (128)
586          \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
587          \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod},
588                                    \p{InCountingRod}) (32)
589          \p{Block: Cuneiform}    (Single: \p{InCuneiform}; NOT
590                                    \p{Cuneiform} NOR \p{Is_Cuneiform})
591                                    (1024)
592          \p{Block: Cuneiform_Numbers} \p{Block=
593                                    Cuneiform_Numbers_And_Punctuation} (128)
594          \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
595                                    CuneiformNumbers},
596                                    \p{InCuneiformNumbers}) (128)
597          \p{Block: Currency_Symbols} (Single: \p{InCurrencySymbols}) (48)
598          \p{Block: Cypriot_Syllabary} (Single: \p{InCypriotSyllabary}) (64)
599          \p{Block: Cyrillic}     (Single: \p{InCyrillic}; NOT \p{Cyrillic}
600                                    NOR \p{Is_Cyrillic}) (256)
601          \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
602          \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
603          \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA},
604                                    \p{InCyrillicExtA}) (32)
605          \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB},
606                                    \p{InCyrillicExtB}) (96)
607          \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
608          \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup},
609                                    \p{InCyrillicSup}) (48)
610          \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
611                                    (48)
612          \p{Block: Deseret}      (Single: \p{InDeseret}) (80)
613          \p{Block: Devanagari}   (Single: \p{InDevanagari}; NOT
614                                    \p{Devanagari} NOR \p{Is_Devanagari})
615                                    (128)
616          \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
617          \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt},
618                                    \p{InDevanagariExt}) (32)
619          \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
620          \p{Block: Diacriticals_For_Symbols} \p{Block=
621                                    Combining_Diacritical_Marks_For_Symbols}
622                                    (48)
623          \p{Block: Diacriticals_Sup} \p{Block=
624                                    Combining_Diacritical_Marks_Supplement}
625                                    (64)
626          \p{Block: Dingbats}     (Single: \p{InDingbats}) (192)
627          \p{Block: Domino}       \p{Block=Domino_Tiles} (112)
628          \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}, \p{InDomino}) (112)
629          \p{Block: Egyptian_Hieroglyphs} (Single:
630                                    \p{InEgyptianHieroglyphs}; NOT
631                                    \p{Egyptian_Hieroglyphs} NOR
632                                    \p{Is_Egyptian_Hieroglyphs}) (1072)
633          \p{Block: Emoticons}    (Single: \p{InEmoticons}) (80)
634          \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
635          \p{Block: Enclosed_Alphanum_Sup} \p{Block=
636                                    Enclosed_Alphanumeric_Supplement} (256)
637          \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
638                                    EnclosedAlphanumSup},
639                                    \p{InEnclosedAlphanumSup}) (256)
640          \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
641                                    EnclosedAlphanum},
642                                    \p{InEnclosedAlphanum}) (160)
643          \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
644                                    (256)
645          \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
646                                    EnclosedCJK}, \p{InEnclosedCJK}) (256)
647          \p{Block: Enclosed_Ideographic_Sup} \p{Block=
648                                    Enclosed_Ideographic_Supplement} (256)
649          \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
650                                    EnclosedIdeographicSup},
651                                    \p{InEnclosedIdeographicSup}) (256)
652          \p{Block: Ethiopic}     (Single: \p{InEthiopic}; NOT \p{Ethiopic}
653                                    NOR \p{Is_Ethiopic}) (384)
654          \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
655          \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
656          \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt},
657                                    \p{InEthiopicExt}) (96)
658          \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA},
659                                    \p{InEthiopicExtA}) (48)
660          \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
661          \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup},
662                                    \p{InEthiopicSup}) (32)
663          \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation},
664                                    \p{InPunctuation}; NOT \p{Punct} NOR
665                                    \p{Is_Punctuation}) (112)
666          \p{Block: Geometric_Shapes} (Single: \p{InGeometricShapes}) (96)
667          \p{Block: Georgian}     (Single: \p{InGeorgian}; NOT \p{Georgian}
668                                    NOR \p{Is_Georgian}) (96)
669          \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
670          \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup},
671                                    \p{InGeorgianSup}) (48)
672          \p{Block: Glagolitic}   (Single: \p{InGlagolitic}; NOT
673                                    \p{Glagolitic} NOR \p{Is_Glagolitic})
674                                    (96)
675          \p{Block: Gothic}       (Single: \p{InGothic}; NOT \p{Gothic} NOR
676                                    \p{Is_Gothic}) (32)
677          \p{Block: Greek}        \p{Block=Greek_And_Coptic} (NOT \p{Greek}
678                                    NOR \p{Is_Greek}) (144)
679          \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}, \p{InGreek};
680                                    NOT \p{Greek} NOR \p{Is_Greek}) (144)
681          \p{Block: Greek_Ext}    \p{Block=Greek_Extended} (256)
682          \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt},
683                                    \p{InGreekExt}) (256)
684          \p{Block: Gujarati}     (Single: \p{InGujarati}; NOT \p{Gujarati}
685                                    NOR \p{Is_Gujarati}) (128)
686          \p{Block: Gurmukhi}     (Single: \p{InGurmukhi}; NOT \p{Gurmukhi}
687                                    NOR \p{Is_Gurmukhi}) (128)
688          \p{Block: Half_And_Full_Forms} \p{Block=
689                                    Halfwidth_And_Fullwidth_Forms} (240)
690          \p{Block: Half_Marks}   \p{Block=Combining_Half_Marks} (16)
691          \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
692                                    HalfAndFullForms},
693                                    \p{InHalfAndFullForms}) (240)
694          \p{Block: Hangul}       \p{Block=Hangul_Syllables} (NOT \p{Hangul}
695                                    NOR \p{Is_Hangul}) (11_184)
696          \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo},
697                                    \p{InCompatJamo}) (96)
698          \p{Block: Hangul_Jamo}  (Short: \p{Blk=Jamo}, \p{InJamo}) (256)
699          \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA},
700                                    \p{InJamoExtA}) (32)
701          \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB},
702                                    \p{InJamoExtB}) (80)
703          \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}, \p{InHangul};
704                                    NOT \p{Hangul} NOR \p{Is_Hangul})
705                                    (11_184)
706          \p{Block: Hanunoo}      (Single: \p{InHanunoo}; NOT \p{Hanunoo}
707                                    NOR \p{Is_Hanunoo}) (32)
708          \p{Block: Hebrew}       (Single: \p{InHebrew}; NOT \p{Hebrew} NOR
709                                    \p{Is_Hebrew}) (112)
710          \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
711                                    HighPUSurrogates},
712                                    \p{InHighPUSurrogates}) (128)
713          \p{Block: High_PU_Surrogates} \p{Block=
714                                    High_Private_Use_Surrogates} (128)
715          \p{Block: High_Surrogates} (Single: \p{InHighSurrogates}) (896)
716          \p{Block: Hiragana}     (Single: \p{InHiragana}; NOT \p{Hiragana}
717                                    NOR \p{Is_Hiragana}) (96)
718          \p{Block: IDC}          \p{Block=
719                                    Ideographic_Description_Characters} (NOT
720                                    \p{ID_Continue} NOR \p{Is_IDC}) (16)
721          \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC},
722                                    \p{InIDC}; NOT \p{ID_Continue} NOR
723                                    \p{Is_IDC}) (16)
724          \p{Block: Imperial_Aramaic} (Single: \p{InImperialAramaic}; NOT
725                                    \p{Imperial_Aramaic} NOR
726                                    \p{Is_Imperial_Aramaic}) (32)
727          \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
728                                    (16)
729          \p{Block: Inscriptional_Pahlavi} (Single:
730                                    \p{InInscriptionalPahlavi}; NOT
731                                    \p{Inscriptional_Pahlavi} NOR
732                                    \p{Is_Inscriptional_Pahlavi}) (32)
733          \p{Block: Inscriptional_Parthian} (Single:
734                                    \p{InInscriptionalParthian}; NOT
735                                    \p{Inscriptional_Parthian} NOR
736                                    \p{Is_Inscriptional_Parthian}) (32)
737          \p{Block: IPA_Ext}      \p{Block=IPA_Extensions} (96)
738          \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}, \p{InIPAExt})
739                                    (96)
740          \p{Block: Jamo}         \p{Block=Hangul_Jamo} (256)
741          \p{Block: Jamo_Ext_A}   \p{Block=Hangul_Jamo_Extended_A} (32)
742          \p{Block: Jamo_Ext_B}   \p{Block=Hangul_Jamo_Extended_B} (80)
743          \p{Block: Javanese}     (Single: \p{InJavanese}; NOT \p{Javanese}
744                                    NOR \p{Is_Javanese}) (96)
745          \p{Block: Kaithi}       (Single: \p{InKaithi}; NOT \p{Kaithi} NOR
746                                    \p{Is_Kaithi}) (80)
747          \p{Block: Kana_Sup}     \p{Block=Kana_Supplement} (256)
748          \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}, \p{InKanaSup})
749                                    (256)
750          \p{Block: Kanbun}       (Single: \p{InKanbun}) (16)
751          \p{Block: Kangxi}       \p{Block=Kangxi_Radicals} (224)
752          \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}, \p{InKangxi})
753                                    (224)
754          \p{Block: Kannada}      (Single: \p{InKannada}; NOT \p{Kannada}
755                                    NOR \p{Is_Kannada}) (128)
756          \p{Block: Katakana}     (Single: \p{InKatakana}; NOT \p{Katakana}
757                                    NOR \p{Is_Katakana}) (96)
758          \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
759          \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
760                                    KatakanaExt}, \p{InKatakanaExt}) (16)
761          \p{Block: Kayah_Li}     (Single: \p{InKayahLi}) (48)
762          \p{Block: Kharoshthi}   (Single: \p{InKharoshthi}; NOT
763                                    \p{Kharoshthi} NOR \p{Is_Kharoshthi})
764                                    (96)
765          \p{Block: Khmer}        (Single: \p{InKhmer}; NOT \p{Khmer} NOR
766                                    \p{Is_Khmer}) (128)
767          \p{Block: Khmer_Symbols} (Single: \p{InKhmerSymbols}) (32)
768          \p{Block: Lao}          (Single: \p{InLao}; NOT \p{Lao} NOR
769                                    \p{Is_Lao}) (128)
770          \p{Block: Latin_1}      \p{Block=Latin_1_Supplement} (128)
771          \p{Block: Latin_1_Sup}  \p{Block=Latin_1_Supplement} (128)
772          \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1},
773                                    \p{InLatin1}) (128)
774          \p{Block: Latin_Ext_A}  \p{Block=Latin_Extended_A} (128)
775          \p{Block: Latin_Ext_Additional} \p{Block=
776                                    Latin_Extended_Additional} (256)
777          \p{Block: Latin_Ext_B}  \p{Block=Latin_Extended_B} (208)
778          \p{Block: Latin_Ext_C}  \p{Block=Latin_Extended_C} (32)
779          \p{Block: Latin_Ext_D}  \p{Block=Latin_Extended_D} (224)
780          \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA},
781                                    \p{InLatinExtA}) (128)
782          \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
783                                    LatinExtAdditional},
784                                    \p{InLatinExtAdditional}) (256)
785          \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB},
786                                    \p{InLatinExtB}) (208)
787          \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC},
788                                    \p{InLatinExtC}) (32)
789          \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD},
790                                    \p{InLatinExtD}) (224)
791          \p{Block: Lepcha}       (Single: \p{InLepcha}; NOT \p{Lepcha} NOR
792                                    \p{Is_Lepcha}) (80)
793          \p{Block: Letterlike_Symbols} (Single: \p{InLetterlikeSymbols})
794                                    (80)
795          \p{Block: Limbu}        (Single: \p{InLimbu}; NOT \p{Limbu} NOR
796                                    \p{Is_Limbu}) (80)
797          \p{Block: Linear_B_Ideograms} (Single: \p{InLinearBIdeograms})
798                                    (128)
799          \p{Block: Linear_B_Syllabary} (Single: \p{InLinearBSyllabary})
800                                    (128)
801          \p{Block: Lisu}         (Single: \p{InLisu}) (48)
802          \p{Block: Low_Surrogates} (Single: \p{InLowSurrogates}) (1024)
803          \p{Block: Lycian}       (Single: \p{InLycian}; NOT \p{Lycian} NOR
804                                    \p{Is_Lycian}) (32)
805          \p{Block: Lydian}       (Single: \p{InLydian}; NOT \p{Lydian} NOR
806                                    \p{Is_Lydian}) (32)
807          \p{Block: Mahjong}      \p{Block=Mahjong_Tiles} (48)
808          \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}, \p{InMahjong})
809                                    (48)
810          \p{Block: Malayalam}    (Single: \p{InMalayalam}; NOT
811                                    \p{Malayalam} NOR \p{Is_Malayalam}) (128)
812          \p{Block: Mandaic}      (Single: \p{InMandaic}; NOT \p{Mandaic}
813                                    NOR \p{Is_Mandaic}) (32)
814          \p{Block: Math_Alphanum} \p{Block=
815                                    Mathematical_Alphanumeric_Symbols} (1024)
816          \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
817          \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
818                                    MathAlphanum}, \p{InMathAlphanum}) (1024)
819          \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators},
820                                    \p{InMathOperators}) (256)
821          \p{Block: Meetei_Mayek} (Single: \p{InMeeteiMayek}; NOT
822                                    \p{Meetei_Mayek} NOR
823                                    \p{Is_Meetei_Mayek}) (64)
824          \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
825          \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt},
826                                    \p{InMeeteiMayekExt}) (32)
827          \p{Block: Meroitic_Cursive} (Single: \p{InMeroiticCursive}; NOT
828                                    \p{Meroitic_Cursive} NOR
829                                    \p{Is_Meroitic_Cursive}) (96)
830          \p{Block: Meroitic_Hieroglyphs} (Single:
831                                    \p{InMeroiticHieroglyphs}) (32)
832          \p{Block: Miao}         (Single: \p{InMiao}; NOT \p{Miao} NOR
833                                    \p{Is_Miao}) (160)
834          \p{Block: Misc_Arrows}  \p{Block=Miscellaneous_Symbols_And_Arrows}
835                                    (256)
836          \p{Block: Misc_Math_Symbols_A} \p{Block=
837                                    Miscellaneous_Mathematical_Symbols_A}
838                                    (48)
839          \p{Block: Misc_Math_Symbols_B} \p{Block=
840                                    Miscellaneous_Mathematical_Symbols_B}
841                                    (128)
842          \p{Block: Misc_Pictographs} \p{Block=
843                                    Miscellaneous_Symbols_And_Pictographs}
844                                    (768)
845          \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
846          \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
847          \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
848                                    MiscMathSymbolsA},
849                                    \p{InMiscMathSymbolsA}) (48)
850          \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
851                                    MiscMathSymbolsB},
852                                    \p{InMiscMathSymbolsB}) (128)
853          \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols},
854                                    \p{InMiscSymbols}) (256)
855          \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
856                                    MiscArrows}, \p{InMiscArrows}) (256)
857          \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
858                                    MiscPictographs}, \p{InMiscPictographs})
859                                    (768)
860          \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical},
861                                    \p{InMiscTechnical}) (256)
862          \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
863          \p{Block: Modifier_Tone_Letters} (Single:
864                                    \p{InModifierToneLetters}) (32)
865          \p{Block: Mongolian}    (Single: \p{InMongolian}; NOT
866                                    \p{Mongolian} NOR \p{Is_Mongolian}) (176)
867          \p{Block: Music}        \p{Block=Musical_Symbols} (256)
868          \p{Block: Musical_Symbols} (Short: \p{Blk=Music}, \p{InMusic})
869                                    (256)
870          \p{Block: Myanmar}      (Single: \p{InMyanmar}; NOT \p{Myanmar}
871                                    NOR \p{Is_Myanmar}) (160)
872          \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
873          \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA},
874                                    \p{InMyanmarExtA}) (32)
875          \p{Block: NB}           \p{Block=No_Block} (860_672)
876          \p{Block: New_Tai_Lue}  (Single: \p{InNewTaiLue}; NOT
877                                    \p{New_Tai_Lue} NOR \p{Is_New_Tai_Lue})
878                                    (96)
879          \p{Block: NKo}          (Single: \p{InNKo}; NOT \p{Nko} NOR
880                                    \p{Is_NKo}) (64)
881          \p{Block: No_Block}     (Short: \p{Blk=NB}, \p{InNB}) (860_672)
882          \p{Block: Number_Forms} (Single: \p{InNumberForms}) (64)
883          \p{Block: OCR}          \p{Block=Optical_Character_Recognition}
884                                    (32)
885          \p{Block: Ogham}        (Single: \p{InOgham}; NOT \p{Ogham} NOR
886                                    \p{Is_Ogham}) (32)
887          \p{Block: Ol_Chiki}     (Single: \p{InOlChiki}) (48)
888          \p{Block: Old_Italic}   (Single: \p{InOldItalic}; NOT
889                                    \p{Old_Italic} NOR \p{Is_Old_Italic})
890                                    (48)
891          \p{Block: Old_Persian}  (Single: \p{InOldPersian}; NOT
892                                    \p{Old_Persian} NOR \p{Is_Old_Persian})
893                                    (64)
894          \p{Block: Old_South_Arabian} (Single: \p{InOldSouthArabian}) (32)
895          \p{Block: Old_Turkic}   (Single: \p{InOldTurkic}; NOT
896                                    \p{Old_Turkic} NOR \p{Is_Old_Turkic})
897                                    (80)
898          \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR},
899                                    \p{InOCR}) (32)
900          \p{Block: Oriya}        (Single: \p{InOriya}; NOT \p{Oriya} NOR
901                                    \p{Is_Oriya}) (128)
902          \p{Block: Osmanya}      (Single: \p{InOsmanya}; NOT \p{Osmanya}
903                                    NOR \p{Is_Osmanya}) (48)
904          \p{Block: Phags_Pa}     (Single: \p{InPhagsPa}; NOT \p{Phags_Pa}
905                                    NOR \p{Is_Phags_Pa}) (64)
906          \p{Block: Phaistos}     \p{Block=Phaistos_Disc} (48)
907          \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}, \p{InPhaistos})
908                                    (48)
909          \p{Block: Phoenician}   (Single: \p{InPhoenician}; NOT
910                                    \p{Phoenician} NOR \p{Is_Phoenician})
911                                    (32)
912          \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
913          \p{Block: Phonetic_Ext_Sup} \p{Block=
914                                    Phonetic_Extensions_Supplement} (64)
915          \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt},
916                                    \p{InPhoneticExt}) (128)
917          \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
918                                    PhoneticExtSup}, \p{InPhoneticExtSup})
919                                    (64)
920          \p{Block: Playing_Cards} (Single: \p{InPlayingCards}) (96)
921          \p{Block: Private_Use}  \p{Block=Private_Use_Area} (NOT
922                                    \p{Private_Use} NOR \p{Is_Private_Use})
923                                    (6400)
924          \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}, \p{InPUA}; NOT
925                                    \p{Private_Use} NOR \p{Is_Private_Use})
926                                    (6400)
927          \p{Block: PUA}          \p{Block=Private_Use_Area} (NOT
928                                    \p{Private_Use} NOR \p{Is_Private_Use})
929                                    (6400)
930          \p{Block: Punctuation}  \p{Block=General_Punctuation} (NOT
931                                    \p{Punct} NOR \p{Is_Punctuation}) (112)
932          \p{Block: Rejang}       (Single: \p{InRejang}; NOT \p{Rejang} NOR
933                                    \p{Is_Rejang}) (48)
934          \p{Block: Rumi}         \p{Block=Rumi_Numeral_Symbols} (32)
935          \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}, \p{InRumi})
936                                    (32)
937          \p{Block: Runic}        (Single: \p{InRunic}; NOT \p{Runic} NOR
938                                    \p{Is_Runic}) (96)
939          \p{Block: Samaritan}    (Single: \p{InSamaritan}; NOT
940                                    \p{Samaritan} NOR \p{Is_Samaritan}) (64)
941          \p{Block: Saurashtra}   (Single: \p{InSaurashtra}; NOT
942                                    \p{Saurashtra} NOR \p{Is_Saurashtra})
943                                    (96)
944          \p{Block: Sharada}      (Single: \p{InSharada}; NOT \p{Sharada}
945                                    NOR \p{Is_Sharada}) (96)
946          \p{Block: Shavian}      (Single: \p{InShavian}) (48)
947          \p{Block: Sinhala}      (Single: \p{InSinhala}; NOT \p{Sinhala}
948                                    NOR \p{Is_Sinhala}) (128)
949          \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms},
950                                    \p{InSmallForms}) (32)
951          \p{Block: Small_Forms}  \p{Block=Small_Form_Variants} (32)
952          \p{Block: Sora_Sompeng} (Single: \p{InSoraSompeng}; NOT
953                                    \p{Sora_Sompeng} NOR
954                                    \p{Is_Sora_Sompeng}) (48)
955          \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
956                                    ModifierLetters}, \p{InModifierLetters})
957                                    (80)
958          \p{Block: Specials}     (Single: \p{InSpecials}) (16)
959          \p{Block: Sundanese}    (Single: \p{InSundanese}; NOT
960                                    \p{Sundanese} NOR \p{Is_Sundanese}) (64)
961          \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
962          \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup},
963                                    \p{InSundaneseSup}) (16)
964          \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
965          \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
966          \p{Block: Sup_Math_Operators} \p{Block=
967                                    Supplemental_Mathematical_Operators}
968                                    (256)
969          \p{Block: Sup_PUA_A}    \p{Block=Supplementary_Private_Use_Area_A}
970                                    (65_536)
971          \p{Block: Sup_PUA_B}    \p{Block=Supplementary_Private_Use_Area_B}
972                                    (65_536)
973          \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
974          \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
975          \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
976                                    SuperAndSub}, \p{InSuperAndSub}) (48)
977          \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA},
978                                    \p{InSupArrowsA}) (16)
979          \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB},
980                                    \p{InSupArrowsB}) (128)
981          \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
982                                    SupMathOperators},
983                                    \p{InSupMathOperators}) (256)
984          \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
985                                    SupPunctuation}, \p{InSupPunctuation})
986                                    (128)
987          \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
988                                    SupPUAA}, \p{InSupPUAA}) (65_536)
989          \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
990                                    SupPUAB}, \p{InSupPUAB}) (65_536)
991          \p{Block: Syloti_Nagri} (Single: \p{InSylotiNagri}; NOT
992                                    \p{Syloti_Nagri} NOR
993                                    \p{Is_Syloti_Nagri}) (48)
994          \p{Block: Syriac}       (Single: \p{InSyriac}; NOT \p{Syriac} NOR
995                                    \p{Is_Syriac}) (80)
996          \p{Block: Tagalog}      (Single: \p{InTagalog}; NOT \p{Tagalog}
997                                    NOR \p{Is_Tagalog}) (32)
998          \p{Block: Tagbanwa}     (Single: \p{InTagbanwa}; NOT \p{Tagbanwa}
999                                    NOR \p{Is_Tagbanwa}) (32)
1000          \p{Block: Tags}         (Single: \p{InTags}) (128)
1001          \p{Block: Tai_Le}       (Single: \p{InTaiLe}; NOT \p{Tai_Le} NOR
1002                                    \p{Is_Tai_Le}) (48)
1003          \p{Block: Tai_Tham}     (Single: \p{InTaiTham}; NOT \p{Tai_Tham}
1004                                    NOR \p{Is_Tai_Tham}) (144)
1005          \p{Block: Tai_Viet}     (Single: \p{InTaiViet}; NOT \p{Tai_Viet}
1006                                    NOR \p{Is_Tai_Viet}) (96)
1007          \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1008          \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing},
1009                                    \p{InTaiXuanJing}) (96)
1010          \p{Block: Takri}        (Single: \p{InTakri}; NOT \p{Takri} NOR
1011                                    \p{Is_Takri}) (80)
1012          \p{Block: Tamil}        (Single: \p{InTamil}; NOT \p{Tamil} NOR
1013                                    \p{Is_Tamil}) (128)
1014          \p{Block: Telugu}       (Single: \p{InTelugu}; NOT \p{Telugu} NOR
1015                                    \p{Is_Telugu}) (128)
1016          \p{Block: Thaana}       (Single: \p{InThaana}; NOT \p{Thaana} NOR
1017                                    \p{Is_Thaana}) (64)
1018          \p{Block: Thai}         (Single: \p{InThai}; NOT \p{Thai} NOR
1019                                    \p{Is_Thai}) (128)
1020          \p{Block: Tibetan}      (Single: \p{InTibetan}; NOT \p{Tibetan}
1021                                    NOR \p{Is_Tibetan}) (256)
1022          \p{Block: Tifinagh}     (Single: \p{InTifinagh}; NOT \p{Tifinagh}
1023                                    NOR \p{Is_Tifinagh}) (80)
1024          \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1025                                    (128)
1026          \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1027                                    TransportAndMap}, \p{InTransportAndMap})
1028                                    (128)
1029          \p{Block: UCAS}         \p{Block=
1030                                    Unified_Canadian_Aboriginal_Syllabics}
1031                                    (640)
1032          \p{Block: UCAS_Ext}     \p{Block=
1033                                    Unified_Canadian_Aboriginal_Syllabics_-
1034                                    Extended} (80)
1035          \p{Block: Ugaritic}     (Single: \p{InUgaritic}; NOT \p{Ugaritic}
1036                                    NOR \p{Is_Ugaritic}) (32)
1037          \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1038                                    UCAS}, \p{InUCAS}) (640)
1039          \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1040                                    \p{Blk=UCASExt}, \p{InUCASExt}) (80)
1041          \p{Block: Vai}          (Single: \p{InVai}; NOT \p{Vai} NOR
1042                                    \p{Is_Vai}) (320)
1043          \p{Block: Variation_Selectors} (Short: \p{Blk=VS}, \p{InVS}; NOT
1044                                    \p{Variation_Selector} NOR \p{Is_VS})
1045                                    (16)
1046          \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup},
1047                                    \p{InVSSup}) (240)
1048          \p{Block: Vedic_Ext}    \p{Block=Vedic_Extensions} (48)
1049          \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt},
1050                                    \p{InVedicExt}) (48)
1051          \p{Block: Vertical_Forms} (Single: \p{InVerticalForms}) (16)
1052          \p{Block: VS}           \p{Block=Variation_Selectors} (NOT
1053                                    \p{Variation_Selector} NOR \p{Is_VS})
1054                                    (16)
1055          \p{Block: VS_Sup}       \p{Block=Variation_Selectors_Supplement}
1056                                    (240)
1057          \p{Block: Yi_Radicals}  (Single: \p{InYiRadicals}) (64)
1058          \p{Block: Yi_Syllables} (Single: \p{InYiSyllables}) (1168)
1059          \p{Block: Yijing}       \p{Block=Yijing_Hexagram_Symbols} (64)
1060          \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing},
1061                                    \p{InYijing}) (64)
1062        X \p{Block_Elements}      \p{Block=Block_Elements} (32)
1063          \p{Bopo}                \p{Bopomofo} (= \p{Script=Bopomofo}) (NOT
1064                                    \p{Block=Bopomofo}) (70)
1065          \p{Bopomofo}            \p{Script=Bopomofo} (Short: \p{Bopo}; NOT
1066                                    \p{Block=Bopomofo}) (70)
1067        X \p{Bopomofo_Ext}        \p{Bopomofo_Extended} (= \p{Block=
1068                                    Bopomofo_Extended}) (32)
1069        X \p{Bopomofo_Extended}   \p{Block=Bopomofo_Extended} (Short:
1070                                    \p{InBopomofoExt}) (32)
1071        X \p{Box_Drawing}         \p{Block=Box_Drawing} (128)
1072          \p{Brah}                \p{Brahmi} (= \p{Script=Brahmi}) (NOT
1073                                    \p{Block=Brahmi}) (108)
1074          \p{Brahmi}              \p{Script=Brahmi} (Short: \p{Brah}; NOT
1075                                    \p{Block=Brahmi}) (108)
1076          \p{Brai}                \p{Braille} (= \p{Script=Braille}) (256)
1077          \p{Braille}             \p{Script=Braille} (Short: \p{Brai}) (256)
1078        X \p{Braille_Patterns}    \p{Block=Braille_Patterns} (Short:
1079                                    \p{InBraille}) (256)
1080          \p{Bugi}                \p{Buginese} (= \p{Script=Buginese}) (NOT
1081                                    \p{Block=Buginese}) (30)
1082          \p{Buginese}            \p{Script=Buginese} (Short: \p{Bugi}; NOT
1083                                    \p{Block=Buginese}) (30)
1084          \p{Buhd}                \p{Buhid} (= \p{Script=Buhid}) (NOT
1085                                    \p{Block=Buhid}) (20)
1086          \p{Buhid}               \p{Script=Buhid} (Short: \p{Buhd}; NOT
1087                                    \p{Block=Buhid}) (20)
1088        X \p{Byzantine_Music}     \p{Byzantine_Musical_Symbols} (= \p{Block=
1089                                    Byzantine_Musical_Symbols}) (256)
1090        X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1091                                    (Short: \p{InByzantineMusic}) (256)
1092          \p{C}                   \p{Other} (= \p{General_Category=Other})
1093                                    (1_004_135)
1094          \p{Cakm}                \p{Chakma} (= \p{Script=Chakma}) (NOT
1095                                    \p{Block=Chakma}) (67)
1096          \p{Canadian_Aboriginal} \p{Script=Canadian_Aboriginal} (Short:
1097                                    \p{Cans}) (710)
1098        X \p{Canadian_Syllabics}  \p{Unified_Canadian_Aboriginal_Syllabics}
1099                                    (= \p{Block=
1100                                    Unified_Canadian_Aboriginal_Syllabics})
1101                                    (640)
1102        T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1103                                    Not_Reordered} (1_113_459)
1104        T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1105                                    Overlay} (26)
1106        T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1107                                    Nukta} (13)
1108        T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1109                                    Kana_Voicing} (2)
1110        T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1111                                    Virama} (37)
1112        T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1113                                    CCC10} (1)
1114        T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1115                                    CCC11} (1)
1116        T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1117                                    CCC12} (1)
1118        T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1119                                    CCC13} (1)
1120        T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1121                                    CCC14} (1)
1122        T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1123                                    CCC15} (1)
1124        T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1125                                    CCC16} (1)
1126        T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1127                                    CCC17} (1)
1128        T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1129                                    CCC18} (2)
1130        T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1131                                    CCC19} (2)
1132        T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1133                                    CCC20} (1)
1134        T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1135                                    CCC21} (1)
1136        T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1137                                    CCC22} (1)
1138        T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1139                                    CCC23} (1)
1140        T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1141                                    CCC24} (1)
1142        T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1143                                    CCC25} (1)
1144        T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1145                                    CCC26} (1)
1146        T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1147                                    CCC27} (2)
1148        T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1149                                    CCC28} (2)
1150        T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1151                                    CCC29} (2)
1152        T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1153                                    CCC30} (2)
1154        T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1155                                    CCC31} (2)
1156        T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1157                                    CCC32} (2)
1158        T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1159                                    CCC33} (1)
1160        T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1161                                    CCC34} (1)
1162        T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1163                                    CCC35} (1)
1164        T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1165                                    CCC36} (1)
1166        T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1167                                    CCC84} (1)
1168        T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1169                                    CCC91} (1)
1170        T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1171                                    CCC103} (2)
1172        T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1173                                    CCC107} (4)
1174        T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1175                                    CCC118} (2)
1176        T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1177                                    CCC122} (4)
1178        T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1179                                    CCC129} (1)
1180        T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1181                                    CCC130} (6)
1182        T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1183                                    CCC132} (1)
1184        T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1185                                    Attached_Below_Left} (0)
1186        T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1187                                    Attached_Below} (5)
1188        T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1189                                    Attached_Above} (1)
1190        T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1191                                    Attached_Above_Right} (9)
1192        T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1193                                    Below_Left} (1)
1194        T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1195                                    Below} (129)
1196        T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1197                                    Below_Right} (4)
1198        T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1199                                    Left} (2)
1200        T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1201                                    Right} (1)
1202        T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1203                                    Above_Left} (3)
1204        T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1205                                    Above} (349)
1206        T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1207                                    Above_Right} (4)
1208        T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1209                                    Double_Below} (4)
1210        T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1211                                    Double_Above} (5)
1212        T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1213                                    Iota_Subscript} (1)
1214          \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1215                                    Above} (349)
1216          \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (349)
1217          \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (3)
1218          \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (4)
1219          \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1220                                    Above_Left} (3)
1221          \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1222                                    Above_Right} (4)
1223          \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1224                                    Attached_Above} (1)
1225          \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1226                                    Attached_Above_Right} (9)
1227          \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1228                                    Attached_Below} (5)
1229          \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1230                                    Attached_Below_Left} (0)
1231          \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1232                                    (1)
1233          \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1234                                    \p{Ccc=ATAR}) (9)
1235          \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1236                                    (5)
1237          \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1238                                    ATBL}) (0)
1239          \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1240                                    Below} (129)
1241          \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (129)
1242          \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1)
1243          \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4)
1244          \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1245                                    Below_Left} (1)
1246          \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1247                                    Below_Right} (4)
1248          \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1)
1249          \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2)
1250          \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4)
1251          \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1)
1252          \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2)
1253          \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1)
1254          \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4)
1255          \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1)
1256          \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1)
1257          \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6)
1258          \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1)
1259          \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1)
1260          \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1)
1261          \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1)
1262          \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1)
1263          \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2)
1264          \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2)
1265          \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1)
1266          \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1)
1267          \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1)
1268          \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1)
1269          \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1)
1270          \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1)
1271          \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1)
1272          \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2)
1273          \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2)
1274          \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2)
1275          \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2)
1276          \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2)
1277          \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2)
1278          \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1)
1279          \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1)
1280          \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1)
1281          \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1)
1282          \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1)
1283          \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1)
1284          \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1285                                    Double_Above} (5)
1286          \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1287                                    Double_Below} (4)
1288          \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA}) (5)
1289          \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB}) (4)
1290          \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1291                                    (1)
1292          \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1293                                    Iota_Subscript} (1)
1294          \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV}) (2)
1295          \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1296                                    Kana_Voicing} (2)
1297          \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1298                                    Left} (2)
1299          \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2)
1300          \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1301                                    Nukta} (13)
1302          \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1303                                    (1_113_459)
1304          \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1305                                    Not_Reordered} (1_113_459)
1306          \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (13)
1307          \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1308                                    Overlay} (26)
1309          \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (26)
1310          \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1311                                    Right} (1)
1312          \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1)
1313          \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (37)
1314          \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1315                                    Virama} (37)
1316          \p{Cans}                \p{Canadian_Aboriginal} (= \p{Script=
1317                                    Canadian_Aboriginal}) (710)
1318          \p{Cari}                \p{Carian} (= \p{Script=Carian}) (NOT
1319                                    \p{Block=Carian}) (49)
1320          \p{Carian}              \p{Script=Carian} (Short: \p{Cari}; NOT
1321                                    \p{Block=Carian}) (49)
1322          \p{Case_Ignorable}      \p{Case_Ignorable=Y} (Short: \p{CI}) (1799)
1323          \p{Case_Ignorable: N*}  (Short: \p{CI=N}, \P{CI}) (1_112_313)
1324          \p{Case_Ignorable: Y*}  (Short: \p{CI=Y}, \p{CI}) (1799)
1325          \p{Cased}               \p{Cased=Y} (3448)
1326          \p{Cased: N*}           (Single: \P{Cased}) (1_110_664)
1327          \p{Cased: Y*}           (Single: \p{Cased}) (3448)
1328          \p{Cased_Letter}        \p{General_Category=Cased_Letter} (Short:
1329                                    \p{LC}) (3223)
1330          \p{Category: *}         \p{General_Category: *}
1331          \p{Cc}                  \p{Cntrl} (= \p{General_Category=Control})
1332                                    (65)
1333          \p{Ccc: *}              \p{Canonical_Combining_Class: *}
1334          \p{CE}                  \p{Composition_Exclusion} (=
1335                                    \p{Composition_Exclusion=Y}) (81)
1336          \p{CE: *}               \p{Composition_Exclusion: *}
1337          \p{Cf}                  \p{Format} (= \p{General_Category=Format})
1338                                    (139)
1339          \p{Chakma}              \p{Script=Chakma} (Short: \p{Cakm}; NOT
1340                                    \p{Block=Chakma}) (67)
1341          \p{Cham}                \p{Script=Cham} (NOT \p{Block=Cham}) (83)
1342          \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1343                                    \p{CWCF}) (1107)
1344          \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1345                                    (1_113_005)
1346          \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1347                                    (1107)
1348          \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1349                                    \p{CWCM}) (2138)
1350          \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1351                                    (1_111_974)
1352          \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1353                                    (2138)
1354          \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1355                                    \p{CWL}) (1043)
1356          \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1357                                    (1_113_069)
1358          \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1043)
1359          \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1360                                    Y} (Short: \p{CWKCF}) (9944)
1361          \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1362                                    \P{CWKCF}) (1_104_168)
1363          \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1364                                    \p{CWKCF}) (9944)
1365          \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1366                                    \p{CWT}) (1099)
1367          \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1368                                    (1_113_013)
1369          \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1099)
1370          \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1371                                    \p{CWU}) (1126)
1372          \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1373                                    (1_112_986)
1374          \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1126)
1375          \p{Cher}                \p{Cherokee} (= \p{Script=Cherokee}) (NOT
1376                                    \p{Block=Cherokee}) (85)
1377          \p{Cherokee}            \p{Script=Cherokee} (Short: \p{Cher}; NOT
1378                                    \p{Block=Cherokee}) (85)
1379          \p{CI}                  \p{Case_Ignorable} (= \p{Case_Ignorable=
1380                                    Y}) (1799)
1381          \p{CI: *}               \p{Case_Ignorable: *}
1382        X \p{CJK}                 \p{CJK_Unified_Ideographs} (= \p{Block=
1383                                    CJK_Unified_Ideographs}) (20_992)
1384        X \p{CJK_Compat}          \p{CJK_Compatibility} (= \p{Block=
1385                                    CJK_Compatibility}) (256)
1386        X \p{CJK_Compat_Forms}    \p{CJK_Compatibility_Forms} (= \p{Block=
1387                                    CJK_Compatibility_Forms}) (32)
1388        X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
1389                                    \p{Block=CJK_Compatibility_Ideographs})
1390                                    (512)
1391        X \p{CJK_Compat_Ideographs_Sup}
1392                                    \p{CJK_Compatibility_Ideographs_-
1393                                    Supplement} (= \p{Block=
1394                                    CJK_Compatibility_Ideographs_-
1395                                    Supplement}) (544)
1396        X \p{CJK_Compatibility}   \p{Block=CJK_Compatibility} (Short:
1397                                    \p{InCJKCompat}) (256)
1398        X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
1399                                    (Short: \p{InCJKCompatForms}) (32)
1400        X \p{CJK_Compatibility_Ideographs} \p{Block=
1401                                    CJK_Compatibility_Ideographs} (Short:
1402                                    \p{InCJKCompatIdeographs}) (512)
1403        X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
1404                                    CJK_Compatibility_Ideographs_Supplement}
1405                                    (Short: \p{InCJKCompatIdeographsSup})
1406                                    (544)
1407        X \p{CJK_Ext_A}           \p{CJK_Unified_Ideographs_Extension_A} (=
1408                                    \p{Block=
1409                                    CJK_Unified_Ideographs_Extension_A})
1410                                    (6592)
1411        X \p{CJK_Ext_B}           \p{CJK_Unified_Ideographs_Extension_B} (=
1412                                    \p{Block=
1413                                    CJK_Unified_Ideographs_Extension_B})
1414                                    (42_720)
1415        X \p{CJK_Ext_C}           \p{CJK_Unified_Ideographs_Extension_C} (=
1416                                    \p{Block=
1417                                    CJK_Unified_Ideographs_Extension_C})
1418                                    (4160)
1419        X \p{CJK_Ext_D}           \p{CJK_Unified_Ideographs_Extension_D} (=
1420                                    \p{Block=
1421                                    CJK_Unified_Ideographs_Extension_D})
1422                                    (224)
1423        X \p{CJK_Radicals_Sup}    \p{CJK_Radicals_Supplement} (= \p{Block=
1424                                    CJK_Radicals_Supplement}) (128)
1425        X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
1426                                    (Short: \p{InCJKRadicalsSup}) (128)
1427        X \p{CJK_Strokes}         \p{Block=CJK_Strokes} (48)
1428        X \p{CJK_Symbols}         \p{CJK_Symbols_And_Punctuation} (=
1429                                    \p{Block=CJK_Symbols_And_Punctuation})
1430                                    (64)
1431        X \p{CJK_Symbols_And_Punctuation} \p{Block=
1432                                    CJK_Symbols_And_Punctuation} (Short:
1433                                    \p{InCJKSymbols}) (64)
1434        X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
1435                                    (Short: \p{InCJK}) (20_992)
1436        X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
1437                                    CJK_Unified_Ideographs_Extension_A}
1438                                    (Short: \p{InCJKExtA}) (6592)
1439        X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
1440                                    CJK_Unified_Ideographs_Extension_B}
1441                                    (Short: \p{InCJKExtB}) (42_720)
1442        X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
1443                                    CJK_Unified_Ideographs_Extension_C}
1444                                    (Short: \p{InCJKExtC}) (4160)
1445        X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
1446                                    CJK_Unified_Ideographs_Extension_D}
1447                                    (Short: \p{InCJKExtD}) (224)
1448          \p{Close_Punctuation}   \p{General_Category=Close_Punctuation}
1449                                    (Short: \p{Pe}) (71)
1450          \p{Cn}                  \p{Unassigned} (= \p{General_Category=
1451                                    Unassigned}) (864_415)
1452          \p{Cntrl}               \p{General_Category=Control} Control
1453                                    characters (Short: \p{Cc}) (65)
1454          \p{Co}                  \p{Private_Use} (= \p{General_Category=
1455                                    Private_Use}) (NOT \p{Private_Use_Area})
1456                                    (137_468)
1457        X \p{Combining_Diacritical_Marks} \p{Block=
1458                                    Combining_Diacritical_Marks} (Short:
1459                                    \p{InDiacriticals}) (112)
1460        X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
1461                                    Combining_Diacritical_Marks_For_Symbols}
1462                                    (Short: \p{InDiacriticalsForSymbols})
1463                                    (48)
1464        X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
1465                                    Combining_Diacritical_Marks_Supplement}
1466                                    (Short: \p{InDiacriticalsSup}) (64)
1467        X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
1468                                    \p{InHalfMarks}) (16)
1469          \p{Combining_Mark}      \p{Mark} (= \p{General_Category=Mark})
1470                                    (1645)
1471        X \p{Combining_Marks_For_Symbols}
1472                                    \p{Combining_Diacritical_Marks_For_-
1473                                    Symbols} (= \p{Block=
1474                                    Combining_Diacritical_Marks_For_-
1475                                    Symbols}) (48)
1476          \p{Common}              \p{Script=Common} (Short: \p{Zyyy}) (6412)
1477        X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1478                                    (Short: \p{InIndicNumberForms}) (16)
1479          \p{Comp_Ex}             \p{Full_Composition_Exclusion} (=
1480                                    \p{Full_Composition_Exclusion=Y}) (1120)
1481          \p{Comp_Ex: *}          \p{Full_Composition_Exclusion: *}
1482        X \p{Compat_Jamo}         \p{Hangul_Compatibility_Jamo} (= \p{Block=
1483                                    Hangul_Compatibility_Jamo}) (96)
1484          \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
1485                                    \p{CE}) (81)
1486          \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031)
1487          \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81)
1488          \p{Connector_Punctuation} \p{General_Category=
1489                                    Connector_Punctuation} (Short: \p{Pc})
1490                                    (10)
1491          \p{Control}             \p{Cntrl} (= \p{General_Category=Control})
1492                                    (65)
1493        X \p{Control_Pictures}    \p{Block=Control_Pictures} (64)
1494          \p{Copt}                \p{Coptic} (= \p{Script=Coptic}) (NOT
1495                                    \p{Block=Coptic}) (137)
1496          \p{Coptic}              \p{Script=Coptic} (Short: \p{Copt}; NOT
1497                                    \p{Block=Coptic}) (137)
1498        X \p{Counting_Rod}        \p{Counting_Rod_Numerals} (= \p{Block=
1499                                    Counting_Rod_Numerals}) (32)
1500        X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
1501                                    \p{InCountingRod}) (32)
1502          \p{Cprt}                \p{Cypriot} (= \p{Script=Cypriot}) (55)
1503          \p{Cs}                  \p{Surrogate} (= \p{General_Category=
1504                                    Surrogate}) (2048)
1505          \p{Cuneiform}           \p{Script=Cuneiform} (Short: \p{Xsux}; NOT
1506                                    \p{Block=Cuneiform}) (982)
1507        X \p{Cuneiform_Numbers}   \p{Cuneiform_Numbers_And_Punctuation} (=
1508                                    \p{Block=
1509                                    Cuneiform_Numbers_And_Punctuation}) (128)
1510        X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
1511                                    Cuneiform_Numbers_And_Punctuation}
1512                                    (Short: \p{InCuneiformNumbers}) (128)
1513          \p{Currency_Symbol}     \p{General_Category=Currency_Symbol}
1514                                    (Short: \p{Sc}) (48)
1515        X \p{Currency_Symbols}    \p{Block=Currency_Symbols} (48)
1516          \p{CWCF}                \p{Changes_When_Casefolded} (=
1517                                    \p{Changes_When_Casefolded=Y}) (1107)
1518          \p{CWCF: *}             \p{Changes_When_Casefolded: *}
1519          \p{CWCM}                \p{Changes_When_Casemapped} (=
1520                                    \p{Changes_When_Casemapped=Y}) (2138)
1521          \p{CWCM: *}             \p{Changes_When_Casemapped: *}
1522          \p{CWKCF}               \p{Changes_When_NFKC_Casefolded} (=
1523                                    \p{Changes_When_NFKC_Casefolded=Y})
1524                                    (9944)
1525          \p{CWKCF: *}            \p{Changes_When_NFKC_Casefolded: *}
1526          \p{CWL}                 \p{Changes_When_Lowercased} (=
1527                                    \p{Changes_When_Lowercased=Y}) (1043)
1528          \p{CWL: *}              \p{Changes_When_Lowercased: *}
1529          \p{CWT}                 \p{Changes_When_Titlecased} (=
1530                                    \p{Changes_When_Titlecased=Y}) (1099)
1531          \p{CWT: *}              \p{Changes_When_Titlecased: *}
1532          \p{CWU}                 \p{Changes_When_Uppercased} (=
1533                                    \p{Changes_When_Uppercased=Y}) (1126)
1534          \p{CWU: *}              \p{Changes_When_Uppercased: *}
1535          \p{Cypriot}             \p{Script=Cypriot} (Short: \p{Cprt}) (55)
1536        X \p{Cypriot_Syllabary}   \p{Block=Cypriot_Syllabary} (64)
1537          \p{Cyrillic}            \p{Script=Cyrillic} (Short: \p{Cyrl}; NOT
1538                                    \p{Block=Cyrillic}) (417)
1539        X \p{Cyrillic_Ext_A}      \p{Cyrillic_Extended_A} (= \p{Block=
1540                                    Cyrillic_Extended_A}) (32)
1541        X \p{Cyrillic_Ext_B}      \p{Cyrillic_Extended_B} (= \p{Block=
1542                                    Cyrillic_Extended_B}) (96)
1543        X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
1544                                    \p{InCyrillicExtA}) (32)
1545        X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
1546                                    \p{InCyrillicExtB}) (96)
1547        X \p{Cyrillic_Sup}        \p{Cyrillic_Supplement} (= \p{Block=
1548                                    Cyrillic_Supplement}) (48)
1549        X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
1550                                    \p{InCyrillicSup}) (48)
1551        X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
1552                                    Cyrillic_Supplement}) (48)
1553          \p{Cyrl}                \p{Cyrillic} (= \p{Script=Cyrillic}) (NOT
1554                                    \p{Block=Cyrillic}) (417)
1555          \p{Dash}                \p{Dash=Y} (27)
1556          \p{Dash: N*}            (Single: \P{Dash}) (1_114_085)
1557          \p{Dash: Y*}            (Single: \p{Dash}) (27)
1558          \p{Dash_Punctuation}    \p{General_Category=Dash_Punctuation}
1559                                    (Short: \p{Pd}) (23)
1560          \p{Decimal_Number}      \p{Digit} (= \p{General_Category=
1561                                    Decimal_Number}) (460)
1562          \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
1563                                    (13_225)
1564          \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_225)
1565          \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240)
1566          \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
1567          \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720)
1568          \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
1569          \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
1570          \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240)
1571          \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184)
1572          \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
1573          \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20)
1574          \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
1575          \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171)
1576          \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
1577          \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238)
1578          \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
1579          \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82)
1580          \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
1581          \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122)
1582          \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
1583          \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5)
1584          \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
1585                                    Non_Canonical} (Perl extension) (3655)
1586          \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
1587                                    decompositions (Short: \p{Dt=NonCanon})
1588                                    (Perl extension) (3655)
1589          \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_232)
1590          \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26)
1591          \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
1592          \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (284)
1593          \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (284)
1594          \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38)
1595          \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (146)
1596          \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (146)
1597          \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
1598          \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35)
1599          \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104)
1600          \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
1601                                    Y} (Short: \p{DI}) (4167)
1602          \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
1603                                    (1_109_945)
1604          \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
1605                                    (4167)
1606          \p{Dep}                 \p{Deprecated} (= \p{Deprecated=Y}) (111)
1607          \p{Dep: *}              \p{Deprecated: *}
1608          \p{Deprecated}          \p{Deprecated=Y} (Short: \p{Dep}) (111)
1609          \p{Deprecated: N*}      (Short: \p{Dep=N}, \P{Dep}) (1_114_001)
1610          \p{Deprecated: Y*}      (Short: \p{Dep=Y}, \p{Dep}) (111)
1611          \p{Deseret}             \p{Script=Deseret} (Short: \p{Dsrt}) (80)
1612          \p{Deva}                \p{Devanagari} (= \p{Script=Devanagari})
1613                                    (NOT \p{Block=Devanagari}) (151)
1614          \p{Devanagari}          \p{Script=Devanagari} (Short: \p{Deva};
1615                                    NOT \p{Block=Devanagari}) (151)
1616        X \p{Devanagari_Ext}      \p{Devanagari_Extended} (= \p{Block=
1617                                    Devanagari_Extended}) (32)
1618        X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
1619                                    \p{InDevanagariExt}) (32)
1620          \p{DI}                  \p{Default_Ignorable_Code_Point} (=
1621                                    \p{Default_Ignorable_Code_Point=Y})
1622                                    (4167)
1623          \p{DI: *}               \p{Default_Ignorable_Code_Point: *}
1624          \p{Dia}                 \p{Diacritic} (= \p{Diacritic=Y}) (693)
1625          \p{Dia: *}              \p{Diacritic: *}
1626          \p{Diacritic}           \p{Diacritic=Y} (Short: \p{Dia}) (693)
1627          \p{Diacritic: N*}       (Short: \p{Dia=N}, \P{Dia}) (1_113_419)
1628          \p{Diacritic: Y*}       (Short: \p{Dia=Y}, \p{Dia}) (693)
1629        X \p{Diacriticals}        \p{Combining_Diacritical_Marks} (=
1630                                    \p{Block=Combining_Diacritical_Marks})
1631                                    (112)
1632        X \p{Diacriticals_For_Symbols}
1633                                    \p{Combining_Diacritical_Marks_For_-
1634                                    Symbols} (= \p{Block=
1635                                    Combining_Diacritical_Marks_For_-
1636                                    Symbols}) (48)
1637        X \p{Diacriticals_Sup}    \p{Combining_Diacritical_Marks_Supplement}
1638                                    (= \p{Block=
1639                                    Combining_Diacritical_Marks_Supplement})
1640                                    (64)
1641          \p{Digit}               \p{General_Category=Decimal_Number} [0-9]
1642                                    + all other decimal digits (Short:
1643                                    \p{Nd}) (460)
1644        X \p{Dingbats}            \p{Block=Dingbats} (192)
1645        X \p{Domino}              \p{Domino_Tiles} (= \p{Block=
1646                                    Domino_Tiles}) (112)
1647        X \p{Domino_Tiles}        \p{Block=Domino_Tiles} (Short:
1648                                    \p{InDomino}) (112)
1649          \p{Dsrt}                \p{Deseret} (= \p{Script=Deseret}) (80)
1650          \p{Dt: *}               \p{Decomposition_Type: *}
1651          \p{Ea: *}               \p{East_Asian_Width: *}
1652          \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_746)
1653          \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_746)
1654          \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
1655          \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104)
1656          \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
1657          \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123)
1658          \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (801_811)
1659          \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
1660          \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111)
1661          \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (801_811)
1662          \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (173_217)
1663          \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (173_217)
1664          \p{Egyp}                \p{Egyptian_Hieroglyphs} (= \p{Script=
1665                                    Egyptian_Hieroglyphs}) (NOT \p{Block=
1666                                    Egyptian_Hieroglyphs}) (1071)
1667          \p{Egyptian_Hieroglyphs} \p{Script=Egyptian_Hieroglyphs} (Short:
1668                                    \p{Egyp}; NOT \p{Block=
1669                                    Egyptian_Hieroglyphs}) (1071)
1670        X \p{Emoticons}           \p{Block=Emoticons} (80)
1671        X \p{Enclosed_Alphanum}   \p{Enclosed_Alphanumerics} (= \p{Block=
1672                                    Enclosed_Alphanumerics}) (160)
1673        X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
1674                                    \p{Block=
1675                                    Enclosed_Alphanumeric_Supplement}) (256)
1676        X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
1677                                    Enclosed_Alphanumeric_Supplement}
1678                                    (Short: \p{InEnclosedAlphanumSup}) (256)
1679        X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
1680                                    (Short: \p{InEnclosedAlphanum}) (160)
1681        X \p{Enclosed_CJK}        \p{Enclosed_CJK_Letters_And_Months} (=
1682                                    \p{Block=
1683                                    Enclosed_CJK_Letters_And_Months}) (256)
1684        X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
1685                                    Enclosed_CJK_Letters_And_Months} (Short:
1686                                    \p{InEnclosedCJK}) (256)
1687        X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
1688                                    (= \p{Block=
1689                                    Enclosed_Ideographic_Supplement}) (256)
1690        X \p{Enclosed_Ideographic_Supplement} \p{Block=
1691                                    Enclosed_Ideographic_Supplement} (Short:
1692                                    \p{InEnclosedIdeographicSup}) (256)
1693          \p{Enclosing_Mark}      \p{General_Category=Enclosing_Mark}
1694                                    (Short: \p{Me}) (12)
1695          \p{Ethi}                \p{Ethiopic} (= \p{Script=Ethiopic}) (NOT
1696                                    \p{Block=Ethiopic}) (495)
1697          \p{Ethiopic}            \p{Script=Ethiopic} (Short: \p{Ethi}; NOT
1698                                    \p{Block=Ethiopic}) (495)
1699        X \p{Ethiopic_Ext}        \p{Ethiopic_Extended} (= \p{Block=
1700                                    Ethiopic_Extended}) (96)
1701        X \p{Ethiopic_Ext_A}      \p{Ethiopic_Extended_A} (= \p{Block=
1702                                    Ethiopic_Extended_A}) (48)
1703        X \p{Ethiopic_Extended}   \p{Block=Ethiopic_Extended} (Short:
1704                                    \p{InEthiopicExt}) (96)
1705        X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
1706                                    \p{InEthiopicExtA}) (48)
1707        X \p{Ethiopic_Sup}        \p{Ethiopic_Supplement} (= \p{Block=
1708                                    Ethiopic_Supplement}) (32)
1709        X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
1710                                    \p{InEthiopicSup}) (32)
1711          \p{Ext}                 \p{Extender} (= \p{Extender=Y}) (31)
1712          \p{Ext: *}              \p{Extender: *}
1713          \p{Extender}            \p{Extender=Y} (Short: \p{Ext}) (31)
1714          \p{Extender: N*}        (Short: \p{Ext=N}, \P{Ext}) (1_114_081)
1715          \p{Extender: Y*}        (Short: \p{Ext=Y}, \p{Ext}) (31)
1716          \p{Final_Punctuation}   \p{General_Category=Final_Punctuation}
1717                                    (Short: \p{Pf}) (10)
1718          \p{Format}              \p{General_Category=Format} (Short:
1719                                    \p{Cf}) (139)
1720          \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
1721                                    (Short: \p{CompEx}) (1120)
1722          \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
1723                                    \P{CompEx}) (1_112_992)
1724          \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
1725                                    \p{CompEx}) (1120)
1726          \p{Gc: *}               \p{General_Category: *}
1727          \p{GCB: *}              \p{Grapheme_Cluster_Break: *}
1728          \p{General_Category: C} \p{General_Category=Other} (1_004_135)
1729          \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
1730                                    \p{Gc=LC}, \p{LC}) (3223)
1731          \p{General_Category: Cc} \p{General_Category=Control} (65)
1732          \p{General_Category: Cf} \p{General_Category=Format} (139)
1733          \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
1734                                    (71)
1735          \p{General_Category: Cn} \p{General_Category=Unassigned} (864_415)
1736          \p{General_Category: Cntrl} \p{General_Category=Control} (65)
1737          \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
1738          \p{General_Category: Combining_Mark} \p{General_Category=Mark}
1739                                    (1645)
1740          \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
1741                                    \p{Pc}) (10)
1742          \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65)
1743          \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
1744          \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
1745                                    (48)
1746          \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
1747                                    (23)
1748          \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
1749                                    (460)
1750          \p{General_Category: Digit} \p{General_Category=Decimal_Number}
1751                                    (460)
1752          \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
1753                                    (12)
1754          \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
1755                                    (10)
1756          \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (139)
1757          \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
1758                                    \p{Pi}) (12)
1759          \p{General_Category: L} \p{General_Category=Letter} (101_013)
1760        X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3223)
1761        X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
1762                                    the trailing '_' matters in spite of
1763                                    loose matching rules. (3223)
1764          \p{General_Category: LC} \p{General_Category=Cased_Letter} (3223)
1765          \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (101_013)
1766          \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
1767                                    (224)
1768          \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl}) (1)
1769          \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
1770                                    (/i= General_Category=Cased_Letter)
1771                                    (1751)
1772          \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (237)
1773          \p{General_Category: Lo} \p{General_Category=Other_Letter} (97_553)
1774          \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
1775                                    /i= General_Category=Cased_Letter) (1751)
1776          \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
1777                                    (/i= General_Category=Cased_Letter) (31)
1778          \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
1779                                    (/i= General_Category=Cased_Letter)
1780                                    (1441)
1781          \p{General_Category: M} \p{General_Category=Mark} (1645)
1782          \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (1645)
1783          \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (952)
1784          \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (353)
1785          \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (12)
1786          \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
1787                                    (1280)
1788          \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
1789                                    (237)
1790          \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
1791                                    (115)
1792          \p{General_Category: N} \p{General_Category=Number} (1148)
1793          \p{General_Category: Nd} \p{General_Category=Decimal_Number} (460)
1794          \p{General_Category: Nl} \p{General_Category=Letter_Number} (224)
1795          \p{General_Category: No} \p{General_Category=Other_Number} (464)
1796          \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
1797                                    (1280)
1798          \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1148)
1799          \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
1800                                    (72)
1801          \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (1_004_135)
1802          \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
1803                                    (97_553)
1804          \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No}) (464)
1805          \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
1806                                    (434)
1807          \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
1808                                    (4404)
1809          \p{General_Category: P} \p{General_Category=Punctuation} (632)
1810          \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
1811                                    \p{Zp}) (1)
1812          \p{General_Category: Pc} \p{General_Category=
1813                                    Connector_Punctuation} (10)
1814          \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (23)
1815          \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
1816                                    (71)
1817          \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
1818                                    (10)
1819          \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
1820                                    (12)
1821          \p{General_Category: Po} \p{General_Category=Other_Punctuation}
1822                                    (434)
1823          \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
1824                                    (137_468)
1825          \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (72)
1826          \p{General_Category: Punct} \p{General_Category=Punctuation} (632)
1827          \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (632)
1828          \p{General_Category: S} \p{General_Category=Symbol} (5519)
1829          \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (48)
1830          \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (20)
1831          \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (115)
1832          \p{General_Category: Sm} \p{General_Category=Math_Symbol} (952)
1833          \p{General_Category: So} \p{General_Category=Other_Symbol} (4404)
1834          \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
1835                                    (18)
1836          \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc}) (353)
1837          \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048)
1838          \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (5519)
1839          \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
1840                                    /i= General_Category=Cased_Letter) (31)
1841          \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
1842                                    (864_415)
1843          \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
1844                                    /i= General_Category=Cased_Letter) (1441)
1845          \p{General_Category: Z} \p{General_Category=Separator} (20)
1846          \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
1847          \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
1848                                    (1)
1849          \p{General_Category: Zs} \p{General_Category=Space_Separator} (18)
1850        X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
1851                                    \p{InPunctuation}) (112)
1852        X \p{Geometric_Shapes}    \p{Block=Geometric_Shapes} (96)
1853          \p{Geor}                \p{Georgian} (= \p{Script=Georgian}) (NOT
1854                                    \p{Block=Georgian}) (127)
1855          \p{Georgian}            \p{Script=Georgian} (Short: \p{Geor}; NOT
1856                                    \p{Block=Georgian}) (127)
1857        X \p{Georgian_Sup}        \p{Georgian_Supplement} (= \p{Block=
1858                                    Georgian_Supplement}) (48)
1859        X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
1860                                    \p{InGeorgianSup}) (48)
1861          \p{Glag}                \p{Glagolitic} (= \p{Script=Glagolitic})
1862                                    (NOT \p{Block=Glagolitic}) (94)
1863          \p{Glagolitic}          \p{Script=Glagolitic} (Short: \p{Glag};
1864                                    NOT \p{Block=Glagolitic}) (94)
1865          \p{Goth}                \p{Gothic} (= \p{Script=Gothic}) (NOT
1866                                    \p{Block=Gothic}) (27)
1867          \p{Gothic}              \p{Script=Gothic} (Short: \p{Goth}; NOT
1868                                    \p{Block=Gothic}) (27)
1869          \p{Gr_Base}             \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
1870                                    (108_660)
1871          \p{Gr_Base: *}          \p{Grapheme_Base: *}
1872          \p{Gr_Ext}              \p{Grapheme_Extend} (= \p{Grapheme_Extend=
1873                                    Y}) (1317)
1874          \p{Gr_Ext: *}           \p{Grapheme_Extend: *}
1875          \p{Graph}               Characters that are graphical (247_564)
1876          \p{Grapheme_Base}       \p{Grapheme_Base=Y} (Short: \p{GrBase})
1877                                    (108_660)
1878          \p{Grapheme_Base: N*}   (Short: \p{GrBase=N}, \P{GrBase})
1879                                    (1_005_452)
1880          \p{Grapheme_Base: Y*}   (Short: \p{GrBase=Y}, \p{GrBase}) (108_660)
1881          \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
1882                                    (6023)
1883          \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (6023)
1884          \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1)
1885          \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
1886                                    (1317)
1887          \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1317)
1888          \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125)
1889          \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1)
1890          \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399)
1891          \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773)
1892          \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_094_950)
1893          \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
1894                                    (0)
1895          \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (0)
1896          \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
1897                                    SpacingMark} (291)
1898          \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (291)
1899          \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137)
1900          \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95)
1901          \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
1902                                    (1_094_950)
1903          \p{Grapheme_Extend}     \p{Grapheme_Extend=Y} (Short: \p{GrExt})
1904                                    (1317)
1905          \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_795)
1906          \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1317)
1907          \p{Greek}               \p{Script=Greek} (Short: \p{Grek}; NOT
1908                                    \p{Greek_And_Coptic}) (511)
1909        X \p{Greek_And_Coptic}    \p{Block=Greek_And_Coptic} (Short:
1910                                    \p{InGreek}) (144)
1911        X \p{Greek_Ext}           \p{Greek_Extended} (= \p{Block=
1912                                    Greek_Extended}) (256)
1913        X \p{Greek_Extended}      \p{Block=Greek_Extended} (Short:
1914                                    \p{InGreekExt}) (256)
1915          \p{Grek}                \p{Greek} (= \p{Script=Greek}) (NOT
1916                                    \p{Greek_And_Coptic}) (511)
1917          \p{Gujarati}            \p{Script=Gujarati} (Short: \p{Gujr}; NOT
1918                                    \p{Block=Gujarati}) (84)
1919          \p{Gujr}                \p{Gujarati} (= \p{Script=Gujarati}) (NOT
1920                                    \p{Block=Gujarati}) (84)
1921          \p{Gurmukhi}            \p{Script=Gurmukhi} (Short: \p{Guru}; NOT
1922                                    \p{Block=Gurmukhi}) (79)
1923          \p{Guru}                \p{Gurmukhi} (= \p{Script=Gurmukhi}) (NOT
1924                                    \p{Block=Gurmukhi}) (79)
1925        X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
1926                                    \p{Block=Halfwidth_And_Fullwidth_Forms})
1927                                    (240)
1928        X \p{Half_Marks}          \p{Combining_Half_Marks} (= \p{Block=
1929                                    Combining_Half_Marks}) (16)
1930        X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
1931                                    Halfwidth_And_Fullwidth_Forms} (Short:
1932                                    \p{InHalfAndFullForms}) (240)
1933          \p{Han}                 \p{Script=Han} (75_963)
1934          \p{Hang}                \p{Hangul} (= \p{Script=Hangul}) (NOT
1935                                    \p{Hangul_Syllables}) (11_739)
1936          \p{Hangul}              \p{Script=Hangul} (Short: \p{Hang}; NOT
1937                                    \p{Hangul_Syllables}) (11_739)
1938        X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
1939                                    (Short: \p{InCompatJamo}) (96)
1940        X \p{Hangul_Jamo}         \p{Block=Hangul_Jamo} (Short: \p{InJamo})
1941                                    (256)
1942        X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
1943                                    (Short: \p{InJamoExtA}) (32)
1944        X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
1945                                    (Short: \p{InJamoExtB}) (80)
1946          \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
1947                                    (125)
1948          \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125)
1949          \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
1950                                    (399)
1951          \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399)
1952          \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
1953                                    LVT_Syllable} (10_773)
1954          \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
1955                                    (10_773)
1956          \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
1957                                    Not_Applicable} (1_102_583)
1958          \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
1959                                    (1_102_583)
1960          \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
1961                                    (137)
1962          \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137)
1963          \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
1964                                    (95)
1965          \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95)
1966        X \p{Hangul_Syllables}    \p{Block=Hangul_Syllables} (Short:
1967                                    \p{InHangul}) (11_184)
1968          \p{Hani}                \p{Han} (= \p{Script=Han}) (75_963)
1969          \p{Hano}                \p{Hanunoo} (= \p{Script=Hanunoo}) (NOT
1970                                    \p{Block=Hanunoo}) (21)
1971          \p{Hanunoo}             \p{Script=Hanunoo} (Short: \p{Hano}; NOT
1972                                    \p{Block=Hanunoo}) (21)
1973          \p{Hebr}                \p{Hebrew} (= \p{Script=Hebrew}) (NOT
1974                                    \p{Block=Hebrew}) (133)
1975          \p{Hebrew}              \p{Script=Hebrew} (Short: \p{Hebr}; NOT
1976                                    \p{Block=Hebrew}) (133)
1977          \p{Hex}                 \p{XDigit} (= \p{Hex_Digit=Y}) (44)
1978          \p{Hex: *}              \p{Hex_Digit: *}
1979          \p{Hex_Digit}           \p{XDigit} (= \p{Hex_Digit=Y}) (44)
1980          \p{Hex_Digit: N*}       (Short: \p{Hex=N}, \P{Hex}) (1_114_068)
1981          \p{Hex_Digit: Y*}       (Short: \p{Hex=Y}, \p{Hex}) (44)
1982        X \p{High_Private_Use_Surrogates} \p{Block=
1983                                    High_Private_Use_Surrogates} (Short:
1984                                    \p{InHighPUSurrogates}) (128)
1985        X \p{High_PU_Surrogates}  \p{High_Private_Use_Surrogates} (=
1986                                    \p{Block=High_Private_Use_Surrogates})
1987                                    (128)
1988        X \p{High_Surrogates}     \p{Block=High_Surrogates} (896)
1989          \p{Hira}                \p{Hiragana} (= \p{Script=Hiragana}) (NOT
1990                                    \p{Block=Hiragana}) (91)
1991          \p{Hiragana}            \p{Script=Hiragana} (Short: \p{Hira}; NOT
1992                                    \p{Block=Hiragana}) (91)
1993          \p{HorizSpace}          \p{Blank} (19)
1994          \p{Hst: *}              \p{Hangul_Syllable_Type: *}
1995        D \p{Hyphen}              \p{Hyphen=Y} (11)
1996        D \p{Hyphen: N*}          Supplanted by Line_Break property values;
1997                                    see www.unicode.org/reports/tr14
1998                                    (Single: \P{Hyphen}) (1_114_101)
1999        D \p{Hyphen: Y*}          Supplanted by Line_Break property values;
2000                                    see www.unicode.org/reports/tr14
2001                                    (Single: \p{Hyphen}) (11)
2002          \p{ID_Continue}         \p{ID_Continue=Y} (Short: \p{IDC}; NOT
2003                                    \p{Ideographic_Description_Characters})
2004                                    (103_355)
2005          \p{ID_Continue: N*}     (Short: \p{IDC=N}, \P{IDC}) (1_010_757)
2006          \p{ID_Continue: Y*}     (Short: \p{IDC=Y}, \p{IDC}) (103_355)
2007          \p{ID_Start}            \p{ID_Start=Y} (Short: \p{IDS}) (101_240)
2008          \p{ID_Start: N*}        (Short: \p{IDS=N}, \P{IDS}) (1_012_872)
2009          \p{ID_Start: Y*}        (Short: \p{IDS=Y}, \p{IDS}) (101_240)
2010          \p{IDC}                 \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
2011                                    \p{Ideographic_Description_Characters})
2012                                    (103_355)
2013          \p{IDC: *}              \p{ID_Continue: *}
2014          \p{Ideo}                \p{Ideographic} (= \p{Ideographic=Y})
2015                                    (75_633)
2016          \p{Ideo: *}             \p{Ideographic: *}
2017          \p{Ideographic}         \p{Ideographic=Y} (Short: \p{Ideo})
2018                                    (75_633)
2019          \p{Ideographic: N*}     (Short: \p{Ideo=N}, \P{Ideo}) (1_038_479)
2020          \p{Ideographic: Y*}     (Short: \p{Ideo=Y}, \p{Ideo}) (75_633)
2021        X \p{Ideographic_Description_Characters} \p{Block=
2022                                    Ideographic_Description_Characters}
2023                                    (Short: \p{InIDC}) (16)
2024          \p{IDS}                 \p{ID_Start} (= \p{ID_Start=Y}) (101_240)
2025          \p{IDS: *}              \p{ID_Start: *}
2026          \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
2027                                    \p{IDSB}) (10)
2028          \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
2029                                    (1_114_102)
2030          \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10)
2031          \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
2032                                    \p{IDST}) (2)
2033          \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
2034                                    (1_114_110)
2035          \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2)
2036          \p{IDSB}                \p{IDS_Binary_Operator} (=
2037                                    \p{IDS_Binary_Operator=Y}) (10)
2038          \p{IDSB: *}             \p{IDS_Binary_Operator: *}
2039          \p{IDST}                \p{IDS_Trinary_Operator} (=
2040                                    \p{IDS_Trinary_Operator=Y}) (2)
2041          \p{IDST: *}             \p{IDS_Trinary_Operator: *}
2042          \p{Imperial_Aramaic}    \p{Script=Imperial_Aramaic} (Short:
2043                                    \p{Armi}; NOT \p{Block=
2044                                    Imperial_Aramaic}) (31)
2045          \p{In: *}               \p{Present_In: *} (Perl extension)
2046          \p{In_*}                \p{Block: *}
2047        X \p{Indic_Number_Forms}  \p{Common_Indic_Number_Forms} (= \p{Block=
2048                                    Common_Indic_Number_Forms}) (16)
2049          \p{Inherited}           \p{Script=Inherited} (Short: \p{Zinh})
2050                                    (524)
2051          \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
2052                                    (Short: \p{Pi}) (12)
2053          \p{Inscriptional_Pahlavi} \p{Script=Inscriptional_Pahlavi} (Short:
2054                                    \p{Phli}; NOT \p{Block=
2055                                    Inscriptional_Pahlavi}) (27)
2056          \p{Inscriptional_Parthian} \p{Script=Inscriptional_Parthian}
2057                                    (Short: \p{Prti}; NOT \p{Block=
2058                                    Inscriptional_Parthian}) (30)
2059        X \p{IPA_Ext}             \p{IPA_Extensions} (= \p{Block=
2060                                    IPA_Extensions}) (96)
2061        X \p{IPA_Extensions}      \p{Block=IPA_Extensions} (Short:
2062                                    \p{InIPAExt}) (96)
2063          \p{Is_*}                \p{*} (Any exceptions are individually
2064                                    noted beginning with the word NOT.) If
2065                                    an entry has flag(s) at its beginning,
2066                                    like "D", the "Is_" form has the same
2067                                    flag(s)
2068          \p{Ital}                \p{Old_Italic} (= \p{Script=Old_Italic})
2069                                    (NOT \p{Block=Old_Italic}) (35)
2070        X \p{Jamo}                \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
2071                                    (256)
2072        X \p{Jamo_Ext_A}          \p{Hangul_Jamo_Extended_A} (= \p{Block=
2073                                    Hangul_Jamo_Extended_A}) (32)
2074        X \p{Jamo_Ext_B}          \p{Hangul_Jamo_Extended_B} (= \p{Block=
2075                                    Hangul_Jamo_Extended_B}) (80)
2076          \p{Java}                \p{Javanese} (= \p{Script=Javanese}) (NOT
2077                                    \p{Block=Javanese}) (91)
2078          \p{Javanese}            \p{Script=Javanese} (Short: \p{Java}; NOT
2079                                    \p{Block=Javanese}) (91)
2080          \p{Jg: *}               \p{Joining_Group: *}
2081          \p{Join_C}              \p{Join_Control} (= \p{Join_Control=Y}) (2)
2082          \p{Join_C: *}           \p{Join_Control: *}
2083          \p{Join_Control}        \p{Join_Control=Y} (Short: \p{JoinC}) (2)
2084          \p{Join_Control: N*}    (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110)
2085          \p{Join_Control: Y*}    (Short: \p{JoinC=Y}, \p{JoinC}) (2)
2086          \p{Joining_Group: Ain}  (Short: \p{Jg=Ain}) (7)
2087          \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1)
2088          \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10)
2089          \p{Joining_Group: Beh}  (Short: \p{Jg=Beh}) (20)
2090          \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2)
2091          \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
2092                                    BurushaskiYehBarree}) (2)
2093          \p{Joining_Group: Dal}  (Short: \p{Jg=Dal}) (14)
2094          \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4)
2095          \p{Joining_Group: E}    (Short: \p{Jg=E}) (1)
2096          \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7)
2097          \p{Joining_Group: Fe}   (Short: \p{Jg=Fe}) (1)
2098          \p{Joining_Group: Feh}  (Short: \p{Jg=Feh}) (10)
2099          \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1)
2100          \p{Joining_Group: Gaf}  (Short: \p{Jg=Gaf}) (13)
2101          \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3)
2102          \p{Joining_Group: Hah}  (Short: \p{Jg=Hah}) (18)
2103          \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
2104                                    HamzaOnHehGoal}) (1)
2105          \p{Joining_Group: He}   (Short: \p{Jg=He}) (1)
2106          \p{Joining_Group: Heh}  (Short: \p{Jg=Heh}) (1)
2107          \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2)
2108          \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1)
2109          \p{Joining_Group: Kaf}  (Short: \p{Jg=Kaf}) (5)
2110          \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1)
2111          \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1)
2112          \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2)
2113          \p{Joining_Group: Lam}  (Short: \p{Jg=Lam}) (7)
2114          \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1)
2115          \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4)
2116          \p{Joining_Group: Mim}  (Short: \p{Jg=Mim}) (1)
2117          \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
2118                                    (1_113_870)
2119          \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8)
2120          \p{Joining_Group: Nun}  (Short: \p{Jg=Nun}) (1)
2121          \p{Joining_Group: Nya}  (Short: \p{Jg=Nya}) (1)
2122          \p{Joining_Group: Pe}   (Short: \p{Jg=Pe}) (1)
2123          \p{Joining_Group: Qaf}  (Short: \p{Jg=Qaf}) (5)
2124          \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1)
2125          \p{Joining_Group: Reh}  (Short: \p{Jg=Reh}) (17)
2126          \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1)
2127          \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1)
2128          \p{Joining_Group: Sad}  (Short: \p{Jg=Sad}) (5)
2129          \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1)
2130          \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11)
2131          \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1)
2132          \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1)
2133          \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1)
2134          \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1)
2135          \p{Joining_Group: Tah}  (Short: \p{Jg=Tah}) (4)
2136          \p{Joining_Group: Taw}  (Short: \p{Jg=Taw}) (1)
2137          \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3)
2138          \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
2139                                    Hamza_On_Heh_Goal} (1)
2140          \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2)
2141          \p{Joining_Group: Waw}  (Short: \p{Jg=Waw}) (16)
2142          \p{Joining_Group: Yeh}  (Short: \p{Jg=Yeh}) (10)
2143          \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2)
2144          \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1)
2145          \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1)
2146          \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1)
2147          \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1)
2148          \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1)
2149          \p{Joining_Type: C}     \p{Joining_Type=Join_Causing} (3)
2150          \p{Joining_Type: D}     \p{Joining_Type=Dual_Joining} (215)
2151          \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (215)
2152          \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (3)
2153          \p{Joining_Type: L}     \p{Joining_Type=Left_Joining} (0)
2154          \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (0)
2155          \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_112_389)
2156          \p{Joining_Type: R}     \p{Joining_Type=Right_Joining} (82)
2157          \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (82)
2158          \p{Joining_Type: T}     \p{Joining_Type=Transparent} (1423)
2159          \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1423)
2160          \p{Joining_Type: U}     \p{Joining_Type=Non_Joining} (1_112_389)
2161          \p{Jt: *}               \p{Joining_Type: *}
2162          \p{Kaithi}              \p{Script=Kaithi} (Short: \p{Kthi}; NOT
2163                                    \p{Block=Kaithi}) (66)
2164          \p{Kali}                \p{Kayah_Li} (= \p{Script=Kayah_Li}) (48)
2165          \p{Kana}                \p{Katakana} (= \p{Script=Katakana}) (NOT
2166                                    \p{Block=Katakana}) (300)
2167        X \p{Kana_Sup}            \p{Kana_Supplement} (= \p{Block=
2168                                    Kana_Supplement}) (256)
2169        X \p{Kana_Supplement}     \p{Block=Kana_Supplement} (Short:
2170                                    \p{InKanaSup}) (256)
2171        X \p{Kanbun}              \p{Block=Kanbun} (16)
2172        X \p{Kangxi}              \p{Kangxi_Radicals} (= \p{Block=
2173                                    Kangxi_Radicals}) (224)
2174        X \p{Kangxi_Radicals}     \p{Block=Kangxi_Radicals} (Short:
2175                                    \p{InKangxi}) (224)
2176          \p{Kannada}             \p{Script=Kannada} (Short: \p{Knda}; NOT
2177                                    \p{Block=Kannada}) (86)
2178          \p{Katakana}            \p{Script=Katakana} (Short: \p{Kana}; NOT
2179                                    \p{Block=Katakana}) (300)
2180        X \p{Katakana_Ext}        \p{Katakana_Phonetic_Extensions} (=
2181                                    \p{Block=Katakana_Phonetic_Extensions})
2182                                    (16)
2183        X \p{Katakana_Phonetic_Extensions} \p{Block=
2184                                    Katakana_Phonetic_Extensions} (Short:
2185                                    \p{InKatakanaExt}) (16)
2186          \p{Kayah_Li}            \p{Script=Kayah_Li} (Short: \p{Kali}) (48)
2187          \p{Khar}                \p{Kharoshthi} (= \p{Script=Kharoshthi})
2188                                    (NOT \p{Block=Kharoshthi}) (65)
2189          \p{Kharoshthi}          \p{Script=Kharoshthi} (Short: \p{Khar};
2190                                    NOT \p{Block=Kharoshthi}) (65)
2191          \p{Khmer}               \p{Script=Khmer} (Short: \p{Khmr}; NOT
2192                                    \p{Block=Khmer}) (146)
2193        X \p{Khmer_Symbols}       \p{Block=Khmer_Symbols} (32)
2194          \p{Khmr}                \p{Khmer} (= \p{Script=Khmer}) (NOT
2195                                    \p{Block=Khmer}) (146)
2196          \p{Knda}                \p{Kannada} (= \p{Script=Kannada}) (NOT
2197                                    \p{Block=Kannada}) (86)
2198          \p{Kthi}                \p{Kaithi} (= \p{Script=Kaithi}) (NOT
2199                                    \p{Block=Kaithi}) (66)
2200          \p{L}                   \p{Letter} (= \p{General_Category=Letter})
2201                                    (101_013)
2202        X \p{L&}                  \p{Cased_Letter} (= \p{General_Category=
2203                                    Cased_Letter}) (3223)
2204        X \p{L_}                  \p{Cased_Letter} (= \p{General_Category=
2205                                    Cased_Letter}) Note the trailing '_'
2206                                    matters in spite of loose matching
2207                                    rules. (3223)
2208          \p{Lana}                \p{Tai_Tham} (= \p{Script=Tai_Tham}) (NOT
2209                                    \p{Block=Tai_Tham}) (127)
2210          \p{Lao}                 \p{Script=Lao} (NOT \p{Block=Lao}) (67)
2211          \p{Laoo}                \p{Lao} (= \p{Script=Lao}) (NOT \p{Block=
2212                                    Lao}) (67)
2213          \p{Latin}               \p{Script=Latin} (Short: \p{Latn}) (1272)
2214        X \p{Latin_1}             \p{Latin_1_Supplement} (= \p{Block=
2215                                    Latin_1_Supplement}) (128)
2216        X \p{Latin_1_Sup}         \p{Latin_1_Supplement} (= \p{Block=
2217                                    Latin_1_Supplement}) (128)
2218        X \p{Latin_1_Supplement}  \p{Block=Latin_1_Supplement} (Short:
2219                                    \p{InLatin1}) (128)
2220        X \p{Latin_Ext_A}         \p{Latin_Extended_A} (= \p{Block=
2221                                    Latin_Extended_A}) (128)
2222        X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
2223                                    \p{Block=Latin_Extended_Additional})
2224                                    (256)
2225        X \p{Latin_Ext_B}         \p{Latin_Extended_B} (= \p{Block=
2226                                    Latin_Extended_B}) (208)
2227        X \p{Latin_Ext_C}         \p{Latin_Extended_C} (= \p{Block=
2228                                    Latin_Extended_C}) (32)
2229        X \p{Latin_Ext_D}         \p{Latin_Extended_D} (= \p{Block=
2230                                    Latin_Extended_D}) (224)
2231        X \p{Latin_Extended_A}    \p{Block=Latin_Extended_A} (Short:
2232                                    \p{InLatinExtA}) (128)
2233        X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
2234                                    (Short: \p{InLatinExtAdditional}) (256)
2235        X \p{Latin_Extended_B}    \p{Block=Latin_Extended_B} (Short:
2236                                    \p{InLatinExtB}) (208)
2237        X \p{Latin_Extended_C}    \p{Block=Latin_Extended_C} (Short:
2238                                    \p{InLatinExtC}) (32)
2239        X \p{Latin_Extended_D}    \p{Block=Latin_Extended_D} (Short:
2240                                    \p{InLatinExtD}) (224)
2241          \p{Latn}                \p{Latin} (= \p{Script=Latin}) (1272)
2242          \p{Lb: *}               \p{Line_Break: *}
2243          \p{LC}                  \p{Cased_Letter} (= \p{General_Category=
2244                                    Cased_Letter}) (3223)
2245          \p{Lepc}                \p{Lepcha} (= \p{Script=Lepcha}) (NOT
2246                                    \p{Block=Lepcha}) (74)
2247          \p{Lepcha}              \p{Script=Lepcha} (Short: \p{Lepc}; NOT
2248                                    \p{Block=Lepcha}) (74)
2249          \p{Letter}              \p{General_Category=Letter} (Short: \p{L})
2250                                    (101_013)
2251          \p{Letter_Number}       \p{General_Category=Letter_Number} (Short:
2252                                    \p{Nl}) (224)
2253        X \p{Letterlike_Symbols}  \p{Block=Letterlike_Symbols} (80)
2254          \p{Limb}                \p{Limbu} (= \p{Script=Limbu}) (NOT
2255                                    \p{Block=Limbu}) (66)
2256          \p{Limbu}               \p{Script=Limbu} (Short: \p{Limb}; NOT
2257                                    \p{Block=Limbu}) (66)
2258          \p{Linb}                \p{Linear_B} (= \p{Script=Linear_B}) (211)
2259          \p{Line_Break: AI}      \p{Line_Break=Ambiguous} (724)
2260          \p{Line_Break: AL}      \p{Line_Break=Alphabetic} (16_251)
2261          \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (16_251)
2262          \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (724)
2263          \p{Line_Break: B2}      \p{Line_Break=Break_Both} (3)
2264          \p{Line_Break: BA}      \p{Line_Break=Break_After} (151)
2265          \p{Line_Break: BB}      \p{Line_Break=Break_Before} (19)
2266          \p{Line_Break: BK}      \p{Line_Break=Mandatory_Break} (4)
2267          \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (151)
2268          \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (19)
2269          \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3)
2270          \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1)
2271          \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1)
2272          \p{Line_Break: CB}      \p{Line_Break=Contingent_Break} (1)
2273          \p{Line_Break: CJ}      \p{Line_Break=
2274                                    Conditional_Japanese_Starter} (51)
2275          \p{Line_Break: CL}      \p{Line_Break=Close_Punctuation} (87)
2276          \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2)
2277          \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (87)
2278          \p{Line_Break: CM}      \p{Line_Break=Combining_Mark} (1628)
2279          \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (1628)
2280          \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (665)
2281          \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
2282                                    (51)
2283          \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1)
2284          \p{Line_Break: CP}      \p{Line_Break=Close_Parenthesis} (2)
2285          \p{Line_Break: CR}      \p{Line_Break=Carriage_Return} (1)
2286          \p{Line_Break: EX}      \p{Line_Break=Exclamation} (34)
2287          \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (34)
2288          \p{Line_Break: GL}      \p{Line_Break=Glue} (18)
2289          \p{Line_Break: Glue}    (Short: \p{Lb=GL}) (18)
2290          \p{Line_Break: H2}      (Short: \p{Lb=H2}) (399)
2291          \p{Line_Break: H3}      (Short: \p{Lb=H3}) (10_773)
2292          \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (74)
2293          \p{Line_Break: HL}      \p{Line_Break=Hebrew_Letter} (74)
2294          \p{Line_Break: HY}      \p{Line_Break=Hyphen} (1)
2295          \p{Line_Break: Hyphen}  (Short: \p{Lb=HY}) (1)
2296          \p{Line_Break: ID}      \p{Line_Break=Ideographic} (161_793)
2297          \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (161_793)
2298          \p{Line_Break: IN}      \p{Line_Break=Inseparable} (4)
2299          \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13)
2300          \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (4)
2301          \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (4)
2302          \p{Line_Break: IS}      \p{Line_Break=Infix_Numeric} (13)
2303          \p{Line_Break: JL}      (Short: \p{Lb=JL}) (125)
2304          \p{Line_Break: JT}      (Short: \p{Lb=JT}) (137)
2305          \p{Line_Break: JV}      (Short: \p{Lb=JV}) (95)
2306          \p{Line_Break: LF}      \p{Line_Break=Line_Feed} (1)
2307          \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1)
2308          \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4)
2309          \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1)
2310          \p{Line_Break: NL}      \p{Line_Break=Next_Line} (1)
2311          \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (26)
2312          \p{Line_Break: NS}      \p{Line_Break=Nonstarter} (26)
2313          \p{Line_Break: NU}      \p{Line_Break=Numeric} (452)
2314          \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (452)
2315          \p{Line_Break: OP}      \p{Line_Break=Open_Punctuation} (81)
2316          \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (81)
2317          \p{Line_Break: PO}      \p{Line_Break=Postfix_Numeric} (28)
2318          \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (28)
2319          \p{Line_Break: PR}      \p{Line_Break=Prefix_Numeric} (45)
2320          \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (45)
2321          \p{Line_Break: QU}      \p{Line_Break=Quotation} (34)
2322          \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (34)
2323          \p{Line_Break: SA}      \p{Line_Break=Complex_Context} (665)
2324        D \p{Line_Break: SG}      \p{Line_Break=Surrogate} (2048)
2325          \p{Line_Break: SP}      \p{Line_Break=Space} (1)
2326          \p{Line_Break: Space}   (Short: \p{Lb=SP}) (1)
2327        D \p{Line_Break: Surrogate} Deprecated by Unicode because surrogates
2328                                    should never appear in well-formed text,
2329                                    and therefore shouldn't be the basis for
2330                                    line breaking (Short: \p{Lb=SG}) (2048)
2331          \p{Line_Break: SY}      \p{Line_Break=Break_Symbols} (1)
2332          \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (918_338)
2333          \p{Line_Break: WJ}      \p{Line_Break=Word_Joiner} (2)
2334          \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2)
2335          \p{Line_Break: XX}      \p{Line_Break=Unknown} (918_338)
2336          \p{Line_Break: ZW}      \p{Line_Break=ZWSpace} (1)
2337          \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1)
2338          \p{Line_Separator}      \p{General_Category=Line_Separator}
2339                                    (Short: \p{Zl}) (1)
2340          \p{Linear_B}            \p{Script=Linear_B} (Short: \p{Linb}) (211)
2341        X \p{Linear_B_Ideograms}  \p{Block=Linear_B_Ideograms} (128)
2342        X \p{Linear_B_Syllabary}  \p{Block=Linear_B_Syllabary} (128)
2343          \p{Lisu}                \p{Script=Lisu} (48)
2344          \p{Ll}                  \p{Lowercase_Letter} (=
2345                                    \p{General_Category=Lowercase_Letter})
2346                                    (/i= General_Category=Cased_Letter)
2347                                    (1751)
2348          \p{Lm}                  \p{Modifier_Letter} (=
2349                                    \p{General_Category=Modifier_Letter})
2350                                    (237)
2351          \p{Lo}                  \p{Other_Letter} (= \p{General_Category=
2352                                    Other_Letter}) (97_553)
2353          \p{LOE}                 \p{Logical_Order_Exception} (=
2354                                    \p{Logical_Order_Exception=Y}) (15)
2355          \p{LOE: *}              \p{Logical_Order_Exception: *}
2356          \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
2357                                    \p{LOE}) (15)
2358          \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
2359                                    (1_114_097)
2360          \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (15)
2361        X \p{Low_Surrogates}      \p{Block=Low_Surrogates} (1024)
2362          \p{Lower}               \p{Lowercase=Y} (/i= Cased=Yes) (1934)
2363          \p{Lower: *}            \p{Lowercase: *}
2364          \p{Lowercase}           \p{Lower} (= \p{Lowercase=Y}) (/i= Cased=
2365                                    Yes) (1934)
2366          \p{Lowercase: N*}       (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
2367                                    No) (1_112_178)
2368          \p{Lowercase: Y*}       (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
2369                                    Yes) (1934)
2370          \p{Lowercase_Letter}    \p{General_Category=Lowercase_Letter}
2371                                    (Short: \p{Ll}; /i= General_Category=
2372                                    Cased_Letter) (1751)
2373          \p{Lt}                  \p{Titlecase_Letter} (=
2374                                    \p{General_Category=Titlecase_Letter})
2375                                    (/i= General_Category=Cased_Letter) (31)
2376          \p{Lu}                  \p{Uppercase_Letter} (=
2377                                    \p{General_Category=Uppercase_Letter})
2378                                    (/i= General_Category=Cased_Letter)
2379                                    (1441)
2380          \p{Lyci}                \p{Lycian} (= \p{Script=Lycian}) (NOT
2381                                    \p{Block=Lycian}) (29)
2382          \p{Lycian}              \p{Script=Lycian} (Short: \p{Lyci}; NOT
2383                                    \p{Block=Lycian}) (29)
2384          \p{Lydi}                \p{Lydian} (= \p{Script=Lydian}) (NOT
2385                                    \p{Block=Lydian}) (27)
2386          \p{Lydian}              \p{Script=Lydian} (Short: \p{Lydi}; NOT
2387                                    \p{Block=Lydian}) (27)
2388          \p{M}                   \p{Mark} (= \p{General_Category=Mark})
2389                                    (1645)
2390        X \p{Mahjong}             \p{Mahjong_Tiles} (= \p{Block=
2391                                    Mahjong_Tiles}) (48)
2392        X \p{Mahjong_Tiles}       \p{Block=Mahjong_Tiles} (Short:
2393                                    \p{InMahjong}) (48)
2394          \p{Malayalam}           \p{Script=Malayalam} (Short: \p{Mlym}; NOT
2395                                    \p{Block=Malayalam}) (98)
2396          \p{Mand}                \p{Mandaic} (= \p{Script=Mandaic}) (NOT
2397                                    \p{Block=Mandaic}) (29)
2398          \p{Mandaic}             \p{Script=Mandaic} (Short: \p{Mand}; NOT
2399                                    \p{Block=Mandaic}) (29)
2400          \p{Mark}                \p{General_Category=Mark} (Short: \p{M})
2401                                    (1645)
2402          \p{Math}                \p{Math=Y} (2310)
2403          \p{Math: N*}            (Single: \P{Math}) (1_111_802)
2404          \p{Math: Y*}            (Single: \p{Math}) (2310)
2405        X \p{Math_Alphanum}       \p{Mathematical_Alphanumeric_Symbols} (=
2406                                    \p{Block=
2407                                    Mathematical_Alphanumeric_Symbols})
2408                                    (1024)
2409        X \p{Math_Operators}      \p{Mathematical_Operators} (= \p{Block=
2410                                    Mathematical_Operators}) (256)
2411          \p{Math_Symbol}         \p{General_Category=Math_Symbol} (Short:
2412                                    \p{Sm}) (952)
2413        X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
2414                                    Mathematical_Alphanumeric_Symbols}
2415                                    (Short: \p{InMathAlphanum}) (1024)
2416        X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
2417                                    (Short: \p{InMathOperators}) (256)
2418          \p{Mc}                  \p{Spacing_Mark} (= \p{General_Category=
2419                                    Spacing_Mark}) (353)
2420          \p{Me}                  \p{Enclosing_Mark} (= \p{General_Category=
2421                                    Enclosing_Mark}) (12)
2422          \p{Meetei_Mayek}        \p{Script=Meetei_Mayek} (Short: \p{Mtei};
2423                                    NOT \p{Block=Meetei_Mayek}) (79)
2424        X \p{Meetei_Mayek_Ext}    \p{Meetei_Mayek_Extensions} (= \p{Block=
2425                                    Meetei_Mayek_Extensions}) (32)
2426        X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
2427                                    (Short: \p{InMeeteiMayekExt}) (32)
2428          \p{Merc}                \p{Meroitic_Cursive} (= \p{Script=
2429                                    Meroitic_Cursive}) (NOT \p{Block=
2430                                    Meroitic_Cursive}) (26)
2431          \p{Mero}                \p{Meroitic_Hieroglyphs} (= \p{Script=
2432                                    Meroitic_Hieroglyphs}) (32)
2433          \p{Meroitic_Cursive}    \p{Script=Meroitic_Cursive} (Short:
2434                                    \p{Merc}; NOT \p{Block=
2435                                    Meroitic_Cursive}) (26)
2436          \p{Meroitic_Hieroglyphs} \p{Script=Meroitic_Hieroglyphs} (Short:
2437                                    \p{Mero}) (32)
2438          \p{Miao}                \p{Script=Miao} (NOT \p{Block=Miao}) (133)
2439        X \p{Misc_Arrows}         \p{Miscellaneous_Symbols_And_Arrows} (=
2440                                    \p{Block=
2441                                    Miscellaneous_Symbols_And_Arrows}) (256)
2442        X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
2443                                    (= \p{Block=
2444                                    Miscellaneous_Mathematical_Symbols_A})
2445                                    (48)
2446        X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
2447                                    (= \p{Block=
2448                                    Miscellaneous_Mathematical_Symbols_B})
2449                                    (128)
2450        X \p{Misc_Pictographs}    \p{Miscellaneous_Symbols_And_Pictographs}
2451                                    (= \p{Block=
2452                                    Miscellaneous_Symbols_And_Pictographs})
2453                                    (768)
2454        X \p{Misc_Symbols}        \p{Miscellaneous_Symbols} (= \p{Block=
2455                                    Miscellaneous_Symbols}) (256)
2456        X \p{Misc_Technical}      \p{Miscellaneous_Technical} (= \p{Block=
2457                                    Miscellaneous_Technical}) (256)
2458        X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
2459                                    Miscellaneous_Mathematical_Symbols_A}
2460                                    (Short: \p{InMiscMathSymbolsA}) (48)
2461        X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
2462                                    Miscellaneous_Mathematical_Symbols_B}
2463                                    (Short: \p{InMiscMathSymbolsB}) (128)
2464        X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
2465                                    \p{InMiscSymbols}) (256)
2466        X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
2467                                    Miscellaneous_Symbols_And_Arrows}
2468                                    (Short: \p{InMiscArrows}) (256)
2469        X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
2470                                    Miscellaneous_Symbols_And_Pictographs}
2471                                    (Short: \p{InMiscPictographs}) (768)
2472        X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
2473                                    (Short: \p{InMiscTechnical}) (256)
2474          \p{Mlym}                \p{Malayalam} (= \p{Script=Malayalam})
2475                                    (NOT \p{Block=Malayalam}) (98)
2476          \p{Mn}                  \p{Nonspacing_Mark} (=
2477                                    \p{General_Category=Nonspacing_Mark})
2478                                    (1280)
2479          \p{Modifier_Letter}     \p{General_Category=Modifier_Letter}
2480                                    (Short: \p{Lm}) (237)
2481        X \p{Modifier_Letters}    \p{Spacing_Modifier_Letters} (= \p{Block=
2482                                    Spacing_Modifier_Letters}) (80)
2483          \p{Modifier_Symbol}     \p{General_Category=Modifier_Symbol}
2484                                    (Short: \p{Sk}) (115)
2485        X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
2486          \p{Mong}                \p{Mongolian} (= \p{Script=Mongolian})
2487                                    (NOT \p{Block=Mongolian}) (153)
2488          \p{Mongolian}           \p{Script=Mongolian} (Short: \p{Mong}; NOT
2489                                    \p{Block=Mongolian}) (153)
2490          \p{Mtei}                \p{Meetei_Mayek} (= \p{Script=
2491                                    Meetei_Mayek}) (NOT \p{Block=
2492                                    Meetei_Mayek}) (79)
2493        X \p{Music}               \p{Musical_Symbols} (= \p{Block=
2494                                    Musical_Symbols}) (256)
2495        X \p{Musical_Symbols}     \p{Block=Musical_Symbols} (Short:
2496                                    \p{InMusic}) (256)
2497          \p{Myanmar}             \p{Script=Myanmar} (Short: \p{Mymr}; NOT
2498                                    \p{Block=Myanmar}) (188)
2499        X \p{Myanmar_Ext_A}       \p{Myanmar_Extended_A} (= \p{Block=
2500                                    Myanmar_Extended_A}) (32)
2501        X \p{Myanmar_Extended_A}  \p{Block=Myanmar_Extended_A} (Short:
2502                                    \p{InMyanmarExtA}) (32)
2503          \p{Mymr}                \p{Myanmar} (= \p{Script=Myanmar}) (NOT
2504                                    \p{Block=Myanmar}) (188)
2505          \p{N}                   \p{Number} (= \p{General_Category=Number})
2506                                    (1148)
2507        X \p{NB}                  \p{No_Block} (= \p{Block=No_Block})
2508                                    (860_672)
2509          \p{NChar}               \p{Noncharacter_Code_Point} (=
2510                                    \p{Noncharacter_Code_Point=Y}) (66)
2511          \p{NChar: *}            \p{Noncharacter_Code_Point: *}
2512          \p{Nd}                  \p{Digit} (= \p{General_Category=
2513                                    Decimal_Number}) (460)
2514          \p{New_Tai_Lue}         \p{Script=New_Tai_Lue} (Short: \p{Talu};
2515                                    NOT \p{Block=New_Tai_Lue}) (83)
2516          \p{NFC_QC: *}           \p{NFC_Quick_Check: *}
2517          \p{NFC_Quick_Check: M}  \p{NFC_Quick_Check=Maybe} (104)
2518          \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (104)
2519          \p{NFC_Quick_Check: N}  \p{NFC_Quick_Check=No} (NOT
2520                                    \P{NFC_Quick_Check} NOR \P{NFC_QC})
2521                                    (1120)
2522          \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
2523                                    \P{NFC_Quick_Check} NOR \P{NFC_QC})
2524                                    (1120)
2525          \p{NFC_Quick_Check: Y}  \p{NFC_Quick_Check=Yes} (NOT
2526                                    \p{NFC_Quick_Check} NOR \p{NFC_QC})
2527                                    (1_112_888)
2528          \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
2529                                    \p{NFC_Quick_Check} NOR \p{NFC_QC})
2530                                    (1_112_888)
2531          \p{NFD_QC: *}           \p{NFD_Quick_Check: *}
2532          \p{NFD_Quick_Check: N}  \p{NFD_Quick_Check=No} (NOT
2533                                    \P{NFD_Quick_Check} NOR \P{NFD_QC})
2534                                    (13_225)
2535          \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
2536                                    \P{NFD_Quick_Check} NOR \P{NFD_QC})
2537                                    (13_225)
2538          \p{NFD_Quick_Check: Y}  \p{NFD_Quick_Check=Yes} (NOT
2539                                    \p{NFD_Quick_Check} NOR \p{NFD_QC})
2540                                    (1_100_887)
2541          \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
2542                                    \p{NFD_Quick_Check} NOR \p{NFD_QC})
2543                                    (1_100_887)
2544          \p{NFKC_QC: *}          \p{NFKC_Quick_Check: *}
2545          \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (104)
2546          \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (104)
2547          \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
2548                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
2549                                    (4787)
2550          \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
2551                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
2552                                    (4787)
2553          \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
2554                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
2555                                    (1_109_221)
2556          \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
2557                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
2558                                    (1_109_221)
2559          \p{NFKD_QC: *}          \p{NFKD_Quick_Check: *}
2560          \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
2561                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
2562                                    (16_880)
2563          \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
2564                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
2565                                    (16_880)
2566          \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
2567                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
2568                                    (1_097_232)
2569          \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
2570                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
2571                                    (1_097_232)
2572          \p{Nko}                 \p{Script=Nko} (NOT \p{NKo}) (59)
2573          \p{Nkoo}                \p{Nko} (= \p{Script=Nko}) (NOT \p{NKo})
2574                                    (59)
2575          \p{Nl}                  \p{Letter_Number} (= \p{General_Category=
2576                                    Letter_Number}) (224)
2577          \p{No}                  \p{Other_Number} (= \p{General_Category=
2578                                    Other_Number}) (464)
2579        X \p{No_Block}            \p{Block=No_Block} (Short: \p{InNB})
2580                                    (860_672)
2581          \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
2582                                    \p{NChar}) (66)
2583          \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
2584                                    (1_114_046)
2585          \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
2586                                    (66)
2587          \p{Nonspacing_Mark}     \p{General_Category=Nonspacing_Mark}
2588                                    (Short: \p{Mn}) (1280)
2589          \p{Nt: *}               \p{Numeric_Type: *}
2590          \p{Number}              \p{General_Category=Number} (Short: \p{N})
2591                                    (1148)
2592        X \p{Number_Forms}        \p{Block=Number_Forms} (64)
2593          \p{Numeric_Type: De}    \p{Numeric_Type=Decimal} (460)
2594          \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (460)
2595          \p{Numeric_Type: Di}    \p{Numeric_Type=Digit} (128)
2596          \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128)
2597          \p{Numeric_Type: None}  (Short: \p{Nt=None}) (1_112_887)
2598          \p{Numeric_Type: Nu}    \p{Numeric_Type=Numeric} (637)
2599          \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (637)
2600        T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1)
2601        T \p{Numeric_Value: 0}    (Short: \p{Nv=0}) (60)
2602        T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (3)
2603        T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (1)
2604        T \p{Numeric_Value: 1/9}  (Short: \p{Nv=1/9}) (1)
2605        T \p{Numeric_Value: 1/8}  (Short: \p{Nv=1/8}) (5)
2606        T \p{Numeric_Value: 1/7}  (Short: \p{Nv=1/7}) (1)
2607        T \p{Numeric_Value: 1/6}  (Short: \p{Nv=1/6}) (2)
2608        T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (3)
2609        T \p{Numeric_Value: 1/5}  (Short: \p{Nv=1/5}) (1)
2610        T \p{Numeric_Value: 1/4}  (Short: \p{Nv=1/4}) (9)
2611        T \p{Numeric_Value: 1/3}  (Short: \p{Nv=1/3}) (4)
2612        T \p{Numeric_Value: 3/8}  (Short: \p{Nv=3/8}) (1)
2613        T \p{Numeric_Value: 2/5}  (Short: \p{Nv=2/5}) (1)
2614        T \p{Numeric_Value: 1/2}  (Short: \p{Nv=1/2}) (10)
2615        T \p{Numeric_Value: 3/5}  (Short: \p{Nv=3/5}) (1)
2616        T \p{Numeric_Value: 5/8}  (Short: \p{Nv=5/8}) (1)
2617        T \p{Numeric_Value: 2/3}  (Short: \p{Nv=2/3}) (5)
2618        T \p{Numeric_Value: 3/4}  (Short: \p{Nv=3/4}) (6)
2619        T \p{Numeric_Value: 4/5}  (Short: \p{Nv=4/5}) (1)
2620        T \p{Numeric_Value: 5/6}  (Short: \p{Nv=5/6}) (2)
2621        T \p{Numeric_Value: 7/8}  (Short: \p{Nv=7/8}) (1)
2622        T \p{Numeric_Value: 1}    (Short: \p{Nv=1}) (97)
2623        T \p{Numeric_Value: 3/2}  (Short: \p{Nv=3/2}) (1)
2624        T \p{Numeric_Value: 2}    (Short: \p{Nv=2}) (100)
2625        T \p{Numeric_Value: 5/2}  (Short: \p{Nv=5/2}) (1)
2626        T \p{Numeric_Value: 3}    (Short: \p{Nv=3}) (102)
2627        T \p{Numeric_Value: 7/2}  (Short: \p{Nv=7/2}) (1)
2628        T \p{Numeric_Value: 4}    (Short: \p{Nv=4}) (93)
2629        T \p{Numeric_Value: 9/2}  (Short: \p{Nv=9/2}) (1)
2630        T \p{Numeric_Value: 5}    (Short: \p{Nv=5}) (90)
2631        T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1)
2632        T \p{Numeric_Value: 6}    (Short: \p{Nv=6}) (82)
2633        T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1)
2634        T \p{Numeric_Value: 7}    (Short: \p{Nv=7}) (81)
2635        T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1)
2636        T \p{Numeric_Value: 8}    (Short: \p{Nv=8}) (77)
2637        T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1)
2638        T \p{Numeric_Value: 9}    (Short: \p{Nv=9}) (81)
2639        T \p{Numeric_Value: 10}   (Short: \p{Nv=10}) (40)
2640        T \p{Numeric_Value: 11}   (Short: \p{Nv=11}) (6)
2641        T \p{Numeric_Value: 12}   (Short: \p{Nv=12}) (6)
2642        T \p{Numeric_Value: 13}   (Short: \p{Nv=13}) (4)
2643        T \p{Numeric_Value: 14}   (Short: \p{Nv=14}) (4)
2644        T \p{Numeric_Value: 15}   (Short: \p{Nv=15}) (4)
2645        T \p{Numeric_Value: 16}   (Short: \p{Nv=16}) (5)
2646        T \p{Numeric_Value: 17}   (Short: \p{Nv=17}) (5)
2647        T \p{Numeric_Value: 18}   (Short: \p{Nv=18}) (5)
2648        T \p{Numeric_Value: 19}   (Short: \p{Nv=19}) (5)
2649        T \p{Numeric_Value: 20}   (Short: \p{Nv=20}) (19)
2650        T \p{Numeric_Value: 21}   (Short: \p{Nv=21}) (1)
2651        T \p{Numeric_Value: 22}   (Short: \p{Nv=22}) (1)
2652        T \p{Numeric_Value: 23}   (Short: \p{Nv=23}) (1)
2653        T \p{Numeric_Value: 24}   (Short: \p{Nv=24}) (1)
2654        T \p{Numeric_Value: 25}   (Short: \p{Nv=25}) (1)
2655        T \p{Numeric_Value: 26}   (Short: \p{Nv=26}) (1)
2656        T \p{Numeric_Value: 27}   (Short: \p{Nv=27}) (1)
2657        T \p{Numeric_Value: 28}   (Short: \p{Nv=28}) (1)
2658        T \p{Numeric_Value: 29}   (Short: \p{Nv=29}) (1)
2659        T \p{Numeric_Value: 30}   (Short: \p{Nv=30}) (11)
2660        T \p{Numeric_Value: 31}   (Short: \p{Nv=31}) (1)
2661        T \p{Numeric_Value: 32}   (Short: \p{Nv=32}) (1)
2662        T \p{Numeric_Value: 33}   (Short: \p{Nv=33}) (1)
2663        T \p{Numeric_Value: 34}   (Short: \p{Nv=34}) (1)
2664        T \p{Numeric_Value: 35}   (Short: \p{Nv=35}) (1)
2665        T \p{Numeric_Value: 36}   (Short: \p{Nv=36}) (1)
2666        T \p{Numeric_Value: 37}   (Short: \p{Nv=37}) (1)
2667        T \p{Numeric_Value: 38}   (Short: \p{Nv=38}) (1)
2668        T \p{Numeric_Value: 39}   (Short: \p{Nv=39}) (1)
2669        T \p{Numeric_Value: 40}   (Short: \p{Nv=40}) (10)
2670        T \p{Numeric_Value: 41}   (Short: \p{Nv=41}) (1)
2671        T \p{Numeric_Value: 42}   (Short: \p{Nv=42}) (1)
2672        T \p{Numeric_Value: 43}   (Short: \p{Nv=43}) (1)
2673        T \p{Numeric_Value: 44}   (Short: \p{Nv=44}) (1)
2674        T \p{Numeric_Value: 45}   (Short: \p{Nv=45}) (1)
2675        T \p{Numeric_Value: 46}   (Short: \p{Nv=46}) (1)
2676        T \p{Numeric_Value: 47}   (Short: \p{Nv=47}) (1)
2677        T \p{Numeric_Value: 48}   (Short: \p{Nv=48}) (1)
2678        T \p{Numeric_Value: 49}   (Short: \p{Nv=49}) (1)
2679        T \p{Numeric_Value: 50}   (Short: \p{Nv=50}) (20)
2680        T \p{Numeric_Value: 60}   (Short: \p{Nv=60}) (6)
2681        T \p{Numeric_Value: 70}   (Short: \p{Nv=70}) (6)
2682        T \p{Numeric_Value: 80}   (Short: \p{Nv=80}) (6)
2683        T \p{Numeric_Value: 90}   (Short: \p{Nv=90}) (6)
2684        T \p{Numeric_Value: 100}  (Short: \p{Nv=100}) (20)
2685        T \p{Numeric_Value: 200}  (Short: \p{Nv=200}) (2)
2686        T \p{Numeric_Value: 300}  (Short: \p{Nv=300}) (3)
2687        T \p{Numeric_Value: 400}  (Short: \p{Nv=400}) (2)
2688        T \p{Numeric_Value: 500}  (Short: \p{Nv=500}) (12)
2689        T \p{Numeric_Value: 600}  (Short: \p{Nv=600}) (2)
2690        T \p{Numeric_Value: 700}  (Short: \p{Nv=700}) (2)
2691        T \p{Numeric_Value: 800}  (Short: \p{Nv=800}) (2)
2692        T \p{Numeric_Value: 900}  (Short: \p{Nv=900}) (3)
2693        T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (17)
2694        T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (1)
2695        T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (1)
2696        T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (1)
2697        T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (5)
2698        T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (1)
2699        T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (1)
2700        T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (1)
2701        T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (1)
2702        T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (7)
2703        T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (1)
2704        T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (1)
2705        T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (1)
2706        T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (4)
2707        T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (1)
2708        T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (1)
2709        T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (1)
2710        T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (1)
2711        T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (1)
2712        T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
2713                                    (2)
2714        T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
2715                                    1000000000000}) (1)
2716          \p{Numeric_Value: NaN}  (Short: \p{Nv=NaN}) (1_112_887)
2717          \p{Nv: *}               \p{Numeric_Value: *}
2718        X \p{OCR}                 \p{Optical_Character_Recognition} (=
2719                                    \p{Block=Optical_Character_Recognition})
2720                                    (32)
2721          \p{Ogam}                \p{Ogham} (= \p{Script=Ogham}) (NOT
2722                                    \p{Block=Ogham}) (29)
2723          \p{Ogham}               \p{Script=Ogham} (Short: \p{Ogam}; NOT
2724                                    \p{Block=Ogham}) (29)
2725          \p{Ol_Chiki}            \p{Script=Ol_Chiki} (Short: \p{Olck}) (48)
2726          \p{Olck}                \p{Ol_Chiki} (= \p{Script=Ol_Chiki}) (48)
2727          \p{Old_Italic}          \p{Script=Old_Italic} (Short: \p{Ital};
2728                                    NOT \p{Block=Old_Italic}) (35)
2729          \p{Old_Persian}         \p{Script=Old_Persian} (Short: \p{Xpeo};
2730                                    NOT \p{Block=Old_Persian}) (50)
2731          \p{Old_South_Arabian}   \p{Script=Old_South_Arabian} (Short:
2732                                    \p{Sarb}) (32)
2733          \p{Old_Turkic}          \p{Script=Old_Turkic} (Short: \p{Orkh};
2734                                    NOT \p{Block=Old_Turkic}) (73)
2735          \p{Open_Punctuation}    \p{General_Category=Open_Punctuation}
2736                                    (Short: \p{Ps}) (72)
2737        X \p{Optical_Character_Recognition} \p{Block=
2738                                    Optical_Character_Recognition} (Short:
2739                                    \p{InOCR}) (32)
2740          \p{Oriya}               \p{Script=Oriya} (Short: \p{Orya}; NOT
2741                                    \p{Block=Oriya}) (90)
2742          \p{Orkh}                \p{Old_Turkic} (= \p{Script=Old_Turkic})
2743                                    (NOT \p{Block=Old_Turkic}) (73)
2744          \p{Orya}                \p{Oriya} (= \p{Script=Oriya}) (NOT
2745                                    \p{Block=Oriya}) (90)
2746          \p{Osma}                \p{Osmanya} (= \p{Script=Osmanya}) (NOT
2747                                    \p{Block=Osmanya}) (40)
2748          \p{Osmanya}             \p{Script=Osmanya} (Short: \p{Osma}; NOT
2749                                    \p{Block=Osmanya}) (40)
2750          \p{Other}               \p{General_Category=Other} (Short: \p{C})
2751                                    (1_004_135)
2752          \p{Other_Letter}        \p{General_Category=Other_Letter} (Short:
2753                                    \p{Lo}) (97_553)
2754          \p{Other_Number}        \p{General_Category=Other_Number} (Short:
2755                                    \p{No}) (464)
2756          \p{Other_Punctuation}   \p{General_Category=Other_Punctuation}
2757                                    (Short: \p{Po}) (434)
2758          \p{Other_Symbol}        \p{General_Category=Other_Symbol} (Short:
2759                                    \p{So}) (4404)
2760          \p{P}                   \p{Punct} (= \p{General_Category=
2761                                    Punctuation}) (NOT
2762                                    \p{General_Punctuation}) (632)
2763          \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
2764                                    (Short: \p{Zp}) (1)
2765          \p{Pat_Syn}             \p{Pattern_Syntax} (= \p{Pattern_Syntax=
2766                                    Y}) (2760)
2767          \p{Pat_Syn: *}          \p{Pattern_Syntax: *}
2768          \p{Pat_WS}              \p{Pattern_White_Space} (=
2769                                    \p{Pattern_White_Space=Y}) (11)
2770          \p{Pat_WS: *}           \p{Pattern_White_Space: *}
2771          \p{Pattern_Syntax}      \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
2772                                    (2760)
2773          \p{Pattern_Syntax: N*}  (Short: \p{PatSyn=N}, \P{PatSyn})
2774                                    (1_111_352)
2775          \p{Pattern_Syntax: Y*}  (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760)
2776          \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
2777                                    \p{PatWS}) (11)
2778          \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
2779                                    (1_114_101)
2780          \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11)
2781          \p{Pc}                  \p{Connector_Punctuation} (=
2782                                    \p{General_Category=
2783                                    Connector_Punctuation}) (10)
2784          \p{Pd}                  \p{Dash_Punctuation} (=
2785                                    \p{General_Category=Dash_Punctuation})
2786                                    (23)
2787          \p{Pe}                  \p{Close_Punctuation} (=
2788                                    \p{General_Category=Close_Punctuation})
2789                                    (71)
2790          \p{PerlSpace}           \s, restricted to ASCII = [ \f\n\r\t] (5)
2791          \p{PerlWord}            \w, restricted to ASCII = [A-Za-z0-9_] (63)
2792          \p{Pf}                  \p{Final_Punctuation} (=
2793                                    \p{General_Category=Final_Punctuation})
2794                                    (10)
2795          \p{Phag}                \p{Phags_Pa} (= \p{Script=Phags_Pa}) (NOT
2796                                    \p{Block=Phags_Pa}) (56)
2797          \p{Phags_Pa}            \p{Script=Phags_Pa} (Short: \p{Phag}; NOT
2798                                    \p{Block=Phags_Pa}) (56)
2799        X \p{Phaistos}            \p{Phaistos_Disc} (= \p{Block=
2800                                    Phaistos_Disc}) (48)
2801        X \p{Phaistos_Disc}       \p{Block=Phaistos_Disc} (Short:
2802                                    \p{InPhaistos}) (48)
2803          \p{Phli}                \p{Inscriptional_Pahlavi} (= \p{Script=
2804                                    Inscriptional_Pahlavi}) (NOT \p{Block=
2805                                    Inscriptional_Pahlavi}) (27)
2806          \p{Phnx}                \p{Phoenician} (= \p{Script=Phoenician})
2807                                    (NOT \p{Block=Phoenician}) (29)
2808          \p{Phoenician}          \p{Script=Phoenician} (Short: \p{Phnx};
2809                                    NOT \p{Block=Phoenician}) (29)
2810        X \p{Phonetic_Ext}        \p{Phonetic_Extensions} (= \p{Block=
2811                                    Phonetic_Extensions}) (128)
2812        X \p{Phonetic_Ext_Sup}    \p{Phonetic_Extensions_Supplement} (=
2813                                    \p{Block=
2814                                    Phonetic_Extensions_Supplement}) (64)
2815        X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
2816                                    \p{InPhoneticExt}) (128)
2817        X \p{Phonetic_Extensions_Supplement} \p{Block=
2818                                    Phonetic_Extensions_Supplement} (Short:
2819                                    \p{InPhoneticExtSup}) (64)
2820          \p{Pi}                  \p{Initial_Punctuation} (=
2821                                    \p{General_Category=
2822                                    Initial_Punctuation}) (12)
2823        X \p{Playing_Cards}       \p{Block=Playing_Cards} (96)
2824          \p{Plrd}                \p{Miao} (= \p{Script=Miao}) (NOT
2825                                    \p{Block=Miao}) (133)
2826          \p{Po}                  \p{Other_Punctuation} (=
2827                                    \p{General_Category=Other_Punctuation})
2828                                    (434)
2829          \p{PosixAlnum}          [A-Za-z0-9] (62)
2830          \p{PosixAlpha}          [A-Za-z] (52)
2831          \p{PosixBlank}          \t and ' ' (2)
2832          \p{PosixCntrl}          ASCII control characters: NUL, SOH, STX,
2833                                    ETX, EOT, ENQ, ACK, BEL, BS, HT, LF, VT,
2834                                    FF, CR, SO, SI, DLE, DC1, DC2, DC3, DC4,
2835                                    NAK, SYN, ETB, CAN, EOM, SUB, ESC, FS,
2836                                    GS, RS, US, and DEL (33)
2837          \p{PosixDigit}          [0-9] (10)
2838          \p{PosixGraph}          [-!"#$%&'()*+,./:;<>?@[\\]^_`{|}~0-9A-Za-
2839                                    z] (94)
2840          \p{PosixLower}          [a-z] (/i= PosixAlpha) (26)
2841          \p{PosixPrint}          [- 0-9A-Za-
2842                                    z!"#$%&'()*+,./:;<>?@[\\]^_`{|}~] (95)
2843          \p{PosixPunct}          [-!"#$%&'()*+,./:;<>?@[\\]^_`{|}~] (32)
2844          \p{PosixSpace}          \t, \n, \cK, \f, \r, and ' '.  (\cK is
2845                                    vertical tab) (6)
2846          \p{PosixUpper}          [A-Z] (/i= PosixAlpha) (26)
2847          \p{PosixWord}           \p{PerlWord} (63)
2848          \p{PosixXDigit}         \p{ASCII_Hex_Digit=Y} [0-9A-Fa-f] (Short:
2849                                    \p{AHex}) (22)
2850        T \p{Present_In: 1.1}     \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
2851                                    extension) (33_979)
2852        T \p{Present_In: 2.0}     Code point's usage introduced in version
2853                                    2.0 or earlier (Short: \p{In=2.0}) (Perl
2854                                    extension) (178_500)
2855        T \p{Present_In: 2.1}     Code point's usage introduced in version
2856                                    2.1 or earlier (Short: \p{In=2.1}) (Perl
2857                                    extension) (178_502)
2858        T \p{Present_In: 3.0}     Code point's usage introduced in version
2859                                    3.0 or earlier (Short: \p{In=3.0}) (Perl
2860                                    extension) (188_809)
2861        T \p{Present_In: 3.1}     Code point's usage introduced in version
2862                                    3.1 or earlier (Short: \p{In=3.1}) (Perl
2863                                    extension) (233_787)
2864        T \p{Present_In: 3.2}     Code point's usage introduced in version
2865                                    3.2 or earlier (Short: \p{In=3.2}) (Perl
2866                                    extension) (234_803)
2867        T \p{Present_In: 4.0}     Code point's usage introduced in version
2868                                    4.0 or earlier (Short: \p{In=4.0}) (Perl
2869                                    extension) (236_029)
2870        T \p{Present_In: 4.1}     Code point's usage introduced in version
2871                                    4.1 or earlier (Short: \p{In=4.1}) (Perl
2872                                    extension) (237_302)
2873        T \p{Present_In: 5.0}     Code point's usage introduced in version
2874                                    5.0 or earlier (Short: \p{In=5.0}) (Perl
2875                                    extension) (238_671)
2876        T \p{Present_In: 5.1}     Code point's usage introduced in version
2877                                    5.1 or earlier (Short: \p{In=5.1}) (Perl
2878                                    extension) (240_295)
2879        T \p{Present_In: 5.2}     Code point's usage introduced in version
2880                                    5.2 or earlier (Short: \p{In=5.2}) (Perl
2881                                    extension) (246_943)
2882        T \p{Present_In: 6.0}     Code point's usage introduced in version
2883                                    6.0 or earlier (Short: \p{In=6.0}) (Perl
2884                                    extension) (249_031)
2885        T \p{Present_In: 6.1}     Code point's usage introduced in version
2886                                    6.1 or earlier (Short: \p{In=6.1}) (Perl
2887                                    extension) (249_763)
2888          \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
2889                                    Unassigned}) (Perl extension) (864_349)
2890          \p{Print}               Characters that are graphical plus space
2891                                    characters (but no controls) (247_582)
2892          \p{Private_Use}         \p{General_Category=Private_Use} (Short:
2893                                    \p{Co}; NOT \p{Private_Use_Area})
2894                                    (137_468)
2895        X \p{Private_Use_Area}    \p{Block=Private_Use_Area} (Short:
2896                                    \p{InPUA}) (6400)
2897          \p{Prti}                \p{Inscriptional_Parthian} (= \p{Script=
2898                                    Inscriptional_Parthian}) (NOT \p{Block=
2899                                    Inscriptional_Parthian}) (30)
2900          \p{Ps}                  \p{Open_Punctuation} (=
2901                                    \p{General_Category=Open_Punctuation})
2902                                    (72)
2903        X \p{PUA}                 \p{Private_Use_Area} (= \p{Block=
2904                                    Private_Use_Area}) (6400)
2905          \p{Punct}               \p{General_Category=Punctuation} (Short:
2906                                    \p{P}; NOT \p{General_Punctuation}) (632)
2907          \p{Punctuation}         \p{Punct} (= \p{General_Category=
2908                                    Punctuation}) (NOT
2909                                    \p{General_Punctuation}) (632)
2910          \p{Qaac}                \p{Coptic} (= \p{Script=Coptic}) (NOT
2911                                    \p{Block=Coptic}) (137)
2912          \p{Qaai}                \p{Inherited} (= \p{Script=Inherited})
2913                                    (524)
2914          \p{QMark}               \p{Quotation_Mark} (= \p{Quotation_Mark=
2915                                    Y}) (29)
2916          \p{QMark: *}            \p{Quotation_Mark: *}
2917          \p{Quotation_Mark}      \p{Quotation_Mark=Y} (Short: \p{QMark})
2918                                    (29)
2919          \p{Quotation_Mark: N*}  (Short: \p{QMark=N}, \P{QMark}) (1_114_083)
2920          \p{Quotation_Mark: Y*}  (Short: \p{QMark=Y}, \p{QMark}) (29)
2921          \p{Radical}             \p{Radical=Y} (329)
2922          \p{Radical: N*}         (Single: \P{Radical}) (1_113_783)
2923          \p{Radical: Y*}         (Single: \p{Radical}) (329)
2924          \p{Rejang}              \p{Script=Rejang} (Short: \p{Rjng}; NOT
2925                                    \p{Block=Rejang}) (37)
2926          \p{Rjng}                \p{Rejang} (= \p{Script=Rejang}) (NOT
2927                                    \p{Block=Rejang}) (37)
2928        X \p{Rumi}                \p{Rumi_Numeral_Symbols} (= \p{Block=
2929                                    Rumi_Numeral_Symbols}) (32)
2930        X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
2931                                    \p{InRumi}) (32)
2932          \p{Runic}               \p{Script=Runic} (Short: \p{Runr}; NOT
2933                                    \p{Block=Runic}) (78)
2934          \p{Runr}                \p{Runic} (= \p{Script=Runic}) (NOT
2935                                    \p{Block=Runic}) (78)
2936          \p{S}                   \p{Symbol} (= \p{General_Category=Symbol})
2937                                    (5519)
2938          \p{Samaritan}           \p{Script=Samaritan} (Short: \p{Samr}; NOT
2939                                    \p{Block=Samaritan}) (61)
2940          \p{Samr}                \p{Samaritan} (= \p{Script=Samaritan})
2941                                    (NOT \p{Block=Samaritan}) (61)
2942          \p{Sarb}                \p{Old_South_Arabian} (= \p{Script=
2943                                    Old_South_Arabian}) (32)
2944          \p{Saur}                \p{Saurashtra} (= \p{Script=Saurashtra})
2945                                    (NOT \p{Block=Saurashtra}) (81)
2946          \p{Saurashtra}          \p{Script=Saurashtra} (Short: \p{Saur};
2947                                    NOT \p{Block=Saurashtra}) (81)
2948          \p{SB: *}               \p{Sentence_Break: *}
2949          \p{Sc}                  \p{Currency_Symbol} (=
2950                                    \p{General_Category=Currency_Symbol})
2951                                    (48)
2952          \p{Sc: *}               \p{Script: *}
2953          \p{Script: Arab}        \p{Script=Arabic} (1234)
2954          \p{Script: Arabic}      (Short: \p{Sc=Arab}, \p{Arab}) (1234)
2955          \p{Script: Armenian}    (Short: \p{Sc=Armn}, \p{Armn}) (91)
2956          \p{Script: Armi}        \p{Script=Imperial_Aramaic} (31)
2957          \p{Script: Armn}        \p{Script=Armenian} (91)
2958          \p{Script: Avestan}     (Short: \p{Sc=Avst}, \p{Avst}) (61)
2959          \p{Script: Avst}        \p{Script=Avestan} (61)
2960          \p{Script: Bali}        \p{Script=Balinese} (121)
2961          \p{Script: Balinese}    (Short: \p{Sc=Bali}, \p{Bali}) (121)
2962          \p{Script: Bamu}        \p{Script=Bamum} (657)
2963          \p{Script: Bamum}       (Short: \p{Sc=Bamu}, \p{Bamu}) (657)
2964          \p{Script: Batak}       (Short: \p{Sc=Batk}, \p{Batk}) (56)
2965          \p{Script: Batk}        \p{Script=Batak} (56)
2966          \p{Script: Beng}        \p{Script=Bengali} (92)
2967          \p{Script: Bengali}     (Short: \p{Sc=Beng}, \p{Beng}) (92)
2968          \p{Script: Bopo}        \p{Script=Bopomofo} (70)
2969          \p{Script: Bopomofo}    (Short: \p{Sc=Bopo}, \p{Bopo}) (70)
2970          \p{Script: Brah}        \p{Script=Brahmi} (108)
2971          \p{Script: Brahmi}      (Short: \p{Sc=Brah}, \p{Brah}) (108)
2972          \p{Script: Brai}        \p{Script=Braille} (256)
2973          \p{Script: Braille}     (Short: \p{Sc=Brai}, \p{Brai}) (256)
2974          \p{Script: Bugi}        \p{Script=Buginese} (30)
2975          \p{Script: Buginese}    (Short: \p{Sc=Bugi}, \p{Bugi}) (30)
2976          \p{Script: Buhd}        \p{Script=Buhid} (20)
2977          \p{Script: Buhid}       (Short: \p{Sc=Buhd}, \p{Buhd}) (20)
2978          \p{Script: Cakm}        \p{Script=Chakma} (67)
2979          \p{Script: Canadian_Aboriginal} (Short: \p{Sc=Cans}, \p{Cans})
2980                                    (710)
2981          \p{Script: Cans}        \p{Script=Canadian_Aboriginal} (710)
2982          \p{Script: Cari}        \p{Script=Carian} (49)
2983          \p{Script: Carian}      (Short: \p{Sc=Cari}, \p{Cari}) (49)
2984          \p{Script: Chakma}      (Short: \p{Sc=Cakm}, \p{Cakm}) (67)
2985          \p{Script: Cham}        (Short: \p{Sc=Cham}, \p{Cham}) (83)
2986          \p{Script: Cher}        \p{Script=Cherokee} (85)
2987          \p{Script: Cherokee}    (Short: \p{Sc=Cher}, \p{Cher}) (85)
2988          \p{Script: Common}      (Short: \p{Sc=Zyyy}, \p{Zyyy}) (6412)
2989          \p{Script: Copt}        \p{Script=Coptic} (137)
2990          \p{Script: Coptic}      (Short: \p{Sc=Copt}, \p{Copt}) (137)
2991          \p{Script: Cprt}        \p{Script=Cypriot} (55)
2992          \p{Script: Cuneiform}   (Short: \p{Sc=Xsux}, \p{Xsux}) (982)
2993          \p{Script: Cypriot}     (Short: \p{Sc=Cprt}, \p{Cprt}) (55)
2994          \p{Script: Cyrillic}    (Short: \p{Sc=Cyrl}, \p{Cyrl}) (417)
2995          \p{Script: Cyrl}        \p{Script=Cyrillic} (417)
2996          \p{Script: Deseret}     (Short: \p{Sc=Dsrt}, \p{Dsrt}) (80)
2997          \p{Script: Deva}        \p{Script=Devanagari} (151)
2998          \p{Script: Devanagari}  (Short: \p{Sc=Deva}, \p{Deva}) (151)
2999          \p{Script: Dsrt}        \p{Script=Deseret} (80)
3000          \p{Script: Egyp}        \p{Script=Egyptian_Hieroglyphs} (1071)
3001          \p{Script: Egyptian_Hieroglyphs} (Short: \p{Sc=Egyp}, \p{Egyp})
3002                                    (1071)
3003          \p{Script: Ethi}        \p{Script=Ethiopic} (495)
3004          \p{Script: Ethiopic}    (Short: \p{Sc=Ethi}, \p{Ethi}) (495)
3005          \p{Script: Geor}        \p{Script=Georgian} (127)
3006          \p{Script: Georgian}    (Short: \p{Sc=Geor}, \p{Geor}) (127)
3007          \p{Script: Glag}        \p{Script=Glagolitic} (94)
3008          \p{Script: Glagolitic}  (Short: \p{Sc=Glag}, \p{Glag}) (94)
3009          \p{Script: Goth}        \p{Script=Gothic} (27)
3010          \p{Script: Gothic}      (Short: \p{Sc=Goth}, \p{Goth}) (27)
3011          \p{Script: Greek}       (Short: \p{Sc=Grek}, \p{Grek}) (511)
3012          \p{Script: Grek}        \p{Script=Greek} (511)
3013          \p{Script: Gujarati}    (Short: \p{Sc=Gujr}, \p{Gujr}) (84)
3014          \p{Script: Gujr}        \p{Script=Gujarati} (84)
3015          \p{Script: Gurmukhi}    (Short: \p{Sc=Guru}, \p{Guru}) (79)
3016          \p{Script: Guru}        \p{Script=Gurmukhi} (79)
3017          \p{Script: Han}         (Short: \p{Sc=Han}, \p{Han}) (75_963)
3018          \p{Script: Hang}        \p{Script=Hangul} (11_739)
3019          \p{Script: Hangul}      (Short: \p{Sc=Hang}, \p{Hang}) (11_739)
3020          \p{Script: Hani}        \p{Script=Han} (75_963)
3021          \p{Script: Hano}        \p{Script=Hanunoo} (21)
3022          \p{Script: Hanunoo}     (Short: \p{Sc=Hano}, \p{Hano}) (21)
3023          \p{Script: Hebr}        \p{Script=Hebrew} (133)
3024          \p{Script: Hebrew}      (Short: \p{Sc=Hebr}, \p{Hebr}) (133)
3025          \p{Script: Hira}        \p{Script=Hiragana} (91)
3026          \p{Script: Hiragana}    (Short: \p{Sc=Hira}, \p{Hira}) (91)
3027          \p{Script: Imperial_Aramaic} (Short: \p{Sc=Armi}, \p{Armi}) (31)
3028          \p{Script: Inherited}   (Short: \p{Sc=Zinh}, \p{Zinh}) (524)
3029          \p{Script: Inscriptional_Pahlavi} (Short: \p{Sc=Phli}, \p{Phli})
3030                                    (27)
3031          \p{Script: Inscriptional_Parthian} (Short: \p{Sc=Prti}, \p{Prti})
3032                                    (30)
3033          \p{Script: Ital}        \p{Script=Old_Italic} (35)
3034          \p{Script: Java}        \p{Script=Javanese} (91)
3035          \p{Script: Javanese}    (Short: \p{Sc=Java}, \p{Java}) (91)
3036          \p{Script: Kaithi}      (Short: \p{Sc=Kthi}, \p{Kthi}) (66)
3037          \p{Script: Kali}        \p{Script=Kayah_Li} (48)
3038          \p{Script: Kana}        \p{Script=Katakana} (300)
3039          \p{Script: Kannada}     (Short: \p{Sc=Knda}, \p{Knda}) (86)
3040          \p{Script: Katakana}    (Short: \p{Sc=Kana}, \p{Kana}) (300)
3041          \p{Script: Kayah_Li}    (Short: \p{Sc=Kali}, \p{Kali}) (48)
3042          \p{Script: Khar}        \p{Script=Kharoshthi} (65)
3043          \p{Script: Kharoshthi}  (Short: \p{Sc=Khar}, \p{Khar}) (65)
3044          \p{Script: Khmer}       (Short: \p{Sc=Khmr}, \p{Khmr}) (146)
3045          \p{Script: Khmr}        \p{Script=Khmer} (146)
3046          \p{Script: Knda}        \p{Script=Kannada} (86)
3047          \p{Script: Kthi}        \p{Script=Kaithi} (66)
3048          \p{Script: Lana}        \p{Script=Tai_Tham} (127)
3049          \p{Script: Lao}         (Short: \p{Sc=Lao}, \p{Lao}) (67)
3050          \p{Script: Laoo}        \p{Script=Lao} (67)
3051          \p{Script: Latin}       (Short: \p{Sc=Latn}, \p{Latn}) (1272)
3052          \p{Script: Latn}        \p{Script=Latin} (1272)
3053          \p{Script: Lepc}        \p{Script=Lepcha} (74)
3054          \p{Script: Lepcha}      (Short: \p{Sc=Lepc}, \p{Lepc}) (74)
3055          \p{Script: Limb}        \p{Script=Limbu} (66)
3056          \p{Script: Limbu}       (Short: \p{Sc=Limb}, \p{Limb}) (66)
3057          \p{Script: Linb}        \p{Script=Linear_B} (211)
3058          \p{Script: Linear_B}    (Short: \p{Sc=Linb}, \p{Linb}) (211)
3059          \p{Script: Lisu}        (Short: \p{Sc=Lisu}, \p{Lisu}) (48)
3060          \p{Script: Lyci}        \p{Script=Lycian} (29)
3061          \p{Script: Lycian}      (Short: \p{Sc=Lyci}, \p{Lyci}) (29)
3062          \p{Script: Lydi}        \p{Script=Lydian} (27)
3063          \p{Script: Lydian}      (Short: \p{Sc=Lydi}, \p{Lydi}) (27)
3064          \p{Script: Malayalam}   (Short: \p{Sc=Mlym}, \p{Mlym}) (98)
3065          \p{Script: Mand}        \p{Script=Mandaic} (29)
3066          \p{Script: Mandaic}     (Short: \p{Sc=Mand}, \p{Mand}) (29)
3067          \p{Script: Meetei_Mayek} (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
3068          \p{Script: Merc}        \p{Script=Meroitic_Cursive} (26)
3069          \p{Script: Mero}        \p{Script=Meroitic_Hieroglyphs} (32)
3070          \p{Script: Meroitic_Cursive} (Short: \p{Sc=Merc}, \p{Merc}) (26)
3071          \p{Script: Meroitic_Hieroglyphs} (Short: \p{Sc=Mero}, \p{Mero})
3072                                    (32)
3073          \p{Script: Miao}        (Short: \p{Sc=Miao}, \p{Miao}) (133)
3074          \p{Script: Mlym}        \p{Script=Malayalam} (98)
3075          \p{Script: Mong}        \p{Script=Mongolian} (153)
3076          \p{Script: Mongolian}   (Short: \p{Sc=Mong}, \p{Mong}) (153)
3077          \p{Script: Mtei}        \p{Script=Meetei_Mayek} (79)
3078          \p{Script: Myanmar}     (Short: \p{Sc=Mymr}, \p{Mymr}) (188)
3079          \p{Script: Mymr}        \p{Script=Myanmar} (188)
3080          \p{Script: New_Tai_Lue} (Short: \p{Sc=Talu}, \p{Talu}) (83)
3081          \p{Script: Nko}         (Short: \p{Sc=Nko}, \p{Nko}) (59)
3082          \p{Script: Nkoo}        \p{Script=Nko} (59)
3083          \p{Script: Ogam}        \p{Script=Ogham} (29)
3084          \p{Script: Ogham}       (Short: \p{Sc=Ogam}, \p{Ogam}) (29)
3085          \p{Script: Ol_Chiki}    (Short: \p{Sc=Olck}, \p{Olck}) (48)
3086          \p{Script: Olck}        \p{Script=Ol_Chiki} (48)
3087          \p{Script: Old_Italic}  (Short: \p{Sc=Ital}, \p{Ital}) (35)
3088          \p{Script: Old_Persian} (Short: \p{Sc=Xpeo}, \p{Xpeo}) (50)
3089          \p{Script: Old_South_Arabian} (Short: \p{Sc=Sarb}, \p{Sarb}) (32)
3090          \p{Script: Old_Turkic}  (Short: \p{Sc=Orkh}, \p{Orkh}) (73)
3091          \p{Script: Oriya}       (Short: \p{Sc=Orya}, \p{Orya}) (90)
3092          \p{Script: Orkh}        \p{Script=Old_Turkic} (73)
3093          \p{Script: Orya}        \p{Script=Oriya} (90)
3094          \p{Script: Osma}        \p{Script=Osmanya} (40)
3095          \p{Script: Osmanya}     (Short: \p{Sc=Osma}, \p{Osma}) (40)
3096          \p{Script: Phag}        \p{Script=Phags_Pa} (56)
3097          \p{Script: Phags_Pa}    (Short: \p{Sc=Phag}, \p{Phag}) (56)
3098          \p{Script: Phli}        \p{Script=Inscriptional_Pahlavi} (27)
3099          \p{Script: Phnx}        \p{Script=Phoenician} (29)
3100          \p{Script: Phoenician}  (Short: \p{Sc=Phnx}, \p{Phnx}) (29)
3101          \p{Script: Plrd}        \p{Script=Miao} (133)
3102          \p{Script: Prti}        \p{Script=Inscriptional_Parthian} (30)
3103          \p{Script: Qaac}        \p{Script=Coptic} (137)
3104          \p{Script: Qaai}        \p{Script=Inherited} (524)
3105          \p{Script: Rejang}      (Short: \p{Sc=Rjng}, \p{Rjng}) (37)
3106          \p{Script: Rjng}        \p{Script=Rejang} (37)
3107          \p{Script: Runic}       (Short: \p{Sc=Runr}, \p{Runr}) (78)
3108          \p{Script: Runr}        \p{Script=Runic} (78)
3109          \p{Script: Samaritan}   (Short: \p{Sc=Samr}, \p{Samr}) (61)
3110          \p{Script: Samr}        \p{Script=Samaritan} (61)
3111          \p{Script: Sarb}        \p{Script=Old_South_Arabian} (32)
3112          \p{Script: Saur}        \p{Script=Saurashtra} (81)
3113          \p{Script: Saurashtra}  (Short: \p{Sc=Saur}, \p{Saur}) (81)
3114          \p{Script: Sharada}     (Short: \p{Sc=Shrd}, \p{Shrd}) (83)
3115          \p{Script: Shavian}     (Short: \p{Sc=Shaw}, \p{Shaw}) (48)
3116          \p{Script: Shaw}        \p{Script=Shavian} (48)
3117          \p{Script: Shrd}        \p{Script=Sharada} (83)
3118          \p{Script: Sinh}        \p{Script=Sinhala} (80)
3119          \p{Script: Sinhala}     (Short: \p{Sc=Sinh}, \p{Sinh}) (80)
3120          \p{Script: Sora}        \p{Script=Sora_Sompeng} (35)
3121          \p{Script: Sora_Sompeng} (Short: \p{Sc=Sora}, \p{Sora}) (35)
3122          \p{Script: Sund}        \p{Script=Sundanese} (72)
3123          \p{Script: Sundanese}   (Short: \p{Sc=Sund}, \p{Sund}) (72)
3124          \p{Script: Sylo}        \p{Script=Syloti_Nagri} (44)
3125          \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}, \p{Sylo}) (44)
3126          \p{Script: Syrc}        \p{Script=Syriac} (77)
3127          \p{Script: Syriac}      (Short: \p{Sc=Syrc}, \p{Syrc}) (77)
3128          \p{Script: Tagalog}     (Short: \p{Sc=Tglg}, \p{Tglg}) (20)
3129          \p{Script: Tagb}        \p{Script=Tagbanwa} (18)
3130          \p{Script: Tagbanwa}    (Short: \p{Sc=Tagb}, \p{Tagb}) (18)
3131          \p{Script: Tai_Le}      (Short: \p{Sc=Tale}, \p{Tale}) (35)
3132          \p{Script: Tai_Tham}    (Short: \p{Sc=Lana}, \p{Lana}) (127)
3133          \p{Script: Tai_Viet}    (Short: \p{Sc=Tavt}, \p{Tavt}) (72)
3134          \p{Script: Takr}        \p{Script=Takri} (66)
3135          \p{Script: Takri}       (Short: \p{Sc=Takr}, \p{Takr}) (66)
3136          \p{Script: Tale}        \p{Script=Tai_Le} (35)
3137          \p{Script: Talu}        \p{Script=New_Tai_Lue} (83)
3138          \p{Script: Tamil}       (Short: \p{Sc=Taml}, \p{Taml}) (72)
3139          \p{Script: Taml}        \p{Script=Tamil} (72)
3140          \p{Script: Tavt}        \p{Script=Tai_Viet} (72)
3141          \p{Script: Telu}        \p{Script=Telugu} (93)
3142          \p{Script: Telugu}      (Short: \p{Sc=Telu}, \p{Telu}) (93)
3143          \p{Script: Tfng}        \p{Script=Tifinagh} (59)
3144          \p{Script: Tglg}        \p{Script=Tagalog} (20)
3145          \p{Script: Thaa}        \p{Script=Thaana} (50)
3146          \p{Script: Thaana}      (Short: \p{Sc=Thaa}, \p{Thaa}) (50)
3147          \p{Script: Thai}        (Short: \p{Sc=Thai}, \p{Thai}) (86)
3148          \p{Script: Tibetan}     (Short: \p{Sc=Tibt}, \p{Tibt}) (207)
3149          \p{Script: Tibt}        \p{Script=Tibetan} (207)
3150          \p{Script: Tifinagh}    (Short: \p{Sc=Tfng}, \p{Tfng}) (59)
3151          \p{Script: Ugar}        \p{Script=Ugaritic} (31)
3152          \p{Script: Ugaritic}    (Short: \p{Sc=Ugar}, \p{Ugar}) (31)
3153          \p{Script: Unknown}     (Short: \p{Sc=Zzzz}, \p{Zzzz}) (1_003_931)
3154          \p{Script: Vai}         (Short: \p{Sc=Vai}, \p{Vai}) (300)
3155          \p{Script: Vaii}        \p{Script=Vai} (300)
3156          \p{Script: Xpeo}        \p{Script=Old_Persian} (50)
3157          \p{Script: Xsux}        \p{Script=Cuneiform} (982)
3158          \p{Script: Yi}          (Short: \p{Sc=Yi}, \p{Yi}) (1220)
3159          \p{Script: Yiii}        \p{Script=Yi} (1220)
3160          \p{Script: Zinh}        \p{Script=Inherited} (524)
3161          \p{Script: Zyyy}        \p{Script=Common} (6412)
3162          \p{Script: Zzzz}        \p{Script=Unknown} (1_003_931)
3163          \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1261)
3164          \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}) (1261)
3165          \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}) (92)
3166          \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
3167                                    (31)
3168          \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (92)
3169          \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}) (61)
3170          \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
3171          \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
3172          \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}) (121)
3173          \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
3174          \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}) (657)
3175          \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}) (56)
3176          \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
3177          \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (94)
3178          \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}) (94)
3179          \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (306)
3180          \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}) (306)
3181          \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (108)
3182          \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}) (108)
3183          \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
3184          \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}) (256)
3185          \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (30)
3186          \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}) (30)
3187          \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
3188          \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}) (22)
3189          \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (67)
3190          \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans})
3191                                    (710)
3192          \p{Script_Extensions: Cans} \p{Script_Extensions=
3193                                    Canadian_Aboriginal} (710)
3194          \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
3195          \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}) (49)
3196          \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}) (67)
3197          \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}) (83)
3198          \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (85)
3199          \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}) (85)
3200          \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}) (6059)
3201          \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (137)
3202          \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}) (137)
3203          \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
3204          \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}) (982)
3205          \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}) (112)
3206          \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}) (417)
3207          \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (417)
3208          \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}) (80)
3209          \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (163)
3210          \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}) (163)
3211          \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
3212          \p{Script_Extensions: Egyp} \p{Script_Extensions=
3213                                    Egyptian_Hieroglyphs} (1071)
3214          \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp})
3215                                    (1071)
3216          \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
3217          \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}) (495)
3218          \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (128)
3219          \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}) (128)
3220          \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (94)
3221          \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}) (94)
3222          \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
3223          \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}) (27)
3224          \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}) (511)
3225          \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (511)
3226          \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}) (94)
3227          \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (94)
3228          \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}) (91)
3229          \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (91)
3230          \p{Script_Extensions: Han} (Short: \p{Scx=Han}) (76_218)
3231          \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_971)
3232          \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}) (11_971)
3233          \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (76_218)
3234          \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
3235          \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}) (23)
3236          \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (133)
3237          \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}) (133)
3238          \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (356)
3239          \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}) (356)
3240          \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi}) (31)
3241          \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}) (506)
3242          \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli})
3243                                    (27)
3244          \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
3245                                    Prti}) (30)
3246          \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (35)
3247          \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
3248          \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}) (91)
3249          \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}) (76)
3250          \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
3251          \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (565)
3252          \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}) (86)
3253          \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}) (565)
3254          \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}) (48)
3255          \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (65)
3256          \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}) (65)
3257          \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}) (146)
3258          \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
3259          \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (86)
3260          \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (76)
3261          \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
3262          \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}) (67)
3263          \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (67)
3264          \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}) (1272)
3265          \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1272)
3266          \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
3267          \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}) (74)
3268          \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (66)
3269          \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}) (66)
3270          \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
3271          \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}) (268)
3272          \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}) (48)
3273          \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
3274          \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}) (29)
3275          \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
3276          \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}) (27)
3277          \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}) (98)
3278          \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
3279          \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}) (30)
3280          \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei}) (79)
3281          \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
3282                                    (26)
3283          \p{Script_Extensions: Mero} \p{Script_Extensions=
3284                                    Meroitic_Hieroglyphs} (32)
3285          \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc}) (26)
3286          \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero})
3287                                    (32)
3288          \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}) (133)
3289          \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (98)
3290          \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (156)
3291          \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}) (156)
3292          \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
3293          \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}) (188)
3294          \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (188)
3295          \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}) (83)
3296          \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}) (59)
3297          \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (59)
3298          \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
3299          \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}) (29)
3300          \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}) (48)
3301          \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
3302          \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}) (35)
3303          \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}) (50)
3304          \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb}) (32)
3305          \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}) (73)
3306          \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}) (92)
3307          \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
3308          \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (92)
3309          \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
3310          \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}) (40)
3311          \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
3312          \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}) (59)
3313          \p{Script_Extensions: Phli} \p{Script_Extensions=
3314                                    Inscriptional_Pahlavi} (27)
3315          \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
3316          \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}) (29)
3317          \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (133)
3318          \p{Script_Extensions: Prti} \p{Script_Extensions=
3319                                    Inscriptional_Parthian} (30)
3320          \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (137)
3321          \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (506)
3322          \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}) (37)
3323          \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
3324          \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}) (78)
3325          \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (78)
3326          \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}) (61)
3327          \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
3328          \p{Script_Extensions: Sarb} \p{Script_Extensions=
3329                                    Old_South_Arabian} (32)
3330          \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (81)
3331          \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}) (81)
3332          \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}) (83)
3333          \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}) (48)
3334          \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
3335          \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (83)
3336          \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (80)
3337          \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}) (80)
3338          \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
3339          \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora}) (35)
3340          \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
3341          \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}) (72)
3342          \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (44)
3343          \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo}) (44)
3344          \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (93)
3345          \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}) (93)
3346          \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}) (22)
3347          \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
3348          \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}) (20)
3349          \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}) (35)
3350          \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}) (127)
3351          \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}) (72)
3352          \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (78)
3353          \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}) (78)
3354          \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (35)
3355          \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
3356          \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}) (72)
3357          \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (72)
3358          \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
3359          \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (93)
3360          \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}) (93)
3361          \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
3362          \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
3363          \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (65)
3364          \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}) (65)
3365          \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}) (86)
3366          \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}) (207)
3367          \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
3368          \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}) (59)
3369          \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
3370          \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}) (31)
3371          \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}) (1_003_931)
3372          \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}) (300)
3373          \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
3374          \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
3375          \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (982)
3376          \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}) (1246)
3377          \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
3378          \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (506)
3379          \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (6059)
3380          \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown}
3381                                    (1_003_931)
3382          \p{Scx: *}              \p{Script_Extensions: *}
3383          \p{SD}                  \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
3384          \p{SD: *}               \p{Soft_Dotted: *}
3385          \p{Sentence_Break: AT}  \p{Sentence_Break=ATerm} (4)
3386          \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4)
3387          \p{Sentence_Break: CL}  \p{Sentence_Break=Close} (177)
3388          \p{Sentence_Break: Close} (Short: \p{SB=CL}) (177)
3389          \p{Sentence_Break: CR}  (Short: \p{SB=CR}) (1)
3390          \p{Sentence_Break: EX}  \p{Sentence_Break=Extend} (1649)
3391          \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (1649)
3392          \p{Sentence_Break: FO}  \p{Sentence_Break=Format} (137)
3393          \p{Sentence_Break: Format} (Short: \p{SB=FO}) (137)
3394          \p{Sentence_Break: LE}  \p{Sentence_Break=OLetter} (97_841)
3395          \p{Sentence_Break: LF}  (Short: \p{SB=LF}) (1)
3396          \p{Sentence_Break: LO}  \p{Sentence_Break=Lower} (1933)
3397          \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (1933)
3398          \p{Sentence_Break: NU}  \p{Sentence_Break=Numeric} (452)
3399          \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (452)
3400          \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (97_841)
3401          \p{Sentence_Break: Other} (Short: \p{SB=XX}) (1_010_273)
3402          \p{Sentence_Break: SC}  \p{Sentence_Break=SContinue} (26)
3403          \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26)
3404          \p{Sentence_Break: SE}  \p{Sentence_Break=Sep} (3)
3405          \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3)
3406          \p{Sentence_Break: Sp}  (Short: \p{SB=Sp}) (21)
3407          \p{Sentence_Break: ST}  \p{Sentence_Break=STerm} (80)
3408          \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (80)
3409          \p{Sentence_Break: UP}  \p{Sentence_Break=Upper} (1514)
3410          \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1514)
3411          \p{Sentence_Break: XX}  \p{Sentence_Break=Other} (1_010_273)
3412          \p{Separator}           \p{General_Category=Separator} (Short:
3413                                    \p{Z}) (20)
3414          \p{Sharada}             \p{Script=Sharada} (Short: \p{Shrd}; NOT
3415                                    \p{Block=Sharada}) (83)
3416          \p{Shavian}             \p{Script=Shavian} (Short: \p{Shaw}) (48)
3417          \p{Shaw}                \p{Shavian} (= \p{Script=Shavian}) (48)
3418          \p{Shrd}                \p{Sharada} (= \p{Script=Sharada}) (NOT
3419                                    \p{Block=Sharada}) (83)
3420          \p{Sinh}                \p{Sinhala} (= \p{Script=Sinhala}) (NOT
3421                                    \p{Block=Sinhala}) (80)
3422          \p{Sinhala}             \p{Script=Sinhala} (Short: \p{Sinh}; NOT
3423                                    \p{Block=Sinhala}) (80)
3424          \p{Sk}                  \p{Modifier_Symbol} (=
3425                                    \p{General_Category=Modifier_Symbol})
3426                                    (115)
3427          \p{Sm}                  \p{Math_Symbol} (= \p{General_Category=
3428                                    Math_Symbol}) (952)
3429        X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
3430                                    \p{InSmallForms}) (32)
3431        X \p{Small_Forms}         \p{Small_Form_Variants} (= \p{Block=
3432                                    Small_Form_Variants}) (32)
3433          \p{So}                  \p{Other_Symbol} (= \p{General_Category=
3434                                    Other_Symbol}) (4404)
3435          \p{Soft_Dotted}         \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
3436          \p{Soft_Dotted: N*}     (Short: \p{SD=N}, \P{SD}) (1_114_066)
3437          \p{Soft_Dotted: Y*}     (Short: \p{SD=Y}, \p{SD}) (46)
3438          \p{Sora}                \p{Sora_Sompeng} (= \p{Script=
3439                                    Sora_Sompeng}) (NOT \p{Block=
3440                                    Sora_Sompeng}) (35)
3441          \p{Sora_Sompeng}        \p{Script=Sora_Sompeng} (Short: \p{Sora};
3442                                    NOT \p{Block=Sora_Sompeng}) (35)
3443          \p{Space}               \p{White_Space=Y} \s including beyond
3444                                    ASCII plus vertical tab (26)
3445          \p{Space: *}            \p{White_Space: *}
3446          \p{Space_Separator}     \p{General_Category=Space_Separator}
3447                                    (Short: \p{Zs}) (18)
3448          \p{SpacePerl}           \p{XPerlSpace} (25)
3449          \p{Spacing_Mark}        \p{General_Category=Spacing_Mark} (Short:
3450                                    \p{Mc}) (353)
3451        X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
3452                                    (Short: \p{InModifierLetters}) (80)
3453        X \p{Specials}            \p{Block=Specials} (16)
3454          \p{STerm}               \p{STerm=Y} (83)
3455          \p{STerm: N*}           (Single: \P{STerm}) (1_114_029)
3456          \p{STerm: Y*}           (Single: \p{STerm}) (83)
3457          \p{Sund}                \p{Sundanese} (= \p{Script=Sundanese})
3458                                    (NOT \p{Block=Sundanese}) (72)
3459          \p{Sundanese}           \p{Script=Sundanese} (Short: \p{Sund}; NOT
3460                                    \p{Block=Sundanese}) (72)
3461        X \p{Sundanese_Sup}       \p{Sundanese_Supplement} (= \p{Block=
3462                                    Sundanese_Supplement}) (16)
3463        X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
3464                                    \p{InSundaneseSup}) (16)
3465        X \p{Sup_Arrows_A}        \p{Supplemental_Arrows_A} (= \p{Block=
3466                                    Supplemental_Arrows_A}) (16)
3467        X \p{Sup_Arrows_B}        \p{Supplemental_Arrows_B} (= \p{Block=
3468                                    Supplemental_Arrows_B}) (128)
3469        X \p{Sup_Math_Operators}  \p{Supplemental_Mathematical_Operators} (=
3470                                    \p{Block=
3471                                    Supplemental_Mathematical_Operators})
3472                                    (256)
3473        X \p{Sup_PUA_A}           \p{Supplementary_Private_Use_Area_A} (=
3474                                    \p{Block=
3475                                    Supplementary_Private_Use_Area_A})
3476                                    (65_536)
3477        X \p{Sup_PUA_B}           \p{Supplementary_Private_Use_Area_B} (=
3478                                    \p{Block=
3479                                    Supplementary_Private_Use_Area_B})
3480                                    (65_536)
3481        X \p{Sup_Punctuation}     \p{Supplemental_Punctuation} (= \p{Block=
3482                                    Supplemental_Punctuation}) (128)
3483        X \p{Super_And_Sub}       \p{Superscripts_And_Subscripts} (=
3484                                    \p{Block=Superscripts_And_Subscripts})
3485                                    (48)
3486        X \p{Superscripts_And_Subscripts} \p{Block=
3487                                    Superscripts_And_Subscripts} (Short:
3488                                    \p{InSuperAndSub}) (48)
3489        X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
3490                                    \p{InSupArrowsA}) (16)
3491        X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
3492                                    \p{InSupArrowsB}) (128)
3493        X \p{Supplemental_Mathematical_Operators} \p{Block=
3494                                    Supplemental_Mathematical_Operators}
3495                                    (Short: \p{InSupMathOperators}) (256)
3496        X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
3497                                    (Short: \p{InSupPunctuation}) (128)
3498        X \p{Supplementary_Private_Use_Area_A} \p{Block=
3499                                    Supplementary_Private_Use_Area_A}
3500                                    (Short: \p{InSupPUAA}) (65_536)
3501        X \p{Supplementary_Private_Use_Area_B} \p{Block=
3502                                    Supplementary_Private_Use_Area_B}
3503                                    (Short: \p{InSupPUAB}) (65_536)
3504          \p{Surrogate}           \p{General_Category=Surrogate} (Short:
3505                                    \p{Cs}) (2048)
3506          \p{Sylo}                \p{Syloti_Nagri} (= \p{Script=
3507                                    Syloti_Nagri}) (NOT \p{Block=
3508                                    Syloti_Nagri}) (44)
3509          \p{Syloti_Nagri}        \p{Script=Syloti_Nagri} (Short: \p{Sylo};
3510                                    NOT \p{Block=Syloti_Nagri}) (44)
3511          \p{Symbol}              \p{General_Category=Symbol} (Short: \p{S})
3512                                    (5519)
3513          \p{Syrc}                \p{Syriac} (= \p{Script=Syriac}) (NOT
3514                                    \p{Block=Syriac}) (77)
3515          \p{Syriac}              \p{Script=Syriac} (Short: \p{Syrc}; NOT
3516                                    \p{Block=Syriac}) (77)
3517          \p{Tagalog}             \p{Script=Tagalog} (Short: \p{Tglg}; NOT
3518                                    \p{Block=Tagalog}) (20)
3519          \p{Tagb}                \p{Tagbanwa} (= \p{Script=Tagbanwa}) (NOT
3520                                    \p{Block=Tagbanwa}) (18)
3521          \p{Tagbanwa}            \p{Script=Tagbanwa} (Short: \p{Tagb}; NOT
3522                                    \p{Block=Tagbanwa}) (18)
3523        X \p{Tags}                \p{Block=Tags} (128)
3524          \p{Tai_Le}              \p{Script=Tai_Le} (Short: \p{Tale}; NOT
3525                                    \p{Block=Tai_Le}) (35)
3526          \p{Tai_Tham}            \p{Script=Tai_Tham} (Short: \p{Lana}; NOT
3527                                    \p{Block=Tai_Tham}) (127)
3528          \p{Tai_Viet}            \p{Script=Tai_Viet} (Short: \p{Tavt}; NOT
3529                                    \p{Block=Tai_Viet}) (72)
3530        X \p{Tai_Xuan_Jing}       \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
3531                                    Tai_Xuan_Jing_Symbols}) (96)
3532        X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
3533                                    \p{InTaiXuanJing}) (96)
3534          \p{Takr}                \p{Takri} (= \p{Script=Takri}) (NOT
3535                                    \p{Block=Takri}) (66)
3536          \p{Takri}               \p{Script=Takri} (Short: \p{Takr}; NOT
3537                                    \p{Block=Takri}) (66)
3538          \p{Tale}                \p{Tai_Le} (= \p{Script=Tai_Le}) (NOT
3539                                    \p{Block=Tai_Le}) (35)
3540          \p{Talu}                \p{New_Tai_Lue} (= \p{Script=New_Tai_Lue})
3541                                    (NOT \p{Block=New_Tai_Lue}) (83)
3542          \p{Tamil}               \p{Script=Tamil} (Short: \p{Taml}; NOT
3543                                    \p{Block=Tamil}) (72)
3544          \p{Taml}                \p{Tamil} (= \p{Script=Tamil}) (NOT
3545                                    \p{Block=Tamil}) (72)
3546          \p{Tavt}                \p{Tai_Viet} (= \p{Script=Tai_Viet}) (NOT
3547                                    \p{Block=Tai_Viet}) (72)
3548          \p{Telu}                \p{Telugu} (= \p{Script=Telugu}) (NOT
3549                                    \p{Block=Telugu}) (93)
3550          \p{Telugu}              \p{Script=Telugu} (Short: \p{Telu}; NOT
3551                                    \p{Block=Telugu}) (93)
3552          \p{Term}                \p{Terminal_Punctuation} (=
3553                                    \p{Terminal_Punctuation=Y}) (176)
3554          \p{Term: *}             \p{Terminal_Punctuation: *}
3555          \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
3556                                    \p{Term}) (176)
3557          \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
3558                                    (1_113_936)
3559          \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (176)
3560          \p{Tfng}                \p{Tifinagh} (= \p{Script=Tifinagh}) (NOT
3561                                    \p{Block=Tifinagh}) (59)
3562          \p{Tglg}                \p{Tagalog} (= \p{Script=Tagalog}) (NOT
3563                                    \p{Block=Tagalog}) (20)
3564          \p{Thaa}                \p{Thaana} (= \p{Script=Thaana}) (NOT
3565                                    \p{Block=Thaana}) (50)
3566          \p{Thaana}              \p{Script=Thaana} (Short: \p{Thaa}; NOT
3567                                    \p{Block=Thaana}) (50)
3568          \p{Thai}                \p{Script=Thai} (NOT \p{Block=Thai}) (86)
3569          \p{Tibetan}             \p{Script=Tibetan} (Short: \p{Tibt}; NOT
3570                                    \p{Block=Tibetan}) (207)
3571          \p{Tibt}                \p{Tibetan} (= \p{Script=Tibetan}) (NOT
3572                                    \p{Block=Tibetan}) (207)
3573          \p{Tifinagh}            \p{Script=Tifinagh} (Short: \p{Tfng}; NOT
3574                                    \p{Block=Tifinagh}) (59)
3575          \p{Title}               \p{Titlecase} (/i= Cased=Yes) (31)
3576          \p{Titlecase}           (= \p{Gc=Lt}) (Short: \p{Title}; /i=
3577                                    Cased=Yes) (31)
3578          \p{Titlecase_Letter}    \p{General_Category=Titlecase_Letter}
3579                                    (Short: \p{Lt}; /i= General_Category=
3580                                    Cased_Letter) (31)
3581        X \p{Transport_And_Map}   \p{Transport_And_Map_Symbols} (= \p{Block=
3582                                    Transport_And_Map_Symbols}) (128)
3583        X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
3584                                    (Short: \p{InTransportAndMap}) (128)
3585        X \p{UCAS}                \p{Unified_Canadian_Aboriginal_Syllabics}
3586                                    (= \p{Block=
3587                                    Unified_Canadian_Aboriginal_Syllabics})
3588                                    (640)
3589        X \p{UCAS_Ext}            \p{Unified_Canadian_Aboriginal_Syllabics_-
3590                                    Extended} (= \p{Block=
3591                                    Unified_Canadian_Aboriginal_Syllabics_-
3592                                    Extended}) (80)
3593          \p{Ugar}                \p{Ugaritic} (= \p{Script=Ugaritic}) (NOT
3594                                    \p{Block=Ugaritic}) (31)
3595          \p{Ugaritic}            \p{Script=Ugaritic} (Short: \p{Ugar}; NOT
3596                                    \p{Block=Ugaritic}) (31)
3597          \p{UIdeo}               \p{Unified_Ideograph} (=
3598                                    \p{Unified_Ideograph=Y}) (74_617)
3599          \p{UIdeo: *}            \p{Unified_Ideograph: *}
3600          \p{Unassigned}          \p{General_Category=Unassigned} (Short:
3601                                    \p{Cn}) (864_415)
3602        X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
3603                                    Unified_Canadian_Aboriginal_Syllabics}
3604                                    (Short: \p{InUCAS}) (640)
3605        X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
3606                                    Unified_Canadian_Aboriginal_Syllabics_-
3607                                    Extended} (Short: \p{InUCASExt}) (80)
3608          \p{Unified_Ideograph}   \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
3609                                    (74_617)
3610          \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
3611                                    (1_039_495)
3612          \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (74_617)
3613          \p{Unknown}             \p{Script=Unknown} (Short: \p{Zzzz})
3614                                    (1_003_931)
3615          \p{Upper}               \p{Uppercase=Y} (/i= Cased=Yes) (1483)
3616          \p{Upper: *}            \p{Uppercase: *}
3617          \p{Uppercase}           \p{Upper} (= \p{Uppercase=Y}) (/i= Cased=
3618                                    Yes) (1483)
3619          \p{Uppercase: N*}       (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
3620                                    No) (1_112_629)
3621          \p{Uppercase: Y*}       (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
3622                                    Yes) (1483)
3623          \p{Uppercase_Letter}    \p{General_Category=Uppercase_Letter}
3624                                    (Short: \p{Lu}; /i= General_Category=
3625                                    Cased_Letter) (1441)
3626          \p{Vai}                 \p{Script=Vai} (NOT \p{Block=Vai}) (300)
3627          \p{Vaii}                \p{Vai} (= \p{Script=Vai}) (NOT \p{Block=
3628                                    Vai}) (300)
3629          \p{Variation_Selector}  \p{Variation_Selector=Y} (Short: \p{VS};
3630                                    NOT \p{Variation_Selectors}) (259)
3631          \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853)
3632          \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259)
3633        X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
3634                                    \p{InVS}) (16)
3635        X \p{Variation_Selectors_Supplement} \p{Block=
3636                                    Variation_Selectors_Supplement} (Short:
3637                                    \p{InVSSup}) (240)
3638        X \p{Vedic_Ext}           \p{Vedic_Extensions} (= \p{Block=
3639                                    Vedic_Extensions}) (48)
3640        X \p{Vedic_Extensions}    \p{Block=Vedic_Extensions} (Short:
3641                                    \p{InVedicExt}) (48)
3642        X \p{Vertical_Forms}      \p{Block=Vertical_Forms} (16)
3643          \p{VertSpace}           \v (7)
3644          \p{VS}                  \p{Variation_Selector} (=
3645                                    \p{Variation_Selector=Y}) (NOT
3646                                    \p{Variation_Selectors}) (259)
3647          \p{VS: *}               \p{Variation_Selector: *}
3648        X \p{VS_Sup}              \p{Variation_Selectors_Supplement} (=
3649                                    \p{Block=
3650                                    Variation_Selectors_Supplement}) (240)
3651          \p{WB: *}               \p{Word_Break: *}
3652          \p{White_Space}         \p{White_Space=Y} (Short: \p{WSpace}) (26)
3653          \p{White_Space: N*}     (Short: \p{Space=N}, \P{WSpace})
3654                                    (1_114_086)
3655          \p{White_Space: Y*}     (Short: \p{Space=Y}, \p{WSpace}) (26)
3656          \p{Word}                \w, including beyond ASCII; = \p{Alnum} +
3657                                    \pM + \p{Pc} (103_404)
3658          \p{Word_Break: ALetter} (Short: \p{WB=LE}) (24_941)
3659          \p{Word_Break: CR}      (Short: \p{WB=CR}) (1)
3660          \p{Word_Break: EX}      \p{Word_Break=ExtendNumLet} (10)
3661          \p{Word_Break: Extend}  (Short: \p{WB=Extend}) (1649)
3662          \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (10)
3663          \p{Word_Break: FO}      \p{Word_Break=Format} (136)
3664          \p{Word_Break: Format}  (Short: \p{WB=FO}) (136)
3665          \p{Word_Break: KA}      \p{Word_Break=Katakana} (310)
3666          \p{Word_Break: Katakana} (Short: \p{WB=KA}) (310)
3667          \p{Word_Break: LE}      \p{Word_Break=ALetter} (24_941)
3668          \p{Word_Break: LF}      (Short: \p{WB=LF}) (1)
3669          \p{Word_Break: MB}      \p{Word_Break=MidNumLet} (8)
3670          \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (8)
3671          \p{Word_Break: MidNum}  (Short: \p{WB=MN}) (15)
3672          \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (8)
3673          \p{Word_Break: ML}      \p{Word_Break=MidLetter} (8)
3674          \p{Word_Break: MN}      \p{Word_Break=MidNum} (15)
3675          \p{Word_Break: Newline} (Short: \p{WB=NL}) (5)
3676          \p{Word_Break: NL}      \p{Word_Break=Newline} (5)
3677          \p{Word_Break: NU}      \p{Word_Break=Numeric} (451)
3678          \p{Word_Break: Numeric} (Short: \p{WB=NU}) (451)
3679          \p{Word_Break: Other}   (Short: \p{WB=XX}) (1_086_577)
3680          \p{Word_Break: XX}      \p{Word_Break=Other} (1_086_577)
3681          \p{WSpace}              \p{White_Space} (= \p{White_Space=Y}) (26)
3682          \p{WSpace: *}           \p{White_Space: *}
3683          \p{XDigit}              \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
3684          \p{XID_Continue}        \p{XID_Continue=Y} (Short: \p{XIDC})
3685                                    (103_336)
3686          \p{XID_Continue: N*}    (Short: \p{XIDC=N}, \P{XIDC}) (1_010_776)
3687          \p{XID_Continue: Y*}    (Short: \p{XIDC=Y}, \p{XIDC}) (103_336)
3688          \p{XID_Start}           \p{XID_Start=Y} (Short: \p{XIDS}) (101_217)
3689          \p{XID_Start: N*}       (Short: \p{XIDS=N}, \P{XIDS}) (1_012_895)
3690          \p{XID_Start: Y*}       (Short: \p{XIDS=Y}, \p{XIDS}) (101_217)
3691          \p{XIDC}                \p{XID_Continue} (= \p{XID_Continue=Y})
3692                                    (103_336)
3693          \p{XIDC: *}             \p{XID_Continue: *}
3694          \p{XIDS}                \p{XID_Start} (= \p{XID_Start=Y}) (101_217)
3695          \p{XIDS: *}             \p{XID_Start: *}
3696          \p{Xpeo}                \p{Old_Persian} (= \p{Script=Old_Persian})
3697                                    (NOT \p{Block=Old_Persian}) (50)
3698          \p{XPerlSpace}          \s, including beyond ASCII (Short:
3699                                    \p{SpacePerl}) (25)
3700          \p{XPosixAlnum}         \p{Alnum} (102_619)
3701          \p{XPosixAlpha}         \p{Alpha} (= \p{Alphabetic=Y}) (102_159)
3702          \p{XPosixBlank}         \p{Blank} (19)
3703          \p{XPosixCntrl}         \p{Cntrl} (= \p{General_Category=Control})
3704                                    (65)
3705          \p{XPosixDigit}         \p{Digit} (= \p{General_Category=
3706                                    Decimal_Number}) (460)
3707          \p{XPosixGraph}         \p{Graph} (247_564)
3708          \p{XPosixLower}         \p{Lower} (= \p{Lowercase=Y}) (/i= Cased=
3709                                    Yes) (1934)
3710          \p{XPosixPrint}         \p{Print} (247_582)
3711          \p{XPosixPunct}         \p{Punct} + ASCII-range \p{Symbol} (641)
3712          \p{XPosixSpace}         \p{Space} (= \p{White_Space=Y}) (26)
3713          \p{XPosixUpper}         \p{Upper} (= \p{Uppercase=Y}) (/i= Cased=
3714                                    Yes) (1483)
3715          \p{XPosixWord}          \p{Word} (103_404)
3716          \p{XPosixXDigit}        \p{XDigit} (= \p{Hex_Digit=Y}) (44)
3717          \p{Xsux}                \p{Cuneiform} (= \p{Script=Cuneiform})
3718                                    (NOT \p{Block=Cuneiform}) (982)
3719          \p{Yi}                  \p{Script=Yi} (1220)
3720        X \p{Yi_Radicals}         \p{Block=Yi_Radicals} (64)
3721        X \p{Yi_Syllables}        \p{Block=Yi_Syllables} (1168)
3722          \p{Yiii}                \p{Yi} (= \p{Script=Yi}) (1220)
3723        X \p{Yijing}              \p{Yijing_Hexagram_Symbols} (= \p{Block=
3724                                    Yijing_Hexagram_Symbols}) (64)
3725        X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
3726                                    (Short: \p{InYijing}) (64)
3727          \p{Z}                   \p{Separator} (= \p{General_Category=
3728                                    Separator}) (20)
3729          \p{Zinh}                \p{Inherited} (= \p{Script=Inherited})
3730                                    (524)
3731          \p{Zl}                  \p{Line_Separator} (= \p{General_Category=
3732                                    Line_Separator}) (1)
3733          \p{Zp}                  \p{Paragraph_Separator} (=
3734                                    \p{General_Category=
3735                                    Paragraph_Separator}) (1)
3736          \p{Zs}                  \p{Space_Separator} (=
3737                                    \p{General_Category=Space_Separator})
3738                                    (18)
3739          \p{Zyyy}                \p{Common} (= \p{Script=Common}) (6412)
3740          \p{Zzzz}                \p{Unknown} (= \p{Script=Unknown})
3741                                    (1_003_931)
3742        TX\p{_CanonDCIJ}          (For internal use by Perl, not necessarily
3743                                    stable) (= \p{Soft_Dotted=Y}) (46)
3744        TX\p{_Case_Ignorable}     (For internal use by Perl, not necessarily
3745                                    stable) (= \p{Case_Ignorable=Y}) (1799)
3746        TX\p{_CombAbove}          (For internal use by Perl, not necessarily
3747                                    stable) (= \p{Canonical_Combining_Class=
3748                                    Above}) (349)
3749
3750   Legal "\p{}" and "\P{}" constructs that match no characters
3751       Unicode has some property-value pairs that currently don't match
3752       anything.  This happens generally either because they are obsolete, or
3753       they exist for symmetry with other forms, but no language has yet been
3754       encoded that uses them.  In this version of Unicode, the following
3755       match zero code points:
3756
3757       \p{Canonical_Combining_Class=Attached_Below_Left}
3758       \p{Grapheme_Cluster_Break=Prepend}
3759       \p{Joining_Type=Left_Joining}
3760

Properties accessible through Unicode::UCD

3762       All the Unicode character properties mentioned above (except for those
3763       marked as for internal use by Perl) are also accessible by
3764       "prop_invlist()" in Unicode::UCD.
3765
3766       Due to their nature, not all Unicode character properties are suitable
3767       for regular expression matches, nor "prop_invlist()".  The remaining
3768       non-provisional, non-internal ones are accessible via "prop_invmap()"
3769       in Unicode::UCD (except for those that this Perl installation hasn't
3770       included; see below for which those are).
3771
3772       For compatibility with other parts of Perl, all the single forms given
3773       in the table in the section above are recognized.  BUT, there are some
3774       ambiguities between some Perl extensions and the Unicode properties,
3775       all of which are silently resolved in favor of the official Unicode
3776       property.  To avoid surprises, you should only use "prop_invmap()" for
3777       forms listed in the table below, which omits the non-recommended ones.
3778       The affected forms are the Perl single form equivalents of Unicode
3779       properties, such as "\p{sc}" being a single-form equivalent of
3780       "\p{gc=sc}", which is treated by "prop_invmap()" as the "Script"
3781       property, whose short name is "sc".  The table indicates the current
3782       ambiguities in the INFO column, beginning with the word "NOT".
3783
3784       The standard Unicode properties listed below are documented in
3785       <http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is
3786       documented in "prop_invmap()" in Unicode::UCD.  The other Perl
3787       extensions are in "Other Properties" in perlunicode;
3788
3789       The first column in the table is a name for the property; the second
3790       column is an alternative name, if any, plus possibly some annotations.
3791       The alternative name is the property's full name, unless that would
3792       simply repeat the first column, in which case the second column
3793       indicates the property's short name (if different).  The annotations
3794       are given only in the entry for the full name.  If a property is
3795       obsolete, etc, the entry will be flagged with the same characters used
3796       in the table in the section above, like D or S.
3797
3798          NAME                      INFO
3799
3800          Age
3801          AHex                    ASCII_Hex_Digit
3802          All                     Any.  (Perl extension)
3803          Alnum                   (Perl extension).  Alphabetic and
3804                                  (decimal) Numeric
3805          Alpha                   Alphabetic
3806          Alphabetic              (Short: Alpha)
3807          Any                     (Perl extension).  [\x{0000}-\x{10FFFF}]
3808          ASCII                   Block=ASCII.  (Perl extension).
3809                                  [[:ASCII:]]
3810          ASCII_Hex_Digit         (Short: AHex)
3811          Assigned                (Perl extension).  All assigned code points
3812          Bc                      Bidi_Class
3813          Bidi_C                  Bidi_Control
3814          Bidi_Class              (Short: bc)
3815          Bidi_Control            (Short: Bidi_C)
3816          Bidi_M                  Bidi_Mirrored
3817          Bidi_Mirrored           (Short: Bidi_M)
3818          Bidi_Mirroring_Glyph    (Short: bmg)
3819          Blank                   (Perl extension).  \h, Horizontal white
3820                                  space
3821          Blk                     Block
3822          Block                   (Short: blk)
3823          Bmg                     Bidi_Mirroring_Glyph
3824          Canonical_Combining_Class (Short: ccc)
3825          Case_Folding            (Short: cf)
3826          Case_Ignorable          (Short: CI)
3827          Cased
3828          Category                General_Category
3829          Ccc                     Canonical_Combining_Class
3830          CE                      Composition_Exclusion
3831          Cf                      Case_Folding; NOT 'cf' meaning
3832                                  'General_Category=Format'
3833          Changes_When_Casefolded (Short: CWCF)
3834          Changes_When_Casemapped (Short: CWCM)
3835          Changes_When_Lowercased (Short: CWL)
3836          Changes_When_NFKC_Casefolded (Short: CWKCF)
3837          Changes_When_Titlecased (Short: CWT)
3838          Changes_When_Uppercased (Short: CWU)
3839          CI                      Case_Ignorable
3840          Cntrl                   General_Category=Cntrl.  (Perl extension).
3841                                  Control characters
3842          Comp_Ex                 Full_Composition_Exclusion
3843          Composition_Exclusion   (Short: CE)
3844          CWCF                    Changes_When_Casefolded
3845          CWCM                    Changes_When_Casemapped
3846          CWKCF                   Changes_When_NFKC_Casefolded
3847          CWL                     Changes_When_Lowercased
3848          CWT                     Changes_When_Titlecased
3849          CWU                     Changes_When_Uppercased
3850          Dash
3851          Decomposition_Mapping   (Short: dm)
3852          Decomposition_Type      (Short: dt)
3853          Default_Ignorable_Code_Point (Short: DI)
3854          Dep                     Deprecated
3855          Deprecated              (Short: Dep)
3856          DI                      Default_Ignorable_Code_Point
3857          Dia                     Diacritic
3858          Diacritic               (Short: Dia)
3859          Digit                   General_Category=Digit.  (Perl extension).
3860                                  [0-9] + all other decimal digits
3861          Dm                      Decomposition_Mapping
3862          Dt                      Decomposition_Type
3863          Ea                      East_Asian_Width
3864          East_Asian_Width        (Short: ea)
3865          Ext                     Extender
3866          Extender                (Short: Ext)
3867          Full_Composition_Exclusion (Short: Comp_Ex)
3868          Gc                      General_Category
3869          GCB                     Grapheme_Cluster_Break
3870          General_Category        (Short: gc)
3871          Gr_Base                 Grapheme_Base
3872          Gr_Ext                  Grapheme_Extend
3873          Graph                   (Perl extension).  Characters that are
3874                                  graphical
3875          Grapheme_Base           (Short: Gr_Base)
3876          Grapheme_Cluster_Break  (Short: GCB)
3877          Grapheme_Extend         (Short: Gr_Ext)
3878          Hangul_Syllable_Type    (Short: hst)
3879          Hex                     Hex_Digit
3880          Hex_Digit               (Short: Hex)
3881          HorizSpace              Blank.  (Perl extension)
3882          Hst                     Hangul_Syllable_Type
3883        D Hyphen                  Supplanted by Line_Break property values;
3884                                  see www.unicode.org/reports/tr14
3885          ID_Continue             (Short: IDC)
3886          ID_Start                (Short: IDS)
3887          IDC                     ID_Continue
3888          Ideo                    Ideographic
3889          Ideographic             (Short: Ideo)
3890          IDS                     ID_Start
3891          IDS_Binary_Operator     (Short: IDSB)
3892          IDS_Trinary_Operator    (Short: IDST)
3893          IDSB                    IDS_Binary_Operator
3894          IDST                    IDS_Trinary_Operator
3895          In                      Present_In.  (Perl extension)
3896          Isc                     ISO_Comment; NOT 'isc' meaning
3897                                  'General_Category=Other'
3898          ISO_Comment             (Short: isc)
3899          Jg                      Joining_Group
3900          Join_C                  Join_Control
3901          Join_Control            (Short: Join_C)
3902          Joining_Group           (Short: jg)
3903          Joining_Type            (Short: jt)
3904          Jt                      Joining_Type
3905          Lb                      Line_Break
3906          Lc                      Lowercase_Mapping; NOT 'lc' meaning
3907                                  'General_Category=Cased_Letter'
3908          Line_Break              (Short: lb)
3909          LOE                     Logical_Order_Exception
3910          Logical_Order_Exception (Short: LOE)
3911          Lower                   Lowercase
3912          Lowercase               (Short: Lower)
3913          Lowercase_Mapping       (Short: lc)
3914          Math
3915          Na                      Name
3916          Na1                     Unicode_1_Name
3917          Name                    (Short: na)
3918          Name_Alias
3919          NChar                   Noncharacter_Code_Point
3920          NFC_QC                  NFC_Quick_Check
3921          NFC_Quick_Check         (Short: NFC_QC)
3922          NFD_QC                  NFD_Quick_Check
3923          NFD_Quick_Check         (Short: NFD_QC)
3924          NFKC_Casefold           (Short: NFKC_CF)
3925          NFKC_CF                 NFKC_Casefold
3926          NFKC_QC                 NFKC_Quick_Check
3927          NFKC_Quick_Check        (Short: NFKC_QC)
3928          NFKD_QC                 NFKD_Quick_Check
3929          NFKD_Quick_Check        (Short: NFKD_QC)
3930          Noncharacter_Code_Point (Short: NChar)
3931          Nt                      Numeric_Type
3932          Numeric_Type            (Short: nt)
3933          Numeric_Value           (Short: nv)
3934          Nv                      Numeric_Value
3935          Pat_Syn                 Pattern_Syntax
3936          Pat_WS                  Pattern_White_Space
3937          Pattern_Syntax          (Short: Pat_Syn)
3938          Pattern_White_Space     (Short: Pat_WS)
3939          Perl_Decimal_Digit      (Perl extension)
3940          PerlSpace               (Perl extension).  \s, restricted to ASCII
3941                                  = [ \f\n\r\t]
3942          PerlWord                (Perl extension).  \w, restricted to ASCII
3943                                  = [A-Za-z0-9_]
3944          PosixAlnum              (Perl extension).  [A-Za-z0-9]
3945          PosixAlpha              (Perl extension).  [A-Za-z]
3946          PosixBlank              (Perl extension).  \t and ' '
3947          PosixCntrl              (Perl extension).  ASCII control
3948                                  characters: NUL, SOH, STX, ETX, EOT, ENQ,
3949                                  ACK, BEL, BS, HT, LF, VT, FF, CR, SO, SI,
3950                                  DLE, DC1, DC2, DC3, DC4, NAK, SYN, ETB,
3951                                  CAN, EOM, SUB, ESC, FS, GS, RS, US, and DEL
3952          PosixDigit              (Perl extension).  [0-9]
3953          PosixGraph              (Perl extension).  [-
3954                                  !"#$%&'()*+,./:;<>?@[\\]^_`{|}~0-9A-Za-z]
3955          PosixLower              (Perl extension).  [a-z]
3956          PosixPrint              (Perl extension).  [- 0-9A-Za-
3957                                  z!"#$%&'()*+,./:;<>?@[\\]^_`{|}~]
3958          PosixPunct              (Perl extension).  [-
3959                                  !"#$%&'()*+,./:;<>?@[\\]^_`{|}~]
3960          PosixSpace              (Perl extension).  \t, \n, \cK, \f, \r,
3961                                  and ' '.  (\cK is vertical tab)
3962          PosixUpper              (Perl extension).  [A-Z]
3963          PosixWord               PerlWord.  (Perl extension)
3964          PosixXDigit             (Perl extension).  [0-9A-Fa-f]
3965          Present_In              (Short: In).  (Perl extension)
3966          Print                   (Perl extension).  Characters that are
3967                                  graphical plus space characters (but no
3968                                  controls)
3969          Punct                   General_Category=Punct.  (Perl extension)
3970          QMark                   Quotation_Mark
3971          Quotation_Mark          (Short: QMark)
3972          Radical
3973          SB                      Sentence_Break
3974          Sc                      Script; NOT 'sc' meaning
3975                                  'General_Category=Currency_Symbol'
3976          Scf                     Simple_Case_Folding
3977          Script                  (Short: sc)
3978          Script_Extensions       (Short: scx)
3979          Scx                     Script_Extensions
3980          SD                      Soft_Dotted
3981          Sentence_Break          (Short: SB)
3982          Sfc                     Simple_Case_Folding
3983          Simple_Case_Folding     (Short: scf)
3984          Simple_Lowercase_Mapping (Short: slc)
3985          Simple_Titlecase_Mapping (Short: stc)
3986          Simple_Uppercase_Mapping (Short: suc)
3987          Slc                     Simple_Lowercase_Mapping
3988          Soft_Dotted             (Short: SD)
3989          Space                   White_Space
3990          SpacePerl               XPerlSpace.  (Perl extension)
3991          Stc                     Simple_Titlecase_Mapping
3992          STerm
3993          Suc                     Simple_Uppercase_Mapping
3994          Tc                      Titlecase_Mapping
3995          Term                    Terminal_Punctuation
3996          Terminal_Punctuation    (Short: Term)
3997          Title                   Titlecase.  (Perl extension)
3998          Titlecase               (Short: Title).  (Perl extension).  (=
3999                                  \p{Gc=Lt})
4000          Titlecase_Mapping       (Short: tc)
4001          Uc                      Uppercase_Mapping
4002          UIdeo                   Unified_Ideograph
4003          Unicode_1_Name          (Short: na1)
4004          Unified_Ideograph       (Short: UIdeo)
4005          Upper                   Uppercase
4006          Uppercase               (Short: Upper)
4007          Uppercase_Mapping       (Short: uc)
4008          Variation_Selector      (Short: VS)
4009          VertSpace               (Perl extension).  \v
4010          VS                      Variation_Selector
4011          WB                      Word_Break
4012          White_Space             (Short: WSpace)
4013          Word                    (Perl extension).  \w, including beyond
4014                                  ASCII; = \p{Alnum} + \pM + \p{Pc}
4015          Word_Break              (Short: WB)
4016          WSpace                  White_Space
4017          XDigit                  (Perl extension)
4018          XID_Continue            (Short: XIDC)
4019          XID_Start               (Short: XIDS)
4020          XIDC                    XID_Continue
4021          XIDS                    XID_Start
4022          XPerlSpace              (Perl extension).  \s, including beyond
4023                                  ASCII
4024          XPosixAlnum             Alnum.  (Perl extension)
4025          XPosixAlpha             Alpha.  (Perl extension)
4026          XPosixBlank             Blank.  (Perl extension)
4027          XPosixCntrl             General_Category=Cntrl.  (Perl extension)
4028          XPosixDigit             General_Category=Digit.  (Perl extension)
4029          XPosixGraph             Graph.  (Perl extension)
4030          XPosixLower             Lower.  (Perl extension)
4031          XPosixPrint             Print.  (Perl extension)
4032          XPosixPunct             (Perl extension).  \p{Punct} + ASCII-range
4033                                  \p{Symbol}
4034          XPosixSpace             Space.  (Perl extension)
4035          XPosixUpper             Upper.  (Perl extension)
4036          XPosixWord              Word.  (Perl extension)
4037          XPosixXDigit            XDigit.  (Perl extension)
4038

Properties accessible through other means

4040       Certain properties are accessible also via core function calls.  These
4041       are:
4042
4043        Lowercase_Mapping          lc() and lcfirst()
4044        Titlecase_Mapping          ucfirst()
4045        Uppercase_Mapping          uc()
4046
4047       Also, Case_Folding is accessible through the "/i" modifier in regular
4048       expressions, the "\F" transliteration escape, and the "fc" operator.
4049
4050       And, the Name and Name_Aliases properties are accessible through the
4051       "\N{}" interpolation in double-quoted strings and regular expressions;
4052       and functions "charnames::viacode()", "charnames::vianame()", and
4053       "charnames::string_vianame()" (which require a "use charnames ();" to
4054       be specified.
4055
4056       Finally, most properties related to decomposition are accessible via
4057       Unicode::Normalize.
4058

Unicode character properties that are NOT accepted by Perl

4060       Perl will generate an error for a few character properties in Unicode
4061       when used in a regular expression.  The non-Unihan ones are listed
4062       below, with the reasons they are not accepted, perhaps with work-
4063       arounds.  The short names for the properties are listed enclosed in
4064       (parentheses).  As described after the list, an installation can change
4065       the defaults and choose to accept any of these.  The list is machine
4066       generated based on the choices made for the installation that generated
4067       this document.
4068
4069       Expands_On_NFC (XO_NFC)
4070       Expands_On_NFD (XO_NFD)
4071       Expands_On_NFKC (XO_NFKC)
4072       Expands_On_NFKD (XO_NFKD)
4073           Deprecated by Unicode.  These are characters that expand to more
4074           than one character in the specified normalization form, but whether
4075           they actually take up more bytes or not depends on the encoding
4076           being used.  For example, a UTF-8 encoded character may expand to a
4077           different number of bytes than a UTF-32 encoded character.
4078
4079       Grapheme_Link (Gr_Link)
4080           Deprecated by Unicode:  Duplicates ccc=vr
4081           (Canonical_Combining_Class=Virama)
4082
4083       Indic_Matra_Category (InMC)
4084       Indic_Syllabic_Category (InSC)
4085           Provisional
4086
4087       Jamo_Short_Name (JSN)
4088       Other_Alphabetic (OAlpha)
4089       Other_Default_Ignorable_Code_Point (ODI)
4090       Other_Grapheme_Extend (OGr_Ext)
4091       Other_ID_Continue (OIDC)
4092       Other_ID_Start (OIDS)
4093       Other_Lowercase (OLower)
4094       Other_Math (OMath)
4095       Other_Uppercase (OUpper)
4096           Used by Unicode internally for generating other properties and not
4097           intended to be used stand-alone
4098
4099       Script=Katakana_Or_Hiragana (sc=Hrkt)
4100           Obsolete.  All code points previously matched by this have been
4101           moved to "Script=Common".  Consider instead using
4102           "Script_Extensions=Katakana" or "Script_Extensions=Hiragana (or
4103           both)"
4104
4105       Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)
4106           All code points that would be matched by this are matched by either
4107           "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
4108
4109       An installation can choose to allow any of these to be matched by
4110       downloading the Unicode database from <http://www.unicode.org/Public/>
4111       to $Config{privlib}/unicore/ in the Perl source tree, changing the
4112       controlling lists contained in the program
4113       $Config{privlib}/unicore/mktables and then re-compiling and installing.
4114       (%Config is available from the Config module).
4115

Other information in the Unicode data base

4117       The Unicode data base is delivered in two different formats.  The XML
4118       version is valid for more modern Unicode releases.  The other version
4119       is a collection of files.  The two are intended to give equivalent
4120       information.  Perl uses the older form; this allows you to recompile
4121       Perl to use early Unicode releases.
4122
4123       The only non-character property that Perl currently supports is Named
4124       Sequences, in which a sequence of code points is given a name and
4125       generally treated as a single entity.  (Perl supports these via the
4126       "\N{...}" double-quotish construct, "charnames::string_vianame(name)"
4127       in charnames, and "namedseq()" in Unicode::UCD.
4128
4129       Below is a list of the files in the Unicode data base that Perl doesn't
4130       currently use, along with very brief descriptions of their purposes.
4131       Some of the names of the files have been shortened from those that
4132       Unicode uses, in order to allow them to be distinguishable from
4133       similarly named files on file systems for which only the first 8
4134       characters of a name are significant.
4135
4136       auxiliary/GraphemeBreakTest.html
4137       auxiliary/LineBreakTest.html
4138       auxiliary/SentenceBreakTest.html
4139       auxiliary/WordBreakTest.html
4140           Documentation of validation tests
4141
4142       auxiliary/LBTest.txt
4143       auxiliary/SBTest.txt
4144       auxiliary/WBTest.txt
4145       BidiTest.txt
4146       NormalizationTest.txt
4147           Validation Tests
4148
4149       CJKRadicals.txt
4150           Maps the kRSUnicode property values to corresponding code points
4151
4152       EmojiSources.txt
4153           Maps certain Unicode code points to their legacy Japanese cell-
4154           phone values
4155
4156       Index.txt
4157           Alphabetical index of Unicode characters
4158
4159       IndicMatraCategory.txt
4160       IndicSyllabicCategory.txt
4161           Provisional; for the analysis and processing of Indic scripts
4162
4163       NamedSqProv.txt
4164           Named sequences proposed for inclusion in a later version of the
4165           Unicode Standard; if you need them now, you can append this file to
4166           NamedSequences.txt and recompile perl
4167
4168       NamesList.txt
4169           Annotated list of characters
4170
4171       NormalizationCorrections.txt
4172           Documentation of corrections already incorporated into the Unicode
4173           data base
4174
4175       Props.txt
4176           Only in very early releases; is a subset of PropList.txt (which is
4177           used instead)
4178
4179       ReadMe.txt
4180           Documentation
4181
4182       StandardizedVariants.txt
4183           Certain glyph variations for character display are standardized.
4184           This lists the non-Unihan ones; the Unihan ones are also not used
4185           by Perl, and are in a separate Unicode data base
4186           <http://www.unicode.org/ivd>
4187

SEE ALSO

4189       <http://www.unicode.org/reports/tr44/>
4190
4191       perlrecharclass
4192
4193       perlunicode
4194
4195
4196
4197perl v5.16.3                      2019-01-21                   PERLUNIPROPS(1)
Impressum