1PERLUNIPROPS(1)        Perl Programmers Reference Guide        PERLUNIPROPS(1)
2
3
4

NAME

6       perluniprops - Index of Unicode Version 9.0.0 character properties in
7       Perl
8

DESCRIPTION

10       This document provides information about the portion of the Unicode
11       database that deals with character properties, that is the portion that
12       is defined on single code points.  ("Other information in the Unicode
13       data base" below briefly mentions other data that Unicode provides.)
14
15       Perl can provide access to all non-provisional Unicode character
16       properties, though not all are enabled by default.  The omitted ones
17       are the Unihan properties (accessible via the CPAN module
18       Unicode::Unihan) and certain deprecated or Unicode-internal properties.
19       (An installation may choose to recompile Perl's tables to change this.
20       See "Unicode character properties that are NOT accepted by Perl".)
21
22       For most purposes, access to Unicode properties from the Perl core is
23       through regular expression matches, as described in the next section.
24       For some special purposes, and to access the properties that are not
25       suitable for regular expression matching, all the Unicode character
26       properties that Perl handles are accessible via the standard
27       Unicode::UCD module, as described in the section "Properties accessible
28       through Unicode::UCD".
29
30       Perl also provides some additional extensions and short-cut synonyms
31       for Unicode properties.
32
33       This document merely lists all available properties and does not
34       attempt to explain what each property really means.  There is a brief
35       description of each Perl extension; see "Other Properties" in
36       perlunicode for more information on these.  There is some detail about
37       Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but
38       to find out about the intricacies of the official Unicode properties,
39       refer to the Unicode standard.  A good starting place is
40       <http://www.unicode.org/reports/tr44/>.
41
42       Note that you can define your own properties; see "User-Defined
43       Character Properties" in perlunicode.
44

Properties accessible through "\p{}" and "\P{}"

46       The Perl regular expression "\p{}" and "\P{}" constructs give access to
47       most of the Unicode character properties.  The table below shows all
48       these constructs, both single and compound forms.
49
50       Compound forms consist of two components, separated by an equals sign
51       or a colon.  The first component is the property name, and the second
52       component is the particular value of the property to match against, for
53       example, "\p{Script: Greek}" and "\p{Script=Greek}" both mean to match
54       characters whose Script property value is Greek.
55
56       Single forms, like "\p{Greek}", are mostly Perl-defined shortcuts for
57       their equivalent compound forms.  The table shows these equivalences.
58       (In our example, "\p{Greek}" is a just a shortcut for
59       "\p{Script=Greek}".)  There are also a few Perl-defined single forms
60       that are not shortcuts for a compound form.  One such is "\p{Word}".
61       These are also listed in the table.
62
63       In parsing these constructs, Perl always ignores Upper/lower case
64       differences everywhere within the {braces}.  Thus "\p{Greek}" means the
65       same thing as "\p{greek}".  But note that changing the case of the "p"
66       or "P" before the left brace completely changes the meaning of the
67       construct, from "match" (for "\p{}") to "doesn't match" (for "\P{}").
68       Casing in this document is for improved legibility.
69
70       Also, white space, hyphens, and underscores are normally ignored
71       everywhere between the {braces}, and hence can be freely added or
72       removed even if the "/x" modifier hasn't been specified on the regular
73       expression.  But in the table below a 'T' at the beginning of an entry
74       means that tighter (stricter) rules are used for that entry:
75
76           Single form ("\p{name}") tighter rules:
77               White space, hyphens, and underscores ARE significant except
78               for:
79
80               ·   white space adjacent to a non-word character
81
82               ·   underscores separating digits in numbers
83
84               That means, for example, that you can freely add or remove
85               white space adjacent to (but within) the braces without
86               affecting the meaning.
87
88           Compound form ("\p{name=value}" or "\p{name:value}") tighter rules:
89               The tighter rules given above for the single form apply to
90               everything to the right of the colon or equals; the looser
91               rules still apply to everything to the left.
92
93               That means, for example, that you can freely add or remove
94               white space adjacent to (but within) the braces and the colon
95               or equal sign.
96
97       Some properties are considered obsolete by Unicode, but still
98       available.  There are several varieties of obsolescence:
99
100           Stabilized
101               A property may be stabilized.  Such a determination does not
102               indicate that the property should or should not be used;
103               instead it is a declaration that the property will not be
104               maintained nor extended for newly encoded characters.  Such
105               properties are marked with an 'S' in the table.
106
107           Deprecated
108               A property may be deprecated, perhaps because its original
109               intent has been replaced by another property, or because its
110               specification was somehow defective.  This means that its use
111               is strongly discouraged, so much so that a warning will be
112               issued if used, unless the regular expression is in the scope
113               of a "no warnings 'deprecated'" statement.  A 'D' flags each
114               such entry in the table, and the entry there for the longest,
115               most descriptive version of the property will give the reason
116               it is deprecated, and perhaps advice.  Perl may issue such a
117               warning, even for properties that aren't officially deprecated
118               by Unicode, when there used to be characters or code points
119               that were matched by them, but no longer.  This is to warn you
120               that your program may not work like it did on earlier Unicode
121               releases.
122
123               A deprecated property may be made unavailable in a future Perl
124               version, so it is best to move away from them.
125
126               A deprecated property may also be stabilized, but this fact is
127               not shown.
128
129           Obsolete
130               Properties marked with an 'O' in the table are considered
131               (plain) obsolete.  Generally this designation is given to
132               properties that Unicode once used for internal purposes (but
133               not any longer).
134
135           Discouraged
136               This is not actually a Unicode-specified obsolescence, but
137               applies to certain Perl extensions that are present for
138               backwards compatibility, but are discouraged from being used.
139               These are not obsolete, but their meanings are not stable.
140               Future Unicode versions could force any of these extensions to
141               be removed without warning, replaced by another property with
142               the same name that means something different.  An 'X' flags
143               each such entry in the table.  Use the equivalent shown
144               instead.
145
146               In particular, matches in the Block property have single forms
147               defined by Perl that begin with "In_", ""Is_", or even with no
148               prefix at all,  Like all DISCOURAGED forms, these are not
149               stable.  For example, "\p{Block=Deseret}" can currently be
150               written as "\p{In_Deseret}", "\p{Is_Deseret}", or
151               "\p{Deseret}".  But, a new Unicode version may come along that
152               would force Perl to change the meaning of one or more of these,
153               and your program would no longer be correct.  Currently there
154               are no such conflicts with the form that begins "In_", but
155               there are many with the other two shortcuts, and Unicode
156               continues to define new properties that begin with "In", so
157               it's quite possible that a conflict will occur in the future.
158               The compound form is guaranteed to not become obsolete, and its
159               meaning is clearer anyway.  See "Blocks" in perlunicode for
160               more information about this.
161
162       The table below has two columns.  The left column contains the "\p{}"
163       constructs to look up, possibly preceded by the flags mentioned above;
164       and the right column contains information about them, like a
165       description, or synonyms.  The table shows both the single and compound
166       forms for each property that has them.  If the left column is a short
167       name for a property, the right column will give its longer, more
168       descriptive name; and if the left column is the longest name, the right
169       column will show any equivalent shortest name, in both single and
170       compound forms if applicable.
171
172       If braces are not needed to specify a property (e.g., "\pL"), the left
173       column contains both forms, with and without braces.
174
175       The right column will also caution you if a property means something
176       different than what might normally be expected.
177
178       All single forms are Perl extensions; a few compound forms are as well,
179       and are noted as such.
180
181       Numbers in (parentheses) indicate the total number of Unicode code
182       points matched by the property.  For emphasis, those properties that
183       match no code points at all are listed as well in a separate section
184       following the table.
185
186       Most properties match the same code points regardless of whether "/i"
187       case-insensitive matching is specified or not.  But a few properties
188       are affected.  These are shown with the notation "(/i= other_property)"
189       in the second column.  Under case-insensitive matching they match the
190       same code pode points as the property other_property.
191
192       There is no description given for most non-Perl defined properties (See
193       <http://www.unicode.org/reports/tr44/> for that).
194
195       For compactness, '*' is used as a wildcard instead of showing all
196       possible combinations.  For example, entries like:
197
198        \p{Gc: *}                                  \p{General_Category: *}
199
200       mean that 'Gc' is a synonym for 'General_Category', and anything that
201       is valid for the latter is also valid for the former.  Similarly,
202
203        \p{Is_*}                                   \p{*}
204
205       means that if and only if, for example, "\p{Foo}" exists, then
206       "\p{Is_Foo}" and "\p{IsFoo}" are also valid and all mean the same
207       thing.  And similarly, "\p{Foo=Bar}" means the same as "\p{Is_Foo=Bar}"
208       and "\p{IsFoo=Bar}".  "*" here is restricted to something not beginning
209       with an underscore.
210
211       Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
212       'Y'.  And 'No', 'F', and 'False' are all synonyms for 'N'.  The table
213       shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
214       for the other possibilities.  Note that not all properties which have
215       values 'Yes' and 'No' are binary, and they have all their values
216       spelled out without using this wild card, and a "NOT" clause in their
217       description that highlights their not being binary.  These also require
218       the compound form to match them, whereas true binary properties have
219       both single and compound forms available.
220
221       Note that all non-essential underscores are removed in the display of
222       the short names below.
223
224       Legend summary:
225
226       * is a wild-card
227       (\d+) in the info column gives the number of Unicode code points
228       matched by this property.
229       D means this is deprecated.
230       O means this is obsolete.
231       S means this is stabilized.
232       T means tighter (stricter) name matching applies.
233       X means use of this form is discouraged, and may not be stable.
234
235              NAME                           INFO
236
237          \p{Adlam}               \p{Script_Extensions=Adlam} (Short:
238                                    \p{Adlm}; NOT \p{Block=Adlam}) (88)
239          \p{Adlm}                \p{Adlam} (= \p{Script_Extensions=Adlam})
240                                    (NOT \p{Block=Adlam}) (88)
241        X \p{Aegean_Numbers}      \p{Block=Aegean_Numbers} (64)
242        T \p{Age: 1.1}            \p{Age=V1_1} (33_979)
243        T \p{Age: 2.0}            \p{Age=V2_0} (144_521)
244        T \p{Age: 2.1}            \p{Age=V2_1} (2)
245        T \p{Age: 3.0}            \p{Age=V3_0} (10_307)
246        T \p{Age: 3.1}            \p{Age=V3_1} (44_978)
247        T \p{Age: 3.2}            \p{Age=V3_2} (1016)
248        T \p{Age: 4.0}            \p{Age=V4_0} (1226)
249        T \p{Age: 4.1}            \p{Age=V4_1} (1273)
250        T \p{Age: 5.0}            \p{Age=V5_0} (1369)
251        T \p{Age: 5.1}            \p{Age=V5_1} (1624)
252        T \p{Age: 5.2}            \p{Age=V5_2} (6648)
253        T \p{Age: 6.0}            \p{Age=V6_0} (2088)
254        T \p{Age: 6.1}            \p{Age=V6_1} (732)
255        T \p{Age: 6.2}            \p{Age=V6_2} (1)
256        T \p{Age: 6.3}            \p{Age=V6_3} (5)
257        T \p{Age: 7.0}            \p{Age=V7_0} (2834)
258        T \p{Age: 8.0}            \p{Age=V8_0} (7716)
259        T \p{Age: 9.0}            \p{Age=V9_0} (7500)
260          \p{Age: NA}             \p{Age=Unassigned} (846_293 plus all
261                                    above-Unicode code points)
262          \p{Age: Unassigned}     Code point's usage has not been assigned
263                                    in any Unicode release thus far. (Short:
264                                    \p{Age=NA}) (846_293 plus all above-
265                                    Unicode code points)
266          \p{Age: V1_1}           Code point's usage introduced in version
267                                    1.1 (33_979)
268          \p{Age: V2_0}           Code point's usage was introduced in
269                                    version 2.0; See also Property
270                                    'Present_In' (144_521)
271          \p{Age: V2_1}           Code point's usage was introduced in
272                                    version 2.1; See also Property
273                                    'Present_In' (2)
274          \p{Age: V3_0}           Code point's usage was introduced in
275                                    version 3.0; See also Property
276                                    'Present_In' (10_307)
277          \p{Age: V3_1}           Code point's usage was introduced in
278                                    version 3.1; See also Property
279                                    'Present_In' (44_978)
280          \p{Age: V3_2}           Code point's usage was introduced in
281                                    version 3.2; See also Property
282                                    'Present_In' (1016)
283          \p{Age: V4_0}           Code point's usage was introduced in
284                                    version 4.0; See also Property
285                                    'Present_In' (1226)
286          \p{Age: V4_1}           Code point's usage was introduced in
287                                    version 4.1; See also Property
288                                    'Present_In' (1273)
289          \p{Age: V5_0}           Code point's usage was introduced in
290                                    version 5.0; See also Property
291                                    'Present_In' (1369)
292          \p{Age: V5_1}           Code point's usage was introduced in
293                                    version 5.1; See also Property
294                                    'Present_In' (1624)
295          \p{Age: V5_2}           Code point's usage was introduced in
296                                    version 5.2; See also Property
297                                    'Present_In' (6648)
298          \p{Age: V6_0}           Code point's usage was introduced in
299                                    version 6.0; See also Property
300                                    'Present_In' (2088)
301          \p{Age: V6_1}           Code point's usage was introduced in
302                                    version 6.1; See also Property
303                                    'Present_In' (732)
304          \p{Age: V6_2}           Code point's usage was introduced in
305                                    version 6.2; See also Property
306                                    'Present_In' (1)
307          \p{Age: V6_3}           Code point's usage was introduced in
308                                    version 6.3; See also Property
309                                    'Present_In' (5)
310          \p{Age: V7_0}           Code point's usage was introduced in
311                                    version 7.0; See also Property
312                                    'Present_In' (2834)
313          \p{Age: V8_0}           Code point's usage was introduced in
314                                    version 8.0; See also Property
315                                    'Present_In' (7716)
316          \p{Age: V9_0}           Code point's usage was introduced in
317                                    version 9.0; See also Property
318                                    'Present_In' (7500)
319          \p{Aghb}                \p{Caucasian_Albanian} (=
320                                    \p{Script_Extensions=
321                                    Caucasian_Albanian}) (NOT \p{Block=
322                                    Caucasian_Albanian}) (53)
323          \p{AHex}                \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
324                                    (22)
325          \p{AHex: *}             \p{ASCII_Hex_Digit: *}
326          \p{Ahom}                \p{Script_Extensions=Ahom} (NOT \p{Block=
327                                    Ahom}) (57)
328        X \p{Alchemical}          \p{Alchemical_Symbols} (= \p{Block=
329                                    Alchemical_Symbols}) (128)
330        X \p{Alchemical_Symbols}  \p{Block=Alchemical_Symbols} (Short:
331                                    \p{InAlchemical}) (128)
332          \p{All}                 All code points, including those above
333                                    Unicode.  Same as qr/./s (1_114_112 plus
334                                    all above-Unicode code points)
335          \p{Alnum}               \p{XPosixAlnum} (118_820)
336          \p{Alpha}               \p{XPosixAlpha} (= \p{Alphabetic=Y})
337                                    (118_240)
338          \p{Alpha: *}            \p{Alphabetic: *}
339          \p{Alphabetic}          \p{XPosixAlpha} (= \p{Alphabetic=Y})
340                                    (118_240)
341          \p{Alphabetic: N*}      (Short: \p{Alpha=N}, \P{Alpha}) (995_872
342                                    plus all above-Unicode code points)
343          \p{Alphabetic: Y*}      (Short: \p{Alpha=Y}, \p{Alpha}) (118_240)
344        X \p{Alphabetic_PF}       \p{Alphabetic_Presentation_Forms} (=
345                                    \p{Block=Alphabetic_Presentation_Forms})
346                                    (80)
347        X \p{Alphabetic_Presentation_Forms} \p{Block=
348                                    Alphabetic_Presentation_Forms} (Short:
349                                    \p{InAlphabeticPF}) (80)
350          \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
351                                    Anatolian_Hieroglyphs} (Short: \p{Hluw};
352                                    NOT \p{Block=Anatolian_Hieroglyphs})
353                                    (583)
354        X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
355                                    \p{Block=
356                                    Ancient_Greek_Musical_Notation}) (80)
357        X \p{Ancient_Greek_Musical_Notation} \p{Block=
358                                    Ancient_Greek_Musical_Notation} (Short:
359                                    \p{InAncientGreekMusic}) (80)
360        X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
361        X \p{Ancient_Symbols}     \p{Block=Ancient_Symbols} (64)
362          \p{Any}                 All Unicode code points: [\x{0000}-
363                                    \x{10FFFF}] (1_114_112)
364          \p{Arab}                \p{Arabic} (= \p{Script_Extensions=
365                                    Arabic}) (NOT \p{Block=Arabic}) (1323)
366          \p{Arabic}              \p{Script_Extensions=Arabic} (Short:
367                                    \p{Arab}; NOT \p{Block=Arabic}) (1323)
368        X \p{Arabic_Ext_A}        \p{Arabic_Extended_A} (= \p{Block=
369                                    Arabic_Extended_A}) (96)
370        X \p{Arabic_Extended_A}   \p{Block=Arabic_Extended_A} (Short:
371                                    \p{InArabicExtA}) (96)
372        X \p{Arabic_Math}         \p{Arabic_Mathematical_Alphabetic_Symbols}
373                                    (= \p{Block=
374                                    Arabic_Mathematical_Alphabetic_Symbols})
375                                    (256)
376        X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
377                                    Arabic_Mathematical_Alphabetic_Symbols}
378                                    (Short: \p{InArabicMath}) (256)
379        X \p{Arabic_PF_A}         \p{Arabic_Presentation_Forms_A} (=
380                                    \p{Block=Arabic_Presentation_Forms_A})
381                                    (688)
382        X \p{Arabic_PF_B}         \p{Arabic_Presentation_Forms_B} (=
383                                    \p{Block=Arabic_Presentation_Forms_B})
384                                    (144)
385        X \p{Arabic_Presentation_Forms_A} \p{Block=
386                                    Arabic_Presentation_Forms_A} (Short:
387                                    \p{InArabicPFA}) (688)
388        X \p{Arabic_Presentation_Forms_B} \p{Block=
389                                    Arabic_Presentation_Forms_B} (Short:
390                                    \p{InArabicPFB}) (144)
391        X \p{Arabic_Sup}          \p{Arabic_Supplement} (= \p{Block=
392                                    Arabic_Supplement}) (48)
393        X \p{Arabic_Supplement}   \p{Block=Arabic_Supplement} (Short:
394                                    \p{InArabicSup}) (48)
395          \p{Armenian}            \p{Script_Extensions=Armenian} (Short:
396                                    \p{Armn}; NOT \p{Block=Armenian}) (94)
397          \p{Armi}                \p{Imperial_Aramaic} (=
398                                    \p{Script_Extensions=Imperial_Aramaic})
399                                    (NOT \p{Block=Imperial_Aramaic}) (31)
400          \p{Armn}                \p{Armenian} (= \p{Script_Extensions=
401                                    Armenian}) (NOT \p{Block=Armenian}) (94)
402        X \p{Arrows}              \p{Block=Arrows} (112)
403          \p{ASCII}               \p{Block=Basic_Latin} [[:ASCII:]] (128)
404          \p{ASCII_Hex_Digit}     \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
405                                    (22)
406          \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
407                                    plus all above-Unicode code points)
408          \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22)
409          \p{Assigned}            All assigned code points (267_753)
410          \p{Avestan}             \p{Script_Extensions=Avestan} (Short:
411                                    \p{Avst}; NOT \p{Block=Avestan}) (61)
412          \p{Avst}                \p{Avestan} (= \p{Script_Extensions=
413                                    Avestan}) (NOT \p{Block=Avestan}) (61)
414          \p{Bali}                \p{Balinese} (= \p{Script_Extensions=
415                                    Balinese}) (NOT \p{Block=Balinese}) (121)
416          \p{Balinese}            \p{Script_Extensions=Balinese} (Short:
417                                    \p{Bali}; NOT \p{Block=Balinese}) (121)
418          \p{Bamu}                \p{Bamum} (= \p{Script_Extensions=Bamum})
419                                    (NOT \p{Block=Bamum}) (657)
420          \p{Bamum}               \p{Script_Extensions=Bamum} (Short:
421                                    \p{Bamu}; NOT \p{Block=Bamum}) (657)
422        X \p{Bamum_Sup}           \p{Bamum_Supplement} (= \p{Block=
423                                    Bamum_Supplement}) (576)
424        X \p{Bamum_Supplement}    \p{Block=Bamum_Supplement} (Short:
425                                    \p{InBamumSup}) (576)
426        X \p{Basic_Latin}         \p{ASCII} (= \p{Block=Basic_Latin}) (128)
427          \p{Bass}                \p{Bassa_Vah} (= \p{Script_Extensions=
428                                    Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
429                                    (36)
430          \p{Bassa_Vah}           \p{Script_Extensions=Bassa_Vah} (Short:
431                                    \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
432          \p{Batak}               \p{Script_Extensions=Batak} (Short:
433                                    \p{Batk}; NOT \p{Block=Batak}) (56)
434          \p{Batk}                \p{Batak} (= \p{Script_Extensions=Batak})
435                                    (NOT \p{Block=Batak}) (56)
436          \p{Bc: *}               \p{Bidi_Class: *}
437          \p{Beng}                \p{Bengali} (= \p{Script_Extensions=
438                                    Bengali}) (NOT \p{Block=Bengali}) (98)
439          \p{Bengali}             \p{Script_Extensions=Bengali} (Short:
440                                    \p{Beng}; NOT \p{Block=Bengali}) (98)
441          \p{Bhaiksuki}           \p{Script_Extensions=Bhaiksuki} (Short:
442                                    \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
443          \p{Bhks}                \p{Bhaiksuki} (= \p{Script_Extensions=
444                                    Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
445                                    (97)
446          \p{Bidi_C}              \p{Bidi_Control} (= \p{Bidi_Control=Y})
447                                    (12)
448          \p{Bidi_C: *}           \p{Bidi_Control: *}
449          \p{Bidi_Class: AL}      \p{Bidi_Class=Arabic_Letter} (1420)
450          \p{Bidi_Class: AN}      \p{Bidi_Class=Arabic_Number} (51)
451          \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1420)
452          \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (51)
453          \p{Bidi_Class: B}       \p{Bidi_Class=Paragraph_Separator} (7)
454          \p{Bidi_Class: BN}      \p{Bidi_Class=Boundary_Neutral} (4016)
455          \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016)
456          \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15)
457          \p{Bidi_Class: CS}      \p{Bidi_Class=Common_Separator} (15)
458          \p{Bidi_Class: EN}      \p{Bidi_Class=European_Number} (158)
459          \p{Bidi_Class: ES}      \p{Bidi_Class=European_Separator} (12)
460          \p{Bidi_Class: ET}      \p{Bidi_Class=European_Terminator} (87)
461          \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (158)
462          \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12)
463          \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (87)
464          \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1)
465          \p{Bidi_Class: FSI}     \p{Bidi_Class=First_Strong_Isolate} (1)
466          \p{Bidi_Class: L}       \p{Bidi_Class=Left_To_Right} (1_097_280
467                                    plus all above-Unicode code points)
468          \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_097_280 plus
469                                    all above-Unicode code points)
470          \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1)
471          \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1)
472          \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1)
473          \p{Bidi_Class: LRE}     \p{Bidi_Class=Left_To_Right_Embedding} (1)
474          \p{Bidi_Class: LRI}     \p{Bidi_Class=Left_To_Right_Isolate} (1)
475          \p{Bidi_Class: LRO}     \p{Bidi_Class=Left_To_Right_Override} (1)
476          \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1700)
477          \p{Bidi_Class: NSM}     \p{Bidi_Class=Nonspacing_Mark} (1700)
478          \p{Bidi_Class: ON}      \p{Bidi_Class=Other_Neutral} (5267)
479          \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (5267)
480          \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7)
481          \p{Bidi_Class: PDF}     \p{Bidi_Class=Pop_Directional_Format} (1)
482          \p{Bidi_Class: PDI}     \p{Bidi_Class=Pop_Directional_Isolate} (1)
483          \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1)
484          \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1)
485          \p{Bidi_Class: R}       \p{Bidi_Class=Right_To_Left} (4070)
486          \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (4070)
487          \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1)
488          \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1)
489          \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1)
490          \p{Bidi_Class: RLE}     \p{Bidi_Class=Right_To_Left_Embedding} (1)
491          \p{Bidi_Class: RLI}     \p{Bidi_Class=Right_To_Left_Isolate} (1)
492          \p{Bidi_Class: RLO}     \p{Bidi_Class=Right_To_Left_Override} (1)
493          \p{Bidi_Class: S}       \p{Bidi_Class=Segment_Separator} (3)
494          \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3)
495          \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17)
496          \p{Bidi_Class: WS}      \p{Bidi_Class=White_Space} (17)
497          \p{Bidi_Control}        \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
498          \p{Bidi_Control: N*}    (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
499                                    plus all above-Unicode code points)
500          \p{Bidi_Control: Y*}    (Short: \p{BidiC=Y}, \p{BidiC}) (12)
501          \p{Bidi_M}              \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
502                                    (545)
503          \p{Bidi_M: *}           \p{Bidi_Mirrored: *}
504          \p{Bidi_Mirrored}       \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
505                                    (545)
506          \p{Bidi_Mirrored: N*}   (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567
507                                    plus all above-Unicode code points)
508          \p{Bidi_Mirrored: Y*}   (Short: \p{BidiM=Y}, \p{BidiM}) (545)
509          \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
510                                    (60)
511          \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60)
512          \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
513                                    (1_113_992 plus all above-Unicode code
514                                    points)
515          \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992
516                                    plus all above-Unicode code points)
517          \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
518                                    (60)
519          \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60)
520          \p{Blank}               \p{XPosixBlank} (18)
521          \p{Blk: *}              \p{Block: *}
522          \p{Block: Adlam}        (NOT \p{Adlam} NOR \p{Is_Adlam}) (96)
523          \p{Block: Aegean_Numbers} (64)
524          \p{Block: Ahom}         (NOT \p{Ahom} NOR \p{Is_Ahom}) (64)
525          \p{Block: Alchemical}   \p{Block=Alchemical_Symbols} (128)
526          \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128)
527          \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
528                                    (80)
529          \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
530                                    AlphabeticPF}) (80)
531          \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
532                                    NOR \p{Is_Anatolian_Hieroglyphs}) (640)
533          \p{Block: Ancient_Greek_Music} \p{Block=
534                                    Ancient_Greek_Musical_Notation} (80)
535          \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
536                                    AncientGreekMusic}) (80)
537          \p{Block: Ancient_Greek_Numbers} (80)
538          \p{Block: Ancient_Symbols} (64)
539          \p{Block: Arabic}       (NOT \p{Arabic} NOR \p{Is_Arabic}) (256)
540          \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
541          \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96)
542          \p{Block: Arabic_Math}  \p{Block=
543                                    Arabic_Mathematical_Alphabetic_Symbols}
544                                    (256)
545          \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
546                                    ArabicMath}) (256)
547          \p{Block: Arabic_PF_A}  \p{Block=Arabic_Presentation_Forms_A} (688)
548          \p{Block: Arabic_PF_B}  \p{Block=Arabic_Presentation_Forms_B} (144)
549          \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
550                                    (688)
551          \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
552                                    (144)
553          \p{Block: Arabic_Sup}   \p{Block=Arabic_Supplement} (48)
554          \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48)
555          \p{Block: Armenian}     (NOT \p{Armenian} NOR \p{Is_Armenian}) (96)
556          \p{Block: Arrows}       (112)
557          \p{Block: ASCII}        \p{Block=Basic_Latin} (128)
558          \p{Block: Avestan}      (NOT \p{Avestan} NOR \p{Is_Avestan}) (64)
559          \p{Block: Balinese}     (NOT \p{Balinese} NOR \p{Is_Balinese})
560                                    (128)
561          \p{Block: Bamum}        (NOT \p{Bamum} NOR \p{Is_Bamum}) (96)
562          \p{Block: Bamum_Sup}    \p{Block=Bamum_Supplement} (576)
563          \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576)
564          \p{Block: Basic_Latin}  (Short: \p{Blk=ASCII}) (128)
565          \p{Block: Bassa_Vah}    (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
566                                    (48)
567          \p{Block: Batak}        (NOT \p{Batak} NOR \p{Is_Batak}) (64)
568          \p{Block: Bengali}      (NOT \p{Bengali} NOR \p{Is_Bengali}) (128)
569          \p{Block: Bhaiksuki}    (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
570                                    (112)
571          \p{Block: Block_Elements} (32)
572          \p{Block: Bopomofo}     (NOT \p{Bopomofo} NOR \p{Is_Bopomofo}) (48)
573          \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
574          \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32)
575          \p{Block: Box_Drawing}  (128)
576          \p{Block: Brahmi}       (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128)
577          \p{Block: Braille}      \p{Block=Braille_Patterns} (256)
578          \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256)
579          \p{Block: Buginese}     (NOT \p{Buginese} NOR \p{Is_Buginese}) (32)
580          \p{Block: Buhid}        (NOT \p{Buhid} NOR \p{Is_Buhid}) (32)
581          \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
582                                    (256)
583          \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
584                                    ByzantineMusic}) (256)
585          \p{Block: Canadian_Syllabics} \p{Block=
586                                    Unified_Canadian_Aboriginal_Syllabics}
587                                    (640)
588          \p{Block: Carian}       (NOT \p{Carian} NOR \p{Is_Carian}) (64)
589          \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
590                                    \p{Is_Caucasian_Albanian}) (64)
591          \p{Block: Chakma}       (NOT \p{Chakma} NOR \p{Is_Chakma}) (80)
592          \p{Block: Cham}         (NOT \p{Cham} NOR \p{Is_Cham}) (96)
593          \p{Block: Cherokee}     (NOT \p{Cherokee} NOR \p{Is_Cherokee}) (96)
594          \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
595          \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80)
596          \p{Block: CJK}          \p{Block=CJK_Unified_Ideographs} (20_992)
597          \p{Block: CJK_Compat}   \p{Block=CJK_Compatibility} (256)
598          \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
599          \p{Block: CJK_Compat_Ideographs} \p{Block=
600                                    CJK_Compatibility_Ideographs} (512)
601          \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
602                                    CJK_Compatibility_Ideographs_Supplement}
603                                    (544)
604          \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256)
605          \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
606                                    (32)
607          \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
608                                    CJKCompatIdeographs}) (512)
609          \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
610                                    CJKCompatIdeographsSup}) (544)
611          \p{Block: CJK_Ext_A}    \p{Block=
612                                    CJK_Unified_Ideographs_Extension_A}
613                                    (6592)
614          \p{Block: CJK_Ext_B}    \p{Block=
615                                    CJK_Unified_Ideographs_Extension_B}
616                                    (42_720)
617          \p{Block: CJK_Ext_C}    \p{Block=
618                                    CJK_Unified_Ideographs_Extension_C}
619                                    (4160)
620          \p{Block: CJK_Ext_D}    \p{Block=
621                                    CJK_Unified_Ideographs_Extension_D} (224)
622          \p{Block: CJK_Ext_E}    \p{Block=
623                                    CJK_Unified_Ideographs_Extension_E}
624                                    (5776)
625          \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
626          \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
627                                    (128)
628          \p{Block: CJK_Strokes}  (48)
629          \p{Block: CJK_Symbols}  \p{Block=CJK_Symbols_And_Punctuation} (64)
630          \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
631                                    (64)
632          \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992)
633          \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
634                                    CJKExtA}) (6592)
635          \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
636                                    CJKExtB}) (42_720)
637          \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
638                                    CJKExtC}) (4160)
639          \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
640                                    CJKExtD}) (224)
641          \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
642                                    CJKExtE}) (5776)
643          \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
644                                    Diacriticals}) (112)
645          \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
646                                    DiacriticalsExt}) (80)
647          \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
648                                    DiacriticalsForSymbols}) (48)
649          \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
650                                    DiacriticalsSup}) (64)
651          \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16)
652          \p{Block: Combining_Marks_For_Symbols} \p{Block=
653                                    Combining_Diacritical_Marks_For_Symbols}
654                                    (48)
655          \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
656                                    IndicNumberForms}) (16)
657          \p{Block: Compat_Jamo}  \p{Block=Hangul_Compatibility_Jamo} (96)
658          \p{Block: Control_Pictures} (64)
659          \p{Block: Coptic}       (NOT \p{Coptic} NOR \p{Is_Coptic}) (128)
660          \p{Block: Coptic_Epact_Numbers} (32)
661          \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
662          \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32)
663          \p{Block: Cuneiform}    (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
664                                    (1024)
665          \p{Block: Cuneiform_Numbers} \p{Block=
666                                    Cuneiform_Numbers_And_Punctuation} (128)
667          \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
668                                    CuneiformNumbers}) (128)
669          \p{Block: Currency_Symbols} (48)
670          \p{Block: Cypriot_Syllabary} (64)
671          \p{Block: Cyrillic}     (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
672                                    (256)
673          \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
674          \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
675          \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
676          \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32)
677          \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96)
678          \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16)
679          \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
680          \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48)
681          \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
682                                    (48)
683          \p{Block: Deseret}      (80)
684          \p{Block: Devanagari}   (NOT \p{Devanagari} NOR \p{Is_Devanagari})
685                                    (128)
686          \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
687          \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32)
688          \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
689          \p{Block: Diacriticals_Ext} \p{Block=
690                                    Combining_Diacritical_Marks_Extended}
691                                    (80)
692          \p{Block: Diacriticals_For_Symbols} \p{Block=
693                                    Combining_Diacritical_Marks_For_Symbols}
694                                    (48)
695          \p{Block: Diacriticals_Sup} \p{Block=
696                                    Combining_Diacritical_Marks_Supplement}
697                                    (64)
698          \p{Block: Dingbats}     (192)
699          \p{Block: Domino}       \p{Block=Domino_Tiles} (112)
700          \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112)
701          \p{Block: Duployan}     (NOT \p{Duployan} NOR \p{Is_Duployan})
702                                    (160)
703          \p{Block: Early_Dynastic_Cuneiform} (208)
704          \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
705                                    \p{Is_Egyptian_Hieroglyphs}) (1072)
706          \p{Block: Elbasan}      (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48)
707          \p{Block: Emoticons}    (80)
708          \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
709          \p{Block: Enclosed_Alphanum_Sup} \p{Block=
710                                    Enclosed_Alphanumeric_Supplement} (256)
711          \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
712                                    EnclosedAlphanumSup}) (256)
713          \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
714                                    EnclosedAlphanum}) (160)
715          \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
716                                    (256)
717          \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
718                                    EnclosedCJK}) (256)
719          \p{Block: Enclosed_Ideographic_Sup} \p{Block=
720                                    Enclosed_Ideographic_Supplement} (256)
721          \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
722                                    EnclosedIdeographicSup}) (256)
723          \p{Block: Ethiopic}     (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
724                                    (384)
725          \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
726          \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
727          \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96)
728          \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48)
729          \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
730          \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32)
731          \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
732                                    \p{Punct} NOR \p{Is_Punctuation}) (112)
733          \p{Block: Geometric_Shapes} (96)
734          \p{Block: Geometric_Shapes_Ext} \p{Block=
735                                    Geometric_Shapes_Extended} (128)
736          \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
737                                    GeometricShapesExt}) (128)
738          \p{Block: Georgian}     (NOT \p{Georgian} NOR \p{Is_Georgian}) (96)
739          \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
740          \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48)
741          \p{Block: Glagolitic}   (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
742                                    (96)
743          \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
744          \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
745                                    (48)
746          \p{Block: Gothic}       (NOT \p{Gothic} NOR \p{Is_Gothic}) (32)
747          \p{Block: Grantha}      (NOT \p{Grantha} NOR \p{Is_Grantha}) (128)
748          \p{Block: Greek}        \p{Block=Greek_And_Coptic} (NOT \p{Greek}
749                                    NOR \p{Is_Greek}) (144)
750          \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
751                                    NOR \p{Is_Greek}) (144)
752          \p{Block: Greek_Ext}    \p{Block=Greek_Extended} (256)
753          \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256)
754          \p{Block: Gujarati}     (NOT \p{Gujarati} NOR \p{Is_Gujarati})
755                                    (128)
756          \p{Block: Gurmukhi}     (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
757                                    (128)
758          \p{Block: Half_And_Full_Forms} \p{Block=
759                                    Halfwidth_And_Fullwidth_Forms} (240)
760          \p{Block: Half_Marks}   \p{Block=Combining_Half_Marks} (16)
761          \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
762                                    HalfAndFullForms}) (240)
763          \p{Block: Hangul}       \p{Block=Hangul_Syllables} (NOT \p{Hangul}
764                                    NOR \p{Is_Hangul}) (11_184)
765          \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
766                                    (96)
767          \p{Block: Hangul_Jamo}  (Short: \p{Blk=Jamo}) (256)
768          \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32)
769          \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80)
770          \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
771                                    NOR \p{Is_Hangul}) (11_184)
772          \p{Block: Hanunoo}      (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32)
773          \p{Block: Hatran}       (NOT \p{Hatran} NOR \p{Is_Hatran}) (32)
774          \p{Block: Hebrew}       (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112)
775          \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
776                                    HighPUSurrogates}) (128)
777          \p{Block: High_PU_Surrogates} \p{Block=
778                                    High_Private_Use_Surrogates} (128)
779          \p{Block: High_Surrogates} (896)
780          \p{Block: Hiragana}     (NOT \p{Hiragana} NOR \p{Is_Hiragana}) (96)
781          \p{Block: IDC}          \p{Block=
782                                    Ideographic_Description_Characters} (NOT
783                                    \p{ID_Continue} NOR \p{Is_IDC}) (16)
784          \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
785                                    NOT \p{ID_Continue} NOR \p{Is_IDC}) (16)
786          \p{Block: Ideographic_Symbols} \p{Block=
787                                    Ideographic_Symbols_And_Punctuation} (32)
788          \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
789                                    IdeographicSymbols}) (32)
790          \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
791                                    \p{Is_Imperial_Aramaic}) (32)
792          \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
793                                    (16)
794          \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
795                                    NOR \p{Is_Inscriptional_Pahlavi}) (32)
796          \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
797                                    NOR \p{Is_Inscriptional_Parthian}) (32)
798          \p{Block: IPA_Ext}      \p{Block=IPA_Extensions} (96)
799          \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96)
800          \p{Block: Jamo}         \p{Block=Hangul_Jamo} (256)
801          \p{Block: Jamo_Ext_A}   \p{Block=Hangul_Jamo_Extended_A} (32)
802          \p{Block: Jamo_Ext_B}   \p{Block=Hangul_Jamo_Extended_B} (80)
803          \p{Block: Javanese}     (NOT \p{Javanese} NOR \p{Is_Javanese}) (96)
804          \p{Block: Kaithi}       (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80)
805          \p{Block: Kana_Sup}     \p{Block=Kana_Supplement} (256)
806          \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256)
807          \p{Block: Kanbun}       (16)
808          \p{Block: Kangxi}       \p{Block=Kangxi_Radicals} (224)
809          \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224)
810          \p{Block: Kannada}      (NOT \p{Kannada} NOR \p{Is_Kannada}) (128)
811          \p{Block: Katakana}     (NOT \p{Katakana} NOR \p{Is_Katakana}) (96)
812          \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
813          \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
814                                    KatakanaExt}) (16)
815          \p{Block: Kayah_Li}     (48)
816          \p{Block: Kharoshthi}   (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
817                                    (96)
818          \p{Block: Khmer}        (NOT \p{Khmer} NOR \p{Is_Khmer}) (128)
819          \p{Block: Khmer_Symbols} (32)
820          \p{Block: Khojki}       (NOT \p{Khojki} NOR \p{Is_Khojki}) (80)
821          \p{Block: Khudawadi}    (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
822                                    (80)
823          \p{Block: Lao}          (NOT \p{Lao} NOR \p{Is_Lao}) (128)
824          \p{Block: Latin_1}      \p{Block=Latin_1_Supplement} (128)
825          \p{Block: Latin_1_Sup}  \p{Block=Latin_1_Supplement} (128)
826          \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128)
827          \p{Block: Latin_Ext_A}  \p{Block=Latin_Extended_A} (128)
828          \p{Block: Latin_Ext_Additional} \p{Block=
829                                    Latin_Extended_Additional} (256)
830          \p{Block: Latin_Ext_B}  \p{Block=Latin_Extended_B} (208)
831          \p{Block: Latin_Ext_C}  \p{Block=Latin_Extended_C} (32)
832          \p{Block: Latin_Ext_D}  \p{Block=Latin_Extended_D} (224)
833          \p{Block: Latin_Ext_E}  \p{Block=Latin_Extended_E} (64)
834          \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128)
835          \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
836                                    LatinExtAdditional}) (256)
837          \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208)
838          \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32)
839          \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224)
840          \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64)
841          \p{Block: Lepcha}       (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80)
842          \p{Block: Letterlike_Symbols} (80)
843          \p{Block: Limbu}        (NOT \p{Limbu} NOR \p{Is_Limbu}) (80)
844          \p{Block: Linear_A}     (NOT \p{Linear_A} NOR \p{Is_Linear_A})
845                                    (384)
846          \p{Block: Linear_B_Ideograms} (128)
847          \p{Block: Linear_B_Syllabary} (128)
848          \p{Block: Lisu}         (48)
849          \p{Block: Low_Surrogates} (1024)
850          \p{Block: Lycian}       (NOT \p{Lycian} NOR \p{Is_Lycian}) (32)
851          \p{Block: Lydian}       (NOT \p{Lydian} NOR \p{Is_Lydian}) (32)
852          \p{Block: Mahajani}     (NOT \p{Mahajani} NOR \p{Is_Mahajani}) (48)
853          \p{Block: Mahjong}      \p{Block=Mahjong_Tiles} (48)
854          \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48)
855          \p{Block: Malayalam}    (NOT \p{Malayalam} NOR \p{Is_Malayalam})
856                                    (128)
857          \p{Block: Mandaic}      (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32)
858          \p{Block: Manichaean}   (NOT \p{Manichaean} NOR \p{Is_Manichaean})
859                                    (64)
860          \p{Block: Marchen}      (NOT \p{Marchen} NOR \p{Is_Marchen}) (80)
861          \p{Block: Math_Alphanum} \p{Block=
862                                    Mathematical_Alphanumeric_Symbols} (1024)
863          \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
864          \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
865                                    MathAlphanum}) (1024)
866          \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
867                                    (256)
868          \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
869                                    \p{Is_Meetei_Mayek}) (64)
870          \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
871          \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
872                                    (32)
873          \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
874                                    \p{Is_Mende_Kikakui}) (224)
875          \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
876                                    \p{Is_Meroitic_Cursive}) (96)
877          \p{Block: Meroitic_Hieroglyphs} (32)
878          \p{Block: Miao}         (NOT \p{Miao} NOR \p{Is_Miao}) (160)
879          \p{Block: Misc_Arrows}  \p{Block=Miscellaneous_Symbols_And_Arrows}
880                                    (256)
881          \p{Block: Misc_Math_Symbols_A} \p{Block=
882                                    Miscellaneous_Mathematical_Symbols_A}
883                                    (48)
884          \p{Block: Misc_Math_Symbols_B} \p{Block=
885                                    Miscellaneous_Mathematical_Symbols_B}
886                                    (128)
887          \p{Block: Misc_Pictographs} \p{Block=
888                                    Miscellaneous_Symbols_And_Pictographs}
889                                    (768)
890          \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
891          \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
892          \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
893                                    MiscMathSymbolsA}) (48)
894          \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
895                                    MiscMathSymbolsB}) (128)
896          \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols}) (256)
897          \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
898                                    MiscArrows}) (256)
899          \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
900                                    MiscPictographs}) (768)
901          \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
902                                    (256)
903          \p{Block: Modi}         (NOT \p{Modi} NOR \p{Is_Modi}) (96)
904          \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
905          \p{Block: Modifier_Tone_Letters} (32)
906          \p{Block: Mongolian}    (NOT \p{Mongolian} NOR \p{Is_Mongolian})
907                                    (176)
908          \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
909          \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32)
910          \p{Block: Mro}          (NOT \p{Mro} NOR \p{Is_Mro}) (48)
911          \p{Block: Multani}      (NOT \p{Multani} NOR \p{Is_Multani}) (48)
912          \p{Block: Music}        \p{Block=Musical_Symbols} (256)
913          \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256)
914          \p{Block: Myanmar}      (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160)
915          \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
916          \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
917          \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32)
918          \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32)
919          \p{Block: Nabataean}    (NOT \p{Nabataean} NOR \p{Is_Nabataean})
920                                    (48)
921          \p{Block: NB}           \p{Block=No_Block} (842_320 plus all
922                                    above-Unicode code points)
923          \p{Block: New_Tai_Lue}  (NOT \p{New_Tai_Lue} NOR
924                                    \p{Is_New_Tai_Lue}) (96)
925          \p{Block: Newa}         (NOT \p{Newa} NOR \p{Is_Newa}) (128)
926          \p{Block: NKo}          (NOT \p{Nko} NOR \p{Is_NKo}) (64)
927          \p{Block: No_Block}     (Short: \p{Blk=NB}) (842_320 plus all
928                                    above-Unicode code points)
929          \p{Block: Number_Forms} (64)
930          \p{Block: OCR}          \p{Block=Optical_Character_Recognition}
931                                    (32)
932          \p{Block: Ogham}        (NOT \p{Ogham} NOR \p{Is_Ogham}) (32)
933          \p{Block: Ol_Chiki}     (48)
934          \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
935                                    \p{Is_Old_Hungarian}) (128)
936          \p{Block: Old_Italic}   (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
937                                    (48)
938          \p{Block: Old_North_Arabian} (32)
939          \p{Block: Old_Permic}   (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
940                                    (48)
941          \p{Block: Old_Persian}  (NOT \p{Old_Persian} NOR
942                                    \p{Is_Old_Persian}) (64)
943          \p{Block: Old_South_Arabian} (32)
944          \p{Block: Old_Turkic}   (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
945                                    (80)
946          \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32)
947          \p{Block: Oriya}        (NOT \p{Oriya} NOR \p{Is_Oriya}) (128)
948          \p{Block: Ornamental_Dingbats} (48)
949          \p{Block: Osage}        (NOT \p{Osage} NOR \p{Is_Osage}) (80)
950          \p{Block: Osmanya}      (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48)
951          \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
952                                    \p{Is_Pahawh_Hmong}) (144)
953          \p{Block: Palmyrene}    (32)
954          \p{Block: Pau_Cin_Hau}  (NOT \p{Pau_Cin_Hau} NOR
955                                    \p{Is_Pau_Cin_Hau}) (64)
956          \p{Block: Phags_Pa}     (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa}) (64)
957          \p{Block: Phaistos}     \p{Block=Phaistos_Disc} (48)
958          \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48)
959          \p{Block: Phoenician}   (NOT \p{Phoenician} NOR \p{Is_Phoenician})
960                                    (32)
961          \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
962          \p{Block: Phonetic_Ext_Sup} \p{Block=
963                                    Phonetic_Extensions_Supplement} (64)
964          \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128)
965          \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
966                                    PhoneticExtSup}) (64)
967          \p{Block: Playing_Cards} (96)
968          \p{Block: Private_Use}  \p{Block=Private_Use_Area} (NOT
969                                    \p{Private_Use} NOR \p{Is_Private_Use})
970                                    (6400)
971          \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
972                                    \p{Private_Use} NOR \p{Is_Private_Use})
973                                    (6400)
974          \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
975                                    \p{Is_Psalter_Pahlavi}) (48)
976          \p{Block: PUA}          \p{Block=Private_Use_Area} (NOT
977                                    \p{Private_Use} NOR \p{Is_Private_Use})
978                                    (6400)
979          \p{Block: Punctuation}  \p{Block=General_Punctuation} (NOT
980                                    \p{Punct} NOR \p{Is_Punctuation}) (112)
981          \p{Block: Rejang}       (NOT \p{Rejang} NOR \p{Is_Rejang}) (48)
982          \p{Block: Rumi}         \p{Block=Rumi_Numeral_Symbols} (32)
983          \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32)
984          \p{Block: Runic}        (NOT \p{Runic} NOR \p{Is_Runic}) (96)
985          \p{Block: Samaritan}    (NOT \p{Samaritan} NOR \p{Is_Samaritan})
986                                    (64)
987          \p{Block: Saurashtra}   (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
988                                    (96)
989          \p{Block: Sharada}      (NOT \p{Sharada} NOR \p{Is_Sharada}) (96)
990          \p{Block: Shavian}      (48)
991          \p{Block: Shorthand_Format_Controls} (16)
992          \p{Block: Siddham}      (NOT \p{Siddham} NOR \p{Is_Siddham}) (128)
993          \p{Block: Sinhala}      (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128)
994          \p{Block: Sinhala_Archaic_Numbers} (32)
995          \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32)
996          \p{Block: Small_Forms}  \p{Block=Small_Form_Variants} (32)
997          \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
998                                    \p{Is_Sora_Sompeng}) (48)
999          \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
1000                                    ModifierLetters}) (80)
1001          \p{Block: Specials}     (16)
1002          \p{Block: Sundanese}    (NOT \p{Sundanese} NOR \p{Is_Sundanese})
1003                                    (64)
1004          \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
1005          \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16)
1006          \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
1007          \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
1008          \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
1009          \p{Block: Sup_Math_Operators} \p{Block=
1010                                    Supplemental_Mathematical_Operators}
1011                                    (256)
1012          \p{Block: Sup_PUA_A}    \p{Block=Supplementary_Private_Use_Area_A}
1013                                    (65_536)
1014          \p{Block: Sup_PUA_B}    \p{Block=Supplementary_Private_Use_Area_B}
1015                                    (65_536)
1016          \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
1017          \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
1018                                    Supplemental_Symbols_And_Pictographs}
1019                                    (256)
1020          \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
1021          \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
1022                                    SuperAndSub}) (48)
1023          \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16)
1024          \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128)
1025          \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256)
1026          \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
1027                                    SupMathOperators}) (256)
1028          \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
1029                                    SupPunctuation}) (128)
1030          \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
1031                                    SupSymbolsAndPictographs}) (256)
1032          \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
1033                                    SupPUAA}) (65_536)
1034          \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
1035                                    SupPUAB}) (65_536)
1036          \p{Block: Sutton_SignWriting} (688)
1037          \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
1038                                    \p{Is_Syloti_Nagri}) (48)
1039          \p{Block: Syriac}       (NOT \p{Syriac} NOR \p{Is_Syriac}) (80)
1040          \p{Block: Tagalog}      (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32)
1041          \p{Block: Tagbanwa}     (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa}) (32)
1042          \p{Block: Tags}         (128)
1043          \p{Block: Tai_Le}       (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48)
1044          \p{Block: Tai_Tham}     (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
1045                                    (144)
1046          \p{Block: Tai_Viet}     (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet}) (96)
1047          \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1048          \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96)
1049          \p{Block: Takri}        (NOT \p{Takri} NOR \p{Is_Takri}) (80)
1050          \p{Block: Tamil}        (NOT \p{Tamil} NOR \p{Is_Tamil}) (128)
1051          \p{Block: Tangut}       (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144)
1052          \p{Block: Tangut_Components} (768)
1053          \p{Block: Telugu}       (NOT \p{Telugu} NOR \p{Is_Telugu}) (128)
1054          \p{Block: Thaana}       (NOT \p{Thaana} NOR \p{Is_Thaana}) (64)
1055          \p{Block: Thai}         (NOT \p{Thai} NOR \p{Is_Thai}) (128)
1056          \p{Block: Tibetan}      (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256)
1057          \p{Block: Tifinagh}     (NOT \p{Tifinagh} NOR \p{Is_Tifinagh}) (80)
1058          \p{Block: Tirhuta}      (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96)
1059          \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1060                                    (128)
1061          \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1062                                    TransportAndMap}) (128)
1063          \p{Block: UCAS}         \p{Block=
1064                                    Unified_Canadian_Aboriginal_Syllabics}
1065                                    (640)
1066          \p{Block: UCAS_Ext}     \p{Block=
1067                                    Unified_Canadian_Aboriginal_Syllabics_-
1068                                    Extended} (80)
1069          \p{Block: Ugaritic}     (NOT \p{Ugaritic} NOR \p{Is_Ugaritic}) (32)
1070          \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1071                                    UCAS}) (640)
1072          \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1073                                    \p{Blk=UCASExt}) (80)
1074          \p{Block: Vai}          (NOT \p{Vai} NOR \p{Is_Vai}) (320)
1075          \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
1076                                    \p{Variation_Selector} NOR \p{Is_VS})
1077                                    (16)
1078          \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
1079                                    (240)
1080          \p{Block: Vedic_Ext}    \p{Block=Vedic_Extensions} (48)
1081          \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48)
1082          \p{Block: Vertical_Forms} (16)
1083          \p{Block: VS}           \p{Block=Variation_Selectors} (NOT
1084                                    \p{Variation_Selector} NOR \p{Is_VS})
1085                                    (16)
1086          \p{Block: VS_Sup}       \p{Block=Variation_Selectors_Supplement}
1087                                    (240)
1088          \p{Block: Warang_Citi}  (NOT \p{Warang_Citi} NOR
1089                                    \p{Is_Warang_Citi}) (96)
1090          \p{Block: Yi_Radicals}  (64)
1091          \p{Block: Yi_Syllables} (1168)
1092          \p{Block: Yijing}       \p{Block=Yijing_Hexagram_Symbols} (64)
1093          \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64)
1094        X \p{Block_Elements}      \p{Block=Block_Elements} (32)
1095          \p{Bopo}                \p{Bopomofo} (= \p{Script_Extensions=
1096                                    Bopomofo}) (NOT \p{Block=Bopomofo}) (110)
1097          \p{Bopomofo}            \p{Script_Extensions=Bopomofo} (Short:
1098                                    \p{Bopo}; NOT \p{Block=Bopomofo}) (110)
1099        X \p{Bopomofo_Ext}        \p{Bopomofo_Extended} (= \p{Block=
1100                                    Bopomofo_Extended}) (32)
1101        X \p{Bopomofo_Extended}   \p{Block=Bopomofo_Extended} (Short:
1102                                    \p{InBopomofoExt}) (32)
1103        X \p{Box_Drawing}         \p{Block=Box_Drawing} (128)
1104          \p{Bpt: *}              \p{Bidi_Paired_Bracket_Type: *}
1105          \p{Brah}                \p{Brahmi} (= \p{Script_Extensions=
1106                                    Brahmi}) (NOT \p{Block=Brahmi}) (109)
1107          \p{Brahmi}              \p{Script_Extensions=Brahmi} (Short:
1108                                    \p{Brah}; NOT \p{Block=Brahmi}) (109)
1109          \p{Brai}                \p{Braille} (= \p{Script_Extensions=
1110                                    Braille}) (256)
1111          \p{Braille}             \p{Script_Extensions=Braille} (Short:
1112                                    \p{Brai}) (256)
1113        X \p{Braille_Patterns}    \p{Block=Braille_Patterns} (Short:
1114                                    \p{InBraille}) (256)
1115          \p{Bugi}                \p{Buginese} (= \p{Script_Extensions=
1116                                    Buginese}) (NOT \p{Block=Buginese}) (31)
1117          \p{Buginese}            \p{Script_Extensions=Buginese} (Short:
1118                                    \p{Bugi}; NOT \p{Block=Buginese}) (31)
1119          \p{Buhd}                \p{Buhid} (= \p{Script_Extensions=Buhid})
1120                                    (NOT \p{Block=Buhid}) (22)
1121          \p{Buhid}               \p{Script_Extensions=Buhid} (Short:
1122                                    \p{Buhd}; NOT \p{Block=Buhid}) (22)
1123        X \p{Byzantine_Music}     \p{Byzantine_Musical_Symbols} (= \p{Block=
1124                                    Byzantine_Musical_Symbols}) (256)
1125        X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1126                                    (Short: \p{InByzantineMusic}) (256)
1127          \p{C} \pC               \p{Other} (= \p{General_Category=Other})
1128                                    (986_091 plus all above-Unicode code
1129                                    points)
1130          \p{Cakm}                \p{Chakma} (= \p{Script_Extensions=
1131                                    Chakma}) (NOT \p{Block=Chakma}) (87)
1132          \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
1133                                    (Short: \p{Cans}) (710)
1134        X \p{Canadian_Syllabics}  \p{Unified_Canadian_Aboriginal_Syllabics}
1135                                    (= \p{Block=
1136                                    Unified_Canadian_Aboriginal_Syllabics})
1137                                    (640)
1138        T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1139                                    Not_Reordered} (1_113_298 plus all
1140                                    above-Unicode code points)
1141        T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1142                                    Overlay} (32)
1143        T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1144                                    Nukta} (22)
1145        T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1146                                    Kana_Voicing} (2)
1147        T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1148                                    Virama} (47)
1149        T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1150                                    CCC10} (1)
1151        T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1152                                    CCC11} (1)
1153        T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1154                                    CCC12} (1)
1155        T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1156                                    CCC13} (1)
1157        T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1158                                    CCC14} (1)
1159        T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1160                                    CCC15} (1)
1161        T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1162                                    CCC16} (1)
1163        T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1164                                    CCC17} (1)
1165        T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1166                                    CCC18} (2)
1167        T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1168                                    CCC19} (2)
1169        T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1170                                    CCC20} (1)
1171        T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1172                                    CCC21} (1)
1173        T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1174                                    CCC22} (1)
1175        T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1176                                    CCC23} (1)
1177        T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1178                                    CCC24} (1)
1179        T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1180                                    CCC25} (1)
1181        T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1182                                    CCC26} (1)
1183        T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1184                                    CCC27} (2)
1185        T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1186                                    CCC28} (2)
1187        T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1188                                    CCC29} (2)
1189        T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1190                                    CCC30} (2)
1191        T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1192                                    CCC31} (2)
1193        T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1194                                    CCC32} (2)
1195        T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1196                                    CCC33} (1)
1197        T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1198                                    CCC34} (1)
1199        T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1200                                    CCC35} (1)
1201        T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1202                                    CCC36} (1)
1203        T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1204                                    CCC84} (1)
1205        T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1206                                    CCC91} (1)
1207        T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1208                                    CCC103} (2)
1209        T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1210                                    CCC107} (4)
1211        T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1212                                    CCC118} (2)
1213        T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1214                                    CCC122} (4)
1215        T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1216                                    CCC129} (1)
1217        T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1218                                    CCC130} (6)
1219        T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1220                                    CCC132} (1)
1221        T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
1222                                    CCC133} (0)
1223        T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1224                                    Attached_Below_Left} (0)
1225        T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1226                                    Attached_Below} (5)
1227        T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1228                                    Attached_Above} (1)
1229        T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1230                                    Attached_Above_Right} (9)
1231        T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1232                                    Below_Left} (1)
1233        T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1234                                    Below} (153)
1235        T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1236                                    Below_Right} (4)
1237        T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1238                                    Left} (2)
1239        T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1240                                    Right} (1)
1241        T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1242                                    Above_Left} (3)
1243        T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1244                                    Above} (461)
1245        T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1246                                    Above_Right} (4)
1247        T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1248                                    Double_Below} (4)
1249        T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1250                                    Double_Above} (5)
1251        T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1252                                    Iota_Subscript} (1)
1253          \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1254                                    Above} (461)
1255          \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (461)
1256          \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (3)
1257          \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (4)
1258          \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1259                                    Above_Left} (3)
1260          \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1261                                    Above_Right} (4)
1262          \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1263                                    Attached_Above} (1)
1264          \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1265                                    Attached_Above_Right} (9)
1266          \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1267                                    Attached_Below} (5)
1268          \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1269                                    Attached_Below_Left} (0)
1270          \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1271                                    (1)
1272          \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1273                                    \p{Ccc=ATAR}) (9)
1274          \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1275                                    (5)
1276          \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1277                                    ATBL}) (0)
1278          \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1279                                    Below} (153)
1280          \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (153)
1281          \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1)
1282          \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4)
1283          \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1284                                    Below_Left} (1)
1285          \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1286                                    Below_Right} (4)
1287          \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1)
1288          \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2)
1289          \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4)
1290          \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1)
1291          \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2)
1292          \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1)
1293          \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4)
1294          \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1)
1295          \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1)
1296          \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6)
1297          \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1)
1298          \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
1299          \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1)
1300          \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1)
1301          \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1)
1302          \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1)
1303          \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2)
1304          \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2)
1305          \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1)
1306          \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1)
1307          \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1)
1308          \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1)
1309          \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1)
1310          \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1)
1311          \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1)
1312          \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2)
1313          \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2)
1314          \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2)
1315          \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2)
1316          \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2)
1317          \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2)
1318          \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1)
1319          \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1)
1320          \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1)
1321          \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1)
1322          \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1)
1323          \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1)
1324          \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1325                                    Double_Above} (5)
1326          \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1327                                    Double_Below} (4)
1328          \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA}) (5)
1329          \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB}) (4)
1330          \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1331                                    (1)
1332          \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1333                                    Iota_Subscript} (1)
1334          \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV}) (2)
1335          \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1336                                    Kana_Voicing} (2)
1337          \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1338                                    Left} (2)
1339          \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2)
1340          \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1341                                    Nukta} (22)
1342          \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1343                                    (1_113_298 plus all above-Unicode code
1344                                    points)
1345          \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1346                                    Not_Reordered} (1_113_298 plus all
1347                                    above-Unicode code points)
1348          \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (22)
1349          \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1350                                    Overlay} (32)
1351          \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32)
1352          \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1353                                    Right} (1)
1354          \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1)
1355          \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (47)
1356          \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1357                                    Virama} (47)
1358          \p{Cans}                \p{Canadian_Aboriginal} (=
1359                                    \p{Script_Extensions=
1360                                    Canadian_Aboriginal}) (710)
1361          \p{Cari}                \p{Carian} (= \p{Script_Extensions=
1362                                    Carian}) (NOT \p{Block=Carian}) (49)
1363          \p{Carian}              \p{Script_Extensions=Carian} (Short:
1364                                    \p{Cari}; NOT \p{Block=Carian}) (49)
1365          \p{Case_Ignorable}      \p{Case_Ignorable=Y} (Short: \p{CI}) (2240)
1366          \p{Case_Ignorable: N*}  (Short: \p{CI=N}, \P{CI}) (1_111_872 plus
1367                                    all above-Unicode code points)
1368          \p{Case_Ignorable: Y*}  (Short: \p{CI=Y}, \p{CI}) (2240)
1369          \p{Cased}               \p{Cased=Y} (4105)
1370          \p{Cased: N*}           (Single: \P{Cased}) (1_110_007 plus all
1371                                    above-Unicode code points)
1372          \p{Cased: Y*}           (Single: \p{Cased}) (4105)
1373          \p{Cased_Letter}        \p{General_Category=Cased_Letter} (Short:
1374                                    \p{LC}) (3796)
1375          \p{Category: *}         \p{General_Category: *}
1376          \p{Caucasian_Albanian}  \p{Script_Extensions=Caucasian_Albanian}
1377                                    (Short: \p{Aghb}; NOT \p{Block=
1378                                    Caucasian_Albanian}) (53)
1379          \p{Cc}                  \p{XPosixCntrl} (= \p{General_Category=
1380                                    Control}) (65)
1381          \p{Ccc: *}              \p{Canonical_Combining_Class: *}
1382          \p{CE}                  \p{Composition_Exclusion} (=
1383                                    \p{Composition_Exclusion=Y}) (81)
1384          \p{CE: *}               \p{Composition_Exclusion: *}
1385          \p{Cf}                  \p{Format} (= \p{General_Category=Format})
1386                                    (151)
1387          \p{Chakma}              \p{Script_Extensions=Chakma} (Short:
1388                                    \p{Cakm}; NOT \p{Block=Chakma}) (87)
1389          \p{Cham}                \p{Script_Extensions=Cham} (NOT \p{Block=
1390                                    Cham}) (83)
1391          \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1392                                    \p{CWCF}) (1377)
1393          \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1394                                    (1_112_735 plus all above-Unicode code
1395                                    points)
1396          \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1397                                    (1377)
1398          \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1399                                    \p{CWCM}) (2669)
1400          \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1401                                    (1_111_443 plus all above-Unicode code
1402                                    points)
1403          \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1404                                    (2669)
1405          \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1406                                    \p{CWL}) (1304)
1407          \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1408                                    (1_112_808 plus all above-Unicode code
1409                                    points)
1410          \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1304)
1411          \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1412                                    Y} (Short: \p{CWKCF}) (10_227)
1413          \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1414                                    \P{CWKCF}) (1_103_885 plus all above-
1415                                    Unicode code points)
1416          \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1417                                    \p{CWKCF}) (10_227)
1418          \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1419                                    \p{CWT}) (1369)
1420          \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1421                                    (1_112_743 plus all above-Unicode code
1422                                    points)
1423          \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1369)
1424          \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1425                                    \p{CWU}) (1396)
1426          \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1427                                    (1_112_716 plus all above-Unicode code
1428                                    points)
1429          \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1396)
1430          \p{Cher}                \p{Cherokee} (= \p{Script_Extensions=
1431                                    Cherokee}) (NOT \p{Block=Cherokee}) (172)
1432          \p{Cherokee}            \p{Script_Extensions=Cherokee} (Short:
1433                                    \p{Cher}; NOT \p{Block=Cherokee}) (172)
1434        X \p{Cherokee_Sup}        \p{Cherokee_Supplement} (= \p{Block=
1435                                    Cherokee_Supplement}) (80)
1436        X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
1437                                    \p{InCherokeeSup}) (80)
1438          \p{CI}                  \p{Case_Ignorable} (= \p{Case_Ignorable=
1439                                    Y}) (2240)
1440          \p{CI: *}               \p{Case_Ignorable: *}
1441        X \p{CJK}                 \p{CJK_Unified_Ideographs} (= \p{Block=
1442                                    CJK_Unified_Ideographs}) (20_992)
1443        X \p{CJK_Compat}          \p{CJK_Compatibility} (= \p{Block=
1444                                    CJK_Compatibility}) (256)
1445        X \p{CJK_Compat_Forms}    \p{CJK_Compatibility_Forms} (= \p{Block=
1446                                    CJK_Compatibility_Forms}) (32)
1447        X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
1448                                    \p{Block=CJK_Compatibility_Ideographs})
1449                                    (512)
1450        X \p{CJK_Compat_Ideographs_Sup}
1451                                    \p{CJK_Compatibility_Ideographs_-
1452                                    Supplement} (= \p{Block=
1453                                    CJK_Compatibility_Ideographs_-
1454                                    Supplement}) (544)
1455        X \p{CJK_Compatibility}   \p{Block=CJK_Compatibility} (Short:
1456                                    \p{InCJKCompat}) (256)
1457        X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
1458                                    (Short: \p{InCJKCompatForms}) (32)
1459        X \p{CJK_Compatibility_Ideographs} \p{Block=
1460                                    CJK_Compatibility_Ideographs} (Short:
1461                                    \p{InCJKCompatIdeographs}) (512)
1462        X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
1463                                    CJK_Compatibility_Ideographs_Supplement}
1464                                    (Short: \p{InCJKCompatIdeographsSup})
1465                                    (544)
1466        X \p{CJK_Ext_A}           \p{CJK_Unified_Ideographs_Extension_A} (=
1467                                    \p{Block=
1468                                    CJK_Unified_Ideographs_Extension_A})
1469                                    (6592)
1470        X \p{CJK_Ext_B}           \p{CJK_Unified_Ideographs_Extension_B} (=
1471                                    \p{Block=
1472                                    CJK_Unified_Ideographs_Extension_B})
1473                                    (42_720)
1474        X \p{CJK_Ext_C}           \p{CJK_Unified_Ideographs_Extension_C} (=
1475                                    \p{Block=
1476                                    CJK_Unified_Ideographs_Extension_C})
1477                                    (4160)
1478        X \p{CJK_Ext_D}           \p{CJK_Unified_Ideographs_Extension_D} (=
1479                                    \p{Block=
1480                                    CJK_Unified_Ideographs_Extension_D})
1481                                    (224)
1482        X \p{CJK_Ext_E}           \p{CJK_Unified_Ideographs_Extension_E} (=
1483                                    \p{Block=
1484                                    CJK_Unified_Ideographs_Extension_E})
1485                                    (5776)
1486        X \p{CJK_Radicals_Sup}    \p{CJK_Radicals_Supplement} (= \p{Block=
1487                                    CJK_Radicals_Supplement}) (128)
1488        X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
1489                                    (Short: \p{InCJKRadicalsSup}) (128)
1490        X \p{CJK_Strokes}         \p{Block=CJK_Strokes} (48)
1491        X \p{CJK_Symbols}         \p{CJK_Symbols_And_Punctuation} (=
1492                                    \p{Block=CJK_Symbols_And_Punctuation})
1493                                    (64)
1494        X \p{CJK_Symbols_And_Punctuation} \p{Block=
1495                                    CJK_Symbols_And_Punctuation} (Short:
1496                                    \p{InCJKSymbols}) (64)
1497        X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
1498                                    (Short: \p{InCJK}) (20_992)
1499        X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
1500                                    CJK_Unified_Ideographs_Extension_A}
1501                                    (Short: \p{InCJKExtA}) (6592)
1502        X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
1503                                    CJK_Unified_Ideographs_Extension_B}
1504                                    (Short: \p{InCJKExtB}) (42_720)
1505        X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
1506                                    CJK_Unified_Ideographs_Extension_C}
1507                                    (Short: \p{InCJKExtC}) (4160)
1508        X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
1509                                    CJK_Unified_Ideographs_Extension_D}
1510                                    (Short: \p{InCJKExtD}) (224)
1511        X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
1512                                    CJK_Unified_Ideographs_Extension_E}
1513                                    (Short: \p{InCJKExtE}) (5776)
1514          \p{Close_Punctuation}   \p{General_Category=Close_Punctuation}
1515                                    (Short: \p{Pe}) (73)
1516          \p{Cn}                  \p{Unassigned} (= \p{General_Category=
1517                                    Unassigned}) (846_359 plus all above-
1518                                    Unicode code points)
1519          \p{Cntrl}               \p{XPosixCntrl} (= \p{General_Category=
1520                                    Control}) (65)
1521          \p{Co}                  \p{Private_Use} (= \p{General_Category=
1522                                    Private_Use}) (NOT \p{Private_Use_Area})
1523                                    (137_468)
1524        X \p{Combining_Diacritical_Marks} \p{Block=
1525                                    Combining_Diacritical_Marks} (Short:
1526                                    \p{InDiacriticals}) (112)
1527        X \p{Combining_Diacritical_Marks_Extended} \p{Block=
1528                                    Combining_Diacritical_Marks_Extended}
1529                                    (Short: \p{InDiacriticalsExt}) (80)
1530        X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
1531                                    Combining_Diacritical_Marks_For_Symbols}
1532                                    (Short: \p{InDiacriticalsForSymbols})
1533                                    (48)
1534        X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
1535                                    Combining_Diacritical_Marks_Supplement}
1536                                    (Short: \p{InDiacriticalsSup}) (64)
1537        X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
1538                                    \p{InHalfMarks}) (16)
1539          \p{Combining_Mark}      \p{Mark} (= \p{General_Category=Mark})
1540                                    (2097)
1541        X \p{Combining_Marks_For_Symbols}
1542                                    \p{Combining_Diacritical_Marks_For_-
1543                                    Symbols} (= \p{Block=
1544                                    Combining_Diacritical_Marks_For_-
1545                                    Symbols}) (48)
1546          \p{Common}              \p{Script_Extensions=Common} (Short:
1547                                    \p{Zyyy}) (6864)
1548        X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1549                                    (Short: \p{InIndicNumberForms}) (16)
1550          \p{Comp_Ex}             \p{Full_Composition_Exclusion} (=
1551                                    \p{Full_Composition_Exclusion=Y}) (1120)
1552          \p{Comp_Ex: *}          \p{Full_Composition_Exclusion: *}
1553        X \p{Compat_Jamo}         \p{Hangul_Compatibility_Jamo} (= \p{Block=
1554                                    Hangul_Compatibility_Jamo}) (96)
1555          \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
1556                                    \p{CE}) (81)
1557          \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
1558                                    plus all above-Unicode code points)
1559          \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81)
1560          \p{Connector_Punctuation} \p{General_Category=
1561                                    Connector_Punctuation} (Short: \p{Pc})
1562                                    (10)
1563          \p{Control}             \p{XPosixCntrl} (= \p{General_Category=
1564                                    Control}) (65)
1565        X \p{Control_Pictures}    \p{Block=Control_Pictures} (64)
1566          \p{Copt}                \p{Coptic} (= \p{Script_Extensions=
1567                                    Coptic}) (NOT \p{Block=Coptic}) (165)
1568          \p{Coptic}              \p{Script_Extensions=Coptic} (Short:
1569                                    \p{Copt}; NOT \p{Block=Coptic}) (165)
1570        X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
1571        X \p{Counting_Rod}        \p{Counting_Rod_Numerals} (= \p{Block=
1572                                    Counting_Rod_Numerals}) (32)
1573        X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
1574                                    \p{InCountingRod}) (32)
1575          \p{Cprt}                \p{Cypriot} (= \p{Script_Extensions=
1576                                    Cypriot}) (112)
1577          \p{Cs}                  \p{Surrogate} (= \p{General_Category=
1578                                    Surrogate}) (2048)
1579          \p{Cuneiform}           \p{Script_Extensions=Cuneiform} (Short:
1580                                    \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
1581        X \p{Cuneiform_Numbers}   \p{Cuneiform_Numbers_And_Punctuation} (=
1582                                    \p{Block=
1583                                    Cuneiform_Numbers_And_Punctuation}) (128)
1584        X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
1585                                    Cuneiform_Numbers_And_Punctuation}
1586                                    (Short: \p{InCuneiformNumbers}) (128)
1587          \p{Currency_Symbol}     \p{General_Category=Currency_Symbol}
1588                                    (Short: \p{Sc}) (53)
1589        X \p{Currency_Symbols}    \p{Block=Currency_Symbols} (48)
1590          \p{CWCF}                \p{Changes_When_Casefolded} (=
1591                                    \p{Changes_When_Casefolded=Y}) (1377)
1592          \p{CWCF: *}             \p{Changes_When_Casefolded: *}
1593          \p{CWCM}                \p{Changes_When_Casemapped} (=
1594                                    \p{Changes_When_Casemapped=Y}) (2669)
1595          \p{CWCM: *}             \p{Changes_When_Casemapped: *}
1596          \p{CWKCF}               \p{Changes_When_NFKC_Casefolded} (=
1597                                    \p{Changes_When_NFKC_Casefolded=Y})
1598                                    (10_227)
1599          \p{CWKCF: *}            \p{Changes_When_NFKC_Casefolded: *}
1600          \p{CWL}                 \p{Changes_When_Lowercased} (=
1601                                    \p{Changes_When_Lowercased=Y}) (1304)
1602          \p{CWL: *}              \p{Changes_When_Lowercased: *}
1603          \p{CWT}                 \p{Changes_When_Titlecased} (=
1604                                    \p{Changes_When_Titlecased=Y}) (1369)
1605          \p{CWT: *}              \p{Changes_When_Titlecased: *}
1606          \p{CWU}                 \p{Changes_When_Uppercased} (=
1607                                    \p{Changes_When_Uppercased=Y}) (1396)
1608          \p{CWU: *}              \p{Changes_When_Uppercased: *}
1609          \p{Cypriot}             \p{Script_Extensions=Cypriot} (Short:
1610                                    \p{Cprt}) (112)
1611        X \p{Cypriot_Syllabary}   \p{Block=Cypriot_Syllabary} (64)
1612          \p{Cyrillic}            \p{Script_Extensions=Cyrillic} (Short:
1613                                    \p{Cyrl}; NOT \p{Block=Cyrillic}) (446)
1614        X \p{Cyrillic_Ext_A}      \p{Cyrillic_Extended_A} (= \p{Block=
1615                                    Cyrillic_Extended_A}) (32)
1616        X \p{Cyrillic_Ext_B}      \p{Cyrillic_Extended_B} (= \p{Block=
1617                                    Cyrillic_Extended_B}) (96)
1618        X \p{Cyrillic_Ext_C}      \p{Cyrillic_Extended_C} (= \p{Block=
1619                                    Cyrillic_Extended_C}) (16)
1620        X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
1621                                    \p{InCyrillicExtA}) (32)
1622        X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
1623                                    \p{InCyrillicExtB}) (96)
1624        X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
1625                                    \p{InCyrillicExtC}) (16)
1626        X \p{Cyrillic_Sup}        \p{Cyrillic_Supplement} (= \p{Block=
1627                                    Cyrillic_Supplement}) (48)
1628        X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
1629                                    \p{InCyrillicSup}) (48)
1630        X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
1631                                    Cyrillic_Supplement}) (48)
1632          \p{Cyrl}                \p{Cyrillic} (= \p{Script_Extensions=
1633                                    Cyrillic}) (NOT \p{Block=Cyrillic}) (446)
1634          \p{Dash}                \p{Dash=Y} (28)
1635          \p{Dash: N*}            (Single: \P{Dash}) (1_114_084 plus all
1636                                    above-Unicode code points)
1637          \p{Dash: Y*}            (Single: \p{Dash}) (28)
1638          \p{Dash_Punctuation}    \p{General_Category=Dash_Punctuation}
1639                                    (Short: \p{Pd}) (24)
1640          \p{Decimal_Number}      \p{XPosixDigit} (= \p{General_Category=
1641                                    Decimal_Number}) (580)
1642          \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
1643                                    (13_232)
1644          \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_232)
1645          \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240)
1646          \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
1647          \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720)
1648          \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
1649          \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
1650          \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240)
1651          \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184)
1652          \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
1653          \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20)
1654          \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
1655          \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171)
1656          \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
1657          \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238)
1658          \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
1659          \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82)
1660          \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
1661          \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122)
1662          \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
1663          \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5)
1664          \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
1665                                    Non_Canonical} (Perl extension) (3662)
1666          \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
1667                                    decompositions (Short: \p{Dt=NonCanon})
1668                                    (Perl extension) (3662)
1669          \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_218 plus
1670                                    all above-Unicode code points)
1671          \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26)
1672          \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
1673          \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (285)
1674          \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (285)
1675          \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38)
1676          \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (152)
1677          \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (152)
1678          \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
1679          \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35)
1680          \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104)
1681          \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
1682                                    Y} (Short: \p{DI}) (4173)
1683          \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
1684                                    (1_109_939 plus all above-Unicode code
1685                                    points)
1686          \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
1687                                    (4173)
1688          \p{Dep}                 \p{Deprecated} (= \p{Deprecated=Y}) (15)
1689          \p{Dep: *}              \p{Deprecated: *}
1690          \p{Deprecated}          \p{Deprecated=Y} (Short: \p{Dep}) (15)
1691          \p{Deprecated: N*}      (Short: \p{Dep=N}, \P{Dep}) (1_114_097
1692                                    plus all above-Unicode code points)
1693          \p{Deprecated: Y*}      (Short: \p{Dep=Y}, \p{Dep}) (15)
1694          \p{Deseret}             \p{Script_Extensions=Deseret} (Short:
1695                                    \p{Dsrt}) (80)
1696          \p{Deva}                \p{Devanagari} (= \p{Script_Extensions=
1697                                    Devanagari}) (NOT \p{Block=Devanagari})
1698                                    (210)
1699          \p{Devanagari}          \p{Script_Extensions=Devanagari} (Short:
1700                                    \p{Deva}; NOT \p{Block=Devanagari}) (210)
1701        X \p{Devanagari_Ext}      \p{Devanagari_Extended} (= \p{Block=
1702                                    Devanagari_Extended}) (32)
1703        X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
1704                                    \p{InDevanagariExt}) (32)
1705          \p{DI}                  \p{Default_Ignorable_Code_Point} (=
1706                                    \p{Default_Ignorable_Code_Point=Y})
1707                                    (4173)
1708          \p{DI: *}               \p{Default_Ignorable_Code_Point: *}
1709          \p{Dia}                 \p{Diacritic} (= \p{Diacritic=Y}) (782)
1710          \p{Dia: *}              \p{Diacritic: *}
1711          \p{Diacritic}           \p{Diacritic=Y} (Short: \p{Dia}) (782)
1712          \p{Diacritic: N*}       (Short: \p{Dia=N}, \P{Dia}) (1_113_330
1713                                    plus all above-Unicode code points)
1714          \p{Diacritic: Y*}       (Short: \p{Dia=Y}, \p{Dia}) (782)
1715        X \p{Diacriticals}        \p{Combining_Diacritical_Marks} (=
1716                                    \p{Block=Combining_Diacritical_Marks})
1717                                    (112)
1718        X \p{Diacriticals_Ext}    \p{Combining_Diacritical_Marks_Extended}
1719                                    (= \p{Block=
1720                                    Combining_Diacritical_Marks_Extended})
1721                                    (80)
1722        X \p{Diacriticals_For_Symbols}
1723                                    \p{Combining_Diacritical_Marks_For_-
1724                                    Symbols} (= \p{Block=
1725                                    Combining_Diacritical_Marks_For_-
1726                                    Symbols}) (48)
1727        X \p{Diacriticals_Sup}    \p{Combining_Diacritical_Marks_Supplement}
1728                                    (= \p{Block=
1729                                    Combining_Diacritical_Marks_Supplement})
1730                                    (64)
1731          \p{Digit}               \p{XPosixDigit} (= \p{General_Category=
1732                                    Decimal_Number}) (580)
1733        X \p{Dingbats}            \p{Block=Dingbats} (192)
1734        X \p{Domino}              \p{Domino_Tiles} (= \p{Block=
1735                                    Domino_Tiles}) (112)
1736        X \p{Domino_Tiles}        \p{Block=Domino_Tiles} (Short:
1737                                    \p{InDomino}) (112)
1738          \p{Dsrt}                \p{Deseret} (= \p{Script_Extensions=
1739                                    Deseret}) (80)
1740          \p{Dt: *}               \p{Decomposition_Type: *}
1741          \p{Dupl}                \p{Duployan} (= \p{Script_Extensions=
1742                                    Duployan}) (NOT \p{Block=Duployan}) (147)
1743          \p{Duployan}            \p{Script_Extensions=Duployan} (Short:
1744                                    \p{Dupl}; NOT \p{Block=Duployan}) (147)
1745          \p{Ea: *}               \p{East_Asian_Width: *}
1746        X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
1747                                    (208)
1748          \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
1749          \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739)
1750          \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
1751          \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104)
1752          \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
1753          \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123)
1754          \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (794_146 plus
1755                                    all above-Unicode code points)
1756          \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
1757          \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111)
1758          \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (794_146 plus all
1759                                    above-Unicode code points)
1760          \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (180_889)
1761          \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (180_889)
1762          \p{Egyp}                \p{Egyptian_Hieroglyphs} (=
1763                                    \p{Script_Extensions=
1764                                    Egyptian_Hieroglyphs}) (NOT \p{Block=
1765                                    Egyptian_Hieroglyphs}) (1071)
1766          \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
1767                                    Egyptian_Hieroglyphs} (Short: \p{Egyp};
1768                                    NOT \p{Block=Egyptian_Hieroglyphs})
1769                                    (1071)
1770          \p{Elba}                \p{Elbasan} (= \p{Script_Extensions=
1771                                    Elbasan}) (NOT \p{Block=Elbasan}) (40)
1772          \p{Elbasan}             \p{Script_Extensions=Elbasan} (Short:
1773                                    \p{Elba}; NOT \p{Block=Elbasan}) (40)
1774        X \p{Emoticons}           \p{Block=Emoticons} (80)
1775        X \p{Enclosed_Alphanum}   \p{Enclosed_Alphanumerics} (= \p{Block=
1776                                    Enclosed_Alphanumerics}) (160)
1777        X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
1778                                    \p{Block=
1779                                    Enclosed_Alphanumeric_Supplement}) (256)
1780        X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
1781                                    Enclosed_Alphanumeric_Supplement}
1782                                    (Short: \p{InEnclosedAlphanumSup}) (256)
1783        X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
1784                                    (Short: \p{InEnclosedAlphanum}) (160)
1785        X \p{Enclosed_CJK}        \p{Enclosed_CJK_Letters_And_Months} (=
1786                                    \p{Block=
1787                                    Enclosed_CJK_Letters_And_Months}) (256)
1788        X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
1789                                    Enclosed_CJK_Letters_And_Months} (Short:
1790                                    \p{InEnclosedCJK}) (256)
1791        X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
1792                                    (= \p{Block=
1793                                    Enclosed_Ideographic_Supplement}) (256)
1794        X \p{Enclosed_Ideographic_Supplement} \p{Block=
1795                                    Enclosed_Ideographic_Supplement} (Short:
1796                                    \p{InEnclosedIdeographicSup}) (256)
1797          \p{Enclosing_Mark}      \p{General_Category=Enclosing_Mark}
1798                                    (Short: \p{Me}) (13)
1799          \p{Ethi}                \p{Ethiopic} (= \p{Script_Extensions=
1800                                    Ethiopic}) (NOT \p{Block=Ethiopic}) (495)
1801          \p{Ethiopic}            \p{Script_Extensions=Ethiopic} (Short:
1802                                    \p{Ethi}; NOT \p{Block=Ethiopic}) (495)
1803        X \p{Ethiopic_Ext}        \p{Ethiopic_Extended} (= \p{Block=
1804                                    Ethiopic_Extended}) (96)
1805        X \p{Ethiopic_Ext_A}      \p{Ethiopic_Extended_A} (= \p{Block=
1806                                    Ethiopic_Extended_A}) (48)
1807        X \p{Ethiopic_Extended}   \p{Block=Ethiopic_Extended} (Short:
1808                                    \p{InEthiopicExt}) (96)
1809        X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
1810                                    \p{InEthiopicExtA}) (48)
1811        X \p{Ethiopic_Sup}        \p{Ethiopic_Supplement} (= \p{Block=
1812                                    Ethiopic_Supplement}) (32)
1813        X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
1814                                    \p{InEthiopicSup}) (32)
1815          \p{Ext}                 \p{Extender} (= \p{Extender=Y}) (42)
1816          \p{Ext: *}              \p{Extender: *}
1817          \p{Extender}            \p{Extender=Y} (Short: \p{Ext}) (42)
1818          \p{Extender: N*}        (Short: \p{Ext=N}, \P{Ext}) (1_114_070
1819                                    plus all above-Unicode code points)
1820          \p{Extender: Y*}        (Short: \p{Ext=Y}, \p{Ext}) (42)
1821          \p{Final_Punctuation}   \p{General_Category=Final_Punctuation}
1822                                    (Short: \p{Pf}) (10)
1823          \p{Format}              \p{General_Category=Format} (Short:
1824                                    \p{Cf}) (151)
1825          \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
1826                                    (Short: \p{CompEx}) (1120)
1827          \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
1828                                    \P{CompEx}) (1_112_992 plus all above-
1829                                    Unicode code points)
1830          \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
1831                                    \p{CompEx}) (1120)
1832          \p{Gc: *}               \p{General_Category: *}
1833          \p{GCB: *}              \p{Grapheme_Cluster_Break: *}
1834          \p{General_Category: C} \p{General_Category=Other} (986_091 plus
1835                                    all above-Unicode code points)
1836          \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
1837                                    \p{Gc=LC}, \p{LC}) (3796)
1838          \p{General_Category: Cc} \p{General_Category=Control} (65)
1839          \p{General_Category: Cf} \p{General_Category=Format} (151)
1840          \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
1841                                    (73)
1842          \p{General_Category: Cn} \p{General_Category=Unassigned} (846_359
1843                                    plus all above-Unicode code points)
1844          \p{General_Category: Cntrl} \p{General_Category=Control} (65)
1845          \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
1846          \p{General_Category: Combining_Mark} \p{General_Category=Mark}
1847                                    (2097)
1848          \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
1849                                    \p{Pc}) (10)
1850          \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65)
1851          \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
1852          \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
1853                                    (53)
1854          \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
1855                                    (24)
1856          \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
1857                                    (580)
1858          \p{General_Category: Digit} \p{General_Category=Decimal_Number}
1859                                    (580)
1860          \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
1861                                    (13)
1862          \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
1863                                    (10)
1864          \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (151)
1865          \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
1866                                    \p{Pi}) (12)
1867          \p{General_Category: L} \p{General_Category=Letter} (116_766)
1868        X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3796)
1869        X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
1870                                    the trailing '_' matters in spite of
1871                                    loose matching rules. (3796)
1872          \p{General_Category: LC} \p{General_Category=Cased_Letter} (3796)
1873          \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (116_766)
1874          \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
1875                                    (236)
1876          \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl}) (1)
1877          \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
1878                                    (/i= General_Category=Cased_Letter)
1879                                    (2063)
1880          \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (249)
1881          \p{General_Category: Lo} \p{General_Category=Other_Letter}
1882                                    (112_721)
1883          \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
1884                                    /i= General_Category=Cased_Letter) (2063)
1885          \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
1886                                    (/i= General_Category=Cased_Letter) (31)
1887          \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
1888                                    (/i= General_Category=Cased_Letter)
1889                                    (1702)
1890          \p{General_Category: M} \p{General_Category=Mark} (2097)
1891          \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2097)
1892          \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948)
1893          \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (394)
1894          \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
1895          \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
1896                                    (1690)
1897          \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
1898                                    (249)
1899          \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
1900                                    (121)
1901          \p{General_Category: N} \p{General_Category=Number} (1492)
1902          \p{General_Category: Nd} \p{General_Category=Decimal_Number} (580)
1903          \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
1904          \p{General_Category: No} \p{General_Category=Other_Number} (676)
1905          \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
1906                                    (1690)
1907          \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1492)
1908          \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
1909                                    (75)
1910          \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (986_091 plus
1911                                    all above-Unicode code points)
1912          \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
1913                                    (112_721)
1914          \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No}) (676)
1915          \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
1916                                    (544)
1917          \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
1918                                    (5777)
1919          \p{General_Category: P} \p{General_Category=Punctuation} (748)
1920          \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
1921                                    \p{Zp}) (1)
1922          \p{General_Category: Pc} \p{General_Category=
1923                                    Connector_Punctuation} (10)
1924          \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (24)
1925          \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
1926                                    (73)
1927          \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
1928                                    (10)
1929          \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
1930                                    (12)
1931          \p{General_Category: Po} \p{General_Category=Other_Punctuation}
1932                                    (544)
1933          \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
1934                                    (137_468)
1935          \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (75)
1936          \p{General_Category: Punct} \p{General_Category=Punctuation} (748)
1937          \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (748)
1938          \p{General_Category: S} \p{General_Category=Symbol} (6899)
1939          \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (53)
1940          \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19)
1941          \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (121)
1942          \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
1943          \p{General_Category: So} \p{General_Category=Other_Symbol} (5777)
1944          \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
1945                                    (17)
1946          \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc}) (394)
1947          \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048)
1948          \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (6899)
1949          \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
1950                                    /i= General_Category=Cased_Letter) (31)
1951          \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
1952                                    (846_359 plus all above-Unicode code
1953                                    points)
1954          \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
1955                                    /i= General_Category=Cased_Letter) (1702)
1956          \p{General_Category: Z} \p{General_Category=Separator} (19)
1957          \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
1958          \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
1959                                    (1)
1960          \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
1961        X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
1962                                    \p{InPunctuation}) (112)
1963        X \p{Geometric_Shapes}    \p{Block=Geometric_Shapes} (96)
1964        X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
1965                                    \p{Block=Geometric_Shapes_Extended})
1966                                    (128)
1967        X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
1968                                    (Short: \p{InGeometricShapesExt}) (128)
1969          \p{Geor}                \p{Georgian} (= \p{Script_Extensions=
1970                                    Georgian}) (NOT \p{Block=Georgian}) (129)
1971          \p{Georgian}            \p{Script_Extensions=Georgian} (Short:
1972                                    \p{Geor}; NOT \p{Block=Georgian}) (129)
1973        X \p{Georgian_Sup}        \p{Georgian_Supplement} (= \p{Block=
1974                                    Georgian_Supplement}) (48)
1975        X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
1976                                    \p{InGeorgianSup}) (48)
1977          \p{Glag}                \p{Glagolitic} (= \p{Script_Extensions=
1978                                    Glagolitic}) (NOT \p{Block=Glagolitic})
1979                                    (136)
1980          \p{Glagolitic}          \p{Script_Extensions=Glagolitic} (Short:
1981                                    \p{Glag}; NOT \p{Block=Glagolitic}) (136)
1982        X \p{Glagolitic_Sup}      \p{Glagolitic_Supplement} (= \p{Block=
1983                                    Glagolitic_Supplement}) (48)
1984        X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
1985                                    \p{InGlagoliticSup}) (48)
1986          \p{Goth}                \p{Gothic} (= \p{Script_Extensions=
1987                                    Gothic}) (NOT \p{Block=Gothic}) (27)
1988          \p{Gothic}              \p{Script_Extensions=Gothic} (Short:
1989                                    \p{Goth}; NOT \p{Block=Gothic}) (27)
1990          \p{Gr_Base}             \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
1991                                    (126_288)
1992          \p{Gr_Base: *}          \p{Grapheme_Base: *}
1993          \p{Gr_Ext}              \p{Grapheme_Extend} (= \p{Grapheme_Extend=
1994                                    Y}) (1828)
1995          \p{Gr_Ext: *}           \p{Grapheme_Extend: *}
1996          \p{Gran}                \p{Grantha} (= \p{Script_Extensions=
1997                                    Grantha}) (NOT \p{Block=Grantha}) (113)
1998          \p{Grantha}             \p{Script_Extensions=Grantha} (Short:
1999                                    \p{Gran}; NOT \p{Block=Grantha}) (113)
2000          \p{Graph}               \p{XPosixGraph} (265_621)
2001          \p{Grapheme_Base}       \p{Grapheme_Base=Y} (Short: \p{GrBase})
2002                                    (126_288)
2003          \p{Grapheme_Base: N*}   (Short: \p{GrBase=N}, \P{GrBase}) (987_824
2004                                    plus all above-Unicode code points)
2005          \p{Grapheme_Base: Y*}   (Short: \p{GrBase=Y}, \p{GrBase}) (126_288)
2006          \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
2007                                    (5925)
2008          \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (5925)
2009          \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1)
2010          \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (79)
2011          \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (4)
2012          \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (5)
2013          \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
2014                                    (79)
2015          \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
2016                                    E_Base_GAZ} (4)
2017          \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
2018                                    E_Modifier} (5)
2019          \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
2020                                    (1828)
2021          \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1828)
2022          \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
2023                                    Glue_After_Zwj} (3)
2024          \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (3)
2025          \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125)
2026          \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1)
2027          \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399)
2028          \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773)
2029          \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_094_356
2030                                    plus all above-Unicode code points)
2031          \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
2032                                    (13)
2033          \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (13)
2034          \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
2035                                    (26)
2036          \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
2037                                    Regional_Indicator} (26)
2038          \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
2039                                    SpacingMark} (341)
2040          \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (341)
2041          \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137)
2042          \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95)
2043          \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
2044                                    (1_094_356 plus all above-Unicode code
2045                                    points)
2046          \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1)
2047          \p{Grapheme_Extend}     \p{Grapheme_Extend=Y} (Short: \p{GrExt})
2048                                    (1828)
2049          \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_284
2050                                    plus all above-Unicode code points)
2051          \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1828)
2052          \p{Greek}               \p{Script_Extensions=Greek} (Short:
2053                                    \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
2054        X \p{Greek_And_Coptic}    \p{Block=Greek_And_Coptic} (Short:
2055                                    \p{InGreek}) (144)
2056        X \p{Greek_Ext}           \p{Greek_Extended} (= \p{Block=
2057                                    Greek_Extended}) (256)
2058        X \p{Greek_Extended}      \p{Block=Greek_Extended} (Short:
2059                                    \p{InGreekExt}) (256)
2060          \p{Grek}                \p{Greek} (= \p{Script_Extensions=Greek})
2061                                    (NOT \p{Greek_And_Coptic}) (522)
2062          \p{Gujarati}            \p{Script_Extensions=Gujarati} (Short:
2063                                    \p{Gujr}; NOT \p{Block=Gujarati}) (99)
2064          \p{Gujr}                \p{Gujarati} (= \p{Script_Extensions=
2065                                    Gujarati}) (NOT \p{Block=Gujarati}) (99)
2066          \p{Gurmukhi}            \p{Script_Extensions=Gurmukhi} (Short:
2067                                    \p{Guru}; NOT \p{Block=Gurmukhi}) (93)
2068          \p{Guru}                \p{Gurmukhi} (= \p{Script_Extensions=
2069                                    Gurmukhi}) (NOT \p{Block=Gurmukhi}) (93)
2070        X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
2071                                    \p{Block=Halfwidth_And_Fullwidth_Forms})
2072                                    (240)
2073        X \p{Half_Marks}          \p{Combining_Half_Marks} (= \p{Block=
2074                                    Combining_Half_Marks}) (16)
2075        X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
2076                                    Halfwidth_And_Fullwidth_Forms} (Short:
2077                                    \p{InHalfAndFullForms}) (240)
2078          \p{Han}                 \p{Script_Extensions=Han} (82_013)
2079          \p{Hang}                \p{Hangul} (= \p{Script_Extensions=
2080                                    Hangul}) (NOT \p{Hangul_Syllables})
2081                                    (11_775)
2082          \p{Hangul}              \p{Script_Extensions=Hangul} (Short:
2083                                    \p{Hang}; NOT \p{Hangul_Syllables})
2084                                    (11_775)
2085        X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
2086                                    (Short: \p{InCompatJamo}) (96)
2087        X \p{Hangul_Jamo}         \p{Block=Hangul_Jamo} (Short: \p{InJamo})
2088                                    (256)
2089        X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
2090                                    (Short: \p{InJamoExtA}) (32)
2091        X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
2092                                    (Short: \p{InJamoExtB}) (80)
2093          \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
2094                                    (125)
2095          \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125)
2096          \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
2097                                    (399)
2098          \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399)
2099          \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
2100                                    LVT_Syllable} (10_773)
2101          \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
2102                                    (10_773)
2103          \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
2104                                    Not_Applicable} (1_102_583 plus all
2105                                    above-Unicode code points)
2106          \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
2107                                    (1_102_583 plus all above-Unicode code
2108                                    points)
2109          \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
2110                                    (137)
2111          \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137)
2112          \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
2113                                    (95)
2114          \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95)
2115        X \p{Hangul_Syllables}    \p{Block=Hangul_Syllables} (Short:
2116                                    \p{InHangul}) (11_184)
2117          \p{Hani}                \p{Han} (= \p{Script_Extensions=Han})
2118                                    (82_013)
2119          \p{Hano}                \p{Hanunoo} (= \p{Script_Extensions=
2120                                    Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
2121          \p{Hanunoo}             \p{Script_Extensions=Hanunoo} (Short:
2122                                    \p{Hano}; NOT \p{Block=Hanunoo}) (23)
2123          \p{Hatr}                \p{Hatran} (= \p{Script_Extensions=
2124                                    Hatran}) (NOT \p{Block=Hatran}) (26)
2125          \p{Hatran}              \p{Script_Extensions=Hatran} (Short:
2126                                    \p{Hatr}; NOT \p{Block=Hatran}) (26)
2127          \p{Hebr}                \p{Hebrew} (= \p{Script_Extensions=
2128                                    Hebrew}) (NOT \p{Block=Hebrew}) (133)
2129          \p{Hebrew}              \p{Script_Extensions=Hebrew} (Short:
2130                                    \p{Hebr}; NOT \p{Block=Hebrew}) (133)
2131          \p{Hex}                 \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2132          \p{Hex: *}              \p{Hex_Digit: *}
2133          \p{Hex_Digit}           \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2134          \p{Hex_Digit: N*}       (Short: \p{Hex=N}, \P{Hex}) (1_114_068
2135                                    plus all above-Unicode code points)
2136          \p{Hex_Digit: Y*}       (Short: \p{Hex=Y}, \p{Hex}) (44)
2137        X \p{High_Private_Use_Surrogates} \p{Block=
2138                                    High_Private_Use_Surrogates} (Short:
2139                                    \p{InHighPUSurrogates}) (128)
2140        X \p{High_PU_Surrogates}  \p{High_Private_Use_Surrogates} (=
2141                                    \p{Block=High_Private_Use_Surrogates})
2142                                    (128)
2143        X \p{High_Surrogates}     \p{Block=High_Surrogates} (896)
2144          \p{Hira}                \p{Hiragana} (= \p{Script_Extensions=
2145                                    Hiragana}) (NOT \p{Block=Hiragana}) (143)
2146          \p{Hiragana}            \p{Script_Extensions=Hiragana} (Short:
2147                                    \p{Hira}; NOT \p{Block=Hiragana}) (143)
2148          \p{Hluw}                \p{Anatolian_Hieroglyphs} (=
2149                                    \p{Script_Extensions=
2150                                    Anatolian_Hieroglyphs}) (NOT \p{Block=
2151                                    Anatolian_Hieroglyphs}) (583)
2152          \p{Hmng}                \p{Pahawh_Hmong} (= \p{Script_Extensions=
2153                                    Pahawh_Hmong}) (NOT \p{Block=
2154                                    Pahawh_Hmong}) (127)
2155          \p{HorizSpace}          \p{XPosixBlank} (18)
2156          \p{Hst: *}              \p{Hangul_Syllable_Type: *}
2157          \p{Hung}                \p{Old_Hungarian} (= \p{Script_Extensions=
2158                                    Old_Hungarian}) (NOT \p{Block=
2159                                    Old_Hungarian}) (108)
2160        D \p{Hyphen}              \p{Hyphen=Y} (11)
2161        D \p{Hyphen: N*}          Supplanted by Line_Break property values;
2162                                    see www.unicode.org/reports/tr14
2163                                    (Single: \P{Hyphen}) (1_114_101 plus all
2164                                    above-Unicode code points)
2165        D \p{Hyphen: Y*}          Supplanted by Line_Break property values;
2166                                    see www.unicode.org/reports/tr14
2167                                    (Single: \p{Hyphen}) (11)
2168          \p{ID_Continue}         \p{ID_Continue=Y} (Short: \p{IDC}; NOT
2169                                    \p{Ideographic_Description_Characters})
2170                                    (119_691)
2171          \p{ID_Continue: N*}     (Short: \p{IDC=N}, \P{IDC}) (994_421 plus
2172                                    all above-Unicode code points)
2173          \p{ID_Continue: Y*}     (Short: \p{IDC=Y}, \p{IDC}) (119_691)
2174          \p{ID_Start}            \p{ID_Start=Y} (Short: \p{IDS}) (117_007)
2175          \p{ID_Start: N*}        (Short: \p{IDS=N}, \P{IDS}) (997_105 plus
2176                                    all above-Unicode code points)
2177          \p{ID_Start: Y*}        (Short: \p{IDS=Y}, \p{IDS}) (117_007)
2178          \p{IDC}                 \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
2179                                    \p{Ideographic_Description_Characters})
2180                                    (119_691)
2181          \p{IDC: *}              \p{ID_Continue: *}
2182          \p{Ideo}                \p{Ideographic} (= \p{Ideographic=Y})
2183                                    (88_284)
2184          \p{Ideo: *}             \p{Ideographic: *}
2185          \p{Ideographic}         \p{Ideographic=Y} (Short: \p{Ideo})
2186                                    (88_284)
2187          \p{Ideographic: N*}     (Short: \p{Ideo=N}, \P{Ideo}) (1_025_828
2188                                    plus all above-Unicode code points)
2189          \p{Ideographic: Y*}     (Short: \p{Ideo=Y}, \p{Ideo}) (88_284)
2190        X \p{Ideographic_Description_Characters} \p{Block=
2191                                    Ideographic_Description_Characters}
2192                                    (Short: \p{InIDC}) (16)
2193        X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
2194                                    \p{Block=
2195                                    Ideographic_Symbols_And_Punctuation})
2196                                    (32)
2197        X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
2198                                    Ideographic_Symbols_And_Punctuation}
2199                                    (Short: \p{InIdeographicSymbols}) (32)
2200          \p{IDS}                 \p{ID_Start} (= \p{ID_Start=Y}) (117_007)
2201          \p{IDS: *}              \p{ID_Start: *}
2202          \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
2203                                    \p{IDSB}) (10)
2204          \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
2205                                    (1_114_102 plus all above-Unicode code
2206                                    points)
2207          \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10)
2208          \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
2209                                    \p{IDST}) (2)
2210          \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
2211                                    (1_114_110 plus all above-Unicode code
2212                                    points)
2213          \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2)
2214          \p{IDSB}                \p{IDS_Binary_Operator} (=
2215                                    \p{IDS_Binary_Operator=Y}) (10)
2216          \p{IDSB: *}             \p{IDS_Binary_Operator: *}
2217          \p{IDST}                \p{IDS_Trinary_Operator} (=
2218                                    \p{IDS_Trinary_Operator=Y}) (2)
2219          \p{IDST: *}             \p{IDS_Trinary_Operator: *}
2220          \p{Imperial_Aramaic}    \p{Script_Extensions=Imperial_Aramaic}
2221                                    (Short: \p{Armi}; NOT \p{Block=
2222                                    Imperial_Aramaic}) (31)
2223          \p{In: *}               \p{Present_In: *} (Perl extension)
2224        X \p{In_*}                \p{Block: *}
2225        X \p{Indic_Number_Forms}  \p{Common_Indic_Number_Forms} (= \p{Block=
2226                                    Common_Indic_Number_Forms}) (16)
2227          \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
2228                                    (300)
2229          \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
2230                                    BottomAndRight}) (2)
2231          \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (57)
2232          \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
2233                                    LeftAndRight}) (21)
2234          \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_113_069
2235                                    plus all above-Unicode code points)
2236          \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
2237                                    Overstruck}) (10)
2238          \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (258)
2239          \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (342)
2240          \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
2241                                    TopAndBottom}) (10)
2242          \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
2243                                    \p{InPC=TopAndBottomAndRight}) (1)
2244          \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
2245                                    TopAndLeft}) (6)
2246          \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
2247                                    \p{InPC=TopAndLeftAndRight}) (4)
2248          \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
2249                                    TopAndRight}) (13)
2250          \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
2251                                    VisualOrderLeft}) (19)
2252          \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
2253                                    (15)
2254          \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (67)
2255          \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
2256                                    \p{InSC=BrahmiJoiningNumber}) (20)
2257          \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
2258                                    CantillationMark}) (53)
2259          \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
2260                                    (1907)
2261          \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
2262                                    ConsonantDead}) (10)
2263          \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
2264                                    ConsonantFinal}) (62)
2265          \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
2266                                    \p{InSC=ConsonantHeadLetter}) (5)
2267          \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
2268                                    ConsonantKiller}) (2)
2269          \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
2270                                    ConsonantMedial}) (22)
2271          \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
2272                                    \p{InSC=ConsonantPlaceholder}) (16)
2273          \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
2274                                    \p{InSC=ConsonantPrecedingRepha}) (1)
2275          \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
2276                                    ConsonantPrefixed}) (2)
2277          \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
2278                                    ConsonantSubjoined}) (90)
2279          \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
2280                                    \p{InSC=ConsonantSucceedingRepha}) (4)
2281          \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
2282                                    \p{InSC=ConsonantWithStacker}) (4)
2283          \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
2284                                    GeminationMark}) (2)
2285          \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
2286                                    InvisibleStacker}) (7)
2287          \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1)
2288          \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
2289                                    ModifyingLetter}) (1)
2290          \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
2291                                    NonJoiner}) (1)
2292          \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (24)
2293          \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (459)
2294          \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
2295                                    NumberJoiner}) (1)
2296          \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
2297                                    (1_110_129 plus all above-Unicode code
2298                                    points)
2299          \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
2300                                    PureKiller}) (16)
2301          \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
2302                                    RegisterShifter}) (2)
2303          \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
2304                                    SyllableModifier}) (22)
2305          \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
2306                                    ToneLetter}) (7)
2307          \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
2308                                    (42)
2309          \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (24)
2310          \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga}) (31)
2311          \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30)
2312          \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
2313                                    VowelDependent}) (602)
2314          \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
2315                                    VowelIndependent}) (431)
2316          \p{Inherited}           \p{Script_Extensions=Inherited} (Short:
2317                                    \p{Zinh}) (496)
2318          \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
2319                                    (Short: \p{Pi}) (12)
2320          \p{InPC: *}             \p{Indic_Positional_Category: *}
2321          \p{InSC: *}             \p{Indic_Syllabic_Category: *}
2322          \p{Inscriptional_Pahlavi} \p{Script_Extensions=
2323                                    Inscriptional_Pahlavi} (Short: \p{Phli};
2324                                    NOT \p{Block=Inscriptional_Pahlavi}) (27)
2325          \p{Inscriptional_Parthian} \p{Script_Extensions=
2326                                    Inscriptional_Parthian} (Short:
2327                                    \p{Prti}; NOT \p{Block=
2328                                    Inscriptional_Parthian}) (30)
2329        X \p{IPA_Ext}             \p{IPA_Extensions} (= \p{Block=
2330                                    IPA_Extensions}) (96)
2331        X \p{IPA_Extensions}      \p{Block=IPA_Extensions} (Short:
2332                                    \p{InIPAExt}) (96)
2333          \p{Is_*}                \p{*} (Any exceptions are individually
2334                                    noted beginning with the word NOT.) If
2335                                    an entry has flag(s) at its beginning,
2336                                    like "D", the "Is_" form has the same
2337                                    flag(s)
2338          \p{Ital}                \p{Old_Italic} (= \p{Script_Extensions=
2339                                    Old_Italic}) (NOT \p{Block=Old_Italic})
2340                                    (36)
2341        X \p{Jamo}                \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
2342                                    (256)
2343        X \p{Jamo_Ext_A}          \p{Hangul_Jamo_Extended_A} (= \p{Block=
2344                                    Hangul_Jamo_Extended_A}) (32)
2345        X \p{Jamo_Ext_B}          \p{Hangul_Jamo_Extended_B} (= \p{Block=
2346                                    Hangul_Jamo_Extended_B}) (80)
2347          \p{Java}                \p{Javanese} (= \p{Script_Extensions=
2348                                    Javanese}) (NOT \p{Block=Javanese}) (91)
2349          \p{Javanese}            \p{Script_Extensions=Javanese} (Short:
2350                                    \p{Java}; NOT \p{Block=Javanese}) (91)
2351          \p{Jg: *}               \p{Joining_Group: *}
2352          \p{Join_C}              \p{Join_Control} (= \p{Join_Control=Y}) (2)
2353          \p{Join_C: *}           \p{Join_Control: *}
2354          \p{Join_Control}        \p{Join_Control=Y} (Short: \p{JoinC}) (2)
2355          \p{Join_Control: N*}    (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
2356                                    plus all above-Unicode code points)
2357          \p{Join_Control: Y*}    (Short: \p{JoinC=Y}, \p{JoinC}) (2)
2358          \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1)
2359          \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1)
2360          \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (1)
2361          \p{Joining_Group: Ain}  (Short: \p{Jg=Ain}) (8)
2362          \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1)
2363          \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10)
2364          \p{Joining_Group: Beh}  (Short: \p{Jg=Beh}) (24)
2365          \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2)
2366          \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
2367                                    BurushaskiYehBarree}) (2)
2368          \p{Joining_Group: Dal}  (Short: \p{Jg=Dal}) (15)
2369          \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4)
2370          \p{Joining_Group: E}    (Short: \p{Jg=E}) (1)
2371          \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7)
2372          \p{Joining_Group: Fe}   (Short: \p{Jg=Fe}) (1)
2373          \p{Joining_Group: Feh}  (Short: \p{Jg=Feh}) (10)
2374          \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1)
2375          \p{Joining_Group: Gaf}  (Short: \p{Jg=Gaf}) (14)
2376          \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3)
2377          \p{Joining_Group: Hah}  (Short: \p{Jg=Hah}) (18)
2378          \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
2379                                    HamzaOnHehGoal}) (1)
2380          \p{Joining_Group: He}   (Short: \p{Jg=He}) (1)
2381          \p{Joining_Group: Heh}  (Short: \p{Jg=Heh}) (1)
2382          \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2)
2383          \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1)
2384          \p{Joining_Group: Kaf}  (Short: \p{Jg=Kaf}) (6)
2385          \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1)
2386          \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1)
2387          \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2)
2388          \p{Joining_Group: Lam}  (Short: \p{Jg=Lam}) (7)
2389          \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1)
2390          \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
2391                                    ManichaeanAleph}) (1)
2392          \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
2393                                    (2)
2394          \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
2395                                    (2)
2396          \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
2397                                    ManichaeanDaleth}) (1)
2398          \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
2399                                    ManichaeanDhamedh}) (1)
2400          \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
2401                                    (1)
2402          \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
2403                                    ManichaeanGimel}) (2)
2404          \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
2405                                    (1)
2406          \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
2407                                    ManichaeanHundred}) (1)
2408          \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
2409                                    (3)
2410          \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
2411                                    ManichaeanLamedh}) (1)
2412          \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem}) (1)
2413          \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun}) (1)
2414          \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne}) (1)
2415          \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2)
2416          \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
2417                                    (3)
2418          \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
2419                                    (1)
2420          \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
2421                                    ManichaeanSadhe}) (1)
2422          \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
2423                                    ManichaeanSamekh}) (1)
2424          \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw}) (1)
2425          \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen}) (1)
2426          \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
2427                                    (1)
2428          \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
2429                                    ManichaeanThamedh}) (1)
2430          \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
2431                                    ManichaeanTwenty}) (1)
2432          \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw}) (1)
2433          \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
2434                                    (1)
2435          \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
2436                                    ManichaeanZayin}) (2)
2437          \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4)
2438          \p{Joining_Group: Mim}  (Short: \p{Jg=Mim}) (1)
2439          \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
2440                                    (1_113_818 plus all above-Unicode code
2441                                    points)
2442          \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8)
2443          \p{Joining_Group: Nun}  (Short: \p{Jg=Nun}) (1)
2444          \p{Joining_Group: Nya}  (Short: \p{Jg=Nya}) (1)
2445          \p{Joining_Group: Pe}   (Short: \p{Jg=Pe}) (1)
2446          \p{Joining_Group: Qaf}  (Short: \p{Jg=Qaf}) (5)
2447          \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1)
2448          \p{Joining_Group: Reh}  (Short: \p{Jg=Reh}) (19)
2449          \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1)
2450          \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1)
2451          \p{Joining_Group: Sad}  (Short: \p{Jg=Sad}) (6)
2452          \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1)
2453          \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11)
2454          \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1)
2455          \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1)
2456          \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1)
2457          \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1)
2458          \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1)
2459          \p{Joining_Group: Tah}  (Short: \p{Jg=Tah}) (4)
2460          \p{Joining_Group: Taw}  (Short: \p{Jg=Taw}) (1)
2461          \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3)
2462          \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
2463                                    Hamza_On_Heh_Goal} (1)
2464          \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2)
2465          \p{Joining_Group: Waw}  (Short: \p{Jg=Waw}) (16)
2466          \p{Joining_Group: Yeh}  (Short: \p{Jg=Yeh}) (11)
2467          \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2)
2468          \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1)
2469          \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1)
2470          \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1)
2471          \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1)
2472          \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1)
2473          \p{Joining_Type: C}     \p{Joining_Type=Join_Causing} (4)
2474          \p{Joining_Type: D}     \p{Joining_Type=Dual_Joining} (501)
2475          \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (501)
2476          \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4)
2477          \p{Joining_Type: L}     \p{Joining_Type=Left_Joining} (3)
2478          \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (3)
2479          \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_653 plus
2480                                    all above-Unicode code points)
2481          \p{Joining_Type: R}     \p{Joining_Type=Right_Joining} (112)
2482          \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (112)
2483          \p{Joining_Type: T}     \p{Joining_Type=Transparent} (1839)
2484          \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1839)
2485          \p{Joining_Type: U}     \p{Joining_Type=Non_Joining} (1_111_653
2486                                    plus all above-Unicode code points)
2487          \p{Jt: *}               \p{Joining_Type: *}
2488          \p{Kaithi}              \p{Script_Extensions=Kaithi} (Short:
2489                                    \p{Kthi}; NOT \p{Block=Kaithi}) (86)
2490          \p{Kali}                \p{Kayah_Li} (= \p{Script_Extensions=
2491                                    Kayah_Li}) (48)
2492          \p{Kana}                \p{Katakana} (= \p{Script_Extensions=
2493                                    Katakana}) (NOT \p{Block=Katakana}) (352)
2494        X \p{Kana_Sup}            \p{Kana_Supplement} (= \p{Block=
2495                                    Kana_Supplement}) (256)
2496        X \p{Kana_Supplement}     \p{Block=Kana_Supplement} (Short:
2497                                    \p{InKanaSup}) (256)
2498        X \p{Kanbun}              \p{Block=Kanbun} (16)
2499        X \p{Kangxi}              \p{Kangxi_Radicals} (= \p{Block=
2500                                    Kangxi_Radicals}) (224)
2501        X \p{Kangxi_Radicals}     \p{Block=Kangxi_Radicals} (Short:
2502                                    \p{InKangxi}) (224)
2503          \p{Kannada}             \p{Script_Extensions=Kannada} (Short:
2504                                    \p{Knda}; NOT \p{Block=Kannada}) (100)
2505          \p{Katakana}            \p{Script_Extensions=Katakana} (Short:
2506                                    \p{Kana}; NOT \p{Block=Katakana}) (352)
2507        X \p{Katakana_Ext}        \p{Katakana_Phonetic_Extensions} (=
2508                                    \p{Block=Katakana_Phonetic_Extensions})
2509                                    (16)
2510        X \p{Katakana_Phonetic_Extensions} \p{Block=
2511                                    Katakana_Phonetic_Extensions} (Short:
2512                                    \p{InKatakanaExt}) (16)
2513          \p{Kayah_Li}            \p{Script_Extensions=Kayah_Li} (Short:
2514                                    \p{Kali}) (48)
2515          \p{Khar}                \p{Kharoshthi} (= \p{Script_Extensions=
2516                                    Kharoshthi}) (NOT \p{Block=Kharoshthi})
2517                                    (65)
2518          \p{Kharoshthi}          \p{Script_Extensions=Kharoshthi} (Short:
2519                                    \p{Khar}; NOT \p{Block=Kharoshthi}) (65)
2520          \p{Khmer}               \p{Script_Extensions=Khmer} (Short:
2521                                    \p{Khmr}; NOT \p{Block=Khmer}) (146)
2522        X \p{Khmer_Symbols}       \p{Block=Khmer_Symbols} (32)
2523          \p{Khmr}                \p{Khmer} (= \p{Script_Extensions=Khmer})
2524                                    (NOT \p{Block=Khmer}) (146)
2525          \p{Khoj}                \p{Khojki} (= \p{Script_Extensions=
2526                                    Khojki}) (NOT \p{Block=Khojki}) (72)
2527          \p{Khojki}              \p{Script_Extensions=Khojki} (Short:
2528                                    \p{Khoj}; NOT \p{Block=Khojki}) (72)
2529          \p{Khudawadi}           \p{Script_Extensions=Khudawadi} (Short:
2530                                    \p{Sind}; NOT \p{Block=Khudawadi}) (81)
2531          \p{Knda}                \p{Kannada} (= \p{Script_Extensions=
2532                                    Kannada}) (NOT \p{Block=Kannada}) (100)
2533          \p{Kthi}                \p{Kaithi} (= \p{Script_Extensions=
2534                                    Kaithi}) (NOT \p{Block=Kaithi}) (86)
2535          \p{L} \pL               \p{Letter} (= \p{General_Category=Letter})
2536                                    (116_766)
2537        X \p{L&}                  \p{Cased_Letter} (= \p{General_Category=
2538                                    Cased_Letter}) (3796)
2539        X \p{L_}                  \p{Cased_Letter} (= \p{General_Category=
2540                                    Cased_Letter}) Note the trailing '_'
2541                                    matters in spite of loose matching
2542                                    rules. (3796)
2543          \p{Lana}                \p{Tai_Tham} (= \p{Script_Extensions=
2544                                    Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
2545          \p{Lao}                 \p{Script_Extensions=Lao} (NOT \p{Block=
2546                                    Lao}) (67)
2547          \p{Laoo}                \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
2548                                    \p{Block=Lao}) (67)
2549          \p{Latin}               \p{Script_Extensions=Latin} (Short:
2550                                    \p{Latn}) (1370)
2551        X \p{Latin_1}             \p{Latin_1_Supplement} (= \p{Block=
2552                                    Latin_1_Supplement}) (128)
2553        X \p{Latin_1_Sup}         \p{Latin_1_Supplement} (= \p{Block=
2554                                    Latin_1_Supplement}) (128)
2555        X \p{Latin_1_Supplement}  \p{Block=Latin_1_Supplement} (Short:
2556                                    \p{InLatin1}) (128)
2557        X \p{Latin_Ext_A}         \p{Latin_Extended_A} (= \p{Block=
2558                                    Latin_Extended_A}) (128)
2559        X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
2560                                    \p{Block=Latin_Extended_Additional})
2561                                    (256)
2562        X \p{Latin_Ext_B}         \p{Latin_Extended_B} (= \p{Block=
2563                                    Latin_Extended_B}) (208)
2564        X \p{Latin_Ext_C}         \p{Latin_Extended_C} (= \p{Block=
2565                                    Latin_Extended_C}) (32)
2566        X \p{Latin_Ext_D}         \p{Latin_Extended_D} (= \p{Block=
2567                                    Latin_Extended_D}) (224)
2568        X \p{Latin_Ext_E}         \p{Latin_Extended_E} (= \p{Block=
2569                                    Latin_Extended_E}) (64)
2570        X \p{Latin_Extended_A}    \p{Block=Latin_Extended_A} (Short:
2571                                    \p{InLatinExtA}) (128)
2572        X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
2573                                    (Short: \p{InLatinExtAdditional}) (256)
2574        X \p{Latin_Extended_B}    \p{Block=Latin_Extended_B} (Short:
2575                                    \p{InLatinExtB}) (208)
2576        X \p{Latin_Extended_C}    \p{Block=Latin_Extended_C} (Short:
2577                                    \p{InLatinExtC}) (32)
2578        X \p{Latin_Extended_D}    \p{Block=Latin_Extended_D} (Short:
2579                                    \p{InLatinExtD}) (224)
2580        X \p{Latin_Extended_E}    \p{Block=Latin_Extended_E} (Short:
2581                                    \p{InLatinExtE}) (64)
2582          \p{Latn}                \p{Latin} (= \p{Script_Extensions=Latin})
2583                                    (1370)
2584          \p{Lb: *}               \p{Line_Break: *}
2585          \p{LC}                  \p{Cased_Letter} (= \p{General_Category=
2586                                    Cased_Letter}) (3796)
2587          \p{Lepc}                \p{Lepcha} (= \p{Script_Extensions=
2588                                    Lepcha}) (NOT \p{Block=Lepcha}) (74)
2589          \p{Lepcha}              \p{Script_Extensions=Lepcha} (Short:
2590                                    \p{Lepc}; NOT \p{Block=Lepcha}) (74)
2591          \p{Letter}              \p{General_Category=Letter} (Short: \p{L})
2592                                    (116_766)
2593          \p{Letter_Number}       \p{General_Category=Letter_Number} (Short:
2594                                    \p{Nl}) (236)
2595        X \p{Letterlike_Symbols}  \p{Block=Letterlike_Symbols} (80)
2596          \p{Limb}                \p{Limbu} (= \p{Script_Extensions=Limbu})
2597                                    (NOT \p{Block=Limbu}) (69)
2598          \p{Limbu}               \p{Script_Extensions=Limbu} (Short:
2599                                    \p{Limb}; NOT \p{Block=Limbu}) (69)
2600          \p{Lina}                \p{Linear_A} (= \p{Script_Extensions=
2601                                    Linear_A}) (NOT \p{Block=Linear_A}) (386)
2602          \p{Linb}                \p{Linear_B} (= \p{Script_Extensions=
2603                                    Linear_B}) (268)
2604          \p{Line_Break: AI}      \p{Line_Break=Ambiguous} (707)
2605          \p{Line_Break: AL}      \p{Line_Break=Alphabetic} (19_523)
2606          \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (19_523)
2607          \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707)
2608          \p{Line_Break: B2}      \p{Line_Break=Break_Both} (3)
2609          \p{Line_Break: BA}      \p{Line_Break=Break_After} (218)
2610          \p{Line_Break: BB}      \p{Line_Break=Break_Before} (37)
2611          \p{Line_Break: BK}      \p{Line_Break=Mandatory_Break} (4)
2612          \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (218)
2613          \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (37)
2614          \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3)
2615          \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1)
2616          \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1)
2617          \p{Line_Break: CB}      \p{Line_Break=Contingent_Break} (1)
2618          \p{Line_Break: CJ}      \p{Line_Break=
2619                                    Conditional_Japanese_Starter} (51)
2620          \p{Line_Break: CL}      \p{Line_Break=Close_Punctuation} (90)
2621          \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2)
2622          \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (90)
2623          \p{Line_Break: CM}      \p{Line_Break=Combining_Mark} (2090)
2624          \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2090)
2625          \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (734)
2626          \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
2627                                    (51)
2628          \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1)
2629          \p{Line_Break: CP}      \p{Line_Break=Close_Parenthesis} (2)
2630          \p{Line_Break: CR}      \p{Line_Break=Carriage_Return} (1)
2631          \p{Line_Break: E_Base}  (Short: \p{Lb=EB}) (83)
2632          \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5)
2633          \p{Line_Break: EB}      \p{Line_Break=E_Base} (83)
2634          \p{Line_Break: EM}      \p{Line_Break=E_Modifier} (5)
2635          \p{Line_Break: EX}      \p{Line_Break=Exclamation} (37)
2636          \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (37)
2637          \p{Line_Break: GL}      \p{Line_Break=Glue} (18)
2638          \p{Line_Break: Glue}    (Short: \p{Lb=GL}) (18)
2639          \p{Line_Break: H2}      (Short: \p{Lb=H2}) (399)
2640          \p{Line_Break: H3}      (Short: \p{Lb=H3}) (10_773)
2641          \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (74)
2642          \p{Line_Break: HL}      \p{Line_Break=Hebrew_Letter} (74)
2643          \p{Line_Break: HY}      \p{Line_Break=Hyphen} (1)
2644          \p{Line_Break: Hyphen}  (Short: \p{Lb=HY}) (1)
2645          \p{Line_Break: ID}      \p{Line_Break=Ideographic} (172_133)
2646          \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_133)
2647          \p{Line_Break: IN}      \p{Line_Break=Inseparable} (6)
2648          \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13)
2649          \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6)
2650          \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
2651          \p{Line_Break: IS}      \p{Line_Break=Infix_Numeric} (13)
2652          \p{Line_Break: JL}      (Short: \p{Lb=JL}) (125)
2653          \p{Line_Break: JT}      (Short: \p{Lb=JT}) (137)
2654          \p{Line_Break: JV}      (Short: \p{Lb=JV}) (95)
2655          \p{Line_Break: LF}      \p{Line_Break=Line_Feed} (1)
2656          \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1)
2657          \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4)
2658          \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1)
2659          \p{Line_Break: NL}      \p{Line_Break=Next_Line} (1)
2660          \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (30)
2661          \p{Line_Break: NS}      \p{Line_Break=Nonstarter} (30)
2662          \p{Line_Break: NU}      \p{Line_Break=Numeric} (572)
2663          \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (572)
2664          \p{Line_Break: OP}      \p{Line_Break=Open_Punctuation} (87)
2665          \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (87)
2666          \p{Line_Break: PO}      \p{Line_Break=Postfix_Numeric} (30)
2667          \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (30)
2668          \p{Line_Break: PR}      \p{Line_Break=Prefix_Numeric} (65)
2669          \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (65)
2670          \p{Line_Break: QU}      \p{Line_Break=Quotation} (39)
2671          \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39)
2672          \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26)
2673          \p{Line_Break: RI}      \p{Line_Break=Regional_Indicator} (26)
2674          \p{Line_Break: SA}      \p{Line_Break=Complex_Context} (734)
2675        D \p{Line_Break: SG}      \p{Line_Break=Surrogate} (2048)
2676          \p{Line_Break: SP}      \p{Line_Break=Space} (1)
2677          \p{Line_Break: Space}   (Short: \p{Lb=SP}) (1)
2678        D \p{Line_Break: Surrogate} Deprecated by Unicode because surrogates
2679                                    should never appear in well-formed text,
2680                                    and therefore shouldn't be the basis for
2681                                    line breaking (Short: \p{Lb=SG}) (2048)
2682          \p{Line_Break: SY}      \p{Line_Break=Break_Symbols} (1)
2683          \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (903_847 plus all
2684                                    above-Unicode code points)
2685          \p{Line_Break: WJ}      \p{Line_Break=Word_Joiner} (2)
2686          \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2)
2687          \p{Line_Break: XX}      \p{Line_Break=Unknown} (903_847 plus all
2688                                    above-Unicode code points)
2689          \p{Line_Break: ZW}      \p{Line_Break=ZWSpace} (1)
2690          \p{Line_Break: ZWJ}     (Short: \p{Lb=ZWJ}) (1)
2691          \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1)
2692          \p{Line_Separator}      \p{General_Category=Line_Separator}
2693                                    (Short: \p{Zl}) (1)
2694          \p{Linear_A}            \p{Script_Extensions=Linear_A} (Short:
2695                                    \p{Lina}; NOT \p{Block=Linear_A}) (386)
2696          \p{Linear_B}            \p{Script_Extensions=Linear_B} (Short:
2697                                    \p{Linb}) (268)
2698        X \p{Linear_B_Ideograms}  \p{Block=Linear_B_Ideograms} (128)
2699        X \p{Linear_B_Syllabary}  \p{Block=Linear_B_Syllabary} (128)
2700          \p{Lisu}                \p{Script_Extensions=Lisu} (48)
2701          \p{Ll}                  \p{Lowercase_Letter} (=
2702                                    \p{General_Category=Lowercase_Letter})
2703                                    (/i= General_Category=Cased_Letter)
2704                                    (2063)
2705          \p{Lm}                  \p{Modifier_Letter} (=
2706                                    \p{General_Category=Modifier_Letter})
2707                                    (249)
2708          \p{Lo}                  \p{Other_Letter} (= \p{General_Category=
2709                                    Other_Letter}) (112_721)
2710          \p{LOE}                 \p{Logical_Order_Exception} (=
2711                                    \p{Logical_Order_Exception=Y}) (19)
2712          \p{LOE: *}              \p{Logical_Order_Exception: *}
2713          \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
2714                                    \p{LOE}) (19)
2715          \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
2716                                    (1_114_093 plus all above-Unicode code
2717                                    points)
2718          \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19)
2719        X \p{Low_Surrogates}      \p{Block=Low_Surrogates} (1024)
2720          \p{Lower}               \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
2721                                    Cased=Yes) (2252)
2722          \p{Lower: *}            \p{Lowercase: *}
2723          \p{Lowercase}           \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
2724                                    Cased=Yes) (2252)
2725          \p{Lowercase: N*}       (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
2726                                    No) (1_111_860 plus all above-Unicode
2727                                    code points)
2728          \p{Lowercase: Y*}       (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
2729                                    Yes) (2252)
2730          \p{Lowercase_Letter}    \p{General_Category=Lowercase_Letter}
2731                                    (Short: \p{Ll}; /i= General_Category=
2732                                    Cased_Letter) (2063)
2733          \p{Lt}                  \p{Titlecase_Letter} (=
2734                                    \p{General_Category=Titlecase_Letter})
2735                                    (/i= General_Category=Cased_Letter) (31)
2736          \p{Lu}                  \p{Uppercase_Letter} (=
2737                                    \p{General_Category=Uppercase_Letter})
2738                                    (/i= General_Category=Cased_Letter)
2739                                    (1702)
2740          \p{Lyci}                \p{Lycian} (= \p{Script_Extensions=
2741                                    Lycian}) (NOT \p{Block=Lycian}) (29)
2742          \p{Lycian}              \p{Script_Extensions=Lycian} (Short:
2743                                    \p{Lyci}; NOT \p{Block=Lycian}) (29)
2744          \p{Lydi}                \p{Lydian} (= \p{Script_Extensions=
2745                                    Lydian}) (NOT \p{Block=Lydian}) (27)
2746          \p{Lydian}              \p{Script_Extensions=Lydian} (Short:
2747                                    \p{Lydi}; NOT \p{Block=Lydian}) (27)
2748          \p{M} \pM               \p{Mark} (= \p{General_Category=Mark})
2749                                    (2097)
2750          \p{Mahajani}            \p{Script_Extensions=Mahajani} (Short:
2751                                    \p{Mahj}; NOT \p{Block=Mahajani}) (61)
2752          \p{Mahj}                \p{Mahajani} (= \p{Script_Extensions=
2753                                    Mahajani}) (NOT \p{Block=Mahajani}) (61)
2754        X \p{Mahjong}             \p{Mahjong_Tiles} (= \p{Block=
2755                                    Mahjong_Tiles}) (48)
2756        X \p{Mahjong_Tiles}       \p{Block=Mahjong_Tiles} (Short:
2757                                    \p{InMahjong}) (48)
2758          \p{Malayalam}           \p{Script_Extensions=Malayalam} (Short:
2759                                    \p{Mlym}; NOT \p{Block=Malayalam}) (119)
2760          \p{Mand}                \p{Mandaic} (= \p{Script_Extensions=
2761                                    Mandaic}) (NOT \p{Block=Mandaic}) (30)
2762          \p{Mandaic}             \p{Script_Extensions=Mandaic} (Short:
2763                                    \p{Mand}; NOT \p{Block=Mandaic}) (30)
2764          \p{Mani}                \p{Manichaean} (= \p{Script_Extensions=
2765                                    Manichaean}) (NOT \p{Block=Manichaean})
2766                                    (52)
2767          \p{Manichaean}          \p{Script_Extensions=Manichaean} (Short:
2768                                    \p{Mani}; NOT \p{Block=Manichaean}) (52)
2769          \p{Marc}                \p{Marchen} (= \p{Script_Extensions=
2770                                    Marchen}) (NOT \p{Block=Marchen}) (68)
2771          \p{Marchen}             \p{Script_Extensions=Marchen} (Short:
2772                                    \p{Marc}; NOT \p{Block=Marchen}) (68)
2773          \p{Mark}                \p{General_Category=Mark} (Short: \p{M})
2774                                    (2097)
2775          \p{Math}                \p{Math=Y} (2310)
2776          \p{Math: N*}            (Single: \P{Math}) (1_111_802 plus all
2777                                    above-Unicode code points)
2778          \p{Math: Y*}            (Single: \p{Math}) (2310)
2779        X \p{Math_Alphanum}       \p{Mathematical_Alphanumeric_Symbols} (=
2780                                    \p{Block=
2781                                    Mathematical_Alphanumeric_Symbols})
2782                                    (1024)
2783        X \p{Math_Operators}      \p{Mathematical_Operators} (= \p{Block=
2784                                    Mathematical_Operators}) (256)
2785          \p{Math_Symbol}         \p{General_Category=Math_Symbol} (Short:
2786                                    \p{Sm}) (948)
2787        X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
2788                                    Mathematical_Alphanumeric_Symbols}
2789                                    (Short: \p{InMathAlphanum}) (1024)
2790        X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
2791                                    (Short: \p{InMathOperators}) (256)
2792          \p{Mc}                  \p{Spacing_Mark} (= \p{General_Category=
2793                                    Spacing_Mark}) (394)
2794          \p{Me}                  \p{Enclosing_Mark} (= \p{General_Category=
2795                                    Enclosing_Mark}) (13)
2796          \p{Meetei_Mayek}        \p{Script_Extensions=Meetei_Mayek} (Short:
2797                                    \p{Mtei}; NOT \p{Block=Meetei_Mayek})
2798                                    (79)
2799        X \p{Meetei_Mayek_Ext}    \p{Meetei_Mayek_Extensions} (= \p{Block=
2800                                    Meetei_Mayek_Extensions}) (32)
2801        X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
2802                                    (Short: \p{InMeeteiMayekExt}) (32)
2803          \p{Mend}                \p{Mende_Kikakui} (= \p{Script_Extensions=
2804                                    Mende_Kikakui}) (NOT \p{Block=
2805                                    Mende_Kikakui}) (213)
2806          \p{Mende_Kikakui}       \p{Script_Extensions=Mende_Kikakui}
2807                                    (Short: \p{Mend}; NOT \p{Block=
2808                                    Mende_Kikakui}) (213)
2809          \p{Merc}                \p{Meroitic_Cursive} (=
2810                                    \p{Script_Extensions=Meroitic_Cursive})
2811                                    (NOT \p{Block=Meroitic_Cursive}) (90)
2812          \p{Mero}                \p{Meroitic_Hieroglyphs} (=
2813                                    \p{Script_Extensions=
2814                                    Meroitic_Hieroglyphs}) (32)
2815          \p{Meroitic_Cursive}    \p{Script_Extensions=Meroitic_Cursive}
2816                                    (Short: \p{Merc}; NOT \p{Block=
2817                                    Meroitic_Cursive}) (90)
2818          \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
2819                                    Meroitic_Hieroglyphs} (Short: \p{Mero})
2820                                    (32)
2821          \p{Miao}                \p{Script_Extensions=Miao} (NOT \p{Block=
2822                                    Miao}) (133)
2823        X \p{Misc_Arrows}         \p{Miscellaneous_Symbols_And_Arrows} (=
2824                                    \p{Block=
2825                                    Miscellaneous_Symbols_And_Arrows}) (256)
2826        X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
2827                                    (= \p{Block=
2828                                    Miscellaneous_Mathematical_Symbols_A})
2829                                    (48)
2830        X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
2831                                    (= \p{Block=
2832                                    Miscellaneous_Mathematical_Symbols_B})
2833                                    (128)
2834        X \p{Misc_Pictographs}    \p{Miscellaneous_Symbols_And_Pictographs}
2835                                    (= \p{Block=
2836                                    Miscellaneous_Symbols_And_Pictographs})
2837                                    (768)
2838        X \p{Misc_Symbols}        \p{Miscellaneous_Symbols} (= \p{Block=
2839                                    Miscellaneous_Symbols}) (256)
2840        X \p{Misc_Technical}      \p{Miscellaneous_Technical} (= \p{Block=
2841                                    Miscellaneous_Technical}) (256)
2842        X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
2843                                    Miscellaneous_Mathematical_Symbols_A}
2844                                    (Short: \p{InMiscMathSymbolsA}) (48)
2845        X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
2846                                    Miscellaneous_Mathematical_Symbols_B}
2847                                    (Short: \p{InMiscMathSymbolsB}) (128)
2848        X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
2849                                    \p{InMiscSymbols}) (256)
2850        X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
2851                                    Miscellaneous_Symbols_And_Arrows}
2852                                    (Short: \p{InMiscArrows}) (256)
2853        X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
2854                                    Miscellaneous_Symbols_And_Pictographs}
2855                                    (Short: \p{InMiscPictographs}) (768)
2856        X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
2857                                    (Short: \p{InMiscTechnical}) (256)
2858          \p{Mlym}                \p{Malayalam} (= \p{Script_Extensions=
2859                                    Malayalam}) (NOT \p{Block=Malayalam})
2860                                    (119)
2861          \p{Mn}                  \p{Nonspacing_Mark} (=
2862                                    \p{General_Category=Nonspacing_Mark})
2863                                    (1690)
2864          \p{Modi}                \p{Script_Extensions=Modi} (NOT \p{Block=
2865                                    Modi}) (89)
2866          \p{Modifier_Letter}     \p{General_Category=Modifier_Letter}
2867                                    (Short: \p{Lm}) (249)
2868        X \p{Modifier_Letters}    \p{Spacing_Modifier_Letters} (= \p{Block=
2869                                    Spacing_Modifier_Letters}) (80)
2870          \p{Modifier_Symbol}     \p{General_Category=Modifier_Symbol}
2871                                    (Short: \p{Sk}) (121)
2872        X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
2873          \p{Mong}                \p{Mongolian} (= \p{Script_Extensions=
2874                                    Mongolian}) (NOT \p{Block=Mongolian})
2875                                    (169)
2876          \p{Mongolian}           \p{Script_Extensions=Mongolian} (Short:
2877                                    \p{Mong}; NOT \p{Block=Mongolian}) (169)
2878        X \p{Mongolian_Sup}       \p{Mongolian_Supplement} (= \p{Block=
2879                                    Mongolian_Supplement}) (32)
2880        X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
2881                                    \p{InMongolianSup}) (32)
2882          \p{Mro}                 \p{Script_Extensions=Mro} (NOT \p{Block=
2883                                    Mro}) (43)
2884          \p{Mroo}                \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
2885                                    \p{Block=Mro}) (43)
2886          \p{Mtei}                \p{Meetei_Mayek} (= \p{Script_Extensions=
2887                                    Meetei_Mayek}) (NOT \p{Block=
2888                                    Meetei_Mayek}) (79)
2889          \p{Mult}                \p{Multani} (= \p{Script_Extensions=
2890                                    Multani}) (NOT \p{Block=Multani}) (48)
2891          \p{Multani}             \p{Script_Extensions=Multani} (Short:
2892                                    \p{Mult}; NOT \p{Block=Multani}) (48)
2893        X \p{Music}               \p{Musical_Symbols} (= \p{Block=
2894                                    Musical_Symbols}) (256)
2895        X \p{Musical_Symbols}     \p{Block=Musical_Symbols} (Short:
2896                                    \p{InMusic}) (256)
2897          \p{Myanmar}             \p{Script_Extensions=Myanmar} (Short:
2898                                    \p{Mymr}; NOT \p{Block=Myanmar}) (224)
2899        X \p{Myanmar_Ext_A}       \p{Myanmar_Extended_A} (= \p{Block=
2900                                    Myanmar_Extended_A}) (32)
2901        X \p{Myanmar_Ext_B}       \p{Myanmar_Extended_B} (= \p{Block=
2902                                    Myanmar_Extended_B}) (32)
2903        X \p{Myanmar_Extended_A}  \p{Block=Myanmar_Extended_A} (Short:
2904                                    \p{InMyanmarExtA}) (32)
2905        X \p{Myanmar_Extended_B}  \p{Block=Myanmar_Extended_B} (Short:
2906                                    \p{InMyanmarExtB}) (32)
2907          \p{Mymr}                \p{Myanmar} (= \p{Script_Extensions=
2908                                    Myanmar}) (NOT \p{Block=Myanmar}) (224)
2909          \p{N} \pN               \p{Number} (= \p{General_Category=Number})
2910                                    (1492)
2911          \p{Nabataean}           \p{Script_Extensions=Nabataean} (Short:
2912                                    \p{Nbat}; NOT \p{Block=Nabataean}) (40)
2913          \p{Narb}                \p{Old_North_Arabian} (=
2914                                    \p{Script_Extensions=Old_North_Arabian})
2915                                    (32)
2916        X \p{NB}                  \p{No_Block} (= \p{Block=No_Block})
2917                                    (842_320 plus all above-Unicode code
2918                                    points)
2919          \p{Nbat}                \p{Nabataean} (= \p{Script_Extensions=
2920                                    Nabataean}) (NOT \p{Block=Nabataean})
2921                                    (40)
2922          \p{NChar}               \p{Noncharacter_Code_Point} (=
2923                                    \p{Noncharacter_Code_Point=Y}) (66)
2924          \p{NChar: *}            \p{Noncharacter_Code_Point: *}
2925          \p{Nd}                  \p{XPosixDigit} (= \p{General_Category=
2926                                    Decimal_Number}) (580)
2927          \p{New_Tai_Lue}         \p{Script_Extensions=New_Tai_Lue} (Short:
2928                                    \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
2929          \p{Newa}                \p{Script_Extensions=Newa} (NOT \p{Block=
2930                                    Newa}) (92)
2931          \p{NFC_QC: *}           \p{NFC_Quick_Check: *}
2932          \p{NFC_Quick_Check: M}  \p{NFC_Quick_Check=Maybe} (110)
2933          \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (110)
2934          \p{NFC_Quick_Check: N}  \p{NFC_Quick_Check=No} (NOT
2935                                    \P{NFC_Quick_Check} NOR \P{NFC_QC})
2936                                    (1120)
2937          \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
2938                                    \P{NFC_Quick_Check} NOR \P{NFC_QC})
2939                                    (1120)
2940          \p{NFC_Quick_Check: Y}  \p{NFC_Quick_Check=Yes} (NOT
2941                                    \p{NFC_Quick_Check} NOR \p{NFC_QC})
2942                                    (1_112_882 plus all above-Unicode code
2943                                    points)
2944          \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
2945                                    \p{NFC_Quick_Check} NOR \p{NFC_QC})
2946                                    (1_112_882 plus all above-Unicode code
2947                                    points)
2948          \p{NFD_QC: *}           \p{NFD_Quick_Check: *}
2949          \p{NFD_Quick_Check: N}  \p{NFD_Quick_Check=No} (NOT
2950                                    \P{NFD_Quick_Check} NOR \P{NFD_QC})
2951                                    (13_232)
2952          \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
2953                                    \P{NFD_Quick_Check} NOR \P{NFD_QC})
2954                                    (13_232)
2955          \p{NFD_Quick_Check: Y}  \p{NFD_Quick_Check=Yes} (NOT
2956                                    \p{NFD_Quick_Check} NOR \p{NFD_QC})
2957                                    (1_100_880 plus all above-Unicode code
2958                                    points)
2959          \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
2960                                    \p{NFD_Quick_Check} NOR \p{NFD_QC})
2961                                    (1_100_880 plus all above-Unicode code
2962                                    points)
2963          \p{NFKC_QC: *}          \p{NFKC_Quick_Check: *}
2964          \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (110)
2965          \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (110)
2966          \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
2967                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
2968                                    (4794)
2969          \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
2970                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
2971                                    (4794)
2972          \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
2973                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
2974                                    (1_109_208 plus all above-Unicode code
2975                                    points)
2976          \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
2977                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
2978                                    (1_109_208 plus all above-Unicode code
2979                                    points)
2980          \p{NFKD_QC: *}          \p{NFKD_Quick_Check: *}
2981          \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
2982                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
2983                                    (16_894)
2984          \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
2985                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
2986                                    (16_894)
2987          \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
2988                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
2989                                    (1_097_218 plus all above-Unicode code
2990                                    points)
2991          \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
2992                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
2993                                    (1_097_218 plus all above-Unicode code
2994                                    points)
2995          \p{Nko}                 \p{Script_Extensions=Nko} (NOT \p{NKo})
2996                                    (59)
2997          \p{Nkoo}                \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
2998                                    \p{NKo}) (59)
2999          \p{Nl}                  \p{Letter_Number} (= \p{General_Category=
3000                                    Letter_Number}) (236)
3001          \p{No}                  \p{Other_Number} (= \p{General_Category=
3002                                    Other_Number}) (676)
3003        X \p{No_Block}            \p{Block=No_Block} (Short: \p{InNB})
3004                                    (842_320 plus all above-Unicode code
3005                                    points)
3006          \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
3007                                    \p{NChar}) (66)
3008          \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
3009                                    (1_114_046 plus all above-Unicode code
3010                                    points)
3011          \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
3012                                    (66)
3013          \p{Nonspacing_Mark}     \p{General_Category=Nonspacing_Mark}
3014                                    (Short: \p{Mn}) (1690)
3015          \p{Nt: *}               \p{Numeric_Type: *}
3016          \p{Number}              \p{General_Category=Number} (Short: \p{N})
3017                                    (1492)
3018        X \p{Number_Forms}        \p{Block=Number_Forms} (64)
3019          \p{Numeric_Type: De}    \p{Numeric_Type=Decimal} (580)
3020          \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (580)
3021          \p{Numeric_Type: Di}    \p{Numeric_Type=Digit} (128)
3022          \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128)
3023          \p{Numeric_Type: None}  (Short: \p{Nt=None}) (1_112_539 plus all
3024                                    above-Unicode code points)
3025          \p{Numeric_Type: Nu}    \p{Numeric_Type=Numeric} (865)
3026          \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (865)
3027        T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1)
3028        T \p{Numeric_Value: 0}    (Short: \p{Nv=0}) (74)
3029        T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (1)
3030        T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (1)
3031        T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (1)
3032        T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (1)
3033        T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (4)
3034        T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1)
3035        T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (2)
3036        T \p{Numeric_Value: 1/9}  (Short: \p{Nv=1/9}) (1)
3037        T \p{Numeric_Value: 1/8}  (Short: \p{Nv=1/8}) (6)
3038        T \p{Numeric_Value: 1/7}  (Short: \p{Nv=1/7}) (1)
3039        T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (1)
3040        T \p{Numeric_Value: 1/6}  (Short: \p{Nv=1/6}) (3)
3041        T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (4)
3042        T \p{Numeric_Value: 1/5}  (Short: \p{Nv=1/5}) (2)
3043        T \p{Numeric_Value: 1/4}  (Short: \p{Nv=1/4}) (12)
3044        T \p{Numeric_Value: 1/3}  (Short: \p{Nv=1/3}) (6)
3045        T \p{Numeric_Value: 3/8}  (Short: \p{Nv=3/8}) (1)
3046        T \p{Numeric_Value: 2/5}  (Short: \p{Nv=2/5}) (1)
3047        T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1)
3048        T \p{Numeric_Value: 1/2}  (Short: \p{Nv=1/2}) (13)
3049        T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1)
3050        T \p{Numeric_Value: 3/5}  (Short: \p{Nv=3/5}) (1)
3051        T \p{Numeric_Value: 5/8}  (Short: \p{Nv=5/8}) (1)
3052        T \p{Numeric_Value: 2/3}  (Short: \p{Nv=2/3}) (7)
3053        T \p{Numeric_Value: 3/4}  (Short: \p{Nv=3/4}) (7)
3054        T \p{Numeric_Value: 4/5}  (Short: \p{Nv=4/5}) (1)
3055        T \p{Numeric_Value: 5/6}  (Short: \p{Nv=5/6}) (3)
3056        T \p{Numeric_Value: 7/8}  (Short: \p{Nv=7/8}) (1)
3057        T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1)
3058        T \p{Numeric_Value: 1}    (Short: \p{Nv=1}) (121)
3059        T \p{Numeric_Value: 3/2}  (Short: \p{Nv=3/2}) (1)
3060        T \p{Numeric_Value: 2}    (Short: \p{Nv=2}) (121)
3061        T \p{Numeric_Value: 5/2}  (Short: \p{Nv=5/2}) (1)
3062        T \p{Numeric_Value: 3}    (Short: \p{Nv=3}) (123)
3063        T \p{Numeric_Value: 7/2}  (Short: \p{Nv=7/2}) (1)
3064        T \p{Numeric_Value: 4}    (Short: \p{Nv=4}) (115)
3065        T \p{Numeric_Value: 9/2}  (Short: \p{Nv=9/2}) (1)
3066        T \p{Numeric_Value: 5}    (Short: \p{Nv=5}) (113)
3067        T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1)
3068        T \p{Numeric_Value: 6}    (Short: \p{Nv=6}) (100)
3069        T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1)
3070        T \p{Numeric_Value: 7}    (Short: \p{Nv=7}) (99)
3071        T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1)
3072        T \p{Numeric_Value: 8}    (Short: \p{Nv=8}) (95)
3073        T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1)
3074        T \p{Numeric_Value: 9}    (Short: \p{Nv=9}) (99)
3075        T \p{Numeric_Value: 10}   (Short: \p{Nv=10}) (54)
3076        T \p{Numeric_Value: 11}   (Short: \p{Nv=11}) (6)
3077        T \p{Numeric_Value: 12}   (Short: \p{Nv=12}) (6)
3078        T \p{Numeric_Value: 13}   (Short: \p{Nv=13}) (4)
3079        T \p{Numeric_Value: 14}   (Short: \p{Nv=14}) (4)
3080        T \p{Numeric_Value: 15}   (Short: \p{Nv=15}) (4)
3081        T \p{Numeric_Value: 16}   (Short: \p{Nv=16}) (5)
3082        T \p{Numeric_Value: 17}   (Short: \p{Nv=17}) (5)
3083        T \p{Numeric_Value: 18}   (Short: \p{Nv=18}) (5)
3084        T \p{Numeric_Value: 19}   (Short: \p{Nv=19}) (5)
3085        T \p{Numeric_Value: 20}   (Short: \p{Nv=20}) (31)
3086        T \p{Numeric_Value: 21}   (Short: \p{Nv=21}) (1)
3087        T \p{Numeric_Value: 22}   (Short: \p{Nv=22}) (1)
3088        T \p{Numeric_Value: 23}   (Short: \p{Nv=23}) (1)
3089        T \p{Numeric_Value: 24}   (Short: \p{Nv=24}) (1)
3090        T \p{Numeric_Value: 25}   (Short: \p{Nv=25}) (1)
3091        T \p{Numeric_Value: 26}   (Short: \p{Nv=26}) (1)
3092        T \p{Numeric_Value: 27}   (Short: \p{Nv=27}) (1)
3093        T \p{Numeric_Value: 28}   (Short: \p{Nv=28}) (1)
3094        T \p{Numeric_Value: 29}   (Short: \p{Nv=29}) (1)
3095        T \p{Numeric_Value: 30}   (Short: \p{Nv=30}) (16)
3096        T \p{Numeric_Value: 31}   (Short: \p{Nv=31}) (1)
3097        T \p{Numeric_Value: 32}   (Short: \p{Nv=32}) (1)
3098        T \p{Numeric_Value: 33}   (Short: \p{Nv=33}) (1)
3099        T \p{Numeric_Value: 34}   (Short: \p{Nv=34}) (1)
3100        T \p{Numeric_Value: 35}   (Short: \p{Nv=35}) (1)
3101        T \p{Numeric_Value: 36}   (Short: \p{Nv=36}) (1)
3102        T \p{Numeric_Value: 37}   (Short: \p{Nv=37}) (1)
3103        T \p{Numeric_Value: 38}   (Short: \p{Nv=38}) (1)
3104        T \p{Numeric_Value: 39}   (Short: \p{Nv=39}) (1)
3105        T \p{Numeric_Value: 40}   (Short: \p{Nv=40}) (16)
3106        T \p{Numeric_Value: 41}   (Short: \p{Nv=41}) (1)
3107        T \p{Numeric_Value: 42}   (Short: \p{Nv=42}) (1)
3108        T \p{Numeric_Value: 43}   (Short: \p{Nv=43}) (1)
3109        T \p{Numeric_Value: 44}   (Short: \p{Nv=44}) (1)
3110        T \p{Numeric_Value: 45}   (Short: \p{Nv=45}) (1)
3111        T \p{Numeric_Value: 46}   (Short: \p{Nv=46}) (1)
3112        T \p{Numeric_Value: 47}   (Short: \p{Nv=47}) (1)
3113        T \p{Numeric_Value: 48}   (Short: \p{Nv=48}) (1)
3114        T \p{Numeric_Value: 49}   (Short: \p{Nv=49}) (1)
3115        T \p{Numeric_Value: 50}   (Short: \p{Nv=50}) (27)
3116        T \p{Numeric_Value: 60}   (Short: \p{Nv=60}) (11)
3117        T \p{Numeric_Value: 70}   (Short: \p{Nv=70}) (11)
3118        T \p{Numeric_Value: 80}   (Short: \p{Nv=80}) (10)
3119        T \p{Numeric_Value: 90}   (Short: \p{Nv=90}) (10)
3120        T \p{Numeric_Value: 100}  (Short: \p{Nv=100}) (30)
3121        T \p{Numeric_Value: 200}  (Short: \p{Nv=200}) (4)
3122        T \p{Numeric_Value: 300}  (Short: \p{Nv=300}) (5)
3123        T \p{Numeric_Value: 400}  (Short: \p{Nv=400}) (4)
3124        T \p{Numeric_Value: 500}  (Short: \p{Nv=500}) (14)
3125        T \p{Numeric_Value: 600}  (Short: \p{Nv=600}) (4)
3126        T \p{Numeric_Value: 700}  (Short: \p{Nv=700}) (4)
3127        T \p{Numeric_Value: 800}  (Short: \p{Nv=800}) (4)
3128        T \p{Numeric_Value: 900}  (Short: \p{Nv=900}) (5)
3129        T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (20)
3130        T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (2)
3131        T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (2)
3132        T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (2)
3133        T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (6)
3134        T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (2)
3135        T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (2)
3136        T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (2)
3137        T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (2)
3138        T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (9)
3139        T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (2)
3140        T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (2)
3141        T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (2)
3142        T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (5)
3143        T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (2)
3144        T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (2)
3145        T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (2)
3146        T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (2)
3147        T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (2)
3148        T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (1)
3149        T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1)
3150        T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1)
3151        T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1)
3152        T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1)
3153        T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1)
3154        T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1)
3155        T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1)
3156        T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1)
3157        T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1)
3158        T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1)
3159        T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
3160                                    (3)
3161        T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
3162                                    10000000000}) (1)
3163        T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
3164                                    1000000000000}) (2)
3165          \p{Numeric_Value: NaN}  (Short: \p{Nv=NaN}) (1_112_539 plus all
3166                                    above-Unicode code points)
3167          \p{Nv: *}               \p{Numeric_Value: *}
3168        X \p{OCR}                 \p{Optical_Character_Recognition} (=
3169                                    \p{Block=Optical_Character_Recognition})
3170                                    (32)
3171          \p{Ogam}                \p{Ogham} (= \p{Script_Extensions=Ogham})
3172                                    (NOT \p{Block=Ogham}) (29)
3173          \p{Ogham}               \p{Script_Extensions=Ogham} (Short:
3174                                    \p{Ogam}; NOT \p{Block=Ogham}) (29)
3175          \p{Ol_Chiki}            \p{Script_Extensions=Ol_Chiki} (Short:
3176                                    \p{Olck}) (48)
3177          \p{Olck}                \p{Ol_Chiki} (= \p{Script_Extensions=
3178                                    Ol_Chiki}) (48)
3179          \p{Old_Hungarian}       \p{Script_Extensions=Old_Hungarian}
3180                                    (Short: \p{Hung}; NOT \p{Block=
3181                                    Old_Hungarian}) (108)
3182          \p{Old_Italic}          \p{Script_Extensions=Old_Italic} (Short:
3183                                    \p{Ital}; NOT \p{Block=Old_Italic}) (36)
3184          \p{Old_North_Arabian}   \p{Script_Extensions=Old_North_Arabian}
3185                                    (Short: \p{Narb}) (32)
3186          \p{Old_Permic}          \p{Script_Extensions=Old_Permic} (Short:
3187                                    \p{Perm}; NOT \p{Block=Old_Permic}) (44)
3188          \p{Old_Persian}         \p{Script_Extensions=Old_Persian} (Short:
3189                                    \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
3190          \p{Old_South_Arabian}   \p{Script_Extensions=Old_South_Arabian}
3191                                    (Short: \p{Sarb}) (32)
3192          \p{Old_Turkic}          \p{Script_Extensions=Old_Turkic} (Short:
3193                                    \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
3194          \p{Open_Punctuation}    \p{General_Category=Open_Punctuation}
3195                                    (Short: \p{Ps}) (75)
3196        X \p{Optical_Character_Recognition} \p{Block=
3197                                    Optical_Character_Recognition} (Short:
3198                                    \p{InOCR}) (32)
3199          \p{Oriya}               \p{Script_Extensions=Oriya} (Short:
3200                                    \p{Orya}; NOT \p{Block=Oriya}) (94)
3201          \p{Orkh}                \p{Old_Turkic} (= \p{Script_Extensions=
3202                                    Old_Turkic}) (NOT \p{Block=Old_Turkic})
3203                                    (73)
3204        X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
3205          \p{Orya}                \p{Oriya} (= \p{Script_Extensions=Oriya})
3206                                    (NOT \p{Block=Oriya}) (94)
3207          \p{Osage}               \p{Script_Extensions=Osage} (Short:
3208                                    \p{Osge}; NOT \p{Block=Osage}) (72)
3209          \p{Osge}                \p{Osage} (= \p{Script_Extensions=Osage})
3210                                    (NOT \p{Block=Osage}) (72)
3211          \p{Osma}                \p{Osmanya} (= \p{Script_Extensions=
3212                                    Osmanya}) (NOT \p{Block=Osmanya}) (40)
3213          \p{Osmanya}             \p{Script_Extensions=Osmanya} (Short:
3214                                    \p{Osma}; NOT \p{Block=Osmanya}) (40)
3215          \p{Other}               \p{General_Category=Other} (Short: \p{C})
3216                                    (986_091 plus all above-Unicode code
3217                                    points)
3218          \p{Other_Letter}        \p{General_Category=Other_Letter} (Short:
3219                                    \p{Lo}) (112_721)
3220          \p{Other_Number}        \p{General_Category=Other_Number} (Short:
3221                                    \p{No}) (676)
3222          \p{Other_Punctuation}   \p{General_Category=Other_Punctuation}
3223                                    (Short: \p{Po}) (544)
3224          \p{Other_Symbol}        \p{General_Category=Other_Symbol} (Short:
3225                                    \p{So}) (5777)
3226          \p{P} \pP               \p{Punct} (= \p{General_Category=
3227                                    Punctuation}) (NOT
3228                                    \p{General_Punctuation}) (748)
3229          \p{Pahawh_Hmong}        \p{Script_Extensions=Pahawh_Hmong} (Short:
3230                                    \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
3231                                    (127)
3232          \p{Palm}                \p{Palmyrene} (= \p{Script_Extensions=
3233                                    Palmyrene}) (32)
3234          \p{Palmyrene}           \p{Script_Extensions=Palmyrene} (Short:
3235                                    \p{Palm}) (32)
3236          \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
3237                                    (Short: \p{Zp}) (1)
3238          \p{Pat_Syn}             \p{Pattern_Syntax} (= \p{Pattern_Syntax=
3239                                    Y}) (2760)
3240          \p{Pat_Syn: *}          \p{Pattern_Syntax: *}
3241          \p{Pat_WS}              \p{Pattern_White_Space} (=
3242                                    \p{Pattern_White_Space=Y}) (11)
3243          \p{Pat_WS: *}           \p{Pattern_White_Space: *}
3244          \p{Pattern_Syntax}      \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
3245                                    (2760)
3246          \p{Pattern_Syntax: N*}  (Short: \p{PatSyn=N}, \P{PatSyn})
3247                                    (1_111_352 plus all above-Unicode code
3248                                    points)
3249          \p{Pattern_Syntax: Y*}  (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760)
3250          \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
3251                                    \p{PatWS}) (11)
3252          \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
3253                                    (1_114_101 plus all above-Unicode code
3254                                    points)
3255          \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11)
3256          \p{Pau_Cin_Hau}         \p{Script_Extensions=Pau_Cin_Hau} (Short:
3257                                    \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
3258          \p{Pauc}                \p{Pau_Cin_Hau} (= \p{Script_Extensions=
3259                                    Pau_Cin_Hau}) (NOT \p{Block=
3260                                    Pau_Cin_Hau}) (57)
3261          \p{Pc}                  \p{Connector_Punctuation} (=
3262                                    \p{General_Category=
3263                                    Connector_Punctuation}) (10)
3264          \p{PCM}                 \p{Prepended_Concatenation_Mark} (=
3265                                    \p{Prepended_Concatenation_Mark=Y}) (10)
3266          \p{PCM: *}              \p{Prepended_Concatenation_Mark: *}
3267          \p{Pd}                  \p{Dash_Punctuation} (=
3268                                    \p{General_Category=Dash_Punctuation})
3269                                    (24)
3270          \p{Pe}                  \p{Close_Punctuation} (=
3271                                    \p{General_Category=Close_Punctuation})
3272                                    (73)
3273          \p{PerlSpace}           \p{PosixSpace} (6)
3274          \p{PerlWord}            \p{PosixWord} (63)
3275          \p{Perm}                \p{Old_Permic} (= \p{Script_Extensions=
3276                                    Old_Permic}) (NOT \p{Block=Old_Permic})
3277                                    (44)
3278          \p{Pf}                  \p{Final_Punctuation} (=
3279                                    \p{General_Category=Final_Punctuation})
3280                                    (10)
3281          \p{Phag}                \p{Phags_Pa} (= \p{Script_Extensions=
3282                                    Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
3283          \p{Phags_Pa}            \p{Script_Extensions=Phags_Pa} (Short:
3284                                    \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
3285        X \p{Phaistos}            \p{Phaistos_Disc} (= \p{Block=
3286                                    Phaistos_Disc}) (48)
3287        X \p{Phaistos_Disc}       \p{Block=Phaistos_Disc} (Short:
3288                                    \p{InPhaistos}) (48)
3289          \p{Phli}                \p{Inscriptional_Pahlavi} (=
3290                                    \p{Script_Extensions=
3291                                    Inscriptional_Pahlavi}) (NOT \p{Block=
3292                                    Inscriptional_Pahlavi}) (27)
3293          \p{Phlp}                \p{Psalter_Pahlavi} (=
3294                                    \p{Script_Extensions=Psalter_Pahlavi})
3295                                    (NOT \p{Block=Psalter_Pahlavi}) (30)
3296          \p{Phnx}                \p{Phoenician} (= \p{Script_Extensions=
3297                                    Phoenician}) (NOT \p{Block=Phoenician})
3298                                    (29)
3299          \p{Phoenician}          \p{Script_Extensions=Phoenician} (Short:
3300                                    \p{Phnx}; NOT \p{Block=Phoenician}) (29)
3301        X \p{Phonetic_Ext}        \p{Phonetic_Extensions} (= \p{Block=
3302                                    Phonetic_Extensions}) (128)
3303        X \p{Phonetic_Ext_Sup}    \p{Phonetic_Extensions_Supplement} (=
3304                                    \p{Block=
3305                                    Phonetic_Extensions_Supplement}) (64)
3306        X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
3307                                    \p{InPhoneticExt}) (128)
3308        X \p{Phonetic_Extensions_Supplement} \p{Block=
3309                                    Phonetic_Extensions_Supplement} (Short:
3310                                    \p{InPhoneticExtSup}) (64)
3311          \p{Pi}                  \p{Initial_Punctuation} (=
3312                                    \p{General_Category=
3313                                    Initial_Punctuation}) (12)
3314        X \p{Playing_Cards}       \p{Block=Playing_Cards} (96)
3315          \p{Plrd}                \p{Miao} (= \p{Script_Extensions=Miao})
3316                                    (NOT \p{Block=Miao}) (133)
3317          \p{Po}                  \p{Other_Punctuation} (=
3318                                    \p{General_Category=Other_Punctuation})
3319                                    (544)
3320          \p{PosixAlnum}          [A-Za-z0-9] (62)
3321          \p{PosixAlpha}          [A-Za-z] (52)
3322          \p{PosixBlank}          \t and ' ' (2)
3323          \p{PosixCntrl}          ASCII control characters: NUL, SOH, STX,
3324                                    ETX, EOT, ENQ, ACK, BEL, BS, HT, LF, VT,
3325                                    FF, CR, SO, SI, DLE, DC1, DC2, DC3, DC4,
3326                                    NAK, SYN, ETB, CAN, EOM, SUB, ESC, FS,
3327                                    GS, RS, US, and DEL (33)
3328          \p{PosixDigit}          [0-9] (10)
3329          \p{PosixGraph}          [-!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~0-9A-Za-
3330                                    z] (94)
3331          \p{PosixLower}          [a-z] (/i= PosixAlpha) (26)
3332          \p{PosixPrint}          [- 0-9A-Za-z!"#$%&'()*+,./:;<=
3333                                    >?@[\\]^_`{|}~] (95)
3334          \p{PosixPunct}          [-!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~] (32)
3335          \p{PosixSpace}          \t, \n, \cK, \f, \r, and ' '.  (\cK is
3336                                    vertical tab) (Short: \p{PerlSpace}) (6)
3337          \p{PosixUpper}          [A-Z] (/i= PosixAlpha) (26)
3338          \p{PosixWord}           \w, restricted to ASCII = [A-Za-z0-9_]
3339                                    (Short: \p{PerlWord}) (63)
3340          \p{PosixXDigit}         \p{ASCII_Hex_Digit=Y} [0-9A-Fa-f] (Short:
3341                                    \p{AHex}) (22)
3342          \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
3343                                    Y} (Short: \p{PCM}) (10)
3344          \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
3345                                    (1_114_102 plus all above-Unicode code
3346                                    points)
3347          \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
3348                                    (10)
3349        T \p{Present_In: 1.1}     \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
3350                                    extension) (33_979)
3351        T \p{Present_In: 2.0}     Code point's usage introduced in version
3352                                    2.0 or earlier (Short: \p{In=2.0}) (Perl
3353                                    extension) (178_500)
3354        T \p{Present_In: 2.1}     Code point's usage introduced in version
3355                                    2.1 or earlier (Short: \p{In=2.1}) (Perl
3356                                    extension) (178_502)
3357        T \p{Present_In: 3.0}     Code point's usage introduced in version
3358                                    3.0 or earlier (Short: \p{In=3.0}) (Perl
3359                                    extension) (188_809)
3360        T \p{Present_In: 3.1}     Code point's usage introduced in version
3361                                    3.1 or earlier (Short: \p{In=3.1}) (Perl
3362                                    extension) (233_787)
3363        T \p{Present_In: 3.2}     Code point's usage introduced in version
3364                                    3.2 or earlier (Short: \p{In=3.2}) (Perl
3365                                    extension) (234_803)
3366        T \p{Present_In: 4.0}     Code point's usage introduced in version
3367                                    4.0 or earlier (Short: \p{In=4.0}) (Perl
3368                                    extension) (236_029)
3369        T \p{Present_In: 4.1}     Code point's usage introduced in version
3370                                    4.1 or earlier (Short: \p{In=4.1}) (Perl
3371                                    extension) (237_302)
3372        T \p{Present_In: 5.0}     Code point's usage introduced in version
3373                                    5.0 or earlier (Short: \p{In=5.0}) (Perl
3374                                    extension) (238_671)
3375        T \p{Present_In: 5.1}     Code point's usage introduced in version
3376                                    5.1 or earlier (Short: \p{In=5.1}) (Perl
3377                                    extension) (240_295)
3378        T \p{Present_In: 5.2}     Code point's usage introduced in version
3379                                    5.2 or earlier (Short: \p{In=5.2}) (Perl
3380                                    extension) (246_943)
3381        T \p{Present_In: 6.0}     Code point's usage introduced in version
3382                                    6.0 or earlier (Short: \p{In=6.0}) (Perl
3383                                    extension) (249_031)
3384        T \p{Present_In: 6.1}     Code point's usage introduced in version
3385                                    6.1 or earlier (Short: \p{In=6.1}) (Perl
3386                                    extension) (249_763)
3387        T \p{Present_In: 6.2}     Code point's usage introduced in version
3388                                    6.2 or earlier (Short: \p{In=6.2}) (Perl
3389                                    extension) (249_764)
3390        T \p{Present_In: 6.3}     Code point's usage introduced in version
3391                                    6.3 or earlier (Short: \p{In=6.3}) (Perl
3392                                    extension) (249_769)
3393        T \p{Present_In: 7.0}     Code point's usage introduced in version
3394                                    7.0 or earlier (Short: \p{In=7.0}) (Perl
3395                                    extension) (252_603)
3396        T \p{Present_In: 8.0}     Code point's usage introduced in version
3397                                    8.0 or earlier (Short: \p{In=8.0}) (Perl
3398                                    extension) (260_319)
3399        T \p{Present_In: 9.0}     Code point's usage introduced in version
3400                                    9.0 or earlier (Short: \p{In=9.0}) (Perl
3401                                    extension) (267_819)
3402          \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
3403                                    Unassigned}) (Perl extension) (846_293
3404                                    plus all above-Unicode code points)
3405          \p{Print}               \p{XPosixPrint} (265_638)
3406          \p{Private_Use}         \p{General_Category=Private_Use} (Short:
3407                                    \p{Co}; NOT \p{Private_Use_Area})
3408                                    (137_468)
3409        X \p{Private_Use_Area}    \p{Block=Private_Use_Area} (Short:
3410                                    \p{InPUA}) (6400)
3411          \p{Prti}                \p{Inscriptional_Parthian} (=
3412                                    \p{Script_Extensions=
3413                                    Inscriptional_Parthian}) (NOT \p{Block=
3414                                    Inscriptional_Parthian}) (30)
3415          \p{Ps}                  \p{Open_Punctuation} (=
3416                                    \p{General_Category=Open_Punctuation})
3417                                    (75)
3418          \p{Psalter_Pahlavi}     \p{Script_Extensions=Psalter_Pahlavi}
3419                                    (Short: \p{Phlp}; NOT \p{Block=
3420                                    Psalter_Pahlavi}) (30)
3421        X \p{PUA}                 \p{Private_Use_Area} (= \p{Block=
3422                                    Private_Use_Area}) (6400)
3423          \p{Punct}               \p{General_Category=Punctuation} (Short:
3424                                    \p{P}; NOT \p{General_Punctuation}) (748)
3425          \p{Punctuation}         \p{Punct} (= \p{General_Category=
3426                                    Punctuation}) (NOT
3427                                    \p{General_Punctuation}) (748)
3428          \p{Qaac}                \p{Coptic} (= \p{Script_Extensions=
3429                                    Coptic}) (NOT \p{Block=Coptic}) (165)
3430          \p{Qaai}                \p{Inherited} (= \p{Script_Extensions=
3431                                    Inherited}) (496)
3432          \p{QMark}               \p{Quotation_Mark} (= \p{Quotation_Mark=
3433                                    Y}) (30)
3434          \p{QMark: *}            \p{Quotation_Mark: *}
3435          \p{Quotation_Mark}      \p{Quotation_Mark=Y} (Short: \p{QMark})
3436                                    (30)
3437          \p{Quotation_Mark: N*}  (Short: \p{QMark=N}, \P{QMark}) (1_114_082
3438                                    plus all above-Unicode code points)
3439          \p{Quotation_Mark: Y*}  (Short: \p{QMark=Y}, \p{QMark}) (30)
3440          \p{Radical}             \p{Radical=Y} (329)
3441          \p{Radical: N*}         (Single: \P{Radical}) (1_113_783 plus all
3442                                    above-Unicode code points)
3443          \p{Radical: Y*}         (Single: \p{Radical}) (329)
3444          \p{Rejang}              \p{Script_Extensions=Rejang} (Short:
3445                                    \p{Rjng}; NOT \p{Block=Rejang}) (37)
3446          \p{Rjng}                \p{Rejang} (= \p{Script_Extensions=
3447                                    Rejang}) (NOT \p{Block=Rejang}) (37)
3448        X \p{Rumi}                \p{Rumi_Numeral_Symbols} (= \p{Block=
3449                                    Rumi_Numeral_Symbols}) (32)
3450        X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
3451                                    \p{InRumi}) (32)
3452          \p{Runic}               \p{Script_Extensions=Runic} (Short:
3453                                    \p{Runr}; NOT \p{Block=Runic}) (86)
3454          \p{Runr}                \p{Runic} (= \p{Script_Extensions=Runic})
3455                                    (NOT \p{Block=Runic}) (86)
3456          \p{S} \pS               \p{Symbol} (= \p{General_Category=Symbol})
3457                                    (6899)
3458          \p{Samaritan}           \p{Script_Extensions=Samaritan} (Short:
3459                                    \p{Samr}; NOT \p{Block=Samaritan}) (61)
3460          \p{Samr}                \p{Samaritan} (= \p{Script_Extensions=
3461                                    Samaritan}) (NOT \p{Block=Samaritan})
3462                                    (61)
3463          \p{Sarb}                \p{Old_South_Arabian} (=
3464                                    \p{Script_Extensions=Old_South_Arabian})
3465                                    (32)
3466          \p{Saur}                \p{Saurashtra} (= \p{Script_Extensions=
3467                                    Saurashtra}) (NOT \p{Block=Saurashtra})
3468                                    (82)
3469          \p{Saurashtra}          \p{Script_Extensions=Saurashtra} (Short:
3470                                    \p{Saur}; NOT \p{Block=Saurashtra}) (82)
3471          \p{SB: *}               \p{Sentence_Break: *}
3472          \p{Sc}                  \p{Currency_Symbol} (=
3473                                    \p{General_Category=Currency_Symbol})
3474                                    (53)
3475          \p{Sc: *}               \p{Script: *}
3476          \p{Script: Adlam}       (Short: \p{Sc=Adlm}) (87)
3477          \p{Script: Adlm}        \p{Script=Adlam} (87)
3478          \p{Script: Aghb}        \p{Script=Caucasian_Albanian} (53)
3479          \p{Script: Ahom}        (Short: \p{Sc=Ahom}) (57)
3480          \p{Script: Anatolian_Hieroglyphs} (Short: \p{Sc=Hluw}) (583)
3481          \p{Script: Arab}        \p{Script=Arabic} (1279)
3482          \p{Script: Arabic}      (Short: \p{Sc=Arab}) (1279)
3483          \p{Script: Armenian}    (Short: \p{Sc=Armn}) (93)
3484          \p{Script: Armi}        \p{Script=Imperial_Aramaic} (31)
3485          \p{Script: Armn}        \p{Script=Armenian} (93)
3486          \p{Script: Avestan}     (Short: \p{Sc=Avst}) (61)
3487          \p{Script: Avst}        \p{Script=Avestan} (61)
3488          \p{Script: Bali}        \p{Script=Balinese} (121)
3489          \p{Script: Balinese}    (Short: \p{Sc=Bali}) (121)
3490          \p{Script: Bamu}        \p{Script=Bamum} (657)
3491          \p{Script: Bamum}       (Short: \p{Sc=Bamu}) (657)
3492          \p{Script: Bass}        \p{Script=Bassa_Vah} (36)
3493          \p{Script: Bassa_Vah}   (Short: \p{Sc=Bass}) (36)
3494          \p{Script: Batak}       (Short: \p{Sc=Batk}) (56)
3495          \p{Script: Batk}        \p{Script=Batak} (56)
3496          \p{Script: Beng}        \p{Script=Bengali} (93)
3497          \p{Script: Bengali}     (Short: \p{Sc=Beng}) (93)
3498          \p{Script: Bhaiksuki}   (Short: \p{Sc=Bhks}) (97)
3499          \p{Script: Bhks}        \p{Script=Bhaiksuki} (97)
3500          \p{Script: Bopo}        \p{Script=Bopomofo} (70)
3501          \p{Script: Bopomofo}    (Short: \p{Sc=Bopo}) (70)
3502          \p{Script: Brah}        \p{Script=Brahmi} (109)
3503          \p{Script: Brahmi}      (Short: \p{Sc=Brah}) (109)
3504          \p{Script: Brai}        \p{Script=Braille} (256)
3505          \p{Script: Braille}     (Short: \p{Sc=Brai}) (256)
3506          \p{Script: Bugi}        \p{Script=Buginese} (30)
3507          \p{Script: Buginese}    (Short: \p{Sc=Bugi}) (30)
3508          \p{Script: Buhd}        \p{Script=Buhid} (20)
3509          \p{Script: Buhid}       (Short: \p{Sc=Buhd}) (20)
3510          \p{Script: Cakm}        \p{Script=Chakma} (67)
3511          \p{Script: Canadian_Aboriginal} (Short: \p{Sc=Cans}) (710)
3512          \p{Script: Cans}        \p{Script=Canadian_Aboriginal} (710)
3513          \p{Script: Cari}        \p{Script=Carian} (49)
3514          \p{Script: Carian}      (Short: \p{Sc=Cari}) (49)
3515          \p{Script: Caucasian_Albanian} (Short: \p{Sc=Aghb}) (53)
3516          \p{Script: Chakma}      (Short: \p{Sc=Cakm}) (67)
3517          \p{Script: Cham}        (Short: \p{Sc=Cham}) (83)
3518          \p{Script: Cher}        \p{Script=Cherokee} (172)
3519          \p{Script: Cherokee}    (Short: \p{Sc=Cher}) (172)
3520          \p{Script: Common}      (Short: \p{Sc=Zyyy}) (7279)
3521          \p{Script: Copt}        \p{Script=Coptic} (137)
3522          \p{Script: Coptic}      (Short: \p{Sc=Copt}) (137)
3523          \p{Script: Cprt}        \p{Script=Cypriot} (55)
3524          \p{Script: Cuneiform}   (Short: \p{Sc=Xsux}) (1234)
3525          \p{Script: Cypriot}     (Short: \p{Sc=Cprt}) (55)
3526          \p{Script: Cyrillic}    (Short: \p{Sc=Cyrl}) (443)
3527          \p{Script: Cyrl}        \p{Script=Cyrillic} (443)
3528          \p{Script: Deseret}     (Short: \p{Sc=Dsrt}) (80)
3529          \p{Script: Deva}        \p{Script=Devanagari} (154)
3530          \p{Script: Devanagari}  (Short: \p{Sc=Deva}) (154)
3531          \p{Script: Dsrt}        \p{Script=Deseret} (80)
3532          \p{Script: Dupl}        \p{Script=Duployan} (143)
3533          \p{Script: Duployan}    (Short: \p{Sc=Dupl}) (143)
3534          \p{Script: Egyp}        \p{Script=Egyptian_Hieroglyphs} (1071)
3535          \p{Script: Egyptian_Hieroglyphs} (Short: \p{Sc=Egyp}) (1071)
3536          \p{Script: Elba}        \p{Script=Elbasan} (40)
3537          \p{Script: Elbasan}     (Short: \p{Sc=Elba}) (40)
3538          \p{Script: Ethi}        \p{Script=Ethiopic} (495)
3539          \p{Script: Ethiopic}    (Short: \p{Sc=Ethi}) (495)
3540          \p{Script: Geor}        \p{Script=Georgian} (127)
3541          \p{Script: Georgian}    (Short: \p{Sc=Geor}) (127)
3542          \p{Script: Glag}        \p{Script=Glagolitic} (132)
3543          \p{Script: Glagolitic}  (Short: \p{Sc=Glag}) (132)
3544          \p{Script: Goth}        \p{Script=Gothic} (27)
3545          \p{Script: Gothic}      (Short: \p{Sc=Goth}) (27)
3546          \p{Script: Gran}        \p{Script=Grantha} (85)
3547          \p{Script: Grantha}     (Short: \p{Sc=Gran}) (85)
3548          \p{Script: Greek}       (Short: \p{Sc=Grek}) (518)
3549          \p{Script: Grek}        \p{Script=Greek} (518)
3550          \p{Script: Gujarati}    (Short: \p{Sc=Gujr}) (85)
3551          \p{Script: Gujr}        \p{Script=Gujarati} (85)
3552          \p{Script: Gurmukhi}    (Short: \p{Sc=Guru}) (79)
3553          \p{Script: Guru}        \p{Script=Gurmukhi} (79)
3554          \p{Script: Han}         (Short: \p{Sc=Han}) (81_734)
3555          \p{Script: Hang}        \p{Script=Hangul} (11_739)
3556          \p{Script: Hangul}      (Short: \p{Sc=Hang}) (11_739)
3557          \p{Script: Hani}        \p{Script=Han} (81_734)
3558          \p{Script: Hano}        \p{Script=Hanunoo} (21)
3559          \p{Script: Hanunoo}     (Short: \p{Sc=Hano}) (21)
3560          \p{Script: Hatr}        \p{Script=Hatran} (26)
3561          \p{Script: Hatran}      (Short: \p{Sc=Hatr}) (26)
3562          \p{Script: Hebr}        \p{Script=Hebrew} (133)
3563          \p{Script: Hebrew}      (Short: \p{Sc=Hebr}) (133)
3564          \p{Script: Hira}        \p{Script=Hiragana} (91)
3565          \p{Script: Hiragana}    (Short: \p{Sc=Hira}) (91)
3566          \p{Script: Hluw}        \p{Script=Anatolian_Hieroglyphs} (583)
3567          \p{Script: Hmng}        \p{Script=Pahawh_Hmong} (127)
3568          \p{Script: Hung}        \p{Script=Old_Hungarian} (108)
3569          \p{Script: Imperial_Aramaic} (Short: \p{Sc=Armi}) (31)
3570          \p{Script: Inherited}   (Short: \p{Sc=Zinh}) (564)
3571          \p{Script: Inscriptional_Pahlavi} (Short: \p{Sc=Phli}) (27)
3572          \p{Script: Inscriptional_Parthian} (Short: \p{Sc=Prti}) (30)
3573          \p{Script: Ital}        \p{Script=Old_Italic} (36)
3574          \p{Script: Java}        \p{Script=Javanese} (90)
3575          \p{Script: Javanese}    (Short: \p{Sc=Java}) (90)
3576          \p{Script: Kaithi}      (Short: \p{Sc=Kthi}) (66)
3577          \p{Script: Kali}        \p{Script=Kayah_Li} (47)
3578          \p{Script: Kana}        \p{Script=Katakana} (300)
3579          \p{Script: Kannada}     (Short: \p{Sc=Knda}) (88)
3580          \p{Script: Katakana}    (Short: \p{Sc=Kana}) (300)
3581          \p{Script: Kayah_Li}    (Short: \p{Sc=Kali}) (47)
3582          \p{Script: Khar}        \p{Script=Kharoshthi} (65)
3583          \p{Script: Kharoshthi}  (Short: \p{Sc=Khar}) (65)
3584          \p{Script: Khmer}       (Short: \p{Sc=Khmr}) (146)
3585          \p{Script: Khmr}        \p{Script=Khmer} (146)
3586          \p{Script: Khoj}        \p{Script=Khojki} (62)
3587          \p{Script: Khojki}      (Short: \p{Sc=Khoj}) (62)
3588          \p{Script: Khudawadi}   (Short: \p{Sc=Sind}) (69)
3589          \p{Script: Knda}        \p{Script=Kannada} (88)
3590          \p{Script: Kthi}        \p{Script=Kaithi} (66)
3591          \p{Script: Lana}        \p{Script=Tai_Tham} (127)
3592          \p{Script: Lao}         (Short: \p{Sc=Lao}) (67)
3593          \p{Script: Laoo}        \p{Script=Lao} (67)
3594          \p{Script: Latin}       (Short: \p{Sc=Latn}) (1350)
3595          \p{Script: Latn}        \p{Script=Latin} (1350)
3596          \p{Script: Lepc}        \p{Script=Lepcha} (74)
3597          \p{Script: Lepcha}      (Short: \p{Sc=Lepc}) (74)
3598          \p{Script: Limb}        \p{Script=Limbu} (68)
3599          \p{Script: Limbu}       (Short: \p{Sc=Limb}) (68)
3600          \p{Script: Lina}        \p{Script=Linear_A} (341)
3601          \p{Script: Linb}        \p{Script=Linear_B} (211)
3602          \p{Script: Linear_A}    (Short: \p{Sc=Lina}) (341)
3603          \p{Script: Linear_B}    (Short: \p{Sc=Linb}) (211)
3604          \p{Script: Lisu}        (Short: \p{Sc=Lisu}) (48)
3605          \p{Script: Lyci}        \p{Script=Lycian} (29)
3606          \p{Script: Lycian}      (Short: \p{Sc=Lyci}) (29)
3607          \p{Script: Lydi}        \p{Script=Lydian} (27)
3608          \p{Script: Lydian}      (Short: \p{Sc=Lydi}) (27)
3609          \p{Script: Mahajani}    (Short: \p{Sc=Mahj}) (39)
3610          \p{Script: Mahj}        \p{Script=Mahajani} (39)
3611          \p{Script: Malayalam}   (Short: \p{Sc=Mlym}) (114)
3612          \p{Script: Mand}        \p{Script=Mandaic} (29)
3613          \p{Script: Mandaic}     (Short: \p{Sc=Mand}) (29)
3614          \p{Script: Mani}        \p{Script=Manichaean} (51)
3615          \p{Script: Manichaean}  (Short: \p{Sc=Mani}) (51)
3616          \p{Script: Marc}        \p{Script=Marchen} (68)
3617          \p{Script: Marchen}     (Short: \p{Sc=Marc}) (68)
3618          \p{Script: Meetei_Mayek} (Short: \p{Sc=Mtei}) (79)
3619          \p{Script: Mend}        \p{Script=Mende_Kikakui} (213)
3620          \p{Script: Mende_Kikakui} (Short: \p{Sc=Mend}) (213)
3621          \p{Script: Merc}        \p{Script=Meroitic_Cursive} (90)
3622          \p{Script: Mero}        \p{Script=Meroitic_Hieroglyphs} (32)
3623          \p{Script: Meroitic_Cursive} (Short: \p{Sc=Merc}) (90)
3624          \p{Script: Meroitic_Hieroglyphs} (Short: \p{Sc=Mero}) (32)
3625          \p{Script: Miao}        (Short: \p{Sc=Miao}) (133)
3626          \p{Script: Mlym}        \p{Script=Malayalam} (114)
3627          \p{Script: Modi}        (Short: \p{Sc=Modi}) (79)
3628          \p{Script: Mong}        \p{Script=Mongolian} (166)
3629          \p{Script: Mongolian}   (Short: \p{Sc=Mong}) (166)
3630          \p{Script: Mro}         (Short: \p{Sc=Mro}) (43)
3631          \p{Script: Mroo}        \p{Script=Mro} (43)
3632          \p{Script: Mtei}        \p{Script=Meetei_Mayek} (79)
3633          \p{Script: Mult}        \p{Script=Multani} (38)
3634          \p{Script: Multani}     (Short: \p{Sc=Mult}) (38)
3635          \p{Script: Myanmar}     (Short: \p{Sc=Mymr}) (223)
3636          \p{Script: Mymr}        \p{Script=Myanmar} (223)
3637          \p{Script: Nabataean}   (Short: \p{Sc=Nbat}) (40)
3638          \p{Script: Narb}        \p{Script=Old_North_Arabian} (32)
3639          \p{Script: Nbat}        \p{Script=Nabataean} (40)
3640          \p{Script: New_Tai_Lue} (Short: \p{Sc=Talu}) (83)
3641          \p{Script: Newa}        (Short: \p{Sc=Newa}) (92)
3642          \p{Script: Nko}         (Short: \p{Sc=Nko}) (59)
3643          \p{Script: Nkoo}        \p{Script=Nko} (59)
3644          \p{Script: Ogam}        \p{Script=Ogham} (29)
3645          \p{Script: Ogham}       (Short: \p{Sc=Ogam}) (29)
3646          \p{Script: Ol_Chiki}    (Short: \p{Sc=Olck}) (48)
3647          \p{Script: Olck}        \p{Script=Ol_Chiki} (48)
3648          \p{Script: Old_Hungarian} (Short: \p{Sc=Hung}) (108)
3649          \p{Script: Old_Italic}  (Short: \p{Sc=Ital}) (36)
3650          \p{Script: Old_North_Arabian} (Short: \p{Sc=Narb}) (32)
3651          \p{Script: Old_Permic}  (Short: \p{Sc=Perm}) (43)
3652          \p{Script: Old_Persian} (Short: \p{Sc=Xpeo}) (50)
3653          \p{Script: Old_South_Arabian} (Short: \p{Sc=Sarb}) (32)
3654          \p{Script: Old_Turkic}  (Short: \p{Sc=Orkh}) (73)
3655          \p{Script: Oriya}       (Short: \p{Sc=Orya}) (90)
3656          \p{Script: Orkh}        \p{Script=Old_Turkic} (73)
3657          \p{Script: Orya}        \p{Script=Oriya} (90)
3658          \p{Script: Osage}       (Short: \p{Sc=Osge}) (72)
3659          \p{Script: Osge}        \p{Script=Osage} (72)
3660          \p{Script: Osma}        \p{Script=Osmanya} (40)
3661          \p{Script: Osmanya}     (Short: \p{Sc=Osma}) (40)
3662          \p{Script: Pahawh_Hmong} (Short: \p{Sc=Hmng}) (127)
3663          \p{Script: Palm}        \p{Script=Palmyrene} (32)
3664          \p{Script: Palmyrene}   (Short: \p{Sc=Palm}) (32)
3665          \p{Script: Pau_Cin_Hau} (Short: \p{Sc=Pauc}) (57)
3666          \p{Script: Pauc}        \p{Script=Pau_Cin_Hau} (57)
3667          \p{Script: Perm}        \p{Script=Old_Permic} (43)
3668          \p{Script: Phag}        \p{Script=Phags_Pa} (56)
3669          \p{Script: Phags_Pa}    (Short: \p{Sc=Phag}) (56)
3670          \p{Script: Phli}        \p{Script=Inscriptional_Pahlavi} (27)
3671          \p{Script: Phlp}        \p{Script=Psalter_Pahlavi} (29)
3672          \p{Script: Phnx}        \p{Script=Phoenician} (29)
3673          \p{Script: Phoenician}  (Short: \p{Sc=Phnx}) (29)
3674          \p{Script: Plrd}        \p{Script=Miao} (133)
3675          \p{Script: Prti}        \p{Script=Inscriptional_Parthian} (30)
3676          \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29)
3677          \p{Script: Qaac}        \p{Script=Coptic} (137)
3678          \p{Script: Qaai}        \p{Script=Inherited} (564)
3679          \p{Script: Rejang}      (Short: \p{Sc=Rjng}) (37)
3680          \p{Script: Rjng}        \p{Script=Rejang} (37)
3681          \p{Script: Runic}       (Short: \p{Sc=Runr}) (86)
3682          \p{Script: Runr}        \p{Script=Runic} (86)
3683          \p{Script: Samaritan}   (Short: \p{Sc=Samr}) (61)
3684          \p{Script: Samr}        \p{Script=Samaritan} (61)
3685          \p{Script: Sarb}        \p{Script=Old_South_Arabian} (32)
3686          \p{Script: Saur}        \p{Script=Saurashtra} (82)
3687          \p{Script: Saurashtra}  (Short: \p{Sc=Saur}) (82)
3688          \p{Script: Sgnw}        \p{Script=SignWriting} (672)
3689          \p{Script: Sharada}     (Short: \p{Sc=Shrd}) (94)
3690          \p{Script: Shavian}     (Short: \p{Sc=Shaw}) (48)
3691          \p{Script: Shaw}        \p{Script=Shavian} (48)
3692          \p{Script: Shrd}        \p{Script=Sharada} (94)
3693          \p{Script: Sidd}        \p{Script=Siddham} (92)
3694          \p{Script: Siddham}     (Short: \p{Sc=Sidd}) (92)
3695          \p{Script: SignWriting} (Short: \p{Sc=Sgnw}) (672)
3696          \p{Script: Sind}        \p{Script=Khudawadi} (69)
3697          \p{Script: Sinh}        \p{Script=Sinhala} (110)
3698          \p{Script: Sinhala}     (Short: \p{Sc=Sinh}) (110)
3699          \p{Script: Sora}        \p{Script=Sora_Sompeng} (35)
3700          \p{Script: Sora_Sompeng} (Short: \p{Sc=Sora}) (35)
3701          \p{Script: Sund}        \p{Script=Sundanese} (72)
3702          \p{Script: Sundanese}   (Short: \p{Sc=Sund}) (72)
3703          \p{Script: Sylo}        \p{Script=Syloti_Nagri} (44)
3704          \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (44)
3705          \p{Script: Syrc}        \p{Script=Syriac} (77)
3706          \p{Script: Syriac}      (Short: \p{Sc=Syrc}) (77)
3707          \p{Script: Tagalog}     (Short: \p{Sc=Tglg}) (20)
3708          \p{Script: Tagb}        \p{Script=Tagbanwa} (18)
3709          \p{Script: Tagbanwa}    (Short: \p{Sc=Tagb}) (18)
3710          \p{Script: Tai_Le}      (Short: \p{Sc=Tale}) (35)
3711          \p{Script: Tai_Tham}    (Short: \p{Sc=Lana}) (127)
3712          \p{Script: Tai_Viet}    (Short: \p{Sc=Tavt}) (72)
3713          \p{Script: Takr}        \p{Script=Takri} (66)
3714          \p{Script: Takri}       (Short: \p{Sc=Takr}) (66)
3715          \p{Script: Tale}        \p{Script=Tai_Le} (35)
3716          \p{Script: Talu}        \p{Script=New_Tai_Lue} (83)
3717          \p{Script: Tamil}       (Short: \p{Sc=Taml}) (72)
3718          \p{Script: Taml}        \p{Script=Tamil} (72)
3719          \p{Script: Tang}        \p{Script=Tangut} (6881)
3720          \p{Script: Tangut}      (Short: \p{Sc=Tang}) (6881)
3721          \p{Script: Tavt}        \p{Script=Tai_Viet} (72)
3722          \p{Script: Telu}        \p{Script=Telugu} (96)
3723          \p{Script: Telugu}      (Short: \p{Sc=Telu}) (96)
3724          \p{Script: Tfng}        \p{Script=Tifinagh} (59)
3725          \p{Script: Tglg}        \p{Script=Tagalog} (20)
3726          \p{Script: Thaa}        \p{Script=Thaana} (50)
3727          \p{Script: Thaana}      (Short: \p{Sc=Thaa}) (50)
3728          \p{Script: Thai}        (Short: \p{Sc=Thai}) (86)
3729          \p{Script: Tibetan}     (Short: \p{Sc=Tibt}) (207)
3730          \p{Script: Tibt}        \p{Script=Tibetan} (207)
3731          \p{Script: Tifinagh}    (Short: \p{Sc=Tfng}) (59)
3732          \p{Script: Tirh}        \p{Script=Tirhuta} (82)
3733          \p{Script: Tirhuta}     (Short: \p{Sc=Tirh}) (82)
3734          \p{Script: Ugar}        \p{Script=Ugaritic} (31)
3735          \p{Script: Ugaritic}    (Short: \p{Sc=Ugar}) (31)
3736          \p{Script: Unknown}     (Short: \p{Sc=Zzzz}) (985_875 plus all
3737                                    above-Unicode code points)
3738          \p{Script: Vai}         (Short: \p{Sc=Vai}) (300)
3739          \p{Script: Vaii}        \p{Script=Vai} (300)
3740          \p{Script: Wara}        \p{Script=Warang_Citi} (84)
3741          \p{Script: Warang_Citi} (Short: \p{Sc=Wara}) (84)
3742          \p{Script: Xpeo}        \p{Script=Old_Persian} (50)
3743          \p{Script: Xsux}        \p{Script=Cuneiform} (1234)
3744          \p{Script: Yi}          (Short: \p{Sc=Yi}) (1220)
3745          \p{Script: Yiii}        \p{Script=Yi} (1220)
3746          \p{Script: Zinh}        \p{Script=Inherited} (564)
3747          \p{Script: Zyyy}        \p{Script=Common} (7279)
3748          \p{Script: Zzzz}        \p{Script=Unknown} (985_875 plus all
3749                                    above-Unicode code points)
3750          \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (88)
3751          \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (88)
3752          \p{Script_Extensions: Aghb} \p{Script_Extensions=
3753                                    Caucasian_Albanian} (53)
3754          \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (57)
3755          \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
3756                                    \p{Hluw}) (583)
3757          \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1323)
3758          \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
3759                                    (1323)
3760          \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
3761                                    (94)
3762          \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
3763                                    (31)
3764          \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (94)
3765          \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst}) (61)
3766          \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
3767          \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
3768          \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
3769                                    (121)
3770          \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
3771          \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657)
3772          \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
3773          \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
3774                                    (36)
3775          \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56)
3776          \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
3777          \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (98)
3778          \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng}) (98)
3779          \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
3780                                    (97)
3781          \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
3782          \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (110)
3783          \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
3784                                    (110)
3785          \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (109)
3786          \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah}) (109)
3787          \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
3788          \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
3789                                    (256)
3790          \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
3791          \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
3792                                    (31)
3793          \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
3794          \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22)
3795          \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (87)
3796          \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
3797                                    \p{Cans}) (710)
3798          \p{Script_Extensions: Cans} \p{Script_Extensions=
3799                                    Canadian_Aboriginal} (710)
3800          \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
3801          \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49)
3802          \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
3803                                    \p{Aghb}) (53)
3804          \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (87)
3805          \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83)
3806          \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
3807          \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
3808                                    (172)
3809          \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
3810                                    (6864)
3811          \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
3812          \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt}) (165)
3813          \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
3814          \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
3815                                    (1234)
3816          \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
3817                                    (112)
3818          \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
3819                                    (446)
3820          \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (446)
3821          \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt}) (80)
3822          \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210)
3823          \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
3824                                    (210)
3825          \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
3826          \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
3827          \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
3828                                    (147)
3829          \p{Script_Extensions: Egyp} \p{Script_Extensions=
3830                                    Egyptian_Hieroglyphs} (1071)
3831          \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
3832                                    \p{Egyp}) (1071)
3833          \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
3834          \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba}) (40)
3835          \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
3836          \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
3837                                    (495)
3838          \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (129)
3839          \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
3840                                    (129)
3841          \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (136)
3842          \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
3843                                    (136)
3844          \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
3845          \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27)
3846          \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (113)
3847          \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
3848                                    (113)
3849          \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522)
3850          \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
3851          \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
3852                                    (99)
3853          \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (99)
3854          \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
3855                                    (93)
3856          \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (93)
3857          \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (82_013)
3858          \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
3859          \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
3860                                    (11_775)
3861          \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (82_013)
3862          \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
3863          \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano}) (23)
3864          \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
3865          \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26)
3866          \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (133)
3867          \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr}) (133)
3868          \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (143)
3869          \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
3870                                    (143)
3871          \p{Script_Extensions: Hluw} \p{Script_Extensions=
3872                                    Anatolian_Hieroglyphs} (583)
3873          \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
3874                                    (127)
3875          \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
3876                                    (108)
3877          \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
3878                                    \p{Armi}) (31)
3879          \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
3880                                    (496)
3881          \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
3882                                    \p{Phli}) (27)
3883          \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
3884                                    Prti}, \p{Prti}) (30)
3885          \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (36)
3886          \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
3887          \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
3888                                    (91)
3889          \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (86)
3890          \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
3891          \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (352)
3892          \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
3893                                    (100)
3894          \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
3895                                    (352)
3896          \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
3897                                    (48)
3898          \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (65)
3899          \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
3900                                    (65)
3901          \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146)
3902          \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
3903          \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (72)
3904          \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (72)
3905          \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
3906                                    (81)
3907          \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (100)
3908          \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (86)
3909          \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
3910          \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (67)
3911          \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (67)
3912          \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn}) (1370)
3913          \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1370)
3914          \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
3915          \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74)
3916          \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
3917          \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69)
3918          \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
3919          \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
3920          \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
3921                                    (386)
3922          \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
3923                                    (268)
3924          \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (48)
3925          \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
3926          \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29)
3927          \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
3928          \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27)
3929          \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
3930                                    (61)
3931          \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
3932          \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
3933                                    (119)
3934          \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
3935          \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand}) (30)
3936          \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
3937          \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
3938                                    (52)
3939          \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
3940          \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc}) (68)
3941          \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
3942                                    \p{Mtei}) (79)
3943          \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
3944                                    (213)
3945          \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
3946                                    \p{Mend}) (213)
3947          \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
3948                                    (90)
3949          \p{Script_Extensions: Mero} \p{Script_Extensions=
3950                                    Meroitic_Hieroglyphs} (32)
3951          \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
3952                                    \p{Merc}) (90)
3953          \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
3954                                    \p{Mero}) (32)
3955          \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (133)
3956          \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (119)
3957          \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89)
3958          \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (169)
3959          \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
3960                                    (169)
3961          \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43)
3962          \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
3963          \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
3964          \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
3965          \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult}) (48)
3966          \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
3967                                    (224)
3968          \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
3969          \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
3970                                    (40)
3971          \p{Script_Extensions: Narb} \p{Script_Extensions=
3972                                    Old_North_Arabian} (32)
3973          \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
3974          \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
3975                                    (83)
3976          \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (92)
3977          \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (59)
3978          \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (59)
3979          \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
3980          \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29)
3981          \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
3982                                    (48)
3983          \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
3984          \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
3985                                    \p{Hung}) (108)
3986          \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
3987                                    (36)
3988          \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
3989                                    \p{Narb}) (32)
3990          \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
3991                                    (44)
3992          \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
3993                                    (50)
3994          \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
3995                                    \p{Sarb}) (32)
3996          \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
3997                                    (73)
3998          \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (94)
3999          \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
4000          \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (94)
4001          \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72)
4002          \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
4003          \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
4004          \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma}) (40)
4005          \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
4006                                    \p{Hmng}) (127)
4007          \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
4008          \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
4009                                    (32)
4010          \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
4011                                    (57)
4012          \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
4013          \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
4014          \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
4015          \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
4016                                    (59)
4017          \p{Script_Extensions: Phli} \p{Script_Extensions=
4018                                    Inscriptional_Pahlavi} (27)
4019          \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
4020                                    (30)
4021          \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
4022          \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
4023                                    (29)
4024          \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (133)
4025          \p{Script_Extensions: Prti} \p{Script_Extensions=
4026                                    Inscriptional_Parthian} (30)
4027          \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
4028                                    \p{Phlp}) (30)
4029          \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
4030          \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (496)
4031          \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37)
4032          \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
4033          \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86)
4034          \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
4035          \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
4036                                    (61)
4037          \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
4038          \p{Script_Extensions: Sarb} \p{Script_Extensions=
4039                                    Old_South_Arabian} (32)
4040          \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
4041          \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
4042                                    (82)
4043          \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
4044          \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
4045                                    (100)
4046          \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw}) (48)
4047          \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
4048          \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (100)
4049          \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
4050          \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd}) (92)
4051          \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
4052                                    (672)
4053          \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
4054          \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (112)
4055          \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
4056                                    (112)
4057          \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
4058          \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
4059                                    \p{Sora}) (35)
4060          \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
4061          \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
4062                                    (72)
4063          \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (56)
4064          \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
4065                                    \p{Sylo}) (56)
4066          \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (93)
4067          \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc}) (93)
4068          \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg}) (22)
4069          \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
4070          \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
4071                                    (20)
4072          \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45)
4073          \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
4074                                    (127)
4075          \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
4076                                    (72)
4077          \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (78)
4078          \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (78)
4079          \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
4080          \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
4081          \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (80)
4082          \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (80)
4083          \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6881)
4084          \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
4085                                    (6881)
4086          \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
4087          \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (101)
4088          \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu}) (101)
4089          \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
4090          \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
4091          \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (65)
4092          \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (65)
4093          \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86)
4094          \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
4095                                    (207)
4096          \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
4097          \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
4098                                    (59)
4099          \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (94)
4100          \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh}) (94)
4101          \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
4102          \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
4103                                    (31)
4104          \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
4105                                    (985_875 plus all above-Unicode code
4106                                    points)
4107          \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300)
4108          \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
4109          \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
4110          \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
4111                                    (84)
4112          \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
4113          \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
4114          \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246)
4115          \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
4116          \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (496)
4117          \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (6864)
4118          \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (985_875
4119                                    plus all above-Unicode code points)
4120          \p{Scx: *}              \p{Script_Extensions: *}
4121          \p{SD}                  \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
4122          \p{SD: *}               \p{Soft_Dotted: *}
4123          \p{Sentence_Break: AT}  \p{Sentence_Break=ATerm} (4)
4124          \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4)
4125          \p{Sentence_Break: CL}  \p{Sentence_Break=Close} (187)
4126          \p{Sentence_Break: Close} (Short: \p{SB=CL}) (187)
4127          \p{Sentence_Break: CR}  (Short: \p{SB=CR}) (1)
4128          \p{Sentence_Break: EX}  \p{Sentence_Break=Extend} (2197)
4129          \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2197)
4130          \p{Sentence_Break: FO}  \p{Sentence_Break=Format} (53)
4131          \p{Sentence_Break: Format} (Short: \p{SB=FO}) (53)
4132          \p{Sentence_Break: LE}  \p{Sentence_Break=OLetter} (113_027)
4133          \p{Sentence_Break: LF}  (Short: \p{SB=LF}) (1)
4134          \p{Sentence_Break: LO}  \p{Sentence_Break=Lower} (2251)
4135          \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2251)
4136          \p{Sentence_Break: NU}  \p{Sentence_Break=Numeric} (572)
4137          \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (572)
4138          \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (113_027)
4139          \p{Sentence_Break: Other} (Short: \p{SB=XX}) (993_796 plus all
4140                                    above-Unicode code points)
4141          \p{Sentence_Break: SC}  \p{Sentence_Break=SContinue} (26)
4142          \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26)
4143          \p{Sentence_Break: SE}  \p{Sentence_Break=Sep} (3)
4144          \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3)
4145          \p{Sentence_Break: Sp}  (Short: \p{SB=Sp}) (20)
4146          \p{Sentence_Break: ST}  \p{Sentence_Break=STerm} (121)
4147          \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (121)
4148          \p{Sentence_Break: UP}  \p{Sentence_Break=Upper} (1853)
4149          \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1853)
4150          \p{Sentence_Break: XX}  \p{Sentence_Break=Other} (993_796 plus all
4151                                    above-Unicode code points)
4152          \p{Sentence_Terminal}   \p{Sentence_Terminal=Y} (Short: \p{STerm})
4153                                    (124)
4154          \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
4155                                    (1_113_988 plus all above-Unicode code
4156                                    points)
4157          \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (124)
4158          \p{Separator}           \p{General_Category=Separator} (Short:
4159                                    \p{Z}) (19)
4160          \p{Sgnw}                \p{SignWriting} (= \p{Script_Extensions=
4161                                    SignWriting}) (672)
4162          \p{Sharada}             \p{Script_Extensions=Sharada} (Short:
4163                                    \p{Shrd}; NOT \p{Block=Sharada}) (100)
4164          \p{Shavian}             \p{Script_Extensions=Shavian} (Short:
4165                                    \p{Shaw}) (48)
4166          \p{Shaw}                \p{Shavian} (= \p{Script_Extensions=
4167                                    Shavian}) (48)
4168        X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
4169                                    (16)
4170          \p{Shrd}                \p{Sharada} (= \p{Script_Extensions=
4171                                    Sharada}) (NOT \p{Block=Sharada}) (100)
4172          \p{Sidd}                \p{Siddham} (= \p{Script_Extensions=
4173                                    Siddham}) (NOT \p{Block=Siddham}) (92)
4174          \p{Siddham}             \p{Script_Extensions=Siddham} (Short:
4175                                    \p{Sidd}; NOT \p{Block=Siddham}) (92)
4176          \p{SignWriting}         \p{Script_Extensions=SignWriting} (Short:
4177                                    \p{Sgnw}) (672)
4178          \p{Sind}                \p{Khudawadi} (= \p{Script_Extensions=
4179                                    Khudawadi}) (NOT \p{Block=Khudawadi})
4180                                    (81)
4181          \p{Sinh}                \p{Sinhala} (= \p{Script_Extensions=
4182                                    Sinhala}) (NOT \p{Block=Sinhala}) (112)
4183          \p{Sinhala}             \p{Script_Extensions=Sinhala} (Short:
4184                                    \p{Sinh}; NOT \p{Block=Sinhala}) (112)
4185        X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
4186          \p{Sk}                  \p{Modifier_Symbol} (=
4187                                    \p{General_Category=Modifier_Symbol})
4188                                    (121)
4189          \p{Sm}                  \p{Math_Symbol} (= \p{General_Category=
4190                                    Math_Symbol}) (948)
4191        X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
4192                                    \p{InSmallForms}) (32)
4193        X \p{Small_Forms}         \p{Small_Form_Variants} (= \p{Block=
4194                                    Small_Form_Variants}) (32)
4195          \p{So}                  \p{Other_Symbol} (= \p{General_Category=
4196                                    Other_Symbol}) (5777)
4197          \p{Soft_Dotted}         \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
4198          \p{Soft_Dotted: N*}     (Short: \p{SD=N}, \P{SD}) (1_114_066 plus
4199                                    all above-Unicode code points)
4200          \p{Soft_Dotted: Y*}     (Short: \p{SD=Y}, \p{SD}) (46)
4201          \p{Sora}                \p{Sora_Sompeng} (= \p{Script_Extensions=
4202                                    Sora_Sompeng}) (NOT \p{Block=
4203                                    Sora_Sompeng}) (35)
4204          \p{Sora_Sompeng}        \p{Script_Extensions=Sora_Sompeng} (Short:
4205                                    \p{Sora}; NOT \p{Block=Sora_Sompeng})
4206                                    (35)
4207          \p{Space}               \p{White_Space} (= \p{White_Space=Y}) (25)
4208          \p{Space: *}            \p{White_Space: *}
4209          \p{Space_Separator}     \p{General_Category=Space_Separator}
4210                                    (Short: \p{Zs}) (17)
4211          \p{SpacePerl}           \p{XPosixSpace} (25)
4212          \p{Spacing_Mark}        \p{General_Category=Spacing_Mark} (Short:
4213                                    \p{Mc}) (394)
4214        X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
4215                                    (Short: \p{InModifierLetters}) (80)
4216        X \p{Specials}            \p{Block=Specials} (16)
4217          \p{STerm}               \p{Sentence_Terminal} (=
4218                                    \p{Sentence_Terminal=Y}) (124)
4219          \p{STerm: *}            \p{Sentence_Terminal: *}
4220          \p{Sund}                \p{Sundanese} (= \p{Script_Extensions=
4221                                    Sundanese}) (NOT \p{Block=Sundanese})
4222                                    (72)
4223          \p{Sundanese}           \p{Script_Extensions=Sundanese} (Short:
4224                                    \p{Sund}; NOT \p{Block=Sundanese}) (72)
4225        X \p{Sundanese_Sup}       \p{Sundanese_Supplement} (= \p{Block=
4226                                    Sundanese_Supplement}) (16)
4227        X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
4228                                    \p{InSundaneseSup}) (16)
4229        X \p{Sup_Arrows_A}        \p{Supplemental_Arrows_A} (= \p{Block=
4230                                    Supplemental_Arrows_A}) (16)
4231        X \p{Sup_Arrows_B}        \p{Supplemental_Arrows_B} (= \p{Block=
4232                                    Supplemental_Arrows_B}) (128)
4233        X \p{Sup_Arrows_C}        \p{Supplemental_Arrows_C} (= \p{Block=
4234                                    Supplemental_Arrows_C}) (256)
4235        X \p{Sup_Math_Operators}  \p{Supplemental_Mathematical_Operators} (=
4236                                    \p{Block=
4237                                    Supplemental_Mathematical_Operators})
4238                                    (256)
4239        X \p{Sup_PUA_A}           \p{Supplementary_Private_Use_Area_A} (=
4240                                    \p{Block=
4241                                    Supplementary_Private_Use_Area_A})
4242                                    (65_536)
4243        X \p{Sup_PUA_B}           \p{Supplementary_Private_Use_Area_B} (=
4244                                    \p{Block=
4245                                    Supplementary_Private_Use_Area_B})
4246                                    (65_536)
4247        X \p{Sup_Punctuation}     \p{Supplemental_Punctuation} (= \p{Block=
4248                                    Supplemental_Punctuation}) (128)
4249        X \p{Sup_Symbols_And_Pictographs}
4250                                    \p{Supplemental_Symbols_And_Pictographs}
4251                                    (= \p{Block=
4252                                    Supplemental_Symbols_And_Pictographs})
4253                                    (256)
4254        X \p{Super_And_Sub}       \p{Superscripts_And_Subscripts} (=
4255                                    \p{Block=Superscripts_And_Subscripts})
4256                                    (48)
4257        X \p{Superscripts_And_Subscripts} \p{Block=
4258                                    Superscripts_And_Subscripts} (Short:
4259                                    \p{InSuperAndSub}) (48)
4260        X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
4261                                    \p{InSupArrowsA}) (16)
4262        X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
4263                                    \p{InSupArrowsB}) (128)
4264        X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
4265                                    \p{InSupArrowsC}) (256)
4266        X \p{Supplemental_Mathematical_Operators} \p{Block=
4267                                    Supplemental_Mathematical_Operators}
4268                                    (Short: \p{InSupMathOperators}) (256)
4269        X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
4270                                    (Short: \p{InSupPunctuation}) (128)
4271        X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
4272                                    Supplemental_Symbols_And_Pictographs}
4273                                    (Short: \p{InSupSymbolsAndPictographs})
4274                                    (256)
4275        X \p{Supplementary_Private_Use_Area_A} \p{Block=
4276                                    Supplementary_Private_Use_Area_A}
4277                                    (Short: \p{InSupPUAA}) (65_536)
4278        X \p{Supplementary_Private_Use_Area_B} \p{Block=
4279                                    Supplementary_Private_Use_Area_B}
4280                                    (Short: \p{InSupPUAB}) (65_536)
4281          \p{Surrogate}           \p{General_Category=Surrogate} (Short:
4282                                    \p{Cs}) (2048)
4283        X \p{Sutton_SignWriting}  \p{Block=Sutton_SignWriting} (688)
4284          \p{Sylo}                \p{Syloti_Nagri} (= \p{Script_Extensions=
4285                                    Syloti_Nagri}) (NOT \p{Block=
4286                                    Syloti_Nagri}) (56)
4287          \p{Syloti_Nagri}        \p{Script_Extensions=Syloti_Nagri} (Short:
4288                                    \p{Sylo}; NOT \p{Block=Syloti_Nagri})
4289                                    (56)
4290          \p{Symbol}              \p{General_Category=Symbol} (Short: \p{S})
4291                                    (6899)
4292          \p{Syrc}                \p{Syriac} (= \p{Script_Extensions=
4293                                    Syriac}) (NOT \p{Block=Syriac}) (93)
4294          \p{Syriac}              \p{Script_Extensions=Syriac} (Short:
4295                                    \p{Syrc}; NOT \p{Block=Syriac}) (93)
4296          \p{Tagalog}             \p{Script_Extensions=Tagalog} (Short:
4297                                    \p{Tglg}; NOT \p{Block=Tagalog}) (22)
4298          \p{Tagb}                \p{Tagbanwa} (= \p{Script_Extensions=
4299                                    Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
4300          \p{Tagbanwa}            \p{Script_Extensions=Tagbanwa} (Short:
4301                                    \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
4302        X \p{Tags}                \p{Block=Tags} (128)
4303          \p{Tai_Le}              \p{Script_Extensions=Tai_Le} (Short:
4304                                    \p{Tale}; NOT \p{Block=Tai_Le}) (45)
4305          \p{Tai_Tham}            \p{Script_Extensions=Tai_Tham} (Short:
4306                                    \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
4307          \p{Tai_Viet}            \p{Script_Extensions=Tai_Viet} (Short:
4308                                    \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
4309        X \p{Tai_Xuan_Jing}       \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
4310                                    Tai_Xuan_Jing_Symbols}) (96)
4311        X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
4312                                    \p{InTaiXuanJing}) (96)
4313          \p{Takr}                \p{Takri} (= \p{Script_Extensions=Takri})
4314                                    (NOT \p{Block=Takri}) (78)
4315          \p{Takri}               \p{Script_Extensions=Takri} (Short:
4316                                    \p{Takr}; NOT \p{Block=Takri}) (78)
4317          \p{Tale}                \p{Tai_Le} (= \p{Script_Extensions=
4318                                    Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
4319          \p{Talu}                \p{New_Tai_Lue} (= \p{Script_Extensions=
4320                                    New_Tai_Lue}) (NOT \p{Block=
4321                                    New_Tai_Lue}) (83)
4322          \p{Tamil}               \p{Script_Extensions=Tamil} (Short:
4323                                    \p{Taml}; NOT \p{Block=Tamil}) (80)
4324          \p{Taml}                \p{Tamil} (= \p{Script_Extensions=Tamil})
4325                                    (NOT \p{Block=Tamil}) (80)
4326          \p{Tang}                \p{Tangut} (= \p{Script_Extensions=
4327                                    Tangut}) (NOT \p{Block=Tangut}) (6881)
4328          \p{Tangut}              \p{Script_Extensions=Tangut} (Short:
4329                                    \p{Tang}; NOT \p{Block=Tangut}) (6881)
4330        X \p{Tangut_Components}   \p{Block=Tangut_Components} (768)
4331          \p{Tavt}                \p{Tai_Viet} (= \p{Script_Extensions=
4332                                    Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
4333          \p{Telu}                \p{Telugu} (= \p{Script_Extensions=
4334                                    Telugu}) (NOT \p{Block=Telugu}) (101)
4335          \p{Telugu}              \p{Script_Extensions=Telugu} (Short:
4336                                    \p{Telu}; NOT \p{Block=Telugu}) (101)
4337          \p{Term}                \p{Terminal_Punctuation} (=
4338                                    \p{Terminal_Punctuation=Y}) (246)
4339          \p{Term: *}             \p{Terminal_Punctuation: *}
4340          \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
4341                                    \p{Term}) (246)
4342          \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
4343                                    (1_113_866 plus all above-Unicode code
4344                                    points)
4345          \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (246)
4346          \p{Tfng}                \p{Tifinagh} (= \p{Script_Extensions=
4347                                    Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
4348          \p{Tglg}                \p{Tagalog} (= \p{Script_Extensions=
4349                                    Tagalog}) (NOT \p{Block=Tagalog}) (22)
4350          \p{Thaa}                \p{Thaana} (= \p{Script_Extensions=
4351                                    Thaana}) (NOT \p{Block=Thaana}) (65)
4352          \p{Thaana}              \p{Script_Extensions=Thaana} (Short:
4353                                    \p{Thaa}; NOT \p{Block=Thaana}) (65)
4354          \p{Thai}                \p{Script_Extensions=Thai} (NOT \p{Block=
4355                                    Thai}) (86)
4356          \p{Tibetan}             \p{Script_Extensions=Tibetan} (Short:
4357                                    \p{Tibt}; NOT \p{Block=Tibetan}) (207)
4358          \p{Tibt}                \p{Tibetan} (= \p{Script_Extensions=
4359                                    Tibetan}) (NOT \p{Block=Tibetan}) (207)
4360          \p{Tifinagh}            \p{Script_Extensions=Tifinagh} (Short:
4361                                    \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
4362          \p{Tirh}                \p{Tirhuta} (= \p{Script_Extensions=
4363                                    Tirhuta}) (NOT \p{Block=Tirhuta}) (94)
4364          \p{Tirhuta}             \p{Script_Extensions=Tirhuta} (Short:
4365                                    \p{Tirh}; NOT \p{Block=Tirhuta}) (94)
4366          \p{Title}               \p{Titlecase} (/i= Cased=Yes) (31)
4367          \p{Titlecase}           (= \p{Gc=Lt}) (Short: \p{Title}; /i=
4368                                    Cased=Yes) (31)
4369          \p{Titlecase_Letter}    \p{General_Category=Titlecase_Letter}
4370                                    (Short: \p{Lt}; /i= General_Category=
4371                                    Cased_Letter) (31)
4372        X \p{Transport_And_Map}   \p{Transport_And_Map_Symbols} (= \p{Block=
4373                                    Transport_And_Map_Symbols}) (128)
4374        X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
4375                                    (Short: \p{InTransportAndMap}) (128)
4376        X \p{UCAS}                \p{Unified_Canadian_Aboriginal_Syllabics}
4377                                    (= \p{Block=
4378                                    Unified_Canadian_Aboriginal_Syllabics})
4379                                    (640)
4380        X \p{UCAS_Ext}            \p{Unified_Canadian_Aboriginal_Syllabics_-
4381                                    Extended} (= \p{Block=
4382                                    Unified_Canadian_Aboriginal_Syllabics_-
4383                                    Extended}) (80)
4384          \p{Ugar}                \p{Ugaritic} (= \p{Script_Extensions=
4385                                    Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
4386          \p{Ugaritic}            \p{Script_Extensions=Ugaritic} (Short:
4387                                    \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
4388          \p{UIdeo}               \p{Unified_Ideograph} (=
4389                                    \p{Unified_Ideograph=Y}) (80_388)
4390          \p{UIdeo: *}            \p{Unified_Ideograph: *}
4391          \p{Unassigned}          \p{General_Category=Unassigned} (Short:
4392                                    \p{Cn}) (846_359 plus all above-Unicode
4393                                    code points)
4394          \p{Unicode}             \p{Any} (1_114_112)
4395        X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
4396                                    Unified_Canadian_Aboriginal_Syllabics}
4397                                    (Short: \p{InUCAS}) (640)
4398        X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
4399                                    Unified_Canadian_Aboriginal_Syllabics_-
4400                                    Extended} (Short: \p{InUCASExt}) (80)
4401          \p{Unified_Ideograph}   \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
4402                                    (80_388)
4403          \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
4404                                    (1_033_724 plus all above-Unicode code
4405                                    points)
4406          \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (80_388)
4407          \p{Unknown}             \p{Script_Extensions=Unknown} (Short:
4408                                    \p{Zzzz}) (985_875 plus all above-
4409                                    Unicode code points)
4410          \p{Upper}               \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
4411                                    Cased=Yes) (1822)
4412          \p{Upper: *}            \p{Uppercase: *}
4413          \p{Uppercase}           \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
4414                                    Cased=Yes) (1822)
4415          \p{Uppercase: N*}       (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
4416                                    No) (1_112_290 plus all above-Unicode
4417                                    code points)
4418          \p{Uppercase: Y*}       (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
4419                                    Yes) (1822)
4420          \p{Uppercase_Letter}    \p{General_Category=Uppercase_Letter}
4421                                    (Short: \p{Lu}; /i= General_Category=
4422                                    Cased_Letter) (1702)
4423          \p{Vai}                 \p{Script_Extensions=Vai} (NOT \p{Block=
4424                                    Vai}) (300)
4425          \p{Vaii}                \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
4426                                    \p{Block=Vai}) (300)
4427          \p{Variation_Selector}  \p{Variation_Selector=Y} (Short: \p{VS};
4428                                    NOT \p{Variation_Selectors}) (259)
4429          \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853
4430                                    plus all above-Unicode code points)
4431          \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259)
4432        X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
4433                                    \p{InVS}) (16)
4434        X \p{Variation_Selectors_Supplement} \p{Block=
4435                                    Variation_Selectors_Supplement} (Short:
4436                                    \p{InVSSup}) (240)
4437        X \p{Vedic_Ext}           \p{Vedic_Extensions} (= \p{Block=
4438                                    Vedic_Extensions}) (48)
4439        X \p{Vedic_Extensions}    \p{Block=Vedic_Extensions} (Short:
4440                                    \p{InVedicExt}) (48)
4441        X \p{Vertical_Forms}      \p{Block=Vertical_Forms} (16)
4442          \p{VertSpace}           \v (7)
4443          \p{VS}                  \p{Variation_Selector} (=
4444                                    \p{Variation_Selector=Y}) (NOT
4445                                    \p{Variation_Selectors}) (259)
4446          \p{VS: *}               \p{Variation_Selector: *}
4447        X \p{VS_Sup}              \p{Variation_Selectors_Supplement} (=
4448                                    \p{Block=
4449                                    Variation_Selectors_Supplement}) (240)
4450          \p{Wara}                \p{Warang_Citi} (= \p{Script_Extensions=
4451                                    Warang_Citi}) (NOT \p{Block=
4452                                    Warang_Citi}) (84)
4453          \p{Warang_Citi}         \p{Script_Extensions=Warang_Citi} (Short:
4454                                    \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
4455          \p{WB: *}               \p{Word_Break: *}
4456          \p{White_Space}         \p{White_Space=Y} (Short: \p{Space}) (25)
4457          \p{White_Space: N*}     (Short: \p{Space=N}, \P{Space}) (1_114_087
4458                                    plus all above-Unicode code points)
4459          \p{White_Space: Y*}     (Short: \p{Space=Y}, \p{Space}) (25)
4460          \p{Word}                \p{XPosixWord} (119_821)
4461          \p{Word_Break: ALetter} (Short: \p{WB=LE}) (27_992)
4462          \p{Word_Break: CR}      (Short: \p{WB=CR}) (1)
4463          \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1)
4464          \p{Word_Break: DQ}      \p{Word_Break=Double_Quote} (1)
4465          \p{Word_Break: E_Base}  (Short: \p{WB=EB}) (79)
4466          \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (4)
4467          \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (5)
4468          \p{Word_Break: EB}      \p{Word_Break=E_Base} (79)
4469          \p{Word_Break: EBG}     \p{Word_Break=E_Base_GAZ} (4)
4470          \p{Word_Break: EM}      \p{Word_Break=E_Modifier} (5)
4471          \p{Word_Break: EX}      \p{Word_Break=ExtendNumLet} (11)
4472          \p{Word_Break: Extend}  (Short: \p{WB=Extend}) (2196)
4473          \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11)
4474          \p{Word_Break: FO}      \p{Word_Break=Format} (52)
4475          \p{Word_Break: Format}  (Short: \p{WB=FO}) (52)
4476          \p{Word_Break: GAZ}     \p{Word_Break=Glue_After_Zwj} (3)
4477          \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (3)
4478          \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (74)
4479          \p{Word_Break: HL}      \p{Word_Break=Hebrew_Letter} (74)
4480          \p{Word_Break: KA}      \p{Word_Break=Katakana} (310)
4481          \p{Word_Break: Katakana} (Short: \p{WB=KA}) (310)
4482          \p{Word_Break: LE}      \p{Word_Break=ALetter} (27_992)
4483          \p{Word_Break: LF}      (Short: \p{WB=LF}) (1)
4484          \p{Word_Break: MB}      \p{Word_Break=MidNumLet} (7)
4485          \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (9)
4486          \p{Word_Break: MidNum}  (Short: \p{WB=MN}) (15)
4487          \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7)
4488          \p{Word_Break: ML}      \p{Word_Break=MidLetter} (9)
4489          \p{Word_Break: MN}      \p{Word_Break=MidNum} (15)
4490          \p{Word_Break: Newline} (Short: \p{WB=NL}) (5)
4491          \p{Word_Break: NL}      \p{Word_Break=Newline} (5)
4492          \p{Word_Break: NU}      \p{Word_Break=Numeric} (571)
4493          \p{Word_Break: Numeric} (Short: \p{WB=NU}) (571)
4494          \p{Word_Break: Other}   (Short: \p{WB=XX}) (1_082_748 plus all
4495                                    above-Unicode code points)
4496          \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26)
4497          \p{Word_Break: RI}      \p{Word_Break=Regional_Indicator} (26)
4498          \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1)
4499          \p{Word_Break: SQ}      \p{Word_Break=Single_Quote} (1)
4500          \p{Word_Break: XX}      \p{Word_Break=Other} (1_082_748 plus all
4501                                    above-Unicode code points)
4502          \p{Word_Break: ZWJ}     (Short: \p{WB=ZWJ}) (1)
4503          \p{WSpace}              \p{White_Space} (= \p{White_Space=Y}) (25)
4504          \p{WSpace: *}           \p{White_Space: *}
4505          \p{XDigit}              \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
4506          \p{XID_Continue}        \p{XID_Continue=Y} (Short: \p{XIDC})
4507                                    (119_672)
4508          \p{XID_Continue: N*}    (Short: \p{XIDC=N}, \P{XIDC}) (994_440
4509                                    plus all above-Unicode code points)
4510          \p{XID_Continue: Y*}    (Short: \p{XIDC=Y}, \p{XIDC}) (119_672)
4511          \p{XID_Start}           \p{XID_Start=Y} (Short: \p{XIDS}) (116_984)
4512          \p{XID_Start: N*}       (Short: \p{XIDS=N}, \P{XIDS}) (997_128
4513                                    plus all above-Unicode code points)
4514          \p{XID_Start: Y*}       (Short: \p{XIDS=Y}, \p{XIDS}) (116_984)
4515          \p{XIDC}                \p{XID_Continue} (= \p{XID_Continue=Y})
4516                                    (119_672)
4517          \p{XIDC: *}             \p{XID_Continue: *}
4518          \p{XIDS}                \p{XID_Start} (= \p{XID_Start=Y}) (116_984)
4519          \p{XIDS: *}             \p{XID_Start: *}
4520          \p{Xpeo}                \p{Old_Persian} (= \p{Script_Extensions=
4521                                    Old_Persian}) (NOT \p{Block=
4522                                    Old_Persian}) (50)
4523          \p{XPerlSpace}          \p{XPosixSpace} (25)
4524          \p{XPosixAlnum}         Alphabetic and (decimal) Numeric (Short:
4525                                    \p{Alnum}) (118_820)
4526          \p{XPosixAlpha}         \p{Alphabetic=Y} (Short: \p{Alpha})
4527                                    (118_240)
4528          \p{XPosixBlank}         \h, Horizontal white space (Short:
4529                                    \p{Blank}) (18)
4530          \p{XPosixCntrl}         \p{General_Category=Control} Control
4531                                    characters (Short: \p{Cc}) (65)
4532          \p{XPosixDigit}         \p{General_Category=Decimal_Number} [0-9]
4533                                    + all other decimal digits (Short:
4534                                    \p{Nd}) (580)
4535          \p{XPosixGraph}         Characters that are graphical (Short:
4536                                    \p{Graph}) (265_621)
4537          \p{XPosixLower}         \p{Lowercase=Y} (Short: \p{Lower}; /i=
4538                                    Cased=Yes) (2252)
4539          \p{XPosixPrint}         Characters that are graphical plus space
4540                                    characters (but no controls) (Short:
4541                                    \p{Print}) (265_638)
4542          \p{XPosixPunct}         \p{Punct} + ASCII-range \p{Symbol} (757)
4543          \p{XPosixSpace}         \s including beyond ASCII and vertical tab
4544                                    (Short: \p{SpacePerl}) (25)
4545          \p{XPosixUpper}         \p{Uppercase=Y} (Short: \p{Upper}; /i=
4546                                    Cased=Yes) (1822)
4547          \p{XPosixWord}          \w, including beyond ASCII; = \p{Alnum} +
4548                                    \pM + \p{Pc} + \p{Join_Control} (Short:
4549                                    \p{Word}) (119_821)
4550          \p{XPosixXDigit}        \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
4551          \p{Xsux}                \p{Cuneiform} (= \p{Script_Extensions=
4552                                    Cuneiform}) (NOT \p{Block=Cuneiform})
4553                                    (1234)
4554          \p{Yi}                  \p{Script_Extensions=Yi} (1246)
4555        X \p{Yi_Radicals}         \p{Block=Yi_Radicals} (64)
4556        X \p{Yi_Syllables}        \p{Block=Yi_Syllables} (1168)
4557          \p{Yiii}                \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
4558        X \p{Yijing}              \p{Yijing_Hexagram_Symbols} (= \p{Block=
4559                                    Yijing_Hexagram_Symbols}) (64)
4560        X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
4561                                    (Short: \p{InYijing}) (64)
4562          \p{Z} \pZ               \p{Separator} (= \p{General_Category=
4563                                    Separator}) (19)
4564          \p{Zinh}                \p{Inherited} (= \p{Script_Extensions=
4565                                    Inherited}) (496)
4566          \p{Zl}                  \p{Line_Separator} (= \p{General_Category=
4567                                    Line_Separator}) (1)
4568          \p{Zp}                  \p{Paragraph_Separator} (=
4569                                    \p{General_Category=
4570                                    Paragraph_Separator}) (1)
4571          \p{Zs}                  \p{Space_Separator} (=
4572                                    \p{General_Category=Space_Separator})
4573                                    (17)
4574          \p{Zyyy}                \p{Common} (= \p{Script_Extensions=
4575                                    Common}) (6864)
4576          \p{Zzzz}                \p{Unknown} (= \p{Script_Extensions=
4577                                    Unknown}) (985_875 plus all above-
4578                                    Unicode code points)
4579        TX\p{_CanonDCIJ}          (For internal use by Perl, not necessarily
4580                                    stable) (= \p{Soft_Dotted=Y}) (46)
4581        TX\p{_Case_Ignorable}     (For internal use by Perl, not necessarily
4582                                    stable) (= \p{Case_Ignorable=Y}) (2240)
4583        TX\p{_CombAbove}          (For internal use by Perl, not necessarily
4584                                    stable) (= \p{Canonical_Combining_Class=
4585                                    Above}) (461)
4586
4587   Legal "\p{}" and "\P{}" constructs that match no characters
4588       Unicode has some property-value pairs that currently don't match
4589       anything.  This happens generally either because they are obsolete, or
4590       they exist for symmetry with other forms, but no language has yet been
4591       encoded that uses them.  In this version of Unicode, the following
4592       match zero code points:
4593
4594       \p{Canonical_Combining_Class=Attached_Below_Left}
4595       \p{Canonical_Combining_Class=CCC133}
4596

Properties accessible through Unicode::UCD

4598       The value of any Unicode (not including Perl extensions) character
4599       property mentioned above for any single code point is available through
4600       "charprop()" in Unicode::UCD.  "charprops_all()" in Unicode::UCD
4601       returns the values of all the Unicode properties for a given code
4602       point.
4603
4604       Besides these, all the Unicode character properties mentioned above
4605       (except for those marked as for internal use by Perl) are also
4606       accessible by "prop_invlist()" in Unicode::UCD.
4607
4608       Due to their nature, not all Unicode character properties are suitable
4609       for regular expression matches, nor "prop_invlist()".  The remaining
4610       non-provisional, non-internal ones are accessible via "prop_invmap()"
4611       in Unicode::UCD (except for those that this Perl installation hasn't
4612       included; see below for which those are).
4613
4614       For compatibility with other parts of Perl, all the single forms given
4615       in the table in the section above are recognized.  BUT, there are some
4616       ambiguities between some Perl extensions and the Unicode properties,
4617       all of which are silently resolved in favor of the official Unicode
4618       property.  To avoid surprises, you should only use "prop_invmap()" for
4619       forms listed in the table below, which omits the non-recommended ones.
4620       The affected forms are the Perl single form equivalents of Unicode
4621       properties, such as "\p{sc}" being a single-form equivalent of
4622       "\p{gc=sc}", which is treated by "prop_invmap()" as the "Script"
4623       property, whose short name is "sc".  The table indicates the current
4624       ambiguities in the INFO column, beginning with the word "NOT".
4625
4626       The standard Unicode properties listed below are documented in
4627       <http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is
4628       documented in "prop_invmap()" in Unicode::UCD.  The other Perl
4629       extensions are in "Other Properties" in perlunicode;
4630
4631       The first column in the table is a name for the property; the second
4632       column is an alternative name, if any, plus possibly some annotations.
4633       The alternative name is the property's full name, unless that would
4634       simply repeat the first column, in which case the second column
4635       indicates the property's short name (if different).  The annotations
4636       are given only in the entry for the full name.  If a property is
4637       obsolete, etc, the entry will be flagged with the same characters used
4638       in the table in the section above, like D or S.
4639
4640          NAME                      INFO
4641
4642          Age
4643          AHex                    ASCII_Hex_Digit
4644          All                     (Perl extension).  All code points,
4645                                  including those above Unicode.  Same as
4646                                  qr/./s
4647          Alnum                   XPosixAlnum.  (Perl extension)
4648          Alpha                   Alphabetic
4649          Alphabetic              (Short: Alpha)
4650          Any                     (Perl extension).  All Unicode code
4651                                  points: [\x{0000}-\x{10FFFF}]
4652          ASCII                   Block=ASCII.  (Perl extension).
4653                                  [[:ASCII:]]
4654          ASCII_Hex_Digit         (Short: AHex)
4655          Assigned                (Perl extension).  All assigned code points
4656          Bc                      Bidi_Class
4657          Bidi_C                  Bidi_Control
4658          Bidi_Class              (Short: bc)
4659          Bidi_Control            (Short: Bidi_C)
4660          Bidi_M                  Bidi_Mirrored
4661          Bidi_Mirrored           (Short: Bidi_M)
4662          Bidi_Mirroring_Glyph    (Short: bmg)
4663          Bidi_Paired_Bracket     (Short: bpb)
4664          Bidi_Paired_Bracket_Type (Short: bpt)
4665          Blank                   XPosixBlank.  (Perl extension)
4666          Blk                     Block
4667          Block                   (Short: blk)
4668          Bmg                     Bidi_Mirroring_Glyph
4669          Bpb                     Bidi_Paired_Bracket
4670          Bpt                     Bidi_Paired_Bracket_Type
4671          Canonical_Combining_Class (Short: ccc)
4672          Case_Folding            (Short: cf)
4673          Case_Ignorable          (Short: CI)
4674          Cased
4675          Category                General_Category
4676          Ccc                     Canonical_Combining_Class
4677          CE                      Composition_Exclusion
4678          Cf                      Case_Folding; NOT 'cf' meaning
4679                                  'General_Category=Format'
4680          Changes_When_Casefolded (Short: CWCF)
4681          Changes_When_Casemapped (Short: CWCM)
4682          Changes_When_Lowercased (Short: CWL)
4683          Changes_When_NFKC_Casefolded (Short: CWKCF)
4684          Changes_When_Titlecased (Short: CWT)
4685          Changes_When_Uppercased (Short: CWU)
4686          CI                      Case_Ignorable
4687          Cntrl                   General_Category=XPosixCntrl.  (Perl
4688                                  extension)
4689          Comp_Ex                 Full_Composition_Exclusion
4690          Composition_Exclusion   (Short: CE)
4691          CWCF                    Changes_When_Casefolded
4692          CWCM                    Changes_When_Casemapped
4693          CWKCF                   Changes_When_NFKC_Casefolded
4694          CWL                     Changes_When_Lowercased
4695          CWT                     Changes_When_Titlecased
4696          CWU                     Changes_When_Uppercased
4697          Dash
4698          Decomposition_Mapping   (Short: dm)
4699          Decomposition_Type      (Short: dt)
4700          Default_Ignorable_Code_Point (Short: DI)
4701          Dep                     Deprecated
4702          Deprecated              (Short: Dep)
4703          DI                      Default_Ignorable_Code_Point
4704          Dia                     Diacritic
4705          Diacritic               (Short: Dia)
4706          Digit                   General_Category=XPosixDigit.  (Perl
4707                                  extension)
4708          Dm                      Decomposition_Mapping
4709          Dt                      Decomposition_Type
4710          Ea                      East_Asian_Width
4711          East_Asian_Width        (Short: ea)
4712          Ext                     Extender
4713          Extender                (Short: Ext)
4714          Full_Composition_Exclusion (Short: Comp_Ex)
4715          Gc                      General_Category
4716          GCB                     Grapheme_Cluster_Break
4717          General_Category        (Short: gc)
4718          Gr_Base                 Grapheme_Base
4719          Gr_Ext                  Grapheme_Extend
4720          Graph                   XPosixGraph.  (Perl extension)
4721          Grapheme_Base           (Short: Gr_Base)
4722          Grapheme_Cluster_Break  (Short: GCB)
4723          Grapheme_Extend         (Short: Gr_Ext)
4724          Hangul_Syllable_Type    (Short: hst)
4725          Hex                     Hex_Digit
4726          Hex_Digit               (Short: Hex)
4727          HorizSpace              XPosixBlank.  (Perl extension)
4728          Hst                     Hangul_Syllable_Type
4729        D Hyphen                  Supplanted by Line_Break property values;
4730                                  see www.unicode.org/reports/tr14
4731          ID_Continue             (Short: IDC)
4732          ID_Start                (Short: IDS)
4733          IDC                     ID_Continue
4734          Ideo                    Ideographic
4735          Ideographic             (Short: Ideo)
4736          IDS                     ID_Start
4737          IDS_Binary_Operator     (Short: IDSB)
4738          IDS_Trinary_Operator    (Short: IDST)
4739          IDSB                    IDS_Binary_Operator
4740          IDST                    IDS_Trinary_Operator
4741          In                      Present_In.  (Perl extension)
4742          Indic_Positional_Category (Short: InPC)
4743          Indic_Syllabic_Category (Short: InSC)
4744          InPC                    Indic_Positional_Category
4745          InSC                    Indic_Syllabic_Category
4746          Isc                     ISO_Comment; NOT 'isc' meaning
4747                                  'General_Category=Other'
4748          ISO_Comment             (Short: isc)
4749          Jg                      Joining_Group
4750          Join_C                  Join_Control
4751          Join_Control            (Short: Join_C)
4752          Joining_Group           (Short: jg)
4753          Joining_Type            (Short: jt)
4754          Jt                      Joining_Type
4755          Lb                      Line_Break
4756          Lc                      Lowercase_Mapping; NOT 'lc' meaning
4757                                  'General_Category=Cased_Letter'
4758          Line_Break              (Short: lb)
4759          LOE                     Logical_Order_Exception
4760          Logical_Order_Exception (Short: LOE)
4761          Lower                   Lowercase
4762          Lowercase               (Short: Lower)
4763          Lowercase_Mapping       (Short: lc)
4764          Math
4765          Na                      Name
4766          Na1                     Unicode_1_Name
4767          Name                    (Short: na)
4768          Name_Alias
4769          NChar                   Noncharacter_Code_Point
4770          NFC_QC                  NFC_Quick_Check
4771          NFC_Quick_Check         (Short: NFC_QC)
4772          NFD_QC                  NFD_Quick_Check
4773          NFD_Quick_Check         (Short: NFD_QC)
4774          NFKC_Casefold           (Short: NFKC_CF)
4775          NFKC_CF                 NFKC_Casefold
4776          NFKC_QC                 NFKC_Quick_Check
4777          NFKC_Quick_Check        (Short: NFKC_QC)
4778          NFKD_QC                 NFKD_Quick_Check
4779          NFKD_Quick_Check        (Short: NFKD_QC)
4780          Noncharacter_Code_Point (Short: NChar)
4781          Nt                      Numeric_Type
4782          Numeric_Type            (Short: nt)
4783          Numeric_Value           (Short: nv)
4784          Nv                      Numeric_Value
4785          Pat_Syn                 Pattern_Syntax
4786          Pat_WS                  Pattern_White_Space
4787          Pattern_Syntax          (Short: Pat_Syn)
4788          Pattern_White_Space     (Short: Pat_WS)
4789          PCM                     Prepended_Concatenation_Mark
4790          Perl_Decimal_Digit      (Perl extension)
4791          PerlSpace               PosixSpace.  (Perl extension)
4792          PerlWord                PosixWord.  (Perl extension)
4793          PosixAlnum              (Perl extension).  [A-Za-z0-9]
4794          PosixAlpha              (Perl extension).  [A-Za-z]
4795          PosixBlank              (Perl extension).  \t and ' '
4796          PosixCntrl              (Perl extension).  ASCII control
4797                                  characters: NUL, SOH, STX, ETX, EOT, ENQ,
4798                                  ACK, BEL, BS, HT, LF, VT, FF, CR, SO, SI,
4799                                  DLE, DC1, DC2, DC3, DC4, NAK, SYN, ETB,
4800                                  CAN, EOM, SUB, ESC, FS, GS, RS, US, and DEL
4801          PosixDigit              (Perl extension).  [0-9]
4802          PosixGraph              (Perl extension).  [-!"#$%&'()*+,./:;<=
4803                                  >?@[\\]^_`{|}~0-9A-Za-z]
4804          PosixLower              (Perl extension).  [a-z]
4805          PosixPrint              (Perl extension).  [- 0-9A-Za-
4806                                  z!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~]
4807          PosixPunct              (Perl extension).  [-!"#$%&'()*+,./:;<=
4808                                  >?@[\\]^_`{|}~]
4809          PosixSpace              (Perl extension).  \t, \n, \cK, \f, \r,
4810                                  and ' '.  (\cK is vertical tab)
4811          PosixUpper              (Perl extension).  [A-Z]
4812          PosixWord               (Perl extension).  \w, restricted to ASCII
4813                                  = [A-Za-z0-9_]
4814          PosixXDigit             (Perl extension).  [0-9A-Fa-f]
4815          Prepended_Concatenation_Mark (Short: PCM)
4816          Present_In              (Short: In).  (Perl extension)
4817          Print                   XPosixPrint.  (Perl extension)
4818          Punct                   General_Category=Punct.  (Perl extension)
4819          QMark                   Quotation_Mark
4820          Quotation_Mark          (Short: QMark)
4821          Radical
4822          SB                      Sentence_Break
4823          Sc                      Script; NOT 'sc' meaning
4824                                  'General_Category=Currency_Symbol'
4825          Scf                     Simple_Case_Folding
4826          Script                  (Short: sc)
4827          Script_Extensions       (Short: scx)
4828          Scx                     Script_Extensions
4829          SD                      Soft_Dotted
4830          Sentence_Break          (Short: SB)
4831          Sentence_Terminal       (Short: STerm)
4832          Sfc                     Simple_Case_Folding
4833          Simple_Case_Folding     (Short: scf)
4834          Simple_Lowercase_Mapping (Short: slc)
4835          Simple_Titlecase_Mapping (Short: stc)
4836          Simple_Uppercase_Mapping (Short: suc)
4837          Slc                     Simple_Lowercase_Mapping
4838          Soft_Dotted             (Short: SD)
4839          Space                   White_Space
4840          SpacePerl               XPosixSpace.  (Perl extension)
4841          Stc                     Simple_Titlecase_Mapping
4842          STerm                   Sentence_Terminal
4843          Suc                     Simple_Uppercase_Mapping
4844          Tc                      Titlecase_Mapping
4845          Term                    Terminal_Punctuation
4846          Terminal_Punctuation    (Short: Term)
4847          Title                   Titlecase.  (Perl extension)
4848          Titlecase               (Short: Title).  (Perl extension).  (=
4849                                  \p{Gc=Lt})
4850          Titlecase_Mapping       (Short: tc)
4851          Uc                      Uppercase_Mapping
4852          UIdeo                   Unified_Ideograph
4853          Unicode                 Any.  (Perl extension)
4854          Unicode_1_Name          (Short: na1)
4855          Unified_Ideograph       (Short: UIdeo)
4856          Upper                   Uppercase
4857          Uppercase               (Short: Upper)
4858          Uppercase_Mapping       (Short: uc)
4859          Variation_Selector      (Short: VS)
4860          VertSpace               (Perl extension).  \v
4861          VS                      Variation_Selector
4862          WB                      Word_Break
4863          White_Space             (Short: WSpace)
4864          Word                    XPosixWord.  (Perl extension)
4865          Word_Break              (Short: WB)
4866          WSpace                  White_Space
4867          XDigit                  XPosixXDigit.  (Perl extension)
4868          XID_Continue            (Short: XIDC)
4869          XID_Start               (Short: XIDS)
4870          XIDC                    XID_Continue
4871          XIDS                    XID_Start
4872          XPerlSpace              XPosixSpace.  (Perl extension)
4873          XPosixAlnum             (Short: Alnum).  (Perl extension).
4874                                  Alphabetic and (decimal) Numeric
4875          XPosixAlpha             (Perl extension)
4876          XPosixBlank             (Short: Blank).  (Perl extension).  \h,
4877                                  Horizontal white space
4878          XPosixCntrl             General_Category=XPosixCntrl  (Short:
4879                                  Cntrl).  (Perl extension).  Control
4880                                  characters
4881          XPosixDigit             General_Category=XPosixDigit  (Short:
4882                                  Digit).  (Perl extension).  [0-9] + all
4883                                  other decimal digits
4884          XPosixGraph             (Short: Graph).  (Perl extension).
4885                                  Characters that are graphical
4886          XPosixLower             (Perl extension)
4887          XPosixPrint             (Short: Print).  (Perl extension).
4888                                  Characters that are graphical plus space
4889                                  characters (but no controls)
4890          XPosixPunct             (Perl extension).  \p{Punct} + ASCII-range
4891                                  \p{Symbol}
4892          XPosixSpace             (Perl extension).  \s including beyond
4893                                  ASCII and vertical tab
4894          XPosixUpper             (Perl extension)
4895          XPosixWord              (Short: Word).  (Perl extension).  \w,
4896                                  including beyond ASCII; = \p{Alnum} + \pM
4897                                  + \p{Pc} + \p{Join_Control}
4898          XPosixXDigit            (Short: XDigit).  (Perl extension)
4899

Properties accessible through other means

4901       Certain properties are accessible also via core function calls.  These
4902       are:
4903
4904        Lowercase_Mapping          lc() and lcfirst()
4905        Titlecase_Mapping          ucfirst()
4906        Uppercase_Mapping          uc()
4907
4908       Also, Case_Folding is accessible through the "/i" modifier in regular
4909       expressions, the "\F" transliteration escape, and the "fc" operator.
4910
4911       And, the Name and Name_Aliases properties are accessible through the
4912       "\N{}" interpolation in double-quoted strings and regular expressions;
4913       and functions "charnames::viacode()", "charnames::vianame()", and
4914       "charnames::string_vianame()" (which require a "use charnames ();" to
4915       be specified.
4916
4917       Finally, most properties related to decomposition are accessible via
4918       Unicode::Normalize.
4919

Unicode character properties that are NOT accepted by Perl

4921       Perl will generate an error for a few character properties in Unicode
4922       when used in a regular expression.  The non-Unihan ones are listed
4923       below, with the reasons they are not accepted, perhaps with work-
4924       arounds.  The short names for the properties are listed enclosed in
4925       (parentheses).  As described after the list, an installation can change
4926       the defaults and choose to accept any of these.  The list is machine
4927       generated based on the choices made for the installation that generated
4928       this document.
4929
4930       Expands_On_NFC (XO_NFC)
4931       Expands_On_NFD (XO_NFD)
4932       Expands_On_NFKC (XO_NFKC)
4933       Expands_On_NFKD (XO_NFKD)
4934           Deprecated by Unicode.  These are characters that expand to more
4935           than one character in the specified normalization form, but whether
4936           they actually take up more bytes or not depends on the encoding
4937           being used.  For example, a UTF-8 encoded character may expand to a
4938           different number of bytes than a UTF-32 encoded character.
4939
4940       Grapheme_Link (Gr_Link)
4941           Deprecated by Unicode:  Duplicates ccc=vr
4942           (Canonical_Combining_Class=Virama)
4943
4944       Jamo_Short_Name (JSN)
4945       Other_Alphabetic (OAlpha)
4946       Other_Default_Ignorable_Code_Point (ODI)
4947       Other_Grapheme_Extend (OGr_Ext)
4948       Other_ID_Continue (OIDC)
4949       Other_ID_Start (OIDS)
4950       Other_Lowercase (OLower)
4951       Other_Math (OMath)
4952       Other_Uppercase (OUpper)
4953           Used by Unicode internally for generating other properties and not
4954           intended to be used stand-alone
4955
4956       Script=Katakana_Or_Hiragana (sc=Hrkt)
4957           Obsolete.  All code points previously matched by this have been
4958           moved to "Script=Common".  Consider instead using
4959           "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or
4960           both)
4961
4962       Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)
4963           All code points that would be matched by this are matched by either
4964           "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
4965
4966       An installation can choose to allow any of these to be matched by
4967       downloading the Unicode database from <http://www.unicode.org/Public/>
4968       to $Config{privlib}/unicore/ in the Perl source tree, changing the
4969       controlling lists contained in the program
4970       $Config{privlib}/unicore/mktables and then re-compiling and installing.
4971       (%Config is available from the Config module).
4972
4973       Also, perl can be recompiled to operate on an earlier version of the
4974       Unicode standard.  Further information is at
4975       $Config{privlib}/unicore/README.perl.
4976

Other information in the Unicode data base

4978       The Unicode data base is delivered in two different formats.  The XML
4979       version is valid for more modern Unicode releases.  The other version
4980       is a collection of files.  The two are intended to give equivalent
4981       information.  Perl uses the older form; this allows you to recompile
4982       Perl to use early Unicode releases.
4983
4984       The only non-character property that Perl currently supports is Named
4985       Sequences, in which a sequence of code points is given a name and
4986       generally treated as a single entity.  (Perl supports these via the
4987       "\N{...}" double-quotish construct, "charnames::string_vianame(name)"
4988       in charnames, and "namedseq()" in Unicode::UCD.
4989
4990       Below is a list of the files in the Unicode data base that Perl doesn't
4991       currently use, along with very brief descriptions of their purposes.
4992       Some of the names of the files have been shortened from those that
4993       Unicode uses, in order to allow them to be distinguishable from
4994       similarly named files on file systems for which only the first 8
4995       characters of a name are significant.
4996
4997       auxiliary/GraphemeBreakTest.html
4998       auxiliary/LineBreakTest.html
4999       auxiliary/SentenceBreakTest.html
5000       auxiliary/WordBreakTest.html
5001           Documentation of validation Tests
5002
5003       BidiCharacterTest.txt
5004       BidiTest.txt
5005       NormTest.txt
5006           Validation Tests
5007
5008       CJKRadicals.txt
5009           Maps the kRSUnicode property values to corresponding code points
5010
5011       EmojiSources.txt
5012           Maps certain Unicode code points to their legacy Japanese cell-
5013           phone values
5014
5015       Index.txt
5016           Alphabetical index of Unicode characters
5017
5018       NamedSqProv.txt
5019           Named sequences proposed for inclusion in a later version of the
5020           Unicode Standard; if you need them now, you can append this file to
5021           NamedSequences.txt and recompile perl
5022
5023       NamesList.html
5024           Describes the format and contents of NamesList.txt
5025
5026       NamesList.txt
5027           Annotated list of characters
5028
5029       NormalizationCorrections.txt
5030           Documentation of corrections already incorporated into the Unicode
5031           data base
5032
5033       ReadMe.txt
5034           Documentation
5035
5036       StandardizedVariants.html
5037           Obsoleted as of Unicode 9.0, but previously provided a visual
5038           display of the standard variant sequences derived from
5039           StandardizedVariants.txt.
5040
5041       StandardizedVariants.txt
5042           Certain glyph variations for character display are standardized.
5043           This lists the non-Unihan ones; the Unihan ones are also not used
5044           by Perl, and are in a separate Unicode data base
5045           <http://www.unicode.org/ivd>
5046
5047       TangutSources.txt
5048           Specifies source mappings for Tangut ideographs and components.
5049           This data file also includes informative radical-stroke values that
5050           are used internally by Unicode
5051
5052       USourceData.txt
5053           Documentation of status and cross reference of proposals for
5054           encoding by Unicode of Unihan characters
5055
5056       USourceGlyphs.pdf
5057           Pictures of the characters in USourceData.txt
5058

SEE ALSO

5060       <http://www.unicode.org/reports/tr44/>
5061
5062       perlrecharclass
5063
5064       perlunicode
5065
5066
5067
5068perl v5.26.3                      2019-05-11                   PERLUNIPROPS(1)
Impressum