1PERLUNIPROPS(1)        Perl Programmers Reference Guide        PERLUNIPROPS(1)
2
3
4

NAME

6       perluniprops - Index of Unicode Version 10.0.0 character properties in
7       Perl
8

DESCRIPTION

10       This document provides information about the portion of the Unicode
11       database that deals with character properties, that is the portion that
12       is defined on single code points.  ("Other information in the Unicode
13       data base" below briefly mentions other data that Unicode provides.)
14
15       Perl can provide access to all non-provisional Unicode character
16       properties, though not all are enabled by default.  The omitted ones
17       are the Unihan properties (accessible via the CPAN module
18       Unicode::Unihan) and certain deprecated or Unicode-internal properties.
19       (An installation may choose to recompile Perl's tables to change this.
20       See "Unicode character properties that are NOT accepted by Perl".)
21
22       For most purposes, access to Unicode properties from the Perl core is
23       through regular expression matches, as described in the next section.
24       For some special purposes, and to access the properties that are not
25       suitable for regular expression matching, all the Unicode character
26       properties that Perl handles are accessible via the standard
27       Unicode::UCD module, as described in the section "Properties accessible
28       through Unicode::UCD".
29
30       Perl also provides some additional extensions and short-cut synonyms
31       for Unicode properties.
32
33       This document merely lists all available properties and does not
34       attempt to explain what each property really means.  There is a brief
35       description of each Perl extension; see "Other Properties" in
36       perlunicode for more information on these.  There is some detail about
37       Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but
38       to find out about the intricacies of the official Unicode properties,
39       refer to the Unicode standard.  A good starting place is
40       <http://www.unicode.org/reports/tr44/>.
41
42       Note that you can define your own properties; see "User-Defined
43       Character Properties" in perlunicode.
44

Properties accessible through "\p{}" and "\P{}"

46       The Perl regular expression "\p{}" and "\P{}" constructs give access to
47       most of the Unicode character properties.  The table below shows all
48       these constructs, both single and compound forms.
49
50       Compound forms consist of two components, separated by an equals sign
51       or a colon.  The first component is the property name, and the second
52       component is the particular value of the property to match against, for
53       example, "\p{Script_Extensions: Greek}" and
54       "\p{Script_Extensions=Greek}" both mean to match characters whose
55       Script_Extensions property value is Greek.  ("Script_Extensions" is an
56       improved version of the "Script" property.)
57
58       Single forms, like "\p{Greek}", are mostly Perl-defined shortcuts for
59       their equivalent compound forms.  The table shows these equivalences.
60       (In our example, "\p{Greek}" is a just a shortcut for
61       "\p{Script_Extensions=Greek}").  There are also a few Perl-defined
62       single forms that are not shortcuts for a compound form.  One such is
63       "\p{Word}".  These are also listed in the table.
64
65       In parsing these constructs, Perl always ignores Upper/lower case
66       differences everywhere within the {braces}.  Thus "\p{Greek}" means the
67       same thing as "\p{greek}".  But note that changing the case of the "p"
68       or "P" before the left brace completely changes the meaning of the
69       construct, from "match" (for "\p{}") to "doesn't match" (for "\P{}").
70       Casing in this document is for improved legibility.
71
72       Also, white space, hyphens, and underscores are normally ignored
73       everywhere between the {braces}, and hence can be freely added or
74       removed even if the "/x" modifier hasn't been specified on the regular
75       expression.  But in the table below a 'T' at the beginning of an entry
76       means that tighter (stricter) rules are used for that entry:
77
78           Single form ("\p{name}") tighter rules:
79               White space, hyphens, and underscores ARE significant except
80               for:
81
82               ·   white space adjacent to a non-word character
83
84               ·   underscores separating digits in numbers
85
86               That means, for example, that you can freely add or remove
87               white space adjacent to (but within) the braces without
88               affecting the meaning.
89
90           Compound form ("\p{name=value}" or "\p{name:value}") tighter rules:
91               The tighter rules given above for the single form apply to
92               everything to the right of the colon or equals; the looser
93               rules still apply to everything to the left.
94
95               That means, for example, that you can freely add or remove
96               white space adjacent to (but within) the braces and the colon
97               or equal sign.
98
99       Some properties are considered obsolete by Unicode, but still
100       available.  There are several varieties of obsolescence:
101
102           Stabilized
103               A property may be stabilized.  Such a determination does not
104               indicate that the property should or should not be used;
105               instead it is a declaration that the property will not be
106               maintained nor extended for newly encoded characters.  Such
107               properties are marked with an 'S' in the table.
108
109           Deprecated
110               A property may be deprecated, perhaps because its original
111               intent has been replaced by another property, or because its
112               specification was somehow defective.  This means that its use
113               is strongly discouraged, so much so that a warning will be
114               issued if used, unless the regular expression is in the scope
115               of a "no warnings 'deprecated'" statement.  A 'D' flags each
116               such entry in the table, and the entry there for the longest,
117               most descriptive version of the property will give the reason
118               it is deprecated, and perhaps advice.  Perl may issue such a
119               warning, even for properties that aren't officially deprecated
120               by Unicode, when there used to be characters or code points
121               that were matched by them, but no longer.  This is to warn you
122               that your program may not work like it did on earlier Unicode
123               releases.
124
125               A deprecated property may be made unavailable in a future Perl
126               version, so it is best to move away from them.
127
128               A deprecated property may also be stabilized, but this fact is
129               not shown.
130
131           Obsolete
132               Properties marked with an 'O' in the table are considered
133               (plain) obsolete.  Generally this designation is given to
134               properties that Unicode once used for internal purposes (but
135               not any longer).
136
137           Discouraged
138               This is not actually a Unicode-specified obsolescence, but
139               applies to certain Perl extensions that are present for
140               backwards compatibility, but are discouraged from being used.
141               These are not obsolete, but their meanings are not stable.
142               Future Unicode versions could force any of these extensions to
143               be removed without warning, replaced by another property with
144               the same name that means something different.  An 'X' flags
145               each such entry in the table.  Use the equivalent shown
146               instead.
147
148               In particular, matches in the Block property have single forms
149               defined by Perl that begin with "In_", ""Is_", or even with no
150               prefix at all,  Like all DISCOURAGED forms, these are not
151               stable.  For example, "\p{Block=Deseret}" can currently be
152               written as "\p{In_Deseret}", "\p{Is_Deseret}", or
153               "\p{Deseret}".  But, a new Unicode version may come along that
154               would force Perl to change the meaning of one or more of these,
155               and your program would no longer be correct.  Currently there
156               are no such conflicts with the form that begins "In_", but
157               there are many with the other two shortcuts, and Unicode
158               continues to define new properties that begin with "In", so
159               it's quite possible that a conflict will occur in the future.
160               The compound form is guaranteed to not become obsolete, and its
161               meaning is clearer anyway.  See "Blocks" in perlunicode for
162               more information about this.
163
164       The table below has two columns.  The left column contains the "\p{}"
165       constructs to look up, possibly preceded by the flags mentioned above;
166       and the right column contains information about them, like a
167       description, or synonyms.  The table shows both the single and compound
168       forms for each property that has them.  If the left column is a short
169       name for a property, the right column will give its longer, more
170       descriptive name; and if the left column is the longest name, the right
171       column will show any equivalent shortest name, in both single and
172       compound forms if applicable.
173
174       If braces are not needed to specify a property (e.g., "\pL"), the left
175       column contains both forms, with and without braces.
176
177       The right column will also caution you if a property means something
178       different than what might normally be expected.
179
180       All single forms are Perl extensions; a few compound forms are as well,
181       and are noted as such.
182
183       Numbers in (parentheses) indicate the total number of Unicode code
184       points matched by the property.  For the entries that give the longest,
185       most descriptive version of the property, the count is followed by a
186       list of some of the code points matched by it.  The list includes all
187       the matched characters in the 0-255 range, enclosed in the familiar
188       [brackets] the same as a regular expression bracketed character class.
189       Following that, the next few higher matching ranges are also given.  To
190       avoid visual ambiguity, the SPACE character is represented as "\x20".
191
192       For emphasis, those properties that match no code points at all are
193       listed as well in a separate section following the table.
194
195       Most properties match the same code points regardless of whether "/i"
196       case-insensitive matching is specified or not.  But a few properties
197       are affected.  These are shown with the notation "(/i= other_property)"
198       in the second column.  Under case-insensitive matching they match the
199       same code pode points as the property other_property.
200
201       There is no description given for most non-Perl defined properties (See
202       <http://www.unicode.org/reports/tr44/> for that).
203
204       For compactness, '*' is used as a wildcard instead of showing all
205       possible combinations.  For example, entries like:
206
207        \p{Gc: *}                                  \p{General_Category: *}
208
209       mean that 'Gc' is a synonym for 'General_Category', and anything that
210       is valid for the latter is also valid for the former.  Similarly,
211
212        \p{Is_*}                                   \p{*}
213
214       means that if and only if, for example, "\p{Foo}" exists, then
215       "\p{Is_Foo}" and "\p{IsFoo}" are also valid and all mean the same
216       thing.  And similarly, "\p{Foo=Bar}" means the same as "\p{Is_Foo=Bar}"
217       and "\p{IsFoo=Bar}".  "*" here is restricted to something not beginning
218       with an underscore.
219
220       Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
221       'Y'.  And 'No', 'F', and 'False' are all synonyms for 'N'.  The table
222       shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
223       for the other possibilities.  Note that not all properties which have
224       values 'Yes' and 'No' are binary, and they have all their values
225       spelled out without using this wild card, and a "NOT" clause in their
226       description that highlights their not being binary.  These also require
227       the compound form to match them, whereas true binary properties have
228       both single and compound forms available.
229
230       Note that all non-essential underscores are removed in the display of
231       the short names below.
232
233       Legend summary:
234
235       * is a wild-card
236       (\d+) in the info column gives the number of Unicode code points
237       matched by this property.
238       D means this is deprecated.
239       O means this is obsolete.
240       S means this is stabilized.
241       T means tighter (stricter) name matching applies.
242       X means use of this form is discouraged, and may not be stable.
243
244              NAME                           INFO
245
246          \p{Adlam}               \p{Script_Extensions=Adlam} (Short:
247                                    \p{Adlm}; NOT \p{Block=Adlam}) (88)
248          \p{Adlm}                \p{Adlam} (= \p{Script_Extensions=Adlam})
249                                    (NOT \p{Block=Adlam}) (88)
250        X \p{Aegean_Numbers}      \p{Block=Aegean_Numbers} (64)
251        T \p{Age: 1.1}            \p{Age=V1_1} (33_979)
252          \p{Age: V1_1}           Code point's usage introduced in version
253                                    1.1 (33_979: U+0000..01F5, U+01FA..0217,
254                                    U+0250..02A8, U+02B0..02DE,
255                                    U+02E0..02E9, U+0300..0345 ...)
256        T \p{Age: 2.0}            \p{Age=V2_0} (144_521)
257          \p{Age: V2_0}           Code point's usage was introduced in
258                                    version 2.0; See also Property
259                                    'Present_In' (144_521: U+0591..05A1,
260                                    U+05A3..05AF, U+05C4, U+0F00..0F47,
261                                    U+0F49..0F69, U+0F71..0F8B ...)
262        T \p{Age: 2.1}            \p{Age=V2_1} (2)
263          \p{Age: V2_1}           Code point's usage was introduced in
264                                    version 2.1; See also Property
265                                    'Present_In' (2: U+20AC, U+FFFC)
266        T \p{Age: 3.0}            \p{Age=V3_0} (10_307)
267          \p{Age: V3_0}           Code point's usage was introduced in
268                                    version 3.0; See also Property
269                                    'Present_In' (10_307: U+01F6..01F9,
270                                    U+0218..021F, U+0222..0233,
271                                    U+02A9..02AD, U+02DF, U+02EA..02EE ...)
272        T \p{Age: 3.1}            \p{Age=V3_1} (44_978)
273          \p{Age: V3_1}           Code point's usage was introduced in
274                                    version 3.1; See also Property
275                                    'Present_In' (44_978: U+03F4..03F5,
276                                    U+FDD0..FDEF, U+10300..1031E,
277                                    U+10320..10323, U+10330..1034A,
278                                    U+10400..10425 ...)
279        T \p{Age: 3.2}            \p{Age=V3_2} (1016)
280          \p{Age: V3_2}           Code point's usage was introduced in
281                                    version 3.2; See also Property
282                                    'Present_In' (1016: U+0220, U+034F,
283                                    U+0363..036F, U+03D8..03D9, U+03F6,
284                                    U+048A..048B ...)
285        T \p{Age: 4.0}            \p{Age=V4_0} (1226)
286          \p{Age: V4_0}           Code point's usage was introduced in
287                                    version 4.0; See also Property
288                                    'Present_In' (1226: U+0221,
289                                    U+0234..0236, U+02AE..02AF,
290                                    U+02EF..02FF, U+0350..0357, U+035D..035F
291                                    ...)
292        T \p{Age: 4.1}            \p{Age=V4_1} (1273)
293          \p{Age: V4_1}           Code point's usage was introduced in
294                                    version 4.1; See also Property
295                                    'Present_In' (1273: U+0237..0241,
296                                    U+0358..035C, U+03FC..03FF,
297                                    U+04F6..04F7, U+05A2, U+05C5..05C7 ...)
298        T \p{Age: 5.0}            \p{Age=V5_0} (1369)
299          \p{Age: V5_0}           Code point's usage was introduced in
300                                    version 5.0; See also Property
301                                    'Present_In' (1369: U+0242..024F,
302                                    U+037B..037D, U+04CF, U+04FA..04FF,
303                                    U+0510..0513, U+05BA ...)
304        T \p{Age: 5.1}            \p{Age=V5_1} (1624)
305          \p{Age: V5_1}           Code point's usage was introduced in
306                                    version 5.1; See also Property
307                                    'Present_In' (1624: U+0370..0373,
308                                    U+0376..0377, U+03CF, U+0487,
309                                    U+0514..0523, U+0606..060A ...)
310        T \p{Age: 5.2}            \p{Age=V5_2} (6648)
311          \p{Age: V5_2}           Code point's usage was introduced in
312                                    version 5.2; See also Property
313                                    'Present_In' (6648: U+0524..0525,
314                                    U+0800..082D, U+0830..083E, U+0900,
315                                    U+094E, U+0955 ...)
316        T \p{Age: 6.0}            \p{Age=V6_0} (2088)
317          \p{Age: V6_0}           Code point's usage was introduced in
318                                    version 6.0; See also Property
319                                    'Present_In' (2088: U+0526..0527,
320                                    U+0620, U+065F, U+0840..085B, U+085E,
321                                    U+093A..093B ...)
322        T \p{Age: 6.1}            \p{Age=V6_1} (732)
323          \p{Age: V6_1}           Code point's usage was introduced in
324                                    version 6.1; See also Property
325                                    'Present_In' (732: U+058F, U+0604,
326                                    U+08A0, U+08A2..08AC, U+08E4..08FE,
327                                    U+0AF0 ...)
328        T \p{Age: 6.2}            \p{Age=V6_2} (1)
329          \p{Age: V6_2}           Code point's usage was introduced in
330                                    version 6.2; See also Property
331                                    'Present_In' (1: U+20BA)
332        T \p{Age: 6.3}            \p{Age=V6_3} (5)
333          \p{Age: V6_3}           Code point's usage was introduced in
334                                    version 6.3; See also Property
335                                    'Present_In' (5: U+061C, U+2066..2069)
336        T \p{Age: 7.0}            \p{Age=V7_0} (2834)
337          \p{Age: V7_0}           Code point's usage was introduced in
338                                    version 7.0; See also Property
339                                    'Present_In' (2834: U+037F,
340                                    U+0528..052F, U+058D..058E, U+0605,
341                                    U+08A1, U+08AD..08B2 ...)
342        T \p{Age: 8.0}            \p{Age=V8_0} (7716)
343          \p{Age: V8_0}           Code point's usage was introduced in
344                                    version 8.0; See also Property
345                                    'Present_In' (7716: U+08B3..08B4,
346                                    U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5
347                                    ...)
348        T \p{Age: 9.0}            \p{Age=V9_0} (7500)
349          \p{Age: V9_0}           Code point's usage was introduced in
350                                    version 9.0; See also Property
351                                    'Present_In' (7500: U+08B6..08BD,
352                                    U+08D4..08E2, U+0C80, U+0D4F,
353                                    U+0D54..0D56, U+0D58..0D5E ...)
354        T \p{Age: 10.0}           \p{Age=V10_0} (8518)
355          \p{Age: V10_0}          Code point's usage was introduced in
356                                    version 10.0; See also Property
357                                    'Present_In' (8518: U+0860..086A,
358                                    U+09FC..09FD, U+0AFA..0AFF, U+0D00,
359                                    U+0D3B..0D3C, U+1CF7 ...)
360          \p{Age: NA}             \p{Age=Unassigned} (837_775 plus all
361                                    above-Unicode code points)
362          \p{Age: Unassigned}     Code point's usage has not been assigned
363                                    in any Unicode release thus far. (Short:
364                                    \p{Age=NA}) (837_775 plus all above-
365                                    Unicode code points: U+0378..0379,
366                                    U+0380..0383, U+038B, U+038D, U+03A2,
367                                    U+0530 ...)
368          \p{Aghb}                \p{Caucasian_Albanian} (=
369                                    \p{Script_Extensions=
370                                    Caucasian_Albanian}) (NOT \p{Block=
371                                    Caucasian_Albanian}) (53)
372          \p{AHex}                \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
373                                    (22)
374          \p{AHex: *}             \p{ASCII_Hex_Digit: *}
375          \p{Ahom}                \p{Script_Extensions=Ahom} (NOT \p{Block=
376                                    Ahom}) (57)
377        X \p{Alchemical}          \p{Alchemical_Symbols} (= \p{Block=
378                                    Alchemical_Symbols}) (128)
379        X \p{Alchemical_Symbols}  \p{Block=Alchemical_Symbols} (Short:
380                                    \p{InAlchemical}) (128)
381          \p{All}                 All code points, including those above
382                                    Unicode.  Same as qr/./s (1_114_112 plus
383                                    all above-Unicode code points:
384                                    U+0000..infinity)
385          \p{Alnum}               \p{XPosixAlnum} (127_219)
386          \p{Alpha}               \p{XPosixAlpha} (= \p{Alphabetic=Y})
387                                    (126_629)
388          \p{Alpha: *}            \p{Alphabetic: *}
389          \p{Alphabetic}          \p{XPosixAlpha} (= \p{Alphabetic=Y})
390                                    (126_629)
391          \p{Alphabetic: N*}      (Short: \p{Alpha=N}, \P{Alpha}) (987_483
392                                    plus all above-Unicode code points:
393                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
394                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
395                                    \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
396                                    U+02C2..02C5, U+02D2..02DF,
397                                    U+02E5..02EB, U+02ED, U+02EF..0344 ...)
398          \p{Alphabetic: Y*}      (Short: \p{Alpha=Y}, \p{Alpha}) (126_629:
399                                    [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
400                                    \xf6\xf8-\xff], U+0100..02C1,
401                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
402                                    U+02EE ...)
403        X \p{Alphabetic_PF}       \p{Alphabetic_Presentation_Forms} (=
404                                    \p{Block=Alphabetic_Presentation_Forms})
405                                    (80)
406        X \p{Alphabetic_Presentation_Forms} \p{Block=
407                                    Alphabetic_Presentation_Forms} (Short:
408                                    \p{InAlphabeticPF}) (80)
409          \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
410                                    Anatolian_Hieroglyphs} (Short: \p{Hluw};
411                                    NOT \p{Block=Anatolian_Hieroglyphs})
412                                    (583)
413        X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
414                                    \p{Block=
415                                    Ancient_Greek_Musical_Notation}) (80)
416        X \p{Ancient_Greek_Musical_Notation} \p{Block=
417                                    Ancient_Greek_Musical_Notation} (Short:
418                                    \p{InAncientGreekMusic}) (80)
419        X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
420        X \p{Ancient_Symbols}     \p{Block=Ancient_Symbols} (64)
421          \p{Any}                 All Unicode code points (1_114_112:
422                                    U+0000..10FFFF)
423          \p{Arab}                \p{Arabic} (= \p{Script_Extensions=
424                                    Arabic}) (NOT \p{Block=Arabic}) (1324)
425          \p{Arabic}              \p{Script_Extensions=Arabic} (Short:
426                                    \p{Arab}; NOT \p{Block=Arabic}) (1324)
427        X \p{Arabic_Ext_A}        \p{Arabic_Extended_A} (= \p{Block=
428                                    Arabic_Extended_A}) (96)
429        X \p{Arabic_Extended_A}   \p{Block=Arabic_Extended_A} (Short:
430                                    \p{InArabicExtA}) (96)
431        X \p{Arabic_Math}         \p{Arabic_Mathematical_Alphabetic_Symbols}
432                                    (= \p{Block=
433                                    Arabic_Mathematical_Alphabetic_Symbols})
434                                    (256)
435        X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
436                                    Arabic_Mathematical_Alphabetic_Symbols}
437                                    (Short: \p{InArabicMath}) (256)
438        X \p{Arabic_PF_A}         \p{Arabic_Presentation_Forms_A} (=
439                                    \p{Block=Arabic_Presentation_Forms_A})
440                                    (688)
441        X \p{Arabic_PF_B}         \p{Arabic_Presentation_Forms_B} (=
442                                    \p{Block=Arabic_Presentation_Forms_B})
443                                    (144)
444        X \p{Arabic_Presentation_Forms_A} \p{Block=
445                                    Arabic_Presentation_Forms_A} (Short:
446                                    \p{InArabicPFA}) (688)
447        X \p{Arabic_Presentation_Forms_B} \p{Block=
448                                    Arabic_Presentation_Forms_B} (Short:
449                                    \p{InArabicPFB}) (144)
450        X \p{Arabic_Sup}          \p{Arabic_Supplement} (= \p{Block=
451                                    Arabic_Supplement}) (48)
452        X \p{Arabic_Supplement}   \p{Block=Arabic_Supplement} (Short:
453                                    \p{InArabicSup}) (48)
454          \p{Armenian}            \p{Script_Extensions=Armenian} (Short:
455                                    \p{Armn}; NOT \p{Block=Armenian}) (94)
456          \p{Armi}                \p{Imperial_Aramaic} (=
457                                    \p{Script_Extensions=Imperial_Aramaic})
458                                    (NOT \p{Block=Imperial_Aramaic}) (31)
459          \p{Armn}                \p{Armenian} (= \p{Script_Extensions=
460                                    Armenian}) (NOT \p{Block=Armenian}) (94)
461        X \p{Arrows}              \p{Block=Arrows} (112)
462          \p{ASCII}               \p{Block=Basic_Latin} (128)
463          \p{ASCII_Hex_Digit}     \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
464                                    (22)
465          \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
466                                    plus all above-Unicode code points:
467                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
468                                    >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
469                                    U+0100..infinity)
470          \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A-
471                                    Fa-f])
472          \p{Assigned}            All assigned code points (276_271:
473                                    U+0000..0377, U+037A..037F,
474                                    U+0384..038A, U+038C, U+038E..03A1,
475                                    U+03A3..052F ...)
476          \p{Avestan}             \p{Script_Extensions=Avestan} (Short:
477                                    \p{Avst}; NOT \p{Block=Avestan}) (61)
478          \p{Avst}                \p{Avestan} (= \p{Script_Extensions=
479                                    Avestan}) (NOT \p{Block=Avestan}) (61)
480          \p{Bali}                \p{Balinese} (= \p{Script_Extensions=
481                                    Balinese}) (NOT \p{Block=Balinese}) (121)
482          \p{Balinese}            \p{Script_Extensions=Balinese} (Short:
483                                    \p{Bali}; NOT \p{Block=Balinese}) (121)
484          \p{Bamu}                \p{Bamum} (= \p{Script_Extensions=Bamum})
485                                    (NOT \p{Block=Bamum}) (657)
486          \p{Bamum}               \p{Script_Extensions=Bamum} (Short:
487                                    \p{Bamu}; NOT \p{Block=Bamum}) (657)
488        X \p{Bamum_Sup}           \p{Bamum_Supplement} (= \p{Block=
489                                    Bamum_Supplement}) (576)
490        X \p{Bamum_Supplement}    \p{Block=Bamum_Supplement} (Short:
491                                    \p{InBamumSup}) (576)
492        X \p{Basic_Latin}         \p{ASCII} (= \p{Block=Basic_Latin}) (128)
493          \p{Bass}                \p{Bassa_Vah} (= \p{Script_Extensions=
494                                    Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
495                                    (36)
496          \p{Bassa_Vah}           \p{Script_Extensions=Bassa_Vah} (Short:
497                                    \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
498          \p{Batak}               \p{Script_Extensions=Batak} (Short:
499                                    \p{Batk}; NOT \p{Block=Batak}) (56)
500          \p{Batk}                \p{Batak} (= \p{Script_Extensions=Batak})
501                                    (NOT \p{Block=Batak}) (56)
502          \p{Bc: *}               \p{Bidi_Class: *}
503          \p{Beng}                \p{Bengali} (= \p{Script_Extensions=
504                                    Bengali}) (NOT \p{Block=Bengali}) (101)
505          \p{Bengali}             \p{Script_Extensions=Bengali} (Short:
506                                    \p{Beng}; NOT \p{Block=Bengali}) (101)
507          \p{Bhaiksuki}           \p{Script_Extensions=Bhaiksuki} (Short:
508                                    \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
509          \p{Bhks}                \p{Bhaiksuki} (= \p{Script_Extensions=
510                                    Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
511                                    (97)
512          \p{Bidi_C}              \p{Bidi_Control} (= \p{Bidi_Control=Y})
513                                    (12)
514          \p{Bidi_C: *}           \p{Bidi_Control: *}
515          \p{Bidi_Class: AL}      \p{Bidi_Class=Arabic_Letter} (1436)
516          \p{Bidi_Class: AN}      \p{Bidi_Class=Arabic_Number} (51)
517          \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1436: U+0608,
518                                    U+060B, U+060D, U+061B..064A,
519                                    U+066D..066F, U+0671..06D5 ...)
520          \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (51:
521                                    U+0600..0605, U+0660..0669,
522                                    U+066B..066C, U+06DD, U+08E2,
523                                    U+10E60..10E7E)
524          \p{Bidi_Class: B}       \p{Bidi_Class=Paragraph_Separator} (7)
525          \p{Bidi_Class: BN}      \p{Bidi_Class=Boundary_Neutral} (4016)
526          \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016:
527                                    [^\t\n\cK\f\r\x1c-\x7e\x85\xa0-\xac\xae-
528                                    \xff], U+180E, U+200B..200D,
529                                    U+2060..2065, U+206A..206F, U+FDD0..FDEF
530                                    ...)
531          \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15:
532                                    [,.\/:\xa0], U+060C, U+202F, U+2044,
533                                    U+FE50, U+FE52 ...)
534          \p{Bidi_Class: CS}      \p{Bidi_Class=Common_Separator} (15)
535          \p{Bidi_Class: EN}      \p{Bidi_Class=European_Number} (158)
536          \p{Bidi_Class: ES}      \p{Bidi_Class=European_Separator} (12)
537          \p{Bidi_Class: ET}      \p{Bidi_Class=European_Terminator} (87)
538          \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (158: [0-9\xb2-
539                                    \xb3\xb9], U+06F0..06F9, U+2070,
540                                    U+2074..2079, U+2080..2089, U+2488..249B
541                                    ...)
542          \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-],
543                                    U+207A..207B, U+208A..208B, U+2212,
544                                    U+FB29, U+FE62..FE63 ...)
545          \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (87:
546                                    [#\$\%\xa2-\xa5\xb0-\xb1], U+058F,
547                                    U+0609..060A, U+066A, U+09F2..09F3,
548                                    U+09FB ...)
549          \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1:
550                                    U+2068)
551          \p{Bidi_Class: FSI}     \p{Bidi_Class=First_Strong_Isolate} (1)
552          \p{Bidi_Class: L}       \p{Bidi_Class=Left_To_Right} (1_097_124
553                                    plus all above-Unicode code points)
554          \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_097_124 plus
555                                    all above-Unicode code points: [A-Za-
556                                    z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
557                                    \xff], U+0100..02B8, U+02BB..02C1,
558                                    U+02D0..02D1, U+02E0..02E4, U+02EE ...)
559          \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1:
560                                    U+202A)
561          \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1:
562                                    U+2066)
563          \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1:
564                                    U+202D)
565          \p{Bidi_Class: LRE}     \p{Bidi_Class=Left_To_Right_Embedding} (1)
566          \p{Bidi_Class: LRI}     \p{Bidi_Class=Left_To_Right_Isolate} (1)
567          \p{Bidi_Class: LRO}     \p{Bidi_Class=Left_To_Right_Override} (1)
568          \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1773:
569                                    U+0300..036F, U+0483..0489,
570                                    U+0591..05BD, U+05BF, U+05C1..05C2,
571                                    U+05C4..05C5 ...)
572          \p{Bidi_Class: NSM}     \p{Bidi_Class=Nonspacing_Mark} (1773)
573          \p{Bidi_Class: ON}      \p{Bidi_Class=Other_Neutral} (5350)
574          \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (5350:
575                                    [!\"&\'\(\)*;<=
576                                    >?\@\[\\\]\^_`\{\|\}~\xa1\xa6-\xa9\xab-
577                                    \xac\xae-\xaf\xb4\xb6-\xb8\xbb-
578                                    \xbf\xd7\xf7], U+02B9..02BA,
579                                    U+02C2..02CF, U+02D2..02DF,
580                                    U+02E5..02ED, U+02EF..02FF ...)
581          \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7:
582                                    [\n\r\x1c-\x1e\x85], U+2029)
583          \p{Bidi_Class: PDF}     \p{Bidi_Class=Pop_Directional_Format} (1)
584          \p{Bidi_Class: PDI}     \p{Bidi_Class=Pop_Directional_Isolate} (1)
585          \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1:
586                                    U+202C)
587          \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1:
588                                    U+2069)
589          \p{Bidi_Class: R}       \p{Bidi_Class=Right_To_Left} (4054)
590          \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (4054: U+0590,
591                                    U+05BE, U+05C0, U+05C3, U+05C6,
592                                    U+05C8..05FF ...)
593          \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1:
594                                    U+202B)
595          \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1:
596                                    U+2067)
597          \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1:
598                                    U+202E)
599          \p{Bidi_Class: RLE}     \p{Bidi_Class=Right_To_Left_Embedding} (1)
600          \p{Bidi_Class: RLI}     \p{Bidi_Class=Right_To_Left_Isolate} (1)
601          \p{Bidi_Class: RLO}     \p{Bidi_Class=Right_To_Left_Override} (1)
602          \p{Bidi_Class: S}       \p{Bidi_Class=Segment_Separator} (3)
603          \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3:
604                                    [\t\cK\x1f])
605          \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20],
606                                    U+1680, U+2000..200A, U+2028, U+205F,
607                                    U+3000)
608          \p{Bidi_Class: WS}      \p{Bidi_Class=White_Space} (17)
609          \p{Bidi_Control}        \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
610          \p{Bidi_Control: N*}    (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
611                                    plus all above-Unicode code points:
612                                    U+0000..061B, U+061D..200D,
613                                    U+2010..2029, U+202F..2065,
614                                    U+206A..infinity)
615          \p{Bidi_Control: Y*}    (Short: \p{BidiC=Y}, \p{BidiC}) (12:
616                                    U+061C, U+200E..200F, U+202A..202E,
617                                    U+2066..2069)
618          \p{Bidi_M}              \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
619                                    (545)
620          \p{Bidi_M: *}           \p{Bidi_Mirrored: *}
621          \p{Bidi_Mirrored}       \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
622                                    (545)
623          \p{Bidi_Mirrored: N*}   (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567
624                                    plus all above-Unicode code points:
625                                    [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A-
626                                    Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc-
627                                    \xff], U+0100..0F39, U+0F3E..169A,
628                                    U+169D..2038, U+203B..2044, U+2047..207C
629                                    ...)
630          \p{Bidi_Mirrored: Y*}   (Short: \p{BidiM=Y}, \p{BidiM}) (545:
631                                    [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
632                                    U+169B..169C, U+2039..203A,
633                                    U+2045..2046, U+207D..207E ...)
634          \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
635                                    (60)
636          \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60:
637                                    [\)\]\}], U+0F3B, U+0F3D, U+169C,
638                                    U+2046, U+207E ...)
639          \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
640                                    (1_113_992 plus all above-Unicode code
641                                    points)
642          \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992
643                                    plus all above-Unicode code points:
644                                    [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<=
645                                    >?\@A-Z\\\^_`a-z\|~\x7f-\xff],
646                                    U+0100..0F39, U+0F3E..169A,
647                                    U+169D..2044, U+2047..207C, U+207F..208C
648                                    ...)
649          \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
650                                    (60)
651          \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60:
652                                    [\(\[\{], U+0F3A, U+0F3C, U+169B,
653                                    U+2045, U+207D ...)
654          \p{Blank}               \p{XPosixBlank} (18)
655          \p{Blk: *}              \p{Block: *}
656          \p{Block: Adlam}        (NOT \p{Adlam} NOR \p{Is_Adlam}) (96:
657                                    U+1E900..1E95F)
658          \p{Block: Aegean_Numbers} (64: U+10100..1013F)
659          \p{Block: Ahom}         (NOT \p{Ahom} NOR \p{Is_Ahom}) (64:
660                                    U+11700..1173F)
661          \p{Block: Alchemical}   \p{Block=Alchemical_Symbols} (128)
662          \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128:
663                                    U+1F700..1F77F)
664          \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
665                                    (80)
666          \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
667                                    AlphabeticPF}) (80: U+FB00..FB4F)
668          \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
669                                    NOR \p{Is_Anatolian_Hieroglyphs}) (640:
670                                    U+14400..1467F)
671          \p{Block: Ancient_Greek_Music} \p{Block=
672                                    Ancient_Greek_Musical_Notation} (80)
673          \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
674                                    AncientGreekMusic}) (80: U+1D200..1D24F)
675          \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F)
676          \p{Block: Ancient_Symbols} (64: U+10190..101CF)
677          \p{Block: Arabic}       (NOT \p{Arabic} NOR \p{Is_Arabic}) (256:
678                                    U+0600..06FF)
679          \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
680          \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96:
681                                    U+08A0..08FF)
682          \p{Block: Arabic_Math}  \p{Block=
683                                    Arabic_Mathematical_Alphabetic_Symbols}
684                                    (256)
685          \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
686                                    ArabicMath}) (256: U+1EE00..1EEFF)
687          \p{Block: Arabic_PF_A}  \p{Block=Arabic_Presentation_Forms_A} (688)
688          \p{Block: Arabic_PF_B}  \p{Block=Arabic_Presentation_Forms_B} (144)
689          \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
690                                    (688: U+FB50..FDFF)
691          \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
692                                    (144: U+FE70..FEFF)
693          \p{Block: Arabic_Sup}   \p{Block=Arabic_Supplement} (48)
694          \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48:
695                                    U+0750..077F)
696          \p{Block: Armenian}     (NOT \p{Armenian} NOR \p{Is_Armenian})
697                                    (96: U+0530..058F)
698          \p{Block: Arrows}       (112: U+2190..21FF)
699          \p{Block: ASCII}        \p{Block=Basic_Latin} (128)
700          \p{Block: Avestan}      (NOT \p{Avestan} NOR \p{Is_Avestan}) (64:
701                                    U+10B00..10B3F)
702          \p{Block: Balinese}     (NOT \p{Balinese} NOR \p{Is_Balinese})
703                                    (128: U+1B00..1B7F)
704          \p{Block: Bamum}        (NOT \p{Bamum} NOR \p{Is_Bamum}) (96:
705                                    U+A6A0..A6FF)
706          \p{Block: Bamum_Sup}    \p{Block=Bamum_Supplement} (576)
707          \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576:
708                                    U+16800..16A3F)
709          \p{Block: Basic_Latin}  (Short: \p{Blk=ASCII}) (128: [\x00-\x7f])
710          \p{Block: Bassa_Vah}    (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
711                                    (48: U+16AD0..16AFF)
712          \p{Block: Batak}        (NOT \p{Batak} NOR \p{Is_Batak}) (64:
713                                    U+1BC0..1BFF)
714          \p{Block: Bengali}      (NOT \p{Bengali} NOR \p{Is_Bengali}) (128:
715                                    U+0980..09FF)
716          \p{Block: Bhaiksuki}    (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
717                                    (112: U+11C00..11C6F)
718          \p{Block: Block_Elements} (32: U+2580..259F)
719          \p{Block: Bopomofo}     (NOT \p{Bopomofo} NOR \p{Is_Bopomofo})
720                                    (48: U+3100..312F)
721          \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
722          \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32:
723                                    U+31A0..31BF)
724          \p{Block: Box_Drawing}  (128: U+2500..257F)
725          \p{Block: Brahmi}       (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128:
726                                    U+11000..1107F)
727          \p{Block: Braille}      \p{Block=Braille_Patterns} (256)
728          \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256:
729                                    U+2800..28FF)
730          \p{Block: Buginese}     (NOT \p{Buginese} NOR \p{Is_Buginese})
731                                    (32: U+1A00..1A1F)
732          \p{Block: Buhid}        (NOT \p{Buhid} NOR \p{Is_Buhid}) (32:
733                                    U+1740..175F)
734          \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
735                                    (256)
736          \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
737                                    ByzantineMusic}) (256: U+1D000..1D0FF)
738          \p{Block: Canadian_Syllabics} \p{Block=
739                                    Unified_Canadian_Aboriginal_Syllabics}
740                                    (640)
741          \p{Block: Carian}       (NOT \p{Carian} NOR \p{Is_Carian}) (64:
742                                    U+102A0..102DF)
743          \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
744                                    \p{Is_Caucasian_Albanian}) (64:
745                                    U+10530..1056F)
746          \p{Block: Chakma}       (NOT \p{Chakma} NOR \p{Is_Chakma}) (80:
747                                    U+11100..1114F)
748          \p{Block: Cham}         (NOT \p{Cham} NOR \p{Is_Cham}) (96:
749                                    U+AA00..AA5F)
750          \p{Block: Cherokee}     (NOT \p{Cherokee} NOR \p{Is_Cherokee})
751                                    (96: U+13A0..13FF)
752          \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
753          \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80:
754                                    U+AB70..ABBF)
755          \p{Block: CJK}          \p{Block=CJK_Unified_Ideographs} (20_992)
756          \p{Block: CJK_Compat}   \p{Block=CJK_Compatibility} (256)
757          \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
758          \p{Block: CJK_Compat_Ideographs} \p{Block=
759                                    CJK_Compatibility_Ideographs} (512)
760          \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
761                                    CJK_Compatibility_Ideographs_Supplement}
762                                    (544)
763          \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256:
764                                    U+3300..33FF)
765          \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
766                                    (32: U+FE30..FE4F)
767          \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
768                                    CJKCompatIdeographs}) (512: U+F900..FAFF)
769          \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
770                                    CJKCompatIdeographsSup}) (544:
771                                    U+2F800..2FA1F)
772          \p{Block: CJK_Ext_A}    \p{Block=
773                                    CJK_Unified_Ideographs_Extension_A}
774                                    (6592)
775          \p{Block: CJK_Ext_B}    \p{Block=
776                                    CJK_Unified_Ideographs_Extension_B}
777                                    (42_720)
778          \p{Block: CJK_Ext_C}    \p{Block=
779                                    CJK_Unified_Ideographs_Extension_C}
780                                    (4160)
781          \p{Block: CJK_Ext_D}    \p{Block=
782                                    CJK_Unified_Ideographs_Extension_D} (224)
783          \p{Block: CJK_Ext_E}    \p{Block=
784                                    CJK_Unified_Ideographs_Extension_E}
785                                    (5776)
786          \p{Block: CJK_Ext_F}    \p{Block=
787                                    CJK_Unified_Ideographs_Extension_F}
788                                    (7488)
789          \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
790          \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
791                                    (128: U+2E80..2EFF)
792          \p{Block: CJK_Strokes}  (48: U+31C0..31EF)
793          \p{Block: CJK_Symbols}  \p{Block=CJK_Symbols_And_Punctuation} (64)
794          \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
795                                    (64: U+3000..303F)
796          \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992:
797                                    U+4E00..9FFF)
798          \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
799                                    CJKExtA}) (6592: U+3400..4DBF)
800          \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
801                                    CJKExtB}) (42_720: U+20000..2A6DF)
802          \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
803                                    CJKExtC}) (4160: U+2A700..2B73F)
804          \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
805                                    CJKExtD}) (224: U+2B740..2B81F)
806          \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
807                                    CJKExtE}) (5776: U+2B820..2CEAF)
808          \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk=
809                                    CJKExtF}) (7488: U+2CEB0..2EBEF)
810          \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
811                                    Diacriticals}) (112: U+0300..036F)
812          \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
813                                    DiacriticalsExt}) (80: U+1AB0..1AFF)
814          \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
815                                    DiacriticalsForSymbols}) (48:
816                                    U+20D0..20FF)
817          \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
818                                    DiacriticalsSup}) (64: U+1DC0..1DFF)
819          \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16:
820                                    U+FE20..FE2F)
821          \p{Block: Combining_Marks_For_Symbols} \p{Block=
822                                    Combining_Diacritical_Marks_For_Symbols}
823                                    (48)
824          \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
825                                    IndicNumberForms}) (16: U+A830..A83F)
826          \p{Block: Compat_Jamo}  \p{Block=Hangul_Compatibility_Jamo} (96)
827          \p{Block: Control_Pictures} (64: U+2400..243F)
828          \p{Block: Coptic}       (NOT \p{Coptic} NOR \p{Is_Coptic}) (128:
829                                    U+2C80..2CFF)
830          \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF)
831          \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
832          \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32:
833                                    U+1D360..1D37F)
834          \p{Block: Cuneiform}    (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
835                                    (1024: U+12000..123FF)
836          \p{Block: Cuneiform_Numbers} \p{Block=
837                                    Cuneiform_Numbers_And_Punctuation} (128)
838          \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
839                                    CuneiformNumbers}) (128: U+12400..1247F)
840          \p{Block: Currency_Symbols} (48: U+20A0..20CF)
841          \p{Block: Cypriot_Syllabary} (64: U+10800..1083F)
842          \p{Block: Cyrillic}     (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
843                                    (256: U+0400..04FF)
844          \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
845          \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
846          \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
847          \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32:
848                                    U+2DE0..2DFF)
849          \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96:
850                                    U+A640..A69F)
851          \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16:
852                                    U+1C80..1C8F)
853          \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
854          \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48:
855                                    U+0500..052F)
856          \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
857                                    (48)
858          \p{Block: Deseret}      (80: U+10400..1044F)
859          \p{Block: Devanagari}   (NOT \p{Devanagari} NOR \p{Is_Devanagari})
860                                    (128: U+0900..097F)
861          \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
862          \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32:
863                                    U+A8E0..A8FF)
864          \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
865          \p{Block: Diacriticals_Ext} \p{Block=
866                                    Combining_Diacritical_Marks_Extended}
867                                    (80)
868          \p{Block: Diacriticals_For_Symbols} \p{Block=
869                                    Combining_Diacritical_Marks_For_Symbols}
870                                    (48)
871          \p{Block: Diacriticals_Sup} \p{Block=
872                                    Combining_Diacritical_Marks_Supplement}
873                                    (64)
874          \p{Block: Dingbats}     (192: U+2700..27BF)
875          \p{Block: Domino}       \p{Block=Domino_Tiles} (112)
876          \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112:
877                                    U+1F030..1F09F)
878          \p{Block: Duployan}     (NOT \p{Duployan} NOR \p{Is_Duployan})
879                                    (160: U+1BC00..1BC9F)
880          \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F)
881          \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
882                                    \p{Is_Egyptian_Hieroglyphs}) (1072:
883                                    U+13000..1342F)
884          \p{Block: Elbasan}      (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48:
885                                    U+10500..1052F)
886          \p{Block: Emoticons}    (80: U+1F600..1F64F)
887          \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
888          \p{Block: Enclosed_Alphanum_Sup} \p{Block=
889                                    Enclosed_Alphanumeric_Supplement} (256)
890          \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
891                                    EnclosedAlphanumSup}) (256:
892                                    U+1F100..1F1FF)
893          \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
894                                    EnclosedAlphanum}) (160: U+2460..24FF)
895          \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
896                                    (256)
897          \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
898                                    EnclosedCJK}) (256: U+3200..32FF)
899          \p{Block: Enclosed_Ideographic_Sup} \p{Block=
900                                    Enclosed_Ideographic_Supplement} (256)
901          \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
902                                    EnclosedIdeographicSup}) (256:
903                                    U+1F200..1F2FF)
904          \p{Block: Ethiopic}     (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
905                                    (384: U+1200..137F)
906          \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
907          \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
908          \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96:
909                                    U+2D80..2DDF)
910          \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48:
911                                    U+AB00..AB2F)
912          \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
913          \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32:
914                                    U+1380..139F)
915          \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
916                                    \p{Punct} NOR \p{Is_Punctuation}) (112:
917                                    U+2000..206F)
918          \p{Block: Geometric_Shapes} (96: U+25A0..25FF)
919          \p{Block: Geometric_Shapes_Ext} \p{Block=
920                                    Geometric_Shapes_Extended} (128)
921          \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
922                                    GeometricShapesExt}) (128:
923                                    U+1F780..1F7FF)
924          \p{Block: Georgian}     (NOT \p{Georgian} NOR \p{Is_Georgian})
925                                    (96: U+10A0..10FF)
926          \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
927          \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48:
928                                    U+2D00..2D2F)
929          \p{Block: Glagolitic}   (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
930                                    (96: U+2C00..2C5F)
931          \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
932          \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
933                                    (48: U+1E000..1E02F)
934          \p{Block: Gothic}       (NOT \p{Gothic} NOR \p{Is_Gothic}) (32:
935                                    U+10330..1034F)
936          \p{Block: Grantha}      (NOT \p{Grantha} NOR \p{Is_Grantha}) (128:
937                                    U+11300..1137F)
938          \p{Block: Greek}        \p{Block=Greek_And_Coptic} (NOT \p{Greek}
939                                    NOR \p{Is_Greek}) (144)
940          \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
941                                    NOR \p{Is_Greek}) (144: U+0370..03FF)
942          \p{Block: Greek_Ext}    \p{Block=Greek_Extended} (256)
943          \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256:
944                                    U+1F00..1FFF)
945          \p{Block: Gujarati}     (NOT \p{Gujarati} NOR \p{Is_Gujarati})
946                                    (128: U+0A80..0AFF)
947          \p{Block: Gurmukhi}     (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
948                                    (128: U+0A00..0A7F)
949          \p{Block: Half_And_Full_Forms} \p{Block=
950                                    Halfwidth_And_Fullwidth_Forms} (240)
951          \p{Block: Half_Marks}   \p{Block=Combining_Half_Marks} (16)
952          \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
953                                    HalfAndFullForms}) (240: U+FF00..FFEF)
954          \p{Block: Hangul}       \p{Block=Hangul_Syllables} (NOT \p{Hangul}
955                                    NOR \p{Is_Hangul}) (11_184)
956          \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
957                                    (96: U+3130..318F)
958          \p{Block: Hangul_Jamo}  (Short: \p{Blk=Jamo}) (256: U+1100..11FF)
959          \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32:
960                                    U+A960..A97F)
961          \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80:
962                                    U+D7B0..D7FF)
963          \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
964                                    NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF)
965          \p{Block: Hanunoo}      (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32:
966                                    U+1720..173F)
967          \p{Block: Hatran}       (NOT \p{Hatran} NOR \p{Is_Hatran}) (32:
968                                    U+108E0..108FF)
969          \p{Block: Hebrew}       (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112:
970                                    U+0590..05FF)
971          \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
972                                    HighPUSurrogates}) (128: U+DB80..DBFF)
973          \p{Block: High_PU_Surrogates} \p{Block=
974                                    High_Private_Use_Surrogates} (128)
975          \p{Block: High_Surrogates} (896: U+D800..DB7F)
976          \p{Block: Hiragana}     (NOT \p{Hiragana} NOR \p{Is_Hiragana})
977                                    (96: U+3040..309F)
978          \p{Block: IDC}          \p{Block=
979                                    Ideographic_Description_Characters} (NOT
980                                    \p{ID_Continue} NOR \p{Is_IDC}) (16)
981          \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
982                                    NOT \p{ID_Continue} NOR \p{Is_IDC}) (16:
983                                    U+2FF0..2FFF)
984          \p{Block: Ideographic_Symbols} \p{Block=
985                                    Ideographic_Symbols_And_Punctuation} (32)
986          \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
987                                    IdeographicSymbols}) (32: U+16FE0..16FFF)
988          \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
989                                    \p{Is_Imperial_Aramaic}) (32:
990                                    U+10840..1085F)
991          \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
992                                    (16)
993          \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
994                                    NOR \p{Is_Inscriptional_Pahlavi}) (32:
995                                    U+10B60..10B7F)
996          \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
997                                    NOR \p{Is_Inscriptional_Parthian}) (32:
998                                    U+10B40..10B5F)
999          \p{Block: IPA_Ext}      \p{Block=IPA_Extensions} (96)
1000          \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96:
1001                                    U+0250..02AF)
1002          \p{Block: Jamo}         \p{Block=Hangul_Jamo} (256)
1003          \p{Block: Jamo_Ext_A}   \p{Block=Hangul_Jamo_Extended_A} (32)
1004          \p{Block: Jamo_Ext_B}   \p{Block=Hangul_Jamo_Extended_B} (80)
1005          \p{Block: Javanese}     (NOT \p{Javanese} NOR \p{Is_Javanese})
1006                                    (96: U+A980..A9DF)
1007          \p{Block: Kaithi}       (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80:
1008                                    U+11080..110CF)
1009          \p{Block: Kana_Ext_A}   \p{Block=Kana_Extended_A} (48)
1010          \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48:
1011                                    U+1B100..1B12F)
1012          \p{Block: Kana_Sup}     \p{Block=Kana_Supplement} (256)
1013          \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256:
1014                                    U+1B000..1B0FF)
1015          \p{Block: Kanbun}       (16: U+3190..319F)
1016          \p{Block: Kangxi}       \p{Block=Kangxi_Radicals} (224)
1017          \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224:
1018                                    U+2F00..2FDF)
1019          \p{Block: Kannada}      (NOT \p{Kannada} NOR \p{Is_Kannada}) (128:
1020                                    U+0C80..0CFF)
1021          \p{Block: Katakana}     (NOT \p{Katakana} NOR \p{Is_Katakana})
1022                                    (96: U+30A0..30FF)
1023          \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
1024          \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
1025                                    KatakanaExt}) (16: U+31F0..31FF)
1026          \p{Block: Kayah_Li}     (48: U+A900..A92F)
1027          \p{Block: Kharoshthi}   (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
1028                                    (96: U+10A00..10A5F)
1029          \p{Block: Khmer}        (NOT \p{Khmer} NOR \p{Is_Khmer}) (128:
1030                                    U+1780..17FF)
1031          \p{Block: Khmer_Symbols} (32: U+19E0..19FF)
1032          \p{Block: Khojki}       (NOT \p{Khojki} NOR \p{Is_Khojki}) (80:
1033                                    U+11200..1124F)
1034          \p{Block: Khudawadi}    (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
1035                                    (80: U+112B0..112FF)
1036          \p{Block: Lao}          (NOT \p{Lao} NOR \p{Is_Lao}) (128:
1037                                    U+0E80..0EFF)
1038          \p{Block: Latin_1}      \p{Block=Latin_1_Supplement} (128)
1039          \p{Block: Latin_1_Sup}  \p{Block=Latin_1_Supplement} (128)
1040          \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80-
1041                                    \xff])
1042          \p{Block: Latin_Ext_A}  \p{Block=Latin_Extended_A} (128)
1043          \p{Block: Latin_Ext_Additional} \p{Block=
1044                                    Latin_Extended_Additional} (256)
1045          \p{Block: Latin_Ext_B}  \p{Block=Latin_Extended_B} (208)
1046          \p{Block: Latin_Ext_C}  \p{Block=Latin_Extended_C} (32)
1047          \p{Block: Latin_Ext_D}  \p{Block=Latin_Extended_D} (224)
1048          \p{Block: Latin_Ext_E}  \p{Block=Latin_Extended_E} (64)
1049          \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128:
1050                                    U+0100..017F)
1051          \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
1052                                    LatinExtAdditional}) (256: U+1E00..1EFF)
1053          \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208:
1054                                    U+0180..024F)
1055          \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32:
1056                                    U+2C60..2C7F)
1057          \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224:
1058                                    U+A720..A7FF)
1059          \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64:
1060                                    U+AB30..AB6F)
1061          \p{Block: Lepcha}       (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80:
1062                                    U+1C00..1C4F)
1063          \p{Block: Letterlike_Symbols} (80: U+2100..214F)
1064          \p{Block: Limbu}        (NOT \p{Limbu} NOR \p{Is_Limbu}) (80:
1065                                    U+1900..194F)
1066          \p{Block: Linear_A}     (NOT \p{Linear_A} NOR \p{Is_Linear_A})
1067                                    (384: U+10600..1077F)
1068          \p{Block: Linear_B_Ideograms} (128: U+10080..100FF)
1069          \p{Block: Linear_B_Syllabary} (128: U+10000..1007F)
1070          \p{Block: Lisu}         (48: U+A4D0..A4FF)
1071          \p{Block: Low_Surrogates} (1024: U+DC00..DFFF)
1072          \p{Block: Lycian}       (NOT \p{Lycian} NOR \p{Is_Lycian}) (32:
1073                                    U+10280..1029F)
1074          \p{Block: Lydian}       (NOT \p{Lydian} NOR \p{Is_Lydian}) (32:
1075                                    U+10920..1093F)
1076          \p{Block: Mahajani}     (NOT \p{Mahajani} NOR \p{Is_Mahajani})
1077                                    (48: U+11150..1117F)
1078          \p{Block: Mahjong}      \p{Block=Mahjong_Tiles} (48)
1079          \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48:
1080                                    U+1F000..1F02F)
1081          \p{Block: Malayalam}    (NOT \p{Malayalam} NOR \p{Is_Malayalam})
1082                                    (128: U+0D00..0D7F)
1083          \p{Block: Mandaic}      (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32:
1084                                    U+0840..085F)
1085          \p{Block: Manichaean}   (NOT \p{Manichaean} NOR \p{Is_Manichaean})
1086                                    (64: U+10AC0..10AFF)
1087          \p{Block: Marchen}      (NOT \p{Marchen} NOR \p{Is_Marchen}) (80:
1088                                    U+11C70..11CBF)
1089          \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR
1090                                    \p{Is_Masaram_Gondi}) (96:
1091                                    U+11D00..11D5F)
1092          \p{Block: Math_Alphanum} \p{Block=
1093                                    Mathematical_Alphanumeric_Symbols} (1024)
1094          \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
1095          \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
1096                                    MathAlphanum}) (1024: U+1D400..1D7FF)
1097          \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
1098                                    (256: U+2200..22FF)
1099          \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
1100                                    \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF)
1101          \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
1102          \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
1103                                    (32: U+AAE0..AAFF)
1104          \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
1105                                    \p{Is_Mende_Kikakui}) (224:
1106                                    U+1E800..1E8DF)
1107          \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
1108                                    \p{Is_Meroitic_Cursive}) (96:
1109                                    U+109A0..109FF)
1110          \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F)
1111          \p{Block: Miao}         (NOT \p{Miao} NOR \p{Is_Miao}) (160:
1112                                    U+16F00..16F9F)
1113          \p{Block: Misc_Arrows}  \p{Block=Miscellaneous_Symbols_And_Arrows}
1114                                    (256)
1115          \p{Block: Misc_Math_Symbols_A} \p{Block=
1116                                    Miscellaneous_Mathematical_Symbols_A}
1117                                    (48)
1118          \p{Block: Misc_Math_Symbols_B} \p{Block=
1119                                    Miscellaneous_Mathematical_Symbols_B}
1120                                    (128)
1121          \p{Block: Misc_Pictographs} \p{Block=
1122                                    Miscellaneous_Symbols_And_Pictographs}
1123                                    (768)
1124          \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
1125          \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
1126          \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
1127                                    MiscMathSymbolsA}) (48: U+27C0..27EF)
1128          \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
1129                                    MiscMathSymbolsB}) (128: U+2980..29FF)
1130          \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols})
1131                                    (256: U+2600..26FF)
1132          \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
1133                                    MiscArrows}) (256: U+2B00..2BFF)
1134          \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
1135                                    MiscPictographs}) (768: U+1F300..1F5FF)
1136          \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
1137                                    (256: U+2300..23FF)
1138          \p{Block: Modi}         (NOT \p{Modi} NOR \p{Is_Modi}) (96:
1139                                    U+11600..1165F)
1140          \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
1141          \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F)
1142          \p{Block: Mongolian}    (NOT \p{Mongolian} NOR \p{Is_Mongolian})
1143                                    (176: U+1800..18AF)
1144          \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
1145          \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32:
1146                                    U+11660..1167F)
1147          \p{Block: Mro}          (NOT \p{Mro} NOR \p{Is_Mro}) (48:
1148                                    U+16A40..16A6F)
1149          \p{Block: Multani}      (NOT \p{Multani} NOR \p{Is_Multani}) (48:
1150                                    U+11280..112AF)
1151          \p{Block: Music}        \p{Block=Musical_Symbols} (256)
1152          \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256:
1153                                    U+1D100..1D1FF)
1154          \p{Block: Myanmar}      (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160:
1155                                    U+1000..109F)
1156          \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
1157          \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
1158          \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32:
1159                                    U+AA60..AA7F)
1160          \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32:
1161                                    U+A9E0..A9FF)
1162          \p{Block: Nabataean}    (NOT \p{Nabataean} NOR \p{Is_Nabataean})
1163                                    (48: U+10880..108AF)
1164          \p{Block: NB}           \p{Block=No_Block} (834_096 plus all
1165                                    above-Unicode code points)
1166          \p{Block: New_Tai_Lue}  (NOT \p{New_Tai_Lue} NOR
1167                                    \p{Is_New_Tai_Lue}) (96: U+1980..19DF)
1168          \p{Block: Newa}         (NOT \p{Newa} NOR \p{Is_Newa}) (128:
1169                                    U+11400..1147F)
1170          \p{Block: NKo}          (NOT \p{Nko} NOR \p{Is_NKo}) (64:
1171                                    U+07C0..07FF)
1172          \p{Block: No_Block}     (Short: \p{Blk=NB}) (834_096 plus all
1173                                    above-Unicode code points: U+0870..089F,
1174                                    U+1C90..1CBF, U+2FE0..2FEF,
1175                                    U+10200..1027F, U+103E0..103FF,
1176                                    U+10570..105FF ...)
1177          \p{Block: Number_Forms} (64: U+2150..218F)
1178          \p{Block: Nushu}        (NOT \p{Nushu} NOR \p{Is_Nushu}) (400:
1179                                    U+1B170..1B2FF)
1180          \p{Block: OCR}          \p{Block=Optical_Character_Recognition}
1181                                    (32)
1182          \p{Block: Ogham}        (NOT \p{Ogham} NOR \p{Is_Ogham}) (32:
1183                                    U+1680..169F)
1184          \p{Block: Ol_Chiki}     (48: U+1C50..1C7F)
1185          \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
1186                                    \p{Is_Old_Hungarian}) (128:
1187                                    U+10C80..10CFF)
1188          \p{Block: Old_Italic}   (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
1189                                    (48: U+10300..1032F)
1190          \p{Block: Old_North_Arabian} (32: U+10A80..10A9F)
1191          \p{Block: Old_Permic}   (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
1192                                    (48: U+10350..1037F)
1193          \p{Block: Old_Persian}  (NOT \p{Old_Persian} NOR
1194                                    \p{Is_Old_Persian}) (64: U+103A0..103DF)
1195          \p{Block: Old_South_Arabian} (32: U+10A60..10A7F)
1196          \p{Block: Old_Turkic}   (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
1197                                    (80: U+10C00..10C4F)
1198          \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32:
1199                                    U+2440..245F)
1200          \p{Block: Oriya}        (NOT \p{Oriya} NOR \p{Is_Oriya}) (128:
1201                                    U+0B00..0B7F)
1202          \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F)
1203          \p{Block: Osage}        (NOT \p{Osage} NOR \p{Is_Osage}) (80:
1204                                    U+104B0..104FF)
1205          \p{Block: Osmanya}      (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48:
1206                                    U+10480..104AF)
1207          \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
1208                                    \p{Is_Pahawh_Hmong}) (144:
1209                                    U+16B00..16B8F)
1210          \p{Block: Palmyrene}    (32: U+10860..1087F)
1211          \p{Block: Pau_Cin_Hau}  (NOT \p{Pau_Cin_Hau} NOR
1212                                    \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF)
1213          \p{Block: Phags_Pa}     (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa})
1214                                    (64: U+A840..A87F)
1215          \p{Block: Phaistos}     \p{Block=Phaistos_Disc} (48)
1216          \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48:
1217                                    U+101D0..101FF)
1218          \p{Block: Phoenician}   (NOT \p{Phoenician} NOR \p{Is_Phoenician})
1219                                    (32: U+10900..1091F)
1220          \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
1221          \p{Block: Phonetic_Ext_Sup} \p{Block=
1222                                    Phonetic_Extensions_Supplement} (64)
1223          \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128:
1224                                    U+1D00..1D7F)
1225          \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
1226                                    PhoneticExtSup}) (64: U+1D80..1DBF)
1227          \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF)
1228          \p{Block: Private_Use}  \p{Block=Private_Use_Area} (NOT
1229                                    \p{Private_Use} NOR \p{Is_Private_Use})
1230                                    (6400)
1231          \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
1232                                    \p{Private_Use} NOR \p{Is_Private_Use})
1233                                    (6400: U+E000..F8FF)
1234          \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
1235                                    \p{Is_Psalter_Pahlavi}) (48:
1236                                    U+10B80..10BAF)
1237          \p{Block: PUA}          \p{Block=Private_Use_Area} (NOT
1238                                    \p{Private_Use} NOR \p{Is_Private_Use})
1239                                    (6400)
1240          \p{Block: Punctuation}  \p{Block=General_Punctuation} (NOT
1241                                    \p{Punct} NOR \p{Is_Punctuation}) (112)
1242          \p{Block: Rejang}       (NOT \p{Rejang} NOR \p{Is_Rejang}) (48:
1243                                    U+A930..A95F)
1244          \p{Block: Rumi}         \p{Block=Rumi_Numeral_Symbols} (32)
1245          \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32:
1246                                    U+10E60..10E7F)
1247          \p{Block: Runic}        (NOT \p{Runic} NOR \p{Is_Runic}) (96:
1248                                    U+16A0..16FF)
1249          \p{Block: Samaritan}    (NOT \p{Samaritan} NOR \p{Is_Samaritan})
1250                                    (64: U+0800..083F)
1251          \p{Block: Saurashtra}   (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
1252                                    (96: U+A880..A8DF)
1253          \p{Block: Sharada}      (NOT \p{Sharada} NOR \p{Is_Sharada}) (96:
1254                                    U+11180..111DF)
1255          \p{Block: Shavian}      (48: U+10450..1047F)
1256          \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF)
1257          \p{Block: Siddham}      (NOT \p{Siddham} NOR \p{Is_Siddham}) (128:
1258                                    U+11580..115FF)
1259          \p{Block: Sinhala}      (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128:
1260                                    U+0D80..0DFF)
1261          \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF)
1262          \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32:
1263                                    U+FE50..FE6F)
1264          \p{Block: Small_Forms}  \p{Block=Small_Form_Variants} (32)
1265          \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
1266                                    \p{Is_Sora_Sompeng}) (48: U+110D0..110FF)
1267          \p{Block: Soyombo}      (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96:
1268                                    U+11A50..11AAF)
1269          \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
1270                                    ModifierLetters}) (80: U+02B0..02FF)
1271          \p{Block: Specials}     (16: U+FFF0..FFFF)
1272          \p{Block: Sundanese}    (NOT \p{Sundanese} NOR \p{Is_Sundanese})
1273                                    (64: U+1B80..1BBF)
1274          \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
1275          \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16:
1276                                    U+1CC0..1CCF)
1277          \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
1278          \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
1279          \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
1280          \p{Block: Sup_Math_Operators} \p{Block=
1281                                    Supplemental_Mathematical_Operators}
1282                                    (256)
1283          \p{Block: Sup_PUA_A}    \p{Block=Supplementary_Private_Use_Area_A}
1284                                    (65_536)
1285          \p{Block: Sup_PUA_B}    \p{Block=Supplementary_Private_Use_Area_B}
1286                                    (65_536)
1287          \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
1288          \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
1289                                    Supplemental_Symbols_And_Pictographs}
1290                                    (256)
1291          \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
1292          \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
1293                                    SuperAndSub}) (48: U+2070..209F)
1294          \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16:
1295                                    U+27F0..27FF)
1296          \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128:
1297                                    U+2900..297F)
1298          \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256:
1299                                    U+1F800..1F8FF)
1300          \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
1301                                    SupMathOperators}) (256: U+2A00..2AFF)
1302          \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
1303                                    SupPunctuation}) (128: U+2E00..2E7F)
1304          \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
1305                                    SupSymbolsAndPictographs}) (256:
1306                                    U+1F900..1F9FF)
1307          \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
1308                                    SupPUAA}) (65_536: U+F0000..FFFFF)
1309          \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
1310                                    SupPUAB}) (65_536: U+100000..10FFFF)
1311          \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF)
1312          \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
1313                                    \p{Is_Syloti_Nagri}) (48: U+A800..A82F)
1314          \p{Block: Syriac}       (NOT \p{Syriac} NOR \p{Is_Syriac}) (80:
1315                                    U+0700..074F)
1316          \p{Block: Syriac_Sup}   \p{Block=Syriac_Supplement} (16)
1317          \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16:
1318                                    U+0860..086F)
1319          \p{Block: Tagalog}      (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32:
1320                                    U+1700..171F)
1321          \p{Block: Tagbanwa}     (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa})
1322                                    (32: U+1760..177F)
1323          \p{Block: Tags}         (128: U+E0000..E007F)
1324          \p{Block: Tai_Le}       (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48:
1325                                    U+1950..197F)
1326          \p{Block: Tai_Tham}     (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
1327                                    (144: U+1A20..1AAF)
1328          \p{Block: Tai_Viet}     (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet})
1329                                    (96: U+AA80..AADF)
1330          \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1331          \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96:
1332                                    U+1D300..1D35F)
1333          \p{Block: Takri}        (NOT \p{Takri} NOR \p{Is_Takri}) (80:
1334                                    U+11680..116CF)
1335          \p{Block: Tamil}        (NOT \p{Tamil} NOR \p{Is_Tamil}) (128:
1336                                    U+0B80..0BFF)
1337          \p{Block: Tangut}       (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144:
1338                                    U+17000..187FF)
1339          \p{Block: Tangut_Components} (768: U+18800..18AFF)
1340          \p{Block: Telugu}       (NOT \p{Telugu} NOR \p{Is_Telugu}) (128:
1341                                    U+0C00..0C7F)
1342          \p{Block: Thaana}       (NOT \p{Thaana} NOR \p{Is_Thaana}) (64:
1343                                    U+0780..07BF)
1344          \p{Block: Thai}         (NOT \p{Thai} NOR \p{Is_Thai}) (128:
1345                                    U+0E00..0E7F)
1346          \p{Block: Tibetan}      (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256:
1347                                    U+0F00..0FFF)
1348          \p{Block: Tifinagh}     (NOT \p{Tifinagh} NOR \p{Is_Tifinagh})
1349                                    (80: U+2D30..2D7F)
1350          \p{Block: Tirhuta}      (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96:
1351                                    U+11480..114DF)
1352          \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1353                                    (128)
1354          \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1355                                    TransportAndMap}) (128: U+1F680..1F6FF)
1356          \p{Block: UCAS}         \p{Block=
1357                                    Unified_Canadian_Aboriginal_Syllabics}
1358                                    (640)
1359          \p{Block: UCAS_Ext}     \p{Block=
1360                                    Unified_Canadian_Aboriginal_Syllabics_-
1361                                    Extended} (80)
1362          \p{Block: Ugaritic}     (NOT \p{Ugaritic} NOR \p{Is_Ugaritic})
1363                                    (32: U+10380..1039F)
1364          \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1365                                    UCAS}) (640: U+1400..167F)
1366          \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1367                                    \p{Blk=UCASExt}) (80: U+18B0..18FF)
1368          \p{Block: Vai}          (NOT \p{Vai} NOR \p{Is_Vai}) (320:
1369                                    U+A500..A63F)
1370          \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
1371                                    \p{Variation_Selector} NOR \p{Is_VS})
1372                                    (16: U+FE00..FE0F)
1373          \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
1374                                    (240: U+E0100..E01EF)
1375          \p{Block: Vedic_Ext}    \p{Block=Vedic_Extensions} (48)
1376          \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48:
1377                                    U+1CD0..1CFF)
1378          \p{Block: Vertical_Forms} (16: U+FE10..FE1F)
1379          \p{Block: VS}           \p{Block=Variation_Selectors} (NOT
1380                                    \p{Variation_Selector} NOR \p{Is_VS})
1381                                    (16)
1382          \p{Block: VS_Sup}       \p{Block=Variation_Selectors_Supplement}
1383                                    (240)
1384          \p{Block: Warang_Citi}  (NOT \p{Warang_Citi} NOR
1385                                    \p{Is_Warang_Citi}) (96: U+118A0..118FF)
1386          \p{Block: Yi_Radicals}  (64: U+A490..A4CF)
1387          \p{Block: Yi_Syllables} (1168: U+A000..A48F)
1388          \p{Block: Yijing}       \p{Block=Yijing_Hexagram_Symbols} (64)
1389          \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64:
1390                                    U+4DC0..4DFF)
1391          \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR
1392                                    \p{Is_Zanabazar_Square}) (80:
1393                                    U+11A00..11A4F)
1394        X \p{Block_Elements}      \p{Block=Block_Elements} (32)
1395          \p{Bopo}                \p{Bopomofo} (= \p{Script_Extensions=
1396                                    Bopomofo}) (NOT \p{Block=Bopomofo}) (111)
1397          \p{Bopomofo}            \p{Script_Extensions=Bopomofo} (Short:
1398                                    \p{Bopo}; NOT \p{Block=Bopomofo}) (111)
1399        X \p{Bopomofo_Ext}        \p{Bopomofo_Extended} (= \p{Block=
1400                                    Bopomofo_Extended}) (32)
1401        X \p{Bopomofo_Extended}   \p{Block=Bopomofo_Extended} (Short:
1402                                    \p{InBopomofoExt}) (32)
1403        X \p{Box_Drawing}         \p{Block=Box_Drawing} (128)
1404          \p{Bpt: *}              \p{Bidi_Paired_Bracket_Type: *}
1405          \p{Brah}                \p{Brahmi} (= \p{Script_Extensions=
1406                                    Brahmi}) (NOT \p{Block=Brahmi}) (109)
1407          \p{Brahmi}              \p{Script_Extensions=Brahmi} (Short:
1408                                    \p{Brah}; NOT \p{Block=Brahmi}) (109)
1409          \p{Brai}                \p{Braille} (= \p{Script_Extensions=
1410                                    Braille}) (256)
1411          \p{Braille}             \p{Script_Extensions=Braille} (Short:
1412                                    \p{Brai}) (256)
1413        X \p{Braille_Patterns}    \p{Block=Braille_Patterns} (Short:
1414                                    \p{InBraille}) (256)
1415          \p{Bugi}                \p{Buginese} (= \p{Script_Extensions=
1416                                    Buginese}) (NOT \p{Block=Buginese}) (31)
1417          \p{Buginese}            \p{Script_Extensions=Buginese} (Short:
1418                                    \p{Bugi}; NOT \p{Block=Buginese}) (31)
1419          \p{Buhd}                \p{Buhid} (= \p{Script_Extensions=Buhid})
1420                                    (NOT \p{Block=Buhid}) (22)
1421          \p{Buhid}               \p{Script_Extensions=Buhid} (Short:
1422                                    \p{Buhd}; NOT \p{Block=Buhid}) (22)
1423        X \p{Byzantine_Music}     \p{Byzantine_Musical_Symbols} (= \p{Block=
1424                                    Byzantine_Musical_Symbols}) (256)
1425        X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1426                                    (Short: \p{InByzantineMusic}) (256)
1427          \p{C} \pC               \p{Other} (= \p{General_Category=Other})
1428                                    (977_573 plus all above-Unicode code
1429                                    points)
1430          \p{Cakm}                \p{Chakma} (= \p{Script_Extensions=
1431                                    Chakma}) (NOT \p{Block=Chakma}) (87)
1432          \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
1433                                    (Short: \p{Cans}) (710)
1434        X \p{Canadian_Syllabics}  \p{Unified_Canadian_Aboriginal_Syllabics}
1435                                    (= \p{Block=
1436                                    Unified_Canadian_Aboriginal_Syllabics})
1437                                    (640)
1438        T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1439                                    Not_Reordered} (1_113_286 plus all
1440                                    above-Unicode code points)
1441        T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1442                                    Overlay} (32)
1443        T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1444                                    Nukta} (23)
1445        T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1446                                    Kana_Voicing} (2)
1447        T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1448                                    Virama} (54)
1449        T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1450                                    CCC10} (1)
1451          \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1:
1452                                    U+05B0)
1453        T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1454                                    CCC11} (1)
1455          \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1:
1456                                    U+05B1)
1457        T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1458                                    CCC12} (1)
1459          \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1:
1460                                    U+05B2)
1461        T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1462                                    CCC13} (1)
1463          \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1:
1464                                    U+05B3)
1465        T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1466                                    CCC14} (1)
1467          \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1:
1468                                    U+05B4)
1469        T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1470                                    CCC15} (1)
1471          \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1:
1472                                    U+05B5)
1473        T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1474                                    CCC16} (1)
1475          \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1:
1476                                    U+05B6)
1477        T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1478                                    CCC17} (1)
1479          \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1:
1480                                    U+05B7)
1481        T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1482                                    CCC18} (2)
1483          \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2:
1484                                    U+05B8, U+05C7)
1485        T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1486                                    CCC19} (2)
1487          \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2:
1488                                    U+05B9..05BA)
1489        T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1490                                    CCC20} (1)
1491          \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1:
1492                                    U+05BB)
1493        T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1494                                    CCC21} (1)
1495          \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1:
1496                                    U+05BC)
1497        T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1498                                    CCC22} (1)
1499          \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1:
1500                                    U+05BD)
1501        T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1502                                    CCC23} (1)
1503          \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1:
1504                                    U+05BF)
1505        T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1506                                    CCC24} (1)
1507          \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1:
1508                                    U+05C1)
1509        T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1510                                    CCC25} (1)
1511          \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1:
1512                                    U+05C2)
1513        T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1514                                    CCC26} (1)
1515          \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1:
1516                                    U+FB1E)
1517        T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1518                                    CCC27} (2)
1519          \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2:
1520                                    U+064B, U+08F0)
1521        T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1522                                    CCC28} (2)
1523          \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2:
1524                                    U+064C, U+08F1)
1525        T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1526                                    CCC29} (2)
1527          \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2:
1528                                    U+064D, U+08F2)
1529        T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1530                                    CCC30} (2)
1531          \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2:
1532                                    U+0618, U+064E)
1533        T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1534                                    CCC31} (2)
1535          \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2:
1536                                    U+0619, U+064F)
1537        T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1538                                    CCC32} (2)
1539          \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2:
1540                                    U+061A, U+0650)
1541        T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1542                                    CCC33} (1)
1543          \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1:
1544                                    U+0651)
1545        T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1546                                    CCC34} (1)
1547          \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1:
1548                                    U+0652)
1549        T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1550                                    CCC35} (1)
1551          \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1:
1552                                    U+0670)
1553        T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1554                                    CCC36} (1)
1555          \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1:
1556                                    U+0711)
1557        T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1558                                    CCC84} (1)
1559          \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1:
1560                                    U+0C55)
1561        T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1562                                    CCC91} (1)
1563          \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1:
1564                                    U+0C56)
1565        T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1566                                    CCC103} (2)
1567          \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2:
1568                                    U+0E38..0E39)
1569        T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1570                                    CCC107} (4)
1571          \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4:
1572                                    U+0E48..0E4B)
1573        T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1574                                    CCC118} (2)
1575          \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2:
1576                                    U+0EB8..0EB9)
1577        T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1578                                    CCC122} (4)
1579          \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4:
1580                                    U+0EC8..0ECB)
1581        T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1582                                    CCC129} (1)
1583          \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1:
1584                                    U+0F71)
1585        T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1586                                    CCC130} (6)
1587          \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6:
1588                                    U+0F72, U+0F7A..0F7D, U+0F80)
1589        T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1590                                    CCC132} (1)
1591          \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1:
1592                                    U+0F74)
1593        T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
1594                                    CCC133} (0)
1595          \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
1596        T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1597                                    Attached_Below_Left} (0)
1598        T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1599                                    Attached_Below} (5)
1600        T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1601                                    Attached_Above} (1)
1602        T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1603                                    Attached_Above_Right} (9)
1604        T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1605                                    Below_Left} (1)
1606        T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1607                                    Below} (154)
1608        T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1609                                    Below_Right} (4)
1610        T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1611                                    Left} (2)
1612        T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1613                                    Right} (1)
1614        T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1615                                    Above_Left} (5)
1616        T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1617                                    Above} (461)
1618        T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1619                                    Above_Right} (5)
1620        T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1621                                    Double_Below} (4)
1622        T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1623                                    Double_Above} (5)
1624        T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1625                                    Iota_Subscript} (1)
1626          \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1627                                    Above} (461)
1628          \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (461:
1629                                    U+0300..0314, U+033D..0344, U+0346,
1630                                    U+034A..034C, U+0350..0352, U+0357 ...)
1631          \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5:
1632                                    U+05AE, U+18A9, U+1DF7..1DF8, U+302B)
1633          \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (5:
1634                                    U+0315, U+031A, U+0358, U+1DF6, U+302C)
1635          \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1636                                    Above_Left} (5)
1637          \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1638                                    Above_Right} (5)
1639          \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1640                                    Attached_Above} (1)
1641          \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1642                                    Attached_Above_Right} (9)
1643          \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1644                                    Attached_Below} (5)
1645          \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1646                                    Attached_Below_Left} (0)
1647          \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1648                                    (1: U+1DCE)
1649          \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1650                                    \p{Ccc=ATAR}) (9: U+031B, U+0F39,
1651                                    U+1D165..1D166, U+1D16E..1D172)
1652          \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1653                                    (5: U+0321..0322, U+0327..0328, U+1DD0)
1654          \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1655                                    ATBL}) (0)
1656          \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1657                                    Below} (154)
1658          \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (154:
1659                                    U+0316..0319, U+031C..0320,
1660                                    U+0323..0326, U+0329..0333,
1661                                    U+0339..033C, U+0347..0349 ...)
1662          \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1:
1663                                    U+302A)
1664          \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4:
1665                                    U+059A, U+05AD, U+1939, U+302D)
1666          \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1667                                    Below_Left} (1)
1668          \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1669                                    Below_Right} (4)
1670          \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1671                                    Double_Above} (5)
1672          \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1673                                    Double_Below} (4)
1674          \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA})
1675                                    (5: U+035D..035E, U+0360..0361, U+1DCD)
1676          \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB})
1677                                    (4: U+035C, U+035F, U+0362, U+1DFC)
1678          \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1679                                    (1: U+0345)
1680          \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1681                                    Iota_Subscript} (1)
1682          \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV})
1683                                    (2: U+3099..309A)
1684          \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1685                                    Kana_Voicing} (2)
1686          \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1687                                    Left} (2)
1688          \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2:
1689                                    U+302E..302F)
1690          \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1691                                    Nukta} (23)
1692          \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1693                                    (1_113_286 plus all above-Unicode code
1694                                    points: U+0000..02FF, U+034F,
1695                                    U+0370..0482, U+0488..0590, U+05BE,
1696                                    U+05C0 ...)
1697          \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1698                                    Not_Reordered} (1_113_286 plus all
1699                                    above-Unicode code points)
1700          \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (23:
1701                                    U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C,
1702                                    U+0CBC ...)
1703          \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1704                                    Overlay} (32)
1705          \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32:
1706                                    U+0334..0338, U+1CD4, U+1CE2..1CE8,
1707                                    U+20D2..20D3, U+20D8..20DA, U+20E5..20E6
1708                                    ...)
1709          \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1710                                    Right} (1)
1711          \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1:
1712                                    U+1D16D)
1713          \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (54:
1714                                    U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
1715                                    U+0BCD ...)
1716          \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1717                                    Virama} (54)
1718          \p{Cans}                \p{Canadian_Aboriginal} (=
1719                                    \p{Script_Extensions=
1720                                    Canadian_Aboriginal}) (710)
1721          \p{Cari}                \p{Carian} (= \p{Script_Extensions=
1722                                    Carian}) (NOT \p{Block=Carian}) (49)
1723          \p{Carian}              \p{Script_Extensions=Carian} (Short:
1724                                    \p{Cari}; NOT \p{Block=Carian}) (49)
1725          \p{Case_Ignorable}      \p{Case_Ignorable=Y} (Short: \p{CI}) (2314)
1726          \p{Case_Ignorable: N*}  (Short: \p{CI=N}, \P{CI}) (1_111_798 plus
1727                                    all above-Unicode code points: [\x00-
1728                                    \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A-
1729                                    Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
1730                                    \xac\xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
1731                                    U+0100..02AF, U+0370..0373,
1732                                    U+0376..0379, U+037B..0383, U+0386 ...)
1733          \p{Case_Ignorable: Y*}  (Short: \p{CI=Y}, \p{CI}) (2314:
1734                                    [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8],
1735                                    U+02B0..036F, U+0374..0375, U+037A,
1736                                    U+0384..0385, U+0387 ...)
1737          \p{Cased}               \p{Cased=Y} (4105)
1738          \p{Cased: N*}           (Single: \P{Cased}) (1_110_007 plus all
1739                                    above-Unicode code points: [\x00-
1740                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1741                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
1742                                    \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7], U+01BB,
1743                                    U+01C0..01C3, U+0294, U+02B9..02BF,
1744                                    U+02C2..02DF ...)
1745          \p{Cased: Y*}           (Single: \p{Cased}) (4105: [A-Za-
1746                                    z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
1747                                    \xff], U+0100..01BA, U+01BC..01BF,
1748                                    U+01C4..0293, U+0295..02B8, U+02C0..02C1
1749                                    ...)
1750          \p{Cased_Letter}        \p{General_Category=Cased_Letter} (Short:
1751                                    \p{LC}) (3796)
1752          \p{Category: *}         \p{General_Category: *}
1753          \p{Caucasian_Albanian}  \p{Script_Extensions=Caucasian_Albanian}
1754                                    (Short: \p{Aghb}; NOT \p{Block=
1755                                    Caucasian_Albanian}) (53)
1756          \p{Cc}                  \p{XPosixCntrl} (= \p{General_Category=
1757                                    Control}) (65)
1758          \p{Ccc: *}              \p{Canonical_Combining_Class: *}
1759          \p{CE}                  \p{Composition_Exclusion} (=
1760                                    \p{Composition_Exclusion=Y}) (81)
1761          \p{CE: *}               \p{Composition_Exclusion: *}
1762          \p{Cf}                  \p{Format} (= \p{General_Category=Format})
1763                                    (151)
1764          \p{Chakma}              \p{Script_Extensions=Chakma} (Short:
1765                                    \p{Cakm}; NOT \p{Block=Chakma}) (87)
1766          \p{Cham}                \p{Script_Extensions=Cham} (NOT \p{Block=
1767                                    Cham}) (83)
1768          \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1769                                    \p{CWCF}) (1377)
1770          \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1771                                    (1_112_735 plus all above-Unicode code
1772                                    points: [\x00-
1773                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1774                                    >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\xb4\xb6-
1775                                    \xbf\xd7\xe0-\xff], U+0101, U+0103,
1776                                    U+0105, U+0107, U+0109 ...)
1777          \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1778                                    (1377: [A-Z\xb5\xc0-\xd6\xd8-\xdf],
1779                                    U+0100, U+0102, U+0104, U+0106, U+0108
1780                                    ...)
1781          \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1782                                    \p{CWCM}) (2669)
1783          \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1784                                    (1_111_443 plus all above-Unicode code
1785                                    points: [\x00-
1786                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1787                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1788                                    \xbf\xd7\xf7], U+0138, U+018D, U+019B,
1789                                    U+01AA..01AB, U+01BA..01BB ...)
1790          \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1791                                    (2669: [A-Za-z\xb5\xc0-\xd6\xd8-
1792                                    \xf6\xf8-\xff], U+0100..0137,
1793                                    U+0139..018C, U+018E..019A,
1794                                    U+019C..01A9, U+01AC..01B9 ...)
1795          \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1796                                    \p{CWL}) (1304)
1797          \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1798                                    (1_112_808 plus all above-Unicode code
1799                                    points: [\x00-
1800                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1801                                    >?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1802                                    \xbf\xd7\xdf-\xff], U+0101, U+0103,
1803                                    U+0105, U+0107, U+0109 ...)
1804          \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1304:
1805                                    [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102,
1806                                    U+0104, U+0106, U+0108 ...)
1807          \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1808                                    Y} (Short: \p{CWKCF}) (10_227)
1809          \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1810                                    \P{CWKCF}) (1_103_885 plus all above-
1811                                    Unicode code points: [\x00-
1812                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1813                                    >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1-
1814                                    \xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6-
1815                                    \xb7\xbb\xbf\xd7\xe0-\xff], U+0101,
1816                                    U+0103, U+0105, U+0107, U+0109 ...)
1817          \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1818                                    \p{CWKCF}) (10_227: [A-
1819                                    Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8-
1820                                    \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf],
1821                                    U+0100, U+0102, U+0104, U+0106, U+0108
1822                                    ...)
1823          \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1824                                    \p{CWT}) (1369)
1825          \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1826                                    (1_112_743 plus all above-Unicode code
1827                                    points: [\x00-
1828                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
1829                                    Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1830                                    \xde\xf7], U+0100, U+0102, U+0104,
1831                                    U+0106, U+0108 ...)
1832          \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1369:
1833                                    [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1834                                    U+0103, U+0105, U+0107, U+0109 ...)
1835          \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1836                                    \p{CWU}) (1396)
1837          \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1838                                    (1_112_716 plus all above-Unicode code
1839                                    points: [\x00-
1840                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
1841                                    Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1842                                    \xde\xf7], U+0100, U+0102, U+0104,
1843                                    U+0106, U+0108 ...)
1844          \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1396:
1845                                    [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1846                                    U+0103, U+0105, U+0107, U+0109 ...)
1847          \p{Cher}                \p{Cherokee} (= \p{Script_Extensions=
1848                                    Cherokee}) (NOT \p{Block=Cherokee}) (172)
1849          \p{Cherokee}            \p{Script_Extensions=Cherokee} (Short:
1850                                    \p{Cher}; NOT \p{Block=Cherokee}) (172)
1851        X \p{Cherokee_Sup}        \p{Cherokee_Supplement} (= \p{Block=
1852                                    Cherokee_Supplement}) (80)
1853        X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
1854                                    \p{InCherokeeSup}) (80)
1855          \p{CI}                  \p{Case_Ignorable} (= \p{Case_Ignorable=
1856                                    Y}) (2314)
1857          \p{CI: *}               \p{Case_Ignorable: *}
1858        X \p{CJK}                 \p{CJK_Unified_Ideographs} (= \p{Block=
1859                                    CJK_Unified_Ideographs}) (20_992)
1860        X \p{CJK_Compat}          \p{CJK_Compatibility} (= \p{Block=
1861                                    CJK_Compatibility}) (256)
1862        X \p{CJK_Compat_Forms}    \p{CJK_Compatibility_Forms} (= \p{Block=
1863                                    CJK_Compatibility_Forms}) (32)
1864        X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
1865                                    \p{Block=CJK_Compatibility_Ideographs})
1866                                    (512)
1867        X \p{CJK_Compat_Ideographs_Sup}
1868                                    \p{CJK_Compatibility_Ideographs_-
1869                                    Supplement} (= \p{Block=
1870                                    CJK_Compatibility_Ideographs_-
1871                                    Supplement}) (544)
1872        X \p{CJK_Compatibility}   \p{Block=CJK_Compatibility} (Short:
1873                                    \p{InCJKCompat}) (256)
1874        X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
1875                                    (Short: \p{InCJKCompatForms}) (32)
1876        X \p{CJK_Compatibility_Ideographs} \p{Block=
1877                                    CJK_Compatibility_Ideographs} (Short:
1878                                    \p{InCJKCompatIdeographs}) (512)
1879        X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
1880                                    CJK_Compatibility_Ideographs_Supplement}
1881                                    (Short: \p{InCJKCompatIdeographsSup})
1882                                    (544)
1883        X \p{CJK_Ext_A}           \p{CJK_Unified_Ideographs_Extension_A} (=
1884                                    \p{Block=
1885                                    CJK_Unified_Ideographs_Extension_A})
1886                                    (6592)
1887        X \p{CJK_Ext_B}           \p{CJK_Unified_Ideographs_Extension_B} (=
1888                                    \p{Block=
1889                                    CJK_Unified_Ideographs_Extension_B})
1890                                    (42_720)
1891        X \p{CJK_Ext_C}           \p{CJK_Unified_Ideographs_Extension_C} (=
1892                                    \p{Block=
1893                                    CJK_Unified_Ideographs_Extension_C})
1894                                    (4160)
1895        X \p{CJK_Ext_D}           \p{CJK_Unified_Ideographs_Extension_D} (=
1896                                    \p{Block=
1897                                    CJK_Unified_Ideographs_Extension_D})
1898                                    (224)
1899        X \p{CJK_Ext_E}           \p{CJK_Unified_Ideographs_Extension_E} (=
1900                                    \p{Block=
1901                                    CJK_Unified_Ideographs_Extension_E})
1902                                    (5776)
1903        X \p{CJK_Ext_F}           \p{CJK_Unified_Ideographs_Extension_F} (=
1904                                    \p{Block=
1905                                    CJK_Unified_Ideographs_Extension_F})
1906                                    (7488)
1907        X \p{CJK_Radicals_Sup}    \p{CJK_Radicals_Supplement} (= \p{Block=
1908                                    CJK_Radicals_Supplement}) (128)
1909        X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
1910                                    (Short: \p{InCJKRadicalsSup}) (128)
1911        X \p{CJK_Strokes}         \p{Block=CJK_Strokes} (48)
1912        X \p{CJK_Symbols}         \p{CJK_Symbols_And_Punctuation} (=
1913                                    \p{Block=CJK_Symbols_And_Punctuation})
1914                                    (64)
1915        X \p{CJK_Symbols_And_Punctuation} \p{Block=
1916                                    CJK_Symbols_And_Punctuation} (Short:
1917                                    \p{InCJKSymbols}) (64)
1918        X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
1919                                    (Short: \p{InCJK}) (20_992)
1920        X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
1921                                    CJK_Unified_Ideographs_Extension_A}
1922                                    (Short: \p{InCJKExtA}) (6592)
1923        X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
1924                                    CJK_Unified_Ideographs_Extension_B}
1925                                    (Short: \p{InCJKExtB}) (42_720)
1926        X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
1927                                    CJK_Unified_Ideographs_Extension_C}
1928                                    (Short: \p{InCJKExtC}) (4160)
1929        X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
1930                                    CJK_Unified_Ideographs_Extension_D}
1931                                    (Short: \p{InCJKExtD}) (224)
1932        X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
1933                                    CJK_Unified_Ideographs_Extension_E}
1934                                    (Short: \p{InCJKExtE}) (5776)
1935        X \p{CJK_Unified_Ideographs_Extension_F} \p{Block=
1936                                    CJK_Unified_Ideographs_Extension_F}
1937                                    (Short: \p{InCJKExtF}) (7488)
1938          \p{Close_Punctuation}   \p{General_Category=Close_Punctuation}
1939                                    (Short: \p{Pe}) (73)
1940          \p{Cn}                  \p{Unassigned} (= \p{General_Category=
1941                                    Unassigned}) (837_841 plus all above-
1942                                    Unicode code points)
1943          \p{Cntrl}               \p{XPosixCntrl} (= \p{General_Category=
1944                                    Control}) (65)
1945          \p{Co}                  \p{Private_Use} (= \p{General_Category=
1946                                    Private_Use}) (NOT \p{Private_Use_Area})
1947                                    (137_468)
1948        X \p{Combining_Diacritical_Marks} \p{Block=
1949                                    Combining_Diacritical_Marks} (Short:
1950                                    \p{InDiacriticals}) (112)
1951        X \p{Combining_Diacritical_Marks_Extended} \p{Block=
1952                                    Combining_Diacritical_Marks_Extended}
1953                                    (Short: \p{InDiacriticalsExt}) (80)
1954        X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
1955                                    Combining_Diacritical_Marks_For_Symbols}
1956                                    (Short: \p{InDiacriticalsForSymbols})
1957                                    (48)
1958        X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
1959                                    Combining_Diacritical_Marks_Supplement}
1960                                    (Short: \p{InDiacriticalsSup}) (64)
1961        X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
1962                                    \p{InHalfMarks}) (16)
1963          \p{Combining_Mark}      \p{Mark} (= \p{General_Category=Mark})
1964                                    (2177)
1965        X \p{Combining_Marks_For_Symbols}
1966                                    \p{Combining_Diacritical_Marks_For_-
1967                                    Symbols} (= \p{Block=
1968                                    Combining_Diacritical_Marks_For_-
1969                                    Symbols}) (48)
1970          \p{Common}              \p{Script_Extensions=Common} (Short:
1971                                    \p{Zyyy}) (6947)
1972        X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1973                                    (Short: \p{InIndicNumberForms}) (16)
1974          \p{Comp_Ex}             \p{Full_Composition_Exclusion} (=
1975                                    \p{Full_Composition_Exclusion=Y}) (1120)
1976          \p{Comp_Ex: *}          \p{Full_Composition_Exclusion: *}
1977        X \p{Compat_Jamo}         \p{Hangul_Compatibility_Jamo} (= \p{Block=
1978                                    Hangul_Compatibility_Jamo}) (96)
1979          \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
1980                                    \p{CE}) (81)
1981          \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
1982                                    plus all above-Unicode code points:
1983                                    U+0000..0957, U+0960..09DB, U+09DE,
1984                                    U+09E0..0A32, U+0A34..0A35, U+0A37..0A58
1985                                    ...)
1986          \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81:
1987                                    U+0958..095F, U+09DC..09DD, U+09DF,
1988                                    U+0A33, U+0A36, U+0A59..0A5B ...)
1989          \p{Connector_Punctuation} \p{General_Category=
1990                                    Connector_Punctuation} (Short: \p{Pc})
1991                                    (10)
1992          \p{Control}             \p{XPosixCntrl} (= \p{General_Category=
1993                                    Control}) (65)
1994        X \p{Control_Pictures}    \p{Block=Control_Pictures} (64)
1995          \p{Copt}                \p{Coptic} (= \p{Script_Extensions=
1996                                    Coptic}) (NOT \p{Block=Coptic}) (165)
1997          \p{Coptic}              \p{Script_Extensions=Coptic} (Short:
1998                                    \p{Copt}; NOT \p{Block=Coptic}) (165)
1999        X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
2000        X \p{Counting_Rod}        \p{Counting_Rod_Numerals} (= \p{Block=
2001                                    Counting_Rod_Numerals}) (32)
2002        X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
2003                                    \p{InCountingRod}) (32)
2004          \p{Cprt}                \p{Cypriot} (= \p{Script_Extensions=
2005                                    Cypriot}) (112)
2006          \p{Cs}                  \p{Surrogate} (= \p{General_Category=
2007                                    Surrogate}) (2048)
2008          \p{Cuneiform}           \p{Script_Extensions=Cuneiform} (Short:
2009                                    \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
2010        X \p{Cuneiform_Numbers}   \p{Cuneiform_Numbers_And_Punctuation} (=
2011                                    \p{Block=
2012                                    Cuneiform_Numbers_And_Punctuation}) (128)
2013        X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
2014                                    Cuneiform_Numbers_And_Punctuation}
2015                                    (Short: \p{InCuneiformNumbers}) (128)
2016          \p{Currency_Symbol}     \p{General_Category=Currency_Symbol}
2017                                    (Short: \p{Sc}) (54)
2018        X \p{Currency_Symbols}    \p{Block=Currency_Symbols} (48)
2019          \p{CWCF}                \p{Changes_When_Casefolded} (=
2020                                    \p{Changes_When_Casefolded=Y}) (1377)
2021          \p{CWCF: *}             \p{Changes_When_Casefolded: *}
2022          \p{CWCM}                \p{Changes_When_Casemapped} (=
2023                                    \p{Changes_When_Casemapped=Y}) (2669)
2024          \p{CWCM: *}             \p{Changes_When_Casemapped: *}
2025          \p{CWKCF}               \p{Changes_When_NFKC_Casefolded} (=
2026                                    \p{Changes_When_NFKC_Casefolded=Y})
2027                                    (10_227)
2028          \p{CWKCF: *}            \p{Changes_When_NFKC_Casefolded: *}
2029          \p{CWL}                 \p{Changes_When_Lowercased} (=
2030                                    \p{Changes_When_Lowercased=Y}) (1304)
2031          \p{CWL: *}              \p{Changes_When_Lowercased: *}
2032          \p{CWT}                 \p{Changes_When_Titlecased} (=
2033                                    \p{Changes_When_Titlecased=Y}) (1369)
2034          \p{CWT: *}              \p{Changes_When_Titlecased: *}
2035          \p{CWU}                 \p{Changes_When_Uppercased} (=
2036                                    \p{Changes_When_Uppercased=Y}) (1396)
2037          \p{CWU: *}              \p{Changes_When_Uppercased: *}
2038          \p{Cypriot}             \p{Script_Extensions=Cypriot} (Short:
2039                                    \p{Cprt}) (112)
2040        X \p{Cypriot_Syllabary}   \p{Block=Cypriot_Syllabary} (64)
2041          \p{Cyrillic}            \p{Script_Extensions=Cyrillic} (Short:
2042                                    \p{Cyrl}; NOT \p{Block=Cyrillic}) (446)
2043        X \p{Cyrillic_Ext_A}      \p{Cyrillic_Extended_A} (= \p{Block=
2044                                    Cyrillic_Extended_A}) (32)
2045        X \p{Cyrillic_Ext_B}      \p{Cyrillic_Extended_B} (= \p{Block=
2046                                    Cyrillic_Extended_B}) (96)
2047        X \p{Cyrillic_Ext_C}      \p{Cyrillic_Extended_C} (= \p{Block=
2048                                    Cyrillic_Extended_C}) (16)
2049        X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
2050                                    \p{InCyrillicExtA}) (32)
2051        X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
2052                                    \p{InCyrillicExtB}) (96)
2053        X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
2054                                    \p{InCyrillicExtC}) (16)
2055        X \p{Cyrillic_Sup}        \p{Cyrillic_Supplement} (= \p{Block=
2056                                    Cyrillic_Supplement}) (48)
2057        X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
2058                                    \p{InCyrillicSup}) (48)
2059        X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
2060                                    Cyrillic_Supplement}) (48)
2061          \p{Cyrl}                \p{Cyrillic} (= \p{Script_Extensions=
2062                                    Cyrillic}) (NOT \p{Block=Cyrillic}) (446)
2063          \p{Dash}                \p{Dash=Y} (28)
2064          \p{Dash: N*}            (Single: \P{Dash}) (1_114_084 plus all
2065                                    above-Unicode code points: [\x00-
2066                                    \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-
2067                                    Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2068                                    U+0100..0589, U+058B..05BD,
2069                                    U+05BF..13FF, U+1401..1805, U+1807..200F
2070                                    ...)
2071          \p{Dash: Y*}            (Single: \p{Dash}) (28: [\-], U+058A,
2072                                    U+05BE, U+1400, U+1806, U+2010..2015 ...)
2073          \p{Dash_Punctuation}    \p{General_Category=Dash_Punctuation}
2074                                    (Short: \p{Pd}) (24)
2075          \p{Decimal_Number}      \p{XPosixDigit} (= \p{General_Category=
2076                                    Decimal_Number}) (590)
2077          \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
2078                                    (13_232)
2079          \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_232:
2080                                    [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9-
2081                                    \xdd\xe0-\xe5\xe7-\xef\xf1-\xf6\xf9-
2082                                    \xfd\xff], U+0100..010F, U+0112..0125,
2083                                    U+0128..0130, U+0134..0137, U+0139..013E
2084                                    ...)
2085          \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240:
2086                                    U+2460..2473, U+24B6..24EA,
2087                                    U+3244..3247, U+3251..327E,
2088                                    U+3280..32BF, U+32D0..32FE ...)
2089          \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
2090          \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720:
2091                                    [\xa8\xaf\xb4-\xb5\xb8], U+0132..0133,
2092                                    U+013F..0140, U+0149, U+017F,
2093                                    U+01C4..01CC ...)
2094          \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
2095          \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
2096          \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51,
2097                                    U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63
2098                                    ...)
2099          \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184: U+2102,
2100                                    U+210A..2113, U+2115, U+2119..211D,
2101                                    U+2124, U+2128 ...)
2102          \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
2103          \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc-
2104                                    \xbe], U+2150..215F, U+2189)
2105          \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
2106          \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54,
2107                                    U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68
2108                                    ...)
2109          \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
2110          \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50,
2111                                    U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62
2112                                    ...)
2113          \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
2114          \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55,
2115                                    U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69
2116                                    ...)
2117          \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
2118          \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122:
2119                                    U+FF61..FFBE, U+FFC2..FFC7,
2120                                    U+FFCA..FFCF, U+FFD2..FFD7,
2121                                    U+FFDA..FFDC, U+FFE8..FFEE)
2122          \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
2123          \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0],
2124                                    U+0F0C, U+2007, U+2011, U+202F)
2125          \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
2126                                    Non_Canonical} (Perl extension) (3662)
2127          \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
2128                                    decompositions (Short: \p{Dt=NonCanon})
2129                                    (Perl extension) (3662:
2130                                    [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-\xba\xbc-
2131                                    \xbe], U+0132..0133, U+013F..0140,
2132                                    U+0149, U+017F, U+01C4..01CC ...)
2133          \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_218 plus
2134                                    all above-Unicode code points: [\x00-
2135                                    \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6-
2136                                    \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-
2137                                    \xdf\xe6\xf0\xf7-\xf8\xfe],
2138                                    U+0110..0111, U+0126..0127, U+0131,
2139                                    U+0138, U+0141..0142 ...)
2140          \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26:
2141                                    U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B)
2142          \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
2143          \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (285)
2144          \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (285: U+3250,
2145                                    U+32CC..32CF, U+3300..3357,
2146                                    U+3371..33DF, U+33FF, U+1F130..1F14F ...)
2147          \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38: U+1D62..1D6A,
2148                                    U+2080..208E, U+2090..209C, U+2C7C)
2149          \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (152)
2150          \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (152: [\xaa\xb2-
2151                                    \xb3\xb9-\xba], U+02B0..02B8,
2152                                    U+02E0..02E4, U+10FC, U+1D2C..1D2E,
2153                                    U+1D30..1D3A ...)
2154          \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
2155          \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F,
2156                                    U+30FF, U+FE10..FE19, U+FE30..FE44,
2157                                    U+FE47..FE48)
2158          \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000,
2159                                    U+FF01..FF60, U+FFE0..FFE6)
2160          \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
2161                                    Y} (Short: \p{DI}) (4173)
2162          \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
2163                                    (1_109_939 plus all above-Unicode code
2164                                    points: [\x00-\xac\xae-\xff],
2165                                    U+0100..034E, U+0350..061B,
2166                                    U+061D..115E, U+1161..17B3, U+17B6..180A
2167                                    ...)
2168          \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
2169                                    (4173: [\xad], U+034F, U+061C,
2170                                    U+115F..1160, U+17B4..17B5, U+180B..180E
2171                                    ...)
2172          \p{Dep}                 \p{Deprecated} (= \p{Deprecated=Y}) (15)
2173          \p{Dep: *}              \p{Deprecated: *}
2174          \p{Deprecated}          \p{Deprecated=Y} (Short: \p{Dep}) (15)
2175          \p{Deprecated: N*}      (Short: \p{Dep=N}, \P{Dep}) (1_114_097
2176                                    plus all above-Unicode code points:
2177                                    U+0000..0148, U+014A..0672,
2178                                    U+0674..0F76, U+0F78, U+0F7A..17A2,
2179                                    U+17A5..2069 ...)
2180          \p{Deprecated: Y*}      (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149,
2181                                    U+0673, U+0F77, U+0F79, U+17A3..17A4,
2182                                    U+206A..206F ...)
2183          \p{Deseret}             \p{Script_Extensions=Deseret} (Short:
2184                                    \p{Dsrt}) (80)
2185          \p{Deva}                \p{Devanagari} (= \p{Script_Extensions=
2186                                    Devanagari}) (NOT \p{Block=Devanagari})
2187                                    (210)
2188          \p{Devanagari}          \p{Script_Extensions=Devanagari} (Short:
2189                                    \p{Deva}; NOT \p{Block=Devanagari}) (210)
2190        X \p{Devanagari_Ext}      \p{Devanagari_Extended} (= \p{Block=
2191                                    Devanagari_Extended}) (32)
2192        X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
2193                                    \p{InDevanagariExt}) (32)
2194          \p{DI}                  \p{Default_Ignorable_Code_Point} (=
2195                                    \p{Default_Ignorable_Code_Point=Y})
2196                                    (4173)
2197          \p{DI: *}               \p{Default_Ignorable_Code_Point: *}
2198          \p{Dia}                 \p{Diacritic} (= \p{Diacritic=Y}) (798)
2199          \p{Dia: *}              \p{Diacritic: *}
2200          \p{Diacritic}           \p{Diacritic=Y} (Short: \p{Dia}) (798)
2201          \p{Diacritic: N*}       (Short: \p{Dia=N}, \P{Dia}) (1_113_314
2202                                    plus all above-Unicode code points:
2203                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2204                                    >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
2205                                    \xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
2206                                    U+0100..02AF, U+034F, U+0358..035C,
2207                                    U+0363..0373, U+0376..0379 ...)
2208          \p{Diacritic: Y*}       (Short: \p{Dia=Y}, \p{Dia}) (798:
2209                                    [\^`\xa8\xaf\xb4\xb7-\xb8],
2210                                    U+02B0..034E, U+0350..0357,
2211                                    U+035D..0362, U+0374..0375, U+037A ...)
2212        X \p{Diacriticals}        \p{Combining_Diacritical_Marks} (=
2213                                    \p{Block=Combining_Diacritical_Marks})
2214                                    (112)
2215        X \p{Diacriticals_Ext}    \p{Combining_Diacritical_Marks_Extended}
2216                                    (= \p{Block=
2217                                    Combining_Diacritical_Marks_Extended})
2218                                    (80)
2219        X \p{Diacriticals_For_Symbols}
2220                                    \p{Combining_Diacritical_Marks_For_-
2221                                    Symbols} (= \p{Block=
2222                                    Combining_Diacritical_Marks_For_-
2223                                    Symbols}) (48)
2224        X \p{Diacriticals_Sup}    \p{Combining_Diacritical_Marks_Supplement}
2225                                    (= \p{Block=
2226                                    Combining_Diacritical_Marks_Supplement})
2227                                    (64)
2228          \p{Digit}               \p{XPosixDigit} (= \p{General_Category=
2229                                    Decimal_Number}) (590)
2230        X \p{Dingbats}            \p{Block=Dingbats} (192)
2231        X \p{Domino}              \p{Domino_Tiles} (= \p{Block=
2232                                    Domino_Tiles}) (112)
2233        X \p{Domino_Tiles}        \p{Block=Domino_Tiles} (Short:
2234                                    \p{InDomino}) (112)
2235          \p{Dsrt}                \p{Deseret} (= \p{Script_Extensions=
2236                                    Deseret}) (80)
2237          \p{Dt: *}               \p{Decomposition_Type: *}
2238          \p{Dupl}                \p{Duployan} (= \p{Script_Extensions=
2239                                    Duployan}) (NOT \p{Block=Duployan}) (147)
2240          \p{Duployan}            \p{Script_Extensions=Duployan} (Short:
2241                                    \p{Dupl}; NOT \p{Block=Duployan}) (147)
2242          \p{Ea: *}               \p{East_Asian_Width: *}
2243        X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
2244                                    (208)
2245          \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
2246          \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739:
2247                                    [\xa1\xa4\xa7-\xa8\xaa\xad-\xae\xb0-
2248                                    \xb4\xb6-\xba\xbc-\xbf\xc6\xd0\xd7-
2249                                    \xd8\xde-\xe1\xe6\xe8-\xea\xec-
2250                                    \xed\xf0\xf2-\xf3\xf7-\xfa\xfc\xfe],
2251                                    U+0101, U+0111, U+0113, U+011B,
2252                                    U+0126..0127 ...)
2253          \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
2254          \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000,
2255                                    U+FF01..FF60, U+FFE0..FFE6)
2256          \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
2257          \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9,
2258                                    U+FF61..FFBE, U+FFC2..FFC7,
2259                                    U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC
2260                                    ...)
2261          \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (793_401 plus
2262                                    all above-Unicode code points)
2263          \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
2264          \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20-
2265                                    \x7e\xa2-\xa3\xa5-\xa6\xac\xaf],
2266                                    U+27E6..27ED, U+2985..2986)
2267          \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (793_401 plus all
2268                                    above-Unicode code points: [\x00-
2269                                    \x1f\x7f-\xa0\xa9\xab\xb5\xbb\xc0-
2270                                    \xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd\xe2-
2271                                    \xe5\xe7\xeb\xee-\xef\xf1\xf4-
2272                                    \xf6\xfb\xfd\xff], U+00FF..0100,
2273                                    U+0102..0110, U+0112, U+0114..011A,
2274                                    U+011C..0125 ...)
2275          \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (181_634)
2276          \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (181_634:
2277                                    U+1100..115F, U+231A..231B,
2278                                    U+2329..232A, U+23E9..23EC, U+23F0,
2279                                    U+23F3 ...)
2280          \p{Egyp}                \p{Egyptian_Hieroglyphs} (=
2281                                    \p{Script_Extensions=
2282                                    Egyptian_Hieroglyphs}) (NOT \p{Block=
2283                                    Egyptian_Hieroglyphs}) (1071)
2284          \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
2285                                    Egyptian_Hieroglyphs} (Short: \p{Egyp};
2286                                    NOT \p{Block=Egyptian_Hieroglyphs})
2287                                    (1071)
2288          \p{Elba}                \p{Elbasan} (= \p{Script_Extensions=
2289                                    Elbasan}) (NOT \p{Block=Elbasan}) (40)
2290          \p{Elbasan}             \p{Script_Extensions=Elbasan} (Short:
2291                                    \p{Elba}; NOT \p{Block=Elbasan}) (40)
2292        X \p{Emoticons}           \p{Block=Emoticons} (80)
2293        X \p{Enclosed_Alphanum}   \p{Enclosed_Alphanumerics} (= \p{Block=
2294                                    Enclosed_Alphanumerics}) (160)
2295        X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
2296                                    \p{Block=
2297                                    Enclosed_Alphanumeric_Supplement}) (256)
2298        X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
2299                                    Enclosed_Alphanumeric_Supplement}
2300                                    (Short: \p{InEnclosedAlphanumSup}) (256)
2301        X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
2302                                    (Short: \p{InEnclosedAlphanum}) (160)
2303        X \p{Enclosed_CJK}        \p{Enclosed_CJK_Letters_And_Months} (=
2304                                    \p{Block=
2305                                    Enclosed_CJK_Letters_And_Months}) (256)
2306        X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
2307                                    Enclosed_CJK_Letters_And_Months} (Short:
2308                                    \p{InEnclosedCJK}) (256)
2309        X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
2310                                    (= \p{Block=
2311                                    Enclosed_Ideographic_Supplement}) (256)
2312        X \p{Enclosed_Ideographic_Supplement} \p{Block=
2313                                    Enclosed_Ideographic_Supplement} (Short:
2314                                    \p{InEnclosedIdeographicSup}) (256)
2315          \p{Enclosing_Mark}      \p{General_Category=Enclosing_Mark}
2316                                    (Short: \p{Me}) (13)
2317          \p{Ethi}                \p{Ethiopic} (= \p{Script_Extensions=
2318                                    Ethiopic}) (NOT \p{Block=Ethiopic}) (495)
2319          \p{Ethiopic}            \p{Script_Extensions=Ethiopic} (Short:
2320                                    \p{Ethi}; NOT \p{Block=Ethiopic}) (495)
2321        X \p{Ethiopic_Ext}        \p{Ethiopic_Extended} (= \p{Block=
2322                                    Ethiopic_Extended}) (96)
2323        X \p{Ethiopic_Ext_A}      \p{Ethiopic_Extended_A} (= \p{Block=
2324                                    Ethiopic_Extended_A}) (48)
2325        X \p{Ethiopic_Extended}   \p{Block=Ethiopic_Extended} (Short:
2326                                    \p{InEthiopicExt}) (96)
2327        X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
2328                                    \p{InEthiopicExtA}) (48)
2329        X \p{Ethiopic_Sup}        \p{Ethiopic_Supplement} (= \p{Block=
2330                                    Ethiopic_Supplement}) (32)
2331        X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
2332                                    \p{InEthiopicSup}) (32)
2333          \p{Ext}                 \p{Extender} (= \p{Extender=Y}) (44)
2334          \p{Ext: *}              \p{Extender: *}
2335          \p{Extender}            \p{Extender=Y} (Short: \p{Ext}) (44)
2336          \p{Extender: N*}        (Short: \p{Ext=N}, \P{Ext}) (1_114_068
2337                                    plus all above-Unicode code points:
2338                                    [\x00-\xb6\xb8-\xff], U+0100..02CF,
2339                                    U+02D2..063F, U+0641..07F9,
2340                                    U+07FB..0E45, U+0E47..0EC5 ...)
2341          \p{Extender: Y*}        (Short: \p{Ext=Y}, \p{Ext}) (44: [\xb7],
2342                                    U+02D0..02D1, U+0640, U+07FA, U+0E46,
2343                                    U+0EC6 ...)
2344          \p{Final_Punctuation}   \p{General_Category=Final_Punctuation}
2345                                    (Short: \p{Pf}) (10)
2346          \p{Format}              \p{General_Category=Format} (Short:
2347                                    \p{Cf}) (151)
2348          \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
2349                                    (Short: \p{CompEx}) (1120)
2350          \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
2351                                    \P{CompEx}) (1_112_992 plus all above-
2352                                    Unicode code points: U+0000..033F,
2353                                    U+0342, U+0345..0373, U+0375..037D,
2354                                    U+037F..0386, U+0388..0957 ...)
2355          \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
2356                                    \p{CompEx}) (1120: U+0340..0341,
2357                                    U+0343..0344, U+0374, U+037E, U+0387,
2358                                    U+0958..095F ...)
2359          \p{Gc: *}               \p{General_Category: *}
2360          \p{GCB: *}              \p{Grapheme_Cluster_Break: *}
2361          \p{General_Category: C} \p{General_Category=Other} (977_573 plus
2362                                    all above-Unicode code points)
2363          \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
2364                                    \p{Gc=LC}, \p{LC}) (3796: [A-Za-
2365                                    z\xb5\xc0-\xd6\xd8-\xf6\xf8-\xff],
2366                                    U+0100..01BA, U+01BC..01BF,
2367                                    U+01C4..0293, U+0295..02AF, U+0370..0373
2368                                    ...)
2369          \p{General_Category: Cc} \p{General_Category=Control} (65)
2370          \p{General_Category: Cf} \p{General_Category=Format} (151)
2371          \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
2372                                    (73: [\)\]\}], U+0F3B, U+0F3D, U+169C,
2373                                    U+2046, U+207E ...)
2374          \p{General_Category: Cn} \p{General_Category=Unassigned} (837_841
2375                                    plus all above-Unicode code points)
2376          \p{General_Category: Cntrl} \p{General_Category=Control} (65)
2377          \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
2378          \p{General_Category: Combining_Mark} \p{General_Category=Mark}
2379                                    (2177)
2380          \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
2381                                    \p{Pc}) (10: [_], U+203F..2040, U+2054,
2382                                    U+FE33..FE34, U+FE4D..FE4F, U+FF3F)
2383          \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65:
2384                                    [\x00-\x1f\x7f-\x9f])
2385          \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
2386          \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
2387                                    (54: [\$\xa2-\xa5], U+058F, U+060B,
2388                                    U+09F2..09F3, U+09FB, U+0AF1 ...)
2389          \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
2390                                    (24: [\-], U+058A, U+05BE, U+1400,
2391                                    U+1806, U+2010..2015 ...)
2392          \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
2393                                    (590: [0-9], U+0660..0669, U+06F0..06F9,
2394                                    U+07C0..07C9, U+0966..096F, U+09E6..09EF
2395                                    ...)
2396          \p{General_Category: Digit} \p{General_Category=Decimal_Number}
2397                                    (590)
2398          \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
2399                                    (13: U+0488..0489, U+1ABE, U+20DD..20E0,
2400                                    U+20E2..20E4, U+A670..A672)
2401          \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
2402                                    (10: [\xbb], U+2019, U+201D, U+203A,
2403                                    U+2E03, U+2E05 ...)
2404          \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (151:
2405                                    [\xad], U+0600..0605, U+061C, U+06DD,
2406                                    U+070F, U+08E2 ...)
2407          \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
2408                                    \p{Pi}) (12: [\xab], U+2018,
2409                                    U+201B..201C, U+201F, U+2039, U+2E02 ...)
2410          \p{General_Category: L} \p{General_Category=Letter} (125_093)
2411        X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3796)
2412        X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
2413                                    the trailing '_' matters in spite of
2414                                    loose matching rules. (3796)
2415          \p{General_Category: LC} \p{General_Category=Cased_Letter} (3796)
2416          \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (125_093:
2417                                    [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
2418                                    \xf6\xf8-\xff], U+0100..02C1,
2419                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
2420                                    U+02EE ...)
2421          \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
2422                                    (236: U+16EE..16F0, U+2160..2182,
2423                                    U+2185..2188, U+3007, U+3021..3029,
2424                                    U+3038..303A ...)
2425          \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl})
2426                                    (1: U+2028)
2427          \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
2428                                    (/i= General_Category=Cased_Letter)
2429                                    (2063)
2430          \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (250)
2431          \p{General_Category: Lo} \p{General_Category=Other_Letter}
2432                                    (121_047)
2433          \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
2434                                    /i= General_Category=Cased_Letter)
2435                                    (2063: [a-z\xb5\xdf-\xf6\xf8-\xff],
2436                                    U+0101, U+0103, U+0105, U+0107, U+0109
2437                                    ...)
2438          \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
2439                                    (/i= General_Category=Cased_Letter) (31)
2440          \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
2441                                    (/i= General_Category=Cased_Letter)
2442                                    (1702)
2443          \p{General_Category: M} \p{General_Category=Mark} (2177)
2444          \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2177:
2445                                    U+0300..036F, U+0483..0489,
2446                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2447                                    U+05C4..05C5 ...)
2448          \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948:
2449                                    [+<=>\|~\xac\xb1\xd7\xf7], U+03F6,
2450                                    U+0606..0608, U+2044, U+2052,
2451                                    U+207A..207C ...)
2452          \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (401)
2453          \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
2454          \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
2455                                    (1763)
2456          \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
2457                                    (250: U+02B0..02C1, U+02C6..02D1,
2458                                    U+02E0..02E4, U+02EC, U+02EE, U+0374 ...)
2459          \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
2460                                    (121: [\^`\xa8\xaf\xb4\xb8],
2461                                    U+02C2..02C5, U+02D2..02DF,
2462                                    U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2463          \p{General_Category: N} \p{General_Category=Number} (1502)
2464          \p{General_Category: Nd} \p{General_Category=Decimal_Number} (590)
2465          \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
2466          \p{General_Category: No} \p{General_Category=Other_Number} (676)
2467          \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
2468                                    (1763: U+0300..036F, U+0483..0487,
2469                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2470                                    U+05C4..05C5 ...)
2471          \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1502:
2472                                    [0-9\xb2-\xb3\xb9\xbc-\xbe],
2473                                    U+0660..0669, U+06F0..06F9,
2474                                    U+07C0..07C9, U+0966..096F, U+09E6..09EF
2475                                    ...)
2476          \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
2477                                    (75: [\(\[\{], U+0F3A, U+0F3C, U+169B,
2478                                    U+201A, U+201E ...)
2479          \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (977_573 plus
2480                                    all above-Unicode code points: [\x00-
2481                                    \x1f\x7f-\x9f\xad], U+0378..0379,
2482                                    U+0380..0383, U+038B, U+038D, U+03A2 ...)
2483          \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
2484                                    (121_047: [\xaa\xba], U+01BB,
2485                                    U+01C0..01C3, U+0294, U+05D0..05EA,
2486                                    U+05F0..05F2 ...)
2487          \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No})
2488                                    (676: [\xb2-\xb3\xb9\xbc-\xbe],
2489                                    U+09F4..09F9, U+0B72..0B77,
2490                                    U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
2491                                    ...)
2492          \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
2493                                    (566:
2494                                    [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7\xb6-
2495                                    \xb7\xbf], U+037E, U+0387, U+055A..055F,
2496                                    U+0589, U+05C0 ...)
2497          \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
2498                                    (5855: [\xa6\xa9\xae\xb0], U+0482,
2499                                    U+058D..058E, U+060E..060F, U+06DE,
2500                                    U+06E9 ...)
2501          \p{General_Category: P} \p{General_Category=Punctuation} (770)
2502          \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
2503                                    \p{Zp}) (1: U+2029)
2504          \p{General_Category: Pc} \p{General_Category=
2505                                    Connector_Punctuation} (10)
2506          \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (24)
2507          \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
2508                                    (73)
2509          \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
2510                                    (10)
2511          \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
2512                                    (12)
2513          \p{General_Category: Po} \p{General_Category=Other_Punctuation}
2514                                    (566)
2515          \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
2516                                    (137_468: U+E000..F8FF, U+F0000..FFFFD,
2517                                    U+100000..10FFFD)
2518          \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (75)
2519          \p{General_Category: Punct} \p{General_Category=Punctuation} (770)
2520          \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (770:
2521                                    [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_-
2522                                    \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
2523                                    U+037E, U+0387, U+055A..055F,
2524                                    U+0589..058A, U+05BE ...)
2525          \p{General_Category: S} \p{General_Category=Symbol} (6978)
2526          \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (54)
2527          \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19:
2528                                    [\x20\xa0], U+1680, U+2000..200A,
2529                                    U+2028..2029, U+202F, U+205F ...)
2530          \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (121)
2531          \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
2532          \p{General_Category: So} \p{General_Category=Other_Symbol} (5855)
2533          \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
2534                                    (17: [\x20\xa0], U+1680, U+2000..200A,
2535                                    U+202F, U+205F, U+3000)
2536          \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc})
2537                                    (401: U+0903, U+093B, U+093E..0940,
2538                                    U+0949..094C, U+094E..094F, U+0982..0983
2539                                    ...)
2540          \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048:
2541                                    U+D800..DFFF)
2542          \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (6978:
2543                                    [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae-
2544                                    \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5,
2545                                    U+02D2..02DF, U+02E5..02EB, U+02ED,
2546                                    U+02EF..02FF ...)
2547          \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
2548                                    /i= General_Category=Cased_Letter) (31:
2549                                    U+01C5, U+01C8, U+01CB, U+01F2,
2550                                    U+1F88..1F8F, U+1F98..1F9F ...)
2551          \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
2552                                    (837_841 plus all above-Unicode code
2553                                    points: U+0378..0379, U+0380..0383,
2554                                    U+038B, U+038D, U+03A2, U+0530 ...)
2555          \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
2556                                    /i= General_Category=Cased_Letter)
2557                                    (1702: [A-Z\xc0-\xd6\xd8-\xde], U+0100,
2558                                    U+0102, U+0104, U+0106, U+0108 ...)
2559          \p{General_Category: Z} \p{General_Category=Separator} (19)
2560          \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
2561          \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
2562                                    (1)
2563          \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
2564        X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
2565                                    \p{InPunctuation}) (112)
2566        X \p{Geometric_Shapes}    \p{Block=Geometric_Shapes} (96)
2567        X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
2568                                    \p{Block=Geometric_Shapes_Extended})
2569                                    (128)
2570        X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
2571                                    (Short: \p{InGeometricShapesExt}) (128)
2572          \p{Geor}                \p{Georgian} (= \p{Script_Extensions=
2573                                    Georgian}) (NOT \p{Block=Georgian}) (129)
2574          \p{Georgian}            \p{Script_Extensions=Georgian} (Short:
2575                                    \p{Geor}; NOT \p{Block=Georgian}) (129)
2576        X \p{Georgian_Sup}        \p{Georgian_Supplement} (= \p{Block=
2577                                    Georgian_Supplement}) (48)
2578        X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
2579                                    \p{InGeorgianSup}) (48)
2580          \p{Glag}                \p{Glagolitic} (= \p{Script_Extensions=
2581                                    Glagolitic}) (NOT \p{Block=Glagolitic})
2582                                    (136)
2583          \p{Glagolitic}          \p{Script_Extensions=Glagolitic} (Short:
2584                                    \p{Glag}; NOT \p{Block=Glagolitic}) (136)
2585        X \p{Glagolitic_Sup}      \p{Glagolitic_Supplement} (= \p{Block=
2586                                    Glagolitic_Supplement}) (48)
2587        X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
2588                                    \p{InGlagoliticSup}) (48)
2589          \p{Gonm}                \p{Masaram_Gondi} (= \p{Script_Extensions=
2590                                    Masaram_Gondi}) (NOT \p{Block=
2591                                    Masaram_Gondi}) (75)
2592          \p{Goth}                \p{Gothic} (= \p{Script_Extensions=
2593                                    Gothic}) (NOT \p{Block=Gothic}) (27)
2594          \p{Gothic}              \p{Script_Extensions=Gothic} (Short:
2595                                    \p{Goth}; NOT \p{Block=Gothic}) (27)
2596          \p{Gr_Base}             \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
2597                                    (134_733)
2598          \p{Gr_Base: *}          \p{Grapheme_Base: *}
2599          \p{Gr_Ext}              \p{Grapheme_Extend} (= \p{Grapheme_Extend=
2600                                    Y}) (1901)
2601          \p{Gr_Ext: *}           \p{Grapheme_Extend: *}
2602          \p{Gran}                \p{Grantha} (= \p{Script_Extensions=
2603                                    Grantha}) (NOT \p{Block=Grantha}) (113)
2604          \p{Grantha}             \p{Script_Extensions=Grantha} (Short:
2605                                    \p{Gran}; NOT \p{Block=Grantha}) (113)
2606          \p{Graph}               \p{XPosixGraph} (274_139)
2607          \p{Grapheme_Base}       \p{Grapheme_Base=Y} (Short: \p{GrBase})
2608                                    (134_733)
2609          \p{Grapheme_Base: N*}   (Short: \p{GrBase=N}, \P{GrBase}) (979_379
2610                                    plus all above-Unicode code points:
2611                                    [\x00-\x1f\x7f-\x9f\xad], U+0300..036F,
2612                                    U+0378..0379, U+0380..0383, U+038B,
2613                                    U+038D ...)
2614          \p{Grapheme_Base: Y*}   (Short: \p{GrBase=Y}, \p{GrBase})
2615                                    (134_733: [\x20-\x7e\xa0-\xac\xae-\xff],
2616                                    U+0100..02FF, U+0370..0377,
2617                                    U+037A..037F, U+0384..038A, U+038C ...)
2618          \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
2619                                    (5925)
2620          \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (5925:
2621                                    [^\n\r\x20-\x7e\xa0-\xac\xae-\xff],
2622                                    U+061C, U+180E, U+200B, U+200E..200F,
2623                                    U+2028..202E ...)
2624          \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r])
2625          \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (98:
2626                                    U+261D, U+26F9, U+270A..270D, U+1F385,
2627                                    U+1F3C2..1F3C4, U+1F3C7 ...)
2628          \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (4:
2629                                    U+1F466..1F469)
2630          \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (5:
2631                                    U+1F3FB..1F3FF)
2632          \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
2633                                    (98)
2634          \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
2635                                    E_Base_GAZ} (4)
2636          \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
2637                                    E_Modifier} (5)
2638          \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
2639                                    (1901)
2640          \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1901:
2641                                    U+0300..036F, U+0483..0489,
2642                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2643                                    U+05C4..05C5 ...)
2644          \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
2645                                    Glue_After_Zwj} (22)
2646          \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ})
2647                                    (22: U+2640, U+2642, U+2695..2696,
2648                                    U+2708, U+2764, U+1F308 ...)
2649          \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125:
2650                                    U+1100..115F, U+A960..A97C)
2651          \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n])
2652          \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00,
2653                                    U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C
2654                                    ...)
2655          \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773:
2656                                    U+AC01..AC1B, U+AC1D..AC37,
2657                                    U+AC39..AC53, U+AC55..AC6F,
2658                                    U+AC71..AC8B, U+AC8D..ACA7 ...)
2659          \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_094_232
2660                                    plus all above-Unicode code points:
2661                                    [\x20-\x7e\xa0-\xac\xae-\xff],
2662                                    U+0100..02FF, U+0370..0482,
2663                                    U+048A..0590, U+05BE, U+05C0 ...)
2664          \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
2665                                    (19)
2666          \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (19:
2667                                    U+0600..0605, U+06DD, U+070F, U+08E2,
2668                                    U+0D4E, U+110BD ...)
2669          \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
2670                                    (26: U+1F1E6..1F1FF)
2671          \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
2672                                    Regional_Indicator} (26)
2673          \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
2674                                    SpacingMark} (348)
2675          \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (348:
2676                                    U+0903, U+093B, U+093E..0940,
2677                                    U+0949..094C, U+094E..094F, U+0982..0983
2678                                    ...)
2679          \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137:
2680                                    U+11A8..11FF, U+D7CB..D7FB)
2681          \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95:
2682                                    U+1160..11A7, U+D7B0..D7C6)
2683          \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
2684                                    (1_094_232 plus all above-Unicode code
2685                                    points)
2686          \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D)
2687          \p{Grapheme_Extend}     \p{Grapheme_Extend=Y} (Short: \p{GrExt})
2688                                    (1901)
2689          \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_211
2690                                    plus all above-Unicode code points:
2691                                    U+0000..02FF, U+0370..0482,
2692                                    U+048A..0590, U+05BE, U+05C0, U+05C3 ...)
2693          \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1901:
2694                                    U+0300..036F, U+0483..0489,
2695                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2696                                    U+05C4..05C5 ...)
2697          \p{Greek}               \p{Script_Extensions=Greek} (Short:
2698                                    \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
2699        X \p{Greek_And_Coptic}    \p{Block=Greek_And_Coptic} (Short:
2700                                    \p{InGreek}) (144)
2701        X \p{Greek_Ext}           \p{Greek_Extended} (= \p{Block=
2702                                    Greek_Extended}) (256)
2703        X \p{Greek_Extended}      \p{Block=Greek_Extended} (Short:
2704                                    \p{InGreekExt}) (256)
2705          \p{Grek}                \p{Greek} (= \p{Script_Extensions=Greek})
2706                                    (NOT \p{Greek_And_Coptic}) (522)
2707          \p{Gujarati}            \p{Script_Extensions=Gujarati} (Short:
2708                                    \p{Gujr}; NOT \p{Block=Gujarati}) (105)
2709          \p{Gujr}                \p{Gujarati} (= \p{Script_Extensions=
2710                                    Gujarati}) (NOT \p{Block=Gujarati}) (105)
2711          \p{Gurmukhi}            \p{Script_Extensions=Gurmukhi} (Short:
2712                                    \p{Guru}; NOT \p{Block=Gurmukhi}) (93)
2713          \p{Guru}                \p{Gurmukhi} (= \p{Script_Extensions=
2714                                    Gurmukhi}) (NOT \p{Block=Gurmukhi}) (93)
2715        X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
2716                                    \p{Block=Halfwidth_And_Fullwidth_Forms})
2717                                    (240)
2718        X \p{Half_Marks}          \p{Combining_Half_Marks} (= \p{Block=
2719                                    Combining_Half_Marks}) (16)
2720        X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
2721                                    Halfwidth_And_Fullwidth_Forms} (Short:
2722                                    \p{InHalfAndFullForms}) (240)
2723          \p{Han}                 \p{Script_Extensions=Han} (89_507)
2724          \p{Hang}                \p{Hangul} (= \p{Script_Extensions=
2725                                    Hangul}) (NOT \p{Hangul_Syllables})
2726                                    (11_775)
2727          \p{Hangul}              \p{Script_Extensions=Hangul} (Short:
2728                                    \p{Hang}; NOT \p{Hangul_Syllables})
2729                                    (11_775)
2730        X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
2731                                    (Short: \p{InCompatJamo}) (96)
2732        X \p{Hangul_Jamo}         \p{Block=Hangul_Jamo} (Short: \p{InJamo})
2733                                    (256)
2734        X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
2735                                    (Short: \p{InJamoExtA}) (32)
2736        X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
2737                                    (Short: \p{InJamoExtB}) (80)
2738          \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
2739                                    (125)
2740          \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125:
2741                                    U+1100..115F, U+A960..A97C)
2742          \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
2743                                    (399)
2744          \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399:
2745                                    U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70,
2746                                    U+AC8C ...)
2747          \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
2748                                    LVT_Syllable} (10_773)
2749          \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
2750                                    (10_773: U+AC01..AC1B, U+AC1D..AC37,
2751                                    U+AC39..AC53, U+AC55..AC6F,
2752                                    U+AC71..AC8B, U+AC8D..ACA7 ...)
2753          \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
2754                                    Not_Applicable} (1_102_583 plus all
2755                                    above-Unicode code points)
2756          \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
2757                                    (1_102_583 plus all above-Unicode code
2758                                    points: U+0000..10FF, U+1200..A95F,
2759                                    U+A97D..ABFF, U+D7A4..D7AF,
2760                                    U+D7C7..D7CA, U+D7FC..infinity)
2761          \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
2762                                    (137)
2763          \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137:
2764                                    U+11A8..11FF, U+D7CB..D7FB)
2765          \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
2766                                    (95)
2767          \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95:
2768                                    U+1160..11A7, U+D7B0..D7C6)
2769        X \p{Hangul_Syllables}    \p{Block=Hangul_Syllables} (Short:
2770                                    \p{InHangul}) (11_184)
2771          \p{Hani}                \p{Han} (= \p{Script_Extensions=Han})
2772                                    (89_507)
2773          \p{Hano}                \p{Hanunoo} (= \p{Script_Extensions=
2774                                    Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
2775          \p{Hanunoo}             \p{Script_Extensions=Hanunoo} (Short:
2776                                    \p{Hano}; NOT \p{Block=Hanunoo}) (23)
2777          \p{Hatr}                \p{Hatran} (= \p{Script_Extensions=
2778                                    Hatran}) (NOT \p{Block=Hatran}) (26)
2779          \p{Hatran}              \p{Script_Extensions=Hatran} (Short:
2780                                    \p{Hatr}; NOT \p{Block=Hatran}) (26)
2781          \p{Hebr}                \p{Hebrew} (= \p{Script_Extensions=
2782                                    Hebrew}) (NOT \p{Block=Hebrew}) (133)
2783          \p{Hebrew}              \p{Script_Extensions=Hebrew} (Short:
2784                                    \p{Hebr}; NOT \p{Block=Hebrew}) (133)
2785          \p{Hex}                 \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2786          \p{Hex: *}              \p{Hex_Digit: *}
2787          \p{Hex_Digit}           \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2788          \p{Hex_Digit: N*}       (Short: \p{Hex=N}, \P{Hex}) (1_114_068
2789                                    plus all above-Unicode code points:
2790                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
2791                                    >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
2792                                    U+0100..FF0F, U+FF1A..FF20,
2793                                    U+FF27..FF40, U+FF47..infinity)
2794          \p{Hex_Digit: Y*}       (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa-
2795                                    f], U+FF10..FF19, U+FF21..FF26,
2796                                    U+FF41..FF46)
2797        X \p{High_Private_Use_Surrogates} \p{Block=
2798                                    High_Private_Use_Surrogates} (Short:
2799                                    \p{InHighPUSurrogates}) (128)
2800        X \p{High_PU_Surrogates}  \p{High_Private_Use_Surrogates} (=
2801                                    \p{Block=High_Private_Use_Surrogates})
2802                                    (128)
2803        X \p{High_Surrogates}     \p{Block=High_Surrogates} (896)
2804          \p{Hira}                \p{Hiragana} (= \p{Script_Extensions=
2805                                    Hiragana}) (NOT \p{Block=Hiragana}) (428)
2806          \p{Hiragana}            \p{Script_Extensions=Hiragana} (Short:
2807                                    \p{Hira}; NOT \p{Block=Hiragana}) (428)
2808          \p{Hluw}                \p{Anatolian_Hieroglyphs} (=
2809                                    \p{Script_Extensions=
2810                                    Anatolian_Hieroglyphs}) (NOT \p{Block=
2811                                    Anatolian_Hieroglyphs}) (583)
2812          \p{Hmng}                \p{Pahawh_Hmong} (= \p{Script_Extensions=
2813                                    Pahawh_Hmong}) (NOT \p{Block=
2814                                    Pahawh_Hmong}) (127)
2815          \p{HorizSpace}          \p{XPosixBlank} (18)
2816          \p{Hst: *}              \p{Hangul_Syllable_Type: *}
2817          \p{Hung}                \p{Old_Hungarian} (= \p{Script_Extensions=
2818                                    Old_Hungarian}) (NOT \p{Block=
2819                                    Old_Hungarian}) (108)
2820        D \p{Hyphen}              \p{Hyphen=Y} (11)
2821        D \p{Hyphen: N*}          Supplanted by Line_Break property values;
2822                                    see www.unicode.org/reports/tr14
2823                                    (Single: \P{Hyphen}) (1_114_101 plus all
2824                                    above-Unicode code points: [\x00-
2825                                    \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-
2826                                    Z\[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae-
2827                                    \xff], U+0100..0589, U+058B..1805,
2828                                    U+1807..200F, U+2012..2E16, U+2E18..30FA
2829                                    ...)
2830        D \p{Hyphen: Y*}          Supplanted by Line_Break property values;
2831                                    see www.unicode.org/reports/tr14
2832                                    (Single: \p{Hyphen}) (11: [\-\xad],
2833                                    U+058A, U+1806, U+2010..2011, U+2E17,
2834                                    U+30FB ...)
2835          \p{ID_Continue}         \p{ID_Continue=Y} (Short: \p{IDC}; NOT
2836                                    \p{Ideographic_Description_Characters})
2837                                    (128_108)
2838          \p{ID_Continue: N*}     (Short: \p{IDC=N}, \P{IDC}) (986_004 plus
2839                                    all above-Unicode code points: [\x00-
2840                                    \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
2841                                    >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-
2842                                    \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
2843                                    U+02C2..02C5, U+02D2..02DF,
2844                                    U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2845          \p{ID_Continue: Y*}     (Short: \p{IDC=Y}, \p{IDC}) (128_108:
2846                                    [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-
2847                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
2848                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
2849                                    U+02EE ...)
2850          \p{ID_Start}            \p{ID_Start=Y} (Short: \p{IDS}) (125_334)
2851          \p{ID_Start: N*}        (Short: \p{IDS=N}, \P{IDS}) (988_778 plus
2852                                    all above-Unicode code points: [\x00-
2853                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2854                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
2855                                    \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
2856                                    U+02C2..02C5, U+02D2..02DF,
2857                                    U+02E5..02EB, U+02ED, U+02EF..036F ...)
2858          \p{ID_Start: Y*}        (Short: \p{IDS=Y}, \p{IDS}) (125_334: [A-
2859                                    Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
2860                                    \xff], U+0100..02C1, U+02C6..02D1,
2861                                    U+02E0..02E4, U+02EC, U+02EE ...)
2862          \p{IDC}                 \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
2863                                    \p{Ideographic_Description_Characters})
2864                                    (128_108)
2865          \p{IDC: *}              \p{ID_Continue: *}
2866          \p{Ideo}                \p{Ideographic} (= \p{Ideographic=Y})
2867                                    (96_174)
2868          \p{Ideo: *}             \p{Ideographic: *}
2869          \p{Ideographic}         \p{Ideographic=Y} (Short: \p{Ideo})
2870                                    (96_174)
2871          \p{Ideographic: N*}     (Short: \p{Ideo=N}, \P{Ideo}) (1_017_938
2872                                    plus all above-Unicode code points:
2873                                    U+0000..3005, U+3008..3020,
2874                                    U+302A..3037, U+303B..33FF,
2875                                    U+4DB6..4DFF, U+9FEB..F8FF ...)
2876          \p{Ideographic: Y*}     (Short: \p{Ideo=Y}, \p{Ideo}) (96_174:
2877                                    U+3006..3007, U+3021..3029,
2878                                    U+3038..303A, U+3400..4DB5,
2879                                    U+4E00..9FEA, U+F900..FA6D ...)
2880        X \p{Ideographic_Description_Characters} \p{Block=
2881                                    Ideographic_Description_Characters}
2882                                    (Short: \p{InIDC}) (16)
2883        X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
2884                                    \p{Block=
2885                                    Ideographic_Symbols_And_Punctuation})
2886                                    (32)
2887        X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
2888                                    Ideographic_Symbols_And_Punctuation}
2889                                    (Short: \p{InIdeographicSymbols}) (32)
2890          \p{IDS}                 \p{ID_Start} (= \p{ID_Start=Y}) (125_334)
2891          \p{IDS: *}              \p{ID_Start: *}
2892          \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
2893                                    \p{IDSB}) (10)
2894          \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
2895                                    (1_114_102 plus all above-Unicode code
2896                                    points: U+0000..2FEF, U+2FF2..2FF3,
2897                                    U+2FFC..infinity)
2898          \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10:
2899                                    U+2FF0..2FF1, U+2FF4..2FFB)
2900          \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
2901                                    \p{IDST}) (2)
2902          \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
2903                                    (1_114_110 plus all above-Unicode code
2904                                    points: U+0000..2FF1, U+2FF4..infinity)
2905          \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2:
2906                                    U+2FF2..2FF3)
2907          \p{IDSB}                \p{IDS_Binary_Operator} (=
2908                                    \p{IDS_Binary_Operator=Y}) (10)
2909          \p{IDSB: *}             \p{IDS_Binary_Operator: *}
2910          \p{IDST}                \p{IDS_Trinary_Operator} (=
2911                                    \p{IDS_Trinary_Operator=Y}) (2)
2912          \p{IDST: *}             \p{IDS_Trinary_Operator: *}
2913          \p{Imperial_Aramaic}    \p{Script_Extensions=Imperial_Aramaic}
2914                                    (Short: \p{Armi}; NOT \p{Block=
2915                                    Imperial_Aramaic}) (31)
2916          \p{In: *}               \p{Present_In: *} (Perl extension)
2917        X \p{In_*}                \p{Block: *}
2918        X \p{Indic_Number_Forms}  \p{Common_Indic_Number_Forms} (= \p{Block=
2919                                    Common_Indic_Number_Forms}) (16)
2920          \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
2921                                    (330: U+093C, U+0941..0944, U+094D,
2922                                    U+0952, U+0956..0957, U+0962..0963 ...)
2923          \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC=
2924                                    BottomAndLeft}) (1: U+A9BF)
2925          \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
2926                                    BottomAndRight}) (2: U+1B3B, U+A9C0)
2927          \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (57:
2928                                    U+093F, U+094E, U+09BF, U+09C7..09C8,
2929                                    U+0A3F, U+0ABF ...)
2930          \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
2931                                    LeftAndRight}) (21: U+09CB..09CC,
2932                                    U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C,
2933                                    U+0DDC, U+0DDE ...)
2934          \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_996
2935                                    plus all above-Unicode code points:
2936                                    U+0000..08FF, U+0904..0939, U+093D,
2937                                    U+0950, U+0958..0961, U+0964..0980 ...)
2938          \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
2939                                    Overstruck}) (10: U+1CD4, U+1CE2..1CE8,
2940                                    U+10A01, U+10A06)
2941          \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (262:
2942                                    U+0903, U+093B, U+093E, U+0940,
2943                                    U+0949..094C, U+094F ...)
2944          \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (380:
2945                                    U+0900..0902, U+093A, U+0945..0948,
2946                                    U+0951, U+0953..0955, U+0981 ...)
2947          \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
2948                                    TopAndBottom}) (10: U+0C48, U+0F73,
2949                                    U+0F76..0F79, U+0F81, U+1B3C,
2950                                    U+1112E..1112F)
2951          \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
2952                                    \p{InPC=TopAndBottomAndRight}) (1:
2953                                    U+1B3D)
2954          \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
2955                                    TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE,
2956                                    U+1C29, U+114BB, U+115B9)
2957          \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
2958                                    \p{InPC=TopAndLeftAndRight}) (4: U+0B4C,
2959                                    U+0DDD, U+17BF, U+115BB)
2960          \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
2961                                    TopAndRight}) (13: U+0AC9, U+0B57,
2962                                    U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB,
2963                                    U+1925..1926 ...)
2964          \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
2965                                    VisualOrderLeft}) (19: U+0E40..0E44,
2966                                    U+0EC0..0EC4, U+19B5..19B7, U+19BA,
2967                                    U+AAB5..AAB6, U+AAB9 ...)
2968          \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
2969                                    (15: U+093D, U+09BD, U+0ABD, U+0B3D,
2970                                    U+0C3D, U+0CBD ...)
2971          \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (80:
2972                                    U+0900..0902, U+0981..0982, U+09FC,
2973                                    U+0A01..0A02, U+0A70, U+0A81..0A82 ...)
2974          \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
2975                                    \p{InSC=BrahmiJoiningNumber}) (20:
2976                                    U+11052..11065)
2977          \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
2978                                    CantillationMark}) (57: U+0951..0952,
2979                                    U+0AFA..0AFC, U+1CD0..1CD2,
2980                                    U+1CD4..1CE1, U+1CF4, U+1CF7..1CF9 ...)
2981          \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
2982                                    (2024: U+0915..0939, U+0958..095F,
2983                                    U+0978..097F, U+0995..09A8,
2984                                    U+09AA..09B0, U+09B2 ...)
2985          \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
2986                                    ConsonantDead}) (10: U+09CE,
2987                                    U+0D54..0D56, U+0D7A..0D7F)
2988          \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
2989                                    ConsonantFinal}) (68: U+1930..1931,
2990                                    U+1933..1939, U+19C1..19C7,
2991                                    U+1A58..1A59, U+1BBE..1BBF, U+1BF0..1BF1
2992                                    ...)
2993          \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
2994                                    \p{InSC=ConsonantHeadLetter}) (5:
2995                                    U+0F88..0F8C)
2996          \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
2997                                    ConsonantKiller}) (2: U+0E4C, U+17CD)
2998          \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
2999                                    ConsonantMedial}) (27: U+0A75,
3000                                    U+0EBC..0EBD, U+103B..103E,
3001                                    U+105E..1060, U+1082, U+1A55..1A56 ...)
3002          \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
3003                                    \p{InSC=ConsonantPlaceholder}) (18: [\-
3004                                    \xa0\xd7], U+0A72..0A73, U+104E, U+1900,
3005                                    U+2010..2014, U+25CC ...)
3006          \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
3007                                    \p{InSC=ConsonantPrecedingRepha}) (2:
3008                                    U+0D4E, U+11D46)
3009          \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
3010                                    ConsonantPrefixed}) (7: U+111C2..111C3,
3011                                    U+11A3A, U+11A86..11A89)
3012          \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
3013                                    ConsonantSubjoined}) (95: U+0F8D..0F97,
3014                                    U+0F99..0FBC, U+1929..192B, U+1A57,
3015                                    U+1A5B..1A5E, U+1BA1..1BA3 ...)
3016          \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
3017                                    \p{InSC=ConsonantSucceedingRepha}) (5:
3018                                    U+17CC, U+1A5A, U+1B03, U+1B81, U+A982)
3019          \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
3020                                    \p{InSC=ConsonantWithStacker}) (4:
3021                                    U+0CF1..0CF2, U+11003..11004)
3022          \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
3023                                    GeminationMark}) (3: U+0A71, U+11237,
3024                                    U+11A98)
3025          \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
3026                                    InvisibleStacker}) (10: U+1039, U+17D2,
3027                                    U+1A60, U+1BAB, U+AAF6, U+10A3F ...)
3028          \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1:
3029                                    U+200D)
3030          \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
3031                                    ModifyingLetter}) (1: U+0B83)
3032          \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
3033                                    NonJoiner}) (1: U+200C)
3034          \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (28:
3035                                    U+093C, U+09BC, U+0A3C, U+0ABC,
3036                                    U+0AFD..0AFF, U+0B3C ...)
3037          \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (469:
3038                                    [0-9], U+0966..096F, U+09E6..09EF,
3039                                    U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F
3040                                    ...)
3041          \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
3042                                    NumberJoiner}) (1: U+1107F)
3043          \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
3044                                    (1_109_901 plus all above-Unicode code
3045                                    points: [\x00-
3046                                    \x20!\"#\$\%&\'\(\)*+,.\/:;<=>?\@A-
3047                                    Z\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1-
3048                                    \xb1\xb4-\xd6\xd8-\xff], U+0100..08FF,
3049                                    U+0950, U+0953..0954, U+0964..0965,
3050                                    U+0970..0971 ...)
3051          \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
3052                                    PureKiller}) (21: U+0D3B..0D3C, U+0E3A,
3053                                    U+0E4E, U+0F84, U+103A, U+1714 ...)
3054          \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
3055                                    RegisterShifter}) (2: U+17C9..17CA)
3056          \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
3057                                    SyllableModifier}) (22: [\xb2-\xb3],
3058                                    U+0F35, U+0F37, U+0FC6, U+17CB,
3059                                    U+17CE..17D0 ...)
3060          \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
3061                                    ToneLetter}) (7: U+1970..1974, U+AAC0,
3062                                    U+AAC2)
3063          \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
3064                                    (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037,
3065                                    U+1063..1064, U+1069..106D, U+1087..108D
3066                                    ...)
3067          \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (24:
3068                                    U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
3069                                    U+0BCD ...)
3070          \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga})
3071                                    (34: U+0903, U+0983, U+0A03, U+0A83,
3072                                    U+0B03, U+0C03 ...)
3073          \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30:
3074                                    U+1963..196D, U+A85E..A861, U+A866,
3075                                    U+A922..A92A, U+11150..11154)
3076          \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
3077                                    VowelDependent}) (633: U+093A..093B,
3078                                    U+093E..094C, U+094E..094F,
3079                                    U+0955..0957, U+0962..0963, U+09BE..09C4
3080                                    ...)
3081          \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
3082                                    VowelIndependent}) (443: U+0904..0914,
3083                                    U+0960..0961, U+0972..0977,
3084                                    U+0985..098C, U+098F..0990, U+0993..0994
3085                                    ...)
3086          \p{Inherited}           \p{Script_Extensions=Inherited} (Short:
3087                                    \p{Zinh}) (500)
3088          \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
3089                                    (Short: \p{Pi}) (12)
3090          \p{InPC: *}             \p{Indic_Positional_Category: *}
3091          \p{InSC: *}             \p{Indic_Syllabic_Category: *}
3092          \p{Inscriptional_Pahlavi} \p{Script_Extensions=
3093                                    Inscriptional_Pahlavi} (Short: \p{Phli};
3094                                    NOT \p{Block=Inscriptional_Pahlavi}) (27)
3095          \p{Inscriptional_Parthian} \p{Script_Extensions=
3096                                    Inscriptional_Parthian} (Short:
3097                                    \p{Prti}; NOT \p{Block=
3098                                    Inscriptional_Parthian}) (30)
3099        X \p{IPA_Ext}             \p{IPA_Extensions} (= \p{Block=
3100                                    IPA_Extensions}) (96)
3101        X \p{IPA_Extensions}      \p{Block=IPA_Extensions} (Short:
3102                                    \p{InIPAExt}) (96)
3103          \p{Is_*}                \p{*} (Any exceptions are individually
3104                                    noted beginning with the word NOT.) If
3105                                    an entry has flag(s) at its beginning,
3106                                    like "D", the "Is_" form has the same
3107                                    flag(s)
3108          \p{Ital}                \p{Old_Italic} (= \p{Script_Extensions=
3109                                    Old_Italic}) (NOT \p{Block=Old_Italic})
3110                                    (39)
3111        X \p{Jamo}                \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
3112                                    (256)
3113        X \p{Jamo_Ext_A}          \p{Hangul_Jamo_Extended_A} (= \p{Block=
3114                                    Hangul_Jamo_Extended_A}) (32)
3115        X \p{Jamo_Ext_B}          \p{Hangul_Jamo_Extended_B} (= \p{Block=
3116                                    Hangul_Jamo_Extended_B}) (80)
3117          \p{Java}                \p{Javanese} (= \p{Script_Extensions=
3118                                    Javanese}) (NOT \p{Block=Javanese}) (91)
3119          \p{Javanese}            \p{Script_Extensions=Javanese} (Short:
3120                                    \p{Java}; NOT \p{Block=Javanese}) (91)
3121          \p{Jg: *}               \p{Joining_Group: *}
3122          \p{Join_C}              \p{Join_Control} (= \p{Join_Control=Y}) (2)
3123          \p{Join_C: *}           \p{Join_Control: *}
3124          \p{Join_Control}        \p{Join_Control=Y} (Short: \p{JoinC}) (2)
3125          \p{Join_Control: N*}    (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
3126                                    plus all above-Unicode code points:
3127                                    U+0000..200B, U+200E..infinity)
3128          \p{Join_Control: Y*}    (Short: \p{JoinC=Y}, \p{JoinC}) (2:
3129                                    U+200C..200D)
3130          \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1:
3131                                    U+08BB)
3132          \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1:
3133                                    U+08BD)
3134          \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (1:
3135                                    U+08BC)
3136          \p{Joining_Group: Ain}  (Short: \p{Jg=Ain}) (8: U+0639..063A,
3137                                    U+06A0, U+06FC, U+075D..075F, U+08B3)
3138          \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710)
3139          \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10: U+0622..0623,
3140                                    U+0625, U+0627, U+0671..0673, U+0675,
3141                                    U+0773..0774)
3142          \p{Joining_Group: Beh}  (Short: \p{Jg=Beh}) (24: U+0628,
3143                                    U+062A..062B, U+066E, U+0679..0680,
3144                                    U+0750..0756, U+08A0..08A1 ...)
3145          \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D)
3146          \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
3147                                    BurushaskiYehBarree}) (2: U+077A..077B)
3148          \p{Joining_Group: Dal}  (Short: \p{Jg=Dal}) (15: U+062F..0630,
3149                                    U+0688..0690, U+06EE, U+0759..075A,
3150                                    U+08AE)
3151          \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4:
3152                                    U+0715..0716, U+072A, U+072F)
3153          \p{Joining_Group: E}    (Short: \p{Jg=E}) (1: U+0725)
3154          \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7:
3155                                    U+063D..063F, U+06CC, U+06CE,
3156                                    U+0775..0776)
3157          \p{Joining_Group: Fe}   (Short: \p{Jg=Fe}) (1: U+074F)
3158          \p{Joining_Group: Feh}  (Short: \p{Jg=Feh}) (10: U+0641,
3159                                    U+06A1..06A6, U+0760..0761, U+08A4)
3160          \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1:
3161                                    U+0724)
3162          \p{Joining_Group: Gaf}  (Short: \p{Jg=Gaf}) (14: U+063B..063C,
3163                                    U+06A9, U+06AB, U+06AF..06B4,
3164                                    U+0762..0764, U+08B0)
3165          \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714,
3166                                    U+072E)
3167          \p{Joining_Group: Hah}  (Short: \p{Jg=Hah}) (18: U+062C..062E,
3168                                    U+0681..0687, U+06BF, U+0757..0758,
3169                                    U+076E..076F, U+0772 ...)
3170          \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
3171                                    HamzaOnHehGoal}) (1: U+06C3)
3172          \p{Joining_Group: He}   (Short: \p{Jg=He}) (1: U+0717)
3173          \p{Joining_Group: Heh}  (Short: \p{Jg=Heh}) (1: U+0647)
3174          \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2:
3175                                    U+06C1..06C2)
3176          \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A)
3177          \p{Joining_Group: Kaf}  (Short: \p{Jg=Kaf}) (6: U+0643,
3178                                    U+06AC..06AE, U+077F, U+08B4)
3179          \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F)
3180          \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E)
3181          \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2:
3182                                    U+06BE, U+06FF)
3183          \p{Joining_Group: Lam}  (Short: \p{Jg=Lam}) (7: U+0644,
3184                                    U+06B5..06B8, U+076A, U+08A6)
3185          \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720)
3186          \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1:
3187                                    U+0866)
3188          \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1:
3189                                    U+0861)
3190          \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1:
3191                                    U+0868)
3192          \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla})
3193                                    (1: U+0869)
3194          \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1:
3195                                    U+0860)
3196          \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1:
3197                                    U+0864)
3198          \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna})
3199                                    (1: U+0865)
3200          \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1:
3201                                    U+0862)
3202          \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1:
3203                                    U+0867)
3204          \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1:
3205                                    U+086A)
3206          \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1:
3207                                    U+0863)
3208          \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
3209                                    ManichaeanAleph}) (1: U+10AC0)
3210          \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
3211                                    (2: U+10AD9..10ADA)
3212          \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
3213                                    (2: U+10AC1..10AC2)
3214          \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
3215                                    ManichaeanDaleth}) (1: U+10AC5)
3216          \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
3217                                    ManichaeanDhamedh}) (1: U+10AD4)
3218          \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
3219                                    (1: U+10AEC)
3220          \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
3221                                    ManichaeanGimel}) (2: U+10AC3..10AC4)
3222          \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
3223                                    (1: U+10ACD)
3224          \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
3225                                    ManichaeanHundred}) (1: U+10AEF)
3226          \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
3227                                    (3: U+10AD0..10AD2)
3228          \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
3229                                    ManichaeanLamedh}) (1: U+10AD3)
3230          \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem})
3231                                    (1: U+10AD6)
3232          \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun})
3233                                    (1: U+10AD7)
3234          \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne})
3235                                    (1: U+10AEB)
3236          \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2:
3237                                    U+10ADB..10ADC)
3238          \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
3239                                    (3: U+10ADE..10AE0)
3240          \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
3241                                    (1: U+10AE1)
3242          \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
3243                                    ManichaeanSadhe}) (1: U+10ADD)
3244          \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
3245                                    ManichaeanSamekh}) (1: U+10AD8)
3246          \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw})
3247                                    (1: U+10AE4)
3248          \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen})
3249                                    (1: U+10AED)
3250          \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
3251                                    (1: U+10ACE)
3252          \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
3253                                    ManichaeanThamedh}) (1: U+10AD5)
3254          \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
3255                                    ManichaeanTwenty}) (1: U+10AEE)
3256          \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw})
3257                                    (1: U+10AC7)
3258          \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
3259                                    (1: U+10ACF)
3260          \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
3261                                    ManichaeanZayin}) (2: U+10AC9..10ACA)
3262          \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645,
3263                                    U+0765..0766, U+08A7)
3264          \p{Joining_Group: Mim}  (Short: \p{Jg=Mim}) (1: U+0721)
3265          \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
3266                                    (1_113_807 plus all above-Unicode code
3267                                    points: U+0000..061F, U+0621, U+0640,
3268                                    U+064B..066D, U+0670, U+0674 ...)
3269          \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8: U+0646,
3270                                    U+06B9..06BC, U+0767..0769)
3271          \p{Joining_Group: Nun}  (Short: \p{Jg=Nun}) (1: U+0722)
3272          \p{Joining_Group: Nya}  (Short: \p{Jg=Nya}) (1: U+06BD)
3273          \p{Joining_Group: Pe}   (Short: \p{Jg=Pe}) (1: U+0726)
3274          \p{Joining_Group: Qaf}  (Short: \p{Jg=Qaf}) (5: U+0642, U+066F,
3275                                    U+06A7..06A8, U+08A5)
3276          \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729)
3277          \p{Joining_Group: Reh}  (Short: \p{Jg=Reh}) (19: U+0631..0632,
3278                                    U+0691..0699, U+06EF, U+075B,
3279                                    U+076B..076C, U+0771 ...)
3280          \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1:
3281                                    U+0727)
3282          \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1:
3283                                    U+08AC)
3284          \p{Joining_Group: Sad}  (Short: \p{Jg=Sad}) (6: U+0635..0636,
3285                                    U+069D..069E, U+06FB, U+08AF)
3286          \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728)
3287          \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634,
3288                                    U+069A..069C, U+06FA, U+075C, U+076D,
3289                                    U+0770 ...)
3290          \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723)
3291          \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B)
3292          \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1:
3293                                    U+08B1)
3294          \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA)
3295          \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718)
3296          \p{Joining_Group: Tah}  (Short: \p{Jg=Tah}) (4: U+0637..0638,
3297                                    U+069F, U+08A3)
3298          \p{Joining_Group: Taw}  (Short: \p{Jg=Taw}) (1: U+072C)
3299          \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3:
3300                                    U+0629, U+06C0, U+06D5)
3301          \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
3302                                    Hamza_On_Heh_Goal} (1)
3303          \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C)
3304          \p{Joining_Group: Waw}  (Short: \p{Jg=Waw}) (16: U+0624, U+0648,
3305                                    U+0676..0677, U+06C4..06CB, U+06CF,
3306                                    U+0778..0779 ...)
3307          \p{Joining_Group: Yeh}  (Short: \p{Jg=Yeh}) (11: U+0620, U+0626,
3308                                    U+0649..064A, U+0678, U+06D0..06D1,
3309                                    U+0777 ...)
3310          \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2:
3311                                    U+06D2..06D3)
3312          \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1:
3313                                    U+06CD)
3314          \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D)
3315          \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E)
3316          \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719)
3317          \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D)
3318          \p{Joining_Type: C}     \p{Joining_Type=Join_Causing} (4)
3319          \p{Joining_Type: D}     \p{Joining_Type=Dual_Joining} (507)
3320          \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (507: U+0620,
3321                                    U+0626, U+0628, U+062A..062E,
3322                                    U+0633..063F, U+0641..0647 ...)
3323          \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4: U+0640,
3324                                    U+07FA, U+180A, U+200D)
3325          \p{Joining_Type: L}     \p{Joining_Type=Left_Joining} (3)
3326          \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (3: U+A872,
3327                                    U+10ACD, U+10AD7)
3328          \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_571 plus
3329                                    all above-Unicode code points: [\x00-
3330                                    \xac\xae-\xff], U+0100..02FF,
3331                                    U+0370..0482, U+048A..0590, U+05BE,
3332                                    U+05C0 ...)
3333          \p{Joining_Type: R}     \p{Joining_Type=Right_Joining} (115)
3334          \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (115:
3335                                    U+0622..0625, U+0627, U+0629,
3336                                    U+062F..0632, U+0648, U+0671..0673 ...)
3337          \p{Joining_Type: T}     \p{Joining_Type=Transparent} (1912)
3338          \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1912: [\xad],
3339                                    U+0300..036F, U+0483..0489,
3340                                    U+0591..05BD, U+05BF, U+05C1..05C2 ...)
3341          \p{Joining_Type: U}     \p{Joining_Type=Non_Joining} (1_111_571
3342                                    plus all above-Unicode code points)
3343          \p{Jt: *}               \p{Joining_Type: *}
3344          \p{Kaithi}              \p{Script_Extensions=Kaithi} (Short:
3345                                    \p{Kthi}; NOT \p{Block=Kaithi}) (86)
3346          \p{Kali}                \p{Kayah_Li} (= \p{Script_Extensions=
3347                                    Kayah_Li}) (48)
3348          \p{Kana}                \p{Katakana} (= \p{Script_Extensions=
3349                                    Katakana}) (NOT \p{Block=Katakana}) (352)
3350        X \p{Kana_Ext_A}          \p{Kana_Extended_A} (= \p{Block=
3351                                    Kana_Extended_A}) (48)
3352        X \p{Kana_Extended_A}     \p{Block=Kana_Extended_A} (Short:
3353                                    \p{InKanaExtA}) (48)
3354        X \p{Kana_Sup}            \p{Kana_Supplement} (= \p{Block=
3355                                    Kana_Supplement}) (256)
3356        X \p{Kana_Supplement}     \p{Block=Kana_Supplement} (Short:
3357                                    \p{InKanaSup}) (256)
3358        X \p{Kanbun}              \p{Block=Kanbun} (16)
3359        X \p{Kangxi}              \p{Kangxi_Radicals} (= \p{Block=
3360                                    Kangxi_Radicals}) (224)
3361        X \p{Kangxi_Radicals}     \p{Block=Kangxi_Radicals} (Short:
3362                                    \p{InKangxi}) (224)
3363          \p{Kannada}             \p{Script_Extensions=Kannada} (Short:
3364                                    \p{Knda}; NOT \p{Block=Kannada}) (100)
3365          \p{Katakana}            \p{Script_Extensions=Katakana} (Short:
3366                                    \p{Kana}; NOT \p{Block=Katakana}) (352)
3367        X \p{Katakana_Ext}        \p{Katakana_Phonetic_Extensions} (=
3368                                    \p{Block=Katakana_Phonetic_Extensions})
3369                                    (16)
3370        X \p{Katakana_Phonetic_Extensions} \p{Block=
3371                                    Katakana_Phonetic_Extensions} (Short:
3372                                    \p{InKatakanaExt}) (16)
3373          \p{Kayah_Li}            \p{Script_Extensions=Kayah_Li} (Short:
3374                                    \p{Kali}) (48)
3375          \p{Khar}                \p{Kharoshthi} (= \p{Script_Extensions=
3376                                    Kharoshthi}) (NOT \p{Block=Kharoshthi})
3377                                    (65)
3378          \p{Kharoshthi}          \p{Script_Extensions=Kharoshthi} (Short:
3379                                    \p{Khar}; NOT \p{Block=Kharoshthi}) (65)
3380          \p{Khmer}               \p{Script_Extensions=Khmer} (Short:
3381                                    \p{Khmr}; NOT \p{Block=Khmer}) (146)
3382        X \p{Khmer_Symbols}       \p{Block=Khmer_Symbols} (32)
3383          \p{Khmr}                \p{Khmer} (= \p{Script_Extensions=Khmer})
3384                                    (NOT \p{Block=Khmer}) (146)
3385          \p{Khoj}                \p{Khojki} (= \p{Script_Extensions=
3386                                    Khojki}) (NOT \p{Block=Khojki}) (72)
3387          \p{Khojki}              \p{Script_Extensions=Khojki} (Short:
3388                                    \p{Khoj}; NOT \p{Block=Khojki}) (72)
3389          \p{Khudawadi}           \p{Script_Extensions=Khudawadi} (Short:
3390                                    \p{Sind}; NOT \p{Block=Khudawadi}) (81)
3391          \p{Knda}                \p{Kannada} (= \p{Script_Extensions=
3392                                    Kannada}) (NOT \p{Block=Kannada}) (100)
3393          \p{Kthi}                \p{Kaithi} (= \p{Script_Extensions=
3394                                    Kaithi}) (NOT \p{Block=Kaithi}) (86)
3395          \p{L} \pL               \p{Letter} (= \p{General_Category=Letter})
3396                                    (125_093)
3397        X \p{L&}                  \p{Cased_Letter} (= \p{General_Category=
3398                                    Cased_Letter}) (3796)
3399        X \p{L_}                  \p{Cased_Letter} (= \p{General_Category=
3400                                    Cased_Letter}) Note the trailing '_'
3401                                    matters in spite of loose matching
3402                                    rules. (3796)
3403          \p{Lana}                \p{Tai_Tham} (= \p{Script_Extensions=
3404                                    Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
3405          \p{Lao}                 \p{Script_Extensions=Lao} (NOT \p{Block=
3406                                    Lao}) (67)
3407          \p{Laoo}                \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
3408                                    \p{Block=Lao}) (67)
3409          \p{Latin}               \p{Script_Extensions=Latin} (Short:
3410                                    \p{Latn}) (1370)
3411        X \p{Latin_1}             \p{Latin_1_Supplement} (= \p{Block=
3412                                    Latin_1_Supplement}) (128)
3413        X \p{Latin_1_Sup}         \p{Latin_1_Supplement} (= \p{Block=
3414                                    Latin_1_Supplement}) (128)
3415        X \p{Latin_1_Supplement}  \p{Block=Latin_1_Supplement} (Short:
3416                                    \p{InLatin1}) (128)
3417        X \p{Latin_Ext_A}         \p{Latin_Extended_A} (= \p{Block=
3418                                    Latin_Extended_A}) (128)
3419        X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
3420                                    \p{Block=Latin_Extended_Additional})
3421                                    (256)
3422        X \p{Latin_Ext_B}         \p{Latin_Extended_B} (= \p{Block=
3423                                    Latin_Extended_B}) (208)
3424        X \p{Latin_Ext_C}         \p{Latin_Extended_C} (= \p{Block=
3425                                    Latin_Extended_C}) (32)
3426        X \p{Latin_Ext_D}         \p{Latin_Extended_D} (= \p{Block=
3427                                    Latin_Extended_D}) (224)
3428        X \p{Latin_Ext_E}         \p{Latin_Extended_E} (= \p{Block=
3429                                    Latin_Extended_E}) (64)
3430        X \p{Latin_Extended_A}    \p{Block=Latin_Extended_A} (Short:
3431                                    \p{InLatinExtA}) (128)
3432        X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
3433                                    (Short: \p{InLatinExtAdditional}) (256)
3434        X \p{Latin_Extended_B}    \p{Block=Latin_Extended_B} (Short:
3435                                    \p{InLatinExtB}) (208)
3436        X \p{Latin_Extended_C}    \p{Block=Latin_Extended_C} (Short:
3437                                    \p{InLatinExtC}) (32)
3438        X \p{Latin_Extended_D}    \p{Block=Latin_Extended_D} (Short:
3439                                    \p{InLatinExtD}) (224)
3440        X \p{Latin_Extended_E}    \p{Block=Latin_Extended_E} (Short:
3441                                    \p{InLatinExtE}) (64)
3442          \p{Latn}                \p{Latin} (= \p{Script_Extensions=Latin})
3443                                    (1370)
3444          \p{Lb: *}               \p{Line_Break: *}
3445          \p{LC}                  \p{Cased_Letter} (= \p{General_Category=
3446                                    Cased_Letter}) (3796)
3447          \p{Lepc}                \p{Lepcha} (= \p{Script_Extensions=
3448                                    Lepcha}) (NOT \p{Block=Lepcha}) (74)
3449          \p{Lepcha}              \p{Script_Extensions=Lepcha} (Short:
3450                                    \p{Lepc}; NOT \p{Block=Lepcha}) (74)
3451          \p{Letter}              \p{General_Category=Letter} (Short: \p{L})
3452                                    (125_093)
3453          \p{Letter_Number}       \p{General_Category=Letter_Number} (Short:
3454                                    \p{Nl}) (236)
3455        X \p{Letterlike_Symbols}  \p{Block=Letterlike_Symbols} (80)
3456          \p{Limb}                \p{Limbu} (= \p{Script_Extensions=Limbu})
3457                                    (NOT \p{Block=Limbu}) (69)
3458          \p{Limbu}               \p{Script_Extensions=Limbu} (Short:
3459                                    \p{Limb}; NOT \p{Block=Limbu}) (69)
3460          \p{Lina}                \p{Linear_A} (= \p{Script_Extensions=
3461                                    Linear_A}) (NOT \p{Block=Linear_A}) (386)
3462          \p{Linb}                \p{Linear_B} (= \p{Script_Extensions=
3463                                    Linear_B}) (268)
3464          \p{Line_Break: AI}      \p{Line_Break=Ambiguous} (707)
3465          \p{Line_Break: AL}      \p{Line_Break=Alphabetic} (19_690)
3466          \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (19_690: [#&*<=>\@A-
3467                                    Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0-
3468                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6,
3469                                    U+02CE..02CF, U+02D1..02D7, U+02DC,
3470                                    U+02DE ...)
3471          \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7-
3472                                    \xa8\xaa\xb2-\xb3\xb6-\xba\xbc-
3473                                    \xbe\xd7\xf7], U+02C7, U+02C9..02CB,
3474                                    U+02CD, U+02D0, U+02D8..02DB ...)
3475          \p{Line_Break: B2}      \p{Line_Break=Break_Both} (3)
3476          \p{Line_Break: BA}      \p{Line_Break=Break_After} (232)
3477          \p{Line_Break: BB}      \p{Line_Break=Break_Before} (42)
3478          \p{Line_Break: BK}      \p{Line_Break=Mandatory_Break} (4)
3479          \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (232: [\t\|\xad],
3480                                    U+058A, U+05BE, U+0964..0965,
3481                                    U+0E5A..0E5B, U+0F0B ...)
3482          \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (42: [\xb4],
3483                                    U+02C8, U+02CC, U+02DF, U+0F01..0F04,
3484                                    U+0F06..0F07 ...)
3485          \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014,
3486                                    U+2E3A..2E3B)
3487          \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/])
3488          \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r])
3489          \p{Line_Break: CB}      \p{Line_Break=Contingent_Break} (1)
3490          \p{Line_Break: CJ}      \p{Line_Break=
3491                                    Conditional_Japanese_Starter} (51)
3492          \p{Line_Break: CL}      \p{Line_Break=Close_Punctuation} (90)
3493          \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]])
3494          \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (90: [\}],
3495                                    U+0F3B, U+0F3D, U+169C, U+2046, U+207E
3496                                    ...)
3497          \p{Line_Break: CM}      \p{Line_Break=Combining_Mark} (2170)
3498          \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2170:
3499                                    [^\t\n\cK\f\r\x20-\x7e\x85\xa0-\xff],
3500                                    U+0300..034E, U+0350..035B,
3501                                    U+0363..036F, U+0483..0489, U+0591..05BD
3502                                    ...)
3503          \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (734:
3504                                    U+0E01..0E3A, U+0E40..0E4E,
3505                                    U+0E81..0E82, U+0E84, U+0E87..0E88,
3506                                    U+0E8A ...)
3507          \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
3508                                    (51: U+3041, U+3043, U+3045, U+3047,
3509                                    U+3049, U+3063 ...)
3510          \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC)
3511          \p{Line_Break: CP}      \p{Line_Break=Close_Parenthesis} (2)
3512          \p{Line_Break: CR}      \p{Line_Break=Carriage_Return} (1)
3513          \p{Line_Break: E_Base}  (Short: \p{Lb=EB}) (102: U+261D, U+26F9,
3514                                    U+270A..270D, U+1F385, U+1F3C2..1F3C4,
3515                                    U+1F3C7 ...)
3516          \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF)
3517          \p{Line_Break: EB}      \p{Line_Break=E_Base} (102)
3518          \p{Line_Break: EM}      \p{Line_Break=E_Modifier} (5)
3519          \p{Line_Break: EX}      \p{Line_Break=Exclamation} (37)
3520          \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (37: [!?], U+05C6,
3521                                    U+061B, U+061E..061F, U+06D4, U+07F9 ...)
3522          \p{Line_Break: GL}      \p{Line_Break=Glue} (18)
3523          \p{Line_Break: Glue}    (Short: \p{Lb=GL}) (18: [\xa0], U+034F,
3524                                    U+035C..0362, U+0F08, U+0F0C, U+0F12 ...)
3525          \p{Line_Break: H2}      (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C,
3526                                    U+AC38, U+AC54, U+AC70, U+AC8C ...)
3527          \p{Line_Break: H3}      (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B,
3528                                    U+AC1D..AC37, U+AC39..AC53,
3529                                    U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7
3530                                    ...)
3531          \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (74:
3532                                    U+05D0..05EA, U+05F0..05F2, U+FB1D,
3533                                    U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
3534                                    ...)
3535          \p{Line_Break: HL}      \p{Line_Break=Hebrew_Letter} (74)
3536          \p{Line_Break: HY}      \p{Line_Break=Hyphen} (1)
3537          \p{Line_Break: Hyphen}  (Short: \p{Lb=HY}) (1: [\-])
3538          \p{Line_Break: ID}      \p{Line_Break=Ideographic} (172_784)
3539          \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_784:
3540                                    U+231A..231B, U+23F0..23F3,
3541                                    U+2600..2603, U+2614..2615, U+2618,
3542                                    U+261A..261C ...)
3543          \p{Line_Break: IN}      \p{Line_Break=Inseparable} (6)
3544          \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;],
3545                                    U+037E, U+0589, U+060C..060D, U+07F8,
3546                                    U+2044 ...)
3547          \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026,
3548                                    U+22EF, U+FE19, U+10AF6)
3549          \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
3550          \p{Line_Break: IS}      \p{Line_Break=Infix_Numeric} (13)
3551          \p{Line_Break: JL}      (Short: \p{Lb=JL}) (125: U+1100..115F,
3552                                    U+A960..A97C)
3553          \p{Line_Break: JT}      (Short: \p{Lb=JT}) (137: U+11A8..11FF,
3554                                    U+D7CB..D7FB)
3555          \p{Line_Break: JV}      (Short: \p{Lb=JV}) (95: U+1160..11A7,
3556                                    U+D7B0..D7C6)
3557          \p{Line_Break: LF}      \p{Line_Break=Line_Feed} (1)
3558          \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n])
3559          \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f],
3560                                    U+2028..2029)
3561          \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85])
3562          \p{Line_Break: NL}      \p{Line_Break=Next_Line} (1)
3563          \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (31: U+17D6,
3564                                    U+203C..203D, U+2047..2049, U+3005,
3565                                    U+301C, U+303B..303C ...)
3566          \p{Line_Break: NS}      \p{Line_Break=Nonstarter} (31)
3567          \p{Line_Break: NU}      \p{Line_Break=Numeric} (582)
3568          \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (582: [0-9],
3569                                    U+0660..0669, U+066B..066C,
3570                                    U+06F0..06F9, U+07C0..07C9, U+0966..096F
3571                                    ...)
3572          \p{Line_Break: OP}      \p{Line_Break=Open_Punctuation} (87)
3573          \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (87:
3574                                    [\(\[\{\xa1\xbf], U+0F3A, U+0F3C,
3575                                    U+169B, U+201A, U+201E ...)
3576          \p{Line_Break: PO}      \p{Line_Break=Postfix_Numeric} (30)
3577          \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (30:
3578                                    [\%\xa2\xb0], U+0609..060B, U+066A,
3579                                    U+09F2..09F3, U+09F9, U+0D79 ...)
3580          \p{Line_Break: PR}      \p{Line_Break=Prefix_Numeric} (65)
3581          \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (65: [\$+\\\xa3-
3582                                    \xa5\xb1], U+058F, U+09FB, U+0AF1,
3583                                    U+0BF9, U+0E3F ...)
3584          \p{Line_Break: QU}      \p{Line_Break=Quotation} (39)
3585          \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb],
3586                                    U+2018..2019, U+201B..201D, U+201F,
3587                                    U+2039..203A, U+275B..2760 ...)
3588          \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26:
3589                                    U+1F1E6..1F1FF)
3590          \p{Line_Break: RI}      \p{Line_Break=Regional_Indicator} (26)
3591          \p{Line_Break: SA}      \p{Line_Break=Complex_Context} (734)
3592        D \p{Line_Break: SG}      \p{Line_Break=Surrogate} (2048)
3593          \p{Line_Break: SP}      \p{Line_Break=Space} (1)
3594          \p{Line_Break: Space}   (Short: \p{Lb=SP}) (1: [\x20])
3595        D \p{Line_Break: Surrogate} Deprecated by Unicode because surrogates
3596                                    should never appear in well-formed text,
3597                                    and therefore shouldn't be the basis for
3598                                    line breaking (Short: \p{Lb=SG}) (2048:
3599                                    U+D800..DFFF)
3600          \p{Line_Break: SY}      \p{Line_Break=Break_Symbols} (1)
3601          \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (902_900 plus all
3602                                    above-Unicode code points: U+0378..0379,
3603                                    U+0380..0383, U+038B, U+038D, U+03A2,
3604                                    U+0530 ...)
3605          \p{Line_Break: WJ}      \p{Line_Break=Word_Joiner} (2)
3606          \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF)
3607          \p{Line_Break: XX}      \p{Line_Break=Unknown} (902_900 plus all
3608                                    above-Unicode code points)
3609          \p{Line_Break: ZW}      \p{Line_Break=ZWSpace} (1)
3610          \p{Line_Break: ZWJ}     (Short: \p{Lb=ZWJ}) (1: U+200D)
3611          \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B)
3612          \p{Line_Separator}      \p{General_Category=Line_Separator}
3613                                    (Short: \p{Zl}) (1)
3614          \p{Linear_A}            \p{Script_Extensions=Linear_A} (Short:
3615                                    \p{Lina}; NOT \p{Block=Linear_A}) (386)
3616          \p{Linear_B}            \p{Script_Extensions=Linear_B} (Short:
3617                                    \p{Linb}) (268)
3618        X \p{Linear_B_Ideograms}  \p{Block=Linear_B_Ideograms} (128)
3619        X \p{Linear_B_Syllabary}  \p{Block=Linear_B_Syllabary} (128)
3620          \p{Lisu}                \p{Script_Extensions=Lisu} (48)
3621          \p{Ll}                  \p{Lowercase_Letter} (=
3622                                    \p{General_Category=Lowercase_Letter})
3623                                    (/i= General_Category=Cased_Letter)
3624                                    (2063)
3625          \p{Lm}                  \p{Modifier_Letter} (=
3626                                    \p{General_Category=Modifier_Letter})
3627                                    (250)
3628          \p{Lo}                  \p{Other_Letter} (= \p{General_Category=
3629                                    Other_Letter}) (121_047)
3630          \p{LOE}                 \p{Logical_Order_Exception} (=
3631                                    \p{Logical_Order_Exception=Y}) (19)
3632          \p{LOE: *}              \p{Logical_Order_Exception: *}
3633          \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
3634                                    \p{LOE}) (19)
3635          \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
3636                                    (1_114_093 plus all above-Unicode code
3637                                    points: U+0000..0E3F, U+0E45..0EBF,
3638                                    U+0EC5..19B4, U+19B8..19B9,
3639                                    U+19BB..AAB4, U+AAB7..AAB8 ...)
3640          \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19:
3641                                    U+0E40..0E44, U+0EC0..0EC4,
3642                                    U+19B5..19B7, U+19BA, U+AAB5..AAB6,
3643                                    U+AAB9 ...)
3644        X \p{Low_Surrogates}      \p{Block=Low_Surrogates} (1024)
3645          \p{Lower}               \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3646                                    Cased=Yes) (2252)
3647          \p{Lower: *}            \p{Lowercase: *}
3648          \p{Lowercase}           \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3649                                    Cased=Yes) (2252)
3650          \p{Lowercase: N*}       (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
3651                                    No) (1_111_860 plus all above-Unicode
3652                                    code points: [\x00-
3653                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
3654                                    Z\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
3655                                    \xb4\xb6-\xb9\xbb-\xde\xf7], U+0100,
3656                                    U+0102, U+0104, U+0106, U+0108 ...)
3657          \p{Lowercase: Y*}       (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
3658                                    Yes) (2252: [a-z\xaa\xb5\xba\xdf-
3659                                    \xf6\xf8-\xff], U+0101, U+0103, U+0105,
3660                                    U+0107, U+0109 ...)
3661          \p{Lowercase_Letter}    \p{General_Category=Lowercase_Letter}
3662                                    (Short: \p{Ll}; /i= General_Category=
3663                                    Cased_Letter) (2063)
3664          \p{Lt}                  \p{Titlecase_Letter} (=
3665                                    \p{General_Category=Titlecase_Letter})
3666                                    (/i= General_Category=Cased_Letter) (31)
3667          \p{Lu}                  \p{Uppercase_Letter} (=
3668                                    \p{General_Category=Uppercase_Letter})
3669                                    (/i= General_Category=Cased_Letter)
3670                                    (1702)
3671          \p{Lyci}                \p{Lycian} (= \p{Script_Extensions=
3672                                    Lycian}) (NOT \p{Block=Lycian}) (29)
3673          \p{Lycian}              \p{Script_Extensions=Lycian} (Short:
3674                                    \p{Lyci}; NOT \p{Block=Lycian}) (29)
3675          \p{Lydi}                \p{Lydian} (= \p{Script_Extensions=
3676                                    Lydian}) (NOT \p{Block=Lydian}) (27)
3677          \p{Lydian}              \p{Script_Extensions=Lydian} (Short:
3678                                    \p{Lydi}; NOT \p{Block=Lydian}) (27)
3679          \p{M} \pM               \p{Mark} (= \p{General_Category=Mark})
3680                                    (2177)
3681          \p{Mahajani}            \p{Script_Extensions=Mahajani} (Short:
3682                                    \p{Mahj}; NOT \p{Block=Mahajani}) (61)
3683          \p{Mahj}                \p{Mahajani} (= \p{Script_Extensions=
3684                                    Mahajani}) (NOT \p{Block=Mahajani}) (61)
3685        X \p{Mahjong}             \p{Mahjong_Tiles} (= \p{Block=
3686                                    Mahjong_Tiles}) (48)
3687        X \p{Mahjong_Tiles}       \p{Block=Mahjong_Tiles} (Short:
3688                                    \p{InMahjong}) (48)
3689          \p{Malayalam}           \p{Script_Extensions=Malayalam} (Short:
3690                                    \p{Mlym}; NOT \p{Block=Malayalam}) (122)
3691          \p{Mand}                \p{Mandaic} (= \p{Script_Extensions=
3692                                    Mandaic}) (NOT \p{Block=Mandaic}) (30)
3693          \p{Mandaic}             \p{Script_Extensions=Mandaic} (Short:
3694                                    \p{Mand}; NOT \p{Block=Mandaic}) (30)
3695          \p{Mani}                \p{Manichaean} (= \p{Script_Extensions=
3696                                    Manichaean}) (NOT \p{Block=Manichaean})
3697                                    (52)
3698          \p{Manichaean}          \p{Script_Extensions=Manichaean} (Short:
3699                                    \p{Mani}; NOT \p{Block=Manichaean}) (52)
3700          \p{Marc}                \p{Marchen} (= \p{Script_Extensions=
3701                                    Marchen}) (NOT \p{Block=Marchen}) (68)
3702          \p{Marchen}             \p{Script_Extensions=Marchen} (Short:
3703                                    \p{Marc}; NOT \p{Block=Marchen}) (68)
3704          \p{Mark}                \p{General_Category=Mark} (Short: \p{M})
3705                                    (2177)
3706          \p{Masaram_Gondi}       \p{Script_Extensions=Masaram_Gondi}
3707                                    (Short: \p{Gonm}; NOT \p{Block=
3708                                    Masaram_Gondi}) (75)
3709          \p{Math}                \p{Math=Y} (2310)
3710          \p{Math: N*}            (Single: \P{Math}) (1_111_802 plus all
3711                                    above-Unicode code points: [\x00-
3712                                    \x20!\"#\$\%&\'\(\)*,\-.\/0-9:;?\@A-
3713                                    Z\[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2-
3714                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF,
3715                                    U+03D3..03D4, U+03D6..03EF,
3716                                    U+03F2..03F3, U+03F7..0605 ...)
3717          \p{Math: Y*}            (Single: \p{Math}) (2310: [+<=
3718                                    >\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
3719                                    U+03D5, U+03F0..03F1, U+03F4..03F6,
3720                                    U+0606..0608 ...)
3721        X \p{Math_Alphanum}       \p{Mathematical_Alphanumeric_Symbols} (=
3722                                    \p{Block=
3723                                    Mathematical_Alphanumeric_Symbols})
3724                                    (1024)
3725        X \p{Math_Operators}      \p{Mathematical_Operators} (= \p{Block=
3726                                    Mathematical_Operators}) (256)
3727          \p{Math_Symbol}         \p{General_Category=Math_Symbol} (Short:
3728                                    \p{Sm}) (948)
3729        X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
3730                                    Mathematical_Alphanumeric_Symbols}
3731                                    (Short: \p{InMathAlphanum}) (1024)
3732        X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
3733                                    (Short: \p{InMathOperators}) (256)
3734          \p{Mc}                  \p{Spacing_Mark} (= \p{General_Category=
3735                                    Spacing_Mark}) (401)
3736          \p{Me}                  \p{Enclosing_Mark} (= \p{General_Category=
3737                                    Enclosing_Mark}) (13)
3738          \p{Meetei_Mayek}        \p{Script_Extensions=Meetei_Mayek} (Short:
3739                                    \p{Mtei}; NOT \p{Block=Meetei_Mayek})
3740                                    (79)
3741        X \p{Meetei_Mayek_Ext}    \p{Meetei_Mayek_Extensions} (= \p{Block=
3742                                    Meetei_Mayek_Extensions}) (32)
3743        X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
3744                                    (Short: \p{InMeeteiMayekExt}) (32)
3745          \p{Mend}                \p{Mende_Kikakui} (= \p{Script_Extensions=
3746                                    Mende_Kikakui}) (NOT \p{Block=
3747                                    Mende_Kikakui}) (213)
3748          \p{Mende_Kikakui}       \p{Script_Extensions=Mende_Kikakui}
3749                                    (Short: \p{Mend}; NOT \p{Block=
3750                                    Mende_Kikakui}) (213)
3751          \p{Merc}                \p{Meroitic_Cursive} (=
3752                                    \p{Script_Extensions=Meroitic_Cursive})
3753                                    (NOT \p{Block=Meroitic_Cursive}) (90)
3754          \p{Mero}                \p{Meroitic_Hieroglyphs} (=
3755                                    \p{Script_Extensions=
3756                                    Meroitic_Hieroglyphs}) (32)
3757          \p{Meroitic_Cursive}    \p{Script_Extensions=Meroitic_Cursive}
3758                                    (Short: \p{Merc}; NOT \p{Block=
3759                                    Meroitic_Cursive}) (90)
3760          \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
3761                                    Meroitic_Hieroglyphs} (Short: \p{Mero})
3762                                    (32)
3763          \p{Miao}                \p{Script_Extensions=Miao} (NOT \p{Block=
3764                                    Miao}) (133)
3765        X \p{Misc_Arrows}         \p{Miscellaneous_Symbols_And_Arrows} (=
3766                                    \p{Block=
3767                                    Miscellaneous_Symbols_And_Arrows}) (256)
3768        X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
3769                                    (= \p{Block=
3770                                    Miscellaneous_Mathematical_Symbols_A})
3771                                    (48)
3772        X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
3773                                    (= \p{Block=
3774                                    Miscellaneous_Mathematical_Symbols_B})
3775                                    (128)
3776        X \p{Misc_Pictographs}    \p{Miscellaneous_Symbols_And_Pictographs}
3777                                    (= \p{Block=
3778                                    Miscellaneous_Symbols_And_Pictographs})
3779                                    (768)
3780        X \p{Misc_Symbols}        \p{Miscellaneous_Symbols} (= \p{Block=
3781                                    Miscellaneous_Symbols}) (256)
3782        X \p{Misc_Technical}      \p{Miscellaneous_Technical} (= \p{Block=
3783                                    Miscellaneous_Technical}) (256)
3784        X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
3785                                    Miscellaneous_Mathematical_Symbols_A}
3786                                    (Short: \p{InMiscMathSymbolsA}) (48)
3787        X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
3788                                    Miscellaneous_Mathematical_Symbols_B}
3789                                    (Short: \p{InMiscMathSymbolsB}) (128)
3790        X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
3791                                    \p{InMiscSymbols}) (256)
3792        X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
3793                                    Miscellaneous_Symbols_And_Arrows}
3794                                    (Short: \p{InMiscArrows}) (256)
3795        X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
3796                                    Miscellaneous_Symbols_And_Pictographs}
3797                                    (Short: \p{InMiscPictographs}) (768)
3798        X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
3799                                    (Short: \p{InMiscTechnical}) (256)
3800          \p{Mlym}                \p{Malayalam} (= \p{Script_Extensions=
3801                                    Malayalam}) (NOT \p{Block=Malayalam})
3802                                    (122)
3803          \p{Mn}                  \p{Nonspacing_Mark} (=
3804                                    \p{General_Category=Nonspacing_Mark})
3805                                    (1763)
3806          \p{Modi}                \p{Script_Extensions=Modi} (NOT \p{Block=
3807                                    Modi}) (89)
3808          \p{Modifier_Letter}     \p{General_Category=Modifier_Letter}
3809                                    (Short: \p{Lm}) (250)
3810        X \p{Modifier_Letters}    \p{Spacing_Modifier_Letters} (= \p{Block=
3811                                    Spacing_Modifier_Letters}) (80)
3812          \p{Modifier_Symbol}     \p{General_Category=Modifier_Symbol}
3813                                    (Short: \p{Sk}) (121)
3814        X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
3815          \p{Mong}                \p{Mongolian} (= \p{Script_Extensions=
3816                                    Mongolian}) (NOT \p{Block=Mongolian})
3817                                    (169)
3818          \p{Mongolian}           \p{Script_Extensions=Mongolian} (Short:
3819                                    \p{Mong}; NOT \p{Block=Mongolian}) (169)
3820        X \p{Mongolian_Sup}       \p{Mongolian_Supplement} (= \p{Block=
3821                                    Mongolian_Supplement}) (32)
3822        X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
3823                                    \p{InMongolianSup}) (32)
3824          \p{Mro}                 \p{Script_Extensions=Mro} (NOT \p{Block=
3825                                    Mro}) (43)
3826          \p{Mroo}                \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
3827                                    \p{Block=Mro}) (43)
3828          \p{Mtei}                \p{Meetei_Mayek} (= \p{Script_Extensions=
3829                                    Meetei_Mayek}) (NOT \p{Block=
3830                                    Meetei_Mayek}) (79)
3831          \p{Mult}                \p{Multani} (= \p{Script_Extensions=
3832                                    Multani}) (NOT \p{Block=Multani}) (48)
3833          \p{Multani}             \p{Script_Extensions=Multani} (Short:
3834                                    \p{Mult}; NOT \p{Block=Multani}) (48)
3835        X \p{Music}               \p{Musical_Symbols} (= \p{Block=
3836                                    Musical_Symbols}) (256)
3837        X \p{Musical_Symbols}     \p{Block=Musical_Symbols} (Short:
3838                                    \p{InMusic}) (256)
3839          \p{Myanmar}             \p{Script_Extensions=Myanmar} (Short:
3840                                    \p{Mymr}; NOT \p{Block=Myanmar}) (224)
3841        X \p{Myanmar_Ext_A}       \p{Myanmar_Extended_A} (= \p{Block=
3842                                    Myanmar_Extended_A}) (32)
3843        X \p{Myanmar_Ext_B}       \p{Myanmar_Extended_B} (= \p{Block=
3844                                    Myanmar_Extended_B}) (32)
3845        X \p{Myanmar_Extended_A}  \p{Block=Myanmar_Extended_A} (Short:
3846                                    \p{InMyanmarExtA}) (32)
3847        X \p{Myanmar_Extended_B}  \p{Block=Myanmar_Extended_B} (Short:
3848                                    \p{InMyanmarExtB}) (32)
3849          \p{Mymr}                \p{Myanmar} (= \p{Script_Extensions=
3850                                    Myanmar}) (NOT \p{Block=Myanmar}) (224)
3851          \p{N} \pN               \p{Number} (= \p{General_Category=Number})
3852                                    (1502)
3853          \p{Nabataean}           \p{Script_Extensions=Nabataean} (Short:
3854                                    \p{Nbat}; NOT \p{Block=Nabataean}) (40)
3855          \p{Narb}                \p{Old_North_Arabian} (=
3856                                    \p{Script_Extensions=Old_North_Arabian})
3857                                    (32)
3858        X \p{NB}                  \p{No_Block} (= \p{Block=No_Block})
3859                                    (834_096 plus all above-Unicode code
3860                                    points)
3861          \p{Nbat}                \p{Nabataean} (= \p{Script_Extensions=
3862                                    Nabataean}) (NOT \p{Block=Nabataean})
3863                                    (40)
3864          \p{NChar}               \p{Noncharacter_Code_Point} (=
3865                                    \p{Noncharacter_Code_Point=Y}) (66)
3866          \p{NChar: *}            \p{Noncharacter_Code_Point: *}
3867          \p{Nd}                  \p{XPosixDigit} (= \p{General_Category=
3868                                    Decimal_Number}) (590)
3869          \p{New_Tai_Lue}         \p{Script_Extensions=New_Tai_Lue} (Short:
3870                                    \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
3871          \p{Newa}                \p{Script_Extensions=Newa} (NOT \p{Block=
3872                                    Newa}) (92)
3873          \p{NFC_QC: *}           \p{NFC_Quick_Check: *}
3874          \p{NFC_Quick_Check: M}  \p{NFC_Quick_Check=Maybe} (110)
3875          \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (110:
3876                                    U+0300..0304, U+0306..030C, U+030F,
3877                                    U+0311, U+0313..0314, U+031B ...)
3878          \p{NFC_Quick_Check: N}  \p{NFC_Quick_Check=No} (NOT
3879                                    \P{NFC_Quick_Check} NOR \P{NFC_QC})
3880                                    (1120)
3881          \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
3882                                    \P{NFC_Quick_Check} NOR \P{NFC_QC})
3883                                    (1120: U+0340..0341, U+0343..0344,
3884                                    U+0374, U+037E, U+0387, U+0958..095F ...)
3885          \p{NFC_Quick_Check: Y}  \p{NFC_Quick_Check=Yes} (NOT
3886                                    \p{NFC_Quick_Check} NOR \p{NFC_QC})
3887                                    (1_112_882 plus all above-Unicode code
3888                                    points)
3889          \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
3890                                    \p{NFC_Quick_Check} NOR \p{NFC_QC})
3891                                    (1_112_882 plus all above-Unicode code
3892                                    points: U+0000..02FF, U+0305,
3893                                    U+030D..030E, U+0310, U+0312,
3894                                    U+0315..031A ...)
3895          \p{NFD_QC: *}           \p{NFD_Quick_Check: *}
3896          \p{NFD_Quick_Check: N}  \p{NFD_Quick_Check=No} (NOT
3897                                    \P{NFD_Quick_Check} NOR \P{NFD_QC})
3898                                    (13_232)
3899          \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
3900                                    \P{NFD_Quick_Check} NOR \P{NFD_QC})
3901                                    (13_232: [\xc0-\xc5\xc7-\xcf\xd1-
3902                                    \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
3903                                    \xf6\xf9-\xfd\xff], U+0100..010F,
3904                                    U+0112..0125, U+0128..0130,
3905                                    U+0134..0137, U+0139..013E ...)
3906          \p{NFD_Quick_Check: Y}  \p{NFD_Quick_Check=Yes} (NOT
3907                                    \p{NFD_Quick_Check} NOR \p{NFD_QC})
3908                                    (1_100_880 plus all above-Unicode code
3909                                    points)
3910          \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
3911                                    \p{NFD_Quick_Check} NOR \p{NFD_QC})
3912                                    (1_100_880 plus all above-Unicode code
3913                                    points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde-
3914                                    \xdf\xe6\xf0\xf7-\xf8\xfe],
3915                                    U+0110..0111, U+0126..0127,
3916                                    U+0131..0133, U+0138, U+013F..0142 ...)
3917          \p{NFKC_QC: *}          \p{NFKC_Quick_Check: *}
3918          \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (110)
3919          \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (110:
3920                                    U+0300..0304, U+0306..030C, U+030F,
3921                                    U+0311, U+0313..0314, U+031B ...)
3922          \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
3923                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
3924                                    (4794)
3925          \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
3926                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
3927                                    (4794: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
3928                                    \xba\xbc-\xbe], U+0132..0133,
3929                                    U+013F..0140, U+0149, U+017F,
3930                                    U+01C4..01CC ...)
3931          \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
3932                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
3933                                    (1_109_208 plus all above-Unicode code
3934                                    points)
3935          \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
3936                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
3937                                    (1_109_208 plus all above-Unicode code
3938                                    points: [\x00-\x9f\xa1-\xa7\xa9\xab-
3939                                    \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff],
3940                                    U+0100..0131, U+0134..013E,
3941                                    U+0141..0148, U+014A..017E, U+0180..01C3
3942                                    ...)
3943          \p{NFKD_QC: *}          \p{NFKD_Quick_Check: *}
3944          \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
3945                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
3946                                    (16_894)
3947          \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
3948                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
3949                                    (16_894: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
3950                                    \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1-
3951                                    \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
3952                                    \xf6\xf9-\xfd\xff], U+0100..010F,
3953                                    U+0112..0125, U+0128..0130,
3954                                    U+0132..0137, U+0139..0140 ...)
3955          \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
3956                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
3957                                    (1_097_218 plus all above-Unicode code
3958                                    points)
3959          \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
3960                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
3961                                    (1_097_218 plus all above-Unicode code
3962                                    points: [\x00-\x9f\xa1-\xa7\xa9\xab-
3963                                    \xae\xb0-\xb1\xb6-
3964                                    \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-
3965                                    \xdf\xe6\xf0\xf7-\xf8\xfe],
3966                                    U+0110..0111, U+0126..0127, U+0131,
3967                                    U+0138, U+0141..0142 ...)
3968          \p{Nko}                 \p{Script_Extensions=Nko} (NOT \p{Block=
3969                                    NKo}) (59)
3970          \p{Nkoo}                \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
3971                                    \p{Block=NKo}) (59)
3972          \p{Nl}                  \p{Letter_Number} (= \p{General_Category=
3973                                    Letter_Number}) (236)
3974          \p{No}                  \p{Other_Number} (= \p{General_Category=
3975                                    Other_Number}) (676)
3976        X \p{No_Block}            \p{Block=No_Block} (Short: \p{InNB})
3977                                    (834_096 plus all above-Unicode code
3978                                    points)
3979          \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
3980                                    \p{NChar}) (66)
3981          \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
3982                                    (1_114_046 plus all above-Unicode code
3983                                    points: U+0000..FDCF, U+FDF0..FFFD,
3984                                    U+10000..1FFFD, U+20000..2FFFD,
3985                                    U+30000..3FFFD, U+40000..4FFFD ...)
3986          \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
3987                                    (66: U+FDD0..FDEF, U+FFFE..FFFF,
3988                                    U+1FFFE..1FFFF, U+2FFFE..2FFFF,
3989                                    U+3FFFE..3FFFF, U+4FFFE..4FFFF ...)
3990          \p{Nonspacing_Mark}     \p{General_Category=Nonspacing_Mark}
3991                                    (Short: \p{Mn}) (1763)
3992          \p{Nshu}                \p{Nushu} (= \p{Script_Extensions=Nushu})
3993                                    (NOT \p{Block=Nushu}) (397)
3994          \p{Nt: *}               \p{Numeric_Type: *}
3995          \p{Number}              \p{General_Category=Number} (Short: \p{N})
3996                                    (1502)
3997        X \p{Number_Forms}        \p{Block=Number_Forms} (64)
3998          \p{Numeric_Type: De}    \p{Numeric_Type=Decimal} (590)
3999          \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (590: [0-9],
4000                                    U+0660..0669, U+06F0..06F9,
4001                                    U+07C0..07C9, U+0966..096F, U+09E6..09EF
4002                                    ...)
4003          \p{Numeric_Type: Di}    \p{Numeric_Type=Digit} (128)
4004          \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9],
4005                                    U+1369..1371, U+19DA, U+2070,
4006                                    U+2074..2079, U+2080..2089 ...)
4007          \p{Numeric_Type: None}  (Short: \p{Nt=None}) (1_112_529 plus all
4008                                    above-Unicode code points: [\x00-
4009                                    \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-
4010                                    Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4-
4011                                    \xb8\xba-\xbb\xbf-\xff], U+0100..065F,
4012                                    U+066A..06EF, U+06FA..07BF,
4013                                    U+07CA..0965, U+0970..09E5 ...)
4014          \p{Numeric_Type: Nu}    \p{Numeric_Type=Numeric} (865)
4015          \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (865: [\xbc-\xbe],
4016                                    U+09F4..09F9, U+0B72..0B77,
4017                                    U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
4018                                    ...)
4019        T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33)
4020        T \p{Numeric_Value: 0}    (Short: \p{Nv=0}) (75: [0], U+0660,
4021                                    U+06F0, U+07C0, U+0966, U+09E6 ...)
4022        T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (1: U+0D58)
4023        T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (1: U+0D59)
4024        T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (1: U+0D5A)
4025        T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (1: U+0D5B)
4026        T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (4: U+09F4, U+0B75,
4027                                    U+0D76, U+A833)
4028        T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6)
4029        T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (2: U+0D5C, U+2152)
4030        T \p{Numeric_Value: 1/9}  (Short: \p{Nv=1/9}) (1: U+2151)
4031        T \p{Numeric_Value: 1/8}  (Short: \p{Nv=1/8}) (6: U+09F5, U+0B76,
4032                                    U+0D77, U+215B, U+A834, U+1245F)
4033        T \p{Numeric_Value: 1/7}  (Short: \p{Nv=1/7}) (1: U+2150)
4034        T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (1: U+0D5D)
4035        T \p{Numeric_Value: 1/6}  (Short: \p{Nv=1/6}) (3: U+2159, U+109F7,
4036                                    U+12461)
4037        T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (4: U+09F6, U+0B77,
4038                                    U+0D78, U+A835)
4039        T \p{Numeric_Value: 1/5}  (Short: \p{Nv=1/5}) (2: U+0D5E, U+2155)
4040        T \p{Numeric_Value: 1/4}  (Short: \p{Nv=1/4}) (12: [\xbc], U+09F7,
4041                                    U+0B72, U+0D73, U+A830, U+10140 ...)
4042        T \p{Numeric_Value: 1/3}  (Short: \p{Nv=1/3}) (6: U+2153, U+109F9,
4043                                    U+10E7D, U+1245A, U+1245D, U+12465)
4044        T \p{Numeric_Value: 3/8}  (Short: \p{Nv=3/8}) (1: U+215C)
4045        T \p{Numeric_Value: 2/5}  (Short: \p{Nv=2/5}) (1: U+2156)
4046        T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA)
4047        T \p{Numeric_Value: 1/2}  (Short: \p{Nv=1/2}) (13: [\xbd], U+0B73,
4048                                    U+0D74, U+0F2A, U+2CFD, U+A831 ...)
4049        T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC)
4050        T \p{Numeric_Value: 3/5}  (Short: \p{Nv=3/5}) (1: U+2157)
4051        T \p{Numeric_Value: 5/8}  (Short: \p{Nv=5/8}) (1: U+215D)
4052        T \p{Numeric_Value: 2/3}  (Short: \p{Nv=2/3}) (7: U+2154, U+10177,
4053                                    U+109FD, U+10E7E, U+1245B, U+1245E ...)
4054        T \p{Numeric_Value: 3/4}  (Short: \p{Nv=3/4}) (7: [\xbe], U+09F8,
4055                                    U+0B74, U+0D75, U+A832, U+10178 ...)
4056        T \p{Numeric_Value: 4/5}  (Short: \p{Nv=4/5}) (1: U+2158)
4057        T \p{Numeric_Value: 5/6}  (Short: \p{Nv=5/6}) (3: U+215A, U+109FF,
4058                                    U+1245C)
4059        T \p{Numeric_Value: 7/8}  (Short: \p{Nv=7/8}) (1: U+215E)
4060        T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC)
4061        T \p{Numeric_Value: 1}    (Short: \p{Nv=1}) (122: [1\xb9], U+0661,
4062                                    U+06F1, U+07C1, U+0967, U+09E7 ...)
4063        T \p{Numeric_Value: 3/2}  (Short: \p{Nv=3/2}) (1: U+0F2B)
4064        T \p{Numeric_Value: 2}    (Short: \p{Nv=2}) (122: [2\xb2], U+0662,
4065                                    U+06F2, U+07C2, U+0968, U+09E8 ...)
4066        T \p{Numeric_Value: 5/2}  (Short: \p{Nv=5/2}) (1: U+0F2C)
4067        T \p{Numeric_Value: 3}    (Short: \p{Nv=3}) (124: [3\xb3], U+0663,
4068                                    U+06F3, U+07C3, U+0969, U+09E9 ...)
4069        T \p{Numeric_Value: 7/2}  (Short: \p{Nv=7/2}) (1: U+0F2D)
4070        T \p{Numeric_Value: 4}    (Short: \p{Nv=4}) (116: [4], U+0664,
4071                                    U+06F4, U+07C4, U+096A, U+09EA ...)
4072        T \p{Numeric_Value: 9/2}  (Short: \p{Nv=9/2}) (1: U+0F2E)
4073        T \p{Numeric_Value: 5}    (Short: \p{Nv=5}) (114: [5], U+0665,
4074                                    U+06F5, U+07C5, U+096B, U+09EB ...)
4075        T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F)
4076        T \p{Numeric_Value: 6}    (Short: \p{Nv=6}) (101: [6], U+0666,
4077                                    U+06F6, U+07C6, U+096C, U+09EC ...)
4078        T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30)
4079        T \p{Numeric_Value: 7}    (Short: \p{Nv=7}) (100: [7], U+0667,
4080                                    U+06F7, U+07C7, U+096D, U+09ED ...)
4081        T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31)
4082        T \p{Numeric_Value: 8}    (Short: \p{Nv=8}) (96: [8], U+0668,
4083                                    U+06F8, U+07C8, U+096E, U+09EE ...)
4084        T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32)
4085        T \p{Numeric_Value: 9}    (Short: \p{Nv=9}) (100: [9], U+0669,
4086                                    U+06F9, U+07C9, U+096F, U+09EF ...)
4087        T \p{Numeric_Value: 10}   (Short: \p{Nv=10}) (54: U+0BF0, U+0D70,
4088                                    U+1372, U+2169, U+2179, U+2469 ...)
4089        T \p{Numeric_Value: 11}   (Short: \p{Nv=11}) (6: U+216A, U+217A,
4090                                    U+246A, U+247E, U+2492, U+24EB)
4091        T \p{Numeric_Value: 12}   (Short: \p{Nv=12}) (6: U+216B, U+217B,
4092                                    U+246B, U+247F, U+2493, U+24EC)
4093        T \p{Numeric_Value: 13}   (Short: \p{Nv=13}) (4: U+246C, U+2480,
4094                                    U+2494, U+24ED)
4095        T \p{Numeric_Value: 14}   (Short: \p{Nv=14}) (4: U+246D, U+2481,
4096                                    U+2495, U+24EE)
4097        T \p{Numeric_Value: 15}   (Short: \p{Nv=15}) (4: U+246E, U+2482,
4098                                    U+2496, U+24EF)
4099        T \p{Numeric_Value: 16}   (Short: \p{Nv=16}) (5: U+09F9, U+246F,
4100                                    U+2483, U+2497, U+24F0)
4101        T \p{Numeric_Value: 17}   (Short: \p{Nv=17}) (5: U+16EE, U+2470,
4102                                    U+2484, U+2498, U+24F1)
4103        T \p{Numeric_Value: 18}   (Short: \p{Nv=18}) (5: U+16EF, U+2471,
4104                                    U+2485, U+2499, U+24F2)
4105        T \p{Numeric_Value: 19}   (Short: \p{Nv=19}) (5: U+16F0, U+2472,
4106                                    U+2486, U+249A, U+24F3)
4107        T \p{Numeric_Value: 20}   (Short: \p{Nv=20}) (31: U+1373, U+2473,
4108                                    U+2487, U+249B, U+24F4, U+3039 ...)
4109        T \p{Numeric_Value: 21}   (Short: \p{Nv=21}) (1: U+3251)
4110        T \p{Numeric_Value: 22}   (Short: \p{Nv=22}) (1: U+3252)
4111        T \p{Numeric_Value: 23}   (Short: \p{Nv=23}) (1: U+3253)
4112        T \p{Numeric_Value: 24}   (Short: \p{Nv=24}) (1: U+3254)
4113        T \p{Numeric_Value: 25}   (Short: \p{Nv=25}) (1: U+3255)
4114        T \p{Numeric_Value: 26}   (Short: \p{Nv=26}) (1: U+3256)
4115        T \p{Numeric_Value: 27}   (Short: \p{Nv=27}) (1: U+3257)
4116        T \p{Numeric_Value: 28}   (Short: \p{Nv=28}) (1: U+3258)
4117        T \p{Numeric_Value: 29}   (Short: \p{Nv=29}) (1: U+3259)
4118        T \p{Numeric_Value: 30}   (Short: \p{Nv=30}) (16: U+1374, U+303A,
4119                                    U+324A, U+325A, U+5345, U+10112 ...)
4120        T \p{Numeric_Value: 31}   (Short: \p{Nv=31}) (1: U+325B)
4121        T \p{Numeric_Value: 32}   (Short: \p{Nv=32}) (1: U+325C)
4122        T \p{Numeric_Value: 33}   (Short: \p{Nv=33}) (1: U+325D)
4123        T \p{Numeric_Value: 34}   (Short: \p{Nv=34}) (1: U+325E)
4124        T \p{Numeric_Value: 35}   (Short: \p{Nv=35}) (1: U+325F)
4125        T \p{Numeric_Value: 36}   (Short: \p{Nv=36}) (1: U+32B1)
4126        T \p{Numeric_Value: 37}   (Short: \p{Nv=37}) (1: U+32B2)
4127        T \p{Numeric_Value: 38}   (Short: \p{Nv=38}) (1: U+32B3)
4128        T \p{Numeric_Value: 39}   (Short: \p{Nv=39}) (1: U+32B4)
4129        T \p{Numeric_Value: 40}   (Short: \p{Nv=40}) (16: U+1375, U+324B,
4130                                    U+32B5, U+534C, U+10113, U+102ED ...)
4131        T \p{Numeric_Value: 41}   (Short: \p{Nv=41}) (1: U+32B6)
4132        T \p{Numeric_Value: 42}   (Short: \p{Nv=42}) (1: U+32B7)
4133        T \p{Numeric_Value: 43}   (Short: \p{Nv=43}) (1: U+32B8)
4134        T \p{Numeric_Value: 44}   (Short: \p{Nv=44}) (1: U+32B9)
4135        T \p{Numeric_Value: 45}   (Short: \p{Nv=45}) (1: U+32BA)
4136        T \p{Numeric_Value: 46}   (Short: \p{Nv=46}) (1: U+32BB)
4137        T \p{Numeric_Value: 47}   (Short: \p{Nv=47}) (1: U+32BC)
4138        T \p{Numeric_Value: 48}   (Short: \p{Nv=48}) (1: U+32BD)
4139        T \p{Numeric_Value: 49}   (Short: \p{Nv=49}) (1: U+32BE)
4140        T \p{Numeric_Value: 50}   (Short: \p{Nv=50}) (27: U+1376, U+216C,
4141                                    U+217C, U+2186, U+324C, U+32BF ...)
4142        T \p{Numeric_Value: 60}   (Short: \p{Nv=60}) (11: U+1377, U+324D,
4143                                    U+10115, U+102EF, U+109CE, U+10E6E ...)
4144        T \p{Numeric_Value: 70}   (Short: \p{Nv=70}) (11: U+1378, U+324E,
4145                                    U+10116, U+102F0, U+109CF, U+10E6F ...)
4146        T \p{Numeric_Value: 80}   (Short: \p{Nv=80}) (10: U+1379, U+324F,
4147                                    U+10117, U+102F1, U+10E70, U+11062 ...)
4148        T \p{Numeric_Value: 90}   (Short: \p{Nv=90}) (10: U+137A, U+10118,
4149                                    U+102F2, U+10341, U+10E71, U+11063 ...)
4150        T \p{Numeric_Value: 100}  (Short: \p{Nv=100}) (30: U+0BF1, U+0D71,
4151                                    U+137B, U+216D, U+217D, U+4F70 ...)
4152        T \p{Numeric_Value: 200}  (Short: \p{Nv=200}) (4: U+1011A, U+102F4,
4153                                    U+109D3, U+10E73)
4154        T \p{Numeric_Value: 300}  (Short: \p{Nv=300}) (5: U+1011B, U+1016B,
4155                                    U+102F5, U+109D4, U+10E74)
4156        T \p{Numeric_Value: 400}  (Short: \p{Nv=400}) (4: U+1011C, U+102F6,
4157                                    U+109D5, U+10E75)
4158        T \p{Numeric_Value: 500}  (Short: \p{Nv=500}) (14: U+216E, U+217E,
4159                                    U+1011D, U+10145, U+1014C, U+10153 ...)
4160        T \p{Numeric_Value: 600}  (Short: \p{Nv=600}) (4: U+1011E, U+102F8,
4161                                    U+109D7, U+10E77)
4162        T \p{Numeric_Value: 700}  (Short: \p{Nv=700}) (4: U+1011F, U+102F9,
4163                                    U+109D8, U+10E78)
4164        T \p{Numeric_Value: 800}  (Short: \p{Nv=800}) (4: U+10120, U+102FA,
4165                                    U+109D9, U+10E79)
4166        T \p{Numeric_Value: 900}  (Short: \p{Nv=900}) (5: U+10121, U+102FB,
4167                                    U+1034A, U+109DA, U+10E7A)
4168        T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (20: U+0BF2, U+0D72,
4169                                    U+216F, U+217F..2180, U+4EDF, U+5343 ...)
4170        T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (2: U+10123, U+109DC)
4171        T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (2: U+10124, U+109DD)
4172        T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (2: U+10125, U+109DE)
4173        T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (6: U+2181, U+10126,
4174                                    U+10146, U+1014E, U+10172, U+109DF)
4175        T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (2: U+10127, U+109E0)
4176        T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (2: U+10128, U+109E1)
4177        T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (2: U+10129, U+109E2)
4178        T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (2: U+1012A, U+109E3)
4179        T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (9:
4180                                    U+137C, U+2182, U+4E07, U+842C, U+1012B,
4181                                    U+10155 ...)
4182        T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (2:
4183                                    U+1012C, U+109E5)
4184        T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (2:
4185                                    U+1012D, U+109E6)
4186        T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (2:
4187                                    U+1012E, U+109E7)
4188        T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (5:
4189                                    U+2187, U+1012F, U+10147, U+10156,
4190                                    U+109E8)
4191        T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (2:
4192                                    U+10130, U+109E9)
4193        T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (2:
4194                                    U+10131, U+109EA)
4195        T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (2:
4196                                    U+10132, U+109EB)
4197        T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (2:
4198                                    U+10133, U+109EC)
4199        T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (2:
4200                                    U+2188, U+109ED)
4201        T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (1:
4202                                    U+109EE)
4203        T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1:
4204                                    U+12432)
4205        T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1:
4206                                    U+109EF)
4207        T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1:
4208                                    U+109F0)
4209        T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1:
4210                                    U+12433)
4211        T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1:
4212                                    U+109F1)
4213        T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1:
4214                                    U+109F2)
4215        T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1:
4216                                    U+109F3)
4217        T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1:
4218                                    U+109F4)
4219        T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1:
4220                                    U+109F5)
4221        T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1:
4222                                    U+16B5E)
4223        T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
4224                                    (3: U+4EBF, U+5104, U+16B5F)
4225        T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
4226                                    10000000000}) (1: U+16B60)
4227        T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
4228                                    1000000000000}) (2: U+5146, U+16B61)
4229          \p{Numeric_Value: NaN}  (Short: \p{Nv=NaN}) (1_112_529 plus all
4230                                    above-Unicode code points: [\x00-
4231                                    \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-
4232                                    Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4-
4233                                    \xb8\xba-\xbb\xbf-\xff], U+0100..065F,
4234                                    U+066A..06EF, U+06FA..07BF,
4235                                    U+07CA..0965, U+0970..09E5 ...)
4236          \p{Nushu}               \p{Script_Extensions=Nushu} (Short:
4237                                    \p{Nshu}; NOT \p{Block=Nushu}) (397)
4238          \p{Nv: *}               \p{Numeric_Value: *}
4239        X \p{OCR}                 \p{Optical_Character_Recognition} (=
4240                                    \p{Block=Optical_Character_Recognition})
4241                                    (32)
4242          \p{Ogam}                \p{Ogham} (= \p{Script_Extensions=Ogham})
4243                                    (NOT \p{Block=Ogham}) (29)
4244          \p{Ogham}               \p{Script_Extensions=Ogham} (Short:
4245                                    \p{Ogam}; NOT \p{Block=Ogham}) (29)
4246          \p{Ol_Chiki}            \p{Script_Extensions=Ol_Chiki} (Short:
4247                                    \p{Olck}) (48)
4248          \p{Olck}                \p{Ol_Chiki} (= \p{Script_Extensions=
4249                                    Ol_Chiki}) (48)
4250          \p{Old_Hungarian}       \p{Script_Extensions=Old_Hungarian}
4251                                    (Short: \p{Hung}; NOT \p{Block=
4252                                    Old_Hungarian}) (108)
4253          \p{Old_Italic}          \p{Script_Extensions=Old_Italic} (Short:
4254                                    \p{Ital}; NOT \p{Block=Old_Italic}) (39)
4255          \p{Old_North_Arabian}   \p{Script_Extensions=Old_North_Arabian}
4256                                    (Short: \p{Narb}) (32)
4257          \p{Old_Permic}          \p{Script_Extensions=Old_Permic} (Short:
4258                                    \p{Perm}; NOT \p{Block=Old_Permic}) (44)
4259          \p{Old_Persian}         \p{Script_Extensions=Old_Persian} (Short:
4260                                    \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
4261          \p{Old_South_Arabian}   \p{Script_Extensions=Old_South_Arabian}
4262                                    (Short: \p{Sarb}) (32)
4263          \p{Old_Turkic}          \p{Script_Extensions=Old_Turkic} (Short:
4264                                    \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
4265          \p{Open_Punctuation}    \p{General_Category=Open_Punctuation}
4266                                    (Short: \p{Ps}) (75)
4267        X \p{Optical_Character_Recognition} \p{Block=
4268                                    Optical_Character_Recognition} (Short:
4269                                    \p{InOCR}) (32)
4270          \p{Oriya}               \p{Script_Extensions=Oriya} (Short:
4271                                    \p{Orya}; NOT \p{Block=Oriya}) (94)
4272          \p{Orkh}                \p{Old_Turkic} (= \p{Script_Extensions=
4273                                    Old_Turkic}) (NOT \p{Block=Old_Turkic})
4274                                    (73)
4275        X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
4276          \p{Orya}                \p{Oriya} (= \p{Script_Extensions=Oriya})
4277                                    (NOT \p{Block=Oriya}) (94)
4278          \p{Osage}               \p{Script_Extensions=Osage} (Short:
4279                                    \p{Osge}; NOT \p{Block=Osage}) (72)
4280          \p{Osge}                \p{Osage} (= \p{Script_Extensions=Osage})
4281                                    (NOT \p{Block=Osage}) (72)
4282          \p{Osma}                \p{Osmanya} (= \p{Script_Extensions=
4283                                    Osmanya}) (NOT \p{Block=Osmanya}) (40)
4284          \p{Osmanya}             \p{Script_Extensions=Osmanya} (Short:
4285                                    \p{Osma}; NOT \p{Block=Osmanya}) (40)
4286          \p{Other}               \p{General_Category=Other} (Short: \p{C})
4287                                    (977_573 plus all above-Unicode code
4288                                    points)
4289          \p{Other_Letter}        \p{General_Category=Other_Letter} (Short:
4290                                    \p{Lo}) (121_047)
4291          \p{Other_Number}        \p{General_Category=Other_Number} (Short:
4292                                    \p{No}) (676)
4293          \p{Other_Punctuation}   \p{General_Category=Other_Punctuation}
4294                                    (Short: \p{Po}) (566)
4295          \p{Other_Symbol}        \p{General_Category=Other_Symbol} (Short:
4296                                    \p{So}) (5855)
4297          \p{P} \pP               \p{Punct} (= \p{General_Category=
4298                                    Punctuation}) (NOT
4299                                    \p{General_Punctuation}) (770)
4300          \p{Pahawh_Hmong}        \p{Script_Extensions=Pahawh_Hmong} (Short:
4301                                    \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
4302                                    (127)
4303          \p{Palm}                \p{Palmyrene} (= \p{Script_Extensions=
4304                                    Palmyrene}) (32)
4305          \p{Palmyrene}           \p{Script_Extensions=Palmyrene} (Short:
4306                                    \p{Palm}) (32)
4307          \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
4308                                    (Short: \p{Zp}) (1)
4309          \p{Pat_Syn}             \p{Pattern_Syntax} (= \p{Pattern_Syntax=
4310                                    Y}) (2760)
4311          \p{Pat_Syn: *}          \p{Pattern_Syntax: *}
4312          \p{Pat_WS}              \p{Pattern_White_Space} (=
4313                                    \p{Pattern_White_Space=Y}) (11)
4314          \p{Pat_WS: *}           \p{Pattern_White_Space: *}
4315          \p{Pattern_Syntax}      \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
4316                                    (2760)
4317          \p{Pattern_Syntax: N*}  (Short: \p{PatSyn=N}, \P{PatSyn})
4318                                    (1_111_352 plus all above-Unicode code
4319                                    points: [\x00-\x200-9A-Z_a-z\x7f-
4320                                    \xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb7-
4321                                    \xba\xbc-\xbe\xc0-\xd6\xd8-\xf6\xf8-
4322                                    \xff], U+0100..200F, U+2028..202F,
4323                                    U+203F..2040, U+2054, U+205F..218F ...)
4324          \p{Pattern_Syntax: Y*}  (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760:
4325                                    [!\"#\$\%&\'\(\)*+,\-.\/:;<=
4326                                    >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab-
4327                                    \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
4328                                    U+2010..2027, U+2030..203E,
4329                                    U+2041..2053, U+2055..205E, U+2190..245F
4330                                    ...)
4331          \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
4332                                    \p{PatWS}) (11)
4333          \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
4334                                    (1_114_101 plus all above-Unicode code
4335                                    points: [^\t\n\cK\f\r\x20\x85],
4336                                    U+0100..200D, U+2010..2027,
4337                                    U+202A..infinity)
4338          \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11:
4339                                    [\t\n\cK\f\r\x20\x85], U+200E..200F,
4340                                    U+2028..2029)
4341          \p{Pau_Cin_Hau}         \p{Script_Extensions=Pau_Cin_Hau} (Short:
4342                                    \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
4343          \p{Pauc}                \p{Pau_Cin_Hau} (= \p{Script_Extensions=
4344                                    Pau_Cin_Hau}) (NOT \p{Block=
4345                                    Pau_Cin_Hau}) (57)
4346          \p{Pc}                  \p{Connector_Punctuation} (=
4347                                    \p{General_Category=
4348                                    Connector_Punctuation}) (10)
4349          \p{PCM}                 \p{Prepended_Concatenation_Mark} (=
4350                                    \p{Prepended_Concatenation_Mark=Y}) (10)
4351          \p{PCM: *}              \p{Prepended_Concatenation_Mark: *}
4352          \p{Pd}                  \p{Dash_Punctuation} (=
4353                                    \p{General_Category=Dash_Punctuation})
4354                                    (24)
4355          \p{Pe}                  \p{Close_Punctuation} (=
4356                                    \p{General_Category=Close_Punctuation})
4357                                    (73)
4358          \p{PerlSpace}           \p{PosixSpace} (6)
4359          \p{PerlWord}            \p{PosixWord} (63)
4360          \p{Perm}                \p{Old_Permic} (= \p{Script_Extensions=
4361                                    Old_Permic}) (NOT \p{Block=Old_Permic})
4362                                    (44)
4363          \p{Pf}                  \p{Final_Punctuation} (=
4364                                    \p{General_Category=Final_Punctuation})
4365                                    (10)
4366          \p{Phag}                \p{Phags_Pa} (= \p{Script_Extensions=
4367                                    Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
4368          \p{Phags_Pa}            \p{Script_Extensions=Phags_Pa} (Short:
4369                                    \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
4370        X \p{Phaistos}            \p{Phaistos_Disc} (= \p{Block=
4371                                    Phaistos_Disc}) (48)
4372        X \p{Phaistos_Disc}       \p{Block=Phaistos_Disc} (Short:
4373                                    \p{InPhaistos}) (48)
4374          \p{Phli}                \p{Inscriptional_Pahlavi} (=
4375                                    \p{Script_Extensions=
4376                                    Inscriptional_Pahlavi}) (NOT \p{Block=
4377                                    Inscriptional_Pahlavi}) (27)
4378          \p{Phlp}                \p{Psalter_Pahlavi} (=
4379                                    \p{Script_Extensions=Psalter_Pahlavi})
4380                                    (NOT \p{Block=Psalter_Pahlavi}) (30)
4381          \p{Phnx}                \p{Phoenician} (= \p{Script_Extensions=
4382                                    Phoenician}) (NOT \p{Block=Phoenician})
4383                                    (29)
4384          \p{Phoenician}          \p{Script_Extensions=Phoenician} (Short:
4385                                    \p{Phnx}; NOT \p{Block=Phoenician}) (29)
4386        X \p{Phonetic_Ext}        \p{Phonetic_Extensions} (= \p{Block=
4387                                    Phonetic_Extensions}) (128)
4388        X \p{Phonetic_Ext_Sup}    \p{Phonetic_Extensions_Supplement} (=
4389                                    \p{Block=
4390                                    Phonetic_Extensions_Supplement}) (64)
4391        X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
4392                                    \p{InPhoneticExt}) (128)
4393        X \p{Phonetic_Extensions_Supplement} \p{Block=
4394                                    Phonetic_Extensions_Supplement} (Short:
4395                                    \p{InPhoneticExtSup}) (64)
4396          \p{Pi}                  \p{Initial_Punctuation} (=
4397                                    \p{General_Category=
4398                                    Initial_Punctuation}) (12)
4399        X \p{Playing_Cards}       \p{Block=Playing_Cards} (96)
4400          \p{Plrd}                \p{Miao} (= \p{Script_Extensions=Miao})
4401                                    (NOT \p{Block=Miao}) (133)
4402          \p{Po}                  \p{Other_Punctuation} (=
4403                                    \p{General_Category=Other_Punctuation})
4404                                    (566)
4405          \p{PosixAlnum}          (62: [0-9A-Za-z])
4406          \p{PosixAlpha}          (52: [A-Za-z])
4407          \p{PosixBlank}          (2: [\t\x20])
4408          \p{PosixCntrl}          ASCII control characters (33: ACK, BEL,
4409                                    BS, CAN, CR, DC1, DC2, DC3, DC4, DEL,
4410                                    DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF,
4411                                    FS, GS, HT, LF, NAK, NUL, RS, SI, SO,
4412                                    SOH, STX, SUB, SYN, US, VT)
4413          \p{PosixDigit}          (10: [0-9])
4414          \p{PosixGraph}          (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
4415                                    Z\[\\\]\^_`a-z\{\|\}~])
4416          \p{PosixLower}          (/i= PosixAlpha) (26: [a-z])
4417          \p{PosixPrint}          (95: [\x20-\x7e])
4418          \p{PosixPunct}          (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<=
4419                                    >?\@\[\\\]\^_`\{\|\}~])
4420          \p{PosixSpace}          (Short: \p{PerlSpace}) (6:
4421                                    [\t\n\cK\f\r\x20])
4422          \p{PosixUpper}          (/i= PosixAlpha) (26: [A-Z])
4423          \p{PosixWord}           \w, restricted to ASCII (Short:
4424                                    \p{PerlWord}) (63: [0-9A-Z_a-z])
4425          \p{PosixXDigit}         \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
4426                                    (22)
4427          \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
4428                                    Y} (Short: \p{PCM}) (10)
4429          \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
4430                                    (1_114_102 plus all above-Unicode code
4431                                    points: U+0000..05FF, U+0606..06DC,
4432                                    U+06DE..070E, U+0710..08E1,
4433                                    U+08E3..110BC, U+110BE..infinity)
4434          \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
4435                                    (10: U+0600..0605, U+06DD, U+070F,
4436                                    U+08E2, U+110BD)
4437        T \p{Present_In: 1.1}     \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
4438                                    extension) (33_979)
4439        T \p{Present_In: 2.0}     Code point's usage introduced in version
4440                                    2.0 or earlier (Short: \p{In=2.0}) (Perl
4441                                    extension) (178_500: U+0000..01F5,
4442                                    U+01FA..0217, U+0250..02A8,
4443                                    U+02B0..02DE, U+02E0..02E9, U+0300..0345
4444                                    ...)
4445          \p{Present_In: V2_0}    \p{Present_In=2.0} (Perl extension)
4446                                    (178_500)
4447        T \p{Present_In: 2.1}     Code point's usage introduced in version
4448                                    2.1 or earlier (Short: \p{In=2.1}) (Perl
4449                                    extension) (178_502: U+0000..01F5,
4450                                    U+01FA..0217, U+0250..02A8,
4451                                    U+02B0..02DE, U+02E0..02E9, U+0300..0345
4452                                    ...)
4453          \p{Present_In: V2_1}    \p{Present_In=2.1} (Perl extension)
4454                                    (178_502)
4455        T \p{Present_In: 3.0}     Code point's usage introduced in version
4456                                    3.0 or earlier (Short: \p{In=3.0}) (Perl
4457                                    extension) (188_809: U+0000..021F,
4458                                    U+0222..0233, U+0250..02AD,
4459                                    U+02B0..02EE, U+0300..034E, U+0360..0362
4460                                    ...)
4461          \p{Present_In: V3_0}    \p{Present_In=3.0} (Perl extension)
4462                                    (188_809)
4463        T \p{Present_In: 3.1}     Code point's usage introduced in version
4464                                    3.1 or earlier (Short: \p{In=3.1}) (Perl
4465                                    extension) (233_787: U+0000..021F,
4466                                    U+0222..0233, U+0250..02AD,
4467                                    U+02B0..02EE, U+0300..034E, U+0360..0362
4468                                    ...)
4469          \p{Present_In: V3_1}    \p{Present_In=3.1} (Perl extension)
4470                                    (233_787)
4471        T \p{Present_In: 3.2}     Code point's usage introduced in version
4472                                    3.2 or earlier (Short: \p{In=3.2}) (Perl
4473                                    extension) (234_803: U+0000..0220,
4474                                    U+0222..0233, U+0250..02AD,
4475                                    U+02B0..02EE, U+0300..034F, U+0360..036F
4476                                    ...)
4477          \p{Present_In: V3_2}    \p{Present_In=3.2} (Perl extension)
4478                                    (234_803)
4479        T \p{Present_In: 4.0}     Code point's usage introduced in version
4480                                    4.0 or earlier (Short: \p{In=4.0}) (Perl
4481                                    extension) (236_029: U+0000..0236,
4482                                    U+0250..0357, U+035D..036F,
4483                                    U+0374..0375, U+037A, U+037E ...)
4484          \p{Present_In: V4_0}    \p{Present_In=4.0} (Perl extension)
4485                                    (236_029)
4486        T \p{Present_In: 4.1}     Code point's usage introduced in version
4487                                    4.1 or earlier (Short: \p{In=4.1}) (Perl
4488                                    extension) (237_302: U+0000..0241,
4489                                    U+0250..036F, U+0374..0375, U+037A,
4490                                    U+037E, U+0384..038A ...)
4491          \p{Present_In: V4_1}    \p{Present_In=4.1} (Perl extension)
4492                                    (237_302)
4493        T \p{Present_In: 5.0}     Code point's usage introduced in version
4494                                    5.0 or earlier (Short: \p{In=5.0}) (Perl
4495                                    extension) (238_671: U+0000..036F,
4496                                    U+0374..0375, U+037A..037E,
4497                                    U+0384..038A, U+038C, U+038E..03A1 ...)
4498          \p{Present_In: V5_0}    \p{Present_In=5.0} (Perl extension)
4499                                    (238_671)
4500        T \p{Present_In: 5.1}     Code point's usage introduced in version
4501                                    5.1 or earlier (Short: \p{In=5.1}) (Perl
4502                                    extension) (240_295: U+0000..0377,
4503                                    U+037A..037E, U+0384..038A, U+038C,
4504                                    U+038E..03A1, U+03A3..0523 ...)
4505          \p{Present_In: V5_1}    \p{Present_In=5.1} (Perl extension)
4506                                    (240_295)
4507        T \p{Present_In: 5.2}     Code point's usage introduced in version
4508                                    5.2 or earlier (Short: \p{In=5.2}) (Perl
4509                                    extension) (246_943: U+0000..0377,
4510                                    U+037A..037E, U+0384..038A, U+038C,
4511                                    U+038E..03A1, U+03A3..0525 ...)
4512          \p{Present_In: V5_2}    \p{Present_In=5.2} (Perl extension)
4513                                    (246_943)
4514        T \p{Present_In: 6.0}     Code point's usage introduced in version
4515                                    6.0 or earlier (Short: \p{In=6.0}) (Perl
4516                                    extension) (249_031: U+0000..0377,
4517                                    U+037A..037E, U+0384..038A, U+038C,
4518                                    U+038E..03A1, U+03A3..0527 ...)
4519          \p{Present_In: V6_0}    \p{Present_In=6.0} (Perl extension)
4520                                    (249_031)
4521        T \p{Present_In: 6.1}     Code point's usage introduced in version
4522                                    6.1 or earlier (Short: \p{In=6.1}) (Perl
4523                                    extension) (249_763: U+0000..0377,
4524                                    U+037A..037E, U+0384..038A, U+038C,
4525                                    U+038E..03A1, U+03A3..0527 ...)
4526          \p{Present_In: V6_1}    \p{Present_In=6.1} (Perl extension)
4527                                    (249_763)
4528        T \p{Present_In: 6.2}     Code point's usage introduced in version
4529                                    6.2 or earlier (Short: \p{In=6.2}) (Perl
4530                                    extension) (249_764: U+0000..0377,
4531                                    U+037A..037E, U+0384..038A, U+038C,
4532                                    U+038E..03A1, U+03A3..0527 ...)
4533          \p{Present_In: V6_2}    \p{Present_In=6.2} (Perl extension)
4534                                    (249_764)
4535        T \p{Present_In: 6.3}     Code point's usage introduced in version
4536                                    6.3 or earlier (Short: \p{In=6.3}) (Perl
4537                                    extension) (249_769: U+0000..0377,
4538                                    U+037A..037E, U+0384..038A, U+038C,
4539                                    U+038E..03A1, U+03A3..0527 ...)
4540          \p{Present_In: V6_3}    \p{Present_In=6.3} (Perl extension)
4541                                    (249_769)
4542        T \p{Present_In: 7.0}     Code point's usage introduced in version
4543                                    7.0 or earlier (Short: \p{In=7.0}) (Perl
4544                                    extension) (252_603: U+0000..0377,
4545                                    U+037A..037F, U+0384..038A, U+038C,
4546                                    U+038E..03A1, U+03A3..052F ...)
4547          \p{Present_In: V7_0}    \p{Present_In=7.0} (Perl extension)
4548                                    (252_603)
4549        T \p{Present_In: 8.0}     Code point's usage introduced in version
4550                                    8.0 or earlier (Short: \p{In=8.0}) (Perl
4551                                    extension) (260_319: U+0000..0377,
4552                                    U+037A..037F, U+0384..038A, U+038C,
4553                                    U+038E..03A1, U+03A3..052F ...)
4554          \p{Present_In: V8_0}    \p{Present_In=8.0} (Perl extension)
4555                                    (260_319)
4556        T \p{Present_In: 9.0}     Code point's usage introduced in version
4557                                    9.0 or earlier (Short: \p{In=9.0}) (Perl
4558                                    extension) (267_819: U+0000..0377,
4559                                    U+037A..037F, U+0384..038A, U+038C,
4560                                    U+038E..03A1, U+03A3..052F ...)
4561          \p{Present_In: V9_0}    \p{Present_In=9.0} (Perl extension)
4562                                    (267_819)
4563        T \p{Present_In: 10.0}    Code point's usage introduced in version
4564                                    10.0 or earlier (Short: \p{In=10.0})
4565                                    (Perl extension) (276_337: U+0000..0377,
4566                                    U+037A..037F, U+0384..038A, U+038C,
4567                                    U+038E..03A1, U+03A3..052F ...)
4568          \p{Present_In: V10_0}   \p{Present_In=10.0} (Perl extension)
4569                                    (276_337)
4570          \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
4571                                    Unassigned}) (Perl extension) (837_775
4572                                    plus all above-Unicode code points)
4573          \p{Print}               \p{XPosixPrint} (274_156)
4574          \p{Private_Use}         \p{General_Category=Private_Use} (Short:
4575                                    \p{Co}; NOT \p{Private_Use_Area})
4576                                    (137_468)
4577        X \p{Private_Use_Area}    \p{Block=Private_Use_Area} (Short:
4578                                    \p{InPUA}) (6400)
4579          \p{Prti}                \p{Inscriptional_Parthian} (=
4580                                    \p{Script_Extensions=
4581                                    Inscriptional_Parthian}) (NOT \p{Block=
4582                                    Inscriptional_Parthian}) (30)
4583          \p{Ps}                  \p{Open_Punctuation} (=
4584                                    \p{General_Category=Open_Punctuation})
4585                                    (75)
4586          \p{Psalter_Pahlavi}     \p{Script_Extensions=Psalter_Pahlavi}
4587                                    (Short: \p{Phlp}; NOT \p{Block=
4588                                    Psalter_Pahlavi}) (30)
4589        X \p{PUA}                 \p{Private_Use_Area} (= \p{Block=
4590                                    Private_Use_Area}) (6400)
4591          \p{Punct}               \p{General_Category=Punctuation} (Short:
4592                                    \p{P}; NOT \p{General_Punctuation}) (770)
4593          \p{Punctuation}         \p{Punct} (= \p{General_Category=
4594                                    Punctuation}) (NOT
4595                                    \p{General_Punctuation}) (770)
4596          \p{Qaac}                \p{Coptic} (= \p{Script_Extensions=
4597                                    Coptic}) (NOT \p{Block=Coptic}) (165)
4598          \p{Qaai}                \p{Inherited} (= \p{Script_Extensions=
4599                                    Inherited}) (500)
4600          \p{QMark}               \p{Quotation_Mark} (= \p{Quotation_Mark=
4601                                    Y}) (30)
4602          \p{QMark: *}            \p{Quotation_Mark: *}
4603          \p{Quotation_Mark}      \p{Quotation_Mark=Y} (Short: \p{QMark})
4604                                    (30)
4605          \p{Quotation_Mark: N*}  (Short: \p{QMark=N}, \P{QMark}) (1_114_082
4606                                    plus all above-Unicode code points:
4607                                    [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<=
4608                                    >?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
4609                                    \xaa\xac-\xba\xbc-\xff], U+0100..2017,
4610                                    U+2020..2038, U+203B..2E41,
4611                                    U+2E43..300B, U+3010..301C ...)
4612          \p{Quotation_Mark: Y*}  (Short: \p{QMark=Y}, \p{QMark}) (30:
4613                                    [\"\'\xab\xbb], U+2018..201F,
4614                                    U+2039..203A, U+2E42, U+300C..300F,
4615                                    U+301D..301F ...)
4616          \p{Radical}             \p{Radical=Y} (329)
4617          \p{Radical: N*}         (Single: \P{Radical}) (1_113_783 plus all
4618                                    above-Unicode code points: U+0000..2E7F,
4619                                    U+2E9A, U+2EF4..2EFF, U+2FD6..infinity)
4620          \p{Radical: Y*}         (Single: \p{Radical}) (329: U+2E80..2E99,
4621                                    U+2E9B..2EF3, U+2F00..2FD5)
4622          \p{Regional_Indicator}  \p{Regional_Indicator=Y} (Short: \p{RI})
4623                                    (26)
4624          \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086
4625                                    plus all above-Unicode code points:
4626                                    U+0000..1F1E5, U+1F200..infinity)
4627          \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26:
4628                                    U+1F1E6..1F1FF)
4629          \p{Rejang}              \p{Script_Extensions=Rejang} (Short:
4630                                    \p{Rjng}; NOT \p{Block=Rejang}) (37)
4631          \p{RI}                  \p{Regional_Indicator} (=
4632                                    \p{Regional_Indicator=Y}) (26)
4633          \p{RI: *}               \p{Regional_Indicator: *}
4634          \p{Rjng}                \p{Rejang} (= \p{Script_Extensions=
4635                                    Rejang}) (NOT \p{Block=Rejang}) (37)
4636        X \p{Rumi}                \p{Rumi_Numeral_Symbols} (= \p{Block=
4637                                    Rumi_Numeral_Symbols}) (32)
4638        X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
4639                                    \p{InRumi}) (32)
4640          \p{Runic}               \p{Script_Extensions=Runic} (Short:
4641                                    \p{Runr}; NOT \p{Block=Runic}) (86)
4642          \p{Runr}                \p{Runic} (= \p{Script_Extensions=Runic})
4643                                    (NOT \p{Block=Runic}) (86)
4644          \p{S} \pS               \p{Symbol} (= \p{General_Category=Symbol})
4645                                    (6978)
4646          \p{Samaritan}           \p{Script_Extensions=Samaritan} (Short:
4647                                    \p{Samr}; NOT \p{Block=Samaritan}) (61)
4648          \p{Samr}                \p{Samaritan} (= \p{Script_Extensions=
4649                                    Samaritan}) (NOT \p{Block=Samaritan})
4650                                    (61)
4651          \p{Sarb}                \p{Old_South_Arabian} (=
4652                                    \p{Script_Extensions=Old_South_Arabian})
4653                                    (32)
4654          \p{Saur}                \p{Saurashtra} (= \p{Script_Extensions=
4655                                    Saurashtra}) (NOT \p{Block=Saurashtra})
4656                                    (82)
4657          \p{Saurashtra}          \p{Script_Extensions=Saurashtra} (Short:
4658                                    \p{Saur}; NOT \p{Block=Saurashtra}) (82)
4659          \p{SB: *}               \p{Sentence_Break: *}
4660          \p{Sc}                  \p{Currency_Symbol} (=
4661                                    \p{General_Category=Currency_Symbol})
4662                                    (54)
4663          \p{Sc: *}               \p{Script: *}
4664          \p{Script: Adlam}       (Short: \p{Sc=Adlm}) (87: U+1E900..1E94A,
4665                                    U+1E950..1E959, U+1E95E..1E95F)
4666          \p{Script: Adlm}        \p{Script=Adlam} (87)
4667          \p{Script: Aghb}        \p{Script=Caucasian_Albanian} (=
4668                                    \p{Script_Extensions=
4669                                    Caucasian_Albanian}) (53)
4670          \p{Script: Ahom}        \p{Script_Extensions=Ahom} (Short: \p{Sc=
4671                                    Ahom}, \p{Ahom}) (57)
4672          \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions=
4673                                    Anatolian_Hieroglyphs} (Short: \p{Sc=
4674                                    Hluw}, \p{Hluw}) (583)
4675          \p{Script: Arab}        \p{Script=Arabic} (1280)
4676          \p{Script: Arabic}      (Short: \p{Sc=Arab}) (1280: U+0600..0604,
4677                                    U+0606..060B, U+060D..061A, U+061C,
4678                                    U+061E, U+0620..063F ...)
4679          \p{Script: Armenian}    (Short: \p{Sc=Armn}) (93: U+0531..0556,
4680                                    U+0559..055F, U+0561..0587, U+058A,
4681                                    U+058D..058F, U+FB13..FB17)
4682          \p{Script: Armi}        \p{Script=Imperial_Aramaic} (=
4683                                    \p{Script_Extensions=Imperial_Aramaic})
4684                                    (31)
4685          \p{Script: Armn}        \p{Script=Armenian} (93)
4686          \p{Script: Avestan}     \p{Script_Extensions=Avestan} (Short:
4687                                    \p{Sc=Avst}, \p{Avst}) (61)
4688          \p{Script: Avst}        \p{Script=Avestan} (=
4689                                    \p{Script_Extensions=Avestan}) (61)
4690          \p{Script: Bali}        \p{Script=Balinese} (=
4691                                    \p{Script_Extensions=Balinese}) (121)
4692          \p{Script: Balinese}    \p{Script_Extensions=Balinese} (Short:
4693                                    \p{Sc=Bali}, \p{Bali}) (121)
4694          \p{Script: Bamu}        \p{Script=Bamum} (= \p{Script_Extensions=
4695                                    Bamum}) (657)
4696          \p{Script: Bamum}       \p{Script_Extensions=Bamum} (Short: \p{Sc=
4697                                    Bamu}, \p{Bamu}) (657)
4698          \p{Script: Bass}        \p{Script=Bassa_Vah} (=
4699                                    \p{Script_Extensions=Bassa_Vah}) (36)
4700          \p{Script: Bassa_Vah}   \p{Script_Extensions=Bassa_Vah} (Short:
4701                                    \p{Sc=Bass}, \p{Bass}) (36)
4702          \p{Script: Batak}       \p{Script_Extensions=Batak} (Short: \p{Sc=
4703                                    Batk}, \p{Batk}) (56)
4704          \p{Script: Batk}        \p{Script=Batak} (= \p{Script_Extensions=
4705                                    Batak}) (56)
4706          \p{Script: Beng}        \p{Script=Bengali} (95)
4707          \p{Script: Bengali}     (Short: \p{Sc=Beng}) (95: U+0980..0983,
4708                                    U+0985..098C, U+098F..0990,
4709                                    U+0993..09A8, U+09AA..09B0, U+09B2 ...)
4710          \p{Script: Bhaiksuki}   \p{Script_Extensions=Bhaiksuki} (Short:
4711                                    \p{Sc=Bhks}, \p{Bhks}) (97)
4712          \p{Script: Bhks}        \p{Script=Bhaiksuki} (=
4713                                    \p{Script_Extensions=Bhaiksuki}) (97)
4714          \p{Script: Bopo}        \p{Script=Bopomofo} (71)
4715          \p{Script: Bopomofo}    (Short: \p{Sc=Bopo}) (71: U+02EA..02EB,
4716                                    U+3105..312E, U+31A0..31BA)
4717          \p{Script: Brah}        \p{Script=Brahmi} (= \p{Script_Extensions=
4718                                    Brahmi}) (109)
4719          \p{Script: Brahmi}      \p{Script_Extensions=Brahmi} (Short:
4720                                    \p{Sc=Brah}, \p{Brah}) (109)
4721          \p{Script: Brai}        \p{Script=Braille} (=
4722                                    \p{Script_Extensions=Braille}) (256)
4723          \p{Script: Braille}     \p{Script_Extensions=Braille} (Short:
4724                                    \p{Sc=Brai}, \p{Brai}) (256)
4725          \p{Script: Bugi}        \p{Script=Buginese} (30)
4726          \p{Script: Buginese}    (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B,
4727                                    U+1A1E..1A1F)
4728          \p{Script: Buhd}        \p{Script=Buhid} (20)
4729          \p{Script: Buhid}       (Short: \p{Sc=Buhd}) (20: U+1740..1753)
4730          \p{Script: Cakm}        \p{Script=Chakma} (67)
4731          \p{Script: Canadian_Aboriginal} \p{Script_Extensions=
4732                                    Canadian_Aboriginal} (Short: \p{Sc=
4733                                    Cans}, \p{Cans}) (710)
4734          \p{Script: Cans}        \p{Script=Canadian_Aboriginal} (=
4735                                    \p{Script_Extensions=
4736                                    Canadian_Aboriginal}) (710)
4737          \p{Script: Cari}        \p{Script=Carian} (= \p{Script_Extensions=
4738                                    Carian}) (49)
4739          \p{Script: Carian}      \p{Script_Extensions=Carian} (Short:
4740                                    \p{Sc=Cari}, \p{Cari}) (49)
4741          \p{Script: Caucasian_Albanian} \p{Script_Extensions=
4742                                    Caucasian_Albanian} (Short: \p{Sc=Aghb},
4743                                    \p{Aghb}) (53)
4744          \p{Script: Chakma}      (Short: \p{Sc=Cakm}) (67: U+11100..11134,
4745                                    U+11136..11143)
4746          \p{Script: Cham}        \p{Script_Extensions=Cham} (Short: \p{Sc=
4747                                    Cham}, \p{Cham}) (83)
4748          \p{Script: Cher}        \p{Script=Cherokee} (=
4749                                    \p{Script_Extensions=Cherokee}) (172)
4750          \p{Script: Cherokee}    \p{Script_Extensions=Cherokee} (Short:
4751                                    \p{Sc=Cher}, \p{Cher}) (172)
4752          \p{Script: Common}      (Short: \p{Sc=Zyyy}) (7363: [\x00-
4753                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
4754                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
4755                                    \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF,
4756                                    U+02E5..02E9, U+02EC..02FF, U+0374,
4757                                    U+037E ...)
4758          \p{Script: Copt}        \p{Script=Coptic} (137)
4759          \p{Script: Coptic}      (Short: \p{Sc=Copt}) (137: U+03E2..03EF,
4760                                    U+2C80..2CF3, U+2CF9..2CFF)
4761          \p{Script: Cprt}        \p{Script=Cypriot} (55)
4762          \p{Script: Cuneiform}   \p{Script_Extensions=Cuneiform} (Short:
4763                                    \p{Sc=Xsux}, \p{Xsux}) (1234)
4764          \p{Script: Cypriot}     (Short: \p{Sc=Cprt}) (55: U+10800..10805,
4765                                    U+10808, U+1080A..10835, U+10837..10838,
4766                                    U+1083C, U+1083F)
4767          \p{Script: Cyrillic}    (Short: \p{Sc=Cyrl}) (443: U+0400..0484,
4768                                    U+0487..052F, U+1C80..1C88, U+1D2B,
4769                                    U+1D78, U+2DE0..2DFF ...)
4770          \p{Script: Cyrl}        \p{Script=Cyrillic} (443)
4771          \p{Script: Deseret}     \p{Script_Extensions=Deseret} (Short:
4772                                    \p{Sc=Dsrt}, \p{Dsrt}) (80)
4773          \p{Script: Deva}        \p{Script=Devanagari} (154)
4774          \p{Script: Devanagari}  (Short: \p{Sc=Deva}) (154: U+0900..0950,
4775                                    U+0953..0963, U+0966..097F, U+A8E0..A8FD)
4776          \p{Script: Dsrt}        \p{Script=Deseret} (=
4777                                    \p{Script_Extensions=Deseret}) (80)
4778          \p{Script: Dupl}        \p{Script=Duployan} (143)
4779          \p{Script: Duployan}    (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A,
4780                                    U+1BC70..1BC7C, U+1BC80..1BC88,
4781                                    U+1BC90..1BC99, U+1BC9C..1BC9F)
4782          \p{Script: Egyp}        \p{Script=Egyptian_Hieroglyphs} (=
4783                                    \p{Script_Extensions=
4784                                    Egyptian_Hieroglyphs}) (1071)
4785          \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions=
4786                                    Egyptian_Hieroglyphs} (Short: \p{Sc=
4787                                    Egyp}, \p{Egyp}) (1071)
4788          \p{Script: Elba}        \p{Script=Elbasan} (=
4789                                    \p{Script_Extensions=Elbasan}) (40)
4790          \p{Script: Elbasan}     \p{Script_Extensions=Elbasan} (Short:
4791                                    \p{Sc=Elba}, \p{Elba}) (40)
4792          \p{Script: Ethi}        \p{Script=Ethiopic} (=
4793                                    \p{Script_Extensions=Ethiopic}) (495)
4794          \p{Script: Ethiopic}    \p{Script_Extensions=Ethiopic} (Short:
4795                                    \p{Sc=Ethi}, \p{Ethi}) (495)
4796          \p{Script: Geor}        \p{Script=Georgian} (127)
4797          \p{Script: Georgian}    (Short: \p{Sc=Geor}) (127: U+10A0..10C5,
4798                                    U+10C7, U+10CD, U+10D0..10FA,
4799                                    U+10FC..10FF, U+2D00..2D25 ...)
4800          \p{Script: Glag}        \p{Script=Glagolitic} (132)
4801          \p{Script: Glagolitic}  (Short: \p{Sc=Glag}) (132: U+2C00..2C2E,
4802                                    U+2C30..2C5E, U+1E000..1E006,
4803                                    U+1E008..1E018, U+1E01B..1E021,
4804                                    U+1E023..1E024 ...)
4805          \p{Script: Gonm}        \p{Script=Masaram_Gondi} (=
4806                                    \p{Script_Extensions=Masaram_Gondi}) (75)
4807          \p{Script: Goth}        \p{Script=Gothic} (= \p{Script_Extensions=
4808                                    Gothic}) (27)
4809          \p{Script: Gothic}      \p{Script_Extensions=Gothic} (Short:
4810                                    \p{Sc=Goth}, \p{Goth}) (27)
4811          \p{Script: Gran}        \p{Script=Grantha} (85)
4812          \p{Script: Grantha}     (Short: \p{Sc=Gran}) (85: U+11300..11303,
4813                                    U+11305..1130C, U+1130F..11310,
4814                                    U+11313..11328, U+1132A..11330,
4815                                    U+11332..11333 ...)
4816          \p{Script: Greek}       (Short: \p{Sc=Grek}) (518: U+0370..0373,
4817                                    U+0375..0377, U+037A..037D, U+037F,
4818                                    U+0384, U+0386 ...)
4819          \p{Script: Grek}        \p{Script=Greek} (518)
4820          \p{Script: Gujarati}    (Short: \p{Sc=Gujr}) (91: U+0A81..0A83,
4821                                    U+0A85..0A8D, U+0A8F..0A91,
4822                                    U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3
4823                                    ...)
4824          \p{Script: Gujr}        \p{Script=Gujarati} (91)
4825          \p{Script: Gurmukhi}    (Short: \p{Sc=Guru}) (79: U+0A01..0A03,
4826                                    U+0A05..0A0A, U+0A0F..0A10,
4827                                    U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33
4828                                    ...)
4829          \p{Script: Guru}        \p{Script=Gurmukhi} (79)
4830          \p{Script: Han}         (Short: \p{Sc=Han}) (89_228: U+2E80..2E99,
4831                                    U+2E9B..2EF3, U+2F00..2FD5, U+3005,
4832                                    U+3007, U+3021..3029 ...)
4833          \p{Script: Hang}        \p{Script=Hangul} (11_739)
4834          \p{Script: Hangul}      (Short: \p{Sc=Hang}) (11_739:
4835                                    U+1100..11FF, U+302E..302F,
4836                                    U+3131..318E, U+3200..321E,
4837                                    U+3260..327E, U+A960..A97C ...)
4838          \p{Script: Hani}        \p{Script=Han} (89_228)
4839          \p{Script: Hano}        \p{Script=Hanunoo} (21)
4840          \p{Script: Hanunoo}     (Short: \p{Sc=Hano}) (21: U+1720..1734)
4841          \p{Script: Hatr}        \p{Script=Hatran} (= \p{Script_Extensions=
4842                                    Hatran}) (26)
4843          \p{Script: Hatran}      \p{Script_Extensions=Hatran} (Short:
4844                                    \p{Sc=Hatr}, \p{Hatr}) (26)
4845          \p{Script: Hebr}        \p{Script=Hebrew} (= \p{Script_Extensions=
4846                                    Hebrew}) (133)
4847          \p{Script: Hebrew}      \p{Script_Extensions=Hebrew} (Short:
4848                                    \p{Sc=Hebr}, \p{Hebr}) (133)
4849          \p{Script: Hira}        \p{Script=Hiragana} (376)
4850          \p{Script: Hiragana}    (Short: \p{Sc=Hira}) (376: U+3041..3096,
4851                                    U+309D..309F, U+1B001..1B11E, U+1F200)
4852          \p{Script: Hluw}        \p{Script=Anatolian_Hieroglyphs} (=
4853                                    \p{Script_Extensions=
4854                                    Anatolian_Hieroglyphs}) (583)
4855          \p{Script: Hmng}        \p{Script=Pahawh_Hmong} (=
4856                                    \p{Script_Extensions=Pahawh_Hmong}) (127)
4857          \p{Script: Hung}        \p{Script=Old_Hungarian} (=
4858                                    \p{Script_Extensions=Old_Hungarian})
4859                                    (108)
4860          \p{Script: Imperial_Aramaic} \p{Script_Extensions=
4861                                    Imperial_Aramaic} (Short: \p{Sc=Armi},
4862                                    \p{Armi}) (31)
4863          \p{Script: Inherited}   (Short: \p{Sc=Zinh}) (568: U+0300..036F,
4864                                    U+0485..0486, U+064B..0655, U+0670,
4865                                    U+0951..0952, U+1AB0..1ABE ...)
4866          \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions=
4867                                    Inscriptional_Pahlavi} (Short: \p{Sc=
4868                                    Phli}, \p{Phli}) (27)
4869          \p{Script: Inscriptional_Parthian} \p{Script_Extensions=
4870                                    Inscriptional_Parthian} (Short: \p{Sc=
4871                                    Prti}, \p{Prti}) (30)
4872          \p{Script: Ital}        \p{Script=Old_Italic} (=
4873                                    \p{Script_Extensions=Old_Italic}) (39)
4874          \p{Script: Java}        \p{Script=Javanese} (90)
4875          \p{Script: Javanese}    (Short: \p{Sc=Java}) (90: U+A980..A9CD,
4876                                    U+A9D0..A9D9, U+A9DE..A9DF)
4877          \p{Script: Kaithi}      (Short: \p{Sc=Kthi}) (66: U+11080..110C1)
4878          \p{Script: Kali}        \p{Script=Kayah_Li} (47)
4879          \p{Script: Kana}        \p{Script=Katakana} (300)
4880          \p{Script: Kannada}     (Short: \p{Sc=Knda}) (88: U+0C80..0C83,
4881                                    U+0C85..0C8C, U+0C8E..0C90,
4882                                    U+0C92..0CA8, U+0CAA..0CB3, U+0CB5..0CB9
4883                                    ...)
4884          \p{Script: Katakana}    (Short: \p{Sc=Kana}) (300: U+30A1..30FA,
4885                                    U+30FD..30FF, U+31F0..31FF,
4886                                    U+32D0..32FE, U+3300..3357, U+FF66..FF6F
4887                                    ...)
4888          \p{Script: Kayah_Li}    (Short: \p{Sc=Kali}) (47: U+A900..A92D,
4889                                    U+A92F)
4890          \p{Script: Khar}        \p{Script=Kharoshthi} (=
4891                                    \p{Script_Extensions=Kharoshthi}) (65)
4892          \p{Script: Kharoshthi}  \p{Script_Extensions=Kharoshthi} (Short:
4893                                    \p{Sc=Khar}, \p{Khar}) (65)
4894          \p{Script: Khmer}       \p{Script_Extensions=Khmer} (Short: \p{Sc=
4895                                    Khmr}, \p{Khmr}) (146)
4896          \p{Script: Khmr}        \p{Script=Khmer} (= \p{Script_Extensions=
4897                                    Khmer}) (146)
4898          \p{Script: Khoj}        \p{Script=Khojki} (62)
4899          \p{Script: Khojki}      (Short: \p{Sc=Khoj}) (62: U+11200..11211,
4900                                    U+11213..1123E)
4901          \p{Script: Khudawadi}   (Short: \p{Sc=Sind}) (69: U+112B0..112EA,
4902                                    U+112F0..112F9)
4903          \p{Script: Knda}        \p{Script=Kannada} (88)
4904          \p{Script: Kthi}        \p{Script=Kaithi} (66)
4905          \p{Script: Lana}        \p{Script=Tai_Tham} (=
4906                                    \p{Script_Extensions=Tai_Tham}) (127)
4907          \p{Script: Lao}         \p{Script_Extensions=Lao} (Short: \p{Sc=
4908                                    Lao}, \p{Lao}) (67)
4909          \p{Script: Laoo}        \p{Script=Lao} (= \p{Script_Extensions=
4910                                    Lao}) (67)
4911          \p{Script: Latin}       (Short: \p{Sc=Latn}) (1350: [A-Za-
4912                                    z\xaa\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
4913                                    U+0100..02B8, U+02E0..02E4,
4914                                    U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65
4915                                    ...)
4916          \p{Script: Latn}        \p{Script=Latin} (1350)
4917          \p{Script: Lepc}        \p{Script=Lepcha} (= \p{Script_Extensions=
4918                                    Lepcha}) (74)
4919          \p{Script: Lepcha}      \p{Script_Extensions=Lepcha} (Short:
4920                                    \p{Sc=Lepc}, \p{Lepc}) (74)
4921          \p{Script: Limb}        \p{Script=Limbu} (68)
4922          \p{Script: Limbu}       (Short: \p{Sc=Limb}) (68: U+1900..191E,
4923                                    U+1920..192B, U+1930..193B, U+1940,
4924                                    U+1944..194F)
4925          \p{Script: Lina}        \p{Script=Linear_A} (341)
4926          \p{Script: Linb}        \p{Script=Linear_B} (211)
4927          \p{Script: Linear_A}    (Short: \p{Sc=Lina}) (341: U+10600..10736,
4928                                    U+10740..10755, U+10760..10767)
4929          \p{Script: Linear_B}    (Short: \p{Sc=Linb}) (211: U+10000..1000B,
4930                                    U+1000D..10026, U+10028..1003A,
4931                                    U+1003C..1003D, U+1003F..1004D,
4932                                    U+10050..1005D ...)
4933          \p{Script: Lisu}        \p{Script_Extensions=Lisu} (Short: \p{Sc=
4934                                    Lisu}, \p{Lisu}) (48)
4935          \p{Script: Lyci}        \p{Script=Lycian} (= \p{Script_Extensions=
4936                                    Lycian}) (29)
4937          \p{Script: Lycian}      \p{Script_Extensions=Lycian} (Short:
4938                                    \p{Sc=Lyci}, \p{Lyci}) (29)
4939          \p{Script: Lydi}        \p{Script=Lydian} (= \p{Script_Extensions=
4940                                    Lydian}) (27)
4941          \p{Script: Lydian}      \p{Script_Extensions=Lydian} (Short:
4942                                    \p{Sc=Lydi}, \p{Lydi}) (27)
4943          \p{Script: Mahajani}    (Short: \p{Sc=Mahj}) (39: U+11150..11176)
4944          \p{Script: Mahj}        \p{Script=Mahajani} (39)
4945          \p{Script: Malayalam}   (Short: \p{Sc=Mlym}) (117: U+0D00..0D03,
4946                                    U+0D05..0D0C, U+0D0E..0D10,
4947                                    U+0D12..0D44, U+0D46..0D48, U+0D4A..0D4F
4948                                    ...)
4949          \p{Script: Mand}        \p{Script=Mandaic} (29)
4950          \p{Script: Mandaic}     (Short: \p{Sc=Mand}) (29: U+0840..085B,
4951                                    U+085E)
4952          \p{Script: Mani}        \p{Script=Manichaean} (51)
4953          \p{Script: Manichaean}  (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6,
4954                                    U+10AEB..10AF6)
4955          \p{Script: Marc}        \p{Script=Marchen} (=
4956                                    \p{Script_Extensions=Marchen}) (68)
4957          \p{Script: Marchen}     \p{Script_Extensions=Marchen} (Short:
4958                                    \p{Sc=Marc}, \p{Marc}) (68)
4959          \p{Script: Masaram_Gondi} \p{Script_Extensions=Masaram_Gondi}
4960                                    (Short: \p{Sc=Gonm}, \p{Gonm}) (75)
4961          \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek}
4962                                    (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
4963          \p{Script: Mend}        \p{Script=Mende_Kikakui} (=
4964                                    \p{Script_Extensions=Mende_Kikakui})
4965                                    (213)
4966          \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
4967                                    (Short: \p{Sc=Mend}, \p{Mend}) (213)
4968          \p{Script: Merc}        \p{Script=Meroitic_Cursive} (=
4969                                    \p{Script_Extensions=Meroitic_Cursive})
4970                                    (90)
4971          \p{Script: Mero}        \p{Script=Meroitic_Hieroglyphs} (=
4972                                    \p{Script_Extensions=
4973                                    Meroitic_Hieroglyphs}) (32)
4974          \p{Script: Meroitic_Cursive} \p{Script_Extensions=
4975                                    Meroitic_Cursive} (Short: \p{Sc=Merc},
4976                                    \p{Merc}) (90)
4977          \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions=
4978                                    Meroitic_Hieroglyphs} (Short: \p{Sc=
4979                                    Mero}, \p{Mero}) (32)
4980          \p{Script: Miao}        \p{Script_Extensions=Miao} (Short: \p{Sc=
4981                                    Miao}, \p{Miao}) (133)
4982          \p{Script: Mlym}        \p{Script=Malayalam} (117)
4983          \p{Script: Modi}        (Short: \p{Sc=Modi}) (79: U+11600..11644,
4984                                    U+11650..11659)
4985          \p{Script: Mong}        \p{Script=Mongolian} (166)
4986          \p{Script: Mongolian}   (Short: \p{Sc=Mong}) (166: U+1800..1801,
4987                                    U+1804, U+1806..180E, U+1810..1819,
4988                                    U+1820..1877, U+1880..18AA ...)
4989          \p{Script: Mro}         \p{Script_Extensions=Mro} (Short: \p{Sc=
4990                                    Mro}, \p{Mro}) (43)
4991          \p{Script: Mroo}        \p{Script=Mro} (= \p{Script_Extensions=
4992                                    Mro}) (43)
4993          \p{Script: Mtei}        \p{Script=Meetei_Mayek} (=
4994                                    \p{Script_Extensions=Meetei_Mayek}) (79)
4995          \p{Script: Mult}        \p{Script=Multani} (38)
4996          \p{Script: Multani}     (Short: \p{Sc=Mult}) (38: U+11280..11286,
4997                                    U+11288, U+1128A..1128D, U+1128F..1129D,
4998                                    U+1129F..112A9)
4999          \p{Script: Myanmar}     (Short: \p{Sc=Mymr}) (223: U+1000..109F,
5000                                    U+A9E0..A9FE, U+AA60..AA7F)
5001          \p{Script: Mymr}        \p{Script=Myanmar} (223)
5002          \p{Script: Nabataean}   \p{Script_Extensions=Nabataean} (Short:
5003                                    \p{Sc=Nbat}, \p{Nbat}) (40)
5004          \p{Script: Narb}        \p{Script=Old_North_Arabian} (=
5005                                    \p{Script_Extensions=Old_North_Arabian})
5006                                    (32)
5007          \p{Script: Nbat}        \p{Script=Nabataean} (=
5008                                    \p{Script_Extensions=Nabataean}) (40)
5009          \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
5010                                    \p{Sc=Talu}, \p{Talu}) (83)
5011          \p{Script: Newa}        \p{Script_Extensions=Newa} (Short: \p{Sc=
5012                                    Newa}, \p{Newa}) (92)
5013          \p{Script: Nko}         \p{Script_Extensions=Nko} (Short: \p{Sc=
5014                                    Nko}, \p{Nko}) (59)
5015          \p{Script: Nkoo}        \p{Script=Nko} (= \p{Script_Extensions=
5016                                    Nko}) (59)
5017          \p{Script: Nshu}        \p{Script=Nushu} (= \p{Script_Extensions=
5018                                    Nushu}) (397)
5019          \p{Script: Nushu}       \p{Script_Extensions=Nushu} (Short: \p{Sc=
5020                                    Nshu}, \p{Nshu}) (397)
5021          \p{Script: Ogam}        \p{Script=Ogham} (= \p{Script_Extensions=
5022                                    Ogham}) (29)
5023          \p{Script: Ogham}       \p{Script_Extensions=Ogham} (Short: \p{Sc=
5024                                    Ogam}, \p{Ogam}) (29)
5025          \p{Script: Ol_Chiki}    \p{Script_Extensions=Ol_Chiki} (Short:
5026                                    \p{Sc=Olck}, \p{Olck}) (48)
5027          \p{Script: Olck}        \p{Script=Ol_Chiki} (=
5028                                    \p{Script_Extensions=Ol_Chiki}) (48)
5029          \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
5030                                    (Short: \p{Sc=Hung}, \p{Hung}) (108)
5031          \p{Script: Old_Italic}  \p{Script_Extensions=Old_Italic} (Short:
5032                                    \p{Sc=Ital}, \p{Ital}) (39)
5033          \p{Script: Old_North_Arabian} \p{Script_Extensions=
5034                                    Old_North_Arabian} (Short: \p{Sc=Narb},
5035                                    \p{Narb}) (32)
5036          \p{Script: Old_Permic}  (Short: \p{Sc=Perm}) (43: U+10350..1037A)
5037          \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
5038                                    \p{Sc=Xpeo}, \p{Xpeo}) (50)
5039          \p{Script: Old_South_Arabian} \p{Script_Extensions=
5040                                    Old_South_Arabian} (Short: \p{Sc=Sarb},
5041                                    \p{Sarb}) (32)
5042          \p{Script: Old_Turkic}  \p{Script_Extensions=Old_Turkic} (Short:
5043                                    \p{Sc=Orkh}, \p{Orkh}) (73)
5044          \p{Script: Oriya}       (Short: \p{Sc=Orya}) (90: U+0B01..0B03,
5045                                    U+0B05..0B0C, U+0B0F..0B10,
5046                                    U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33
5047                                    ...)
5048          \p{Script: Orkh}        \p{Script=Old_Turkic} (=
5049                                    \p{Script_Extensions=Old_Turkic}) (73)
5050          \p{Script: Orya}        \p{Script=Oriya} (90)
5051          \p{Script: Osage}       \p{Script_Extensions=Osage} (Short: \p{Sc=
5052                                    Osge}, \p{Osge}) (72)
5053          \p{Script: Osge}        \p{Script=Osage} (= \p{Script_Extensions=
5054                                    Osage}) (72)
5055          \p{Script: Osma}        \p{Script=Osmanya} (=
5056                                    \p{Script_Extensions=Osmanya}) (40)
5057          \p{Script: Osmanya}     \p{Script_Extensions=Osmanya} (Short:
5058                                    \p{Sc=Osma}, \p{Osma}) (40)
5059          \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong}
5060                                    (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
5061          \p{Script: Palm}        \p{Script=Palmyrene} (=
5062                                    \p{Script_Extensions=Palmyrene}) (32)
5063          \p{Script: Palmyrene}   \p{Script_Extensions=Palmyrene} (Short:
5064                                    \p{Sc=Palm}, \p{Palm}) (32)
5065          \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
5066                                    \p{Sc=Pauc}, \p{Pauc}) (57)
5067          \p{Script: Pauc}        \p{Script=Pau_Cin_Hau} (=
5068                                    \p{Script_Extensions=Pau_Cin_Hau}) (57)
5069          \p{Script: Perm}        \p{Script=Old_Permic} (43)
5070          \p{Script: Phag}        \p{Script=Phags_Pa} (56)
5071          \p{Script: Phags_Pa}    (Short: \p{Sc=Phag}) (56: U+A840..A877)
5072          \p{Script: Phli}        \p{Script=Inscriptional_Pahlavi} (=
5073                                    \p{Script_Extensions=
5074                                    Inscriptional_Pahlavi}) (27)
5075          \p{Script: Phlp}        \p{Script=Psalter_Pahlavi} (29)
5076          \p{Script: Phnx}        \p{Script=Phoenician} (=
5077                                    \p{Script_Extensions=Phoenician}) (29)
5078          \p{Script: Phoenician}  \p{Script_Extensions=Phoenician} (Short:
5079                                    \p{Sc=Phnx}, \p{Phnx}) (29)
5080          \p{Script: Plrd}        \p{Script=Miao} (= \p{Script_Extensions=
5081                                    Miao}) (133)
5082          \p{Script: Prti}        \p{Script=Inscriptional_Parthian} (=
5083                                    \p{Script_Extensions=
5084                                    Inscriptional_Parthian}) (30)
5085          \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29:
5086                                    U+10B80..10B91, U+10B99..10B9C,
5087                                    U+10BA9..10BAF)
5088          \p{Script: Qaac}        \p{Script=Coptic} (137)
5089          \p{Script: Qaai}        \p{Script=Inherited} (568)
5090          \p{Script: Rejang}      \p{Script_Extensions=Rejang} (Short:
5091                                    \p{Sc=Rjng}, \p{Rjng}) (37)
5092          \p{Script: Rjng}        \p{Script=Rejang} (= \p{Script_Extensions=
5093                                    Rejang}) (37)
5094          \p{Script: Runic}       \p{Script_Extensions=Runic} (Short: \p{Sc=
5095                                    Runr}, \p{Runr}) (86)
5096          \p{Script: Runr}        \p{Script=Runic} (= \p{Script_Extensions=
5097                                    Runic}) (86)
5098          \p{Script: Samaritan}   \p{Script_Extensions=Samaritan} (Short:
5099                                    \p{Sc=Samr}, \p{Samr}) (61)
5100          \p{Script: Samr}        \p{Script=Samaritan} (=
5101                                    \p{Script_Extensions=Samaritan}) (61)
5102          \p{Script: Sarb}        \p{Script=Old_South_Arabian} (=
5103                                    \p{Script_Extensions=Old_South_Arabian})
5104                                    (32)
5105          \p{Script: Saur}        \p{Script=Saurashtra} (=
5106                                    \p{Script_Extensions=Saurashtra}) (82)
5107          \p{Script: Saurashtra}  \p{Script_Extensions=Saurashtra} (Short:
5108                                    \p{Sc=Saur}, \p{Saur}) (82)
5109          \p{Script: Sgnw}        \p{Script=SignWriting} (=
5110                                    \p{Script_Extensions=SignWriting}) (672)
5111          \p{Script: Sharada}     (Short: \p{Sc=Shrd}) (94: U+11180..111CD,
5112                                    U+111D0..111DF)
5113          \p{Script: Shavian}     \p{Script_Extensions=Shavian} (Short:
5114                                    \p{Sc=Shaw}, \p{Shaw}) (48)
5115          \p{Script: Shaw}        \p{Script=Shavian} (=
5116                                    \p{Script_Extensions=Shavian}) (48)
5117          \p{Script: Shrd}        \p{Script=Sharada} (94)
5118          \p{Script: Sidd}        \p{Script=Siddham} (=
5119                                    \p{Script_Extensions=Siddham}) (92)
5120          \p{Script: Siddham}     \p{Script_Extensions=Siddham} (Short:
5121                                    \p{Sc=Sidd}, \p{Sidd}) (92)
5122          \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short:
5123                                    \p{Sc=Sgnw}, \p{Sgnw}) (672)
5124          \p{Script: Sind}        \p{Script=Khudawadi} (69)
5125          \p{Script: Sinh}        \p{Script=Sinhala} (110)
5126          \p{Script: Sinhala}     (Short: \p{Sc=Sinh}) (110: U+0D82..0D83,
5127                                    U+0D85..0D96, U+0D9A..0DB1,
5128                                    U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...)
5129          \p{Script: Sora}        \p{Script=Sora_Sompeng} (=
5130                                    \p{Script_Extensions=Sora_Sompeng}) (35)
5131          \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng}
5132                                    (Short: \p{Sc=Sora}, \p{Sora}) (35)
5133          \p{Script: Soyo}        \p{Script=Soyombo} (=
5134                                    \p{Script_Extensions=Soyombo}) (80)
5135          \p{Script: Soyombo}     \p{Script_Extensions=Soyombo} (Short:
5136                                    \p{Sc=Soyo}, \p{Soyo}) (80)
5137          \p{Script: Sund}        \p{Script=Sundanese} (=
5138                                    \p{Script_Extensions=Sundanese}) (72)
5139          \p{Script: Sundanese}   \p{Script_Extensions=Sundanese} (Short:
5140                                    \p{Sc=Sund}, \p{Sund}) (72)
5141          \p{Script: Sylo}        \p{Script=Syloti_Nagri} (44)
5142          \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (44: U+A800..A82B)
5143          \p{Script: Syrc}        \p{Script=Syriac} (88)
5144          \p{Script: Syriac}      (Short: \p{Sc=Syrc}) (88: U+0700..070D,
5145                                    U+070F..074A, U+074D..074F, U+0860..086A)
5146          \p{Script: Tagalog}     (Short: \p{Sc=Tglg}) (20: U+1700..170C,
5147                                    U+170E..1714)
5148          \p{Script: Tagb}        \p{Script=Tagbanwa} (18)
5149          \p{Script: Tagbanwa}    (Short: \p{Sc=Tagb}) (18: U+1760..176C,
5150                                    U+176E..1770, U+1772..1773)
5151          \p{Script: Tai_Le}      (Short: \p{Sc=Tale}) (35: U+1950..196D,
5152                                    U+1970..1974)
5153          \p{Script: Tai_Tham}    \p{Script_Extensions=Tai_Tham} (Short:
5154                                    \p{Sc=Lana}, \p{Lana}) (127)
5155          \p{Script: Tai_Viet}    \p{Script_Extensions=Tai_Viet} (Short:
5156                                    \p{Sc=Tavt}, \p{Tavt}) (72)
5157          \p{Script: Takr}        \p{Script=Takri} (66)
5158          \p{Script: Takri}       (Short: \p{Sc=Takr}) (66: U+11680..116B7,
5159                                    U+116C0..116C9)
5160          \p{Script: Tale}        \p{Script=Tai_Le} (35)
5161          \p{Script: Talu}        \p{Script=New_Tai_Lue} (=
5162                                    \p{Script_Extensions=New_Tai_Lue}) (83)
5163          \p{Script: Tamil}       (Short: \p{Sc=Taml}) (72: U+0B82..0B83,
5164                                    U+0B85..0B8A, U+0B8E..0B90,
5165                                    U+0B92..0B95, U+0B99..0B9A, U+0B9C ...)
5166          \p{Script: Taml}        \p{Script=Tamil} (72)
5167          \p{Script: Tang}        \p{Script=Tangut} (= \p{Script_Extensions=
5168                                    Tangut}) (6881)
5169          \p{Script: Tangut}      \p{Script_Extensions=Tangut} (Short:
5170                                    \p{Sc=Tang}, \p{Tang}) (6881)
5171          \p{Script: Tavt}        \p{Script=Tai_Viet} (=
5172                                    \p{Script_Extensions=Tai_Viet}) (72)
5173          \p{Script: Telu}        \p{Script=Telugu} (96)
5174          \p{Script: Telugu}      (Short: \p{Sc=Telu}) (96: U+0C00..0C03,
5175                                    U+0C05..0C0C, U+0C0E..0C10,
5176                                    U+0C12..0C28, U+0C2A..0C39, U+0C3D..0C44
5177                                    ...)
5178          \p{Script: Tfng}        \p{Script=Tifinagh} (=
5179                                    \p{Script_Extensions=Tifinagh}) (59)
5180          \p{Script: Tglg}        \p{Script=Tagalog} (20)
5181          \p{Script: Thaa}        \p{Script=Thaana} (50)
5182          \p{Script: Thaana}      (Short: \p{Sc=Thaa}) (50: U+0780..07B1)
5183          \p{Script: Thai}        \p{Script_Extensions=Thai} (Short: \p{Sc=
5184                                    Thai}, \p{Thai}) (86)
5185          \p{Script: Tibetan}     \p{Script_Extensions=Tibetan} (Short:
5186                                    \p{Sc=Tibt}, \p{Tibt}) (207)
5187          \p{Script: Tibt}        \p{Script=Tibetan} (=
5188                                    \p{Script_Extensions=Tibetan}) (207)
5189          \p{Script: Tifinagh}    \p{Script_Extensions=Tifinagh} (Short:
5190                                    \p{Sc=Tfng}, \p{Tfng}) (59)
5191          \p{Script: Tirh}        \p{Script=Tirhuta} (82)
5192          \p{Script: Tirhuta}     (Short: \p{Sc=Tirh}) (82: U+11480..114C7,
5193                                    U+114D0..114D9)
5194          \p{Script: Ugar}        \p{Script=Ugaritic} (=
5195                                    \p{Script_Extensions=Ugaritic}) (31)
5196          \p{Script: Ugaritic}    \p{Script_Extensions=Ugaritic} (Short:
5197                                    \p{Sc=Ugar}, \p{Ugar}) (31)
5198          \p{Script: Unknown}     \p{Script_Extensions=Unknown} (Short:
5199                                    \p{Sc=Zzzz}, \p{Zzzz}) (977_357 plus all
5200                                    above-Unicode code points)
5201          \p{Script: Vai}         \p{Script_Extensions=Vai} (Short: \p{Sc=
5202                                    Vai}, \p{Vai}) (300)
5203          \p{Script: Vaii}        \p{Script=Vai} (= \p{Script_Extensions=
5204                                    Vai}) (300)
5205          \p{Script: Wara}        \p{Script=Warang_Citi} (=
5206                                    \p{Script_Extensions=Warang_Citi}) (84)
5207          \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
5208                                    \p{Sc=Wara}, \p{Wara}) (84)
5209          \p{Script: Xpeo}        \p{Script=Old_Persian} (=
5210                                    \p{Script_Extensions=Old_Persian}) (50)
5211          \p{Script: Xsux}        \p{Script=Cuneiform} (=
5212                                    \p{Script_Extensions=Cuneiform}) (1234)
5213          \p{Script: Yi}          (Short: \p{Sc=Yi}) (1220: U+A000..A48C,
5214                                    U+A490..A4C6)
5215          \p{Script: Yiii}        \p{Script=Yi} (1220)
5216          \p{Script: Zanabazar_Square} \p{Script_Extensions=
5217                                    Zanabazar_Square} (Short: \p{Sc=Zanb},
5218                                    \p{Zanb}) (72)
5219          \p{Script: Zanb}        \p{Script=Zanabazar_Square} (=
5220                                    \p{Script_Extensions=Zanabazar_Square})
5221                                    (72)
5222          \p{Script: Zinh}        \p{Script=Inherited} (568)
5223          \p{Script: Zyyy}        \p{Script=Common} (7363)
5224          \p{Script: Zzzz}        \p{Script=Unknown} (=
5225                                    \p{Script_Extensions=Unknown}) (977_357
5226                                    plus all above-Unicode code points)
5227          \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (88:
5228                                    U+0640, U+1E900..1E94A, U+1E950..1E959,
5229                                    U+1E95E..1E95F)
5230          \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (88)
5231          \p{Script_Extensions: Aghb} \p{Script_Extensions=
5232                                    Caucasian_Albanian} (53)
5233          \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (57:
5234                                    U+11700..11719, U+1171D..1172B,
5235                                    U+11730..1173F)
5236          \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
5237                                    \p{Hluw}) (583: U+14400..14646)
5238          \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1324)
5239          \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
5240                                    (1324: U+0600..0604, U+0606..061C,
5241                                    U+061E..06DC, U+06DE..06FF,
5242                                    U+0750..077F, U+08A0..08B4 ...)
5243          \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
5244                                    (94: U+0531..0556, U+0559..055F,
5245                                    U+0561..0587, U+0589..058A,
5246                                    U+058D..058F, U+FB13..FB17)
5247          \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
5248                                    (31)
5249          \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (94)
5250          \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst})
5251                                    (61: U+10B00..10B35, U+10B39..10B3F)
5252          \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
5253          \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
5254          \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
5255                                    (121: U+1B00..1B4B, U+1B50..1B7C)
5256          \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
5257          \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657:
5258                                    U+A6A0..A6F7, U+16800..16A38)
5259          \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
5260          \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
5261                                    (36: U+16AD0..16AED, U+16AF0..16AF5)
5262          \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56:
5263                                    U+1BC0..1BF3, U+1BFC..1BFF)
5264          \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
5265          \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (101)
5266          \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng})
5267                                    (101: U+0951..0952, U+0964..0965,
5268                                    U+0980..0983, U+0985..098C,
5269                                    U+098F..0990, U+0993..09A8 ...)
5270          \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
5271                                    (97: U+11C00..11C08, U+11C0A..11C36,
5272                                    U+11C38..11C45, U+11C50..11C6C)
5273          \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
5274          \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (111)
5275          \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
5276                                    (111: U+02EA..02EB, U+3001..3003,
5277                                    U+3008..3011, U+3013..301F,
5278                                    U+302A..302D, U+3030 ...)
5279          \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (109)
5280          \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah})
5281                                    (109: U+11000..1104D, U+11052..1106F,
5282                                    U+1107F)
5283          \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
5284          \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
5285                                    (256: U+2800..28FF)
5286          \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
5287          \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
5288                                    (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF)
5289          \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
5290          \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22:
5291                                    U+1735..1736, U+1740..1753)
5292          \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (87)
5293          \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
5294                                    \p{Cans}) (710: U+1400..167F,
5295                                    U+18B0..18F5)
5296          \p{Script_Extensions: Cans} \p{Script_Extensions=
5297                                    Canadian_Aboriginal} (710)
5298          \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
5299          \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49:
5300                                    U+102A0..102D0)
5301          \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
5302                                    \p{Aghb}) (53: U+10530..10563, U+1056F)
5303          \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (87:
5304                                    U+09E6..09EF, U+1040..1049,
5305                                    U+11100..11134, U+11136..11143)
5306          \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83:
5307                                    U+AA00..AA36, U+AA40..AA4D,
5308                                    U+AA50..AA59, U+AA5C..AA5F)
5309          \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
5310          \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
5311                                    (172: U+13A0..13F5, U+13F8..13FD,
5312                                    U+AB70..ABBF)
5313          \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
5314                                    (6947: [\x00-
5315                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
5316                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
5317                                    \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF,
5318                                    U+02E5..02E9, U+02EC..02FF, U+0374,
5319                                    U+037E ...)
5320          \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
5321          \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt})
5322                                    (165: U+03E2..03EF, U+2C80..2CF3,
5323                                    U+2CF9..2CFF, U+102E0..102FB)
5324          \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
5325          \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
5326                                    (1234: U+12000..12399, U+12400..1246E,
5327                                    U+12470..12474, U+12480..12543)
5328          \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
5329                                    (112: U+10100..10102, U+10107..10133,
5330                                    U+10137..1013F, U+10800..10805, U+10808,
5331                                    U+1080A..10835 ...)
5332          \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
5333                                    (446: U+0400..052F, U+1C80..1C88,
5334                                    U+1D2B, U+1D78, U+2DE0..2DFF, U+2E43 ...)
5335          \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (446)
5336          \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt})
5337                                    (80: U+10400..1044F)
5338          \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210)
5339          \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
5340                                    (210: U+0900..097F, U+1CD0..1CF6,
5341                                    U+1CF8..1CF9, U+20F0, U+A830..A839,
5342                                    U+A8E0..A8FD)
5343          \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
5344          \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
5345          \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
5346                                    (147: U+1BC00..1BC6A, U+1BC70..1BC7C,
5347                                    U+1BC80..1BC88, U+1BC90..1BC99,
5348                                    U+1BC9C..1BCA3)
5349          \p{Script_Extensions: Egyp} \p{Script_Extensions=
5350                                    Egyptian_Hieroglyphs} (1071)
5351          \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
5352                                    \p{Egyp}) (1071: U+13000..1342E)
5353          \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
5354          \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba})
5355                                    (40: U+10500..10527)
5356          \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
5357          \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
5358                                    (495: U+1200..1248, U+124A..124D,
5359                                    U+1250..1256, U+1258, U+125A..125D,
5360                                    U+1260..1288 ...)
5361          \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (129)
5362          \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
5363                                    (129: U+0589, U+10A0..10C5, U+10C7,
5364                                    U+10CD, U+10D0..10FF, U+2D00..2D25 ...)
5365          \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (136)
5366          \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
5367                                    (136: U+0484, U+0487, U+2C00..2C2E,
5368                                    U+2C30..2C5E, U+2E43, U+A66F ...)
5369          \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi}
5370                                    (75)
5371          \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
5372          \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27:
5373                                    U+10330..1034A)
5374          \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (113)
5375          \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
5376                                    (113: U+0951..0952, U+0964..0965,
5377                                    U+0BAA, U+0BB5, U+0BE6..0BF2, U+1CD0 ...)
5378          \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522:
5379                                    U+0342, U+0345, U+0370..0373,
5380                                    U+0375..0377, U+037A..037D, U+037F ...)
5381          \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
5382          \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
5383                                    (105: U+0951..0952, U+0964..0965,
5384                                    U+0A81..0A83, U+0A85..0A8D,
5385                                    U+0A8F..0A91, U+0A93..0AA8 ...)
5386          \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105)
5387          \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
5388                                    (93: U+0951..0952, U+0964..0965,
5389                                    U+0A01..0A03, U+0A05..0A0A,
5390                                    U+0A0F..0A10, U+0A13..0A28 ...)
5391          \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (93)
5392          \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (89_507:
5393                                    U+2E80..2E99, U+2E9B..2EF3,
5394                                    U+2F00..2FD5, U+3001..3003,
5395                                    U+3005..3011, U+3013..301F ...)
5396          \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
5397          \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
5398                                    (11_775: U+1100..11FF, U+3001..3003,
5399                                    U+3008..3011, U+3013..301F,
5400                                    U+302E..3030, U+3037 ...)
5401          \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (89_507)
5402          \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
5403          \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano})
5404                                    (23: U+1720..1736)
5405          \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
5406          \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26:
5407                                    U+108E0..108F2, U+108F4..108F5,
5408                                    U+108FB..108FF)
5409          \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (133)
5410          \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr})
5411                                    (133: U+0591..05C7, U+05D0..05EA,
5412                                    U+05F0..05F4, U+FB1D..FB36,
5413                                    U+FB38..FB3C, U+FB3E ...)
5414          \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (428)
5415          \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
5416                                    (428: U+3001..3003, U+3008..3011,
5417                                    U+3013..301F, U+3030..3035, U+3037,
5418                                    U+303C..303D ...)
5419          \p{Script_Extensions: Hluw} \p{Script_Extensions=
5420                                    Anatolian_Hieroglyphs} (583)
5421          \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
5422                                    (127)
5423          \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
5424                                    (108)
5425          \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
5426                                    \p{Armi}) (31: U+10840..10855,
5427                                    U+10857..1085F)
5428          \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
5429                                    (500: U+0300..0341, U+0343..0344,
5430                                    U+0346..0362, U+1AB0..1ABE,
5431                                    U+1DC2..1DF9, U+1DFB..1DFF ...)
5432          \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
5433                                    \p{Phli}) (27: U+10B60..10B72,
5434                                    U+10B78..10B7F)
5435          \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
5436                                    Prti}, \p{Prti}) (30: U+10B40..10B55,
5437                                    U+10B58..10B5F)
5438          \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39)
5439          \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
5440          \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
5441                                    (91: U+A980..A9CD, U+A9CF..A9D9,
5442                                    U+A9DE..A9DF)
5443          \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (86:
5444                                    U+0966..096F, U+A830..A839,
5445                                    U+11080..110C1)
5446          \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
5447          \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (352)
5448          \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
5449                                    (100: U+0951..0952, U+0964..0965,
5450                                    U+0C80..0C83, U+0C85..0C8C,
5451                                    U+0C8E..0C90, U+0C92..0CA8 ...)
5452          \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
5453                                    (352: U+3001..3003, U+3008..3011,
5454                                    U+3013..301F, U+3030..3035, U+3037,
5455                                    U+303C..303D ...)
5456          \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
5457                                    (48: U+A900..A92F)
5458          \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (65)
5459          \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
5460                                    (65: U+10A00..10A03, U+10A05..10A06,
5461                                    U+10A0C..10A13, U+10A15..10A17,
5462                                    U+10A19..10A33, U+10A38..10A3A ...)
5463          \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146:
5464                                    U+1780..17DD, U+17E0..17E9,
5465                                    U+17F0..17F9, U+19E0..19FF)
5466          \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
5467          \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (72)
5468          \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (72:
5469                                    U+0AE6..0AEF, U+11200..11211,
5470                                    U+11213..1123E)
5471          \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
5472                                    (81: U+0964..0965, U+A830..A839,
5473                                    U+112B0..112EA, U+112F0..112F9)
5474          \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (100)
5475          \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (86)
5476          \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
5477          \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (67:
5478                                    U+0E81..0E82, U+0E84, U+0E87..0E88,
5479                                    U+0E8A, U+0E8D, U+0E94..0E97 ...)
5480          \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (67)
5481          \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn})
5482                                    (1370: [A-Za-z\xaa\xba\xc0-\xd6\xd8-
5483                                    \xf6\xf8-\xff], U+0100..02B8,
5484                                    U+02E0..02E4, U+0363..036F,
5485                                    U+0485..0486, U+0951..0952 ...)
5486          \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1370)
5487          \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
5488          \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74:
5489                                    U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F)
5490          \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
5491          \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69:
5492                                    U+0965, U+1900..191E, U+1920..192B,
5493                                    U+1930..193B, U+1940, U+1944..194F)
5494          \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
5495          \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
5496          \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
5497                                    (386: U+10107..10133, U+10600..10736,
5498                                    U+10740..10755, U+10760..10767)
5499          \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
5500                                    (268: U+10000..1000B, U+1000D..10026,
5501                                    U+10028..1003A, U+1003C..1003D,
5502                                    U+1003F..1004D, U+10050..1005D ...)
5503          \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (48:
5504                                    U+A4D0..A4FF)
5505          \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
5506          \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29:
5507                                    U+10280..1029C)
5508          \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
5509          \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27:
5510                                    U+10920..10939, U+1093F)
5511          \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
5512                                    (61: U+0964..096F, U+A830..A839,
5513                                    U+11150..11176)
5514          \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
5515          \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
5516                                    (122: U+0951..0952, U+0964..0965,
5517                                    U+0D00..0D03, U+0D05..0D0C,
5518                                    U+0D0E..0D10, U+0D12..0D44 ...)
5519          \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
5520          \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand})
5521                                    (30: U+0640, U+0840..085B, U+085E)
5522          \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
5523          \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
5524                                    (52: U+0640, U+10AC0..10AE6,
5525                                    U+10AEB..10AF6)
5526          \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
5527          \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc})
5528                                    (68: U+11C70..11C8F, U+11C92..11CA7,
5529                                    U+11CA9..11CB6)
5530          \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm},
5531                                    \p{Gonm}) (75: U+11D00..11D06,
5532                                    U+11D08..11D09, U+11D0B..11D36, U+11D3A,
5533                                    U+11D3C..11D3D, U+11D3F..11D47 ...)
5534          \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
5535                                    \p{Mtei}) (79: U+AAE0..AAF6,
5536                                    U+ABC0..ABED, U+ABF0..ABF9)
5537          \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
5538                                    (213)
5539          \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
5540                                    \p{Mend}) (213: U+1E800..1E8C4,
5541                                    U+1E8C7..1E8D6)
5542          \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
5543                                    (90)
5544          \p{Script_Extensions: Mero} \p{Script_Extensions=
5545                                    Meroitic_Hieroglyphs} (32)
5546          \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
5547                                    \p{Merc}) (90: U+109A0..109B7,
5548                                    U+109BC..109CF, U+109D2..109FF)
5549          \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
5550                                    \p{Mero}) (32: U+10980..1099F)
5551          \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (133:
5552                                    U+16F00..16F44, U+16F50..16F7E,
5553                                    U+16F8F..16F9F)
5554          \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (122)
5555          \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89:
5556                                    U+A830..A839, U+11600..11644,
5557                                    U+11650..11659)
5558          \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (169)
5559          \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
5560                                    (169: U+1800..180E, U+1810..1819,
5561                                    U+1820..1877, U+1880..18AA,
5562                                    U+11660..1166C)
5563          \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43:
5564                                    U+16A40..16A5E, U+16A60..16A69,
5565                                    U+16A6E..16A6F)
5566          \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
5567          \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
5568          \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
5569          \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult})
5570                                    (48: U+0A66..0A6F, U+11280..11286,
5571                                    U+11288, U+1128A..1128D, U+1128F..1129D,
5572                                    U+1129F..112A9)
5573          \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
5574                                    (224: U+1000..109F, U+A92E,
5575                                    U+A9E0..A9FE, U+AA60..AA7F)
5576          \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
5577          \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
5578                                    (40: U+10880..1089E, U+108A7..108AF)
5579          \p{Script_Extensions: Narb} \p{Script_Extensions=
5580                                    Old_North_Arabian} (32)
5581          \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
5582          \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
5583                                    (83: U+1980..19AB, U+19B0..19C9,
5584                                    U+19D0..19DA, U+19DE..19DF)
5585          \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (92:
5586                                    U+11400..11459, U+1145B, U+1145D)
5587          \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (59:
5588                                    U+07C0..07FA)
5589          \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (59)
5590          \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397)
5591          \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397:
5592                                    U+16FE1, U+1B170..1B2FB)
5593          \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
5594          \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29:
5595                                    U+1680..169C)
5596          \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
5597                                    (48: U+1C50..1C7F)
5598          \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
5599          \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
5600                                    \p{Hung}) (108: U+10C80..10CB2,
5601                                    U+10CC0..10CF2, U+10CFA..10CFF)
5602          \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
5603                                    (39: U+10300..10323, U+1032D..1032F)
5604          \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
5605                                    \p{Narb}) (32: U+10A80..10A9F)
5606          \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
5607                                    (44: U+0483, U+10350..1037A)
5608          \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
5609                                    (50: U+103A0..103C3, U+103C8..103D5)
5610          \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
5611                                    \p{Sarb}) (32: U+10A60..10A7F)
5612          \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
5613                                    (73: U+10C00..10C48)
5614          \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (94:
5615                                    U+0951..0952, U+0964..0965,
5616                                    U+0B01..0B03, U+0B05..0B0C,
5617                                    U+0B0F..0B10, U+0B13..0B28 ...)
5618          \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
5619          \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (94)
5620          \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72:
5621                                    U+104B0..104D3, U+104D8..104FB)
5622          \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
5623          \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
5624          \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma})
5625                                    (40: U+10480..1049D, U+104A0..104A9)
5626          \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
5627                                    \p{Hmng}) (127: U+16B00..16B45,
5628                                    U+16B50..16B59, U+16B5B..16B61,
5629                                    U+16B63..16B77, U+16B7D..16B8F)
5630          \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
5631          \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
5632                                    (32: U+10860..1087F)
5633          \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
5634                                    (57: U+11AC0..11AF8)
5635          \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
5636          \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
5637          \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
5638          \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
5639                                    (59: U+1802..1803, U+1805, U+A840..A877)
5640          \p{Script_Extensions: Phli} \p{Script_Extensions=
5641                                    Inscriptional_Pahlavi} (27)
5642          \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
5643                                    (30)
5644          \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
5645          \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
5646                                    (29: U+10900..1091B, U+1091F)
5647          \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (133)
5648          \p{Script_Extensions: Prti} \p{Script_Extensions=
5649                                    Inscriptional_Parthian} (30)
5650          \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
5651                                    \p{Phlp}) (30: U+0640, U+10B80..10B91,
5652                                    U+10B99..10B9C, U+10BA9..10BAF)
5653          \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
5654          \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (500)
5655          \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37:
5656                                    U+A930..A953, U+A95F)
5657          \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
5658          \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86:
5659                                    U+16A0..16EA, U+16EE..16F8)
5660          \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
5661          \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
5662                                    (61: U+0800..082D, U+0830..083E)
5663          \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
5664          \p{Script_Extensions: Sarb} \p{Script_Extensions=
5665                                    Old_South_Arabian} (32)
5666          \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
5667          \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
5668                                    (82: U+A880..A8C5, U+A8CE..A8D9)
5669          \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
5670          \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
5671                                    (100: U+0951, U+1CD7, U+1CD9,
5672                                    U+1CDC..1CDD, U+1CE0, U+11180..111CD ...)
5673          \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw})
5674                                    (48: U+10450..1047F)
5675          \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
5676          \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (100)
5677          \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
5678          \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd})
5679                                    (92: U+11580..115B5, U+115B8..115DD)
5680          \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
5681                                    (672: U+1D800..1DA8B, U+1DA9B..1DA9F,
5682                                    U+1DAA1..1DAAF)
5683          \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
5684          \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (112)
5685          \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
5686                                    (112: U+0964..0965, U+0D82..0D83,
5687                                    U+0D85..0D96, U+0D9A..0DB1,
5688                                    U+0DB3..0DBB, U+0DBD ...)
5689          \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
5690          \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
5691                                    \p{Sora}) (35: U+110D0..110E8,
5692                                    U+110F0..110F9)
5693          \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (80)
5694          \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo})
5695                                    (80: U+11A50..11A83, U+11A86..11A9C,
5696                                    U+11A9E..11AA2)
5697          \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
5698          \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
5699                                    (72: U+1B80..1BBF, U+1CC0..1CC7)
5700          \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (56)
5701          \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
5702                                    \p{Sylo}) (56: U+0964..0965,
5703                                    U+09E6..09EF, U+A800..A82B)
5704          \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (105)
5705          \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc})
5706                                    (105: U+060C, U+061B..061C, U+061F,
5707                                    U+0640, U+064B..0655, U+0670 ...)
5708          \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg})
5709                                    (22: U+1700..170C, U+170E..1714,
5710                                    U+1735..1736)
5711          \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
5712          \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
5713                                    (20: U+1735..1736, U+1760..176C,
5714                                    U+176E..1770, U+1772..1773)
5715          \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45:
5716                                    U+1040..1049, U+1950..196D, U+1970..1974)
5717          \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
5718                                    (127: U+1A20..1A5E, U+1A60..1A7C,
5719                                    U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD)
5720          \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
5721                                    (72: U+AA80..AAC2, U+AADB..AADF)
5722          \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (78)
5723          \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (78:
5724                                    U+0964..0965, U+A830..A839,
5725                                    U+11680..116B7, U+116C0..116C9)
5726          \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
5727          \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
5728          \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (81:
5729                                    U+0951..0952, U+0964..0965,
5730                                    U+0B82..0B83, U+0B85..0B8A,
5731                                    U+0B8E..0B90, U+0B92..0B95 ...)
5732          \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (81)
5733          \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6881)
5734          \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
5735                                    (6881: U+16FE0, U+17000..187EC,
5736                                    U+18800..18AF2)
5737          \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
5738          \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (101)
5739          \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu})
5740                                    (101: U+0951..0952, U+0964..0965,
5741                                    U+0C00..0C03, U+0C05..0C0C,
5742                                    U+0C0E..0C10, U+0C12..0C28 ...)
5743          \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
5744          \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
5745          \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66)
5746          \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66:
5747                                    U+060C, U+061B..061C, U+061F,
5748                                    U+0660..0669, U+0780..07B1, U+FDF2 ...)
5749          \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86:
5750                                    U+0E01..0E3A, U+0E40..0E5B)
5751          \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
5752                                    (207: U+0F00..0F47, U+0F49..0F6C,
5753                                    U+0F71..0F97, U+0F99..0FBC,
5754                                    U+0FBE..0FCC, U+0FCE..0FD4 ...)
5755          \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
5756          \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
5757                                    (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F)
5758          \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (94)
5759          \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh})
5760                                    (94: U+0964..0965, U+A830..A839,
5761                                    U+11480..114C7, U+114D0..114D9)
5762          \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
5763          \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
5764                                    (31: U+10380..1039D, U+1039F)
5765          \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
5766                                    (977_357 plus all above-Unicode code
5767                                    points: U+0378..0379, U+0380..0383,
5768                                    U+038B, U+038D, U+03A2, U+0530 ...)
5769          \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300:
5770                                    U+A500..A62B)
5771          \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
5772          \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
5773          \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
5774                                    (84: U+118A0..118F2, U+118FF)
5775          \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
5776          \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
5777          \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246:
5778                                    U+3001..3002, U+3008..3011,
5779                                    U+3014..301B, U+30FB, U+A000..A48C,
5780                                    U+A490..A4C6 ...)
5781          \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
5782          \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb},
5783                                    \p{Zanb}) (72: U+11A00..11A47)
5784          \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square}
5785                                    (72)
5786          \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (500)
5787          \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (6947)
5788          \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (977_357
5789                                    plus all above-Unicode code points)
5790          \p{Scx: *}              \p{Script_Extensions: *}
5791          \p{SD}                  \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
5792          \p{SD: *}               \p{Soft_Dotted: *}
5793          \p{Sentence_Break: AT}  \p{Sentence_Break=ATerm} (4)
5794          \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024,
5795                                    U+FE52, U+FF0E)
5796          \p{Sentence_Break: CL}  \p{Sentence_Break=Close} (187)
5797          \p{Sentence_Break: Close} (Short: \p{SB=CL}) (187:
5798                                    [\"\'\(\)\[\]\{\}\xab\xbb],
5799                                    U+0F3A..0F3D, U+169B..169C,
5800                                    U+2018..201F, U+2039..203A, U+2045..2046
5801                                    ...)
5802          \p{Sentence_Break: CR}  (Short: \p{SB=CR}) (1: [\r])
5803          \p{Sentence_Break: EX}  \p{Sentence_Break=Extend} (2277)
5804          \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2277: U+0300..036F,
5805                                    U+0483..0489, U+0591..05BD, U+05BF,
5806                                    U+05C1..05C2, U+05C4..05C5 ...)
5807          \p{Sentence_Break: FO}  \p{Sentence_Break=Format} (53)
5808          \p{Sentence_Break: Format} (Short: \p{SB=FO}) (53: [\xad],
5809                                    U+0600..0605, U+061C, U+06DD, U+070F,
5810                                    U+08E2 ...)
5811          \p{Sentence_Break: LE}  \p{Sentence_Break=OLetter} (121_354)
5812          \p{Sentence_Break: LF}  (Short: \p{SB=LF}) (1: [\n])
5813          \p{Sentence_Break: LO}  \p{Sentence_Break=Lower} (2251)
5814          \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2251: [a-
5815                                    z\xaa\xb5\xba\xdf-\xf6\xf8-\xff],
5816                                    U+0101, U+0103, U+0105, U+0107, U+0109
5817                                    ...)
5818          \p{Sentence_Break: NU}  \p{Sentence_Break=Numeric} (582)
5819          \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (582: [0-9],
5820                                    U+0660..0669, U+066B..066C,
5821                                    U+06F0..06F9, U+07C0..07C9, U+0966..096F
5822                                    ...)
5823          \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (121_354: U+01BB,
5824                                    U+01C0..01C3, U+0294, U+02B9..02BF,
5825                                    U+02C6..02D1, U+02EC ...)
5826          \p{Sentence_Break: Other} (Short: \p{SB=XX}) (985_375 plus all
5827                                    above-Unicode code points:
5828                                    [^\t\n\cK\f\r\x20!\"\'\(\),\-.0-9:?A-
5829                                    Z\[\]a-z\{\}\x85\xa0\xaa-
5830                                    \xab\xad\xb5\xba-\xbb\xc0-\xd6\xd8-
5831                                    \xf6\xf8-\xff], U+02C2..02C5,
5832                                    U+02D2..02DF, U+02E5..02EB, U+02ED,
5833                                    U+02EF..02FF ...)
5834          \p{Sentence_Break: SC}  \p{Sentence_Break=SContinue} (26)
5835          \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:],
5836                                    U+055D, U+060C..060D, U+07F8, U+1802,
5837                                    U+1808 ...)
5838          \p{Sentence_Break: SE}  \p{Sentence_Break=Sep} (3)
5839          \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85],
5840                                    U+2028..2029)
5841          \p{Sentence_Break: Sp}  (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0],
5842                                    U+1680, U+2000..200A, U+202F, U+205F,
5843                                    U+3000)
5844          \p{Sentence_Break: ST}  \p{Sentence_Break=STerm} (125)
5845          \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (125: [!?], U+0589,
5846                                    U+061F, U+06D4, U+0700..0702, U+07F9 ...)
5847          \p{Sentence_Break: UP}  \p{Sentence_Break=Upper} (1853)
5848          \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1853: [A-Z\xc0-
5849                                    \xd6\xd8-\xde], U+0100, U+0102, U+0104,
5850                                    U+0106, U+0108 ...)
5851          \p{Sentence_Break: XX}  \p{Sentence_Break=Other} (985_375 plus all
5852                                    above-Unicode code points)
5853          \p{Sentence_Terminal}   \p{Sentence_Terminal=Y} (Short: \p{STerm})
5854                                    (128)
5855          \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
5856                                    (1_113_984 plus all above-Unicode code
5857                                    points: [\x00-\x20\"#\$\%&\'\(\)*+,\-
5858                                    \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
5859                                    \xff], U+0100..0588, U+058A..061E,
5860                                    U+0620..06D3, U+06D5..06FF, U+0703..07F8
5861                                    ...)
5862          \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (128:
5863                                    [!.?], U+0589, U+061F, U+06D4,
5864                                    U+0700..0702, U+07F9 ...)
5865          \p{Separator}           \p{General_Category=Separator} (Short:
5866                                    \p{Z}) (19)
5867          \p{Sgnw}                \p{SignWriting} (= \p{Script_Extensions=
5868                                    SignWriting}) (672)
5869          \p{Sharada}             \p{Script_Extensions=Sharada} (Short:
5870                                    \p{Shrd}; NOT \p{Block=Sharada}) (100)
5871          \p{Shavian}             \p{Script_Extensions=Shavian} (Short:
5872                                    \p{Shaw}) (48)
5873          \p{Shaw}                \p{Shavian} (= \p{Script_Extensions=
5874                                    Shavian}) (48)
5875        X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
5876                                    (16)
5877          \p{Shrd}                \p{Sharada} (= \p{Script_Extensions=
5878                                    Sharada}) (NOT \p{Block=Sharada}) (100)
5879          \p{Sidd}                \p{Siddham} (= \p{Script_Extensions=
5880                                    Siddham}) (NOT \p{Block=Siddham}) (92)
5881          \p{Siddham}             \p{Script_Extensions=Siddham} (Short:
5882                                    \p{Sidd}; NOT \p{Block=Siddham}) (92)
5883          \p{SignWriting}         \p{Script_Extensions=SignWriting} (Short:
5884                                    \p{Sgnw}) (672)
5885          \p{Sind}                \p{Khudawadi} (= \p{Script_Extensions=
5886                                    Khudawadi}) (NOT \p{Block=Khudawadi})
5887                                    (81)
5888          \p{Sinh}                \p{Sinhala} (= \p{Script_Extensions=
5889                                    Sinhala}) (NOT \p{Block=Sinhala}) (112)
5890          \p{Sinhala}             \p{Script_Extensions=Sinhala} (Short:
5891                                    \p{Sinh}; NOT \p{Block=Sinhala}) (112)
5892        X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
5893          \p{Sk}                  \p{Modifier_Symbol} (=
5894                                    \p{General_Category=Modifier_Symbol})
5895                                    (121)
5896          \p{Sm}                  \p{Math_Symbol} (= \p{General_Category=
5897                                    Math_Symbol}) (948)
5898        X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
5899                                    \p{InSmallForms}) (32)
5900        X \p{Small_Forms}         \p{Small_Form_Variants} (= \p{Block=
5901                                    Small_Form_Variants}) (32)
5902          \p{So}                  \p{Other_Symbol} (= \p{General_Category=
5903                                    Other_Symbol}) (5855)
5904          \p{Soft_Dotted}         \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
5905          \p{Soft_Dotted: N*}     (Short: \p{SD=N}, \P{SD}) (1_114_066 plus
5906                                    all above-Unicode code points: [\x00-
5907                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
5908                                    Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff],
5909                                    U+0100..012E, U+0130..0248,
5910                                    U+024A..0267, U+0269..029C, U+029E..02B1
5911                                    ...)
5912          \p{Soft_Dotted: Y*}     (Short: \p{SD=Y}, \p{SD}) (46: [i-j],
5913                                    U+012F, U+0249, U+0268, U+029D, U+02B2
5914                                    ...)
5915          \p{Sora}                \p{Sora_Sompeng} (= \p{Script_Extensions=
5916                                    Sora_Sompeng}) (NOT \p{Block=
5917                                    Sora_Sompeng}) (35)
5918          \p{Sora_Sompeng}        \p{Script_Extensions=Sora_Sompeng} (Short:
5919                                    \p{Sora}; NOT \p{Block=Sora_Sompeng})
5920                                    (35)
5921          \p{Soyo}                \p{Soyombo} (= \p{Script_Extensions=
5922                                    Soyombo}) (NOT \p{Block=Soyombo}) (80)
5923          \p{Soyombo}             \p{Script_Extensions=Soyombo} (Short:
5924                                    \p{Soyo}; NOT \p{Block=Soyombo}) (80)
5925          \p{Space}               \p{White_Space} (= \p{White_Space=Y}) (25)
5926          \p{Space: *}            \p{White_Space: *}
5927          \p{Space_Separator}     \p{General_Category=Space_Separator}
5928                                    (Short: \p{Zs}) (17)
5929          \p{SpacePerl}           \p{XPosixSpace} (25)
5930          \p{Spacing_Mark}        \p{General_Category=Spacing_Mark} (Short:
5931                                    \p{Mc}) (401)
5932        X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
5933                                    (Short: \p{InModifierLetters}) (80)
5934        X \p{Specials}            \p{Block=Specials} (16)
5935          \p{STerm}               \p{Sentence_Terminal} (=
5936                                    \p{Sentence_Terminal=Y}) (128)
5937          \p{STerm: *}            \p{Sentence_Terminal: *}
5938          \p{Sund}                \p{Sundanese} (= \p{Script_Extensions=
5939                                    Sundanese}) (NOT \p{Block=Sundanese})
5940                                    (72)
5941          \p{Sundanese}           \p{Script_Extensions=Sundanese} (Short:
5942                                    \p{Sund}; NOT \p{Block=Sundanese}) (72)
5943        X \p{Sundanese_Sup}       \p{Sundanese_Supplement} (= \p{Block=
5944                                    Sundanese_Supplement}) (16)
5945        X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
5946                                    \p{InSundaneseSup}) (16)
5947        X \p{Sup_Arrows_A}        \p{Supplemental_Arrows_A} (= \p{Block=
5948                                    Supplemental_Arrows_A}) (16)
5949        X \p{Sup_Arrows_B}        \p{Supplemental_Arrows_B} (= \p{Block=
5950                                    Supplemental_Arrows_B}) (128)
5951        X \p{Sup_Arrows_C}        \p{Supplemental_Arrows_C} (= \p{Block=
5952                                    Supplemental_Arrows_C}) (256)
5953        X \p{Sup_Math_Operators}  \p{Supplemental_Mathematical_Operators} (=
5954                                    \p{Block=
5955                                    Supplemental_Mathematical_Operators})
5956                                    (256)
5957        X \p{Sup_PUA_A}           \p{Supplementary_Private_Use_Area_A} (=
5958                                    \p{Block=
5959                                    Supplementary_Private_Use_Area_A})
5960                                    (65_536)
5961        X \p{Sup_PUA_B}           \p{Supplementary_Private_Use_Area_B} (=
5962                                    \p{Block=
5963                                    Supplementary_Private_Use_Area_B})
5964                                    (65_536)
5965        X \p{Sup_Punctuation}     \p{Supplemental_Punctuation} (= \p{Block=
5966                                    Supplemental_Punctuation}) (128)
5967        X \p{Sup_Symbols_And_Pictographs}
5968                                    \p{Supplemental_Symbols_And_Pictographs}
5969                                    (= \p{Block=
5970                                    Supplemental_Symbols_And_Pictographs})
5971                                    (256)
5972        X \p{Super_And_Sub}       \p{Superscripts_And_Subscripts} (=
5973                                    \p{Block=Superscripts_And_Subscripts})
5974                                    (48)
5975        X \p{Superscripts_And_Subscripts} \p{Block=
5976                                    Superscripts_And_Subscripts} (Short:
5977                                    \p{InSuperAndSub}) (48)
5978        X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
5979                                    \p{InSupArrowsA}) (16)
5980        X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
5981                                    \p{InSupArrowsB}) (128)
5982        X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
5983                                    \p{InSupArrowsC}) (256)
5984        X \p{Supplemental_Mathematical_Operators} \p{Block=
5985                                    Supplemental_Mathematical_Operators}
5986                                    (Short: \p{InSupMathOperators}) (256)
5987        X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
5988                                    (Short: \p{InSupPunctuation}) (128)
5989        X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
5990                                    Supplemental_Symbols_And_Pictographs}
5991                                    (Short: \p{InSupSymbolsAndPictographs})
5992                                    (256)
5993        X \p{Supplementary_Private_Use_Area_A} \p{Block=
5994                                    Supplementary_Private_Use_Area_A}
5995                                    (Short: \p{InSupPUAA}) (65_536)
5996        X \p{Supplementary_Private_Use_Area_B} \p{Block=
5997                                    Supplementary_Private_Use_Area_B}
5998                                    (Short: \p{InSupPUAB}) (65_536)
5999          \p{Surrogate}           \p{General_Category=Surrogate} (Short:
6000                                    \p{Cs}) (2048)
6001        X \p{Sutton_SignWriting}  \p{Block=Sutton_SignWriting} (688)
6002          \p{Sylo}                \p{Syloti_Nagri} (= \p{Script_Extensions=
6003                                    Syloti_Nagri}) (NOT \p{Block=
6004                                    Syloti_Nagri}) (56)
6005          \p{Syloti_Nagri}        \p{Script_Extensions=Syloti_Nagri} (Short:
6006                                    \p{Sylo}; NOT \p{Block=Syloti_Nagri})
6007                                    (56)
6008          \p{Symbol}              \p{General_Category=Symbol} (Short: \p{S})
6009                                    (6978)
6010          \p{Syrc}                \p{Syriac} (= \p{Script_Extensions=
6011                                    Syriac}) (NOT \p{Block=Syriac}) (105)
6012          \p{Syriac}              \p{Script_Extensions=Syriac} (Short:
6013                                    \p{Syrc}; NOT \p{Block=Syriac}) (105)
6014        X \p{Syriac_Sup}          \p{Syriac_Supplement} (= \p{Block=
6015                                    Syriac_Supplement}) (16)
6016        X \p{Syriac_Supplement}   \p{Block=Syriac_Supplement} (Short:
6017                                    \p{InSyriacSup}) (16)
6018          \p{Tagalog}             \p{Script_Extensions=Tagalog} (Short:
6019                                    \p{Tglg}; NOT \p{Block=Tagalog}) (22)
6020          \p{Tagb}                \p{Tagbanwa} (= \p{Script_Extensions=
6021                                    Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
6022          \p{Tagbanwa}            \p{Script_Extensions=Tagbanwa} (Short:
6023                                    \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
6024        X \p{Tags}                \p{Block=Tags} (128)
6025          \p{Tai_Le}              \p{Script_Extensions=Tai_Le} (Short:
6026                                    \p{Tale}; NOT \p{Block=Tai_Le}) (45)
6027          \p{Tai_Tham}            \p{Script_Extensions=Tai_Tham} (Short:
6028                                    \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
6029          \p{Tai_Viet}            \p{Script_Extensions=Tai_Viet} (Short:
6030                                    \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
6031        X \p{Tai_Xuan_Jing}       \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
6032                                    Tai_Xuan_Jing_Symbols}) (96)
6033        X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
6034                                    \p{InTaiXuanJing}) (96)
6035          \p{Takr}                \p{Takri} (= \p{Script_Extensions=Takri})
6036                                    (NOT \p{Block=Takri}) (78)
6037          \p{Takri}               \p{Script_Extensions=Takri} (Short:
6038                                    \p{Takr}; NOT \p{Block=Takri}) (78)
6039          \p{Tale}                \p{Tai_Le} (= \p{Script_Extensions=
6040                                    Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
6041          \p{Talu}                \p{New_Tai_Lue} (= \p{Script_Extensions=
6042                                    New_Tai_Lue}) (NOT \p{Block=
6043                                    New_Tai_Lue}) (83)
6044          \p{Tamil}               \p{Script_Extensions=Tamil} (Short:
6045                                    \p{Taml}; NOT \p{Block=Tamil}) (81)
6046          \p{Taml}                \p{Tamil} (= \p{Script_Extensions=Tamil})
6047                                    (NOT \p{Block=Tamil}) (81)
6048          \p{Tang}                \p{Tangut} (= \p{Script_Extensions=
6049                                    Tangut}) (NOT \p{Block=Tangut}) (6881)
6050          \p{Tangut}              \p{Script_Extensions=Tangut} (Short:
6051                                    \p{Tang}; NOT \p{Block=Tangut}) (6881)
6052        X \p{Tangut_Components}   \p{Block=Tangut_Components} (768)
6053          \p{Tavt}                \p{Tai_Viet} (= \p{Script_Extensions=
6054                                    Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
6055          \p{Telu}                \p{Telugu} (= \p{Script_Extensions=
6056                                    Telugu}) (NOT \p{Block=Telugu}) (101)
6057          \p{Telugu}              \p{Script_Extensions=Telugu} (Short:
6058                                    \p{Telu}; NOT \p{Block=Telugu}) (101)
6059          \p{Term}                \p{Terminal_Punctuation} (=
6060                                    \p{Terminal_Punctuation=Y}) (252)
6061          \p{Term: *}             \p{Terminal_Punctuation: *}
6062          \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
6063                                    \p{Term}) (252)
6064          \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
6065                                    (1_113_860 plus all above-Unicode code
6066                                    points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0-
6067                                    9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
6068                                    U+0100..037D, U+037F..0386,
6069                                    U+0388..0588, U+058A..05C2, U+05C4..060B
6070                                    ...)
6071          \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (252:
6072                                    [!,.:;?], U+037E, U+0387, U+0589,
6073                                    U+05C3, U+060C ...)
6074          \p{Tfng}                \p{Tifinagh} (= \p{Script_Extensions=
6075                                    Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
6076          \p{Tglg}                \p{Tagalog} (= \p{Script_Extensions=
6077                                    Tagalog}) (NOT \p{Block=Tagalog}) (22)
6078          \p{Thaa}                \p{Thaana} (= \p{Script_Extensions=
6079                                    Thaana}) (NOT \p{Block=Thaana}) (66)
6080          \p{Thaana}              \p{Script_Extensions=Thaana} (Short:
6081                                    \p{Thaa}; NOT \p{Block=Thaana}) (66)
6082          \p{Thai}                \p{Script_Extensions=Thai} (NOT \p{Block=
6083                                    Thai}) (86)
6084          \p{Tibetan}             \p{Script_Extensions=Tibetan} (Short:
6085                                    \p{Tibt}; NOT \p{Block=Tibetan}) (207)
6086          \p{Tibt}                \p{Tibetan} (= \p{Script_Extensions=
6087                                    Tibetan}) (NOT \p{Block=Tibetan}) (207)
6088          \p{Tifinagh}            \p{Script_Extensions=Tifinagh} (Short:
6089                                    \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
6090          \p{Tirh}                \p{Tirhuta} (= \p{Script_Extensions=
6091                                    Tirhuta}) (NOT \p{Block=Tirhuta}) (94)
6092          \p{Tirhuta}             \p{Script_Extensions=Tirhuta} (Short:
6093                                    \p{Tirh}; NOT \p{Block=Tirhuta}) (94)
6094          \p{Title}               \p{Titlecase} (/i= Cased=Yes) (31)
6095          \p{Titlecase}           (= \p{Gc=Lt}) (Short: \p{Title}; /i=
6096                                    Cased=Yes) (31: U+01C5, U+01C8, U+01CB,
6097                                    U+01F2, U+1F88..1F8F, U+1F98..1F9F ...)
6098          \p{Titlecase_Letter}    \p{General_Category=Titlecase_Letter}
6099                                    (Short: \p{Lt}; /i= General_Category=
6100                                    Cased_Letter) (31)
6101        X \p{Transport_And_Map}   \p{Transport_And_Map_Symbols} (= \p{Block=
6102                                    Transport_And_Map_Symbols}) (128)
6103        X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
6104                                    (Short: \p{InTransportAndMap}) (128)
6105        X \p{UCAS}                \p{Unified_Canadian_Aboriginal_Syllabics}
6106                                    (= \p{Block=
6107                                    Unified_Canadian_Aboriginal_Syllabics})
6108                                    (640)
6109        X \p{UCAS_Ext}            \p{Unified_Canadian_Aboriginal_Syllabics_-
6110                                    Extended} (= \p{Block=
6111                                    Unified_Canadian_Aboriginal_Syllabics_-
6112                                    Extended}) (80)
6113          \p{Ugar}                \p{Ugaritic} (= \p{Script_Extensions=
6114                                    Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
6115          \p{Ugaritic}            \p{Script_Extensions=Ugaritic} (Short:
6116                                    \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
6117          \p{UIdeo}               \p{Unified_Ideograph} (=
6118                                    \p{Unified_Ideograph=Y}) (87_882)
6119          \p{UIdeo: *}            \p{Unified_Ideograph: *}
6120          \p{Unassigned}          \p{General_Category=Unassigned} (Short:
6121                                    \p{Cn}) (837_841 plus all above-Unicode
6122                                    code points)
6123          \p{Unicode}             \p{Any} (1_114_112)
6124        X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
6125                                    Unified_Canadian_Aboriginal_Syllabics}
6126                                    (Short: \p{InUCAS}) (640)
6127        X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
6128                                    Unified_Canadian_Aboriginal_Syllabics_-
6129                                    Extended} (Short: \p{InUCASExt}) (80)
6130          \p{Unified_Ideograph}   \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
6131                                    (87_882)
6132          \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
6133                                    (1_026_230 plus all above-Unicode code
6134                                    points: U+0000..33FF, U+4DB6..4DFF,
6135                                    U+9FEB..FA0D, U+FA10, U+FA12,
6136                                    U+FA15..FA1E ...)
6137          \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (87_882:
6138                                    U+3400..4DB5, U+4E00..9FEA,
6139                                    U+FA0E..FA0F, U+FA11, U+FA13..FA14,
6140                                    U+FA1F ...)
6141          \p{Unknown}             \p{Script_Extensions=Unknown} (Short:
6142                                    \p{Zzzz}) (977_357 plus all above-
6143                                    Unicode code points)
6144          \p{Upper}               \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6145                                    Cased=Yes) (1822)
6146          \p{Upper: *}            \p{Uppercase: *}
6147          \p{Uppercase}           \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6148                                    Cased=Yes) (1822)
6149          \p{Uppercase: N*}       (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
6150                                    No) (1_112_290 plus all above-Unicode
6151                                    code points: [\x00-
6152                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
6153                                    >?\@\[\\\]\^_`a-z\{\|\}~\x7f-
6154                                    \xbf\xd7\xdf-\xff], U+0101, U+0103,
6155                                    U+0105, U+0107, U+0109 ...)
6156          \p{Uppercase: Y*}       (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
6157                                    Yes) (1822: [A-Z\xc0-\xd6\xd8-\xde],
6158                                    U+0100, U+0102, U+0104, U+0106, U+0108
6159                                    ...)
6160          \p{Uppercase_Letter}    \p{General_Category=Uppercase_Letter}
6161                                    (Short: \p{Lu}; /i= General_Category=
6162                                    Cased_Letter) (1702)
6163          \p{Vai}                 \p{Script_Extensions=Vai} (NOT \p{Block=
6164                                    Vai}) (300)
6165          \p{Vaii}                \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
6166                                    \p{Block=Vai}) (300)
6167          \p{Variation_Selector}  \p{Variation_Selector=Y} (Short: \p{VS};
6168                                    NOT \p{Variation_Selectors}) (259)
6169          \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853
6170                                    plus all above-Unicode code points:
6171                                    U+0000..180A, U+180E..FDFF,
6172                                    U+FE10..E00FF, U+E01F0..infinity)
6173          \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259:
6174                                    U+180B..180D, U+FE00..FE0F,
6175                                    U+E0100..E01EF)
6176        X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
6177                                    \p{InVS}) (16)
6178        X \p{Variation_Selectors_Supplement} \p{Block=
6179                                    Variation_Selectors_Supplement} (Short:
6180                                    \p{InVSSup}) (240)
6181        X \p{Vedic_Ext}           \p{Vedic_Extensions} (= \p{Block=
6182                                    Vedic_Extensions}) (48)
6183        X \p{Vedic_Extensions}    \p{Block=Vedic_Extensions} (Short:
6184                                    \p{InVedicExt}) (48)
6185        X \p{Vertical_Forms}      \p{Block=Vertical_Forms} (16)
6186          \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated}
6187                                    (787_892 plus all above-Unicode code
6188                                    points)
6189          \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (787_892 plus
6190                                    all above-Unicode code points: [\x00-
6191                                    \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf-
6192                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9,
6193                                    U+02EC..10FF, U+1200..1400,
6194                                    U+1680..18AF, U+1900..2015 ...)
6195          \p{Vertical_Orientation: Tr} \p{Vertical_Orientation=
6196                                    Transformed_Rotated} (47)
6197          \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr})
6198                                    (47: U+2329..232A, U+3008..3011,
6199                                    U+3014..301F, U+3030, U+30A0, U+30FC ...)
6200          \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu})
6201                                    (147: U+3001..3002, U+3041, U+3043,
6202                                    U+3045, U+3047, U+3049 ...)
6203          \p{Vertical_Orientation: Tu} \p{Vertical_Orientation=
6204                                    Transformed_Upright} (147)
6205          \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright}
6206                                    (326_026)
6207          \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (326_026:
6208                                    [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7],
6209                                    U+02EA..02EB, U+1100..11FF,
6210                                    U+1401..167F, U+18B0..18FF, U+2016 ...)
6211          \p{VertSpace}           \v (7: [\n\cK\f\r\x85], U+2028..2029)
6212          \p{Vo: *}               \p{Vertical_Orientation: *}
6213          \p{VS}                  \p{Variation_Selector} (=
6214                                    \p{Variation_Selector=Y}) (NOT
6215                                    \p{Variation_Selectors}) (259)
6216          \p{VS: *}               \p{Variation_Selector: *}
6217        X \p{VS_Sup}              \p{Variation_Selectors_Supplement} (=
6218                                    \p{Block=
6219                                    Variation_Selectors_Supplement}) (240)
6220          \p{Wara}                \p{Warang_Citi} (= \p{Script_Extensions=
6221                                    Warang_Citi}) (NOT \p{Block=
6222                                    Warang_Citi}) (84)
6223          \p{Warang_Citi}         \p{Script_Extensions=Warang_Citi} (Short:
6224                                    \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
6225          \p{WB: *}               \p{Word_Break: *}
6226          \p{White_Space}         \p{White_Space=Y} (Short: \p{Space}) (25)
6227          \p{White_Space: N*}     (Short: \p{Space=N}, \P{Space}) (1_114_087
6228                                    plus all above-Unicode code points:
6229                                    [^\t\n\cK\f\r\x20\x85\xa0],
6230                                    U+0100..167F, U+1681..1FFF,
6231                                    U+200B..2027, U+202A..202E, U+2030..205E
6232                                    ...)
6233          \p{White_Space: Y*}     (Short: \p{Space=Y}, \p{Space}) (25:
6234                                    [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6235                                    U+2000..200A, U+2028..2029, U+202F,
6236                                    U+205F ...)
6237          \p{Word}                \p{XPosixWord} (128_238)
6238          \p{Word_Break: ALetter} (Short: \p{WB=LE}) (28_179: [A-Za-
6239                                    z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6240                                    \xff], U+0100..02D7, U+02DE..02E4,
6241                                    U+02EC..02FF, U+0370..0374, U+0376..0377
6242                                    ...)
6243          \p{Word_Break: CR}      (Short: \p{WB=CR}) (1: [\r])
6244          \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"])
6245          \p{Word_Break: DQ}      \p{Word_Break=Double_Quote} (1)
6246          \p{Word_Break: E_Base}  (Short: \p{WB=EB}) (98: U+261D, U+26F9,
6247                                    U+270A..270D, U+1F385, U+1F3C2..1F3C4,
6248                                    U+1F3C7 ...)
6249          \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (4: U+1F466..1F469)
6250          \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (5: U+1F3FB..1F3FF)
6251          \p{Word_Break: EB}      \p{Word_Break=E_Base} (98)
6252          \p{Word_Break: EBG}     \p{Word_Break=E_Base_GAZ} (4)
6253          \p{Word_Break: EM}      \p{Word_Break=E_Modifier} (5)
6254          \p{Word_Break: EX}      \p{Word_Break=ExtendNumLet} (11)
6255          \p{Word_Break: Extend}  (Short: \p{WB=Extend}) (2276:
6256                                    U+0300..036F, U+0483..0489,
6257                                    U+0591..05BD, U+05BF, U+05C1..05C2,
6258                                    U+05C4..05C5 ...)
6259          \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F,
6260                                    U+203F..2040, U+2054, U+FE33..FE34,
6261                                    U+FE4D..FE4F ...)
6262          \p{Word_Break: FO}      \p{Word_Break=Format} (52)
6263          \p{Word_Break: Format}  (Short: \p{WB=FO}) (52: [\xad],
6264                                    U+0600..0605, U+061C, U+06DD, U+070F,
6265                                    U+08E2 ...)
6266          \p{Word_Break: GAZ}     \p{Word_Break=Glue_After_Zwj} (22)
6267          \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (22: U+2640,
6268                                    U+2642, U+2695..2696, U+2708, U+2764,
6269                                    U+1F308 ...)
6270          \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (74:
6271                                    U+05D0..05EA, U+05F0..05F2, U+FB1D,
6272                                    U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
6273                                    ...)
6274          \p{Word_Break: HL}      \p{Word_Break=Hebrew_Letter} (74)
6275          \p{Word_Break: KA}      \p{Word_Break=Katakana} (310)
6276          \p{Word_Break: Katakana} (Short: \p{WB=KA}) (310: U+3031..3035,
6277                                    U+309B..309C, U+30A0..30FA,
6278                                    U+30FC..30FF, U+31F0..31FF, U+32D0..32FE
6279                                    ...)
6280          \p{Word_Break: LE}      \p{Word_Break=ALetter} (28_179)
6281          \p{Word_Break: LF}      (Short: \p{WB=LF}) (1: [\n])
6282          \p{Word_Break: MB}      \p{Word_Break=MidNumLet} (7)
6283          \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (8: [:\xb7], U+0387,
6284                                    U+05F4, U+2027, U+FE13, U+FE55 ...)
6285          \p{Word_Break: MidNum}  (Short: \p{WB=MN}) (15: [,;], U+037E,
6286                                    U+0589, U+060C..060D, U+066C, U+07F8 ...)
6287          \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.],
6288                                    U+2018..2019, U+2024, U+FE52, U+FF07,
6289                                    U+FF0E)
6290          \p{Word_Break: ML}      \p{Word_Break=MidLetter} (8)
6291          \p{Word_Break: MN}      \p{Word_Break=MidNum} (15)
6292          \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85],
6293                                    U+2028..2029)
6294          \p{Word_Break: NL}      \p{Word_Break=Newline} (5)
6295          \p{Word_Break: NU}      \p{Word_Break=Numeric} (581)
6296          \p{Word_Break: Numeric} (Short: \p{WB=NU}) (581: [0-9],
6297                                    U+0660..0669, U+066B, U+06F0..06F9,
6298                                    U+07C0..07C9, U+0966..096F ...)
6299          \p{Word_Break: Other}   (Short: \p{WB=XX}) (1_082_434 plus all
6300                                    above-Unicode code points:
6301                                    [^\n\cK\f\r\"\',.0-9:;A-Z_a-
6302                                    z\x85\xaa\xad\xb5\xb7\xba\xc0-\xd6\xd8-
6303                                    \xf6\xf8-\xff], U+02D8..02DD,
6304                                    U+02E5..02EB, U+0375, U+0378..0379,
6305                                    U+0380..0385 ...)
6306          \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26:
6307                                    U+1F1E6..1F1FF)
6308          \p{Word_Break: RI}      \p{Word_Break=Regional_Indicator} (26)
6309          \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\'])
6310          \p{Word_Break: SQ}      \p{Word_Break=Single_Quote} (1)
6311          \p{Word_Break: XX}      \p{Word_Break=Other} (1_082_434 plus all
6312                                    above-Unicode code points)
6313          \p{Word_Break: ZWJ}     (Short: \p{WB=ZWJ}) (1: U+200D)
6314          \p{WSpace}              \p{White_Space} (= \p{White_Space=Y}) (25)
6315          \p{WSpace: *}           \p{White_Space: *}
6316          \p{XDigit}              \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
6317          \p{XID_Continue}        \p{XID_Continue=Y} (Short: \p{XIDC})
6318                                    (128_089)
6319          \p{XID_Continue: N*}    (Short: \p{XIDC=N}, \P{XIDC}) (986_023
6320                                    plus all above-Unicode code points:
6321                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
6322                                    >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-
6323                                    \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
6324                                    U+02C2..02C5, U+02D2..02DF,
6325                                    U+02E5..02EB, U+02ED, U+02EF..02FF ...)
6326          \p{XID_Continue: Y*}    (Short: \p{XIDC=Y}, \p{XIDC}) (128_089:
6327                                    [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-
6328                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6329                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
6330                                    U+02EE ...)
6331          \p{XID_Start}           \p{XID_Start=Y} (Short: \p{XIDS}) (125_311)
6332          \p{XID_Start: N*}       (Short: \p{XIDS=N}, \P{XIDS}) (988_801
6333                                    plus all above-Unicode code points:
6334                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
6335                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
6336                                    \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
6337                                    U+02C2..02C5, U+02D2..02DF,
6338                                    U+02E5..02EB, U+02ED, U+02EF..036F ...)
6339          \p{XID_Start: Y*}       (Short: \p{XIDS=Y}, \p{XIDS}) (125_311:
6340                                    [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
6341                                    \xf6\xf8-\xff], U+0100..02C1,
6342                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
6343                                    U+02EE ...)
6344          \p{XIDC}                \p{XID_Continue} (= \p{XID_Continue=Y})
6345                                    (128_089)
6346          \p{XIDC: *}             \p{XID_Continue: *}
6347          \p{XIDS}                \p{XID_Start} (= \p{XID_Start=Y}) (125_311)
6348          \p{XIDS: *}             \p{XID_Start: *}
6349          \p{Xpeo}                \p{Old_Persian} (= \p{Script_Extensions=
6350                                    Old_Persian}) (NOT \p{Block=
6351                                    Old_Persian}) (50)
6352          \p{XPerlSpace}          \p{XPosixSpace} (25)
6353          \p{XPosixAlnum}         Alphabetic and (decimal) Numeric (Short:
6354                                    \p{Alnum}) (127_219: [0-9A-Za-
6355                                    z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6356                                    \xff], U+0100..02C1, U+02C6..02D1,
6357                                    U+02E0..02E4, U+02EC, U+02EE ...)
6358          \p{XPosixAlpha}         \p{Alphabetic=Y} (Short: \p{Alpha})
6359                                    (126_629)
6360          \p{XPosixBlank}         \h, Horizontal white space (Short:
6361                                    \p{Blank}) (18: [\t\x20\xa0], U+1680,
6362                                    U+2000..200A, U+202F, U+205F, U+3000)
6363          \p{XPosixCntrl}         \p{General_Category=Control} Control
6364                                    characters (Short: \p{Cc}) (65)
6365          \p{XPosixDigit}         \p{General_Category=Decimal_Number} [0-9]
6366                                    + all other decimal digits (Short:
6367                                    \p{Nd}) (590)
6368          \p{XPosixGraph}         Characters that are graphical (Short:
6369                                    \p{Graph}) (274_139:
6370                                    [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6371                                    Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff],
6372                                    U+0100..0377, U+037A..037F,
6373                                    U+0384..038A, U+038C, U+038E..03A1 ...)
6374          \p{XPosixLower}         \p{Lowercase=Y} (Short: \p{Lower}; /i=
6375                                    Cased=Yes) (2252)
6376          \p{XPosixPrint}         Characters that are graphical plus space
6377                                    characters (but no controls) (Short:
6378                                    \p{Print}) (274_156: [\x20-\x7e\xa0-
6379                                    \xff], U+0100..0377, U+037A..037F,
6380                                    U+0384..038A, U+038C, U+038E..03A1 ...)
6381          \p{XPosixPunct}         \p{Punct} + ASCII-range \p{Symbol} (779:
6382                                    [!\"#\$\%&\'\(\)*+,\-.\/:;<=
6383                                    >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
6384                                    \xb7\xbb\xbf], U+037E, U+0387,
6385                                    U+055A..055F, U+0589..058A, U+05BE ...)
6386          \p{XPosixSpace}         \s including beyond ASCII and vertical tab
6387                                    (Short: \p{SpacePerl}) (25:
6388                                    [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6389                                    U+2000..200A, U+2028..2029, U+202F,
6390                                    U+205F ...)
6391          \p{XPosixUpper}         \p{Uppercase=Y} (Short: \p{Upper}; /i=
6392                                    Cased=Yes) (1822)
6393          \p{XPosixWord}          \w, including beyond ASCII; = \p{Alnum} +
6394                                    \pM + \p{Pc} + \p{Join_Control} (Short:
6395                                    \p{Word}) (128_238: [0-9A-Z_a-
6396                                    z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6397                                    \xff], U+0100..02C1, U+02C6..02D1,
6398                                    U+02E0..02E4, U+02EC, U+02EE ...)
6399          \p{XPosixXDigit}        \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
6400          \p{Xsux}                \p{Cuneiform} (= \p{Script_Extensions=
6401                                    Cuneiform}) (NOT \p{Block=Cuneiform})
6402                                    (1234)
6403          \p{Yi}                  \p{Script_Extensions=Yi} (1246)
6404        X \p{Yi_Radicals}         \p{Block=Yi_Radicals} (64)
6405        X \p{Yi_Syllables}        \p{Block=Yi_Syllables} (1168)
6406          \p{Yiii}                \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
6407        X \p{Yijing}              \p{Yijing_Hexagram_Symbols} (= \p{Block=
6408                                    Yijing_Hexagram_Symbols}) (64)
6409        X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
6410                                    (Short: \p{InYijing}) (64)
6411          \p{Z} \pZ               \p{Separator} (= \p{General_Category=
6412                                    Separator}) (19)
6413          \p{Zanabazar_Square}    \p{Script_Extensions=Zanabazar_Square}
6414                                    (Short: \p{Zanb}; NOT \p{Block=
6415                                    Zanabazar_Square}) (72)
6416          \p{Zanb}                \p{Zanabazar_Square} (=
6417                                    \p{Script_Extensions=Zanabazar_Square})
6418                                    (NOT \p{Block=Zanabazar_Square}) (72)
6419          \p{Zinh}                \p{Inherited} (= \p{Script_Extensions=
6420                                    Inherited}) (500)
6421          \p{Zl}                  \p{Line_Separator} (= \p{General_Category=
6422                                    Line_Separator}) (1)
6423          \p{Zp}                  \p{Paragraph_Separator} (=
6424                                    \p{General_Category=
6425                                    Paragraph_Separator}) (1)
6426          \p{Zs}                  \p{Space_Separator} (=
6427                                    \p{General_Category=Space_Separator})
6428                                    (17)
6429          \p{Zyyy}                \p{Common} (= \p{Script_Extensions=
6430                                    Common}) (6947)
6431          \p{Zzzz}                \p{Unknown} (= \p{Script_Extensions=
6432                                    Unknown}) (977_357 plus all above-
6433                                    Unicode code points)
6434        TX\p{_CanonDCIJ}          (For internal use by Perl, not necessarily
6435                                    stable) (= \p{Soft_Dotted=Y}) (46)
6436        TX\p{_Case_Ignorable}     (For internal use by Perl, not necessarily
6437                                    stable) (= \p{Case_Ignorable=Y}) (2314)
6438        TX\p{_CombAbove}          (For internal use by Perl, not necessarily
6439                                    stable) (= \p{Canonical_Combining_Class=
6440                                    Above}) (461)
6441
6442   Legal "\p{}" and "\P{}" constructs that match no characters
6443       Unicode has some property-value pairs that currently don't match
6444       anything.  This happens generally either because they are obsolete, or
6445       they exist for symmetry with other forms, but no language has yet been
6446       encoded that uses them.  In this version of Unicode, the following
6447       match zero code points:
6448
6449       \p{Canonical_Combining_Class=Attached_Below_Left}
6450       \p{Canonical_Combining_Class=CCC133}
6451

Properties accessible through Unicode::UCD

6453       The value of any Unicode (not including Perl extensions) character
6454       property mentioned above for any single code point is available through
6455       "charprop()" in Unicode::UCD.  "charprops_all()" in Unicode::UCD
6456       returns the values of all the Unicode properties for a given code
6457       point.
6458
6459       Besides these, all the Unicode character properties mentioned above
6460       (except for those marked as for internal use by Perl) are also
6461       accessible by "prop_invlist()" in Unicode::UCD.
6462
6463       Due to their nature, not all Unicode character properties are suitable
6464       for regular expression matches, nor "prop_invlist()".  The remaining
6465       non-provisional, non-internal ones are accessible via "prop_invmap()"
6466       in Unicode::UCD (except for those that this Perl installation hasn't
6467       included; see below for which those are).
6468
6469       For compatibility with other parts of Perl, all the single forms given
6470       in the table in the section above are recognized.  BUT, there are some
6471       ambiguities between some Perl extensions and the Unicode properties,
6472       all of which are silently resolved in favor of the official Unicode
6473       property.  To avoid surprises, you should only use "prop_invmap()" for
6474       forms listed in the table below, which omits the non-recommended ones.
6475       The affected forms are the Perl single form equivalents of Unicode
6476       properties, such as "\p{sc}" being a single-form equivalent of
6477       "\p{gc=sc}", which is treated by "prop_invmap()" as the "Script"
6478       property, whose short name is "sc".  The table indicates the current
6479       ambiguities in the INFO column, beginning with the word "NOT".
6480
6481       The standard Unicode properties listed below are documented in
6482       <http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is
6483       documented in "prop_invmap()" in Unicode::UCD.  The other Perl
6484       extensions are in "Other Properties" in perlunicode;
6485
6486       The first column in the table is a name for the property; the second
6487       column is an alternative name, if any, plus possibly some annotations.
6488       The alternative name is the property's full name, unless that would
6489       simply repeat the first column, in which case the second column
6490       indicates the property's short name (if different).  The annotations
6491       are given only in the entry for the full name.  The annotations for
6492       binary properties include a list of the first few ranges that the
6493       property matches.  To avoid any ambiguity, the SPACE character is
6494       represented as "\x20".
6495
6496       If a property is obsolete, etc, the entry will be flagged with the same
6497       characters used in the table in the section above, like D or S.
6498
6499          NAME                      INFO
6500
6501          Age
6502          AHex                    ASCII_Hex_Digit
6503          All                     (Perl extension).  All code points,
6504                                  including those above Unicode.  Same as
6505                                  qr/./s.  U+0000..infinity
6506          Alnum                   XPosixAlnum.  (Perl extension)
6507          Alpha                   Alphabetic
6508          Alphabetic              (Short: Alpha).  [A-Za-z\xaa\xb5\xba\xc0-
6509                                  \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6510                                  U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
6511                                  ...
6512          Any                     (Perl extension).  All Unicode code
6513                                  points.  U+0000..10FFFF
6514          ASCII                   Block=Basic_Latin.  (Perl extension).
6515                                  [\x00-\x7f]
6516          ASCII_Hex_Digit         (Short: AHex).  [0-9A-Fa-f]
6517          Assigned                (Perl extension).  All assigned code
6518                                  points.  U+0000..0377, U+037A..037F,
6519                                  U+0384..038A, U+038C, U+038E..03A1,
6520                                  U+03A3..052F ...
6521          Bc                      Bidi_Class
6522          Bidi_C                  Bidi_Control
6523          Bidi_Class              (Short: bc)
6524          Bidi_Control            (Short: Bidi_C).  U+061C, U+200E..200F,
6525                                  U+202A..202E, U+2066..2069
6526          Bidi_M                  Bidi_Mirrored
6527          Bidi_Mirrored           (Short: Bidi_M).
6528                                  [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
6529                                  U+169B..169C, U+2039..203A, U+2045..2046,
6530                                  U+207D..207E ...
6531          Bidi_Mirroring_Glyph    (Short: bmg)
6532          Bidi_Paired_Bracket     (Short: bpb)
6533          Bidi_Paired_Bracket_Type (Short: bpt)
6534          Blank                   XPosixBlank.  (Perl extension)
6535          Blk                     Block
6536          Block                   (Short: blk)
6537          Bmg                     Bidi_Mirroring_Glyph
6538          Bpb                     Bidi_Paired_Bracket
6539          Bpt                     Bidi_Paired_Bracket_Type
6540          Canonical_Combining_Class (Short: ccc)
6541          Case_Folding            (Short: cf)
6542          Case_Ignorable          (Short: CI).
6543                                  [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8],
6544                                  U+02B0..036F, U+0374..0375, U+037A,
6545                                  U+0384..0385, U+0387 ...
6546          Cased                   [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6547                                  \xff], U+0100..01BA, U+01BC..01BF,
6548                                  U+01C4..0293, U+0295..02B8, U+02C0..02C1
6549                                  ...
6550          Category                General_Category
6551          Ccc                     Canonical_Combining_Class
6552          CE                      Composition_Exclusion
6553          Cf                      Case_Folding; NOT 'cf' meaning
6554                                  'General_Category=Format'
6555          Changes_When_Casefolded (Short: CWCF).  [A-Z\xb5\xc0-\xd6\xd8-
6556                                  \xdf], U+0100, U+0102, U+0104, U+0106,
6557                                  U+0108 ...
6558          Changes_When_Casemapped (Short: CWCM).  [A-Za-z\xb5\xc0-\xd6\xd8-
6559                                  \xf6\xf8-\xff], U+0100..0137,
6560                                  U+0139..018C, U+018E..019A, U+019C..01A9,
6561                                  U+01AC..01B9 ...
6562          Changes_When_Lowercased (Short: CWL).  [A-Z\xc0-\xd6\xd8-\xde],
6563                                  U+0100, U+0102, U+0104, U+0106, U+0108 ...
6564          Changes_When_NFKC_Casefolded (Short: CWKCF).  [A-
6565                                  Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8-
6566                                  \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf], U+0100,
6567                                  U+0102, U+0104, U+0106, U+0108 ...
6568          Changes_When_Titlecased (Short: CWT).  [a-z\xb5\xdf-\xf6\xf8-
6569                                  \xff], U+0101, U+0103, U+0105, U+0107,
6570                                  U+0109 ...
6571          Changes_When_Uppercased (Short: CWU).  [a-z\xb5\xdf-\xf6\xf8-
6572                                  \xff], U+0101, U+0103, U+0105, U+0107,
6573                                  U+0109 ...
6574          CI                      Case_Ignorable
6575          Cntrl                   XPosixCntrl (=General_Category=Control).
6576                                  (Perl extension)
6577          Comp_Ex                 Full_Composition_Exclusion
6578          Composition_Exclusion   (Short: CE).  U+0958..095F, U+09DC..09DD,
6579                                  U+09DF, U+0A33, U+0A36, U+0A59..0A5B ...
6580          CWCF                    Changes_When_Casefolded
6581          CWCM                    Changes_When_Casemapped
6582          CWKCF                   Changes_When_NFKC_Casefolded
6583          CWL                     Changes_When_Lowercased
6584          CWT                     Changes_When_Titlecased
6585          CWU                     Changes_When_Uppercased
6586          Dash                    [\-], U+058A, U+05BE, U+1400, U+1806,
6587                                  U+2010..2015 ...
6588          Decomposition_Mapping   (Short: dm)
6589          Decomposition_Type      (Short: dt)
6590          Default_Ignorable_Code_Point (Short: DI).  [\xad], U+034F, U+061C,
6591                                  U+115F..1160, U+17B4..17B5, U+180B..180E
6592                                  ...
6593          Dep                     Deprecated
6594          Deprecated              (Short: Dep).  U+0149, U+0673, U+0F77,
6595                                  U+0F79, U+17A3..17A4, U+206A..206F ...
6596          DI                      Default_Ignorable_Code_Point
6597          Dia                     Diacritic
6598          Diacritic               (Short: Dia).  [\^`\xa8\xaf\xb4\xb7-\xb8],
6599                                  U+02B0..034E, U+0350..0357, U+035D..0362,
6600                                  U+0374..0375, U+037A ...
6601          Digit                   XPosixDigit (=General_Category=
6602                                  Decimal_Number).  (Perl extension)
6603          Dm                      Decomposition_Mapping
6604          Dt                      Decomposition_Type
6605          Ea                      East_Asian_Width
6606          East_Asian_Width        (Short: ea)
6607          Ext                     Extender
6608          Extender                (Short: Ext).  [\xb7], U+02D0..02D1,
6609                                  U+0640, U+07FA, U+0E46, U+0EC6 ...
6610          Full_Composition_Exclusion (Short: Comp_Ex).  U+0340..0341,
6611                                  U+0343..0344, U+0374, U+037E, U+0387,
6612                                  U+0958..095F ...
6613          Gc                      General_Category
6614          GCB                     Grapheme_Cluster_Break
6615          General_Category        (Short: gc)
6616          Gr_Base                 Grapheme_Base
6617          Gr_Ext                  Grapheme_Extend
6618          Graph                   XPosixGraph.  (Perl extension)
6619          Grapheme_Base           (Short: Gr_Base).  [\x20-\x7e\xa0-
6620                                  \xac\xae-\xff], U+0100..02FF,
6621                                  U+0370..0377, U+037A..037F, U+0384..038A,
6622                                  U+038C ...
6623          Grapheme_Cluster_Break  (Short: GCB)
6624          Grapheme_Extend         (Short: Gr_Ext).  U+0300..036F,
6625                                  U+0483..0489, U+0591..05BD, U+05BF,
6626                                  U+05C1..05C2, U+05C4..05C5 ...
6627          Hangul_Syllable_Type    (Short: hst)
6628          Hex                     Hex_Digit
6629          Hex_Digit               (Short: Hex).  [0-9A-Fa-f], U+FF10..FF19,
6630                                  U+FF21..FF26, U+FF41..FF46
6631          HorizSpace              XPosixBlank.  (Perl extension)
6632          Hst                     Hangul_Syllable_Type
6633        D Hyphen                  [\-\xad], U+058A, U+1806, U+2010..2011,
6634                                  U+2E17, U+30FB ...  Supplanted by
6635                                  Line_Break property values; see
6636                                  www.unicode.org/reports/tr14
6637          ID_Continue             (Short: IDC).  [0-9A-Z_a-
6638                                  z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8-
6639                                  \xff], U+0100..02C1, U+02C6..02D1,
6640                                  U+02E0..02E4, U+02EC, U+02EE ...
6641          ID_Start                (Short: IDS).  [A-Za-z\xaa\xb5\xba\xc0-
6642                                  \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6643                                  U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
6644                                  ...
6645          IDC                     ID_Continue
6646          Ideo                    Ideographic
6647          Ideographic             (Short: Ideo).  U+3006..3007,
6648                                  U+3021..3029, U+3038..303A, U+3400..4DB5,
6649                                  U+4E00..9FEA, U+F900..FA6D ...
6650          IDS                     ID_Start
6651          IDS_Binary_Operator     (Short: IDSB).  U+2FF0..2FF1, U+2FF4..2FFB
6652          IDS_Trinary_Operator    (Short: IDST).  U+2FF2..2FF3
6653          IDSB                    IDS_Binary_Operator
6654          IDST                    IDS_Trinary_Operator
6655          In                      Present_In.  (Perl extension)
6656          Indic_Positional_Category (Short: InPC)
6657          Indic_Syllabic_Category (Short: InSC)
6658          InPC                    Indic_Positional_Category
6659          InSC                    Indic_Syllabic_Category
6660          Isc                     ISO_Comment; NOT 'isc' meaning
6661                                  'General_Category=Other'
6662          ISO_Comment             (Short: isc)
6663          Jg                      Joining_Group
6664          Join_C                  Join_Control
6665          Join_Control            (Short: Join_C).  U+200C..200D
6666          Joining_Group           (Short: jg)
6667          Joining_Type            (Short: jt)
6668          Jt                      Joining_Type
6669          Lb                      Line_Break
6670          Lc                      Lowercase_Mapping; NOT 'lc' meaning
6671                                  'General_Category=Cased_Letter'
6672          Line_Break              (Short: lb)
6673          LOE                     Logical_Order_Exception
6674          Logical_Order_Exception (Short: LOE).  U+0E40..0E44, U+0EC0..0EC4,
6675                                  U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9
6676                                  ...
6677          Lower                   Lowercase
6678          Lowercase               (Short: Lower).  [a-z\xaa\xb5\xba\xdf-
6679                                  \xf6\xf8-\xff], U+0101, U+0103, U+0105,
6680                                  U+0107, U+0109 ...
6681          Lowercase_Mapping       (Short: lc)
6682          Math                    [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
6683                                  U+03D5, U+03F0..03F1, U+03F4..03F6,
6684                                  U+0606..0608 ...
6685          Na                      Name
6686          Na1                     Unicode_1_Name
6687          Name                    (Short: na)
6688          Name_Alias
6689          NChar                   Noncharacter_Code_Point
6690          NFC_QC                  NFC_Quick_Check
6691          NFC_Quick_Check         (Short: NFC_QC)
6692          NFD_QC                  NFD_Quick_Check
6693          NFD_Quick_Check         (Short: NFD_QC)
6694          NFKC_Casefold           (Short: NFKC_CF)
6695          NFKC_CF                 NFKC_Casefold
6696          NFKC_QC                 NFKC_Quick_Check
6697          NFKC_Quick_Check        (Short: NFKC_QC)
6698          NFKD_QC                 NFKD_Quick_Check
6699          NFKD_Quick_Check        (Short: NFKD_QC)
6700          Noncharacter_Code_Point (Short: NChar).  U+FDD0..FDEF,
6701                                  U+FFFE..FFFF, U+1FFFE..1FFFF,
6702                                  U+2FFFE..2FFFF, U+3FFFE..3FFFF,
6703                                  U+4FFFE..4FFFF ...
6704          Nt                      Numeric_Type
6705          Numeric_Type            (Short: nt)
6706          Numeric_Value           (Short: nv)
6707          Nv                      Numeric_Value
6708          Pat_Syn                 Pattern_Syntax
6709          Pat_WS                  Pattern_White_Space
6710          Pattern_Syntax          (Short: Pat_Syn).
6711                                  [!\"#\$\%&\'\(\)*+,\-.\/:;<=
6712                                  >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab-
6713                                  \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
6714                                  U+2010..2027, U+2030..203E, U+2041..2053,
6715                                  U+2055..205E, U+2190..245F ...
6716          Pattern_White_Space     (Short: Pat_WS).  [\t\n\cK\f\r\x20\x85],
6717                                  U+200E..200F, U+2028..2029
6718          PCM                     Prepended_Concatenation_Mark
6719          Perl_Decimal_Digit      (Perl extension)
6720          PerlSpace               PosixSpace.  (Perl extension)
6721          PerlWord                PosixWord.  (Perl extension)
6722          PosixAlnum              (Perl extension).  [0-9A-Za-z]
6723          PosixAlpha              (Perl extension).  [A-Za-z]
6724          PosixBlank              (Perl extension).  [\t\x20]
6725          PosixCntrl              (Perl extension).  ASCII control
6726                                  characters.  ACK, BEL, BS, CAN, CR, DC1,
6727                                  DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT,
6728                                  ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK,
6729                                  NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT
6730          PosixDigit              (Perl extension).  [0-9]
6731          PosixGraph              (Perl extension).
6732                                  [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6733                                  Z\[\\\]\^_`a-z\{\|\}~]
6734          PosixLower              (Perl extension).  [a-z]
6735          PosixPrint              (Perl extension).  [\x20-\x7e]
6736          PosixPunct              (Perl extension).
6737                                  [!\"#\$\%&\'\(\)*+,\-.\/:;<=
6738                                  >?\@\[\\\]\^_`\{\|\}~]
6739          PosixSpace              (Perl extension).  [\t\n\cK\f\r\x20]
6740          PosixUpper              (Perl extension).  [A-Z]
6741          PosixWord               (Perl extension).  \w, restricted to
6742                                  ASCII.  [0-9A-Z_a-z]
6743          PosixXDigit             ASCII_Hex_Digit.  (Perl extension).
6744                                  [0-9A-Fa-f]
6745          Prepended_Concatenation_Mark (Short: PCM).  U+0600..0605, U+06DD,
6746                                  U+070F, U+08E2, U+110BD
6747          Present_In              (Short: In).  (Perl extension)
6748          Print                   XPosixPrint.  (Perl extension)
6749          Punct                   General_Category=Punctuation.  (Perl
6750                                  extension).
6751                                  [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_-
6752                                  \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
6753                                  U+037E, U+0387, U+055A..055F,
6754                                  U+0589..058A, U+05BE ...
6755          QMark                   Quotation_Mark
6756          Quotation_Mark          (Short: QMark).  [\"\'\xab\xbb],
6757                                  U+2018..201F, U+2039..203A, U+2E42,
6758                                  U+300C..300F, U+301D..301F ...
6759          Radical                 U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5
6760          Regional_Indicator      (Short: RI).  U+1F1E6..1F1FF
6761          RI                      Regional_Indicator
6762          SB                      Sentence_Break
6763          Sc                      Script; NOT 'sc' meaning
6764                                  'General_Category=Currency_Symbol'
6765          Scf                     Simple_Case_Folding
6766          Script                  (Short: sc)
6767          Script_Extensions       (Short: scx)
6768          Scx                     Script_Extensions
6769          SD                      Soft_Dotted
6770          Sentence_Break          (Short: SB)
6771          Sentence_Terminal       (Short: STerm).  [!.?], U+0589, U+061F,
6772                                  U+06D4, U+0700..0702, U+07F9 ...
6773          Sfc                     Simple_Case_Folding
6774          Simple_Case_Folding     (Short: scf)
6775          Simple_Lowercase_Mapping (Short: slc)
6776          Simple_Titlecase_Mapping (Short: stc)
6777          Simple_Uppercase_Mapping (Short: suc)
6778          Slc                     Simple_Lowercase_Mapping
6779          Soft_Dotted             (Short: SD).  [i-j], U+012F, U+0249,
6780                                  U+0268, U+029D, U+02B2 ...
6781          Space                   White_Space
6782          SpacePerl               XPosixSpace.  (Perl extension)
6783          Stc                     Simple_Titlecase_Mapping
6784          STerm                   Sentence_Terminal
6785          Suc                     Simple_Uppercase_Mapping
6786          Tc                      Titlecase_Mapping
6787          Term                    Terminal_Punctuation
6788          Terminal_Punctuation    (Short: Term).  [!,.:;?], U+037E, U+0387,
6789                                  U+0589, U+05C3, U+060C ...
6790          Title                   Titlecase.  (Perl extension)
6791          Titlecase               (Short: Title).  (Perl extension).  (=
6792                                  \p{Gc=Lt}).  U+01C5, U+01C8, U+01CB,
6793                                  U+01F2, U+1F88..1F8F, U+1F98..1F9F ...
6794          Titlecase_Mapping       (Short: tc)
6795          Uc                      Uppercase_Mapping
6796          UIdeo                   Unified_Ideograph
6797          Unicode                 Any.  (Perl extension)
6798          Unicode_1_Name          (Short: na1)
6799          Unified_Ideograph       (Short: UIdeo).  U+3400..4DB5,
6800                                  U+4E00..9FEA, U+FA0E..FA0F, U+FA11,
6801                                  U+FA13..FA14, U+FA1F ...
6802          Upper                   Uppercase
6803          Uppercase               (Short: Upper).  [A-Z\xc0-\xd6\xd8-\xde],
6804                                  U+0100, U+0102, U+0104, U+0106, U+0108 ...
6805          Uppercase_Mapping       (Short: uc)
6806          Variation_Selector      (Short: VS).  U+180B..180D, U+FE00..FE0F,
6807                                  U+E0100..E01EF
6808          Vertical_Orientation    (Short: vo)
6809          VertSpace               (Perl extension).  \v.  [\n\cK\f\r\x85],
6810                                  U+2028..2029
6811          Vo                      Vertical_Orientation
6812          VS                      Variation_Selector
6813          WB                      Word_Break
6814          White_Space             (Short: WSpace).
6815                                  [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6816                                  U+2000..200A, U+2028..2029, U+202F, U+205F
6817                                  ...
6818          Word                    XPosixWord.  (Perl extension)
6819          Word_Break              (Short: WB)
6820          WSpace                  White_Space
6821          XDigit                  XPosixXDigit (=Hex_Digit).  (Perl
6822                                  extension)
6823          XID_Continue            (Short: XIDC).  [0-9A-Z_a-
6824                                  z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8-
6825                                  \xff], U+0100..02C1, U+02C6..02D1,
6826                                  U+02E0..02E4, U+02EC, U+02EE ...
6827          XID_Start               (Short: XIDS).  [A-Za-z\xaa\xb5\xba\xc0-
6828                                  \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6829                                  U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
6830                                  ...
6831          XIDC                    XID_Continue
6832          XIDS                    XID_Start
6833          XPerlSpace              XPosixSpace.  (Perl extension)
6834          XPosixAlnum             (Short: Alnum).  (Perl extension).
6835                                  Alphabetic and (decimal) Numeric.  [0-9A-
6836                                  Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6837                                  \xff], U+0100..02C1, U+02C6..02D1,
6838                                  U+02E0..02E4, U+02EC, U+02EE ...
6839          XPosixAlpha             Alphabetic.  (Perl extension).  [A-Za-
6840                                  z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6841                                  U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
6842                                  U+02EC, U+02EE ...
6843          XPosixBlank             (Short: Blank).  (Perl extension).  \h,
6844                                  Horizontal white space.  [\t\x20\xa0],
6845                                  U+1680, U+2000..200A, U+202F, U+205F,
6846                                  U+3000
6847          XPosixCntrl             General_Category=Control  (Short: Cntrl).
6848                                  (Perl extension).  Control characters.
6849                                  [\x00-\x1f\x7f-\x9f]
6850          XPosixDigit             General_Category=Decimal_Number  (Short:
6851                                  Digit).  (Perl extension).  [0-9] + all
6852                                  other decimal digits.  [0-9],
6853                                  U+0660..0669, U+06F0..06F9, U+07C0..07C9,
6854                                  U+0966..096F, U+09E6..09EF ...
6855          XPosixGraph             (Short: Graph).  (Perl extension).
6856                                  Characters that are graphical.
6857                                  [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6858                                  Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff],
6859                                  U+0100..0377, U+037A..037F, U+0384..038A,
6860                                  U+038C, U+038E..03A1 ...
6861          XPosixLower             Lowercase.  (Perl extension).  [a-
6862                                  z\xaa\xb5\xba\xdf-\xf6\xf8-\xff], U+0101,
6863                                  U+0103, U+0105, U+0107, U+0109 ...
6864          XPosixPrint             (Short: Print).  (Perl extension).
6865                                  Characters that are graphical plus space
6866                                  characters (but no controls).  [\x20-
6867                                  \x7e\xa0-\xff], U+0100..0377,
6868                                  U+037A..037F, U+0384..038A, U+038C,
6869                                  U+038E..03A1 ...
6870          XPosixPunct             (Perl extension).  \p{Punct} + ASCII-range
6871                                  \p{Symbol}.  [!\"#\$\%&\'\(\)*+,\-.\/:;<=
6872                                  >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
6873                                  \xb7\xbb\xbf], U+037E, U+0387,
6874                                  U+055A..055F, U+0589..058A, U+05BE ...
6875          XPosixSpace             (Perl extension).  \s including beyond
6876                                  ASCII and vertical tab.
6877                                  [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6878                                  U+2000..200A, U+2028..2029, U+202F, U+205F
6879                                  ...
6880          XPosixUpper             Uppercase.  (Perl extension).  [A-Z\xc0-
6881                                  \xd6\xd8-\xde], U+0100, U+0102, U+0104,
6882                                  U+0106, U+0108 ...
6883          XPosixWord              (Short: Word).  (Perl extension).  \w,
6884                                  including beyond ASCII; = \p{Alnum} + \pM
6885                                  + \p{Pc} + \p{Join_Control}.  [0-9A-Z_a-
6886                                  z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6887                                  U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
6888                                  U+02EC, U+02EE ...
6889          XPosixXDigit            Hex_Digit  (Short: XDigit).  (Perl
6890                                  extension).  [0-9A-Fa-f], U+FF10..FF19,
6891                                  U+FF21..FF26, U+FF41..FF46
6892

Properties accessible through other means

6894       Certain properties are accessible also via core function calls.  These
6895       are:
6896
6897        Lowercase_Mapping          lc() and lcfirst()
6898        Titlecase_Mapping          ucfirst()
6899        Uppercase_Mapping          uc()
6900
6901       Also, Case_Folding is accessible through the "/i" modifier in regular
6902       expressions, the "\F" transliteration escape, and the "fc" operator.
6903
6904       And, the Name and Name_Aliases properties are accessible through the
6905       "\N{}" interpolation in double-quoted strings and regular expressions;
6906       and functions "charnames::viacode()", "charnames::vianame()", and
6907       "charnames::string_vianame()" (which require a "use charnames ();" to
6908       be specified.
6909
6910       Finally, most properties related to decomposition are accessible via
6911       Unicode::Normalize.
6912

Unicode character properties that are NOT accepted by Perl

6914       Perl will generate an error for a few character properties in Unicode
6915       when used in a regular expression.  The non-Unihan ones are listed
6916       below, with the reasons they are not accepted, perhaps with work-
6917       arounds.  The short names for the properties are listed enclosed in
6918       (parentheses).  As described after the list, an installation can change
6919       the defaults and choose to accept any of these.  The list is machine
6920       generated based on the choices made for the installation that generated
6921       this document.
6922
6923       Expands_On_NFC (XO_NFC)
6924       Expands_On_NFD (XO_NFD)
6925       Expands_On_NFKC (XO_NFKC)
6926       Expands_On_NFKD (XO_NFKD)
6927           Deprecated by Unicode.  These are characters that expand to more
6928           than one character in the specified normalization form, but whether
6929           they actually take up more bytes or not depends on the encoding
6930           being used.  For example, a UTF-8 encoded character may expand to a
6931           different number of bytes than a UTF-32 encoded character.
6932
6933       Grapheme_Link (Gr_Link)
6934           Deprecated by Unicode:  Duplicates ccc=vr
6935           (Canonical_Combining_Class=Virama)
6936
6937       Jamo_Short_Name (JSN)
6938       Other_Alphabetic (OAlpha)
6939       Other_Default_Ignorable_Code_Point (ODI)
6940       Other_Grapheme_Extend (OGr_Ext)
6941       Other_ID_Continue (OIDC)
6942       Other_ID_Start (OIDS)
6943       Other_Lowercase (OLower)
6944       Other_Math (OMath)
6945       Other_Uppercase (OUpper)
6946           Used by Unicode internally for generating other properties and not
6947           intended to be used stand-alone
6948
6949       Script=Katakana_Or_Hiragana (sc=Hrkt)
6950           Obsolete.  All code points previously matched by this have been
6951           moved to "Script=Common".  Consider instead using
6952           "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or
6953           both)
6954
6955       Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)
6956           All code points that would be matched by this are matched by either
6957           "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
6958
6959       An installation can choose to allow any of these to be matched by
6960       downloading the Unicode database from <http://www.unicode.org/Public/>
6961       to $Config{privlib}/unicore/ in the Perl source tree, changing the
6962       controlling lists contained in the program
6963       $Config{privlib}/unicore/mktables and then re-compiling and installing.
6964       (%Config is available from the Config module).
6965
6966       Also, perl can be recompiled to operate on an earlier version of the
6967       Unicode standard.  Further information is at
6968       $Config{privlib}/unicore/README.perl.
6969

Other information in the Unicode data base

6971       The Unicode data base is delivered in two different formats.  The XML
6972       version is valid for more modern Unicode releases.  The other version
6973       is a collection of files.  The two are intended to give equivalent
6974       information.  Perl uses the older form; this allows you to recompile
6975       Perl to use early Unicode releases.
6976
6977       The only non-character property that Perl currently supports is Named
6978       Sequences, in which a sequence of code points is given a name and
6979       generally treated as a single entity.  (Perl supports these via the
6980       "\N{...}" double-quotish construct, "charnames::string_vianame(name)"
6981       in charnames, and "namedseq()" in Unicode::UCD.
6982
6983       Below is a list of the files in the Unicode data base that Perl doesn't
6984       currently use, along with very brief descriptions of their purposes.
6985       Some of the names of the files have been shortened from those that
6986       Unicode uses, in order to allow them to be distinguishable from
6987       similarly named files on file systems for which only the first 8
6988       characters of a name are significant.
6989
6990       auxiliary/GraphemeBreakTest.html
6991       auxiliary/LineBreakTest.html
6992       auxiliary/SentenceBreakTest.html
6993       auxiliary/WordBreakTest.html
6994           Documentation of validation Tests
6995
6996       BidiCharacterTest.txt
6997       BidiTest.txt
6998       NormTest.txt
6999           Validation Tests
7000
7001       CJKRadicals.txt
7002           Maps the kRSUnicode property values to corresponding code points
7003
7004       EmojiSources.txt
7005           Maps certain Unicode code points to their legacy Japanese cell-
7006           phone values
7007
7008       extracted/DName.txt
7009           This file adds no new information not already present in other
7010           files
7011
7012       Index.txt
7013           Alphabetical index of Unicode characters
7014
7015       NamedSqProv.txt
7016           Named sequences proposed for inclusion in a later version of the
7017           Unicode Standard; if you need them now, you can append this file to
7018           NamedSequences.txt and recompile perl
7019
7020       NamesList.html
7021           Describes the format and contents of NamesList.txt
7022
7023       NamesList.txt
7024           Annotated list of characters
7025
7026       NormalizationCorrections.txt
7027           Documentation of corrections already incorporated into the Unicode
7028           data base
7029
7030       NushuSources.txt
7031           Specifies source material for Nushu characters
7032
7033       ReadMe.txt
7034           Documentation
7035
7036       StandardizedVariants.html
7037           Obsoleted as of Unicode 9.0, but previously provided a visual
7038           display of the standard variant sequences derived from
7039           StandardizedVariants.txt.
7040
7041       StandardizedVariants.txt
7042           Certain glyph variations for character display are standardized.
7043           This lists the non-Unihan ones; the Unihan ones are also not used
7044           by Perl, and are in a separate Unicode data base
7045           <http://www.unicode.org/ivd>
7046
7047       TangutSources.txt
7048           Specifies source mappings for Tangut ideographs and components.
7049           This data file also includes informative radical-stroke values that
7050           are used internally by Unicode
7051
7052       USourceData.txt
7053           Documentation of status and cross reference of proposals for
7054           encoding by Unicode of Unihan characters
7055
7056       USourceGlyphs.pdf
7057           Pictures of the characters in USourceData.txt
7058

SEE ALSO

7060       <http://www.unicode.org/reports/tr44/>
7061
7062       perlrecharclass
7063
7064       perlunicode
7065
7066
7067
7068perl v5.28.2                      2019-04-23                   PERLUNIPROPS(1)
Impressum