1PERLUNIPROPS(1)        Perl Programmers Reference Guide        PERLUNIPROPS(1)
2
3
4

NAME

6       perluniprops - Index of Unicode Version 15.0.0 character properties in
7       Perl
8

DESCRIPTION

10       This document provides information about the portion of the Unicode
11       database that deals with character properties, that is the portion that
12       is defined on single code points.  ("Other information in the Unicode
13       data base" below briefly mentions other data that Unicode provides.)
14
15       Perl can provide access to all non-provisional Unicode character
16       properties, though not all are enabled by default.  The omitted ones
17       are the Unihan properties and certain deprecated or Unicode-internal
18       properties.  (An installation may choose to recompile Perl's tables to
19       change this.  See "Unicode character properties that are NOT accepted
20       by Perl".)
21
22       For most purposes, access to Unicode properties from the Perl core is
23       through regular expression matches, as described in the next section.
24       For some special purposes, and to access the properties that are not
25       suitable for regular expression matching, all the Unicode character
26       properties that Perl handles are accessible via the standard
27       Unicode::UCD module, as described in the section "Properties accessible
28       through Unicode::UCD".
29
30       Perl also provides some additional extensions and short-cut synonyms
31       for Unicode properties.
32
33       This document merely lists all available properties and does not
34       attempt to explain what each property really means.  There is a brief
35       description of each Perl extension; see "Other Properties" in
36       perlunicode for more information on these.  There is some detail about
37       Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but
38       to find out about the intricacies of the official Unicode properties,
39       refer to the Unicode standard.  A good starting place is
40       <http://www.unicode.org/reports/tr44/>.
41
42       Note that you can define your own properties; see "User-Defined
43       Character Properties" in perlunicode.
44

Properties accessible through "\p{}" and "\P{}"

46       The Perl regular expression "\p{}" and "\P{}" constructs give access to
47       most of the Unicode character properties.  The table below shows all
48       these constructs, both single and compound forms.
49
50       Compound forms consist of two components, separated by an equals sign
51       or a colon.  The first component is the property name, and the second
52       component is the particular value of the property to match against, for
53       example, "\p{Script_Extensions: Greek}" and
54       "\p{Script_Extensions=Greek}" both mean to match characters whose
55       Script_Extensions property value is Greek.  ("Script_Extensions" is an
56       improved version of the "Script" property.)
57
58       Single forms, like "\p{Greek}", are mostly Perl-defined shortcuts for
59       their equivalent compound forms.  The table shows these equivalences.
60       (In our example, "\p{Greek}" is a just a shortcut for
61       "\p{Script_Extensions=Greek}").  There are also a few Perl-defined
62       single forms that are not shortcuts for a compound form.  One such is
63       "\p{Word}".  These are also listed in the table.
64
65       In parsing these constructs, Perl always ignores Upper/lower case
66       differences everywhere within the {braces}.  Thus "\p{Greek}" means the
67       same thing as "\p{greek}".  But note that changing the case of the "p"
68       or "P" before the left brace completely changes the meaning of the
69       construct, from "match" (for "\p{}") to "doesn't match" (for "\P{}").
70       Casing in this document is for improved legibility.
71
72       Also, white space, hyphens, and underscores are normally ignored
73       everywhere between the {braces}, and hence can be freely added or
74       removed even if the "/x" modifier hasn't been specified on the regular
75       expression.  But in the table below a 'T' at the beginning of an entry
76       means that tighter (stricter) rules are used for that entry:
77
78           Single form ("\p{name}") tighter rules:
79               White space, hyphens, and underscores ARE significant except
80               for:
81
82               •   white space adjacent to a non-word character
83
84               •   underscores separating digits in numbers
85
86               That means, for example, that you can freely add or remove
87               white space adjacent to (but within) the braces without
88               affecting the meaning.
89
90           Compound form ("\p{name=value}" or "\p{name:value}") tighter rules:
91               The tighter rules given above for the single form apply to
92               everything to the right of the colon or equals; the looser
93               rules still apply to everything to the left.
94
95               That means, for example, that you can freely add or remove
96               white space adjacent to (but within) the braces and the colon
97               or equal sign.
98
99       Some properties are considered obsolete by Unicode, but still
100       available.  There are several varieties of obsolescence:
101
102           Stabilized
103               A property may be stabilized.  Such a determination does not
104               indicate that the property should or should not be used;
105               instead it is a declaration that the property will not be
106               maintained nor extended for newly encoded characters.  Such
107               properties are marked with an 'S' in the table.
108
109           Deprecated
110               A property may be deprecated, perhaps because its original
111               intent has been replaced by another property, or because its
112               specification was somehow defective.  This means that its use
113               is strongly discouraged, so much so that a warning will be
114               issued if used, unless the regular expression is in the scope
115               of a "no warnings 'deprecated'" statement.  A 'D' flags each
116               such entry in the table, and the entry there for the longest,
117               most descriptive version of the property will give the reason
118               it is deprecated, and perhaps advice.  Perl may issue such a
119               warning, even for properties that aren't officially deprecated
120               by Unicode, when there used to be characters or code points
121               that were matched by them, but no longer.  This is to warn you
122               that your program may not work like it did on earlier Unicode
123               releases.
124
125               A deprecated property may be made unavailable in a future Perl
126               version, so it is best to move away from them.
127
128               A deprecated property may also be stabilized, but this fact is
129               not shown.
130
131           Obsolete
132               Properties marked with an 'O' in the table are considered
133               (plain) obsolete.  Generally this designation is given to
134               properties that Unicode once used for internal purposes (but
135               not any longer).
136
137           Discouraged
138               This is not actually a Unicode-specified obsolescence, but
139               applies to certain Perl extensions that are present for
140               backwards compatibility, but are discouraged from being used.
141               These are not obsolete, but their meanings are not stable.
142               Future Unicode versions could force any of these extensions to
143               be removed without warning, replaced by another property with
144               the same name that means something different.  An 'X' flags
145               each such entry in the table.  Use the equivalent shown
146               instead.
147
148               In particular, matches in the Block property have single forms
149               defined by Perl that begin with "In_", ""Is_", or even with no
150               prefix at all,  Like all DISCOURAGED forms, these are not
151               stable.  For example, "\p{Block=Deseret}" can currently be
152               written as "\p{In_Deseret}", "\p{Is_Deseret}", or
153               "\p{Deseret}".  But, a new Unicode version may come along that
154               would force Perl to change the meaning of one or more of these,
155               and your program would no longer be correct.  Currently there
156               are no such conflicts with the form that begins "In_", but
157               there are many with the other two shortcuts, and Unicode
158               continues to define new properties that begin with "In", so
159               it's quite possible that a conflict will occur in the future.
160               The compound form is guaranteed to not become obsolete, and its
161               meaning is clearer anyway.  See "Blocks" in perlunicode for
162               more information about this.
163
164               User-defined properties must begin with "In" or "Is".  These
165               override any Unicode property of the same name.
166
167       The table below has two columns.  The left column contains the "\p{}"
168       constructs to look up, possibly preceded by the flags mentioned above;
169       and the right column contains information about them, like a
170       description, or synonyms.  The table shows both the single and compound
171       forms for each property that has them.  If the left column is a short
172       name for a property, the right column will give its longer, more
173       descriptive name; and if the left column is the longest name, the right
174       column will show any equivalent shortest name, in both single and
175       compound forms if applicable.
176
177       If braces are not needed to specify a property (e.g., "\pL"), the left
178       column contains both forms, with and without braces.
179
180       The right column will also caution you if a property means something
181       different than what might normally be expected.
182
183       All single forms are Perl extensions; a few compound forms are as well,
184       and are noted as such.
185
186       Numbers in (parentheses) indicate the total number of Unicode code
187       points matched by the property.  For the entries that give the longest,
188       most descriptive version of the property, the count is followed by a
189       list of some of the code points matched by it.  The list includes all
190       the matched characters in the 0-255 range, enclosed in the familiar
191       [brackets] the same as a regular expression bracketed character class.
192       Following that, the next few higher matching ranges are also given.  To
193       avoid visual ambiguity, the SPACE character is represented as "\x20".
194
195       For emphasis, those properties that match no code points at all are
196       listed as well in a separate section following the table.
197
198       Most properties match the same code points regardless of whether "/i"
199       case-insensitive matching is specified or not.  But a few properties
200       are affected.  These are shown with the notation "(/i= other_property)"
201       in the second column.  Under case-insensitive matching they match the
202       same code pode points as the property other_property.
203
204       There is no description given for most non-Perl defined properties (See
205       <http://www.unicode.org/reports/tr44/> for that).
206
207       For compactness, '*' is used as a wildcard instead of showing all
208       possible combinations.  For example, entries like:
209
210        \p{Gc: *}                                  \p{General_Category: *}
211
212       mean that 'Gc' is a synonym for 'General_Category', and anything that
213       is valid for the latter is also valid for the former.  Similarly,
214
215        \p{Is_*}                                   \p{*}
216
217       means that if and only if, for example, "\p{Foo}" exists, then
218       "\p{Is_Foo}" and "\p{IsFoo}" are also valid and all mean the same
219       thing.  And similarly, "\p{Foo=Bar}" means the same as "\p{Is_Foo=Bar}"
220       and "\p{IsFoo=Bar}".  "*" here is restricted to something not beginning
221       with an underscore.
222
223       Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
224       'Y'.  And 'No', 'F', and 'False' are all synonyms for 'N'.  The table
225       shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
226       for the other possibilities.  Note that not all properties which have
227       values 'Yes' and 'No' are binary, and they have all their values
228       spelled out without using this wild card, and a "NOT" clause in their
229       description that highlights their not being binary.  These also require
230       the compound form to match them, whereas true binary properties have
231       both single and compound forms available.
232
233       Note that all non-essential underscores are removed in the display of
234       the short names below.
235
236       Legend summary:
237
238       * is a wild-card
239       (\d+) in the info column gives the number of Unicode code points
240       matched by this property.
241       D means this is deprecated.
242       O means this is obsolete.
243       S means this is stabilized.
244       T means tighter (stricter) name matching applies.
245       X means use of this form is discouraged, and may not be stable.
246
247              NAME                           INFO
248
249          \p{Adlam}               \p{Script_Extensions=Adlam} (Short:
250                                    \p{Adlm}; NOT \p{Block=Adlam}) (90)
251          \p{Adlm}                \p{Adlam} (= \p{Script_Extensions=Adlam})
252                                    (NOT \p{Block=Adlam}) (90)
253        X \p{Aegean_Numbers}      \p{Block=Aegean_Numbers} (64)
254        T \p{Age: 1.1}            \p{Age=V1_1} (33_979)
255          \p{Age: V1_1}           Code point's usage introduced in version
256                                    1.1 (33_979: U+0000..01F5, U+01FA..0217,
257                                    U+0250..02A8, U+02B0..02DE,
258                                    U+02E0..02E9, U+0300..0345 ...)
259        T \p{Age: 2.0}            \p{Age=V2_0} (144_521)
260          \p{Age: V2_0}           Code point's usage was introduced in
261                                    version 2.0; See also Property
262                                    'Present_In' (144_521: U+0591..05A1,
263                                    U+05A3..05AF, U+05C4, U+0F00..0F47,
264                                    U+0F49..0F69, U+0F71..0F8B ...)
265        T \p{Age: 2.1}            \p{Age=V2_1} (2)
266          \p{Age: V2_1}           Code point's usage was introduced in
267                                    version 2.1; See also Property
268                                    'Present_In' (2: U+20AC, U+FFFC)
269        T \p{Age: 3.0}            \p{Age=V3_0} (10_307)
270          \p{Age: V3_0}           Code point's usage was introduced in
271                                    version 3.0; See also Property
272                                    'Present_In' (10_307: U+01F6..01F9,
273                                    U+0218..021F, U+0222..0233,
274                                    U+02A9..02AD, U+02DF, U+02EA..02EE ...)
275        T \p{Age: 3.1}            \p{Age=V3_1} (44_978)
276          \p{Age: V3_1}           Code point's usage was introduced in
277                                    version 3.1; See also Property
278                                    'Present_In' (44_978: U+03F4..03F5,
279                                    U+FDD0..FDEF, U+10300..1031E,
280                                    U+10320..10323, U+10330..1034A,
281                                    U+10400..10425 ...)
282        T \p{Age: 3.2}            \p{Age=V3_2} (1016)
283          \p{Age: V3_2}           Code point's usage was introduced in
284                                    version 3.2; See also Property
285                                    'Present_In' (1016: U+0220, U+034F,
286                                    U+0363..036F, U+03D8..03D9, U+03F6,
287                                    U+048A..048B ...)
288        T \p{Age: 4.0}            \p{Age=V4_0} (1226)
289          \p{Age: V4_0}           Code point's usage was introduced in
290                                    version 4.0; See also Property
291                                    'Present_In' (1226: U+0221,
292                                    U+0234..0236, U+02AE..02AF,
293                                    U+02EF..02FF, U+0350..0357, U+035D..035F
294                                    ...)
295        T \p{Age: 4.1}            \p{Age=V4_1} (1273)
296          \p{Age: V4_1}           Code point's usage was introduced in
297                                    version 4.1; See also Property
298                                    'Present_In' (1273: U+0237..0241,
299                                    U+0358..035C, U+03FC..03FF,
300                                    U+04F6..04F7, U+05A2, U+05C5..05C7 ...)
301        T \p{Age: 5.0}            \p{Age=V5_0} (1369)
302          \p{Age: V5_0}           Code point's usage was introduced in
303                                    version 5.0; See also Property
304                                    'Present_In' (1369: U+0242..024F,
305                                    U+037B..037D, U+04CF, U+04FA..04FF,
306                                    U+0510..0513, U+05BA ...)
307        T \p{Age: 5.1}            \p{Age=V5_1} (1624)
308          \p{Age: V5_1}           Code point's usage was introduced in
309                                    version 5.1; See also Property
310                                    'Present_In' (1624: U+0370..0373,
311                                    U+0376..0377, U+03CF, U+0487,
312                                    U+0514..0523, U+0606..060A ...)
313        T \p{Age: 5.2}            \p{Age=V5_2} (6648)
314          \p{Age: V5_2}           Code point's usage was introduced in
315                                    version 5.2; See also Property
316                                    'Present_In' (6648: U+0524..0525,
317                                    U+0800..082D, U+0830..083E, U+0900,
318                                    U+094E, U+0955 ...)
319        T \p{Age: 6.0}            \p{Age=V6_0} (2088)
320          \p{Age: V6_0}           Code point's usage was introduced in
321                                    version 6.0; See also Property
322                                    'Present_In' (2088: U+0526..0527,
323                                    U+0620, U+065F, U+0840..085B, U+085E,
324                                    U+093A..093B ...)
325        T \p{Age: 6.1}            \p{Age=V6_1} (732)
326          \p{Age: V6_1}           Code point's usage was introduced in
327                                    version 6.1; See also Property
328                                    'Present_In' (732: U+058F, U+0604,
329                                    U+08A0, U+08A2..08AC, U+08E4..08FE,
330                                    U+0AF0 ...)
331        T \p{Age: 6.2}            \p{Age=V6_2} (1)
332          \p{Age: V6_2}           Code point's usage was introduced in
333                                    version 6.2; See also Property
334                                    'Present_In' (1: U+20BA)
335        T \p{Age: 6.3}            \p{Age=V6_3} (5)
336          \p{Age: V6_3}           Code point's usage was introduced in
337                                    version 6.3; See also Property
338                                    'Present_In' (5: U+061C, U+2066..2069)
339        T \p{Age: 7.0}            \p{Age=V7_0} (2834)
340          \p{Age: V7_0}           Code point's usage was introduced in
341                                    version 7.0; See also Property
342                                    'Present_In' (2834: U+037F,
343                                    U+0528..052F, U+058D..058E, U+0605,
344                                    U+08A1, U+08AD..08B2 ...)
345        T \p{Age: 8.0}            \p{Age=V8_0} (7716)
346          \p{Age: V8_0}           Code point's usage was introduced in
347                                    version 8.0; See also Property
348                                    'Present_In' (7716: U+08B3..08B4,
349                                    U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5
350                                    ...)
351        T \p{Age: 9.0}            \p{Age=V9_0} (7500)
352          \p{Age: V9_0}           Code point's usage was introduced in
353                                    version 9.0; See also Property
354                                    'Present_In' (7500: U+08B6..08BD,
355                                    U+08D4..08E2, U+0C80, U+0D4F,
356                                    U+0D54..0D56, U+0D58..0D5E ...)
357        T \p{Age: 10.0}           \p{Age=V10_0} (8518)
358          \p{Age: V10_0}          Code point's usage was introduced in
359                                    version 10.0; See also Property
360                                    'Present_In' (8518: U+0860..086A,
361                                    U+09FC..09FD, U+0AFA..0AFF, U+0D00,
362                                    U+0D3B..0D3C, U+1CF7 ...)
363        T \p{Age: 11.0}           \p{Age=V11_0} (684)
364          \p{Age: V11_0}          Code point's usage was introduced in
365                                    version 11.0; See also Property
366                                    'Present_In' (684: U+0560, U+0588,
367                                    U+05EF, U+07FD..07FF, U+08D3, U+09FE ...)
368        T \p{Age: 12.0}           \p{Age=V12_0} (554)
369          \p{Age: V12_0}          Code point's usage was introduced in
370                                    version 12.0; See also Property
371                                    'Present_In' (554: U+0C77, U+0E86,
372                                    U+0E89, U+0E8C, U+0E8E..0E93, U+0E98 ...)
373        T \p{Age: 12.1}           \p{Age=V12_1} (1)
374          \p{Age: V12_1}          Code point's usage was introduced in
375                                    version 12.1; See also Property
376                                    'Present_In' (1: U+32FF)
377        T \p{Age: 13.0}           \p{Age=V13_0} (5930)
378          \p{Age: V13_0}          Code point's usage was introduced in
379                                    version 13.0; See also Property
380                                    'Present_In' (5930: U+08BE..08C7,
381                                    U+0B55, U+0D04, U+0D81, U+1ABF..1AC0,
382                                    U+2B97 ...)
383        T \p{Age: 14.0}           \p{Age=V14_0} (838)
384          \p{Age: V14_0}          Code point's usage was introduced in
385                                    version 14.0; See also Property
386                                    'Present_In' (838: U+061D, U+0870..088E,
387                                    U+0890..0891, U+0898..089F, U+08B5,
388                                    U+08C8..08D2 ...)
389        T \p{Age: 15.0}           \p{Age=V15_0} (4489)
390          \p{Age: V15_0}          Code point's usage was introduced in
391                                    version 15.0; See also Property
392                                    'Present_In' (4489: U+0CF3, U+0ECE,
393                                    U+10EFD..10EFF, U+1123F..11241,
394                                    U+11B00..11B09, U+11F00..11F10 ...)
395          \p{Age: NA}             \p{Age=Unassigned} (825_279 plus all
396                                    above-Unicode code points)
397          \p{Age: Unassigned}     Code point's usage has not been assigned
398                                    in any Unicode release thus far.
399        (Short: \p{Age=NA}) (825_279 plus all above-Unicode code points:
400                                    U+0378..0379, U+0380..0383, U+038B,
401                                    U+038D, U+03A2, U+0530 ...)
402          \p{Aghb}                \p{Caucasian_Albanian} (=
403                                    \p{Script_Extensions=
404                                    Caucasian_Albanian}) (NOT \p{Block=
405                                    Caucasian_Albanian}) (53)
406          \p{AHex}                \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
407                                    (22)
408          \p{AHex: *}             \p{ASCII_Hex_Digit: *}
409          \p{Ahom}                \p{Script_Extensions=Ahom} (NOT \p{Block=
410                                    Ahom}) (65)
411        X \p{Alchemical}          \p{Alchemical_Symbols} (= \p{Block=
412                                    Alchemical_Symbols}) (128)
413        X \p{Alchemical_Symbols}  \p{Block=Alchemical_Symbols} (Short:
414                                    \p{InAlchemical}) (128)
415          \p{All}                 All code points, including those above
416                                    Unicode.  Same as qr/./s (1_114_112 plus
417                                    all above-Unicode code points:
418                                    U+0000..infinity)
419          \p{Alnum}               \p{XPosixAlnum} (138_445)
420          \p{Alpha}               \p{XPosixAlpha} (= \p{Alphabetic=Y})
421                                    (137_765)
422          \p{Alpha: *}            \p{Alphabetic: *}
423          \p{Alphabetic}          \p{XPosixAlpha} (= \p{Alphabetic=Y})
424                                    (137_765)
425          \p{Alphabetic: N*}      (Short: \p{Alpha=N}, \P{Alpha}) (976_347
426                                    plus all above-Unicode code points:
427                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
428                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4
429                                    \xb6-\xb9\xbb-\xbf\xd7\xf7],
430                                    U+02C2..02C5, U+02D2..02DF,
431                                    U+02E5..02EB, U+02ED, U+02EF..0344 ...)
432          \p{Alphabetic: Y*}      (Short: \p{Alpha=Y}, \p{Alpha}) (137_765:
433                                    [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
434                                    \xf8-\xff], U+0100..02C1, U+02C6..02D1,
435                                    U+02E0..02E4, U+02EC, U+02EE ...)
436        X \p{Alphabetic_PF}       \p{Alphabetic_Presentation_Forms} (=
437                                    \p{Block=Alphabetic_Presentation_Forms})
438                                    (80)
439        X \p{Alphabetic_Presentation_Forms} \p{Block=
440                                    Alphabetic_Presentation_Forms} (Short:
441                                    \p{InAlphabeticPF}) (80)
442          \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
443                                    Anatolian_Hieroglyphs} (Short: \p{Hluw};
444                                    NOT \p{Block=Anatolian_Hieroglyphs})
445                                    (583)
446        X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
447                                    \p{Block=
448                                    Ancient_Greek_Musical_Notation}) (80)
449        X \p{Ancient_Greek_Musical_Notation} \p{Block=
450                                    Ancient_Greek_Musical_Notation} (Short:
451                                    \p{InAncientGreekMusic}) (80)
452        X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
453        X \p{Ancient_Symbols}     \p{Block=Ancient_Symbols} (64)
454          \p{Any}                 All Unicode code points (1_114_112:
455                                    U+0000..10FFFF)
456          \p{Arab}                \p{Arabic} (= \p{Script_Extensions=
457                                    Arabic}) (NOT \p{Block=Arabic}) (1414)
458          \p{Arabic}              \p{Script_Extensions=Arabic} (Short:
459                                    \p{Arab}; NOT \p{Block=Arabic}) (1414)
460        X \p{Arabic_Ext_A}        \p{Arabic_Extended_A} (= \p{Block=
461                                    Arabic_Extended_A}) (96)
462        X \p{Arabic_Ext_B}        \p{Arabic_Extended_B} (= \p{Block=
463                                    Arabic_Extended_B}) (48)
464        X \p{Arabic_Ext_C}        \p{Arabic_Extended_C} (= \p{Block=
465                                    Arabic_Extended_C}) (64)
466        X \p{Arabic_Extended_A}   \p{Block=Arabic_Extended_A} (Short:
467                                    \p{InArabicExtA}) (96)
468        X \p{Arabic_Extended_B}   \p{Block=Arabic_Extended_B} (Short:
469                                    \p{InArabicExtB}) (48)
470        X \p{Arabic_Extended_C}   \p{Block=Arabic_Extended_C} (Short:
471                                    \p{InArabicExtC}) (64)
472        X \p{Arabic_Math}         \p{Arabic_Mathematical_Alphabetic_Symbols}
473                                    (= \p{Block=
474                                    Arabic_Mathematical_Alphabetic_Symbols})
475                                    (256)
476        X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
477                                    Arabic_Mathematical_Alphabetic_Symbols}
478                                    (Short: \p{InArabicMath}) (256)
479        X \p{Arabic_PF_A}         \p{Arabic_Presentation_Forms_A} (=
480                                    \p{Block=Arabic_Presentation_Forms_A})
481                                    (688)
482        X \p{Arabic_PF_B}         \p{Arabic_Presentation_Forms_B} (=
483                                    \p{Block=Arabic_Presentation_Forms_B})
484                                    (144)
485        X \p{Arabic_Presentation_Forms_A} \p{Block=
486                                    Arabic_Presentation_Forms_A} (Short:
487                                    \p{InArabicPFA}) (688)
488        X \p{Arabic_Presentation_Forms_B} \p{Block=
489                                    Arabic_Presentation_Forms_B} (Short:
490                                    \p{InArabicPFB}) (144)
491        X \p{Arabic_Sup}          \p{Arabic_Supplement} (= \p{Block=
492                                    Arabic_Supplement}) (48)
493        X \p{Arabic_Supplement}   \p{Block=Arabic_Supplement} (Short:
494                                    \p{InArabicSup}) (48)
495          \p{Armenian}            \p{Script_Extensions=Armenian} (Short:
496                                    \p{Armn}; NOT \p{Block=Armenian}) (96)
497          \p{Armi}                \p{Imperial_Aramaic} (=
498                                    \p{Script_Extensions=Imperial_Aramaic})
499                                    (NOT \p{Block=Imperial_Aramaic}) (31)
500          \p{Armn}                \p{Armenian} (= \p{Script_Extensions=
501                                    Armenian}) (NOT \p{Block=Armenian}) (96)
502        X \p{Arrows}              \p{Block=Arrows} (112)
503          \p{ASCII}               \p{Block=Basic_Latin} (128)
504          \p{ASCII_Hex_Digit}     \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
505                                    (22)
506          \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
507                                    plus all above-Unicode code points:
508                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
509                                    \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
510                                    U+0100..infinity)
511          \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A-
512                                    Fa-f])
513          \p{Assigned}            All assigned code points (288_767:
514                                    U+0000..0377, U+037A..037F,
515                                    U+0384..038A, U+038C, U+038E..03A1,
516                                    U+03A3..052F ...)
517          \p{Avestan}             \p{Script_Extensions=Avestan} (Short:
518                                    \p{Avst}; NOT \p{Block=Avestan}) (61)
519          \p{Avst}                \p{Avestan} (= \p{Script_Extensions=
520                                    Avestan}) (NOT \p{Block=Avestan}) (61)
521          \p{Bali}                \p{Balinese} (= \p{Script_Extensions=
522                                    Balinese}) (NOT \p{Block=Balinese}) (124)
523          \p{Balinese}            \p{Script_Extensions=Balinese} (Short:
524                                    \p{Bali}; NOT \p{Block=Balinese}) (124)
525          \p{Bamu}                \p{Bamum} (= \p{Script_Extensions=Bamum})
526                                    (NOT \p{Block=Bamum}) (657)
527          \p{Bamum}               \p{Script_Extensions=Bamum} (Short:
528                                    \p{Bamu}; NOT \p{Block=Bamum}) (657)
529        X \p{Bamum_Sup}           \p{Bamum_Supplement} (= \p{Block=
530                                    Bamum_Supplement}) (576)
531        X \p{Bamum_Supplement}    \p{Block=Bamum_Supplement} (Short:
532                                    \p{InBamumSup}) (576)
533        X \p{Basic_Latin}         \p{ASCII} (= \p{Block=Basic_Latin}) (128)
534          \p{Bass}                \p{Bassa_Vah} (= \p{Script_Extensions=
535                                    Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
536                                    (36)
537          \p{Bassa_Vah}           \p{Script_Extensions=Bassa_Vah} (Short:
538                                    \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
539          \p{Batak}               \p{Script_Extensions=Batak} (Short:
540                                    \p{Batk}; NOT \p{Block=Batak}) (56)
541          \p{Batk}                \p{Batak} (= \p{Script_Extensions=Batak})
542                                    (NOT \p{Block=Batak}) (56)
543          \p{Bc: *}               \p{Bidi_Class: *}
544          \p{Beng}                \p{Bengali} (= \p{Script_Extensions=
545                                    Bengali}) (NOT \p{Block=Bengali}) (113)
546          \p{Bengali}             \p{Script_Extensions=Bengali} (Short:
547                                    \p{Beng}; NOT \p{Block=Bengali}) (113)
548          \p{Bhaiksuki}           \p{Script_Extensions=Bhaiksuki} (Short:
549                                    \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
550          \p{Bhks}                \p{Bhaiksuki} (= \p{Script_Extensions=
551                                    Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
552                                    (97)
553          \p{Bidi_C}              \p{Bidi_Control} (= \p{Bidi_Control=Y})
554                                    (12)
555          \p{Bidi_C: *}           \p{Bidi_Control: *}
556          \p{Bidi_Class: AL}      \p{Bidi_Class=Arabic_Letter} (1769)
557          \p{Bidi_Class: AN}      \p{Bidi_Class=Arabic_Number} (63)
558          \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1769: U+0608,
559                                    U+060B, U+060D, U+061B..064A,
560                                    U+066D..066F, U+0671..06D5 ...)
561          \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (63:
562                                    U+0600..0605, U+0660..0669,
563                                    U+066B..066C, U+06DD, U+0890..0891,
564                                    U+08E2 ...)
565          \p{Bidi_Class: B}       \p{Bidi_Class=Paragraph_Separator} (7)
566          \p{Bidi_Class: BN}      \p{Bidi_Class=Boundary_Neutral} (4016)
567          \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016: [^\t\n
568                                    \cK\f\r\x1c-\x7e\x85\xa0-\xac\xae-\xff],
569                                    U+180E, U+200B..200D, U+2060..2065,
570                                    U+206A..206F, U+FDD0..FDEF ...)
571          \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15: [,.\/:
572                                    \xa0], U+060C, U+202F, U+2044, U+FE50,
573                                    U+FE52 ...)
574          \p{Bidi_Class: CS}      \p{Bidi_Class=Common_Separator} (15)
575          \p{Bidi_Class: EN}      \p{Bidi_Class=European_Number} (168)
576          \p{Bidi_Class: ES}      \p{Bidi_Class=European_Separator} (12)
577          \p{Bidi_Class: ET}      \p{Bidi_Class=European_Terminator} (92)
578          \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (168: [0-9\xb2-
579                                    \xb3\xb9], U+06F0..06F9, U+2070,
580                                    U+2074..2079, U+2080..2089, U+2488..249B
581                                    ...)
582          \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-],
583                                    U+207A..207B, U+208A..208B, U+2212,
584                                    U+FB29, U+FE62..FE63 ...)
585          \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (92: [#\$
586                                    \%\xa2-\xa5\xb0-\xb1], U+058F,
587                                    U+0609..060A, U+066A, U+09F2..09F3,
588                                    U+09FB ...)
589          \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1:
590                                    U+2068)
591          \p{Bidi_Class: FSI}     \p{Bidi_Class=First_Strong_Isolate} (1)
592          \p{Bidi_Class: L}       \p{Bidi_Class=Left_To_Right} (1_096_272
593                                    plus all above-Unicode code points)
594          \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_096_272 plus
595                                    all above-Unicode code points: [A-Za-z
596                                    \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
597                                    \xff], U+0100..02B8, U+02BB..02C1,
598                                    U+02D0..02D1, U+02E0..02E4, U+02EE ...)
599          \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1:
600                                    U+202A)
601          \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1:
602                                    U+2066)
603          \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1:
604                                    U+202D)
605          \p{Bidi_Class: LRE}     \p{Bidi_Class=Left_To_Right_Embedding} (1)
606          \p{Bidi_Class: LRI}     \p{Bidi_Class=Left_To_Right_Isolate} (1)
607          \p{Bidi_Class: LRO}     \p{Bidi_Class=Left_To_Right_Override} (1)
608          \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1993:
609                                    U+0300..036F, U+0483..0489,
610                                    U+0591..05BD, U+05BF, U+05C1..05C2,
611                                    U+05C4..05C5 ...)
612          \p{Bidi_Class: NSM}     \p{Bidi_Class=Nonspacing_Mark} (1993)
613          \p{Bidi_Class: ON}      \p{Bidi_Class=Other_Neutral} (6029)
614          \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (6029: [!\"&\'
615                                    \(\)*;<=>?\@\[\\\]\^_`\{\|\}~\xa1\xa6-
616                                    \xa9\xab-\xac\xae-\xaf\xb4\xb6-\xb8\xbb-
617                                    \xbf\xd7\xf7], U+02B9..02BA,
618                                    U+02C2..02CF, U+02D2..02DF,
619                                    U+02E5..02ED, U+02EF..02FF ...)
620          \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7: [\n\r
621                                    \x1c-\x1e\x85], U+2029)
622          \p{Bidi_Class: PDF}     \p{Bidi_Class=Pop_Directional_Format} (1)
623          \p{Bidi_Class: PDI}     \p{Bidi_Class=Pop_Directional_Isolate} (1)
624          \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1:
625                                    U+202C)
626          \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1:
627                                    U+2069)
628          \p{Bidi_Class: R}       \p{Bidi_Class=Right_To_Left} (3647)
629          \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (3647: U+0590,
630                                    U+05BE, U+05C0, U+05C3, U+05C6,
631                                    U+05C8..05FF ...)
632          \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1:
633                                    U+202B)
634          \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1:
635                                    U+2067)
636          \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1:
637                                    U+202E)
638          \p{Bidi_Class: RLE}     \p{Bidi_Class=Right_To_Left_Embedding} (1)
639          \p{Bidi_Class: RLI}     \p{Bidi_Class=Right_To_Left_Isolate} (1)
640          \p{Bidi_Class: RLO}     \p{Bidi_Class=Right_To_Left_Override} (1)
641          \p{Bidi_Class: S}       \p{Bidi_Class=Segment_Separator} (3)
642          \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3: [\t\cK
643                                    \x1f])
644          \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20],
645                                    U+1680, U+2000..200A, U+2028, U+205F,
646                                    U+3000)
647          \p{Bidi_Class: WS}      \p{Bidi_Class=White_Space} (17)
648          \p{Bidi_Control}        \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
649          \p{Bidi_Control: N*}    (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
650                                    plus all above-Unicode code points:
651                                    U+0000..061B, U+061D..200D,
652                                    U+2010..2029, U+202F..2065,
653                                    U+206A..infinity)
654          \p{Bidi_Control: Y*}    (Short: \p{BidiC=Y}, \p{BidiC}) (12:
655                                    U+061C, U+200E..200F, U+202A..202E,
656                                    U+2066..2069)
657          \p{Bidi_M}              \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
658                                    (553)
659          \p{Bidi_M: *}           \p{Bidi_Mirrored: *}
660          \p{Bidi_Mirrored}       \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
661                                    (553)
662          \p{Bidi_Mirrored: N*}   (Short: \p{BidiM=N}, \P{BidiM}) (1_113_559
663                                    plus all above-Unicode code points:
664                                    [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A-
665                                    Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc-
666                                    \xff], U+0100..0F39, U+0F3E..169A,
667                                    U+169D..2038, U+203B..2044, U+2047..207C
668                                    ...)
669          \p{Bidi_Mirrored: Y*}   (Short: \p{BidiM=Y}, \p{BidiM}) (553:
670                                    [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
671                                    U+169B..169C, U+2039..203A,
672                                    U+2045..2046, U+207D..207E ...)
673          \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
674                                    (64)
675          \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (64: [\)\]
676                                    \}], U+0F3B, U+0F3D, U+169C, U+2046,
677                                    U+207E ...)
678          \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
679                                    (1_113_984 plus all above-Unicode code
680                                    points)
681          \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_984
682                                    plus all above-Unicode code points:
683                                    [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<=>?
684                                    \@A-Z\\\^_`a-z\|~\x7f-\xff],
685                                    U+0100..0F39, U+0F3E..169A,
686                                    U+169D..2044, U+2047..207C, U+207F..208C
687                                    ...)
688          \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
689                                    (64)
690          \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (64:
691                                    [\(\[\{], U+0F3A, U+0F3C, U+169B,
692                                    U+2045, U+207D ...)
693          \p{Blank}               \p{XPosixBlank} (18)
694          \p{Blk: *}              \p{Block: *}
695          \p{Block: Adlam}        (NOT \p{Adlam} NOR \p{Is_Adlam}) (96:
696                                    U+1E900..1E95F)
697          \p{Block: Aegean_Numbers} (64: U+10100..1013F)
698          \p{Block: Ahom}         (NOT \p{Ahom} NOR \p{Is_Ahom}) (80:
699                                    U+11700..1174F)
700          \p{Block: Alchemical}   \p{Block=Alchemical_Symbols} (128)
701          \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128:
702                                    U+1F700..1F77F)
703          \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
704                                    (80)
705          \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
706                                    AlphabeticPF}) (80: U+FB00..FB4F)
707          \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
708                                    NOR \p{Is_Anatolian_Hieroglyphs}) (640:
709                                    U+14400..1467F)
710          \p{Block: Ancient_Greek_Music} \p{Block=
711                                    Ancient_Greek_Musical_Notation} (80)
712          \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
713                                    AncientGreekMusic}) (80: U+1D200..1D24F)
714          \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F)
715          \p{Block: Ancient_Symbols} (64: U+10190..101CF)
716          \p{Block: Arabic}       (NOT \p{Arabic} NOR \p{Is_Arabic}) (256:
717                                    U+0600..06FF)
718          \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
719          \p{Block: Arabic_Ext_B} \p{Block=Arabic_Extended_B} (48)
720          \p{Block: Arabic_Ext_C} \p{Block=Arabic_Extended_C} (64)
721          \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96:
722                                    U+08A0..08FF)
723          \p{Block: Arabic_Extended_B} (Short: \p{Blk=ArabicExtB}) (48:
724                                    U+0870..089F)
725          \p{Block: Arabic_Extended_C} (Short: \p{Blk=ArabicExtC}) (64:
726                                    U+10EC0..10EFF)
727          \p{Block: Arabic_Math}  \p{Block=
728                                    Arabic_Mathematical_Alphabetic_Symbols}
729                                    (256)
730          \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
731                                    ArabicMath}) (256: U+1EE00..1EEFF)
732          \p{Block: Arabic_PF_A}  \p{Block=Arabic_Presentation_Forms_A} (688)
733          \p{Block: Arabic_PF_B}  \p{Block=Arabic_Presentation_Forms_B} (144)
734          \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
735                                    (688: U+FB50..FDFF)
736          \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
737                                    (144: U+FE70..FEFF)
738          \p{Block: Arabic_Sup}   \p{Block=Arabic_Supplement} (48)
739          \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48:
740                                    U+0750..077F)
741          \p{Block: Armenian}     (NOT \p{Armenian} NOR \p{Is_Armenian})
742                                    (96: U+0530..058F)
743          \p{Block: Arrows}       (112: U+2190..21FF)
744          \p{Block: ASCII}        \p{Block=Basic_Latin} (128)
745          \p{Block: Avestan}      (NOT \p{Avestan} NOR \p{Is_Avestan}) (64:
746                                    U+10B00..10B3F)
747          \p{Block: Balinese}     (NOT \p{Balinese} NOR \p{Is_Balinese})
748                                    (128: U+1B00..1B7F)
749          \p{Block: Bamum}        (NOT \p{Bamum} NOR \p{Is_Bamum}) (96:
750                                    U+A6A0..A6FF)
751          \p{Block: Bamum_Sup}    \p{Block=Bamum_Supplement} (576)
752          \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576:
753                                    U+16800..16A3F)
754          \p{Block: Basic_Latin}  (Short: \p{Blk=ASCII}) (128: [\x00-\x7f])
755          \p{Block: Bassa_Vah}    (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
756                                    (48: U+16AD0..16AFF)
757          \p{Block: Batak}        (NOT \p{Batak} NOR \p{Is_Batak}) (64:
758                                    U+1BC0..1BFF)
759          \p{Block: Bengali}      (NOT \p{Bengali} NOR \p{Is_Bengali}) (128:
760                                    U+0980..09FF)
761          \p{Block: Bhaiksuki}    (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
762                                    (112: U+11C00..11C6F)
763          \p{Block: Block_Elements} (32: U+2580..259F)
764          \p{Block: Bopomofo}     (NOT \p{Bopomofo} NOR \p{Is_Bopomofo})
765                                    (48: U+3100..312F)
766          \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
767          \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32:
768                                    U+31A0..31BF)
769          \p{Block: Box_Drawing}  (128: U+2500..257F)
770          \p{Block: Brahmi}       (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128:
771                                    U+11000..1107F)
772          \p{Block: Braille}      \p{Block=Braille_Patterns} (256)
773          \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256:
774                                    U+2800..28FF)
775          \p{Block: Buginese}     (NOT \p{Buginese} NOR \p{Is_Buginese})
776                                    (32: U+1A00..1A1F)
777          \p{Block: Buhid}        (NOT \p{Buhid} NOR \p{Is_Buhid}) (32:
778                                    U+1740..175F)
779          \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
780                                    (256)
781          \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
782                                    ByzantineMusic}) (256: U+1D000..1D0FF)
783          \p{Block: Canadian_Syllabics} \p{Block=
784                                    Unified_Canadian_Aboriginal_Syllabics}
785                                    (640)
786          \p{Block: Carian}       (NOT \p{Carian} NOR \p{Is_Carian}) (64:
787                                    U+102A0..102DF)
788          \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
789                                    \p{Is_Caucasian_Albanian}) (64:
790                                    U+10530..1056F)
791          \p{Block: Chakma}       (NOT \p{Chakma} NOR \p{Is_Chakma}) (80:
792                                    U+11100..1114F)
793          \p{Block: Cham}         (NOT \p{Cham} NOR \p{Is_Cham}) (96:
794                                    U+AA00..AA5F)
795          \p{Block: Cherokee}     (NOT \p{Cherokee} NOR \p{Is_Cherokee})
796                                    (96: U+13A0..13FF)
797          \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
798          \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80:
799                                    U+AB70..ABBF)
800          \p{Block: Chess_Symbols} (112: U+1FA00..1FA6F)
801          \p{Block: Chorasmian}   (NOT \p{Chorasmian} NOR \p{Is_Chorasmian})
802                                    (48: U+10FB0..10FDF)
803          \p{Block: CJK}          \p{Block=CJK_Unified_Ideographs} (20_992)
804          \p{Block: CJK_Compat}   \p{Block=CJK_Compatibility} (256)
805          \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
806          \p{Block: CJK_Compat_Ideographs} \p{Block=
807                                    CJK_Compatibility_Ideographs} (512)
808          \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
809                                    CJK_Compatibility_Ideographs_Supplement}
810                                    (544)
811          \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256:
812                                    U+3300..33FF)
813          \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
814                                    (32: U+FE30..FE4F)
815          \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
816                                    CJKCompatIdeographs}) (512: U+F900..FAFF)
817          \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
818                                    CJKCompatIdeographsSup}) (544:
819                                    U+2F800..2FA1F)
820          \p{Block: CJK_Ext_A}    \p{Block=
821                                    CJK_Unified_Ideographs_Extension_A}
822                                    (6592)
823          \p{Block: CJK_Ext_B}    \p{Block=
824                                    CJK_Unified_Ideographs_Extension_B}
825                                    (42_720)
826          \p{Block: CJK_Ext_C}    \p{Block=
827                                    CJK_Unified_Ideographs_Extension_C}
828                                    (4160)
829          \p{Block: CJK_Ext_D}    \p{Block=
830                                    CJK_Unified_Ideographs_Extension_D} (224)
831          \p{Block: CJK_Ext_E}    \p{Block=
832                                    CJK_Unified_Ideographs_Extension_E}
833                                    (5776)
834          \p{Block: CJK_Ext_F}    \p{Block=
835                                    CJK_Unified_Ideographs_Extension_F}
836                                    (7488)
837          \p{Block: CJK_Ext_G}    \p{Block=
838                                    CJK_Unified_Ideographs_Extension_G}
839                                    (4944)
840          \p{Block: CJK_Ext_H}    \p{Block=
841                                    CJK_Unified_Ideographs_Extension_H}
842                                    (4192)
843          \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
844          \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
845                                    (128: U+2E80..2EFF)
846          \p{Block: CJK_Strokes}  (48: U+31C0..31EF)
847          \p{Block: CJK_Symbols}  \p{Block=CJK_Symbols_And_Punctuation} (64)
848          \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
849                                    (64: U+3000..303F)
850          \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992:
851                                    U+4E00..9FFF)
852          \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
853                                    CJKExtA}) (6592: U+3400..4DBF)
854          \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
855                                    CJKExtB}) (42_720: U+20000..2A6DF)
856          \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
857                                    CJKExtC}) (4160: U+2A700..2B73F)
858          \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
859                                    CJKExtD}) (224: U+2B740..2B81F)
860          \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
861                                    CJKExtE}) (5776: U+2B820..2CEAF)
862          \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk=
863                                    CJKExtF}) (7488: U+2CEB0..2EBEF)
864          \p{Block: CJK_Unified_Ideographs_Extension_G} (Short: \p{Blk=
865                                    CJKExtG}) (4944: U+30000..3134F)
866          \p{Block: CJK_Unified_Ideographs_Extension_H} (Short: \p{Blk=
867                                    CJKExtH}) (4192: U+31350..323AF)
868          \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
869                                    Diacriticals}) (112: U+0300..036F)
870          \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
871                                    DiacriticalsExt}) (80: U+1AB0..1AFF)
872          \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
873                                    DiacriticalsForSymbols}) (48:
874                                    U+20D0..20FF)
875          \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
876                                    DiacriticalsSup}) (64: U+1DC0..1DFF)
877          \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16:
878                                    U+FE20..FE2F)
879          \p{Block: Combining_Marks_For_Symbols} \p{Block=
880                                    Combining_Diacritical_Marks_For_Symbols}
881                                    (48)
882          \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
883                                    IndicNumberForms}) (16: U+A830..A83F)
884          \p{Block: Compat_Jamo}  \p{Block=Hangul_Compatibility_Jamo} (96)
885          \p{Block: Control_Pictures} (64: U+2400..243F)
886          \p{Block: Coptic}       (NOT \p{Coptic} NOR \p{Is_Coptic}) (128:
887                                    U+2C80..2CFF)
888          \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF)
889          \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
890          \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32:
891                                    U+1D360..1D37F)
892          \p{Block: Cuneiform}    (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
893                                    (1024: U+12000..123FF)
894          \p{Block: Cuneiform_Numbers} \p{Block=
895                                    Cuneiform_Numbers_And_Punctuation} (128)
896          \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
897                                    CuneiformNumbers}) (128: U+12400..1247F)
898          \p{Block: Currency_Symbols} (48: U+20A0..20CF)
899          \p{Block: Cypriot_Syllabary} (64: U+10800..1083F)
900          \p{Block: Cypro_Minoan} (NOT \p{Cypro_Minoan} NOR
901                                    \p{Is_Cypro_Minoan}) (112:
902                                    U+12F90..12FFF)
903          \p{Block: Cyrillic}     (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
904                                    (256: U+0400..04FF)
905          \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
906          \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
907          \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
908          \p{Block: Cyrillic_Ext_D} \p{Block=Cyrillic_Extended_D} (96)
909          \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32:
910                                    U+2DE0..2DFF)
911          \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96:
912                                    U+A640..A69F)
913          \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16:
914                                    U+1C80..1C8F)
915          \p{Block: Cyrillic_Extended_D} (Short: \p{Blk=CyrillicExtD}) (96:
916                                    U+1E030..1E08F)
917          \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
918          \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48:
919                                    U+0500..052F)
920          \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
921                                    (48)
922          \p{Block: Deseret}      (80: U+10400..1044F)
923          \p{Block: Devanagari}   (NOT \p{Devanagari} NOR \p{Is_Devanagari})
924                                    (128: U+0900..097F)
925          \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
926          \p{Block: Devanagari_Ext_A} \p{Block=Devanagari_Extended_A} (96)
927          \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32:
928                                    U+A8E0..A8FF)
929          \p{Block: Devanagari_Extended_A} (Short: \p{Blk=DevanagariExtA})
930                                    (96: U+11B00..11B5F)
931          \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
932          \p{Block: Diacriticals_Ext} \p{Block=
933                                    Combining_Diacritical_Marks_Extended}
934                                    (80)
935          \p{Block: Diacriticals_For_Symbols} \p{Block=
936                                    Combining_Diacritical_Marks_For_Symbols}
937                                    (48)
938          \p{Block: Diacriticals_Sup} \p{Block=
939                                    Combining_Diacritical_Marks_Supplement}
940                                    (64)
941          \p{Block: Dingbats}     (192: U+2700..27BF)
942          \p{Block: Dives_Akuru}  (NOT \p{Dives_Akuru} NOR
943                                    \p{Is_Dives_Akuru}) (96: U+11900..1195F)
944          \p{Block: Dogra}        (NOT \p{Dogra} NOR \p{Is_Dogra}) (80:
945                                    U+11800..1184F)
946          \p{Block: Domino}       \p{Block=Domino_Tiles} (112)
947          \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112:
948                                    U+1F030..1F09F)
949          \p{Block: Duployan}     (NOT \p{Duployan} NOR \p{Is_Duployan})
950                                    (160: U+1BC00..1BC9F)
951          \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F)
952          \p{Block: Egyptian_Hieroglyph_Format_Controls} (48: U+13430..1345F)
953          \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
954                                    \p{Is_Egyptian_Hieroglyphs}) (1072:
955                                    U+13000..1342F)
956          \p{Block: Elbasan}      (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48:
957                                    U+10500..1052F)
958          \p{Block: Elymaic}      (NOT \p{Elymaic} NOR \p{Is_Elymaic}) (32:
959                                    U+10FE0..10FFF)
960          \p{Block: Emoticons}    (80: U+1F600..1F64F)
961          \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
962          \p{Block: Enclosed_Alphanum_Sup} \p{Block=
963                                    Enclosed_Alphanumeric_Supplement} (256)
964          \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
965                                    EnclosedAlphanumSup}) (256:
966                                    U+1F100..1F1FF)
967          \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
968                                    EnclosedAlphanum}) (160: U+2460..24FF)
969          \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
970                                    (256)
971          \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
972                                    EnclosedCJK}) (256: U+3200..32FF)
973          \p{Block: Enclosed_Ideographic_Sup} \p{Block=
974                                    Enclosed_Ideographic_Supplement} (256)
975          \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
976                                    EnclosedIdeographicSup}) (256:
977                                    U+1F200..1F2FF)
978          \p{Block: Ethiopic}     (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
979                                    (384: U+1200..137F)
980          \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
981          \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
982          \p{Block: Ethiopic_Ext_B} \p{Block=Ethiopic_Extended_B} (32)
983          \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96:
984                                    U+2D80..2DDF)
985          \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48:
986                                    U+AB00..AB2F)
987          \p{Block: Ethiopic_Extended_B} (Short: \p{Blk=EthiopicExtB}) (32:
988                                    U+1E7E0..1E7FF)
989          \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
990          \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32:
991                                    U+1380..139F)
992          \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
993                                    \p{Punct} NOR \p{Is_Punctuation}) (112:
994                                    U+2000..206F)
995          \p{Block: Geometric_Shapes} (96: U+25A0..25FF)
996          \p{Block: Geometric_Shapes_Ext} \p{Block=
997                                    Geometric_Shapes_Extended} (128)
998          \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
999                                    GeometricShapesExt}) (128:
1000                                    U+1F780..1F7FF)
1001          \p{Block: Georgian}     (NOT \p{Georgian} NOR \p{Is_Georgian})
1002                                    (96: U+10A0..10FF)
1003          \p{Block: Georgian_Ext} \p{Block=Georgian_Extended} (48)
1004          \p{Block: Georgian_Extended} (Short: \p{Blk=GeorgianExt}) (48:
1005                                    U+1C90..1CBF)
1006          \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
1007          \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48:
1008                                    U+2D00..2D2F)
1009          \p{Block: Glagolitic}   (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
1010                                    (96: U+2C00..2C5F)
1011          \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
1012          \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
1013                                    (48: U+1E000..1E02F)
1014          \p{Block: Gothic}       (NOT \p{Gothic} NOR \p{Is_Gothic}) (32:
1015                                    U+10330..1034F)
1016          \p{Block: Grantha}      (NOT \p{Grantha} NOR \p{Is_Grantha}) (128:
1017                                    U+11300..1137F)
1018          \p{Block: Greek}        \p{Block=Greek_And_Coptic} (NOT \p{Greek}
1019                                    NOR \p{Is_Greek}) (144)
1020          \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
1021                                    NOR \p{Is_Greek}) (144: U+0370..03FF)
1022          \p{Block: Greek_Ext}    \p{Block=Greek_Extended} (256)
1023          \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256:
1024                                    U+1F00..1FFF)
1025          \p{Block: Gujarati}     (NOT \p{Gujarati} NOR \p{Is_Gujarati})
1026                                    (128: U+0A80..0AFF)
1027          \p{Block: Gunjala_Gondi} (NOT \p{Gunjala_Gondi} NOR
1028                                    \p{Is_Gunjala_Gondi}) (80:
1029                                    U+11D60..11DAF)
1030          \p{Block: Gurmukhi}     (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
1031                                    (128: U+0A00..0A7F)
1032          \p{Block: Half_And_Full_Forms} \p{Block=
1033                                    Halfwidth_And_Fullwidth_Forms} (240)
1034          \p{Block: Half_Marks}   \p{Block=Combining_Half_Marks} (16)
1035          \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
1036                                    HalfAndFullForms}) (240: U+FF00..FFEF)
1037          \p{Block: Hangul}       \p{Block=Hangul_Syllables} (NOT \p{Hangul}
1038                                    NOR \p{Is_Hangul}) (11_184)
1039          \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
1040                                    (96: U+3130..318F)
1041          \p{Block: Hangul_Jamo}  (Short: \p{Blk=Jamo}) (256: U+1100..11FF)
1042          \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32:
1043                                    U+A960..A97F)
1044          \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80:
1045                                    U+D7B0..D7FF)
1046          \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
1047                                    NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF)
1048          \p{Block: Hanifi_Rohingya} (NOT \p{Hanifi_Rohingya} NOR
1049                                    \p{Is_Hanifi_Rohingya}) (64:
1050                                    U+10D00..10D3F)
1051          \p{Block: Hanunoo}      (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32:
1052                                    U+1720..173F)
1053          \p{Block: Hatran}       (NOT \p{Hatran} NOR \p{Is_Hatran}) (32:
1054                                    U+108E0..108FF)
1055          \p{Block: Hebrew}       (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112:
1056                                    U+0590..05FF)
1057          \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
1058                                    HighPUSurrogates}) (128: U+DB80..DBFF)
1059          \p{Block: High_PU_Surrogates} \p{Block=
1060                                    High_Private_Use_Surrogates} (128)
1061          \p{Block: High_Surrogates} (896: U+D800..DB7F)
1062          \p{Block: Hiragana}     (NOT \p{Hiragana} NOR \p{Is_Hiragana})
1063                                    (96: U+3040..309F)
1064          \p{Block: IDC}          \p{Block=
1065                                    Ideographic_Description_Characters} (NOT
1066                                    \p{ID_Continue} NOR \p{Is_IDC}) (16)
1067          \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
1068                                    NOT \p{ID_Continue} NOR \p{Is_IDC}) (16:
1069                                    U+2FF0..2FFF)
1070          \p{Block: Ideographic_Symbols} \p{Block=
1071                                    Ideographic_Symbols_And_Punctuation} (32)
1072          \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
1073                                    IdeographicSymbols}) (32: U+16FE0..16FFF)
1074          \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
1075                                    \p{Is_Imperial_Aramaic}) (32:
1076                                    U+10840..1085F)
1077          \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1078                                    (16)
1079          \p{Block: Indic_Siyaq_Numbers} (80: U+1EC70..1ECBF)
1080          \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
1081                                    NOR \p{Is_Inscriptional_Pahlavi}) (32:
1082                                    U+10B60..10B7F)
1083          \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
1084                                    NOR \p{Is_Inscriptional_Parthian}) (32:
1085                                    U+10B40..10B5F)
1086          \p{Block: IPA_Ext}      \p{Block=IPA_Extensions} (96)
1087          \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96:
1088                                    U+0250..02AF)
1089          \p{Block: Jamo}         \p{Block=Hangul_Jamo} (256)
1090          \p{Block: Jamo_Ext_A}   \p{Block=Hangul_Jamo_Extended_A} (32)
1091          \p{Block: Jamo_Ext_B}   \p{Block=Hangul_Jamo_Extended_B} (80)
1092          \p{Block: Javanese}     (NOT \p{Javanese} NOR \p{Is_Javanese})
1093                                    (96: U+A980..A9DF)
1094          \p{Block: Kaithi}       (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80:
1095                                    U+11080..110CF)
1096          \p{Block: Kaktovik_Numerals} (32: U+1D2C0..1D2DF)
1097          \p{Block: Kana_Ext_A}   \p{Block=Kana_Extended_A} (48)
1098          \p{Block: Kana_Ext_B}   \p{Block=Kana_Extended_B} (16)
1099          \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48:
1100                                    U+1B100..1B12F)
1101          \p{Block: Kana_Extended_B} (Short: \p{Blk=KanaExtB}) (16:
1102                                    U+1AFF0..1AFFF)
1103          \p{Block: Kana_Sup}     \p{Block=Kana_Supplement} (256)
1104          \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256:
1105                                    U+1B000..1B0FF)
1106          \p{Block: Kanbun}       (16: U+3190..319F)
1107          \p{Block: Kangxi}       \p{Block=Kangxi_Radicals} (224)
1108          \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224:
1109                                    U+2F00..2FDF)
1110          \p{Block: Kannada}      (NOT \p{Kannada} NOR \p{Is_Kannada}) (128:
1111                                    U+0C80..0CFF)
1112          \p{Block: Katakana}     (NOT \p{Katakana} NOR \p{Is_Katakana})
1113                                    (96: U+30A0..30FF)
1114          \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
1115          \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
1116                                    KatakanaExt}) (16: U+31F0..31FF)
1117          \p{Block: Kawi}         (NOT \p{Kawi} NOR \p{Is_Kawi}) (96:
1118                                    U+11F00..11F5F)
1119          \p{Block: Kayah_Li}     (48: U+A900..A92F)
1120          \p{Block: Kharoshthi}   (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
1121                                    (96: U+10A00..10A5F)
1122          \p{Block: Khitan_Small_Script} (NOT \p{Khitan_Small_Script} NOR
1123                                    \p{Is_Khitan_Small_Script}) (512:
1124                                    U+18B00..18CFF)
1125          \p{Block: Khmer}        (NOT \p{Khmer} NOR \p{Is_Khmer}) (128:
1126                                    U+1780..17FF)
1127          \p{Block: Khmer_Symbols} (32: U+19E0..19FF)
1128          \p{Block: Khojki}       (NOT \p{Khojki} NOR \p{Is_Khojki}) (80:
1129                                    U+11200..1124F)
1130          \p{Block: Khudawadi}    (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
1131                                    (80: U+112B0..112FF)
1132          \p{Block: Lao}          (NOT \p{Lao} NOR \p{Is_Lao}) (128:
1133                                    U+0E80..0EFF)
1134          \p{Block: Latin_1}      \p{Block=Latin_1_Supplement} (128)
1135          \p{Block: Latin_1_Sup}  \p{Block=Latin_1_Supplement} (128)
1136          \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80-
1137                                    \xff])
1138          \p{Block: Latin_Ext_A}  \p{Block=Latin_Extended_A} (128)
1139          \p{Block: Latin_Ext_Additional} \p{Block=
1140                                    Latin_Extended_Additional} (256)
1141          \p{Block: Latin_Ext_B}  \p{Block=Latin_Extended_B} (208)
1142          \p{Block: Latin_Ext_C}  \p{Block=Latin_Extended_C} (32)
1143          \p{Block: Latin_Ext_D}  \p{Block=Latin_Extended_D} (224)
1144          \p{Block: Latin_Ext_E}  \p{Block=Latin_Extended_E} (64)
1145          \p{Block: Latin_Ext_F}  \p{Block=Latin_Extended_F} (64)
1146          \p{Block: Latin_Ext_G}  \p{Block=Latin_Extended_G} (256)
1147          \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128:
1148                                    U+0100..017F)
1149          \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
1150                                    LatinExtAdditional}) (256: U+1E00..1EFF)
1151          \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208:
1152                                    U+0180..024F)
1153          \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32:
1154                                    U+2C60..2C7F)
1155          \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224:
1156                                    U+A720..A7FF)
1157          \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64:
1158                                    U+AB30..AB6F)
1159          \p{Block: Latin_Extended_F} (Short: \p{Blk=LatinExtF}) (64:
1160                                    U+10780..107BF)
1161          \p{Block: Latin_Extended_G} (Short: \p{Blk=LatinExtG}) (256:
1162                                    U+1DF00..1DFFF)
1163          \p{Block: Lepcha}       (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80:
1164                                    U+1C00..1C4F)
1165          \p{Block: Letterlike_Symbols} (80: U+2100..214F)
1166          \p{Block: Limbu}        (NOT \p{Limbu} NOR \p{Is_Limbu}) (80:
1167                                    U+1900..194F)
1168          \p{Block: Linear_A}     (NOT \p{Linear_A} NOR \p{Is_Linear_A})
1169                                    (384: U+10600..1077F)
1170          \p{Block: Linear_B_Ideograms} (128: U+10080..100FF)
1171          \p{Block: Linear_B_Syllabary} (128: U+10000..1007F)
1172          \p{Block: Lisu}         (NOT \p{Lisu} NOR \p{Is_Lisu}) (48:
1173                                    U+A4D0..A4FF)
1174          \p{Block: Lisu_Sup}     \p{Block=Lisu_Supplement} (16)
1175          \p{Block: Lisu_Supplement} (Short: \p{Blk=LisuSup}) (16:
1176                                    U+11FB0..11FBF)
1177          \p{Block: Low_Surrogates} (1024: U+DC00..DFFF)
1178          \p{Block: Lycian}       (NOT \p{Lycian} NOR \p{Is_Lycian}) (32:
1179                                    U+10280..1029F)
1180          \p{Block: Lydian}       (NOT \p{Lydian} NOR \p{Is_Lydian}) (32:
1181                                    U+10920..1093F)
1182          \p{Block: Mahajani}     (NOT \p{Mahajani} NOR \p{Is_Mahajani})
1183                                    (48: U+11150..1117F)
1184          \p{Block: Mahjong}      \p{Block=Mahjong_Tiles} (48)
1185          \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48:
1186                                    U+1F000..1F02F)
1187          \p{Block: Makasar}      (NOT \p{Makasar} NOR \p{Is_Makasar}) (32:
1188                                    U+11EE0..11EFF)
1189          \p{Block: Malayalam}    (NOT \p{Malayalam} NOR \p{Is_Malayalam})
1190                                    (128: U+0D00..0D7F)
1191          \p{Block: Mandaic}      (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32:
1192                                    U+0840..085F)
1193          \p{Block: Manichaean}   (NOT \p{Manichaean} NOR \p{Is_Manichaean})
1194                                    (64: U+10AC0..10AFF)
1195          \p{Block: Marchen}      (NOT \p{Marchen} NOR \p{Is_Marchen}) (80:
1196                                    U+11C70..11CBF)
1197          \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR
1198                                    \p{Is_Masaram_Gondi}) (96:
1199                                    U+11D00..11D5F)
1200          \p{Block: Math_Alphanum} \p{Block=
1201                                    Mathematical_Alphanumeric_Symbols} (1024)
1202          \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
1203          \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
1204                                    MathAlphanum}) (1024: U+1D400..1D7FF)
1205          \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
1206                                    (256: U+2200..22FF)
1207          \p{Block: Mayan_Numerals} (32: U+1D2E0..1D2FF)
1208          \p{Block: Medefaidrin}  (NOT \p{Medefaidrin} NOR
1209                                    \p{Is_Medefaidrin}) (96: U+16E40..16E9F)
1210          \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
1211                                    \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF)
1212          \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
1213          \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
1214                                    (32: U+AAE0..AAFF)
1215          \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
1216                                    \p{Is_Mende_Kikakui}) (224:
1217                                    U+1E800..1E8DF)
1218          \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
1219                                    \p{Is_Meroitic_Cursive}) (96:
1220                                    U+109A0..109FF)
1221          \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F)
1222          \p{Block: Miao}         (NOT \p{Miao} NOR \p{Is_Miao}) (160:
1223                                    U+16F00..16F9F)
1224          \p{Block: Misc_Arrows}  \p{Block=Miscellaneous_Symbols_And_Arrows}
1225                                    (256)
1226          \p{Block: Misc_Math_Symbols_A} \p{Block=
1227                                    Miscellaneous_Mathematical_Symbols_A}
1228                                    (48)
1229          \p{Block: Misc_Math_Symbols_B} \p{Block=
1230                                    Miscellaneous_Mathematical_Symbols_B}
1231                                    (128)
1232          \p{Block: Misc_Pictographs} \p{Block=
1233                                    Miscellaneous_Symbols_And_Pictographs}
1234                                    (768)
1235          \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
1236          \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
1237          \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
1238                                    MiscMathSymbolsA}) (48: U+27C0..27EF)
1239          \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
1240                                    MiscMathSymbolsB}) (128: U+2980..29FF)
1241          \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols})
1242                                    (256: U+2600..26FF)
1243          \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
1244                                    MiscArrows}) (256: U+2B00..2BFF)
1245          \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
1246                                    MiscPictographs}) (768: U+1F300..1F5FF)
1247          \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
1248                                    (256: U+2300..23FF)
1249          \p{Block: Modi}         (NOT \p{Modi} NOR \p{Is_Modi}) (96:
1250                                    U+11600..1165F)
1251          \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
1252          \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F)
1253          \p{Block: Mongolian}    (NOT \p{Mongolian} NOR \p{Is_Mongolian})
1254                                    (176: U+1800..18AF)
1255          \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
1256          \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32:
1257                                    U+11660..1167F)
1258          \p{Block: Mro}          (NOT \p{Mro} NOR \p{Is_Mro}) (48:
1259                                    U+16A40..16A6F)
1260          \p{Block: Multani}      (NOT \p{Multani} NOR \p{Is_Multani}) (48:
1261                                    U+11280..112AF)
1262          \p{Block: Music}        \p{Block=Musical_Symbols} (256)
1263          \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256:
1264                                    U+1D100..1D1FF)
1265          \p{Block: Myanmar}      (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160:
1266                                    U+1000..109F)
1267          \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
1268          \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
1269          \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32:
1270                                    U+AA60..AA7F)
1271          \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32:
1272                                    U+A9E0..A9FF)
1273          \p{Block: Nabataean}    (NOT \p{Nabataean} NOR \p{Is_Nabataean})
1274                                    (48: U+10880..108AF)
1275          \p{Block: Nag_Mundari}  (NOT \p{Nag_Mundari} NOR
1276                                    \p{Is_Nag_Mundari}) (48: U+1E4D0..1E4FF)
1277          \p{Block: Nandinagari}  (NOT \p{Nandinagari} NOR
1278                                    \p{Is_Nandinagari}) (96: U+119A0..119FF)
1279          \p{Block: NB}           \p{Block=No_Block} (820_944 plus all
1280                                    above-Unicode code points)
1281          \p{Block: New_Tai_Lue}  (NOT \p{New_Tai_Lue} NOR
1282                                    \p{Is_New_Tai_Lue}) (96: U+1980..19DF)
1283          \p{Block: Newa}         (NOT \p{Newa} NOR \p{Is_Newa}) (128:
1284                                    U+11400..1147F)
1285          \p{Block: NKo}          (NOT \p{Nko} NOR \p{Is_NKo}) (64:
1286                                    U+07C0..07FF)
1287          \p{Block: No_Block}     (Short: \p{Blk=NB}) (820_944 plus all
1288                                    above-Unicode code points: U+2FE0..2FEF,
1289                                    U+10200..1027F, U+103E0..103FF,
1290                                    U+105C0..105FF, U+107C0..107FF,
1291                                    U+108B0..108DF ...)
1292          \p{Block: Number_Forms} (64: U+2150..218F)
1293          \p{Block: Nushu}        (NOT \p{Nushu} NOR \p{Is_Nushu}) (400:
1294                                    U+1B170..1B2FF)
1295          \p{Block: Nyiakeng_Puachue_Hmong} (NOT \p{Nyiakeng_Puachue_Hmong}
1296                                    NOR \p{Is_Nyiakeng_Puachue_Hmong}) (80:
1297                                    U+1E100..1E14F)
1298          \p{Block: OCR}          \p{Block=Optical_Character_Recognition}
1299                                    (32)
1300          \p{Block: Ogham}        (NOT \p{Ogham} NOR \p{Is_Ogham}) (32:
1301                                    U+1680..169F)
1302          \p{Block: Ol_Chiki}     (48: U+1C50..1C7F)
1303          \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
1304                                    \p{Is_Old_Hungarian}) (128:
1305                                    U+10C80..10CFF)
1306          \p{Block: Old_Italic}   (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
1307                                    (48: U+10300..1032F)
1308          \p{Block: Old_North_Arabian} (32: U+10A80..10A9F)
1309          \p{Block: Old_Permic}   (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
1310                                    (48: U+10350..1037F)
1311          \p{Block: Old_Persian}  (NOT \p{Old_Persian} NOR
1312                                    \p{Is_Old_Persian}) (64: U+103A0..103DF)
1313          \p{Block: Old_Sogdian}  (NOT \p{Old_Sogdian} NOR
1314                                    \p{Is_Old_Sogdian}) (48: U+10F00..10F2F)
1315          \p{Block: Old_South_Arabian} (32: U+10A60..10A7F)
1316          \p{Block: Old_Turkic}   (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
1317                                    (80: U+10C00..10C4F)
1318          \p{Block: Old_Uyghur}   (NOT \p{Old_Uyghur} NOR \p{Is_Old_Uyghur})
1319                                    (64: U+10F70..10FAF)
1320          \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32:
1321                                    U+2440..245F)
1322          \p{Block: Oriya}        (NOT \p{Oriya} NOR \p{Is_Oriya}) (128:
1323                                    U+0B00..0B7F)
1324          \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F)
1325          \p{Block: Osage}        (NOT \p{Osage} NOR \p{Is_Osage}) (80:
1326                                    U+104B0..104FF)
1327          \p{Block: Osmanya}      (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48:
1328                                    U+10480..104AF)
1329          \p{Block: Ottoman_Siyaq_Numbers} (80: U+1ED00..1ED4F)
1330          \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
1331                                    \p{Is_Pahawh_Hmong}) (144:
1332                                    U+16B00..16B8F)
1333          \p{Block: Palmyrene}    (32: U+10860..1087F)
1334          \p{Block: Pau_Cin_Hau}  (NOT \p{Pau_Cin_Hau} NOR
1335                                    \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF)
1336          \p{Block: Phags_Pa}     (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa})
1337                                    (64: U+A840..A87F)
1338          \p{Block: Phaistos}     \p{Block=Phaistos_Disc} (48)
1339          \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48:
1340                                    U+101D0..101FF)
1341          \p{Block: Phoenician}   (NOT \p{Phoenician} NOR \p{Is_Phoenician})
1342                                    (32: U+10900..1091F)
1343          \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
1344          \p{Block: Phonetic_Ext_Sup} \p{Block=
1345                                    Phonetic_Extensions_Supplement} (64)
1346          \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128:
1347                                    U+1D00..1D7F)
1348          \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
1349                                    PhoneticExtSup}) (64: U+1D80..1DBF)
1350          \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF)
1351          \p{Block: Private_Use}  \p{Block=Private_Use_Area} (NOT
1352                                    \p{Private_Use} NOR \p{Is_Private_Use})
1353                                    (6400)
1354          \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
1355                                    \p{Private_Use} NOR \p{Is_Private_Use})
1356                                    (6400: U+E000..F8FF)
1357          \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
1358                                    \p{Is_Psalter_Pahlavi}) (48:
1359                                    U+10B80..10BAF)
1360          \p{Block: PUA}          \p{Block=Private_Use_Area} (NOT
1361                                    \p{Private_Use} NOR \p{Is_Private_Use})
1362                                    (6400)
1363          \p{Block: Punctuation}  \p{Block=General_Punctuation} (NOT
1364                                    \p{Punct} NOR \p{Is_Punctuation}) (112)
1365          \p{Block: Rejang}       (NOT \p{Rejang} NOR \p{Is_Rejang}) (48:
1366                                    U+A930..A95F)
1367          \p{Block: Rumi}         \p{Block=Rumi_Numeral_Symbols} (32)
1368          \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32:
1369                                    U+10E60..10E7F)
1370          \p{Block: Runic}        (NOT \p{Runic} NOR \p{Is_Runic}) (96:
1371                                    U+16A0..16FF)
1372          \p{Block: Samaritan}    (NOT \p{Samaritan} NOR \p{Is_Samaritan})
1373                                    (64: U+0800..083F)
1374          \p{Block: Saurashtra}   (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
1375                                    (96: U+A880..A8DF)
1376          \p{Block: Sharada}      (NOT \p{Sharada} NOR \p{Is_Sharada}) (96:
1377                                    U+11180..111DF)
1378          \p{Block: Shavian}      (48: U+10450..1047F)
1379          \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF)
1380          \p{Block: Siddham}      (NOT \p{Siddham} NOR \p{Is_Siddham}) (128:
1381                                    U+11580..115FF)
1382          \p{Block: Sinhala}      (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128:
1383                                    U+0D80..0DFF)
1384          \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF)
1385          \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32:
1386                                    U+FE50..FE6F)
1387          \p{Block: Small_Forms}  \p{Block=Small_Form_Variants} (32)
1388          \p{Block: Small_Kana_Ext} \p{Block=Small_Kana_Extension} (64)
1389          \p{Block: Small_Kana_Extension} (Short: \p{Blk=SmallKanaExt}) (64:
1390                                    U+1B130..1B16F)
1391          \p{Block: Sogdian}      (NOT \p{Sogdian} NOR \p{Is_Sogdian}) (64:
1392                                    U+10F30..10F6F)
1393          \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
1394                                    \p{Is_Sora_Sompeng}) (48: U+110D0..110FF)
1395          \p{Block: Soyombo}      (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96:
1396                                    U+11A50..11AAF)
1397          \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
1398                                    ModifierLetters}) (80: U+02B0..02FF)
1399          \p{Block: Specials}     (16: U+FFF0..FFFF)
1400          \p{Block: Sundanese}    (NOT \p{Sundanese} NOR \p{Is_Sundanese})
1401                                    (64: U+1B80..1BBF)
1402          \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
1403          \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16:
1404                                    U+1CC0..1CCF)
1405          \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
1406          \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
1407          \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
1408          \p{Block: Sup_Math_Operators} \p{Block=
1409                                    Supplemental_Mathematical_Operators}
1410                                    (256)
1411          \p{Block: Sup_PUA_A}    \p{Block=Supplementary_Private_Use_Area_A}
1412                                    (65_536)
1413          \p{Block: Sup_PUA_B}    \p{Block=Supplementary_Private_Use_Area_B}
1414                                    (65_536)
1415          \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
1416          \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
1417                                    Supplemental_Symbols_And_Pictographs}
1418                                    (256)
1419          \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
1420          \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
1421                                    SuperAndSub}) (48: U+2070..209F)
1422          \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16:
1423                                    U+27F0..27FF)
1424          \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128:
1425                                    U+2900..297F)
1426          \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256:
1427                                    U+1F800..1F8FF)
1428          \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
1429                                    SupMathOperators}) (256: U+2A00..2AFF)
1430          \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
1431                                    SupPunctuation}) (128: U+2E00..2E7F)
1432          \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
1433                                    SupSymbolsAndPictographs}) (256:
1434                                    U+1F900..1F9FF)
1435          \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
1436                                    SupPUAA}) (65_536: U+F0000..FFFFF)
1437          \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
1438                                    SupPUAB}) (65_536: U+100000..10FFFF)
1439          \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF)
1440          \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
1441                                    \p{Is_Syloti_Nagri}) (48: U+A800..A82F)
1442          \p{Block: Symbols_And_Pictographs_Ext_A} \p{Block=
1443                                    Symbols_And_Pictographs_Extended_A} (144)
1444          \p{Block: Symbols_And_Pictographs_Extended_A} (Short: \p{Blk=
1445                                    SymbolsAndPictographsExtA}) (144:
1446                                    U+1FA70..1FAFF)
1447          \p{Block: Symbols_For_Legacy_Computing} (256: U+1FB00..1FBFF)
1448          \p{Block: Syriac}       (NOT \p{Syriac} NOR \p{Is_Syriac}) (80:
1449                                    U+0700..074F)
1450          \p{Block: Syriac_Sup}   \p{Block=Syriac_Supplement} (16)
1451          \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16:
1452                                    U+0860..086F)
1453          \p{Block: Tagalog}      (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32:
1454                                    U+1700..171F)
1455          \p{Block: Tagbanwa}     (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa})
1456                                    (32: U+1760..177F)
1457          \p{Block: Tags}         (128: U+E0000..E007F)
1458          \p{Block: Tai_Le}       (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48:
1459                                    U+1950..197F)
1460          \p{Block: Tai_Tham}     (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
1461                                    (144: U+1A20..1AAF)
1462          \p{Block: Tai_Viet}     (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet})
1463                                    (96: U+AA80..AADF)
1464          \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1465          \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96:
1466                                    U+1D300..1D35F)
1467          \p{Block: Takri}        (NOT \p{Takri} NOR \p{Is_Takri}) (80:
1468                                    U+11680..116CF)
1469          \p{Block: Tamil}        (NOT \p{Tamil} NOR \p{Is_Tamil}) (128:
1470                                    U+0B80..0BFF)
1471          \p{Block: Tamil_Sup}    \p{Block=Tamil_Supplement} (64)
1472          \p{Block: Tamil_Supplement} (Short: \p{Blk=TamilSup}) (64:
1473                                    U+11FC0..11FFF)
1474          \p{Block: Tangsa}       (NOT \p{Tangsa} NOR \p{Is_Tangsa}) (96:
1475                                    U+16A70..16ACF)
1476          \p{Block: Tangut}       (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144:
1477                                    U+17000..187FF)
1478          \p{Block: Tangut_Components} (768: U+18800..18AFF)
1479          \p{Block: Tangut_Sup}   \p{Block=Tangut_Supplement} (128)
1480          \p{Block: Tangut_Supplement} (Short: \p{Blk=TangutSup}) (128:
1481                                    U+18D00..18D7F)
1482          \p{Block: Telugu}       (NOT \p{Telugu} NOR \p{Is_Telugu}) (128:
1483                                    U+0C00..0C7F)
1484          \p{Block: Thaana}       (NOT \p{Thaana} NOR \p{Is_Thaana}) (64:
1485                                    U+0780..07BF)
1486          \p{Block: Thai}         (NOT \p{Thai} NOR \p{Is_Thai}) (128:
1487                                    U+0E00..0E7F)
1488          \p{Block: Tibetan}      (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256:
1489                                    U+0F00..0FFF)
1490          \p{Block: Tifinagh}     (NOT \p{Tifinagh} NOR \p{Is_Tifinagh})
1491                                    (80: U+2D30..2D7F)
1492          \p{Block: Tirhuta}      (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96:
1493                                    U+11480..114DF)
1494          \p{Block: Toto}         (NOT \p{Toto} NOR \p{Is_Toto}) (48:
1495                                    U+1E290..1E2BF)
1496          \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1497                                    (128)
1498          \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1499                                    TransportAndMap}) (128: U+1F680..1F6FF)
1500          \p{Block: UCAS}         \p{Block=
1501                                    Unified_Canadian_Aboriginal_Syllabics}
1502                                    (640)
1503          \p{Block: UCAS_Ext}     \p{Block=
1504                                    Unified_Canadian_Aboriginal_Syllabics_-
1505                                    Extended} (80)
1506          \p{Block: UCAS_Ext_A}   \p{Block=
1507                                    Unified_Canadian_Aboriginal_Syllabics_-
1508                                    Extended_A} (16)
1509          \p{Block: Ugaritic}     (NOT \p{Ugaritic} NOR \p{Is_Ugaritic})
1510                                    (32: U+10380..1039F)
1511          \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1512                                    UCAS}) (640: U+1400..167F)
1513          \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1514                                    \p{Blk=UCASExt}) (80: U+18B0..18FF)
1515          \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended_A}
1516                                    (Short: \p{Blk=UCASExtA}) (16:
1517                                    U+11AB0..11ABF)
1518          \p{Block: Vai}          (NOT \p{Vai} NOR \p{Is_Vai}) (320:
1519                                    U+A500..A63F)
1520          \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
1521                                    \p{Variation_Selector} NOR \p{Is_VS})
1522                                    (16: U+FE00..FE0F)
1523          \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
1524                                    (240: U+E0100..E01EF)
1525          \p{Block: Vedic_Ext}    \p{Block=Vedic_Extensions} (48)
1526          \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48:
1527                                    U+1CD0..1CFF)
1528          \p{Block: Vertical_Forms} (16: U+FE10..FE1F)
1529          \p{Block: Vithkuqi}     (NOT \p{Vithkuqi} NOR \p{Is_Vithkuqi})
1530                                    (80: U+10570..105BF)
1531          \p{Block: VS}           \p{Block=Variation_Selectors} (NOT
1532                                    \p{Variation_Selector} NOR \p{Is_VS})
1533                                    (16)
1534          \p{Block: VS_Sup}       \p{Block=Variation_Selectors_Supplement}
1535                                    (240)
1536          \p{Block: Wancho}       (NOT \p{Wancho} NOR \p{Is_Wancho}) (64:
1537                                    U+1E2C0..1E2FF)
1538          \p{Block: Warang_Citi}  (NOT \p{Warang_Citi} NOR
1539                                    \p{Is_Warang_Citi}) (96: U+118A0..118FF)
1540          \p{Block: Yezidi}       (NOT \p{Yezidi} NOR \p{Is_Yezidi}) (64:
1541                                    U+10E80..10EBF)
1542          \p{Block: Yi_Radicals}  (64: U+A490..A4CF)
1543          \p{Block: Yi_Syllables} (1168: U+A000..A48F)
1544          \p{Block: Yijing}       \p{Block=Yijing_Hexagram_Symbols} (64)
1545          \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64:
1546                                    U+4DC0..4DFF)
1547          \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR
1548                                    \p{Is_Zanabazar_Square}) (80:
1549                                    U+11A00..11A4F)
1550          \p{Block: Znamenny_Music} \p{Block=Znamenny_Musical_Notation} (208)
1551          \p{Block: Znamenny_Musical_Notation} (Short: \p{Blk=
1552                                    ZnamennyMusic}) (208: U+1CF00..1CFCF)
1553        X \p{Block_Elements}      \p{Block=Block_Elements} (32)
1554          \p{Bopo}                \p{Bopomofo} (= \p{Script_Extensions=
1555                                    Bopomofo}) (NOT \p{Block=Bopomofo}) (117)
1556          \p{Bopomofo}            \p{Script_Extensions=Bopomofo} (Short:
1557                                    \p{Bopo}; NOT \p{Block=Bopomofo}) (117)
1558        X \p{Bopomofo_Ext}        \p{Bopomofo_Extended} (= \p{Block=
1559                                    Bopomofo_Extended}) (32)
1560        X \p{Bopomofo_Extended}   \p{Block=Bopomofo_Extended} (Short:
1561                                    \p{InBopomofoExt}) (32)
1562        X \p{Box_Drawing}         \p{Block=Box_Drawing} (128)
1563          \p{Bpt: *}              \p{Bidi_Paired_Bracket_Type: *}
1564          \p{Brah}                \p{Brahmi} (= \p{Script_Extensions=
1565                                    Brahmi}) (NOT \p{Block=Brahmi}) (115)
1566          \p{Brahmi}              \p{Script_Extensions=Brahmi} (Short:
1567                                    \p{Brah}; NOT \p{Block=Brahmi}) (115)
1568          \p{Brai}                \p{Braille} (= \p{Script_Extensions=
1569                                    Braille}) (256)
1570          \p{Braille}             \p{Script_Extensions=Braille} (Short:
1571                                    \p{Brai}) (256)
1572        X \p{Braille_Patterns}    \p{Block=Braille_Patterns} (Short:
1573                                    \p{InBraille}) (256)
1574          \p{Bugi}                \p{Buginese} (= \p{Script_Extensions=
1575                                    Buginese}) (NOT \p{Block=Buginese}) (31)
1576          \p{Buginese}            \p{Script_Extensions=Buginese} (Short:
1577                                    \p{Bugi}; NOT \p{Block=Buginese}) (31)
1578          \p{Buhd}                \p{Buhid} (= \p{Script_Extensions=Buhid})
1579                                    (NOT \p{Block=Buhid}) (22)
1580          \p{Buhid}               \p{Script_Extensions=Buhid} (Short:
1581                                    \p{Buhd}; NOT \p{Block=Buhid}) (22)
1582        X \p{Byzantine_Music}     \p{Byzantine_Musical_Symbols} (= \p{Block=
1583                                    Byzantine_Musical_Symbols}) (256)
1584        X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1585                                    (Short: \p{InByzantineMusic}) (256)
1586          \p{C} \pC               \p{Other} (= \p{General_Category=Other})
1587                                    (965_096 plus all above-Unicode code
1588                                    points)
1589          \p{Cakm}                \p{Chakma} (= \p{Script_Extensions=
1590                                    Chakma}) (NOT \p{Block=Chakma}) (91)
1591          \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
1592                                    (Short: \p{Cans}) (726)
1593        X \p{Canadian_Syllabics}  \p{Unified_Canadian_Aboriginal_Syllabics}
1594                                    (= \p{Block=
1595                                    Unified_Canadian_Aboriginal_Syllabics})
1596                                    (640)
1597        T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1598                                    Not_Reordered} (1_113_190 plus all
1599                                    above-Unicode code points)
1600        T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1601                                    Overlay} (32)
1602        T \p{Canonical_Combining_Class: 6} \p{Canonical_Combining_Class=
1603                                    Han_Reading} (2)
1604        T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1605                                    Nukta} (27)
1606        T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1607                                    Kana_Voicing} (2)
1608        T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1609                                    Virama} (65)
1610        T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1611                                    CCC10} (1)
1612          \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1:
1613                                    U+05B0)
1614        T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1615                                    CCC11} (1)
1616          \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1:
1617                                    U+05B1)
1618        T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1619                                    CCC12} (1)
1620          \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1:
1621                                    U+05B2)
1622        T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1623                                    CCC13} (1)
1624          \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1:
1625                                    U+05B3)
1626        T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1627                                    CCC14} (1)
1628          \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1:
1629                                    U+05B4)
1630        T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1631                                    CCC15} (1)
1632          \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1:
1633                                    U+05B5)
1634        T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1635                                    CCC16} (1)
1636          \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1:
1637                                    U+05B6)
1638        T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1639                                    CCC17} (1)
1640          \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1:
1641                                    U+05B7)
1642        T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1643                                    CCC18} (2)
1644          \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2:
1645                                    U+05B8, U+05C7)
1646        T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1647                                    CCC19} (2)
1648          \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2:
1649                                    U+05B9..05BA)
1650        T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1651                                    CCC20} (1)
1652          \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1:
1653                                    U+05BB)
1654        T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1655                                    CCC21} (1)
1656          \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1:
1657                                    U+05BC)
1658        T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1659                                    CCC22} (1)
1660          \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1:
1661                                    U+05BD)
1662        T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1663                                    CCC23} (1)
1664          \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1:
1665                                    U+05BF)
1666        T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1667                                    CCC24} (1)
1668          \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1:
1669                                    U+05C1)
1670        T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1671                                    CCC25} (1)
1672          \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1:
1673                                    U+05C2)
1674        T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1675                                    CCC26} (1)
1676          \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1:
1677                                    U+FB1E)
1678        T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1679                                    CCC27} (2)
1680          \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2:
1681                                    U+064B, U+08F0)
1682        T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1683                                    CCC28} (2)
1684          \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2:
1685                                    U+064C, U+08F1)
1686        T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1687                                    CCC29} (2)
1688          \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2:
1689                                    U+064D, U+08F2)
1690        T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1691                                    CCC30} (2)
1692          \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2:
1693                                    U+0618, U+064E)
1694        T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1695                                    CCC31} (2)
1696          \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2:
1697                                    U+0619, U+064F)
1698        T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1699                                    CCC32} (2)
1700          \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2:
1701                                    U+061A, U+0650)
1702        T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1703                                    CCC33} (1)
1704          \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1:
1705                                    U+0651)
1706        T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1707                                    CCC34} (1)
1708          \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1:
1709                                    U+0652)
1710        T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1711                                    CCC35} (1)
1712          \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1:
1713                                    U+0670)
1714        T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1715                                    CCC36} (1)
1716          \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1:
1717                                    U+0711)
1718        T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1719                                    CCC84} (1)
1720          \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1:
1721                                    U+0C55)
1722        T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1723                                    CCC91} (1)
1724          \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1:
1725                                    U+0C56)
1726        T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1727                                    CCC103} (2)
1728          \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2:
1729                                    U+0E38..0E39)
1730        T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1731                                    CCC107} (4)
1732          \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4:
1733                                    U+0E48..0E4B)
1734        T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1735                                    CCC118} (2)
1736          \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2:
1737                                    U+0EB8..0EB9)
1738        T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1739                                    CCC122} (4)
1740          \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4:
1741                                    U+0EC8..0ECB)
1742        T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1743                                    CCC129} (1)
1744          \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1:
1745                                    U+0F71)
1746        T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1747                                    CCC130} (6)
1748          \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6:
1749                                    U+0F72, U+0F7A..0F7D, U+0F80)
1750        T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1751                                    CCC132} (1)
1752          \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1:
1753                                    U+0F74)
1754        T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
1755                                    CCC133} (0)
1756          \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
1757        T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1758                                    Attached_Below_Left} (0)
1759        T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1760                                    Attached_Below} (5)
1761        T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1762                                    Attached_Above} (1)
1763        T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1764                                    Attached_Above_Right} (9)
1765        T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1766                                    Below_Left} (2)
1767        T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1768                                    Below} (181)
1769        T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1770                                    Below_Right} (4)
1771        T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1772                                    Left} (2)
1773        T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1774                                    Right} (1)
1775        T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1776                                    Above_Left} (5)
1777        T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1778                                    Above} (510)
1779        T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1780                                    Above_Right} (7)
1781        T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1782                                    Double_Below} (4)
1783        T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1784                                    Double_Above} (5)
1785        T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1786                                    Iota_Subscript} (1)
1787          \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1788                                    Above} (510)
1789          \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (510:
1790                                    U+0300..0314, U+033D..0344, U+0346,
1791                                    U+034A..034C, U+0350..0352, U+0357 ...)
1792          \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5:
1793                                    U+05AE, U+18A9, U+1DF7..1DF8, U+302B)
1794          \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (7:
1795                                    U+0315, U+031A, U+0358, U+1DF6, U+302C,
1796                                    U+1E4EC..1E4ED)
1797          \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1798                                    Above_Left} (5)
1799          \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1800                                    Above_Right} (7)
1801          \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1802                                    Attached_Above} (1)
1803          \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1804                                    Attached_Above_Right} (9)
1805          \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1806                                    Attached_Below} (5)
1807          \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1808                                    Attached_Below_Left} (0)
1809          \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1810                                    (1: U+1DCE)
1811          \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1812                                    \p{Ccc=ATAR}) (9: U+031B, U+0F39,
1813                                    U+1D165..1D166, U+1D16E..1D172)
1814          \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1815                                    (5: U+0321..0322, U+0327..0328, U+1DD0)
1816          \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1817                                    ATBL}) (0)
1818          \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1819                                    Below} (181)
1820          \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (181:
1821                                    U+0316..0319, U+031C..0320,
1822                                    U+0323..0326, U+0329..0333,
1823                                    U+0339..033C, U+0347..0349 ...)
1824          \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (2:
1825                                    U+1DFA, U+302A)
1826          \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4:
1827                                    U+059A, U+05AD, U+1939, U+302D)
1828          \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1829                                    Below_Left} (2)
1830          \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1831                                    Below_Right} (4)
1832          \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1833                                    Double_Above} (5)
1834          \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1835                                    Double_Below} (4)
1836          \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA})
1837                                    (5: U+035D..035E, U+0360..0361, U+1DCD)
1838          \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB})
1839                                    (4: U+035C, U+035F, U+0362, U+1DFC)
1840          \p{Canonical_Combining_Class: Han_Reading} (Short: \p{Ccc=HANR})
1841                                    (2: U+16FF0..16FF1)
1842          \p{Canonical_Combining_Class: HANR} \p{Canonical_Combining_Class=
1843                                    Han_Reading} (2)
1844          \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1845                                    (1: U+0345)
1846          \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1847                                    Iota_Subscript} (1)
1848          \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV})
1849                                    (2: U+3099..309A)
1850          \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1851                                    Kana_Voicing} (2)
1852          \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1853                                    Left} (2)
1854          \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2:
1855                                    U+302E..302F)
1856          \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1857                                    Nukta} (27)
1858          \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1859                                    (1_113_190 plus all above-Unicode code
1860                                    points: U+0000..02FF, U+034F,
1861                                    U+0370..0482, U+0488..0590, U+05BE,
1862                                    U+05C0 ...)
1863          \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1864                                    Not_Reordered} (1_113_190 plus all
1865                                    above-Unicode code points)
1866          \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (27:
1867                                    U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C,
1868                                    U+0C3C ...)
1869          \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1870                                    Overlay} (32)
1871          \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32:
1872                                    U+0334..0338, U+1CD4, U+1CE2..1CE8,
1873                                    U+20D2..20D3, U+20D8..20DA, U+20E5..20E6
1874                                    ...)
1875          \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1876                                    Right} (1)
1877          \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1:
1878                                    U+1D16D)
1879          \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (65:
1880                                    U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
1881                                    U+0BCD ...)
1882          \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1883                                    Virama} (65)
1884          \p{Cans}                \p{Canadian_Aboriginal} (=
1885                                    \p{Script_Extensions=
1886                                    Canadian_Aboriginal}) (726)
1887          \p{Cari}                \p{Carian} (= \p{Script_Extensions=
1888                                    Carian}) (NOT \p{Block=Carian}) (49)
1889          \p{Carian}              \p{Script_Extensions=Carian} (Short:
1890                                    \p{Cari}; NOT \p{Block=Carian}) (49)
1891          \p{Case_Ignorable}      \p{Case_Ignorable=Y} (Short: \p{CI}) (2707)
1892          \p{Case_Ignorable: N*}  (Short: \p{CI=N}, \P{CI}) (1_111_405 plus
1893                                    all above-Unicode code points: [\x00-
1894                                    \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A-Z
1895                                    \[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-\xac\xae
1896                                    \xb0-\xb3\xb5-\xb6\xb9-\xff],
1897                                    U+0100..02AF, U+0370..0373,
1898                                    U+0376..0379, U+037B..0383, U+0386 ...)
1899          \p{Case_Ignorable: Y*}  (Short: \p{CI=Y}, \p{CI}) (2707: [\'.:\^`
1900                                    \xa8\xad\xaf\xb4\xb7-\xb8],
1901                                    U+02B0..036F, U+0374..0375, U+037A,
1902                                    U+0384..0385, U+0387 ...)
1903          \p{Cased}               \p{Cased=Y} (4526)
1904          \p{Cased: N*}           (Single: \P{Cased}) (1_109_586 plus all
1905                                    above-Unicode code points: [\x00-\x20!
1906                                    \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]
1907                                    \^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9
1908                                    \xbb-\xbf\xd7\xf7], U+01BB,
1909                                    U+01C0..01C3, U+0294, U+02B9..02BF,
1910                                    U+02C2..02DF ...)
1911          \p{Cased: Y*}           (Single: \p{Cased}) (4526: [A-Za-z\xaa
1912                                    \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
1913                                    U+0100..01BA, U+01BC..01BF,
1914                                    U+01C4..0293, U+0295..02B8, U+02C0..02C1
1915                                    ...)
1916          \p{Cased_Letter}        \p{General_Category=Cased_Letter} (Short:
1917                                    \p{LC}) (4095)
1918          \p{Category: *}         \p{General_Category: *}
1919          \p{Caucasian_Albanian}  \p{Script_Extensions=Caucasian_Albanian}
1920                                    (Short: \p{Aghb}; NOT \p{Block=
1921                                    Caucasian_Albanian}) (53)
1922          \p{Cc}                  \p{XPosixCntrl} (= \p{General_Category=
1923                                    Control}) (65)
1924          \p{Ccc: *}              \p{Canonical_Combining_Class: *}
1925          \p{CE}                  \p{Composition_Exclusion} (=
1926                                    \p{Composition_Exclusion=Y}) (81)
1927          \p{CE: *}               \p{Composition_Exclusion: *}
1928          \p{Cf}                  \p{Format} (= \p{General_Category=Format})
1929                                    (170)
1930          \p{Chakma}              \p{Script_Extensions=Chakma} (Short:
1931                                    \p{Cakm}; NOT \p{Block=Chakma}) (91)
1932          \p{Cham}                \p{Script_Extensions=Cham} (NOT \p{Block=
1933                                    Cham}) (83)
1934          \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1935                                    \p{CWCF}) (1506)
1936          \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1937                                    (1_112_606 plus all above-Unicode code
1938                                    points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1939                                    \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1940                                    \xb4\xb6-\xbf\xd7\xe0-\xff], U+0101,
1941                                    U+0103, U+0105, U+0107, U+0109 ...)
1942          \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1943                                    (1506: [A-Z\xb5\xc0-\xd6\xd8-\xdf],
1944                                    U+0100, U+0102, U+0104, U+0106, U+0108
1945                                    ...)
1946          \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1947                                    \p{CWCM}) (2927)
1948          \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1949                                    (1_111_185 plus all above-Unicode code
1950                                    points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1951                                    \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xb4
1952                                    \xb6-\xbf\xd7\xf7], U+0138, U+018D,
1953                                    U+019B, U+01AA..01AB, U+01BA..01BB ...)
1954          \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1955                                    (2927: [A-Za-z\xb5\xc0-\xd6\xd8-\xf6
1956                                    \xf8-\xff], U+0100..0137, U+0139..018C,
1957                                    U+018E..019A, U+019C..01A9, U+01AC..01B9
1958                                    ...)
1959          \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1960                                    \p{CWL}) (1433)
1961          \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1962                                    (1_112_679 plus all above-Unicode code
1963                                    points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1964                                    \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1965                                    \xbf\xd7\xdf-\xff], U+0101, U+0103,
1966                                    U+0105, U+0107, U+0109 ...)
1967          \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1433:
1968                                    [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102,
1969                                    U+0104, U+0106, U+0108 ...)
1970          \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1971                                    Y} (Short: \p{CWKCF}) (10_491)
1972          \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1973                                    \P{CWKCF}) (1_103_621 plus all above-
1974                                    Unicode code points: [\x00-\x20!\"#\$
1975                                    \%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a-
1976                                    z\{\|\}~\x7f-\x9f\xa1-\xa7\xa9\xab-\xac
1977                                    \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xd7\xe0-
1978                                    \xff], U+0101, U+0103, U+0105, U+0107,
1979                                    U+0109 ...)
1980          \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1981                                    \p{CWKCF}) (10_491: [A-Z\xa0\xa8\xaa
1982                                    \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0-
1983                                    \xd6\xd8-\xdf], U+0100, U+0102, U+0104,
1984                                    U+0106, U+0108 ...)
1985          \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1986                                    \p{CWT}) (1452)
1987          \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1988                                    (1_112_660 plus all above-Unicode code
1989                                    points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1990                                    \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f-
1991                                    \xb4\xb6-\xde\xf7], U+0100, U+0102,
1992                                    U+0104, U+0106, U+0108 ...)
1993          \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1452:
1994                                    [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1995                                    U+0103, U+0105, U+0107, U+0109 ...)
1996          \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1997                                    \p{CWU}) (1525)
1998          \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1999                                    (1_112_587 plus all above-Unicode code
2000                                    points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
2001                                    \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f-
2002                                    \xb4\xb6-\xde\xf7], U+0100, U+0102,
2003                                    U+0104, U+0106, U+0108 ...)
2004          \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1525:
2005                                    [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
2006                                    U+0103, U+0105, U+0107, U+0109 ...)
2007          \p{Cher}                \p{Cherokee} (= \p{Script_Extensions=
2008                                    Cherokee}) (NOT \p{Block=Cherokee}) (172)
2009          \p{Cherokee}            \p{Script_Extensions=Cherokee} (Short:
2010                                    \p{Cher}; NOT \p{Block=Cherokee}) (172)
2011        X \p{Cherokee_Sup}        \p{Cherokee_Supplement} (= \p{Block=
2012                                    Cherokee_Supplement}) (80)
2013        X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
2014                                    \p{InCherokeeSup}) (80)
2015        X \p{Chess_Symbols}       \p{Block=Chess_Symbols} (112)
2016          \p{Chorasmian}          \p{Script_Extensions=Chorasmian} (Short:
2017                                    \p{Chrs}; NOT \p{Block=Chorasmian}) (28)
2018          \p{Chrs}                \p{Chorasmian} (= \p{Script_Extensions=
2019                                    Chorasmian}) (NOT \p{Block=Chorasmian})
2020                                    (28)
2021          \p{CI}                  \p{Case_Ignorable} (= \p{Case_Ignorable=
2022                                    Y}) (2707)
2023          \p{CI: *}               \p{Case_Ignorable: *}
2024        X \p{CJK}                 \p{CJK_Unified_Ideographs} (= \p{Block=
2025                                    CJK_Unified_Ideographs}) (20_992)
2026        X \p{CJK_Compat}          \p{CJK_Compatibility} (= \p{Block=
2027                                    CJK_Compatibility}) (256)
2028        X \p{CJK_Compat_Forms}    \p{CJK_Compatibility_Forms} (= \p{Block=
2029                                    CJK_Compatibility_Forms}) (32)
2030        X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
2031                                    \p{Block=CJK_Compatibility_Ideographs})
2032                                    (512)
2033        X \p{CJK_Compat_Ideographs_Sup}
2034                                    \p{CJK_Compatibility_Ideographs_-
2035                                    Supplement} (= \p{Block=
2036                                    CJK_Compatibility_Ideographs_-
2037                                    Supplement}) (544)
2038        X \p{CJK_Compatibility}   \p{Block=CJK_Compatibility} (Short:
2039                                    \p{InCJKCompat}) (256)
2040        X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
2041                                    (Short: \p{InCJKCompatForms}) (32)
2042        X \p{CJK_Compatibility_Ideographs} \p{Block=
2043                                    CJK_Compatibility_Ideographs} (Short:
2044                                    \p{InCJKCompatIdeographs}) (512)
2045        X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
2046                                    CJK_Compatibility_Ideographs_Supplement}
2047                                    (Short: \p{InCJKCompatIdeographsSup})
2048                                    (544)
2049        X \p{CJK_Ext_A}           \p{CJK_Unified_Ideographs_Extension_A} (=
2050                                    \p{Block=
2051                                    CJK_Unified_Ideographs_Extension_A})
2052                                    (6592)
2053        X \p{CJK_Ext_B}           \p{CJK_Unified_Ideographs_Extension_B} (=
2054                                    \p{Block=
2055                                    CJK_Unified_Ideographs_Extension_B})
2056                                    (42_720)
2057        X \p{CJK_Ext_C}           \p{CJK_Unified_Ideographs_Extension_C} (=
2058                                    \p{Block=
2059                                    CJK_Unified_Ideographs_Extension_C})
2060                                    (4160)
2061        X \p{CJK_Ext_D}           \p{CJK_Unified_Ideographs_Extension_D} (=
2062                                    \p{Block=
2063                                    CJK_Unified_Ideographs_Extension_D})
2064                                    (224)
2065        X \p{CJK_Ext_E}           \p{CJK_Unified_Ideographs_Extension_E} (=
2066                                    \p{Block=
2067                                    CJK_Unified_Ideographs_Extension_E})
2068                                    (5776)
2069        X \p{CJK_Ext_F}           \p{CJK_Unified_Ideographs_Extension_F} (=
2070                                    \p{Block=
2071                                    CJK_Unified_Ideographs_Extension_F})
2072                                    (7488)
2073        X \p{CJK_Ext_G}           \p{CJK_Unified_Ideographs_Extension_G} (=
2074                                    \p{Block=
2075                                    CJK_Unified_Ideographs_Extension_G})
2076                                    (4944)
2077        X \p{CJK_Ext_H}           \p{CJK_Unified_Ideographs_Extension_H} (=
2078                                    \p{Block=
2079                                    CJK_Unified_Ideographs_Extension_H})
2080                                    (4192)
2081        X \p{CJK_Radicals_Sup}    \p{CJK_Radicals_Supplement} (= \p{Block=
2082                                    CJK_Radicals_Supplement}) (128)
2083        X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
2084                                    (Short: \p{InCJKRadicalsSup}) (128)
2085        X \p{CJK_Strokes}         \p{Block=CJK_Strokes} (48)
2086        X \p{CJK_Symbols}         \p{CJK_Symbols_And_Punctuation} (=
2087                                    \p{Block=CJK_Symbols_And_Punctuation})
2088                                    (64)
2089        X \p{CJK_Symbols_And_Punctuation} \p{Block=
2090                                    CJK_Symbols_And_Punctuation} (Short:
2091                                    \p{InCJKSymbols}) (64)
2092        X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
2093                                    (Short: \p{InCJK}) (20_992)
2094        X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
2095                                    CJK_Unified_Ideographs_Extension_A}
2096                                    (Short: \p{InCJKExtA}) (6592)
2097        X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
2098                                    CJK_Unified_Ideographs_Extension_B}
2099                                    (Short: \p{InCJKExtB}) (42_720)
2100        X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
2101                                    CJK_Unified_Ideographs_Extension_C}
2102                                    (Short: \p{InCJKExtC}) (4160)
2103        X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
2104                                    CJK_Unified_Ideographs_Extension_D}
2105                                    (Short: \p{InCJKExtD}) (224)
2106        X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
2107                                    CJK_Unified_Ideographs_Extension_E}
2108                                    (Short: \p{InCJKExtE}) (5776)
2109        X \p{CJK_Unified_Ideographs_Extension_F} \p{Block=
2110                                    CJK_Unified_Ideographs_Extension_F}
2111                                    (Short: \p{InCJKExtF}) (7488)
2112        X \p{CJK_Unified_Ideographs_Extension_G} \p{Block=
2113                                    CJK_Unified_Ideographs_Extension_G}
2114                                    (Short: \p{InCJKExtG}) (4944)
2115        X \p{CJK_Unified_Ideographs_Extension_H} \p{Block=
2116                                    CJK_Unified_Ideographs_Extension_H}
2117                                    (Short: \p{InCJKExtH}) (4192)
2118          \p{Close_Punctuation}   \p{General_Category=Close_Punctuation}
2119                                    (Short: \p{Pe}) (77)
2120          \p{Cn}                  \p{Unassigned} (= \p{General_Category=
2121                                    Unassigned}) (825_345 plus all above-
2122                                    Unicode code points)
2123          \p{Cntrl}               \p{XPosixCntrl} (= \p{General_Category=
2124                                    Control}) (65)
2125          \p{Co}                  \p{Private_Use} (= \p{General_Category=
2126                                    Private_Use}) (NOT \p{Private_Use_Area})
2127                                    (137_468)
2128        X \p{Combining_Diacritical_Marks} \p{Block=
2129                                    Combining_Diacritical_Marks} (Short:
2130                                    \p{InDiacriticals}) (112)
2131        X \p{Combining_Diacritical_Marks_Extended} \p{Block=
2132                                    Combining_Diacritical_Marks_Extended}
2133                                    (Short: \p{InDiacriticalsExt}) (80)
2134        X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
2135                                    Combining_Diacritical_Marks_For_Symbols}
2136                                    (Short: \p{InDiacriticalsForSymbols})
2137                                    (48)
2138        X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
2139                                    Combining_Diacritical_Marks_Supplement}
2140                                    (Short: \p{InDiacriticalsSup}) (64)
2141        X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
2142                                    \p{InHalfMarks}) (16)
2143          \p{Combining_Mark}      \p{Mark} (= \p{General_Category=Mark})
2144                                    (2450)
2145        X \p{Combining_Marks_For_Symbols}
2146                                    \p{Combining_Diacritical_Marks_For_-
2147                                    Symbols} (= \p{Block=
2148                                    Combining_Diacritical_Marks_For_-
2149                                    Symbols}) (48)
2150          \p{Common}              \p{Script_Extensions=Common} (Short:
2151                                    \p{Zyyy}) (7873)
2152        X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
2153                                    (Short: \p{InIndicNumberForms}) (16)
2154          \p{Comp_Ex}             \p{Full_Composition_Exclusion} (=
2155                                    \p{Full_Composition_Exclusion=Y}) (1120)
2156          \p{Comp_Ex: *}          \p{Full_Composition_Exclusion: *}
2157        X \p{Compat_Jamo}         \p{Hangul_Compatibility_Jamo} (= \p{Block=
2158                                    Hangul_Compatibility_Jamo}) (96)
2159          \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
2160                                    \p{CE}) (81)
2161          \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
2162                                    plus all above-Unicode code points:
2163                                    U+0000..0957, U+0960..09DB, U+09DE,
2164                                    U+09E0..0A32, U+0A34..0A35, U+0A37..0A58
2165                                    ...)
2166          \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81:
2167                                    U+0958..095F, U+09DC..09DD, U+09DF,
2168                                    U+0A33, U+0A36, U+0A59..0A5B ...)
2169          \p{Connector_Punctuation} \p{General_Category=
2170                                    Connector_Punctuation} (Short: \p{Pc})
2171                                    (10)
2172          \p{Control}             \p{XPosixCntrl} (= \p{General_Category=
2173                                    Control}) (65)
2174        X \p{Control_Pictures}    \p{Block=Control_Pictures} (64)
2175          \p{Copt}                \p{Coptic} (= \p{Script_Extensions=
2176                                    Coptic}) (NOT \p{Block=Coptic}) (165)
2177          \p{Coptic}              \p{Script_Extensions=Coptic} (Short:
2178                                    \p{Copt}; NOT \p{Block=Coptic}) (165)
2179        X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
2180        X \p{Counting_Rod}        \p{Counting_Rod_Numerals} (= \p{Block=
2181                                    Counting_Rod_Numerals}) (32)
2182        X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
2183                                    \p{InCountingRod}) (32)
2184          \p{Cpmn}                \p{Cypro_Minoan} (= \p{Script_Extensions=
2185                                    Cypro_Minoan}) (NOT \p{Block=
2186                                    Cypro_Minoan}) (101)
2187          \p{Cprt}                \p{Cypriot} (= \p{Script_Extensions=
2188                                    Cypriot}) (112)
2189          \p{Cs}                  \p{Surrogate} (= \p{General_Category=
2190                                    Surrogate}) (2048)
2191          \p{Cuneiform}           \p{Script_Extensions=Cuneiform} (Short:
2192                                    \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
2193        X \p{Cuneiform_Numbers}   \p{Cuneiform_Numbers_And_Punctuation} (=
2194                                    \p{Block=
2195                                    Cuneiform_Numbers_And_Punctuation}) (128)
2196        X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
2197                                    Cuneiform_Numbers_And_Punctuation}
2198                                    (Short: \p{InCuneiformNumbers}) (128)
2199          \p{Currency_Symbol}     \p{General_Category=Currency_Symbol}
2200                                    (Short: \p{Sc}) (63)
2201        X \p{Currency_Symbols}    \p{Block=Currency_Symbols} (48)
2202          \p{CWCF}                \p{Changes_When_Casefolded} (=
2203                                    \p{Changes_When_Casefolded=Y}) (1506)
2204          \p{CWCF: *}             \p{Changes_When_Casefolded: *}
2205          \p{CWCM}                \p{Changes_When_Casemapped} (=
2206                                    \p{Changes_When_Casemapped=Y}) (2927)
2207          \p{CWCM: *}             \p{Changes_When_Casemapped: *}
2208          \p{CWKCF}               \p{Changes_When_NFKC_Casefolded} (=
2209                                    \p{Changes_When_NFKC_Casefolded=Y})
2210                                    (10_491)
2211          \p{CWKCF: *}            \p{Changes_When_NFKC_Casefolded: *}
2212          \p{CWL}                 \p{Changes_When_Lowercased} (=
2213                                    \p{Changes_When_Lowercased=Y}) (1433)
2214          \p{CWL: *}              \p{Changes_When_Lowercased: *}
2215          \p{CWT}                 \p{Changes_When_Titlecased} (=
2216                                    \p{Changes_When_Titlecased=Y}) (1452)
2217          \p{CWT: *}              \p{Changes_When_Titlecased: *}
2218          \p{CWU}                 \p{Changes_When_Uppercased} (=
2219                                    \p{Changes_When_Uppercased=Y}) (1525)
2220          \p{CWU: *}              \p{Changes_When_Uppercased: *}
2221          \p{Cypriot}             \p{Script_Extensions=Cypriot} (Short:
2222                                    \p{Cprt}) (112)
2223        X \p{Cypriot_Syllabary}   \p{Block=Cypriot_Syllabary} (64)
2224          \p{Cypro_Minoan}        \p{Script_Extensions=Cypro_Minoan} (Short:
2225                                    \p{Cpmn}; NOT \p{Block=Cypro_Minoan})
2226                                    (101)
2227          \p{Cyrillic}            \p{Script_Extensions=Cyrillic} (Short:
2228                                    \p{Cyrl}; NOT \p{Block=Cyrillic}) (510)
2229        X \p{Cyrillic_Ext_A}      \p{Cyrillic_Extended_A} (= \p{Block=
2230                                    Cyrillic_Extended_A}) (32)
2231        X \p{Cyrillic_Ext_B}      \p{Cyrillic_Extended_B} (= \p{Block=
2232                                    Cyrillic_Extended_B}) (96)
2233        X \p{Cyrillic_Ext_C}      \p{Cyrillic_Extended_C} (= \p{Block=
2234                                    Cyrillic_Extended_C}) (16)
2235        X \p{Cyrillic_Ext_D}      \p{Cyrillic_Extended_D} (= \p{Block=
2236                                    Cyrillic_Extended_D}) (96)
2237        X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
2238                                    \p{InCyrillicExtA}) (32)
2239        X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
2240                                    \p{InCyrillicExtB}) (96)
2241        X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
2242                                    \p{InCyrillicExtC}) (16)
2243        X \p{Cyrillic_Extended_D} \p{Block=Cyrillic_Extended_D} (Short:
2244                                    \p{InCyrillicExtD}) (96)
2245        X \p{Cyrillic_Sup}        \p{Cyrillic_Supplement} (= \p{Block=
2246                                    Cyrillic_Supplement}) (48)
2247        X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
2248                                    \p{InCyrillicSup}) (48)
2249        X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
2250                                    Cyrillic_Supplement}) (48)
2251          \p{Cyrl}                \p{Cyrillic} (= \p{Script_Extensions=
2252                                    Cyrillic}) (NOT \p{Block=Cyrillic}) (510)
2253          \p{Dash}                \p{Dash=Y} (30)
2254          \p{Dash: N*}            (Single: \P{Dash}) (1_114_082 plus all
2255                                    above-Unicode code points: [\x00-\x20!
2256                                    \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z
2257                                    \[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2258                                    U+0100..0589, U+058B..05BD,
2259                                    U+05BF..13FF, U+1401..1805, U+1807..200F
2260                                    ...)
2261          \p{Dash: Y*}            (Single: \p{Dash}) (30: [\-], U+058A,
2262                                    U+05BE, U+1400, U+1806, U+2010..2015 ...)
2263          \p{Dash_Punctuation}    \p{General_Category=Dash_Punctuation}
2264                                    (Short: \p{Pd}) (26)
2265          \p{Decimal_Number}      \p{XPosixDigit} (= \p{General_Category=
2266                                    Decimal_Number}) (680)
2267          \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
2268                                    (13_233)
2269          \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_233:
2270                                    [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd
2271                                    \xe0-\xe5\xe7-\xef\xf1-\xf6\xf9-\xfd
2272                                    \xff], U+0100..010F, U+0112..0125,
2273                                    U+0128..0130, U+0134..0137, U+0139..013E
2274                                    ...)
2275          \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240:
2276                                    U+2460..2473, U+24B6..24EA,
2277                                    U+3244..3247, U+3251..327E,
2278                                    U+3280..32BF, U+32D0..32FE ...)
2279          \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
2280          \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720: [\xa8
2281                                    \xaf\xb4-\xb5\xb8], U+0132..0133,
2282                                    U+013F..0140, U+0149, U+017F,
2283                                    U+01C4..01CC ...)
2284          \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
2285          \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
2286          \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51,
2287                                    U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63
2288                                    ...)
2289          \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1194: U+2102,
2290                                    U+210A..2113, U+2115, U+2119..211D,
2291                                    U+2124, U+2128 ...)
2292          \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
2293          \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc-
2294                                    \xbe], U+2150..215F, U+2189)
2295          \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
2296          \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54,
2297                                    U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68
2298                                    ...)
2299          \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
2300          \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50,
2301                                    U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62
2302                                    ...)
2303          \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
2304          \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55,
2305                                    U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69
2306                                    ...)
2307          \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
2308          \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122:
2309                                    U+FF61..FFBE, U+FFC2..FFC7,
2310                                    U+FFCA..FFCF, U+FFD2..FFD7,
2311                                    U+FFDA..FFDC, U+FFE8..FFEE)
2312          \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
2313          \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0],
2314                                    U+0F0C, U+2007, U+2011, U+202F)
2315          \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
2316                                    Non_Canonical} (Perl extension) (3796)
2317          \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
2318                                    decompositions (Short: \p{Dt=NonCanon})
2319                                    (Perl extension) (3796: [\xa0\xa8\xaa
2320                                    \xaf\xb2-\xb5\xb8-\xba\xbc-\xbe],
2321                                    U+0132..0133, U+013F..0140, U+0149,
2322                                    U+017F, U+01C4..01CC ...)
2323          \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_083 plus
2324                                    all above-Unicode code points: [\x00-
2325                                    \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6-
2326                                    \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-\xdf
2327                                    \xe6\xf0\xf7-\xf8\xfe], U+0110..0111,
2328                                    U+0126..0127, U+0131, U+0138,
2329                                    U+0141..0142 ...)
2330          \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26:
2331                                    U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B)
2332          \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
2333          \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (286)
2334          \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (286: U+3250,
2335                                    U+32CC..32CF, U+32FF..3357,
2336                                    U+3371..33DF, U+33FF, U+1F130..1F14F ...)
2337          \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (64: U+1D62..1D6A,
2338                                    U+2080..208E, U+2090..209C, U+2C7C,
2339                                    U+1E051..1E06A)
2340          \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (249)
2341          \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (249: [\xaa\xb2-
2342                                    \xb3\xb9-\xba], U+02B0..02B8,
2343                                    U+02E0..02E4, U+10FC, U+1D2C..1D2E,
2344                                    U+1D30..1D3A ...)
2345          \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
2346          \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F,
2347                                    U+30FF, U+FE10..FE19, U+FE30..FE44,
2348                                    U+FE47..FE48)
2349          \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000,
2350                                    U+FF01..FF60, U+FFE0..FFE6)
2351          \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
2352                                    Y} (Short: \p{DI}) (4174)
2353          \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
2354                                    (1_109_938 plus all above-Unicode code
2355                                    points: [\x00-\xac\xae-\xff],
2356                                    U+0100..034E, U+0350..061B,
2357                                    U+061D..115E, U+1161..17B3, U+17B6..180A
2358                                    ...)
2359          \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
2360                                    (4174: [\xad], U+034F, U+061C,
2361                                    U+115F..1160, U+17B4..17B5, U+180B..180F
2362                                    ...)
2363          \p{Dep}                 \p{Deprecated} (= \p{Deprecated=Y}) (15)
2364          \p{Dep: *}              \p{Deprecated: *}
2365          \p{Deprecated}          \p{Deprecated=Y} (Short: \p{Dep}) (15)
2366          \p{Deprecated: N*}      (Short: \p{Dep=N}, \P{Dep}) (1_114_097
2367                                    plus all above-Unicode code points:
2368                                    U+0000..0148, U+014A..0672,
2369                                    U+0674..0F76, U+0F78, U+0F7A..17A2,
2370                                    U+17A5..2069 ...)
2371          \p{Deprecated: Y*}      (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149,
2372                                    U+0673, U+0F77, U+0F79, U+17A3..17A4,
2373                                    U+206A..206F ...)
2374          \p{Deseret}             \p{Script_Extensions=Deseret} (Short:
2375                                    \p{Dsrt}) (80)
2376          \p{Deva}                \p{Devanagari} (= \p{Script_Extensions=
2377                                    Devanagari}) (NOT \p{Block=Devanagari})
2378                                    (220)
2379          \p{Devanagari}          \p{Script_Extensions=Devanagari} (Short:
2380                                    \p{Deva}; NOT \p{Block=Devanagari}) (220)
2381        X \p{Devanagari_Ext}      \p{Devanagari_Extended} (= \p{Block=
2382                                    Devanagari_Extended}) (32)
2383        X \p{Devanagari_Ext_A}    \p{Devanagari_Extended_A} (= \p{Block=
2384                                    Devanagari_Extended_A}) (96)
2385        X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
2386                                    \p{InDevanagariExt}) (32)
2387        X \p{Devanagari_Extended_A} \p{Block=Devanagari_Extended_A} (Short:
2388                                    \p{InDevanagariExtA}) (96)
2389          \p{DI}                  \p{Default_Ignorable_Code_Point} (=
2390                                    \p{Default_Ignorable_Code_Point=Y})
2391                                    (4174)
2392          \p{DI: *}               \p{Default_Ignorable_Code_Point: *}
2393          \p{Dia}                 \p{Diacritic} (= \p{Diacritic=Y}) (1144)
2394          \p{Dia: *}              \p{Diacritic: *}
2395          \p{Diacritic}           \p{Diacritic=Y} (Short: \p{Dia}) (1144)
2396          \p{Diacritic: N*}       (Short: \p{Dia=N}, \P{Dia}) (1_112_968
2397                                    plus all above-Unicode code points:
2398                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2399                                    >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
2400                                    \xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
2401                                    U+0100..02AF, U+034F, U+0358..035C,
2402                                    U+0363..0373, U+0376..0379 ...)
2403          \p{Diacritic: Y*}       (Short: \p{Dia=Y}, \p{Dia}) (1144: [\^`
2404                                    \xa8\xaf\xb4\xb7-\xb8], U+02B0..034E,
2405                                    U+0350..0357, U+035D..0362,
2406                                    U+0374..0375, U+037A ...)
2407        X \p{Diacriticals}        \p{Combining_Diacritical_Marks} (=
2408                                    \p{Block=Combining_Diacritical_Marks})
2409                                    (112)
2410        X \p{Diacriticals_Ext}    \p{Combining_Diacritical_Marks_Extended}
2411                                    (= \p{Block=
2412                                    Combining_Diacritical_Marks_Extended})
2413                                    (80)
2414        X \p{Diacriticals_For_Symbols}
2415                                    \p{Combining_Diacritical_Marks_For_-
2416                                    Symbols} (= \p{Block=
2417                                    Combining_Diacritical_Marks_For_-
2418                                    Symbols}) (48)
2419        X \p{Diacriticals_Sup}    \p{Combining_Diacritical_Marks_Supplement}
2420                                    (= \p{Block=
2421                                    Combining_Diacritical_Marks_Supplement})
2422                                    (64)
2423          \p{Diak}                \p{Dives_Akuru} (= \p{Script_Extensions=
2424                                    Dives_Akuru}) (NOT \p{Block=
2425                                    Dives_Akuru}) (72)
2426          \p{Digit}               \p{XPosixDigit} (= \p{General_Category=
2427                                    Decimal_Number}) (680)
2428        X \p{Dingbats}            \p{Block=Dingbats} (192)
2429          \p{Dives_Akuru}         \p{Script_Extensions=Dives_Akuru} (Short:
2430                                    \p{Diak}; NOT \p{Block=Dives_Akuru}) (72)
2431          \p{Dogr}                \p{Dogra} (= \p{Script_Extensions=Dogra})
2432                                    (NOT \p{Block=Dogra}) (82)
2433          \p{Dogra}               \p{Script_Extensions=Dogra} (Short:
2434                                    \p{Dogr}; NOT \p{Block=Dogra}) (82)
2435        X \p{Domino}              \p{Domino_Tiles} (= \p{Block=
2436                                    Domino_Tiles}) (112)
2437        X \p{Domino_Tiles}        \p{Block=Domino_Tiles} (Short:
2438                                    \p{InDomino}) (112)
2439          \p{Dsrt}                \p{Deseret} (= \p{Script_Extensions=
2440                                    Deseret}) (80)
2441          \p{Dt: *}               \p{Decomposition_Type: *}
2442          \p{Dupl}                \p{Duployan} (= \p{Script_Extensions=
2443                                    Duployan}) (NOT \p{Block=Duployan}) (147)
2444          \p{Duployan}            \p{Script_Extensions=Duployan} (Short:
2445                                    \p{Dupl}; NOT \p{Block=Duployan}) (147)
2446          \p{Ea: *}               \p{East_Asian_Width: *}
2447        X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
2448                                    (208)
2449          \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
2450          \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739: [\xa1
2451                                    \xa4\xa7-\xa8\xaa\xad-\xae\xb0-\xb4\xb6-
2452                                    \xba\xbc-\xbf\xc6\xd0\xd7-\xd8\xde-\xe1
2453                                    \xe6\xe8-\xea\xec-\xed\xf0\xf2-\xf3\xf7-
2454                                    \xfa\xfc\xfe], U+0101, U+0111, U+0113,
2455                                    U+011B, U+0126..0127 ...)
2456          \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
2457          \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000,
2458                                    U+FF01..FF60, U+FFE0..FFE6)
2459          \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
2460          \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9,
2461                                    U+FF61..FFBE, U+FFC2..FFC7,
2462                                    U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC
2463                                    ...)
2464          \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (792_623 plus
2465                                    all above-Unicode code points)
2466          \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
2467          \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20-\x7e
2468                                    \xa2-\xa3\xa5-\xa6\xac\xaf],
2469                                    U+27E6..27ED, U+2985..2986)
2470          \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (792_623 plus all
2471                                    above-Unicode code points: [\x00-\x1f
2472                                    \x7f-\xa0\xa9\xab\xb5\xbb\xc0-\xc5\xc7-
2473                                    \xcf\xd1-\xd6\xd9-\xdd\xe2-\xe5\xe7\xeb
2474                                    \xee-\xef\xf1\xf4-\xf6\xfb\xfd\xff],
2475                                    U+00FF..0100, U+0102..0110, U+0112,
2476                                    U+0114..011A, U+011C..0125 ...)
2477          \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (182_412)
2478          \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (182_412:
2479                                    U+1100..115F, U+231A..231B,
2480                                    U+2329..232A, U+23E9..23EC, U+23F0,
2481                                    U+23F3 ...)
2482          \p{EBase}               \p{Emoji_Modifier_Base} (=
2483                                    \p{Emoji_Modifier_Base=Y}) (134)
2484          \p{EBase: *}            \p{Emoji_Modifier_Base: *}
2485          \p{EComp}               \p{Emoji_Component} (= \p{Emoji_Component=
2486                                    Y}) (146)
2487          \p{EComp: *}            \p{Emoji_Component: *}
2488          \p{Egyp}                \p{Egyptian_Hieroglyphs} (=
2489                                    \p{Script_Extensions=
2490                                    Egyptian_Hieroglyphs}) (NOT \p{Block=
2491                                    Egyptian_Hieroglyphs}) (1110)
2492        X \p{Egyptian_Hieroglyph_Format_Controls} \p{Block=
2493                                    Egyptian_Hieroglyph_Format_Controls} (48)
2494          \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
2495                                    Egyptian_Hieroglyphs} (Short: \p{Egyp};
2496                                    NOT \p{Block=Egyptian_Hieroglyphs})
2497                                    (1110)
2498          \p{Elba}                \p{Elbasan} (= \p{Script_Extensions=
2499                                    Elbasan}) (NOT \p{Block=Elbasan}) (40)
2500          \p{Elbasan}             \p{Script_Extensions=Elbasan} (Short:
2501                                    \p{Elba}; NOT \p{Block=Elbasan}) (40)
2502          \p{Elym}                \p{Elymaic} (= \p{Script_Extensions=
2503                                    Elymaic}) (NOT \p{Block=Elymaic}) (23)
2504          \p{Elymaic}             \p{Script_Extensions=Elymaic} (Short:
2505                                    \p{Elym}; NOT \p{Block=Elymaic}) (23)
2506          \p{EMod}                \p{Emoji_Modifier} (= \p{Emoji_Modifier=
2507                                    Y}) (5)
2508          \p{EMod: *}             \p{Emoji_Modifier: *}
2509          \p{Emoji}               \p{Emoji=Y} (1424)
2510          \p{Emoji: N*}           (Single: \P{Emoji}) (1_112_688 plus all
2511                                    above-Unicode code points: [\x00-\x20!
2512                                    \"\$\%&\'\(\)+,\-.\/:;<=>?\@A-Z\[\\\]
2513                                    \^_`a-z\{\|\}~\x7f-\xa8\xaa-\xad\xaf-
2514                                    \xff], U+0100..203B, U+203D..2048,
2515                                    U+204A..2121, U+2123..2138, U+213A..2193
2516                                    ...)
2517          \p{Emoji: Y*}           (Single: \p{Emoji}) (1424: [#*0-9\xa9
2518                                    \xae], U+203C, U+2049, U+2122, U+2139,
2519                                    U+2194..2199 ...)
2520          \p{Emoji_Component}     \p{Emoji_Component=Y} (Short: \p{EComp})
2521                                    (146)
2522          \p{Emoji_Component: N*} (Short: \p{EComp=N}, \P{EComp}) (1_113_966
2523                                    plus all above-Unicode code points:
2524                                    [\x00-\x20!\"\$\%&\'\(\)+,\-.\/:;<=>?
2525                                    \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2526                                    U+0100..200C, U+200E..20E2,
2527                                    U+20E4..FE0E, U+FE10..1F1E5,
2528                                    U+1F200..1F3FA ...)
2529          \p{Emoji_Component: Y*} (Short: \p{EComp=Y}, \p{EComp}) (146:
2530                                    [#*0-9], U+200D, U+20E3, U+FE0F,
2531                                    U+1F1E6..1F1FF, U+1F3FB..1F3FF ...)
2532          \p{Emoji_Modifier}      \p{Emoji_Modifier=Y} (Short: \p{EMod}) (5)
2533          \p{Emoji_Modifier: N*}  (Short: \p{EMod=N}, \P{EMod}) (1_114_107
2534                                    plus all above-Unicode code points:
2535                                    U+0000..1F3FA, U+1F400..infinity)
2536          \p{Emoji_Modifier: Y*}  (Short: \p{EMod=Y}, \p{EMod}) (5:
2537                                    U+1F3FB..1F3FF)
2538          \p{Emoji_Modifier_Base} \p{Emoji_Modifier_Base=Y} (Short:
2539                                    \p{EBase}) (134)
2540          \p{Emoji_Modifier_Base: N*} (Short: \p{EBase=N}, \P{EBase})
2541                                    (1_113_978 plus all above-Unicode code
2542                                    points: U+0000..261C, U+261E..26F8,
2543                                    U+26FA..2709, U+270E..1F384,
2544                                    U+1F386..1F3C1, U+1F3C5..1F3C6 ...)
2545          \p{Emoji_Modifier_Base: Y*} (Short: \p{EBase=Y}, \p{EBase}) (134:
2546                                    U+261D, U+26F9, U+270A..270D, U+1F385,
2547                                    U+1F3C2..1F3C4, U+1F3C7 ...)
2548          \p{Emoji_Presentation}  \p{Emoji_Presentation=Y} (Short:
2549                                    \p{EPres}) (1205)
2550          \p{Emoji_Presentation: N*} (Short: \p{EPres=N}, \P{EPres})
2551                                    (1_112_907 plus all above-Unicode code
2552                                    points: U+0000..2319, U+231C..23E8,
2553                                    U+23ED..23EF, U+23F1..23F2,
2554                                    U+23F4..25FC, U+25FF..2613 ...)
2555          \p{Emoji_Presentation: Y*} (Short: \p{EPres=Y}, \p{EPres}) (1205:
2556                                    U+231A..231B, U+23E9..23EC, U+23F0,
2557                                    U+23F3, U+25FD..25FE, U+2614..2615 ...)
2558        X \p{Emoticons}           \p{Block=Emoticons} (80)
2559        X \p{Enclosed_Alphanum}   \p{Enclosed_Alphanumerics} (= \p{Block=
2560                                    Enclosed_Alphanumerics}) (160)
2561        X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
2562                                    \p{Block=
2563                                    Enclosed_Alphanumeric_Supplement}) (256)
2564        X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
2565                                    Enclosed_Alphanumeric_Supplement}
2566                                    (Short: \p{InEnclosedAlphanumSup}) (256)
2567        X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
2568                                    (Short: \p{InEnclosedAlphanum}) (160)
2569        X \p{Enclosed_CJK}        \p{Enclosed_CJK_Letters_And_Months} (=
2570                                    \p{Block=
2571                                    Enclosed_CJK_Letters_And_Months}) (256)
2572        X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
2573                                    Enclosed_CJK_Letters_And_Months} (Short:
2574                                    \p{InEnclosedCJK}) (256)
2575        X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
2576                                    (= \p{Block=
2577                                    Enclosed_Ideographic_Supplement}) (256)
2578        X \p{Enclosed_Ideographic_Supplement} \p{Block=
2579                                    Enclosed_Ideographic_Supplement} (Short:
2580                                    \p{InEnclosedIdeographicSup}) (256)
2581          \p{Enclosing_Mark}      \p{General_Category=Enclosing_Mark}
2582                                    (Short: \p{Me}) (13)
2583          \p{EPres}               \p{Emoji_Presentation} (=
2584                                    \p{Emoji_Presentation=Y}) (1205)
2585          \p{EPres: *}            \p{Emoji_Presentation: *}
2586          \p{Ethi}                \p{Ethiopic} (= \p{Script_Extensions=
2587                                    Ethiopic}) (NOT \p{Block=Ethiopic}) (523)
2588          \p{Ethiopic}            \p{Script_Extensions=Ethiopic} (Short:
2589                                    \p{Ethi}; NOT \p{Block=Ethiopic}) (523)
2590        X \p{Ethiopic_Ext}        \p{Ethiopic_Extended} (= \p{Block=
2591                                    Ethiopic_Extended}) (96)
2592        X \p{Ethiopic_Ext_A}      \p{Ethiopic_Extended_A} (= \p{Block=
2593                                    Ethiopic_Extended_A}) (48)
2594        X \p{Ethiopic_Ext_B}      \p{Ethiopic_Extended_B} (= \p{Block=
2595                                    Ethiopic_Extended_B}) (32)
2596        X \p{Ethiopic_Extended}   \p{Block=Ethiopic_Extended} (Short:
2597                                    \p{InEthiopicExt}) (96)
2598        X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
2599                                    \p{InEthiopicExtA}) (48)
2600        X \p{Ethiopic_Extended_B} \p{Block=Ethiopic_Extended_B} (Short:
2601                                    \p{InEthiopicExtB}) (32)
2602        X \p{Ethiopic_Sup}        \p{Ethiopic_Supplement} (= \p{Block=
2603                                    Ethiopic_Supplement}) (32)
2604        X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
2605                                    \p{InEthiopicSup}) (32)
2606          \p{Ext}                 \p{Extender} (= \p{Extender=Y}) (50)
2607          \p{Ext: *}              \p{Extender: *}
2608          \p{Extended_Pictographic} \p{Extended_Pictographic=Y} (Short:
2609                                    \p{ExtPict}) (3537)
2610          \p{Extended_Pictographic: N*} (Short: \p{ExtPict=N}, \P{ExtPict})
2611                                    (1_110_575 plus all above-Unicode code
2612                                    points: [\x00-\xa8\xaa-\xad\xaf-\xff],
2613                                    U+0100..203B, U+203D..2048,
2614                                    U+204A..2121, U+2123..2138, U+213A..2193
2615                                    ...)
2616          \p{Extended_Pictographic: Y*} (Short: \p{ExtPict=Y}, \p{ExtPict})
2617                                    (3537: [\xa9\xae], U+203C, U+2049,
2618                                    U+2122, U+2139, U+2194..2199 ...)
2619          \p{Extender}            \p{Extender=Y} (Short: \p{Ext}) (50)
2620          \p{Extender: N*}        (Short: \p{Ext=N}, \P{Ext}) (1_114_062
2621                                    plus all above-Unicode code points:
2622                                    [\x00-\xb6\xb8-\xff], U+0100..02CF,
2623                                    U+02D2..063F, U+0641..07F9,
2624                                    U+07FB..0B54, U+0B56..0E45 ...)
2625          \p{Extender: Y*}        (Short: \p{Ext=Y}, \p{Ext}) (50: [\xb7],
2626                                    U+02D0..02D1, U+0640, U+07FA, U+0B55,
2627                                    U+0E46 ...)
2628          \p{ExtPict}             \p{Extended_Pictographic} (=
2629                                    \p{Extended_Pictographic=Y}) (3537)
2630          \p{ExtPict: *}          \p{Extended_Pictographic: *}
2631          \p{Final_Punctuation}   \p{General_Category=Final_Punctuation}
2632                                    (Short: \p{Pf}) (10)
2633          \p{Format}              \p{General_Category=Format} (Short:
2634                                    \p{Cf}) (170)
2635          \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
2636                                    (Short: \p{CompEx}) (1120)
2637          \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
2638                                    \P{CompEx}) (1_112_992 plus all above-
2639                                    Unicode code points: U+0000..033F,
2640                                    U+0342, U+0345..0373, U+0375..037D,
2641                                    U+037F..0386, U+0388..0957 ...)
2642          \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
2643                                    \p{CompEx}) (1120: U+0340..0341,
2644                                    U+0343..0344, U+0374, U+037E, U+0387,
2645                                    U+0958..095F ...)
2646          \p{Gc: *}               \p{General_Category: *}
2647          \p{GCB: *}              \p{Grapheme_Cluster_Break: *}
2648          \p{General_Category: C} \p{General_Category=Other} (965_096 plus
2649                                    all above-Unicode code points)
2650          \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
2651                                    \p{Gc=LC}, \p{LC}) (4095: [A-Za-z\xb5
2652                                    \xc0-\xd6\xd8-\xf6\xf8-\xff],
2653                                    U+0100..01BA, U+01BC..01BF,
2654                                    U+01C4..0293, U+0295..02AF, U+0370..0373
2655                                    ...)
2656          \p{General_Category: Cc} \p{General_Category=Control} (65)
2657          \p{General_Category: Cf} \p{General_Category=Format} (170)
2658          \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
2659                                    (77: [\)\]\}], U+0F3B, U+0F3D, U+169C,
2660                                    U+2046, U+207E ...)
2661          \p{General_Category: Cn} \p{General_Category=Unassigned} (825_345
2662                                    plus all above-Unicode code points)
2663          \p{General_Category: Cntrl} \p{General_Category=Control} (65)
2664          \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
2665          \p{General_Category: Combining_Mark} \p{General_Category=Mark}
2666                                    (2450)
2667          \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
2668                                    \p{Pc}) (10: [_], U+203F..2040, U+2054,
2669                                    U+FE33..FE34, U+FE4D..FE4F, U+FF3F)
2670          \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65:
2671                                    [\x00-\x1f\x7f-\x9f])
2672          \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
2673          \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
2674                                    (63: [\$\xa2-\xa5], U+058F, U+060B,
2675                                    U+07FE..07FF, U+09F2..09F3, U+09FB ...)
2676          \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
2677                                    (26: [\-], U+058A, U+05BE, U+1400,
2678                                    U+1806, U+2010..2015 ...)
2679          \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
2680                                    (680: [0-9], U+0660..0669, U+06F0..06F9,
2681                                    U+07C0..07C9, U+0966..096F, U+09E6..09EF
2682                                    ...)
2683          \p{General_Category: Digit} \p{General_Category=Decimal_Number}
2684                                    (680)
2685          \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
2686                                    (13: U+0488..0489, U+1ABE, U+20DD..20E0,
2687                                    U+20E2..20E4, U+A670..A672)
2688          \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
2689                                    (10: [\xbb], U+2019, U+201D, U+203A,
2690                                    U+2E03, U+2E05 ...)
2691          \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (170:
2692                                    [\xad], U+0600..0605, U+061C, U+06DD,
2693                                    U+070F, U+0890..0891 ...)
2694          \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
2695                                    \p{Pi}) (12: [\xab], U+2018,
2696                                    U+201B..201C, U+201F, U+2039, U+2E02 ...)
2697          \p{General_Category: L} \p{General_Category=Letter} (136_104)
2698        X \p{General_Category: L&} \p{General_Category=Cased_Letter} (4095)
2699        X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
2700                                    the trailing '_' matters in spite of
2701                                    loose matching rules. (4095)
2702          \p{General_Category: LC} \p{General_Category=Cased_Letter} (4095)
2703          \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (136_104:
2704                                    [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
2705                                    \xf8-\xff], U+0100..02C1, U+02C6..02D1,
2706                                    U+02E0..02E4, U+02EC, U+02EE ...)
2707          \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
2708                                    (236: U+16EE..16F0, U+2160..2182,
2709                                    U+2185..2188, U+3007, U+3021..3029,
2710                                    U+3038..303A ...)
2711          \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl})
2712                                    (1: U+2028)
2713          \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
2714                                    (/i= General_Category=Cased_Letter)
2715                                    (2233)
2716          \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (397)
2717          \p{General_Category: Lo} \p{General_Category=Other_Letter}
2718                                    (131_612)
2719          \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
2720                                    /i= General_Category=Cased_Letter)
2721                                    (2233: [a-z\xb5\xdf-\xf6\xf8-\xff],
2722                                    U+0101, U+0103, U+0105, U+0107, U+0109
2723                                    ...)
2724          \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
2725                                    (/i= General_Category=Cased_Letter) (31)
2726          \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
2727                                    (/i= General_Category=Cased_Letter)
2728                                    (1831)
2729          \p{General_Category: M} \p{General_Category=Mark} (2450)
2730          \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2450:
2731                                    U+0300..036F, U+0483..0489,
2732                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2733                                    U+05C4..05C5 ...)
2734          \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948:
2735                                    [+<=>\|~\xac\xb1\xd7\xf7], U+03F6,
2736                                    U+0606..0608, U+2044, U+2052,
2737                                    U+207A..207C ...)
2738          \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (452)
2739          \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
2740          \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
2741                                    (1985)
2742          \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
2743                                    (397: U+02B0..02C1, U+02C6..02D1,
2744                                    U+02E0..02E4, U+02EC, U+02EE, U+0374 ...)
2745          \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
2746                                    (125: [\^`\xa8\xaf\xb4\xb8],
2747                                    U+02C2..02C5, U+02D2..02DF,
2748                                    U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2749          \p{General_Category: N} \p{General_Category=Number} (1831)
2750          \p{General_Category: Nd} \p{General_Category=Decimal_Number} (680)
2751          \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
2752          \p{General_Category: No} \p{General_Category=Other_Number} (915)
2753          \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
2754                                    (1985: U+0300..036F, U+0483..0487,
2755                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2756                                    U+05C4..05C5 ...)
2757          \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1831: [0-9
2758                                    \xb2-\xb3\xb9\xbc-\xbe], U+0660..0669,
2759                                    U+06F0..06F9, U+07C0..07C9,
2760                                    U+0966..096F, U+09E6..09EF ...)
2761          \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
2762                                    (79: [\(\[\{], U+0F3A, U+0F3C, U+169B,
2763                                    U+201A, U+201E ...)
2764          \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (965_096 plus
2765                                    all above-Unicode code points: [\x00-
2766                                    \x1f\x7f-\x9f\xad], U+0378..0379,
2767                                    U+0380..0383, U+038B, U+038D, U+03A2 ...)
2768          \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
2769                                    (131_612: [\xaa\xba], U+01BB,
2770                                    U+01C0..01C3, U+0294, U+05D0..05EA,
2771                                    U+05EF..05F2 ...)
2772          \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No})
2773                                    (915: [\xb2-\xb3\xb9\xbc-\xbe],
2774                                    U+09F4..09F9, U+0B72..0B77,
2775                                    U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
2776                                    ...)
2777          \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
2778                                    (628: [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7
2779                                    \xb6-\xb7\xbf], U+037E, U+0387,
2780                                    U+055A..055F, U+0589, U+05C0 ...)
2781          \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
2782                                    (6634: [\xa6\xa9\xae\xb0], U+0482,
2783                                    U+058D..058E, U+060E..060F, U+06DE,
2784                                    U+06E9 ...)
2785          \p{General_Category: P} \p{General_Category=Punctuation} (842)
2786          \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
2787                                    \p{Zp}) (1: U+2029)
2788          \p{General_Category: Pc} \p{General_Category=
2789                                    Connector_Punctuation} (10)
2790          \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (26)
2791          \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
2792                                    (77)
2793          \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
2794                                    (10)
2795          \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
2796                                    (12)
2797          \p{General_Category: Po} \p{General_Category=Other_Punctuation}
2798                                    (628)
2799          \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
2800                                    (137_468: U+E000..F8FF, U+F0000..FFFFD,
2801                                    U+100000..10FFFD)
2802          \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (79)
2803          \p{General_Category: Punct} \p{General_Category=Punctuation} (842)
2804          \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (842:
2805                                    [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_\{\}
2806                                    \xa1\xa7\xab\xb6-\xb7\xbb\xbf], U+037E,
2807                                    U+0387, U+055A..055F, U+0589..058A,
2808                                    U+05BE ...)
2809          \p{General_Category: S} \p{General_Category=Symbol} (7770)
2810          \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (63)
2811          \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19:
2812                                    [\x20\xa0], U+1680, U+2000..200A,
2813                                    U+2028..2029, U+202F, U+205F ...)
2814          \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (125)
2815          \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
2816          \p{General_Category: So} \p{General_Category=Other_Symbol} (6634)
2817          \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
2818                                    (17: [\x20\xa0], U+1680, U+2000..200A,
2819                                    U+202F, U+205F, U+3000)
2820          \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc})
2821                                    (452: U+0903, U+093B, U+093E..0940,
2822                                    U+0949..094C, U+094E..094F, U+0982..0983
2823                                    ...)
2824          \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048:
2825                                    U+D800..DFFF)
2826          \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (7770:
2827                                    [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae-
2828                                    \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5,
2829                                    U+02D2..02DF, U+02E5..02EB, U+02ED,
2830                                    U+02EF..02FF ...)
2831          \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
2832                                    /i= General_Category=Cased_Letter) (31:
2833                                    U+01C5, U+01C8, U+01CB, U+01F2,
2834                                    U+1F88..1F8F, U+1F98..1F9F ...)
2835          \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
2836                                    (825_345 plus all above-Unicode code
2837                                    points: U+0378..0379, U+0380..0383,
2838                                    U+038B, U+038D, U+03A2, U+0530 ...)
2839          \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
2840                                    /i= General_Category=Cased_Letter)
2841                                    (1831: [A-Z\xc0-\xd6\xd8-\xde], U+0100,
2842                                    U+0102, U+0104, U+0106, U+0108 ...)
2843          \p{General_Category: Z} \p{General_Category=Separator} (19)
2844          \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
2845          \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
2846                                    (1)
2847          \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
2848        X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
2849                                    \p{InPunctuation}) (112)
2850        X \p{Geometric_Shapes}    \p{Block=Geometric_Shapes} (96)
2851        X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
2852                                    \p{Block=Geometric_Shapes_Extended})
2853                                    (128)
2854        X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
2855                                    (Short: \p{InGeometricShapesExt}) (128)
2856          \p{Geor}                \p{Georgian} (= \p{Script_Extensions=
2857                                    Georgian}) (NOT \p{Block=Georgian}) (174)
2858          \p{Georgian}            \p{Script_Extensions=Georgian} (Short:
2859                                    \p{Geor}; NOT \p{Block=Georgian}) (174)
2860        X \p{Georgian_Ext}        \p{Georgian_Extended} (= \p{Block=
2861                                    Georgian_Extended}) (48)
2862        X \p{Georgian_Extended}   \p{Block=Georgian_Extended} (Short:
2863                                    \p{InGeorgianExt}) (48)
2864        X \p{Georgian_Sup}        \p{Georgian_Supplement} (= \p{Block=
2865                                    Georgian_Supplement}) (48)
2866        X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
2867                                    \p{InGeorgianSup}) (48)
2868          \p{Glag}                \p{Glagolitic} (= \p{Script_Extensions=
2869                                    Glagolitic}) (NOT \p{Block=Glagolitic})
2870                                    (138)
2871          \p{Glagolitic}          \p{Script_Extensions=Glagolitic} (Short:
2872                                    \p{Glag}; NOT \p{Block=Glagolitic}) (138)
2873        X \p{Glagolitic_Sup}      \p{Glagolitic_Supplement} (= \p{Block=
2874                                    Glagolitic_Supplement}) (48)
2875        X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
2876                                    \p{InGlagoliticSup}) (48)
2877          \p{Gong}                \p{Gunjala_Gondi} (= \p{Script_Extensions=
2878                                    Gunjala_Gondi}) (NOT \p{Block=
2879                                    Gunjala_Gondi}) (65)
2880          \p{Gonm}                \p{Masaram_Gondi} (= \p{Script_Extensions=
2881                                    Masaram_Gondi}) (NOT \p{Block=
2882                                    Masaram_Gondi}) (77)
2883          \p{Goth}                \p{Gothic} (= \p{Script_Extensions=
2884                                    Gothic}) (NOT \p{Block=Gothic}) (27)
2885          \p{Gothic}              \p{Script_Extensions=Gothic} (Short:
2886                                    \p{Goth}; NOT \p{Block=Gothic}) (27)
2887          \p{Gr_Base}             \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
2888                                    (146_986)
2889          \p{Gr_Base: *}          \p{Grapheme_Base: *}
2890          \p{Gr_Ext}              \p{Grapheme_Extend} (= \p{Grapheme_Extend=
2891                                    Y}) (2125)
2892          \p{Gr_Ext: *}           \p{Grapheme_Extend: *}
2893          \p{Gran}                \p{Grantha} (= \p{Script_Extensions=
2894                                    Grantha}) (NOT \p{Block=Grantha}) (116)
2895          \p{Grantha}             \p{Script_Extensions=Grantha} (Short:
2896                                    \p{Gran}; NOT \p{Block=Grantha}) (116)
2897          \p{Graph}               \p{XPosixGraph} (286_635)
2898          \p{Grapheme_Base}       \p{Grapheme_Base=Y} (Short: \p{GrBase})
2899                                    (146_986)
2900          \p{Grapheme_Base: N*}   (Short: \p{GrBase=N}, \P{GrBase}) (967_126
2901                                    plus all above-Unicode code points:
2902                                    [\x00-\x1f\x7f-\x9f\xad], U+0300..036F,
2903                                    U+0378..0379, U+0380..0383, U+038B,
2904                                    U+038D ...)
2905          \p{Grapheme_Base: Y*}   (Short: \p{GrBase=Y}, \p{GrBase})
2906                                    (146_986: [\x20-\x7e\xa0-\xac\xae-\xff],
2907                                    U+0100..02FF, U+0370..0377,
2908                                    U+037A..037F, U+0384..038A, U+038C ...)
2909          \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
2910                                    (3893)
2911          \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (3893: [^
2912                                    \n\r\x20-\x7e\xa0-\xac\xae-\xff],
2913                                    U+061C, U+180E, U+200B, U+200E..200F,
2914                                    U+2028..202E ...)
2915          \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r])
2916          \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (0)
2917          \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (0)
2918          \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (0)
2919          \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
2920                                    (0)
2921          \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
2922                                    E_Base_GAZ} (0)
2923          \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
2924                                    E_Modifier} (0)
2925          \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
2926                                    (2130)
2927          \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (2130:
2928                                    U+0300..036F, U+0483..0489,
2929                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2930                                    U+05C4..05C5 ...)
2931          \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
2932                                    Glue_After_Zwj} (0)
2933          \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (0)
2934          \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125:
2935                                    U+1100..115F, U+A960..A97C)
2936          \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n])
2937          \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00,
2938                                    U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C
2939                                    ...)
2940          \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773:
2941                                    U+AC01..AC1B, U+AC1D..AC37,
2942                                    U+AC39..AC53, U+AC55..AC6F,
2943                                    U+AC71..AC8B, U+AC8D..ACA7 ...)
2944          \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_096_109
2945                                    plus all above-Unicode code points:
2946                                    [\x20-\x7e\xa0-\xac\xae-\xff],
2947                                    U+0100..02FF, U+0370..0482,
2948                                    U+048A..0590, U+05BE, U+05C0 ...)
2949          \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
2950                                    (27)
2951          \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (27:
2952                                    U+0600..0605, U+06DD, U+070F,
2953                                    U+0890..0891, U+08E2, U+0D4E ...)
2954          \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
2955                                    (26: U+1F1E6..1F1FF)
2956          \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
2957                                    Regional_Indicator} (26)
2958          \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
2959                                    SpacingMark} (395)
2960          \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (395:
2961                                    U+0903, U+093B, U+093E..0940,
2962                                    U+0949..094C, U+094E..094F, U+0982..0983
2963                                    ...)
2964          \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137:
2965                                    U+11A8..11FF, U+D7CB..D7FB)
2966          \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95:
2967                                    U+1160..11A7, U+D7B0..D7C6)
2968          \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
2969                                    (1_096_109 plus all above-Unicode code
2970                                    points)
2971          \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D)
2972          \p{Grapheme_Extend}     \p{Grapheme_Extend=Y} (Short: \p{GrExt})
2973                                    (2125)
2974          \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_111_987
2975                                    plus all above-Unicode code points:
2976                                    U+0000..02FF, U+0370..0482,
2977                                    U+048A..0590, U+05BE, U+05C0, U+05C3 ...)
2978          \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (2125:
2979                                    U+0300..036F, U+0483..0489,
2980                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2981                                    U+05C4..05C5 ...)
2982          \p{Greek}               \p{Script_Extensions=Greek} (Short:
2983                                    \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
2984        X \p{Greek_And_Coptic}    \p{Block=Greek_And_Coptic} (Short:
2985                                    \p{InGreek}) (144)
2986        X \p{Greek_Ext}           \p{Greek_Extended} (= \p{Block=
2987                                    Greek_Extended}) (256)
2988        X \p{Greek_Extended}      \p{Block=Greek_Extended} (Short:
2989                                    \p{InGreekExt}) (256)
2990          \p{Grek}                \p{Greek} (= \p{Script_Extensions=Greek})
2991                                    (NOT \p{Greek_And_Coptic}) (522)
2992          \p{Gujarati}            \p{Script_Extensions=Gujarati} (Short:
2993                                    \p{Gujr}; NOT \p{Block=Gujarati}) (105)
2994          \p{Gujr}                \p{Gujarati} (= \p{Script_Extensions=
2995                                    Gujarati}) (NOT \p{Block=Gujarati}) (105)
2996          \p{Gunjala_Gondi}       \p{Script_Extensions=Gunjala_Gondi}
2997                                    (Short: \p{Gong}; NOT \p{Block=
2998                                    Gunjala_Gondi}) (65)
2999          \p{Gurmukhi}            \p{Script_Extensions=Gurmukhi} (Short:
3000                                    \p{Guru}; NOT \p{Block=Gurmukhi}) (94)
3001          \p{Guru}                \p{Gurmukhi} (= \p{Script_Extensions=
3002                                    Gurmukhi}) (NOT \p{Block=Gurmukhi}) (94)
3003        X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
3004                                    \p{Block=Halfwidth_And_Fullwidth_Forms})
3005                                    (240)
3006        X \p{Half_Marks}          \p{Combining_Half_Marks} (= \p{Block=
3007                                    Combining_Half_Marks}) (16)
3008        X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
3009                                    Halfwidth_And_Fullwidth_Forms} (Short:
3010                                    \p{InHalfAndFullForms}) (240)
3011          \p{Han}                 \p{Script_Extensions=Han} (98_696)
3012          \p{Hang}                \p{Hangul} (= \p{Script_Extensions=
3013                                    Hangul}) (NOT \p{Hangul_Syllables})
3014                                    (11_775)
3015          \p{Hangul}              \p{Script_Extensions=Hangul} (Short:
3016                                    \p{Hang}; NOT \p{Hangul_Syllables})
3017                                    (11_775)
3018        X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
3019                                    (Short: \p{InCompatJamo}) (96)
3020        X \p{Hangul_Jamo}         \p{Block=Hangul_Jamo} (Short: \p{InJamo})
3021                                    (256)
3022        X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
3023                                    (Short: \p{InJamoExtA}) (32)
3024        X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
3025                                    (Short: \p{InJamoExtB}) (80)
3026          \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
3027                                    (125)
3028          \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125:
3029                                    U+1100..115F, U+A960..A97C)
3030          \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
3031                                    (399)
3032          \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399:
3033                                    U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70,
3034                                    U+AC8C ...)
3035          \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
3036                                    LVT_Syllable} (10_773)
3037          \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
3038                                    (10_773: U+AC01..AC1B, U+AC1D..AC37,
3039                                    U+AC39..AC53, U+AC55..AC6F,
3040                                    U+AC71..AC8B, U+AC8D..ACA7 ...)
3041          \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
3042                                    Not_Applicable} (1_102_583 plus all
3043                                    above-Unicode code points)
3044          \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
3045                                    (1_102_583 plus all above-Unicode code
3046                                    points: U+0000..10FF, U+1200..A95F,
3047                                    U+A97D..ABFF, U+D7A4..D7AF,
3048                                    U+D7C7..D7CA, U+D7FC..infinity)
3049          \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
3050                                    (137)
3051          \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137:
3052                                    U+11A8..11FF, U+D7CB..D7FB)
3053          \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
3054                                    (95)
3055          \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95:
3056                                    U+1160..11A7, U+D7B0..D7C6)
3057        X \p{Hangul_Syllables}    \p{Block=Hangul_Syllables} (Short:
3058                                    \p{InHangul}) (11_184)
3059          \p{Hani}                \p{Han} (= \p{Script_Extensions=Han})
3060                                    (98_696)
3061          \p{Hanifi_Rohingya}     \p{Script_Extensions=Hanifi_Rohingya}
3062                                    (Short: \p{Rohg}; NOT \p{Block=
3063                                    Hanifi_Rohingya}) (55)
3064          \p{Hano}                \p{Hanunoo} (= \p{Script_Extensions=
3065                                    Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
3066          \p{Hanunoo}             \p{Script_Extensions=Hanunoo} (Short:
3067                                    \p{Hano}; NOT \p{Block=Hanunoo}) (23)
3068          \p{Hatr}                \p{Hatran} (= \p{Script_Extensions=
3069                                    Hatran}) (NOT \p{Block=Hatran}) (26)
3070          \p{Hatran}              \p{Script_Extensions=Hatran} (Short:
3071                                    \p{Hatr}; NOT \p{Block=Hatran}) (26)
3072          \p{Hebr}                \p{Hebrew} (= \p{Script_Extensions=
3073                                    Hebrew}) (NOT \p{Block=Hebrew}) (134)
3074          \p{Hebrew}              \p{Script_Extensions=Hebrew} (Short:
3075                                    \p{Hebr}; NOT \p{Block=Hebrew}) (134)
3076          \p{Hex}                 \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
3077          \p{Hex: *}              \p{Hex_Digit: *}
3078          \p{Hex_Digit}           \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
3079          \p{Hex_Digit: N*}       (Short: \p{Hex=N}, \P{Hex}) (1_114_068
3080                                    plus all above-Unicode code points:
3081                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
3082                                    \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
3083                                    U+0100..FF0F, U+FF1A..FF20,
3084                                    U+FF27..FF40, U+FF47..infinity)
3085          \p{Hex_Digit: Y*}       (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa-
3086                                    f], U+FF10..FF19, U+FF21..FF26,
3087                                    U+FF41..FF46)
3088        X \p{High_Private_Use_Surrogates} \p{Block=
3089                                    High_Private_Use_Surrogates} (Short:
3090                                    \p{InHighPUSurrogates}) (128)
3091        X \p{High_PU_Surrogates}  \p{High_Private_Use_Surrogates} (=
3092                                    \p{Block=High_Private_Use_Surrogates})
3093                                    (128)
3094        X \p{High_Surrogates}     \p{Block=High_Surrogates} (896)
3095          \p{Hira}                \p{Hiragana} (= \p{Script_Extensions=
3096                                    Hiragana}) (NOT \p{Block=Hiragana}) (433)
3097          \p{Hiragana}            \p{Script_Extensions=Hiragana} (Short:
3098                                    \p{Hira}; NOT \p{Block=Hiragana}) (433)
3099          \p{Hluw}                \p{Anatolian_Hieroglyphs} (=
3100                                    \p{Script_Extensions=
3101                                    Anatolian_Hieroglyphs}) (NOT \p{Block=
3102                                    Anatolian_Hieroglyphs}) (583)
3103          \p{Hmng}                \p{Pahawh_Hmong} (= \p{Script_Extensions=
3104                                    Pahawh_Hmong}) (NOT \p{Block=
3105                                    Pahawh_Hmong}) (127)
3106          \p{Hmnp}                \p{Nyiakeng_Puachue_Hmong} (=
3107                                    \p{Script_Extensions=
3108                                    Nyiakeng_Puachue_Hmong}) (NOT \p{Block=
3109                                    Nyiakeng_Puachue_Hmong}) (71)
3110          \p{HorizSpace}          \p{XPosixBlank} (18)
3111          \p{Hst: *}              \p{Hangul_Syllable_Type: *}
3112          \p{Hung}                \p{Old_Hungarian} (= \p{Script_Extensions=
3113                                    Old_Hungarian}) (NOT \p{Block=
3114                                    Old_Hungarian}) (108)
3115        D \p{Hyphen}              \p{Hyphen=Y} (11)
3116        D \p{Hyphen: N*}          Supplanted by Line_Break property values;
3117                                    see www.unicode.org/reports/tr14
3118                                    (Single: \P{Hyphen}) (1_114_101 plus all
3119                                    above-Unicode code points: [\x00-\x20!
3120                                    \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z
3121                                    \[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae-\xff],
3122                                    U+0100..0589, U+058B..1805,
3123                                    U+1807..200F, U+2012..2E16, U+2E18..30FA
3124                                    ...)
3125        D \p{Hyphen: Y*}          Supplanted by Line_Break property values;
3126                                    see www.unicode.org/reports/tr14
3127                                    (Single: \p{Hyphen}) (11: [\-\xad],
3128                                    U+058A, U+1806, U+2010..2011, U+2E17,
3129                                    U+30FB ...)
3130          \p{ID_Continue}         \p{ID_Continue=Y} (Short: \p{IDC}; NOT
3131                                    \p{Ideographic_Description_Characters})
3132                                    (139_482)
3133          \p{ID_Continue: N*}     (Short: \p{IDC=N}, \P{IDC}) (974_630 plus
3134                                    all above-Unicode code points: [\x00-
3135                                    \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@
3136                                    \[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4\xb6
3137                                    \xb8-\xb9\xbb-\xbf\xd7\xf7],
3138                                    U+02C2..02C5, U+02D2..02DF,
3139                                    U+02E5..02EB, U+02ED, U+02EF..02FF ...)
3140          \p{ID_Continue: Y*}     (Short: \p{IDC=Y}, \p{IDC}) (139_482:
3141                                    [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6
3142                                    \xd8-\xf6\xf8-\xff], U+0100..02C1,
3143                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
3144                                    U+02EE ...)
3145          \p{ID_Start}            \p{ID_Start=Y} (Short: \p{IDS}) (136_345)
3146          \p{ID_Start: N*}        (Short: \p{IDS=N}, \P{IDS}) (977_767 plus
3147                                    all above-Unicode code points: [\x00-
3148                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@
3149                                    \[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6-
3150                                    \xb9\xbb-\xbf\xd7\xf7], U+02C2..02C5,
3151                                    U+02D2..02DF, U+02E5..02EB, U+02ED,
3152                                    U+02EF..036F ...)
3153          \p{ID_Start: Y*}        (Short: \p{IDS=Y}, \p{IDS}) (136_345: [A-
3154                                    Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
3155                                    \xff], U+0100..02C1, U+02C6..02D1,
3156                                    U+02E0..02E4, U+02EC, U+02EE ...)
3157          \p{IDC}                 \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
3158                                    \p{Ideographic_Description_Characters})
3159                                    (139_482)
3160          \p{IDC: *}              \p{ID_Continue: *}
3161          \p{Identifier_Status: Allowed} (112_159: [\'\-.0-9:A-Z_a-z\xb7
3162                                    \xc0-\xd6\xd8-\xf6\xf8-\xff],
3163                                    U+0100..0131, U+0134..013E,
3164                                    U+0141..0148, U+014A..017E, U+018F ...)
3165          \p{Identifier_Status: Restricted} (1_001_953 plus all above-
3166                                    Unicode code points: [\x00-\x20!\"#\$
3167                                    \%&\(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x7f-
3168                                    \xb6\xb8-\xbf\xd7\xf7], U+0132..0133,
3169                                    U+013F..0140, U+0149, U+017F..018E,
3170                                    U+0190..019F ...)
3171          \p{Identifier_Type: Default_Ignorable} (398: [\xad], U+034F,
3172                                    U+061C, U+115F..1160, U+17B4..17B5,
3173                                    U+180B..180F ...)
3174          \p{Identifier_Type: Deprecated} (15: U+0149, U+0673, U+0F77,
3175                                    U+0F79, U+17A3..17A4, U+206A..206F ...)
3176          \p{Identifier_Type: Exclusion} (17_241: U+03E2..03EF,
3177                                    U+0800..082D, U+0830..083E,
3178                                    U+1680..169C, U+16A0..16EA, U+16EE..16F8
3179                                    ...)
3180          \p{Identifier_Type: Inclusion} (17: [\'\-.:\xb7], U+0375, U+058A,
3181                                    U+05F3..05F4, U+06FD..06FE, U+0F0B ...)
3182          \p{Identifier_Type: Limited_Use} (5268: U+0700..070D,
3183                                    U+070F..074A, U+074D..074F,
3184                                    U+07C0..07FA, U+07FD..07FF, U+0840..085B
3185                                    ...)
3186          \p{Identifier_Type: Not_Character} (964_920 plus all above-Unicode
3187                                    code points: [^\t\n\cK\f\r\x20-\x7e\x85
3188                                    \xa0-\xff], U+0378..0379, U+0380..0383,
3189                                    U+038B, U+038D, U+03A2 ...)
3190          \p{Identifier_Type: Not_NFKC} (4921: [\xa0\xa8\xaa\xaf\xb2-\xb5
3191                                    \xb8-\xba\xbc-\xbe], U+0132..0133,
3192                                    U+013F..0140, U+017F, U+01C4..01CC,
3193                                    U+01F1..01F3 ...)
3194          \p{Identifier_Type: Not_XID} (8277: [\t\n\cK\f\r\x20!\"#\$\%&
3195                                    \(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x85
3196                                    \xa1-\xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6
3197                                    \xbb\xbf\xd7\xf7], U+02C2..02C5,
3198                                    U+02D2..02D7, U+02DE..02DF,
3199                                    U+02E5..02EB, U+02ED ...)
3200          \p{Identifier_Type: Obsolete} (1627: U+018D, U+01AA..01AB,
3201                                    U+01B9..01BB, U+01BE..01BF,
3202                                    U+01F6..01F7, U+021C..021D ...)
3203          \p{Identifier_Type: Recommended} (112_142: [0-9A-Z_a-z\xc0-\xd6
3204                                    \xd8-\xf6\xf8-\xff], U+0100..0131,
3205                                    U+0134..013E, U+0141..0148,
3206                                    U+014A..017E, U+018F ...)
3207          \p{Identifier_Type: Technical} (1660: U+0180, U+018D,
3208                                    U+01AA..01AB, U+01BA..01BB, U+01BE,
3209                                    U+01C0..01C3 ...)
3210          \p{Identifier_Type: Uncommon_Use} (396: U+0181..018C, U+018E,
3211                                    U+0190..019F, U+01A2..01A9,
3212                                    U+01AC..01AE, U+01B1..01B8 ...)
3213          \p{Ideo}                \p{Ideographic} (= \p{Ideographic=Y})
3214                                    (105_854)
3215          \p{Ideo: *}             \p{Ideographic: *}
3216          \p{Ideographic}         \p{Ideographic=Y} (Short: \p{Ideo})
3217                                    (105_854)
3218          \p{Ideographic: N*}     (Short: \p{Ideo=N}, \P{Ideo}) (1_008_258
3219                                    plus all above-Unicode code points:
3220                                    U+0000..3005, U+3008..3020,
3221                                    U+302A..3037, U+303B..33FF,
3222                                    U+4DC0..4DFF, U+A000..F8FF ...)
3223          \p{Ideographic: Y*}     (Short: \p{Ideo=Y}, \p{Ideo}) (105_854:
3224                                    U+3006..3007, U+3021..3029,
3225                                    U+3038..303A, U+3400..4DBF,
3226                                    U+4E00..9FFF, U+F900..FA6D ...)
3227        X \p{Ideographic_Description_Characters} \p{Block=
3228                                    Ideographic_Description_Characters}
3229                                    (Short: \p{InIDC}) (16)
3230        X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
3231                                    \p{Block=
3232                                    Ideographic_Symbols_And_Punctuation})
3233                                    (32)
3234        X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
3235                                    Ideographic_Symbols_And_Punctuation}
3236                                    (Short: \p{InIdeographicSymbols}) (32)
3237          \p{IDS}                 \p{ID_Start} (= \p{ID_Start=Y}) (136_345)
3238          \p{IDS: *}              \p{ID_Start: *}
3239          \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
3240                                    \p{IDSB}) (10)
3241          \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
3242                                    (1_114_102 plus all above-Unicode code
3243                                    points: U+0000..2FEF, U+2FF2..2FF3,
3244                                    U+2FFC..infinity)
3245          \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10:
3246                                    U+2FF0..2FF1, U+2FF4..2FFB)
3247          \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
3248                                    \p{IDST}) (2)
3249          \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
3250                                    (1_114_110 plus all above-Unicode code
3251                                    points: U+0000..2FF1, U+2FF4..infinity)
3252          \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2:
3253                                    U+2FF2..2FF3)
3254          \p{IDSB}                \p{IDS_Binary_Operator} (=
3255                                    \p{IDS_Binary_Operator=Y}) (10)
3256          \p{IDSB: *}             \p{IDS_Binary_Operator: *}
3257          \p{IDST}                \p{IDS_Trinary_Operator} (=
3258                                    \p{IDS_Trinary_Operator=Y}) (2)
3259          \p{IDST: *}             \p{IDS_Trinary_Operator: *}
3260          \p{Imperial_Aramaic}    \p{Script_Extensions=Imperial_Aramaic}
3261                                    (Short: \p{Armi}; NOT \p{Block=
3262                                    Imperial_Aramaic}) (31)
3263          \p{In: *}               \p{Present_In: *} (Perl extension)
3264        X \p{In_*}                \p{Block: *}
3265        X \p{Indic_Number_Forms}  \p{Common_Indic_Number_Forms} (= \p{Block=
3266                                    Common_Indic_Number_Forms}) (16)
3267          \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
3268                                    (356: U+093C, U+0941..0944, U+094D,
3269                                    U+0952, U+0956..0957, U+0962..0963 ...)
3270          \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC=
3271                                    BottomAndLeft}) (1: U+A9BF)
3272          \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
3273                                    BottomAndRight}) (4: U+1B3B, U+A9BE,
3274                                    U+A9C0, U+11942)
3275          \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (66:
3276                                    U+093F, U+094E, U+09BF, U+09C7..09C8,
3277                                    U+0A3F, U+0ABF ...)
3278          \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
3279                                    LeftAndRight}) (22: U+09CB..09CC,
3280                                    U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C,
3281                                    U+0DDC, U+0DDE ...)
3282          \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_875
3283                                    plus all above-Unicode code points:
3284                                    U+0000..08FF, U+0904..0939, U+093D,
3285                                    U+0950, U+0953..0954, U+0958..0961 ...)
3286          \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
3287                                    Overstruck}) (10: U+1CD4, U+1CE2..1CE8,
3288                                    U+10A01, U+10A06)
3289          \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (295:
3290                                    U+0903, U+093B, U+093E, U+0940,
3291                                    U+0949..094C, U+094F ...)
3292          \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (428:
3293                                    U+0900..0902, U+093A, U+0945..0948,
3294                                    U+0951, U+0955, U+0981 ...)
3295          \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
3296                                    TopAndBottom}) (10: U+0C48, U+0F73,
3297                                    U+0F76..0F79, U+0F81, U+1B3C,
3298                                    U+1112E..1112F)
3299          \p{Indic_Positional_Category: Top_And_Bottom_And_Left} (Short:
3300                                    \p{InPC=TopAndBottomAndLeft}) (2:
3301                                    U+103C, U+1171E)
3302          \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
3303                                    \p{InPC=TopAndBottomAndRight}) (1:
3304                                    U+1B3D)
3305          \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
3306                                    TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE,
3307                                    U+1C29, U+114BB, U+115B9)
3308          \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
3309                                    \p{InPC=TopAndLeftAndRight}) (4: U+0B4C,
3310                                    U+0DDD, U+17BF, U+115BB)
3311          \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
3312                                    TopAndRight}) (13: U+0AC9, U+0B57,
3313                                    U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB,
3314                                    U+1925..1926 ...)
3315          \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
3316                                    VisualOrderLeft}) (19: U+0E40..0E44,
3317                                    U+0EC0..0EC4, U+19B5..19B7, U+19BA,
3318                                    U+AAB5..AAB6, U+AAB9 ...)
3319        X \p{Indic_Siyaq_Numbers} \p{Block=Indic_Siyaq_Numbers} (80)
3320          \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
3321                                    (17: U+093D, U+09BD, U+0ABD, U+0B3D,
3322                                    U+0C3D, U+0CBD ...)
3323          \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (94:
3324                                    U+0900..0902, U+0981..0982, U+09FC,
3325                                    U+0A01..0A02, U+0A70, U+0A81..0A82 ...)
3326          \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
3327                                    \p{InSC=BrahmiJoiningNumber}) (20:
3328                                    U+11052..11065)
3329          \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
3330                                    CantillationMark}) (58: U+0951..0952,
3331                                    U+0A51, U+0AFA, U+0AFC, U+1CD0..1CD2,
3332                                    U+1CD4..1CE1 ...)
3333          \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
3334                                    (2241: U+0915..0939, U+0958..095F,
3335                                    U+0978..097F, U+0995..09A8,
3336                                    U+09AA..09B0, U+09B2 ...)
3337          \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
3338                                    ConsonantDead}) (14: U+09CE, U+0C5D,
3339                                    U+0CDD, U+0D54..0D56, U+0D7A..0D7F,
3340                                    U+1CF2..1CF3)
3341          \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
3342                                    ConsonantFinal}) (70: U+1930..1931,
3343                                    U+1933..1939, U+19C1..19C7,
3344                                    U+1A58..1A59, U+1B03, U+1B81 ...)
3345          \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
3346                                    \p{InSC=ConsonantHeadLetter}) (5:
3347                                    U+0F88..0F8C)
3348          \p{Indic_Syllabic_Category: Consonant_Initial_Postfixed} (Short:
3349                                    \p{InSC=ConsonantInitialPostfixed}) (1:
3350                                    U+1A5A)
3351          \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
3352                                    ConsonantKiller}) (2: U+0E4C, U+17CD)
3353          \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
3354                                    ConsonantMedial}) (31: U+0A75,
3355                                    U+0EBC..0EBD, U+103B..103E,
3356                                    U+105E..1060, U+1082, U+1A55..1A56 ...)
3357          \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
3358                                    \p{InSC=ConsonantPlaceholder}) (22: [\-
3359                                    \xa0\xd7], U+0980, U+0A72..0A73, U+104B,
3360                                    U+104E, U+1900 ...)
3361          \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
3362                                    \p{InSC=ConsonantPrecedingRepha}) (4:
3363                                    U+0D4E, U+11941, U+11D46, U+11F02)
3364          \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
3365                                    ConsonantPrefixed}) (10: U+111C2..111C3,
3366                                    U+1193F, U+11A3A, U+11A84..11A89)
3367          \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
3368                                    ConsonantSubjoined}) (94: U+0F8D..0F97,
3369                                    U+0F99..0FBC, U+1929..192B, U+1A57,
3370                                    U+1A5B..1A5E, U+1BA1..1BA3 ...)
3371          \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
3372                                    \p{InSC=ConsonantSucceedingRepha}) (1:
3373                                    U+17CC)
3374          \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
3375                                    \p{InSC=ConsonantWithStacker}) (8:
3376                                    U+0CF1..0CF2, U+1CF5..1CF6,
3377                                    U+11003..11004, U+11460..11461)
3378          \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
3379                                    GeminationMark}) (4: U+0A71, U+0AFB,
3380                                    U+11237, U+11A98)
3381          \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
3382                                    InvisibleStacker}) (13: U+1039, U+17D2,
3383                                    U+1A60, U+1BAB, U+AAF6, U+10A3F ...)
3384          \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1:
3385                                    U+200D)
3386          \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
3387                                    ModifyingLetter}) (1: U+0B83)
3388          \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
3389                                    NonJoiner}) (1: U+200C)
3390          \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (32:
3391                                    U+093C, U+09BC, U+0A3C, U+0ABC,
3392                                    U+0AFD..0AFF, U+0B3C ...)
3393          \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (501:
3394                                    [0-9], U+0966..096F, U+09E6..09EF,
3395                                    U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F
3396                                    ...)
3397          \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
3398                                    NumberJoiner}) (1: U+1107F)
3399          \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
3400                                    (1_109_473 plus all above-Unicode code
3401                                    points: [\x00-\x20!\"#\$\%&\'\(\)*+,.
3402                                    \/:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
3403                                    \x9f\xa1-\xb1\xb4-\xd6\xd8-\xff],
3404                                    U+0100..08FF, U+0950, U+0953..0954,
3405                                    U+0964..0965, U+0970..0971 ...)
3406          \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
3407                                    PureKiller}) (26: U+0D3B..0D3C, U+0E3A,
3408                                    U+0E4E, U+0EBA, U+0F84, U+103A ...)
3409          \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
3410                                    RegisterShifter}) (2: U+17C9..17CA)
3411          \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
3412                                    SyllableModifier}) (26: [\xb2-\xb3],
3413                                    U+09FE, U+0ECE, U+0F35, U+0F37, U+0FC6
3414                                    ...)
3415          \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
3416                                    ToneLetter}) (7: U+1970..1974, U+AAC0,
3417                                    U+AAC2)
3418          \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
3419                                    (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037,
3420                                    U+1063..1064, U+1069..106D, U+1087..108D
3421                                    ...)
3422          \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (27:
3423                                    U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
3424                                    U+0BCD ...)
3425          \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga})
3426                                    (36: U+0903, U+0983, U+0A03, U+0A83,
3427                                    U+0B03, U+0C03 ...)
3428          \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30:
3429                                    U+1963..196D, U+A85E..A861, U+A866,
3430                                    U+A922..A92A, U+11150..11154)
3431          \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
3432                                    VowelDependent}) (697: U+093A..093B,
3433                                    U+093E..094C, U+094E..094F,
3434                                    U+0955..0957, U+0962..0963, U+09BE..09C4
3435                                    ...)
3436          \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
3437                                    VowelIndependent}) (500: U+0904..0914,
3438                                    U+0960..0961, U+0972..0977,
3439                                    U+0985..098C, U+098F..0990, U+0993..0994
3440                                    ...)
3441          \p{Inherited}           \p{Script_Extensions=Inherited} (Short:
3442                                    \p{Zinh}) (586)
3443          \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
3444                                    (Short: \p{Pi}) (12)
3445          \p{InPC: *}             \p{Indic_Positional_Category: *}
3446          \p{InSC: *}             \p{Indic_Syllabic_Category: *}
3447          \p{Inscriptional_Pahlavi} \p{Script_Extensions=
3448                                    Inscriptional_Pahlavi} (Short: \p{Phli};
3449                                    NOT \p{Block=Inscriptional_Pahlavi}) (27)
3450          \p{Inscriptional_Parthian} \p{Script_Extensions=
3451                                    Inscriptional_Parthian} (Short:
3452                                    \p{Prti}; NOT \p{Block=
3453                                    Inscriptional_Parthian}) (30)
3454        X \p{IPA_Ext}             \p{IPA_Extensions} (= \p{Block=
3455                                    IPA_Extensions}) (96)
3456        X \p{IPA_Extensions}      \p{Block=IPA_Extensions} (Short:
3457                                    \p{InIPAExt}) (96)
3458          \p{Is_*}                \p{*} (Any exceptions are individually
3459                                    noted beginning with the word NOT.) If
3460                                    an entry has flag(s) at its beginning,
3461                                    like "D", the "Is_" form has the same
3462                                    flag(s)
3463          \p{Ital}                \p{Old_Italic} (= \p{Script_Extensions=
3464                                    Old_Italic}) (NOT \p{Block=Old_Italic})
3465                                    (39)
3466        X \p{Jamo}                \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
3467                                    (256)
3468        X \p{Jamo_Ext_A}          \p{Hangul_Jamo_Extended_A} (= \p{Block=
3469                                    Hangul_Jamo_Extended_A}) (32)
3470        X \p{Jamo_Ext_B}          \p{Hangul_Jamo_Extended_B} (= \p{Block=
3471                                    Hangul_Jamo_Extended_B}) (80)
3472          \p{Java}                \p{Javanese} (= \p{Script_Extensions=
3473                                    Javanese}) (NOT \p{Block=Javanese}) (91)
3474          \p{Javanese}            \p{Script_Extensions=Javanese} (Short:
3475                                    \p{Java}; NOT \p{Block=Javanese}) (91)
3476          \p{Jg: *}               \p{Joining_Group: *}
3477          \p{Join_C}              \p{Join_Control} (= \p{Join_Control=Y}) (2)
3478          \p{Join_C: *}           \p{Join_Control: *}
3479          \p{Join_Control}        \p{Join_Control=Y} (Short: \p{JoinC}) (2)
3480          \p{Join_Control: N*}    (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
3481                                    plus all above-Unicode code points:
3482                                    U+0000..200B, U+200E..infinity)
3483          \p{Join_Control: Y*}    (Short: \p{JoinC=Y}, \p{JoinC}) (2:
3484                                    U+200C..200D)
3485          \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1:
3486                                    U+08BB)
3487          \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1:
3488                                    U+08BD)
3489          \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (2:
3490                                    U+08BC, U+08C4)
3491          \p{Joining_Group: Ain}  (Short: \p{Jg=Ain}) (9: U+0639..063A,
3492                                    U+06A0, U+06FC, U+075D..075F, U+08B3,
3493                                    U+08C3)
3494          \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710)
3495          \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (29: U+0622..0623,
3496                                    U+0625, U+0627, U+0671..0673, U+0675,
3497                                    U+0773..0774 ...)
3498          \p{Joining_Group: Beh}  (Short: \p{Jg=Beh}) (27: U+0628,
3499                                    U+062A..062B, U+066E, U+0679..0680,
3500                                    U+0750..0756, U+08A0..08A1 ...)
3501          \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D)
3502          \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
3503                                    BurushaskiYehBarree}) (2: U+077A..077B)
3504          \p{Joining_Group: Dal}  (Short: \p{Jg=Dal}) (15: U+062F..0630,
3505                                    U+0688..0690, U+06EE, U+0759..075A,
3506                                    U+08AE)
3507          \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4:
3508                                    U+0715..0716, U+072A, U+072F)
3509          \p{Joining_Group: E}    (Short: \p{Jg=E}) (1: U+0725)
3510          \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7:
3511                                    U+063D..063F, U+06CC, U+06CE,
3512                                    U+0775..0776)
3513          \p{Joining_Group: Fe}   (Short: \p{Jg=Fe}) (1: U+074F)
3514          \p{Joining_Group: Feh}  (Short: \p{Jg=Feh}) (10: U+0641,
3515                                    U+06A1..06A6, U+0760..0761, U+08A4)
3516          \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1:
3517                                    U+0724)
3518          \p{Joining_Group: Gaf}  (Short: \p{Jg=Gaf}) (17: U+063B..063C,
3519                                    U+06A9, U+06AB, U+06AF..06B4,
3520                                    U+0762..0764, U+088D ...)
3521          \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714,
3522                                    U+072E)
3523          \p{Joining_Group: Hah}  (Short: \p{Jg=Hah}) (22: U+062C..062E,
3524                                    U+0681..0687, U+06BF, U+0757..0758,
3525                                    U+076E..076F, U+0772 ...)
3526          \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
3527                                    HamzaOnHehGoal}) (1: U+06C3)
3528          \p{Joining_Group: Hanifi_Rohingya_Kinna_Ya} (Short: \p{Jg=
3529                                    HanifiRohingyaKinnaYa}) (4: U+10D19,
3530                                    U+10D1E, U+10D20, U+10D23)
3531          \p{Joining_Group: Hanifi_Rohingya_Pa} (Short: \p{Jg=
3532                                    HanifiRohingyaPa}) (3: U+10D02, U+10D09,
3533                                    U+10D1C)
3534          \p{Joining_Group: He}   (Short: \p{Jg=He}) (1: U+0717)
3535          \p{Joining_Group: Heh}  (Short: \p{Jg=Heh}) (1: U+0647)
3536          \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2:
3537                                    U+06C1..06C2)
3538          \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A)
3539          \p{Joining_Group: Kaf}  (Short: \p{Jg=Kaf}) (6: U+0643,
3540                                    U+06AC..06AE, U+077F, U+08B4)
3541          \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F)
3542          \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E)
3543          \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2:
3544                                    U+06BE, U+06FF)
3545          \p{Joining_Group: Lam}  (Short: \p{Jg=Lam}) (8: U+0644,
3546                                    U+06B5..06B8, U+076A, U+08A6, U+08C7)
3547          \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720)
3548          \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1:
3549                                    U+0866)
3550          \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1:
3551                                    U+0861)
3552          \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1:
3553                                    U+0868)
3554          \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla})
3555                                    (1: U+0869)
3556          \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1:
3557                                    U+0860)
3558          \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1:
3559                                    U+0864)
3560          \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna})
3561                                    (1: U+0865)
3562          \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1:
3563                                    U+0862)
3564          \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1:
3565                                    U+0867)
3566          \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1:
3567                                    U+086A)
3568          \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1:
3569                                    U+0863)
3570          \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
3571                                    ManichaeanAleph}) (1: U+10AC0)
3572          \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
3573                                    (2: U+10AD9..10ADA)
3574          \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
3575                                    (2: U+10AC1..10AC2)
3576          \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
3577                                    ManichaeanDaleth}) (1: U+10AC5)
3578          \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
3579                                    ManichaeanDhamedh}) (1: U+10AD4)
3580          \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
3581                                    (1: U+10AEC)
3582          \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
3583                                    ManichaeanGimel}) (2: U+10AC3..10AC4)
3584          \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
3585                                    (1: U+10ACD)
3586          \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
3587                                    ManichaeanHundred}) (1: U+10AEF)
3588          \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
3589                                    (3: U+10AD0..10AD2)
3590          \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
3591                                    ManichaeanLamedh}) (1: U+10AD3)
3592          \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem})
3593                                    (1: U+10AD6)
3594          \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun})
3595                                    (1: U+10AD7)
3596          \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne})
3597                                    (1: U+10AEB)
3598          \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2:
3599                                    U+10ADB..10ADC)
3600          \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
3601                                    (3: U+10ADE..10AE0)
3602          \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
3603                                    (1: U+10AE1)
3604          \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
3605                                    ManichaeanSadhe}) (1: U+10ADD)
3606          \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
3607                                    ManichaeanSamekh}) (1: U+10AD8)
3608          \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw})
3609                                    (1: U+10AE4)
3610          \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen})
3611                                    (1: U+10AED)
3612          \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
3613                                    (1: U+10ACE)
3614          \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
3615                                    ManichaeanThamedh}) (1: U+10AD5)
3616          \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
3617                                    ManichaeanTwenty}) (1: U+10AEE)
3618          \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw})
3619                                    (1: U+10AC7)
3620          \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
3621                                    (1: U+10ACF)
3622          \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
3623                                    ManichaeanZayin}) (2: U+10AC9..10ACA)
3624          \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645,
3625                                    U+0765..0766, U+08A7)
3626          \p{Joining_Group: Mim}  (Short: \p{Jg=Mim}) (1: U+0721)
3627          \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
3628                                    (1_113_762 plus all above-Unicode code
3629                                    points: U+0000..061F, U+0621, U+0640,
3630                                    U+064B..066D, U+0670, U+0674 ...)
3631          \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (9: U+0646,
3632                                    U+06B9..06BC, U+0767..0769, U+0889)
3633          \p{Joining_Group: Nun}  (Short: \p{Jg=Nun}) (1: U+0722)
3634          \p{Joining_Group: Nya}  (Short: \p{Jg=Nya}) (1: U+06BD)
3635          \p{Joining_Group: Pe}   (Short: \p{Jg=Pe}) (1: U+0726)
3636          \p{Joining_Group: Qaf}  (Short: \p{Jg=Qaf}) (6: U+0642, U+066F,
3637                                    U+06A7..06A8, U+08A5, U+08B5)
3638          \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729)
3639          \p{Joining_Group: Reh}  (Short: \p{Jg=Reh}) (19: U+0631..0632,
3640                                    U+0691..0699, U+06EF, U+075B,
3641                                    U+076B..076C, U+0771 ...)
3642          \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1:
3643                                    U+0727)
3644          \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1:
3645                                    U+08AC)
3646          \p{Joining_Group: Sad}  (Short: \p{Jg=Sad}) (6: U+0635..0636,
3647                                    U+069D..069E, U+06FB, U+08AF)
3648          \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728)
3649          \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634,
3650                                    U+069A..069C, U+06FA, U+075C, U+076D,
3651                                    U+0770 ...)
3652          \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723)
3653          \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B)
3654          \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1:
3655                                    U+08B1)
3656          \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA)
3657          \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718)
3658          \p{Joining_Group: Tah}  (Short: \p{Jg=Tah}) (6: U+0637..0638,
3659                                    U+069F, U+088B..088C, U+08A3)
3660          \p{Joining_Group: Taw}  (Short: \p{Jg=Taw}) (1: U+072C)
3661          \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3:
3662                                    U+0629, U+06C0, U+06D5)
3663          \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
3664                                    Hamza_On_Heh_Goal} (1)
3665          \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C)
3666          \p{Joining_Group: Thin_Yeh} (Short: \p{Jg=ThinYeh}) (1: U+0886)
3667          \p{Joining_Group: Vertical_Tail} (Short: \p{Jg=VerticalTail}) (1:
3668                                    U+088E)
3669          \p{Joining_Group: Waw}  (Short: \p{Jg=Waw}) (16: U+0624, U+0648,
3670                                    U+0676..0677, U+06C4..06CB, U+06CF,
3671                                    U+0778..0779 ...)
3672          \p{Joining_Group: Yeh}  (Short: \p{Jg=Yeh}) (11: U+0620, U+0626,
3673                                    U+0649..064A, U+0678, U+06D0..06D1,
3674                                    U+0777 ...)
3675          \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2:
3676                                    U+06D2..06D3)
3677          \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1:
3678                                    U+06CD)
3679          \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D)
3680          \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E)
3681          \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719)
3682          \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D)
3683          \p{Joining_Type: C}     \p{Joining_Type=Join_Causing} (7)
3684          \p{Joining_Type: D}     \p{Joining_Type=Dual_Joining} (610)
3685          \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (610: U+0620,
3686                                    U+0626, U+0628, U+062A..062E,
3687                                    U+0633..063F, U+0641..0647 ...)
3688          \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (7: U+0640,
3689                                    U+07FA, U+0883..0885, U+180A, U+200D)
3690          \p{Joining_Type: L}     \p{Joining_Type=Left_Joining} (5)
3691          \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (5: U+A872,
3692                                    U+10ACD, U+10AD7, U+10D00, U+10FCB)
3693          \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_188 plus
3694                                    all above-Unicode code points: [\x00-
3695                                    \xac\xae-\xff], U+0100..02FF,
3696                                    U+0370..0482, U+048A..0590, U+05BE,
3697                                    U+05C0 ...)
3698          \p{Joining_Type: R}     \p{Joining_Type=Right_Joining} (152)
3699          \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (152:
3700                                    U+0622..0625, U+0627, U+0629,
3701                                    U+062F..0632, U+0648, U+0671..0673 ...)
3702          \p{Joining_Type: T}     \p{Joining_Type=Transparent} (2150)
3703          \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (2150: [\xad],
3704                                    U+0300..036F, U+0483..0489,
3705                                    U+0591..05BD, U+05BF, U+05C1..05C2 ...)
3706          \p{Joining_Type: U}     \p{Joining_Type=Non_Joining} (1_111_188
3707                                    plus all above-Unicode code points)
3708          \p{Jt: *}               \p{Joining_Type: *}
3709          \p{Kaithi}              \p{Script_Extensions=Kaithi} (Short:
3710                                    \p{Kthi}; NOT \p{Block=Kaithi}) (88)
3711        X \p{Kaktovik_Numerals}   \p{Block=Kaktovik_Numerals} (32)
3712          \p{Kali}                \p{Kayah_Li} (= \p{Script_Extensions=
3713                                    Kayah_Li}) (48)
3714          \p{Kana}                \p{Katakana} (= \p{Script_Extensions=
3715                                    Katakana}) (NOT \p{Block=Katakana}) (373)
3716        X \p{Kana_Ext_A}          \p{Kana_Extended_A} (= \p{Block=
3717                                    Kana_Extended_A}) (48)
3718        X \p{Kana_Ext_B}          \p{Kana_Extended_B} (= \p{Block=
3719                                    Kana_Extended_B}) (16)
3720        X \p{Kana_Extended_A}     \p{Block=Kana_Extended_A} (Short:
3721                                    \p{InKanaExtA}) (48)
3722        X \p{Kana_Extended_B}     \p{Block=Kana_Extended_B} (Short:
3723                                    \p{InKanaExtB}) (16)
3724        X \p{Kana_Sup}            \p{Kana_Supplement} (= \p{Block=
3725                                    Kana_Supplement}) (256)
3726        X \p{Kana_Supplement}     \p{Block=Kana_Supplement} (Short:
3727                                    \p{InKanaSup}) (256)
3728        X \p{Kanbun}              \p{Block=Kanbun} (16)
3729        X \p{Kangxi}              \p{Kangxi_Radicals} (= \p{Block=
3730                                    Kangxi_Radicals}) (224)
3731        X \p{Kangxi_Radicals}     \p{Block=Kangxi_Radicals} (Short:
3732                                    \p{InKangxi}) (224)
3733          \p{Kannada}             \p{Script_Extensions=Kannada} (Short:
3734                                    \p{Knda}; NOT \p{Block=Kannada}) (106)
3735          \p{Katakana}            \p{Script_Extensions=Katakana} (Short:
3736                                    \p{Kana}; NOT \p{Block=Katakana}) (373)
3737        X \p{Katakana_Ext}        \p{Katakana_Phonetic_Extensions} (=
3738                                    \p{Block=Katakana_Phonetic_Extensions})
3739                                    (16)
3740        X \p{Katakana_Phonetic_Extensions} \p{Block=
3741                                    Katakana_Phonetic_Extensions} (Short:
3742                                    \p{InKatakanaExt}) (16)
3743          \p{Kawi}                \p{Script_Extensions=Kawi} (NOT \p{Block=
3744                                    Kawi}) (86)
3745          \p{Kayah_Li}            \p{Script_Extensions=Kayah_Li} (Short:
3746                                    \p{Kali}) (48)
3747          \p{Khar}                \p{Kharoshthi} (= \p{Script_Extensions=
3748                                    Kharoshthi}) (NOT \p{Block=Kharoshthi})
3749                                    (68)
3750          \p{Kharoshthi}          \p{Script_Extensions=Kharoshthi} (Short:
3751                                    \p{Khar}; NOT \p{Block=Kharoshthi}) (68)
3752          \p{Khitan_Small_Script} \p{Script_Extensions=Khitan_Small_Script}
3753                                    (Short: \p{Kits}; NOT \p{Block=
3754                                    Khitan_Small_Script}) (471)
3755          \p{Khmer}               \p{Script_Extensions=Khmer} (Short:
3756                                    \p{Khmr}; NOT \p{Block=Khmer}) (146)
3757        X \p{Khmer_Symbols}       \p{Block=Khmer_Symbols} (32)
3758          \p{Khmr}                \p{Khmer} (= \p{Script_Extensions=Khmer})
3759                                    (NOT \p{Block=Khmer}) (146)
3760          \p{Khoj}                \p{Khojki} (= \p{Script_Extensions=
3761                                    Khojki}) (NOT \p{Block=Khojki}) (85)
3762          \p{Khojki}              \p{Script_Extensions=Khojki} (Short:
3763                                    \p{Khoj}; NOT \p{Block=Khojki}) (85)
3764          \p{Khudawadi}           \p{Script_Extensions=Khudawadi} (Short:
3765                                    \p{Sind}; NOT \p{Block=Khudawadi}) (81)
3766          \p{Kits}                \p{Khitan_Small_Script} (=
3767                                    \p{Script_Extensions=
3768                                    Khitan_Small_Script}) (NOT \p{Block=
3769                                    Khitan_Small_Script}) (471)
3770          \p{Knda}                \p{Kannada} (= \p{Script_Extensions=
3771                                    Kannada}) (NOT \p{Block=Kannada}) (106)
3772          \p{Kthi}                \p{Kaithi} (= \p{Script_Extensions=
3773                                    Kaithi}) (NOT \p{Block=Kaithi}) (88)
3774          \p{L} \pL               \p{Letter} (= \p{General_Category=Letter})
3775                                    (136_104)
3776        X \p{L&}                  \p{Cased_Letter} (= \p{General_Category=
3777                                    Cased_Letter}) (4095)
3778        X \p{L_}                  \p{Cased_Letter} (= \p{General_Category=
3779                                    Cased_Letter}) Note the trailing '_'
3780                                    matters in spite of loose matching
3781                                    rules. (4095)
3782          \p{Lana}                \p{Tai_Tham} (= \p{Script_Extensions=
3783                                    Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
3784          \p{Lao}                 \p{Script_Extensions=Lao} (NOT \p{Block=
3785                                    Lao}) (83)
3786          \p{Laoo}                \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
3787                                    \p{Block=Lao}) (83)
3788          \p{Latin}               \p{Script_Extensions=Latin} (Short:
3789                                    \p{Latn}) (1510)
3790        X \p{Latin_1}             \p{Latin_1_Supplement} (= \p{Block=
3791                                    Latin_1_Supplement}) (128)
3792        X \p{Latin_1_Sup}         \p{Latin_1_Supplement} (= \p{Block=
3793                                    Latin_1_Supplement}) (128)
3794        X \p{Latin_1_Supplement}  \p{Block=Latin_1_Supplement} (Short:
3795                                    \p{InLatin1}) (128)
3796        X \p{Latin_Ext_A}         \p{Latin_Extended_A} (= \p{Block=
3797                                    Latin_Extended_A}) (128)
3798        X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
3799                                    \p{Block=Latin_Extended_Additional})
3800                                    (256)
3801        X \p{Latin_Ext_B}         \p{Latin_Extended_B} (= \p{Block=
3802                                    Latin_Extended_B}) (208)
3803        X \p{Latin_Ext_C}         \p{Latin_Extended_C} (= \p{Block=
3804                                    Latin_Extended_C}) (32)
3805        X \p{Latin_Ext_D}         \p{Latin_Extended_D} (= \p{Block=
3806                                    Latin_Extended_D}) (224)
3807        X \p{Latin_Ext_E}         \p{Latin_Extended_E} (= \p{Block=
3808                                    Latin_Extended_E}) (64)
3809        X \p{Latin_Ext_F}         \p{Latin_Extended_F} (= \p{Block=
3810                                    Latin_Extended_F}) (64)
3811        X \p{Latin_Ext_G}         \p{Latin_Extended_G} (= \p{Block=
3812                                    Latin_Extended_G}) (256)
3813        X \p{Latin_Extended_A}    \p{Block=Latin_Extended_A} (Short:
3814                                    \p{InLatinExtA}) (128)
3815        X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
3816                                    (Short: \p{InLatinExtAdditional}) (256)
3817        X \p{Latin_Extended_B}    \p{Block=Latin_Extended_B} (Short:
3818                                    \p{InLatinExtB}) (208)
3819        X \p{Latin_Extended_C}    \p{Block=Latin_Extended_C} (Short:
3820                                    \p{InLatinExtC}) (32)
3821        X \p{Latin_Extended_D}    \p{Block=Latin_Extended_D} (Short:
3822                                    \p{InLatinExtD}) (224)
3823        X \p{Latin_Extended_E}    \p{Block=Latin_Extended_E} (Short:
3824                                    \p{InLatinExtE}) (64)
3825        X \p{Latin_Extended_F}    \p{Block=Latin_Extended_F} (Short:
3826                                    \p{InLatinExtF}) (64)
3827        X \p{Latin_Extended_G}    \p{Block=Latin_Extended_G} (Short:
3828                                    \p{InLatinExtG}) (256)
3829          \p{Latn}                \p{Latin} (= \p{Script_Extensions=Latin})
3830                                    (1510)
3831          \p{Lb: *}               \p{Line_Break: *}
3832          \p{LC}                  \p{Cased_Letter} (= \p{General_Category=
3833                                    Cased_Letter}) (4095)
3834          \p{Lepc}                \p{Lepcha} (= \p{Script_Extensions=
3835                                    Lepcha}) (NOT \p{Block=Lepcha}) (74)
3836          \p{Lepcha}              \p{Script_Extensions=Lepcha} (Short:
3837                                    \p{Lepc}; NOT \p{Block=Lepcha}) (74)
3838          \p{Letter}              \p{General_Category=Letter} (Short: \p{L})
3839                                    (136_104)
3840          \p{Letter_Number}       \p{General_Category=Letter_Number} (Short:
3841                                    \p{Nl}) (236)
3842        X \p{Letterlike_Symbols}  \p{Block=Letterlike_Symbols} (80)
3843          \p{Limb}                \p{Limbu} (= \p{Script_Extensions=Limbu})
3844                                    (NOT \p{Block=Limbu}) (69)
3845          \p{Limbu}               \p{Script_Extensions=Limbu} (Short:
3846                                    \p{Limb}; NOT \p{Block=Limbu}) (69)
3847          \p{Lina}                \p{Linear_A} (= \p{Script_Extensions=
3848                                    Linear_A}) (NOT \p{Block=Linear_A}) (386)
3849          \p{Linb}                \p{Linear_B} (= \p{Script_Extensions=
3850                                    Linear_B}) (268)
3851          \p{Line_Break: AI}      \p{Line_Break=Ambiguous} (707)
3852          \p{Line_Break: AL}      \p{Line_Break=Alphabetic} (22_215)
3853          \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (22_215: [#&*<=>\@A-
3854                                    Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0-
3855                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6,
3856                                    U+02CE..02CF, U+02D1..02D7, U+02DC,
3857                                    U+02DE ...)
3858          \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7-\xa8\xaa
3859                                    \xb2-\xb3\xb6-\xba\xbc-\xbe\xd7\xf7],
3860                                    U+02C7, U+02C9..02CB, U+02CD, U+02D0,
3861                                    U+02D8..02DB ...)
3862          \p{Line_Break: B2}      \p{Line_Break=Break_Both} (3)
3863          \p{Line_Break: BA}      \p{Line_Break=Break_After} (249)
3864          \p{Line_Break: BB}      \p{Line_Break=Break_Before} (55)
3865          \p{Line_Break: BK}      \p{Line_Break=Mandatory_Break} (4)
3866          \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (249: [\t\|\xad],
3867                                    U+058A, U+05BE, U+0964..0965,
3868                                    U+0E5A..0E5B, U+0F0B ...)
3869          \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (55: [\xb4],
3870                                    U+02C8, U+02CC, U+02DF, U+0C77, U+0C84
3871                                    ...)
3872          \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014,
3873                                    U+2E3A..2E3B)
3874          \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/])
3875          \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r])
3876          \p{Line_Break: CB}      \p{Line_Break=Contingent_Break} (1)
3877          \p{Line_Break: CJ}      \p{Line_Break=
3878                                    Conditional_Japanese_Starter} (60)
3879          \p{Line_Break: CL}      \p{Line_Break=Close_Punctuation} (97)
3880          \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]])
3881          \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (97: [\}],
3882                                    U+0F3B, U+0F3D, U+169C, U+2046, U+207E
3883                                    ...)
3884          \p{Line_Break: CM}      \p{Line_Break=Combining_Mark} (2438)
3885          \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2438: [^\t\n
3886                                    \cK\f\r\x20-\x7e\x85\xa0-\xff],
3887                                    U+0300..034E, U+0350..035B,
3888                                    U+0363..036F, U+0483..0489, U+0591..05BD
3889                                    ...)
3890          \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (758:
3891                                    U+0E01..0E3A, U+0E40..0E4E,
3892                                    U+0E81..0E82, U+0E84, U+0E86..0E8A,
3893                                    U+0E8C..0EA3 ...)
3894          \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
3895                                    (60: U+3041, U+3043, U+3045, U+3047,
3896                                    U+3049, U+3063 ...)
3897          \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC)
3898          \p{Line_Break: CP}      \p{Line_Break=Close_Parenthesis} (2)
3899          \p{Line_Break: CR}      \p{Line_Break=Carriage_Return} (1)
3900          \p{Line_Break: E_Base}  (Short: \p{Lb=EB}) (134: U+261D, U+26F9,
3901                                    U+270A..270D, U+1F385, U+1F3C2..1F3C4,
3902                                    U+1F3C7 ...)
3903          \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF)
3904          \p{Line_Break: EB}      \p{Line_Break=E_Base} (134)
3905          \p{Line_Break: EM}      \p{Line_Break=E_Modifier} (5)
3906          \p{Line_Break: EX}      \p{Line_Break=Exclamation} (40)
3907          \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (40: [!?], U+05C6,
3908                                    U+061B, U+061D..061F, U+06D4, U+07F9 ...)
3909          \p{Line_Break: GL}      \p{Line_Break=Glue} (31)
3910          \p{Line_Break: Glue}    (Short: \p{Lb=GL}) (31: [\xa0], U+034F,
3911                                    U+035C..0362, U+0F08, U+0F0C, U+0F12 ...)
3912          \p{Line_Break: H2}      (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C,
3913                                    U+AC38, U+AC54, U+AC70, U+AC8C ...)
3914          \p{Line_Break: H3}      (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B,
3915                                    U+AC1D..AC37, U+AC39..AC53,
3916                                    U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7
3917                                    ...)
3918          \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (75:
3919                                    U+05D0..05EA, U+05EF..05F2, U+FB1D,
3920                                    U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
3921                                    ...)
3922          \p{Line_Break: HL}      \p{Line_Break=Hebrew_Letter} (75)
3923          \p{Line_Break: HY}      \p{Line_Break=Hyphen} (1)
3924          \p{Line_Break: Hyphen}  (Short: \p{Lb=HY}) (1: [\-])
3925          \p{Line_Break: ID}      \p{Line_Break=Ideographic} (172_465)
3926          \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_465:
3927                                    U+231A..231B, U+23F0..23F3,
3928                                    U+2600..2603, U+2614..2615, U+2618,
3929                                    U+261A..261C ...)
3930          \p{Line_Break: IN}      \p{Line_Break=Inseparable} (6)
3931          \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;],
3932                                    U+037E, U+0589, U+060C..060D, U+07F8,
3933                                    U+2044 ...)
3934          \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026,
3935                                    U+22EF, U+FE19, U+10AF6)
3936          \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
3937          \p{Line_Break: IS}      \p{Line_Break=Infix_Numeric} (13)
3938          \p{Line_Break: JL}      (Short: \p{Lb=JL}) (125: U+1100..115F,
3939                                    U+A960..A97C)
3940          \p{Line_Break: JT}      (Short: \p{Lb=JT}) (137: U+11A8..11FF,
3941                                    U+D7CB..D7FB)
3942          \p{Line_Break: JV}      (Short: \p{Lb=JV}) (95: U+1160..11A7,
3943                                    U+D7B0..D7C6)
3944          \p{Line_Break: LF}      \p{Line_Break=Line_Feed} (1)
3945          \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n])
3946          \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f],
3947                                    U+2028..2029)
3948          \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85])
3949          \p{Line_Break: NL}      \p{Line_Break=Next_Line} (1)
3950          \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (33: U+17D6,
3951                                    U+203C..203D, U+2047..2049, U+3005,
3952                                    U+301C, U+303B..303C ...)
3953          \p{Line_Break: NS}      \p{Line_Break=Nonstarter} (33)
3954          \p{Line_Break: NU}      \p{Line_Break=Numeric} (672)
3955          \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (672: [0-9],
3956                                    U+0660..0669, U+066B..066C,
3957                                    U+06F0..06F9, U+07C0..07C9, U+0966..096F
3958                                    ...)
3959          \p{Line_Break: OP}      \p{Line_Break=Open_Punctuation} (94)
3960          \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (94: [\(\[\{
3961                                    \xa1\xbf], U+0F3A, U+0F3C, U+169B,
3962                                    U+201A, U+201E ...)
3963          \p{Line_Break: PO}      \p{Line_Break=Postfix_Numeric} (38)
3964          \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (38: [\%\xa2
3965                                    \xb0], U+0609..060B, U+066A,
3966                                    U+09F2..09F3, U+09F9, U+0D79 ...)
3967          \p{Line_Break: PR}      \p{Line_Break=Prefix_Numeric} (67)
3968          \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (67: [\$+\\\xa3-
3969                                    \xa5\xb1], U+058F, U+07FE..07FF, U+09FB,
3970                                    U+0AF1, U+0BF9 ...)
3971          \p{Line_Break: QU}      \p{Line_Break=Quotation} (39)
3972          \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb],
3973                                    U+2018..2019, U+201B..201D, U+201F,
3974                                    U+2039..203A, U+275B..2760 ...)
3975          \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26:
3976                                    U+1F1E6..1F1FF)
3977          \p{Line_Break: RI}      \p{Line_Break=Regional_Indicator} (26)
3978          \p{Line_Break: SA}      \p{Line_Break=Complex_Context} (758)
3979        D \p{Line_Break: SG}      \p{Line_Break=Surrogate} (2048)
3980          \p{Line_Break: SP}      \p{Line_Break=Space} (1)
3981          \p{Line_Break: Space}   (Short: \p{Lb=SP}) (1: [\x20])
3982        D \p{Line_Break: Surrogate} Surrogates should never appear in well-
3983                                    formed text, and therefore shouldn't be
3984                                    the basis for line breaking (Short:
3985                                    \p{Lb=SG}) (2048: U+D800..DFFF)
3986          \p{Line_Break: SY}      \p{Line_Break=Break_Symbols} (1)
3987          \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (900_198 plus all
3988                                    above-Unicode code points: U+0378..0379,
3989                                    U+0380..0383, U+038B, U+038D, U+03A2,
3990                                    U+0530 ...)
3991          \p{Line_Break: WJ}      \p{Line_Break=Word_Joiner} (2)
3992          \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF)
3993          \p{Line_Break: XX}      \p{Line_Break=Unknown} (900_198 plus all
3994                                    above-Unicode code points)
3995          \p{Line_Break: ZW}      \p{Line_Break=ZWSpace} (1)
3996          \p{Line_Break: ZWJ}     (Short: \p{Lb=ZWJ}) (1: U+200D)
3997          \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B)
3998          \p{Line_Separator}      \p{General_Category=Line_Separator}
3999                                    (Short: \p{Zl}) (1)
4000          \p{Linear_A}            \p{Script_Extensions=Linear_A} (Short:
4001                                    \p{Lina}; NOT \p{Block=Linear_A}) (386)
4002          \p{Linear_B}            \p{Script_Extensions=Linear_B} (Short:
4003                                    \p{Linb}) (268)
4004        X \p{Linear_B_Ideograms}  \p{Block=Linear_B_Ideograms} (128)
4005        X \p{Linear_B_Syllabary}  \p{Block=Linear_B_Syllabary} (128)
4006          \p{Lisu}                \p{Script_Extensions=Lisu} (NOT \p{Block=
4007                                    Lisu}) (49)
4008        X \p{Lisu_Sup}            \p{Lisu_Supplement} (= \p{Block=
4009                                    Lisu_Supplement}) (16)
4010        X \p{Lisu_Supplement}     \p{Block=Lisu_Supplement} (Short:
4011                                    \p{InLisuSup}) (16)
4012          \p{Ll}                  \p{Lowercase_Letter} (=
4013                                    \p{General_Category=Lowercase_Letter})
4014                                    (/i= General_Category=Cased_Letter)
4015                                    (2233)
4016          \p{Lm}                  \p{Modifier_Letter} (=
4017                                    \p{General_Category=Modifier_Letter})
4018                                    (397)
4019          \p{Lo}                  \p{Other_Letter} (= \p{General_Category=
4020                                    Other_Letter}) (131_612)
4021          \p{LOE}                 \p{Logical_Order_Exception} (=
4022                                    \p{Logical_Order_Exception=Y}) (19)
4023          \p{LOE: *}              \p{Logical_Order_Exception: *}
4024          \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
4025                                    \p{LOE}) (19)
4026          \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
4027                                    (1_114_093 plus all above-Unicode code
4028                                    points: U+0000..0E3F, U+0E45..0EBF,
4029                                    U+0EC5..19B4, U+19B8..19B9,
4030                                    U+19BB..AAB4, U+AAB7..AAB8 ...)
4031          \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19:
4032                                    U+0E40..0E44, U+0EC0..0EC4,
4033                                    U+19B5..19B7, U+19BA, U+AAB5..AAB6,
4034                                    U+AAB9 ...)
4035        X \p{Low_Surrogates}      \p{Block=Low_Surrogates} (1024)
4036          \p{Lower}               \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
4037                                    Cased=Yes) (2544)
4038          \p{Lower: *}            \p{Lowercase: *}
4039          \p{Lowercase}           \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
4040                                    Cased=Yes) (2544)
4041          \p{Lowercase: N*}       (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
4042                                    No) (1_111_568 plus all above-Unicode
4043                                    code points: [\x00-\x20!\"#\$\%&\'
4044                                    \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`\{
4045                                    \|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9\xbb-
4046                                    \xde\xf7], U+0100, U+0102, U+0104,
4047                                    U+0106, U+0108 ...)
4048          \p{Lowercase: Y*}       (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
4049                                    Yes) (2544: [a-z\xaa\xb5\xba\xdf-\xf6
4050                                    \xf8-\xff], U+0101, U+0103, U+0105,
4051                                    U+0107, U+0109 ...)
4052          \p{Lowercase_Letter}    \p{General_Category=Lowercase_Letter}
4053                                    (Short: \p{Ll}; /i= General_Category=
4054                                    Cased_Letter) (2233)
4055          \p{Lt}                  \p{Titlecase_Letter} (=
4056                                    \p{General_Category=Titlecase_Letter})
4057                                    (/i= General_Category=Cased_Letter) (31)
4058          \p{Lu}                  \p{Uppercase_Letter} (=
4059                                    \p{General_Category=Uppercase_Letter})
4060                                    (/i= General_Category=Cased_Letter)
4061                                    (1831)
4062          \p{Lyci}                \p{Lycian} (= \p{Script_Extensions=
4063                                    Lycian}) (NOT \p{Block=Lycian}) (29)
4064          \p{Lycian}              \p{Script_Extensions=Lycian} (Short:
4065                                    \p{Lyci}; NOT \p{Block=Lycian}) (29)
4066          \p{Lydi}                \p{Lydian} (= \p{Script_Extensions=
4067                                    Lydian}) (NOT \p{Block=Lydian}) (27)
4068          \p{Lydian}              \p{Script_Extensions=Lydian} (Short:
4069                                    \p{Lydi}; NOT \p{Block=Lydian}) (27)
4070          \p{M} \pM               \p{Mark} (= \p{General_Category=Mark})
4071                                    (2450)
4072          \p{Mahajani}            \p{Script_Extensions=Mahajani} (Short:
4073                                    \p{Mahj}; NOT \p{Block=Mahajani}) (61)
4074          \p{Mahj}                \p{Mahajani} (= \p{Script_Extensions=
4075                                    Mahajani}) (NOT \p{Block=Mahajani}) (61)
4076        X \p{Mahjong}             \p{Mahjong_Tiles} (= \p{Block=
4077                                    Mahjong_Tiles}) (48)
4078        X \p{Mahjong_Tiles}       \p{Block=Mahjong_Tiles} (Short:
4079                                    \p{InMahjong}) (48)
4080          \p{Maka}                \p{Makasar} (= \p{Script_Extensions=
4081                                    Makasar}) (NOT \p{Block=Makasar}) (25)
4082          \p{Makasar}             \p{Script_Extensions=Makasar} (Short:
4083                                    \p{Maka}; NOT \p{Block=Makasar}) (25)
4084          \p{Malayalam}           \p{Script_Extensions=Malayalam} (Short:
4085                                    \p{Mlym}; NOT \p{Block=Malayalam}) (126)
4086          \p{Mand}                \p{Mandaic} (= \p{Script_Extensions=
4087                                    Mandaic}) (NOT \p{Block=Mandaic}) (30)
4088          \p{Mandaic}             \p{Script_Extensions=Mandaic} (Short:
4089                                    \p{Mand}; NOT \p{Block=Mandaic}) (30)
4090          \p{Mani}                \p{Manichaean} (= \p{Script_Extensions=
4091                                    Manichaean}) (NOT \p{Block=Manichaean})
4092                                    (52)
4093          \p{Manichaean}          \p{Script_Extensions=Manichaean} (Short:
4094                                    \p{Mani}; NOT \p{Block=Manichaean}) (52)
4095          \p{Marc}                \p{Marchen} (= \p{Script_Extensions=
4096                                    Marchen}) (NOT \p{Block=Marchen}) (68)
4097          \p{Marchen}             \p{Script_Extensions=Marchen} (Short:
4098                                    \p{Marc}; NOT \p{Block=Marchen}) (68)
4099          \p{Mark}                \p{General_Category=Mark} (Short: \p{M})
4100                                    (2450)
4101          \p{Masaram_Gondi}       \p{Script_Extensions=Masaram_Gondi}
4102                                    (Short: \p{Gonm}; NOT \p{Block=
4103                                    Masaram_Gondi}) (77)
4104          \p{Math}                \p{Math=Y} (2310)
4105          \p{Math: N*}            (Single: \P{Math}) (1_111_802 plus all
4106                                    above-Unicode code points: [\x00-\x20!
4107                                    \"#\$\%&\'\(\)*,\-.\/0-9:;?\@A-Z
4108                                    \[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2-
4109                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF,
4110                                    U+03D3..03D4, U+03D6..03EF,
4111                                    U+03F2..03F3, U+03F7..0605 ...)
4112          \p{Math: Y*}            (Single: \p{Math}) (2310: [+<=>\^\|~\xac
4113                                    \xb1\xd7\xf7], U+03D0..03D2, U+03D5,
4114                                    U+03F0..03F1, U+03F4..03F6, U+0606..0608
4115                                    ...)
4116        X \p{Math_Alphanum}       \p{Mathematical_Alphanumeric_Symbols} (=
4117                                    \p{Block=
4118                                    Mathematical_Alphanumeric_Symbols})
4119                                    (1024)
4120        X \p{Math_Operators}      \p{Mathematical_Operators} (= \p{Block=
4121                                    Mathematical_Operators}) (256)
4122          \p{Math_Symbol}         \p{General_Category=Math_Symbol} (Short:
4123                                    \p{Sm}) (948)
4124        X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
4125                                    Mathematical_Alphanumeric_Symbols}
4126                                    (Short: \p{InMathAlphanum}) (1024)
4127        X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
4128                                    (Short: \p{InMathOperators}) (256)
4129        X \p{Mayan_Numerals}      \p{Block=Mayan_Numerals} (32)
4130          \p{Mc}                  \p{Spacing_Mark} (= \p{General_Category=
4131                                    Spacing_Mark}) (452)
4132          \p{Me}                  \p{Enclosing_Mark} (= \p{General_Category=
4133                                    Enclosing_Mark}) (13)
4134          \p{Medefaidrin}         \p{Script_Extensions=Medefaidrin} (Short:
4135                                    \p{Medf}; NOT \p{Block=Medefaidrin}) (91)
4136          \p{Medf}                \p{Medefaidrin} (= \p{Script_Extensions=
4137                                    Medefaidrin}) (NOT \p{Block=
4138                                    Medefaidrin}) (91)
4139          \p{Meetei_Mayek}        \p{Script_Extensions=Meetei_Mayek} (Short:
4140                                    \p{Mtei}; NOT \p{Block=Meetei_Mayek})
4141                                    (79)
4142        X \p{Meetei_Mayek_Ext}    \p{Meetei_Mayek_Extensions} (= \p{Block=
4143                                    Meetei_Mayek_Extensions}) (32)
4144        X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
4145                                    (Short: \p{InMeeteiMayekExt}) (32)
4146          \p{Mend}                \p{Mende_Kikakui} (= \p{Script_Extensions=
4147                                    Mende_Kikakui}) (NOT \p{Block=
4148                                    Mende_Kikakui}) (213)
4149          \p{Mende_Kikakui}       \p{Script_Extensions=Mende_Kikakui}
4150                                    (Short: \p{Mend}; NOT \p{Block=
4151                                    Mende_Kikakui}) (213)
4152          \p{Merc}                \p{Meroitic_Cursive} (=
4153                                    \p{Script_Extensions=Meroitic_Cursive})
4154                                    (NOT \p{Block=Meroitic_Cursive}) (90)
4155          \p{Mero}                \p{Meroitic_Hieroglyphs} (=
4156                                    \p{Script_Extensions=
4157                                    Meroitic_Hieroglyphs}) (32)
4158          \p{Meroitic_Cursive}    \p{Script_Extensions=Meroitic_Cursive}
4159                                    (Short: \p{Merc}; NOT \p{Block=
4160                                    Meroitic_Cursive}) (90)
4161          \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
4162                                    Meroitic_Hieroglyphs} (Short: \p{Mero})
4163                                    (32)
4164          \p{Miao}                \p{Script_Extensions=Miao} (NOT \p{Block=
4165                                    Miao}) (149)
4166        X \p{Misc_Arrows}         \p{Miscellaneous_Symbols_And_Arrows} (=
4167                                    \p{Block=
4168                                    Miscellaneous_Symbols_And_Arrows}) (256)
4169        X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
4170                                    (= \p{Block=
4171                                    Miscellaneous_Mathematical_Symbols_A})
4172                                    (48)
4173        X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
4174                                    (= \p{Block=
4175                                    Miscellaneous_Mathematical_Symbols_B})
4176                                    (128)
4177        X \p{Misc_Pictographs}    \p{Miscellaneous_Symbols_And_Pictographs}
4178                                    (= \p{Block=
4179                                    Miscellaneous_Symbols_And_Pictographs})
4180                                    (768)
4181        X \p{Misc_Symbols}        \p{Miscellaneous_Symbols} (= \p{Block=
4182                                    Miscellaneous_Symbols}) (256)
4183        X \p{Misc_Technical}      \p{Miscellaneous_Technical} (= \p{Block=
4184                                    Miscellaneous_Technical}) (256)
4185        X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
4186                                    Miscellaneous_Mathematical_Symbols_A}
4187                                    (Short: \p{InMiscMathSymbolsA}) (48)
4188        X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
4189                                    Miscellaneous_Mathematical_Symbols_B}
4190                                    (Short: \p{InMiscMathSymbolsB}) (128)
4191        X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
4192                                    \p{InMiscSymbols}) (256)
4193        X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
4194                                    Miscellaneous_Symbols_And_Arrows}
4195                                    (Short: \p{InMiscArrows}) (256)
4196        X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
4197                                    Miscellaneous_Symbols_And_Pictographs}
4198                                    (Short: \p{InMiscPictographs}) (768)
4199        X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
4200                                    (Short: \p{InMiscTechnical}) (256)
4201          \p{Mlym}                \p{Malayalam} (= \p{Script_Extensions=
4202                                    Malayalam}) (NOT \p{Block=Malayalam})
4203                                    (126)
4204          \p{Mn}                  \p{Nonspacing_Mark} (=
4205                                    \p{General_Category=Nonspacing_Mark})
4206                                    (1985)
4207          \p{Modi}                \p{Script_Extensions=Modi} (NOT \p{Block=
4208                                    Modi}) (89)
4209          \p{Modifier_Letter}     \p{General_Category=Modifier_Letter}
4210                                    (Short: \p{Lm}) (397)
4211        X \p{Modifier_Letters}    \p{Spacing_Modifier_Letters} (= \p{Block=
4212                                    Spacing_Modifier_Letters}) (80)
4213          \p{Modifier_Symbol}     \p{General_Category=Modifier_Symbol}
4214                                    (Short: \p{Sk}) (125)
4215        X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
4216          \p{Mong}                \p{Mongolian} (= \p{Script_Extensions=
4217                                    Mongolian}) (NOT \p{Block=Mongolian})
4218                                    (172)
4219          \p{Mongolian}           \p{Script_Extensions=Mongolian} (Short:
4220                                    \p{Mong}; NOT \p{Block=Mongolian}) (172)
4221        X \p{Mongolian_Sup}       \p{Mongolian_Supplement} (= \p{Block=
4222                                    Mongolian_Supplement}) (32)
4223        X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
4224                                    \p{InMongolianSup}) (32)
4225          \p{Mro}                 \p{Script_Extensions=Mro} (NOT \p{Block=
4226                                    Mro}) (43)
4227          \p{Mroo}                \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
4228                                    \p{Block=Mro}) (43)
4229          \p{Mtei}                \p{Meetei_Mayek} (= \p{Script_Extensions=
4230                                    Meetei_Mayek}) (NOT \p{Block=
4231                                    Meetei_Mayek}) (79)
4232          \p{Mult}                \p{Multani} (= \p{Script_Extensions=
4233                                    Multani}) (NOT \p{Block=Multani}) (48)
4234          \p{Multani}             \p{Script_Extensions=Multani} (Short:
4235                                    \p{Mult}; NOT \p{Block=Multani}) (48)
4236        X \p{Music}               \p{Musical_Symbols} (= \p{Block=
4237                                    Musical_Symbols}) (256)
4238        X \p{Musical_Symbols}     \p{Block=Musical_Symbols} (Short:
4239                                    \p{InMusic}) (256)
4240          \p{Myanmar}             \p{Script_Extensions=Myanmar} (Short:
4241                                    \p{Mymr}; NOT \p{Block=Myanmar}) (224)
4242        X \p{Myanmar_Ext_A}       \p{Myanmar_Extended_A} (= \p{Block=
4243                                    Myanmar_Extended_A}) (32)
4244        X \p{Myanmar_Ext_B}       \p{Myanmar_Extended_B} (= \p{Block=
4245                                    Myanmar_Extended_B}) (32)
4246        X \p{Myanmar_Extended_A}  \p{Block=Myanmar_Extended_A} (Short:
4247                                    \p{InMyanmarExtA}) (32)
4248        X \p{Myanmar_Extended_B}  \p{Block=Myanmar_Extended_B} (Short:
4249                                    \p{InMyanmarExtB}) (32)
4250          \p{Mymr}                \p{Myanmar} (= \p{Script_Extensions=
4251                                    Myanmar}) (NOT \p{Block=Myanmar}) (224)
4252          \p{N} \pN               \p{Number} (= \p{General_Category=Number})
4253                                    (1831)
4254          \p{Na=*}                \p{Name=*}
4255          \p{Nabataean}           \p{Script_Extensions=Nabataean} (Short:
4256                                    \p{Nbat}; NOT \p{Block=Nabataean}) (40)
4257          \p{Nag_Mundari}         \p{Script_Extensions=Nag_Mundari} (Short:
4258                                    \p{Nagm}; NOT \p{Block=Nag_Mundari}) (42)
4259          \p{Nagm}                \p{Nag_Mundari} (= \p{Script_Extensions=
4260                                    Nag_Mundari}) (NOT \p{Block=
4261                                    Nag_Mundari}) (42)
4262          \p{Name=*}              Combination of Name and Name_Alias
4263                                    properties; has special loose matching
4264                                    rules, for which see Unicode UAX #44
4265          \p{Nand}                \p{Nandinagari} (= \p{Script_Extensions=
4266                                    Nandinagari}) (NOT \p{Block=
4267                                    Nandinagari}) (86)
4268          \p{Nandinagari}         \p{Script_Extensions=Nandinagari} (Short:
4269                                    \p{Nand}; NOT \p{Block=Nandinagari}) (86)
4270          \p{Narb}                \p{Old_North_Arabian} (=
4271                                    \p{Script_Extensions=Old_North_Arabian})
4272                                    (32)
4273        X \p{NB}                  \p{No_Block} (= \p{Block=No_Block})
4274                                    (820_944 plus all above-Unicode code
4275                                    points)
4276          \p{Nbat}                \p{Nabataean} (= \p{Script_Extensions=
4277                                    Nabataean}) (NOT \p{Block=Nabataean})
4278                                    (40)
4279          \p{NChar}               \p{Noncharacter_Code_Point} (=
4280                                    \p{Noncharacter_Code_Point=Y}) (66)
4281          \p{NChar: *}            \p{Noncharacter_Code_Point: *}
4282          \p{Nd}                  \p{XPosixDigit} (= \p{General_Category=
4283                                    Decimal_Number}) (680)
4284          \p{New_Tai_Lue}         \p{Script_Extensions=New_Tai_Lue} (Short:
4285                                    \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
4286          \p{Newa}                \p{Script_Extensions=Newa} (NOT \p{Block=
4287                                    Newa}) (97)
4288          \p{NFC_QC: *}           \p{NFC_Quick_Check: *}
4289          \p{NFC_Quick_Check: M}  \p{NFC_Quick_Check=Maybe} (111)
4290          \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (111:
4291                                    U+0300..0304, U+0306..030C, U+030F,
4292                                    U+0311, U+0313..0314, U+031B ...)
4293          \p{NFC_Quick_Check: N}  \p{NFC_Quick_Check=No} (NOT
4294                                    \P{NFC_Quick_Check} NOR \P{NFC_QC})
4295                                    (1120)
4296          \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
4297                                    \P{NFC_Quick_Check} NOR \P{NFC_QC})
4298                                    (1120: U+0340..0341, U+0343..0344,
4299                                    U+0374, U+037E, U+0387, U+0958..095F ...)
4300          \p{NFC_Quick_Check: Y}  \p{NFC_Quick_Check=Yes} (NOT
4301                                    \p{NFC_Quick_Check} NOR \p{NFC_QC})
4302                                    (1_112_881 plus all above-Unicode code
4303                                    points)
4304          \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
4305                                    \p{NFC_Quick_Check} NOR \p{NFC_QC})
4306                                    (1_112_881 plus all above-Unicode code
4307                                    points: U+0000..02FF, U+0305,
4308                                    U+030D..030E, U+0310, U+0312,
4309                                    U+0315..031A ...)
4310          \p{NFD_QC: *}           \p{NFD_Quick_Check: *}
4311          \p{NFD_Quick_Check: N}  \p{NFD_Quick_Check=No} (NOT
4312                                    \P{NFD_Quick_Check} NOR \P{NFD_QC})
4313                                    (13_233)
4314          \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
4315                                    \P{NFD_Quick_Check} NOR \P{NFD_QC})
4316                                    (13_233: [\xc0-\xc5\xc7-\xcf\xd1-\xd6
4317                                    \xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-\xf6
4318                                    \xf9-\xfd\xff], U+0100..010F,
4319                                    U+0112..0125, U+0128..0130,
4320                                    U+0134..0137, U+0139..013E ...)
4321          \p{NFD_Quick_Check: Y}  \p{NFD_Quick_Check=Yes} (NOT
4322                                    \p{NFD_Quick_Check} NOR \p{NFD_QC})
4323                                    (1_100_879 plus all above-Unicode code
4324                                    points)
4325          \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
4326                                    \p{NFD_Quick_Check} NOR \p{NFD_QC})
4327                                    (1_100_879 plus all above-Unicode code
4328                                    points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde-
4329                                    \xdf\xe6\xf0\xf7-\xf8\xfe],
4330                                    U+0110..0111, U+0126..0127,
4331                                    U+0131..0133, U+0138, U+013F..0142 ...)
4332          \p{NFKC_QC: *}          \p{NFKC_Quick_Check: *}
4333          \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (111)
4334          \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (111:
4335                                    U+0300..0304, U+0306..030C, U+030F,
4336                                    U+0311, U+0313..0314, U+031B ...)
4337          \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
4338                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4339                                    (4928)
4340          \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
4341                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4342                                    (4928: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4343                                    \xba\xbc-\xbe], U+0132..0133,
4344                                    U+013F..0140, U+0149, U+017F,
4345                                    U+01C4..01CC ...)
4346          \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
4347                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4348                                    (1_109_073 plus all above-Unicode code
4349                                    points)
4350          \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
4351                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4352                                    (1_109_073 plus all above-Unicode code
4353                                    points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4354                                    \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff],
4355                                    U+0100..0131, U+0134..013E,
4356                                    U+0141..0148, U+014A..017E, U+0180..01C3
4357                                    ...)
4358          \p{NFKD_QC: *}          \p{NFKD_Quick_Check: *}
4359          \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
4360                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4361                                    (17_029)
4362          \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
4363                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4364                                    (17_029: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4365                                    \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1-
4366                                    \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
4367                                    \xf6\xf9-\xfd\xff], U+0100..010F,
4368                                    U+0112..0125, U+0128..0130,
4369                                    U+0132..0137, U+0139..0140 ...)
4370          \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
4371                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4372                                    (1_097_083 plus all above-Unicode code
4373                                    points)
4374          \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
4375                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4376                                    (1_097_083 plus all above-Unicode code
4377                                    points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4378                                    \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xc6\xd0
4379                                    \xd7-\xd8\xde-\xdf\xe6\xf0\xf7-\xf8
4380                                    \xfe], U+0110..0111, U+0126..0127,
4381                                    U+0131, U+0138, U+0141..0142 ...)
4382          \p{Nko}                 \p{Script_Extensions=Nko} (NOT \p{Block=
4383                                    NKo}) (67)
4384          \p{Nkoo}                \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
4385                                    \p{Block=NKo}) (67)
4386          \p{Nl}                  \p{Letter_Number} (= \p{General_Category=
4387                                    Letter_Number}) (236)
4388          \p{No}                  \p{Other_Number} (= \p{General_Category=
4389                                    Other_Number}) (915)
4390        X \p{No_Block}            \p{Block=No_Block} (Short: \p{InNB})
4391                                    (820_944 plus all above-Unicode code
4392                                    points)
4393          \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
4394                                    \p{NChar}) (66)
4395          \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
4396                                    (1_114_046 plus all above-Unicode code
4397                                    points: U+0000..FDCF, U+FDF0..FFFD,
4398                                    U+10000..1FFFD, U+20000..2FFFD,
4399                                    U+30000..3FFFD, U+40000..4FFFD ...)
4400          \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
4401                                    (66: U+FDD0..FDEF, U+FFFE..FFFF,
4402                                    U+1FFFE..1FFFF, U+2FFFE..2FFFF,
4403                                    U+3FFFE..3FFFF, U+4FFFE..4FFFF ...)
4404          \p{Nonspacing_Mark}     \p{General_Category=Nonspacing_Mark}
4405                                    (Short: \p{Mn}) (1985)
4406          \p{Nshu}                \p{Nushu} (= \p{Script_Extensions=Nushu})
4407                                    (NOT \p{Block=Nushu}) (397)
4408          \p{Nt: *}               \p{Numeric_Type: *}
4409          \p{Number}              \p{General_Category=Number} (Short: \p{N})
4410                                    (1831)
4411        X \p{Number_Forms}        \p{Block=Number_Forms} (64)
4412          \p{Numeric_Type: De}    \p{Numeric_Type=Decimal} (680)
4413          \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (680: [0-9],
4414                                    U+0660..0669, U+06F0..06F9,
4415                                    U+07C0..07C9, U+0966..096F, U+09E6..09EF
4416                                    ...)
4417          \p{Numeric_Type: Di}    \p{Numeric_Type=Digit} (128)
4418          \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9],
4419                                    U+1369..1371, U+19DA, U+2070,
4420                                    U+2074..2079, U+2080..2089 ...)
4421          \p{Numeric_Type: None}  (Short: \p{Nt=None}) (1_112_200 plus all
4422                                    above-Unicode code points: [\x00-\x20!
4423                                    \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\]
4424                                    \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba-
4425                                    \xbb\xbf-\xff], U+0100..065F,
4426                                    U+066A..06EF, U+06FA..07BF,
4427                                    U+07CA..0965, U+0970..09E5 ...)
4428          \p{Numeric_Type: Nu}    \p{Numeric_Type=Numeric} (1104)
4429          \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (1104: [\xbc-\xbe],
4430                                    U+09F4..09F9, U+0B72..0B77,
4431                                    U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
4432                                    ...)
4433        T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33)
4434        T \p{Numeric_Value: 0}    (Short: \p{Nv=0}) (87: [0], U+0660,
4435                                    U+06F0, U+07C0, U+0966, U+09E6 ...)
4436        T \p{Numeric_Value: 1/320} (Short: \p{Nv=1/320}) (2: U+11FC0,
4437                                    U+11FD4)
4438        T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (2: U+0D58, U+11FC1)
4439        T \p{Numeric_Value: 1/80} (Short: \p{Nv=1/80}) (1: U+11FC2)
4440        T \p{Numeric_Value: 1/64} (Short: \p{Nv=1/64}) (1: U+11FC3)
4441        T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (2: U+0D59, U+11FC4)
4442        T \p{Numeric_Value: 1/32} (Short: \p{Nv=1/32}) (1: U+11FC5)
4443        T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (2: U+0D5A, U+11FC6)
4444        T \p{Numeric_Value: 3/64} (Short: \p{Nv=3/64}) (1: U+11FC7)
4445        T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (2: U+0D5B, U+11FC8)
4446        T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (6: U+09F4, U+0B75,
4447                                    U+0D76, U+A833, U+11FC9..11FCA)
4448        T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6)
4449        T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (3: U+0D5C, U+2152,
4450                                    U+11FCB)
4451        T \p{Numeric_Value: 1/9}  (Short: \p{Nv=1/9}) (1: U+2151)
4452        T \p{Numeric_Value: 1/8}  (Short: \p{Nv=1/8}) (7: U+09F5, U+0B76,
4453                                    U+0D77, U+215B, U+A834, U+11FCC ...)
4454        T \p{Numeric_Value: 1/7}  (Short: \p{Nv=1/7}) (1: U+2150)
4455        T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (2: U+0D5D, U+11FCD)
4456        T \p{Numeric_Value: 1/6}  (Short: \p{Nv=1/6}) (4: U+2159, U+109F7,
4457                                    U+12461, U+1ED3D)
4458        T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (5: U+09F6, U+0B77,
4459                                    U+0D78, U+A835, U+11FCE)
4460        T \p{Numeric_Value: 1/5}  (Short: \p{Nv=1/5}) (3: U+0D5E, U+2155,
4461                                    U+11FCF)
4462        T \p{Numeric_Value: 1/4}  (Short: \p{Nv=1/4}) (14: [\xbc], U+09F7,
4463                                    U+0B72, U+0D73, U+A830, U+10140 ...)
4464        T \p{Numeric_Value: 1/3}  (Short: \p{Nv=1/3}) (6: U+2153, U+109F9,
4465                                    U+10E7D, U+1245A, U+1245D, U+12465)
4466        T \p{Numeric_Value: 3/8}  (Short: \p{Nv=3/8}) (1: U+215C)
4467        T \p{Numeric_Value: 2/5}  (Short: \p{Nv=2/5}) (1: U+2156)
4468        T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA)
4469        T \p{Numeric_Value: 1/2}  (Short: \p{Nv=1/2}) (19: [\xbd], U+0B73,
4470                                    U+0D74, U+0F2A, U+2CFD, U+A831 ...)
4471        T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC)
4472        T \p{Numeric_Value: 3/5}  (Short: \p{Nv=3/5}) (1: U+2157)
4473        T \p{Numeric_Value: 5/8}  (Short: \p{Nv=5/8}) (1: U+215D)
4474        T \p{Numeric_Value: 2/3}  (Short: \p{Nv=2/3}) (7: U+2154, U+10177,
4475                                    U+109FD, U+10E7E, U+1245B, U+1245E ...)
4476        T \p{Numeric_Value: 3/4}  (Short: \p{Nv=3/4}) (9: [\xbe], U+09F8,
4477                                    U+0B74, U+0D75, U+A832, U+10178 ...)
4478        T \p{Numeric_Value: 4/5}  (Short: \p{Nv=4/5}) (1: U+2158)
4479        T \p{Numeric_Value: 5/6}  (Short: \p{Nv=5/6}) (3: U+215A, U+109FF,
4480                                    U+1245C)
4481        T \p{Numeric_Value: 7/8}  (Short: \p{Nv=7/8}) (1: U+215E)
4482        T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC)
4483        T \p{Numeric_Value: 1}    (Short: \p{Nv=1}) (144: [1\xb9], U+0661,
4484                                    U+06F1, U+07C1, U+0967, U+09E7 ...)
4485        T \p{Numeric_Value: 3/2}  (Short: \p{Nv=3/2}) (1: U+0F2B)
4486        T \p{Numeric_Value: 2}    (Short: \p{Nv=2}) (143: [2\xb2], U+0662,
4487                                    U+06F2, U+07C2, U+0968, U+09E8 ...)
4488        T \p{Numeric_Value: 5/2}  (Short: \p{Nv=5/2}) (1: U+0F2C)
4489        T \p{Numeric_Value: 3}    (Short: \p{Nv=3}) (144: [3\xb3], U+0663,
4490                                    U+06F3, U+07C3, U+0969, U+09E9 ...)
4491        T \p{Numeric_Value: 7/2}  (Short: \p{Nv=7/2}) (1: U+0F2D)
4492        T \p{Numeric_Value: 4}    (Short: \p{Nv=4}) (135: [4], U+0664,
4493                                    U+06F4, U+07C4, U+096A, U+09EA ...)
4494        T \p{Numeric_Value: 9/2}  (Short: \p{Nv=9/2}) (1: U+0F2E)
4495        T \p{Numeric_Value: 5}    (Short: \p{Nv=5}) (133: [5], U+0665,
4496                                    U+06F5, U+07C5, U+096B, U+09EB ...)
4497        T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F)
4498        T \p{Numeric_Value: 6}    (Short: \p{Nv=6}) (117: [6], U+0666,
4499                                    U+06F6, U+07C6, U+096C, U+09EC ...)
4500        T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30)
4501        T \p{Numeric_Value: 7}    (Short: \p{Nv=7}) (116: [7], U+0667,
4502                                    U+06F7, U+07C7, U+096D, U+09ED ...)
4503        T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31)
4504        T \p{Numeric_Value: 8}    (Short: \p{Nv=8}) (112: [8], U+0668,
4505                                    U+06F8, U+07C8, U+096E, U+09EE ...)
4506        T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32)
4507        T \p{Numeric_Value: 9}    (Short: \p{Nv=9}) (116: [9], U+0669,
4508                                    U+06F9, U+07C9, U+096F, U+09EF ...)
4509        T \p{Numeric_Value: 10}   (Short: \p{Nv=10}) (63: U+0BF0, U+0D70,
4510                                    U+1372, U+2169, U+2179, U+2469 ...)
4511        T \p{Numeric_Value: 11}   (Short: \p{Nv=11}) (9: U+216A, U+217A,
4512                                    U+246A, U+247E, U+2492, U+24EB ...)
4513        T \p{Numeric_Value: 12}   (Short: \p{Nv=12}) (9: U+216B, U+217B,
4514                                    U+246B, U+247F, U+2493, U+24EC ...)
4515        T \p{Numeric_Value: 13}   (Short: \p{Nv=13}) (7: U+246C, U+2480,
4516                                    U+2494, U+24ED, U+16E8D, U+1D2CD ...)
4517        T \p{Numeric_Value: 14}   (Short: \p{Nv=14}) (7: U+246D, U+2481,
4518                                    U+2495, U+24EE, U+16E8E, U+1D2CE ...)
4519        T \p{Numeric_Value: 15}   (Short: \p{Nv=15}) (7: U+246E, U+2482,
4520                                    U+2496, U+24EF, U+16E8F, U+1D2CF ...)
4521        T \p{Numeric_Value: 16}   (Short: \p{Nv=16}) (8: U+09F9, U+246F,
4522                                    U+2483, U+2497, U+24F0, U+16E90 ...)
4523        T \p{Numeric_Value: 17}   (Short: \p{Nv=17}) (8: U+16EE, U+2470,
4524                                    U+2484, U+2498, U+24F1, U+16E91 ...)
4525        T \p{Numeric_Value: 18}   (Short: \p{Nv=18}) (8: U+16EF, U+2471,
4526                                    U+2485, U+2499, U+24F2, U+16E92 ...)
4527        T \p{Numeric_Value: 19}   (Short: \p{Nv=19}) (8: U+16F0, U+2472,
4528                                    U+2486, U+249A, U+24F3, U+16E93 ...)
4529        T \p{Numeric_Value: 20}   (Short: \p{Nv=20}) (36: U+1373, U+2473,
4530                                    U+2487, U+249B, U+24F4, U+3039 ...)
4531        T \p{Numeric_Value: 21}   (Short: \p{Nv=21}) (1: U+3251)
4532        T \p{Numeric_Value: 22}   (Short: \p{Nv=22}) (1: U+3252)
4533        T \p{Numeric_Value: 23}   (Short: \p{Nv=23}) (1: U+3253)
4534        T \p{Numeric_Value: 24}   (Short: \p{Nv=24}) (1: U+3254)
4535        T \p{Numeric_Value: 25}   (Short: \p{Nv=25}) (1: U+3255)
4536        T \p{Numeric_Value: 26}   (Short: \p{Nv=26}) (1: U+3256)
4537        T \p{Numeric_Value: 27}   (Short: \p{Nv=27}) (1: U+3257)
4538        T \p{Numeric_Value: 28}   (Short: \p{Nv=28}) (1: U+3258)
4539        T \p{Numeric_Value: 29}   (Short: \p{Nv=29}) (1: U+3259)
4540        T \p{Numeric_Value: 30}   (Short: \p{Nv=30}) (19: U+1374, U+303A,
4541                                    U+324A, U+325A, U+5345, U+10112 ...)
4542        T \p{Numeric_Value: 31}   (Short: \p{Nv=31}) (1: U+325B)
4543        T \p{Numeric_Value: 32}   (Short: \p{Nv=32}) (1: U+325C)
4544        T \p{Numeric_Value: 33}   (Short: \p{Nv=33}) (1: U+325D)
4545        T \p{Numeric_Value: 34}   (Short: \p{Nv=34}) (1: U+325E)
4546        T \p{Numeric_Value: 35}   (Short: \p{Nv=35}) (1: U+325F)
4547        T \p{Numeric_Value: 36}   (Short: \p{Nv=36}) (1: U+32B1)
4548        T \p{Numeric_Value: 37}   (Short: \p{Nv=37}) (1: U+32B2)
4549        T \p{Numeric_Value: 38}   (Short: \p{Nv=38}) (1: U+32B3)
4550        T \p{Numeric_Value: 39}   (Short: \p{Nv=39}) (1: U+32B4)
4551        T \p{Numeric_Value: 40}   (Short: \p{Nv=40}) (18: U+1375, U+324B,
4552                                    U+32B5, U+534C, U+10113, U+102ED ...)
4553        T \p{Numeric_Value: 41}   (Short: \p{Nv=41}) (1: U+32B6)
4554        T \p{Numeric_Value: 42}   (Short: \p{Nv=42}) (1: U+32B7)
4555        T \p{Numeric_Value: 43}   (Short: \p{Nv=43}) (1: U+32B8)
4556        T \p{Numeric_Value: 44}   (Short: \p{Nv=44}) (1: U+32B9)
4557        T \p{Numeric_Value: 45}   (Short: \p{Nv=45}) (1: U+32BA)
4558        T \p{Numeric_Value: 46}   (Short: \p{Nv=46}) (1: U+32BB)
4559        T \p{Numeric_Value: 47}   (Short: \p{Nv=47}) (1: U+32BC)
4560        T \p{Numeric_Value: 48}   (Short: \p{Nv=48}) (1: U+32BD)
4561        T \p{Numeric_Value: 49}   (Short: \p{Nv=49}) (1: U+32BE)
4562        T \p{Numeric_Value: 50}   (Short: \p{Nv=50}) (29: U+1376, U+216C,
4563                                    U+217C, U+2186, U+324C, U+32BF ...)
4564        T \p{Numeric_Value: 60}   (Short: \p{Nv=60}) (13: U+1377, U+324D,
4565                                    U+10115, U+102EF, U+109CE, U+10E6E ...)
4566        T \p{Numeric_Value: 70}   (Short: \p{Nv=70}) (13: U+1378, U+324E,
4567                                    U+10116, U+102F0, U+109CF, U+10E6F ...)
4568        T \p{Numeric_Value: 80}   (Short: \p{Nv=80}) (12: U+1379, U+324F,
4569                                    U+10117, U+102F1, U+10E70, U+11062 ...)
4570        T \p{Numeric_Value: 90}   (Short: \p{Nv=90}) (12: U+137A, U+10118,
4571                                    U+102F2, U+10341, U+10E71, U+11063 ...)
4572        T \p{Numeric_Value: 100}  (Short: \p{Nv=100}) (35: U+0BF1, U+0D71,
4573                                    U+137B, U+216D, U+217D, U+4F70 ...)
4574        T \p{Numeric_Value: 200}  (Short: \p{Nv=200}) (6: U+1011A, U+102F4,
4575                                    U+109D3, U+10E73, U+1EC84, U+1ED14)
4576        T \p{Numeric_Value: 300}  (Short: \p{Nv=300}) (7: U+1011B, U+1016B,
4577                                    U+102F5, U+109D4, U+10E74, U+1EC85 ...)
4578        T \p{Numeric_Value: 400}  (Short: \p{Nv=400}) (7: U+1011C, U+102F6,
4579                                    U+109D5, U+10E75, U+1EC86, U+1ED16 ...)
4580        T \p{Numeric_Value: 500}  (Short: \p{Nv=500}) (16: U+216E, U+217E,
4581                                    U+1011D, U+10145, U+1014C, U+10153 ...)
4582        T \p{Numeric_Value: 600}  (Short: \p{Nv=600}) (7: U+1011E, U+102F8,
4583                                    U+109D7, U+10E77, U+1EC88, U+1ED18 ...)
4584        T \p{Numeric_Value: 700}  (Short: \p{Nv=700}) (6: U+1011F, U+102F9,
4585                                    U+109D8, U+10E78, U+1EC89, U+1ED19)
4586        T \p{Numeric_Value: 800}  (Short: \p{Nv=800}) (6: U+10120, U+102FA,
4587                                    U+109D9, U+10E79, U+1EC8A, U+1ED1A)
4588        T \p{Numeric_Value: 900}  (Short: \p{Nv=900}) (7: U+10121, U+102FB,
4589                                    U+1034A, U+109DA, U+10E7A, U+1EC8B ...)
4590        T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (22: U+0BF2, U+0D72,
4591                                    U+216F, U+217F..2180, U+4EDF, U+5343 ...)
4592        T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (5: U+10123, U+109DC,
4593                                    U+1EC8D, U+1ED1D, U+1ED3A)
4594        T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (4: U+10124, U+109DD,
4595                                    U+1EC8E, U+1ED1E)
4596        T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (4: U+10125, U+109DE,
4597                                    U+1EC8F, U+1ED1F)
4598        T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (8: U+2181, U+10126,
4599                                    U+10146, U+1014E, U+10172, U+109DF ...)
4600        T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (4: U+10127, U+109E0,
4601                                    U+1EC91, U+1ED21)
4602        T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (4: U+10128, U+109E1,
4603                                    U+1EC92, U+1ED22)
4604        T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (4: U+10129, U+109E2,
4605                                    U+1EC93, U+1ED23)
4606        T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (4: U+1012A, U+109E3,
4607                                    U+1EC94, U+1ED24)
4608        T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (13:
4609                                    U+137C, U+2182, U+4E07, U+842C, U+1012B,
4610                                    U+10155 ...)
4611        T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (4:
4612                                    U+1012C, U+109E5, U+1EC96, U+1ED26)
4613        T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (4:
4614                                    U+1012D, U+109E6, U+1EC97, U+1ED27)
4615        T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (4:
4616                                    U+1012E, U+109E7, U+1EC98, U+1ED28)
4617        T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (7:
4618                                    U+2187, U+1012F, U+10147, U+10156,
4619                                    U+109E8, U+1EC99 ...)
4620        T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (4:
4621                                    U+10130, U+109E9, U+1EC9A, U+1ED2A)
4622        T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (4:
4623                                    U+10131, U+109EA, U+1EC9B, U+1ED2B)
4624        T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (4:
4625                                    U+10132, U+109EB, U+1EC9C, U+1ED2C)
4626        T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (4:
4627                                    U+10133, U+109EC, U+1EC9D, U+1ED2D)
4628        T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (5:
4629                                    U+2188, U+109ED, U+1EC9E, U+1ECA0,
4630                                    U+1ECB4)
4631        T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (2:
4632                                    U+109EE, U+1EC9F)
4633        T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1:
4634                                    U+12432)
4635        T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1:
4636                                    U+109EF)
4637        T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1:
4638                                    U+109F0)
4639        T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1:
4640                                    U+12433)
4641        T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1:
4642                                    U+109F1)
4643        T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1:
4644                                    U+109F2)
4645        T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1:
4646                                    U+109F3)
4647        T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1:
4648                                    U+109F4)
4649        T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1:
4650                                    U+109F5)
4651        T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1:
4652                                    U+16B5E)
4653        T \p{Numeric_Value: 10000000} (= 1.0e+07) (Short: \p{Nv=10000000})
4654                                    (1: U+1ECA1)
4655        T \p{Numeric_Value: 20000000} (= 2.0e+07) (Short: \p{Nv=20000000})
4656                                    (1: U+1ECA2)
4657        T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
4658                                    (3: U+4EBF, U+5104, U+16B5F)
4659        T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
4660                                    10000000000}) (1: U+16B60)
4661        T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
4662                                    1000000000000}) (2: U+5146, U+16B61)
4663          \p{Numeric_Value: NaN}  (Short: \p{Nv=NaN}) (1_112_200 plus all
4664                                    above-Unicode code points: [\x00-\x20!
4665                                    \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\]
4666                                    \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba-
4667                                    \xbb\xbf-\xff], U+0100..065F,
4668                                    U+066A..06EF, U+06FA..07BF,
4669                                    U+07CA..0965, U+0970..09E5 ...)
4670          \p{Nushu}               \p{Script_Extensions=Nushu} (Short:
4671                                    \p{Nshu}; NOT \p{Block=Nushu}) (397)
4672          \p{Nv: *}               \p{Numeric_Value: *}
4673          \p{Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
4674                                    Nyiakeng_Puachue_Hmong} (Short:
4675                                    \p{Hmnp}; NOT \p{Block=
4676                                    Nyiakeng_Puachue_Hmong}) (71)
4677        X \p{OCR}                 \p{Optical_Character_Recognition} (=
4678                                    \p{Block=Optical_Character_Recognition})
4679                                    (32)
4680          \p{Ogam}                \p{Ogham} (= \p{Script_Extensions=Ogham})
4681                                    (NOT \p{Block=Ogham}) (29)
4682          \p{Ogham}               \p{Script_Extensions=Ogham} (Short:
4683                                    \p{Ogam}; NOT \p{Block=Ogham}) (29)
4684          \p{Ol_Chiki}            \p{Script_Extensions=Ol_Chiki} (Short:
4685                                    \p{Olck}) (48)
4686          \p{Olck}                \p{Ol_Chiki} (= \p{Script_Extensions=
4687                                    Ol_Chiki}) (48)
4688          \p{Old_Hungarian}       \p{Script_Extensions=Old_Hungarian}
4689                                    (Short: \p{Hung}; NOT \p{Block=
4690                                    Old_Hungarian}) (108)
4691          \p{Old_Italic}          \p{Script_Extensions=Old_Italic} (Short:
4692                                    \p{Ital}; NOT \p{Block=Old_Italic}) (39)
4693          \p{Old_North_Arabian}   \p{Script_Extensions=Old_North_Arabian}
4694                                    (Short: \p{Narb}) (32)
4695          \p{Old_Permic}          \p{Script_Extensions=Old_Permic} (Short:
4696                                    \p{Perm}; NOT \p{Block=Old_Permic}) (44)
4697          \p{Old_Persian}         \p{Script_Extensions=Old_Persian} (Short:
4698                                    \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
4699          \p{Old_Sogdian}         \p{Script_Extensions=Old_Sogdian} (Short:
4700                                    \p{Sogo}; NOT \p{Block=Old_Sogdian}) (40)
4701          \p{Old_South_Arabian}   \p{Script_Extensions=Old_South_Arabian}
4702                                    (Short: \p{Sarb}) (32)
4703          \p{Old_Turkic}          \p{Script_Extensions=Old_Turkic} (Short:
4704                                    \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
4705          \p{Old_Uyghur}          \p{Script_Extensions=Old_Uyghur} (Short:
4706                                    \p{Ougr}; NOT \p{Block=Old_Uyghur}) (28)
4707          \p{Open_Punctuation}    \p{General_Category=Open_Punctuation}
4708                                    (Short: \p{Ps}) (79)
4709        X \p{Optical_Character_Recognition} \p{Block=
4710                                    Optical_Character_Recognition} (Short:
4711                                    \p{InOCR}) (32)
4712          \p{Oriya}               \p{Script_Extensions=Oriya} (Short:
4713                                    \p{Orya}; NOT \p{Block=Oriya}) (97)
4714          \p{Orkh}                \p{Old_Turkic} (= \p{Script_Extensions=
4715                                    Old_Turkic}) (NOT \p{Block=Old_Turkic})
4716                                    (73)
4717        X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
4718          \p{Orya}                \p{Oriya} (= \p{Script_Extensions=Oriya})
4719                                    (NOT \p{Block=Oriya}) (97)
4720          \p{Osage}               \p{Script_Extensions=Osage} (Short:
4721                                    \p{Osge}; NOT \p{Block=Osage}) (72)
4722          \p{Osge}                \p{Osage} (= \p{Script_Extensions=Osage})
4723                                    (NOT \p{Block=Osage}) (72)
4724          \p{Osma}                \p{Osmanya} (= \p{Script_Extensions=
4725                                    Osmanya}) (NOT \p{Block=Osmanya}) (40)
4726          \p{Osmanya}             \p{Script_Extensions=Osmanya} (Short:
4727                                    \p{Osma}; NOT \p{Block=Osmanya}) (40)
4728          \p{Other}               \p{General_Category=Other} (Short: \p{C})
4729                                    (965_096 plus all above-Unicode code
4730                                    points)
4731          \p{Other_Letter}        \p{General_Category=Other_Letter} (Short:
4732                                    \p{Lo}) (131_612)
4733          \p{Other_Number}        \p{General_Category=Other_Number} (Short:
4734                                    \p{No}) (915)
4735          \p{Other_Punctuation}   \p{General_Category=Other_Punctuation}
4736                                    (Short: \p{Po}) (628)
4737          \p{Other_Symbol}        \p{General_Category=Other_Symbol} (Short:
4738                                    \p{So}) (6634)
4739        X \p{Ottoman_Siyaq_Numbers} \p{Block=Ottoman_Siyaq_Numbers} (80)
4740          \p{Ougr}                \p{Old_Uyghur} (= \p{Script_Extensions=
4741                                    Old_Uyghur}) (NOT \p{Block=Old_Uyghur})
4742                                    (28)
4743          \p{P} \pP               \p{Punct} (= \p{General_Category=
4744                                    Punctuation}) (NOT
4745                                    \p{General_Punctuation}) (842)
4746          \p{Pahawh_Hmong}        \p{Script_Extensions=Pahawh_Hmong} (Short:
4747                                    \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
4748                                    (127)
4749          \p{Palm}                \p{Palmyrene} (= \p{Script_Extensions=
4750                                    Palmyrene}) (32)
4751          \p{Palmyrene}           \p{Script_Extensions=Palmyrene} (Short:
4752                                    \p{Palm}) (32)
4753          \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
4754                                    (Short: \p{Zp}) (1)
4755          \p{Pat_Syn}             \p{Pattern_Syntax} (= \p{Pattern_Syntax=
4756                                    Y}) (2760)
4757          \p{Pat_Syn: *}          \p{Pattern_Syntax: *}
4758          \p{Pat_WS}              \p{Pattern_White_Space} (=
4759                                    \p{Pattern_White_Space=Y}) (11)
4760          \p{Pat_WS: *}           \p{Pattern_White_Space: *}
4761          \p{Pattern_Syntax}      \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
4762                                    (2760)
4763          \p{Pattern_Syntax: N*}  (Short: \p{PatSyn=N}, \P{PatSyn})
4764                                    (1_111_352 plus all above-Unicode code
4765                                    points: [\x00-\x200-9A-Z_a-z\x7f-\xa0
4766                                    \xa8\xaa\xad\xaf\xb2-\xb5\xb7-\xba\xbc-
4767                                    \xbe\xc0-\xd6\xd8-\xf6\xf8-\xff],
4768                                    U+0100..200F, U+2028..202F,
4769                                    U+203F..2040, U+2054, U+205F..218F ...)
4770          \p{Pattern_Syntax: Y*}  (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760:
4771                                    [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\]
4772                                    \^`\{\|\}~\xa1-\xa7\xa9\xab-\xac\xae
4773                                    \xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
4774                                    U+2010..2027, U+2030..203E,
4775                                    U+2041..2053, U+2055..205E, U+2190..245F
4776                                    ...)
4777          \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
4778                                    \p{PatWS}) (11)
4779          \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
4780                                    (1_114_101 plus all above-Unicode code
4781                                    points: [^\t\n\cK\f\r\x20\x85],
4782                                    U+0100..200D, U+2010..2027,
4783                                    U+202A..infinity)
4784          \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11:
4785                                    [\t\n\cK\f\r\x20\x85], U+200E..200F,
4786                                    U+2028..2029)
4787          \p{Pau_Cin_Hau}         \p{Script_Extensions=Pau_Cin_Hau} (Short:
4788                                    \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
4789          \p{Pauc}                \p{Pau_Cin_Hau} (= \p{Script_Extensions=
4790                                    Pau_Cin_Hau}) (NOT \p{Block=
4791                                    Pau_Cin_Hau}) (57)
4792          \p{Pc}                  \p{Connector_Punctuation} (=
4793                                    \p{General_Category=
4794                                    Connector_Punctuation}) (10)
4795          \p{PCM}                 \p{Prepended_Concatenation_Mark} (=
4796                                    \p{Prepended_Concatenation_Mark=Y}) (13)
4797          \p{PCM: *}              \p{Prepended_Concatenation_Mark: *}
4798          \p{Pd}                  \p{Dash_Punctuation} (=
4799                                    \p{General_Category=Dash_Punctuation})
4800                                    (26)
4801          \p{Pe}                  \p{Close_Punctuation} (=
4802                                    \p{General_Category=Close_Punctuation})
4803                                    (77)
4804          \p{PerlSpace}           \p{PosixSpace} (6)
4805          \p{PerlWord}            \p{PosixWord} (63)
4806          \p{Perm}                \p{Old_Permic} (= \p{Script_Extensions=
4807                                    Old_Permic}) (NOT \p{Block=Old_Permic})
4808                                    (44)
4809          \p{Pf}                  \p{Final_Punctuation} (=
4810                                    \p{General_Category=Final_Punctuation})
4811                                    (10)
4812          \p{Phag}                \p{Phags_Pa} (= \p{Script_Extensions=
4813                                    Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
4814          \p{Phags_Pa}            \p{Script_Extensions=Phags_Pa} (Short:
4815                                    \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
4816        X \p{Phaistos}            \p{Phaistos_Disc} (= \p{Block=
4817                                    Phaistos_Disc}) (48)
4818        X \p{Phaistos_Disc}       \p{Block=Phaistos_Disc} (Short:
4819                                    \p{InPhaistos}) (48)
4820          \p{Phli}                \p{Inscriptional_Pahlavi} (=
4821                                    \p{Script_Extensions=
4822                                    Inscriptional_Pahlavi}) (NOT \p{Block=
4823                                    Inscriptional_Pahlavi}) (27)
4824          \p{Phlp}                \p{Psalter_Pahlavi} (=
4825                                    \p{Script_Extensions=Psalter_Pahlavi})
4826                                    (NOT \p{Block=Psalter_Pahlavi}) (30)
4827          \p{Phnx}                \p{Phoenician} (= \p{Script_Extensions=
4828                                    Phoenician}) (NOT \p{Block=Phoenician})
4829                                    (29)
4830          \p{Phoenician}          \p{Script_Extensions=Phoenician} (Short:
4831                                    \p{Phnx}; NOT \p{Block=Phoenician}) (29)
4832        X \p{Phonetic_Ext}        \p{Phonetic_Extensions} (= \p{Block=
4833                                    Phonetic_Extensions}) (128)
4834        X \p{Phonetic_Ext_Sup}    \p{Phonetic_Extensions_Supplement} (=
4835                                    \p{Block=
4836                                    Phonetic_Extensions_Supplement}) (64)
4837        X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
4838                                    \p{InPhoneticExt}) (128)
4839        X \p{Phonetic_Extensions_Supplement} \p{Block=
4840                                    Phonetic_Extensions_Supplement} (Short:
4841                                    \p{InPhoneticExtSup}) (64)
4842          \p{Pi}                  \p{Initial_Punctuation} (=
4843                                    \p{General_Category=
4844                                    Initial_Punctuation}) (12)
4845        X \p{Playing_Cards}       \p{Block=Playing_Cards} (96)
4846          \p{Plrd}                \p{Miao} (= \p{Script_Extensions=Miao})
4847                                    (NOT \p{Block=Miao}) (149)
4848          \p{Po}                  \p{Other_Punctuation} (=
4849                                    \p{General_Category=Other_Punctuation})
4850                                    (628)
4851          \p{PosixAlnum}          (62: [0-9A-Za-z])
4852          \p{PosixAlpha}          (52: [A-Za-z])
4853          \p{PosixBlank}          (2: [\t\x20])
4854          \p{PosixCntrl}          ASCII control characters (33: ACK, BEL,
4855                                    BS, CAN, CR, DC1, DC2, DC3, DC4, DEL,
4856                                    DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF,
4857                                    FS, GS, HT, LF, NAK, NUL, RS, SI, SO,
4858                                    SOH, STX, SUB, SYN, US, VT)
4859          \p{PosixDigit}          (10: [0-9])
4860          \p{PosixGraph}          (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
4861                                    Z\[\\\]\^_`a-z\{\|\}~])
4862          \p{PosixLower}          (/i= PosixAlpha) (26: [a-z])
4863          \p{PosixPrint}          (95: [\x20-\x7e])
4864          \p{PosixPunct}          (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@
4865                                    \[\\\]\^_`\{\|\}~])
4866          \p{PosixSpace}          (Short: \p{PerlSpace}) (6: [\t\n\cK\f\r
4867                                    \x20])
4868          \p{PosixUpper}          (/i= PosixAlpha) (26: [A-Z])
4869          \p{PosixWord}           \w, restricted to ASCII (Short:
4870                                    \p{PerlWord}) (63: [0-9A-Z_a-z])
4871          \p{PosixXDigit}         \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
4872                                    (22)
4873          \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
4874                                    Y} (Short: \p{PCM}) (13)
4875          \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
4876                                    (1_114_099 plus all above-Unicode code
4877                                    points: U+0000..05FF, U+0606..06DC,
4878                                    U+06DE..070E, U+0710..088F,
4879                                    U+0892..08E1, U+08E3..110BC ...)
4880          \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
4881                                    (13: U+0600..0605, U+06DD, U+070F,
4882                                    U+0890..0891, U+08E2, U+110BD ...)
4883        T \p{Present_In: 1.1}     \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
4884                                    extension) (33_979)
4885          \p{Present_In: V1_1}    \p{Present_In=1.1} (= \p{Age=V1_1}) (Perl
4886                                    extension) (33_979)
4887        T \p{Present_In: 2.0}     Code point's usage introduced in version
4888                                    2.0 or earlier (Short: \p{In=2.0}) (Perl
4889                                    extension) (178_500: U+0000..01F5,
4890                                    U+01FA..0217, U+0250..02A8,
4891                                    U+02B0..02DE, U+02E0..02E9, U+0300..0345
4892                                    ...)
4893          \p{Present_In: V2_0}    \p{Present_In=2.0} (Perl extension)
4894                                    (178_500)
4895        T \p{Present_In: 2.1}     Code point's usage introduced in version
4896                                    2.1 or earlier (Short: \p{In=2.1}) (Perl
4897                                    extension) (178_502: U+0000..01F5,
4898                                    U+01FA..0217, U+0250..02A8,
4899                                    U+02B0..02DE, U+02E0..02E9, U+0300..0345
4900                                    ...)
4901          \p{Present_In: V2_1}    \p{Present_In=2.1} (Perl extension)
4902                                    (178_502)
4903        T \p{Present_In: 3.0}     Code point's usage introduced in version
4904                                    3.0 or earlier (Short: \p{In=3.0}) (Perl
4905                                    extension) (188_809: U+0000..021F,
4906                                    U+0222..0233, U+0250..02AD,
4907                                    U+02B0..02EE, U+0300..034E, U+0360..0362
4908                                    ...)
4909          \p{Present_In: V3_0}    \p{Present_In=3.0} (Perl extension)
4910                                    (188_809)
4911        T \p{Present_In: 3.1}     Code point's usage introduced in version
4912                                    3.1 or earlier (Short: \p{In=3.1}) (Perl
4913                                    extension) (233_787: U+0000..021F,
4914                                    U+0222..0233, U+0250..02AD,
4915                                    U+02B0..02EE, U+0300..034E, U+0360..0362
4916                                    ...)
4917          \p{Present_In: V3_1}    \p{Present_In=3.1} (Perl extension)
4918                                    (233_787)
4919        T \p{Present_In: 3.2}     Code point's usage introduced in version
4920                                    3.2 or earlier (Short: \p{In=3.2}) (Perl
4921                                    extension) (234_803: U+0000..0220,
4922                                    U+0222..0233, U+0250..02AD,
4923                                    U+02B0..02EE, U+0300..034F, U+0360..036F
4924                                    ...)
4925          \p{Present_In: V3_2}    \p{Present_In=3.2} (Perl extension)
4926                                    (234_803)
4927        T \p{Present_In: 4.0}     Code point's usage introduced in version
4928                                    4.0 or earlier (Short: \p{In=4.0}) (Perl
4929                                    extension) (236_029: U+0000..0236,
4930                                    U+0250..0357, U+035D..036F,
4931                                    U+0374..0375, U+037A, U+037E ...)
4932          \p{Present_In: V4_0}    \p{Present_In=4.0} (Perl extension)
4933                                    (236_029)
4934        T \p{Present_In: 4.1}     Code point's usage introduced in version
4935                                    4.1 or earlier (Short: \p{In=4.1}) (Perl
4936                                    extension) (237_302: U+0000..0241,
4937                                    U+0250..036F, U+0374..0375, U+037A,
4938                                    U+037E, U+0384..038A ...)
4939          \p{Present_In: V4_1}    \p{Present_In=4.1} (Perl extension)
4940                                    (237_302)
4941        T \p{Present_In: 5.0}     Code point's usage introduced in version
4942                                    5.0 or earlier (Short: \p{In=5.0}) (Perl
4943                                    extension) (238_671: U+0000..036F,
4944                                    U+0374..0375, U+037A..037E,
4945                                    U+0384..038A, U+038C, U+038E..03A1 ...)
4946          \p{Present_In: V5_0}    \p{Present_In=5.0} (Perl extension)
4947                                    (238_671)
4948        T \p{Present_In: 5.1}     Code point's usage introduced in version
4949                                    5.1 or earlier (Short: \p{In=5.1}) (Perl
4950                                    extension) (240_295: U+0000..0377,
4951                                    U+037A..037E, U+0384..038A, U+038C,
4952                                    U+038E..03A1, U+03A3..0523 ...)
4953          \p{Present_In: V5_1}    \p{Present_In=5.1} (Perl extension)
4954                                    (240_295)
4955        T \p{Present_In: 5.2}     Code point's usage introduced in version
4956                                    5.2 or earlier (Short: \p{In=5.2}) (Perl
4957                                    extension) (246_943: U+0000..0377,
4958                                    U+037A..037E, U+0384..038A, U+038C,
4959                                    U+038E..03A1, U+03A3..0525 ...)
4960          \p{Present_In: V5_2}    \p{Present_In=5.2} (Perl extension)
4961                                    (246_943)
4962        T \p{Present_In: 6.0}     Code point's usage introduced in version
4963                                    6.0 or earlier (Short: \p{In=6.0}) (Perl
4964                                    extension) (249_031: U+0000..0377,
4965                                    U+037A..037E, U+0384..038A, U+038C,
4966                                    U+038E..03A1, U+03A3..0527 ...)
4967          \p{Present_In: V6_0}    \p{Present_In=6.0} (Perl extension)
4968                                    (249_031)
4969        T \p{Present_In: 6.1}     Code point's usage introduced in version
4970                                    6.1 or earlier (Short: \p{In=6.1}) (Perl
4971                                    extension) (249_763: U+0000..0377,
4972                                    U+037A..037E, U+0384..038A, U+038C,
4973                                    U+038E..03A1, U+03A3..0527 ...)
4974          \p{Present_In: V6_1}    \p{Present_In=6.1} (Perl extension)
4975                                    (249_763)
4976        T \p{Present_In: 6.2}     Code point's usage introduced in version
4977                                    6.2 or earlier (Short: \p{In=6.2}) (Perl
4978                                    extension) (249_764: U+0000..0377,
4979                                    U+037A..037E, U+0384..038A, U+038C,
4980                                    U+038E..03A1, U+03A3..0527 ...)
4981          \p{Present_In: V6_2}    \p{Present_In=6.2} (Perl extension)
4982                                    (249_764)
4983        T \p{Present_In: 6.3}     Code point's usage introduced in version
4984                                    6.3 or earlier (Short: \p{In=6.3}) (Perl
4985                                    extension) (249_769: U+0000..0377,
4986                                    U+037A..037E, U+0384..038A, U+038C,
4987                                    U+038E..03A1, U+03A3..0527 ...)
4988          \p{Present_In: V6_3}    \p{Present_In=6.3} (Perl extension)
4989                                    (249_769)
4990        T \p{Present_In: 7.0}     Code point's usage introduced in version
4991                                    7.0 or earlier (Short: \p{In=7.0}) (Perl
4992                                    extension) (252_603: U+0000..0377,
4993                                    U+037A..037F, U+0384..038A, U+038C,
4994                                    U+038E..03A1, U+03A3..052F ...)
4995          \p{Present_In: V7_0}    \p{Present_In=7.0} (Perl extension)
4996                                    (252_603)
4997        T \p{Present_In: 8.0}     Code point's usage introduced in version
4998                                    8.0 or earlier (Short: \p{In=8.0}) (Perl
4999                                    extension) (260_319: U+0000..0377,
5000                                    U+037A..037F, U+0384..038A, U+038C,
5001                                    U+038E..03A1, U+03A3..052F ...)
5002          \p{Present_In: V8_0}    \p{Present_In=8.0} (Perl extension)
5003                                    (260_319)
5004        T \p{Present_In: 9.0}     Code point's usage introduced in version
5005                                    9.0 or earlier (Short: \p{In=9.0}) (Perl
5006                                    extension) (267_819: U+0000..0377,
5007                                    U+037A..037F, U+0384..038A, U+038C,
5008                                    U+038E..03A1, U+03A3..052F ...)
5009          \p{Present_In: V9_0}    \p{Present_In=9.0} (Perl extension)
5010                                    (267_819)
5011        T \p{Present_In: 10.0}    Code point's usage introduced in version
5012                                    10.0 or earlier (Short: \p{In=10.0})
5013                                    (Perl extension) (276_337: U+0000..0377,
5014                                    U+037A..037F, U+0384..038A, U+038C,
5015                                    U+038E..03A1, U+03A3..052F ...)
5016          \p{Present_In: V10_0}   \p{Present_In=10.0} (Perl extension)
5017                                    (276_337)
5018        T \p{Present_In: 11.0}    Code point's usage introduced in version
5019                                    11.0 or earlier (Short: \p{In=11.0})
5020                                    (Perl extension) (277_021: U+0000..0377,
5021                                    U+037A..037F, U+0384..038A, U+038C,
5022                                    U+038E..03A1, U+03A3..052F ...)
5023          \p{Present_In: V11_0}   \p{Present_In=11.0} (Perl extension)
5024                                    (277_021)
5025        T \p{Present_In: 12.0}    Code point's usage introduced in version
5026                                    12.0 or earlier (Short: \p{In=12.0})
5027                                    (Perl extension) (277_575: U+0000..0377,
5028                                    U+037A..037F, U+0384..038A, U+038C,
5029                                    U+038E..03A1, U+03A3..052F ...)
5030          \p{Present_In: V12_0}   \p{Present_In=12.0} (Perl extension)
5031                                    (277_575)
5032        T \p{Present_In: 12.1}    Code point's usage introduced in version
5033                                    12.1 or earlier (Short: \p{In=12.1})
5034                                    (Perl extension) (277_576: U+0000..0377,
5035                                    U+037A..037F, U+0384..038A, U+038C,
5036                                    U+038E..03A1, U+03A3..052F ...)
5037          \p{Present_In: V12_1}   \p{Present_In=12.1} (Perl extension)
5038                                    (277_576)
5039        T \p{Present_In: 13.0}    Code point's usage introduced in version
5040                                    13.0 or earlier (Short: \p{In=13.0})
5041                                    (Perl extension) (283_506: U+0000..0377,
5042                                    U+037A..037F, U+0384..038A, U+038C,
5043                                    U+038E..03A1, U+03A3..052F ...)
5044          \p{Present_In: V13_0}   \p{Present_In=13.0} (Perl extension)
5045                                    (283_506)
5046        T \p{Present_In: 14.0}    Code point's usage introduced in version
5047                                    14.0 or earlier (Short: \p{In=14.0})
5048                                    (Perl extension) (284_344: U+0000..0377,
5049                                    U+037A..037F, U+0384..038A, U+038C,
5050                                    U+038E..03A1, U+03A3..052F ...)
5051          \p{Present_In: V14_0}   \p{Present_In=14.0} (Perl extension)
5052                                    (284_344)
5053        T \p{Present_In: 15.0}    Code point's usage introduced in version
5054                                    15.0 or earlier (Short: \p{In=15.0})
5055                                    (Perl extension) (288_833: U+0000..0377,
5056                                    U+037A..037F, U+0384..038A, U+038C,
5057                                    U+038E..03A1, U+03A3..052F ...)
5058          \p{Present_In: V15_0}   \p{Present_In=15.0} (Perl extension)
5059                                    (288_833)
5060          \p{Present_In: NA}      \p{Present_In=Unassigned} (= \p{Age=
5061                                    Unassigned}) (Perl extension) (825_279
5062                                    plus all above-Unicode code points)
5063          \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=NA})
5064                                    (Perl extension) (825_279 plus all
5065                                    above-Unicode code points)
5066          \p{Print}               \p{XPosixPrint} (286_652)
5067          \p{Private_Use}         \p{General_Category=Private_Use} (Short:
5068                                    \p{Co}; NOT \p{Private_Use_Area})
5069                                    (137_468)
5070        X \p{Private_Use_Area}    \p{Block=Private_Use_Area} (Short:
5071                                    \p{InPUA}) (6400)
5072          \p{Prti}                \p{Inscriptional_Parthian} (=
5073                                    \p{Script_Extensions=
5074                                    Inscriptional_Parthian}) (NOT \p{Block=
5075                                    Inscriptional_Parthian}) (30)
5076          \p{Ps}                  \p{Open_Punctuation} (=
5077                                    \p{General_Category=Open_Punctuation})
5078                                    (79)
5079          \p{Psalter_Pahlavi}     \p{Script_Extensions=Psalter_Pahlavi}
5080                                    (Short: \p{Phlp}; NOT \p{Block=
5081                                    Psalter_Pahlavi}) (30)
5082        X \p{PUA}                 \p{Private_Use_Area} (= \p{Block=
5083                                    Private_Use_Area}) (6400)
5084          \p{Punct}               \p{General_Category=Punctuation} (Short:
5085                                    \p{P}; NOT \p{General_Punctuation}) (842)
5086          \p{Punctuation}         \p{Punct} (= \p{General_Category=
5087                                    Punctuation}) (NOT
5088                                    \p{General_Punctuation}) (842)
5089          \p{Qaac}                \p{Coptic} (= \p{Script_Extensions=
5090                                    Coptic}) (NOT \p{Block=Coptic}) (165)
5091          \p{Qaai}                \p{Inherited} (= \p{Script_Extensions=
5092                                    Inherited}) (586)
5093          \p{QMark}               \p{Quotation_Mark} (= \p{Quotation_Mark=
5094                                    Y}) (30)
5095          \p{QMark: *}            \p{Quotation_Mark: *}
5096          \p{Quotation_Mark}      \p{Quotation_Mark=Y} (Short: \p{QMark})
5097                                    (30)
5098          \p{Quotation_Mark: N*}  (Short: \p{QMark=N}, \P{QMark}) (1_114_082
5099                                    plus all above-Unicode code points:
5100                                    [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<=>?
5101                                    \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xaa\xac-
5102                                    \xba\xbc-\xff], U+0100..2017,
5103                                    U+2020..2038, U+203B..2E41,
5104                                    U+2E43..300B, U+3010..301C ...)
5105          \p{Quotation_Mark: Y*}  (Short: \p{QMark=Y}, \p{QMark}) (30: [\"
5106                                    \'\xab\xbb], U+2018..201F, U+2039..203A,
5107                                    U+2E42, U+300C..300F, U+301D..301F ...)
5108          \p{Radical}             \p{Radical=Y} (329)
5109          \p{Radical: N*}         (Single: \P{Radical}) (1_113_783 plus all
5110                                    above-Unicode code points: U+0000..2E7F,
5111                                    U+2E9A, U+2EF4..2EFF, U+2FD6..infinity)
5112          \p{Radical: Y*}         (Single: \p{Radical}) (329: U+2E80..2E99,
5113                                    U+2E9B..2EF3, U+2F00..2FD5)
5114          \p{Regional_Indicator}  \p{Regional_Indicator=Y} (Short: \p{RI})
5115                                    (26)
5116          \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086
5117                                    plus all above-Unicode code points:
5118                                    U+0000..1F1E5, U+1F200..infinity)
5119          \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26:
5120                                    U+1F1E6..1F1FF)
5121          \p{Rejang}              \p{Script_Extensions=Rejang} (Short:
5122                                    \p{Rjng}; NOT \p{Block=Rejang}) (37)
5123          \p{RI}                  \p{Regional_Indicator} (=
5124                                    \p{Regional_Indicator=Y}) (26)
5125          \p{RI: *}               \p{Regional_Indicator: *}
5126          \p{Rjng}                \p{Rejang} (= \p{Script_Extensions=
5127                                    Rejang}) (NOT \p{Block=Rejang}) (37)
5128          \p{Rohg}                \p{Hanifi_Rohingya} (=
5129                                    \p{Script_Extensions=Hanifi_Rohingya})
5130                                    (NOT \p{Block=Hanifi_Rohingya}) (55)
5131        X \p{Rumi}                \p{Rumi_Numeral_Symbols} (= \p{Block=
5132                                    Rumi_Numeral_Symbols}) (32)
5133        X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
5134                                    \p{InRumi}) (32)
5135          \p{Runic}               \p{Script_Extensions=Runic} (Short:
5136                                    \p{Runr}; NOT \p{Block=Runic}) (86)
5137          \p{Runr}                \p{Runic} (= \p{Script_Extensions=Runic})
5138                                    (NOT \p{Block=Runic}) (86)
5139          \p{S} \pS               \p{Symbol} (= \p{General_Category=Symbol})
5140                                    (7770)
5141          \p{Samaritan}           \p{Script_Extensions=Samaritan} (Short:
5142                                    \p{Samr}; NOT \p{Block=Samaritan}) (61)
5143          \p{Samr}                \p{Samaritan} (= \p{Script_Extensions=
5144                                    Samaritan}) (NOT \p{Block=Samaritan})
5145                                    (61)
5146          \p{Sarb}                \p{Old_South_Arabian} (=
5147                                    \p{Script_Extensions=Old_South_Arabian})
5148                                    (32)
5149          \p{Saur}                \p{Saurashtra} (= \p{Script_Extensions=
5150                                    Saurashtra}) (NOT \p{Block=Saurashtra})
5151                                    (82)
5152          \p{Saurashtra}          \p{Script_Extensions=Saurashtra} (Short:
5153                                    \p{Saur}; NOT \p{Block=Saurashtra}) (82)
5154          \p{SB: *}               \p{Sentence_Break: *}
5155          \p{Sc}                  \p{Currency_Symbol} (=
5156                                    \p{General_Category=Currency_Symbol})
5157                                    (63)
5158          \p{Sc: *}               \p{Script: *}
5159          \p{Script: Adlam}       (Short: \p{Sc=Adlm}) (88: U+1E900..1E94B,
5160                                    U+1E950..1E959, U+1E95E..1E95F)
5161          \p{Script: Adlm}        \p{Script=Adlam} (88)
5162          \p{Script: Aghb}        \p{Script=Caucasian_Albanian} (=
5163                                    \p{Script_Extensions=
5164                                    Caucasian_Albanian}) (53)
5165          \p{Script: Ahom}        \p{Script_Extensions=Ahom} (Short: \p{Sc=
5166                                    Ahom}, \p{Ahom}) (65)
5167          \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions=
5168                                    Anatolian_Hieroglyphs} (Short: \p{Sc=
5169                                    Hluw}, \p{Hluw}) (583)
5170          \p{Script: Arab}        \p{Script=Arabic} (1368)
5171          \p{Script: Arabic}      (Short: \p{Sc=Arab}) (1368: U+0600..0604,
5172                                    U+0606..060B, U+060D..061A,
5173                                    U+061C..061E, U+0620..063F, U+0641..064A
5174                                    ...)
5175          \p{Script: Armenian}    \p{Script_Extensions=Armenian} (Short:
5176                                    \p{Sc=Armn}, \p{Armn}) (96)
5177          \p{Script: Armi}        \p{Script=Imperial_Aramaic} (=
5178                                    \p{Script_Extensions=Imperial_Aramaic})
5179                                    (31)
5180          \p{Script: Armn}        \p{Script=Armenian} (=
5181                                    \p{Script_Extensions=Armenian}) (96)
5182          \p{Script: Avestan}     \p{Script_Extensions=Avestan} (Short:
5183                                    \p{Sc=Avst}, \p{Avst}) (61)
5184          \p{Script: Avst}        \p{Script=Avestan} (=
5185                                    \p{Script_Extensions=Avestan}) (61)
5186          \p{Script: Bali}        \p{Script=Balinese} (=
5187                                    \p{Script_Extensions=Balinese}) (124)
5188          \p{Script: Balinese}    \p{Script_Extensions=Balinese} (Short:
5189                                    \p{Sc=Bali}, \p{Bali}) (124)
5190          \p{Script: Bamu}        \p{Script=Bamum} (= \p{Script_Extensions=
5191                                    Bamum}) (657)
5192          \p{Script: Bamum}       \p{Script_Extensions=Bamum} (Short: \p{Sc=
5193                                    Bamu}, \p{Bamu}) (657)
5194          \p{Script: Bass}        \p{Script=Bassa_Vah} (=
5195                                    \p{Script_Extensions=Bassa_Vah}) (36)
5196          \p{Script: Bassa_Vah}   \p{Script_Extensions=Bassa_Vah} (Short:
5197                                    \p{Sc=Bass}, \p{Bass}) (36)
5198          \p{Script: Batak}       \p{Script_Extensions=Batak} (Short: \p{Sc=
5199                                    Batk}, \p{Batk}) (56)
5200          \p{Script: Batk}        \p{Script=Batak} (= \p{Script_Extensions=
5201                                    Batak}) (56)
5202          \p{Script: Beng}        \p{Script=Bengali} (96)
5203          \p{Script: Bengali}     (Short: \p{Sc=Beng}) (96: U+0980..0983,
5204                                    U+0985..098C, U+098F..0990,
5205                                    U+0993..09A8, U+09AA..09B0, U+09B2 ...)
5206          \p{Script: Bhaiksuki}   \p{Script_Extensions=Bhaiksuki} (Short:
5207                                    \p{Sc=Bhks}, \p{Bhks}) (97)
5208          \p{Script: Bhks}        \p{Script=Bhaiksuki} (=
5209                                    \p{Script_Extensions=Bhaiksuki}) (97)
5210          \p{Script: Bopo}        \p{Script=Bopomofo} (77)
5211          \p{Script: Bopomofo}    (Short: \p{Sc=Bopo}) (77: U+02EA..02EB,
5212                                    U+3105..312F, U+31A0..31BF)
5213          \p{Script: Brah}        \p{Script=Brahmi} (= \p{Script_Extensions=
5214                                    Brahmi}) (115)
5215          \p{Script: Brahmi}      \p{Script_Extensions=Brahmi} (Short:
5216                                    \p{Sc=Brah}, \p{Brah}) (115)
5217          \p{Script: Brai}        \p{Script=Braille} (=
5218                                    \p{Script_Extensions=Braille}) (256)
5219          \p{Script: Braille}     \p{Script_Extensions=Braille} (Short:
5220                                    \p{Sc=Brai}, \p{Brai}) (256)
5221          \p{Script: Bugi}        \p{Script=Buginese} (30)
5222          \p{Script: Buginese}    (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B,
5223                                    U+1A1E..1A1F)
5224          \p{Script: Buhd}        \p{Script=Buhid} (20)
5225          \p{Script: Buhid}       (Short: \p{Sc=Buhd}) (20: U+1740..1753)
5226          \p{Script: Cakm}        \p{Script=Chakma} (71)
5227          \p{Script: Canadian_Aboriginal} \p{Script_Extensions=
5228                                    Canadian_Aboriginal} (Short: \p{Sc=
5229                                    Cans}, \p{Cans}) (726)
5230          \p{Script: Cans}        \p{Script=Canadian_Aboriginal} (=
5231                                    \p{Script_Extensions=
5232                                    Canadian_Aboriginal}) (726)
5233          \p{Script: Cari}        \p{Script=Carian} (= \p{Script_Extensions=
5234                                    Carian}) (49)
5235          \p{Script: Carian}      \p{Script_Extensions=Carian} (Short:
5236                                    \p{Sc=Cari}, \p{Cari}) (49)
5237          \p{Script: Caucasian_Albanian} \p{Script_Extensions=
5238                                    Caucasian_Albanian} (Short: \p{Sc=Aghb},
5239                                    \p{Aghb}) (53)
5240          \p{Script: Chakma}      (Short: \p{Sc=Cakm}) (71: U+11100..11134,
5241                                    U+11136..11147)
5242          \p{Script: Cham}        \p{Script_Extensions=Cham} (Short: \p{Sc=
5243                                    Cham}, \p{Cham}) (83)
5244          \p{Script: Cher}        \p{Script=Cherokee} (=
5245                                    \p{Script_Extensions=Cherokee}) (172)
5246          \p{Script: Cherokee}    \p{Script_Extensions=Cherokee} (Short:
5247                                    \p{Sc=Cher}, \p{Cher}) (172)
5248          \p{Script: Chorasmian}  \p{Script_Extensions=Chorasmian} (Short:
5249                                    \p{Sc=Chrs}, \p{Chrs}) (28)
5250          \p{Script: Chrs}        \p{Script=Chorasmian} (=
5251                                    \p{Script_Extensions=Chorasmian}) (28)
5252          \p{Script: Common}      (Short: \p{Sc=Zyyy}) (8301: [\x00-\x20!
5253                                    \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]
5254                                    \^_`\{\|\}~\x7f-\xa9\xab-\xb9\xbb-\xbf
5255                                    \xd7\xf7], U+02B9..02DF, U+02E5..02E9,
5256                                    U+02EC..02FF, U+0374, U+037E ...)
5257          \p{Script: Copt}        \p{Script=Coptic} (137)
5258          \p{Script: Coptic}      (Short: \p{Sc=Copt}) (137: U+03E2..03EF,
5259                                    U+2C80..2CF3, U+2CF9..2CFF)
5260          \p{Script: Cpmn}        \p{Script=Cypro_Minoan} (99)
5261          \p{Script: Cprt}        \p{Script=Cypriot} (55)
5262          \p{Script: Cuneiform}   \p{Script_Extensions=Cuneiform} (Short:
5263                                    \p{Sc=Xsux}, \p{Xsux}) (1234)
5264          \p{Script: Cypriot}     (Short: \p{Sc=Cprt}) (55: U+10800..10805,
5265                                    U+10808, U+1080A..10835, U+10837..10838,
5266                                    U+1083C, U+1083F)
5267          \p{Script: Cypro_Minoan} (Short: \p{Sc=Cpmn}) (99: U+12F90..12FF2)
5268          \p{Script: Cyrillic}    (Short: \p{Sc=Cyrl}) (506: U+0400..0484,
5269                                    U+0487..052F, U+1C80..1C88, U+1D2B,
5270                                    U+1D78, U+2DE0..2DFF ...)
5271          \p{Script: Cyrl}        \p{Script=Cyrillic} (506)
5272          \p{Script: Deseret}     \p{Script_Extensions=Deseret} (Short:
5273                                    \p{Sc=Dsrt}, \p{Dsrt}) (80)
5274          \p{Script: Deva}        \p{Script=Devanagari} (164)
5275          \p{Script: Devanagari}  (Short: \p{Sc=Deva}) (164: U+0900..0950,
5276                                    U+0955..0963, U+0966..097F,
5277                                    U+A8E0..A8FF, U+11B00..11B09)
5278          \p{Script: Diak}        \p{Script=Dives_Akuru} (=
5279                                    \p{Script_Extensions=Dives_Akuru}) (72)
5280          \p{Script: Dives_Akuru} \p{Script_Extensions=Dives_Akuru} (Short:
5281                                    \p{Sc=Diak}, \p{Diak}) (72)
5282          \p{Script: Dogr}        \p{Script=Dogra} (60)
5283          \p{Script: Dogra}       (Short: \p{Sc=Dogr}) (60: U+11800..1183B)
5284          \p{Script: Dsrt}        \p{Script=Deseret} (=
5285                                    \p{Script_Extensions=Deseret}) (80)
5286          \p{Script: Dupl}        \p{Script=Duployan} (143)
5287          \p{Script: Duployan}    (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A,
5288                                    U+1BC70..1BC7C, U+1BC80..1BC88,
5289                                    U+1BC90..1BC99, U+1BC9C..1BC9F)
5290          \p{Script: Egyp}        \p{Script=Egyptian_Hieroglyphs} (=
5291                                    \p{Script_Extensions=
5292                                    Egyptian_Hieroglyphs}) (1110)
5293          \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions=
5294                                    Egyptian_Hieroglyphs} (Short: \p{Sc=
5295                                    Egyp}, \p{Egyp}) (1110)
5296          \p{Script: Elba}        \p{Script=Elbasan} (=
5297                                    \p{Script_Extensions=Elbasan}) (40)
5298          \p{Script: Elbasan}     \p{Script_Extensions=Elbasan} (Short:
5299                                    \p{Sc=Elba}, \p{Elba}) (40)
5300          \p{Script: Elym}        \p{Script=Elymaic} (=
5301                                    \p{Script_Extensions=Elymaic}) (23)
5302          \p{Script: Elymaic}     \p{Script_Extensions=Elymaic} (Short:
5303                                    \p{Sc=Elym}, \p{Elym}) (23)
5304          \p{Script: Ethi}        \p{Script=Ethiopic} (=
5305                                    \p{Script_Extensions=Ethiopic}) (523)
5306          \p{Script: Ethiopic}    \p{Script_Extensions=Ethiopic} (Short:
5307                                    \p{Sc=Ethi}, \p{Ethi}) (523)
5308          \p{Script: Geor}        \p{Script=Georgian} (173)
5309          \p{Script: Georgian}    (Short: \p{Sc=Geor}) (173: U+10A0..10C5,
5310                                    U+10C7, U+10CD, U+10D0..10FA,
5311                                    U+10FC..10FF, U+1C90..1CBA ...)
5312          \p{Script: Glag}        \p{Script=Glagolitic} (134)
5313          \p{Script: Glagolitic}  (Short: \p{Sc=Glag}) (134: U+2C00..2C5F,
5314                                    U+1E000..1E006, U+1E008..1E018,
5315                                    U+1E01B..1E021, U+1E023..1E024,
5316                                    U+1E026..1E02A)
5317          \p{Script: Gong}        \p{Script=Gunjala_Gondi} (63)
5318          \p{Script: Gonm}        \p{Script=Masaram_Gondi} (75)
5319          \p{Script: Goth}        \p{Script=Gothic} (= \p{Script_Extensions=
5320                                    Gothic}) (27)
5321          \p{Script: Gothic}      \p{Script_Extensions=Gothic} (Short:
5322                                    \p{Sc=Goth}, \p{Goth}) (27)
5323          \p{Script: Gran}        \p{Script=Grantha} (85)
5324          \p{Script: Grantha}     (Short: \p{Sc=Gran}) (85: U+11300..11303,
5325                                    U+11305..1130C, U+1130F..11310,
5326                                    U+11313..11328, U+1132A..11330,
5327                                    U+11332..11333 ...)
5328          \p{Script: Greek}       (Short: \p{Sc=Grek}) (518: U+0370..0373,
5329                                    U+0375..0377, U+037A..037D, U+037F,
5330                                    U+0384, U+0386 ...)
5331          \p{Script: Grek}        \p{Script=Greek} (518)
5332          \p{Script: Gujarati}    (Short: \p{Sc=Gujr}) (91: U+0A81..0A83,
5333                                    U+0A85..0A8D, U+0A8F..0A91,
5334                                    U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3
5335                                    ...)
5336          \p{Script: Gujr}        \p{Script=Gujarati} (91)
5337          \p{Script: Gunjala_Gondi} (Short: \p{Sc=Gong}) (63:
5338                                    U+11D60..11D65, U+11D67..11D68,
5339                                    U+11D6A..11D8E, U+11D90..11D91,
5340                                    U+11D93..11D98, U+11DA0..11DA9)
5341          \p{Script: Gurmukhi}    (Short: \p{Sc=Guru}) (80: U+0A01..0A03,
5342                                    U+0A05..0A0A, U+0A0F..0A10,
5343                                    U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33
5344                                    ...)
5345          \p{Script: Guru}        \p{Script=Gurmukhi} (80)
5346          \p{Script: Han}         (Short: \p{Sc=Han}) (98_408: U+2E80..2E99,
5347                                    U+2E9B..2EF3, U+2F00..2FD5, U+3005,
5348                                    U+3007, U+3021..3029 ...)
5349          \p{Script: Hang}        \p{Script=Hangul} (11_739)
5350          \p{Script: Hangul}      (Short: \p{Sc=Hang}) (11_739:
5351                                    U+1100..11FF, U+302E..302F,
5352                                    U+3131..318E, U+3200..321E,
5353                                    U+3260..327E, U+A960..A97C ...)
5354          \p{Script: Hani}        \p{Script=Han} (98_408)
5355          \p{Script: Hanifi_Rohingya} (Short: \p{Sc=Rohg}) (50:
5356                                    U+10D00..10D27, U+10D30..10D39)
5357          \p{Script: Hano}        \p{Script=Hanunoo} (21)
5358          \p{Script: Hanunoo}     (Short: \p{Sc=Hano}) (21: U+1720..1734)
5359          \p{Script: Hatr}        \p{Script=Hatran} (= \p{Script_Extensions=
5360                                    Hatran}) (26)
5361          \p{Script: Hatran}      \p{Script_Extensions=Hatran} (Short:
5362                                    \p{Sc=Hatr}, \p{Hatr}) (26)
5363          \p{Script: Hebr}        \p{Script=Hebrew} (= \p{Script_Extensions=
5364                                    Hebrew}) (134)
5365          \p{Script: Hebrew}      \p{Script_Extensions=Hebrew} (Short:
5366                                    \p{Sc=Hebr}, \p{Hebr}) (134)
5367          \p{Script: Hira}        \p{Script=Hiragana} (381)
5368          \p{Script: Hiragana}    (Short: \p{Sc=Hira}) (381: U+3041..3096,
5369                                    U+309D..309F, U+1B001..1B11F, U+1B132,
5370                                    U+1B150..1B152, U+1F200)
5371          \p{Script: Hluw}        \p{Script=Anatolian_Hieroglyphs} (=
5372                                    \p{Script_Extensions=
5373                                    Anatolian_Hieroglyphs}) (583)
5374          \p{Script: Hmng}        \p{Script=Pahawh_Hmong} (=
5375                                    \p{Script_Extensions=Pahawh_Hmong}) (127)
5376          \p{Script: Hmnp}        \p{Script=Nyiakeng_Puachue_Hmong} (=
5377                                    \p{Script_Extensions=
5378                                    Nyiakeng_Puachue_Hmong}) (71)
5379          \p{Script: Hung}        \p{Script=Old_Hungarian} (=
5380                                    \p{Script_Extensions=Old_Hungarian})
5381                                    (108)
5382          \p{Script: Imperial_Aramaic} \p{Script_Extensions=
5383                                    Imperial_Aramaic} (Short: \p{Sc=Armi},
5384                                    \p{Armi}) (31)
5385          \p{Script: Inherited}   (Short: \p{Sc=Zinh}) (657: U+0300..036F,
5386                                    U+0485..0486, U+064B..0655, U+0670,
5387                                    U+0951..0954, U+1AB0..1ACE ...)
5388          \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions=
5389                                    Inscriptional_Pahlavi} (Short: \p{Sc=
5390                                    Phli}, \p{Phli}) (27)
5391          \p{Script: Inscriptional_Parthian} \p{Script_Extensions=
5392                                    Inscriptional_Parthian} (Short: \p{Sc=
5393                                    Prti}, \p{Prti}) (30)
5394          \p{Script: Ital}        \p{Script=Old_Italic} (=
5395                                    \p{Script_Extensions=Old_Italic}) (39)
5396          \p{Script: Java}        \p{Script=Javanese} (90)
5397          \p{Script: Javanese}    (Short: \p{Sc=Java}) (90: U+A980..A9CD,
5398                                    U+A9D0..A9D9, U+A9DE..A9DF)
5399          \p{Script: Kaithi}      (Short: \p{Sc=Kthi}) (68: U+11080..110C2,
5400                                    U+110CD)
5401          \p{Script: Kali}        \p{Script=Kayah_Li} (47)
5402          \p{Script: Kana}        \p{Script=Katakana} (321)
5403          \p{Script: Kannada}     (Short: \p{Sc=Knda}) (91: U+0C80..0C8C,
5404                                    U+0C8E..0C90, U+0C92..0CA8,
5405                                    U+0CAA..0CB3, U+0CB5..0CB9, U+0CBC..0CC4
5406                                    ...)
5407          \p{Script: Katakana}    (Short: \p{Sc=Kana}) (321: U+30A1..30FA,
5408                                    U+30FD..30FF, U+31F0..31FF,
5409                                    U+32D0..32FE, U+3300..3357, U+FF66..FF6F
5410                                    ...)
5411          \p{Script: Kawi}        \p{Script_Extensions=Kawi} (Short: \p{Sc=
5412                                    Kawi}, \p{Kawi}) (86)
5413          \p{Script: Kayah_Li}    (Short: \p{Sc=Kali}) (47: U+A900..A92D,
5414                                    U+A92F)
5415          \p{Script: Khar}        \p{Script=Kharoshthi} (=
5416                                    \p{Script_Extensions=Kharoshthi}) (68)
5417          \p{Script: Kharoshthi}  \p{Script_Extensions=Kharoshthi} (Short:
5418                                    \p{Sc=Khar}, \p{Khar}) (68)
5419          \p{Script: Khitan_Small_Script} \p{Script_Extensions=
5420                                    Khitan_Small_Script} (Short: \p{Sc=
5421                                    Kits}, \p{Kits}) (471)
5422          \p{Script: Khmer}       \p{Script_Extensions=Khmer} (Short: \p{Sc=
5423                                    Khmr}, \p{Khmr}) (146)
5424          \p{Script: Khmr}        \p{Script=Khmer} (= \p{Script_Extensions=
5425                                    Khmer}) (146)
5426          \p{Script: Khoj}        \p{Script=Khojki} (65)
5427          \p{Script: Khojki}      (Short: \p{Sc=Khoj}) (65: U+11200..11211,
5428                                    U+11213..11241)
5429          \p{Script: Khudawadi}   (Short: \p{Sc=Sind}) (69: U+112B0..112EA,
5430                                    U+112F0..112F9)
5431          \p{Script: Kits}        \p{Script=Khitan_Small_Script} (=
5432                                    \p{Script_Extensions=
5433                                    Khitan_Small_Script}) (471)
5434          \p{Script: Knda}        \p{Script=Kannada} (91)
5435          \p{Script: Kthi}        \p{Script=Kaithi} (68)
5436          \p{Script: Lana}        \p{Script=Tai_Tham} (=
5437                                    \p{Script_Extensions=Tai_Tham}) (127)
5438          \p{Script: Lao}         \p{Script_Extensions=Lao} (Short: \p{Sc=
5439                                    Lao}, \p{Lao}) (83)
5440          \p{Script: Laoo}        \p{Script=Lao} (= \p{Script_Extensions=
5441                                    Lao}) (83)
5442          \p{Script: Latin}       (Short: \p{Sc=Latn}) (1481: [A-Za-z\xaa
5443                                    \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
5444                                    U+0100..02B8, U+02E0..02E4,
5445                                    U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65
5446                                    ...)
5447          \p{Script: Latn}        \p{Script=Latin} (1481)
5448          \p{Script: Lepc}        \p{Script=Lepcha} (= \p{Script_Extensions=
5449                                    Lepcha}) (74)
5450          \p{Script: Lepcha}      \p{Script_Extensions=Lepcha} (Short:
5451                                    \p{Sc=Lepc}, \p{Lepc}) (74)
5452          \p{Script: Limb}        \p{Script=Limbu} (68)
5453          \p{Script: Limbu}       (Short: \p{Sc=Limb}) (68: U+1900..191E,
5454                                    U+1920..192B, U+1930..193B, U+1940,
5455                                    U+1944..194F)
5456          \p{Script: Lina}        \p{Script=Linear_A} (341)
5457          \p{Script: Linb}        \p{Script=Linear_B} (211)
5458          \p{Script: Linear_A}    (Short: \p{Sc=Lina}) (341: U+10600..10736,
5459                                    U+10740..10755, U+10760..10767)
5460          \p{Script: Linear_B}    (Short: \p{Sc=Linb}) (211: U+10000..1000B,
5461                                    U+1000D..10026, U+10028..1003A,
5462                                    U+1003C..1003D, U+1003F..1004D,
5463                                    U+10050..1005D ...)
5464          \p{Script: Lisu}        \p{Script_Extensions=Lisu} (Short: \p{Sc=
5465                                    Lisu}, \p{Lisu}) (49)
5466          \p{Script: Lyci}        \p{Script=Lycian} (= \p{Script_Extensions=
5467                                    Lycian}) (29)
5468          \p{Script: Lycian}      \p{Script_Extensions=Lycian} (Short:
5469                                    \p{Sc=Lyci}, \p{Lyci}) (29)
5470          \p{Script: Lydi}        \p{Script=Lydian} (= \p{Script_Extensions=
5471                                    Lydian}) (27)
5472          \p{Script: Lydian}      \p{Script_Extensions=Lydian} (Short:
5473                                    \p{Sc=Lydi}, \p{Lydi}) (27)
5474          \p{Script: Mahajani}    (Short: \p{Sc=Mahj}) (39: U+11150..11176)
5475          \p{Script: Mahj}        \p{Script=Mahajani} (39)
5476          \p{Script: Maka}        \p{Script=Makasar} (=
5477                                    \p{Script_Extensions=Makasar}) (25)
5478          \p{Script: Makasar}     \p{Script_Extensions=Makasar} (Short:
5479                                    \p{Sc=Maka}, \p{Maka}) (25)
5480          \p{Script: Malayalam}   (Short: \p{Sc=Mlym}) (118: U+0D00..0D0C,
5481                                    U+0D0E..0D10, U+0D12..0D44,
5482                                    U+0D46..0D48, U+0D4A..0D4F, U+0D54..0D63
5483                                    ...)
5484          \p{Script: Mand}        \p{Script=Mandaic} (29)
5485          \p{Script: Mandaic}     (Short: \p{Sc=Mand}) (29: U+0840..085B,
5486                                    U+085E)
5487          \p{Script: Mani}        \p{Script=Manichaean} (51)
5488          \p{Script: Manichaean}  (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6,
5489                                    U+10AEB..10AF6)
5490          \p{Script: Marc}        \p{Script=Marchen} (=
5491                                    \p{Script_Extensions=Marchen}) (68)
5492          \p{Script: Marchen}     \p{Script_Extensions=Marchen} (Short:
5493                                    \p{Sc=Marc}, \p{Marc}) (68)
5494          \p{Script: Masaram_Gondi} (Short: \p{Sc=Gonm}) (75:
5495                                    U+11D00..11D06, U+11D08..11D09,
5496                                    U+11D0B..11D36, U+11D3A, U+11D3C..11D3D,
5497                                    U+11D3F..11D47 ...)
5498          \p{Script: Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
5499                                    \p{Sc=Medf}, \p{Medf}) (91)
5500          \p{Script: Medf}        \p{Script=Medefaidrin} (=
5501                                    \p{Script_Extensions=Medefaidrin}) (91)
5502          \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek}
5503                                    (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
5504          \p{Script: Mend}        \p{Script=Mende_Kikakui} (=
5505                                    \p{Script_Extensions=Mende_Kikakui})
5506                                    (213)
5507          \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
5508                                    (Short: \p{Sc=Mend}, \p{Mend}) (213)
5509          \p{Script: Merc}        \p{Script=Meroitic_Cursive} (=
5510                                    \p{Script_Extensions=Meroitic_Cursive})
5511                                    (90)
5512          \p{Script: Mero}        \p{Script=Meroitic_Hieroglyphs} (=
5513                                    \p{Script_Extensions=
5514                                    Meroitic_Hieroglyphs}) (32)
5515          \p{Script: Meroitic_Cursive} \p{Script_Extensions=
5516                                    Meroitic_Cursive} (Short: \p{Sc=Merc},
5517                                    \p{Merc}) (90)
5518          \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions=
5519                                    Meroitic_Hieroglyphs} (Short: \p{Sc=
5520                                    Mero}, \p{Mero}) (32)
5521          \p{Script: Miao}        \p{Script_Extensions=Miao} (Short: \p{Sc=
5522                                    Miao}, \p{Miao}) (149)
5523          \p{Script: Mlym}        \p{Script=Malayalam} (118)
5524          \p{Script: Modi}        (Short: \p{Sc=Modi}) (79: U+11600..11644,
5525                                    U+11650..11659)
5526          \p{Script: Mong}        \p{Script=Mongolian} (168)
5527          \p{Script: Mongolian}   (Short: \p{Sc=Mong}) (168: U+1800..1801,
5528                                    U+1804, U+1806..1819, U+1820..1878,
5529                                    U+1880..18AA, U+11660..1166C)
5530          \p{Script: Mro}         \p{Script_Extensions=Mro} (Short: \p{Sc=
5531                                    Mro}, \p{Mro}) (43)
5532          \p{Script: Mroo}        \p{Script=Mro} (= \p{Script_Extensions=
5533                                    Mro}) (43)
5534          \p{Script: Mtei}        \p{Script=Meetei_Mayek} (=
5535                                    \p{Script_Extensions=Meetei_Mayek}) (79)
5536          \p{Script: Mult}        \p{Script=Multani} (38)
5537          \p{Script: Multani}     (Short: \p{Sc=Mult}) (38: U+11280..11286,
5538                                    U+11288, U+1128A..1128D, U+1128F..1129D,
5539                                    U+1129F..112A9)
5540          \p{Script: Myanmar}     (Short: \p{Sc=Mymr}) (223: U+1000..109F,
5541                                    U+A9E0..A9FE, U+AA60..AA7F)
5542          \p{Script: Mymr}        \p{Script=Myanmar} (223)
5543          \p{Script: Nabataean}   \p{Script_Extensions=Nabataean} (Short:
5544                                    \p{Sc=Nbat}, \p{Nbat}) (40)
5545          \p{Script: Nag_Mundari} \p{Script_Extensions=Nag_Mundari} (Short:
5546                                    \p{Sc=Nagm}, \p{Nagm}) (42)
5547          \p{Script: Nagm}        \p{Script=Nag_Mundari} (=
5548                                    \p{Script_Extensions=Nag_Mundari}) (42)
5549          \p{Script: Nand}        \p{Script=Nandinagari} (65)
5550          \p{Script: Nandinagari} (Short: \p{Sc=Nand}) (65: U+119A0..119A7,
5551                                    U+119AA..119D7, U+119DA..119E4)
5552          \p{Script: Narb}        \p{Script=Old_North_Arabian} (=
5553                                    \p{Script_Extensions=Old_North_Arabian})
5554                                    (32)
5555          \p{Script: Nbat}        \p{Script=Nabataean} (=
5556                                    \p{Script_Extensions=Nabataean}) (40)
5557          \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
5558                                    \p{Sc=Talu}, \p{Talu}) (83)
5559          \p{Script: Newa}        \p{Script_Extensions=Newa} (Short: \p{Sc=
5560                                    Newa}, \p{Newa}) (97)
5561          \p{Script: Nko}         (Short: \p{Sc=Nko}) (62: U+07C0..07FA,
5562                                    U+07FD..07FF)
5563          \p{Script: Nkoo}        \p{Script=Nko} (62)
5564          \p{Script: Nshu}        \p{Script=Nushu} (= \p{Script_Extensions=
5565                                    Nushu}) (397)
5566          \p{Script: Nushu}       \p{Script_Extensions=Nushu} (Short: \p{Sc=
5567                                    Nshu}, \p{Nshu}) (397)
5568          \p{Script: Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
5569                                    Nyiakeng_Puachue_Hmong} (Short: \p{Sc=
5570                                    Hmnp}, \p{Hmnp}) (71)
5571          \p{Script: Ogam}        \p{Script=Ogham} (= \p{Script_Extensions=
5572                                    Ogham}) (29)
5573          \p{Script: Ogham}       \p{Script_Extensions=Ogham} (Short: \p{Sc=
5574                                    Ogam}, \p{Ogam}) (29)
5575          \p{Script: Ol_Chiki}    \p{Script_Extensions=Ol_Chiki} (Short:
5576                                    \p{Sc=Olck}, \p{Olck}) (48)
5577          \p{Script: Olck}        \p{Script=Ol_Chiki} (=
5578                                    \p{Script_Extensions=Ol_Chiki}) (48)
5579          \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
5580                                    (Short: \p{Sc=Hung}, \p{Hung}) (108)
5581          \p{Script: Old_Italic}  \p{Script_Extensions=Old_Italic} (Short:
5582                                    \p{Sc=Ital}, \p{Ital}) (39)
5583          \p{Script: Old_North_Arabian} \p{Script_Extensions=
5584                                    Old_North_Arabian} (Short: \p{Sc=Narb},
5585                                    \p{Narb}) (32)
5586          \p{Script: Old_Permic}  (Short: \p{Sc=Perm}) (43: U+10350..1037A)
5587          \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
5588                                    \p{Sc=Xpeo}, \p{Xpeo}) (50)
5589          \p{Script: Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
5590                                    \p{Sc=Sogo}, \p{Sogo}) (40)
5591          \p{Script: Old_South_Arabian} \p{Script_Extensions=
5592                                    Old_South_Arabian} (Short: \p{Sc=Sarb},
5593                                    \p{Sarb}) (32)
5594          \p{Script: Old_Turkic}  \p{Script_Extensions=Old_Turkic} (Short:
5595                                    \p{Sc=Orkh}, \p{Orkh}) (73)
5596          \p{Script: Old_Uyghur}  (Short: \p{Sc=Ougr}) (26: U+10F70..10F89)
5597          \p{Script: Oriya}       (Short: \p{Sc=Orya}) (91: U+0B01..0B03,
5598                                    U+0B05..0B0C, U+0B0F..0B10,
5599                                    U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33
5600                                    ...)
5601          \p{Script: Orkh}        \p{Script=Old_Turkic} (=
5602                                    \p{Script_Extensions=Old_Turkic}) (73)
5603          \p{Script: Orya}        \p{Script=Oriya} (91)
5604          \p{Script: Osage}       \p{Script_Extensions=Osage} (Short: \p{Sc=
5605                                    Osge}, \p{Osge}) (72)
5606          \p{Script: Osge}        \p{Script=Osage} (= \p{Script_Extensions=
5607                                    Osage}) (72)
5608          \p{Script: Osma}        \p{Script=Osmanya} (=
5609                                    \p{Script_Extensions=Osmanya}) (40)
5610          \p{Script: Osmanya}     \p{Script_Extensions=Osmanya} (Short:
5611                                    \p{Sc=Osma}, \p{Osma}) (40)
5612          \p{Script: Ougr}        \p{Script=Old_Uyghur} (26)
5613          \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong}
5614                                    (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
5615          \p{Script: Palm}        \p{Script=Palmyrene} (=
5616                                    \p{Script_Extensions=Palmyrene}) (32)
5617          \p{Script: Palmyrene}   \p{Script_Extensions=Palmyrene} (Short:
5618                                    \p{Sc=Palm}, \p{Palm}) (32)
5619          \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
5620                                    \p{Sc=Pauc}, \p{Pauc}) (57)
5621          \p{Script: Pauc}        \p{Script=Pau_Cin_Hau} (=
5622                                    \p{Script_Extensions=Pau_Cin_Hau}) (57)
5623          \p{Script: Perm}        \p{Script=Old_Permic} (43)
5624          \p{Script: Phag}        \p{Script=Phags_Pa} (56)
5625          \p{Script: Phags_Pa}    (Short: \p{Sc=Phag}) (56: U+A840..A877)
5626          \p{Script: Phli}        \p{Script=Inscriptional_Pahlavi} (=
5627                                    \p{Script_Extensions=
5628                                    Inscriptional_Pahlavi}) (27)
5629          \p{Script: Phlp}        \p{Script=Psalter_Pahlavi} (29)
5630          \p{Script: Phnx}        \p{Script=Phoenician} (=
5631                                    \p{Script_Extensions=Phoenician}) (29)
5632          \p{Script: Phoenician}  \p{Script_Extensions=Phoenician} (Short:
5633                                    \p{Sc=Phnx}, \p{Phnx}) (29)
5634          \p{Script: Plrd}        \p{Script=Miao} (= \p{Script_Extensions=
5635                                    Miao}) (149)
5636          \p{Script: Prti}        \p{Script=Inscriptional_Parthian} (=
5637                                    \p{Script_Extensions=
5638                                    Inscriptional_Parthian}) (30)
5639          \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29:
5640                                    U+10B80..10B91, U+10B99..10B9C,
5641                                    U+10BA9..10BAF)
5642          \p{Script: Qaac}        \p{Script=Coptic} (137)
5643          \p{Script: Qaai}        \p{Script=Inherited} (657)
5644          \p{Script: Rejang}      \p{Script_Extensions=Rejang} (Short:
5645                                    \p{Sc=Rjng}, \p{Rjng}) (37)
5646          \p{Script: Rjng}        \p{Script=Rejang} (= \p{Script_Extensions=
5647                                    Rejang}) (37)
5648          \p{Script: Rohg}        \p{Script=Hanifi_Rohingya} (50)
5649          \p{Script: Runic}       \p{Script_Extensions=Runic} (Short: \p{Sc=
5650                                    Runr}, \p{Runr}) (86)
5651          \p{Script: Runr}        \p{Script=Runic} (= \p{Script_Extensions=
5652                                    Runic}) (86)
5653          \p{Script: Samaritan}   \p{Script_Extensions=Samaritan} (Short:
5654                                    \p{Sc=Samr}, \p{Samr}) (61)
5655          \p{Script: Samr}        \p{Script=Samaritan} (=
5656                                    \p{Script_Extensions=Samaritan}) (61)
5657          \p{Script: Sarb}        \p{Script=Old_South_Arabian} (=
5658                                    \p{Script_Extensions=Old_South_Arabian})
5659                                    (32)
5660          \p{Script: Saur}        \p{Script=Saurashtra} (=
5661                                    \p{Script_Extensions=Saurashtra}) (82)
5662          \p{Script: Saurashtra}  \p{Script_Extensions=Saurashtra} (Short:
5663                                    \p{Sc=Saur}, \p{Saur}) (82)
5664          \p{Script: Sgnw}        \p{Script=SignWriting} (=
5665                                    \p{Script_Extensions=SignWriting}) (672)
5666          \p{Script: Sharada}     (Short: \p{Sc=Shrd}) (96: U+11180..111DF)
5667          \p{Script: Shavian}     \p{Script_Extensions=Shavian} (Short:
5668                                    \p{Sc=Shaw}, \p{Shaw}) (48)
5669          \p{Script: Shaw}        \p{Script=Shavian} (=
5670                                    \p{Script_Extensions=Shavian}) (48)
5671          \p{Script: Shrd}        \p{Script=Sharada} (96)
5672          \p{Script: Sidd}        \p{Script=Siddham} (=
5673                                    \p{Script_Extensions=Siddham}) (92)
5674          \p{Script: Siddham}     \p{Script_Extensions=Siddham} (Short:
5675                                    \p{Sc=Sidd}, \p{Sidd}) (92)
5676          \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short:
5677                                    \p{Sc=Sgnw}, \p{Sgnw}) (672)
5678          \p{Script: Sind}        \p{Script=Khudawadi} (69)
5679          \p{Script: Sinh}        \p{Script=Sinhala} (111)
5680          \p{Script: Sinhala}     (Short: \p{Sc=Sinh}) (111: U+0D81..0D83,
5681                                    U+0D85..0D96, U+0D9A..0DB1,
5682                                    U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...)
5683          \p{Script: Sogd}        \p{Script=Sogdian} (42)
5684          \p{Script: Sogdian}     (Short: \p{Sc=Sogd}) (42: U+10F30..10F59)
5685          \p{Script: Sogo}        \p{Script=Old_Sogdian} (=
5686                                    \p{Script_Extensions=Old_Sogdian}) (40)
5687          \p{Script: Sora}        \p{Script=Sora_Sompeng} (=
5688                                    \p{Script_Extensions=Sora_Sompeng}) (35)
5689          \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng}
5690                                    (Short: \p{Sc=Sora}, \p{Sora}) (35)
5691          \p{Script: Soyo}        \p{Script=Soyombo} (=
5692                                    \p{Script_Extensions=Soyombo}) (83)
5693          \p{Script: Soyombo}     \p{Script_Extensions=Soyombo} (Short:
5694                                    \p{Sc=Soyo}, \p{Soyo}) (83)
5695          \p{Script: Sund}        \p{Script=Sundanese} (=
5696                                    \p{Script_Extensions=Sundanese}) (72)
5697          \p{Script: Sundanese}   \p{Script_Extensions=Sundanese} (Short:
5698                                    \p{Sc=Sund}, \p{Sund}) (72)
5699          \p{Script: Sylo}        \p{Script=Syloti_Nagri} (45)
5700          \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (45: U+A800..A82C)
5701          \p{Script: Syrc}        \p{Script=Syriac} (88)
5702          \p{Script: Syriac}      (Short: \p{Sc=Syrc}) (88: U+0700..070D,
5703                                    U+070F..074A, U+074D..074F, U+0860..086A)
5704          \p{Script: Tagalog}     (Short: \p{Sc=Tglg}) (23: U+1700..1715,
5705                                    U+171F)
5706          \p{Script: Tagb}        \p{Script=Tagbanwa} (18)
5707          \p{Script: Tagbanwa}    (Short: \p{Sc=Tagb}) (18: U+1760..176C,
5708                                    U+176E..1770, U+1772..1773)
5709          \p{Script: Tai_Le}      (Short: \p{Sc=Tale}) (35: U+1950..196D,
5710                                    U+1970..1974)
5711          \p{Script: Tai_Tham}    \p{Script_Extensions=Tai_Tham} (Short:
5712                                    \p{Sc=Lana}, \p{Lana}) (127)
5713          \p{Script: Tai_Viet}    \p{Script_Extensions=Tai_Viet} (Short:
5714                                    \p{Sc=Tavt}, \p{Tavt}) (72)
5715          \p{Script: Takr}        \p{Script=Takri} (68)
5716          \p{Script: Takri}       (Short: \p{Sc=Takr}) (68: U+11680..116B9,
5717                                    U+116C0..116C9)
5718          \p{Script: Tale}        \p{Script=Tai_Le} (35)
5719          \p{Script: Talu}        \p{Script=New_Tai_Lue} (=
5720                                    \p{Script_Extensions=New_Tai_Lue}) (83)
5721          \p{Script: Tamil}       (Short: \p{Sc=Taml}) (123: U+0B82..0B83,
5722                                    U+0B85..0B8A, U+0B8E..0B90,
5723                                    U+0B92..0B95, U+0B99..0B9A, U+0B9C ...)
5724          \p{Script: Taml}        \p{Script=Tamil} (123)
5725          \p{Script: Tang}        \p{Script=Tangut} (= \p{Script_Extensions=
5726                                    Tangut}) (6914)
5727          \p{Script: Tangsa}      \p{Script_Extensions=Tangsa} (Short:
5728                                    \p{Sc=Tnsa}, \p{Tnsa}) (89)
5729          \p{Script: Tangut}      \p{Script_Extensions=Tangut} (Short:
5730                                    \p{Sc=Tang}, \p{Tang}) (6914)
5731          \p{Script: Tavt}        \p{Script=Tai_Viet} (=
5732                                    \p{Script_Extensions=Tai_Viet}) (72)
5733          \p{Script: Telu}        \p{Script=Telugu} (100)
5734          \p{Script: Telugu}      (Short: \p{Sc=Telu}) (100: U+0C00..0C0C,
5735                                    U+0C0E..0C10, U+0C12..0C28,
5736                                    U+0C2A..0C39, U+0C3C..0C44, U+0C46..0C48
5737                                    ...)
5738          \p{Script: Tfng}        \p{Script=Tifinagh} (=
5739                                    \p{Script_Extensions=Tifinagh}) (59)
5740          \p{Script: Tglg}        \p{Script=Tagalog} (23)
5741          \p{Script: Thaa}        \p{Script=Thaana} (50)
5742          \p{Script: Thaana}      (Short: \p{Sc=Thaa}) (50: U+0780..07B1)
5743          \p{Script: Thai}        \p{Script_Extensions=Thai} (Short: \p{Sc=
5744                                    Thai}, \p{Thai}) (86)
5745          \p{Script: Tibetan}     \p{Script_Extensions=Tibetan} (Short:
5746                                    \p{Sc=Tibt}, \p{Tibt}) (207)
5747          \p{Script: Tibt}        \p{Script=Tibetan} (=
5748                                    \p{Script_Extensions=Tibetan}) (207)
5749          \p{Script: Tifinagh}    \p{Script_Extensions=Tifinagh} (Short:
5750                                    \p{Sc=Tfng}, \p{Tfng}) (59)
5751          \p{Script: Tirh}        \p{Script=Tirhuta} (82)
5752          \p{Script: Tirhuta}     (Short: \p{Sc=Tirh}) (82: U+11480..114C7,
5753                                    U+114D0..114D9)
5754          \p{Script: Tnsa}        \p{Script=Tangsa} (= \p{Script_Extensions=
5755                                    Tangsa}) (89)
5756          \p{Script: Toto}        \p{Script_Extensions=Toto} (Short: \p{Sc=
5757                                    Toto}, \p{Toto}) (31)
5758          \p{Script: Ugar}        \p{Script=Ugaritic} (=
5759                                    \p{Script_Extensions=Ugaritic}) (31)
5760          \p{Script: Ugaritic}    \p{Script_Extensions=Ugaritic} (Short:
5761                                    \p{Sc=Ugar}, \p{Ugar}) (31)
5762          \p{Script: Unknown}     \p{Script_Extensions=Unknown} (Short:
5763                                    \p{Sc=Zzzz}, \p{Zzzz}) (964_861 plus all
5764                                    above-Unicode code points)
5765          \p{Script: Vai}         \p{Script_Extensions=Vai} (Short: \p{Sc=
5766                                    Vai}, \p{Vai}) (300)
5767          \p{Script: Vaii}        \p{Script=Vai} (= \p{Script_Extensions=
5768                                    Vai}) (300)
5769          \p{Script: Vith}        \p{Script=Vithkuqi} (=
5770                                    \p{Script_Extensions=Vithkuqi}) (70)
5771          \p{Script: Vithkuqi}    \p{Script_Extensions=Vithkuqi} (Short:
5772                                    \p{Sc=Vith}, \p{Vith}) (70)
5773          \p{Script: Wancho}      \p{Script_Extensions=Wancho} (Short:
5774                                    \p{Sc=Wcho}, \p{Wcho}) (59)
5775          \p{Script: Wara}        \p{Script=Warang_Citi} (=
5776                                    \p{Script_Extensions=Warang_Citi}) (84)
5777          \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
5778                                    \p{Sc=Wara}, \p{Wara}) (84)
5779          \p{Script: Wcho}        \p{Script=Wancho} (= \p{Script_Extensions=
5780                                    Wancho}) (59)
5781          \p{Script: Xpeo}        \p{Script=Old_Persian} (=
5782                                    \p{Script_Extensions=Old_Persian}) (50)
5783          \p{Script: Xsux}        \p{Script=Cuneiform} (=
5784                                    \p{Script_Extensions=Cuneiform}) (1234)
5785          \p{Script: Yezi}        \p{Script=Yezidi} (47)
5786          \p{Script: Yezidi}      (Short: \p{Sc=Yezi}) (47: U+10E80..10EA9,
5787                                    U+10EAB..10EAD, U+10EB0..10EB1)
5788          \p{Script: Yi}          (Short: \p{Sc=Yi}) (1220: U+A000..A48C,
5789                                    U+A490..A4C6)
5790          \p{Script: Yiii}        \p{Script=Yi} (1220)
5791          \p{Script: Zanabazar_Square} \p{Script_Extensions=
5792                                    Zanabazar_Square} (Short: \p{Sc=Zanb},
5793                                    \p{Zanb}) (72)
5794          \p{Script: Zanb}        \p{Script=Zanabazar_Square} (=
5795                                    \p{Script_Extensions=Zanabazar_Square})
5796                                    (72)
5797          \p{Script: Zinh}        \p{Script=Inherited} (657)
5798          \p{Script: Zyyy}        \p{Script=Common} (8301)
5799          \p{Script: Zzzz}        \p{Script=Unknown} (=
5800                                    \p{Script_Extensions=Unknown}) (964_861
5801                                    plus all above-Unicode code points)
5802          \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (90:
5803                                    U+061F, U+0640, U+1E900..1E94B,
5804                                    U+1E950..1E959, U+1E95E..1E95F)
5805          \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (90)
5806          \p{Script_Extensions: Aghb} \p{Script_Extensions=
5807                                    Caucasian_Albanian} (53)
5808          \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (65:
5809                                    U+11700..1171A, U+1171D..1172B,
5810                                    U+11730..11746)
5811          \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
5812                                    \p{Hluw}) (583: U+14400..14646)
5813          \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1414)
5814          \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
5815                                    (1414: U+0600..0604, U+0606..06DC,
5816                                    U+06DE..06FF, U+0750..077F,
5817                                    U+0870..088E, U+0890..0891 ...)
5818          \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
5819                                    (96: U+0531..0556, U+0559..058A,
5820                                    U+058D..058F, U+FB13..FB17)
5821          \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
5822                                    (31)
5823          \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (96)
5824          \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst})
5825                                    (61: U+10B00..10B35, U+10B39..10B3F)
5826          \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
5827          \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (124)
5828          \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
5829                                    (124: U+1B00..1B4C, U+1B50..1B7E)
5830          \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
5831          \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657:
5832                                    U+A6A0..A6F7, U+16800..16A38)
5833          \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
5834          \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
5835                                    (36: U+16AD0..16AED, U+16AF0..16AF5)
5836          \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56:
5837                                    U+1BC0..1BF3, U+1BFC..1BFF)
5838          \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
5839          \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (113)
5840          \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng})
5841                                    (113: U+0951..0952, U+0964..0965,
5842                                    U+0980..0983, U+0985..098C,
5843                                    U+098F..0990, U+0993..09A8 ...)
5844          \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
5845                                    (97: U+11C00..11C08, U+11C0A..11C36,
5846                                    U+11C38..11C45, U+11C50..11C6C)
5847          \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
5848          \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (117)
5849          \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
5850                                    (117: U+02EA..02EB, U+3001..3003,
5851                                    U+3008..3011, U+3013..301F,
5852                                    U+302A..302D, U+3030 ...)
5853          \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (115)
5854          \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah})
5855                                    (115: U+11000..1104D, U+11052..11075,
5856                                    U+1107F)
5857          \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
5858          \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
5859                                    (256: U+2800..28FF)
5860          \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
5861          \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
5862                                    (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF)
5863          \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
5864          \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22:
5865                                    U+1735..1736, U+1740..1753)
5866          \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (91)
5867          \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
5868                                    \p{Cans}) (726: U+1400..167F,
5869                                    U+18B0..18F5, U+11AB0..11ABF)
5870          \p{Script_Extensions: Cans} \p{Script_Extensions=
5871                                    Canadian_Aboriginal} (726)
5872          \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
5873          \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49:
5874                                    U+102A0..102D0)
5875          \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
5876                                    \p{Aghb}) (53: U+10530..10563, U+1056F)
5877          \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (91:
5878                                    U+09E6..09EF, U+1040..1049,
5879                                    U+11100..11134, U+11136..11147)
5880          \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83:
5881                                    U+AA00..AA36, U+AA40..AA4D,
5882                                    U+AA50..AA59, U+AA5C..AA5F)
5883          \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
5884          \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
5885                                    (172: U+13A0..13F5, U+13F8..13FD,
5886                                    U+AB70..ABBF)
5887          \p{Script_Extensions: Chorasmian} (Short: \p{Scx=Chrs}, \p{Chrs})
5888                                    (28: U+10FB0..10FCB)
5889          \p{Script_Extensions: Chrs} \p{Script_Extensions=Chorasmian} (28)
5890          \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
5891                                    (7873: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
5892                                    \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xa9
5893                                    \xab-\xb9\xbb-\xbf\xd7\xf7],
5894                                    U+02B9..02DF, U+02E5..02E9,
5895                                    U+02EC..02FF, U+0374, U+037E ...)
5896          \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
5897          \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt})
5898                                    (165: U+03E2..03EF, U+2C80..2CF3,
5899                                    U+2CF9..2CFF, U+102E0..102FB)
5900          \p{Script_Extensions: Cpmn} \p{Script_Extensions=Cypro_Minoan}
5901                                    (101)
5902          \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
5903          \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
5904                                    (1234: U+12000..12399, U+12400..1246E,
5905                                    U+12470..12474, U+12480..12543)
5906          \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
5907                                    (112: U+10100..10102, U+10107..10133,
5908                                    U+10137..1013F, U+10800..10805, U+10808,
5909                                    U+1080A..10835 ...)
5910          \p{Script_Extensions: Cypro_Minoan} (Short: \p{Scx=Cpmn},
5911                                    \p{Cpmn}) (101: U+10100..10101,
5912                                    U+12F90..12FF2)
5913          \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
5914                                    (510: U+0400..052F, U+1C80..1C88,
5915                                    U+1D2B, U+1D78, U+1DF8, U+2DE0..2DFF ...)
5916          \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (510)
5917          \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt})
5918                                    (80: U+10400..1044F)
5919          \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (220)
5920          \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
5921                                    (220: U+0900..0952, U+0955..097F,
5922                                    U+1CD0..1CF6, U+1CF8..1CF9, U+20F0,
5923                                    U+A830..A839 ...)
5924          \p{Script_Extensions: Diak} \p{Script_Extensions=Dives_Akuru} (72)
5925          \p{Script_Extensions: Dives_Akuru} (Short: \p{Scx=Diak}, \p{Diak})
5926                                    (72: U+11900..11906, U+11909,
5927                                    U+1190C..11913, U+11915..11916,
5928                                    U+11918..11935, U+11937..11938 ...)
5929          \p{Script_Extensions: Dogr} \p{Script_Extensions=Dogra} (82)
5930          \p{Script_Extensions: Dogra} (Short: \p{Scx=Dogr}, \p{Dogr}) (82:
5931                                    U+0964..096F, U+A830..A839,
5932                                    U+11800..1183B)
5933          \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
5934          \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
5935          \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
5936                                    (147: U+1BC00..1BC6A, U+1BC70..1BC7C,
5937                                    U+1BC80..1BC88, U+1BC90..1BC99,
5938                                    U+1BC9C..1BCA3)
5939          \p{Script_Extensions: Egyp} \p{Script_Extensions=
5940                                    Egyptian_Hieroglyphs} (1110)
5941          \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
5942                                    \p{Egyp}) (1110: U+13000..13455)
5943          \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
5944          \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba})
5945                                    (40: U+10500..10527)
5946          \p{Script_Extensions: Elym} \p{Script_Extensions=Elymaic} (23)
5947          \p{Script_Extensions: Elymaic} (Short: \p{Scx=Elym}, \p{Elym})
5948                                    (23: U+10FE0..10FF6)
5949          \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (523)
5950          \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
5951                                    (523: U+1200..1248, U+124A..124D,
5952                                    U+1250..1256, U+1258, U+125A..125D,
5953                                    U+1260..1288 ...)
5954          \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (174)
5955          \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
5956                                    (174: U+10A0..10C5, U+10C7, U+10CD,
5957                                    U+10D0..10FF, U+1C90..1CBA, U+1CBD..1CBF
5958                                    ...)
5959          \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (138)
5960          \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
5961                                    (138: U+0484, U+0487, U+2C00..2C5F,
5962                                    U+2E43, U+A66F, U+1E000..1E006 ...)
5963          \p{Script_Extensions: Gong} \p{Script_Extensions=Gunjala_Gondi}
5964                                    (65)
5965          \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi}
5966                                    (77)
5967          \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
5968          \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27:
5969                                    U+10330..1034A)
5970          \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (116)
5971          \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
5972                                    (116: U+0951..0952, U+0964..0965,
5973                                    U+0BE6..0BF3, U+1CD0, U+1CD2..1CD3,
5974                                    U+1CF2..1CF4 ...)
5975          \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522:
5976                                    U+0342, U+0345, U+0370..0373,
5977                                    U+0375..0377, U+037A..037D, U+037F ...)
5978          \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
5979          \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
5980                                    (105: U+0951..0952, U+0964..0965,
5981                                    U+0A81..0A83, U+0A85..0A8D,
5982                                    U+0A8F..0A91, U+0A93..0AA8 ...)
5983          \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105)
5984          \p{Script_Extensions: Gunjala_Gondi} (Short: \p{Scx=Gong},
5985                                    \p{Gong}) (65: U+0964..0965,
5986                                    U+11D60..11D65, U+11D67..11D68,
5987                                    U+11D6A..11D8E, U+11D90..11D91,
5988                                    U+11D93..11D98 ...)
5989          \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
5990                                    (94: U+0951..0952, U+0964..0965,
5991                                    U+0A01..0A03, U+0A05..0A0A,
5992                                    U+0A0F..0A10, U+0A13..0A28 ...)
5993          \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (94)
5994          \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (98_696:
5995                                    U+2E80..2E99, U+2E9B..2EF3,
5996                                    U+2F00..2FD5, U+3001..3003,
5997                                    U+3005..3011, U+3013..301F ...)
5998          \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
5999          \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
6000                                    (11_775: U+1100..11FF, U+3001..3003,
6001                                    U+3008..3011, U+3013..301F,
6002                                    U+302E..3030, U+3037 ...)
6003          \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (98_696)
6004          \p{Script_Extensions: Hanifi_Rohingya} (Short: \p{Scx=Rohg},
6005                                    \p{Rohg}) (55: U+060C, U+061B, U+061F,
6006                                    U+0640, U+06D4, U+10D00..10D27 ...)
6007          \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
6008          \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano})
6009                                    (23: U+1720..1736)
6010          \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
6011          \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26:
6012                                    U+108E0..108F2, U+108F4..108F5,
6013                                    U+108FB..108FF)
6014          \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (134)
6015          \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr})
6016                                    (134: U+0591..05C7, U+05D0..05EA,
6017                                    U+05EF..05F4, U+FB1D..FB36,
6018                                    U+FB38..FB3C, U+FB3E ...)
6019          \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (433)
6020          \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
6021                                    (433: U+3001..3003, U+3008..3011,
6022                                    U+3013..301F, U+3030..3035, U+3037,
6023                                    U+303C..303D ...)
6024          \p{Script_Extensions: Hluw} \p{Script_Extensions=
6025                                    Anatolian_Hieroglyphs} (583)
6026          \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
6027                                    (127)
6028          \p{Script_Extensions: Hmnp} \p{Script_Extensions=
6029                                    Nyiakeng_Puachue_Hmong} (71)
6030          \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
6031                                    (108)
6032          \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
6033                                    \p{Armi}) (31: U+10840..10855,
6034                                    U+10857..1085F)
6035          \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
6036                                    (586: U+0300..0341, U+0343..0344,
6037                                    U+0346..0362, U+0953..0954,
6038                                    U+1AB0..1ACE, U+1DC2..1DF7 ...)
6039          \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
6040                                    \p{Phli}) (27: U+10B60..10B72,
6041                                    U+10B78..10B7F)
6042          \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
6043                                    Prti}, \p{Prti}) (30: U+10B40..10B55,
6044                                    U+10B58..10B5F)
6045          \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39)
6046          \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
6047          \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
6048                                    (91: U+A980..A9CD, U+A9CF..A9D9,
6049                                    U+A9DE..A9DF)
6050          \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (88:
6051                                    U+0966..096F, U+A830..A839,
6052                                    U+11080..110C2, U+110CD)
6053          \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
6054          \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (373)
6055          \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
6056                                    (106: U+0951..0952, U+0964..0965,
6057                                    U+0C80..0C8C, U+0C8E..0C90,
6058                                    U+0C92..0CA8, U+0CAA..0CB3 ...)
6059          \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
6060                                    (373: U+3001..3003, U+3008..3011,
6061                                    U+3013..301F, U+3030..3035, U+3037,
6062                                    U+303C..303D ...)
6063          \p{Script_Extensions: Kawi} (Short: \p{Scx=Kawi}, \p{Kawi}) (86:
6064                                    U+11F00..11F10, U+11F12..11F3A,
6065                                    U+11F3E..11F59)
6066          \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
6067                                    (48: U+A900..A92F)
6068          \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (68)
6069          \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
6070                                    (68: U+10A00..10A03, U+10A05..10A06,
6071                                    U+10A0C..10A13, U+10A15..10A17,
6072                                    U+10A19..10A35, U+10A38..10A3A ...)
6073          \p{Script_Extensions: Khitan_Small_Script} (Short: \p{Scx=Kits},
6074                                    \p{Kits}) (471: U+16FE4, U+18B00..18CD5)
6075          \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146:
6076                                    U+1780..17DD, U+17E0..17E9,
6077                                    U+17F0..17F9, U+19E0..19FF)
6078          \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
6079          \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (85)
6080          \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (85:
6081                                    U+0AE6..0AEF, U+A830..A839,
6082                                    U+11200..11211, U+11213..11241)
6083          \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
6084                                    (81: U+0964..0965, U+A830..A839,
6085                                    U+112B0..112EA, U+112F0..112F9)
6086          \p{Script_Extensions: Kits} \p{Script_Extensions=
6087                                    Khitan_Small_Script} (471)
6088          \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (106)
6089          \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (88)
6090          \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
6091          \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (83:
6092                                    U+0E81..0E82, U+0E84, U+0E86..0E8A,
6093                                    U+0E8C..0EA3, U+0EA5, U+0EA7..0EBD ...)
6094          \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (83)
6095          \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn})
6096                                    (1510: [A-Za-z\xaa\xba\xc0-\xd6\xd8-
6097                                    \xf6\xf8-\xff], U+0100..02B8,
6098                                    U+02E0..02E4, U+0363..036F,
6099                                    U+0485..0486, U+0951..0952 ...)
6100          \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1510)
6101          \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
6102          \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74:
6103                                    U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F)
6104          \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
6105          \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69:
6106                                    U+0965, U+1900..191E, U+1920..192B,
6107                                    U+1930..193B, U+1940, U+1944..194F)
6108          \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
6109          \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
6110          \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
6111                                    (386: U+10107..10133, U+10600..10736,
6112                                    U+10740..10755, U+10760..10767)
6113          \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
6114                                    (268: U+10000..1000B, U+1000D..10026,
6115                                    U+10028..1003A, U+1003C..1003D,
6116                                    U+1003F..1004D, U+10050..1005D ...)
6117          \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (49:
6118                                    U+A4D0..A4FF, U+11FB0)
6119          \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
6120          \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29:
6121                                    U+10280..1029C)
6122          \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
6123          \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27:
6124                                    U+10920..10939, U+1093F)
6125          \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
6126                                    (61: U+0964..096F, U+A830..A839,
6127                                    U+11150..11176)
6128          \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
6129          \p{Script_Extensions: Maka} \p{Script_Extensions=Makasar} (25)
6130          \p{Script_Extensions: Makasar} (Short: \p{Scx=Maka}, \p{Maka})
6131                                    (25: U+11EE0..11EF8)
6132          \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
6133                                    (126: U+0951..0952, U+0964..0965,
6134                                    U+0D00..0D0C, U+0D0E..0D10,
6135                                    U+0D12..0D44, U+0D46..0D48 ...)
6136          \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
6137          \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand})
6138                                    (30: U+0640, U+0840..085B, U+085E)
6139          \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
6140          \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
6141                                    (52: U+0640, U+10AC0..10AE6,
6142                                    U+10AEB..10AF6)
6143          \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
6144          \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc})
6145                                    (68: U+11C70..11C8F, U+11C92..11CA7,
6146                                    U+11CA9..11CB6)
6147          \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm},
6148                                    \p{Gonm}) (77: U+0964..0965,
6149                                    U+11D00..11D06, U+11D08..11D09,
6150                                    U+11D0B..11D36, U+11D3A, U+11D3C..11D3D
6151                                    ...)
6152          \p{Script_Extensions: Medefaidrin} (Short: \p{Scx=Medf}, \p{Medf})
6153                                    (91: U+16E40..16E9A)
6154          \p{Script_Extensions: Medf} \p{Script_Extensions=Medefaidrin} (91)
6155          \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
6156                                    \p{Mtei}) (79: U+AAE0..AAF6,
6157                                    U+ABC0..ABED, U+ABF0..ABF9)
6158          \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
6159                                    (213)
6160          \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
6161                                    \p{Mend}) (213: U+1E800..1E8C4,
6162                                    U+1E8C7..1E8D6)
6163          \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
6164                                    (90)
6165          \p{Script_Extensions: Mero} \p{Script_Extensions=
6166                                    Meroitic_Hieroglyphs} (32)
6167          \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
6168                                    \p{Merc}) (90: U+109A0..109B7,
6169                                    U+109BC..109CF, U+109D2..109FF)
6170          \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
6171                                    \p{Mero}) (32: U+10980..1099F)
6172          \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (149:
6173                                    U+16F00..16F4A, U+16F4F..16F87,
6174                                    U+16F8F..16F9F)
6175          \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (126)
6176          \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89:
6177                                    U+A830..A839, U+11600..11644,
6178                                    U+11650..11659)
6179          \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (172)
6180          \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
6181                                    (172: U+1800..1819, U+1820..1878,
6182                                    U+1880..18AA, U+202F, U+11660..1166C)
6183          \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43:
6184                                    U+16A40..16A5E, U+16A60..16A69,
6185                                    U+16A6E..16A6F)
6186          \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
6187          \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
6188          \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
6189          \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult})
6190                                    (48: U+0A66..0A6F, U+11280..11286,
6191                                    U+11288, U+1128A..1128D, U+1128F..1129D,
6192                                    U+1129F..112A9)
6193          \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
6194                                    (224: U+1000..109F, U+A92E,
6195                                    U+A9E0..A9FE, U+AA60..AA7F)
6196          \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
6197          \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
6198                                    (40: U+10880..1089E, U+108A7..108AF)
6199          \p{Script_Extensions: Nag_Mundari} (Short: \p{Scx=Nagm}, \p{Nagm})
6200                                    (42: U+1E4D0..1E4F9)
6201          \p{Script_Extensions: Nagm} \p{Script_Extensions=Nag_Mundari} (42)
6202          \p{Script_Extensions: Nand} \p{Script_Extensions=Nandinagari} (86)
6203          \p{Script_Extensions: Nandinagari} (Short: \p{Scx=Nand}, \p{Nand})
6204                                    (86: U+0964..0965, U+0CE6..0CEF, U+1CE9,
6205                                    U+1CF2, U+1CFA, U+A830..A835 ...)
6206          \p{Script_Extensions: Narb} \p{Script_Extensions=
6207                                    Old_North_Arabian} (32)
6208          \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
6209          \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
6210                                    (83: U+1980..19AB, U+19B0..19C9,
6211                                    U+19D0..19DA, U+19DE..19DF)
6212          \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (97:
6213                                    U+11400..1145B, U+1145D..11461)
6214          \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (67:
6215                                    U+060C, U+061B, U+061F, U+07C0..07FA,
6216                                    U+07FD..07FF, U+FD3E..FD3F)
6217          \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (67)
6218          \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397)
6219          \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397:
6220                                    U+16FE1, U+1B170..1B2FB)
6221          \p{Script_Extensions: Nyiakeng_Puachue_Hmong} (Short: \p{Scx=
6222                                    Hmnp}, \p{Hmnp}) (71: U+1E100..1E12C,
6223                                    U+1E130..1E13D, U+1E140..1E149,
6224                                    U+1E14E..1E14F)
6225          \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
6226          \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29:
6227                                    U+1680..169C)
6228          \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
6229                                    (48: U+1C50..1C7F)
6230          \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
6231          \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
6232                                    \p{Hung}) (108: U+10C80..10CB2,
6233                                    U+10CC0..10CF2, U+10CFA..10CFF)
6234          \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
6235                                    (39: U+10300..10323, U+1032D..1032F)
6236          \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
6237                                    \p{Narb}) (32: U+10A80..10A9F)
6238          \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
6239                                    (44: U+0483, U+10350..1037A)
6240          \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
6241                                    (50: U+103A0..103C3, U+103C8..103D5)
6242          \p{Script_Extensions: Old_Sogdian} (Short: \p{Scx=Sogo}, \p{Sogo})
6243                                    (40: U+10F00..10F27)
6244          \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
6245                                    \p{Sarb}) (32: U+10A60..10A7F)
6246          \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
6247                                    (73: U+10C00..10C48)
6248          \p{Script_Extensions: Old_Uyghur} (Short: \p{Scx=Ougr}, \p{Ougr})
6249                                    (28: U+0640, U+10AF2, U+10F70..10F89)
6250          \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (97:
6251                                    U+0951..0952, U+0964..0965,
6252                                    U+0B01..0B03, U+0B05..0B0C,
6253                                    U+0B0F..0B10, U+0B13..0B28 ...)
6254          \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
6255          \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (97)
6256          \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72:
6257                                    U+104B0..104D3, U+104D8..104FB)
6258          \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
6259          \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
6260          \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma})
6261                                    (40: U+10480..1049D, U+104A0..104A9)
6262          \p{Script_Extensions: Ougr} \p{Script_Extensions=Old_Uyghur} (28)
6263          \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
6264                                    \p{Hmng}) (127: U+16B00..16B45,
6265                                    U+16B50..16B59, U+16B5B..16B61,
6266                                    U+16B63..16B77, U+16B7D..16B8F)
6267          \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
6268          \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
6269                                    (32: U+10860..1087F)
6270          \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
6271                                    (57: U+11AC0..11AF8)
6272          \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
6273          \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
6274          \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
6275          \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
6276                                    (59: U+1802..1803, U+1805, U+A840..A877)
6277          \p{Script_Extensions: Phli} \p{Script_Extensions=
6278                                    Inscriptional_Pahlavi} (27)
6279          \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
6280                                    (30)
6281          \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
6282          \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
6283                                    (29: U+10900..1091B, U+1091F)
6284          \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (149)
6285          \p{Script_Extensions: Prti} \p{Script_Extensions=
6286                                    Inscriptional_Parthian} (30)
6287          \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
6288                                    \p{Phlp}) (30: U+0640, U+10B80..10B91,
6289                                    U+10B99..10B9C, U+10BA9..10BAF)
6290          \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
6291          \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (586)
6292          \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37:
6293                                    U+A930..A953, U+A95F)
6294          \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
6295          \p{Script_Extensions: Rohg} \p{Script_Extensions=Hanifi_Rohingya}
6296                                    (55)
6297          \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86:
6298                                    U+16A0..16EA, U+16EE..16F8)
6299          \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
6300          \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
6301                                    (61: U+0800..082D, U+0830..083E)
6302          \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
6303          \p{Script_Extensions: Sarb} \p{Script_Extensions=
6304                                    Old_South_Arabian} (32)
6305          \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
6306          \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
6307                                    (82: U+A880..A8C5, U+A8CE..A8D9)
6308          \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
6309          \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
6310                                    (102: U+0951, U+1CD7, U+1CD9,
6311                                    U+1CDC..1CDD, U+1CE0, U+11180..111DF)
6312          \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw})
6313                                    (48: U+10450..1047F)
6314          \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
6315          \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (102)
6316          \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
6317          \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd})
6318                                    (92: U+11580..115B5, U+115B8..115DD)
6319          \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
6320                                    (672: U+1D800..1DA8B, U+1DA9B..1DA9F,
6321                                    U+1DAA1..1DAAF)
6322          \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
6323          \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (113)
6324          \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
6325                                    (113: U+0964..0965, U+0D81..0D83,
6326                                    U+0D85..0D96, U+0D9A..0DB1,
6327                                    U+0DB3..0DBB, U+0DBD ...)
6328          \p{Script_Extensions: Sogd} \p{Script_Extensions=Sogdian} (43)
6329          \p{Script_Extensions: Sogdian} (Short: \p{Scx=Sogd}, \p{Sogd})
6330                                    (43: U+0640, U+10F30..10F59)
6331          \p{Script_Extensions: Sogo} \p{Script_Extensions=Old_Sogdian} (40)
6332          \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
6333          \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
6334                                    \p{Sora}) (35: U+110D0..110E8,
6335                                    U+110F0..110F9)
6336          \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (83)
6337          \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo})
6338                                    (83: U+11A50..11AA2)
6339          \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
6340          \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
6341                                    (72: U+1B80..1BBF, U+1CC0..1CC7)
6342          \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (57)
6343          \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
6344                                    \p{Sylo}) (57: U+0964..0965,
6345                                    U+09E6..09EF, U+A800..A82C)
6346          \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (107)
6347          \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc})
6348                                    (107: U+060C, U+061B..061C, U+061F,
6349                                    U+0640, U+064B..0655, U+0670 ...)
6350          \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg})
6351                                    (25: U+1700..1715, U+171F, U+1735..1736)
6352          \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
6353          \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
6354                                    (20: U+1735..1736, U+1760..176C,
6355                                    U+176E..1770, U+1772..1773)
6356          \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45:
6357                                    U+1040..1049, U+1950..196D, U+1970..1974)
6358          \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
6359                                    (127: U+1A20..1A5E, U+1A60..1A7C,
6360                                    U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD)
6361          \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
6362                                    (72: U+AA80..AAC2, U+AADB..AADF)
6363          \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (80)
6364          \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (80:
6365                                    U+0964..0965, U+A830..A839,
6366                                    U+11680..116B9, U+116C0..116C9)
6367          \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
6368          \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
6369          \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (133:
6370                                    U+0951..0952, U+0964..0965,
6371                                    U+0B82..0B83, U+0B85..0B8A,
6372                                    U+0B8E..0B90, U+0B92..0B95 ...)
6373          \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (133)
6374          \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6914)
6375          \p{Script_Extensions: Tangsa} (Short: \p{Scx=Tnsa}, \p{Tnsa}) (89:
6376                                    U+16A70..16ABE, U+16AC0..16AC9)
6377          \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
6378                                    (6914: U+16FE0, U+17000..187F7,
6379                                    U+18800..18AFF, U+18D00..18D08)
6380          \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
6381          \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (106)
6382          \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu})
6383                                    (106: U+0951..0952, U+0964..0965,
6384                                    U+0C00..0C0C, U+0C0E..0C10,
6385                                    U+0C12..0C28, U+0C2A..0C39 ...)
6386          \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
6387          \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (25)
6388          \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66)
6389          \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66:
6390                                    U+060C, U+061B..061C, U+061F,
6391                                    U+0660..0669, U+0780..07B1, U+FDF2 ...)
6392          \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86:
6393                                    U+0E01..0E3A, U+0E40..0E5B)
6394          \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
6395                                    (207: U+0F00..0F47, U+0F49..0F6C,
6396                                    U+0F71..0F97, U+0F99..0FBC,
6397                                    U+0FBE..0FCC, U+0FCE..0FD4 ...)
6398          \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
6399          \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
6400                                    (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F)
6401          \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (97)
6402          \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh})
6403                                    (97: U+0951..0952, U+0964..0965, U+1CF2,
6404                                    U+A830..A839, U+11480..114C7,
6405                                    U+114D0..114D9)
6406          \p{Script_Extensions: Tnsa} \p{Script_Extensions=Tangsa} (89)
6407          \p{Script_Extensions: Toto} (Short: \p{Scx=Toto}, \p{Toto}) (31:
6408                                    U+1E290..1E2AE)
6409          \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
6410          \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
6411                                    (31: U+10380..1039D, U+1039F)
6412          \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
6413                                    (964_861 plus all above-Unicode code
6414                                    points: U+0378..0379, U+0380..0383,
6415                                    U+038B, U+038D, U+03A2, U+0530 ...)
6416          \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300:
6417                                    U+A500..A62B)
6418          \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
6419          \p{Script_Extensions: Vith} \p{Script_Extensions=Vithkuqi} (70)
6420          \p{Script_Extensions: Vithkuqi} (Short: \p{Scx=Vith}, \p{Vith})
6421                                    (70: U+10570..1057A, U+1057C..1058A,
6422                                    U+1058C..10592, U+10594..10595,
6423                                    U+10597..105A1, U+105A3..105B1 ...)
6424          \p{Script_Extensions: Wancho} (Short: \p{Scx=Wcho}, \p{Wcho}) (59:
6425                                    U+1E2C0..1E2F9, U+1E2FF)
6426          \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
6427          \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
6428                                    (84: U+118A0..118F2, U+118FF)
6429          \p{Script_Extensions: Wcho} \p{Script_Extensions=Wancho} (59)
6430          \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
6431          \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
6432          \p{Script_Extensions: Yezi} \p{Script_Extensions=Yezidi} (60)
6433          \p{Script_Extensions: Yezidi} (Short: \p{Scx=Yezi}, \p{Yezi}) (60:
6434                                    U+060C, U+061B, U+061F, U+0660..0669,
6435                                    U+10E80..10EA9, U+10EAB..10EAD ...)
6436          \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246:
6437                                    U+3001..3002, U+3008..3011,
6438                                    U+3014..301B, U+30FB, U+A000..A48C,
6439                                    U+A490..A4C6 ...)
6440          \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
6441          \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb},
6442                                    \p{Zanb}) (72: U+11A00..11A47)
6443          \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square}
6444                                    (72)
6445          \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (586)
6446          \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (7873)
6447          \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (964_861
6448                                    plus all above-Unicode code points)
6449          \p{Scx: *}              \p{Script_Extensions: *}
6450          \p{SD}                  \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (50)
6451          \p{SD: *}               \p{Soft_Dotted: *}
6452          \p{Sentence_Break: AT}  \p{Sentence_Break=ATerm} (4)
6453          \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024,
6454                                    U+FE52, U+FF0E)
6455          \p{Sentence_Break: CL}  \p{Sentence_Break=Close} (195)
6456          \p{Sentence_Break: Close} (Short: \p{SB=CL}) (195: [\"\'\(\)\[\]
6457                                    \{\}\xab\xbb], U+0F3A..0F3D,
6458                                    U+169B..169C, U+2018..201F,
6459                                    U+2039..203A, U+2045..2046 ...)
6460          \p{Sentence_Break: CR}  (Short: \p{SB=CR}) (1: [\r])
6461          \p{Sentence_Break: EX}  \p{Sentence_Break=Extend} (2550)
6462          \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2550: U+0300..036F,
6463                                    U+0483..0489, U+0591..05BD, U+05BF,
6464                                    U+05C1..05C2, U+05C4..05C5 ...)
6465          \p{Sentence_Break: FO}  \p{Sentence_Break=Format} (72)
6466          \p{Sentence_Break: Format} (Short: \p{SB=FO}) (72: [\xad],
6467                                    U+0600..0605, U+061C, U+06DD, U+070F,
6468                                    U+0890..0891 ...)
6469          \p{Sentence_Break: LE}  \p{Sentence_Break=OLetter} (132_036)
6470          \p{Sentence_Break: LF}  (Short: \p{SB=LF}) (1: [\n])
6471          \p{Sentence_Break: LO}  \p{Sentence_Break=Lower} (2497)
6472          \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2497: [a-z\xaa\xb5
6473                                    \xba\xdf-\xf6\xf8-\xff], U+0101, U+0103,
6474                                    U+0105, U+0107, U+0109 ...)
6475          \p{Sentence_Break: NU}  \p{Sentence_Break=Numeric} (682)
6476          \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (682: [0-9],
6477                                    U+0660..0669, U+066B..066C,
6478                                    U+06F0..06F9, U+07C0..07C9, U+0966..096F
6479                                    ...)
6480          \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (132_036: U+01BB,
6481                                    U+01C0..01C3, U+0294, U+02B9..02BF,
6482                                    U+02C6..02D1, U+02EC ...)
6483          \p{Sentence_Break: Other} (Short: \p{SB=XX}) (973_938 plus all
6484                                    above-Unicode code points: [^\t\n\cK\f
6485                                    \r\x20!\"\'\(\),\-.0-9:?A-Z\[\]a-z\{\}
6486                                    \x85\xa0\xaa-\xab\xad\xb5\xba-\xbb\xc0-
6487                                    \xd6\xd8-\xf6\xf8-\xff], U+02C2..02C5,
6488                                    U+02D2..02DF, U+02E5..02EB, U+02ED,
6489                                    U+02EF..02FF ...)
6490          \p{Sentence_Break: SC}  \p{Sentence_Break=SContinue} (26)
6491          \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:],
6492                                    U+055D, U+060C..060D, U+07F8, U+1802,
6493                                    U+1808 ...)
6494          \p{Sentence_Break: SE}  \p{Sentence_Break=Sep} (3)
6495          \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85],
6496                                    U+2028..2029)
6497          \p{Sentence_Break: Sp}  (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0],
6498                                    U+1680, U+2000..200A, U+202F, U+205F,
6499                                    U+3000)
6500          \p{Sentence_Break: ST}  \p{Sentence_Break=STerm} (151)
6501          \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (151: [!?], U+0589,
6502                                    U+061D..061F, U+06D4, U+0700..0702,
6503                                    U+07F9 ...)
6504          \p{Sentence_Break: UP}  \p{Sentence_Break=Upper} (1936)
6505          \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1936: [A-Z\xc0-\xd6
6506                                    \xd8-\xde], U+0100, U+0102, U+0104,
6507                                    U+0106, U+0108 ...)
6508          \p{Sentence_Break: XX}  \p{Sentence_Break=Other} (973_938 plus all
6509                                    above-Unicode code points)
6510          \p{Sentence_Terminal}   \p{Sentence_Terminal=Y} (Short: \p{STerm})
6511                                    (154)
6512          \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
6513                                    (1_113_958 plus all above-Unicode code
6514                                    points: [\x00-\x20\"#\$\%&\'\(\)*+,\-
6515                                    \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
6516                                    \xff], U+0100..0588, U+058A..061C,
6517                                    U+0620..06D3, U+06D5..06FF, U+0703..07F8
6518                                    ...)
6519          \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (154:
6520                                    [!.?], U+0589, U+061D..061F, U+06D4,
6521                                    U+0700..0702, U+07F9 ...)
6522          \p{Separator}           \p{General_Category=Separator} (Short:
6523                                    \p{Z}) (19)
6524          \p{Sgnw}                \p{SignWriting} (= \p{Script_Extensions=
6525                                    SignWriting}) (672)
6526          \p{Sharada}             \p{Script_Extensions=Sharada} (Short:
6527                                    \p{Shrd}; NOT \p{Block=Sharada}) (102)
6528          \p{Shavian}             \p{Script_Extensions=Shavian} (Short:
6529                                    \p{Shaw}) (48)
6530          \p{Shaw}                \p{Shavian} (= \p{Script_Extensions=
6531                                    Shavian}) (48)
6532        X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
6533                                    (16)
6534          \p{Shrd}                \p{Sharada} (= \p{Script_Extensions=
6535                                    Sharada}) (NOT \p{Block=Sharada}) (102)
6536          \p{Sidd}                \p{Siddham} (= \p{Script_Extensions=
6537                                    Siddham}) (NOT \p{Block=Siddham}) (92)
6538          \p{Siddham}             \p{Script_Extensions=Siddham} (Short:
6539                                    \p{Sidd}; NOT \p{Block=Siddham}) (92)
6540          \p{SignWriting}         \p{Script_Extensions=SignWriting} (Short:
6541                                    \p{Sgnw}) (672)
6542          \p{Sind}                \p{Khudawadi} (= \p{Script_Extensions=
6543                                    Khudawadi}) (NOT \p{Block=Khudawadi})
6544                                    (81)
6545          \p{Sinh}                \p{Sinhala} (= \p{Script_Extensions=
6546                                    Sinhala}) (NOT \p{Block=Sinhala}) (113)
6547          \p{Sinhala}             \p{Script_Extensions=Sinhala} (Short:
6548                                    \p{Sinh}; NOT \p{Block=Sinhala}) (113)
6549        X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
6550          \p{Sk}                  \p{Modifier_Symbol} (=
6551                                    \p{General_Category=Modifier_Symbol})
6552                                    (125)
6553          \p{Sm}                  \p{Math_Symbol} (= \p{General_Category=
6554                                    Math_Symbol}) (948)
6555        X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
6556                                    \p{InSmallForms}) (32)
6557        X \p{Small_Forms}         \p{Small_Form_Variants} (= \p{Block=
6558                                    Small_Form_Variants}) (32)
6559        X \p{Small_Kana_Ext}      \p{Small_Kana_Extension} (= \p{Block=
6560                                    Small_Kana_Extension}) (64)
6561        X \p{Small_Kana_Extension} \p{Block=Small_Kana_Extension} (Short:
6562                                    \p{InSmallKanaExt}) (64)
6563          \p{So}                  \p{Other_Symbol} (= \p{General_Category=
6564                                    Other_Symbol}) (6634)
6565          \p{Soft_Dotted}         \p{Soft_Dotted=Y} (Short: \p{SD}) (50)
6566          \p{Soft_Dotted: N*}     (Short: \p{SD=N}, \P{SD}) (1_114_062 plus
6567                                    all above-Unicode code points: [\x00-
6568                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6569                                    Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff],
6570                                    U+0100..012E, U+0130..0248,
6571                                    U+024A..0267, U+0269..029C, U+029E..02B1
6572                                    ...)
6573          \p{Soft_Dotted: Y*}     (Short: \p{SD=Y}, \p{SD}) (50: [i-j],
6574                                    U+012F, U+0249, U+0268, U+029D, U+02B2
6575                                    ...)
6576          \p{Sogd}                \p{Sogdian} (= \p{Script_Extensions=
6577                                    Sogdian}) (NOT \p{Block=Sogdian}) (43)
6578          \p{Sogdian}             \p{Script_Extensions=Sogdian} (Short:
6579                                    \p{Sogd}; NOT \p{Block=Sogdian}) (43)
6580          \p{Sogo}                \p{Old_Sogdian} (= \p{Script_Extensions=
6581                                    Old_Sogdian}) (NOT \p{Block=
6582                                    Old_Sogdian}) (40)
6583          \p{Sora}                \p{Sora_Sompeng} (= \p{Script_Extensions=
6584                                    Sora_Sompeng}) (NOT \p{Block=
6585                                    Sora_Sompeng}) (35)
6586          \p{Sora_Sompeng}        \p{Script_Extensions=Sora_Sompeng} (Short:
6587                                    \p{Sora}; NOT \p{Block=Sora_Sompeng})
6588                                    (35)
6589          \p{Soyo}                \p{Soyombo} (= \p{Script_Extensions=
6590                                    Soyombo}) (NOT \p{Block=Soyombo}) (83)
6591          \p{Soyombo}             \p{Script_Extensions=Soyombo} (Short:
6592                                    \p{Soyo}; NOT \p{Block=Soyombo}) (83)
6593          \p{Space}               \p{White_Space} (= \p{White_Space=Y}) (25)
6594          \p{Space: *}            \p{White_Space: *}
6595          \p{Space_Separator}     \p{General_Category=Space_Separator}
6596                                    (Short: \p{Zs}) (17)
6597          \p{SpacePerl}           \p{XPosixSpace} (25)
6598          \p{Spacing_Mark}        \p{General_Category=Spacing_Mark} (Short:
6599                                    \p{Mc}) (452)
6600        X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
6601                                    (Short: \p{InModifierLetters}) (80)
6602        X \p{Specials}            \p{Block=Specials} (16)
6603          \p{STerm}               \p{Sentence_Terminal} (=
6604                                    \p{Sentence_Terminal=Y}) (154)
6605          \p{STerm: *}            \p{Sentence_Terminal: *}
6606          \p{Sund}                \p{Sundanese} (= \p{Script_Extensions=
6607                                    Sundanese}) (NOT \p{Block=Sundanese})
6608                                    (72)
6609          \p{Sundanese}           \p{Script_Extensions=Sundanese} (Short:
6610                                    \p{Sund}; NOT \p{Block=Sundanese}) (72)
6611        X \p{Sundanese_Sup}       \p{Sundanese_Supplement} (= \p{Block=
6612                                    Sundanese_Supplement}) (16)
6613        X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
6614                                    \p{InSundaneseSup}) (16)
6615        X \p{Sup_Arrows_A}        \p{Supplemental_Arrows_A} (= \p{Block=
6616                                    Supplemental_Arrows_A}) (16)
6617        X \p{Sup_Arrows_B}        \p{Supplemental_Arrows_B} (= \p{Block=
6618                                    Supplemental_Arrows_B}) (128)
6619        X \p{Sup_Arrows_C}        \p{Supplemental_Arrows_C} (= \p{Block=
6620                                    Supplemental_Arrows_C}) (256)
6621        X \p{Sup_Math_Operators}  \p{Supplemental_Mathematical_Operators} (=
6622                                    \p{Block=
6623                                    Supplemental_Mathematical_Operators})
6624                                    (256)
6625        X \p{Sup_PUA_A}           \p{Supplementary_Private_Use_Area_A} (=
6626                                    \p{Block=
6627                                    Supplementary_Private_Use_Area_A})
6628                                    (65_536)
6629        X \p{Sup_PUA_B}           \p{Supplementary_Private_Use_Area_B} (=
6630                                    \p{Block=
6631                                    Supplementary_Private_Use_Area_B})
6632                                    (65_536)
6633        X \p{Sup_Punctuation}     \p{Supplemental_Punctuation} (= \p{Block=
6634                                    Supplemental_Punctuation}) (128)
6635        X \p{Sup_Symbols_And_Pictographs}
6636                                    \p{Supplemental_Symbols_And_Pictographs}
6637                                    (= \p{Block=
6638                                    Supplemental_Symbols_And_Pictographs})
6639                                    (256)
6640        X \p{Super_And_Sub}       \p{Superscripts_And_Subscripts} (=
6641                                    \p{Block=Superscripts_And_Subscripts})
6642                                    (48)
6643        X \p{Superscripts_And_Subscripts} \p{Block=
6644                                    Superscripts_And_Subscripts} (Short:
6645                                    \p{InSuperAndSub}) (48)
6646        X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
6647                                    \p{InSupArrowsA}) (16)
6648        X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
6649                                    \p{InSupArrowsB}) (128)
6650        X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
6651                                    \p{InSupArrowsC}) (256)
6652        X \p{Supplemental_Mathematical_Operators} \p{Block=
6653                                    Supplemental_Mathematical_Operators}
6654                                    (Short: \p{InSupMathOperators}) (256)
6655        X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
6656                                    (Short: \p{InSupPunctuation}) (128)
6657        X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
6658                                    Supplemental_Symbols_And_Pictographs}
6659                                    (Short: \p{InSupSymbolsAndPictographs})
6660                                    (256)
6661        X \p{Supplementary_Private_Use_Area_A} \p{Block=
6662                                    Supplementary_Private_Use_Area_A}
6663                                    (Short: \p{InSupPUAA}) (65_536)
6664        X \p{Supplementary_Private_Use_Area_B} \p{Block=
6665                                    Supplementary_Private_Use_Area_B}
6666                                    (Short: \p{InSupPUAB}) (65_536)
6667          \p{Surrogate}           \p{General_Category=Surrogate} (Short:
6668                                    \p{Cs}) (2048)
6669        X \p{Sutton_SignWriting}  \p{Block=Sutton_SignWriting} (688)
6670          \p{Sylo}                \p{Syloti_Nagri} (= \p{Script_Extensions=
6671                                    Syloti_Nagri}) (NOT \p{Block=
6672                                    Syloti_Nagri}) (57)
6673          \p{Syloti_Nagri}        \p{Script_Extensions=Syloti_Nagri} (Short:
6674                                    \p{Sylo}; NOT \p{Block=Syloti_Nagri})
6675                                    (57)
6676          \p{Symbol}              \p{General_Category=Symbol} (Short: \p{S})
6677                                    (7770)
6678        X \p{Symbols_And_Pictographs_Ext_A}
6679                                    \p{Symbols_And_Pictographs_Extended_A}
6680                                    (= \p{Block=
6681                                    Symbols_And_Pictographs_Extended_A})
6682                                    (144)
6683        X \p{Symbols_And_Pictographs_Extended_A} \p{Block=
6684                                    Symbols_And_Pictographs_Extended_A} (144)
6685        X \p{Symbols_For_Legacy_Computing} \p{Block=
6686                                    Symbols_For_Legacy_Computing} (256)
6687          \p{Syrc}                \p{Syriac} (= \p{Script_Extensions=
6688                                    Syriac}) (NOT \p{Block=Syriac}) (107)
6689          \p{Syriac}              \p{Script_Extensions=Syriac} (Short:
6690                                    \p{Syrc}; NOT \p{Block=Syriac}) (107)
6691        X \p{Syriac_Sup}          \p{Syriac_Supplement} (= \p{Block=
6692                                    Syriac_Supplement}) (16)
6693        X \p{Syriac_Supplement}   \p{Block=Syriac_Supplement} (Short:
6694                                    \p{InSyriacSup}) (16)
6695          \p{Tagalog}             \p{Script_Extensions=Tagalog} (Short:
6696                                    \p{Tglg}; NOT \p{Block=Tagalog}) (25)
6697          \p{Tagb}                \p{Tagbanwa} (= \p{Script_Extensions=
6698                                    Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
6699          \p{Tagbanwa}            \p{Script_Extensions=Tagbanwa} (Short:
6700                                    \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
6701        X \p{Tags}                \p{Block=Tags} (128)
6702          \p{Tai_Le}              \p{Script_Extensions=Tai_Le} (Short:
6703                                    \p{Tale}; NOT \p{Block=Tai_Le}) (45)
6704          \p{Tai_Tham}            \p{Script_Extensions=Tai_Tham} (Short:
6705                                    \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
6706          \p{Tai_Viet}            \p{Script_Extensions=Tai_Viet} (Short:
6707                                    \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
6708        X \p{Tai_Xuan_Jing}       \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
6709                                    Tai_Xuan_Jing_Symbols}) (96)
6710        X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
6711                                    \p{InTaiXuanJing}) (96)
6712          \p{Takr}                \p{Takri} (= \p{Script_Extensions=Takri})
6713                                    (NOT \p{Block=Takri}) (80)
6714          \p{Takri}               \p{Script_Extensions=Takri} (Short:
6715                                    \p{Takr}; NOT \p{Block=Takri}) (80)
6716          \p{Tale}                \p{Tai_Le} (= \p{Script_Extensions=
6717                                    Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
6718          \p{Talu}                \p{New_Tai_Lue} (= \p{Script_Extensions=
6719                                    New_Tai_Lue}) (NOT \p{Block=
6720                                    New_Tai_Lue}) (83)
6721          \p{Tamil}               \p{Script_Extensions=Tamil} (Short:
6722                                    \p{Taml}; NOT \p{Block=Tamil}) (133)
6723        X \p{Tamil_Sup}           \p{Tamil_Supplement} (= \p{Block=
6724                                    Tamil_Supplement}) (64)
6725        X \p{Tamil_Supplement}    \p{Block=Tamil_Supplement} (Short:
6726                                    \p{InTamilSup}) (64)
6727          \p{Taml}                \p{Tamil} (= \p{Script_Extensions=Tamil})
6728                                    (NOT \p{Block=Tamil}) (133)
6729          \p{Tang}                \p{Tangut} (= \p{Script_Extensions=
6730                                    Tangut}) (NOT \p{Block=Tangut}) (6914)
6731          \p{Tangsa}              \p{Script_Extensions=Tangsa} (Short:
6732                                    \p{Tnsa}; NOT \p{Block=Tangsa}) (89)
6733          \p{Tangut}              \p{Script_Extensions=Tangut} (Short:
6734                                    \p{Tang}; NOT \p{Block=Tangut}) (6914)
6735        X \p{Tangut_Components}   \p{Block=Tangut_Components} (768)
6736        X \p{Tangut_Sup}          \p{Tangut_Supplement} (= \p{Block=
6737                                    Tangut_Supplement}) (128)
6738        X \p{Tangut_Supplement}   \p{Block=Tangut_Supplement} (Short:
6739                                    \p{InTangutSup}) (128)
6740          \p{Tavt}                \p{Tai_Viet} (= \p{Script_Extensions=
6741                                    Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
6742          \p{Telu}                \p{Telugu} (= \p{Script_Extensions=
6743                                    Telugu}) (NOT \p{Block=Telugu}) (106)
6744          \p{Telugu}              \p{Script_Extensions=Telugu} (Short:
6745                                    \p{Telu}; NOT \p{Block=Telugu}) (106)
6746          \p{Term}                \p{Terminal_Punctuation} (=
6747                                    \p{Terminal_Punctuation=Y}) (278)
6748          \p{Term: *}             \p{Terminal_Punctuation: *}
6749          \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
6750                                    \p{Term}) (278)
6751          \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
6752                                    (1_113_834 plus all above-Unicode code
6753                                    points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0-
6754                                    9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
6755                                    U+0100..037D, U+037F..0386,
6756                                    U+0388..0588, U+058A..05C2, U+05C4..060B
6757                                    ...)
6758          \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (278:
6759                                    [!,.:;?], U+037E, U+0387, U+0589,
6760                                    U+05C3, U+060C ...)
6761          \p{Tfng}                \p{Tifinagh} (= \p{Script_Extensions=
6762                                    Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
6763          \p{Tglg}                \p{Tagalog} (= \p{Script_Extensions=
6764                                    Tagalog}) (NOT \p{Block=Tagalog}) (25)
6765          \p{Thaa}                \p{Thaana} (= \p{Script_Extensions=
6766                                    Thaana}) (NOT \p{Block=Thaana}) (66)
6767          \p{Thaana}              \p{Script_Extensions=Thaana} (Short:
6768                                    \p{Thaa}; NOT \p{Block=Thaana}) (66)
6769          \p{Thai}                \p{Script_Extensions=Thai} (NOT \p{Block=
6770                                    Thai}) (86)
6771          \p{Tibetan}             \p{Script_Extensions=Tibetan} (Short:
6772                                    \p{Tibt}; NOT \p{Block=Tibetan}) (207)
6773          \p{Tibt}                \p{Tibetan} (= \p{Script_Extensions=
6774                                    Tibetan}) (NOT \p{Block=Tibetan}) (207)
6775          \p{Tifinagh}            \p{Script_Extensions=Tifinagh} (Short:
6776                                    \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
6777          \p{Tirh}                \p{Tirhuta} (= \p{Script_Extensions=
6778                                    Tirhuta}) (NOT \p{Block=Tirhuta}) (97)
6779          \p{Tirhuta}             \p{Script_Extensions=Tirhuta} (Short:
6780                                    \p{Tirh}; NOT \p{Block=Tirhuta}) (97)
6781          \p{Title}               \p{Titlecase} (/i= Cased=Yes) (31)
6782          \p{Titlecase}           (= \p{Gc=Lt}) (Short: \p{Title}; /i=
6783                                    Cased=Yes) (31: U+01C5, U+01C8, U+01CB,
6784                                    U+01F2, U+1F88..1F8F, U+1F98..1F9F ...)
6785          \p{Titlecase_Letter}    \p{General_Category=Titlecase_Letter}
6786                                    (Short: \p{Lt}; /i= General_Category=
6787                                    Cased_Letter) (31)
6788          \p{Tnsa}                \p{Tangsa} (= \p{Script_Extensions=
6789                                    Tangsa}) (NOT \p{Block=Tangsa}) (89)
6790          \p{Toto}                \p{Script_Extensions=Toto} (NOT \p{Block=
6791                                    Toto}) (31)
6792        X \p{Transport_And_Map}   \p{Transport_And_Map_Symbols} (= \p{Block=
6793                                    Transport_And_Map_Symbols}) (128)
6794        X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
6795                                    (Short: \p{InTransportAndMap}) (128)
6796        X \p{UCAS}                \p{Unified_Canadian_Aboriginal_Syllabics}
6797                                    (= \p{Block=
6798                                    Unified_Canadian_Aboriginal_Syllabics})
6799                                    (640)
6800        X \p{UCAS_Ext}            \p{Unified_Canadian_Aboriginal_Syllabics_-
6801                                    Extended} (= \p{Block=
6802                                    Unified_Canadian_Aboriginal_Syllabics_-
6803                                    Extended}) (80)
6804        X \p{UCAS_Ext_A}          \p{Unified_Canadian_Aboriginal_Syllabics_-
6805                                    Extended_A} (= \p{Block=
6806                                    Unified_Canadian_Aboriginal_Syllabics_-
6807                                    Extended_A}) (16)
6808          \p{Ugar}                \p{Ugaritic} (= \p{Script_Extensions=
6809                                    Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
6810          \p{Ugaritic}            \p{Script_Extensions=Ugaritic} (Short:
6811                                    \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
6812          \p{UIdeo}               \p{Unified_Ideograph} (=
6813                                    \p{Unified_Ideograph=Y}) (97_058)
6814          \p{UIdeo: *}            \p{Unified_Ideograph: *}
6815          \p{Unassigned}          \p{General_Category=Unassigned} (Short:
6816                                    \p{Cn}) (825_345 plus all above-Unicode
6817                                    code points)
6818          \p{Unicode}             \p{Any} (1_114_112)
6819        X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
6820                                    Unified_Canadian_Aboriginal_Syllabics}
6821                                    (Short: \p{InUCAS}) (640)
6822        X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
6823                                    Unified_Canadian_Aboriginal_Syllabics_-
6824                                    Extended} (Short: \p{InUCASExt}) (80)
6825        X \p{Unified_Canadian_Aboriginal_Syllabics_Extended_A} \p{Block=
6826                                    Unified_Canadian_Aboriginal_Syllabics_-
6827                                    Extended_A} (Short: \p{InUCASExtA}) (16)
6828          \p{Unified_Ideograph}   \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
6829                                    (97_058)
6830          \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
6831                                    (1_017_054 plus all above-Unicode code
6832                                    points: U+0000..33FF, U+4DC0..4DFF,
6833                                    U+A000..FA0D, U+FA10, U+FA12,
6834                                    U+FA15..FA1E ...)
6835          \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (97_058:
6836                                    U+3400..4DBF, U+4E00..9FFF,
6837                                    U+FA0E..FA0F, U+FA11, U+FA13..FA14,
6838                                    U+FA1F ...)
6839          \p{Unknown}             \p{Script_Extensions=Unknown} (Short:
6840                                    \p{Zzzz}) (964_861 plus all above-
6841                                    Unicode code points)
6842          \p{Upper}               \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6843                                    Cased=Yes) (1951)
6844          \p{Upper: *}            \p{Uppercase: *}
6845          \p{Uppercase}           \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6846                                    Cased=Yes) (1951)
6847          \p{Uppercase: N*}       (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
6848                                    No) (1_112_161 plus all above-Unicode
6849                                    code points: [\x00-\x20!\"#\$\%&\'
6850                                    \(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a-z\{
6851                                    \|\}~\x7f-\xbf\xd7\xdf-\xff], U+0101,
6852                                    U+0103, U+0105, U+0107, U+0109 ...)
6853          \p{Uppercase: Y*}       (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
6854                                    Yes) (1951: [A-Z\xc0-\xd6\xd8-\xde],
6855                                    U+0100, U+0102, U+0104, U+0106, U+0108
6856                                    ...)
6857          \p{Uppercase_Letter}    \p{General_Category=Uppercase_Letter}
6858                                    (Short: \p{Lu}; /i= General_Category=
6859                                    Cased_Letter) (1831)
6860          \p{Vai}                 \p{Script_Extensions=Vai} (NOT \p{Block=
6861                                    Vai}) (300)
6862          \p{Vaii}                \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
6863                                    \p{Block=Vai}) (300)
6864          \p{Variation_Selector}  \p{Variation_Selector=Y} (Short: \p{VS};
6865                                    NOT \p{Variation_Selectors}) (260)
6866          \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_852
6867                                    plus all above-Unicode code points:
6868                                    U+0000..180A, U+180E, U+1810..FDFF,
6869                                    U+FE10..E00FF, U+E01F0..infinity)
6870          \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (260:
6871                                    U+180B..180D, U+180F, U+FE00..FE0F,
6872                                    U+E0100..E01EF)
6873        X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
6874                                    \p{InVS}) (16)
6875        X \p{Variation_Selectors_Supplement} \p{Block=
6876                                    Variation_Selectors_Supplement} (Short:
6877                                    \p{InVSSup}) (240)
6878        X \p{Vedic_Ext}           \p{Vedic_Extensions} (= \p{Block=
6879                                    Vedic_Extensions}) (48)
6880        X \p{Vedic_Extensions}    \p{Block=Vedic_Extensions} (Short:
6881                                    \p{InVedicExt}) (48)
6882        X \p{Vertical_Forms}      \p{Block=Vertical_Forms} (16)
6883          \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated}
6884                                    (786_609 plus all above-Unicode code
6885                                    points)
6886          \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (786_609 plus
6887                                    all above-Unicode code points: [\x00-
6888                                    \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf-
6889                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9,
6890                                    U+02EC..10FF, U+1200..1400,
6891                                    U+1680..18AF, U+1900..2015 ...)
6892          \p{Vertical_Orientation: Tr} \p{Vertical_Orientation=
6893                                    Transformed_Rotated} (47)
6894          \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr})
6895                                    (47: U+2329..232A, U+3008..3011,
6896                                    U+3014..301F, U+3030, U+30A0, U+30FC ...)
6897          \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu})
6898                                    (148: U+3001..3002, U+3041, U+3043,
6899                                    U+3045, U+3047, U+3049 ...)
6900          \p{Vertical_Orientation: Tu} \p{Vertical_Orientation=
6901                                    Transformed_Upright} (148)
6902          \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright}
6903                                    (327_308)
6904          \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (327_308:
6905                                    [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7],
6906                                    U+02EA..02EB, U+1100..11FF,
6907                                    U+1401..167F, U+18B0..18FF, U+2016 ...)
6908          \p{VertSpace}           \v (7: [\n\cK\f\r\x85], U+2028..2029)
6909          \p{Vith}                \p{Vithkuqi} (= \p{Script_Extensions=
6910                                    Vithkuqi}) (NOT \p{Block=Vithkuqi}) (70)
6911          \p{Vithkuqi}            \p{Script_Extensions=Vithkuqi} (Short:
6912                                    \p{Vith}; NOT \p{Block=Vithkuqi}) (70)
6913          \p{Vo: *}               \p{Vertical_Orientation: *}
6914          \p{VS}                  \p{Variation_Selector} (=
6915                                    \p{Variation_Selector=Y}) (NOT
6916                                    \p{Variation_Selectors}) (260)
6917          \p{VS: *}               \p{Variation_Selector: *}
6918        X \p{VS_Sup}              \p{Variation_Selectors_Supplement} (=
6919                                    \p{Block=
6920                                    Variation_Selectors_Supplement}) (240)
6921          \p{Wancho}              \p{Script_Extensions=Wancho} (Short:
6922                                    \p{Wcho}; NOT \p{Block=Wancho}) (59)
6923          \p{Wara}                \p{Warang_Citi} (= \p{Script_Extensions=
6924                                    Warang_Citi}) (NOT \p{Block=
6925                                    Warang_Citi}) (84)
6926          \p{Warang_Citi}         \p{Script_Extensions=Warang_Citi} (Short:
6927                                    \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
6928          \p{WB: *}               \p{Word_Break: *}
6929          \p{Wcho}                \p{Wancho} (= \p{Script_Extensions=
6930                                    Wancho}) (NOT \p{Block=Wancho}) (59)
6931          \p{White_Space}         \p{White_Space=Y} (Short: \p{Space}) (25)
6932          \p{White_Space: N*}     (Short: \p{Space=N}, \P{Space}) (1_114_087
6933                                    plus all above-Unicode code points: [^
6934                                    \t\n\cK\f\r\x20\x85\xa0], U+0100..167F,
6935                                    U+1681..1FFF, U+200B..2027,
6936                                    U+202A..202E, U+2030..205E ...)
6937          \p{White_Space: Y*}     (Short: \p{Space=Y}, \p{Space}) (25: [\t
6938                                    \n\cK\f\r\x20\x85\xa0], U+1680,
6939                                    U+2000..200A, U+2028..2029, U+202F,
6940                                    U+205F ...)
6941          \p{Word}                \p{XPosixWord} (139_612)
6942          \p{Word_Break: ALetter} (Short: \p{WB=LE}) (29_489: [A-Za-z\xaa
6943                                    \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6944                                    U+0100..02D7, U+02DE..02FF,
6945                                    U+0370..0374, U+0376..0377, U+037A..037D
6946                                    ...)
6947          \p{Word_Break: CR}      (Short: \p{WB=CR}) (1: [\r])
6948          \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"])
6949          \p{Word_Break: DQ}      \p{Word_Break=Double_Quote} (1)
6950          \p{Word_Break: E_Base}  (Short: \p{WB=EB}) (0)
6951          \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (0)
6952          \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (0)
6953          \p{Word_Break: EB}      \p{Word_Break=E_Base} (0)
6954          \p{Word_Break: EBG}     \p{Word_Break=E_Base_GAZ} (0)
6955          \p{Word_Break: EM}      \p{Word_Break=E_Modifier} (0)
6956          \p{Word_Break: EX}      \p{Word_Break=ExtendNumLet} (11)
6957          \p{Word_Break: Extend}  (Short: \p{WB=Extend}) (2554:
6958                                    U+0300..036F, U+0483..0489,
6959                                    U+0591..05BD, U+05BF, U+05C1..05C2,
6960                                    U+05C4..05C5 ...)
6961          \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F,
6962                                    U+203F..2040, U+2054, U+FE33..FE34,
6963                                    U+FE4D..FE4F ...)
6964          \p{Word_Break: FO}      \p{Word_Break=Format} (71)
6965          \p{Word_Break: Format}  (Short: \p{WB=FO}) (71: [\xad],
6966                                    U+0600..0605, U+061C, U+06DD, U+070F,
6967                                    U+0890..0891 ...)
6968          \p{Word_Break: GAZ}     \p{Word_Break=Glue_After_Zwj} (0)
6969          \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (0)
6970          \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (75:
6971                                    U+05D0..05EA, U+05EF..05F2, U+FB1D,
6972                                    U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
6973                                    ...)
6974          \p{Word_Break: HL}      \p{Word_Break=Hebrew_Letter} (75)
6975          \p{Word_Break: KA}      \p{Word_Break=Katakana} (331)
6976          \p{Word_Break: Katakana} (Short: \p{WB=KA}) (331: U+3031..3035,
6977                                    U+309B..309C, U+30A0..30FA,
6978                                    U+30FC..30FF, U+31F0..31FF, U+32D0..32FE
6979                                    ...)
6980          \p{Word_Break: LE}      \p{Word_Break=ALetter} (29_489)
6981          \p{Word_Break: LF}      (Short: \p{WB=LF}) (1: [\n])
6982          \p{Word_Break: MB}      \p{Word_Break=MidNumLet} (7)
6983          \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (9: [:\xb7], U+0387,
6984                                    U+055F, U+05F4, U+2027, U+FE13 ...)
6985          \p{Word_Break: MidNum}  (Short: \p{WB=MN}) (15: [,;], U+037E,
6986                                    U+0589, U+060C..060D, U+066C, U+07F8 ...)
6987          \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.],
6988                                    U+2018..2019, U+2024, U+FE52, U+FF07,
6989                                    U+FF0E)
6990          \p{Word_Break: ML}      \p{Word_Break=MidLetter} (9)
6991          \p{Word_Break: MN}      \p{Word_Break=MidNum} (15)
6992          \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85],
6993                                    U+2028..2029)
6994          \p{Word_Break: NL}      \p{Word_Break=Newline} (5)
6995          \p{Word_Break: NU}      \p{Word_Break=Numeric} (681)
6996          \p{Word_Break: Numeric} (Short: \p{WB=NU}) (681: [0-9],
6997                                    U+0660..0669, U+066B, U+06F0..06F9,
6998                                    U+07C0..07C9, U+0966..096F ...)
6999          \p{Word_Break: Other}   (Short: \p{WB=XX}) (1_080_819 plus all
7000                                    above-Unicode code points: [^\n\cK\f\r
7001                                    \x20\"\',.0-9:;A-Z_a-z\x85\xaa\xad\xb5
7002                                    \xb7\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7003                                    U+02D8..02DD, U+0375, U+0378..0379,
7004                                    U+0380..0385, U+038B ...)
7005          \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26:
7006                                    U+1F1E6..1F1FF)
7007          \p{Word_Break: RI}      \p{Word_Break=Regional_Indicator} (26)
7008          \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\'])
7009          \p{Word_Break: SQ}      \p{Word_Break=Single_Quote} (1)
7010          \p{Word_Break: WSegSpace} (Short: \p{WB=WSegSpace}) (14: [\x20],
7011                                    U+1680, U+2000..2006, U+2008..200A,
7012                                    U+205F, U+3000)
7013          \p{Word_Break: XX}      \p{Word_Break=Other} (1_080_819 plus all
7014                                    above-Unicode code points)
7015          \p{Word_Break: ZWJ}     (Short: \p{WB=ZWJ}) (1: U+200D)
7016          \p{WSpace}              \p{White_Space} (= \p{White_Space=Y}) (25)
7017          \p{WSpace: *}           \p{White_Space: *}
7018          \p{XDigit}              \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
7019          \p{XID_Continue}        \p{XID_Continue=Y} (Short: \p{XIDC})
7020                                    (139_463)
7021          \p{XID_Continue: N*}    (Short: \p{XIDC=N}, \P{XIDC}) (974_649
7022                                    plus all above-Unicode code points:
7023                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
7024                                    \@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4
7025                                    \xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
7026                                    U+02C2..02C5, U+02D2..02DF,
7027                                    U+02E5..02EB, U+02ED, U+02EF..02FF ...)
7028          \p{XID_Continue: Y*}    (Short: \p{XIDC=Y}, \p{XIDC}) (139_463:
7029                                    [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6
7030                                    \xd8-\xf6\xf8-\xff], U+0100..02C1,
7031                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
7032                                    U+02EE ...)
7033          \p{XID_Start}           \p{XID_Start=Y} (Short: \p{XIDS}) (136_322)
7034          \p{XID_Start: N*}       (Short: \p{XIDS=N}, \P{XIDS}) (977_790
7035                                    plus all above-Unicode code points:
7036                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
7037                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4
7038                                    \xb6-\xb9\xbb-\xbf\xd7\xf7],
7039                                    U+02C2..02C5, U+02D2..02DF,
7040                                    U+02E5..02EB, U+02ED, U+02EF..036F ...)
7041          \p{XID_Start: Y*}       (Short: \p{XIDS=Y}, \p{XIDS}) (136_322:
7042                                    [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
7043                                    \xf8-\xff], U+0100..02C1, U+02C6..02D1,
7044                                    U+02E0..02E4, U+02EC, U+02EE ...)
7045          \p{XIDC}                \p{XID_Continue} (= \p{XID_Continue=Y})
7046                                    (139_463)
7047          \p{XIDC: *}             \p{XID_Continue: *}
7048          \p{XIDS}                \p{XID_Start} (= \p{XID_Start=Y}) (136_322)
7049          \p{XIDS: *}             \p{XID_Start: *}
7050          \p{Xpeo}                \p{Old_Persian} (= \p{Script_Extensions=
7051                                    Old_Persian}) (NOT \p{Block=
7052                                    Old_Persian}) (50)
7053          \p{XPerlSpace}          \p{XPosixSpace} (25)
7054          \p{XPosixAlnum}         Alphabetic and (decimal) Numeric (Short:
7055                                    \p{Alnum}) (138_445: [0-9A-Za-z\xaa\xb5
7056                                    \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7057                                    U+0100..02C1, U+02C6..02D1,
7058                                    U+02E0..02E4, U+02EC, U+02EE ...)
7059          \p{XPosixAlpha}         \p{Alphabetic=Y} (Short: \p{Alpha})
7060                                    (137_765)
7061          \p{XPosixBlank}         \h, Horizontal white space (Short:
7062                                    \p{Blank}) (18: [\t\x20\xa0], U+1680,
7063                                    U+2000..200A, U+202F, U+205F, U+3000)
7064          \p{XPosixCntrl}         \p{General_Category=Control} Control
7065                                    characters (Short: \p{Cc}) (65)
7066          \p{XPosixDigit}         \p{General_Category=Decimal_Number} [0-9]
7067                                    + all other decimal digits (Short:
7068                                    \p{Nd}) (680)
7069          \p{XPosixGraph}         Characters that are graphical (Short:
7070                                    \p{Graph}) (286_635: [!\"#\$\%&\'
7071                                    \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`a-z
7072                                    \{\|\}~\xa1-\xff], U+0100..0377,
7073                                    U+037A..037F, U+0384..038A, U+038C,
7074                                    U+038E..03A1 ...)
7075          \p{XPosixLower}         \p{Lowercase=Y} (Short: \p{Lower}; /i=
7076                                    Cased=Yes) (2544)
7077          \p{XPosixPrint}         Characters that are graphical plus space
7078                                    characters (but no controls) (Short:
7079                                    \p{Print}) (286_652: [\x20-\x7e\xa0-
7080                                    \xff], U+0100..0377, U+037A..037F,
7081                                    U+0384..038A, U+038C, U+038E..03A1 ...)
7082          \p{XPosixPunct}         \p{Punct} + ASCII-range \p{Symbol} (851:
7083                                    [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\]
7084                                    \^_`\{\|\}~\xa1\xa7\xab\xb6-\xb7\xbb
7085                                    \xbf], U+037E, U+0387, U+055A..055F,
7086                                    U+0589..058A, U+05BE ...)
7087          \p{XPosixSpace}         \s including beyond ASCII and vertical tab
7088                                    (Short: \p{SpacePerl}) (25: [\t\n\cK\f
7089                                    \r\x20\x85\xa0], U+1680, U+2000..200A,
7090                                    U+2028..2029, U+202F, U+205F ...)
7091          \p{XPosixUpper}         \p{Uppercase=Y} (Short: \p{Upper}; /i=
7092                                    Cased=Yes) (1951)
7093          \p{XPosixWord}          \w, including beyond ASCII; = \p{Alnum} +
7094                                    \pM + \p{Pc} + \p{Join_Control} (Short:
7095                                    \p{Word}) (139_612: [0-9A-Z_a-z\xaa\xb5
7096                                    \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7097                                    U+0100..02C1, U+02C6..02D1,
7098                                    U+02E0..02E4, U+02EC, U+02EE ...)
7099          \p{XPosixXDigit}        \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
7100          \p{Xsux}                \p{Cuneiform} (= \p{Script_Extensions=
7101                                    Cuneiform}) (NOT \p{Block=Cuneiform})
7102                                    (1234)
7103          \p{Yezi}                \p{Yezidi} (= \p{Script_Extensions=
7104                                    Yezidi}) (NOT \p{Block=Yezidi}) (60)
7105          \p{Yezidi}              \p{Script_Extensions=Yezidi} (Short:
7106                                    \p{Yezi}; NOT \p{Block=Yezidi}) (60)
7107          \p{Yi}                  \p{Script_Extensions=Yi} (1246)
7108        X \p{Yi_Radicals}         \p{Block=Yi_Radicals} (64)
7109        X \p{Yi_Syllables}        \p{Block=Yi_Syllables} (1168)
7110          \p{Yiii}                \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
7111        X \p{Yijing}              \p{Yijing_Hexagram_Symbols} (= \p{Block=
7112                                    Yijing_Hexagram_Symbols}) (64)
7113        X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
7114                                    (Short: \p{InYijing}) (64)
7115          \p{Z} \pZ               \p{Separator} (= \p{General_Category=
7116                                    Separator}) (19)
7117          \p{Zanabazar_Square}    \p{Script_Extensions=Zanabazar_Square}
7118                                    (Short: \p{Zanb}; NOT \p{Block=
7119                                    Zanabazar_Square}) (72)
7120          \p{Zanb}                \p{Zanabazar_Square} (=
7121                                    \p{Script_Extensions=Zanabazar_Square})
7122                                    (NOT \p{Block=Zanabazar_Square}) (72)
7123          \p{Zinh}                \p{Inherited} (= \p{Script_Extensions=
7124                                    Inherited}) (586)
7125          \p{Zl}                  \p{Line_Separator} (= \p{General_Category=
7126                                    Line_Separator}) (1)
7127        X \p{Znamenny_Music}      \p{Znamenny_Musical_Notation} (= \p{Block=
7128                                    Znamenny_Musical_Notation}) (208)
7129        X \p{Znamenny_Musical_Notation} \p{Block=Znamenny_Musical_Notation}
7130                                    (Short: \p{InZnamennyMusic}) (208)
7131          \p{Zp}                  \p{Paragraph_Separator} (=
7132                                    \p{General_Category=
7133                                    Paragraph_Separator}) (1)
7134          \p{Zs}                  \p{Space_Separator} (=
7135                                    \p{General_Category=Space_Separator})
7136                                    (17)
7137          \p{Zyyy}                \p{Common} (= \p{Script_Extensions=
7138                                    Common}) (7873)
7139          \p{Zzzz}                \p{Unknown} (= \p{Script_Extensions=
7140                                    Unknown}) (964_861 plus all above-
7141                                    Unicode code points)
7142
7143   Legal "\p{}" and "\P{}" constructs that match no characters
7144       Unicode has some property-value pairs that currently don't match
7145       anything.  This happens generally either because they are obsolete, or
7146       they exist for symmetry with other forms, but no language has yet been
7147       encoded that uses them.  In this version of Unicode, the following
7148       match zero code points:
7149
7150       \p{Canonical_Combining_Class=Attached_Below_Left}
7151       \p{Canonical_Combining_Class=CCC133}
7152       \p{Grapheme_Cluster_Break=E_Base}
7153       \p{Grapheme_Cluster_Break=E_Base_GAZ}
7154       \p{Grapheme_Cluster_Break=E_Modifier}
7155       \p{Grapheme_Cluster_Break=Glue_After_Zwj}
7156       \p{Word_Break=E_Base}
7157       \p{Word_Break=E_Base_GAZ}
7158       \p{Word_Break=E_Modifier}
7159       \p{Word_Break=Glue_After_Zwj}
7160

Properties accessible through Unicode::UCD

7162       The value of any Unicode (not including Perl extensions) character
7163       property mentioned above for any single code point is available through
7164       "charprop()" in Unicode::UCD.  "charprops_all()" in Unicode::UCD
7165       returns the values of all the Unicode properties for a given code
7166       point.
7167
7168       Besides these, all the Unicode character properties mentioned above
7169       (except for those marked as for internal use by Perl) are also
7170       accessible by "prop_invlist()" in Unicode::UCD.
7171
7172       Due to their nature, not all Unicode character properties are suitable
7173       for regular expression matches, nor prop_invlist().  The remaining non-
7174       provisional, non-internal ones are accessible via "prop_invmap()" in
7175       Unicode::UCD (except for those that this Perl installation hasn't
7176       included; see below for which those are).
7177
7178       For compatibility with other parts of Perl, all the single forms given
7179       in the table in the section above are recognized.  BUT, there are some
7180       ambiguities between some Perl extensions and the Unicode properties,
7181       all of which are silently resolved in favor of the official Unicode
7182       property.  To avoid surprises, you should only use prop_invmap() for
7183       forms listed in the table below, which omits the non-recommended ones.
7184       The affected forms are the Perl single form equivalents of Unicode
7185       properties, such as "\p{sc}" being a single-form equivalent of
7186       "\p{gc=sc}", which is treated by prop_invmap() as the "Script"
7187       property, whose short name is "sc".  The table indicates the current
7188       ambiguities in the INFO column, beginning with the word "NOT".
7189
7190       The standard Unicode properties listed below are documented in
7191       <http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is
7192       documented in "prop_invmap()" in Unicode::UCD.  The other Perl
7193       extensions are in "Other Properties" in perlunicode;
7194
7195       The first column in the table is a name for the property; the second
7196       column is an alternative name, if any, plus possibly some annotations.
7197       The alternative name is the property's full name, unless that would
7198       simply repeat the first column, in which case the second column
7199       indicates the property's short name (if different).  The annotations
7200       are given only in the entry for the full name.  The annotations for
7201       binary properties include a list of the first few ranges that the
7202       property matches.  To avoid any ambiguity, the SPACE character is
7203       represented as "\x20".
7204
7205       If a property is obsolete, etc, the entry will be flagged with the same
7206       characters used in the table in the section above, like D or S.
7207
7208          NAME                      INFO
7209
7210          Age
7211          AHex                    ASCII_Hex_Digit
7212          All                     (Perl extension).  All code points,
7213                                  including those above Unicode.  Same as
7214                                  qr/./s.  U+0000..infinity
7215          Alnum                   XPosixAlnum.  (Perl extension)
7216          Alpha                   Alphabetic
7217          Alphabetic              (Short: Alpha).  [A-Za-z\xaa\xb5\xba\xc0-
7218                                  \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7219                                  U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7220                                  ...
7221          Any                     (Perl extension).  All Unicode code
7222                                  points.  U+0000..10FFFF
7223          ASCII                   Block=Basic_Latin.  (Perl extension).
7224                                  [\x00-\x7f]
7225          ASCII_Hex_Digit         (Short: AHex).  [0-9A-Fa-f]
7226          Assigned                (Perl extension).  All assigned code
7227                                  points.  U+0000..0377, U+037A..037F,
7228                                  U+0384..038A, U+038C, U+038E..03A1,
7229                                  U+03A3..052F ...
7230          Bc                      Bidi_Class
7231          Bidi_C                  Bidi_Control
7232          Bidi_Class              (Short: bc)
7233          Bidi_Control            (Short: Bidi_C).  U+061C, U+200E..200F,
7234                                  U+202A..202E, U+2066..2069
7235          Bidi_M                  Bidi_Mirrored
7236          Bidi_Mirrored           (Short: Bidi_M).  [\(\)<>\[\]\{\}\xab
7237                                  \xbb], U+0F3A..0F3D, U+169B..169C,
7238                                  U+2039..203A, U+2045..2046, U+207D..207E
7239                                  ...
7240          Bidi_Mirroring_Glyph    (Short: bmg)
7241          Bidi_Paired_Bracket     (Short: bpb)
7242          Bidi_Paired_Bracket_Type (Short: bpt)
7243          Blank                   XPosixBlank.  (Perl extension)
7244          Blk                     Block
7245          Block                   (Short: blk)
7246          Bmg                     Bidi_Mirroring_Glyph
7247          Bpb                     Bidi_Paired_Bracket
7248          Bpt                     Bidi_Paired_Bracket_Type
7249          Canonical_Combining_Class (Short: ccc)
7250          Case_Folding            (Short: cf)
7251          Case_Ignorable          (Short: CI).  [\'.:\^`\xa8\xad\xaf\xb4
7252                                  \xb7-\xb8], U+02B0..036F, U+0374..0375,
7253                                  U+037A, U+0384..0385, U+0387 ...
7254          Cased                   [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7255                                  \xff], U+0100..01BA, U+01BC..01BF,
7256                                  U+01C4..0293, U+0295..02B8, U+02C0..02C1
7257                                  ...
7258          Category                General_Category
7259          Ccc                     Canonical_Combining_Class
7260          CE                      Composition_Exclusion
7261          Cf                      Case_Folding; NOT 'cf' meaning
7262                                  'General_Category=Format'
7263          Changes_When_Casefolded (Short: CWCF).  [A-Z\xb5\xc0-\xd6\xd8-
7264                                  \xdf], U+0100, U+0102, U+0104, U+0106,
7265                                  U+0108 ...
7266          Changes_When_Casemapped (Short: CWCM).  [A-Za-z\xb5\xc0-\xd6\xd8-
7267                                  \xf6\xf8-\xff], U+0100..0137,
7268                                  U+0139..018C, U+018E..019A, U+019C..01A9,
7269                                  U+01AC..01B9 ...
7270          Changes_When_Lowercased (Short: CWL).  [A-Z\xc0-\xd6\xd8-\xde],
7271                                  U+0100, U+0102, U+0104, U+0106, U+0108 ...
7272          Changes_When_NFKC_Casefolded (Short: CWKCF).  [A-Z\xa0\xa8\xaa
7273                                  \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0-
7274                                  \xd6\xd8-\xdf], U+0100, U+0102, U+0104,
7275                                  U+0106, U+0108 ...
7276          Changes_When_Titlecased (Short: CWT).  [a-z\xb5\xdf-\xf6\xf8-
7277                                  \xff], U+0101, U+0103, U+0105, U+0107,
7278                                  U+0109 ...
7279          Changes_When_Uppercased (Short: CWU).  [a-z\xb5\xdf-\xf6\xf8-
7280                                  \xff], U+0101, U+0103, U+0105, U+0107,
7281                                  U+0109 ...
7282          CI                      Case_Ignorable
7283          Cntrl                   XPosixCntrl (=General_Category=Control).
7284                                  (Perl extension)
7285          Comp_Ex                 Full_Composition_Exclusion
7286          Composition_Exclusion   (Short: CE).  U+0958..095F, U+09DC..09DD,
7287                                  U+09DF, U+0A33, U+0A36, U+0A59..0A5B ...
7288          CWCF                    Changes_When_Casefolded
7289          CWCM                    Changes_When_Casemapped
7290          CWKCF                   Changes_When_NFKC_Casefolded
7291          CWL                     Changes_When_Lowercased
7292          CWT                     Changes_When_Titlecased
7293          CWU                     Changes_When_Uppercased
7294          Dash                    [\-], U+058A, U+05BE, U+1400, U+1806,
7295                                  U+2010..2015 ...
7296          Decomposition_Mapping   (Short: dm)
7297          Decomposition_Type      (Short: dt)
7298          Default_Ignorable_Code_Point (Short: DI).  [\xad], U+034F, U+061C,
7299                                  U+115F..1160, U+17B4..17B5, U+180B..180F
7300                                  ...
7301          Dep                     Deprecated
7302          Deprecated              (Short: Dep).  U+0149, U+0673, U+0F77,
7303                                  U+0F79, U+17A3..17A4, U+206A..206F ...
7304          DI                      Default_Ignorable_Code_Point
7305          Dia                     Diacritic
7306          Diacritic               (Short: Dia).  [\^`\xa8\xaf\xb4\xb7-\xb8],
7307                                  U+02B0..034E, U+0350..0357, U+035D..0362,
7308                                  U+0374..0375, U+037A ...
7309          Digit                   XPosixDigit (=General_Category=
7310                                  Decimal_Number).  (Perl extension)
7311          Dm                      Decomposition_Mapping
7312          Dt                      Decomposition_Type
7313          Ea                      East_Asian_Width
7314          East_Asian_Width        (Short: ea)
7315          EBase                   Emoji_Modifier_Base
7316          EComp                   Emoji_Component
7317          EMod                    Emoji_Modifier
7318          Emoji                   [#*0-9\xa9\xae], U+203C, U+2049, U+2122,
7319                                  U+2139, U+2194..2199 ...
7320          Emoji_Component         (Short: EComp).  [#*0-9], U+200D, U+20E3,
7321                                  U+FE0F, U+1F1E6..1F1FF, U+1F3FB..1F3FF ...
7322          Emoji_Modifier          (Short: EMod).  U+1F3FB..1F3FF
7323          Emoji_Modifier_Base     (Short: EBase).  U+261D, U+26F9,
7324                                  U+270A..270D, U+1F385, U+1F3C2..1F3C4,
7325                                  U+1F3C7 ...
7326          Emoji_Presentation      (Short: EPres).  U+231A..231B,
7327                                  U+23E9..23EC, U+23F0, U+23F3,
7328                                  U+25FD..25FE, U+2614..2615 ...
7329          EPres                   Emoji_Presentation
7330          EqUIdeo                 Equivalent_Unified_Ideograph
7331          Equivalent_Unified_Ideograph (Short: EqUIdeo)
7332          Ext                     Extender
7333          Extended_Pictographic   (Short: ExtPict).  [\xa9\xae], U+203C,
7334                                  U+2049, U+2122, U+2139, U+2194..2199 ...
7335          Extender                (Short: Ext).  [\xb7], U+02D0..02D1,
7336                                  U+0640, U+07FA, U+0B55, U+0E46 ...
7337          ExtPict                 Extended_Pictographic
7338          Full_Composition_Exclusion (Short: Comp_Ex).  U+0340..0341,
7339                                  U+0343..0344, U+0374, U+037E, U+0387,
7340                                  U+0958..095F ...
7341          Gc                      General_Category
7342          GCB                     Grapheme_Cluster_Break
7343          General_Category        (Short: gc)
7344          Gr_Base                 Grapheme_Base
7345          Gr_Ext                  Grapheme_Extend
7346          Graph                   XPosixGraph.  (Perl extension)
7347          Grapheme_Base           (Short: Gr_Base).  [\x20-\x7e\xa0-\xac
7348                                  \xae-\xff], U+0100..02FF, U+0370..0377,
7349                                  U+037A..037F, U+0384..038A, U+038C ...
7350          Grapheme_Cluster_Break  (Short: GCB)
7351          Grapheme_Extend         (Short: Gr_Ext).  U+0300..036F,
7352                                  U+0483..0489, U+0591..05BD, U+05BF,
7353                                  U+05C1..05C2, U+05C4..05C5 ...
7354          Hangul_Syllable_Type    (Short: hst)
7355          Hex                     Hex_Digit
7356          Hex_Digit               (Short: Hex).  [0-9A-Fa-f], U+FF10..FF19,
7357                                  U+FF21..FF26, U+FF41..FF46
7358          HorizSpace              XPosixBlank.  (Perl extension)
7359          Hst                     Hangul_Syllable_Type
7360        D Hyphen                  [\-\xad], U+058A, U+1806, U+2010..2011,
7361                                  U+2E17, U+30FB ...  Supplanted by
7362                                  Line_Break property values; see
7363                                  www.unicode.org/reports/tr14
7364          ID_Continue             (Short: IDC).  [0-9A-Z_a-z\xaa\xb5\xb7
7365                                  \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7366                                  U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7367                                  U+02EC, U+02EE ...
7368          ID_Start                (Short: IDS).  [A-Za-z\xaa\xb5\xba\xc0-
7369                                  \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7370                                  U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7371                                  ...
7372          IDC                     ID_Continue
7373          Identifier_Status
7374          Identifier_Type
7375          Ideo                    Ideographic
7376          Ideographic             (Short: Ideo).  U+3006..3007,
7377                                  U+3021..3029, U+3038..303A, U+3400..4DBF,
7378                                  U+4E00..9FFF, U+F900..FA6D ...
7379          IDS                     ID_Start
7380          IDS_Binary_Operator     (Short: IDSB).  U+2FF0..2FF1, U+2FF4..2FFB
7381          IDS_Trinary_Operator    (Short: IDST).  U+2FF2..2FF3
7382          IDSB                    IDS_Binary_Operator
7383          IDST                    IDS_Trinary_Operator
7384          In                      Present_In.  (Perl extension)
7385          Indic_Positional_Category (Short: InPC)
7386          Indic_Syllabic_Category (Short: InSC)
7387          InPC                    Indic_Positional_Category
7388          InSC                    Indic_Syllabic_Category
7389          Isc                     ISO_Comment; NOT 'isc' meaning
7390                                  'General_Category=Other'
7391          ISO_Comment             (Short: isc)
7392          Jg                      Joining_Group
7393          Join_C                  Join_Control
7394          Join_Control            (Short: Join_C).  U+200C..200D
7395          Joining_Group           (Short: jg)
7396          Joining_Type            (Short: jt)
7397          Jt                      Joining_Type
7398          Lb                      Line_Break
7399          Lc                      Lowercase_Mapping; NOT 'lc' meaning
7400                                  'General_Category=Cased_Letter'
7401          Line_Break              (Short: lb)
7402          LOE                     Logical_Order_Exception
7403          Logical_Order_Exception (Short: LOE).  U+0E40..0E44, U+0EC0..0EC4,
7404                                  U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9
7405                                  ...
7406          Lower                   Lowercase
7407          Lowercase               (Short: Lower).  [a-z\xaa\xb5\xba\xdf-
7408                                  \xf6\xf8-\xff], U+0101, U+0103, U+0105,
7409                                  U+0107, U+0109 ...
7410          Lowercase_Mapping       (Short: lc)
7411          Math                    [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
7412                                  U+03D5, U+03F0..03F1, U+03F4..03F6,
7413                                  U+0606..0608 ...
7414          Na                      Name
7415          Na1                     Unicode_1_Name
7416          Name                    (Short: na)
7417          Name_Alias
7418          NChar                   Noncharacter_Code_Point
7419          NFC_QC                  NFC_Quick_Check
7420          NFC_Quick_Check         (Short: NFC_QC)
7421          NFD_QC                  NFD_Quick_Check
7422          NFD_Quick_Check         (Short: NFD_QC)
7423          NFKC_Casefold           (Short: NFKC_CF)
7424          NFKC_CF                 NFKC_Casefold
7425          NFKC_QC                 NFKC_Quick_Check
7426          NFKC_Quick_Check        (Short: NFKC_QC)
7427          NFKD_QC                 NFKD_Quick_Check
7428          NFKD_Quick_Check        (Short: NFKD_QC)
7429          Noncharacter_Code_Point (Short: NChar).  U+FDD0..FDEF,
7430                                  U+FFFE..FFFF, U+1FFFE..1FFFF,
7431                                  U+2FFFE..2FFFF, U+3FFFE..3FFFF,
7432                                  U+4FFFE..4FFFF ...
7433          Nt                      Numeric_Type
7434          Numeric_Type            (Short: nt)
7435          Numeric_Value           (Short: nv)
7436          Nv                      Numeric_Value
7437          Pat_Syn                 Pattern_Syntax
7438          Pat_WS                  Pattern_White_Space
7439          Pattern_Syntax          (Short: Pat_Syn).  [!\"#\$\%&\'\(\)*+,\-.
7440                                  \/:;<=>?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9
7441                                  \xab-\xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7
7442                                  \xf7], U+2010..2027, U+2030..203E,
7443                                  U+2041..2053, U+2055..205E, U+2190..245F
7444                                  ...
7445          Pattern_White_Space     (Short: Pat_WS).  [\t\n\cK\f\r\x20\x85],
7446                                  U+200E..200F, U+2028..2029
7447          PCM                     Prepended_Concatenation_Mark
7448          Perl_Decimal_Digit      (Perl extension)
7449          PerlSpace               PosixSpace.  (Perl extension)
7450          PerlWord                PosixWord.  (Perl extension)
7451          PosixAlnum              (Perl extension).  [0-9A-Za-z]
7452          PosixAlpha              (Perl extension).  [A-Za-z]
7453          PosixBlank              (Perl extension).  [\t\x20]
7454          PosixCntrl              (Perl extension).  ASCII control
7455                                  characters.  ACK, BEL, BS, CAN, CR, DC1,
7456                                  DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT,
7457                                  ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK,
7458                                  NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT
7459          PosixDigit              (Perl extension).  [0-9]
7460          PosixGraph              (Perl extension).  [!\"#\$\%&\'\(\)*+,\-.
7461                                  \/0-9:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~]
7462          PosixLower              (Perl extension).  [a-z]
7463          PosixPrint              (Perl extension).  [\x20-\x7e]
7464          PosixPunct              (Perl extension).  [!\"#\$\%&\'\(\)*+,\-.
7465                                  \/:;<=>?\@\[\\\]\^_`\{\|\}~]
7466          PosixSpace              (Perl extension).  [\t\n\cK\f\r\x20]
7467          PosixUpper              (Perl extension).  [A-Z]
7468          PosixWord               (Perl extension).  \w, restricted to
7469                                  ASCII.  [0-9A-Z_a-z]
7470          PosixXDigit             ASCII_Hex_Digit.  (Perl extension).
7471                                  [0-9A-Fa-f]
7472          Prepended_Concatenation_Mark (Short: PCM).  U+0600..0605, U+06DD,
7473                                  U+070F, U+0890..0891, U+08E2, U+110BD ...
7474          Present_In              (Short: In).  (Perl extension)
7475          Print                   XPosixPrint.  (Perl extension)
7476          Punct                   General_Category=Punctuation.  (Perl
7477                                  extension).  [!\"#\%&\'\(\)*,\-.\/:;?\@
7478                                  \[\\\]_\{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
7479                                  U+037E, U+0387, U+055A..055F,
7480                                  U+0589..058A, U+05BE ...
7481          QMark                   Quotation_Mark
7482          Quotation_Mark          (Short: QMark).  [\"\'\xab\xbb],
7483                                  U+2018..201F, U+2039..203A, U+2E42,
7484                                  U+300C..300F, U+301D..301F ...
7485          Radical                 U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5
7486          Regional_Indicator      (Short: RI).  U+1F1E6..1F1FF
7487          RI                      Regional_Indicator
7488          SB                      Sentence_Break
7489          Sc                      Script; NOT 'sc' meaning
7490                                  'General_Category=Currency_Symbol'
7491          Scf                     Simple_Case_Folding
7492          Script                  (Short: sc)
7493          Script_Extensions       (Short: scx)
7494          Scx                     Script_Extensions
7495          SD                      Soft_Dotted
7496          Sentence_Break          (Short: SB)
7497          Sentence_Terminal       (Short: STerm).  [!.?], U+0589,
7498                                  U+061D..061F, U+06D4, U+0700..0702, U+07F9
7499                                  ...
7500          Sfc                     Simple_Case_Folding
7501          Simple_Case_Folding     (Short: scf)
7502          Simple_Lowercase_Mapping (Short: slc)
7503          Simple_Titlecase_Mapping (Short: stc)
7504          Simple_Uppercase_Mapping (Short: suc)
7505          Slc                     Simple_Lowercase_Mapping
7506          Soft_Dotted             (Short: SD).  [i-j], U+012F, U+0249,
7507                                  U+0268, U+029D, U+02B2 ...
7508          Space                   White_Space
7509          SpacePerl               XPosixSpace.  (Perl extension)
7510          Stc                     Simple_Titlecase_Mapping
7511          STerm                   Sentence_Terminal
7512          Suc                     Simple_Uppercase_Mapping
7513          Tc                      Titlecase_Mapping
7514          Term                    Terminal_Punctuation
7515          Terminal_Punctuation    (Short: Term).  [!,.:;?], U+037E, U+0387,
7516                                  U+0589, U+05C3, U+060C ...
7517          Title                   Titlecase.  (Perl extension)
7518          Titlecase               (Short: Title).  (Perl extension).  (=
7519                                  \p{Gc=Lt}).  U+01C5, U+01C8, U+01CB,
7520                                  U+01F2, U+1F88..1F8F, U+1F98..1F9F ...
7521          Titlecase_Mapping       (Short: tc)
7522          Uc                      Uppercase_Mapping
7523          UIdeo                   Unified_Ideograph
7524          Unicode                 Any.  (Perl extension)
7525          Unicode_1_Name          (Short: na1)
7526          Unified_Ideograph       (Short: UIdeo).  U+3400..4DBF,
7527                                  U+4E00..9FFF, U+FA0E..FA0F, U+FA11,
7528                                  U+FA13..FA14, U+FA1F ...
7529          Upper                   Uppercase
7530          Uppercase               (Short: Upper).  [A-Z\xc0-\xd6\xd8-\xde],
7531                                  U+0100, U+0102, U+0104, U+0106, U+0108 ...
7532          Uppercase_Mapping       (Short: uc)
7533          Variation_Selector      (Short: VS).  U+180B..180D, U+180F,
7534                                  U+FE00..FE0F, U+E0100..E01EF
7535          Vertical_Orientation    (Short: vo)
7536          VertSpace               (Perl extension).  \v.  [\n\cK\f\r\x85],
7537                                  U+2028..2029
7538          Vo                      Vertical_Orientation
7539          VS                      Variation_Selector
7540          WB                      Word_Break
7541          White_Space             (Short: WSpace).  [\t\n\cK\f\r\x20\x85
7542                                  \xa0], U+1680, U+2000..200A, U+2028..2029,
7543                                  U+202F, U+205F ...
7544          Word                    XPosixWord.  (Perl extension)
7545          Word_Break              (Short: WB)
7546          WSpace                  White_Space
7547          XDigit                  XPosixXDigit (=Hex_Digit).  (Perl
7548                                  extension)
7549          XID_Continue            (Short: XIDC).  [0-9A-Z_a-z\xaa\xb5\xb7
7550                                  \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7551                                  U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7552                                  U+02EC, U+02EE ...
7553          XID_Start               (Short: XIDS).  [A-Za-z\xaa\xb5\xba\xc0-
7554                                  \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7555                                  U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7556                                  ...
7557          XIDC                    XID_Continue
7558          XIDS                    XID_Start
7559          XPerlSpace              XPosixSpace.  (Perl extension)
7560          XPosixAlnum             (Short: Alnum).  (Perl extension).
7561                                  Alphabetic and (decimal) Numeric.  [0-9A-
7562                                  Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7563                                  \xff], U+0100..02C1, U+02C6..02D1,
7564                                  U+02E0..02E4, U+02EC, U+02EE ...
7565          XPosixAlpha             Alphabetic.  (Perl extension).  [A-Za-z
7566                                  \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7567                                  U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7568                                  U+02EC, U+02EE ...
7569          XPosixBlank             (Short: Blank).  (Perl extension).  \h,
7570                                  Horizontal white space.  [\t\x20\xa0],
7571                                  U+1680, U+2000..200A, U+202F, U+205F,
7572                                  U+3000
7573          XPosixCntrl             General_Category=Control  (Short: Cntrl).
7574                                  (Perl extension).  Control characters.
7575                                  [\x00-\x1f\x7f-\x9f]
7576          XPosixDigit             General_Category=Decimal_Number  (Short:
7577                                  Digit).  (Perl extension).  [0-9] + all
7578                                  other decimal digits.  [0-9],
7579                                  U+0660..0669, U+06F0..06F9, U+07C0..07C9,
7580                                  U+0966..096F, U+09E6..09EF ...
7581          XPosixGraph             (Short: Graph).  (Perl extension).
7582                                  Characters that are graphical.  [!\"#\$
7583                                  \%&\'\(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]
7584                                  \^_`a-z\{\|\}~\xa1-\xff], U+0100..0377,
7585                                  U+037A..037F, U+0384..038A, U+038C,
7586                                  U+038E..03A1 ...
7587          XPosixLower             Lowercase.  (Perl extension).  [a-z\xaa
7588                                  \xb5\xba\xdf-\xf6\xf8-\xff], U+0101,
7589                                  U+0103, U+0105, U+0107, U+0109 ...
7590          XPosixPrint             (Short: Print).  (Perl extension).
7591                                  Characters that are graphical plus space
7592                                  characters (but no controls).  [\x20-\x7e
7593                                  \xa0-\xff], U+0100..0377, U+037A..037F,
7594                                  U+0384..038A, U+038C, U+038E..03A1 ...
7595          XPosixPunct             (Perl extension).  \p{Punct} + ASCII-range
7596                                  \p{Symbol}.  [!\"#\$\%&\'\(\)*+,\-.\/:;<=
7597                                  >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
7598                                  \xb7\xbb\xbf], U+037E, U+0387,
7599                                  U+055A..055F, U+0589..058A, U+05BE ...
7600          XPosixSpace             (Perl extension).  \s including beyond
7601                                  ASCII and vertical tab.  [\t\n\cK\f\r\x20
7602                                  \x85\xa0], U+1680, U+2000..200A,
7603                                  U+2028..2029, U+202F, U+205F ...
7604          XPosixUpper             Uppercase.  (Perl extension).  [A-Z\xc0-
7605                                  \xd6\xd8-\xde], U+0100, U+0102, U+0104,
7606                                  U+0106, U+0108 ...
7607          XPosixWord              (Short: Word).  (Perl extension).  \w,
7608                                  including beyond ASCII; = \p{Alnum} + \pM
7609                                  + \p{Pc} + \p{Join_Control}.  [0-9A-Z_a-z
7610                                  \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7611                                  U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7612                                  U+02EC, U+02EE ...
7613          XPosixXDigit            Hex_Digit  (Short: XDigit).  (Perl
7614                                  extension).  [0-9A-Fa-f], U+FF10..FF19,
7615                                  U+FF21..FF26, U+FF41..FF46
7616

Properties accessible through other means

7618       Certain properties are accessible also via core function calls.  These
7619       are:
7620
7621        Lowercase_Mapping          lc() and lcfirst()
7622        Titlecase_Mapping          ucfirst()
7623        Uppercase_Mapping          uc()
7624
7625       Also, Case_Folding is accessible through the "/i" modifier in regular
7626       expressions, the "\F" transliteration escape, and the "fc" operator.
7627
7628       Besides being able to say "\p{Name=...}", the Name and Name_Aliases
7629       properties are accessible through the "\N{}" interpolation in double-
7630       quoted strings and regular expressions; and functions
7631       charnames::viacode(), charnames::vianame(), and
7632       charnames::string_vianame() (which require a "use charnames ();" to be
7633       specified.
7634
7635       Finally, most properties related to decomposition are accessible via
7636       Unicode::Normalize.
7637

Unicode character properties that are NOT accepted by Perl

7639       Perl will generate an error for a few character properties in Unicode
7640       when used in a regular expression.  The non-Unihan ones are listed
7641       below, with the reasons they are not accepted, perhaps with work-
7642       arounds.  The short names for the properties are listed enclosed in
7643       (parentheses).  As described after the list, an installation can change
7644       the defaults and choose to accept any of these.  The list is machine
7645       generated based on the choices made for the installation that generated
7646       this document.
7647
7648       Expands_On_NFC (XO_NFC)
7649       Expands_On_NFD (XO_NFD)
7650       Expands_On_NFKC (XO_NFKC)
7651       Expands_On_NFKD (XO_NFKD)
7652           Deprecated by Unicode.  These are characters that expand to more
7653           than one character in the specified normalization form, but whether
7654           they actually take up more bytes or not depends on the encoding
7655           being used.  For example, a UTF-8 encoded character may expand to a
7656           different number of bytes than a UTF-32 encoded character.
7657
7658       Grapheme_Link (Gr_Link)
7659           Duplicates ccc=vr (Canonical_Combining_Class=Virama)
7660
7661       Jamo_Short_Name (JSN)
7662       Other_Alphabetic (OAlpha)
7663       Other_Default_Ignorable_Code_Point (ODI)
7664       Other_Grapheme_Extend (OGr_Ext)
7665       Other_ID_Continue (OIDC)
7666       Other_ID_Start (OIDS)
7667       Other_Lowercase (OLower)
7668       Other_Math (OMath)
7669       Other_Uppercase (OUpper)
7670           Used by Unicode internally for generating other properties and not
7671           intended to be used stand-alone
7672
7673       Script=Katakana_Or_Hiragana (sc=Hrkt)
7674           Obsolete.  All code points previously matched by this have been
7675           moved to "Script=Common".  Consider instead using
7676           "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or
7677           both)
7678
7679       Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)
7680           All code points that would be matched by this are matched by either
7681           "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
7682
7683       An installation can choose to allow any of these to be matched by
7684       downloading the Unicode database from <http://www.unicode.org/Public/>
7685       to $Config{privlib}/unicore/ in the Perl source tree, changing the
7686       controlling lists contained in the program
7687       $Config{privlib}/unicore/mktables and then re-compiling and installing.
7688       (%Config is available from the Config module).
7689
7690       Also, perl can be recompiled to operate on an earlier version of the
7691       Unicode standard.  Further information is at
7692       $Config{privlib}/unicore/README.perl.
7693

Other information in the Unicode data base

7695       The Unicode data base is delivered in two different formats.  The XML
7696       version is valid for more modern Unicode releases.  The other version
7697       is a collection of files.  The two are intended to give equivalent
7698       information.  Perl uses the older form; this allows you to recompile
7699       Perl to use early Unicode releases.
7700
7701       The only non-character property that Perl currently supports is Named
7702       Sequences, in which a sequence of code points is given a name and
7703       generally treated as a single entity.  (Perl supports these via the
7704       "\N{...}" double-quotish construct, "charnames::string_vianame(name)"
7705       in charnames, and "namedseq()" in Unicode::UCD.
7706
7707       Below is a list of the files in the Unicode data base that Perl doesn't
7708       currently use, along with very brief descriptions of their purposes.
7709       Some of the names of the files have been shortened from those that
7710       Unicode uses, in order to allow them to be distinguishable from
7711       similarly named files on file systems for which only the first 8
7712       characters of a name are significant.
7713
7714       auxiliary/GraphemeBreakTest.html
7715       auxiliary/LineBreakTest.html
7716       auxiliary/SentenceBreakTest.html
7717       auxiliary/WordBreakTest.html
7718           Documentation of validation Tests
7719
7720       BidiCharacterTest.txt
7721       BidiTest.txt
7722           Validation Tests
7723
7724       CJKRadicals.txt
7725           Maps the kRSUnicode property values to corresponding code points
7726
7727       confusables.txt
7728       confusablesSummary.txt
7729       intentional.txt
7730           Currently unused by Perl
7731
7732       emoji/ReadMe.txt
7733       ReadMe.txt
7734           Documentation
7735
7736       EmojiSources.txt
7737           Maps certain Unicode code points to their legacy Japanese cell-
7738           phone values
7739
7740       extracted/DName.txt
7741           This file adds no new information not already present in other
7742           files
7743
7744       Index.txt
7745           Alphabetical index of Unicode characters
7746
7747       NamedSqProv.txt
7748           Named sequences proposed for inclusion in a later version of the
7749           Unicode Standard; if you need them now, you can append this file to
7750           NamedSequences.txt and recompile perl
7751
7752       NamesList.html
7753           Describes the format and contents of NamesList.txt
7754
7755       NamesList.txt
7756           Annotated list of characters
7757
7758       NormalizationCorrections.txt
7759           Documentation of corrections already incorporated into the Unicode
7760           data base
7761
7762       NushuSources.txt
7763           Specifies source material for Nushu characters
7764
7765       StandardizedVariants.html
7766           Obsoleted as of Unicode 9.0, but previously provided a visual
7767           display of the standard variant sequences derived from
7768           StandardizedVariants.txt.
7769
7770       StandardizedVariants.txt
7771           Certain glyph variations for character display are standardized.
7772           This lists the non-Unihan ones; the Unihan ones are also not used
7773           by Perl, and are in a separate Unicode data base
7774           <http://www.unicode.org/ivd>
7775
7776       TangutSources.txt
7777           Specifies source mappings for Tangut ideographs and components.
7778           This data file also includes informative radical-stroke values that
7779           are used internally by Unicode
7780
7781       USourceData.txt
7782           Documentation of status and cross reference of proposals for
7783           encoding by Unicode of Unihan characters
7784
7785       USourceGlyphs.pdf
7786           Pictures of the characters in USourceData.txt
7787

SEE ALSO

7789       <http://www.unicode.org/reports/tr44/>
7790
7791       perlrecharclass
7792
7793       perlunicode
7794
7795
7796
7797perl v5.38.2                      2023-11-30                   PERLUNIPROPS(1)
Impressum