1PERLUNIPROPS(1)        Perl Programmers Reference Guide        PERLUNIPROPS(1)
2
3
4

NAME

6       perluniprops - Index of Unicode Version 14.0.0 character properties in
7       Perl
8

DESCRIPTION

10       This document provides information about the portion of the Unicode
11       database that deals with character properties, that is the portion that
12       is defined on single code points.  ("Other information in the Unicode
13       data base" below briefly mentions other data that Unicode provides.)
14
15       Perl can provide access to all non-provisional Unicode character
16       properties, though not all are enabled by default.  The omitted ones
17       are the Unihan properties and certain deprecated or Unicode-internal
18       properties.  (An installation may choose to recompile Perl's tables to
19       change this.  See "Unicode character properties that are NOT accepted
20       by Perl".)
21
22       For most purposes, access to Unicode properties from the Perl core is
23       through regular expression matches, as described in the next section.
24       For some special purposes, and to access the properties that are not
25       suitable for regular expression matching, all the Unicode character
26       properties that Perl handles are accessible via the standard
27       Unicode::UCD module, as described in the section "Properties accessible
28       through Unicode::UCD".
29
30       Perl also provides some additional extensions and short-cut synonyms
31       for Unicode properties.
32
33       This document merely lists all available properties and does not
34       attempt to explain what each property really means.  There is a brief
35       description of each Perl extension; see "Other Properties" in
36       perlunicode for more information on these.  There is some detail about
37       Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but
38       to find out about the intricacies of the official Unicode properties,
39       refer to the Unicode standard.  A good starting place is
40       <http://www.unicode.org/reports/tr44/>.
41
42       Note that you can define your own properties; see "User-Defined
43       Character Properties" in perlunicode.
44

Properties accessible through "\p{}" and "\P{}"

46       The Perl regular expression "\p{}" and "\P{}" constructs give access to
47       most of the Unicode character properties.  The table below shows all
48       these constructs, both single and compound forms.
49
50       Compound forms consist of two components, separated by an equals sign
51       or a colon.  The first component is the property name, and the second
52       component is the particular value of the property to match against, for
53       example, "\p{Script_Extensions: Greek}" and
54       "\p{Script_Extensions=Greek}" both mean to match characters whose
55       Script_Extensions property value is Greek.  ("Script_Extensions" is an
56       improved version of the "Script" property.)
57
58       Single forms, like "\p{Greek}", are mostly Perl-defined shortcuts for
59       their equivalent compound forms.  The table shows these equivalences.
60       (In our example, "\p{Greek}" is a just a shortcut for
61       "\p{Script_Extensions=Greek}").  There are also a few Perl-defined
62       single forms that are not shortcuts for a compound form.  One such is
63       "\p{Word}".  These are also listed in the table.
64
65       In parsing these constructs, Perl always ignores Upper/lower case
66       differences everywhere within the {braces}.  Thus "\p{Greek}" means the
67       same thing as "\p{greek}".  But note that changing the case of the "p"
68       or "P" before the left brace completely changes the meaning of the
69       construct, from "match" (for "\p{}") to "doesn't match" (for "\P{}").
70       Casing in this document is for improved legibility.
71
72       Also, white space, hyphens, and underscores are normally ignored
73       everywhere between the {braces}, and hence can be freely added or
74       removed even if the "/x" modifier hasn't been specified on the regular
75       expression.  But in the table below a 'T' at the beginning of an entry
76       means that tighter (stricter) rules are used for that entry:
77
78           Single form ("\p{name}") tighter rules:
79               White space, hyphens, and underscores ARE significant except
80               for:
81
82               •   white space adjacent to a non-word character
83
84               •   underscores separating digits in numbers
85
86               That means, for example, that you can freely add or remove
87               white space adjacent to (but within) the braces without
88               affecting the meaning.
89
90           Compound form ("\p{name=value}" or "\p{name:value}") tighter rules:
91               The tighter rules given above for the single form apply to
92               everything to the right of the colon or equals; the looser
93               rules still apply to everything to the left.
94
95               That means, for example, that you can freely add or remove
96               white space adjacent to (but within) the braces and the colon
97               or equal sign.
98
99       Some properties are considered obsolete by Unicode, but still
100       available.  There are several varieties of obsolescence:
101
102           Stabilized
103               A property may be stabilized.  Such a determination does not
104               indicate that the property should or should not be used;
105               instead it is a declaration that the property will not be
106               maintained nor extended for newly encoded characters.  Such
107               properties are marked with an 'S' in the table.
108
109           Deprecated
110               A property may be deprecated, perhaps because its original
111               intent has been replaced by another property, or because its
112               specification was somehow defective.  This means that its use
113               is strongly discouraged, so much so that a warning will be
114               issued if used, unless the regular expression is in the scope
115               of a "no warnings 'deprecated'" statement.  A 'D' flags each
116               such entry in the table, and the entry there for the longest,
117               most descriptive version of the property will give the reason
118               it is deprecated, and perhaps advice.  Perl may issue such a
119               warning, even for properties that aren't officially deprecated
120               by Unicode, when there used to be characters or code points
121               that were matched by them, but no longer.  This is to warn you
122               that your program may not work like it did on earlier Unicode
123               releases.
124
125               A deprecated property may be made unavailable in a future Perl
126               version, so it is best to move away from them.
127
128               A deprecated property may also be stabilized, but this fact is
129               not shown.
130
131           Obsolete
132               Properties marked with an 'O' in the table are considered
133               (plain) obsolete.  Generally this designation is given to
134               properties that Unicode once used for internal purposes (but
135               not any longer).
136
137           Discouraged
138               This is not actually a Unicode-specified obsolescence, but
139               applies to certain Perl extensions that are present for
140               backwards compatibility, but are discouraged from being used.
141               These are not obsolete, but their meanings are not stable.
142               Future Unicode versions could force any of these extensions to
143               be removed without warning, replaced by another property with
144               the same name that means something different.  An 'X' flags
145               each such entry in the table.  Use the equivalent shown
146               instead.
147
148               In particular, matches in the Block property have single forms
149               defined by Perl that begin with "In_", ""Is_", or even with no
150               prefix at all,  Like all DISCOURAGED forms, these are not
151               stable.  For example, "\p{Block=Deseret}" can currently be
152               written as "\p{In_Deseret}", "\p{Is_Deseret}", or
153               "\p{Deseret}".  But, a new Unicode version may come along that
154               would force Perl to change the meaning of one or more of these,
155               and your program would no longer be correct.  Currently there
156               are no such conflicts with the form that begins "In_", but
157               there are many with the other two shortcuts, and Unicode
158               continues to define new properties that begin with "In", so
159               it's quite possible that a conflict will occur in the future.
160               The compound form is guaranteed to not become obsolete, and its
161               meaning is clearer anyway.  See "Blocks" in perlunicode for
162               more information about this.
163
164               User-defined properties must begin with "In" or "Is".  These
165               override any Unicode property of the same name.
166
167       The table below has two columns.  The left column contains the "\p{}"
168       constructs to look up, possibly preceded by the flags mentioned above;
169       and the right column contains information about them, like a
170       description, or synonyms.  The table shows both the single and compound
171       forms for each property that has them.  If the left column is a short
172       name for a property, the right column will give its longer, more
173       descriptive name; and if the left column is the longest name, the right
174       column will show any equivalent shortest name, in both single and
175       compound forms if applicable.
176
177       If braces are not needed to specify a property (e.g., "\pL"), the left
178       column contains both forms, with and without braces.
179
180       The right column will also caution you if a property means something
181       different than what might normally be expected.
182
183       All single forms are Perl extensions; a few compound forms are as well,
184       and are noted as such.
185
186       Numbers in (parentheses) indicate the total number of Unicode code
187       points matched by the property.  For the entries that give the longest,
188       most descriptive version of the property, the count is followed by a
189       list of some of the code points matched by it.  The list includes all
190       the matched characters in the 0-255 range, enclosed in the familiar
191       [brackets] the same as a regular expression bracketed character class.
192       Following that, the next few higher matching ranges are also given.  To
193       avoid visual ambiguity, the SPACE character is represented as "\x20".
194
195       For emphasis, those properties that match no code points at all are
196       listed as well in a separate section following the table.
197
198       Most properties match the same code points regardless of whether "/i"
199       case-insensitive matching is specified or not.  But a few properties
200       are affected.  These are shown with the notation "(/i= other_property)"
201       in the second column.  Under case-insensitive matching they match the
202       same code pode points as the property other_property.
203
204       There is no description given for most non-Perl defined properties (See
205       <http://www.unicode.org/reports/tr44/> for that).
206
207       For compactness, '*' is used as a wildcard instead of showing all
208       possible combinations.  For example, entries like:
209
210        \p{Gc: *}                                  \p{General_Category: *}
211
212       mean that 'Gc' is a synonym for 'General_Category', and anything that
213       is valid for the latter is also valid for the former.  Similarly,
214
215        \p{Is_*}                                   \p{*}
216
217       means that if and only if, for example, "\p{Foo}" exists, then
218       "\p{Is_Foo}" and "\p{IsFoo}" are also valid and all mean the same
219       thing.  And similarly, "\p{Foo=Bar}" means the same as "\p{Is_Foo=Bar}"
220       and "\p{IsFoo=Bar}".  "*" here is restricted to something not beginning
221       with an underscore.
222
223       Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
224       'Y'.  And 'No', 'F', and 'False' are all synonyms for 'N'.  The table
225       shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
226       for the other possibilities.  Note that not all properties which have
227       values 'Yes' and 'No' are binary, and they have all their values
228       spelled out without using this wild card, and a "NOT" clause in their
229       description that highlights their not being binary.  These also require
230       the compound form to match them, whereas true binary properties have
231       both single and compound forms available.
232
233       Note that all non-essential underscores are removed in the display of
234       the short names below.
235
236       Legend summary:
237
238       * is a wild-card
239       (\d+) in the info column gives the number of Unicode code points
240       matched by this property.
241       D means this is deprecated.
242       O means this is obsolete.
243       S means this is stabilized.
244       T means tighter (stricter) name matching applies.
245       X means use of this form is discouraged, and may not be stable.
246
247              NAME                           INFO
248
249          \p{Adlam}               \p{Script_Extensions=Adlam} (Short:
250                                    \p{Adlm}; NOT \p{Block=Adlam}) (90)
251          \p{Adlm}                \p{Adlam} (= \p{Script_Extensions=Adlam})
252                                    (NOT \p{Block=Adlam}) (90)
253        X \p{Aegean_Numbers}      \p{Block=Aegean_Numbers} (64)
254        T \p{Age: 1.1}            \p{Age=V1_1} (33_979)
255          \p{Age: V1_1}           Code point's usage introduced in version
256                                    1.1 (33_979: U+0000..01F5, U+01FA..0217,
257                                    U+0250..02A8, U+02B0..02DE,
258                                    U+02E0..02E9, U+0300..0345 ...)
259        T \p{Age: 2.0}            \p{Age=V2_0} (144_521)
260          \p{Age: V2_0}           Code point's usage was introduced in
261                                    version 2.0; See also Property
262                                    'Present_In' (144_521: U+0591..05A1,
263                                    U+05A3..05AF, U+05C4, U+0F00..0F47,
264                                    U+0F49..0F69, U+0F71..0F8B ...)
265        T \p{Age: 2.1}            \p{Age=V2_1} (2)
266          \p{Age: V2_1}           Code point's usage was introduced in
267                                    version 2.1; See also Property
268                                    'Present_In' (2: U+20AC, U+FFFC)
269        T \p{Age: 3.0}            \p{Age=V3_0} (10_307)
270          \p{Age: V3_0}           Code point's usage was introduced in
271                                    version 3.0; See also Property
272                                    'Present_In' (10_307: U+01F6..01F9,
273                                    U+0218..021F, U+0222..0233,
274                                    U+02A9..02AD, U+02DF, U+02EA..02EE ...)
275        T \p{Age: 3.1}            \p{Age=V3_1} (44_978)
276          \p{Age: V3_1}           Code point's usage was introduced in
277                                    version 3.1; See also Property
278                                    'Present_In' (44_978: U+03F4..03F5,
279                                    U+FDD0..FDEF, U+10300..1031E,
280                                    U+10320..10323, U+10330..1034A,
281                                    U+10400..10425 ...)
282        T \p{Age: 3.2}            \p{Age=V3_2} (1016)
283          \p{Age: V3_2}           Code point's usage was introduced in
284                                    version 3.2; See also Property
285                                    'Present_In' (1016: U+0220, U+034F,
286                                    U+0363..036F, U+03D8..03D9, U+03F6,
287                                    U+048A..048B ...)
288        T \p{Age: 4.0}            \p{Age=V4_0} (1226)
289          \p{Age: V4_0}           Code point's usage was introduced in
290                                    version 4.0; See also Property
291                                    'Present_In' (1226: U+0221,
292                                    U+0234..0236, U+02AE..02AF,
293                                    U+02EF..02FF, U+0350..0357, U+035D..035F
294                                    ...)
295        T \p{Age: 4.1}            \p{Age=V4_1} (1273)
296          \p{Age: V4_1}           Code point's usage was introduced in
297                                    version 4.1; See also Property
298                                    'Present_In' (1273: U+0237..0241,
299                                    U+0358..035C, U+03FC..03FF,
300                                    U+04F6..04F7, U+05A2, U+05C5..05C7 ...)
301        T \p{Age: 5.0}            \p{Age=V5_0} (1369)
302          \p{Age: V5_0}           Code point's usage was introduced in
303                                    version 5.0; See also Property
304                                    'Present_In' (1369: U+0242..024F,
305                                    U+037B..037D, U+04CF, U+04FA..04FF,
306                                    U+0510..0513, U+05BA ...)
307        T \p{Age: 5.1}            \p{Age=V5_1} (1624)
308          \p{Age: V5_1}           Code point's usage was introduced in
309                                    version 5.1; See also Property
310                                    'Present_In' (1624: U+0370..0373,
311                                    U+0376..0377, U+03CF, U+0487,
312                                    U+0514..0523, U+0606..060A ...)
313        T \p{Age: 5.2}            \p{Age=V5_2} (6648)
314          \p{Age: V5_2}           Code point's usage was introduced in
315                                    version 5.2; See also Property
316                                    'Present_In' (6648: U+0524..0525,
317                                    U+0800..082D, U+0830..083E, U+0900,
318                                    U+094E, U+0955 ...)
319        T \p{Age: 6.0}            \p{Age=V6_0} (2088)
320          \p{Age: V6_0}           Code point's usage was introduced in
321                                    version 6.0; See also Property
322                                    'Present_In' (2088: U+0526..0527,
323                                    U+0620, U+065F, U+0840..085B, U+085E,
324                                    U+093A..093B ...)
325        T \p{Age: 6.1}            \p{Age=V6_1} (732)
326          \p{Age: V6_1}           Code point's usage was introduced in
327                                    version 6.1; See also Property
328                                    'Present_In' (732: U+058F, U+0604,
329                                    U+08A0, U+08A2..08AC, U+08E4..08FE,
330                                    U+0AF0 ...)
331        T \p{Age: 6.2}            \p{Age=V6_2} (1)
332          \p{Age: V6_2}           Code point's usage was introduced in
333                                    version 6.2; See also Property
334                                    'Present_In' (1: U+20BA)
335        T \p{Age: 6.3}            \p{Age=V6_3} (5)
336          \p{Age: V6_3}           Code point's usage was introduced in
337                                    version 6.3; See also Property
338                                    'Present_In' (5: U+061C, U+2066..2069)
339        T \p{Age: 7.0}            \p{Age=V7_0} (2834)
340          \p{Age: V7_0}           Code point's usage was introduced in
341                                    version 7.0; See also Property
342                                    'Present_In' (2834: U+037F,
343                                    U+0528..052F, U+058D..058E, U+0605,
344                                    U+08A1, U+08AD..08B2 ...)
345        T \p{Age: 8.0}            \p{Age=V8_0} (7716)
346          \p{Age: V8_0}           Code point's usage was introduced in
347                                    version 8.0; See also Property
348                                    'Present_In' (7716: U+08B3..08B4,
349                                    U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5
350                                    ...)
351        T \p{Age: 9.0}            \p{Age=V9_0} (7500)
352          \p{Age: V9_0}           Code point's usage was introduced in
353                                    version 9.0; See also Property
354                                    'Present_In' (7500: U+08B6..08BD,
355                                    U+08D4..08E2, U+0C80, U+0D4F,
356                                    U+0D54..0D56, U+0D58..0D5E ...)
357        T \p{Age: 10.0}           \p{Age=V10_0} (8518)
358          \p{Age: V10_0}          Code point's usage was introduced in
359                                    version 10.0; See also Property
360                                    'Present_In' (8518: U+0860..086A,
361                                    U+09FC..09FD, U+0AFA..0AFF, U+0D00,
362                                    U+0D3B..0D3C, U+1CF7 ...)
363        T \p{Age: 11.0}           \p{Age=V11_0} (684)
364          \p{Age: V11_0}          Code point's usage was introduced in
365                                    version 11.0; See also Property
366                                    'Present_In' (684: U+0560, U+0588,
367                                    U+05EF, U+07FD..07FF, U+08D3, U+09FE ...)
368        T \p{Age: 12.0}           \p{Age=V12_0} (554)
369          \p{Age: V12_0}          Code point's usage was introduced in
370                                    version 12.0; See also Property
371                                    'Present_In' (554: U+0C77, U+0E86,
372                                    U+0E89, U+0E8C, U+0E8E..0E93, U+0E98 ...)
373        T \p{Age: 12.1}           \p{Age=V12_1} (1)
374          \p{Age: V12_1}          Code point's usage was introduced in
375                                    version 12.1; See also Property
376                                    'Present_In' (1: U+32FF)
377        T \p{Age: 13.0}           \p{Age=V13_0} (5930)
378          \p{Age: V13_0}          Code point's usage was introduced in
379                                    version 13.0; See also Property
380                                    'Present_In' (5930: U+08BE..08C7,
381                                    U+0B55, U+0D04, U+0D81, U+1ABF..1AC0,
382                                    U+2B97 ...)
383        T \p{Age: 14.0}           \p{Age=V14_0} (838)
384          \p{Age: V14_0}          Code point's usage was introduced in
385                                    version 14.0; See also Property
386                                    'Present_In' (838: U+061D, U+0870..088E,
387                                    U+0890..0891, U+0898..089F, U+08B5,
388                                    U+08C8..08D2 ...)
389          \p{Age: NA}             \p{Age=Unassigned} (829_768 plus all
390                                    above-Unicode code points)
391          \p{Age: Unassigned}     Code point's usage has not been assigned
392                                    in any Unicode release thus far.
393        (Short: \p{Age=NA}) (829_768 plus all above-Unicode code points:
394                                    U+0378..0379, U+0380..0383, U+038B,
395                                    U+038D, U+03A2, U+0530 ...)
396          \p{Aghb}                \p{Caucasian_Albanian} (=
397                                    \p{Script_Extensions=
398                                    Caucasian_Albanian}) (NOT \p{Block=
399                                    Caucasian_Albanian}) (53)
400          \p{AHex}                \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
401                                    (22)
402          \p{AHex: *}             \p{ASCII_Hex_Digit: *}
403          \p{Ahom}                \p{Script_Extensions=Ahom} (NOT \p{Block=
404                                    Ahom}) (65)
405        X \p{Alchemical}          \p{Alchemical_Symbols} (= \p{Block=
406                                    Alchemical_Symbols}) (128)
407        X \p{Alchemical_Symbols}  \p{Block=Alchemical_Symbols} (Short:
408                                    \p{InAlchemical}) (128)
409          \p{All}                 All code points, including those above
410                                    Unicode.  Same as qr/./s (1_114_112 plus
411                                    all above-Unicode code points:
412                                    U+0000..infinity)
413          \p{Alnum}               \p{XPosixAlnum} (134_056)
414          \p{Alpha}               \p{XPosixAlpha} (= \p{Alphabetic=Y})
415                                    (133_396)
416          \p{Alpha: *}            \p{Alphabetic: *}
417          \p{Alphabetic}          \p{XPosixAlpha} (= \p{Alphabetic=Y})
418                                    (133_396)
419          \p{Alphabetic: N*}      (Short: \p{Alpha=N}, \P{Alpha}) (980_716
420                                    plus all above-Unicode code points:
421                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
422                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4
423                                    \xb6-\xb9\xbb-\xbf\xd7\xf7],
424                                    U+02C2..02C5, U+02D2..02DF,
425                                    U+02E5..02EB, U+02ED, U+02EF..0344 ...)
426          \p{Alphabetic: Y*}      (Short: \p{Alpha=Y}, \p{Alpha}) (133_396:
427                                    [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
428                                    \xf8-\xff], U+0100..02C1, U+02C6..02D1,
429                                    U+02E0..02E4, U+02EC, U+02EE ...)
430        X \p{Alphabetic_PF}       \p{Alphabetic_Presentation_Forms} (=
431                                    \p{Block=Alphabetic_Presentation_Forms})
432                                    (80)
433        X \p{Alphabetic_Presentation_Forms} \p{Block=
434                                    Alphabetic_Presentation_Forms} (Short:
435                                    \p{InAlphabeticPF}) (80)
436          \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
437                                    Anatolian_Hieroglyphs} (Short: \p{Hluw};
438                                    NOT \p{Block=Anatolian_Hieroglyphs})
439                                    (583)
440        X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
441                                    \p{Block=
442                                    Ancient_Greek_Musical_Notation}) (80)
443        X \p{Ancient_Greek_Musical_Notation} \p{Block=
444                                    Ancient_Greek_Musical_Notation} (Short:
445                                    \p{InAncientGreekMusic}) (80)
446        X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
447        X \p{Ancient_Symbols}     \p{Block=Ancient_Symbols} (64)
448          \p{Any}                 All Unicode code points (1_114_112:
449                                    U+0000..10FFFF)
450          \p{Arab}                \p{Arabic} (= \p{Script_Extensions=
451                                    Arabic}) (NOT \p{Block=Arabic}) (1411)
452          \p{Arabic}              \p{Script_Extensions=Arabic} (Short:
453                                    \p{Arab}; NOT \p{Block=Arabic}) (1411)
454        X \p{Arabic_Ext_A}        \p{Arabic_Extended_A} (= \p{Block=
455                                    Arabic_Extended_A}) (96)
456        X \p{Arabic_Ext_B}        \p{Arabic_Extended_B} (= \p{Block=
457                                    Arabic_Extended_B}) (48)
458        X \p{Arabic_Extended_A}   \p{Block=Arabic_Extended_A} (Short:
459                                    \p{InArabicExtA}) (96)
460        X \p{Arabic_Extended_B}   \p{Block=Arabic_Extended_B} (Short:
461                                    \p{InArabicExtB}) (48)
462        X \p{Arabic_Math}         \p{Arabic_Mathematical_Alphabetic_Symbols}
463                                    (= \p{Block=
464                                    Arabic_Mathematical_Alphabetic_Symbols})
465                                    (256)
466        X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
467                                    Arabic_Mathematical_Alphabetic_Symbols}
468                                    (Short: \p{InArabicMath}) (256)
469        X \p{Arabic_PF_A}         \p{Arabic_Presentation_Forms_A} (=
470                                    \p{Block=Arabic_Presentation_Forms_A})
471                                    (688)
472        X \p{Arabic_PF_B}         \p{Arabic_Presentation_Forms_B} (=
473                                    \p{Block=Arabic_Presentation_Forms_B})
474                                    (144)
475        X \p{Arabic_Presentation_Forms_A} \p{Block=
476                                    Arabic_Presentation_Forms_A} (Short:
477                                    \p{InArabicPFA}) (688)
478        X \p{Arabic_Presentation_Forms_B} \p{Block=
479                                    Arabic_Presentation_Forms_B} (Short:
480                                    \p{InArabicPFB}) (144)
481        X \p{Arabic_Sup}          \p{Arabic_Supplement} (= \p{Block=
482                                    Arabic_Supplement}) (48)
483        X \p{Arabic_Supplement}   \p{Block=Arabic_Supplement} (Short:
484                                    \p{InArabicSup}) (48)
485          \p{Armenian}            \p{Script_Extensions=Armenian} (Short:
486                                    \p{Armn}; NOT \p{Block=Armenian}) (96)
487          \p{Armi}                \p{Imperial_Aramaic} (=
488                                    \p{Script_Extensions=Imperial_Aramaic})
489                                    (NOT \p{Block=Imperial_Aramaic}) (31)
490          \p{Armn}                \p{Armenian} (= \p{Script_Extensions=
491                                    Armenian}) (NOT \p{Block=Armenian}) (96)
492        X \p{Arrows}              \p{Block=Arrows} (112)
493          \p{ASCII}               \p{Block=Basic_Latin} (128)
494          \p{ASCII_Hex_Digit}     \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
495                                    (22)
496          \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
497                                    plus all above-Unicode code points:
498                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
499                                    \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
500                                    U+0100..infinity)
501          \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A-
502                                    Fa-f])
503          \p{Assigned}            All assigned code points (284_278:
504                                    U+0000..0377, U+037A..037F,
505                                    U+0384..038A, U+038C, U+038E..03A1,
506                                    U+03A3..052F ...)
507          \p{Avestan}             \p{Script_Extensions=Avestan} (Short:
508                                    \p{Avst}; NOT \p{Block=Avestan}) (61)
509          \p{Avst}                \p{Avestan} (= \p{Script_Extensions=
510                                    Avestan}) (NOT \p{Block=Avestan}) (61)
511          \p{Bali}                \p{Balinese} (= \p{Script_Extensions=
512                                    Balinese}) (NOT \p{Block=Balinese}) (124)
513          \p{Balinese}            \p{Script_Extensions=Balinese} (Short:
514                                    \p{Bali}; NOT \p{Block=Balinese}) (124)
515          \p{Bamu}                \p{Bamum} (= \p{Script_Extensions=Bamum})
516                                    (NOT \p{Block=Bamum}) (657)
517          \p{Bamum}               \p{Script_Extensions=Bamum} (Short:
518                                    \p{Bamu}; NOT \p{Block=Bamum}) (657)
519        X \p{Bamum_Sup}           \p{Bamum_Supplement} (= \p{Block=
520                                    Bamum_Supplement}) (576)
521        X \p{Bamum_Supplement}    \p{Block=Bamum_Supplement} (Short:
522                                    \p{InBamumSup}) (576)
523        X \p{Basic_Latin}         \p{ASCII} (= \p{Block=Basic_Latin}) (128)
524          \p{Bass}                \p{Bassa_Vah} (= \p{Script_Extensions=
525                                    Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
526                                    (36)
527          \p{Bassa_Vah}           \p{Script_Extensions=Bassa_Vah} (Short:
528                                    \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
529          \p{Batak}               \p{Script_Extensions=Batak} (Short:
530                                    \p{Batk}; NOT \p{Block=Batak}) (56)
531          \p{Batk}                \p{Batak} (= \p{Script_Extensions=Batak})
532                                    (NOT \p{Block=Batak}) (56)
533          \p{Bc: *}               \p{Bidi_Class: *}
534          \p{Beng}                \p{Bengali} (= \p{Script_Extensions=
535                                    Bengali}) (NOT \p{Block=Bengali}) (113)
536          \p{Bengali}             \p{Script_Extensions=Bengali} (Short:
537                                    \p{Beng}; NOT \p{Block=Bengali}) (113)
538          \p{Bhaiksuki}           \p{Script_Extensions=Bhaiksuki} (Short:
539                                    \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
540          \p{Bhks}                \p{Bhaiksuki} (= \p{Script_Extensions=
541                                    Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
542                                    (97)
543          \p{Bidi_C}              \p{Bidi_Control} (= \p{Bidi_Control=Y})
544                                    (12)
545          \p{Bidi_C: *}           \p{Bidi_Control: *}
546          \p{Bidi_Class: AL}      \p{Bidi_Class=Arabic_Letter} (1708)
547          \p{Bidi_Class: AN}      \p{Bidi_Class=Arabic_Number} (63)
548          \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1708: U+0608,
549                                    U+060B, U+060D, U+061B..064A,
550                                    U+066D..066F, U+0671..06D5 ...)
551          \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (63:
552                                    U+0600..0605, U+0660..0669,
553                                    U+066B..066C, U+06DD, U+0890..0891,
554                                    U+08E2 ...)
555          \p{Bidi_Class: B}       \p{Bidi_Class=Paragraph_Separator} (7)
556          \p{Bidi_Class: BN}      \p{Bidi_Class=Boundary_Neutral} (4016)
557          \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016: [^\t\n
558                                    \cK\f\r\x1c-\x7e\x85\xa0-\xac\xae-\xff],
559                                    U+180E, U+200B..200D, U+2060..2065,
560                                    U+206A..206F, U+FDD0..FDEF ...)
561          \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15: [,.\/:
562                                    \xa0], U+060C, U+202F, U+2044, U+FE50,
563                                    U+FE52 ...)
564          \p{Bidi_Class: CS}      \p{Bidi_Class=Common_Separator} (15)
565          \p{Bidi_Class: EN}      \p{Bidi_Class=European_Number} (168)
566          \p{Bidi_Class: ES}      \p{Bidi_Class=European_Separator} (12)
567          \p{Bidi_Class: ET}      \p{Bidi_Class=European_Terminator} (92)
568          \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (168: [0-9\xb2-
569                                    \xb3\xb9], U+06F0..06F9, U+2070,
570                                    U+2074..2079, U+2080..2089, U+2488..249B
571                                    ...)
572          \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-],
573                                    U+207A..207B, U+208A..208B, U+2212,
574                                    U+FB29, U+FE62..FE63 ...)
575          \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (92: [#\$
576                                    \%\xa2-\xa5\xb0-\xb1], U+058F,
577                                    U+0609..060A, U+066A, U+09F2..09F3,
578                                    U+09FB ...)
579          \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1:
580                                    U+2068)
581          \p{Bidi_Class: FSI}     \p{Bidi_Class=First_Strong_Isolate} (1)
582          \p{Bidi_Class: L}       \p{Bidi_Class=Left_To_Right} (1_096_333
583                                    plus all above-Unicode code points)
584          \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_096_333 plus
585                                    all above-Unicode code points: [A-Za-z
586                                    \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
587                                    \xff], U+0100..02B8, U+02BB..02C1,
588                                    U+02D0..02D1, U+02E0..02E4, U+02EE ...)
589          \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1:
590                                    U+202A)
591          \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1:
592                                    U+2066)
593          \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1:
594                                    U+202D)
595          \p{Bidi_Class: LRE}     \p{Bidi_Class=Left_To_Right_Embedding} (1)
596          \p{Bidi_Class: LRI}     \p{Bidi_Class=Left_To_Right_Isolate} (1)
597          \p{Bidi_Class: LRO}     \p{Bidi_Class=Left_To_Right_Override} (1)
598          \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1958:
599                                    U+0300..036F, U+0483..0489,
600                                    U+0591..05BD, U+05BF, U+05C1..05C2,
601                                    U+05C4..05C5 ...)
602          \p{Bidi_Class: NSM}     \p{Bidi_Class=Nonspacing_Mark} (1958)
603          \p{Bidi_Class: ON}      \p{Bidi_Class=Other_Neutral} (6000)
604          \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (6000: [!\"&\'
605                                    \(\)*;<=>?\@\[\\\]\^_`\{\|\}~\xa1\xa6-
606                                    \xa9\xab-\xac\xae-\xaf\xb4\xb6-\xb8\xbb-
607                                    \xbf\xd7\xf7], U+02B9..02BA,
608                                    U+02C2..02CF, U+02D2..02DF,
609                                    U+02E5..02ED, U+02EF..02FF ...)
610          \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7: [\n\r
611                                    \x1c-\x1e\x85], U+2029)
612          \p{Bidi_Class: PDF}     \p{Bidi_Class=Pop_Directional_Format} (1)
613          \p{Bidi_Class: PDI}     \p{Bidi_Class=Pop_Directional_Isolate} (1)
614          \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1:
615                                    U+202C)
616          \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1:
617                                    U+2069)
618          \p{Bidi_Class: R}       \p{Bidi_Class=Right_To_Left} (3711)
619          \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (3711: U+0590,
620                                    U+05BE, U+05C0, U+05C3, U+05C6,
621                                    U+05C8..05FF ...)
622          \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1:
623                                    U+202B)
624          \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1:
625                                    U+2067)
626          \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1:
627                                    U+202E)
628          \p{Bidi_Class: RLE}     \p{Bidi_Class=Right_To_Left_Embedding} (1)
629          \p{Bidi_Class: RLI}     \p{Bidi_Class=Right_To_Left_Isolate} (1)
630          \p{Bidi_Class: RLO}     \p{Bidi_Class=Right_To_Left_Override} (1)
631          \p{Bidi_Class: S}       \p{Bidi_Class=Segment_Separator} (3)
632          \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3: [\t\cK
633                                    \x1f])
634          \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20],
635                                    U+1680, U+2000..200A, U+2028, U+205F,
636                                    U+3000)
637          \p{Bidi_Class: WS}      \p{Bidi_Class=White_Space} (17)
638          \p{Bidi_Control}        \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
639          \p{Bidi_Control: N*}    (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
640                                    plus all above-Unicode code points:
641                                    U+0000..061B, U+061D..200D,
642                                    U+2010..2029, U+202F..2065,
643                                    U+206A..infinity)
644          \p{Bidi_Control: Y*}    (Short: \p{BidiC=Y}, \p{BidiC}) (12:
645                                    U+061C, U+200E..200F, U+202A..202E,
646                                    U+2066..2069)
647          \p{Bidi_M}              \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
648                                    (553)
649          \p{Bidi_M: *}           \p{Bidi_Mirrored: *}
650          \p{Bidi_Mirrored}       \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
651                                    (553)
652          \p{Bidi_Mirrored: N*}   (Short: \p{BidiM=N}, \P{BidiM}) (1_113_559
653                                    plus all above-Unicode code points:
654                                    [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A-
655                                    Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc-
656                                    \xff], U+0100..0F39, U+0F3E..169A,
657                                    U+169D..2038, U+203B..2044, U+2047..207C
658                                    ...)
659          \p{Bidi_Mirrored: Y*}   (Short: \p{BidiM=Y}, \p{BidiM}) (553:
660                                    [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
661                                    U+169B..169C, U+2039..203A,
662                                    U+2045..2046, U+207D..207E ...)
663          \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
664                                    (64)
665          \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (64: [\)\]
666                                    \}], U+0F3B, U+0F3D, U+169C, U+2046,
667                                    U+207E ...)
668          \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
669                                    (1_113_984 plus all above-Unicode code
670                                    points)
671          \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_984
672                                    plus all above-Unicode code points:
673                                    [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<=>?
674                                    \@A-Z\\\^_`a-z\|~\x7f-\xff],
675                                    U+0100..0F39, U+0F3E..169A,
676                                    U+169D..2044, U+2047..207C, U+207F..208C
677                                    ...)
678          \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
679                                    (64)
680          \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (64:
681                                    [\(\[\{], U+0F3A, U+0F3C, U+169B,
682                                    U+2045, U+207D ...)
683          \p{Blank}               \p{XPosixBlank} (18)
684          \p{Blk: *}              \p{Block: *}
685          \p{Block: Adlam}        (NOT \p{Adlam} NOR \p{Is_Adlam}) (96:
686                                    U+1E900..1E95F)
687          \p{Block: Aegean_Numbers} (64: U+10100..1013F)
688          \p{Block: Ahom}         (NOT \p{Ahom} NOR \p{Is_Ahom}) (80:
689                                    U+11700..1174F)
690          \p{Block: Alchemical}   \p{Block=Alchemical_Symbols} (128)
691          \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128:
692                                    U+1F700..1F77F)
693          \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
694                                    (80)
695          \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
696                                    AlphabeticPF}) (80: U+FB00..FB4F)
697          \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
698                                    NOR \p{Is_Anatolian_Hieroglyphs}) (640:
699                                    U+14400..1467F)
700          \p{Block: Ancient_Greek_Music} \p{Block=
701                                    Ancient_Greek_Musical_Notation} (80)
702          \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
703                                    AncientGreekMusic}) (80: U+1D200..1D24F)
704          \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F)
705          \p{Block: Ancient_Symbols} (64: U+10190..101CF)
706          \p{Block: Arabic}       (NOT \p{Arabic} NOR \p{Is_Arabic}) (256:
707                                    U+0600..06FF)
708          \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
709          \p{Block: Arabic_Ext_B} \p{Block=Arabic_Extended_B} (48)
710          \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96:
711                                    U+08A0..08FF)
712          \p{Block: Arabic_Extended_B} (Short: \p{Blk=ArabicExtB}) (48:
713                                    U+0870..089F)
714          \p{Block: Arabic_Math}  \p{Block=
715                                    Arabic_Mathematical_Alphabetic_Symbols}
716                                    (256)
717          \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
718                                    ArabicMath}) (256: U+1EE00..1EEFF)
719          \p{Block: Arabic_PF_A}  \p{Block=Arabic_Presentation_Forms_A} (688)
720          \p{Block: Arabic_PF_B}  \p{Block=Arabic_Presentation_Forms_B} (144)
721          \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
722                                    (688: U+FB50..FDFF)
723          \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
724                                    (144: U+FE70..FEFF)
725          \p{Block: Arabic_Sup}   \p{Block=Arabic_Supplement} (48)
726          \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48:
727                                    U+0750..077F)
728          \p{Block: Armenian}     (NOT \p{Armenian} NOR \p{Is_Armenian})
729                                    (96: U+0530..058F)
730          \p{Block: Arrows}       (112: U+2190..21FF)
731          \p{Block: ASCII}        \p{Block=Basic_Latin} (128)
732          \p{Block: Avestan}      (NOT \p{Avestan} NOR \p{Is_Avestan}) (64:
733                                    U+10B00..10B3F)
734          \p{Block: Balinese}     (NOT \p{Balinese} NOR \p{Is_Balinese})
735                                    (128: U+1B00..1B7F)
736          \p{Block: Bamum}        (NOT \p{Bamum} NOR \p{Is_Bamum}) (96:
737                                    U+A6A0..A6FF)
738          \p{Block: Bamum_Sup}    \p{Block=Bamum_Supplement} (576)
739          \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576:
740                                    U+16800..16A3F)
741          \p{Block: Basic_Latin}  (Short: \p{Blk=ASCII}) (128: [\x00-\x7f])
742          \p{Block: Bassa_Vah}    (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
743                                    (48: U+16AD0..16AFF)
744          \p{Block: Batak}        (NOT \p{Batak} NOR \p{Is_Batak}) (64:
745                                    U+1BC0..1BFF)
746          \p{Block: Bengali}      (NOT \p{Bengali} NOR \p{Is_Bengali}) (128:
747                                    U+0980..09FF)
748          \p{Block: Bhaiksuki}    (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
749                                    (112: U+11C00..11C6F)
750          \p{Block: Block_Elements} (32: U+2580..259F)
751          \p{Block: Bopomofo}     (NOT \p{Bopomofo} NOR \p{Is_Bopomofo})
752                                    (48: U+3100..312F)
753          \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
754          \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32:
755                                    U+31A0..31BF)
756          \p{Block: Box_Drawing}  (128: U+2500..257F)
757          \p{Block: Brahmi}       (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128:
758                                    U+11000..1107F)
759          \p{Block: Braille}      \p{Block=Braille_Patterns} (256)
760          \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256:
761                                    U+2800..28FF)
762          \p{Block: Buginese}     (NOT \p{Buginese} NOR \p{Is_Buginese})
763                                    (32: U+1A00..1A1F)
764          \p{Block: Buhid}        (NOT \p{Buhid} NOR \p{Is_Buhid}) (32:
765                                    U+1740..175F)
766          \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
767                                    (256)
768          \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
769                                    ByzantineMusic}) (256: U+1D000..1D0FF)
770          \p{Block: Canadian_Syllabics} \p{Block=
771                                    Unified_Canadian_Aboriginal_Syllabics}
772                                    (640)
773          \p{Block: Carian}       (NOT \p{Carian} NOR \p{Is_Carian}) (64:
774                                    U+102A0..102DF)
775          \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
776                                    \p{Is_Caucasian_Albanian}) (64:
777                                    U+10530..1056F)
778          \p{Block: Chakma}       (NOT \p{Chakma} NOR \p{Is_Chakma}) (80:
779                                    U+11100..1114F)
780          \p{Block: Cham}         (NOT \p{Cham} NOR \p{Is_Cham}) (96:
781                                    U+AA00..AA5F)
782          \p{Block: Cherokee}     (NOT \p{Cherokee} NOR \p{Is_Cherokee})
783                                    (96: U+13A0..13FF)
784          \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
785          \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80:
786                                    U+AB70..ABBF)
787          \p{Block: Chess_Symbols} (112: U+1FA00..1FA6F)
788          \p{Block: Chorasmian}   (NOT \p{Chorasmian} NOR \p{Is_Chorasmian})
789                                    (48: U+10FB0..10FDF)
790          \p{Block: CJK}          \p{Block=CJK_Unified_Ideographs} (20_992)
791          \p{Block: CJK_Compat}   \p{Block=CJK_Compatibility} (256)
792          \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
793          \p{Block: CJK_Compat_Ideographs} \p{Block=
794                                    CJK_Compatibility_Ideographs} (512)
795          \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
796                                    CJK_Compatibility_Ideographs_Supplement}
797                                    (544)
798          \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256:
799                                    U+3300..33FF)
800          \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
801                                    (32: U+FE30..FE4F)
802          \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
803                                    CJKCompatIdeographs}) (512: U+F900..FAFF)
804          \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
805                                    CJKCompatIdeographsSup}) (544:
806                                    U+2F800..2FA1F)
807          \p{Block: CJK_Ext_A}    \p{Block=
808                                    CJK_Unified_Ideographs_Extension_A}
809                                    (6592)
810          \p{Block: CJK_Ext_B}    \p{Block=
811                                    CJK_Unified_Ideographs_Extension_B}
812                                    (42_720)
813          \p{Block: CJK_Ext_C}    \p{Block=
814                                    CJK_Unified_Ideographs_Extension_C}
815                                    (4160)
816          \p{Block: CJK_Ext_D}    \p{Block=
817                                    CJK_Unified_Ideographs_Extension_D} (224)
818          \p{Block: CJK_Ext_E}    \p{Block=
819                                    CJK_Unified_Ideographs_Extension_E}
820                                    (5776)
821          \p{Block: CJK_Ext_F}    \p{Block=
822                                    CJK_Unified_Ideographs_Extension_F}
823                                    (7488)
824          \p{Block: CJK_Ext_G}    \p{Block=
825                                    CJK_Unified_Ideographs_Extension_G}
826                                    (4944)
827          \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
828          \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
829                                    (128: U+2E80..2EFF)
830          \p{Block: CJK_Strokes}  (48: U+31C0..31EF)
831          \p{Block: CJK_Symbols}  \p{Block=CJK_Symbols_And_Punctuation} (64)
832          \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
833                                    (64: U+3000..303F)
834          \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992:
835                                    U+4E00..9FFF)
836          \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
837                                    CJKExtA}) (6592: U+3400..4DBF)
838          \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
839                                    CJKExtB}) (42_720: U+20000..2A6DF)
840          \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
841                                    CJKExtC}) (4160: U+2A700..2B73F)
842          \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
843                                    CJKExtD}) (224: U+2B740..2B81F)
844          \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
845                                    CJKExtE}) (5776: U+2B820..2CEAF)
846          \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk=
847                                    CJKExtF}) (7488: U+2CEB0..2EBEF)
848          \p{Block: CJK_Unified_Ideographs_Extension_G} (Short: \p{Blk=
849                                    CJKExtG}) (4944: U+30000..3134F)
850          \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
851                                    Diacriticals}) (112: U+0300..036F)
852          \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
853                                    DiacriticalsExt}) (80: U+1AB0..1AFF)
854          \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
855                                    DiacriticalsForSymbols}) (48:
856                                    U+20D0..20FF)
857          \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
858                                    DiacriticalsSup}) (64: U+1DC0..1DFF)
859          \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16:
860                                    U+FE20..FE2F)
861          \p{Block: Combining_Marks_For_Symbols} \p{Block=
862                                    Combining_Diacritical_Marks_For_Symbols}
863                                    (48)
864          \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
865                                    IndicNumberForms}) (16: U+A830..A83F)
866          \p{Block: Compat_Jamo}  \p{Block=Hangul_Compatibility_Jamo} (96)
867          \p{Block: Control_Pictures} (64: U+2400..243F)
868          \p{Block: Coptic}       (NOT \p{Coptic} NOR \p{Is_Coptic}) (128:
869                                    U+2C80..2CFF)
870          \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF)
871          \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
872          \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32:
873                                    U+1D360..1D37F)
874          \p{Block: Cuneiform}    (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
875                                    (1024: U+12000..123FF)
876          \p{Block: Cuneiform_Numbers} \p{Block=
877                                    Cuneiform_Numbers_And_Punctuation} (128)
878          \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
879                                    CuneiformNumbers}) (128: U+12400..1247F)
880          \p{Block: Currency_Symbols} (48: U+20A0..20CF)
881          \p{Block: Cypriot_Syllabary} (64: U+10800..1083F)
882          \p{Block: Cypro_Minoan} (NOT \p{Cypro_Minoan} NOR
883                                    \p{Is_Cypro_Minoan}) (112:
884                                    U+12F90..12FFF)
885          \p{Block: Cyrillic}     (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
886                                    (256: U+0400..04FF)
887          \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
888          \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
889          \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
890          \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32:
891                                    U+2DE0..2DFF)
892          \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96:
893                                    U+A640..A69F)
894          \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16:
895                                    U+1C80..1C8F)
896          \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
897          \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48:
898                                    U+0500..052F)
899          \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
900                                    (48)
901          \p{Block: Deseret}      (80: U+10400..1044F)
902          \p{Block: Devanagari}   (NOT \p{Devanagari} NOR \p{Is_Devanagari})
903                                    (128: U+0900..097F)
904          \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
905          \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32:
906                                    U+A8E0..A8FF)
907          \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
908          \p{Block: Diacriticals_Ext} \p{Block=
909                                    Combining_Diacritical_Marks_Extended}
910                                    (80)
911          \p{Block: Diacriticals_For_Symbols} \p{Block=
912                                    Combining_Diacritical_Marks_For_Symbols}
913                                    (48)
914          \p{Block: Diacriticals_Sup} \p{Block=
915                                    Combining_Diacritical_Marks_Supplement}
916                                    (64)
917          \p{Block: Dingbats}     (192: U+2700..27BF)
918          \p{Block: Dives_Akuru}  (NOT \p{Dives_Akuru} NOR
919                                    \p{Is_Dives_Akuru}) (96: U+11900..1195F)
920          \p{Block: Dogra}        (NOT \p{Dogra} NOR \p{Is_Dogra}) (80:
921                                    U+11800..1184F)
922          \p{Block: Domino}       \p{Block=Domino_Tiles} (112)
923          \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112:
924                                    U+1F030..1F09F)
925          \p{Block: Duployan}     (NOT \p{Duployan} NOR \p{Is_Duployan})
926                                    (160: U+1BC00..1BC9F)
927          \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F)
928          \p{Block: Egyptian_Hieroglyph_Format_Controls} (16: U+13430..1343F)
929          \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
930                                    \p{Is_Egyptian_Hieroglyphs}) (1072:
931                                    U+13000..1342F)
932          \p{Block: Elbasan}      (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48:
933                                    U+10500..1052F)
934          \p{Block: Elymaic}      (NOT \p{Elymaic} NOR \p{Is_Elymaic}) (32:
935                                    U+10FE0..10FFF)
936          \p{Block: Emoticons}    (80: U+1F600..1F64F)
937          \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
938          \p{Block: Enclosed_Alphanum_Sup} \p{Block=
939                                    Enclosed_Alphanumeric_Supplement} (256)
940          \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
941                                    EnclosedAlphanumSup}) (256:
942                                    U+1F100..1F1FF)
943          \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
944                                    EnclosedAlphanum}) (160: U+2460..24FF)
945          \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
946                                    (256)
947          \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
948                                    EnclosedCJK}) (256: U+3200..32FF)
949          \p{Block: Enclosed_Ideographic_Sup} \p{Block=
950                                    Enclosed_Ideographic_Supplement} (256)
951          \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
952                                    EnclosedIdeographicSup}) (256:
953                                    U+1F200..1F2FF)
954          \p{Block: Ethiopic}     (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
955                                    (384: U+1200..137F)
956          \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
957          \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
958          \p{Block: Ethiopic_Ext_B} \p{Block=Ethiopic_Extended_B} (32)
959          \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96:
960                                    U+2D80..2DDF)
961          \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48:
962                                    U+AB00..AB2F)
963          \p{Block: Ethiopic_Extended_B} (Short: \p{Blk=EthiopicExtB}) (32:
964                                    U+1E7E0..1E7FF)
965          \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
966          \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32:
967                                    U+1380..139F)
968          \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
969                                    \p{Punct} NOR \p{Is_Punctuation}) (112:
970                                    U+2000..206F)
971          \p{Block: Geometric_Shapes} (96: U+25A0..25FF)
972          \p{Block: Geometric_Shapes_Ext} \p{Block=
973                                    Geometric_Shapes_Extended} (128)
974          \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
975                                    GeometricShapesExt}) (128:
976                                    U+1F780..1F7FF)
977          \p{Block: Georgian}     (NOT \p{Georgian} NOR \p{Is_Georgian})
978                                    (96: U+10A0..10FF)
979          \p{Block: Georgian_Ext} \p{Block=Georgian_Extended} (48)
980          \p{Block: Georgian_Extended} (Short: \p{Blk=GeorgianExt}) (48:
981                                    U+1C90..1CBF)
982          \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
983          \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48:
984                                    U+2D00..2D2F)
985          \p{Block: Glagolitic}   (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
986                                    (96: U+2C00..2C5F)
987          \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
988          \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
989                                    (48: U+1E000..1E02F)
990          \p{Block: Gothic}       (NOT \p{Gothic} NOR \p{Is_Gothic}) (32:
991                                    U+10330..1034F)
992          \p{Block: Grantha}      (NOT \p{Grantha} NOR \p{Is_Grantha}) (128:
993                                    U+11300..1137F)
994          \p{Block: Greek}        \p{Block=Greek_And_Coptic} (NOT \p{Greek}
995                                    NOR \p{Is_Greek}) (144)
996          \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
997                                    NOR \p{Is_Greek}) (144: U+0370..03FF)
998          \p{Block: Greek_Ext}    \p{Block=Greek_Extended} (256)
999          \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256:
1000                                    U+1F00..1FFF)
1001          \p{Block: Gujarati}     (NOT \p{Gujarati} NOR \p{Is_Gujarati})
1002                                    (128: U+0A80..0AFF)
1003          \p{Block: Gunjala_Gondi} (NOT \p{Gunjala_Gondi} NOR
1004                                    \p{Is_Gunjala_Gondi}) (80:
1005                                    U+11D60..11DAF)
1006          \p{Block: Gurmukhi}     (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
1007                                    (128: U+0A00..0A7F)
1008          \p{Block: Half_And_Full_Forms} \p{Block=
1009                                    Halfwidth_And_Fullwidth_Forms} (240)
1010          \p{Block: Half_Marks}   \p{Block=Combining_Half_Marks} (16)
1011          \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
1012                                    HalfAndFullForms}) (240: U+FF00..FFEF)
1013          \p{Block: Hangul}       \p{Block=Hangul_Syllables} (NOT \p{Hangul}
1014                                    NOR \p{Is_Hangul}) (11_184)
1015          \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
1016                                    (96: U+3130..318F)
1017          \p{Block: Hangul_Jamo}  (Short: \p{Blk=Jamo}) (256: U+1100..11FF)
1018          \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32:
1019                                    U+A960..A97F)
1020          \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80:
1021                                    U+D7B0..D7FF)
1022          \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
1023                                    NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF)
1024          \p{Block: Hanifi_Rohingya} (NOT \p{Hanifi_Rohingya} NOR
1025                                    \p{Is_Hanifi_Rohingya}) (64:
1026                                    U+10D00..10D3F)
1027          \p{Block: Hanunoo}      (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32:
1028                                    U+1720..173F)
1029          \p{Block: Hatran}       (NOT \p{Hatran} NOR \p{Is_Hatran}) (32:
1030                                    U+108E0..108FF)
1031          \p{Block: Hebrew}       (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112:
1032                                    U+0590..05FF)
1033          \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
1034                                    HighPUSurrogates}) (128: U+DB80..DBFF)
1035          \p{Block: High_PU_Surrogates} \p{Block=
1036                                    High_Private_Use_Surrogates} (128)
1037          \p{Block: High_Surrogates} (896: U+D800..DB7F)
1038          \p{Block: Hiragana}     (NOT \p{Hiragana} NOR \p{Is_Hiragana})
1039                                    (96: U+3040..309F)
1040          \p{Block: IDC}          \p{Block=
1041                                    Ideographic_Description_Characters} (NOT
1042                                    \p{ID_Continue} NOR \p{Is_IDC}) (16)
1043          \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
1044                                    NOT \p{ID_Continue} NOR \p{Is_IDC}) (16:
1045                                    U+2FF0..2FFF)
1046          \p{Block: Ideographic_Symbols} \p{Block=
1047                                    Ideographic_Symbols_And_Punctuation} (32)
1048          \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
1049                                    IdeographicSymbols}) (32: U+16FE0..16FFF)
1050          \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
1051                                    \p{Is_Imperial_Aramaic}) (32:
1052                                    U+10840..1085F)
1053          \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1054                                    (16)
1055          \p{Block: Indic_Siyaq_Numbers} (80: U+1EC70..1ECBF)
1056          \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
1057                                    NOR \p{Is_Inscriptional_Pahlavi}) (32:
1058                                    U+10B60..10B7F)
1059          \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
1060                                    NOR \p{Is_Inscriptional_Parthian}) (32:
1061                                    U+10B40..10B5F)
1062          \p{Block: IPA_Ext}      \p{Block=IPA_Extensions} (96)
1063          \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96:
1064                                    U+0250..02AF)
1065          \p{Block: Jamo}         \p{Block=Hangul_Jamo} (256)
1066          \p{Block: Jamo_Ext_A}   \p{Block=Hangul_Jamo_Extended_A} (32)
1067          \p{Block: Jamo_Ext_B}   \p{Block=Hangul_Jamo_Extended_B} (80)
1068          \p{Block: Javanese}     (NOT \p{Javanese} NOR \p{Is_Javanese})
1069                                    (96: U+A980..A9DF)
1070          \p{Block: Kaithi}       (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80:
1071                                    U+11080..110CF)
1072          \p{Block: Kana_Ext_A}   \p{Block=Kana_Extended_A} (48)
1073          \p{Block: Kana_Ext_B}   \p{Block=Kana_Extended_B} (16)
1074          \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48:
1075                                    U+1B100..1B12F)
1076          \p{Block: Kana_Extended_B} (Short: \p{Blk=KanaExtB}) (16:
1077                                    U+1AFF0..1AFFF)
1078          \p{Block: Kana_Sup}     \p{Block=Kana_Supplement} (256)
1079          \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256:
1080                                    U+1B000..1B0FF)
1081          \p{Block: Kanbun}       (16: U+3190..319F)
1082          \p{Block: Kangxi}       \p{Block=Kangxi_Radicals} (224)
1083          \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224:
1084                                    U+2F00..2FDF)
1085          \p{Block: Kannada}      (NOT \p{Kannada} NOR \p{Is_Kannada}) (128:
1086                                    U+0C80..0CFF)
1087          \p{Block: Katakana}     (NOT \p{Katakana} NOR \p{Is_Katakana})
1088                                    (96: U+30A0..30FF)
1089          \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
1090          \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
1091                                    KatakanaExt}) (16: U+31F0..31FF)
1092          \p{Block: Kayah_Li}     (48: U+A900..A92F)
1093          \p{Block: Kharoshthi}   (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
1094                                    (96: U+10A00..10A5F)
1095          \p{Block: Khitan_Small_Script} (NOT \p{Khitan_Small_Script} NOR
1096                                    \p{Is_Khitan_Small_Script}) (512:
1097                                    U+18B00..18CFF)
1098          \p{Block: Khmer}        (NOT \p{Khmer} NOR \p{Is_Khmer}) (128:
1099                                    U+1780..17FF)
1100          \p{Block: Khmer_Symbols} (32: U+19E0..19FF)
1101          \p{Block: Khojki}       (NOT \p{Khojki} NOR \p{Is_Khojki}) (80:
1102                                    U+11200..1124F)
1103          \p{Block: Khudawadi}    (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
1104                                    (80: U+112B0..112FF)
1105          \p{Block: Lao}          (NOT \p{Lao} NOR \p{Is_Lao}) (128:
1106                                    U+0E80..0EFF)
1107          \p{Block: Latin_1}      \p{Block=Latin_1_Supplement} (128)
1108          \p{Block: Latin_1_Sup}  \p{Block=Latin_1_Supplement} (128)
1109          \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80-
1110                                    \xff])
1111          \p{Block: Latin_Ext_A}  \p{Block=Latin_Extended_A} (128)
1112          \p{Block: Latin_Ext_Additional} \p{Block=
1113                                    Latin_Extended_Additional} (256)
1114          \p{Block: Latin_Ext_B}  \p{Block=Latin_Extended_B} (208)
1115          \p{Block: Latin_Ext_C}  \p{Block=Latin_Extended_C} (32)
1116          \p{Block: Latin_Ext_D}  \p{Block=Latin_Extended_D} (224)
1117          \p{Block: Latin_Ext_E}  \p{Block=Latin_Extended_E} (64)
1118          \p{Block: Latin_Ext_F}  \p{Block=Latin_Extended_F} (64)
1119          \p{Block: Latin_Ext_G}  \p{Block=Latin_Extended_G} (256)
1120          \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128:
1121                                    U+0100..017F)
1122          \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
1123                                    LatinExtAdditional}) (256: U+1E00..1EFF)
1124          \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208:
1125                                    U+0180..024F)
1126          \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32:
1127                                    U+2C60..2C7F)
1128          \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224:
1129                                    U+A720..A7FF)
1130          \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64:
1131                                    U+AB30..AB6F)
1132          \p{Block: Latin_Extended_F} (Short: \p{Blk=LatinExtF}) (64:
1133                                    U+10780..107BF)
1134          \p{Block: Latin_Extended_G} (Short: \p{Blk=LatinExtG}) (256:
1135                                    U+1DF00..1DFFF)
1136          \p{Block: Lepcha}       (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80:
1137                                    U+1C00..1C4F)
1138          \p{Block: Letterlike_Symbols} (80: U+2100..214F)
1139          \p{Block: Limbu}        (NOT \p{Limbu} NOR \p{Is_Limbu}) (80:
1140                                    U+1900..194F)
1141          \p{Block: Linear_A}     (NOT \p{Linear_A} NOR \p{Is_Linear_A})
1142                                    (384: U+10600..1077F)
1143          \p{Block: Linear_B_Ideograms} (128: U+10080..100FF)
1144          \p{Block: Linear_B_Syllabary} (128: U+10000..1007F)
1145          \p{Block: Lisu}         (NOT \p{Lisu} NOR \p{Is_Lisu}) (48:
1146                                    U+A4D0..A4FF)
1147          \p{Block: Lisu_Sup}     \p{Block=Lisu_Supplement} (16)
1148          \p{Block: Lisu_Supplement} (Short: \p{Blk=LisuSup}) (16:
1149                                    U+11FB0..11FBF)
1150          \p{Block: Low_Surrogates} (1024: U+DC00..DFFF)
1151          \p{Block: Lycian}       (NOT \p{Lycian} NOR \p{Is_Lycian}) (32:
1152                                    U+10280..1029F)
1153          \p{Block: Lydian}       (NOT \p{Lydian} NOR \p{Is_Lydian}) (32:
1154                                    U+10920..1093F)
1155          \p{Block: Mahajani}     (NOT \p{Mahajani} NOR \p{Is_Mahajani})
1156                                    (48: U+11150..1117F)
1157          \p{Block: Mahjong}      \p{Block=Mahjong_Tiles} (48)
1158          \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48:
1159                                    U+1F000..1F02F)
1160          \p{Block: Makasar}      (NOT \p{Makasar} NOR \p{Is_Makasar}) (32:
1161                                    U+11EE0..11EFF)
1162          \p{Block: Malayalam}    (NOT \p{Malayalam} NOR \p{Is_Malayalam})
1163                                    (128: U+0D00..0D7F)
1164          \p{Block: Mandaic}      (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32:
1165                                    U+0840..085F)
1166          \p{Block: Manichaean}   (NOT \p{Manichaean} NOR \p{Is_Manichaean})
1167                                    (64: U+10AC0..10AFF)
1168          \p{Block: Marchen}      (NOT \p{Marchen} NOR \p{Is_Marchen}) (80:
1169                                    U+11C70..11CBF)
1170          \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR
1171                                    \p{Is_Masaram_Gondi}) (96:
1172                                    U+11D00..11D5F)
1173          \p{Block: Math_Alphanum} \p{Block=
1174                                    Mathematical_Alphanumeric_Symbols} (1024)
1175          \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
1176          \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
1177                                    MathAlphanum}) (1024: U+1D400..1D7FF)
1178          \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
1179                                    (256: U+2200..22FF)
1180          \p{Block: Mayan_Numerals} (32: U+1D2E0..1D2FF)
1181          \p{Block: Medefaidrin}  (NOT \p{Medefaidrin} NOR
1182                                    \p{Is_Medefaidrin}) (96: U+16E40..16E9F)
1183          \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
1184                                    \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF)
1185          \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
1186          \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
1187                                    (32: U+AAE0..AAFF)
1188          \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
1189                                    \p{Is_Mende_Kikakui}) (224:
1190                                    U+1E800..1E8DF)
1191          \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
1192                                    \p{Is_Meroitic_Cursive}) (96:
1193                                    U+109A0..109FF)
1194          \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F)
1195          \p{Block: Miao}         (NOT \p{Miao} NOR \p{Is_Miao}) (160:
1196                                    U+16F00..16F9F)
1197          \p{Block: Misc_Arrows}  \p{Block=Miscellaneous_Symbols_And_Arrows}
1198                                    (256)
1199          \p{Block: Misc_Math_Symbols_A} \p{Block=
1200                                    Miscellaneous_Mathematical_Symbols_A}
1201                                    (48)
1202          \p{Block: Misc_Math_Symbols_B} \p{Block=
1203                                    Miscellaneous_Mathematical_Symbols_B}
1204                                    (128)
1205          \p{Block: Misc_Pictographs} \p{Block=
1206                                    Miscellaneous_Symbols_And_Pictographs}
1207                                    (768)
1208          \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
1209          \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
1210          \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
1211                                    MiscMathSymbolsA}) (48: U+27C0..27EF)
1212          \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
1213                                    MiscMathSymbolsB}) (128: U+2980..29FF)
1214          \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols})
1215                                    (256: U+2600..26FF)
1216          \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
1217                                    MiscArrows}) (256: U+2B00..2BFF)
1218          \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
1219                                    MiscPictographs}) (768: U+1F300..1F5FF)
1220          \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
1221                                    (256: U+2300..23FF)
1222          \p{Block: Modi}         (NOT \p{Modi} NOR \p{Is_Modi}) (96:
1223                                    U+11600..1165F)
1224          \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
1225          \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F)
1226          \p{Block: Mongolian}    (NOT \p{Mongolian} NOR \p{Is_Mongolian})
1227                                    (176: U+1800..18AF)
1228          \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
1229          \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32:
1230                                    U+11660..1167F)
1231          \p{Block: Mro}          (NOT \p{Mro} NOR \p{Is_Mro}) (48:
1232                                    U+16A40..16A6F)
1233          \p{Block: Multani}      (NOT \p{Multani} NOR \p{Is_Multani}) (48:
1234                                    U+11280..112AF)
1235          \p{Block: Music}        \p{Block=Musical_Symbols} (256)
1236          \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256:
1237                                    U+1D100..1D1FF)
1238          \p{Block: Myanmar}      (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160:
1239                                    U+1000..109F)
1240          \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
1241          \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
1242          \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32:
1243                                    U+AA60..AA7F)
1244          \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32:
1245                                    U+A9E0..A9FF)
1246          \p{Block: Nabataean}    (NOT \p{Nabataean} NOR \p{Is_Nabataean})
1247                                    (48: U+10880..108AF)
1248          \p{Block: Nandinagari}  (NOT \p{Nandinagari} NOR
1249                                    \p{Is_Nandinagari}) (96: U+119A0..119FF)
1250          \p{Block: NB}           \p{Block=No_Block} (825_600 plus all
1251                                    above-Unicode code points)
1252          \p{Block: New_Tai_Lue}  (NOT \p{New_Tai_Lue} NOR
1253                                    \p{Is_New_Tai_Lue}) (96: U+1980..19DF)
1254          \p{Block: Newa}         (NOT \p{Newa} NOR \p{Is_Newa}) (128:
1255                                    U+11400..1147F)
1256          \p{Block: NKo}          (NOT \p{Nko} NOR \p{Is_NKo}) (64:
1257                                    U+07C0..07FF)
1258          \p{Block: No_Block}     (Short: \p{Blk=NB}) (825_600 plus all
1259                                    above-Unicode code points: U+2FE0..2FEF,
1260                                    U+10200..1027F, U+103E0..103FF,
1261                                    U+105C0..105FF, U+107C0..107FF,
1262                                    U+108B0..108DF ...)
1263          \p{Block: Number_Forms} (64: U+2150..218F)
1264          \p{Block: Nushu}        (NOT \p{Nushu} NOR \p{Is_Nushu}) (400:
1265                                    U+1B170..1B2FF)
1266          \p{Block: Nyiakeng_Puachue_Hmong} (NOT \p{Nyiakeng_Puachue_Hmong}
1267                                    NOR \p{Is_Nyiakeng_Puachue_Hmong}) (80:
1268                                    U+1E100..1E14F)
1269          \p{Block: OCR}          \p{Block=Optical_Character_Recognition}
1270                                    (32)
1271          \p{Block: Ogham}        (NOT \p{Ogham} NOR \p{Is_Ogham}) (32:
1272                                    U+1680..169F)
1273          \p{Block: Ol_Chiki}     (48: U+1C50..1C7F)
1274          \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
1275                                    \p{Is_Old_Hungarian}) (128:
1276                                    U+10C80..10CFF)
1277          \p{Block: Old_Italic}   (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
1278                                    (48: U+10300..1032F)
1279          \p{Block: Old_North_Arabian} (32: U+10A80..10A9F)
1280          \p{Block: Old_Permic}   (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
1281                                    (48: U+10350..1037F)
1282          \p{Block: Old_Persian}  (NOT \p{Old_Persian} NOR
1283                                    \p{Is_Old_Persian}) (64: U+103A0..103DF)
1284          \p{Block: Old_Sogdian}  (NOT \p{Old_Sogdian} NOR
1285                                    \p{Is_Old_Sogdian}) (48: U+10F00..10F2F)
1286          \p{Block: Old_South_Arabian} (32: U+10A60..10A7F)
1287          \p{Block: Old_Turkic}   (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
1288                                    (80: U+10C00..10C4F)
1289          \p{Block: Old_Uyghur}   (NOT \p{Old_Uyghur} NOR \p{Is_Old_Uyghur})
1290                                    (64: U+10F70..10FAF)
1291          \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32:
1292                                    U+2440..245F)
1293          \p{Block: Oriya}        (NOT \p{Oriya} NOR \p{Is_Oriya}) (128:
1294                                    U+0B00..0B7F)
1295          \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F)
1296          \p{Block: Osage}        (NOT \p{Osage} NOR \p{Is_Osage}) (80:
1297                                    U+104B0..104FF)
1298          \p{Block: Osmanya}      (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48:
1299                                    U+10480..104AF)
1300          \p{Block: Ottoman_Siyaq_Numbers} (80: U+1ED00..1ED4F)
1301          \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
1302                                    \p{Is_Pahawh_Hmong}) (144:
1303                                    U+16B00..16B8F)
1304          \p{Block: Palmyrene}    (32: U+10860..1087F)
1305          \p{Block: Pau_Cin_Hau}  (NOT \p{Pau_Cin_Hau} NOR
1306                                    \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF)
1307          \p{Block: Phags_Pa}     (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa})
1308                                    (64: U+A840..A87F)
1309          \p{Block: Phaistos}     \p{Block=Phaistos_Disc} (48)
1310          \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48:
1311                                    U+101D0..101FF)
1312          \p{Block: Phoenician}   (NOT \p{Phoenician} NOR \p{Is_Phoenician})
1313                                    (32: U+10900..1091F)
1314          \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
1315          \p{Block: Phonetic_Ext_Sup} \p{Block=
1316                                    Phonetic_Extensions_Supplement} (64)
1317          \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128:
1318                                    U+1D00..1D7F)
1319          \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
1320                                    PhoneticExtSup}) (64: U+1D80..1DBF)
1321          \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF)
1322          \p{Block: Private_Use}  \p{Block=Private_Use_Area} (NOT
1323                                    \p{Private_Use} NOR \p{Is_Private_Use})
1324                                    (6400)
1325          \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
1326                                    \p{Private_Use} NOR \p{Is_Private_Use})
1327                                    (6400: U+E000..F8FF)
1328          \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
1329                                    \p{Is_Psalter_Pahlavi}) (48:
1330                                    U+10B80..10BAF)
1331          \p{Block: PUA}          \p{Block=Private_Use_Area} (NOT
1332                                    \p{Private_Use} NOR \p{Is_Private_Use})
1333                                    (6400)
1334          \p{Block: Punctuation}  \p{Block=General_Punctuation} (NOT
1335                                    \p{Punct} NOR \p{Is_Punctuation}) (112)
1336          \p{Block: Rejang}       (NOT \p{Rejang} NOR \p{Is_Rejang}) (48:
1337                                    U+A930..A95F)
1338          \p{Block: Rumi}         \p{Block=Rumi_Numeral_Symbols} (32)
1339          \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32:
1340                                    U+10E60..10E7F)
1341          \p{Block: Runic}        (NOT \p{Runic} NOR \p{Is_Runic}) (96:
1342                                    U+16A0..16FF)
1343          \p{Block: Samaritan}    (NOT \p{Samaritan} NOR \p{Is_Samaritan})
1344                                    (64: U+0800..083F)
1345          \p{Block: Saurashtra}   (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
1346                                    (96: U+A880..A8DF)
1347          \p{Block: Sharada}      (NOT \p{Sharada} NOR \p{Is_Sharada}) (96:
1348                                    U+11180..111DF)
1349          \p{Block: Shavian}      (48: U+10450..1047F)
1350          \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF)
1351          \p{Block: Siddham}      (NOT \p{Siddham} NOR \p{Is_Siddham}) (128:
1352                                    U+11580..115FF)
1353          \p{Block: Sinhala}      (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128:
1354                                    U+0D80..0DFF)
1355          \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF)
1356          \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32:
1357                                    U+FE50..FE6F)
1358          \p{Block: Small_Forms}  \p{Block=Small_Form_Variants} (32)
1359          \p{Block: Small_Kana_Ext} \p{Block=Small_Kana_Extension} (64)
1360          \p{Block: Small_Kana_Extension} (Short: \p{Blk=SmallKanaExt}) (64:
1361                                    U+1B130..1B16F)
1362          \p{Block: Sogdian}      (NOT \p{Sogdian} NOR \p{Is_Sogdian}) (64:
1363                                    U+10F30..10F6F)
1364          \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
1365                                    \p{Is_Sora_Sompeng}) (48: U+110D0..110FF)
1366          \p{Block: Soyombo}      (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96:
1367                                    U+11A50..11AAF)
1368          \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
1369                                    ModifierLetters}) (80: U+02B0..02FF)
1370          \p{Block: Specials}     (16: U+FFF0..FFFF)
1371          \p{Block: Sundanese}    (NOT \p{Sundanese} NOR \p{Is_Sundanese})
1372                                    (64: U+1B80..1BBF)
1373          \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
1374          \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16:
1375                                    U+1CC0..1CCF)
1376          \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
1377          \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
1378          \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
1379          \p{Block: Sup_Math_Operators} \p{Block=
1380                                    Supplemental_Mathematical_Operators}
1381                                    (256)
1382          \p{Block: Sup_PUA_A}    \p{Block=Supplementary_Private_Use_Area_A}
1383                                    (65_536)
1384          \p{Block: Sup_PUA_B}    \p{Block=Supplementary_Private_Use_Area_B}
1385                                    (65_536)
1386          \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
1387          \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
1388                                    Supplemental_Symbols_And_Pictographs}
1389                                    (256)
1390          \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
1391          \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
1392                                    SuperAndSub}) (48: U+2070..209F)
1393          \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16:
1394                                    U+27F0..27FF)
1395          \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128:
1396                                    U+2900..297F)
1397          \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256:
1398                                    U+1F800..1F8FF)
1399          \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
1400                                    SupMathOperators}) (256: U+2A00..2AFF)
1401          \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
1402                                    SupPunctuation}) (128: U+2E00..2E7F)
1403          \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
1404                                    SupSymbolsAndPictographs}) (256:
1405                                    U+1F900..1F9FF)
1406          \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
1407                                    SupPUAA}) (65_536: U+F0000..FFFFF)
1408          \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
1409                                    SupPUAB}) (65_536: U+100000..10FFFF)
1410          \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF)
1411          \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
1412                                    \p{Is_Syloti_Nagri}) (48: U+A800..A82F)
1413          \p{Block: Symbols_And_Pictographs_Ext_A} \p{Block=
1414                                    Symbols_And_Pictographs_Extended_A} (144)
1415          \p{Block: Symbols_And_Pictographs_Extended_A} (Short: \p{Blk=
1416                                    SymbolsAndPictographsExtA}) (144:
1417                                    U+1FA70..1FAFF)
1418          \p{Block: Symbols_For_Legacy_Computing} (256: U+1FB00..1FBFF)
1419          \p{Block: Syriac}       (NOT \p{Syriac} NOR \p{Is_Syriac}) (80:
1420                                    U+0700..074F)
1421          \p{Block: Syriac_Sup}   \p{Block=Syriac_Supplement} (16)
1422          \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16:
1423                                    U+0860..086F)
1424          \p{Block: Tagalog}      (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32:
1425                                    U+1700..171F)
1426          \p{Block: Tagbanwa}     (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa})
1427                                    (32: U+1760..177F)
1428          \p{Block: Tags}         (128: U+E0000..E007F)
1429          \p{Block: Tai_Le}       (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48:
1430                                    U+1950..197F)
1431          \p{Block: Tai_Tham}     (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
1432                                    (144: U+1A20..1AAF)
1433          \p{Block: Tai_Viet}     (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet})
1434                                    (96: U+AA80..AADF)
1435          \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1436          \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96:
1437                                    U+1D300..1D35F)
1438          \p{Block: Takri}        (NOT \p{Takri} NOR \p{Is_Takri}) (80:
1439                                    U+11680..116CF)
1440          \p{Block: Tamil}        (NOT \p{Tamil} NOR \p{Is_Tamil}) (128:
1441                                    U+0B80..0BFF)
1442          \p{Block: Tamil_Sup}    \p{Block=Tamil_Supplement} (64)
1443          \p{Block: Tamil_Supplement} (Short: \p{Blk=TamilSup}) (64:
1444                                    U+11FC0..11FFF)
1445          \p{Block: Tangsa}       (NOT \p{Tangsa} NOR \p{Is_Tangsa}) (96:
1446                                    U+16A70..16ACF)
1447          \p{Block: Tangut}       (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144:
1448                                    U+17000..187FF)
1449          \p{Block: Tangut_Components} (768: U+18800..18AFF)
1450          \p{Block: Tangut_Sup}   \p{Block=Tangut_Supplement} (128)
1451          \p{Block: Tangut_Supplement} (Short: \p{Blk=TangutSup}) (128:
1452                                    U+18D00..18D7F)
1453          \p{Block: Telugu}       (NOT \p{Telugu} NOR \p{Is_Telugu}) (128:
1454                                    U+0C00..0C7F)
1455          \p{Block: Thaana}       (NOT \p{Thaana} NOR \p{Is_Thaana}) (64:
1456                                    U+0780..07BF)
1457          \p{Block: Thai}         (NOT \p{Thai} NOR \p{Is_Thai}) (128:
1458                                    U+0E00..0E7F)
1459          \p{Block: Tibetan}      (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256:
1460                                    U+0F00..0FFF)
1461          \p{Block: Tifinagh}     (NOT \p{Tifinagh} NOR \p{Is_Tifinagh})
1462                                    (80: U+2D30..2D7F)
1463          \p{Block: Tirhuta}      (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96:
1464                                    U+11480..114DF)
1465          \p{Block: Toto}         (NOT \p{Toto} NOR \p{Is_Toto}) (48:
1466                                    U+1E290..1E2BF)
1467          \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1468                                    (128)
1469          \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1470                                    TransportAndMap}) (128: U+1F680..1F6FF)
1471          \p{Block: UCAS}         \p{Block=
1472                                    Unified_Canadian_Aboriginal_Syllabics}
1473                                    (640)
1474          \p{Block: UCAS_Ext}     \p{Block=
1475                                    Unified_Canadian_Aboriginal_Syllabics_-
1476                                    Extended} (80)
1477          \p{Block: UCAS_Ext_A}   \p{Block=
1478                                    Unified_Canadian_Aboriginal_Syllabics_-
1479                                    Extended_A} (16)
1480          \p{Block: Ugaritic}     (NOT \p{Ugaritic} NOR \p{Is_Ugaritic})
1481                                    (32: U+10380..1039F)
1482          \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1483                                    UCAS}) (640: U+1400..167F)
1484          \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1485                                    \p{Blk=UCASExt}) (80: U+18B0..18FF)
1486          \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended_A}
1487                                    (Short: \p{Blk=UCASExtA}) (16:
1488                                    U+11AB0..11ABF)
1489          \p{Block: Vai}          (NOT \p{Vai} NOR \p{Is_Vai}) (320:
1490                                    U+A500..A63F)
1491          \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
1492                                    \p{Variation_Selector} NOR \p{Is_VS})
1493                                    (16: U+FE00..FE0F)
1494          \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
1495                                    (240: U+E0100..E01EF)
1496          \p{Block: Vedic_Ext}    \p{Block=Vedic_Extensions} (48)
1497          \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48:
1498                                    U+1CD0..1CFF)
1499          \p{Block: Vertical_Forms} (16: U+FE10..FE1F)
1500          \p{Block: Vithkuqi}     (NOT \p{Vithkuqi} NOR \p{Is_Vithkuqi})
1501                                    (80: U+10570..105BF)
1502          \p{Block: VS}           \p{Block=Variation_Selectors} (NOT
1503                                    \p{Variation_Selector} NOR \p{Is_VS})
1504                                    (16)
1505          \p{Block: VS_Sup}       \p{Block=Variation_Selectors_Supplement}
1506                                    (240)
1507          \p{Block: Wancho}       (NOT \p{Wancho} NOR \p{Is_Wancho}) (64:
1508                                    U+1E2C0..1E2FF)
1509          \p{Block: Warang_Citi}  (NOT \p{Warang_Citi} NOR
1510                                    \p{Is_Warang_Citi}) (96: U+118A0..118FF)
1511          \p{Block: Yezidi}       (NOT \p{Yezidi} NOR \p{Is_Yezidi}) (64:
1512                                    U+10E80..10EBF)
1513          \p{Block: Yi_Radicals}  (64: U+A490..A4CF)
1514          \p{Block: Yi_Syllables} (1168: U+A000..A48F)
1515          \p{Block: Yijing}       \p{Block=Yijing_Hexagram_Symbols} (64)
1516          \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64:
1517                                    U+4DC0..4DFF)
1518          \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR
1519                                    \p{Is_Zanabazar_Square}) (80:
1520                                    U+11A00..11A4F)
1521          \p{Block: Znamenny_Music} \p{Block=Znamenny_Musical_Notation} (208)
1522          \p{Block: Znamenny_Musical_Notation} (Short: \p{Blk=
1523                                    ZnamennyMusic}) (208: U+1CF00..1CFCF)
1524        X \p{Block_Elements}      \p{Block=Block_Elements} (32)
1525          \p{Bopo}                \p{Bopomofo} (= \p{Script_Extensions=
1526                                    Bopomofo}) (NOT \p{Block=Bopomofo}) (117)
1527          \p{Bopomofo}            \p{Script_Extensions=Bopomofo} (Short:
1528                                    \p{Bopo}; NOT \p{Block=Bopomofo}) (117)
1529        X \p{Bopomofo_Ext}        \p{Bopomofo_Extended} (= \p{Block=
1530                                    Bopomofo_Extended}) (32)
1531        X \p{Bopomofo_Extended}   \p{Block=Bopomofo_Extended} (Short:
1532                                    \p{InBopomofoExt}) (32)
1533        X \p{Box_Drawing}         \p{Block=Box_Drawing} (128)
1534          \p{Bpt: *}              \p{Bidi_Paired_Bracket_Type: *}
1535          \p{Brah}                \p{Brahmi} (= \p{Script_Extensions=
1536                                    Brahmi}) (NOT \p{Block=Brahmi}) (115)
1537          \p{Brahmi}              \p{Script_Extensions=Brahmi} (Short:
1538                                    \p{Brah}; NOT \p{Block=Brahmi}) (115)
1539          \p{Brai}                \p{Braille} (= \p{Script_Extensions=
1540                                    Braille}) (256)
1541          \p{Braille}             \p{Script_Extensions=Braille} (Short:
1542                                    \p{Brai}) (256)
1543        X \p{Braille_Patterns}    \p{Block=Braille_Patterns} (Short:
1544                                    \p{InBraille}) (256)
1545          \p{Bugi}                \p{Buginese} (= \p{Script_Extensions=
1546                                    Buginese}) (NOT \p{Block=Buginese}) (31)
1547          \p{Buginese}            \p{Script_Extensions=Buginese} (Short:
1548                                    \p{Bugi}; NOT \p{Block=Buginese}) (31)
1549          \p{Buhd}                \p{Buhid} (= \p{Script_Extensions=Buhid})
1550                                    (NOT \p{Block=Buhid}) (22)
1551          \p{Buhid}               \p{Script_Extensions=Buhid} (Short:
1552                                    \p{Buhd}; NOT \p{Block=Buhid}) (22)
1553        X \p{Byzantine_Music}     \p{Byzantine_Musical_Symbols} (= \p{Block=
1554                                    Byzantine_Musical_Symbols}) (256)
1555        X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1556                                    (Short: \p{InByzantineMusic}) (256)
1557          \p{C} \pC               \p{Other} (= \p{General_Category=Other})
1558                                    (969_578 plus all above-Unicode code
1559                                    points)
1560          \p{Cakm}                \p{Chakma} (= \p{Script_Extensions=
1561                                    Chakma}) (NOT \p{Block=Chakma}) (91)
1562          \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
1563                                    (Short: \p{Cans}) (726)
1564        X \p{Canadian_Syllabics}  \p{Unified_Canadian_Aboriginal_Syllabics}
1565                                    (= \p{Block=
1566                                    Unified_Canadian_Aboriginal_Syllabics})
1567                                    (640)
1568        T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1569                                    Not_Reordered} (1_113_200 plus all
1570                                    above-Unicode code points)
1571        T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1572                                    Overlay} (32)
1573        T \p{Canonical_Combining_Class: 6} \p{Canonical_Combining_Class=
1574                                    Han_Reading} (2)
1575        T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1576                                    Nukta} (27)
1577        T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1578                                    Kana_Voicing} (2)
1579        T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1580                                    Virama} (63)
1581        T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1582                                    CCC10} (1)
1583          \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1:
1584                                    U+05B0)
1585        T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1586                                    CCC11} (1)
1587          \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1:
1588                                    U+05B1)
1589        T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1590                                    CCC12} (1)
1591          \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1:
1592                                    U+05B2)
1593        T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1594                                    CCC13} (1)
1595          \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1:
1596                                    U+05B3)
1597        T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1598                                    CCC14} (1)
1599          \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1:
1600                                    U+05B4)
1601        T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1602                                    CCC15} (1)
1603          \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1:
1604                                    U+05B5)
1605        T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1606                                    CCC16} (1)
1607          \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1:
1608                                    U+05B6)
1609        T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1610                                    CCC17} (1)
1611          \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1:
1612                                    U+05B7)
1613        T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1614                                    CCC18} (2)
1615          \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2:
1616                                    U+05B8, U+05C7)
1617        T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1618                                    CCC19} (2)
1619          \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2:
1620                                    U+05B9..05BA)
1621        T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1622                                    CCC20} (1)
1623          \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1:
1624                                    U+05BB)
1625        T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1626                                    CCC21} (1)
1627          \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1:
1628                                    U+05BC)
1629        T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1630                                    CCC22} (1)
1631          \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1:
1632                                    U+05BD)
1633        T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1634                                    CCC23} (1)
1635          \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1:
1636                                    U+05BF)
1637        T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1638                                    CCC24} (1)
1639          \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1:
1640                                    U+05C1)
1641        T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1642                                    CCC25} (1)
1643          \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1:
1644                                    U+05C2)
1645        T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1646                                    CCC26} (1)
1647          \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1:
1648                                    U+FB1E)
1649        T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1650                                    CCC27} (2)
1651          \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2:
1652                                    U+064B, U+08F0)
1653        T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1654                                    CCC28} (2)
1655          \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2:
1656                                    U+064C, U+08F1)
1657        T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1658                                    CCC29} (2)
1659          \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2:
1660                                    U+064D, U+08F2)
1661        T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1662                                    CCC30} (2)
1663          \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2:
1664                                    U+0618, U+064E)
1665        T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1666                                    CCC31} (2)
1667          \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2:
1668                                    U+0619, U+064F)
1669        T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1670                                    CCC32} (2)
1671          \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2:
1672                                    U+061A, U+0650)
1673        T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1674                                    CCC33} (1)
1675          \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1:
1676                                    U+0651)
1677        T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1678                                    CCC34} (1)
1679          \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1:
1680                                    U+0652)
1681        T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1682                                    CCC35} (1)
1683          \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1:
1684                                    U+0670)
1685        T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1686                                    CCC36} (1)
1687          \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1:
1688                                    U+0711)
1689        T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1690                                    CCC84} (1)
1691          \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1:
1692                                    U+0C55)
1693        T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1694                                    CCC91} (1)
1695          \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1:
1696                                    U+0C56)
1697        T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1698                                    CCC103} (2)
1699          \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2:
1700                                    U+0E38..0E39)
1701        T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1702                                    CCC107} (4)
1703          \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4:
1704                                    U+0E48..0E4B)
1705        T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1706                                    CCC118} (2)
1707          \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2:
1708                                    U+0EB8..0EB9)
1709        T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1710                                    CCC122} (4)
1711          \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4:
1712                                    U+0EC8..0ECB)
1713        T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1714                                    CCC129} (1)
1715          \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1:
1716                                    U+0F71)
1717        T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1718                                    CCC130} (6)
1719          \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6:
1720                                    U+0F72, U+0F7A..0F7D, U+0F80)
1721        T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1722                                    CCC132} (1)
1723          \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1:
1724                                    U+0F74)
1725        T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
1726                                    CCC133} (0)
1727          \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
1728        T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1729                                    Attached_Below_Left} (0)
1730        T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1731                                    Attached_Below} (5)
1732        T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1733                                    Attached_Above} (1)
1734        T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1735                                    Attached_Above_Right} (9)
1736        T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1737                                    Below_Left} (2)
1738        T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1739                                    Below} (177)
1740        T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1741                                    Below_Right} (4)
1742        T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1743                                    Left} (2)
1744        T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1745                                    Right} (1)
1746        T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1747                                    Above_Left} (5)
1748        T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1749                                    Above} (508)
1750        T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1751                                    Above_Right} (5)
1752        T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1753                                    Double_Below} (4)
1754        T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1755                                    Double_Above} (5)
1756        T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1757                                    Iota_Subscript} (1)
1758          \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1759                                    Above} (508)
1760          \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (508:
1761                                    U+0300..0314, U+033D..0344, U+0346,
1762                                    U+034A..034C, U+0350..0352, U+0357 ...)
1763          \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5:
1764                                    U+05AE, U+18A9, U+1DF7..1DF8, U+302B)
1765          \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (5:
1766                                    U+0315, U+031A, U+0358, U+1DF6, U+302C)
1767          \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1768                                    Above_Left} (5)
1769          \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1770                                    Above_Right} (5)
1771          \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1772                                    Attached_Above} (1)
1773          \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1774                                    Attached_Above_Right} (9)
1775          \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1776                                    Attached_Below} (5)
1777          \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1778                                    Attached_Below_Left} (0)
1779          \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1780                                    (1: U+1DCE)
1781          \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1782                                    \p{Ccc=ATAR}) (9: U+031B, U+0F39,
1783                                    U+1D165..1D166, U+1D16E..1D172)
1784          \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1785                                    (5: U+0321..0322, U+0327..0328, U+1DD0)
1786          \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1787                                    ATBL}) (0)
1788          \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1789                                    Below} (177)
1790          \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (177:
1791                                    U+0316..0319, U+031C..0320,
1792                                    U+0323..0326, U+0329..0333,
1793                                    U+0339..033C, U+0347..0349 ...)
1794          \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (2:
1795                                    U+1DFA, U+302A)
1796          \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4:
1797                                    U+059A, U+05AD, U+1939, U+302D)
1798          \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1799                                    Below_Left} (2)
1800          \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1801                                    Below_Right} (4)
1802          \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1803                                    Double_Above} (5)
1804          \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1805                                    Double_Below} (4)
1806          \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA})
1807                                    (5: U+035D..035E, U+0360..0361, U+1DCD)
1808          \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB})
1809                                    (4: U+035C, U+035F, U+0362, U+1DFC)
1810          \p{Canonical_Combining_Class: Han_Reading} (Short: \p{Ccc=HANR})
1811                                    (2: U+16FF0..16FF1)
1812          \p{Canonical_Combining_Class: HANR} \p{Canonical_Combining_Class=
1813                                    Han_Reading} (2)
1814          \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1815                                    (1: U+0345)
1816          \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1817                                    Iota_Subscript} (1)
1818          \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV})
1819                                    (2: U+3099..309A)
1820          \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1821                                    Kana_Voicing} (2)
1822          \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1823                                    Left} (2)
1824          \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2:
1825                                    U+302E..302F)
1826          \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1827                                    Nukta} (27)
1828          \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1829                                    (1_113_200 plus all above-Unicode code
1830                                    points: U+0000..02FF, U+034F,
1831                                    U+0370..0482, U+0488..0590, U+05BE,
1832                                    U+05C0 ...)
1833          \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1834                                    Not_Reordered} (1_113_200 plus all
1835                                    above-Unicode code points)
1836          \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (27:
1837                                    U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C,
1838                                    U+0C3C ...)
1839          \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1840                                    Overlay} (32)
1841          \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32:
1842                                    U+0334..0338, U+1CD4, U+1CE2..1CE8,
1843                                    U+20D2..20D3, U+20D8..20DA, U+20E5..20E6
1844                                    ...)
1845          \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1846                                    Right} (1)
1847          \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1:
1848                                    U+1D16D)
1849          \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (63:
1850                                    U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
1851                                    U+0BCD ...)
1852          \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1853                                    Virama} (63)
1854          \p{Cans}                \p{Canadian_Aboriginal} (=
1855                                    \p{Script_Extensions=
1856                                    Canadian_Aboriginal}) (726)
1857          \p{Cari}                \p{Carian} (= \p{Script_Extensions=
1858                                    Carian}) (NOT \p{Block=Carian}) (49)
1859          \p{Carian}              \p{Script_Extensions=Carian} (Short:
1860                                    \p{Cari}; NOT \p{Block=Carian}) (49)
1861          \p{Case_Ignorable}      \p{Case_Ignorable=Y} (Short: \p{CI}) (2602)
1862          \p{Case_Ignorable: N*}  (Short: \p{CI=N}, \P{CI}) (1_111_510 plus
1863                                    all above-Unicode code points: [\x00-
1864                                    \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A-Z
1865                                    \[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-\xac\xae
1866                                    \xb0-\xb3\xb5-\xb6\xb9-\xff],
1867                                    U+0100..02AF, U+0370..0373,
1868                                    U+0376..0379, U+037B..0383, U+0386 ...)
1869          \p{Case_Ignorable: Y*}  (Short: \p{CI=Y}, \p{CI}) (2602: [\'.:\^`
1870                                    \xa8\xad\xaf\xb4\xb7-\xb8],
1871                                    U+02B0..036F, U+0374..0375, U+037A,
1872                                    U+0384..0385, U+0387 ...)
1873          \p{Cased}               \p{Cased=Y} (4453)
1874          \p{Cased: N*}           (Single: \P{Cased}) (1_109_659 plus all
1875                                    above-Unicode code points: [\x00-\x20!
1876                                    \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]
1877                                    \^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9
1878                                    \xbb-\xbf\xd7\xf7], U+01BB,
1879                                    U+01C0..01C3, U+0294, U+02B9..02BF,
1880                                    U+02C2..02DF ...)
1881          \p{Cased: Y*}           (Single: \p{Cased}) (4453: [A-Za-z\xaa
1882                                    \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
1883                                    U+0100..01BA, U+01BC..01BF,
1884                                    U+01C4..0293, U+0295..02B8, U+02C0..02C1
1885                                    ...)
1886          \p{Cased_Letter}        \p{General_Category=Cased_Letter} (Short:
1887                                    \p{LC}) (4089)
1888          \p{Category: *}         \p{General_Category: *}
1889          \p{Caucasian_Albanian}  \p{Script_Extensions=Caucasian_Albanian}
1890                                    (Short: \p{Aghb}; NOT \p{Block=
1891                                    Caucasian_Albanian}) (53)
1892          \p{Cc}                  \p{XPosixCntrl} (= \p{General_Category=
1893                                    Control}) (65)
1894          \p{Ccc: *}              \p{Canonical_Combining_Class: *}
1895          \p{CE}                  \p{Composition_Exclusion} (=
1896                                    \p{Composition_Exclusion=Y}) (81)
1897          \p{CE: *}               \p{Composition_Exclusion: *}
1898          \p{Cf}                  \p{Format} (= \p{General_Category=Format})
1899                                    (163)
1900          \p{Chakma}              \p{Script_Extensions=Chakma} (Short:
1901                                    \p{Cakm}; NOT \p{Block=Chakma}) (91)
1902          \p{Cham}                \p{Script_Extensions=Cham} (NOT \p{Block=
1903                                    Cham}) (83)
1904          \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1905                                    \p{CWCF}) (1506)
1906          \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1907                                    (1_112_606 plus all above-Unicode code
1908                                    points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1909                                    \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1910                                    \xb4\xb6-\xbf\xd7\xe0-\xff], U+0101,
1911                                    U+0103, U+0105, U+0107, U+0109 ...)
1912          \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1913                                    (1506: [A-Z\xb5\xc0-\xd6\xd8-\xdf],
1914                                    U+0100, U+0102, U+0104, U+0106, U+0108
1915                                    ...)
1916          \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1917                                    \p{CWCM}) (2927)
1918          \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1919                                    (1_111_185 plus all above-Unicode code
1920                                    points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1921                                    \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xb4
1922                                    \xb6-\xbf\xd7\xf7], U+0138, U+018D,
1923                                    U+019B, U+01AA..01AB, U+01BA..01BB ...)
1924          \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1925                                    (2927: [A-Za-z\xb5\xc0-\xd6\xd8-\xf6
1926                                    \xf8-\xff], U+0100..0137, U+0139..018C,
1927                                    U+018E..019A, U+019C..01A9, U+01AC..01B9
1928                                    ...)
1929          \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1930                                    \p{CWL}) (1433)
1931          \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1932                                    (1_112_679 plus all above-Unicode code
1933                                    points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1934                                    \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1935                                    \xbf\xd7\xdf-\xff], U+0101, U+0103,
1936                                    U+0105, U+0107, U+0109 ...)
1937          \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1433:
1938                                    [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102,
1939                                    U+0104, U+0106, U+0108 ...)
1940          \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1941                                    Y} (Short: \p{CWKCF}) (10_429)
1942          \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1943                                    \P{CWKCF}) (1_103_683 plus all above-
1944                                    Unicode code points: [\x00-\x20!\"#\$
1945                                    \%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a-
1946                                    z\{\|\}~\x7f-\x9f\xa1-\xa7\xa9\xab-\xac
1947                                    \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xd7\xe0-
1948                                    \xff], U+0101, U+0103, U+0105, U+0107,
1949                                    U+0109 ...)
1950          \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1951                                    \p{CWKCF}) (10_429: [A-Z\xa0\xa8\xaa
1952                                    \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0-
1953                                    \xd6\xd8-\xdf], U+0100, U+0102, U+0104,
1954                                    U+0106, U+0108 ...)
1955          \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1956                                    \p{CWT}) (1452)
1957          \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1958                                    (1_112_660 plus all above-Unicode code
1959                                    points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1960                                    \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f-
1961                                    \xb4\xb6-\xde\xf7], U+0100, U+0102,
1962                                    U+0104, U+0106, U+0108 ...)
1963          \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1452:
1964                                    [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1965                                    U+0103, U+0105, U+0107, U+0109 ...)
1966          \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1967                                    \p{CWU}) (1525)
1968          \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1969                                    (1_112_587 plus all above-Unicode code
1970                                    points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1971                                    \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f-
1972                                    \xb4\xb6-\xde\xf7], U+0100, U+0102,
1973                                    U+0104, U+0106, U+0108 ...)
1974          \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1525:
1975                                    [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1976                                    U+0103, U+0105, U+0107, U+0109 ...)
1977          \p{Cher}                \p{Cherokee} (= \p{Script_Extensions=
1978                                    Cherokee}) (NOT \p{Block=Cherokee}) (172)
1979          \p{Cherokee}            \p{Script_Extensions=Cherokee} (Short:
1980                                    \p{Cher}; NOT \p{Block=Cherokee}) (172)
1981        X \p{Cherokee_Sup}        \p{Cherokee_Supplement} (= \p{Block=
1982                                    Cherokee_Supplement}) (80)
1983        X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
1984                                    \p{InCherokeeSup}) (80)
1985        X \p{Chess_Symbols}       \p{Block=Chess_Symbols} (112)
1986          \p{Chorasmian}          \p{Script_Extensions=Chorasmian} (Short:
1987                                    \p{Chrs}; NOT \p{Block=Chorasmian}) (28)
1988          \p{Chrs}                \p{Chorasmian} (= \p{Script_Extensions=
1989                                    Chorasmian}) (NOT \p{Block=Chorasmian})
1990                                    (28)
1991          \p{CI}                  \p{Case_Ignorable} (= \p{Case_Ignorable=
1992                                    Y}) (2602)
1993          \p{CI: *}               \p{Case_Ignorable: *}
1994        X \p{CJK}                 \p{CJK_Unified_Ideographs} (= \p{Block=
1995                                    CJK_Unified_Ideographs}) (20_992)
1996        X \p{CJK_Compat}          \p{CJK_Compatibility} (= \p{Block=
1997                                    CJK_Compatibility}) (256)
1998        X \p{CJK_Compat_Forms}    \p{CJK_Compatibility_Forms} (= \p{Block=
1999                                    CJK_Compatibility_Forms}) (32)
2000        X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
2001                                    \p{Block=CJK_Compatibility_Ideographs})
2002                                    (512)
2003        X \p{CJK_Compat_Ideographs_Sup}
2004                                    \p{CJK_Compatibility_Ideographs_-
2005                                    Supplement} (= \p{Block=
2006                                    CJK_Compatibility_Ideographs_-
2007                                    Supplement}) (544)
2008        X \p{CJK_Compatibility}   \p{Block=CJK_Compatibility} (Short:
2009                                    \p{InCJKCompat}) (256)
2010        X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
2011                                    (Short: \p{InCJKCompatForms}) (32)
2012        X \p{CJK_Compatibility_Ideographs} \p{Block=
2013                                    CJK_Compatibility_Ideographs} (Short:
2014                                    \p{InCJKCompatIdeographs}) (512)
2015        X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
2016                                    CJK_Compatibility_Ideographs_Supplement}
2017                                    (Short: \p{InCJKCompatIdeographsSup})
2018                                    (544)
2019        X \p{CJK_Ext_A}           \p{CJK_Unified_Ideographs_Extension_A} (=
2020                                    \p{Block=
2021                                    CJK_Unified_Ideographs_Extension_A})
2022                                    (6592)
2023        X \p{CJK_Ext_B}           \p{CJK_Unified_Ideographs_Extension_B} (=
2024                                    \p{Block=
2025                                    CJK_Unified_Ideographs_Extension_B})
2026                                    (42_720)
2027        X \p{CJK_Ext_C}           \p{CJK_Unified_Ideographs_Extension_C} (=
2028                                    \p{Block=
2029                                    CJK_Unified_Ideographs_Extension_C})
2030                                    (4160)
2031        X \p{CJK_Ext_D}           \p{CJK_Unified_Ideographs_Extension_D} (=
2032                                    \p{Block=
2033                                    CJK_Unified_Ideographs_Extension_D})
2034                                    (224)
2035        X \p{CJK_Ext_E}           \p{CJK_Unified_Ideographs_Extension_E} (=
2036                                    \p{Block=
2037                                    CJK_Unified_Ideographs_Extension_E})
2038                                    (5776)
2039        X \p{CJK_Ext_F}           \p{CJK_Unified_Ideographs_Extension_F} (=
2040                                    \p{Block=
2041                                    CJK_Unified_Ideographs_Extension_F})
2042                                    (7488)
2043        X \p{CJK_Ext_G}           \p{CJK_Unified_Ideographs_Extension_G} (=
2044                                    \p{Block=
2045                                    CJK_Unified_Ideographs_Extension_G})
2046                                    (4944)
2047        X \p{CJK_Radicals_Sup}    \p{CJK_Radicals_Supplement} (= \p{Block=
2048                                    CJK_Radicals_Supplement}) (128)
2049        X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
2050                                    (Short: \p{InCJKRadicalsSup}) (128)
2051        X \p{CJK_Strokes}         \p{Block=CJK_Strokes} (48)
2052        X \p{CJK_Symbols}         \p{CJK_Symbols_And_Punctuation} (=
2053                                    \p{Block=CJK_Symbols_And_Punctuation})
2054                                    (64)
2055        X \p{CJK_Symbols_And_Punctuation} \p{Block=
2056                                    CJK_Symbols_And_Punctuation} (Short:
2057                                    \p{InCJKSymbols}) (64)
2058        X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
2059                                    (Short: \p{InCJK}) (20_992)
2060        X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
2061                                    CJK_Unified_Ideographs_Extension_A}
2062                                    (Short: \p{InCJKExtA}) (6592)
2063        X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
2064                                    CJK_Unified_Ideographs_Extension_B}
2065                                    (Short: \p{InCJKExtB}) (42_720)
2066        X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
2067                                    CJK_Unified_Ideographs_Extension_C}
2068                                    (Short: \p{InCJKExtC}) (4160)
2069        X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
2070                                    CJK_Unified_Ideographs_Extension_D}
2071                                    (Short: \p{InCJKExtD}) (224)
2072        X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
2073                                    CJK_Unified_Ideographs_Extension_E}
2074                                    (Short: \p{InCJKExtE}) (5776)
2075        X \p{CJK_Unified_Ideographs_Extension_F} \p{Block=
2076                                    CJK_Unified_Ideographs_Extension_F}
2077                                    (Short: \p{InCJKExtF}) (7488)
2078        X \p{CJK_Unified_Ideographs_Extension_G} \p{Block=
2079                                    CJK_Unified_Ideographs_Extension_G}
2080                                    (Short: \p{InCJKExtG}) (4944)
2081          \p{Close_Punctuation}   \p{General_Category=Close_Punctuation}
2082                                    (Short: \p{Pe}) (77)
2083          \p{Cn}                  \p{Unassigned} (= \p{General_Category=
2084                                    Unassigned}) (829_834 plus all above-
2085                                    Unicode code points)
2086          \p{Cntrl}               \p{XPosixCntrl} (= \p{General_Category=
2087                                    Control}) (65)
2088          \p{Co}                  \p{Private_Use} (= \p{General_Category=
2089                                    Private_Use}) (NOT \p{Private_Use_Area})
2090                                    (137_468)
2091        X \p{Combining_Diacritical_Marks} \p{Block=
2092                                    Combining_Diacritical_Marks} (Short:
2093                                    \p{InDiacriticals}) (112)
2094        X \p{Combining_Diacritical_Marks_Extended} \p{Block=
2095                                    Combining_Diacritical_Marks_Extended}
2096                                    (Short: \p{InDiacriticalsExt}) (80)
2097        X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
2098                                    Combining_Diacritical_Marks_For_Symbols}
2099                                    (Short: \p{InDiacriticalsForSymbols})
2100                                    (48)
2101        X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
2102                                    Combining_Diacritical_Marks_Supplement}
2103                                    (Short: \p{InDiacriticalsSup}) (64)
2104        X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
2105                                    \p{InHalfMarks}) (16)
2106          \p{Combining_Mark}      \p{Mark} (= \p{General_Category=Mark})
2107                                    (2408)
2108        X \p{Combining_Marks_For_Symbols}
2109                                    \p{Combining_Diacritical_Marks_For_-
2110                                    Symbols} (= \p{Block=
2111                                    Combining_Diacritical_Marks_For_-
2112                                    Symbols}) (48)
2113          \p{Common}              \p{Script_Extensions=Common} (Short:
2114                                    \p{Zyyy}) (7824)
2115        X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
2116                                    (Short: \p{InIndicNumberForms}) (16)
2117          \p{Comp_Ex}             \p{Full_Composition_Exclusion} (=
2118                                    \p{Full_Composition_Exclusion=Y}) (1120)
2119          \p{Comp_Ex: *}          \p{Full_Composition_Exclusion: *}
2120        X \p{Compat_Jamo}         \p{Hangul_Compatibility_Jamo} (= \p{Block=
2121                                    Hangul_Compatibility_Jamo}) (96)
2122          \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
2123                                    \p{CE}) (81)
2124          \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
2125                                    plus all above-Unicode code points:
2126                                    U+0000..0957, U+0960..09DB, U+09DE,
2127                                    U+09E0..0A32, U+0A34..0A35, U+0A37..0A58
2128                                    ...)
2129          \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81:
2130                                    U+0958..095F, U+09DC..09DD, U+09DF,
2131                                    U+0A33, U+0A36, U+0A59..0A5B ...)
2132          \p{Connector_Punctuation} \p{General_Category=
2133                                    Connector_Punctuation} (Short: \p{Pc})
2134                                    (10)
2135          \p{Control}             \p{XPosixCntrl} (= \p{General_Category=
2136                                    Control}) (65)
2137        X \p{Control_Pictures}    \p{Block=Control_Pictures} (64)
2138          \p{Copt}                \p{Coptic} (= \p{Script_Extensions=
2139                                    Coptic}) (NOT \p{Block=Coptic}) (165)
2140          \p{Coptic}              \p{Script_Extensions=Coptic} (Short:
2141                                    \p{Copt}; NOT \p{Block=Coptic}) (165)
2142        X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
2143        X \p{Counting_Rod}        \p{Counting_Rod_Numerals} (= \p{Block=
2144                                    Counting_Rod_Numerals}) (32)
2145        X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
2146                                    \p{InCountingRod}) (32)
2147          \p{Cpmn}                \p{Cypro_Minoan} (= \p{Script_Extensions=
2148                                    Cypro_Minoan}) (NOT \p{Block=
2149                                    Cypro_Minoan}) (101)
2150          \p{Cprt}                \p{Cypriot} (= \p{Script_Extensions=
2151                                    Cypriot}) (112)
2152          \p{Cs}                  \p{Surrogate} (= \p{General_Category=
2153                                    Surrogate}) (2048)
2154          \p{Cuneiform}           \p{Script_Extensions=Cuneiform} (Short:
2155                                    \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
2156        X \p{Cuneiform_Numbers}   \p{Cuneiform_Numbers_And_Punctuation} (=
2157                                    \p{Block=
2158                                    Cuneiform_Numbers_And_Punctuation}) (128)
2159        X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
2160                                    Cuneiform_Numbers_And_Punctuation}
2161                                    (Short: \p{InCuneiformNumbers}) (128)
2162          \p{Currency_Symbol}     \p{General_Category=Currency_Symbol}
2163                                    (Short: \p{Sc}) (63)
2164        X \p{Currency_Symbols}    \p{Block=Currency_Symbols} (48)
2165          \p{CWCF}                \p{Changes_When_Casefolded} (=
2166                                    \p{Changes_When_Casefolded=Y}) (1506)
2167          \p{CWCF: *}             \p{Changes_When_Casefolded: *}
2168          \p{CWCM}                \p{Changes_When_Casemapped} (=
2169                                    \p{Changes_When_Casemapped=Y}) (2927)
2170          \p{CWCM: *}             \p{Changes_When_Casemapped: *}
2171          \p{CWKCF}               \p{Changes_When_NFKC_Casefolded} (=
2172                                    \p{Changes_When_NFKC_Casefolded=Y})
2173                                    (10_429)
2174          \p{CWKCF: *}            \p{Changes_When_NFKC_Casefolded: *}
2175          \p{CWL}                 \p{Changes_When_Lowercased} (=
2176                                    \p{Changes_When_Lowercased=Y}) (1433)
2177          \p{CWL: *}              \p{Changes_When_Lowercased: *}
2178          \p{CWT}                 \p{Changes_When_Titlecased} (=
2179                                    \p{Changes_When_Titlecased=Y}) (1452)
2180          \p{CWT: *}              \p{Changes_When_Titlecased: *}
2181          \p{CWU}                 \p{Changes_When_Uppercased} (=
2182                                    \p{Changes_When_Uppercased=Y}) (1525)
2183          \p{CWU: *}              \p{Changes_When_Uppercased: *}
2184          \p{Cypriot}             \p{Script_Extensions=Cypriot} (Short:
2185                                    \p{Cprt}) (112)
2186        X \p{Cypriot_Syllabary}   \p{Block=Cypriot_Syllabary} (64)
2187          \p{Cypro_Minoan}        \p{Script_Extensions=Cypro_Minoan} (Short:
2188                                    \p{Cpmn}; NOT \p{Block=Cypro_Minoan})
2189                                    (101)
2190          \p{Cyrillic}            \p{Script_Extensions=Cyrillic} (Short:
2191                                    \p{Cyrl}; NOT \p{Block=Cyrillic}) (447)
2192        X \p{Cyrillic_Ext_A}      \p{Cyrillic_Extended_A} (= \p{Block=
2193                                    Cyrillic_Extended_A}) (32)
2194        X \p{Cyrillic_Ext_B}      \p{Cyrillic_Extended_B} (= \p{Block=
2195                                    Cyrillic_Extended_B}) (96)
2196        X \p{Cyrillic_Ext_C}      \p{Cyrillic_Extended_C} (= \p{Block=
2197                                    Cyrillic_Extended_C}) (16)
2198        X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
2199                                    \p{InCyrillicExtA}) (32)
2200        X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
2201                                    \p{InCyrillicExtB}) (96)
2202        X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
2203                                    \p{InCyrillicExtC}) (16)
2204        X \p{Cyrillic_Sup}        \p{Cyrillic_Supplement} (= \p{Block=
2205                                    Cyrillic_Supplement}) (48)
2206        X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
2207                                    \p{InCyrillicSup}) (48)
2208        X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
2209                                    Cyrillic_Supplement}) (48)
2210          \p{Cyrl}                \p{Cyrillic} (= \p{Script_Extensions=
2211                                    Cyrillic}) (NOT \p{Block=Cyrillic}) (447)
2212          \p{Dash}                \p{Dash=Y} (30)
2213          \p{Dash: N*}            (Single: \P{Dash}) (1_114_082 plus all
2214                                    above-Unicode code points: [\x00-\x20!
2215                                    \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z
2216                                    \[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2217                                    U+0100..0589, U+058B..05BD,
2218                                    U+05BF..13FF, U+1401..1805, U+1807..200F
2219                                    ...)
2220          \p{Dash: Y*}            (Single: \p{Dash}) (30: [\-], U+058A,
2221                                    U+05BE, U+1400, U+1806, U+2010..2015 ...)
2222          \p{Dash_Punctuation}    \p{General_Category=Dash_Punctuation}
2223                                    (Short: \p{Pd}) (26)
2224          \p{Decimal_Number}      \p{XPosixDigit} (= \p{General_Category=
2225                                    Decimal_Number}) (660)
2226          \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
2227                                    (13_233)
2228          \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_233:
2229                                    [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd
2230                                    \xe0-\xe5\xe7-\xef\xf1-\xf6\xf9-\xfd
2231                                    \xff], U+0100..010F, U+0112..0125,
2232                                    U+0128..0130, U+0134..0137, U+0139..013E
2233                                    ...)
2234          \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240:
2235                                    U+2460..2473, U+24B6..24EA,
2236                                    U+3244..3247, U+3251..327E,
2237                                    U+3280..32BF, U+32D0..32FE ...)
2238          \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
2239          \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720: [\xa8
2240                                    \xaf\xb4-\xb5\xb8], U+0132..0133,
2241                                    U+013F..0140, U+0149, U+017F,
2242                                    U+01C4..01CC ...)
2243          \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
2244          \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
2245          \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51,
2246                                    U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63
2247                                    ...)
2248          \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1194: U+2102,
2249                                    U+210A..2113, U+2115, U+2119..211D,
2250                                    U+2124, U+2128 ...)
2251          \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
2252          \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc-
2253                                    \xbe], U+2150..215F, U+2189)
2254          \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
2255          \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54,
2256                                    U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68
2257                                    ...)
2258          \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
2259          \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50,
2260                                    U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62
2261                                    ...)
2262          \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
2263          \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55,
2264                                    U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69
2265                                    ...)
2266          \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
2267          \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122:
2268                                    U+FF61..FFBE, U+FFC2..FFC7,
2269                                    U+FFCA..FFCF, U+FFD2..FFD7,
2270                                    U+FFDA..FFDC, U+FFE8..FFEE)
2271          \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
2272          \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0],
2273                                    U+0F0C, U+2007, U+2011, U+202F)
2274          \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
2275                                    Non_Canonical} (Perl extension) (3734)
2276          \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
2277                                    decompositions (Short: \p{Dt=NonCanon})
2278                                    (Perl extension) (3734: [\xa0\xa8\xaa
2279                                    \xaf\xb2-\xb5\xb8-\xba\xbc-\xbe],
2280                                    U+0132..0133, U+013F..0140, U+0149,
2281                                    U+017F, U+01C4..01CC ...)
2282          \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_145 plus
2283                                    all above-Unicode code points: [\x00-
2284                                    \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6-
2285                                    \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-\xdf
2286                                    \xe6\xf0\xf7-\xf8\xfe], U+0110..0111,
2287                                    U+0126..0127, U+0131, U+0138,
2288                                    U+0141..0142 ...)
2289          \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26:
2290                                    U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B)
2291          \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
2292          \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (286)
2293          \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (286: U+3250,
2294                                    U+32CC..32CF, U+32FF..3357,
2295                                    U+3371..33DF, U+33FF, U+1F130..1F14F ...)
2296          \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38: U+1D62..1D6A,
2297                                    U+2080..208E, U+2090..209C, U+2C7C)
2298          \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (213)
2299          \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (213: [\xaa\xb2-
2300                                    \xb3\xb9-\xba], U+02B0..02B8,
2301                                    U+02E0..02E4, U+10FC, U+1D2C..1D2E,
2302                                    U+1D30..1D3A ...)
2303          \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
2304          \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F,
2305                                    U+30FF, U+FE10..FE19, U+FE30..FE44,
2306                                    U+FE47..FE48)
2307          \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000,
2308                                    U+FF01..FF60, U+FFE0..FFE6)
2309          \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
2310                                    Y} (Short: \p{DI}) (4174)
2311          \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
2312                                    (1_109_938 plus all above-Unicode code
2313                                    points: [\x00-\xac\xae-\xff],
2314                                    U+0100..034E, U+0350..061B,
2315                                    U+061D..115E, U+1161..17B3, U+17B6..180A
2316                                    ...)
2317          \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
2318                                    (4174: [\xad], U+034F, U+061C,
2319                                    U+115F..1160, U+17B4..17B5, U+180B..180F
2320                                    ...)
2321          \p{Dep}                 \p{Deprecated} (= \p{Deprecated=Y}) (15)
2322          \p{Dep: *}              \p{Deprecated: *}
2323          \p{Deprecated}          \p{Deprecated=Y} (Short: \p{Dep}) (15)
2324          \p{Deprecated: N*}      (Short: \p{Dep=N}, \P{Dep}) (1_114_097
2325                                    plus all above-Unicode code points:
2326                                    U+0000..0148, U+014A..0672,
2327                                    U+0674..0F76, U+0F78, U+0F7A..17A2,
2328                                    U+17A5..2069 ...)
2329          \p{Deprecated: Y*}      (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149,
2330                                    U+0673, U+0F77, U+0F79, U+17A3..17A4,
2331                                    U+206A..206F ...)
2332          \p{Deseret}             \p{Script_Extensions=Deseret} (Short:
2333                                    \p{Dsrt}) (80)
2334          \p{Deva}                \p{Devanagari} (= \p{Script_Extensions=
2335                                    Devanagari}) (NOT \p{Block=Devanagari})
2336                                    (210)
2337          \p{Devanagari}          \p{Script_Extensions=Devanagari} (Short:
2338                                    \p{Deva}; NOT \p{Block=Devanagari}) (210)
2339        X \p{Devanagari_Ext}      \p{Devanagari_Extended} (= \p{Block=
2340                                    Devanagari_Extended}) (32)
2341        X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
2342                                    \p{InDevanagariExt}) (32)
2343          \p{DI}                  \p{Default_Ignorable_Code_Point} (=
2344                                    \p{Default_Ignorable_Code_Point=Y})
2345                                    (4174)
2346          \p{DI: *}               \p{Default_Ignorable_Code_Point: *}
2347          \p{Dia}                 \p{Diacritic} (= \p{Diacritic=Y}) (1064)
2348          \p{Dia: *}              \p{Diacritic: *}
2349          \p{Diacritic}           \p{Diacritic=Y} (Short: \p{Dia}) (1064)
2350          \p{Diacritic: N*}       (Short: \p{Dia=N}, \P{Dia}) (1_113_048
2351                                    plus all above-Unicode code points:
2352                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2353                                    >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
2354                                    \xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
2355                                    U+0100..02AF, U+034F, U+0358..035C,
2356                                    U+0363..0373, U+0376..0379 ...)
2357          \p{Diacritic: Y*}       (Short: \p{Dia=Y}, \p{Dia}) (1064: [\^`
2358                                    \xa8\xaf\xb4\xb7-\xb8], U+02B0..034E,
2359                                    U+0350..0357, U+035D..0362,
2360                                    U+0374..0375, U+037A ...)
2361        X \p{Diacriticals}        \p{Combining_Diacritical_Marks} (=
2362                                    \p{Block=Combining_Diacritical_Marks})
2363                                    (112)
2364        X \p{Diacriticals_Ext}    \p{Combining_Diacritical_Marks_Extended}
2365                                    (= \p{Block=
2366                                    Combining_Diacritical_Marks_Extended})
2367                                    (80)
2368        X \p{Diacriticals_For_Symbols}
2369                                    \p{Combining_Diacritical_Marks_For_-
2370                                    Symbols} (= \p{Block=
2371                                    Combining_Diacritical_Marks_For_-
2372                                    Symbols}) (48)
2373        X \p{Diacriticals_Sup}    \p{Combining_Diacritical_Marks_Supplement}
2374                                    (= \p{Block=
2375                                    Combining_Diacritical_Marks_Supplement})
2376                                    (64)
2377          \p{Diak}                \p{Dives_Akuru} (= \p{Script_Extensions=
2378                                    Dives_Akuru}) (NOT \p{Block=
2379                                    Dives_Akuru}) (72)
2380          \p{Digit}               \p{XPosixDigit} (= \p{General_Category=
2381                                    Decimal_Number}) (660)
2382        X \p{Dingbats}            \p{Block=Dingbats} (192)
2383          \p{Dives_Akuru}         \p{Script_Extensions=Dives_Akuru} (Short:
2384                                    \p{Diak}; NOT \p{Block=Dives_Akuru}) (72)
2385          \p{Dogr}                \p{Dogra} (= \p{Script_Extensions=Dogra})
2386                                    (NOT \p{Block=Dogra}) (82)
2387          \p{Dogra}               \p{Script_Extensions=Dogra} (Short:
2388                                    \p{Dogr}; NOT \p{Block=Dogra}) (82)
2389        X \p{Domino}              \p{Domino_Tiles} (= \p{Block=
2390                                    Domino_Tiles}) (112)
2391        X \p{Domino_Tiles}        \p{Block=Domino_Tiles} (Short:
2392                                    \p{InDomino}) (112)
2393          \p{Dsrt}                \p{Deseret} (= \p{Script_Extensions=
2394                                    Deseret}) (80)
2395          \p{Dt: *}               \p{Decomposition_Type: *}
2396          \p{Dupl}                \p{Duployan} (= \p{Script_Extensions=
2397                                    Duployan}) (NOT \p{Block=Duployan}) (147)
2398          \p{Duployan}            \p{Script_Extensions=Duployan} (Short:
2399                                    \p{Dupl}; NOT \p{Block=Duployan}) (147)
2400          \p{Ea: *}               \p{East_Asian_Width: *}
2401        X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
2402                                    (208)
2403          \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
2404          \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739: [\xa1
2405                                    \xa4\xa7-\xa8\xaa\xad-\xae\xb0-\xb4\xb6-
2406                                    \xba\xbc-\xbf\xc6\xd0\xd7-\xd8\xde-\xe1
2407                                    \xe6\xe8-\xea\xec-\xed\xf0\xf2-\xf3\xf7-
2408                                    \xfa\xfc\xfe], U+0101, U+0111, U+0113,
2409                                    U+011B, U+0126..0127 ...)
2410          \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
2411          \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000,
2412                                    U+FF01..FF60, U+FFE0..FFE6)
2413          \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
2414          \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9,
2415                                    U+FF61..FFBE, U+FFC2..FFC7,
2416                                    U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC
2417                                    ...)
2418          \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (792_645 plus
2419                                    all above-Unicode code points)
2420          \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
2421          \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20-\x7e
2422                                    \xa2-\xa3\xa5-\xa6\xac\xaf],
2423                                    U+27E6..27ED, U+2985..2986)
2424          \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (792_645 plus all
2425                                    above-Unicode code points: [\x00-\x1f
2426                                    \x7f-\xa0\xa9\xab\xb5\xbb\xc0-\xc5\xc7-
2427                                    \xcf\xd1-\xd6\xd9-\xdd\xe2-\xe5\xe7\xeb
2428                                    \xee-\xef\xf1\xf4-\xf6\xfb\xfd\xff],
2429                                    U+00FF..0100, U+0102..0110, U+0112,
2430                                    U+0114..011A, U+011C..0125 ...)
2431          \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (182_390)
2432          \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (182_390:
2433                                    U+1100..115F, U+231A..231B,
2434                                    U+2329..232A, U+23E9..23EC, U+23F0,
2435                                    U+23F3 ...)
2436          \p{EBase}               \p{Emoji_Modifier_Base} (=
2437                                    \p{Emoji_Modifier_Base=Y}) (132)
2438          \p{EBase: *}            \p{Emoji_Modifier_Base: *}
2439          \p{EComp}               \p{Emoji_Component} (= \p{Emoji_Component=
2440                                    Y}) (146)
2441          \p{EComp: *}            \p{Emoji_Component: *}
2442          \p{Egyp}                \p{Egyptian_Hieroglyphs} (=
2443                                    \p{Script_Extensions=
2444                                    Egyptian_Hieroglyphs}) (NOT \p{Block=
2445                                    Egyptian_Hieroglyphs}) (1080)
2446        X \p{Egyptian_Hieroglyph_Format_Controls} \p{Block=
2447                                    Egyptian_Hieroglyph_Format_Controls} (16)
2448          \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
2449                                    Egyptian_Hieroglyphs} (Short: \p{Egyp};
2450                                    NOT \p{Block=Egyptian_Hieroglyphs})
2451                                    (1080)
2452          \p{Elba}                \p{Elbasan} (= \p{Script_Extensions=
2453                                    Elbasan}) (NOT \p{Block=Elbasan}) (40)
2454          \p{Elbasan}             \p{Script_Extensions=Elbasan} (Short:
2455                                    \p{Elba}; NOT \p{Block=Elbasan}) (40)
2456          \p{Elym}                \p{Elymaic} (= \p{Script_Extensions=
2457                                    Elymaic}) (NOT \p{Block=Elymaic}) (23)
2458          \p{Elymaic}             \p{Script_Extensions=Elymaic} (Short:
2459                                    \p{Elym}; NOT \p{Block=Elymaic}) (23)
2460          \p{EMod}                \p{Emoji_Modifier} (= \p{Emoji_Modifier=
2461                                    Y}) (5)
2462          \p{EMod: *}             \p{Emoji_Modifier: *}
2463          \p{Emoji}               \p{Emoji=Y} (1404)
2464          \p{Emoji: N*}           (Single: \P{Emoji}) (1_112_708 plus all
2465                                    above-Unicode code points: [\x00-\x20!
2466                                    \"\$\%&\'\(\)+,\-.\/:;<=>?\@A-Z\[\\\]
2467                                    \^_`a-z\{\|\}~\x7f-\xa8\xaa-\xad\xaf-
2468                                    \xff], U+0100..203B, U+203D..2048,
2469                                    U+204A..2121, U+2123..2138, U+213A..2193
2470                                    ...)
2471          \p{Emoji: Y*}           (Single: \p{Emoji}) (1404: [#*0-9\xa9
2472                                    \xae], U+203C, U+2049, U+2122, U+2139,
2473                                    U+2194..2199 ...)
2474          \p{Emoji_Component}     \p{Emoji_Component=Y} (Short: \p{EComp})
2475                                    (146)
2476          \p{Emoji_Component: N*} (Short: \p{EComp=N}, \P{EComp}) (1_113_966
2477                                    plus all above-Unicode code points:
2478                                    [\x00-\x20!\"\$\%&\'\(\)+,\-.\/:;<=>?
2479                                    \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2480                                    U+0100..200C, U+200E..20E2,
2481                                    U+20E4..FE0E, U+FE10..1F1E5,
2482                                    U+1F200..1F3FA ...)
2483          \p{Emoji_Component: Y*} (Short: \p{EComp=Y}, \p{EComp}) (146:
2484                                    [#*0-9], U+200D, U+20E3, U+FE0F,
2485                                    U+1F1E6..1F1FF, U+1F3FB..1F3FF ...)
2486          \p{Emoji_Modifier}      \p{Emoji_Modifier=Y} (Short: \p{EMod}) (5)
2487          \p{Emoji_Modifier: N*}  (Short: \p{EMod=N}, \P{EMod}) (1_114_107
2488                                    plus all above-Unicode code points:
2489                                    U+0000..1F3FA, U+1F400..infinity)
2490          \p{Emoji_Modifier: Y*}  (Short: \p{EMod=Y}, \p{EMod}) (5:
2491                                    U+1F3FB..1F3FF)
2492          \p{Emoji_Modifier_Base} \p{Emoji_Modifier_Base=Y} (Short:
2493                                    \p{EBase}) (132)
2494          \p{Emoji_Modifier_Base: N*} (Short: \p{EBase=N}, \P{EBase})
2495                                    (1_113_980 plus all above-Unicode code
2496                                    points: U+0000..261C, U+261E..26F8,
2497                                    U+26FA..2709, U+270E..1F384,
2498                                    U+1F386..1F3C1, U+1F3C5..1F3C6 ...)
2499          \p{Emoji_Modifier_Base: Y*} (Short: \p{EBase=Y}, \p{EBase}) (132:
2500                                    U+261D, U+26F9, U+270A..270D, U+1F385,
2501                                    U+1F3C2..1F3C4, U+1F3C7 ...)
2502          \p{Emoji_Presentation}  \p{Emoji_Presentation=Y} (Short:
2503                                    \p{EPres}) (1185)
2504          \p{Emoji_Presentation: N*} (Short: \p{EPres=N}, \P{EPres})
2505                                    (1_112_927 plus all above-Unicode code
2506                                    points: U+0000..2319, U+231C..23E8,
2507                                    U+23ED..23EF, U+23F1..23F2,
2508                                    U+23F4..25FC, U+25FF..2613 ...)
2509          \p{Emoji_Presentation: Y*} (Short: \p{EPres=Y}, \p{EPres}) (1185:
2510                                    U+231A..231B, U+23E9..23EC, U+23F0,
2511                                    U+23F3, U+25FD..25FE, U+2614..2615 ...)
2512        X \p{Emoticons}           \p{Block=Emoticons} (80)
2513        X \p{Enclosed_Alphanum}   \p{Enclosed_Alphanumerics} (= \p{Block=
2514                                    Enclosed_Alphanumerics}) (160)
2515        X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
2516                                    \p{Block=
2517                                    Enclosed_Alphanumeric_Supplement}) (256)
2518        X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
2519                                    Enclosed_Alphanumeric_Supplement}
2520                                    (Short: \p{InEnclosedAlphanumSup}) (256)
2521        X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
2522                                    (Short: \p{InEnclosedAlphanum}) (160)
2523        X \p{Enclosed_CJK}        \p{Enclosed_CJK_Letters_And_Months} (=
2524                                    \p{Block=
2525                                    Enclosed_CJK_Letters_And_Months}) (256)
2526        X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
2527                                    Enclosed_CJK_Letters_And_Months} (Short:
2528                                    \p{InEnclosedCJK}) (256)
2529        X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
2530                                    (= \p{Block=
2531                                    Enclosed_Ideographic_Supplement}) (256)
2532        X \p{Enclosed_Ideographic_Supplement} \p{Block=
2533                                    Enclosed_Ideographic_Supplement} (Short:
2534                                    \p{InEnclosedIdeographicSup}) (256)
2535          \p{Enclosing_Mark}      \p{General_Category=Enclosing_Mark}
2536                                    (Short: \p{Me}) (13)
2537          \p{EPres}               \p{Emoji_Presentation} (=
2538                                    \p{Emoji_Presentation=Y}) (1185)
2539          \p{EPres: *}            \p{Emoji_Presentation: *}
2540          \p{Ethi}                \p{Ethiopic} (= \p{Script_Extensions=
2541                                    Ethiopic}) (NOT \p{Block=Ethiopic}) (523)
2542          \p{Ethiopic}            \p{Script_Extensions=Ethiopic} (Short:
2543                                    \p{Ethi}; NOT \p{Block=Ethiopic}) (523)
2544        X \p{Ethiopic_Ext}        \p{Ethiopic_Extended} (= \p{Block=
2545                                    Ethiopic_Extended}) (96)
2546        X \p{Ethiopic_Ext_A}      \p{Ethiopic_Extended_A} (= \p{Block=
2547                                    Ethiopic_Extended_A}) (48)
2548        X \p{Ethiopic_Ext_B}      \p{Ethiopic_Extended_B} (= \p{Block=
2549                                    Ethiopic_Extended_B}) (32)
2550        X \p{Ethiopic_Extended}   \p{Block=Ethiopic_Extended} (Short:
2551                                    \p{InEthiopicExt}) (96)
2552        X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
2553                                    \p{InEthiopicExtA}) (48)
2554        X \p{Ethiopic_Extended_B} \p{Block=Ethiopic_Extended_B} (Short:
2555                                    \p{InEthiopicExtB}) (32)
2556        X \p{Ethiopic_Sup}        \p{Ethiopic_Supplement} (= \p{Block=
2557                                    Ethiopic_Supplement}) (32)
2558        X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
2559                                    \p{InEthiopicSup}) (32)
2560          \p{Ext}                 \p{Extender} (= \p{Extender=Y}) (50)
2561          \p{Ext: *}              \p{Extender: *}
2562          \p{Extended_Pictographic} \p{Extended_Pictographic=Y} (Short:
2563                                    \p{ExtPict}) (3537)
2564          \p{Extended_Pictographic: N*} (Short: \p{ExtPict=N}, \P{ExtPict})
2565                                    (1_110_575 plus all above-Unicode code
2566                                    points: [\x00-\xa8\xaa-\xad\xaf-\xff],
2567                                    U+0100..203B, U+203D..2048,
2568                                    U+204A..2121, U+2123..2138, U+213A..2193
2569                                    ...)
2570          \p{Extended_Pictographic: Y*} (Short: \p{ExtPict=Y}, \p{ExtPict})
2571                                    (3537: [\xa9\xae], U+203C, U+2049,
2572                                    U+2122, U+2139, U+2194..2199 ...)
2573          \p{Extender}            \p{Extender=Y} (Short: \p{Ext}) (50)
2574          \p{Extender: N*}        (Short: \p{Ext=N}, \P{Ext}) (1_114_062
2575                                    plus all above-Unicode code points:
2576                                    [\x00-\xb6\xb8-\xff], U+0100..02CF,
2577                                    U+02D2..063F, U+0641..07F9,
2578                                    U+07FB..0B54, U+0B56..0E45 ...)
2579          \p{Extender: Y*}        (Short: \p{Ext=Y}, \p{Ext}) (50: [\xb7],
2580                                    U+02D0..02D1, U+0640, U+07FA, U+0B55,
2581                                    U+0E46 ...)
2582          \p{ExtPict}             \p{Extended_Pictographic} (=
2583                                    \p{Extended_Pictographic=Y}) (3537)
2584          \p{ExtPict: *}          \p{Extended_Pictographic: *}
2585          \p{Final_Punctuation}   \p{General_Category=Final_Punctuation}
2586                                    (Short: \p{Pf}) (10)
2587          \p{Format}              \p{General_Category=Format} (Short:
2588                                    \p{Cf}) (163)
2589          \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
2590                                    (Short: \p{CompEx}) (1120)
2591          \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
2592                                    \P{CompEx}) (1_112_992 plus all above-
2593                                    Unicode code points: U+0000..033F,
2594                                    U+0342, U+0345..0373, U+0375..037D,
2595                                    U+037F..0386, U+0388..0957 ...)
2596          \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
2597                                    \p{CompEx}) (1120: U+0340..0341,
2598                                    U+0343..0344, U+0374, U+037E, U+0387,
2599                                    U+0958..095F ...)
2600          \p{Gc: *}               \p{General_Category: *}
2601          \p{GCB: *}              \p{Grapheme_Cluster_Break: *}
2602          \p{General_Category: C} \p{General_Category=Other} (969_578 plus
2603                                    all above-Unicode code points)
2604          \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
2605                                    \p{Gc=LC}, \p{LC}) (4089: [A-Za-z\xb5
2606                                    \xc0-\xd6\xd8-\xf6\xf8-\xff],
2607                                    U+0100..01BA, U+01BC..01BF,
2608                                    U+01C4..0293, U+0295..02AF, U+0370..0373
2609                                    ...)
2610          \p{General_Category: Cc} \p{General_Category=Control} (65)
2611          \p{General_Category: Cf} \p{General_Category=Format} (163)
2612          \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
2613                                    (77: [\)\]\}], U+0F3B, U+0F3D, U+169C,
2614                                    U+2046, U+207E ...)
2615          \p{General_Category: Cn} \p{General_Category=Unassigned} (829_834
2616                                    plus all above-Unicode code points)
2617          \p{General_Category: Cntrl} \p{General_Category=Control} (65)
2618          \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
2619          \p{General_Category: Combining_Mark} \p{General_Category=Mark}
2620                                    (2408)
2621          \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
2622                                    \p{Pc}) (10: [_], U+203F..2040, U+2054,
2623                                    U+FE33..FE34, U+FE4D..FE4F, U+FF3F)
2624          \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65:
2625                                    [\x00-\x1f\x7f-\x9f])
2626          \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
2627          \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
2628                                    (63: [\$\xa2-\xa5], U+058F, U+060B,
2629                                    U+07FE..07FF, U+09F2..09F3, U+09FB ...)
2630          \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
2631                                    (26: [\-], U+058A, U+05BE, U+1400,
2632                                    U+1806, U+2010..2015 ...)
2633          \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
2634                                    (660: [0-9], U+0660..0669, U+06F0..06F9,
2635                                    U+07C0..07C9, U+0966..096F, U+09E6..09EF
2636                                    ...)
2637          \p{General_Category: Digit} \p{General_Category=Decimal_Number}
2638                                    (660)
2639          \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
2640                                    (13: U+0488..0489, U+1ABE, U+20DD..20E0,
2641                                    U+20E2..20E4, U+A670..A672)
2642          \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
2643                                    (10: [\xbb], U+2019, U+201D, U+203A,
2644                                    U+2E03, U+2E05 ...)
2645          \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (163:
2646                                    [\xad], U+0600..0605, U+061C, U+06DD,
2647                                    U+070F, U+0890..0891 ...)
2648          \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
2649                                    \p{Pi}) (12: [\xab], U+2018,
2650                                    U+201B..201C, U+201F, U+2039, U+2E02 ...)
2651          \p{General_Category: L} \p{General_Category=Letter} (131_756)
2652        X \p{General_Category: L&} \p{General_Category=Cased_Letter} (4089)
2653        X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
2654                                    the trailing '_' matters in spite of
2655                                    loose matching rules. (4089)
2656          \p{General_Category: LC} \p{General_Category=Cased_Letter} (4089)
2657          \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (131_756:
2658                                    [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
2659                                    \xf8-\xff], U+0100..02C1, U+02C6..02D1,
2660                                    U+02E0..02E4, U+02EC, U+02EE ...)
2661          \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
2662                                    (236: U+16EE..16F0, U+2160..2182,
2663                                    U+2185..2188, U+3007, U+3021..3029,
2664                                    U+3038..303A ...)
2665          \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl})
2666                                    (1: U+2028)
2667          \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
2668                                    (/i= General_Category=Cased_Letter)
2669                                    (2227)
2670          \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (334)
2671          \p{General_Category: Lo} \p{General_Category=Other_Letter}
2672                                    (127_333)
2673          \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
2674                                    /i= General_Category=Cased_Letter)
2675                                    (2227: [a-z\xb5\xdf-\xf6\xf8-\xff],
2676                                    U+0101, U+0103, U+0105, U+0107, U+0109
2677                                    ...)
2678          \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
2679                                    (/i= General_Category=Cased_Letter) (31)
2680          \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
2681                                    (/i= General_Category=Cased_Letter)
2682                                    (1831)
2683          \p{General_Category: M} \p{General_Category=Mark} (2408)
2684          \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2408:
2685                                    U+0300..036F, U+0483..0489,
2686                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2687                                    U+05C4..05C5 ...)
2688          \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948:
2689                                    [+<=>\|~\xac\xb1\xd7\xf7], U+03F6,
2690                                    U+0606..0608, U+2044, U+2052,
2691                                    U+207A..207C ...)
2692          \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (445)
2693          \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
2694          \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
2695                                    (1950)
2696          \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
2697                                    (334: U+02B0..02C1, U+02C6..02D1,
2698                                    U+02E0..02E4, U+02EC, U+02EE, U+0374 ...)
2699          \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
2700                                    (125: [\^`\xa8\xaf\xb4\xb8],
2701                                    U+02C2..02C5, U+02D2..02DF,
2702                                    U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2703          \p{General_Category: N} \p{General_Category=Number} (1791)
2704          \p{General_Category: Nd} \p{General_Category=Decimal_Number} (660)
2705          \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
2706          \p{General_Category: No} \p{General_Category=Other_Number} (895)
2707          \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
2708                                    (1950: U+0300..036F, U+0483..0487,
2709                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2710                                    U+05C4..05C5 ...)
2711          \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1791: [0-9
2712                                    \xb2-\xb3\xb9\xbc-\xbe], U+0660..0669,
2713                                    U+06F0..06F9, U+07C0..07C9,
2714                                    U+0966..096F, U+09E6..09EF ...)
2715          \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
2716                                    (79: [\(\[\{], U+0F3A, U+0F3C, U+169B,
2717                                    U+201A, U+201E ...)
2718          \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (969_578 plus
2719                                    all above-Unicode code points: [\x00-
2720                                    \x1f\x7f-\x9f\xad], U+0378..0379,
2721                                    U+0380..0383, U+038B, U+038D, U+03A2 ...)
2722          \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
2723                                    (127_333: [\xaa\xba], U+01BB,
2724                                    U+01C0..01C3, U+0294, U+05D0..05EA,
2725                                    U+05EF..05F2 ...)
2726          \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No})
2727                                    (895: [\xb2-\xb3\xb9\xbc-\xbe],
2728                                    U+09F4..09F9, U+0B72..0B77,
2729                                    U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
2730                                    ...)
2731          \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
2732                                    (605: [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7
2733                                    \xb6-\xb7\xbf], U+037E, U+0387,
2734                                    U+055A..055F, U+0589, U+05C0 ...)
2735          \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
2736                                    (6605: [\xa6\xa9\xae\xb0], U+0482,
2737                                    U+058D..058E, U+060E..060F, U+06DE,
2738                                    U+06E9 ...)
2739          \p{General_Category: P} \p{General_Category=Punctuation} (819)
2740          \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
2741                                    \p{Zp}) (1: U+2029)
2742          \p{General_Category: Pc} \p{General_Category=
2743                                    Connector_Punctuation} (10)
2744          \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (26)
2745          \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
2746                                    (77)
2747          \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
2748                                    (10)
2749          \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
2750                                    (12)
2751          \p{General_Category: Po} \p{General_Category=Other_Punctuation}
2752                                    (605)
2753          \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
2754                                    (137_468: U+E000..F8FF, U+F0000..FFFFD,
2755                                    U+100000..10FFFD)
2756          \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (79)
2757          \p{General_Category: Punct} \p{General_Category=Punctuation} (819)
2758          \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (819:
2759                                    [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_\{\}
2760                                    \xa1\xa7\xab\xb6-\xb7\xbb\xbf], U+037E,
2761                                    U+0387, U+055A..055F, U+0589..058A,
2762                                    U+05BE ...)
2763          \p{General_Category: S} \p{General_Category=Symbol} (7741)
2764          \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (63)
2765          \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19:
2766                                    [\x20\xa0], U+1680, U+2000..200A,
2767                                    U+2028..2029, U+202F, U+205F ...)
2768          \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (125)
2769          \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
2770          \p{General_Category: So} \p{General_Category=Other_Symbol} (6605)
2771          \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
2772                                    (17: [\x20\xa0], U+1680, U+2000..200A,
2773                                    U+202F, U+205F, U+3000)
2774          \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc})
2775                                    (445: U+0903, U+093B, U+093E..0940,
2776                                    U+0949..094C, U+094E..094F, U+0982..0983
2777                                    ...)
2778          \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048:
2779                                    U+D800..DFFF)
2780          \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (7741:
2781                                    [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae-
2782                                    \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5,
2783                                    U+02D2..02DF, U+02E5..02EB, U+02ED,
2784                                    U+02EF..02FF ...)
2785          \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
2786                                    /i= General_Category=Cased_Letter) (31:
2787                                    U+01C5, U+01C8, U+01CB, U+01F2,
2788                                    U+1F88..1F8F, U+1F98..1F9F ...)
2789          \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
2790                                    (829_834 plus all above-Unicode code
2791                                    points: U+0378..0379, U+0380..0383,
2792                                    U+038B, U+038D, U+03A2, U+0530 ...)
2793          \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
2794                                    /i= General_Category=Cased_Letter)
2795                                    (1831: [A-Z\xc0-\xd6\xd8-\xde], U+0100,
2796                                    U+0102, U+0104, U+0106, U+0108 ...)
2797          \p{General_Category: Z} \p{General_Category=Separator} (19)
2798          \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
2799          \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
2800                                    (1)
2801          \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
2802        X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
2803                                    \p{InPunctuation}) (112)
2804        X \p{Geometric_Shapes}    \p{Block=Geometric_Shapes} (96)
2805        X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
2806                                    \p{Block=Geometric_Shapes_Extended})
2807                                    (128)
2808        X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
2809                                    (Short: \p{InGeometricShapesExt}) (128)
2810          \p{Geor}                \p{Georgian} (= \p{Script_Extensions=
2811                                    Georgian}) (NOT \p{Block=Georgian}) (174)
2812          \p{Georgian}            \p{Script_Extensions=Georgian} (Short:
2813                                    \p{Geor}; NOT \p{Block=Georgian}) (174)
2814        X \p{Georgian_Ext}        \p{Georgian_Extended} (= \p{Block=
2815                                    Georgian_Extended}) (48)
2816        X \p{Georgian_Extended}   \p{Block=Georgian_Extended} (Short:
2817                                    \p{InGeorgianExt}) (48)
2818        X \p{Georgian_Sup}        \p{Georgian_Supplement} (= \p{Block=
2819                                    Georgian_Supplement}) (48)
2820        X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
2821                                    \p{InGeorgianSup}) (48)
2822          \p{Glag}                \p{Glagolitic} (= \p{Script_Extensions=
2823                                    Glagolitic}) (NOT \p{Block=Glagolitic})
2824                                    (138)
2825          \p{Glagolitic}          \p{Script_Extensions=Glagolitic} (Short:
2826                                    \p{Glag}; NOT \p{Block=Glagolitic}) (138)
2827        X \p{Glagolitic_Sup}      \p{Glagolitic_Supplement} (= \p{Block=
2828                                    Glagolitic_Supplement}) (48)
2829        X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
2830                                    \p{InGlagoliticSup}) (48)
2831          \p{Gong}                \p{Gunjala_Gondi} (= \p{Script_Extensions=
2832                                    Gunjala_Gondi}) (NOT \p{Block=
2833                                    Gunjala_Gondi}) (65)
2834          \p{Gonm}                \p{Masaram_Gondi} (= \p{Script_Extensions=
2835                                    Masaram_Gondi}) (NOT \p{Block=
2836                                    Masaram_Gondi}) (77)
2837          \p{Goth}                \p{Gothic} (= \p{Script_Extensions=
2838                                    Gothic}) (NOT \p{Block=Gothic}) (27)
2839          \p{Gothic}              \p{Script_Extensions=Gothic} (Short:
2840                                    \p{Goth}; NOT \p{Block=Gothic}) (27)
2841          \p{Gr_Base}             \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
2842                                    (142_539)
2843          \p{Gr_Base: *}          \p{Grapheme_Base: *}
2844          \p{Gr_Ext}              \p{Grapheme_Extend} (= \p{Grapheme_Extend=
2845                                    Y}) (2090)
2846          \p{Gr_Ext: *}           \p{Grapheme_Extend: *}
2847          \p{Gran}                \p{Grantha} (= \p{Script_Extensions=
2848                                    Grantha}) (NOT \p{Block=Grantha}) (116)
2849          \p{Grantha}             \p{Script_Extensions=Grantha} (Short:
2850                                    \p{Gran}; NOT \p{Block=Grantha}) (116)
2851          \p{Graph}               \p{XPosixGraph} (282_146)
2852          \p{Grapheme_Base}       \p{Grapheme_Base=Y} (Short: \p{GrBase})
2853                                    (142_539)
2854          \p{Grapheme_Base: N*}   (Short: \p{GrBase=N}, \P{GrBase}) (971_573
2855                                    plus all above-Unicode code points:
2856                                    [\x00-\x1f\x7f-\x9f\xad], U+0300..036F,
2857                                    U+0378..0379, U+0380..0383, U+038B,
2858                                    U+038D ...)
2859          \p{Grapheme_Base: Y*}   (Short: \p{GrBase=Y}, \p{GrBase})
2860                                    (142_539: [\x20-\x7e\xa0-\xac\xae-\xff],
2861                                    U+0100..02FF, U+0370..0377,
2862                                    U+037A..037F, U+0384..038A, U+038C ...)
2863          \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
2864                                    (3886)
2865          \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (3886: [^
2866                                    \n\r\x20-\x7e\xa0-\xac\xae-\xff],
2867                                    U+061C, U+180E, U+200B, U+200E..200F,
2868                                    U+2028..202E ...)
2869          \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r])
2870          \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (0)
2871          \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (0)
2872          \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (0)
2873          \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
2874                                    (0)
2875          \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
2876                                    E_Base_GAZ} (0)
2877          \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
2878                                    E_Modifier} (0)
2879          \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
2880                                    (2095)
2881          \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (2095:
2882                                    U+0300..036F, U+0483..0489,
2883                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2884                                    U+05C4..05C5 ...)
2885          \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
2886                                    Glue_After_Zwj} (0)
2887          \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (0)
2888          \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125:
2889                                    U+1100..115F, U+A960..A97C)
2890          \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n])
2891          \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00,
2892                                    U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C
2893                                    ...)
2894          \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773:
2895                                    U+AC01..AC1B, U+AC1D..AC37,
2896                                    U+AC39..AC53, U+AC55..AC6F,
2897                                    U+AC71..AC8B, U+AC8D..ACA7 ...)
2898          \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_096_159
2899                                    plus all above-Unicode code points:
2900                                    [\x20-\x7e\xa0-\xac\xae-\xff],
2901                                    U+0100..02FF, U+0370..0482,
2902                                    U+048A..0590, U+05BE, U+05C0 ...)
2903          \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
2904                                    (26)
2905          \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (26:
2906                                    U+0600..0605, U+06DD, U+070F,
2907                                    U+0890..0891, U+08E2, U+0D4E ...)
2908          \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
2909                                    (26: U+1F1E6..1F1FF)
2910          \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
2911                                    Regional_Indicator} (26)
2912          \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
2913                                    SpacingMark} (388)
2914          \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (388:
2915                                    U+0903, U+093B, U+093E..0940,
2916                                    U+0949..094C, U+094E..094F, U+0982..0983
2917                                    ...)
2918          \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137:
2919                                    U+11A8..11FF, U+D7CB..D7FB)
2920          \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95:
2921                                    U+1160..11A7, U+D7B0..D7C6)
2922          \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
2923                                    (1_096_159 plus all above-Unicode code
2924                                    points)
2925          \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D)
2926          \p{Grapheme_Extend}     \p{Grapheme_Extend=Y} (Short: \p{GrExt})
2927                                    (2090)
2928          \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_022
2929                                    plus all above-Unicode code points:
2930                                    U+0000..02FF, U+0370..0482,
2931                                    U+048A..0590, U+05BE, U+05C0, U+05C3 ...)
2932          \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (2090:
2933                                    U+0300..036F, U+0483..0489,
2934                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2935                                    U+05C4..05C5 ...)
2936          \p{Greek}               \p{Script_Extensions=Greek} (Short:
2937                                    \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
2938        X \p{Greek_And_Coptic}    \p{Block=Greek_And_Coptic} (Short:
2939                                    \p{InGreek}) (144)
2940        X \p{Greek_Ext}           \p{Greek_Extended} (= \p{Block=
2941                                    Greek_Extended}) (256)
2942        X \p{Greek_Extended}      \p{Block=Greek_Extended} (Short:
2943                                    \p{InGreekExt}) (256)
2944          \p{Grek}                \p{Greek} (= \p{Script_Extensions=Greek})
2945                                    (NOT \p{Greek_And_Coptic}) (522)
2946          \p{Gujarati}            \p{Script_Extensions=Gujarati} (Short:
2947                                    \p{Gujr}; NOT \p{Block=Gujarati}) (105)
2948          \p{Gujr}                \p{Gujarati} (= \p{Script_Extensions=
2949                                    Gujarati}) (NOT \p{Block=Gujarati}) (105)
2950          \p{Gunjala_Gondi}       \p{Script_Extensions=Gunjala_Gondi}
2951                                    (Short: \p{Gong}; NOT \p{Block=
2952                                    Gunjala_Gondi}) (65)
2953          \p{Gurmukhi}            \p{Script_Extensions=Gurmukhi} (Short:
2954                                    \p{Guru}; NOT \p{Block=Gurmukhi}) (94)
2955          \p{Guru}                \p{Gurmukhi} (= \p{Script_Extensions=
2956                                    Gurmukhi}) (NOT \p{Block=Gurmukhi}) (94)
2957        X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
2958                                    \p{Block=Halfwidth_And_Fullwidth_Forms})
2959                                    (240)
2960        X \p{Half_Marks}          \p{Combining_Half_Marks} (= \p{Block=
2961                                    Combining_Half_Marks}) (16)
2962        X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
2963                                    Halfwidth_And_Fullwidth_Forms} (Short:
2964                                    \p{InHalfAndFullForms}) (240)
2965          \p{Han}                 \p{Script_Extensions=Han} (94_503)
2966          \p{Hang}                \p{Hangul} (= \p{Script_Extensions=
2967                                    Hangul}) (NOT \p{Hangul_Syllables})
2968                                    (11_775)
2969          \p{Hangul}              \p{Script_Extensions=Hangul} (Short:
2970                                    \p{Hang}; NOT \p{Hangul_Syllables})
2971                                    (11_775)
2972        X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
2973                                    (Short: \p{InCompatJamo}) (96)
2974        X \p{Hangul_Jamo}         \p{Block=Hangul_Jamo} (Short: \p{InJamo})
2975                                    (256)
2976        X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
2977                                    (Short: \p{InJamoExtA}) (32)
2978        X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
2979                                    (Short: \p{InJamoExtB}) (80)
2980          \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
2981                                    (125)
2982          \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125:
2983                                    U+1100..115F, U+A960..A97C)
2984          \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
2985                                    (399)
2986          \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399:
2987                                    U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70,
2988                                    U+AC8C ...)
2989          \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
2990                                    LVT_Syllable} (10_773)
2991          \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
2992                                    (10_773: U+AC01..AC1B, U+AC1D..AC37,
2993                                    U+AC39..AC53, U+AC55..AC6F,
2994                                    U+AC71..AC8B, U+AC8D..ACA7 ...)
2995          \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
2996                                    Not_Applicable} (1_102_583 plus all
2997                                    above-Unicode code points)
2998          \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
2999                                    (1_102_583 plus all above-Unicode code
3000                                    points: U+0000..10FF, U+1200..A95F,
3001                                    U+A97D..ABFF, U+D7A4..D7AF,
3002                                    U+D7C7..D7CA, U+D7FC..infinity)
3003          \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
3004                                    (137)
3005          \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137:
3006                                    U+11A8..11FF, U+D7CB..D7FB)
3007          \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
3008                                    (95)
3009          \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95:
3010                                    U+1160..11A7, U+D7B0..D7C6)
3011        X \p{Hangul_Syllables}    \p{Block=Hangul_Syllables} (Short:
3012                                    \p{InHangul}) (11_184)
3013          \p{Hani}                \p{Han} (= \p{Script_Extensions=Han})
3014                                    (94_503)
3015          \p{Hanifi_Rohingya}     \p{Script_Extensions=Hanifi_Rohingya}
3016                                    (Short: \p{Rohg}; NOT \p{Block=
3017                                    Hanifi_Rohingya}) (55)
3018          \p{Hano}                \p{Hanunoo} (= \p{Script_Extensions=
3019                                    Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
3020          \p{Hanunoo}             \p{Script_Extensions=Hanunoo} (Short:
3021                                    \p{Hano}; NOT \p{Block=Hanunoo}) (23)
3022          \p{Hatr}                \p{Hatran} (= \p{Script_Extensions=
3023                                    Hatran}) (NOT \p{Block=Hatran}) (26)
3024          \p{Hatran}              \p{Script_Extensions=Hatran} (Short:
3025                                    \p{Hatr}; NOT \p{Block=Hatran}) (26)
3026          \p{Hebr}                \p{Hebrew} (= \p{Script_Extensions=
3027                                    Hebrew}) (NOT \p{Block=Hebrew}) (134)
3028          \p{Hebrew}              \p{Script_Extensions=Hebrew} (Short:
3029                                    \p{Hebr}; NOT \p{Block=Hebrew}) (134)
3030          \p{Hex}                 \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
3031          \p{Hex: *}              \p{Hex_Digit: *}
3032          \p{Hex_Digit}           \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
3033          \p{Hex_Digit: N*}       (Short: \p{Hex=N}, \P{Hex}) (1_114_068
3034                                    plus all above-Unicode code points:
3035                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
3036                                    \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
3037                                    U+0100..FF0F, U+FF1A..FF20,
3038                                    U+FF27..FF40, U+FF47..infinity)
3039          \p{Hex_Digit: Y*}       (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa-
3040                                    f], U+FF10..FF19, U+FF21..FF26,
3041                                    U+FF41..FF46)
3042        X \p{High_Private_Use_Surrogates} \p{Block=
3043                                    High_Private_Use_Surrogates} (Short:
3044                                    \p{InHighPUSurrogates}) (128)
3045        X \p{High_PU_Surrogates}  \p{High_Private_Use_Surrogates} (=
3046                                    \p{Block=High_Private_Use_Surrogates})
3047                                    (128)
3048        X \p{High_Surrogates}     \p{Block=High_Surrogates} (896)
3049          \p{Hira}                \p{Hiragana} (= \p{Script_Extensions=
3050                                    Hiragana}) (NOT \p{Block=Hiragana}) (432)
3051          \p{Hiragana}            \p{Script_Extensions=Hiragana} (Short:
3052                                    \p{Hira}; NOT \p{Block=Hiragana}) (432)
3053          \p{Hluw}                \p{Anatolian_Hieroglyphs} (=
3054                                    \p{Script_Extensions=
3055                                    Anatolian_Hieroglyphs}) (NOT \p{Block=
3056                                    Anatolian_Hieroglyphs}) (583)
3057          \p{Hmng}                \p{Pahawh_Hmong} (= \p{Script_Extensions=
3058                                    Pahawh_Hmong}) (NOT \p{Block=
3059                                    Pahawh_Hmong}) (127)
3060          \p{Hmnp}                \p{Nyiakeng_Puachue_Hmong} (=
3061                                    \p{Script_Extensions=
3062                                    Nyiakeng_Puachue_Hmong}) (NOT \p{Block=
3063                                    Nyiakeng_Puachue_Hmong}) (71)
3064          \p{HorizSpace}          \p{XPosixBlank} (18)
3065          \p{Hst: *}              \p{Hangul_Syllable_Type: *}
3066          \p{Hung}                \p{Old_Hungarian} (= \p{Script_Extensions=
3067                                    Old_Hungarian}) (NOT \p{Block=
3068                                    Old_Hungarian}) (108)
3069        D \p{Hyphen}              \p{Hyphen=Y} (11)
3070        D \p{Hyphen: N*}          Supplanted by Line_Break property values;
3071                                    see www.unicode.org/reports/tr14
3072                                    (Single: \P{Hyphen}) (1_114_101 plus all
3073                                    above-Unicode code points: [\x00-\x20!
3074                                    \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z
3075                                    \[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae-\xff],
3076                                    U+0100..0589, U+058B..1805,
3077                                    U+1807..200F, U+2012..2E16, U+2E18..30FA
3078                                    ...)
3079        D \p{Hyphen: Y*}          Supplanted by Line_Break property values;
3080                                    see www.unicode.org/reports/tr14
3081                                    (Single: \p{Hyphen}) (11: [\-\xad],
3082                                    U+058A, U+1806, U+2010..2011, U+2E17,
3083                                    U+30FB ...)
3084          \p{ID_Continue}         \p{ID_Continue=Y} (Short: \p{IDC}; NOT
3085                                    \p{Ideographic_Description_Characters})
3086                                    (135_072)
3087          \p{ID_Continue: N*}     (Short: \p{IDC=N}, \P{IDC}) (979_040 plus
3088                                    all above-Unicode code points: [\x00-
3089                                    \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@
3090                                    \[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4\xb6
3091                                    \xb8-\xb9\xbb-\xbf\xd7\xf7],
3092                                    U+02C2..02C5, U+02D2..02DF,
3093                                    U+02E5..02EB, U+02ED, U+02EF..02FF ...)
3094          \p{ID_Continue: Y*}     (Short: \p{IDC=Y}, \p{IDC}) (135_072:
3095                                    [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6
3096                                    \xd8-\xf6\xf8-\xff], U+0100..02C1,
3097                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
3098                                    U+02EE ...)
3099          \p{ID_Start}            \p{ID_Start=Y} (Short: \p{IDS}) (131_997)
3100          \p{ID_Start: N*}        (Short: \p{IDS=N}, \P{IDS}) (982_115 plus
3101                                    all above-Unicode code points: [\x00-
3102                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@
3103                                    \[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6-
3104                                    \xb9\xbb-\xbf\xd7\xf7], U+02C2..02C5,
3105                                    U+02D2..02DF, U+02E5..02EB, U+02ED,
3106                                    U+02EF..036F ...)
3107          \p{ID_Start: Y*}        (Short: \p{IDS=Y}, \p{IDS}) (131_997: [A-
3108                                    Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
3109                                    \xff], U+0100..02C1, U+02C6..02D1,
3110                                    U+02E0..02E4, U+02EC, U+02EE ...)
3111          \p{IDC}                 \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
3112                                    \p{Ideographic_Description_Characters})
3113                                    (135_072)
3114          \p{IDC: *}              \p{ID_Continue: *}
3115          \p{Identifier_Status: Allowed} (107_957: [\'\-.0-9:A-Z_a-z\xb7
3116                                    \xc0-\xd6\xd8-\xf6\xf8-\xff],
3117                                    U+0100..0131, U+0134..013E,
3118                                    U+0141..0148, U+014A..017E, U+018F ...)
3119          \p{Identifier_Status: Restricted} (1_006_155 plus all above-
3120                                    Unicode code points: [\x00-\x20!\"#\$
3121                                    \%&\(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x7f-
3122                                    \xb6\xb8-\xbf\xd7\xf7], U+0132..0133,
3123                                    U+013F..0140, U+0149, U+017F..018E,
3124                                    U+0190..019F ...)
3125          \p{Identifier_Type: Default_Ignorable} (396: [\xad], U+034F,
3126                                    U+061C, U+115F..1160, U+17B4..17B5,
3127                                    U+180B..180F ...)
3128          \p{Identifier_Type: Deprecated} (15: U+0149, U+0673, U+0F77,
3129                                    U+0F79, U+17A3..17A4, U+206A..206F ...)
3130          \p{Identifier_Type: Exclusion} (17_080: U+03E2..03EF,
3131                                    U+0800..082D, U+0830..083E,
3132                                    U+1680..169C, U+16A0..16EA, U+16EE..16F8
3133                                    ...)
3134          \p{Identifier_Type: Inclusion} (19: [\'\-.:\xb7], U+0375, U+058A,
3135                                    U+05F3..05F4, U+06FD..06FE, U+0F0B ...)
3136          \p{Identifier_Type: Limited_Use} (5268: U+0700..070D,
3137                                    U+070F..074A, U+074D..074F,
3138                                    U+07C0..07FA, U+07FD..07FF, U+0840..085B
3139                                    ...)
3140          \p{Identifier_Type: Not_Character} (969_409 plus all above-Unicode
3141                                    code points: [^\t\n\cK\f\r\x20-\x7e\x85
3142                                    \xa0-\xff], U+0378..0379, U+0380..0383,
3143                                    U+038B, U+038D, U+03A2 ...)
3144          \p{Identifier_Type: Not_NFKC} (4859: [\xa0\xa8\xaa\xaf\xb2-\xb5
3145                                    \xb8-\xba\xbc-\xbe], U+0132..0133,
3146                                    U+013F..0140, U+017F, U+01C4..01CC,
3147                                    U+01F1..01F3 ...)
3148          \p{Identifier_Type: Not_XID} (8198: [\t\n\cK\f\r\x20!\"#\$\%&
3149                                    \(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x85
3150                                    \xa1-\xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6
3151                                    \xbb\xbf\xd7\xf7], U+02C2..02C5,
3152                                    U+02D2..02D7, U+02DE..02DF,
3153                                    U+02E5..02EB, U+02ED ...)
3154          \p{Identifier_Type: Obsolete} (1627: U+018D, U+01AA..01AB,
3155                                    U+01B9..01BB, U+01BE..01BF,
3156                                    U+01F6..01F7, U+021C..021D ...)
3157          \p{Identifier_Type: Recommended} (107_938: [0-9A-Z_a-z\xc0-\xd6
3158                                    \xd8-\xf6\xf8-\xff], U+0100..0131,
3159                                    U+0134..013E, U+0141..0148,
3160                                    U+014A..017E, U+018F ...)
3161          \p{Identifier_Type: Technical} (1660: U+0180, U+018D,
3162                                    U+01AA..01AB, U+01BA..01BB, U+01BE,
3163                                    U+01C0..01C3 ...)
3164          \p{Identifier_Type: Uncommon_Use} (393: U+0181..018C, U+018E,
3165                                    U+0190..019F, U+01A2..01A9,
3166                                    U+01AC..01AE, U+01B1..01B8 ...)
3167          \p{Ideo}                \p{Ideographic} (= \p{Ideographic=Y})
3168                                    (101_661)
3169          \p{Ideo: *}             \p{Ideographic: *}
3170          \p{Ideographic}         \p{Ideographic=Y} (Short: \p{Ideo})
3171                                    (101_661)
3172          \p{Ideographic: N*}     (Short: \p{Ideo=N}, \P{Ideo}) (1_012_451
3173                                    plus all above-Unicode code points:
3174                                    U+0000..3005, U+3008..3020,
3175                                    U+302A..3037, U+303B..33FF,
3176                                    U+4DC0..4DFF, U+A000..F8FF ...)
3177          \p{Ideographic: Y*}     (Short: \p{Ideo=Y}, \p{Ideo}) (101_661:
3178                                    U+3006..3007, U+3021..3029,
3179                                    U+3038..303A, U+3400..4DBF,
3180                                    U+4E00..9FFF, U+F900..FA6D ...)
3181        X \p{Ideographic_Description_Characters} \p{Block=
3182                                    Ideographic_Description_Characters}
3183                                    (Short: \p{InIDC}) (16)
3184        X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
3185                                    \p{Block=
3186                                    Ideographic_Symbols_And_Punctuation})
3187                                    (32)
3188        X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
3189                                    Ideographic_Symbols_And_Punctuation}
3190                                    (Short: \p{InIdeographicSymbols}) (32)
3191          \p{IDS}                 \p{ID_Start} (= \p{ID_Start=Y}) (131_997)
3192          \p{IDS: *}              \p{ID_Start: *}
3193          \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
3194                                    \p{IDSB}) (10)
3195          \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
3196                                    (1_114_102 plus all above-Unicode code
3197                                    points: U+0000..2FEF, U+2FF2..2FF3,
3198                                    U+2FFC..infinity)
3199          \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10:
3200                                    U+2FF0..2FF1, U+2FF4..2FFB)
3201          \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
3202                                    \p{IDST}) (2)
3203          \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
3204                                    (1_114_110 plus all above-Unicode code
3205                                    points: U+0000..2FF1, U+2FF4..infinity)
3206          \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2:
3207                                    U+2FF2..2FF3)
3208          \p{IDSB}                \p{IDS_Binary_Operator} (=
3209                                    \p{IDS_Binary_Operator=Y}) (10)
3210          \p{IDSB: *}             \p{IDS_Binary_Operator: *}
3211          \p{IDST}                \p{IDS_Trinary_Operator} (=
3212                                    \p{IDS_Trinary_Operator=Y}) (2)
3213          \p{IDST: *}             \p{IDS_Trinary_Operator: *}
3214          \p{Imperial_Aramaic}    \p{Script_Extensions=Imperial_Aramaic}
3215                                    (Short: \p{Armi}; NOT \p{Block=
3216                                    Imperial_Aramaic}) (31)
3217          \p{In: *}               \p{Present_In: *} (Perl extension)
3218        X \p{In_*}                \p{Block: *}
3219        X \p{Indic_Number_Forms}  \p{Common_Indic_Number_Forms} (= \p{Block=
3220                                    Common_Indic_Number_Forms}) (16)
3221          \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
3222                                    (352: U+093C, U+0941..0944, U+094D,
3223                                    U+0952, U+0956..0957, U+0962..0963 ...)
3224          \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC=
3225                                    BottomAndLeft}) (1: U+A9BF)
3226          \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
3227                                    BottomAndRight}) (4: U+1B3B, U+A9BE,
3228                                    U+A9C0, U+11942)
3229          \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (64:
3230                                    U+093F, U+094E, U+09BF, U+09C7..09C8,
3231                                    U+0A3F, U+0ABF ...)
3232          \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
3233                                    LeftAndRight}) (22: U+09CB..09CC,
3234                                    U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C,
3235                                    U+0DDC, U+0DDE ...)
3236          \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_896
3237                                    plus all above-Unicode code points:
3238                                    U+0000..08FF, U+0904..0939, U+093D,
3239                                    U+0950, U+0958..0961, U+0964..0980 ...)
3240          \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
3241                                    Overstruck}) (10: U+1CD4, U+1CE2..1CE8,
3242                                    U+10A01, U+10A06)
3243          \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (290:
3244                                    U+0903, U+093B, U+093E, U+0940,
3245                                    U+0949..094C, U+094F ...)
3246          \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (418:
3247                                    U+0900..0902, U+093A, U+0945..0948,
3248                                    U+0951, U+0953..0955, U+0981 ...)
3249          \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
3250                                    TopAndBottom}) (10: U+0C48, U+0F73,
3251                                    U+0F76..0F79, U+0F81, U+1B3C,
3252                                    U+1112E..1112F)
3253          \p{Indic_Positional_Category: Top_And_Bottom_And_Left} (Short:
3254                                    \p{InPC=TopAndBottomAndLeft}) (2:
3255                                    U+103C, U+1171E)
3256          \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
3257                                    \p{InPC=TopAndBottomAndRight}) (1:
3258                                    U+1B3D)
3259          \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
3260                                    TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE,
3261                                    U+1C29, U+114BB, U+115B9)
3262          \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
3263                                    \p{InPC=TopAndLeftAndRight}) (4: U+0B4C,
3264                                    U+0DDD, U+17BF, U+115BB)
3265          \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
3266                                    TopAndRight}) (13: U+0AC9, U+0B57,
3267                                    U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB,
3268                                    U+1925..1926 ...)
3269          \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
3270                                    VisualOrderLeft}) (19: U+0E40..0E44,
3271                                    U+0EC0..0EC4, U+19B5..19B7, U+19BA,
3272                                    U+AAB5..AAB6, U+AAB9 ...)
3273        X \p{Indic_Siyaq_Numbers} \p{Block=Indic_Siyaq_Numbers} (80)
3274          \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
3275                                    (17: U+093D, U+09BD, U+0ABD, U+0B3D,
3276                                    U+0C3D, U+0CBD ...)
3277          \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (91:
3278                                    U+0900..0902, U+0981..0982, U+09FC,
3279                                    U+0A01..0A02, U+0A70, U+0A81..0A82 ...)
3280          \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
3281                                    \p{InSC=BrahmiJoiningNumber}) (20:
3282                                    U+11052..11065)
3283          \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
3284                                    CantillationMark}) (59: U+0951..0952,
3285                                    U+0A51, U+0AFA..0AFC, U+1CD0..1CD2,
3286                                    U+1CD4..1CE1, U+1CF4 ...)
3287          \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
3288                                    (2206: U+0915..0939, U+0958..095F,
3289                                    U+0978..097F, U+0995..09A8,
3290                                    U+09AA..09B0, U+09B2 ...)
3291          \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
3292                                    ConsonantDead}) (14: U+09CE, U+0C5D,
3293                                    U+0CDD, U+0D54..0D56, U+0D7A..0D7F,
3294                                    U+1CF2..1CF3)
3295          \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
3296                                    ConsonantFinal}) (70: U+1930..1931,
3297                                    U+1933..1939, U+19C1..19C7,
3298                                    U+1A58..1A59, U+1B03, U+1B81 ...)
3299          \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
3300                                    \p{InSC=ConsonantHeadLetter}) (5:
3301                                    U+0F88..0F8C)
3302          \p{Indic_Syllabic_Category: Consonant_Initial_Postfixed} (Short:
3303                                    \p{InSC=ConsonantInitialPostfixed}) (1:
3304                                    U+1A5A)
3305          \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
3306                                    ConsonantKiller}) (2: U+0E4C, U+17CD)
3307          \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
3308                                    ConsonantMedial}) (31: U+0A75,
3309                                    U+0EBC..0EBD, U+103B..103E,
3310                                    U+105E..1060, U+1082, U+1A55..1A56 ...)
3311          \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
3312                                    \p{InSC=ConsonantPlaceholder}) (22: [\-
3313                                    \xa0\xd7], U+0980, U+0A72..0A73, U+104B,
3314                                    U+104E, U+1900 ...)
3315          \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
3316                                    \p{InSC=ConsonantPrecedingRepha}) (3:
3317                                    U+0D4E, U+11941, U+11D46)
3318          \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
3319                                    ConsonantPrefixed}) (10: U+111C2..111C3,
3320                                    U+1193F, U+11A3A, U+11A84..11A89)
3321          \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
3322                                    ConsonantSubjoined}) (94: U+0F8D..0F97,
3323                                    U+0F99..0FBC, U+1929..192B, U+1A57,
3324                                    U+1A5B..1A5E, U+1BA1..1BA3 ...)
3325          \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
3326                                    \p{InSC=ConsonantSucceedingRepha}) (1:
3327                                    U+17CC)
3328          \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
3329                                    \p{InSC=ConsonantWithStacker}) (8:
3330                                    U+0CF1..0CF2, U+1CF5..1CF6,
3331                                    U+11003..11004, U+11460..11461)
3332          \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
3333                                    GeminationMark}) (3: U+0A71, U+11237,
3334                                    U+11A98)
3335          \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
3336                                    InvisibleStacker}) (12: U+1039, U+17D2,
3337                                    U+1A60, U+1BAB, U+AAF6, U+10A3F ...)
3338          \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1:
3339                                    U+200D)
3340          \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
3341                                    ModifyingLetter}) (1: U+0B83)
3342          \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
3343                                    NonJoiner}) (1: U+200C)
3344          \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (32:
3345                                    U+093C, U+09BC, U+0A3C, U+0ABC,
3346                                    U+0AFD..0AFF, U+0B3C ...)
3347          \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (491:
3348                                    [0-9], U+0966..096F, U+09E6..09EF,
3349                                    U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F
3350                                    ...)
3351          \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
3352                                    NumberJoiner}) (1: U+1107F)
3353          \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
3354                                    (1_109_551 plus all above-Unicode code
3355                                    points: [\x00-\x20!\"#\$\%&\'\(\)*+,.
3356                                    \/:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
3357                                    \x9f\xa1-\xb1\xb4-\xd6\xd8-\xff],
3358                                    U+0100..08FF, U+0950, U+0953..0954,
3359                                    U+0964..0965, U+0970..0971 ...)
3360          \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
3361                                    PureKiller}) (25: U+0D3B..0D3C, U+0E3A,
3362                                    U+0E4E, U+0EBA, U+0F84, U+103A ...)
3363          \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
3364                                    RegisterShifter}) (2: U+17C9..17CA)
3365          \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
3366                                    SyllableModifier}) (25: [\xb2-\xb3],
3367                                    U+09FE, U+0F35, U+0F37, U+0FC6, U+17CB
3368                                    ...)
3369          \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
3370                                    ToneLetter}) (7: U+1970..1974, U+AAC0,
3371                                    U+AAC2)
3372          \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
3373                                    (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037,
3374                                    U+1063..1064, U+1069..106D, U+1087..108D
3375                                    ...)
3376          \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (27:
3377                                    U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
3378                                    U+0BCD ...)
3379          \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga})
3380                                    (35: U+0903, U+0983, U+0A03, U+0A83,
3381                                    U+0B03, U+0C03 ...)
3382          \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30:
3383                                    U+1963..196D, U+A85E..A861, U+A866,
3384                                    U+A922..A92A, U+11150..11154)
3385          \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
3386                                    VowelDependent}) (686: U+093A..093B,
3387                                    U+093E..094C, U+094E..094F,
3388                                    U+0955..0957, U+0962..0963, U+09BE..09C4
3389                                    ...)
3390          \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
3391                                    VowelIndependent}) (486: U+0904..0914,
3392                                    U+0960..0961, U+0972..0977,
3393                                    U+0985..098C, U+098F..0990, U+0993..0994
3394                                    ...)
3395          \p{Inherited}           \p{Script_Extensions=Inherited} (Short:
3396                                    \p{Zinh}) (586)
3397          \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
3398                                    (Short: \p{Pi}) (12)
3399          \p{InPC: *}             \p{Indic_Positional_Category: *}
3400          \p{InSC: *}             \p{Indic_Syllabic_Category: *}
3401          \p{Inscriptional_Pahlavi} \p{Script_Extensions=
3402                                    Inscriptional_Pahlavi} (Short: \p{Phli};
3403                                    NOT \p{Block=Inscriptional_Pahlavi}) (27)
3404          \p{Inscriptional_Parthian} \p{Script_Extensions=
3405                                    Inscriptional_Parthian} (Short:
3406                                    \p{Prti}; NOT \p{Block=
3407                                    Inscriptional_Parthian}) (30)
3408        X \p{IPA_Ext}             \p{IPA_Extensions} (= \p{Block=
3409                                    IPA_Extensions}) (96)
3410        X \p{IPA_Extensions}      \p{Block=IPA_Extensions} (Short:
3411                                    \p{InIPAExt}) (96)
3412          \p{Is_*}                \p{*} (Any exceptions are individually
3413                                    noted beginning with the word NOT.) If
3414                                    an entry has flag(s) at its beginning,
3415                                    like "D", the "Is_" form has the same
3416                                    flag(s)
3417          \p{Ital}                \p{Old_Italic} (= \p{Script_Extensions=
3418                                    Old_Italic}) (NOT \p{Block=Old_Italic})
3419                                    (39)
3420        X \p{Jamo}                \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
3421                                    (256)
3422        X \p{Jamo_Ext_A}          \p{Hangul_Jamo_Extended_A} (= \p{Block=
3423                                    Hangul_Jamo_Extended_A}) (32)
3424        X \p{Jamo_Ext_B}          \p{Hangul_Jamo_Extended_B} (= \p{Block=
3425                                    Hangul_Jamo_Extended_B}) (80)
3426          \p{Java}                \p{Javanese} (= \p{Script_Extensions=
3427                                    Javanese}) (NOT \p{Block=Javanese}) (91)
3428          \p{Javanese}            \p{Script_Extensions=Javanese} (Short:
3429                                    \p{Java}; NOT \p{Block=Javanese}) (91)
3430          \p{Jg: *}               \p{Joining_Group: *}
3431          \p{Join_C}              \p{Join_Control} (= \p{Join_Control=Y}) (2)
3432          \p{Join_C: *}           \p{Join_Control: *}
3433          \p{Join_Control}        \p{Join_Control=Y} (Short: \p{JoinC}) (2)
3434          \p{Join_Control: N*}    (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
3435                                    plus all above-Unicode code points:
3436                                    U+0000..200B, U+200E..infinity)
3437          \p{Join_Control: Y*}    (Short: \p{JoinC=Y}, \p{JoinC}) (2:
3438                                    U+200C..200D)
3439          \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1:
3440                                    U+08BB)
3441          \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1:
3442                                    U+08BD)
3443          \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (2:
3444                                    U+08BC, U+08C4)
3445          \p{Joining_Group: Ain}  (Short: \p{Jg=Ain}) (9: U+0639..063A,
3446                                    U+06A0, U+06FC, U+075D..075F, U+08B3,
3447                                    U+08C3)
3448          \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710)
3449          \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (29: U+0622..0623,
3450                                    U+0625, U+0627, U+0671..0673, U+0675,
3451                                    U+0773..0774 ...)
3452          \p{Joining_Group: Beh}  (Short: \p{Jg=Beh}) (27: U+0628,
3453                                    U+062A..062B, U+066E, U+0679..0680,
3454                                    U+0750..0756, U+08A0..08A1 ...)
3455          \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D)
3456          \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
3457                                    BurushaskiYehBarree}) (2: U+077A..077B)
3458          \p{Joining_Group: Dal}  (Short: \p{Jg=Dal}) (15: U+062F..0630,
3459                                    U+0688..0690, U+06EE, U+0759..075A,
3460                                    U+08AE)
3461          \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4:
3462                                    U+0715..0716, U+072A, U+072F)
3463          \p{Joining_Group: E}    (Short: \p{Jg=E}) (1: U+0725)
3464          \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7:
3465                                    U+063D..063F, U+06CC, U+06CE,
3466                                    U+0775..0776)
3467          \p{Joining_Group: Fe}   (Short: \p{Jg=Fe}) (1: U+074F)
3468          \p{Joining_Group: Feh}  (Short: \p{Jg=Feh}) (10: U+0641,
3469                                    U+06A1..06A6, U+0760..0761, U+08A4)
3470          \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1:
3471                                    U+0724)
3472          \p{Joining_Group: Gaf}  (Short: \p{Jg=Gaf}) (17: U+063B..063C,
3473                                    U+06A9, U+06AB, U+06AF..06B4,
3474                                    U+0762..0764, U+088D ...)
3475          \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714,
3476                                    U+072E)
3477          \p{Joining_Group: Hah}  (Short: \p{Jg=Hah}) (22: U+062C..062E,
3478                                    U+0681..0687, U+06BF, U+0757..0758,
3479                                    U+076E..076F, U+0772 ...)
3480          \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
3481                                    HamzaOnHehGoal}) (1: U+06C3)
3482          \p{Joining_Group: Hanifi_Rohingya_Kinna_Ya} (Short: \p{Jg=
3483                                    HanifiRohingyaKinnaYa}) (4: U+10D19,
3484                                    U+10D1E, U+10D20, U+10D23)
3485          \p{Joining_Group: Hanifi_Rohingya_Pa} (Short: \p{Jg=
3486                                    HanifiRohingyaPa}) (3: U+10D02, U+10D09,
3487                                    U+10D1C)
3488          \p{Joining_Group: He}   (Short: \p{Jg=He}) (1: U+0717)
3489          \p{Joining_Group: Heh}  (Short: \p{Jg=Heh}) (1: U+0647)
3490          \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2:
3491                                    U+06C1..06C2)
3492          \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A)
3493          \p{Joining_Group: Kaf}  (Short: \p{Jg=Kaf}) (6: U+0643,
3494                                    U+06AC..06AE, U+077F, U+08B4)
3495          \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F)
3496          \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E)
3497          \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2:
3498                                    U+06BE, U+06FF)
3499          \p{Joining_Group: Lam}  (Short: \p{Jg=Lam}) (8: U+0644,
3500                                    U+06B5..06B8, U+076A, U+08A6, U+08C7)
3501          \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720)
3502          \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1:
3503                                    U+0866)
3504          \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1:
3505                                    U+0861)
3506          \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1:
3507                                    U+0868)
3508          \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla})
3509                                    (1: U+0869)
3510          \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1:
3511                                    U+0860)
3512          \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1:
3513                                    U+0864)
3514          \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna})
3515                                    (1: U+0865)
3516          \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1:
3517                                    U+0862)
3518          \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1:
3519                                    U+0867)
3520          \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1:
3521                                    U+086A)
3522          \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1:
3523                                    U+0863)
3524          \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
3525                                    ManichaeanAleph}) (1: U+10AC0)
3526          \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
3527                                    (2: U+10AD9..10ADA)
3528          \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
3529                                    (2: U+10AC1..10AC2)
3530          \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
3531                                    ManichaeanDaleth}) (1: U+10AC5)
3532          \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
3533                                    ManichaeanDhamedh}) (1: U+10AD4)
3534          \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
3535                                    (1: U+10AEC)
3536          \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
3537                                    ManichaeanGimel}) (2: U+10AC3..10AC4)
3538          \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
3539                                    (1: U+10ACD)
3540          \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
3541                                    ManichaeanHundred}) (1: U+10AEF)
3542          \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
3543                                    (3: U+10AD0..10AD2)
3544          \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
3545                                    ManichaeanLamedh}) (1: U+10AD3)
3546          \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem})
3547                                    (1: U+10AD6)
3548          \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun})
3549                                    (1: U+10AD7)
3550          \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne})
3551                                    (1: U+10AEB)
3552          \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2:
3553                                    U+10ADB..10ADC)
3554          \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
3555                                    (3: U+10ADE..10AE0)
3556          \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
3557                                    (1: U+10AE1)
3558          \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
3559                                    ManichaeanSadhe}) (1: U+10ADD)
3560          \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
3561                                    ManichaeanSamekh}) (1: U+10AD8)
3562          \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw})
3563                                    (1: U+10AE4)
3564          \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen})
3565                                    (1: U+10AED)
3566          \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
3567                                    (1: U+10ACE)
3568          \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
3569                                    ManichaeanThamedh}) (1: U+10AD5)
3570          \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
3571                                    ManichaeanTwenty}) (1: U+10AEE)
3572          \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw})
3573                                    (1: U+10AC7)
3574          \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
3575                                    (1: U+10ACF)
3576          \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
3577                                    ManichaeanZayin}) (2: U+10AC9..10ACA)
3578          \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645,
3579                                    U+0765..0766, U+08A7)
3580          \p{Joining_Group: Mim}  (Short: \p{Jg=Mim}) (1: U+0721)
3581          \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
3582                                    (1_113_762 plus all above-Unicode code
3583                                    points: U+0000..061F, U+0621, U+0640,
3584                                    U+064B..066D, U+0670, U+0674 ...)
3585          \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (9: U+0646,
3586                                    U+06B9..06BC, U+0767..0769, U+0889)
3587          \p{Joining_Group: Nun}  (Short: \p{Jg=Nun}) (1: U+0722)
3588          \p{Joining_Group: Nya}  (Short: \p{Jg=Nya}) (1: U+06BD)
3589          \p{Joining_Group: Pe}   (Short: \p{Jg=Pe}) (1: U+0726)
3590          \p{Joining_Group: Qaf}  (Short: \p{Jg=Qaf}) (6: U+0642, U+066F,
3591                                    U+06A7..06A8, U+08A5, U+08B5)
3592          \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729)
3593          \p{Joining_Group: Reh}  (Short: \p{Jg=Reh}) (19: U+0631..0632,
3594                                    U+0691..0699, U+06EF, U+075B,
3595                                    U+076B..076C, U+0771 ...)
3596          \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1:
3597                                    U+0727)
3598          \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1:
3599                                    U+08AC)
3600          \p{Joining_Group: Sad}  (Short: \p{Jg=Sad}) (6: U+0635..0636,
3601                                    U+069D..069E, U+06FB, U+08AF)
3602          \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728)
3603          \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634,
3604                                    U+069A..069C, U+06FA, U+075C, U+076D,
3605                                    U+0770 ...)
3606          \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723)
3607          \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B)
3608          \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1:
3609                                    U+08B1)
3610          \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA)
3611          \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718)
3612          \p{Joining_Group: Tah}  (Short: \p{Jg=Tah}) (6: U+0637..0638,
3613                                    U+069F, U+088B..088C, U+08A3)
3614          \p{Joining_Group: Taw}  (Short: \p{Jg=Taw}) (1: U+072C)
3615          \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3:
3616                                    U+0629, U+06C0, U+06D5)
3617          \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
3618                                    Hamza_On_Heh_Goal} (1)
3619          \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C)
3620          \p{Joining_Group: Thin_Yeh} (Short: \p{Jg=ThinYeh}) (1: U+0886)
3621          \p{Joining_Group: Vertical_Tail} (Short: \p{Jg=VerticalTail}) (1:
3622                                    U+088E)
3623          \p{Joining_Group: Waw}  (Short: \p{Jg=Waw}) (16: U+0624, U+0648,
3624                                    U+0676..0677, U+06C4..06CB, U+06CF,
3625                                    U+0778..0779 ...)
3626          \p{Joining_Group: Yeh}  (Short: \p{Jg=Yeh}) (11: U+0620, U+0626,
3627                                    U+0649..064A, U+0678, U+06D0..06D1,
3628                                    U+0777 ...)
3629          \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2:
3630                                    U+06D2..06D3)
3631          \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1:
3632                                    U+06CD)
3633          \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D)
3634          \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E)
3635          \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719)
3636          \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D)
3637          \p{Joining_Type: C}     \p{Joining_Type=Join_Causing} (7)
3638          \p{Joining_Type: D}     \p{Joining_Type=Dual_Joining} (610)
3639          \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (610: U+0620,
3640                                    U+0626, U+0628, U+062A..062E,
3641                                    U+0633..063F, U+0641..0647 ...)
3642          \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (7: U+0640,
3643                                    U+07FA, U+0883..0885, U+180A, U+200D)
3644          \p{Joining_Type: L}     \p{Joining_Type=Left_Joining} (5)
3645          \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (5: U+A872,
3646                                    U+10ACD, U+10AD7, U+10D00, U+10FCB)
3647          \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_230 plus
3648                                    all above-Unicode code points: [\x00-
3649                                    \xac\xae-\xff], U+0100..02FF,
3650                                    U+0370..0482, U+048A..0590, U+05BE,
3651                                    U+05C0 ...)
3652          \p{Joining_Type: R}     \p{Joining_Type=Right_Joining} (152)
3653          \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (152:
3654                                    U+0622..0625, U+0627, U+0629,
3655                                    U+062F..0632, U+0648, U+0671..0673 ...)
3656          \p{Joining_Type: T}     \p{Joining_Type=Transparent} (2108)
3657          \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (2108: [\xad],
3658                                    U+0300..036F, U+0483..0489,
3659                                    U+0591..05BD, U+05BF, U+05C1..05C2 ...)
3660          \p{Joining_Type: U}     \p{Joining_Type=Non_Joining} (1_111_230
3661                                    plus all above-Unicode code points)
3662          \p{Jt: *}               \p{Joining_Type: *}
3663          \p{Kaithi}              \p{Script_Extensions=Kaithi} (Short:
3664                                    \p{Kthi}; NOT \p{Block=Kaithi}) (88)
3665          \p{Kali}                \p{Kayah_Li} (= \p{Script_Extensions=
3666                                    Kayah_Li}) (48)
3667          \p{Kana}                \p{Katakana} (= \p{Script_Extensions=
3668                                    Katakana}) (NOT \p{Block=Katakana}) (372)
3669        X \p{Kana_Ext_A}          \p{Kana_Extended_A} (= \p{Block=
3670                                    Kana_Extended_A}) (48)
3671        X \p{Kana_Ext_B}          \p{Kana_Extended_B} (= \p{Block=
3672                                    Kana_Extended_B}) (16)
3673        X \p{Kana_Extended_A}     \p{Block=Kana_Extended_A} (Short:
3674                                    \p{InKanaExtA}) (48)
3675        X \p{Kana_Extended_B}     \p{Block=Kana_Extended_B} (Short:
3676                                    \p{InKanaExtB}) (16)
3677        X \p{Kana_Sup}            \p{Kana_Supplement} (= \p{Block=
3678                                    Kana_Supplement}) (256)
3679        X \p{Kana_Supplement}     \p{Block=Kana_Supplement} (Short:
3680                                    \p{InKanaSup}) (256)
3681        X \p{Kanbun}              \p{Block=Kanbun} (16)
3682        X \p{Kangxi}              \p{Kangxi_Radicals} (= \p{Block=
3683                                    Kangxi_Radicals}) (224)
3684        X \p{Kangxi_Radicals}     \p{Block=Kangxi_Radicals} (Short:
3685                                    \p{InKangxi}) (224)
3686          \p{Kannada}             \p{Script_Extensions=Kannada} (Short:
3687                                    \p{Knda}; NOT \p{Block=Kannada}) (105)
3688          \p{Katakana}            \p{Script_Extensions=Katakana} (Short:
3689                                    \p{Kana}; NOT \p{Block=Katakana}) (372)
3690        X \p{Katakana_Ext}        \p{Katakana_Phonetic_Extensions} (=
3691                                    \p{Block=Katakana_Phonetic_Extensions})
3692                                    (16)
3693        X \p{Katakana_Phonetic_Extensions} \p{Block=
3694                                    Katakana_Phonetic_Extensions} (Short:
3695                                    \p{InKatakanaExt}) (16)
3696          \p{Kayah_Li}            \p{Script_Extensions=Kayah_Li} (Short:
3697                                    \p{Kali}) (48)
3698          \p{Khar}                \p{Kharoshthi} (= \p{Script_Extensions=
3699                                    Kharoshthi}) (NOT \p{Block=Kharoshthi})
3700                                    (68)
3701          \p{Kharoshthi}          \p{Script_Extensions=Kharoshthi} (Short:
3702                                    \p{Khar}; NOT \p{Block=Kharoshthi}) (68)
3703          \p{Khitan_Small_Script} \p{Script_Extensions=Khitan_Small_Script}
3704                                    (Short: \p{Kits}; NOT \p{Block=
3705                                    Khitan_Small_Script}) (471)
3706          \p{Khmer}               \p{Script_Extensions=Khmer} (Short:
3707                                    \p{Khmr}; NOT \p{Block=Khmer}) (146)
3708        X \p{Khmer_Symbols}       \p{Block=Khmer_Symbols} (32)
3709          \p{Khmr}                \p{Khmer} (= \p{Script_Extensions=Khmer})
3710                                    (NOT \p{Block=Khmer}) (146)
3711          \p{Khoj}                \p{Khojki} (= \p{Script_Extensions=
3712                                    Khojki}) (NOT \p{Block=Khojki}) (82)
3713          \p{Khojki}              \p{Script_Extensions=Khojki} (Short:
3714                                    \p{Khoj}; NOT \p{Block=Khojki}) (82)
3715          \p{Khudawadi}           \p{Script_Extensions=Khudawadi} (Short:
3716                                    \p{Sind}; NOT \p{Block=Khudawadi}) (81)
3717          \p{Kits}                \p{Khitan_Small_Script} (=
3718                                    \p{Script_Extensions=
3719                                    Khitan_Small_Script}) (NOT \p{Block=
3720                                    Khitan_Small_Script}) (471)
3721          \p{Knda}                \p{Kannada} (= \p{Script_Extensions=
3722                                    Kannada}) (NOT \p{Block=Kannada}) (105)
3723          \p{Kthi}                \p{Kaithi} (= \p{Script_Extensions=
3724                                    Kaithi}) (NOT \p{Block=Kaithi}) (88)
3725          \p{L} \pL               \p{Letter} (= \p{General_Category=Letter})
3726                                    (131_756)
3727        X \p{L&}                  \p{Cased_Letter} (= \p{General_Category=
3728                                    Cased_Letter}) (4089)
3729        X \p{L_}                  \p{Cased_Letter} (= \p{General_Category=
3730                                    Cased_Letter}) Note the trailing '_'
3731                                    matters in spite of loose matching
3732                                    rules. (4089)
3733          \p{Lana}                \p{Tai_Tham} (= \p{Script_Extensions=
3734                                    Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
3735          \p{Lao}                 \p{Script_Extensions=Lao} (NOT \p{Block=
3736                                    Lao}) (82)
3737          \p{Laoo}                \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
3738                                    \p{Block=Lao}) (82)
3739          \p{Latin}               \p{Script_Extensions=Latin} (Short:
3740                                    \p{Latn}) (1504)
3741        X \p{Latin_1}             \p{Latin_1_Supplement} (= \p{Block=
3742                                    Latin_1_Supplement}) (128)
3743        X \p{Latin_1_Sup}         \p{Latin_1_Supplement} (= \p{Block=
3744                                    Latin_1_Supplement}) (128)
3745        X \p{Latin_1_Supplement}  \p{Block=Latin_1_Supplement} (Short:
3746                                    \p{InLatin1}) (128)
3747        X \p{Latin_Ext_A}         \p{Latin_Extended_A} (= \p{Block=
3748                                    Latin_Extended_A}) (128)
3749        X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
3750                                    \p{Block=Latin_Extended_Additional})
3751                                    (256)
3752        X \p{Latin_Ext_B}         \p{Latin_Extended_B} (= \p{Block=
3753                                    Latin_Extended_B}) (208)
3754        X \p{Latin_Ext_C}         \p{Latin_Extended_C} (= \p{Block=
3755                                    Latin_Extended_C}) (32)
3756        X \p{Latin_Ext_D}         \p{Latin_Extended_D} (= \p{Block=
3757                                    Latin_Extended_D}) (224)
3758        X \p{Latin_Ext_E}         \p{Latin_Extended_E} (= \p{Block=
3759                                    Latin_Extended_E}) (64)
3760        X \p{Latin_Ext_F}         \p{Latin_Extended_F} (= \p{Block=
3761                                    Latin_Extended_F}) (64)
3762        X \p{Latin_Ext_G}         \p{Latin_Extended_G} (= \p{Block=
3763                                    Latin_Extended_G}) (256)
3764        X \p{Latin_Extended_A}    \p{Block=Latin_Extended_A} (Short:
3765                                    \p{InLatinExtA}) (128)
3766        X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
3767                                    (Short: \p{InLatinExtAdditional}) (256)
3768        X \p{Latin_Extended_B}    \p{Block=Latin_Extended_B} (Short:
3769                                    \p{InLatinExtB}) (208)
3770        X \p{Latin_Extended_C}    \p{Block=Latin_Extended_C} (Short:
3771                                    \p{InLatinExtC}) (32)
3772        X \p{Latin_Extended_D}    \p{Block=Latin_Extended_D} (Short:
3773                                    \p{InLatinExtD}) (224)
3774        X \p{Latin_Extended_E}    \p{Block=Latin_Extended_E} (Short:
3775                                    \p{InLatinExtE}) (64)
3776        X \p{Latin_Extended_F}    \p{Block=Latin_Extended_F} (Short:
3777                                    \p{InLatinExtF}) (64)
3778        X \p{Latin_Extended_G}    \p{Block=Latin_Extended_G} (Short:
3779                                    \p{InLatinExtG}) (256)
3780          \p{Latn}                \p{Latin} (= \p{Script_Extensions=Latin})
3781                                    (1504)
3782          \p{Lb: *}               \p{Line_Break: *}
3783          \p{LC}                  \p{Cased_Letter} (= \p{General_Category=
3784                                    Cased_Letter}) (4089)
3785          \p{Lepc}                \p{Lepcha} (= \p{Script_Extensions=
3786                                    Lepcha}) (NOT \p{Block=Lepcha}) (74)
3787          \p{Lepcha}              \p{Script_Extensions=Lepcha} (Short:
3788                                    \p{Lepc}; NOT \p{Block=Lepcha}) (74)
3789          \p{Letter}              \p{General_Category=Letter} (Short: \p{L})
3790                                    (131_756)
3791          \p{Letter_Number}       \p{General_Category=Letter_Number} (Short:
3792                                    \p{Nl}) (236)
3793        X \p{Letterlike_Symbols}  \p{Block=Letterlike_Symbols} (80)
3794          \p{Limb}                \p{Limbu} (= \p{Script_Extensions=Limbu})
3795                                    (NOT \p{Block=Limbu}) (69)
3796          \p{Limbu}               \p{Script_Extensions=Limbu} (Short:
3797                                    \p{Limb}; NOT \p{Block=Limbu}) (69)
3798          \p{Lina}                \p{Linear_A} (= \p{Script_Extensions=
3799                                    Linear_A}) (NOT \p{Block=Linear_A}) (386)
3800          \p{Linb}                \p{Linear_B} (= \p{Script_Extensions=
3801                                    Linear_B}) (268)
3802          \p{Line_Break: AI}      \p{Line_Break=Ambiguous} (707)
3803          \p{Line_Break: AL}      \p{Line_Break=Alphabetic} (22_043)
3804          \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (22_043: [#&*<=>\@A-
3805                                    Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0-
3806                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6,
3807                                    U+02CE..02CF, U+02D1..02D7, U+02DC,
3808                                    U+02DE ...)
3809          \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7-\xa8\xaa
3810                                    \xb2-\xb3\xb6-\xba\xbc-\xbe\xd7\xf7],
3811                                    U+02C7, U+02C9..02CB, U+02CD, U+02D0,
3812                                    U+02D8..02DB ...)
3813          \p{Line_Break: B2}      \p{Line_Break=Break_Both} (3)
3814          \p{Line_Break: BA}      \p{Line_Break=Break_After} (247)
3815          \p{Line_Break: BB}      \p{Line_Break=Break_Before} (45)
3816          \p{Line_Break: BK}      \p{Line_Break=Mandatory_Break} (4)
3817          \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (247: [\t\|\xad],
3818                                    U+058A, U+05BE, U+0964..0965,
3819                                    U+0E5A..0E5B, U+0F0B ...)
3820          \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (45: [\xb4],
3821                                    U+02C8, U+02CC, U+02DF, U+0C77, U+0C84
3822                                    ...)
3823          \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014,
3824                                    U+2E3A..2E3B)
3825          \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/])
3826          \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r])
3827          \p{Line_Break: CB}      \p{Line_Break=Contingent_Break} (1)
3828          \p{Line_Break: CJ}      \p{Line_Break=
3829                                    Conditional_Japanese_Starter} (58)
3830          \p{Line_Break: CL}      \p{Line_Break=Close_Punctuation} (95)
3831          \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]])
3832          \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (95: [\}],
3833                                    U+0F3B, U+0F3D, U+169C, U+2046, U+207E
3834                                    ...)
3835          \p{Line_Break: CM}      \p{Line_Break=Combining_Mark} (2399)
3836          \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2399: [^\t\n
3837                                    \cK\f\r\x20-\x7e\x85\xa0-\xff],
3838                                    U+0300..034E, U+0350..035B,
3839                                    U+0363..036F, U+0483..0489, U+0591..05BD
3840                                    ...)
3841          \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (757:
3842                                    U+0E01..0E3A, U+0E40..0E4E,
3843                                    U+0E81..0E82, U+0E84, U+0E86..0E8A,
3844                                    U+0E8C..0EA3 ...)
3845          \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
3846                                    (58: U+3041, U+3043, U+3045, U+3047,
3847                                    U+3049, U+3063 ...)
3848          \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC)
3849          \p{Line_Break: CP}      \p{Line_Break=Close_Parenthesis} (2)
3850          \p{Line_Break: CR}      \p{Line_Break=Carriage_Return} (1)
3851          \p{Line_Break: E_Base}  (Short: \p{Lb=EB}) (132: U+261D, U+26F9,
3852                                    U+270A..270D, U+1F385, U+1F3C2..1F3C4,
3853                                    U+1F3C7 ...)
3854          \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF)
3855          \p{Line_Break: EB}      \p{Line_Break=E_Base} (132)
3856          \p{Line_Break: EM}      \p{Line_Break=E_Modifier} (5)
3857          \p{Line_Break: EX}      \p{Line_Break=Exclamation} (40)
3858          \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (40: [!?], U+05C6,
3859                                    U+061B, U+061D..061F, U+06D4, U+07F9 ...)
3860          \p{Line_Break: GL}      \p{Line_Break=Glue} (26)
3861          \p{Line_Break: Glue}    (Short: \p{Lb=GL}) (26: [\xa0], U+034F,
3862                                    U+035C..0362, U+0F08, U+0F0C, U+0F12 ...)
3863          \p{Line_Break: H2}      (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C,
3864                                    U+AC38, U+AC54, U+AC70, U+AC8C ...)
3865          \p{Line_Break: H3}      (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B,
3866                                    U+AC1D..AC37, U+AC39..AC53,
3867                                    U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7
3868                                    ...)
3869          \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (75:
3870                                    U+05D0..05EA, U+05EF..05F2, U+FB1D,
3871                                    U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
3872                                    ...)
3873          \p{Line_Break: HL}      \p{Line_Break=Hebrew_Letter} (75)
3874          \p{Line_Break: HY}      \p{Line_Break=Hyphen} (1)
3875          \p{Line_Break: Hyphen}  (Short: \p{Lb=HY}) (1: [\-])
3876          \p{Line_Break: ID}      \p{Line_Break=Ideographic} (172_456)
3877          \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_456:
3878                                    U+231A..231B, U+23F0..23F3,
3879                                    U+2600..2603, U+2614..2615, U+2618,
3880                                    U+261A..261C ...)
3881          \p{Line_Break: IN}      \p{Line_Break=Inseparable} (6)
3882          \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;],
3883                                    U+037E, U+0589, U+060C..060D, U+07F8,
3884                                    U+2044 ...)
3885          \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026,
3886                                    U+22EF, U+FE19, U+10AF6)
3887          \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
3888          \p{Line_Break: IS}      \p{Line_Break=Infix_Numeric} (13)
3889          \p{Line_Break: JL}      (Short: \p{Lb=JL}) (125: U+1100..115F,
3890                                    U+A960..A97C)
3891          \p{Line_Break: JT}      (Short: \p{Lb=JT}) (137: U+11A8..11FF,
3892                                    U+D7CB..D7FB)
3893          \p{Line_Break: JV}      (Short: \p{Lb=JV}) (95: U+1160..11A7,
3894                                    U+D7B0..D7C6)
3895          \p{Line_Break: LF}      \p{Line_Break=Line_Feed} (1)
3896          \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n])
3897          \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f],
3898                                    U+2028..2029)
3899          \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85])
3900          \p{Line_Break: NL}      \p{Line_Break=Next_Line} (1)
3901          \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (33: U+17D6,
3902                                    U+203C..203D, U+2047..2049, U+3005,
3903                                    U+301C, U+303B..303C ...)
3904          \p{Line_Break: NS}      \p{Line_Break=Nonstarter} (33)
3905          \p{Line_Break: NU}      \p{Line_Break=Numeric} (652)
3906          \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (652: [0-9],
3907                                    U+0660..0669, U+066B..066C,
3908                                    U+06F0..06F9, U+07C0..07C9, U+0966..096F
3909                                    ...)
3910          \p{Line_Break: OP}      \p{Line_Break=Open_Punctuation} (92)
3911          \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (92: [\(\[\{
3912                                    \xa1\xbf], U+0F3A, U+0F3C, U+169B,
3913                                    U+201A, U+201E ...)
3914          \p{Line_Break: PO}      \p{Line_Break=Postfix_Numeric} (37)
3915          \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (37: [\%\xa2
3916                                    \xb0], U+0609..060B, U+066A,
3917                                    U+09F2..09F3, U+09F9, U+0D79 ...)
3918          \p{Line_Break: PR}      \p{Line_Break=Prefix_Numeric} (67)
3919          \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (67: [\$+\\\xa3-
3920                                    \xa5\xb1], U+058F, U+07FE..07FF, U+09FB,
3921                                    U+0AF1, U+0BF9 ...)
3922          \p{Line_Break: QU}      \p{Line_Break=Quotation} (39)
3923          \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb],
3924                                    U+2018..2019, U+201B..201D, U+201F,
3925                                    U+2039..203A, U+275B..2760 ...)
3926          \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26:
3927                                    U+1F1E6..1F1FF)
3928          \p{Line_Break: RI}      \p{Line_Break=Regional_Indicator} (26)
3929          \p{Line_Break: SA}      \p{Line_Break=Complex_Context} (757)
3930        D \p{Line_Break: SG}      \p{Line_Break=Surrogate} (2048)
3931          \p{Line_Break: SP}      \p{Line_Break=Space} (1)
3932          \p{Line_Break: Space}   (Short: \p{Lb=SP}) (1: [\x20])
3933        D \p{Line_Break: Surrogate} Surrogates should never appear in well-
3934                                    formed text, and therefore shouldn't be
3935                                    the basis for line breaking (Short:
3936                                    \p{Lb=SG}) (2048: U+D800..DFFF)
3937          \p{Line_Break: SY}      \p{Line_Break=Break_Symbols} (1)
3938          \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (900_465 plus all
3939                                    above-Unicode code points: U+0378..0379,
3940                                    U+0380..0383, U+038B, U+038D, U+03A2,
3941                                    U+0530 ...)
3942          \p{Line_Break: WJ}      \p{Line_Break=Word_Joiner} (2)
3943          \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF)
3944          \p{Line_Break: XX}      \p{Line_Break=Unknown} (900_465 plus all
3945                                    above-Unicode code points)
3946          \p{Line_Break: ZW}      \p{Line_Break=ZWSpace} (1)
3947          \p{Line_Break: ZWJ}     (Short: \p{Lb=ZWJ}) (1: U+200D)
3948          \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B)
3949          \p{Line_Separator}      \p{General_Category=Line_Separator}
3950                                    (Short: \p{Zl}) (1)
3951          \p{Linear_A}            \p{Script_Extensions=Linear_A} (Short:
3952                                    \p{Lina}; NOT \p{Block=Linear_A}) (386)
3953          \p{Linear_B}            \p{Script_Extensions=Linear_B} (Short:
3954                                    \p{Linb}) (268)
3955        X \p{Linear_B_Ideograms}  \p{Block=Linear_B_Ideograms} (128)
3956        X \p{Linear_B_Syllabary}  \p{Block=Linear_B_Syllabary} (128)
3957          \p{Lisu}                \p{Script_Extensions=Lisu} (NOT \p{Block=
3958                                    Lisu}) (49)
3959        X \p{Lisu_Sup}            \p{Lisu_Supplement} (= \p{Block=
3960                                    Lisu_Supplement}) (16)
3961        X \p{Lisu_Supplement}     \p{Block=Lisu_Supplement} (Short:
3962                                    \p{InLisuSup}) (16)
3963          \p{Ll}                  \p{Lowercase_Letter} (=
3964                                    \p{General_Category=Lowercase_Letter})
3965                                    (/i= General_Category=Cased_Letter)
3966                                    (2227)
3967          \p{Lm}                  \p{Modifier_Letter} (=
3968                                    \p{General_Category=Modifier_Letter})
3969                                    (334)
3970          \p{Lo}                  \p{Other_Letter} (= \p{General_Category=
3971                                    Other_Letter}) (127_333)
3972          \p{LOE}                 \p{Logical_Order_Exception} (=
3973                                    \p{Logical_Order_Exception=Y}) (19)
3974          \p{LOE: *}              \p{Logical_Order_Exception: *}
3975          \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
3976                                    \p{LOE}) (19)
3977          \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
3978                                    (1_114_093 plus all above-Unicode code
3979                                    points: U+0000..0E3F, U+0E45..0EBF,
3980                                    U+0EC5..19B4, U+19B8..19B9,
3981                                    U+19BB..AAB4, U+AAB7..AAB8 ...)
3982          \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19:
3983                                    U+0E40..0E44, U+0EC0..0EC4,
3984                                    U+19B5..19B7, U+19BA, U+AAB5..AAB6,
3985                                    U+AAB9 ...)
3986        X \p{Low_Surrogates}      \p{Block=Low_Surrogates} (1024)
3987          \p{Lower}               \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3988                                    Cased=Yes) (2471)
3989          \p{Lower: *}            \p{Lowercase: *}
3990          \p{Lowercase}           \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3991                                    Cased=Yes) (2471)
3992          \p{Lowercase: N*}       (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
3993                                    No) (1_111_641 plus all above-Unicode
3994                                    code points: [\x00-\x20!\"#\$\%&\'
3995                                    \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`\{
3996                                    \|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9\xbb-
3997                                    \xde\xf7], U+0100, U+0102, U+0104,
3998                                    U+0106, U+0108 ...)
3999          \p{Lowercase: Y*}       (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
4000                                    Yes) (2471: [a-z\xaa\xb5\xba\xdf-\xf6
4001                                    \xf8-\xff], U+0101, U+0103, U+0105,
4002                                    U+0107, U+0109 ...)
4003          \p{Lowercase_Letter}    \p{General_Category=Lowercase_Letter}
4004                                    (Short: \p{Ll}; /i= General_Category=
4005                                    Cased_Letter) (2227)
4006          \p{Lt}                  \p{Titlecase_Letter} (=
4007                                    \p{General_Category=Titlecase_Letter})
4008                                    (/i= General_Category=Cased_Letter) (31)
4009          \p{Lu}                  \p{Uppercase_Letter} (=
4010                                    \p{General_Category=Uppercase_Letter})
4011                                    (/i= General_Category=Cased_Letter)
4012                                    (1831)
4013          \p{Lyci}                \p{Lycian} (= \p{Script_Extensions=
4014                                    Lycian}) (NOT \p{Block=Lycian}) (29)
4015          \p{Lycian}              \p{Script_Extensions=Lycian} (Short:
4016                                    \p{Lyci}; NOT \p{Block=Lycian}) (29)
4017          \p{Lydi}                \p{Lydian} (= \p{Script_Extensions=
4018                                    Lydian}) (NOT \p{Block=Lydian}) (27)
4019          \p{Lydian}              \p{Script_Extensions=Lydian} (Short:
4020                                    \p{Lydi}; NOT \p{Block=Lydian}) (27)
4021          \p{M} \pM               \p{Mark} (= \p{General_Category=Mark})
4022                                    (2408)
4023          \p{Mahajani}            \p{Script_Extensions=Mahajani} (Short:
4024                                    \p{Mahj}; NOT \p{Block=Mahajani}) (61)
4025          \p{Mahj}                \p{Mahajani} (= \p{Script_Extensions=
4026                                    Mahajani}) (NOT \p{Block=Mahajani}) (61)
4027        X \p{Mahjong}             \p{Mahjong_Tiles} (= \p{Block=
4028                                    Mahjong_Tiles}) (48)
4029        X \p{Mahjong_Tiles}       \p{Block=Mahjong_Tiles} (Short:
4030                                    \p{InMahjong}) (48)
4031          \p{Maka}                \p{Makasar} (= \p{Script_Extensions=
4032                                    Makasar}) (NOT \p{Block=Makasar}) (25)
4033          \p{Makasar}             \p{Script_Extensions=Makasar} (Short:
4034                                    \p{Maka}; NOT \p{Block=Makasar}) (25)
4035          \p{Malayalam}           \p{Script_Extensions=Malayalam} (Short:
4036                                    \p{Mlym}; NOT \p{Block=Malayalam}) (126)
4037          \p{Mand}                \p{Mandaic} (= \p{Script_Extensions=
4038                                    Mandaic}) (NOT \p{Block=Mandaic}) (30)
4039          \p{Mandaic}             \p{Script_Extensions=Mandaic} (Short:
4040                                    \p{Mand}; NOT \p{Block=Mandaic}) (30)
4041          \p{Mani}                \p{Manichaean} (= \p{Script_Extensions=
4042                                    Manichaean}) (NOT \p{Block=Manichaean})
4043                                    (52)
4044          \p{Manichaean}          \p{Script_Extensions=Manichaean} (Short:
4045                                    \p{Mani}; NOT \p{Block=Manichaean}) (52)
4046          \p{Marc}                \p{Marchen} (= \p{Script_Extensions=
4047                                    Marchen}) (NOT \p{Block=Marchen}) (68)
4048          \p{Marchen}             \p{Script_Extensions=Marchen} (Short:
4049                                    \p{Marc}; NOT \p{Block=Marchen}) (68)
4050          \p{Mark}                \p{General_Category=Mark} (Short: \p{M})
4051                                    (2408)
4052          \p{Masaram_Gondi}       \p{Script_Extensions=Masaram_Gondi}
4053                                    (Short: \p{Gonm}; NOT \p{Block=
4054                                    Masaram_Gondi}) (77)
4055          \p{Math}                \p{Math=Y} (2310)
4056          \p{Math: N*}            (Single: \P{Math}) (1_111_802 plus all
4057                                    above-Unicode code points: [\x00-\x20!
4058                                    \"#\$\%&\'\(\)*,\-.\/0-9:;?\@A-Z
4059                                    \[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2-
4060                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF,
4061                                    U+03D3..03D4, U+03D6..03EF,
4062                                    U+03F2..03F3, U+03F7..0605 ...)
4063          \p{Math: Y*}            (Single: \p{Math}) (2310: [+<=>\^\|~\xac
4064                                    \xb1\xd7\xf7], U+03D0..03D2, U+03D5,
4065                                    U+03F0..03F1, U+03F4..03F6, U+0606..0608
4066                                    ...)
4067        X \p{Math_Alphanum}       \p{Mathematical_Alphanumeric_Symbols} (=
4068                                    \p{Block=
4069                                    Mathematical_Alphanumeric_Symbols})
4070                                    (1024)
4071        X \p{Math_Operators}      \p{Mathematical_Operators} (= \p{Block=
4072                                    Mathematical_Operators}) (256)
4073          \p{Math_Symbol}         \p{General_Category=Math_Symbol} (Short:
4074                                    \p{Sm}) (948)
4075        X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
4076                                    Mathematical_Alphanumeric_Symbols}
4077                                    (Short: \p{InMathAlphanum}) (1024)
4078        X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
4079                                    (Short: \p{InMathOperators}) (256)
4080        X \p{Mayan_Numerals}      \p{Block=Mayan_Numerals} (32)
4081          \p{Mc}                  \p{Spacing_Mark} (= \p{General_Category=
4082                                    Spacing_Mark}) (445)
4083          \p{Me}                  \p{Enclosing_Mark} (= \p{General_Category=
4084                                    Enclosing_Mark}) (13)
4085          \p{Medefaidrin}         \p{Script_Extensions=Medefaidrin} (Short:
4086                                    \p{Medf}; NOT \p{Block=Medefaidrin}) (91)
4087          \p{Medf}                \p{Medefaidrin} (= \p{Script_Extensions=
4088                                    Medefaidrin}) (NOT \p{Block=
4089                                    Medefaidrin}) (91)
4090          \p{Meetei_Mayek}        \p{Script_Extensions=Meetei_Mayek} (Short:
4091                                    \p{Mtei}; NOT \p{Block=Meetei_Mayek})
4092                                    (79)
4093        X \p{Meetei_Mayek_Ext}    \p{Meetei_Mayek_Extensions} (= \p{Block=
4094                                    Meetei_Mayek_Extensions}) (32)
4095        X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
4096                                    (Short: \p{InMeeteiMayekExt}) (32)
4097          \p{Mend}                \p{Mende_Kikakui} (= \p{Script_Extensions=
4098                                    Mende_Kikakui}) (NOT \p{Block=
4099                                    Mende_Kikakui}) (213)
4100          \p{Mende_Kikakui}       \p{Script_Extensions=Mende_Kikakui}
4101                                    (Short: \p{Mend}; NOT \p{Block=
4102                                    Mende_Kikakui}) (213)
4103          \p{Merc}                \p{Meroitic_Cursive} (=
4104                                    \p{Script_Extensions=Meroitic_Cursive})
4105                                    (NOT \p{Block=Meroitic_Cursive}) (90)
4106          \p{Mero}                \p{Meroitic_Hieroglyphs} (=
4107                                    \p{Script_Extensions=
4108                                    Meroitic_Hieroglyphs}) (32)
4109          \p{Meroitic_Cursive}    \p{Script_Extensions=Meroitic_Cursive}
4110                                    (Short: \p{Merc}; NOT \p{Block=
4111                                    Meroitic_Cursive}) (90)
4112          \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
4113                                    Meroitic_Hieroglyphs} (Short: \p{Mero})
4114                                    (32)
4115          \p{Miao}                \p{Script_Extensions=Miao} (NOT \p{Block=
4116                                    Miao}) (149)
4117        X \p{Misc_Arrows}         \p{Miscellaneous_Symbols_And_Arrows} (=
4118                                    \p{Block=
4119                                    Miscellaneous_Symbols_And_Arrows}) (256)
4120        X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
4121                                    (= \p{Block=
4122                                    Miscellaneous_Mathematical_Symbols_A})
4123                                    (48)
4124        X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
4125                                    (= \p{Block=
4126                                    Miscellaneous_Mathematical_Symbols_B})
4127                                    (128)
4128        X \p{Misc_Pictographs}    \p{Miscellaneous_Symbols_And_Pictographs}
4129                                    (= \p{Block=
4130                                    Miscellaneous_Symbols_And_Pictographs})
4131                                    (768)
4132        X \p{Misc_Symbols}        \p{Miscellaneous_Symbols} (= \p{Block=
4133                                    Miscellaneous_Symbols}) (256)
4134        X \p{Misc_Technical}      \p{Miscellaneous_Technical} (= \p{Block=
4135                                    Miscellaneous_Technical}) (256)
4136        X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
4137                                    Miscellaneous_Mathematical_Symbols_A}
4138                                    (Short: \p{InMiscMathSymbolsA}) (48)
4139        X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
4140                                    Miscellaneous_Mathematical_Symbols_B}
4141                                    (Short: \p{InMiscMathSymbolsB}) (128)
4142        X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
4143                                    \p{InMiscSymbols}) (256)
4144        X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
4145                                    Miscellaneous_Symbols_And_Arrows}
4146                                    (Short: \p{InMiscArrows}) (256)
4147        X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
4148                                    Miscellaneous_Symbols_And_Pictographs}
4149                                    (Short: \p{InMiscPictographs}) (768)
4150        X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
4151                                    (Short: \p{InMiscTechnical}) (256)
4152          \p{Mlym}                \p{Malayalam} (= \p{Script_Extensions=
4153                                    Malayalam}) (NOT \p{Block=Malayalam})
4154                                    (126)
4155          \p{Mn}                  \p{Nonspacing_Mark} (=
4156                                    \p{General_Category=Nonspacing_Mark})
4157                                    (1950)
4158          \p{Modi}                \p{Script_Extensions=Modi} (NOT \p{Block=
4159                                    Modi}) (89)
4160          \p{Modifier_Letter}     \p{General_Category=Modifier_Letter}
4161                                    (Short: \p{Lm}) (334)
4162        X \p{Modifier_Letters}    \p{Spacing_Modifier_Letters} (= \p{Block=
4163                                    Spacing_Modifier_Letters}) (80)
4164          \p{Modifier_Symbol}     \p{General_Category=Modifier_Symbol}
4165                                    (Short: \p{Sk}) (125)
4166        X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
4167          \p{Mong}                \p{Mongolian} (= \p{Script_Extensions=
4168                                    Mongolian}) (NOT \p{Block=Mongolian})
4169                                    (172)
4170          \p{Mongolian}           \p{Script_Extensions=Mongolian} (Short:
4171                                    \p{Mong}; NOT \p{Block=Mongolian}) (172)
4172        X \p{Mongolian_Sup}       \p{Mongolian_Supplement} (= \p{Block=
4173                                    Mongolian_Supplement}) (32)
4174        X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
4175                                    \p{InMongolianSup}) (32)
4176          \p{Mro}                 \p{Script_Extensions=Mro} (NOT \p{Block=
4177                                    Mro}) (43)
4178          \p{Mroo}                \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
4179                                    \p{Block=Mro}) (43)
4180          \p{Mtei}                \p{Meetei_Mayek} (= \p{Script_Extensions=
4181                                    Meetei_Mayek}) (NOT \p{Block=
4182                                    Meetei_Mayek}) (79)
4183          \p{Mult}                \p{Multani} (= \p{Script_Extensions=
4184                                    Multani}) (NOT \p{Block=Multani}) (48)
4185          \p{Multani}             \p{Script_Extensions=Multani} (Short:
4186                                    \p{Mult}; NOT \p{Block=Multani}) (48)
4187        X \p{Music}               \p{Musical_Symbols} (= \p{Block=
4188                                    Musical_Symbols}) (256)
4189        X \p{Musical_Symbols}     \p{Block=Musical_Symbols} (Short:
4190                                    \p{InMusic}) (256)
4191          \p{Myanmar}             \p{Script_Extensions=Myanmar} (Short:
4192                                    \p{Mymr}; NOT \p{Block=Myanmar}) (224)
4193        X \p{Myanmar_Ext_A}       \p{Myanmar_Extended_A} (= \p{Block=
4194                                    Myanmar_Extended_A}) (32)
4195        X \p{Myanmar_Ext_B}       \p{Myanmar_Extended_B} (= \p{Block=
4196                                    Myanmar_Extended_B}) (32)
4197        X \p{Myanmar_Extended_A}  \p{Block=Myanmar_Extended_A} (Short:
4198                                    \p{InMyanmarExtA}) (32)
4199        X \p{Myanmar_Extended_B}  \p{Block=Myanmar_Extended_B} (Short:
4200                                    \p{InMyanmarExtB}) (32)
4201          \p{Mymr}                \p{Myanmar} (= \p{Script_Extensions=
4202                                    Myanmar}) (NOT \p{Block=Myanmar}) (224)
4203          \p{N} \pN               \p{Number} (= \p{General_Category=Number})
4204                                    (1791)
4205          \p{Na=*}                \p{Name=*}
4206          \p{Nabataean}           \p{Script_Extensions=Nabataean} (Short:
4207                                    \p{Nbat}; NOT \p{Block=Nabataean}) (40)
4208          \p{Name=*}              Combination of Name and Name_Alias
4209                                    properties; has special loose matching
4210                                    rules, for which see Unicode UAX #44
4211          \p{Nand}                \p{Nandinagari} (= \p{Script_Extensions=
4212                                    Nandinagari}) (NOT \p{Block=
4213                                    Nandinagari}) (86)
4214          \p{Nandinagari}         \p{Script_Extensions=Nandinagari} (Short:
4215                                    \p{Nand}; NOT \p{Block=Nandinagari}) (86)
4216          \p{Narb}                \p{Old_North_Arabian} (=
4217                                    \p{Script_Extensions=Old_North_Arabian})
4218                                    (32)
4219        X \p{NB}                  \p{No_Block} (= \p{Block=No_Block})
4220                                    (825_600 plus all above-Unicode code
4221                                    points)
4222          \p{Nbat}                \p{Nabataean} (= \p{Script_Extensions=
4223                                    Nabataean}) (NOT \p{Block=Nabataean})
4224                                    (40)
4225          \p{NChar}               \p{Noncharacter_Code_Point} (=
4226                                    \p{Noncharacter_Code_Point=Y}) (66)
4227          \p{NChar: *}            \p{Noncharacter_Code_Point: *}
4228          \p{Nd}                  \p{XPosixDigit} (= \p{General_Category=
4229                                    Decimal_Number}) (660)
4230          \p{New_Tai_Lue}         \p{Script_Extensions=New_Tai_Lue} (Short:
4231                                    \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
4232          \p{Newa}                \p{Script_Extensions=Newa} (NOT \p{Block=
4233                                    Newa}) (97)
4234          \p{NFC_QC: *}           \p{NFC_Quick_Check: *}
4235          \p{NFC_Quick_Check: M}  \p{NFC_Quick_Check=Maybe} (111)
4236          \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (111:
4237                                    U+0300..0304, U+0306..030C, U+030F,
4238                                    U+0311, U+0313..0314, U+031B ...)
4239          \p{NFC_Quick_Check: N}  \p{NFC_Quick_Check=No} (NOT
4240                                    \P{NFC_Quick_Check} NOR \P{NFC_QC})
4241                                    (1120)
4242          \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
4243                                    \P{NFC_Quick_Check} NOR \P{NFC_QC})
4244                                    (1120: U+0340..0341, U+0343..0344,
4245                                    U+0374, U+037E, U+0387, U+0958..095F ...)
4246          \p{NFC_Quick_Check: Y}  \p{NFC_Quick_Check=Yes} (NOT
4247                                    \p{NFC_Quick_Check} NOR \p{NFC_QC})
4248                                    (1_112_881 plus all above-Unicode code
4249                                    points)
4250          \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
4251                                    \p{NFC_Quick_Check} NOR \p{NFC_QC})
4252                                    (1_112_881 plus all above-Unicode code
4253                                    points: U+0000..02FF, U+0305,
4254                                    U+030D..030E, U+0310, U+0312,
4255                                    U+0315..031A ...)
4256          \p{NFD_QC: *}           \p{NFD_Quick_Check: *}
4257          \p{NFD_Quick_Check: N}  \p{NFD_Quick_Check=No} (NOT
4258                                    \P{NFD_Quick_Check} NOR \P{NFD_QC})
4259                                    (13_233)
4260          \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
4261                                    \P{NFD_Quick_Check} NOR \P{NFD_QC})
4262                                    (13_233: [\xc0-\xc5\xc7-\xcf\xd1-\xd6
4263                                    \xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-\xf6
4264                                    \xf9-\xfd\xff], U+0100..010F,
4265                                    U+0112..0125, U+0128..0130,
4266                                    U+0134..0137, U+0139..013E ...)
4267          \p{NFD_Quick_Check: Y}  \p{NFD_Quick_Check=Yes} (NOT
4268                                    \p{NFD_Quick_Check} NOR \p{NFD_QC})
4269                                    (1_100_879 plus all above-Unicode code
4270                                    points)
4271          \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
4272                                    \p{NFD_Quick_Check} NOR \p{NFD_QC})
4273                                    (1_100_879 plus all above-Unicode code
4274                                    points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde-
4275                                    \xdf\xe6\xf0\xf7-\xf8\xfe],
4276                                    U+0110..0111, U+0126..0127,
4277                                    U+0131..0133, U+0138, U+013F..0142 ...)
4278          \p{NFKC_QC: *}          \p{NFKC_Quick_Check: *}
4279          \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (111)
4280          \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (111:
4281                                    U+0300..0304, U+0306..030C, U+030F,
4282                                    U+0311, U+0313..0314, U+031B ...)
4283          \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
4284                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4285                                    (4866)
4286          \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
4287                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4288                                    (4866: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4289                                    \xba\xbc-\xbe], U+0132..0133,
4290                                    U+013F..0140, U+0149, U+017F,
4291                                    U+01C4..01CC ...)
4292          \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
4293                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4294                                    (1_109_135 plus all above-Unicode code
4295                                    points)
4296          \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
4297                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4298                                    (1_109_135 plus all above-Unicode code
4299                                    points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4300                                    \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff],
4301                                    U+0100..0131, U+0134..013E,
4302                                    U+0141..0148, U+014A..017E, U+0180..01C3
4303                                    ...)
4304          \p{NFKD_QC: *}          \p{NFKD_Quick_Check: *}
4305          \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
4306                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4307                                    (16_967)
4308          \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
4309                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4310                                    (16_967: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4311                                    \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1-
4312                                    \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
4313                                    \xf6\xf9-\xfd\xff], U+0100..010F,
4314                                    U+0112..0125, U+0128..0130,
4315                                    U+0132..0137, U+0139..0140 ...)
4316          \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
4317                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4318                                    (1_097_145 plus all above-Unicode code
4319                                    points)
4320          \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
4321                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4322                                    (1_097_145 plus all above-Unicode code
4323                                    points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4324                                    \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xc6\xd0
4325                                    \xd7-\xd8\xde-\xdf\xe6\xf0\xf7-\xf8
4326                                    \xfe], U+0110..0111, U+0126..0127,
4327                                    U+0131, U+0138, U+0141..0142 ...)
4328          \p{Nko}                 \p{Script_Extensions=Nko} (NOT \p{Block=
4329                                    NKo}) (67)
4330          \p{Nkoo}                \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
4331                                    \p{Block=NKo}) (67)
4332          \p{Nl}                  \p{Letter_Number} (= \p{General_Category=
4333                                    Letter_Number}) (236)
4334          \p{No}                  \p{Other_Number} (= \p{General_Category=
4335                                    Other_Number}) (895)
4336        X \p{No_Block}            \p{Block=No_Block} (Short: \p{InNB})
4337                                    (825_600 plus all above-Unicode code
4338                                    points)
4339          \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
4340                                    \p{NChar}) (66)
4341          \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
4342                                    (1_114_046 plus all above-Unicode code
4343                                    points: U+0000..FDCF, U+FDF0..FFFD,
4344                                    U+10000..1FFFD, U+20000..2FFFD,
4345                                    U+30000..3FFFD, U+40000..4FFFD ...)
4346          \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
4347                                    (66: U+FDD0..FDEF, U+FFFE..FFFF,
4348                                    U+1FFFE..1FFFF, U+2FFFE..2FFFF,
4349                                    U+3FFFE..3FFFF, U+4FFFE..4FFFF ...)
4350          \p{Nonspacing_Mark}     \p{General_Category=Nonspacing_Mark}
4351                                    (Short: \p{Mn}) (1950)
4352          \p{Nshu}                \p{Nushu} (= \p{Script_Extensions=Nushu})
4353                                    (NOT \p{Block=Nushu}) (397)
4354          \p{Nt: *}               \p{Numeric_Type: *}
4355          \p{Number}              \p{General_Category=Number} (Short: \p{N})
4356                                    (1791)
4357        X \p{Number_Forms}        \p{Block=Number_Forms} (64)
4358          \p{Numeric_Type: De}    \p{Numeric_Type=Decimal} (660)
4359          \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (660: [0-9],
4360                                    U+0660..0669, U+06F0..06F9,
4361                                    U+07C0..07C9, U+0966..096F, U+09E6..09EF
4362                                    ...)
4363          \p{Numeric_Type: Di}    \p{Numeric_Type=Digit} (128)
4364          \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9],
4365                                    U+1369..1371, U+19DA, U+2070,
4366                                    U+2074..2079, U+2080..2089 ...)
4367          \p{Numeric_Type: None}  (Short: \p{Nt=None}) (1_112_240 plus all
4368                                    above-Unicode code points: [\x00-\x20!
4369                                    \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\]
4370                                    \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba-
4371                                    \xbb\xbf-\xff], U+0100..065F,
4372                                    U+066A..06EF, U+06FA..07BF,
4373                                    U+07CA..0965, U+0970..09E5 ...)
4374          \p{Numeric_Type: Nu}    \p{Numeric_Type=Numeric} (1084)
4375          \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (1084: [\xbc-\xbe],
4376                                    U+09F4..09F9, U+0B72..0B77,
4377                                    U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
4378                                    ...)
4379        T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33)
4380        T \p{Numeric_Value: 0}    (Short: \p{Nv=0}) (84: [0], U+0660,
4381                                    U+06F0, U+07C0, U+0966, U+09E6 ...)
4382        T \p{Numeric_Value: 1/320} (Short: \p{Nv=1/320}) (2: U+11FC0,
4383                                    U+11FD4)
4384        T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (2: U+0D58, U+11FC1)
4385        T \p{Numeric_Value: 1/80} (Short: \p{Nv=1/80}) (1: U+11FC2)
4386        T \p{Numeric_Value: 1/64} (Short: \p{Nv=1/64}) (1: U+11FC3)
4387        T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (2: U+0D59, U+11FC4)
4388        T \p{Numeric_Value: 1/32} (Short: \p{Nv=1/32}) (1: U+11FC5)
4389        T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (2: U+0D5A, U+11FC6)
4390        T \p{Numeric_Value: 3/64} (Short: \p{Nv=3/64}) (1: U+11FC7)
4391        T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (2: U+0D5B, U+11FC8)
4392        T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (6: U+09F4, U+0B75,
4393                                    U+0D76, U+A833, U+11FC9..11FCA)
4394        T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6)
4395        T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (3: U+0D5C, U+2152,
4396                                    U+11FCB)
4397        T \p{Numeric_Value: 1/9}  (Short: \p{Nv=1/9}) (1: U+2151)
4398        T \p{Numeric_Value: 1/8}  (Short: \p{Nv=1/8}) (7: U+09F5, U+0B76,
4399                                    U+0D77, U+215B, U+A834, U+11FCC ...)
4400        T \p{Numeric_Value: 1/7}  (Short: \p{Nv=1/7}) (1: U+2150)
4401        T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (2: U+0D5D, U+11FCD)
4402        T \p{Numeric_Value: 1/6}  (Short: \p{Nv=1/6}) (4: U+2159, U+109F7,
4403                                    U+12461, U+1ED3D)
4404        T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (5: U+09F6, U+0B77,
4405                                    U+0D78, U+A835, U+11FCE)
4406        T \p{Numeric_Value: 1/5}  (Short: \p{Nv=1/5}) (3: U+0D5E, U+2155,
4407                                    U+11FCF)
4408        T \p{Numeric_Value: 1/4}  (Short: \p{Nv=1/4}) (14: [\xbc], U+09F7,
4409                                    U+0B72, U+0D73, U+A830, U+10140 ...)
4410        T \p{Numeric_Value: 1/3}  (Short: \p{Nv=1/3}) (6: U+2153, U+109F9,
4411                                    U+10E7D, U+1245A, U+1245D, U+12465)
4412        T \p{Numeric_Value: 3/8}  (Short: \p{Nv=3/8}) (1: U+215C)
4413        T \p{Numeric_Value: 2/5}  (Short: \p{Nv=2/5}) (1: U+2156)
4414        T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA)
4415        T \p{Numeric_Value: 1/2}  (Short: \p{Nv=1/2}) (19: [\xbd], U+0B73,
4416                                    U+0D74, U+0F2A, U+2CFD, U+A831 ...)
4417        T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC)
4418        T \p{Numeric_Value: 3/5}  (Short: \p{Nv=3/5}) (1: U+2157)
4419        T \p{Numeric_Value: 5/8}  (Short: \p{Nv=5/8}) (1: U+215D)
4420        T \p{Numeric_Value: 2/3}  (Short: \p{Nv=2/3}) (7: U+2154, U+10177,
4421                                    U+109FD, U+10E7E, U+1245B, U+1245E ...)
4422        T \p{Numeric_Value: 3/4}  (Short: \p{Nv=3/4}) (9: [\xbe], U+09F8,
4423                                    U+0B74, U+0D75, U+A832, U+10178 ...)
4424        T \p{Numeric_Value: 4/5}  (Short: \p{Nv=4/5}) (1: U+2158)
4425        T \p{Numeric_Value: 5/6}  (Short: \p{Nv=5/6}) (3: U+215A, U+109FF,
4426                                    U+1245C)
4427        T \p{Numeric_Value: 7/8}  (Short: \p{Nv=7/8}) (1: U+215E)
4428        T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC)
4429        T \p{Numeric_Value: 1}    (Short: \p{Nv=1}) (141: [1\xb9], U+0661,
4430                                    U+06F1, U+07C1, U+0967, U+09E7 ...)
4431        T \p{Numeric_Value: 3/2}  (Short: \p{Nv=3/2}) (1: U+0F2B)
4432        T \p{Numeric_Value: 2}    (Short: \p{Nv=2}) (140: [2\xb2], U+0662,
4433                                    U+06F2, U+07C2, U+0968, U+09E8 ...)
4434        T \p{Numeric_Value: 5/2}  (Short: \p{Nv=5/2}) (1: U+0F2C)
4435        T \p{Numeric_Value: 3}    (Short: \p{Nv=3}) (141: [3\xb3], U+0663,
4436                                    U+06F3, U+07C3, U+0969, U+09E9 ...)
4437        T \p{Numeric_Value: 7/2}  (Short: \p{Nv=7/2}) (1: U+0F2D)
4438        T \p{Numeric_Value: 4}    (Short: \p{Nv=4}) (132: [4], U+0664,
4439                                    U+06F4, U+07C4, U+096A, U+09EA ...)
4440        T \p{Numeric_Value: 9/2}  (Short: \p{Nv=9/2}) (1: U+0F2E)
4441        T \p{Numeric_Value: 5}    (Short: \p{Nv=5}) (130: [5], U+0665,
4442                                    U+06F5, U+07C5, U+096B, U+09EB ...)
4443        T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F)
4444        T \p{Numeric_Value: 6}    (Short: \p{Nv=6}) (114: [6], U+0666,
4445                                    U+06F6, U+07C6, U+096C, U+09EC ...)
4446        T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30)
4447        T \p{Numeric_Value: 7}    (Short: \p{Nv=7}) (113: [7], U+0667,
4448                                    U+06F7, U+07C7, U+096D, U+09ED ...)
4449        T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31)
4450        T \p{Numeric_Value: 8}    (Short: \p{Nv=8}) (109: [8], U+0668,
4451                                    U+06F8, U+07C8, U+096E, U+09EE ...)
4452        T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32)
4453        T \p{Numeric_Value: 9}    (Short: \p{Nv=9}) (113: [9], U+0669,
4454                                    U+06F9, U+07C9, U+096F, U+09EF ...)
4455        T \p{Numeric_Value: 10}   (Short: \p{Nv=10}) (62: U+0BF0, U+0D70,
4456                                    U+1372, U+2169, U+2179, U+2469 ...)
4457        T \p{Numeric_Value: 11}   (Short: \p{Nv=11}) (8: U+216A, U+217A,
4458                                    U+246A, U+247E, U+2492, U+24EB ...)
4459        T \p{Numeric_Value: 12}   (Short: \p{Nv=12}) (8: U+216B, U+217B,
4460                                    U+246B, U+247F, U+2493, U+24EC ...)
4461        T \p{Numeric_Value: 13}   (Short: \p{Nv=13}) (6: U+246C, U+2480,
4462                                    U+2494, U+24ED, U+16E8D, U+1D2ED)
4463        T \p{Numeric_Value: 14}   (Short: \p{Nv=14}) (6: U+246D, U+2481,
4464                                    U+2495, U+24EE, U+16E8E, U+1D2EE)
4465        T \p{Numeric_Value: 15}   (Short: \p{Nv=15}) (6: U+246E, U+2482,
4466                                    U+2496, U+24EF, U+16E8F, U+1D2EF)
4467        T \p{Numeric_Value: 16}   (Short: \p{Nv=16}) (7: U+09F9, U+246F,
4468                                    U+2483, U+2497, U+24F0, U+16E90 ...)
4469        T \p{Numeric_Value: 17}   (Short: \p{Nv=17}) (7: U+16EE, U+2470,
4470                                    U+2484, U+2498, U+24F1, U+16E91 ...)
4471        T \p{Numeric_Value: 18}   (Short: \p{Nv=18}) (7: U+16EF, U+2471,
4472                                    U+2485, U+2499, U+24F2, U+16E92 ...)
4473        T \p{Numeric_Value: 19}   (Short: \p{Nv=19}) (7: U+16F0, U+2472,
4474                                    U+2486, U+249A, U+24F3, U+16E93 ...)
4475        T \p{Numeric_Value: 20}   (Short: \p{Nv=20}) (36: U+1373, U+2473,
4476                                    U+2487, U+249B, U+24F4, U+3039 ...)
4477        T \p{Numeric_Value: 21}   (Short: \p{Nv=21}) (1: U+3251)
4478        T \p{Numeric_Value: 22}   (Short: \p{Nv=22}) (1: U+3252)
4479        T \p{Numeric_Value: 23}   (Short: \p{Nv=23}) (1: U+3253)
4480        T \p{Numeric_Value: 24}   (Short: \p{Nv=24}) (1: U+3254)
4481        T \p{Numeric_Value: 25}   (Short: \p{Nv=25}) (1: U+3255)
4482        T \p{Numeric_Value: 26}   (Short: \p{Nv=26}) (1: U+3256)
4483        T \p{Numeric_Value: 27}   (Short: \p{Nv=27}) (1: U+3257)
4484        T \p{Numeric_Value: 28}   (Short: \p{Nv=28}) (1: U+3258)
4485        T \p{Numeric_Value: 29}   (Short: \p{Nv=29}) (1: U+3259)
4486        T \p{Numeric_Value: 30}   (Short: \p{Nv=30}) (19: U+1374, U+303A,
4487                                    U+324A, U+325A, U+5345, U+10112 ...)
4488        T \p{Numeric_Value: 31}   (Short: \p{Nv=31}) (1: U+325B)
4489        T \p{Numeric_Value: 32}   (Short: \p{Nv=32}) (1: U+325C)
4490        T \p{Numeric_Value: 33}   (Short: \p{Nv=33}) (1: U+325D)
4491        T \p{Numeric_Value: 34}   (Short: \p{Nv=34}) (1: U+325E)
4492        T \p{Numeric_Value: 35}   (Short: \p{Nv=35}) (1: U+325F)
4493        T \p{Numeric_Value: 36}   (Short: \p{Nv=36}) (1: U+32B1)
4494        T \p{Numeric_Value: 37}   (Short: \p{Nv=37}) (1: U+32B2)
4495        T \p{Numeric_Value: 38}   (Short: \p{Nv=38}) (1: U+32B3)
4496        T \p{Numeric_Value: 39}   (Short: \p{Nv=39}) (1: U+32B4)
4497        T \p{Numeric_Value: 40}   (Short: \p{Nv=40}) (18: U+1375, U+324B,
4498                                    U+32B5, U+534C, U+10113, U+102ED ...)
4499        T \p{Numeric_Value: 41}   (Short: \p{Nv=41}) (1: U+32B6)
4500        T \p{Numeric_Value: 42}   (Short: \p{Nv=42}) (1: U+32B7)
4501        T \p{Numeric_Value: 43}   (Short: \p{Nv=43}) (1: U+32B8)
4502        T \p{Numeric_Value: 44}   (Short: \p{Nv=44}) (1: U+32B9)
4503        T \p{Numeric_Value: 45}   (Short: \p{Nv=45}) (1: U+32BA)
4504        T \p{Numeric_Value: 46}   (Short: \p{Nv=46}) (1: U+32BB)
4505        T \p{Numeric_Value: 47}   (Short: \p{Nv=47}) (1: U+32BC)
4506        T \p{Numeric_Value: 48}   (Short: \p{Nv=48}) (1: U+32BD)
4507        T \p{Numeric_Value: 49}   (Short: \p{Nv=49}) (1: U+32BE)
4508        T \p{Numeric_Value: 50}   (Short: \p{Nv=50}) (29: U+1376, U+216C,
4509                                    U+217C, U+2186, U+324C, U+32BF ...)
4510        T \p{Numeric_Value: 60}   (Short: \p{Nv=60}) (13: U+1377, U+324D,
4511                                    U+10115, U+102EF, U+109CE, U+10E6E ...)
4512        T \p{Numeric_Value: 70}   (Short: \p{Nv=70}) (13: U+1378, U+324E,
4513                                    U+10116, U+102F0, U+109CF, U+10E6F ...)
4514        T \p{Numeric_Value: 80}   (Short: \p{Nv=80}) (12: U+1379, U+324F,
4515                                    U+10117, U+102F1, U+10E70, U+11062 ...)
4516        T \p{Numeric_Value: 90}   (Short: \p{Nv=90}) (12: U+137A, U+10118,
4517                                    U+102F2, U+10341, U+10E71, U+11063 ...)
4518        T \p{Numeric_Value: 100}  (Short: \p{Nv=100}) (35: U+0BF1, U+0D71,
4519                                    U+137B, U+216D, U+217D, U+4F70 ...)
4520        T \p{Numeric_Value: 200}  (Short: \p{Nv=200}) (6: U+1011A, U+102F4,
4521                                    U+109D3, U+10E73, U+1EC84, U+1ED14)
4522        T \p{Numeric_Value: 300}  (Short: \p{Nv=300}) (7: U+1011B, U+1016B,
4523                                    U+102F5, U+109D4, U+10E74, U+1EC85 ...)
4524        T \p{Numeric_Value: 400}  (Short: \p{Nv=400}) (7: U+1011C, U+102F6,
4525                                    U+109D5, U+10E75, U+1EC86, U+1ED16 ...)
4526        T \p{Numeric_Value: 500}  (Short: \p{Nv=500}) (16: U+216E, U+217E,
4527                                    U+1011D, U+10145, U+1014C, U+10153 ...)
4528        T \p{Numeric_Value: 600}  (Short: \p{Nv=600}) (7: U+1011E, U+102F8,
4529                                    U+109D7, U+10E77, U+1EC88, U+1ED18 ...)
4530        T \p{Numeric_Value: 700}  (Short: \p{Nv=700}) (6: U+1011F, U+102F9,
4531                                    U+109D8, U+10E78, U+1EC89, U+1ED19)
4532        T \p{Numeric_Value: 800}  (Short: \p{Nv=800}) (6: U+10120, U+102FA,
4533                                    U+109D9, U+10E79, U+1EC8A, U+1ED1A)
4534        T \p{Numeric_Value: 900}  (Short: \p{Nv=900}) (7: U+10121, U+102FB,
4535                                    U+1034A, U+109DA, U+10E7A, U+1EC8B ...)
4536        T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (22: U+0BF2, U+0D72,
4537                                    U+216F, U+217F..2180, U+4EDF, U+5343 ...)
4538        T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (5: U+10123, U+109DC,
4539                                    U+1EC8D, U+1ED1D, U+1ED3A)
4540        T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (4: U+10124, U+109DD,
4541                                    U+1EC8E, U+1ED1E)
4542        T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (4: U+10125, U+109DE,
4543                                    U+1EC8F, U+1ED1F)
4544        T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (8: U+2181, U+10126,
4545                                    U+10146, U+1014E, U+10172, U+109DF ...)
4546        T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (4: U+10127, U+109E0,
4547                                    U+1EC91, U+1ED21)
4548        T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (4: U+10128, U+109E1,
4549                                    U+1EC92, U+1ED22)
4550        T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (4: U+10129, U+109E2,
4551                                    U+1EC93, U+1ED23)
4552        T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (4: U+1012A, U+109E3,
4553                                    U+1EC94, U+1ED24)
4554        T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (13:
4555                                    U+137C, U+2182, U+4E07, U+842C, U+1012B,
4556                                    U+10155 ...)
4557        T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (4:
4558                                    U+1012C, U+109E5, U+1EC96, U+1ED26)
4559        T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (4:
4560                                    U+1012D, U+109E6, U+1EC97, U+1ED27)
4561        T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (4:
4562                                    U+1012E, U+109E7, U+1EC98, U+1ED28)
4563        T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (7:
4564                                    U+2187, U+1012F, U+10147, U+10156,
4565                                    U+109E8, U+1EC99 ...)
4566        T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (4:
4567                                    U+10130, U+109E9, U+1EC9A, U+1ED2A)
4568        T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (4:
4569                                    U+10131, U+109EA, U+1EC9B, U+1ED2B)
4570        T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (4:
4571                                    U+10132, U+109EB, U+1EC9C, U+1ED2C)
4572        T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (4:
4573                                    U+10133, U+109EC, U+1EC9D, U+1ED2D)
4574        T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (5:
4575                                    U+2188, U+109ED, U+1EC9E, U+1ECA0,
4576                                    U+1ECB4)
4577        T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (2:
4578                                    U+109EE, U+1EC9F)
4579        T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1:
4580                                    U+12432)
4581        T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1:
4582                                    U+109EF)
4583        T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1:
4584                                    U+109F0)
4585        T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1:
4586                                    U+12433)
4587        T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1:
4588                                    U+109F1)
4589        T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1:
4590                                    U+109F2)
4591        T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1:
4592                                    U+109F3)
4593        T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1:
4594                                    U+109F4)
4595        T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1:
4596                                    U+109F5)
4597        T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1:
4598                                    U+16B5E)
4599        T \p{Numeric_Value: 10000000} (= 1.0e+07) (Short: \p{Nv=10000000})
4600                                    (1: U+1ECA1)
4601        T \p{Numeric_Value: 20000000} (= 2.0e+07) (Short: \p{Nv=20000000})
4602                                    (1: U+1ECA2)
4603        T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
4604                                    (3: U+4EBF, U+5104, U+16B5F)
4605        T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
4606                                    10000000000}) (1: U+16B60)
4607        T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
4608                                    1000000000000}) (2: U+5146, U+16B61)
4609          \p{Numeric_Value: NaN}  (Short: \p{Nv=NaN}) (1_112_240 plus all
4610                                    above-Unicode code points: [\x00-\x20!
4611                                    \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\]
4612                                    \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba-
4613                                    \xbb\xbf-\xff], U+0100..065F,
4614                                    U+066A..06EF, U+06FA..07BF,
4615                                    U+07CA..0965, U+0970..09E5 ...)
4616          \p{Nushu}               \p{Script_Extensions=Nushu} (Short:
4617                                    \p{Nshu}; NOT \p{Block=Nushu}) (397)
4618          \p{Nv: *}               \p{Numeric_Value: *}
4619          \p{Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
4620                                    Nyiakeng_Puachue_Hmong} (Short:
4621                                    \p{Hmnp}; NOT \p{Block=
4622                                    Nyiakeng_Puachue_Hmong}) (71)
4623        X \p{OCR}                 \p{Optical_Character_Recognition} (=
4624                                    \p{Block=Optical_Character_Recognition})
4625                                    (32)
4626          \p{Ogam}                \p{Ogham} (= \p{Script_Extensions=Ogham})
4627                                    (NOT \p{Block=Ogham}) (29)
4628          \p{Ogham}               \p{Script_Extensions=Ogham} (Short:
4629                                    \p{Ogam}; NOT \p{Block=Ogham}) (29)
4630          \p{Ol_Chiki}            \p{Script_Extensions=Ol_Chiki} (Short:
4631                                    \p{Olck}) (48)
4632          \p{Olck}                \p{Ol_Chiki} (= \p{Script_Extensions=
4633                                    Ol_Chiki}) (48)
4634          \p{Old_Hungarian}       \p{Script_Extensions=Old_Hungarian}
4635                                    (Short: \p{Hung}; NOT \p{Block=
4636                                    Old_Hungarian}) (108)
4637          \p{Old_Italic}          \p{Script_Extensions=Old_Italic} (Short:
4638                                    \p{Ital}; NOT \p{Block=Old_Italic}) (39)
4639          \p{Old_North_Arabian}   \p{Script_Extensions=Old_North_Arabian}
4640                                    (Short: \p{Narb}) (32)
4641          \p{Old_Permic}          \p{Script_Extensions=Old_Permic} (Short:
4642                                    \p{Perm}; NOT \p{Block=Old_Permic}) (44)
4643          \p{Old_Persian}         \p{Script_Extensions=Old_Persian} (Short:
4644                                    \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
4645          \p{Old_Sogdian}         \p{Script_Extensions=Old_Sogdian} (Short:
4646                                    \p{Sogo}; NOT \p{Block=Old_Sogdian}) (40)
4647          \p{Old_South_Arabian}   \p{Script_Extensions=Old_South_Arabian}
4648                                    (Short: \p{Sarb}) (32)
4649          \p{Old_Turkic}          \p{Script_Extensions=Old_Turkic} (Short:
4650                                    \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
4651          \p{Old_Uyghur}          \p{Script_Extensions=Old_Uyghur} (Short:
4652                                    \p{Ougr}; NOT \p{Block=Old_Uyghur}) (28)
4653          \p{Open_Punctuation}    \p{General_Category=Open_Punctuation}
4654                                    (Short: \p{Ps}) (79)
4655        X \p{Optical_Character_Recognition} \p{Block=
4656                                    Optical_Character_Recognition} (Short:
4657                                    \p{InOCR}) (32)
4658          \p{Oriya}               \p{Script_Extensions=Oriya} (Short:
4659                                    \p{Orya}; NOT \p{Block=Oriya}) (97)
4660          \p{Orkh}                \p{Old_Turkic} (= \p{Script_Extensions=
4661                                    Old_Turkic}) (NOT \p{Block=Old_Turkic})
4662                                    (73)
4663        X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
4664          \p{Orya}                \p{Oriya} (= \p{Script_Extensions=Oriya})
4665                                    (NOT \p{Block=Oriya}) (97)
4666          \p{Osage}               \p{Script_Extensions=Osage} (Short:
4667                                    \p{Osge}; NOT \p{Block=Osage}) (72)
4668          \p{Osge}                \p{Osage} (= \p{Script_Extensions=Osage})
4669                                    (NOT \p{Block=Osage}) (72)
4670          \p{Osma}                \p{Osmanya} (= \p{Script_Extensions=
4671                                    Osmanya}) (NOT \p{Block=Osmanya}) (40)
4672          \p{Osmanya}             \p{Script_Extensions=Osmanya} (Short:
4673                                    \p{Osma}; NOT \p{Block=Osmanya}) (40)
4674          \p{Other}               \p{General_Category=Other} (Short: \p{C})
4675                                    (969_578 plus all above-Unicode code
4676                                    points)
4677          \p{Other_Letter}        \p{General_Category=Other_Letter} (Short:
4678                                    \p{Lo}) (127_333)
4679          \p{Other_Number}        \p{General_Category=Other_Number} (Short:
4680                                    \p{No}) (895)
4681          \p{Other_Punctuation}   \p{General_Category=Other_Punctuation}
4682                                    (Short: \p{Po}) (605)
4683          \p{Other_Symbol}        \p{General_Category=Other_Symbol} (Short:
4684                                    \p{So}) (6605)
4685        X \p{Ottoman_Siyaq_Numbers} \p{Block=Ottoman_Siyaq_Numbers} (80)
4686          \p{Ougr}                \p{Old_Uyghur} (= \p{Script_Extensions=
4687                                    Old_Uyghur}) (NOT \p{Block=Old_Uyghur})
4688                                    (28)
4689          \p{P} \pP               \p{Punct} (= \p{General_Category=
4690                                    Punctuation}) (NOT
4691                                    \p{General_Punctuation}) (819)
4692          \p{Pahawh_Hmong}        \p{Script_Extensions=Pahawh_Hmong} (Short:
4693                                    \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
4694                                    (127)
4695          \p{Palm}                \p{Palmyrene} (= \p{Script_Extensions=
4696                                    Palmyrene}) (32)
4697          \p{Palmyrene}           \p{Script_Extensions=Palmyrene} (Short:
4698                                    \p{Palm}) (32)
4699          \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
4700                                    (Short: \p{Zp}) (1)
4701          \p{Pat_Syn}             \p{Pattern_Syntax} (= \p{Pattern_Syntax=
4702                                    Y}) (2760)
4703          \p{Pat_Syn: *}          \p{Pattern_Syntax: *}
4704          \p{Pat_WS}              \p{Pattern_White_Space} (=
4705                                    \p{Pattern_White_Space=Y}) (11)
4706          \p{Pat_WS: *}           \p{Pattern_White_Space: *}
4707          \p{Pattern_Syntax}      \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
4708                                    (2760)
4709          \p{Pattern_Syntax: N*}  (Short: \p{PatSyn=N}, \P{PatSyn})
4710                                    (1_111_352 plus all above-Unicode code
4711                                    points: [\x00-\x200-9A-Z_a-z\x7f-\xa0
4712                                    \xa8\xaa\xad\xaf\xb2-\xb5\xb7-\xba\xbc-
4713                                    \xbe\xc0-\xd6\xd8-\xf6\xf8-\xff],
4714                                    U+0100..200F, U+2028..202F,
4715                                    U+203F..2040, U+2054, U+205F..218F ...)
4716          \p{Pattern_Syntax: Y*}  (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760:
4717                                    [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\]
4718                                    \^`\{\|\}~\xa1-\xa7\xa9\xab-\xac\xae
4719                                    \xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
4720                                    U+2010..2027, U+2030..203E,
4721                                    U+2041..2053, U+2055..205E, U+2190..245F
4722                                    ...)
4723          \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
4724                                    \p{PatWS}) (11)
4725          \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
4726                                    (1_114_101 plus all above-Unicode code
4727                                    points: [^\t\n\cK\f\r\x20\x85],
4728                                    U+0100..200D, U+2010..2027,
4729                                    U+202A..infinity)
4730          \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11:
4731                                    [\t\n\cK\f\r\x20\x85], U+200E..200F,
4732                                    U+2028..2029)
4733          \p{Pau_Cin_Hau}         \p{Script_Extensions=Pau_Cin_Hau} (Short:
4734                                    \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
4735          \p{Pauc}                \p{Pau_Cin_Hau} (= \p{Script_Extensions=
4736                                    Pau_Cin_Hau}) (NOT \p{Block=
4737                                    Pau_Cin_Hau}) (57)
4738          \p{Pc}                  \p{Connector_Punctuation} (=
4739                                    \p{General_Category=
4740                                    Connector_Punctuation}) (10)
4741          \p{PCM}                 \p{Prepended_Concatenation_Mark} (=
4742                                    \p{Prepended_Concatenation_Mark=Y}) (13)
4743          \p{PCM: *}              \p{Prepended_Concatenation_Mark: *}
4744          \p{Pd}                  \p{Dash_Punctuation} (=
4745                                    \p{General_Category=Dash_Punctuation})
4746                                    (26)
4747          \p{Pe}                  \p{Close_Punctuation} (=
4748                                    \p{General_Category=Close_Punctuation})
4749                                    (77)
4750          \p{PerlSpace}           \p{PosixSpace} (6)
4751          \p{PerlWord}            \p{PosixWord} (63)
4752          \p{Perm}                \p{Old_Permic} (= \p{Script_Extensions=
4753                                    Old_Permic}) (NOT \p{Block=Old_Permic})
4754                                    (44)
4755          \p{Pf}                  \p{Final_Punctuation} (=
4756                                    \p{General_Category=Final_Punctuation})
4757                                    (10)
4758          \p{Phag}                \p{Phags_Pa} (= \p{Script_Extensions=
4759                                    Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
4760          \p{Phags_Pa}            \p{Script_Extensions=Phags_Pa} (Short:
4761                                    \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
4762        X \p{Phaistos}            \p{Phaistos_Disc} (= \p{Block=
4763                                    Phaistos_Disc}) (48)
4764        X \p{Phaistos_Disc}       \p{Block=Phaistos_Disc} (Short:
4765                                    \p{InPhaistos}) (48)
4766          \p{Phli}                \p{Inscriptional_Pahlavi} (=
4767                                    \p{Script_Extensions=
4768                                    Inscriptional_Pahlavi}) (NOT \p{Block=
4769                                    Inscriptional_Pahlavi}) (27)
4770          \p{Phlp}                \p{Psalter_Pahlavi} (=
4771                                    \p{Script_Extensions=Psalter_Pahlavi})
4772                                    (NOT \p{Block=Psalter_Pahlavi}) (30)
4773          \p{Phnx}                \p{Phoenician} (= \p{Script_Extensions=
4774                                    Phoenician}) (NOT \p{Block=Phoenician})
4775                                    (29)
4776          \p{Phoenician}          \p{Script_Extensions=Phoenician} (Short:
4777                                    \p{Phnx}; NOT \p{Block=Phoenician}) (29)
4778        X \p{Phonetic_Ext}        \p{Phonetic_Extensions} (= \p{Block=
4779                                    Phonetic_Extensions}) (128)
4780        X \p{Phonetic_Ext_Sup}    \p{Phonetic_Extensions_Supplement} (=
4781                                    \p{Block=
4782                                    Phonetic_Extensions_Supplement}) (64)
4783        X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
4784                                    \p{InPhoneticExt}) (128)
4785        X \p{Phonetic_Extensions_Supplement} \p{Block=
4786                                    Phonetic_Extensions_Supplement} (Short:
4787                                    \p{InPhoneticExtSup}) (64)
4788          \p{Pi}                  \p{Initial_Punctuation} (=
4789                                    \p{General_Category=
4790                                    Initial_Punctuation}) (12)
4791        X \p{Playing_Cards}       \p{Block=Playing_Cards} (96)
4792          \p{Plrd}                \p{Miao} (= \p{Script_Extensions=Miao})
4793                                    (NOT \p{Block=Miao}) (149)
4794          \p{Po}                  \p{Other_Punctuation} (=
4795                                    \p{General_Category=Other_Punctuation})
4796                                    (605)
4797          \p{PosixAlnum}          (62: [0-9A-Za-z])
4798          \p{PosixAlpha}          (52: [A-Za-z])
4799          \p{PosixBlank}          (2: [\t\x20])
4800          \p{PosixCntrl}          ASCII control characters (33: ACK, BEL,
4801                                    BS, CAN, CR, DC1, DC2, DC3, DC4, DEL,
4802                                    DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF,
4803                                    FS, GS, HT, LF, NAK, NUL, RS, SI, SO,
4804                                    SOH, STX, SUB, SYN, US, VT)
4805          \p{PosixDigit}          (10: [0-9])
4806          \p{PosixGraph}          (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
4807                                    Z\[\\\]\^_`a-z\{\|\}~])
4808          \p{PosixLower}          (/i= PosixAlpha) (26: [a-z])
4809          \p{PosixPrint}          (95: [\x20-\x7e])
4810          \p{PosixPunct}          (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@
4811                                    \[\\\]\^_`\{\|\}~])
4812          \p{PosixSpace}          (Short: \p{PerlSpace}) (6: [\t\n\cK\f\r
4813                                    \x20])
4814          \p{PosixUpper}          (/i= PosixAlpha) (26: [A-Z])
4815          \p{PosixWord}           \w, restricted to ASCII (Short:
4816                                    \p{PerlWord}) (63: [0-9A-Z_a-z])
4817          \p{PosixXDigit}         \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
4818                                    (22)
4819          \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
4820                                    Y} (Short: \p{PCM}) (13)
4821          \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
4822                                    (1_114_099 plus all above-Unicode code
4823                                    points: U+0000..05FF, U+0606..06DC,
4824                                    U+06DE..070E, U+0710..088F,
4825                                    U+0892..08E1, U+08E3..110BC ...)
4826          \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
4827                                    (13: U+0600..0605, U+06DD, U+070F,
4828                                    U+0890..0891, U+08E2, U+110BD ...)
4829        T \p{Present_In: 1.1}     \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
4830                                    extension) (33_979)
4831          \p{Present_In: V1_1}    \p{Present_In=1.1} (= \p{Age=V1_1}) (Perl
4832                                    extension) (33_979)
4833        T \p{Present_In: 2.0}     Code point's usage introduced in version
4834                                    2.0 or earlier (Short: \p{In=2.0}) (Perl
4835                                    extension) (178_500: U+0000..01F5,
4836                                    U+01FA..0217, U+0250..02A8,
4837                                    U+02B0..02DE, U+02E0..02E9, U+0300..0345
4838                                    ...)
4839          \p{Present_In: V2_0}    \p{Present_In=2.0} (Perl extension)
4840                                    (178_500)
4841        T \p{Present_In: 2.1}     Code point's usage introduced in version
4842                                    2.1 or earlier (Short: \p{In=2.1}) (Perl
4843                                    extension) (178_502: U+0000..01F5,
4844                                    U+01FA..0217, U+0250..02A8,
4845                                    U+02B0..02DE, U+02E0..02E9, U+0300..0345
4846                                    ...)
4847          \p{Present_In: V2_1}    \p{Present_In=2.1} (Perl extension)
4848                                    (178_502)
4849        T \p{Present_In: 3.0}     Code point's usage introduced in version
4850                                    3.0 or earlier (Short: \p{In=3.0}) (Perl
4851                                    extension) (188_809: U+0000..021F,
4852                                    U+0222..0233, U+0250..02AD,
4853                                    U+02B0..02EE, U+0300..034E, U+0360..0362
4854                                    ...)
4855          \p{Present_In: V3_0}    \p{Present_In=3.0} (Perl extension)
4856                                    (188_809)
4857        T \p{Present_In: 3.1}     Code point's usage introduced in version
4858                                    3.1 or earlier (Short: \p{In=3.1}) (Perl
4859                                    extension) (233_787: U+0000..021F,
4860                                    U+0222..0233, U+0250..02AD,
4861                                    U+02B0..02EE, U+0300..034E, U+0360..0362
4862                                    ...)
4863          \p{Present_In: V3_1}    \p{Present_In=3.1} (Perl extension)
4864                                    (233_787)
4865        T \p{Present_In: 3.2}     Code point's usage introduced in version
4866                                    3.2 or earlier (Short: \p{In=3.2}) (Perl
4867                                    extension) (234_803: U+0000..0220,
4868                                    U+0222..0233, U+0250..02AD,
4869                                    U+02B0..02EE, U+0300..034F, U+0360..036F
4870                                    ...)
4871          \p{Present_In: V3_2}    \p{Present_In=3.2} (Perl extension)
4872                                    (234_803)
4873        T \p{Present_In: 4.0}     Code point's usage introduced in version
4874                                    4.0 or earlier (Short: \p{In=4.0}) (Perl
4875                                    extension) (236_029: U+0000..0236,
4876                                    U+0250..0357, U+035D..036F,
4877                                    U+0374..0375, U+037A, U+037E ...)
4878          \p{Present_In: V4_0}    \p{Present_In=4.0} (Perl extension)
4879                                    (236_029)
4880        T \p{Present_In: 4.1}     Code point's usage introduced in version
4881                                    4.1 or earlier (Short: \p{In=4.1}) (Perl
4882                                    extension) (237_302: U+0000..0241,
4883                                    U+0250..036F, U+0374..0375, U+037A,
4884                                    U+037E, U+0384..038A ...)
4885          \p{Present_In: V4_1}    \p{Present_In=4.1} (Perl extension)
4886                                    (237_302)
4887        T \p{Present_In: 5.0}     Code point's usage introduced in version
4888                                    5.0 or earlier (Short: \p{In=5.0}) (Perl
4889                                    extension) (238_671: U+0000..036F,
4890                                    U+0374..0375, U+037A..037E,
4891                                    U+0384..038A, U+038C, U+038E..03A1 ...)
4892          \p{Present_In: V5_0}    \p{Present_In=5.0} (Perl extension)
4893                                    (238_671)
4894        T \p{Present_In: 5.1}     Code point's usage introduced in version
4895                                    5.1 or earlier (Short: \p{In=5.1}) (Perl
4896                                    extension) (240_295: U+0000..0377,
4897                                    U+037A..037E, U+0384..038A, U+038C,
4898                                    U+038E..03A1, U+03A3..0523 ...)
4899          \p{Present_In: V5_1}    \p{Present_In=5.1} (Perl extension)
4900                                    (240_295)
4901        T \p{Present_In: 5.2}     Code point's usage introduced in version
4902                                    5.2 or earlier (Short: \p{In=5.2}) (Perl
4903                                    extension) (246_943: U+0000..0377,
4904                                    U+037A..037E, U+0384..038A, U+038C,
4905                                    U+038E..03A1, U+03A3..0525 ...)
4906          \p{Present_In: V5_2}    \p{Present_In=5.2} (Perl extension)
4907                                    (246_943)
4908        T \p{Present_In: 6.0}     Code point's usage introduced in version
4909                                    6.0 or earlier (Short: \p{In=6.0}) (Perl
4910                                    extension) (249_031: U+0000..0377,
4911                                    U+037A..037E, U+0384..038A, U+038C,
4912                                    U+038E..03A1, U+03A3..0527 ...)
4913          \p{Present_In: V6_0}    \p{Present_In=6.0} (Perl extension)
4914                                    (249_031)
4915        T \p{Present_In: 6.1}     Code point's usage introduced in version
4916                                    6.1 or earlier (Short: \p{In=6.1}) (Perl
4917                                    extension) (249_763: U+0000..0377,
4918                                    U+037A..037E, U+0384..038A, U+038C,
4919                                    U+038E..03A1, U+03A3..0527 ...)
4920          \p{Present_In: V6_1}    \p{Present_In=6.1} (Perl extension)
4921                                    (249_763)
4922        T \p{Present_In: 6.2}     Code point's usage introduced in version
4923                                    6.2 or earlier (Short: \p{In=6.2}) (Perl
4924                                    extension) (249_764: U+0000..0377,
4925                                    U+037A..037E, U+0384..038A, U+038C,
4926                                    U+038E..03A1, U+03A3..0527 ...)
4927          \p{Present_In: V6_2}    \p{Present_In=6.2} (Perl extension)
4928                                    (249_764)
4929        T \p{Present_In: 6.3}     Code point's usage introduced in version
4930                                    6.3 or earlier (Short: \p{In=6.3}) (Perl
4931                                    extension) (249_769: U+0000..0377,
4932                                    U+037A..037E, U+0384..038A, U+038C,
4933                                    U+038E..03A1, U+03A3..0527 ...)
4934          \p{Present_In: V6_3}    \p{Present_In=6.3} (Perl extension)
4935                                    (249_769)
4936        T \p{Present_In: 7.0}     Code point's usage introduced in version
4937                                    7.0 or earlier (Short: \p{In=7.0}) (Perl
4938                                    extension) (252_603: U+0000..0377,
4939                                    U+037A..037F, U+0384..038A, U+038C,
4940                                    U+038E..03A1, U+03A3..052F ...)
4941          \p{Present_In: V7_0}    \p{Present_In=7.0} (Perl extension)
4942                                    (252_603)
4943        T \p{Present_In: 8.0}     Code point's usage introduced in version
4944                                    8.0 or earlier (Short: \p{In=8.0}) (Perl
4945                                    extension) (260_319: U+0000..0377,
4946                                    U+037A..037F, U+0384..038A, U+038C,
4947                                    U+038E..03A1, U+03A3..052F ...)
4948          \p{Present_In: V8_0}    \p{Present_In=8.0} (Perl extension)
4949                                    (260_319)
4950        T \p{Present_In: 9.0}     Code point's usage introduced in version
4951                                    9.0 or earlier (Short: \p{In=9.0}) (Perl
4952                                    extension) (267_819: U+0000..0377,
4953                                    U+037A..037F, U+0384..038A, U+038C,
4954                                    U+038E..03A1, U+03A3..052F ...)
4955          \p{Present_In: V9_0}    \p{Present_In=9.0} (Perl extension)
4956                                    (267_819)
4957        T \p{Present_In: 10.0}    Code point's usage introduced in version
4958                                    10.0 or earlier (Short: \p{In=10.0})
4959                                    (Perl extension) (276_337: U+0000..0377,
4960                                    U+037A..037F, U+0384..038A, U+038C,
4961                                    U+038E..03A1, U+03A3..052F ...)
4962          \p{Present_In: V10_0}   \p{Present_In=10.0} (Perl extension)
4963                                    (276_337)
4964        T \p{Present_In: 11.0}    Code point's usage introduced in version
4965                                    11.0 or earlier (Short: \p{In=11.0})
4966                                    (Perl extension) (277_021: U+0000..0377,
4967                                    U+037A..037F, U+0384..038A, U+038C,
4968                                    U+038E..03A1, U+03A3..052F ...)
4969          \p{Present_In: V11_0}   \p{Present_In=11.0} (Perl extension)
4970                                    (277_021)
4971        T \p{Present_In: 12.0}    Code point's usage introduced in version
4972                                    12.0 or earlier (Short: \p{In=12.0})
4973                                    (Perl extension) (277_575: U+0000..0377,
4974                                    U+037A..037F, U+0384..038A, U+038C,
4975                                    U+038E..03A1, U+03A3..052F ...)
4976          \p{Present_In: V12_0}   \p{Present_In=12.0} (Perl extension)
4977                                    (277_575)
4978        T \p{Present_In: 12.1}    Code point's usage introduced in version
4979                                    12.1 or earlier (Short: \p{In=12.1})
4980                                    (Perl extension) (277_576: U+0000..0377,
4981                                    U+037A..037F, U+0384..038A, U+038C,
4982                                    U+038E..03A1, U+03A3..052F ...)
4983          \p{Present_In: V12_1}   \p{Present_In=12.1} (Perl extension)
4984                                    (277_576)
4985        T \p{Present_In: 13.0}    Code point's usage introduced in version
4986                                    13.0 or earlier (Short: \p{In=13.0})
4987                                    (Perl extension) (283_506: U+0000..0377,
4988                                    U+037A..037F, U+0384..038A, U+038C,
4989                                    U+038E..03A1, U+03A3..052F ...)
4990          \p{Present_In: V13_0}   \p{Present_In=13.0} (Perl extension)
4991                                    (283_506)
4992        T \p{Present_In: 14.0}    Code point's usage introduced in version
4993                                    14.0 or earlier (Short: \p{In=14.0})
4994                                    (Perl extension) (284_344: U+0000..0377,
4995                                    U+037A..037F, U+0384..038A, U+038C,
4996                                    U+038E..03A1, U+03A3..052F ...)
4997          \p{Present_In: V14_0}   \p{Present_In=14.0} (Perl extension)
4998                                    (284_344)
4999          \p{Present_In: NA}      \p{Present_In=Unassigned} (= \p{Age=
5000                                    Unassigned}) (Perl extension) (829_768
5001                                    plus all above-Unicode code points)
5002          \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=NA})
5003                                    (Perl extension) (829_768 plus all
5004                                    above-Unicode code points)
5005          \p{Print}               \p{XPosixPrint} (282_163)
5006          \p{Private_Use}         \p{General_Category=Private_Use} (Short:
5007                                    \p{Co}; NOT \p{Private_Use_Area})
5008                                    (137_468)
5009        X \p{Private_Use_Area}    \p{Block=Private_Use_Area} (Short:
5010                                    \p{InPUA}) (6400)
5011          \p{Prti}                \p{Inscriptional_Parthian} (=
5012                                    \p{Script_Extensions=
5013                                    Inscriptional_Parthian}) (NOT \p{Block=
5014                                    Inscriptional_Parthian}) (30)
5015          \p{Ps}                  \p{Open_Punctuation} (=
5016                                    \p{General_Category=Open_Punctuation})
5017                                    (79)
5018          \p{Psalter_Pahlavi}     \p{Script_Extensions=Psalter_Pahlavi}
5019                                    (Short: \p{Phlp}; NOT \p{Block=
5020                                    Psalter_Pahlavi}) (30)
5021        X \p{PUA}                 \p{Private_Use_Area} (= \p{Block=
5022                                    Private_Use_Area}) (6400)
5023          \p{Punct}               \p{General_Category=Punctuation} (Short:
5024                                    \p{P}; NOT \p{General_Punctuation}) (819)
5025          \p{Punctuation}         \p{Punct} (= \p{General_Category=
5026                                    Punctuation}) (NOT
5027                                    \p{General_Punctuation}) (819)
5028          \p{Qaac}                \p{Coptic} (= \p{Script_Extensions=
5029                                    Coptic}) (NOT \p{Block=Coptic}) (165)
5030          \p{Qaai}                \p{Inherited} (= \p{Script_Extensions=
5031                                    Inherited}) (586)
5032          \p{QMark}               \p{Quotation_Mark} (= \p{Quotation_Mark=
5033                                    Y}) (30)
5034          \p{QMark: *}            \p{Quotation_Mark: *}
5035          \p{Quotation_Mark}      \p{Quotation_Mark=Y} (Short: \p{QMark})
5036                                    (30)
5037          \p{Quotation_Mark: N*}  (Short: \p{QMark=N}, \P{QMark}) (1_114_082
5038                                    plus all above-Unicode code points:
5039                                    [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<=>?
5040                                    \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xaa\xac-
5041                                    \xba\xbc-\xff], U+0100..2017,
5042                                    U+2020..2038, U+203B..2E41,
5043                                    U+2E43..300B, U+3010..301C ...)
5044          \p{Quotation_Mark: Y*}  (Short: \p{QMark=Y}, \p{QMark}) (30: [\"
5045                                    \'\xab\xbb], U+2018..201F, U+2039..203A,
5046                                    U+2E42, U+300C..300F, U+301D..301F ...)
5047          \p{Radical}             \p{Radical=Y} (329)
5048          \p{Radical: N*}         (Single: \P{Radical}) (1_113_783 plus all
5049                                    above-Unicode code points: U+0000..2E7F,
5050                                    U+2E9A, U+2EF4..2EFF, U+2FD6..infinity)
5051          \p{Radical: Y*}         (Single: \p{Radical}) (329: U+2E80..2E99,
5052                                    U+2E9B..2EF3, U+2F00..2FD5)
5053          \p{Regional_Indicator}  \p{Regional_Indicator=Y} (Short: \p{RI})
5054                                    (26)
5055          \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086
5056                                    plus all above-Unicode code points:
5057                                    U+0000..1F1E5, U+1F200..infinity)
5058          \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26:
5059                                    U+1F1E6..1F1FF)
5060          \p{Rejang}              \p{Script_Extensions=Rejang} (Short:
5061                                    \p{Rjng}; NOT \p{Block=Rejang}) (37)
5062          \p{RI}                  \p{Regional_Indicator} (=
5063                                    \p{Regional_Indicator=Y}) (26)
5064          \p{RI: *}               \p{Regional_Indicator: *}
5065          \p{Rjng}                \p{Rejang} (= \p{Script_Extensions=
5066                                    Rejang}) (NOT \p{Block=Rejang}) (37)
5067          \p{Rohg}                \p{Hanifi_Rohingya} (=
5068                                    \p{Script_Extensions=Hanifi_Rohingya})
5069                                    (NOT \p{Block=Hanifi_Rohingya}) (55)
5070        X \p{Rumi}                \p{Rumi_Numeral_Symbols} (= \p{Block=
5071                                    Rumi_Numeral_Symbols}) (32)
5072        X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
5073                                    \p{InRumi}) (32)
5074          \p{Runic}               \p{Script_Extensions=Runic} (Short:
5075                                    \p{Runr}; NOT \p{Block=Runic}) (86)
5076          \p{Runr}                \p{Runic} (= \p{Script_Extensions=Runic})
5077                                    (NOT \p{Block=Runic}) (86)
5078          \p{S} \pS               \p{Symbol} (= \p{General_Category=Symbol})
5079                                    (7741)
5080          \p{Samaritan}           \p{Script_Extensions=Samaritan} (Short:
5081                                    \p{Samr}; NOT \p{Block=Samaritan}) (61)
5082          \p{Samr}                \p{Samaritan} (= \p{Script_Extensions=
5083                                    Samaritan}) (NOT \p{Block=Samaritan})
5084                                    (61)
5085          \p{Sarb}                \p{Old_South_Arabian} (=
5086                                    \p{Script_Extensions=Old_South_Arabian})
5087                                    (32)
5088          \p{Saur}                \p{Saurashtra} (= \p{Script_Extensions=
5089                                    Saurashtra}) (NOT \p{Block=Saurashtra})
5090                                    (82)
5091          \p{Saurashtra}          \p{Script_Extensions=Saurashtra} (Short:
5092                                    \p{Saur}; NOT \p{Block=Saurashtra}) (82)
5093          \p{SB: *}               \p{Sentence_Break: *}
5094          \p{Sc}                  \p{Currency_Symbol} (=
5095                                    \p{General_Category=Currency_Symbol})
5096                                    (63)
5097          \p{Sc: *}               \p{Script: *}
5098          \p{Script: Adlam}       (Short: \p{Sc=Adlm}) (88: U+1E900..1E94B,
5099                                    U+1E950..1E959, U+1E95E..1E95F)
5100          \p{Script: Adlm}        \p{Script=Adlam} (88)
5101          \p{Script: Aghb}        \p{Script=Caucasian_Albanian} (=
5102                                    \p{Script_Extensions=
5103                                    Caucasian_Albanian}) (53)
5104          \p{Script: Ahom}        \p{Script_Extensions=Ahom} (Short: \p{Sc=
5105                                    Ahom}, \p{Ahom}) (65)
5106          \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions=
5107                                    Anatolian_Hieroglyphs} (Short: \p{Sc=
5108                                    Hluw}, \p{Hluw}) (583)
5109          \p{Script: Arab}        \p{Script=Arabic} (1365)
5110          \p{Script: Arabic}      (Short: \p{Sc=Arab}) (1365: U+0600..0604,
5111                                    U+0606..060B, U+060D..061A,
5112                                    U+061C..061E, U+0620..063F, U+0641..064A
5113                                    ...)
5114          \p{Script: Armenian}    \p{Script_Extensions=Armenian} (Short:
5115                                    \p{Sc=Armn}, \p{Armn}) (96)
5116          \p{Script: Armi}        \p{Script=Imperial_Aramaic} (=
5117                                    \p{Script_Extensions=Imperial_Aramaic})
5118                                    (31)
5119          \p{Script: Armn}        \p{Script=Armenian} (=
5120                                    \p{Script_Extensions=Armenian}) (96)
5121          \p{Script: Avestan}     \p{Script_Extensions=Avestan} (Short:
5122                                    \p{Sc=Avst}, \p{Avst}) (61)
5123          \p{Script: Avst}        \p{Script=Avestan} (=
5124                                    \p{Script_Extensions=Avestan}) (61)
5125          \p{Script: Bali}        \p{Script=Balinese} (=
5126                                    \p{Script_Extensions=Balinese}) (124)
5127          \p{Script: Balinese}    \p{Script_Extensions=Balinese} (Short:
5128                                    \p{Sc=Bali}, \p{Bali}) (124)
5129          \p{Script: Bamu}        \p{Script=Bamum} (= \p{Script_Extensions=
5130                                    Bamum}) (657)
5131          \p{Script: Bamum}       \p{Script_Extensions=Bamum} (Short: \p{Sc=
5132                                    Bamu}, \p{Bamu}) (657)
5133          \p{Script: Bass}        \p{Script=Bassa_Vah} (=
5134                                    \p{Script_Extensions=Bassa_Vah}) (36)
5135          \p{Script: Bassa_Vah}   \p{Script_Extensions=Bassa_Vah} (Short:
5136                                    \p{Sc=Bass}, \p{Bass}) (36)
5137          \p{Script: Batak}       \p{Script_Extensions=Batak} (Short: \p{Sc=
5138                                    Batk}, \p{Batk}) (56)
5139          \p{Script: Batk}        \p{Script=Batak} (= \p{Script_Extensions=
5140                                    Batak}) (56)
5141          \p{Script: Beng}        \p{Script=Bengali} (96)
5142          \p{Script: Bengali}     (Short: \p{Sc=Beng}) (96: U+0980..0983,
5143                                    U+0985..098C, U+098F..0990,
5144                                    U+0993..09A8, U+09AA..09B0, U+09B2 ...)
5145          \p{Script: Bhaiksuki}   \p{Script_Extensions=Bhaiksuki} (Short:
5146                                    \p{Sc=Bhks}, \p{Bhks}) (97)
5147          \p{Script: Bhks}        \p{Script=Bhaiksuki} (=
5148                                    \p{Script_Extensions=Bhaiksuki}) (97)
5149          \p{Script: Bopo}        \p{Script=Bopomofo} (77)
5150          \p{Script: Bopomofo}    (Short: \p{Sc=Bopo}) (77: U+02EA..02EB,
5151                                    U+3105..312F, U+31A0..31BF)
5152          \p{Script: Brah}        \p{Script=Brahmi} (= \p{Script_Extensions=
5153                                    Brahmi}) (115)
5154          \p{Script: Brahmi}      \p{Script_Extensions=Brahmi} (Short:
5155                                    \p{Sc=Brah}, \p{Brah}) (115)
5156          \p{Script: Brai}        \p{Script=Braille} (=
5157                                    \p{Script_Extensions=Braille}) (256)
5158          \p{Script: Braille}     \p{Script_Extensions=Braille} (Short:
5159                                    \p{Sc=Brai}, \p{Brai}) (256)
5160          \p{Script: Bugi}        \p{Script=Buginese} (30)
5161          \p{Script: Buginese}    (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B,
5162                                    U+1A1E..1A1F)
5163          \p{Script: Buhd}        \p{Script=Buhid} (20)
5164          \p{Script: Buhid}       (Short: \p{Sc=Buhd}) (20: U+1740..1753)
5165          \p{Script: Cakm}        \p{Script=Chakma} (71)
5166          \p{Script: Canadian_Aboriginal} \p{Script_Extensions=
5167                                    Canadian_Aboriginal} (Short: \p{Sc=
5168                                    Cans}, \p{Cans}) (726)
5169          \p{Script: Cans}        \p{Script=Canadian_Aboriginal} (=
5170                                    \p{Script_Extensions=
5171                                    Canadian_Aboriginal}) (726)
5172          \p{Script: Cari}        \p{Script=Carian} (= \p{Script_Extensions=
5173                                    Carian}) (49)
5174          \p{Script: Carian}      \p{Script_Extensions=Carian} (Short:
5175                                    \p{Sc=Cari}, \p{Cari}) (49)
5176          \p{Script: Caucasian_Albanian} \p{Script_Extensions=
5177                                    Caucasian_Albanian} (Short: \p{Sc=Aghb},
5178                                    \p{Aghb}) (53)
5179          \p{Script: Chakma}      (Short: \p{Sc=Cakm}) (71: U+11100..11134,
5180                                    U+11136..11147)
5181          \p{Script: Cham}        \p{Script_Extensions=Cham} (Short: \p{Sc=
5182                                    Cham}, \p{Cham}) (83)
5183          \p{Script: Cher}        \p{Script=Cherokee} (=
5184                                    \p{Script_Extensions=Cherokee}) (172)
5185          \p{Script: Cherokee}    \p{Script_Extensions=Cherokee} (Short:
5186                                    \p{Sc=Cher}, \p{Cher}) (172)
5187          \p{Script: Chorasmian}  \p{Script_Extensions=Chorasmian} (Short:
5188                                    \p{Sc=Chrs}, \p{Chrs}) (28)
5189          \p{Script: Chrs}        \p{Script=Chorasmian} (=
5190                                    \p{Script_Extensions=Chorasmian}) (28)
5191          \p{Script: Common}      (Short: \p{Sc=Zyyy}) (8252: [\x00-\x20!
5192                                    \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]
5193                                    \^_`\{\|\}~\x7f-\xa9\xab-\xb9\xbb-\xbf
5194                                    \xd7\xf7], U+02B9..02DF, U+02E5..02E9,
5195                                    U+02EC..02FF, U+0374, U+037E ...)
5196          \p{Script: Copt}        \p{Script=Coptic} (137)
5197          \p{Script: Coptic}      (Short: \p{Sc=Copt}) (137: U+03E2..03EF,
5198                                    U+2C80..2CF3, U+2CF9..2CFF)
5199          \p{Script: Cpmn}        \p{Script=Cypro_Minoan} (99)
5200          \p{Script: Cprt}        \p{Script=Cypriot} (55)
5201          \p{Script: Cuneiform}   \p{Script_Extensions=Cuneiform} (Short:
5202                                    \p{Sc=Xsux}, \p{Xsux}) (1234)
5203          \p{Script: Cypriot}     (Short: \p{Sc=Cprt}) (55: U+10800..10805,
5204                                    U+10808, U+1080A..10835, U+10837..10838,
5205                                    U+1083C, U+1083F)
5206          \p{Script: Cypro_Minoan} (Short: \p{Sc=Cpmn}) (99: U+12F90..12FF2)
5207          \p{Script: Cyrillic}    (Short: \p{Sc=Cyrl}) (443: U+0400..0484,
5208                                    U+0487..052F, U+1C80..1C88, U+1D2B,
5209                                    U+1D78, U+2DE0..2DFF ...)
5210          \p{Script: Cyrl}        \p{Script=Cyrillic} (443)
5211          \p{Script: Deseret}     \p{Script_Extensions=Deseret} (Short:
5212                                    \p{Sc=Dsrt}, \p{Dsrt}) (80)
5213          \p{Script: Deva}        \p{Script=Devanagari} (154)
5214          \p{Script: Devanagari}  (Short: \p{Sc=Deva}) (154: U+0900..0950,
5215                                    U+0955..0963, U+0966..097F, U+A8E0..A8FF)
5216          \p{Script: Diak}        \p{Script=Dives_Akuru} (=
5217                                    \p{Script_Extensions=Dives_Akuru}) (72)
5218          \p{Script: Dives_Akuru} \p{Script_Extensions=Dives_Akuru} (Short:
5219                                    \p{Sc=Diak}, \p{Diak}) (72)
5220          \p{Script: Dogr}        \p{Script=Dogra} (60)
5221          \p{Script: Dogra}       (Short: \p{Sc=Dogr}) (60: U+11800..1183B)
5222          \p{Script: Dsrt}        \p{Script=Deseret} (=
5223                                    \p{Script_Extensions=Deseret}) (80)
5224          \p{Script: Dupl}        \p{Script=Duployan} (143)
5225          \p{Script: Duployan}    (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A,
5226                                    U+1BC70..1BC7C, U+1BC80..1BC88,
5227                                    U+1BC90..1BC99, U+1BC9C..1BC9F)
5228          \p{Script: Egyp}        \p{Script=Egyptian_Hieroglyphs} (=
5229                                    \p{Script_Extensions=
5230                                    Egyptian_Hieroglyphs}) (1080)
5231          \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions=
5232                                    Egyptian_Hieroglyphs} (Short: \p{Sc=
5233                                    Egyp}, \p{Egyp}) (1080)
5234          \p{Script: Elba}        \p{Script=Elbasan} (=
5235                                    \p{Script_Extensions=Elbasan}) (40)
5236          \p{Script: Elbasan}     \p{Script_Extensions=Elbasan} (Short:
5237                                    \p{Sc=Elba}, \p{Elba}) (40)
5238          \p{Script: Elym}        \p{Script=Elymaic} (=
5239                                    \p{Script_Extensions=Elymaic}) (23)
5240          \p{Script: Elymaic}     \p{Script_Extensions=Elymaic} (Short:
5241                                    \p{Sc=Elym}, \p{Elym}) (23)
5242          \p{Script: Ethi}        \p{Script=Ethiopic} (=
5243                                    \p{Script_Extensions=Ethiopic}) (523)
5244          \p{Script: Ethiopic}    \p{Script_Extensions=Ethiopic} (Short:
5245                                    \p{Sc=Ethi}, \p{Ethi}) (523)
5246          \p{Script: Geor}        \p{Script=Georgian} (173)
5247          \p{Script: Georgian}    (Short: \p{Sc=Geor}) (173: U+10A0..10C5,
5248                                    U+10C7, U+10CD, U+10D0..10FA,
5249                                    U+10FC..10FF, U+1C90..1CBA ...)
5250          \p{Script: Glag}        \p{Script=Glagolitic} (134)
5251          \p{Script: Glagolitic}  (Short: \p{Sc=Glag}) (134: U+2C00..2C5F,
5252                                    U+1E000..1E006, U+1E008..1E018,
5253                                    U+1E01B..1E021, U+1E023..1E024,
5254                                    U+1E026..1E02A)
5255          \p{Script: Gong}        \p{Script=Gunjala_Gondi} (63)
5256          \p{Script: Gonm}        \p{Script=Masaram_Gondi} (75)
5257          \p{Script: Goth}        \p{Script=Gothic} (= \p{Script_Extensions=
5258                                    Gothic}) (27)
5259          \p{Script: Gothic}      \p{Script_Extensions=Gothic} (Short:
5260                                    \p{Sc=Goth}, \p{Goth}) (27)
5261          \p{Script: Gran}        \p{Script=Grantha} (85)
5262          \p{Script: Grantha}     (Short: \p{Sc=Gran}) (85: U+11300..11303,
5263                                    U+11305..1130C, U+1130F..11310,
5264                                    U+11313..11328, U+1132A..11330,
5265                                    U+11332..11333 ...)
5266          \p{Script: Greek}       (Short: \p{Sc=Grek}) (518: U+0370..0373,
5267                                    U+0375..0377, U+037A..037D, U+037F,
5268                                    U+0384, U+0386 ...)
5269          \p{Script: Grek}        \p{Script=Greek} (518)
5270          \p{Script: Gujarati}    (Short: \p{Sc=Gujr}) (91: U+0A81..0A83,
5271                                    U+0A85..0A8D, U+0A8F..0A91,
5272                                    U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3
5273                                    ...)
5274          \p{Script: Gujr}        \p{Script=Gujarati} (91)
5275          \p{Script: Gunjala_Gondi} (Short: \p{Sc=Gong}) (63:
5276                                    U+11D60..11D65, U+11D67..11D68,
5277                                    U+11D6A..11D8E, U+11D90..11D91,
5278                                    U+11D93..11D98, U+11DA0..11DA9)
5279          \p{Script: Gurmukhi}    (Short: \p{Sc=Guru}) (80: U+0A01..0A03,
5280                                    U+0A05..0A0A, U+0A0F..0A10,
5281                                    U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33
5282                                    ...)
5283          \p{Script: Guru}        \p{Script=Gurmukhi} (80)
5284          \p{Script: Han}         (Short: \p{Sc=Han}) (94_215: U+2E80..2E99,
5285                                    U+2E9B..2EF3, U+2F00..2FD5, U+3005,
5286                                    U+3007, U+3021..3029 ...)
5287          \p{Script: Hang}        \p{Script=Hangul} (11_739)
5288          \p{Script: Hangul}      (Short: \p{Sc=Hang}) (11_739:
5289                                    U+1100..11FF, U+302E..302F,
5290                                    U+3131..318E, U+3200..321E,
5291                                    U+3260..327E, U+A960..A97C ...)
5292          \p{Script: Hani}        \p{Script=Han} (94_215)
5293          \p{Script: Hanifi_Rohingya} (Short: \p{Sc=Rohg}) (50:
5294                                    U+10D00..10D27, U+10D30..10D39)
5295          \p{Script: Hano}        \p{Script=Hanunoo} (21)
5296          \p{Script: Hanunoo}     (Short: \p{Sc=Hano}) (21: U+1720..1734)
5297          \p{Script: Hatr}        \p{Script=Hatran} (= \p{Script_Extensions=
5298                                    Hatran}) (26)
5299          \p{Script: Hatran}      \p{Script_Extensions=Hatran} (Short:
5300                                    \p{Sc=Hatr}, \p{Hatr}) (26)
5301          \p{Script: Hebr}        \p{Script=Hebrew} (= \p{Script_Extensions=
5302                                    Hebrew}) (134)
5303          \p{Script: Hebrew}      \p{Script_Extensions=Hebrew} (Short:
5304                                    \p{Sc=Hebr}, \p{Hebr}) (134)
5305          \p{Script: Hira}        \p{Script=Hiragana} (380)
5306          \p{Script: Hiragana}    (Short: \p{Sc=Hira}) (380: U+3041..3096,
5307                                    U+309D..309F, U+1B001..1B11F,
5308                                    U+1B150..1B152, U+1F200)
5309          \p{Script: Hluw}        \p{Script=Anatolian_Hieroglyphs} (=
5310                                    \p{Script_Extensions=
5311                                    Anatolian_Hieroglyphs}) (583)
5312          \p{Script: Hmng}        \p{Script=Pahawh_Hmong} (=
5313                                    \p{Script_Extensions=Pahawh_Hmong}) (127)
5314          \p{Script: Hmnp}        \p{Script=Nyiakeng_Puachue_Hmong} (=
5315                                    \p{Script_Extensions=
5316                                    Nyiakeng_Puachue_Hmong}) (71)
5317          \p{Script: Hung}        \p{Script=Old_Hungarian} (=
5318                                    \p{Script_Extensions=Old_Hungarian})
5319                                    (108)
5320          \p{Script: Imperial_Aramaic} \p{Script_Extensions=
5321                                    Imperial_Aramaic} (Short: \p{Sc=Armi},
5322                                    \p{Armi}) (31)
5323          \p{Script: Inherited}   (Short: \p{Sc=Zinh}) (657: U+0300..036F,
5324                                    U+0485..0486, U+064B..0655, U+0670,
5325                                    U+0951..0954, U+1AB0..1ACE ...)
5326          \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions=
5327                                    Inscriptional_Pahlavi} (Short: \p{Sc=
5328                                    Phli}, \p{Phli}) (27)
5329          \p{Script: Inscriptional_Parthian} \p{Script_Extensions=
5330                                    Inscriptional_Parthian} (Short: \p{Sc=
5331                                    Prti}, \p{Prti}) (30)
5332          \p{Script: Ital}        \p{Script=Old_Italic} (=
5333                                    \p{Script_Extensions=Old_Italic}) (39)
5334          \p{Script: Java}        \p{Script=Javanese} (90)
5335          \p{Script: Javanese}    (Short: \p{Sc=Java}) (90: U+A980..A9CD,
5336                                    U+A9D0..A9D9, U+A9DE..A9DF)
5337          \p{Script: Kaithi}      (Short: \p{Sc=Kthi}) (68: U+11080..110C2,
5338                                    U+110CD)
5339          \p{Script: Kali}        \p{Script=Kayah_Li} (47)
5340          \p{Script: Kana}        \p{Script=Katakana} (320)
5341          \p{Script: Kannada}     (Short: \p{Sc=Knda}) (90: U+0C80..0C8C,
5342                                    U+0C8E..0C90, U+0C92..0CA8,
5343                                    U+0CAA..0CB3, U+0CB5..0CB9, U+0CBC..0CC4
5344                                    ...)
5345          \p{Script: Katakana}    (Short: \p{Sc=Kana}) (320: U+30A1..30FA,
5346                                    U+30FD..30FF, U+31F0..31FF,
5347                                    U+32D0..32FE, U+3300..3357, U+FF66..FF6F
5348                                    ...)
5349          \p{Script: Kayah_Li}    (Short: \p{Sc=Kali}) (47: U+A900..A92D,
5350                                    U+A92F)
5351          \p{Script: Khar}        \p{Script=Kharoshthi} (=
5352                                    \p{Script_Extensions=Kharoshthi}) (68)
5353          \p{Script: Kharoshthi}  \p{Script_Extensions=Kharoshthi} (Short:
5354                                    \p{Sc=Khar}, \p{Khar}) (68)
5355          \p{Script: Khitan_Small_Script} \p{Script_Extensions=
5356                                    Khitan_Small_Script} (Short: \p{Sc=
5357                                    Kits}, \p{Kits}) (471)
5358          \p{Script: Khmer}       \p{Script_Extensions=Khmer} (Short: \p{Sc=
5359                                    Khmr}, \p{Khmr}) (146)
5360          \p{Script: Khmr}        \p{Script=Khmer} (= \p{Script_Extensions=
5361                                    Khmer}) (146)
5362          \p{Script: Khoj}        \p{Script=Khojki} (62)
5363          \p{Script: Khojki}      (Short: \p{Sc=Khoj}) (62: U+11200..11211,
5364                                    U+11213..1123E)
5365          \p{Script: Khudawadi}   (Short: \p{Sc=Sind}) (69: U+112B0..112EA,
5366                                    U+112F0..112F9)
5367          \p{Script: Kits}        \p{Script=Khitan_Small_Script} (=
5368                                    \p{Script_Extensions=
5369                                    Khitan_Small_Script}) (471)
5370          \p{Script: Knda}        \p{Script=Kannada} (90)
5371          \p{Script: Kthi}        \p{Script=Kaithi} (68)
5372          \p{Script: Lana}        \p{Script=Tai_Tham} (=
5373                                    \p{Script_Extensions=Tai_Tham}) (127)
5374          \p{Script: Lao}         \p{Script_Extensions=Lao} (Short: \p{Sc=
5375                                    Lao}, \p{Lao}) (82)
5376          \p{Script: Laoo}        \p{Script=Lao} (= \p{Script_Extensions=
5377                                    Lao}) (82)
5378          \p{Script: Latin}       (Short: \p{Sc=Latn}) (1475: [A-Za-z\xaa
5379                                    \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
5380                                    U+0100..02B8, U+02E0..02E4,
5381                                    U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65
5382                                    ...)
5383          \p{Script: Latn}        \p{Script=Latin} (1475)
5384          \p{Script: Lepc}        \p{Script=Lepcha} (= \p{Script_Extensions=
5385                                    Lepcha}) (74)
5386          \p{Script: Lepcha}      \p{Script_Extensions=Lepcha} (Short:
5387                                    \p{Sc=Lepc}, \p{Lepc}) (74)
5388          \p{Script: Limb}        \p{Script=Limbu} (68)
5389          \p{Script: Limbu}       (Short: \p{Sc=Limb}) (68: U+1900..191E,
5390                                    U+1920..192B, U+1930..193B, U+1940,
5391                                    U+1944..194F)
5392          \p{Script: Lina}        \p{Script=Linear_A} (341)
5393          \p{Script: Linb}        \p{Script=Linear_B} (211)
5394          \p{Script: Linear_A}    (Short: \p{Sc=Lina}) (341: U+10600..10736,
5395                                    U+10740..10755, U+10760..10767)
5396          \p{Script: Linear_B}    (Short: \p{Sc=Linb}) (211: U+10000..1000B,
5397                                    U+1000D..10026, U+10028..1003A,
5398                                    U+1003C..1003D, U+1003F..1004D,
5399                                    U+10050..1005D ...)
5400          \p{Script: Lisu}        \p{Script_Extensions=Lisu} (Short: \p{Sc=
5401                                    Lisu}, \p{Lisu}) (49)
5402          \p{Script: Lyci}        \p{Script=Lycian} (= \p{Script_Extensions=
5403                                    Lycian}) (29)
5404          \p{Script: Lycian}      \p{Script_Extensions=Lycian} (Short:
5405                                    \p{Sc=Lyci}, \p{Lyci}) (29)
5406          \p{Script: Lydi}        \p{Script=Lydian} (= \p{Script_Extensions=
5407                                    Lydian}) (27)
5408          \p{Script: Lydian}      \p{Script_Extensions=Lydian} (Short:
5409                                    \p{Sc=Lydi}, \p{Lydi}) (27)
5410          \p{Script: Mahajani}    (Short: \p{Sc=Mahj}) (39: U+11150..11176)
5411          \p{Script: Mahj}        \p{Script=Mahajani} (39)
5412          \p{Script: Maka}        \p{Script=Makasar} (=
5413                                    \p{Script_Extensions=Makasar}) (25)
5414          \p{Script: Makasar}     \p{Script_Extensions=Makasar} (Short:
5415                                    \p{Sc=Maka}, \p{Maka}) (25)
5416          \p{Script: Malayalam}   (Short: \p{Sc=Mlym}) (118: U+0D00..0D0C,
5417                                    U+0D0E..0D10, U+0D12..0D44,
5418                                    U+0D46..0D48, U+0D4A..0D4F, U+0D54..0D63
5419                                    ...)
5420          \p{Script: Mand}        \p{Script=Mandaic} (29)
5421          \p{Script: Mandaic}     (Short: \p{Sc=Mand}) (29: U+0840..085B,
5422                                    U+085E)
5423          \p{Script: Mani}        \p{Script=Manichaean} (51)
5424          \p{Script: Manichaean}  (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6,
5425                                    U+10AEB..10AF6)
5426          \p{Script: Marc}        \p{Script=Marchen} (=
5427                                    \p{Script_Extensions=Marchen}) (68)
5428          \p{Script: Marchen}     \p{Script_Extensions=Marchen} (Short:
5429                                    \p{Sc=Marc}, \p{Marc}) (68)
5430          \p{Script: Masaram_Gondi} (Short: \p{Sc=Gonm}) (75:
5431                                    U+11D00..11D06, U+11D08..11D09,
5432                                    U+11D0B..11D36, U+11D3A, U+11D3C..11D3D,
5433                                    U+11D3F..11D47 ...)
5434          \p{Script: Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
5435                                    \p{Sc=Medf}, \p{Medf}) (91)
5436          \p{Script: Medf}        \p{Script=Medefaidrin} (=
5437                                    \p{Script_Extensions=Medefaidrin}) (91)
5438          \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek}
5439                                    (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
5440          \p{Script: Mend}        \p{Script=Mende_Kikakui} (=
5441                                    \p{Script_Extensions=Mende_Kikakui})
5442                                    (213)
5443          \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
5444                                    (Short: \p{Sc=Mend}, \p{Mend}) (213)
5445          \p{Script: Merc}        \p{Script=Meroitic_Cursive} (=
5446                                    \p{Script_Extensions=Meroitic_Cursive})
5447                                    (90)
5448          \p{Script: Mero}        \p{Script=Meroitic_Hieroglyphs} (=
5449                                    \p{Script_Extensions=
5450                                    Meroitic_Hieroglyphs}) (32)
5451          \p{Script: Meroitic_Cursive} \p{Script_Extensions=
5452                                    Meroitic_Cursive} (Short: \p{Sc=Merc},
5453                                    \p{Merc}) (90)
5454          \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions=
5455                                    Meroitic_Hieroglyphs} (Short: \p{Sc=
5456                                    Mero}, \p{Mero}) (32)
5457          \p{Script: Miao}        \p{Script_Extensions=Miao} (Short: \p{Sc=
5458                                    Miao}, \p{Miao}) (149)
5459          \p{Script: Mlym}        \p{Script=Malayalam} (118)
5460          \p{Script: Modi}        (Short: \p{Sc=Modi}) (79: U+11600..11644,
5461                                    U+11650..11659)
5462          \p{Script: Mong}        \p{Script=Mongolian} (168)
5463          \p{Script: Mongolian}   (Short: \p{Sc=Mong}) (168: U+1800..1801,
5464                                    U+1804, U+1806..1819, U+1820..1878,
5465                                    U+1880..18AA, U+11660..1166C)
5466          \p{Script: Mro}         \p{Script_Extensions=Mro} (Short: \p{Sc=
5467                                    Mro}, \p{Mro}) (43)
5468          \p{Script: Mroo}        \p{Script=Mro} (= \p{Script_Extensions=
5469                                    Mro}) (43)
5470          \p{Script: Mtei}        \p{Script=Meetei_Mayek} (=
5471                                    \p{Script_Extensions=Meetei_Mayek}) (79)
5472          \p{Script: Mult}        \p{Script=Multani} (38)
5473          \p{Script: Multani}     (Short: \p{Sc=Mult}) (38: U+11280..11286,
5474                                    U+11288, U+1128A..1128D, U+1128F..1129D,
5475                                    U+1129F..112A9)
5476          \p{Script: Myanmar}     (Short: \p{Sc=Mymr}) (223: U+1000..109F,
5477                                    U+A9E0..A9FE, U+AA60..AA7F)
5478          \p{Script: Mymr}        \p{Script=Myanmar} (223)
5479          \p{Script: Nabataean}   \p{Script_Extensions=Nabataean} (Short:
5480                                    \p{Sc=Nbat}, \p{Nbat}) (40)
5481          \p{Script: Nand}        \p{Script=Nandinagari} (65)
5482          \p{Script: Nandinagari} (Short: \p{Sc=Nand}) (65: U+119A0..119A7,
5483                                    U+119AA..119D7, U+119DA..119E4)
5484          \p{Script: Narb}        \p{Script=Old_North_Arabian} (=
5485                                    \p{Script_Extensions=Old_North_Arabian})
5486                                    (32)
5487          \p{Script: Nbat}        \p{Script=Nabataean} (=
5488                                    \p{Script_Extensions=Nabataean}) (40)
5489          \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
5490                                    \p{Sc=Talu}, \p{Talu}) (83)
5491          \p{Script: Newa}        \p{Script_Extensions=Newa} (Short: \p{Sc=
5492                                    Newa}, \p{Newa}) (97)
5493          \p{Script: Nko}         (Short: \p{Sc=Nko}) (62: U+07C0..07FA,
5494                                    U+07FD..07FF)
5495          \p{Script: Nkoo}        \p{Script=Nko} (62)
5496          \p{Script: Nshu}        \p{Script=Nushu} (= \p{Script_Extensions=
5497                                    Nushu}) (397)
5498          \p{Script: Nushu}       \p{Script_Extensions=Nushu} (Short: \p{Sc=
5499                                    Nshu}, \p{Nshu}) (397)
5500          \p{Script: Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
5501                                    Nyiakeng_Puachue_Hmong} (Short: \p{Sc=
5502                                    Hmnp}, \p{Hmnp}) (71)
5503          \p{Script: Ogam}        \p{Script=Ogham} (= \p{Script_Extensions=
5504                                    Ogham}) (29)
5505          \p{Script: Ogham}       \p{Script_Extensions=Ogham} (Short: \p{Sc=
5506                                    Ogam}, \p{Ogam}) (29)
5507          \p{Script: Ol_Chiki}    \p{Script_Extensions=Ol_Chiki} (Short:
5508                                    \p{Sc=Olck}, \p{Olck}) (48)
5509          \p{Script: Olck}        \p{Script=Ol_Chiki} (=
5510                                    \p{Script_Extensions=Ol_Chiki}) (48)
5511          \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
5512                                    (Short: \p{Sc=Hung}, \p{Hung}) (108)
5513          \p{Script: Old_Italic}  \p{Script_Extensions=Old_Italic} (Short:
5514                                    \p{Sc=Ital}, \p{Ital}) (39)
5515          \p{Script: Old_North_Arabian} \p{Script_Extensions=
5516                                    Old_North_Arabian} (Short: \p{Sc=Narb},
5517                                    \p{Narb}) (32)
5518          \p{Script: Old_Permic}  (Short: \p{Sc=Perm}) (43: U+10350..1037A)
5519          \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
5520                                    \p{Sc=Xpeo}, \p{Xpeo}) (50)
5521          \p{Script: Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
5522                                    \p{Sc=Sogo}, \p{Sogo}) (40)
5523          \p{Script: Old_South_Arabian} \p{Script_Extensions=
5524                                    Old_South_Arabian} (Short: \p{Sc=Sarb},
5525                                    \p{Sarb}) (32)
5526          \p{Script: Old_Turkic}  \p{Script_Extensions=Old_Turkic} (Short:
5527                                    \p{Sc=Orkh}, \p{Orkh}) (73)
5528          \p{Script: Old_Uyghur}  (Short: \p{Sc=Ougr}) (26: U+10F70..10F89)
5529          \p{Script: Oriya}       (Short: \p{Sc=Orya}) (91: U+0B01..0B03,
5530                                    U+0B05..0B0C, U+0B0F..0B10,
5531                                    U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33
5532                                    ...)
5533          \p{Script: Orkh}        \p{Script=Old_Turkic} (=
5534                                    \p{Script_Extensions=Old_Turkic}) (73)
5535          \p{Script: Orya}        \p{Script=Oriya} (91)
5536          \p{Script: Osage}       \p{Script_Extensions=Osage} (Short: \p{Sc=
5537                                    Osge}, \p{Osge}) (72)
5538          \p{Script: Osge}        \p{Script=Osage} (= \p{Script_Extensions=
5539                                    Osage}) (72)
5540          \p{Script: Osma}        \p{Script=Osmanya} (=
5541                                    \p{Script_Extensions=Osmanya}) (40)
5542          \p{Script: Osmanya}     \p{Script_Extensions=Osmanya} (Short:
5543                                    \p{Sc=Osma}, \p{Osma}) (40)
5544          \p{Script: Ougr}        \p{Script=Old_Uyghur} (26)
5545          \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong}
5546                                    (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
5547          \p{Script: Palm}        \p{Script=Palmyrene} (=
5548                                    \p{Script_Extensions=Palmyrene}) (32)
5549          \p{Script: Palmyrene}   \p{Script_Extensions=Palmyrene} (Short:
5550                                    \p{Sc=Palm}, \p{Palm}) (32)
5551          \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
5552                                    \p{Sc=Pauc}, \p{Pauc}) (57)
5553          \p{Script: Pauc}        \p{Script=Pau_Cin_Hau} (=
5554                                    \p{Script_Extensions=Pau_Cin_Hau}) (57)
5555          \p{Script: Perm}        \p{Script=Old_Permic} (43)
5556          \p{Script: Phag}        \p{Script=Phags_Pa} (56)
5557          \p{Script: Phags_Pa}    (Short: \p{Sc=Phag}) (56: U+A840..A877)
5558          \p{Script: Phli}        \p{Script=Inscriptional_Pahlavi} (=
5559                                    \p{Script_Extensions=
5560                                    Inscriptional_Pahlavi}) (27)
5561          \p{Script: Phlp}        \p{Script=Psalter_Pahlavi} (29)
5562          \p{Script: Phnx}        \p{Script=Phoenician} (=
5563                                    \p{Script_Extensions=Phoenician}) (29)
5564          \p{Script: Phoenician}  \p{Script_Extensions=Phoenician} (Short:
5565                                    \p{Sc=Phnx}, \p{Phnx}) (29)
5566          \p{Script: Plrd}        \p{Script=Miao} (= \p{Script_Extensions=
5567                                    Miao}) (149)
5568          \p{Script: Prti}        \p{Script=Inscriptional_Parthian} (=
5569                                    \p{Script_Extensions=
5570                                    Inscriptional_Parthian}) (30)
5571          \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29:
5572                                    U+10B80..10B91, U+10B99..10B9C,
5573                                    U+10BA9..10BAF)
5574          \p{Script: Qaac}        \p{Script=Coptic} (137)
5575          \p{Script: Qaai}        \p{Script=Inherited} (657)
5576          \p{Script: Rejang}      \p{Script_Extensions=Rejang} (Short:
5577                                    \p{Sc=Rjng}, \p{Rjng}) (37)
5578          \p{Script: Rjng}        \p{Script=Rejang} (= \p{Script_Extensions=
5579                                    Rejang}) (37)
5580          \p{Script: Rohg}        \p{Script=Hanifi_Rohingya} (50)
5581          \p{Script: Runic}       \p{Script_Extensions=Runic} (Short: \p{Sc=
5582                                    Runr}, \p{Runr}) (86)
5583          \p{Script: Runr}        \p{Script=Runic} (= \p{Script_Extensions=
5584                                    Runic}) (86)
5585          \p{Script: Samaritan}   \p{Script_Extensions=Samaritan} (Short:
5586                                    \p{Sc=Samr}, \p{Samr}) (61)
5587          \p{Script: Samr}        \p{Script=Samaritan} (=
5588                                    \p{Script_Extensions=Samaritan}) (61)
5589          \p{Script: Sarb}        \p{Script=Old_South_Arabian} (=
5590                                    \p{Script_Extensions=Old_South_Arabian})
5591                                    (32)
5592          \p{Script: Saur}        \p{Script=Saurashtra} (=
5593                                    \p{Script_Extensions=Saurashtra}) (82)
5594          \p{Script: Saurashtra}  \p{Script_Extensions=Saurashtra} (Short:
5595                                    \p{Sc=Saur}, \p{Saur}) (82)
5596          \p{Script: Sgnw}        \p{Script=SignWriting} (=
5597                                    \p{Script_Extensions=SignWriting}) (672)
5598          \p{Script: Sharada}     (Short: \p{Sc=Shrd}) (96: U+11180..111DF)
5599          \p{Script: Shavian}     \p{Script_Extensions=Shavian} (Short:
5600                                    \p{Sc=Shaw}, \p{Shaw}) (48)
5601          \p{Script: Shaw}        \p{Script=Shavian} (=
5602                                    \p{Script_Extensions=Shavian}) (48)
5603          \p{Script: Shrd}        \p{Script=Sharada} (96)
5604          \p{Script: Sidd}        \p{Script=Siddham} (=
5605                                    \p{Script_Extensions=Siddham}) (92)
5606          \p{Script: Siddham}     \p{Script_Extensions=Siddham} (Short:
5607                                    \p{Sc=Sidd}, \p{Sidd}) (92)
5608          \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short:
5609                                    \p{Sc=Sgnw}, \p{Sgnw}) (672)
5610          \p{Script: Sind}        \p{Script=Khudawadi} (69)
5611          \p{Script: Sinh}        \p{Script=Sinhala} (111)
5612          \p{Script: Sinhala}     (Short: \p{Sc=Sinh}) (111: U+0D81..0D83,
5613                                    U+0D85..0D96, U+0D9A..0DB1,
5614                                    U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...)
5615          \p{Script: Sogd}        \p{Script=Sogdian} (42)
5616          \p{Script: Sogdian}     (Short: \p{Sc=Sogd}) (42: U+10F30..10F59)
5617          \p{Script: Sogo}        \p{Script=Old_Sogdian} (=
5618                                    \p{Script_Extensions=Old_Sogdian}) (40)
5619          \p{Script: Sora}        \p{Script=Sora_Sompeng} (=
5620                                    \p{Script_Extensions=Sora_Sompeng}) (35)
5621          \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng}
5622                                    (Short: \p{Sc=Sora}, \p{Sora}) (35)
5623          \p{Script: Soyo}        \p{Script=Soyombo} (=
5624                                    \p{Script_Extensions=Soyombo}) (83)
5625          \p{Script: Soyombo}     \p{Script_Extensions=Soyombo} (Short:
5626                                    \p{Sc=Soyo}, \p{Soyo}) (83)
5627          \p{Script: Sund}        \p{Script=Sundanese} (=
5628                                    \p{Script_Extensions=Sundanese}) (72)
5629          \p{Script: Sundanese}   \p{Script_Extensions=Sundanese} (Short:
5630                                    \p{Sc=Sund}, \p{Sund}) (72)
5631          \p{Script: Sylo}        \p{Script=Syloti_Nagri} (45)
5632          \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (45: U+A800..A82C)
5633          \p{Script: Syrc}        \p{Script=Syriac} (88)
5634          \p{Script: Syriac}      (Short: \p{Sc=Syrc}) (88: U+0700..070D,
5635                                    U+070F..074A, U+074D..074F, U+0860..086A)
5636          \p{Script: Tagalog}     (Short: \p{Sc=Tglg}) (23: U+1700..1715,
5637                                    U+171F)
5638          \p{Script: Tagb}        \p{Script=Tagbanwa} (18)
5639          \p{Script: Tagbanwa}    (Short: \p{Sc=Tagb}) (18: U+1760..176C,
5640                                    U+176E..1770, U+1772..1773)
5641          \p{Script: Tai_Le}      (Short: \p{Sc=Tale}) (35: U+1950..196D,
5642                                    U+1970..1974)
5643          \p{Script: Tai_Tham}    \p{Script_Extensions=Tai_Tham} (Short:
5644                                    \p{Sc=Lana}, \p{Lana}) (127)
5645          \p{Script: Tai_Viet}    \p{Script_Extensions=Tai_Viet} (Short:
5646                                    \p{Sc=Tavt}, \p{Tavt}) (72)
5647          \p{Script: Takr}        \p{Script=Takri} (68)
5648          \p{Script: Takri}       (Short: \p{Sc=Takr}) (68: U+11680..116B9,
5649                                    U+116C0..116C9)
5650          \p{Script: Tale}        \p{Script=Tai_Le} (35)
5651          \p{Script: Talu}        \p{Script=New_Tai_Lue} (=
5652                                    \p{Script_Extensions=New_Tai_Lue}) (83)
5653          \p{Script: Tamil}       (Short: \p{Sc=Taml}) (123: U+0B82..0B83,
5654                                    U+0B85..0B8A, U+0B8E..0B90,
5655                                    U+0B92..0B95, U+0B99..0B9A, U+0B9C ...)
5656          \p{Script: Taml}        \p{Script=Tamil} (123)
5657          \p{Script: Tang}        \p{Script=Tangut} (= \p{Script_Extensions=
5658                                    Tangut}) (6914)
5659          \p{Script: Tangsa}      \p{Script_Extensions=Tangsa} (Short:
5660                                    \p{Sc=Tnsa}, \p{Tnsa}) (89)
5661          \p{Script: Tangut}      \p{Script_Extensions=Tangut} (Short:
5662                                    \p{Sc=Tang}, \p{Tang}) (6914)
5663          \p{Script: Tavt}        \p{Script=Tai_Viet} (=
5664                                    \p{Script_Extensions=Tai_Viet}) (72)
5665          \p{Script: Telu}        \p{Script=Telugu} (100)
5666          \p{Script: Telugu}      (Short: \p{Sc=Telu}) (100: U+0C00..0C0C,
5667                                    U+0C0E..0C10, U+0C12..0C28,
5668                                    U+0C2A..0C39, U+0C3C..0C44, U+0C46..0C48
5669                                    ...)
5670          \p{Script: Tfng}        \p{Script=Tifinagh} (=
5671                                    \p{Script_Extensions=Tifinagh}) (59)
5672          \p{Script: Tglg}        \p{Script=Tagalog} (23)
5673          \p{Script: Thaa}        \p{Script=Thaana} (50)
5674          \p{Script: Thaana}      (Short: \p{Sc=Thaa}) (50: U+0780..07B1)
5675          \p{Script: Thai}        \p{Script_Extensions=Thai} (Short: \p{Sc=
5676                                    Thai}, \p{Thai}) (86)
5677          \p{Script: Tibetan}     \p{Script_Extensions=Tibetan} (Short:
5678                                    \p{Sc=Tibt}, \p{Tibt}) (207)
5679          \p{Script: Tibt}        \p{Script=Tibetan} (=
5680                                    \p{Script_Extensions=Tibetan}) (207)
5681          \p{Script: Tifinagh}    \p{Script_Extensions=Tifinagh} (Short:
5682                                    \p{Sc=Tfng}, \p{Tfng}) (59)
5683          \p{Script: Tirh}        \p{Script=Tirhuta} (82)
5684          \p{Script: Tirhuta}     (Short: \p{Sc=Tirh}) (82: U+11480..114C7,
5685                                    U+114D0..114D9)
5686          \p{Script: Tnsa}        \p{Script=Tangsa} (= \p{Script_Extensions=
5687                                    Tangsa}) (89)
5688          \p{Script: Toto}        \p{Script_Extensions=Toto} (Short: \p{Sc=
5689                                    Toto}, \p{Toto}) (31)
5690          \p{Script: Ugar}        \p{Script=Ugaritic} (=
5691                                    \p{Script_Extensions=Ugaritic}) (31)
5692          \p{Script: Ugaritic}    \p{Script_Extensions=Ugaritic} (Short:
5693                                    \p{Sc=Ugar}, \p{Ugar}) (31)
5694          \p{Script: Unknown}     \p{Script_Extensions=Unknown} (Short:
5695                                    \p{Sc=Zzzz}, \p{Zzzz}) (969_350 plus all
5696                                    above-Unicode code points)
5697          \p{Script: Vai}         \p{Script_Extensions=Vai} (Short: \p{Sc=
5698                                    Vai}, \p{Vai}) (300)
5699          \p{Script: Vaii}        \p{Script=Vai} (= \p{Script_Extensions=
5700                                    Vai}) (300)
5701          \p{Script: Vith}        \p{Script=Vithkuqi} (=
5702                                    \p{Script_Extensions=Vithkuqi}) (70)
5703          \p{Script: Vithkuqi}    \p{Script_Extensions=Vithkuqi} (Short:
5704                                    \p{Sc=Vith}, \p{Vith}) (70)
5705          \p{Script: Wancho}      \p{Script_Extensions=Wancho} (Short:
5706                                    \p{Sc=Wcho}, \p{Wcho}) (59)
5707          \p{Script: Wara}        \p{Script=Warang_Citi} (=
5708                                    \p{Script_Extensions=Warang_Citi}) (84)
5709          \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
5710                                    \p{Sc=Wara}, \p{Wara}) (84)
5711          \p{Script: Wcho}        \p{Script=Wancho} (= \p{Script_Extensions=
5712                                    Wancho}) (59)
5713          \p{Script: Xpeo}        \p{Script=Old_Persian} (=
5714                                    \p{Script_Extensions=Old_Persian}) (50)
5715          \p{Script: Xsux}        \p{Script=Cuneiform} (=
5716                                    \p{Script_Extensions=Cuneiform}) (1234)
5717          \p{Script: Yezi}        \p{Script=Yezidi} (47)
5718          \p{Script: Yezidi}      (Short: \p{Sc=Yezi}) (47: U+10E80..10EA9,
5719                                    U+10EAB..10EAD, U+10EB0..10EB1)
5720          \p{Script: Yi}          (Short: \p{Sc=Yi}) (1220: U+A000..A48C,
5721                                    U+A490..A4C6)
5722          \p{Script: Yiii}        \p{Script=Yi} (1220)
5723          \p{Script: Zanabazar_Square} \p{Script_Extensions=
5724                                    Zanabazar_Square} (Short: \p{Sc=Zanb},
5725                                    \p{Zanb}) (72)
5726          \p{Script: Zanb}        \p{Script=Zanabazar_Square} (=
5727                                    \p{Script_Extensions=Zanabazar_Square})
5728                                    (72)
5729          \p{Script: Zinh}        \p{Script=Inherited} (657)
5730          \p{Script: Zyyy}        \p{Script=Common} (8252)
5731          \p{Script: Zzzz}        \p{Script=Unknown} (=
5732                                    \p{Script_Extensions=Unknown}) (969_350
5733                                    plus all above-Unicode code points)
5734          \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (90:
5735                                    U+061F, U+0640, U+1E900..1E94B,
5736                                    U+1E950..1E959, U+1E95E..1E95F)
5737          \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (90)
5738          \p{Script_Extensions: Aghb} \p{Script_Extensions=
5739                                    Caucasian_Albanian} (53)
5740          \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (65:
5741                                    U+11700..1171A, U+1171D..1172B,
5742                                    U+11730..11746)
5743          \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
5744                                    \p{Hluw}) (583: U+14400..14646)
5745          \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1411)
5746          \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
5747                                    (1411: U+0600..0604, U+0606..06DC,
5748                                    U+06DE..06FF, U+0750..077F,
5749                                    U+0870..088E, U+0890..0891 ...)
5750          \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
5751                                    (96: U+0531..0556, U+0559..058A,
5752                                    U+058D..058F, U+FB13..FB17)
5753          \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
5754                                    (31)
5755          \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (96)
5756          \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst})
5757                                    (61: U+10B00..10B35, U+10B39..10B3F)
5758          \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
5759          \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (124)
5760          \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
5761                                    (124: U+1B00..1B4C, U+1B50..1B7E)
5762          \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
5763          \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657:
5764                                    U+A6A0..A6F7, U+16800..16A38)
5765          \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
5766          \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
5767                                    (36: U+16AD0..16AED, U+16AF0..16AF5)
5768          \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56:
5769                                    U+1BC0..1BF3, U+1BFC..1BFF)
5770          \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
5771          \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (113)
5772          \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng})
5773                                    (113: U+0951..0952, U+0964..0965,
5774                                    U+0980..0983, U+0985..098C,
5775                                    U+098F..0990, U+0993..09A8 ...)
5776          \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
5777                                    (97: U+11C00..11C08, U+11C0A..11C36,
5778                                    U+11C38..11C45, U+11C50..11C6C)
5779          \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
5780          \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (117)
5781          \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
5782                                    (117: U+02EA..02EB, U+3001..3003,
5783                                    U+3008..3011, U+3013..301F,
5784                                    U+302A..302D, U+3030 ...)
5785          \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (115)
5786          \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah})
5787                                    (115: U+11000..1104D, U+11052..11075,
5788                                    U+1107F)
5789          \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
5790          \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
5791                                    (256: U+2800..28FF)
5792          \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
5793          \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
5794                                    (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF)
5795          \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
5796          \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22:
5797                                    U+1735..1736, U+1740..1753)
5798          \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (91)
5799          \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
5800                                    \p{Cans}) (726: U+1400..167F,
5801                                    U+18B0..18F5, U+11AB0..11ABF)
5802          \p{Script_Extensions: Cans} \p{Script_Extensions=
5803                                    Canadian_Aboriginal} (726)
5804          \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
5805          \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49:
5806                                    U+102A0..102D0)
5807          \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
5808                                    \p{Aghb}) (53: U+10530..10563, U+1056F)
5809          \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (91:
5810                                    U+09E6..09EF, U+1040..1049,
5811                                    U+11100..11134, U+11136..11147)
5812          \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83:
5813                                    U+AA00..AA36, U+AA40..AA4D,
5814                                    U+AA50..AA59, U+AA5C..AA5F)
5815          \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
5816          \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
5817                                    (172: U+13A0..13F5, U+13F8..13FD,
5818                                    U+AB70..ABBF)
5819          \p{Script_Extensions: Chorasmian} (Short: \p{Scx=Chrs}, \p{Chrs})
5820                                    (28: U+10FB0..10FCB)
5821          \p{Script_Extensions: Chrs} \p{Script_Extensions=Chorasmian} (28)
5822          \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
5823                                    (7824: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
5824                                    \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xa9
5825                                    \xab-\xb9\xbb-\xbf\xd7\xf7],
5826                                    U+02B9..02DF, U+02E5..02E9,
5827                                    U+02EC..02FF, U+0374, U+037E ...)
5828          \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
5829          \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt})
5830                                    (165: U+03E2..03EF, U+2C80..2CF3,
5831                                    U+2CF9..2CFF, U+102E0..102FB)
5832          \p{Script_Extensions: Cpmn} \p{Script_Extensions=Cypro_Minoan}
5833                                    (101)
5834          \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
5835          \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
5836                                    (1234: U+12000..12399, U+12400..1246E,
5837                                    U+12470..12474, U+12480..12543)
5838          \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
5839                                    (112: U+10100..10102, U+10107..10133,
5840                                    U+10137..1013F, U+10800..10805, U+10808,
5841                                    U+1080A..10835 ...)
5842          \p{Script_Extensions: Cypro_Minoan} (Short: \p{Scx=Cpmn},
5843                                    \p{Cpmn}) (101: U+10100..10101,
5844                                    U+12F90..12FF2)
5845          \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
5846                                    (447: U+0400..052F, U+1C80..1C88,
5847                                    U+1D2B, U+1D78, U+1DF8, U+2DE0..2DFF ...)
5848          \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (447)
5849          \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt})
5850                                    (80: U+10400..1044F)
5851          \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210)
5852          \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
5853                                    (210: U+0900..0952, U+0955..097F,
5854                                    U+1CD0..1CF6, U+1CF8..1CF9, U+20F0,
5855                                    U+A830..A839 ...)
5856          \p{Script_Extensions: Diak} \p{Script_Extensions=Dives_Akuru} (72)
5857          \p{Script_Extensions: Dives_Akuru} (Short: \p{Scx=Diak}, \p{Diak})
5858                                    (72: U+11900..11906, U+11909,
5859                                    U+1190C..11913, U+11915..11916,
5860                                    U+11918..11935, U+11937..11938 ...)
5861          \p{Script_Extensions: Dogr} \p{Script_Extensions=Dogra} (82)
5862          \p{Script_Extensions: Dogra} (Short: \p{Scx=Dogr}, \p{Dogr}) (82:
5863                                    U+0964..096F, U+A830..A839,
5864                                    U+11800..1183B)
5865          \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
5866          \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
5867          \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
5868                                    (147: U+1BC00..1BC6A, U+1BC70..1BC7C,
5869                                    U+1BC80..1BC88, U+1BC90..1BC99,
5870                                    U+1BC9C..1BCA3)
5871          \p{Script_Extensions: Egyp} \p{Script_Extensions=
5872                                    Egyptian_Hieroglyphs} (1080)
5873          \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
5874                                    \p{Egyp}) (1080: U+13000..1342E,
5875                                    U+13430..13438)
5876          \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
5877          \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba})
5878                                    (40: U+10500..10527)
5879          \p{Script_Extensions: Elym} \p{Script_Extensions=Elymaic} (23)
5880          \p{Script_Extensions: Elymaic} (Short: \p{Scx=Elym}, \p{Elym})
5881                                    (23: U+10FE0..10FF6)
5882          \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (523)
5883          \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
5884                                    (523: U+1200..1248, U+124A..124D,
5885                                    U+1250..1256, U+1258, U+125A..125D,
5886                                    U+1260..1288 ...)
5887          \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (174)
5888          \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
5889                                    (174: U+10A0..10C5, U+10C7, U+10CD,
5890                                    U+10D0..10FF, U+1C90..1CBA, U+1CBD..1CBF
5891                                    ...)
5892          \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (138)
5893          \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
5894                                    (138: U+0484, U+0487, U+2C00..2C5F,
5895                                    U+2E43, U+A66F, U+1E000..1E006 ...)
5896          \p{Script_Extensions: Gong} \p{Script_Extensions=Gunjala_Gondi}
5897                                    (65)
5898          \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi}
5899                                    (77)
5900          \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
5901          \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27:
5902                                    U+10330..1034A)
5903          \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (116)
5904          \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
5905                                    (116: U+0951..0952, U+0964..0965,
5906                                    U+0BE6..0BF3, U+1CD0, U+1CD2..1CD3,
5907                                    U+1CF2..1CF4 ...)
5908          \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522:
5909                                    U+0342, U+0345, U+0370..0373,
5910                                    U+0375..0377, U+037A..037D, U+037F ...)
5911          \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
5912          \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
5913                                    (105: U+0951..0952, U+0964..0965,
5914                                    U+0A81..0A83, U+0A85..0A8D,
5915                                    U+0A8F..0A91, U+0A93..0AA8 ...)
5916          \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105)
5917          \p{Script_Extensions: Gunjala_Gondi} (Short: \p{Scx=Gong},
5918                                    \p{Gong}) (65: U+0964..0965,
5919                                    U+11D60..11D65, U+11D67..11D68,
5920                                    U+11D6A..11D8E, U+11D90..11D91,
5921                                    U+11D93..11D98 ...)
5922          \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
5923                                    (94: U+0951..0952, U+0964..0965,
5924                                    U+0A01..0A03, U+0A05..0A0A,
5925                                    U+0A0F..0A10, U+0A13..0A28 ...)
5926          \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (94)
5927          \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (94_503:
5928                                    U+2E80..2E99, U+2E9B..2EF3,
5929                                    U+2F00..2FD5, U+3001..3003,
5930                                    U+3005..3011, U+3013..301F ...)
5931          \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
5932          \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
5933                                    (11_775: U+1100..11FF, U+3001..3003,
5934                                    U+3008..3011, U+3013..301F,
5935                                    U+302E..3030, U+3037 ...)
5936          \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (94_503)
5937          \p{Script_Extensions: Hanifi_Rohingya} (Short: \p{Scx=Rohg},
5938                                    \p{Rohg}) (55: U+060C, U+061B, U+061F,
5939                                    U+0640, U+06D4, U+10D00..10D27 ...)
5940          \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
5941          \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano})
5942                                    (23: U+1720..1736)
5943          \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
5944          \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26:
5945                                    U+108E0..108F2, U+108F4..108F5,
5946                                    U+108FB..108FF)
5947          \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (134)
5948          \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr})
5949                                    (134: U+0591..05C7, U+05D0..05EA,
5950                                    U+05EF..05F4, U+FB1D..FB36,
5951                                    U+FB38..FB3C, U+FB3E ...)
5952          \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (432)
5953          \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
5954                                    (432: U+3001..3003, U+3008..3011,
5955                                    U+3013..301F, U+3030..3035, U+3037,
5956                                    U+303C..303D ...)
5957          \p{Script_Extensions: Hluw} \p{Script_Extensions=
5958                                    Anatolian_Hieroglyphs} (583)
5959          \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
5960                                    (127)
5961          \p{Script_Extensions: Hmnp} \p{Script_Extensions=
5962                                    Nyiakeng_Puachue_Hmong} (71)
5963          \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
5964                                    (108)
5965          \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
5966                                    \p{Armi}) (31: U+10840..10855,
5967                                    U+10857..1085F)
5968          \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
5969                                    (586: U+0300..0341, U+0343..0344,
5970                                    U+0346..0362, U+0953..0954,
5971                                    U+1AB0..1ACE, U+1DC2..1DF7 ...)
5972          \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
5973                                    \p{Phli}) (27: U+10B60..10B72,
5974                                    U+10B78..10B7F)
5975          \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
5976                                    Prti}, \p{Prti}) (30: U+10B40..10B55,
5977                                    U+10B58..10B5F)
5978          \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39)
5979          \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
5980          \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
5981                                    (91: U+A980..A9CD, U+A9CF..A9D9,
5982                                    U+A9DE..A9DF)
5983          \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (88:
5984                                    U+0966..096F, U+A830..A839,
5985                                    U+11080..110C2, U+110CD)
5986          \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
5987          \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (372)
5988          \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
5989                                    (105: U+0951..0952, U+0964..0965,
5990                                    U+0C80..0C8C, U+0C8E..0C90,
5991                                    U+0C92..0CA8, U+0CAA..0CB3 ...)
5992          \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
5993                                    (372: U+3001..3003, U+3008..3011,
5994                                    U+3013..301F, U+3030..3035, U+3037,
5995                                    U+303C..303D ...)
5996          \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
5997                                    (48: U+A900..A92F)
5998          \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (68)
5999          \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
6000                                    (68: U+10A00..10A03, U+10A05..10A06,
6001                                    U+10A0C..10A13, U+10A15..10A17,
6002                                    U+10A19..10A35, U+10A38..10A3A ...)
6003          \p{Script_Extensions: Khitan_Small_Script} (Short: \p{Scx=Kits},
6004                                    \p{Kits}) (471: U+16FE4, U+18B00..18CD5)
6005          \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146:
6006                                    U+1780..17DD, U+17E0..17E9,
6007                                    U+17F0..17F9, U+19E0..19FF)
6008          \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
6009          \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (82)
6010          \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (82:
6011                                    U+0AE6..0AEF, U+A830..A839,
6012                                    U+11200..11211, U+11213..1123E)
6013          \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
6014                                    (81: U+0964..0965, U+A830..A839,
6015                                    U+112B0..112EA, U+112F0..112F9)
6016          \p{Script_Extensions: Kits} \p{Script_Extensions=
6017                                    Khitan_Small_Script} (471)
6018          \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (105)
6019          \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (88)
6020          \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
6021          \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (82:
6022                                    U+0E81..0E82, U+0E84, U+0E86..0E8A,
6023                                    U+0E8C..0EA3, U+0EA5, U+0EA7..0EBD ...)
6024          \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (82)
6025          \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn})
6026                                    (1504: [A-Za-z\xaa\xba\xc0-\xd6\xd8-
6027                                    \xf6\xf8-\xff], U+0100..02B8,
6028                                    U+02E0..02E4, U+0363..036F,
6029                                    U+0485..0486, U+0951..0952 ...)
6030          \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1504)
6031          \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
6032          \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74:
6033                                    U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F)
6034          \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
6035          \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69:
6036                                    U+0965, U+1900..191E, U+1920..192B,
6037                                    U+1930..193B, U+1940, U+1944..194F)
6038          \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
6039          \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
6040          \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
6041                                    (386: U+10107..10133, U+10600..10736,
6042                                    U+10740..10755, U+10760..10767)
6043          \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
6044                                    (268: U+10000..1000B, U+1000D..10026,
6045                                    U+10028..1003A, U+1003C..1003D,
6046                                    U+1003F..1004D, U+10050..1005D ...)
6047          \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (49:
6048                                    U+A4D0..A4FF, U+11FB0)
6049          \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
6050          \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29:
6051                                    U+10280..1029C)
6052          \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
6053          \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27:
6054                                    U+10920..10939, U+1093F)
6055          \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
6056                                    (61: U+0964..096F, U+A830..A839,
6057                                    U+11150..11176)
6058          \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
6059          \p{Script_Extensions: Maka} \p{Script_Extensions=Makasar} (25)
6060          \p{Script_Extensions: Makasar} (Short: \p{Scx=Maka}, \p{Maka})
6061                                    (25: U+11EE0..11EF8)
6062          \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
6063                                    (126: U+0951..0952, U+0964..0965,
6064                                    U+0D00..0D0C, U+0D0E..0D10,
6065                                    U+0D12..0D44, U+0D46..0D48 ...)
6066          \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
6067          \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand})
6068                                    (30: U+0640, U+0840..085B, U+085E)
6069          \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
6070          \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
6071                                    (52: U+0640, U+10AC0..10AE6,
6072                                    U+10AEB..10AF6)
6073          \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
6074          \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc})
6075                                    (68: U+11C70..11C8F, U+11C92..11CA7,
6076                                    U+11CA9..11CB6)
6077          \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm},
6078                                    \p{Gonm}) (77: U+0964..0965,
6079                                    U+11D00..11D06, U+11D08..11D09,
6080                                    U+11D0B..11D36, U+11D3A, U+11D3C..11D3D
6081                                    ...)
6082          \p{Script_Extensions: Medefaidrin} (Short: \p{Scx=Medf}, \p{Medf})
6083                                    (91: U+16E40..16E9A)
6084          \p{Script_Extensions: Medf} \p{Script_Extensions=Medefaidrin} (91)
6085          \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
6086                                    \p{Mtei}) (79: U+AAE0..AAF6,
6087                                    U+ABC0..ABED, U+ABF0..ABF9)
6088          \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
6089                                    (213)
6090          \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
6091                                    \p{Mend}) (213: U+1E800..1E8C4,
6092                                    U+1E8C7..1E8D6)
6093          \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
6094                                    (90)
6095          \p{Script_Extensions: Mero} \p{Script_Extensions=
6096                                    Meroitic_Hieroglyphs} (32)
6097          \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
6098                                    \p{Merc}) (90: U+109A0..109B7,
6099                                    U+109BC..109CF, U+109D2..109FF)
6100          \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
6101                                    \p{Mero}) (32: U+10980..1099F)
6102          \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (149:
6103                                    U+16F00..16F4A, U+16F4F..16F87,
6104                                    U+16F8F..16F9F)
6105          \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (126)
6106          \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89:
6107                                    U+A830..A839, U+11600..11644,
6108                                    U+11650..11659)
6109          \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (172)
6110          \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
6111                                    (172: U+1800..1819, U+1820..1878,
6112                                    U+1880..18AA, U+202F, U+11660..1166C)
6113          \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43:
6114                                    U+16A40..16A5E, U+16A60..16A69,
6115                                    U+16A6E..16A6F)
6116          \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
6117          \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
6118          \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
6119          \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult})
6120                                    (48: U+0A66..0A6F, U+11280..11286,
6121                                    U+11288, U+1128A..1128D, U+1128F..1129D,
6122                                    U+1129F..112A9)
6123          \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
6124                                    (224: U+1000..109F, U+A92E,
6125                                    U+A9E0..A9FE, U+AA60..AA7F)
6126          \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
6127          \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
6128                                    (40: U+10880..1089E, U+108A7..108AF)
6129          \p{Script_Extensions: Nand} \p{Script_Extensions=Nandinagari} (86)
6130          \p{Script_Extensions: Nandinagari} (Short: \p{Scx=Nand}, \p{Nand})
6131                                    (86: U+0964..0965, U+0CE6..0CEF, U+1CE9,
6132                                    U+1CF2, U+1CFA, U+A830..A835 ...)
6133          \p{Script_Extensions: Narb} \p{Script_Extensions=
6134                                    Old_North_Arabian} (32)
6135          \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
6136          \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
6137                                    (83: U+1980..19AB, U+19B0..19C9,
6138                                    U+19D0..19DA, U+19DE..19DF)
6139          \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (97:
6140                                    U+11400..1145B, U+1145D..11461)
6141          \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (67:
6142                                    U+060C, U+061B, U+061F, U+07C0..07FA,
6143                                    U+07FD..07FF, U+FD3E..FD3F)
6144          \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (67)
6145          \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397)
6146          \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397:
6147                                    U+16FE1, U+1B170..1B2FB)
6148          \p{Script_Extensions: Nyiakeng_Puachue_Hmong} (Short: \p{Scx=
6149                                    Hmnp}, \p{Hmnp}) (71: U+1E100..1E12C,
6150                                    U+1E130..1E13D, U+1E140..1E149,
6151                                    U+1E14E..1E14F)
6152          \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
6153          \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29:
6154                                    U+1680..169C)
6155          \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
6156                                    (48: U+1C50..1C7F)
6157          \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
6158          \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
6159                                    \p{Hung}) (108: U+10C80..10CB2,
6160                                    U+10CC0..10CF2, U+10CFA..10CFF)
6161          \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
6162                                    (39: U+10300..10323, U+1032D..1032F)
6163          \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
6164                                    \p{Narb}) (32: U+10A80..10A9F)
6165          \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
6166                                    (44: U+0483, U+10350..1037A)
6167          \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
6168                                    (50: U+103A0..103C3, U+103C8..103D5)
6169          \p{Script_Extensions: Old_Sogdian} (Short: \p{Scx=Sogo}, \p{Sogo})
6170                                    (40: U+10F00..10F27)
6171          \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
6172                                    \p{Sarb}) (32: U+10A60..10A7F)
6173          \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
6174                                    (73: U+10C00..10C48)
6175          \p{Script_Extensions: Old_Uyghur} (Short: \p{Scx=Ougr}, \p{Ougr})
6176                                    (28: U+0640, U+10AF2, U+10F70..10F89)
6177          \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (97:
6178                                    U+0951..0952, U+0964..0965,
6179                                    U+0B01..0B03, U+0B05..0B0C,
6180                                    U+0B0F..0B10, U+0B13..0B28 ...)
6181          \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
6182          \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (97)
6183          \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72:
6184                                    U+104B0..104D3, U+104D8..104FB)
6185          \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
6186          \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
6187          \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma})
6188                                    (40: U+10480..1049D, U+104A0..104A9)
6189          \p{Script_Extensions: Ougr} \p{Script_Extensions=Old_Uyghur} (28)
6190          \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
6191                                    \p{Hmng}) (127: U+16B00..16B45,
6192                                    U+16B50..16B59, U+16B5B..16B61,
6193                                    U+16B63..16B77, U+16B7D..16B8F)
6194          \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
6195          \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
6196                                    (32: U+10860..1087F)
6197          \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
6198                                    (57: U+11AC0..11AF8)
6199          \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
6200          \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
6201          \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
6202          \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
6203                                    (59: U+1802..1803, U+1805, U+A840..A877)
6204          \p{Script_Extensions: Phli} \p{Script_Extensions=
6205                                    Inscriptional_Pahlavi} (27)
6206          \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
6207                                    (30)
6208          \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
6209          \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
6210                                    (29: U+10900..1091B, U+1091F)
6211          \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (149)
6212          \p{Script_Extensions: Prti} \p{Script_Extensions=
6213                                    Inscriptional_Parthian} (30)
6214          \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
6215                                    \p{Phlp}) (30: U+0640, U+10B80..10B91,
6216                                    U+10B99..10B9C, U+10BA9..10BAF)
6217          \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
6218          \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (586)
6219          \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37:
6220                                    U+A930..A953, U+A95F)
6221          \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
6222          \p{Script_Extensions: Rohg} \p{Script_Extensions=Hanifi_Rohingya}
6223                                    (55)
6224          \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86:
6225                                    U+16A0..16EA, U+16EE..16F8)
6226          \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
6227          \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
6228                                    (61: U+0800..082D, U+0830..083E)
6229          \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
6230          \p{Script_Extensions: Sarb} \p{Script_Extensions=
6231                                    Old_South_Arabian} (32)
6232          \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
6233          \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
6234                                    (82: U+A880..A8C5, U+A8CE..A8D9)
6235          \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
6236          \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
6237                                    (102: U+0951, U+1CD7, U+1CD9,
6238                                    U+1CDC..1CDD, U+1CE0, U+11180..111DF)
6239          \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw})
6240                                    (48: U+10450..1047F)
6241          \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
6242          \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (102)
6243          \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
6244          \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd})
6245                                    (92: U+11580..115B5, U+115B8..115DD)
6246          \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
6247                                    (672: U+1D800..1DA8B, U+1DA9B..1DA9F,
6248                                    U+1DAA1..1DAAF)
6249          \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
6250          \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (113)
6251          \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
6252                                    (113: U+0964..0965, U+0D81..0D83,
6253                                    U+0D85..0D96, U+0D9A..0DB1,
6254                                    U+0DB3..0DBB, U+0DBD ...)
6255          \p{Script_Extensions: Sogd} \p{Script_Extensions=Sogdian} (43)
6256          \p{Script_Extensions: Sogdian} (Short: \p{Scx=Sogd}, \p{Sogd})
6257                                    (43: U+0640, U+10F30..10F59)
6258          \p{Script_Extensions: Sogo} \p{Script_Extensions=Old_Sogdian} (40)
6259          \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
6260          \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
6261                                    \p{Sora}) (35: U+110D0..110E8,
6262                                    U+110F0..110F9)
6263          \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (83)
6264          \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo})
6265                                    (83: U+11A50..11AA2)
6266          \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
6267          \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
6268                                    (72: U+1B80..1BBF, U+1CC0..1CC7)
6269          \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (57)
6270          \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
6271                                    \p{Sylo}) (57: U+0964..0965,
6272                                    U+09E6..09EF, U+A800..A82C)
6273          \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (107)
6274          \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc})
6275                                    (107: U+060C, U+061B..061C, U+061F,
6276                                    U+0640, U+064B..0655, U+0670 ...)
6277          \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg})
6278                                    (25: U+1700..1715, U+171F, U+1735..1736)
6279          \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
6280          \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
6281                                    (20: U+1735..1736, U+1760..176C,
6282                                    U+176E..1770, U+1772..1773)
6283          \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45:
6284                                    U+1040..1049, U+1950..196D, U+1970..1974)
6285          \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
6286                                    (127: U+1A20..1A5E, U+1A60..1A7C,
6287                                    U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD)
6288          \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
6289                                    (72: U+AA80..AAC2, U+AADB..AADF)
6290          \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (80)
6291          \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (80:
6292                                    U+0964..0965, U+A830..A839,
6293                                    U+11680..116B9, U+116C0..116C9)
6294          \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
6295          \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
6296          \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (133:
6297                                    U+0951..0952, U+0964..0965,
6298                                    U+0B82..0B83, U+0B85..0B8A,
6299                                    U+0B8E..0B90, U+0B92..0B95 ...)
6300          \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (133)
6301          \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6914)
6302          \p{Script_Extensions: Tangsa} (Short: \p{Scx=Tnsa}, \p{Tnsa}) (89:
6303                                    U+16A70..16ABE, U+16AC0..16AC9)
6304          \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
6305                                    (6914: U+16FE0, U+17000..187F7,
6306                                    U+18800..18AFF, U+18D00..18D08)
6307          \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
6308          \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (106)
6309          \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu})
6310                                    (106: U+0951..0952, U+0964..0965,
6311                                    U+0C00..0C0C, U+0C0E..0C10,
6312                                    U+0C12..0C28, U+0C2A..0C39 ...)
6313          \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
6314          \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (25)
6315          \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66)
6316          \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66:
6317                                    U+060C, U+061B..061C, U+061F,
6318                                    U+0660..0669, U+0780..07B1, U+FDF2 ...)
6319          \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86:
6320                                    U+0E01..0E3A, U+0E40..0E5B)
6321          \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
6322                                    (207: U+0F00..0F47, U+0F49..0F6C,
6323                                    U+0F71..0F97, U+0F99..0FBC,
6324                                    U+0FBE..0FCC, U+0FCE..0FD4 ...)
6325          \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
6326          \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
6327                                    (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F)
6328          \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (97)
6329          \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh})
6330                                    (97: U+0951..0952, U+0964..0965, U+1CF2,
6331                                    U+A830..A839, U+11480..114C7,
6332                                    U+114D0..114D9)
6333          \p{Script_Extensions: Tnsa} \p{Script_Extensions=Tangsa} (89)
6334          \p{Script_Extensions: Toto} (Short: \p{Scx=Toto}, \p{Toto}) (31:
6335                                    U+1E290..1E2AE)
6336          \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
6337          \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
6338                                    (31: U+10380..1039D, U+1039F)
6339          \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
6340                                    (969_350 plus all above-Unicode code
6341                                    points: U+0378..0379, U+0380..0383,
6342                                    U+038B, U+038D, U+03A2, U+0530 ...)
6343          \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300:
6344                                    U+A500..A62B)
6345          \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
6346          \p{Script_Extensions: Vith} \p{Script_Extensions=Vithkuqi} (70)
6347          \p{Script_Extensions: Vithkuqi} (Short: \p{Scx=Vith}, \p{Vith})
6348                                    (70: U+10570..1057A, U+1057C..1058A,
6349                                    U+1058C..10592, U+10594..10595,
6350                                    U+10597..105A1, U+105A3..105B1 ...)
6351          \p{Script_Extensions: Wancho} (Short: \p{Scx=Wcho}, \p{Wcho}) (59:
6352                                    U+1E2C0..1E2F9, U+1E2FF)
6353          \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
6354          \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
6355                                    (84: U+118A0..118F2, U+118FF)
6356          \p{Script_Extensions: Wcho} \p{Script_Extensions=Wancho} (59)
6357          \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
6358          \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
6359          \p{Script_Extensions: Yezi} \p{Script_Extensions=Yezidi} (60)
6360          \p{Script_Extensions: Yezidi} (Short: \p{Scx=Yezi}, \p{Yezi}) (60:
6361                                    U+060C, U+061B, U+061F, U+0660..0669,
6362                                    U+10E80..10EA9, U+10EAB..10EAD ...)
6363          \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246:
6364                                    U+3001..3002, U+3008..3011,
6365                                    U+3014..301B, U+30FB, U+A000..A48C,
6366                                    U+A490..A4C6 ...)
6367          \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
6368          \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb},
6369                                    \p{Zanb}) (72: U+11A00..11A47)
6370          \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square}
6371                                    (72)
6372          \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (586)
6373          \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (7824)
6374          \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (969_350
6375                                    plus all above-Unicode code points)
6376          \p{Scx: *}              \p{Script_Extensions: *}
6377          \p{SD}                  \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (47)
6378          \p{SD: *}               \p{Soft_Dotted: *}
6379          \p{Sentence_Break: AT}  \p{Sentence_Break=ATerm} (4)
6380          \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024,
6381                                    U+FE52, U+FF0E)
6382          \p{Sentence_Break: CL}  \p{Sentence_Break=Close} (195)
6383          \p{Sentence_Break: Close} (Short: \p{SB=CL}) (195: [\"\'\(\)\[\]
6384                                    \{\}\xab\xbb], U+0F3A..0F3D,
6385                                    U+169B..169C, U+2018..201F,
6386                                    U+2039..203A, U+2045..2046 ...)
6387          \p{Sentence_Break: CR}  (Short: \p{SB=CR}) (1: [\r])
6388          \p{Sentence_Break: EX}  \p{Sentence_Break=Extend} (2508)
6389          \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2508: U+0300..036F,
6390                                    U+0483..0489, U+0591..05BD, U+05BF,
6391                                    U+05C1..05C2, U+05C4..05C5 ...)
6392          \p{Sentence_Break: FO}  \p{Sentence_Break=Format} (65)
6393          \p{Sentence_Break: Format} (Short: \p{SB=FO}) (65: [\xad],
6394                                    U+0600..0605, U+061C, U+06DD, U+070F,
6395                                    U+0890..0891 ...)
6396          \p{Sentence_Break: LE}  \p{Sentence_Break=OLetter} (127_761)
6397          \p{Sentence_Break: LF}  (Short: \p{SB=LF}) (1: [\n])
6398          \p{Sentence_Break: LO}  \p{Sentence_Break=Lower} (2424)
6399          \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2424: [a-z\xaa\xb5
6400                                    \xba\xdf-\xf6\xf8-\xff], U+0101, U+0103,
6401                                    U+0105, U+0107, U+0109 ...)
6402          \p{Sentence_Break: NU}  \p{Sentence_Break=Numeric} (662)
6403          \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (662: [0-9],
6404                                    U+0660..0669, U+066B..066C,
6405                                    U+06F0..06F9, U+07C0..07C9, U+0966..096F
6406                                    ...)
6407          \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (127_761: U+01BB,
6408                                    U+01C0..01C3, U+0294, U+02B9..02BF,
6409                                    U+02C6..02D1, U+02EC ...)
6410          \p{Sentence_Break: Other} (Short: \p{SB=XX}) (978_357 plus all
6411                                    above-Unicode code points: [^\t\n\cK\f
6412                                    \r\x20!\"\'\(\),\-.0-9:?A-Z\[\]a-z\{\}
6413                                    \x85\xa0\xaa-\xab\xad\xb5\xba-\xbb\xc0-
6414                                    \xd6\xd8-\xf6\xf8-\xff], U+02C2..02C5,
6415                                    U+02D2..02DF, U+02E5..02EB, U+02ED,
6416                                    U+02EF..02FF ...)
6417          \p{Sentence_Break: SC}  \p{Sentence_Break=SContinue} (26)
6418          \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:],
6419                                    U+055D, U+060C..060D, U+07F8, U+1802,
6420                                    U+1808 ...)
6421          \p{Sentence_Break: SE}  \p{Sentence_Break=Sep} (3)
6422          \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85],
6423                                    U+2028..2029)
6424          \p{Sentence_Break: Sp}  (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0],
6425                                    U+1680, U+2000..200A, U+202F, U+205F,
6426                                    U+3000)
6427          \p{Sentence_Break: ST}  \p{Sentence_Break=STerm} (149)
6428          \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (149: [!?], U+0589,
6429                                    U+061D..061F, U+06D4, U+0700..0702,
6430                                    U+07F9 ...)
6431          \p{Sentence_Break: UP}  \p{Sentence_Break=Upper} (1936)
6432          \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1936: [A-Z\xc0-\xd6
6433                                    \xd8-\xde], U+0100, U+0102, U+0104,
6434                                    U+0106, U+0108 ...)
6435          \p{Sentence_Break: XX}  \p{Sentence_Break=Other} (978_357 plus all
6436                                    above-Unicode code points)
6437          \p{Sentence_Terminal}   \p{Sentence_Terminal=Y} (Short: \p{STerm})
6438                                    (152)
6439          \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
6440                                    (1_113_960 plus all above-Unicode code
6441                                    points: [\x00-\x20\"#\$\%&\'\(\)*+,\-
6442                                    \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
6443                                    \xff], U+0100..0588, U+058A..061C,
6444                                    U+0620..06D3, U+06D5..06FF, U+0703..07F8
6445                                    ...)
6446          \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (152:
6447                                    [!.?], U+0589, U+061D..061F, U+06D4,
6448                                    U+0700..0702, U+07F9 ...)
6449          \p{Separator}           \p{General_Category=Separator} (Short:
6450                                    \p{Z}) (19)
6451          \p{Sgnw}                \p{SignWriting} (= \p{Script_Extensions=
6452                                    SignWriting}) (672)
6453          \p{Sharada}             \p{Script_Extensions=Sharada} (Short:
6454                                    \p{Shrd}; NOT \p{Block=Sharada}) (102)
6455          \p{Shavian}             \p{Script_Extensions=Shavian} (Short:
6456                                    \p{Shaw}) (48)
6457          \p{Shaw}                \p{Shavian} (= \p{Script_Extensions=
6458                                    Shavian}) (48)
6459        X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
6460                                    (16)
6461          \p{Shrd}                \p{Sharada} (= \p{Script_Extensions=
6462                                    Sharada}) (NOT \p{Block=Sharada}) (102)
6463          \p{Sidd}                \p{Siddham} (= \p{Script_Extensions=
6464                                    Siddham}) (NOT \p{Block=Siddham}) (92)
6465          \p{Siddham}             \p{Script_Extensions=Siddham} (Short:
6466                                    \p{Sidd}; NOT \p{Block=Siddham}) (92)
6467          \p{SignWriting}         \p{Script_Extensions=SignWriting} (Short:
6468                                    \p{Sgnw}) (672)
6469          \p{Sind}                \p{Khudawadi} (= \p{Script_Extensions=
6470                                    Khudawadi}) (NOT \p{Block=Khudawadi})
6471                                    (81)
6472          \p{Sinh}                \p{Sinhala} (= \p{Script_Extensions=
6473                                    Sinhala}) (NOT \p{Block=Sinhala}) (113)
6474          \p{Sinhala}             \p{Script_Extensions=Sinhala} (Short:
6475                                    \p{Sinh}; NOT \p{Block=Sinhala}) (113)
6476        X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
6477          \p{Sk}                  \p{Modifier_Symbol} (=
6478                                    \p{General_Category=Modifier_Symbol})
6479                                    (125)
6480          \p{Sm}                  \p{Math_Symbol} (= \p{General_Category=
6481                                    Math_Symbol}) (948)
6482        X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
6483                                    \p{InSmallForms}) (32)
6484        X \p{Small_Forms}         \p{Small_Form_Variants} (= \p{Block=
6485                                    Small_Form_Variants}) (32)
6486        X \p{Small_Kana_Ext}      \p{Small_Kana_Extension} (= \p{Block=
6487                                    Small_Kana_Extension}) (64)
6488        X \p{Small_Kana_Extension} \p{Block=Small_Kana_Extension} (Short:
6489                                    \p{InSmallKanaExt}) (64)
6490          \p{So}                  \p{Other_Symbol} (= \p{General_Category=
6491                                    Other_Symbol}) (6605)
6492          \p{Soft_Dotted}         \p{Soft_Dotted=Y} (Short: \p{SD}) (47)
6493          \p{Soft_Dotted: N*}     (Short: \p{SD=N}, \P{SD}) (1_114_065 plus
6494                                    all above-Unicode code points: [\x00-
6495                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6496                                    Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff],
6497                                    U+0100..012E, U+0130..0248,
6498                                    U+024A..0267, U+0269..029C, U+029E..02B1
6499                                    ...)
6500          \p{Soft_Dotted: Y*}     (Short: \p{SD=Y}, \p{SD}) (47: [i-j],
6501                                    U+012F, U+0249, U+0268, U+029D, U+02B2
6502                                    ...)
6503          \p{Sogd}                \p{Sogdian} (= \p{Script_Extensions=
6504                                    Sogdian}) (NOT \p{Block=Sogdian}) (43)
6505          \p{Sogdian}             \p{Script_Extensions=Sogdian} (Short:
6506                                    \p{Sogd}; NOT \p{Block=Sogdian}) (43)
6507          \p{Sogo}                \p{Old_Sogdian} (= \p{Script_Extensions=
6508                                    Old_Sogdian}) (NOT \p{Block=
6509                                    Old_Sogdian}) (40)
6510          \p{Sora}                \p{Sora_Sompeng} (= \p{Script_Extensions=
6511                                    Sora_Sompeng}) (NOT \p{Block=
6512                                    Sora_Sompeng}) (35)
6513          \p{Sora_Sompeng}        \p{Script_Extensions=Sora_Sompeng} (Short:
6514                                    \p{Sora}; NOT \p{Block=Sora_Sompeng})
6515                                    (35)
6516          \p{Soyo}                \p{Soyombo} (= \p{Script_Extensions=
6517                                    Soyombo}) (NOT \p{Block=Soyombo}) (83)
6518          \p{Soyombo}             \p{Script_Extensions=Soyombo} (Short:
6519                                    \p{Soyo}; NOT \p{Block=Soyombo}) (83)
6520          \p{Space}               \p{White_Space} (= \p{White_Space=Y}) (25)
6521          \p{Space: *}            \p{White_Space: *}
6522          \p{Space_Separator}     \p{General_Category=Space_Separator}
6523                                    (Short: \p{Zs}) (17)
6524          \p{SpacePerl}           \p{XPosixSpace} (25)
6525          \p{Spacing_Mark}        \p{General_Category=Spacing_Mark} (Short:
6526                                    \p{Mc}) (445)
6527        X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
6528                                    (Short: \p{InModifierLetters}) (80)
6529        X \p{Specials}            \p{Block=Specials} (16)
6530          \p{STerm}               \p{Sentence_Terminal} (=
6531                                    \p{Sentence_Terminal=Y}) (152)
6532          \p{STerm: *}            \p{Sentence_Terminal: *}
6533          \p{Sund}                \p{Sundanese} (= \p{Script_Extensions=
6534                                    Sundanese}) (NOT \p{Block=Sundanese})
6535                                    (72)
6536          \p{Sundanese}           \p{Script_Extensions=Sundanese} (Short:
6537                                    \p{Sund}; NOT \p{Block=Sundanese}) (72)
6538        X \p{Sundanese_Sup}       \p{Sundanese_Supplement} (= \p{Block=
6539                                    Sundanese_Supplement}) (16)
6540        X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
6541                                    \p{InSundaneseSup}) (16)
6542        X \p{Sup_Arrows_A}        \p{Supplemental_Arrows_A} (= \p{Block=
6543                                    Supplemental_Arrows_A}) (16)
6544        X \p{Sup_Arrows_B}        \p{Supplemental_Arrows_B} (= \p{Block=
6545                                    Supplemental_Arrows_B}) (128)
6546        X \p{Sup_Arrows_C}        \p{Supplemental_Arrows_C} (= \p{Block=
6547                                    Supplemental_Arrows_C}) (256)
6548        X \p{Sup_Math_Operators}  \p{Supplemental_Mathematical_Operators} (=
6549                                    \p{Block=
6550                                    Supplemental_Mathematical_Operators})
6551                                    (256)
6552        X \p{Sup_PUA_A}           \p{Supplementary_Private_Use_Area_A} (=
6553                                    \p{Block=
6554                                    Supplementary_Private_Use_Area_A})
6555                                    (65_536)
6556        X \p{Sup_PUA_B}           \p{Supplementary_Private_Use_Area_B} (=
6557                                    \p{Block=
6558                                    Supplementary_Private_Use_Area_B})
6559                                    (65_536)
6560        X \p{Sup_Punctuation}     \p{Supplemental_Punctuation} (= \p{Block=
6561                                    Supplemental_Punctuation}) (128)
6562        X \p{Sup_Symbols_And_Pictographs}
6563                                    \p{Supplemental_Symbols_And_Pictographs}
6564                                    (= \p{Block=
6565                                    Supplemental_Symbols_And_Pictographs})
6566                                    (256)
6567        X \p{Super_And_Sub}       \p{Superscripts_And_Subscripts} (=
6568                                    \p{Block=Superscripts_And_Subscripts})
6569                                    (48)
6570        X \p{Superscripts_And_Subscripts} \p{Block=
6571                                    Superscripts_And_Subscripts} (Short:
6572                                    \p{InSuperAndSub}) (48)
6573        X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
6574                                    \p{InSupArrowsA}) (16)
6575        X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
6576                                    \p{InSupArrowsB}) (128)
6577        X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
6578                                    \p{InSupArrowsC}) (256)
6579        X \p{Supplemental_Mathematical_Operators} \p{Block=
6580                                    Supplemental_Mathematical_Operators}
6581                                    (Short: \p{InSupMathOperators}) (256)
6582        X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
6583                                    (Short: \p{InSupPunctuation}) (128)
6584        X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
6585                                    Supplemental_Symbols_And_Pictographs}
6586                                    (Short: \p{InSupSymbolsAndPictographs})
6587                                    (256)
6588        X \p{Supplementary_Private_Use_Area_A} \p{Block=
6589                                    Supplementary_Private_Use_Area_A}
6590                                    (Short: \p{InSupPUAA}) (65_536)
6591        X \p{Supplementary_Private_Use_Area_B} \p{Block=
6592                                    Supplementary_Private_Use_Area_B}
6593                                    (Short: \p{InSupPUAB}) (65_536)
6594          \p{Surrogate}           \p{General_Category=Surrogate} (Short:
6595                                    \p{Cs}) (2048)
6596        X \p{Sutton_SignWriting}  \p{Block=Sutton_SignWriting} (688)
6597          \p{Sylo}                \p{Syloti_Nagri} (= \p{Script_Extensions=
6598                                    Syloti_Nagri}) (NOT \p{Block=
6599                                    Syloti_Nagri}) (57)
6600          \p{Syloti_Nagri}        \p{Script_Extensions=Syloti_Nagri} (Short:
6601                                    \p{Sylo}; NOT \p{Block=Syloti_Nagri})
6602                                    (57)
6603          \p{Symbol}              \p{General_Category=Symbol} (Short: \p{S})
6604                                    (7741)
6605        X \p{Symbols_And_Pictographs_Ext_A}
6606                                    \p{Symbols_And_Pictographs_Extended_A}
6607                                    (= \p{Block=
6608                                    Symbols_And_Pictographs_Extended_A})
6609                                    (144)
6610        X \p{Symbols_And_Pictographs_Extended_A} \p{Block=
6611                                    Symbols_And_Pictographs_Extended_A} (144)
6612        X \p{Symbols_For_Legacy_Computing} \p{Block=
6613                                    Symbols_For_Legacy_Computing} (256)
6614          \p{Syrc}                \p{Syriac} (= \p{Script_Extensions=
6615                                    Syriac}) (NOT \p{Block=Syriac}) (107)
6616          \p{Syriac}              \p{Script_Extensions=Syriac} (Short:
6617                                    \p{Syrc}; NOT \p{Block=Syriac}) (107)
6618        X \p{Syriac_Sup}          \p{Syriac_Supplement} (= \p{Block=
6619                                    Syriac_Supplement}) (16)
6620        X \p{Syriac_Supplement}   \p{Block=Syriac_Supplement} (Short:
6621                                    \p{InSyriacSup}) (16)
6622          \p{Tagalog}             \p{Script_Extensions=Tagalog} (Short:
6623                                    \p{Tglg}; NOT \p{Block=Tagalog}) (25)
6624          \p{Tagb}                \p{Tagbanwa} (= \p{Script_Extensions=
6625                                    Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
6626          \p{Tagbanwa}            \p{Script_Extensions=Tagbanwa} (Short:
6627                                    \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
6628        X \p{Tags}                \p{Block=Tags} (128)
6629          \p{Tai_Le}              \p{Script_Extensions=Tai_Le} (Short:
6630                                    \p{Tale}; NOT \p{Block=Tai_Le}) (45)
6631          \p{Tai_Tham}            \p{Script_Extensions=Tai_Tham} (Short:
6632                                    \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
6633          \p{Tai_Viet}            \p{Script_Extensions=Tai_Viet} (Short:
6634                                    \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
6635        X \p{Tai_Xuan_Jing}       \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
6636                                    Tai_Xuan_Jing_Symbols}) (96)
6637        X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
6638                                    \p{InTaiXuanJing}) (96)
6639          \p{Takr}                \p{Takri} (= \p{Script_Extensions=Takri})
6640                                    (NOT \p{Block=Takri}) (80)
6641          \p{Takri}               \p{Script_Extensions=Takri} (Short:
6642                                    \p{Takr}; NOT \p{Block=Takri}) (80)
6643          \p{Tale}                \p{Tai_Le} (= \p{Script_Extensions=
6644                                    Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
6645          \p{Talu}                \p{New_Tai_Lue} (= \p{Script_Extensions=
6646                                    New_Tai_Lue}) (NOT \p{Block=
6647                                    New_Tai_Lue}) (83)
6648          \p{Tamil}               \p{Script_Extensions=Tamil} (Short:
6649                                    \p{Taml}; NOT \p{Block=Tamil}) (133)
6650        X \p{Tamil_Sup}           \p{Tamil_Supplement} (= \p{Block=
6651                                    Tamil_Supplement}) (64)
6652        X \p{Tamil_Supplement}    \p{Block=Tamil_Supplement} (Short:
6653                                    \p{InTamilSup}) (64)
6654          \p{Taml}                \p{Tamil} (= \p{Script_Extensions=Tamil})
6655                                    (NOT \p{Block=Tamil}) (133)
6656          \p{Tang}                \p{Tangut} (= \p{Script_Extensions=
6657                                    Tangut}) (NOT \p{Block=Tangut}) (6914)
6658          \p{Tangsa}              \p{Script_Extensions=Tangsa} (Short:
6659                                    \p{Tnsa}; NOT \p{Block=Tangsa}) (89)
6660          \p{Tangut}              \p{Script_Extensions=Tangut} (Short:
6661                                    \p{Tang}; NOT \p{Block=Tangut}) (6914)
6662        X \p{Tangut_Components}   \p{Block=Tangut_Components} (768)
6663        X \p{Tangut_Sup}          \p{Tangut_Supplement} (= \p{Block=
6664                                    Tangut_Supplement}) (128)
6665        X \p{Tangut_Supplement}   \p{Block=Tangut_Supplement} (Short:
6666                                    \p{InTangutSup}) (128)
6667          \p{Tavt}                \p{Tai_Viet} (= \p{Script_Extensions=
6668                                    Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
6669          \p{Telu}                \p{Telugu} (= \p{Script_Extensions=
6670                                    Telugu}) (NOT \p{Block=Telugu}) (106)
6671          \p{Telugu}              \p{Script_Extensions=Telugu} (Short:
6672                                    \p{Telu}; NOT \p{Block=Telugu}) (106)
6673          \p{Term}                \p{Terminal_Punctuation} (=
6674                                    \p{Terminal_Punctuation=Y}) (276)
6675          \p{Term: *}             \p{Terminal_Punctuation: *}
6676          \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
6677                                    \p{Term}) (276)
6678          \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
6679                                    (1_113_836 plus all above-Unicode code
6680                                    points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0-
6681                                    9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
6682                                    U+0100..037D, U+037F..0386,
6683                                    U+0388..0588, U+058A..05C2, U+05C4..060B
6684                                    ...)
6685          \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (276:
6686                                    [!,.:;?], U+037E, U+0387, U+0589,
6687                                    U+05C3, U+060C ...)
6688          \p{Tfng}                \p{Tifinagh} (= \p{Script_Extensions=
6689                                    Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
6690          \p{Tglg}                \p{Tagalog} (= \p{Script_Extensions=
6691                                    Tagalog}) (NOT \p{Block=Tagalog}) (25)
6692          \p{Thaa}                \p{Thaana} (= \p{Script_Extensions=
6693                                    Thaana}) (NOT \p{Block=Thaana}) (66)
6694          \p{Thaana}              \p{Script_Extensions=Thaana} (Short:
6695                                    \p{Thaa}; NOT \p{Block=Thaana}) (66)
6696          \p{Thai}                \p{Script_Extensions=Thai} (NOT \p{Block=
6697                                    Thai}) (86)
6698          \p{Tibetan}             \p{Script_Extensions=Tibetan} (Short:
6699                                    \p{Tibt}; NOT \p{Block=Tibetan}) (207)
6700          \p{Tibt}                \p{Tibetan} (= \p{Script_Extensions=
6701                                    Tibetan}) (NOT \p{Block=Tibetan}) (207)
6702          \p{Tifinagh}            \p{Script_Extensions=Tifinagh} (Short:
6703                                    \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
6704          \p{Tirh}                \p{Tirhuta} (= \p{Script_Extensions=
6705                                    Tirhuta}) (NOT \p{Block=Tirhuta}) (97)
6706          \p{Tirhuta}             \p{Script_Extensions=Tirhuta} (Short:
6707                                    \p{Tirh}; NOT \p{Block=Tirhuta}) (97)
6708          \p{Title}               \p{Titlecase} (/i= Cased=Yes) (31)
6709          \p{Titlecase}           (= \p{Gc=Lt}) (Short: \p{Title}; /i=
6710                                    Cased=Yes) (31: U+01C5, U+01C8, U+01CB,
6711                                    U+01F2, U+1F88..1F8F, U+1F98..1F9F ...)
6712          \p{Titlecase_Letter}    \p{General_Category=Titlecase_Letter}
6713                                    (Short: \p{Lt}; /i= General_Category=
6714                                    Cased_Letter) (31)
6715          \p{Tnsa}                \p{Tangsa} (= \p{Script_Extensions=
6716                                    Tangsa}) (NOT \p{Block=Tangsa}) (89)
6717          \p{Toto}                \p{Script_Extensions=Toto} (NOT \p{Block=
6718                                    Toto}) (31)
6719        X \p{Transport_And_Map}   \p{Transport_And_Map_Symbols} (= \p{Block=
6720                                    Transport_And_Map_Symbols}) (128)
6721        X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
6722                                    (Short: \p{InTransportAndMap}) (128)
6723        X \p{UCAS}                \p{Unified_Canadian_Aboriginal_Syllabics}
6724                                    (= \p{Block=
6725                                    Unified_Canadian_Aboriginal_Syllabics})
6726                                    (640)
6727        X \p{UCAS_Ext}            \p{Unified_Canadian_Aboriginal_Syllabics_-
6728                                    Extended} (= \p{Block=
6729                                    Unified_Canadian_Aboriginal_Syllabics_-
6730                                    Extended}) (80)
6731        X \p{UCAS_Ext_A}          \p{Unified_Canadian_Aboriginal_Syllabics_-
6732                                    Extended_A} (= \p{Block=
6733                                    Unified_Canadian_Aboriginal_Syllabics_-
6734                                    Extended_A}) (16)
6735          \p{Ugar}                \p{Ugaritic} (= \p{Script_Extensions=
6736                                    Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
6737          \p{Ugaritic}            \p{Script_Extensions=Ugaritic} (Short:
6738                                    \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
6739          \p{UIdeo}               \p{Unified_Ideograph} (=
6740                                    \p{Unified_Ideograph=Y}) (92_865)
6741          \p{UIdeo: *}            \p{Unified_Ideograph: *}
6742          \p{Unassigned}          \p{General_Category=Unassigned} (Short:
6743                                    \p{Cn}) (829_834 plus all above-Unicode
6744                                    code points)
6745          \p{Unicode}             \p{Any} (1_114_112)
6746        X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
6747                                    Unified_Canadian_Aboriginal_Syllabics}
6748                                    (Short: \p{InUCAS}) (640)
6749        X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
6750                                    Unified_Canadian_Aboriginal_Syllabics_-
6751                                    Extended} (Short: \p{InUCASExt}) (80)
6752        X \p{Unified_Canadian_Aboriginal_Syllabics_Extended_A} \p{Block=
6753                                    Unified_Canadian_Aboriginal_Syllabics_-
6754                                    Extended_A} (Short: \p{InUCASExtA}) (16)
6755          \p{Unified_Ideograph}   \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
6756                                    (92_865)
6757          \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
6758                                    (1_021_247 plus all above-Unicode code
6759                                    points: U+0000..33FF, U+4DC0..4DFF,
6760                                    U+A000..FA0D, U+FA10, U+FA12,
6761                                    U+FA15..FA1E ...)
6762          \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (92_865:
6763                                    U+3400..4DBF, U+4E00..9FFF,
6764                                    U+FA0E..FA0F, U+FA11, U+FA13..FA14,
6765                                    U+FA1F ...)
6766          \p{Unknown}             \p{Script_Extensions=Unknown} (Short:
6767                                    \p{Zzzz}) (969_350 plus all above-
6768                                    Unicode code points)
6769          \p{Upper}               \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6770                                    Cased=Yes) (1951)
6771          \p{Upper: *}            \p{Uppercase: *}
6772          \p{Uppercase}           \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6773                                    Cased=Yes) (1951)
6774          \p{Uppercase: N*}       (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
6775                                    No) (1_112_161 plus all above-Unicode
6776                                    code points: [\x00-\x20!\"#\$\%&\'
6777                                    \(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a-z\{
6778                                    \|\}~\x7f-\xbf\xd7\xdf-\xff], U+0101,
6779                                    U+0103, U+0105, U+0107, U+0109 ...)
6780          \p{Uppercase: Y*}       (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
6781                                    Yes) (1951: [A-Z\xc0-\xd6\xd8-\xde],
6782                                    U+0100, U+0102, U+0104, U+0106, U+0108
6783                                    ...)
6784          \p{Uppercase_Letter}    \p{General_Category=Uppercase_Letter}
6785                                    (Short: \p{Lu}; /i= General_Category=
6786                                    Cased_Letter) (1831)
6787          \p{Vai}                 \p{Script_Extensions=Vai} (NOT \p{Block=
6788                                    Vai}) (300)
6789          \p{Vaii}                \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
6790                                    \p{Block=Vai}) (300)
6791          \p{Variation_Selector}  \p{Variation_Selector=Y} (Short: \p{VS};
6792                                    NOT \p{Variation_Selectors}) (260)
6793          \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_852
6794                                    plus all above-Unicode code points:
6795                                    U+0000..180A, U+180E, U+1810..FDFF,
6796                                    U+FE10..E00FF, U+E01F0..infinity)
6797          \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (260:
6798                                    U+180B..180D, U+180F, U+FE00..FE0F,
6799                                    U+E0100..E01EF)
6800        X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
6801                                    \p{InVS}) (16)
6802        X \p{Variation_Selectors_Supplement} \p{Block=
6803                                    Variation_Selectors_Supplement} (Short:
6804                                    \p{InVSSup}) (240)
6805        X \p{Vedic_Ext}           \p{Vedic_Extensions} (= \p{Block=
6806                                    Vedic_Extensions}) (48)
6807        X \p{Vedic_Extensions}    \p{Block=Vedic_Extensions} (Short:
6808                                    \p{InVedicExt}) (48)
6809        X \p{Vertical_Forms}      \p{Block=Vertical_Forms} (16)
6810          \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated}
6811                                    (786_641 plus all above-Unicode code
6812                                    points)
6813          \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (786_641 plus
6814                                    all above-Unicode code points: [\x00-
6815                                    \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf-
6816                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9,
6817                                    U+02EC..10FF, U+1200..1400,
6818                                    U+1680..18AF, U+1900..2015 ...)
6819          \p{Vertical_Orientation: Tr} \p{Vertical_Orientation=
6820                                    Transformed_Rotated} (47)
6821          \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr})
6822                                    (47: U+2329..232A, U+3008..3011,
6823                                    U+3014..301F, U+3030, U+30A0, U+30FC ...)
6824          \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu})
6825                                    (148: U+3001..3002, U+3041, U+3043,
6826                                    U+3045, U+3047, U+3049 ...)
6827          \p{Vertical_Orientation: Tu} \p{Vertical_Orientation=
6828                                    Transformed_Upright} (148)
6829          \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright}
6830                                    (327_276)
6831          \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (327_276:
6832                                    [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7],
6833                                    U+02EA..02EB, U+1100..11FF,
6834                                    U+1401..167F, U+18B0..18FF, U+2016 ...)
6835          \p{VertSpace}           \v (7: [\n\cK\f\r\x85], U+2028..2029)
6836          \p{Vith}                \p{Vithkuqi} (= \p{Script_Extensions=
6837                                    Vithkuqi}) (NOT \p{Block=Vithkuqi}) (70)
6838          \p{Vithkuqi}            \p{Script_Extensions=Vithkuqi} (Short:
6839                                    \p{Vith}; NOT \p{Block=Vithkuqi}) (70)
6840          \p{Vo: *}               \p{Vertical_Orientation: *}
6841          \p{VS}                  \p{Variation_Selector} (=
6842                                    \p{Variation_Selector=Y}) (NOT
6843                                    \p{Variation_Selectors}) (260)
6844          \p{VS: *}               \p{Variation_Selector: *}
6845        X \p{VS_Sup}              \p{Variation_Selectors_Supplement} (=
6846                                    \p{Block=
6847                                    Variation_Selectors_Supplement}) (240)
6848          \p{Wancho}              \p{Script_Extensions=Wancho} (Short:
6849                                    \p{Wcho}; NOT \p{Block=Wancho}) (59)
6850          \p{Wara}                \p{Warang_Citi} (= \p{Script_Extensions=
6851                                    Warang_Citi}) (NOT \p{Block=
6852                                    Warang_Citi}) (84)
6853          \p{Warang_Citi}         \p{Script_Extensions=Warang_Citi} (Short:
6854                                    \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
6855          \p{WB: *}               \p{Word_Break: *}
6856          \p{Wcho}                \p{Wancho} (= \p{Script_Extensions=
6857                                    Wancho}) (NOT \p{Block=Wancho}) (59)
6858          \p{White_Space}         \p{White_Space=Y} (Short: \p{Space}) (25)
6859          \p{White_Space: N*}     (Short: \p{Space=N}, \P{Space}) (1_114_087
6860                                    plus all above-Unicode code points: [^
6861                                    \t\n\cK\f\r\x20\x85\xa0], U+0100..167F,
6862                                    U+1681..1FFF, U+200B..2027,
6863                                    U+202A..202E, U+2030..205E ...)
6864          \p{White_Space: Y*}     (Short: \p{Space=Y}, \p{Space}) (25: [\t
6865                                    \n\cK\f\r\x20\x85\xa0], U+1680,
6866                                    U+2000..200A, U+2028..2029, U+202F,
6867                                    U+205F ...)
6868          \p{Word}                \p{XPosixWord} (135_202)
6869          \p{Word_Break: ALetter} (Short: \p{WB=LE}) (29_336: [A-Za-z\xaa
6870                                    \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6871                                    U+0100..02D7, U+02DE..02FF,
6872                                    U+0370..0374, U+0376..0377, U+037A..037D
6873                                    ...)
6874          \p{Word_Break: CR}      (Short: \p{WB=CR}) (1: [\r])
6875          \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"])
6876          \p{Word_Break: DQ}      \p{Word_Break=Double_Quote} (1)
6877          \p{Word_Break: E_Base}  (Short: \p{WB=EB}) (0)
6878          \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (0)
6879          \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (0)
6880          \p{Word_Break: EB}      \p{Word_Break=E_Base} (0)
6881          \p{Word_Break: EBG}     \p{Word_Break=E_Base_GAZ} (0)
6882          \p{Word_Break: EM}      \p{Word_Break=E_Modifier} (0)
6883          \p{Word_Break: EX}      \p{Word_Break=ExtendNumLet} (11)
6884          \p{Word_Break: Extend}  (Short: \p{WB=Extend}) (2512:
6885                                    U+0300..036F, U+0483..0489,
6886                                    U+0591..05BD, U+05BF, U+05C1..05C2,
6887                                    U+05C4..05C5 ...)
6888          \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F,
6889                                    U+203F..2040, U+2054, U+FE33..FE34,
6890                                    U+FE4D..FE4F ...)
6891          \p{Word_Break: FO}      \p{Word_Break=Format} (64)
6892          \p{Word_Break: Format}  (Short: \p{WB=FO}) (64: [\xad],
6893                                    U+0600..0605, U+061C, U+06DD, U+070F,
6894                                    U+0890..0891 ...)
6895          \p{Word_Break: GAZ}     \p{Word_Break=Glue_After_Zwj} (0)
6896          \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (0)
6897          \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (75:
6898                                    U+05D0..05EA, U+05EF..05F2, U+FB1D,
6899                                    U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
6900                                    ...)
6901          \p{Word_Break: HL}      \p{Word_Break=Hebrew_Letter} (75)
6902          \p{Word_Break: KA}      \p{Word_Break=Katakana} (330)
6903          \p{Word_Break: Katakana} (Short: \p{WB=KA}) (330: U+3031..3035,
6904                                    U+309B..309C, U+30A0..30FA,
6905                                    U+30FC..30FF, U+31F0..31FF, U+32D0..32FE
6906                                    ...)
6907          \p{Word_Break: LE}      \p{Word_Break=ALetter} (29_336)
6908          \p{Word_Break: LF}      (Short: \p{WB=LF}) (1: [\n])
6909          \p{Word_Break: MB}      \p{Word_Break=MidNumLet} (7)
6910          \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (9: [:\xb7], U+0387,
6911                                    U+055F, U+05F4, U+2027, U+FE13 ...)
6912          \p{Word_Break: MidNum}  (Short: \p{WB=MN}) (15: [,;], U+037E,
6913                                    U+0589, U+060C..060D, U+066C, U+07F8 ...)
6914          \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.],
6915                                    U+2018..2019, U+2024, U+FE52, U+FF07,
6916                                    U+FF0E)
6917          \p{Word_Break: ML}      \p{Word_Break=MidLetter} (9)
6918          \p{Word_Break: MN}      \p{Word_Break=MidNum} (15)
6919          \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85],
6920                                    U+2028..2029)
6921          \p{Word_Break: NL}      \p{Word_Break=Newline} (5)
6922          \p{Word_Break: NU}      \p{Word_Break=Numeric} (661)
6923          \p{Word_Break: Numeric} (Short: \p{WB=NU}) (661: [0-9],
6924                                    U+0660..0669, U+066B, U+06F0..06F9,
6925                                    U+07C0..07C9, U+0966..096F ...)
6926          \p{Word_Break: Other}   (Short: \p{WB=XX}) (1_081_042 plus all
6927                                    above-Unicode code points: [^\n\cK\f\r
6928                                    \x20\"\',.0-9:;A-Z_a-z\x85\xaa\xad\xb5
6929                                    \xb7\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6930                                    U+02D8..02DD, U+0375, U+0378..0379,
6931                                    U+0380..0385, U+038B ...)
6932          \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26:
6933                                    U+1F1E6..1F1FF)
6934          \p{Word_Break: RI}      \p{Word_Break=Regional_Indicator} (26)
6935          \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\'])
6936          \p{Word_Break: SQ}      \p{Word_Break=Single_Quote} (1)
6937          \p{Word_Break: WSegSpace} (Short: \p{WB=WSegSpace}) (14: [\x20],
6938                                    U+1680, U+2000..2006, U+2008..200A,
6939                                    U+205F, U+3000)
6940          \p{Word_Break: XX}      \p{Word_Break=Other} (1_081_042 plus all
6941                                    above-Unicode code points)
6942          \p{Word_Break: ZWJ}     (Short: \p{WB=ZWJ}) (1: U+200D)
6943          \p{WSpace}              \p{White_Space} (= \p{White_Space=Y}) (25)
6944          \p{WSpace: *}           \p{White_Space: *}
6945          \p{XDigit}              \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
6946          \p{XID_Continue}        \p{XID_Continue=Y} (Short: \p{XIDC})
6947                                    (135_053)
6948          \p{XID_Continue: N*}    (Short: \p{XIDC=N}, \P{XIDC}) (979_059
6949                                    plus all above-Unicode code points:
6950                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
6951                                    \@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4
6952                                    \xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
6953                                    U+02C2..02C5, U+02D2..02DF,
6954                                    U+02E5..02EB, U+02ED, U+02EF..02FF ...)
6955          \p{XID_Continue: Y*}    (Short: \p{XIDC=Y}, \p{XIDC}) (135_053:
6956                                    [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6
6957                                    \xd8-\xf6\xf8-\xff], U+0100..02C1,
6958                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
6959                                    U+02EE ...)
6960          \p{XID_Start}           \p{XID_Start=Y} (Short: \p{XIDS}) (131_974)
6961          \p{XID_Start: N*}       (Short: \p{XIDS=N}, \P{XIDS}) (982_138
6962                                    plus all above-Unicode code points:
6963                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
6964                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4
6965                                    \xb6-\xb9\xbb-\xbf\xd7\xf7],
6966                                    U+02C2..02C5, U+02D2..02DF,
6967                                    U+02E5..02EB, U+02ED, U+02EF..036F ...)
6968          \p{XID_Start: Y*}       (Short: \p{XIDS=Y}, \p{XIDS}) (131_974:
6969                                    [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
6970                                    \xf8-\xff], U+0100..02C1, U+02C6..02D1,
6971                                    U+02E0..02E4, U+02EC, U+02EE ...)
6972          \p{XIDC}                \p{XID_Continue} (= \p{XID_Continue=Y})
6973                                    (135_053)
6974          \p{XIDC: *}             \p{XID_Continue: *}
6975          \p{XIDS}                \p{XID_Start} (= \p{XID_Start=Y}) (131_974)
6976          \p{XIDS: *}             \p{XID_Start: *}
6977          \p{Xpeo}                \p{Old_Persian} (= \p{Script_Extensions=
6978                                    Old_Persian}) (NOT \p{Block=
6979                                    Old_Persian}) (50)
6980          \p{XPerlSpace}          \p{XPosixSpace} (25)
6981          \p{XPosixAlnum}         Alphabetic and (decimal) Numeric (Short:
6982                                    \p{Alnum}) (134_056: [0-9A-Za-z\xaa\xb5
6983                                    \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6984                                    U+0100..02C1, U+02C6..02D1,
6985                                    U+02E0..02E4, U+02EC, U+02EE ...)
6986          \p{XPosixAlpha}         \p{Alphabetic=Y} (Short: \p{Alpha})
6987                                    (133_396)
6988          \p{XPosixBlank}         \h, Horizontal white space (Short:
6989                                    \p{Blank}) (18: [\t\x20\xa0], U+1680,
6990                                    U+2000..200A, U+202F, U+205F, U+3000)
6991          \p{XPosixCntrl}         \p{General_Category=Control} Control
6992                                    characters (Short: \p{Cc}) (65)
6993          \p{XPosixDigit}         \p{General_Category=Decimal_Number} [0-9]
6994                                    + all other decimal digits (Short:
6995                                    \p{Nd}) (660)
6996          \p{XPosixGraph}         Characters that are graphical (Short:
6997                                    \p{Graph}) (282_146: [!\"#\$\%&\'
6998                                    \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`a-z
6999                                    \{\|\}~\xa1-\xff], U+0100..0377,
7000                                    U+037A..037F, U+0384..038A, U+038C,
7001                                    U+038E..03A1 ...)
7002          \p{XPosixLower}         \p{Lowercase=Y} (Short: \p{Lower}; /i=
7003                                    Cased=Yes) (2471)
7004          \p{XPosixPrint}         Characters that are graphical plus space
7005                                    characters (but no controls) (Short:
7006                                    \p{Print}) (282_163: [\x20-\x7e\xa0-
7007                                    \xff], U+0100..0377, U+037A..037F,
7008                                    U+0384..038A, U+038C, U+038E..03A1 ...)
7009          \p{XPosixPunct}         \p{Punct} + ASCII-range \p{Symbol} (828:
7010                                    [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\]
7011                                    \^_`\{\|\}~\xa1\xa7\xab\xb6-\xb7\xbb
7012                                    \xbf], U+037E, U+0387, U+055A..055F,
7013                                    U+0589..058A, U+05BE ...)
7014          \p{XPosixSpace}         \s including beyond ASCII and vertical tab
7015                                    (Short: \p{SpacePerl}) (25: [\t\n\cK\f
7016                                    \r\x20\x85\xa0], U+1680, U+2000..200A,
7017                                    U+2028..2029, U+202F, U+205F ...)
7018          \p{XPosixUpper}         \p{Uppercase=Y} (Short: \p{Upper}; /i=
7019                                    Cased=Yes) (1951)
7020          \p{XPosixWord}          \w, including beyond ASCII; = \p{Alnum} +
7021                                    \pM + \p{Pc} + \p{Join_Control} (Short:
7022                                    \p{Word}) (135_202: [0-9A-Z_a-z\xaa\xb5
7023                                    \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7024                                    U+0100..02C1, U+02C6..02D1,
7025                                    U+02E0..02E4, U+02EC, U+02EE ...)
7026          \p{XPosixXDigit}        \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
7027          \p{Xsux}                \p{Cuneiform} (= \p{Script_Extensions=
7028                                    Cuneiform}) (NOT \p{Block=Cuneiform})
7029                                    (1234)
7030          \p{Yezi}                \p{Yezidi} (= \p{Script_Extensions=
7031                                    Yezidi}) (NOT \p{Block=Yezidi}) (60)
7032          \p{Yezidi}              \p{Script_Extensions=Yezidi} (Short:
7033                                    \p{Yezi}; NOT \p{Block=Yezidi}) (60)
7034          \p{Yi}                  \p{Script_Extensions=Yi} (1246)
7035        X \p{Yi_Radicals}         \p{Block=Yi_Radicals} (64)
7036        X \p{Yi_Syllables}        \p{Block=Yi_Syllables} (1168)
7037          \p{Yiii}                \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
7038        X \p{Yijing}              \p{Yijing_Hexagram_Symbols} (= \p{Block=
7039                                    Yijing_Hexagram_Symbols}) (64)
7040        X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
7041                                    (Short: \p{InYijing}) (64)
7042          \p{Z} \pZ               \p{Separator} (= \p{General_Category=
7043                                    Separator}) (19)
7044          \p{Zanabazar_Square}    \p{Script_Extensions=Zanabazar_Square}
7045                                    (Short: \p{Zanb}; NOT \p{Block=
7046                                    Zanabazar_Square}) (72)
7047          \p{Zanb}                \p{Zanabazar_Square} (=
7048                                    \p{Script_Extensions=Zanabazar_Square})
7049                                    (NOT \p{Block=Zanabazar_Square}) (72)
7050          \p{Zinh}                \p{Inherited} (= \p{Script_Extensions=
7051                                    Inherited}) (586)
7052          \p{Zl}                  \p{Line_Separator} (= \p{General_Category=
7053                                    Line_Separator}) (1)
7054        X \p{Znamenny_Music}      \p{Znamenny_Musical_Notation} (= \p{Block=
7055                                    Znamenny_Musical_Notation}) (208)
7056        X \p{Znamenny_Musical_Notation} \p{Block=Znamenny_Musical_Notation}
7057                                    (Short: \p{InZnamennyMusic}) (208)
7058          \p{Zp}                  \p{Paragraph_Separator} (=
7059                                    \p{General_Category=
7060                                    Paragraph_Separator}) (1)
7061          \p{Zs}                  \p{Space_Separator} (=
7062                                    \p{General_Category=Space_Separator})
7063                                    (17)
7064          \p{Zyyy}                \p{Common} (= \p{Script_Extensions=
7065                                    Common}) (7824)
7066          \p{Zzzz}                \p{Unknown} (= \p{Script_Extensions=
7067                                    Unknown}) (969_350 plus all above-
7068                                    Unicode code points)
7069
7070   Legal "\p{}" and "\P{}" constructs that match no characters
7071       Unicode has some property-value pairs that currently don't match
7072       anything.  This happens generally either because they are obsolete, or
7073       they exist for symmetry with other forms, but no language has yet been
7074       encoded that uses them.  In this version of Unicode, the following
7075       match zero code points:
7076
7077       \p{Canonical_Combining_Class=Attached_Below_Left}
7078       \p{Canonical_Combining_Class=CCC133}
7079       \p{Grapheme_Cluster_Break=E_Base}
7080       \p{Grapheme_Cluster_Break=E_Base_GAZ}
7081       \p{Grapheme_Cluster_Break=E_Modifier}
7082       \p{Grapheme_Cluster_Break=Glue_After_Zwj}
7083       \p{Word_Break=E_Base}
7084       \p{Word_Break=E_Base_GAZ}
7085       \p{Word_Break=E_Modifier}
7086       \p{Word_Break=Glue_After_Zwj}
7087

Properties accessible through Unicode::UCD

7089       The value of any Unicode (not including Perl extensions) character
7090       property mentioned above for any single code point is available through
7091       "charprop()" in Unicode::UCD.  "charprops_all()" in Unicode::UCD
7092       returns the values of all the Unicode properties for a given code
7093       point.
7094
7095       Besides these, all the Unicode character properties mentioned above
7096       (except for those marked as for internal use by Perl) are also
7097       accessible by "prop_invlist()" in Unicode::UCD.
7098
7099       Due to their nature, not all Unicode character properties are suitable
7100       for regular expression matches, nor "prop_invlist()".  The remaining
7101       non-provisional, non-internal ones are accessible via "prop_invmap()"
7102       in Unicode::UCD (except for those that this Perl installation hasn't
7103       included; see below for which those are).
7104
7105       For compatibility with other parts of Perl, all the single forms given
7106       in the table in the section above are recognized.  BUT, there are some
7107       ambiguities between some Perl extensions and the Unicode properties,
7108       all of which are silently resolved in favor of the official Unicode
7109       property.  To avoid surprises, you should only use "prop_invmap()" for
7110       forms listed in the table below, which omits the non-recommended ones.
7111       The affected forms are the Perl single form equivalents of Unicode
7112       properties, such as "\p{sc}" being a single-form equivalent of
7113       "\p{gc=sc}", which is treated by "prop_invmap()" as the "Script"
7114       property, whose short name is "sc".  The table indicates the current
7115       ambiguities in the INFO column, beginning with the word "NOT".
7116
7117       The standard Unicode properties listed below are documented in
7118       <http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is
7119       documented in "prop_invmap()" in Unicode::UCD.  The other Perl
7120       extensions are in "Other Properties" in perlunicode;
7121
7122       The first column in the table is a name for the property; the second
7123       column is an alternative name, if any, plus possibly some annotations.
7124       The alternative name is the property's full name, unless that would
7125       simply repeat the first column, in which case the second column
7126       indicates the property's short name (if different).  The annotations
7127       are given only in the entry for the full name.  The annotations for
7128       binary properties include a list of the first few ranges that the
7129       property matches.  To avoid any ambiguity, the SPACE character is
7130       represented as "\x20".
7131
7132       If a property is obsolete, etc, the entry will be flagged with the same
7133       characters used in the table in the section above, like D or S.
7134
7135          NAME                      INFO
7136
7137          Age
7138          AHex                    ASCII_Hex_Digit
7139          All                     (Perl extension).  All code points,
7140                                  including those above Unicode.  Same as
7141                                  qr/./s.  U+0000..infinity
7142          Alnum                   XPosixAlnum.  (Perl extension)
7143          Alpha                   Alphabetic
7144          Alphabetic              (Short: Alpha).  [A-Za-z\xaa\xb5\xba\xc0-
7145                                  \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7146                                  U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7147                                  ...
7148          Any                     (Perl extension).  All Unicode code
7149                                  points.  U+0000..10FFFF
7150          ASCII                   Block=Basic_Latin.  (Perl extension).
7151                                  [\x00-\x7f]
7152          ASCII_Hex_Digit         (Short: AHex).  [0-9A-Fa-f]
7153          Assigned                (Perl extension).  All assigned code
7154                                  points.  U+0000..0377, U+037A..037F,
7155                                  U+0384..038A, U+038C, U+038E..03A1,
7156                                  U+03A3..052F ...
7157          Bc                      Bidi_Class
7158          Bidi_C                  Bidi_Control
7159          Bidi_Class              (Short: bc)
7160          Bidi_Control            (Short: Bidi_C).  U+061C, U+200E..200F,
7161                                  U+202A..202E, U+2066..2069
7162          Bidi_M                  Bidi_Mirrored
7163          Bidi_Mirrored           (Short: Bidi_M).  [\(\)<>\[\]\{\}\xab
7164                                  \xbb], U+0F3A..0F3D, U+169B..169C,
7165                                  U+2039..203A, U+2045..2046, U+207D..207E
7166                                  ...
7167          Bidi_Mirroring_Glyph    (Short: bmg)
7168          Bidi_Paired_Bracket     (Short: bpb)
7169          Bidi_Paired_Bracket_Type (Short: bpt)
7170          Blank                   XPosixBlank.  (Perl extension)
7171          Blk                     Block
7172          Block                   (Short: blk)
7173          Bmg                     Bidi_Mirroring_Glyph
7174          Bpb                     Bidi_Paired_Bracket
7175          Bpt                     Bidi_Paired_Bracket_Type
7176          Canonical_Combining_Class (Short: ccc)
7177          Case_Folding            (Short: cf)
7178          Case_Ignorable          (Short: CI).  [\'.:\^`\xa8\xad\xaf\xb4
7179                                  \xb7-\xb8], U+02B0..036F, U+0374..0375,
7180                                  U+037A, U+0384..0385, U+0387 ...
7181          Cased                   [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7182                                  \xff], U+0100..01BA, U+01BC..01BF,
7183                                  U+01C4..0293, U+0295..02B8, U+02C0..02C1
7184                                  ...
7185          Category                General_Category
7186          Ccc                     Canonical_Combining_Class
7187          CE                      Composition_Exclusion
7188          Cf                      Case_Folding; NOT 'cf' meaning
7189                                  'General_Category=Format'
7190          Changes_When_Casefolded (Short: CWCF).  [A-Z\xb5\xc0-\xd6\xd8-
7191                                  \xdf], U+0100, U+0102, U+0104, U+0106,
7192                                  U+0108 ...
7193          Changes_When_Casemapped (Short: CWCM).  [A-Za-z\xb5\xc0-\xd6\xd8-
7194                                  \xf6\xf8-\xff], U+0100..0137,
7195                                  U+0139..018C, U+018E..019A, U+019C..01A9,
7196                                  U+01AC..01B9 ...
7197          Changes_When_Lowercased (Short: CWL).  [A-Z\xc0-\xd6\xd8-\xde],
7198                                  U+0100, U+0102, U+0104, U+0106, U+0108 ...
7199          Changes_When_NFKC_Casefolded (Short: CWKCF).  [A-Z\xa0\xa8\xaa
7200                                  \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0-
7201                                  \xd6\xd8-\xdf], U+0100, U+0102, U+0104,
7202                                  U+0106, U+0108 ...
7203          Changes_When_Titlecased (Short: CWT).  [a-z\xb5\xdf-\xf6\xf8-
7204                                  \xff], U+0101, U+0103, U+0105, U+0107,
7205                                  U+0109 ...
7206          Changes_When_Uppercased (Short: CWU).  [a-z\xb5\xdf-\xf6\xf8-
7207                                  \xff], U+0101, U+0103, U+0105, U+0107,
7208                                  U+0109 ...
7209          CI                      Case_Ignorable
7210          Cntrl                   XPosixCntrl (=General_Category=Control).
7211                                  (Perl extension)
7212          Comp_Ex                 Full_Composition_Exclusion
7213          Composition_Exclusion   (Short: CE).  U+0958..095F, U+09DC..09DD,
7214                                  U+09DF, U+0A33, U+0A36, U+0A59..0A5B ...
7215          CWCF                    Changes_When_Casefolded
7216          CWCM                    Changes_When_Casemapped
7217          CWKCF                   Changes_When_NFKC_Casefolded
7218          CWL                     Changes_When_Lowercased
7219          CWT                     Changes_When_Titlecased
7220          CWU                     Changes_When_Uppercased
7221          Dash                    [\-], U+058A, U+05BE, U+1400, U+1806,
7222                                  U+2010..2015 ...
7223          Decomposition_Mapping   (Short: dm)
7224          Decomposition_Type      (Short: dt)
7225          Default_Ignorable_Code_Point (Short: DI).  [\xad], U+034F, U+061C,
7226                                  U+115F..1160, U+17B4..17B5, U+180B..180F
7227                                  ...
7228          Dep                     Deprecated
7229          Deprecated              (Short: Dep).  U+0149, U+0673, U+0F77,
7230                                  U+0F79, U+17A3..17A4, U+206A..206F ...
7231          DI                      Default_Ignorable_Code_Point
7232          Dia                     Diacritic
7233          Diacritic               (Short: Dia).  [\^`\xa8\xaf\xb4\xb7-\xb8],
7234                                  U+02B0..034E, U+0350..0357, U+035D..0362,
7235                                  U+0374..0375, U+037A ...
7236          Digit                   XPosixDigit (=General_Category=
7237                                  Decimal_Number).  (Perl extension)
7238          Dm                      Decomposition_Mapping
7239          Dt                      Decomposition_Type
7240          Ea                      East_Asian_Width
7241          East_Asian_Width        (Short: ea)
7242          EBase                   Emoji_Modifier_Base
7243          EComp                   Emoji_Component
7244          EMod                    Emoji_Modifier
7245          Emoji                   [#*0-9\xa9\xae], U+203C, U+2049, U+2122,
7246                                  U+2139, U+2194..2199 ...
7247          Emoji_Component         (Short: EComp).  [#*0-9], U+200D, U+20E3,
7248                                  U+FE0F, U+1F1E6..1F1FF, U+1F3FB..1F3FF ...
7249          Emoji_Modifier          (Short: EMod).  U+1F3FB..1F3FF
7250          Emoji_Modifier_Base     (Short: EBase).  U+261D, U+26F9,
7251                                  U+270A..270D, U+1F385, U+1F3C2..1F3C4,
7252                                  U+1F3C7 ...
7253          Emoji_Presentation      (Short: EPres).  U+231A..231B,
7254                                  U+23E9..23EC, U+23F0, U+23F3,
7255                                  U+25FD..25FE, U+2614..2615 ...
7256          EPres                   Emoji_Presentation
7257          EqUIdeo                 Equivalent_Unified_Ideograph
7258          Equivalent_Unified_Ideograph (Short: EqUIdeo)
7259          Ext                     Extender
7260          Extended_Pictographic   (Short: ExtPict).  [\xa9\xae], U+203C,
7261                                  U+2049, U+2122, U+2139, U+2194..2199 ...
7262          Extender                (Short: Ext).  [\xb7], U+02D0..02D1,
7263                                  U+0640, U+07FA, U+0B55, U+0E46 ...
7264          ExtPict                 Extended_Pictographic
7265          Full_Composition_Exclusion (Short: Comp_Ex).  U+0340..0341,
7266                                  U+0343..0344, U+0374, U+037E, U+0387,
7267                                  U+0958..095F ...
7268          Gc                      General_Category
7269          GCB                     Grapheme_Cluster_Break
7270          General_Category        (Short: gc)
7271          Gr_Base                 Grapheme_Base
7272          Gr_Ext                  Grapheme_Extend
7273          Graph                   XPosixGraph.  (Perl extension)
7274          Grapheme_Base           (Short: Gr_Base).  [\x20-\x7e\xa0-\xac
7275                                  \xae-\xff], U+0100..02FF, U+0370..0377,
7276                                  U+037A..037F, U+0384..038A, U+038C ...
7277          Grapheme_Cluster_Break  (Short: GCB)
7278          Grapheme_Extend         (Short: Gr_Ext).  U+0300..036F,
7279                                  U+0483..0489, U+0591..05BD, U+05BF,
7280                                  U+05C1..05C2, U+05C4..05C5 ...
7281          Hangul_Syllable_Type    (Short: hst)
7282          Hex                     Hex_Digit
7283          Hex_Digit               (Short: Hex).  [0-9A-Fa-f], U+FF10..FF19,
7284                                  U+FF21..FF26, U+FF41..FF46
7285          HorizSpace              XPosixBlank.  (Perl extension)
7286          Hst                     Hangul_Syllable_Type
7287        D Hyphen                  [\-\xad], U+058A, U+1806, U+2010..2011,
7288                                  U+2E17, U+30FB ...  Supplanted by
7289                                  Line_Break property values; see
7290                                  www.unicode.org/reports/tr14
7291          ID_Continue             (Short: IDC).  [0-9A-Z_a-z\xaa\xb5\xb7
7292                                  \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7293                                  U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7294                                  U+02EC, U+02EE ...
7295          ID_Start                (Short: IDS).  [A-Za-z\xaa\xb5\xba\xc0-
7296                                  \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7297                                  U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7298                                  ...
7299          IDC                     ID_Continue
7300          Identifier_Status
7301          Identifier_Type
7302          Ideo                    Ideographic
7303          Ideographic             (Short: Ideo).  U+3006..3007,
7304                                  U+3021..3029, U+3038..303A, U+3400..4DBF,
7305                                  U+4E00..9FFF, U+F900..FA6D ...
7306          IDS                     ID_Start
7307          IDS_Binary_Operator     (Short: IDSB).  U+2FF0..2FF1, U+2FF4..2FFB
7308          IDS_Trinary_Operator    (Short: IDST).  U+2FF2..2FF3
7309          IDSB                    IDS_Binary_Operator
7310          IDST                    IDS_Trinary_Operator
7311          In                      Present_In.  (Perl extension)
7312          Indic_Positional_Category (Short: InPC)
7313          Indic_Syllabic_Category (Short: InSC)
7314          InPC                    Indic_Positional_Category
7315          InSC                    Indic_Syllabic_Category
7316          Isc                     ISO_Comment; NOT 'isc' meaning
7317                                  'General_Category=Other'
7318          ISO_Comment             (Short: isc)
7319          Jg                      Joining_Group
7320          Join_C                  Join_Control
7321          Join_Control            (Short: Join_C).  U+200C..200D
7322          Joining_Group           (Short: jg)
7323          Joining_Type            (Short: jt)
7324          Jt                      Joining_Type
7325          Lb                      Line_Break
7326          Lc                      Lowercase_Mapping; NOT 'lc' meaning
7327                                  'General_Category=Cased_Letter'
7328          Line_Break              (Short: lb)
7329          LOE                     Logical_Order_Exception
7330          Logical_Order_Exception (Short: LOE).  U+0E40..0E44, U+0EC0..0EC4,
7331                                  U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9
7332                                  ...
7333          Lower                   Lowercase
7334          Lowercase               (Short: Lower).  [a-z\xaa\xb5\xba\xdf-
7335                                  \xf6\xf8-\xff], U+0101, U+0103, U+0105,
7336                                  U+0107, U+0109 ...
7337          Lowercase_Mapping       (Short: lc)
7338          Math                    [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
7339                                  U+03D5, U+03F0..03F1, U+03F4..03F6,
7340                                  U+0606..0608 ...
7341          Na                      Name
7342          Na1                     Unicode_1_Name
7343          Name                    (Short: na)
7344          Name_Alias
7345          NChar                   Noncharacter_Code_Point
7346          NFC_QC                  NFC_Quick_Check
7347          NFC_Quick_Check         (Short: NFC_QC)
7348          NFD_QC                  NFD_Quick_Check
7349          NFD_Quick_Check         (Short: NFD_QC)
7350          NFKC_Casefold           (Short: NFKC_CF)
7351          NFKC_CF                 NFKC_Casefold
7352          NFKC_QC                 NFKC_Quick_Check
7353          NFKC_Quick_Check        (Short: NFKC_QC)
7354          NFKD_QC                 NFKD_Quick_Check
7355          NFKD_Quick_Check        (Short: NFKD_QC)
7356          Noncharacter_Code_Point (Short: NChar).  U+FDD0..FDEF,
7357                                  U+FFFE..FFFF, U+1FFFE..1FFFF,
7358                                  U+2FFFE..2FFFF, U+3FFFE..3FFFF,
7359                                  U+4FFFE..4FFFF ...
7360          Nt                      Numeric_Type
7361          Numeric_Type            (Short: nt)
7362          Numeric_Value           (Short: nv)
7363          Nv                      Numeric_Value
7364          Pat_Syn                 Pattern_Syntax
7365          Pat_WS                  Pattern_White_Space
7366          Pattern_Syntax          (Short: Pat_Syn).  [!\"#\$\%&\'\(\)*+,\-.
7367                                  \/:;<=>?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9
7368                                  \xab-\xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7
7369                                  \xf7], U+2010..2027, U+2030..203E,
7370                                  U+2041..2053, U+2055..205E, U+2190..245F
7371                                  ...
7372          Pattern_White_Space     (Short: Pat_WS).  [\t\n\cK\f\r\x20\x85],
7373                                  U+200E..200F, U+2028..2029
7374          PCM                     Prepended_Concatenation_Mark
7375          Perl_Decimal_Digit      (Perl extension)
7376          PerlSpace               PosixSpace.  (Perl extension)
7377          PerlWord                PosixWord.  (Perl extension)
7378          PosixAlnum              (Perl extension).  [0-9A-Za-z]
7379          PosixAlpha              (Perl extension).  [A-Za-z]
7380          PosixBlank              (Perl extension).  [\t\x20]
7381          PosixCntrl              (Perl extension).  ASCII control
7382                                  characters.  ACK, BEL, BS, CAN, CR, DC1,
7383                                  DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT,
7384                                  ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK,
7385                                  NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT
7386          PosixDigit              (Perl extension).  [0-9]
7387          PosixGraph              (Perl extension).  [!\"#\$\%&\'\(\)*+,\-.
7388                                  \/0-9:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~]
7389          PosixLower              (Perl extension).  [a-z]
7390          PosixPrint              (Perl extension).  [\x20-\x7e]
7391          PosixPunct              (Perl extension).  [!\"#\$\%&\'\(\)*+,\-.
7392                                  \/:;<=>?\@\[\\\]\^_`\{\|\}~]
7393          PosixSpace              (Perl extension).  [\t\n\cK\f\r\x20]
7394          PosixUpper              (Perl extension).  [A-Z]
7395          PosixWord               (Perl extension).  \w, restricted to
7396                                  ASCII.  [0-9A-Z_a-z]
7397          PosixXDigit             ASCII_Hex_Digit.  (Perl extension).
7398                                  [0-9A-Fa-f]
7399          Prepended_Concatenation_Mark (Short: PCM).  U+0600..0605, U+06DD,
7400                                  U+070F, U+0890..0891, U+08E2, U+110BD ...
7401          Present_In              (Short: In).  (Perl extension)
7402          Print                   XPosixPrint.  (Perl extension)
7403          Punct                   General_Category=Punctuation.  (Perl
7404                                  extension).  [!\"#\%&\'\(\)*,\-.\/:;?\@
7405                                  \[\\\]_\{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
7406                                  U+037E, U+0387, U+055A..055F,
7407                                  U+0589..058A, U+05BE ...
7408          QMark                   Quotation_Mark
7409          Quotation_Mark          (Short: QMark).  [\"\'\xab\xbb],
7410                                  U+2018..201F, U+2039..203A, U+2E42,
7411                                  U+300C..300F, U+301D..301F ...
7412          Radical                 U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5
7413          Regional_Indicator      (Short: RI).  U+1F1E6..1F1FF
7414          RI                      Regional_Indicator
7415          SB                      Sentence_Break
7416          Sc                      Script; NOT 'sc' meaning
7417                                  'General_Category=Currency_Symbol'
7418          Scf                     Simple_Case_Folding
7419          Script                  (Short: sc)
7420          Script_Extensions       (Short: scx)
7421          Scx                     Script_Extensions
7422          SD                      Soft_Dotted
7423          Sentence_Break          (Short: SB)
7424          Sentence_Terminal       (Short: STerm).  [!.?], U+0589,
7425                                  U+061D..061F, U+06D4, U+0700..0702, U+07F9
7426                                  ...
7427          Sfc                     Simple_Case_Folding
7428          Simple_Case_Folding     (Short: scf)
7429          Simple_Lowercase_Mapping (Short: slc)
7430          Simple_Titlecase_Mapping (Short: stc)
7431          Simple_Uppercase_Mapping (Short: suc)
7432          Slc                     Simple_Lowercase_Mapping
7433          Soft_Dotted             (Short: SD).  [i-j], U+012F, U+0249,
7434                                  U+0268, U+029D, U+02B2 ...
7435          Space                   White_Space
7436          SpacePerl               XPosixSpace.  (Perl extension)
7437          Stc                     Simple_Titlecase_Mapping
7438          STerm                   Sentence_Terminal
7439          Suc                     Simple_Uppercase_Mapping
7440          Tc                      Titlecase_Mapping
7441          Term                    Terminal_Punctuation
7442          Terminal_Punctuation    (Short: Term).  [!,.:;?], U+037E, U+0387,
7443                                  U+0589, U+05C3, U+060C ...
7444          Title                   Titlecase.  (Perl extension)
7445          Titlecase               (Short: Title).  (Perl extension).  (=
7446                                  \p{Gc=Lt}).  U+01C5, U+01C8, U+01CB,
7447                                  U+01F2, U+1F88..1F8F, U+1F98..1F9F ...
7448          Titlecase_Mapping       (Short: tc)
7449          Uc                      Uppercase_Mapping
7450          UIdeo                   Unified_Ideograph
7451          Unicode                 Any.  (Perl extension)
7452          Unicode_1_Name          (Short: na1)
7453          Unified_Ideograph       (Short: UIdeo).  U+3400..4DBF,
7454                                  U+4E00..9FFF, U+FA0E..FA0F, U+FA11,
7455                                  U+FA13..FA14, U+FA1F ...
7456          Upper                   Uppercase
7457          Uppercase               (Short: Upper).  [A-Z\xc0-\xd6\xd8-\xde],
7458                                  U+0100, U+0102, U+0104, U+0106, U+0108 ...
7459          Uppercase_Mapping       (Short: uc)
7460          Variation_Selector      (Short: VS).  U+180B..180D, U+180F,
7461                                  U+FE00..FE0F, U+E0100..E01EF
7462          Vertical_Orientation    (Short: vo)
7463          VertSpace               (Perl extension).  \v.  [\n\cK\f\r\x85],
7464                                  U+2028..2029
7465          Vo                      Vertical_Orientation
7466          VS                      Variation_Selector
7467          WB                      Word_Break
7468          White_Space             (Short: WSpace).  [\t\n\cK\f\r\x20\x85
7469                                  \xa0], U+1680, U+2000..200A, U+2028..2029,
7470                                  U+202F, U+205F ...
7471          Word                    XPosixWord.  (Perl extension)
7472          Word_Break              (Short: WB)
7473          WSpace                  White_Space
7474          XDigit                  XPosixXDigit (=Hex_Digit).  (Perl
7475                                  extension)
7476          XID_Continue            (Short: XIDC).  [0-9A-Z_a-z\xaa\xb5\xb7
7477                                  \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7478                                  U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7479                                  U+02EC, U+02EE ...
7480          XID_Start               (Short: XIDS).  [A-Za-z\xaa\xb5\xba\xc0-
7481                                  \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7482                                  U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7483                                  ...
7484          XIDC                    XID_Continue
7485          XIDS                    XID_Start
7486          XPerlSpace              XPosixSpace.  (Perl extension)
7487          XPosixAlnum             (Short: Alnum).  (Perl extension).
7488                                  Alphabetic and (decimal) Numeric.  [0-9A-
7489                                  Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7490                                  \xff], U+0100..02C1, U+02C6..02D1,
7491                                  U+02E0..02E4, U+02EC, U+02EE ...
7492          XPosixAlpha             Alphabetic.  (Perl extension).  [A-Za-z
7493                                  \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7494                                  U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7495                                  U+02EC, U+02EE ...
7496          XPosixBlank             (Short: Blank).  (Perl extension).  \h,
7497                                  Horizontal white space.  [\t\x20\xa0],
7498                                  U+1680, U+2000..200A, U+202F, U+205F,
7499                                  U+3000
7500          XPosixCntrl             General_Category=Control  (Short: Cntrl).
7501                                  (Perl extension).  Control characters.
7502                                  [\x00-\x1f\x7f-\x9f]
7503          XPosixDigit             General_Category=Decimal_Number  (Short:
7504                                  Digit).  (Perl extension).  [0-9] + all
7505                                  other decimal digits.  [0-9],
7506                                  U+0660..0669, U+06F0..06F9, U+07C0..07C9,
7507                                  U+0966..096F, U+09E6..09EF ...
7508          XPosixGraph             (Short: Graph).  (Perl extension).
7509                                  Characters that are graphical.  [!\"#\$
7510                                  \%&\'\(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]
7511                                  \^_`a-z\{\|\}~\xa1-\xff], U+0100..0377,
7512                                  U+037A..037F, U+0384..038A, U+038C,
7513                                  U+038E..03A1 ...
7514          XPosixLower             Lowercase.  (Perl extension).  [a-z\xaa
7515                                  \xb5\xba\xdf-\xf6\xf8-\xff], U+0101,
7516                                  U+0103, U+0105, U+0107, U+0109 ...
7517          XPosixPrint             (Short: Print).  (Perl extension).
7518                                  Characters that are graphical plus space
7519                                  characters (but no controls).  [\x20-\x7e
7520                                  \xa0-\xff], U+0100..0377, U+037A..037F,
7521                                  U+0384..038A, U+038C, U+038E..03A1 ...
7522          XPosixPunct             (Perl extension).  \p{Punct} + ASCII-range
7523                                  \p{Symbol}.  [!\"#\$\%&\'\(\)*+,\-.\/:;<=
7524                                  >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
7525                                  \xb7\xbb\xbf], U+037E, U+0387,
7526                                  U+055A..055F, U+0589..058A, U+05BE ...
7527          XPosixSpace             (Perl extension).  \s including beyond
7528                                  ASCII and vertical tab.  [\t\n\cK\f\r\x20
7529                                  \x85\xa0], U+1680, U+2000..200A,
7530                                  U+2028..2029, U+202F, U+205F ...
7531          XPosixUpper             Uppercase.  (Perl extension).  [A-Z\xc0-
7532                                  \xd6\xd8-\xde], U+0100, U+0102, U+0104,
7533                                  U+0106, U+0108 ...
7534          XPosixWord              (Short: Word).  (Perl extension).  \w,
7535                                  including beyond ASCII; = \p{Alnum} + \pM
7536                                  + \p{Pc} + \p{Join_Control}.  [0-9A-Z_a-z
7537                                  \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7538                                  U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7539                                  U+02EC, U+02EE ...
7540          XPosixXDigit            Hex_Digit  (Short: XDigit).  (Perl
7541                                  extension).  [0-9A-Fa-f], U+FF10..FF19,
7542                                  U+FF21..FF26, U+FF41..FF46
7543

Properties accessible through other means

7545       Certain properties are accessible also via core function calls.  These
7546       are:
7547
7548        Lowercase_Mapping          lc() and lcfirst()
7549        Titlecase_Mapping          ucfirst()
7550        Uppercase_Mapping          uc()
7551
7552       Also, Case_Folding is accessible through the "/i" modifier in regular
7553       expressions, the "\F" transliteration escape, and the "fc" operator.
7554
7555       Besides being able to say "\p{Name=...}", the Name and Name_Aliases
7556       properties are accessible through the "\N{}" interpolation in double-
7557       quoted strings and regular expressions; and functions
7558       "charnames::viacode()", "charnames::vianame()", and
7559       "charnames::string_vianame()" (which require a "use charnames ();" to
7560       be specified.
7561
7562       Finally, most properties related to decomposition are accessible via
7563       Unicode::Normalize.
7564

Unicode character properties that are NOT accepted by Perl

7566       Perl will generate an error for a few character properties in Unicode
7567       when used in a regular expression.  The non-Unihan ones are listed
7568       below, with the reasons they are not accepted, perhaps with work-
7569       arounds.  The short names for the properties are listed enclosed in
7570       (parentheses).  As described after the list, an installation can change
7571       the defaults and choose to accept any of these.  The list is machine
7572       generated based on the choices made for the installation that generated
7573       this document.
7574
7575       Expands_On_NFC (XO_NFC)
7576       Expands_On_NFD (XO_NFD)
7577       Expands_On_NFKC (XO_NFKC)
7578       Expands_On_NFKD (XO_NFKD)
7579           Deprecated by Unicode.  These are characters that expand to more
7580           than one character in the specified normalization form, but whether
7581           they actually take up more bytes or not depends on the encoding
7582           being used.  For example, a UTF-8 encoded character may expand to a
7583           different number of bytes than a UTF-32 encoded character.
7584
7585       Grapheme_Link (Gr_Link)
7586           Duplicates ccc=vr (Canonical_Combining_Class=Virama)
7587
7588       Jamo_Short_Name (JSN)
7589       Other_Alphabetic (OAlpha)
7590       Other_Default_Ignorable_Code_Point (ODI)
7591       Other_Grapheme_Extend (OGr_Ext)
7592       Other_ID_Continue (OIDC)
7593       Other_ID_Start (OIDS)
7594       Other_Lowercase (OLower)
7595       Other_Math (OMath)
7596       Other_Uppercase (OUpper)
7597           Used by Unicode internally for generating other properties and not
7598           intended to be used stand-alone
7599
7600       Script=Katakana_Or_Hiragana (sc=Hrkt)
7601           Obsolete.  All code points previously matched by this have been
7602           moved to "Script=Common".  Consider instead using
7603           "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or
7604           both)
7605
7606       Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)
7607           All code points that would be matched by this are matched by either
7608           "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
7609
7610       An installation can choose to allow any of these to be matched by
7611       downloading the Unicode database from <http://www.unicode.org/Public/>
7612       to $Config{privlib}/unicore/ in the Perl source tree, changing the
7613       controlling lists contained in the program
7614       $Config{privlib}/unicore/mktables and then re-compiling and installing.
7615       (%Config is available from the Config module).
7616
7617       Also, perl can be recompiled to operate on an earlier version of the
7618       Unicode standard.  Further information is at
7619       $Config{privlib}/unicore/README.perl.
7620

Other information in the Unicode data base

7622       The Unicode data base is delivered in two different formats.  The XML
7623       version is valid for more modern Unicode releases.  The other version
7624       is a collection of files.  The two are intended to give equivalent
7625       information.  Perl uses the older form; this allows you to recompile
7626       Perl to use early Unicode releases.
7627
7628       The only non-character property that Perl currently supports is Named
7629       Sequences, in which a sequence of code points is given a name and
7630       generally treated as a single entity.  (Perl supports these via the
7631       "\N{...}" double-quotish construct, "charnames::string_vianame(name)"
7632       in charnames, and "namedseq()" in Unicode::UCD.
7633
7634       Below is a list of the files in the Unicode data base that Perl doesn't
7635       currently use, along with very brief descriptions of their purposes.
7636       Some of the names of the files have been shortened from those that
7637       Unicode uses, in order to allow them to be distinguishable from
7638       similarly named files on file systems for which only the first 8
7639       characters of a name are significant.
7640
7641       auxiliary/GraphemeBreakTest.html
7642       auxiliary/LineBreakTest.html
7643       auxiliary/SentenceBreakTest.html
7644       auxiliary/WordBreakTest.html
7645           Documentation of validation Tests
7646
7647       BidiCharacterTest.txt
7648       BidiTest.txt
7649       NormTest.txt
7650           Validation Tests
7651
7652       CJKRadicals.txt
7653           Maps the kRSUnicode property values to corresponding code points
7654
7655       emoji/ReadMe.txt
7656       ReadMe.txt
7657           Documentation
7658
7659       EmojiSources.txt
7660           Maps certain Unicode code points to their legacy Japanese cell-
7661           phone values
7662
7663       extracted/DName.txt
7664           This file adds no new information not already present in other
7665           files
7666
7667       Index.txt
7668           Alphabetical index of Unicode characters
7669
7670       NamedSqProv.txt
7671           Named sequences proposed for inclusion in a later version of the
7672           Unicode Standard; if you need them now, you can append this file to
7673           NamedSequences.txt and recompile perl
7674
7675       NamesList.html
7676           Describes the format and contents of NamesList.txt
7677
7678       NamesList.txt
7679           Annotated list of characters
7680
7681       NormalizationCorrections.txt
7682           Documentation of corrections already incorporated into the Unicode
7683           data base
7684
7685       NushuSources.txt
7686           Specifies source material for Nushu characters
7687
7688       StandardizedVariants.html
7689           Obsoleted as of Unicode 9.0, but previously provided a visual
7690           display of the standard variant sequences derived from
7691           StandardizedVariants.txt.
7692
7693       StandardizedVariants.txt
7694           Certain glyph variations for character display are standardized.
7695           This lists the non-Unihan ones; the Unihan ones are also not used
7696           by Perl, and are in a separate Unicode data base
7697           <http://www.unicode.org/ivd>
7698
7699       TangutSources.txt
7700           Specifies source mappings for Tangut ideographs and components.
7701           This data file also includes informative radical-stroke values that
7702           are used internally by Unicode
7703
7704       USourceData.txt
7705           Documentation of status and cross reference of proposals for
7706           encoding by Unicode of Unihan characters
7707
7708       USourceGlyphs.pdf
7709           Pictures of the characters in USourceData.txt
7710

SEE ALSO

7712       <http://www.unicode.org/reports/tr44/>
7713
7714       perlrecharclass
7715
7716       perlunicode
7717
7718
7719
7720perl v5.36.0                      2022-08-30                   PERLUNIPROPS(1)
Impressum