1PERLUNIPROPS(1)        Perl Programmers Reference Guide        PERLUNIPROPS(1)
2
3
4

NAME

6       perluniprops - Index of Unicode Version 12.1.0 character properties in
7       Perl
8

DESCRIPTION

10       This document provides information about the portion of the Unicode
11       database that deals with character properties, that is the portion that
12       is defined on single code points.  ("Other information in the Unicode
13       data base" below briefly mentions other data that Unicode provides.)
14
15       Perl can provide access to all non-provisional Unicode character
16       properties, though not all are enabled by default.  The omitted ones
17       are the Unihan properties (accessible via the CPAN module
18       Unicode::Unihan) and certain deprecated or Unicode-internal properties.
19       (An installation may choose to recompile Perl's tables to change this.
20       See "Unicode character properties that are NOT accepted by Perl".)
21
22       For most purposes, access to Unicode properties from the Perl core is
23       through regular expression matches, as described in the next section.
24       For some special purposes, and to access the properties that are not
25       suitable for regular expression matching, all the Unicode character
26       properties that Perl handles are accessible via the standard
27       Unicode::UCD module, as described in the section "Properties accessible
28       through Unicode::UCD".
29
30       Perl also provides some additional extensions and short-cut synonyms
31       for Unicode properties.
32
33       This document merely lists all available properties and does not
34       attempt to explain what each property really means.  There is a brief
35       description of each Perl extension; see "Other Properties" in
36       perlunicode for more information on these.  There is some detail about
37       Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but
38       to find out about the intricacies of the official Unicode properties,
39       refer to the Unicode standard.  A good starting place is
40       <http://www.unicode.org/reports/tr44/>.
41
42       Note that you can define your own properties; see "User-Defined
43       Character Properties" in perlunicode.
44

Properties accessible through "\p{}" and "\P{}"

46       The Perl regular expression "\p{}" and "\P{}" constructs give access to
47       most of the Unicode character properties.  The table below shows all
48       these constructs, both single and compound forms.
49
50       Compound forms consist of two components, separated by an equals sign
51       or a colon.  The first component is the property name, and the second
52       component is the particular value of the property to match against, for
53       example, "\p{Script_Extensions: Greek}" and
54       "\p{Script_Extensions=Greek}" both mean to match characters whose
55       Script_Extensions property value is Greek.  ("Script_Extensions" is an
56       improved version of the "Script" property.)
57
58       Single forms, like "\p{Greek}", are mostly Perl-defined shortcuts for
59       their equivalent compound forms.  The table shows these equivalences.
60       (In our example, "\p{Greek}" is a just a shortcut for
61       "\p{Script_Extensions=Greek}").  There are also a few Perl-defined
62       single forms that are not shortcuts for a compound form.  One such is
63       "\p{Word}".  These are also listed in the table.
64
65       In parsing these constructs, Perl always ignores Upper/lower case
66       differences everywhere within the {braces}.  Thus "\p{Greek}" means the
67       same thing as "\p{greek}".  But note that changing the case of the "p"
68       or "P" before the left brace completely changes the meaning of the
69       construct, from "match" (for "\p{}") to "doesn't match" (for "\P{}").
70       Casing in this document is for improved legibility.
71
72       Also, white space, hyphens, and underscores are normally ignored
73       everywhere between the {braces}, and hence can be freely added or
74       removed even if the "/x" modifier hasn't been specified on the regular
75       expression.  But in the table below a 'T' at the beginning of an entry
76       means that tighter (stricter) rules are used for that entry:
77
78           Single form ("\p{name}") tighter rules:
79               White space, hyphens, and underscores ARE significant except
80               for:
81
82               ·   white space adjacent to a non-word character
83
84               ·   underscores separating digits in numbers
85
86               That means, for example, that you can freely add or remove
87               white space adjacent to (but within) the braces without
88               affecting the meaning.
89
90           Compound form ("\p{name=value}" or "\p{name:value}") tighter rules:
91               The tighter rules given above for the single form apply to
92               everything to the right of the colon or equals; the looser
93               rules still apply to everything to the left.
94
95               That means, for example, that you can freely add or remove
96               white space adjacent to (but within) the braces and the colon
97               or equal sign.
98
99       Some properties are considered obsolete by Unicode, but still
100       available.  There are several varieties of obsolescence:
101
102           Stabilized
103               A property may be stabilized.  Such a determination does not
104               indicate that the property should or should not be used;
105               instead it is a declaration that the property will not be
106               maintained nor extended for newly encoded characters.  Such
107               properties are marked with an 'S' in the table.
108
109           Deprecated
110               A property may be deprecated, perhaps because its original
111               intent has been replaced by another property, or because its
112               specification was somehow defective.  This means that its use
113               is strongly discouraged, so much so that a warning will be
114               issued if used, unless the regular expression is in the scope
115               of a "no warnings 'deprecated'" statement.  A 'D' flags each
116               such entry in the table, and the entry there for the longest,
117               most descriptive version of the property will give the reason
118               it is deprecated, and perhaps advice.  Perl may issue such a
119               warning, even for properties that aren't officially deprecated
120               by Unicode, when there used to be characters or code points
121               that were matched by them, but no longer.  This is to warn you
122               that your program may not work like it did on earlier Unicode
123               releases.
124
125               A deprecated property may be made unavailable in a future Perl
126               version, so it is best to move away from them.
127
128               A deprecated property may also be stabilized, but this fact is
129               not shown.
130
131           Obsolete
132               Properties marked with an 'O' in the table are considered
133               (plain) obsolete.  Generally this designation is given to
134               properties that Unicode once used for internal purposes (but
135               not any longer).
136
137           Discouraged
138               This is not actually a Unicode-specified obsolescence, but
139               applies to certain Perl extensions that are present for
140               backwards compatibility, but are discouraged from being used.
141               These are not obsolete, but their meanings are not stable.
142               Future Unicode versions could force any of these extensions to
143               be removed without warning, replaced by another property with
144               the same name that means something different.  An 'X' flags
145               each such entry in the table.  Use the equivalent shown
146               instead.
147
148               In particular, matches in the Block property have single forms
149               defined by Perl that begin with "In_", ""Is_", or even with no
150               prefix at all,  Like all DISCOURAGED forms, these are not
151               stable.  For example, "\p{Block=Deseret}" can currently be
152               written as "\p{In_Deseret}", "\p{Is_Deseret}", or
153               "\p{Deseret}".  But, a new Unicode version may come along that
154               would force Perl to change the meaning of one or more of these,
155               and your program would no longer be correct.  Currently there
156               are no such conflicts with the form that begins "In_", but
157               there are many with the other two shortcuts, and Unicode
158               continues to define new properties that begin with "In", so
159               it's quite possible that a conflict will occur in the future.
160               The compound form is guaranteed to not become obsolete, and its
161               meaning is clearer anyway.  See "Blocks" in perlunicode for
162               more information about this.
163
164       The table below has two columns.  The left column contains the "\p{}"
165       constructs to look up, possibly preceded by the flags mentioned above;
166       and the right column contains information about them, like a
167       description, or synonyms.  The table shows both the single and compound
168       forms for each property that has them.  If the left column is a short
169       name for a property, the right column will give its longer, more
170       descriptive name; and if the left column is the longest name, the right
171       column will show any equivalent shortest name, in both single and
172       compound forms if applicable.
173
174       If braces are not needed to specify a property (e.g., "\pL"), the left
175       column contains both forms, with and without braces.
176
177       The right column will also caution you if a property means something
178       different than what might normally be expected.
179
180       All single forms are Perl extensions; a few compound forms are as well,
181       and are noted as such.
182
183       Numbers in (parentheses) indicate the total number of Unicode code
184       points matched by the property.  For the entries that give the longest,
185       most descriptive version of the property, the count is followed by a
186       list of some of the code points matched by it.  The list includes all
187       the matched characters in the 0-255 range, enclosed in the familiar
188       [brackets] the same as a regular expression bracketed character class.
189       Following that, the next few higher matching ranges are also given.  To
190       avoid visual ambiguity, the SPACE character is represented as "\x20".
191
192       For emphasis, those properties that match no code points at all are
193       listed as well in a separate section following the table.
194
195       Most properties match the same code points regardless of whether "/i"
196       case-insensitive matching is specified or not.  But a few properties
197       are affected.  These are shown with the notation "(/i= other_property)"
198       in the second column.  Under case-insensitive matching they match the
199       same code pode points as the property other_property.
200
201       There is no description given for most non-Perl defined properties (See
202       <http://www.unicode.org/reports/tr44/> for that).
203
204       For compactness, '*' is used as a wildcard instead of showing all
205       possible combinations.  For example, entries like:
206
207        \p{Gc: *}                                  \p{General_Category: *}
208
209       mean that 'Gc' is a synonym for 'General_Category', and anything that
210       is valid for the latter is also valid for the former.  Similarly,
211
212        \p{Is_*}                                   \p{*}
213
214       means that if and only if, for example, "\p{Foo}" exists, then
215       "\p{Is_Foo}" and "\p{IsFoo}" are also valid and all mean the same
216       thing.  And similarly, "\p{Foo=Bar}" means the same as "\p{Is_Foo=Bar}"
217       and "\p{IsFoo=Bar}".  "*" here is restricted to something not beginning
218       with an underscore.
219
220       Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
221       'Y'.  And 'No', 'F', and 'False' are all synonyms for 'N'.  The table
222       shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
223       for the other possibilities.  Note that not all properties which have
224       values 'Yes' and 'No' are binary, and they have all their values
225       spelled out without using this wild card, and a "NOT" clause in their
226       description that highlights their not being binary.  These also require
227       the compound form to match them, whereas true binary properties have
228       both single and compound forms available.
229
230       Note that all non-essential underscores are removed in the display of
231       the short names below.
232
233       Legend summary:
234
235       * is a wild-card
236       (\d+) in the info column gives the number of Unicode code points
237       matched by this property.
238       D means this is deprecated.
239       O means this is obsolete.
240       S means this is stabilized.
241       T means tighter (stricter) name matching applies.
242       X means use of this form is discouraged, and may not be stable.
243
244              NAME                           INFO
245
246          \p{Adlam}               \p{Script_Extensions=Adlam} (Short:
247                                    \p{Adlm}; NOT \p{Block=Adlam}) (89)
248          \p{Adlm}                \p{Adlam} (= \p{Script_Extensions=Adlam})
249                                    (NOT \p{Block=Adlam}) (89)
250        X \p{Aegean_Numbers}      \p{Block=Aegean_Numbers} (64)
251        T \p{Age: 1.1}            \p{Age=V1_1} (33_979)
252          \p{Age: V1_1}           Code point's usage introduced in version
253                                    1.1 (33_979: U+0000..01F5, U+01FA..0217,
254                                    U+0250..02A8, U+02B0..02DE,
255                                    U+02E0..02E9, U+0300..0345 ...)
256        T \p{Age: 2.0}            \p{Age=V2_0} (144_521)
257          \p{Age: V2_0}           Code point's usage was introduced in
258                                    version 2.0; See also Property
259                                    'Present_In' (144_521: U+0591..05A1,
260                                    U+05A3..05AF, U+05C4, U+0F00..0F47,
261                                    U+0F49..0F69, U+0F71..0F8B ...)
262        T \p{Age: 2.1}            \p{Age=V2_1} (2)
263          \p{Age: V2_1}           Code point's usage was introduced in
264                                    version 2.1; See also Property
265                                    'Present_In' (2: U+20AC, U+FFFC)
266        T \p{Age: 3.0}            \p{Age=V3_0} (10_307)
267          \p{Age: V3_0}           Code point's usage was introduced in
268                                    version 3.0; See also Property
269                                    'Present_In' (10_307: U+01F6..01F9,
270                                    U+0218..021F, U+0222..0233,
271                                    U+02A9..02AD, U+02DF, U+02EA..02EE ...)
272        T \p{Age: 3.1}            \p{Age=V3_1} (44_978)
273          \p{Age: V3_1}           Code point's usage was introduced in
274                                    version 3.1; See also Property
275                                    'Present_In' (44_978: U+03F4..03F5,
276                                    U+FDD0..FDEF, U+10300..1031E,
277                                    U+10320..10323, U+10330..1034A,
278                                    U+10400..10425 ...)
279        T \p{Age: 3.2}            \p{Age=V3_2} (1016)
280          \p{Age: V3_2}           Code point's usage was introduced in
281                                    version 3.2; See also Property
282                                    'Present_In' (1016: U+0220, U+034F,
283                                    U+0363..036F, U+03D8..03D9, U+03F6,
284                                    U+048A..048B ...)
285        T \p{Age: 4.0}            \p{Age=V4_0} (1226)
286          \p{Age: V4_0}           Code point's usage was introduced in
287                                    version 4.0; See also Property
288                                    'Present_In' (1226: U+0221,
289                                    U+0234..0236, U+02AE..02AF,
290                                    U+02EF..02FF, U+0350..0357, U+035D..035F
291                                    ...)
292        T \p{Age: 4.1}            \p{Age=V4_1} (1273)
293          \p{Age: V4_1}           Code point's usage was introduced in
294                                    version 4.1; See also Property
295                                    'Present_In' (1273: U+0237..0241,
296                                    U+0358..035C, U+03FC..03FF,
297                                    U+04F6..04F7, U+05A2, U+05C5..05C7 ...)
298        T \p{Age: 5.0}            \p{Age=V5_0} (1369)
299          \p{Age: V5_0}           Code point's usage was introduced in
300                                    version 5.0; See also Property
301                                    'Present_In' (1369: U+0242..024F,
302                                    U+037B..037D, U+04CF, U+04FA..04FF,
303                                    U+0510..0513, U+05BA ...)
304        T \p{Age: 5.1}            \p{Age=V5_1} (1624)
305          \p{Age: V5_1}           Code point's usage was introduced in
306                                    version 5.1; See also Property
307                                    'Present_In' (1624: U+0370..0373,
308                                    U+0376..0377, U+03CF, U+0487,
309                                    U+0514..0523, U+0606..060A ...)
310        T \p{Age: 5.2}            \p{Age=V5_2} (6648)
311          \p{Age: V5_2}           Code point's usage was introduced in
312                                    version 5.2; See also Property
313                                    'Present_In' (6648: U+0524..0525,
314                                    U+0800..082D, U+0830..083E, U+0900,
315                                    U+094E, U+0955 ...)
316        T \p{Age: 6.0}            \p{Age=V6_0} (2088)
317          \p{Age: V6_0}           Code point's usage was introduced in
318                                    version 6.0; See also Property
319                                    'Present_In' (2088: U+0526..0527,
320                                    U+0620, U+065F, U+0840..085B, U+085E,
321                                    U+093A..093B ...)
322        T \p{Age: 6.1}            \p{Age=V6_1} (732)
323          \p{Age: V6_1}           Code point's usage was introduced in
324                                    version 6.1; See also Property
325                                    'Present_In' (732: U+058F, U+0604,
326                                    U+08A0, U+08A2..08AC, U+08E4..08FE,
327                                    U+0AF0 ...)
328        T \p{Age: 6.2}            \p{Age=V6_2} (1)
329          \p{Age: V6_2}           Code point's usage was introduced in
330                                    version 6.2; See also Property
331                                    'Present_In' (1: U+20BA)
332        T \p{Age: 6.3}            \p{Age=V6_3} (5)
333          \p{Age: V6_3}           Code point's usage was introduced in
334                                    version 6.3; See also Property
335                                    'Present_In' (5: U+061C, U+2066..2069)
336        T \p{Age: 7.0}            \p{Age=V7_0} (2834)
337          \p{Age: V7_0}           Code point's usage was introduced in
338                                    version 7.0; See also Property
339                                    'Present_In' (2834: U+037F,
340                                    U+0528..052F, U+058D..058E, U+0605,
341                                    U+08A1, U+08AD..08B2 ...)
342        T \p{Age: 8.0}            \p{Age=V8_0} (7716)
343          \p{Age: V8_0}           Code point's usage was introduced in
344                                    version 8.0; See also Property
345                                    'Present_In' (7716: U+08B3..08B4,
346                                    U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5
347                                    ...)
348        T \p{Age: 9.0}            \p{Age=V9_0} (7500)
349          \p{Age: V9_0}           Code point's usage was introduced in
350                                    version 9.0; See also Property
351                                    'Present_In' (7500: U+08B6..08BD,
352                                    U+08D4..08E2, U+0C80, U+0D4F,
353                                    U+0D54..0D56, U+0D58..0D5E ...)
354        T \p{Age: 10.0}           \p{Age=V10_0} (8518)
355          \p{Age: V10_0}          Code point's usage was introduced in
356                                    version 10.0; See also Property
357                                    'Present_In' (8518: U+0860..086A,
358                                    U+09FC..09FD, U+0AFA..0AFF, U+0D00,
359                                    U+0D3B..0D3C, U+1CF7 ...)
360        T \p{Age: 11.0}           \p{Age=V11_0} (684)
361          \p{Age: V11_0}          Code point's usage was introduced in
362                                    version 11.0; See also Property
363                                    'Present_In' (684: U+0560, U+0588,
364                                    U+05EF, U+07FD..07FF, U+08D3, U+09FE ...)
365        T \p{Age: 12.0}           \p{Age=V12_0} (554)
366          \p{Age: V12_0}          Code point's usage was introduced in
367                                    version 12.0; See also Property
368                                    'Present_In' (554: U+0C77, U+0E86,
369                                    U+0E89, U+0E8C, U+0E8E..0E93, U+0E98 ...)
370        T \p{Age: 12.1}           \p{Age=V12_1} (1)
371          \p{Age: V12_1}          Code point's usage was introduced in
372                                    version 12.1; See also Property
373                                    'Present_In' (1: U+32FF)
374          \p{Age: NA}             \p{Age=Unassigned} (836_536 plus all
375                                    above-Unicode code points)
376          \p{Age: Unassigned}     Code point's usage has not been assigned
377                                    in any Unicode release thus far. (Short:
378                                    \p{Age=NA}) (836_536 plus all above-
379                                    Unicode code points: U+0378..0379,
380                                    U+0380..0383, U+038B, U+038D, U+03A2,
381                                    U+0530 ...)
382          \p{Aghb}                \p{Caucasian_Albanian} (=
383                                    \p{Script_Extensions=
384                                    Caucasian_Albanian}) (NOT \p{Block=
385                                    Caucasian_Albanian}) (53)
386          \p{AHex}                \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
387                                    (22)
388          \p{AHex: *}             \p{ASCII_Hex_Digit: *}
389          \p{Ahom}                \p{Script_Extensions=Ahom} (NOT \p{Block=
390                                    Ahom}) (58)
391        X \p{Alchemical}          \p{Alchemical_Symbols} (= \p{Block=
392                                    Alchemical_Symbols}) (128)
393        X \p{Alchemical_Symbols}  \p{Block=Alchemical_Symbols} (Short:
394                                    \p{InAlchemical}) (128)
395          \p{All}                 All code points, including those above
396                                    Unicode.  Same as qr/./s (1_114_112 plus
397                                    all above-Unicode code points:
398                                    U+0000..infinity)
399          \p{Alnum}               \p{XPosixAlnum} (127_886)
400          \p{Alpha}               \p{XPosixAlpha} (= \p{Alphabetic=Y})
401                                    (127_256)
402          \p{Alpha: *}            \p{Alphabetic: *}
403          \p{Alphabetic}          \p{XPosixAlpha} (= \p{Alphabetic=Y})
404                                    (127_256)
405          \p{Alphabetic: N*}      (Short: \p{Alpha=N}, \P{Alpha}) (986_856
406                                    plus all above-Unicode code points:
407                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
408                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
409                                    \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
410                                    U+02C2..02C5, U+02D2..02DF,
411                                    U+02E5..02EB, U+02ED, U+02EF..0344 ...)
412          \p{Alphabetic: Y*}      (Short: \p{Alpha=Y}, \p{Alpha}) (127_256:
413                                    [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
414                                    \xf6\xf8-\xff], U+0100..02C1,
415                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
416                                    U+02EE ...)
417        X \p{Alphabetic_PF}       \p{Alphabetic_Presentation_Forms} (=
418                                    \p{Block=Alphabetic_Presentation_Forms})
419                                    (80)
420        X \p{Alphabetic_Presentation_Forms} \p{Block=
421                                    Alphabetic_Presentation_Forms} (Short:
422                                    \p{InAlphabeticPF}) (80)
423          \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
424                                    Anatolian_Hieroglyphs} (Short: \p{Hluw};
425                                    NOT \p{Block=Anatolian_Hieroglyphs})
426                                    (583)
427        X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
428                                    \p{Block=
429                                    Ancient_Greek_Musical_Notation}) (80)
430        X \p{Ancient_Greek_Musical_Notation} \p{Block=
431                                    Ancient_Greek_Musical_Notation} (Short:
432                                    \p{InAncientGreekMusic}) (80)
433        X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
434        X \p{Ancient_Symbols}     \p{Block=Ancient_Symbols} (64)
435          \p{Any}                 All Unicode code points (1_114_112:
436                                    U+0000..10FFFF)
437          \p{Arab}                \p{Arabic} (= \p{Script_Extensions=
438                                    Arabic}) (NOT \p{Block=Arabic}) (1325)
439          \p{Arabic}              \p{Script_Extensions=Arabic} (Short:
440                                    \p{Arab}; NOT \p{Block=Arabic}) (1325)
441        X \p{Arabic_Ext_A}        \p{Arabic_Extended_A} (= \p{Block=
442                                    Arabic_Extended_A}) (96)
443        X \p{Arabic_Extended_A}   \p{Block=Arabic_Extended_A} (Short:
444                                    \p{InArabicExtA}) (96)
445        X \p{Arabic_Math}         \p{Arabic_Mathematical_Alphabetic_Symbols}
446                                    (= \p{Block=
447                                    Arabic_Mathematical_Alphabetic_Symbols})
448                                    (256)
449        X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
450                                    Arabic_Mathematical_Alphabetic_Symbols}
451                                    (Short: \p{InArabicMath}) (256)
452        X \p{Arabic_PF_A}         \p{Arabic_Presentation_Forms_A} (=
453                                    \p{Block=Arabic_Presentation_Forms_A})
454                                    (688)
455        X \p{Arabic_PF_B}         \p{Arabic_Presentation_Forms_B} (=
456                                    \p{Block=Arabic_Presentation_Forms_B})
457                                    (144)
458        X \p{Arabic_Presentation_Forms_A} \p{Block=
459                                    Arabic_Presentation_Forms_A} (Short:
460                                    \p{InArabicPFA}) (688)
461        X \p{Arabic_Presentation_Forms_B} \p{Block=
462                                    Arabic_Presentation_Forms_B} (Short:
463                                    \p{InArabicPFB}) (144)
464        X \p{Arabic_Sup}          \p{Arabic_Supplement} (= \p{Block=
465                                    Arabic_Supplement}) (48)
466        X \p{Arabic_Supplement}   \p{Block=Arabic_Supplement} (Short:
467                                    \p{InArabicSup}) (48)
468          \p{Armenian}            \p{Script_Extensions=Armenian} (Short:
469                                    \p{Armn}; NOT \p{Block=Armenian}) (96)
470          \p{Armi}                \p{Imperial_Aramaic} (=
471                                    \p{Script_Extensions=Imperial_Aramaic})
472                                    (NOT \p{Block=Imperial_Aramaic}) (31)
473          \p{Armn}                \p{Armenian} (= \p{Script_Extensions=
474                                    Armenian}) (NOT \p{Block=Armenian}) (96)
475        X \p{Arrows}              \p{Block=Arrows} (112)
476          \p{ASCII}               \p{Block=Basic_Latin} (128)
477          \p{ASCII_Hex_Digit}     \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
478                                    (22)
479          \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
480                                    plus all above-Unicode code points:
481                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
482                                    >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
483                                    U+0100..infinity)
484          \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A-
485                                    Fa-f])
486          \p{Assigned}            All assigned code points (277_510:
487                                    U+0000..0377, U+037A..037F,
488                                    U+0384..038A, U+038C, U+038E..03A1,
489                                    U+03A3..052F ...)
490          \p{Avestan}             \p{Script_Extensions=Avestan} (Short:
491                                    \p{Avst}; NOT \p{Block=Avestan}) (61)
492          \p{Avst}                \p{Avestan} (= \p{Script_Extensions=
493                                    Avestan}) (NOT \p{Block=Avestan}) (61)
494          \p{Bali}                \p{Balinese} (= \p{Script_Extensions=
495                                    Balinese}) (NOT \p{Block=Balinese}) (121)
496          \p{Balinese}            \p{Script_Extensions=Balinese} (Short:
497                                    \p{Bali}; NOT \p{Block=Balinese}) (121)
498          \p{Bamu}                \p{Bamum} (= \p{Script_Extensions=Bamum})
499                                    (NOT \p{Block=Bamum}) (657)
500          \p{Bamum}               \p{Script_Extensions=Bamum} (Short:
501                                    \p{Bamu}; NOT \p{Block=Bamum}) (657)
502        X \p{Bamum_Sup}           \p{Bamum_Supplement} (= \p{Block=
503                                    Bamum_Supplement}) (576)
504        X \p{Bamum_Supplement}    \p{Block=Bamum_Supplement} (Short:
505                                    \p{InBamumSup}) (576)
506        X \p{Basic_Latin}         \p{ASCII} (= \p{Block=Basic_Latin}) (128)
507          \p{Bass}                \p{Bassa_Vah} (= \p{Script_Extensions=
508                                    Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
509                                    (36)
510          \p{Bassa_Vah}           \p{Script_Extensions=Bassa_Vah} (Short:
511                                    \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
512          \p{Batak}               \p{Script_Extensions=Batak} (Short:
513                                    \p{Batk}; NOT \p{Block=Batak}) (56)
514          \p{Batk}                \p{Batak} (= \p{Script_Extensions=Batak})
515                                    (NOT \p{Block=Batak}) (56)
516          \p{Bc: *}               \p{Bidi_Class: *}
517          \p{Beng}                \p{Bengali} (= \p{Script_Extensions=
518                                    Bengali}) (NOT \p{Block=Bengali}) (113)
519          \p{Bengali}             \p{Script_Extensions=Bengali} (Short:
520                                    \p{Beng}; NOT \p{Block=Bengali}) (113)
521          \p{Bhaiksuki}           \p{Script_Extensions=Bhaiksuki} (Short:
522                                    \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
523          \p{Bhks}                \p{Bhaiksuki} (= \p{Script_Extensions=
524                                    Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
525                                    (97)
526          \p{Bidi_C}              \p{Bidi_Control} (= \p{Bidi_Control=Y})
527                                    (12)
528          \p{Bidi_C: *}           \p{Bidi_Control: *}
529          \p{Bidi_Class: AL}      \p{Bidi_Class=Arabic_Letter} (1698)
530          \p{Bidi_Class: AN}      \p{Bidi_Class=Arabic_Number} (61)
531          \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1698: U+0608,
532                                    U+060B, U+060D, U+061B..064A,
533                                    U+066D..066F, U+0671..06D5 ...)
534          \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (61:
535                                    U+0600..0605, U+0660..0669,
536                                    U+066B..066C, U+06DD, U+08E2,
537                                    U+10D30..10D39 ...)
538          \p{Bidi_Class: B}       \p{Bidi_Class=Paragraph_Separator} (7)
539          \p{Bidi_Class: BN}      \p{Bidi_Class=Boundary_Neutral} (4016)
540          \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016:
541                                    [^\t\n\cK\f\r\x1c-\x7e\x85\xa0-\xac\xae-
542                                    \xff], U+180E, U+200B..200D,
543                                    U+2060..2065, U+206A..206F, U+FDD0..FDEF
544                                    ...)
545          \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15:
546                                    [,.\/:\xa0], U+060C, U+202F, U+2044,
547                                    U+FE50, U+FE52 ...)
548          \p{Bidi_Class: CS}      \p{Bidi_Class=Common_Separator} (15)
549          \p{Bidi_Class: EN}      \p{Bidi_Class=European_Number} (158)
550          \p{Bidi_Class: ES}      \p{Bidi_Class=European_Separator} (12)
551          \p{Bidi_Class: ET}      \p{Bidi_Class=European_Terminator} (92)
552          \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (158: [0-9\xb2-
553                                    \xb3\xb9], U+06F0..06F9, U+2070,
554                                    U+2074..2079, U+2080..2089, U+2488..249B
555                                    ...)
556          \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-],
557                                    U+207A..207B, U+208A..208B, U+2212,
558                                    U+FB29, U+FE62..FE63 ...)
559          \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (92:
560                                    [#\$\%\xa2-\xa5\xb0-\xb1], U+058F,
561                                    U+0609..060A, U+066A, U+09F2..09F3,
562                                    U+09FB ...)
563          \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1:
564                                    U+2068)
565          \p{Bidi_Class: FSI}     \p{Bidi_Class=First_Strong_Isolate} (1)
566          \p{Bidi_Class: L}       \p{Bidi_Class=Left_To_Right} (1_096_767
567                                    plus all above-Unicode code points)
568          \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_096_767 plus
569                                    all above-Unicode code points: [A-Za-
570                                    z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
571                                    \xff], U+0100..02B8, U+02BB..02C1,
572                                    U+02D0..02D1, U+02E0..02E4, U+02EE ...)
573          \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1:
574                                    U+202A)
575          \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1:
576                                    U+2066)
577          \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1:
578                                    U+202D)
579          \p{Bidi_Class: LRE}     \p{Bidi_Class=Left_To_Right_Embedding} (1)
580          \p{Bidi_Class: LRI}     \p{Bidi_Class=Left_To_Right_Isolate} (1)
581          \p{Bidi_Class: LRO}     \p{Bidi_Class=Left_To_Right_Override} (1)
582          \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1834:
583                                    U+0300..036F, U+0483..0489,
584                                    U+0591..05BD, U+05BF, U+05C1..05C2,
585                                    U+05C4..05C5 ...)
586          \p{Bidi_Class: NSM}     \p{Bidi_Class=Nonspacing_Mark} (1834)
587          \p{Bidi_Class: ON}      \p{Bidi_Class=Other_Neutral} (5658)
588          \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (5658:
589                                    [!\"&\'\(\)*;<=
590                                    >?\@\[\\\]\^_`\{\|\}~\xa1\xa6-\xa9\xab-
591                                    \xac\xae-\xaf\xb4\xb6-\xb8\xbb-
592                                    \xbf\xd7\xf7], U+02B9..02BA,
593                                    U+02C2..02CF, U+02D2..02DF,
594                                    U+02E5..02ED, U+02EF..02FF ...)
595          \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7:
596                                    [\n\r\x1c-\x1e\x85], U+2029)
597          \p{Bidi_Class: PDF}     \p{Bidi_Class=Pop_Directional_Format} (1)
598          \p{Bidi_Class: PDI}     \p{Bidi_Class=Pop_Directional_Isolate} (1)
599          \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1:
600                                    U+202C)
601          \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1:
602                                    U+2069)
603          \p{Bidi_Class: R}       \p{Bidi_Class=Right_To_Left} (3765)
604          \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (3765: U+0590,
605                                    U+05BE, U+05C0, U+05C3, U+05C6,
606                                    U+05C8..05FF ...)
607          \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1:
608                                    U+202B)
609          \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1:
610                                    U+2067)
611          \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1:
612                                    U+202E)
613          \p{Bidi_Class: RLE}     \p{Bidi_Class=Right_To_Left_Embedding} (1)
614          \p{Bidi_Class: RLI}     \p{Bidi_Class=Right_To_Left_Isolate} (1)
615          \p{Bidi_Class: RLO}     \p{Bidi_Class=Right_To_Left_Override} (1)
616          \p{Bidi_Class: S}       \p{Bidi_Class=Segment_Separator} (3)
617          \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3:
618                                    [\t\cK\x1f])
619          \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20],
620                                    U+1680, U+2000..200A, U+2028, U+205F,
621                                    U+3000)
622          \p{Bidi_Class: WS}      \p{Bidi_Class=White_Space} (17)
623          \p{Bidi_Control}        \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
624          \p{Bidi_Control: N*}    (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
625                                    plus all above-Unicode code points:
626                                    U+0000..061B, U+061D..200D,
627                                    U+2010..2029, U+202F..2065,
628                                    U+206A..infinity)
629          \p{Bidi_Control: Y*}    (Short: \p{BidiC=Y}, \p{BidiC}) (12:
630                                    U+061C, U+200E..200F, U+202A..202E,
631                                    U+2066..2069)
632          \p{Bidi_M}              \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
633                                    (545)
634          \p{Bidi_M: *}           \p{Bidi_Mirrored: *}
635          \p{Bidi_Mirrored}       \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
636                                    (545)
637          \p{Bidi_Mirrored: N*}   (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567
638                                    plus all above-Unicode code points:
639                                    [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A-
640                                    Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc-
641                                    \xff], U+0100..0F39, U+0F3E..169A,
642                                    U+169D..2038, U+203B..2044, U+2047..207C
643                                    ...)
644          \p{Bidi_Mirrored: Y*}   (Short: \p{BidiM=Y}, \p{BidiM}) (545:
645                                    [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
646                                    U+169B..169C, U+2039..203A,
647                                    U+2045..2046, U+207D..207E ...)
648          \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
649                                    (60)
650          \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60:
651                                    [\)\]\}], U+0F3B, U+0F3D, U+169C,
652                                    U+2046, U+207E ...)
653          \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
654                                    (1_113_992 plus all above-Unicode code
655                                    points)
656          \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992
657                                    plus all above-Unicode code points:
658                                    [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<=
659                                    >?\@A-Z\\\^_`a-z\|~\x7f-\xff],
660                                    U+0100..0F39, U+0F3E..169A,
661                                    U+169D..2044, U+2047..207C, U+207F..208C
662                                    ...)
663          \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
664                                    (60)
665          \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60:
666                                    [\(\[\{], U+0F3A, U+0F3C, U+169B,
667                                    U+2045, U+207D ...)
668          \p{Blank}               \p{XPosixBlank} (18)
669          \p{Blk: *}              \p{Block: *}
670          \p{Block: Adlam}        (NOT \p{Adlam} NOR \p{Is_Adlam}) (96:
671                                    U+1E900..1E95F)
672          \p{Block: Aegean_Numbers} (64: U+10100..1013F)
673          \p{Block: Ahom}         (NOT \p{Ahom} NOR \p{Is_Ahom}) (64:
674                                    U+11700..1173F)
675          \p{Block: Alchemical}   \p{Block=Alchemical_Symbols} (128)
676          \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128:
677                                    U+1F700..1F77F)
678          \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
679                                    (80)
680          \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
681                                    AlphabeticPF}) (80: U+FB00..FB4F)
682          \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
683                                    NOR \p{Is_Anatolian_Hieroglyphs}) (640:
684                                    U+14400..1467F)
685          \p{Block: Ancient_Greek_Music} \p{Block=
686                                    Ancient_Greek_Musical_Notation} (80)
687          \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
688                                    AncientGreekMusic}) (80: U+1D200..1D24F)
689          \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F)
690          \p{Block: Ancient_Symbols} (64: U+10190..101CF)
691          \p{Block: Arabic}       (NOT \p{Arabic} NOR \p{Is_Arabic}) (256:
692                                    U+0600..06FF)
693          \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
694          \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96:
695                                    U+08A0..08FF)
696          \p{Block: Arabic_Math}  \p{Block=
697                                    Arabic_Mathematical_Alphabetic_Symbols}
698                                    (256)
699          \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
700                                    ArabicMath}) (256: U+1EE00..1EEFF)
701          \p{Block: Arabic_PF_A}  \p{Block=Arabic_Presentation_Forms_A} (688)
702          \p{Block: Arabic_PF_B}  \p{Block=Arabic_Presentation_Forms_B} (144)
703          \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
704                                    (688: U+FB50..FDFF)
705          \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
706                                    (144: U+FE70..FEFF)
707          \p{Block: Arabic_Sup}   \p{Block=Arabic_Supplement} (48)
708          \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48:
709                                    U+0750..077F)
710          \p{Block: Armenian}     (NOT \p{Armenian} NOR \p{Is_Armenian})
711                                    (96: U+0530..058F)
712          \p{Block: Arrows}       (112: U+2190..21FF)
713          \p{Block: ASCII}        \p{Block=Basic_Latin} (128)
714          \p{Block: Avestan}      (NOT \p{Avestan} NOR \p{Is_Avestan}) (64:
715                                    U+10B00..10B3F)
716          \p{Block: Balinese}     (NOT \p{Balinese} NOR \p{Is_Balinese})
717                                    (128: U+1B00..1B7F)
718          \p{Block: Bamum}        (NOT \p{Bamum} NOR \p{Is_Bamum}) (96:
719                                    U+A6A0..A6FF)
720          \p{Block: Bamum_Sup}    \p{Block=Bamum_Supplement} (576)
721          \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576:
722                                    U+16800..16A3F)
723          \p{Block: Basic_Latin}  (Short: \p{Blk=ASCII}) (128: [\x00-\x7f])
724          \p{Block: Bassa_Vah}    (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
725                                    (48: U+16AD0..16AFF)
726          \p{Block: Batak}        (NOT \p{Batak} NOR \p{Is_Batak}) (64:
727                                    U+1BC0..1BFF)
728          \p{Block: Bengali}      (NOT \p{Bengali} NOR \p{Is_Bengali}) (128:
729                                    U+0980..09FF)
730          \p{Block: Bhaiksuki}    (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
731                                    (112: U+11C00..11C6F)
732          \p{Block: Block_Elements} (32: U+2580..259F)
733          \p{Block: Bopomofo}     (NOT \p{Bopomofo} NOR \p{Is_Bopomofo})
734                                    (48: U+3100..312F)
735          \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
736          \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32:
737                                    U+31A0..31BF)
738          \p{Block: Box_Drawing}  (128: U+2500..257F)
739          \p{Block: Brahmi}       (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128:
740                                    U+11000..1107F)
741          \p{Block: Braille}      \p{Block=Braille_Patterns} (256)
742          \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256:
743                                    U+2800..28FF)
744          \p{Block: Buginese}     (NOT \p{Buginese} NOR \p{Is_Buginese})
745                                    (32: U+1A00..1A1F)
746          \p{Block: Buhid}        (NOT \p{Buhid} NOR \p{Is_Buhid}) (32:
747                                    U+1740..175F)
748          \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
749                                    (256)
750          \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
751                                    ByzantineMusic}) (256: U+1D000..1D0FF)
752          \p{Block: Canadian_Syllabics} \p{Block=
753                                    Unified_Canadian_Aboriginal_Syllabics}
754                                    (640)
755          \p{Block: Carian}       (NOT \p{Carian} NOR \p{Is_Carian}) (64:
756                                    U+102A0..102DF)
757          \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
758                                    \p{Is_Caucasian_Albanian}) (64:
759                                    U+10530..1056F)
760          \p{Block: Chakma}       (NOT \p{Chakma} NOR \p{Is_Chakma}) (80:
761                                    U+11100..1114F)
762          \p{Block: Cham}         (NOT \p{Cham} NOR \p{Is_Cham}) (96:
763                                    U+AA00..AA5F)
764          \p{Block: Cherokee}     (NOT \p{Cherokee} NOR \p{Is_Cherokee})
765                                    (96: U+13A0..13FF)
766          \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
767          \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80:
768                                    U+AB70..ABBF)
769          \p{Block: Chess_Symbols} (112: U+1FA00..1FA6F)
770          \p{Block: CJK}          \p{Block=CJK_Unified_Ideographs} (20_992)
771          \p{Block: CJK_Compat}   \p{Block=CJK_Compatibility} (256)
772          \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
773          \p{Block: CJK_Compat_Ideographs} \p{Block=
774                                    CJK_Compatibility_Ideographs} (512)
775          \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
776                                    CJK_Compatibility_Ideographs_Supplement}
777                                    (544)
778          \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256:
779                                    U+3300..33FF)
780          \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
781                                    (32: U+FE30..FE4F)
782          \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
783                                    CJKCompatIdeographs}) (512: U+F900..FAFF)
784          \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
785                                    CJKCompatIdeographsSup}) (544:
786                                    U+2F800..2FA1F)
787          \p{Block: CJK_Ext_A}    \p{Block=
788                                    CJK_Unified_Ideographs_Extension_A}
789                                    (6592)
790          \p{Block: CJK_Ext_B}    \p{Block=
791                                    CJK_Unified_Ideographs_Extension_B}
792                                    (42_720)
793          \p{Block: CJK_Ext_C}    \p{Block=
794                                    CJK_Unified_Ideographs_Extension_C}
795                                    (4160)
796          \p{Block: CJK_Ext_D}    \p{Block=
797                                    CJK_Unified_Ideographs_Extension_D} (224)
798          \p{Block: CJK_Ext_E}    \p{Block=
799                                    CJK_Unified_Ideographs_Extension_E}
800                                    (5776)
801          \p{Block: CJK_Ext_F}    \p{Block=
802                                    CJK_Unified_Ideographs_Extension_F}
803                                    (7488)
804          \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
805          \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
806                                    (128: U+2E80..2EFF)
807          \p{Block: CJK_Strokes}  (48: U+31C0..31EF)
808          \p{Block: CJK_Symbols}  \p{Block=CJK_Symbols_And_Punctuation} (64)
809          \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
810                                    (64: U+3000..303F)
811          \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992:
812                                    U+4E00..9FFF)
813          \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
814                                    CJKExtA}) (6592: U+3400..4DBF)
815          \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
816                                    CJKExtB}) (42_720: U+20000..2A6DF)
817          \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
818                                    CJKExtC}) (4160: U+2A700..2B73F)
819          \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
820                                    CJKExtD}) (224: U+2B740..2B81F)
821          \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
822                                    CJKExtE}) (5776: U+2B820..2CEAF)
823          \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk=
824                                    CJKExtF}) (7488: U+2CEB0..2EBEF)
825          \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
826                                    Diacriticals}) (112: U+0300..036F)
827          \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
828                                    DiacriticalsExt}) (80: U+1AB0..1AFF)
829          \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
830                                    DiacriticalsForSymbols}) (48:
831                                    U+20D0..20FF)
832          \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
833                                    DiacriticalsSup}) (64: U+1DC0..1DFF)
834          \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16:
835                                    U+FE20..FE2F)
836          \p{Block: Combining_Marks_For_Symbols} \p{Block=
837                                    Combining_Diacritical_Marks_For_Symbols}
838                                    (48)
839          \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
840                                    IndicNumberForms}) (16: U+A830..A83F)
841          \p{Block: Compat_Jamo}  \p{Block=Hangul_Compatibility_Jamo} (96)
842          \p{Block: Control_Pictures} (64: U+2400..243F)
843          \p{Block: Coptic}       (NOT \p{Coptic} NOR \p{Is_Coptic}) (128:
844                                    U+2C80..2CFF)
845          \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF)
846          \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
847          \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32:
848                                    U+1D360..1D37F)
849          \p{Block: Cuneiform}    (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
850                                    (1024: U+12000..123FF)
851          \p{Block: Cuneiform_Numbers} \p{Block=
852                                    Cuneiform_Numbers_And_Punctuation} (128)
853          \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
854                                    CuneiformNumbers}) (128: U+12400..1247F)
855          \p{Block: Currency_Symbols} (48: U+20A0..20CF)
856          \p{Block: Cypriot_Syllabary} (64: U+10800..1083F)
857          \p{Block: Cyrillic}     (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
858                                    (256: U+0400..04FF)
859          \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
860          \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
861          \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
862          \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32:
863                                    U+2DE0..2DFF)
864          \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96:
865                                    U+A640..A69F)
866          \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16:
867                                    U+1C80..1C8F)
868          \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
869          \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48:
870                                    U+0500..052F)
871          \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
872                                    (48)
873          \p{Block: Deseret}      (80: U+10400..1044F)
874          \p{Block: Devanagari}   (NOT \p{Devanagari} NOR \p{Is_Devanagari})
875                                    (128: U+0900..097F)
876          \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
877          \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32:
878                                    U+A8E0..A8FF)
879          \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
880          \p{Block: Diacriticals_Ext} \p{Block=
881                                    Combining_Diacritical_Marks_Extended}
882                                    (80)
883          \p{Block: Diacriticals_For_Symbols} \p{Block=
884                                    Combining_Diacritical_Marks_For_Symbols}
885                                    (48)
886          \p{Block: Diacriticals_Sup} \p{Block=
887                                    Combining_Diacritical_Marks_Supplement}
888                                    (64)
889          \p{Block: Dingbats}     (192: U+2700..27BF)
890          \p{Block: Dogra}        (NOT \p{Dogra} NOR \p{Is_Dogra}) (80:
891                                    U+11800..1184F)
892          \p{Block: Domino}       \p{Block=Domino_Tiles} (112)
893          \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112:
894                                    U+1F030..1F09F)
895          \p{Block: Duployan}     (NOT \p{Duployan} NOR \p{Is_Duployan})
896                                    (160: U+1BC00..1BC9F)
897          \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F)
898          \p{Block: Egyptian_Hieroglyph_Format_Controls} (16: U+13430..1343F)
899          \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
900                                    \p{Is_Egyptian_Hieroglyphs}) (1072:
901                                    U+13000..1342F)
902          \p{Block: Elbasan}      (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48:
903                                    U+10500..1052F)
904          \p{Block: Elymaic}      (NOT \p{Elymaic} NOR \p{Is_Elymaic}) (32:
905                                    U+10FE0..10FFF)
906          \p{Block: Emoticons}    (80: U+1F600..1F64F)
907          \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
908          \p{Block: Enclosed_Alphanum_Sup} \p{Block=
909                                    Enclosed_Alphanumeric_Supplement} (256)
910          \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
911                                    EnclosedAlphanumSup}) (256:
912                                    U+1F100..1F1FF)
913          \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
914                                    EnclosedAlphanum}) (160: U+2460..24FF)
915          \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
916                                    (256)
917          \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
918                                    EnclosedCJK}) (256: U+3200..32FF)
919          \p{Block: Enclosed_Ideographic_Sup} \p{Block=
920                                    Enclosed_Ideographic_Supplement} (256)
921          \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
922                                    EnclosedIdeographicSup}) (256:
923                                    U+1F200..1F2FF)
924          \p{Block: Ethiopic}     (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
925                                    (384: U+1200..137F)
926          \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
927          \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
928          \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96:
929                                    U+2D80..2DDF)
930          \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48:
931                                    U+AB00..AB2F)
932          \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
933          \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32:
934                                    U+1380..139F)
935          \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
936                                    \p{Punct} NOR \p{Is_Punctuation}) (112:
937                                    U+2000..206F)
938          \p{Block: Geometric_Shapes} (96: U+25A0..25FF)
939          \p{Block: Geometric_Shapes_Ext} \p{Block=
940                                    Geometric_Shapes_Extended} (128)
941          \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
942                                    GeometricShapesExt}) (128:
943                                    U+1F780..1F7FF)
944          \p{Block: Georgian}     (NOT \p{Georgian} NOR \p{Is_Georgian})
945                                    (96: U+10A0..10FF)
946          \p{Block: Georgian_Ext} \p{Block=Georgian_Extended} (48)
947          \p{Block: Georgian_Extended} (Short: \p{Blk=GeorgianExt}) (48:
948                                    U+1C90..1CBF)
949          \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
950          \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48:
951                                    U+2D00..2D2F)
952          \p{Block: Glagolitic}   (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
953                                    (96: U+2C00..2C5F)
954          \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
955          \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
956                                    (48: U+1E000..1E02F)
957          \p{Block: Gothic}       (NOT \p{Gothic} NOR \p{Is_Gothic}) (32:
958                                    U+10330..1034F)
959          \p{Block: Grantha}      (NOT \p{Grantha} NOR \p{Is_Grantha}) (128:
960                                    U+11300..1137F)
961          \p{Block: Greek}        \p{Block=Greek_And_Coptic} (NOT \p{Greek}
962                                    NOR \p{Is_Greek}) (144)
963          \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
964                                    NOR \p{Is_Greek}) (144: U+0370..03FF)
965          \p{Block: Greek_Ext}    \p{Block=Greek_Extended} (256)
966          \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256:
967                                    U+1F00..1FFF)
968          \p{Block: Gujarati}     (NOT \p{Gujarati} NOR \p{Is_Gujarati})
969                                    (128: U+0A80..0AFF)
970          \p{Block: Gunjala_Gondi} (NOT \p{Gunjala_Gondi} NOR
971                                    \p{Is_Gunjala_Gondi}) (80:
972                                    U+11D60..11DAF)
973          \p{Block: Gurmukhi}     (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
974                                    (128: U+0A00..0A7F)
975          \p{Block: Half_And_Full_Forms} \p{Block=
976                                    Halfwidth_And_Fullwidth_Forms} (240)
977          \p{Block: Half_Marks}   \p{Block=Combining_Half_Marks} (16)
978          \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
979                                    HalfAndFullForms}) (240: U+FF00..FFEF)
980          \p{Block: Hangul}       \p{Block=Hangul_Syllables} (NOT \p{Hangul}
981                                    NOR \p{Is_Hangul}) (11_184)
982          \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
983                                    (96: U+3130..318F)
984          \p{Block: Hangul_Jamo}  (Short: \p{Blk=Jamo}) (256: U+1100..11FF)
985          \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32:
986                                    U+A960..A97F)
987          \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80:
988                                    U+D7B0..D7FF)
989          \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
990                                    NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF)
991          \p{Block: Hanifi_Rohingya} (NOT \p{Hanifi_Rohingya} NOR
992                                    \p{Is_Hanifi_Rohingya}) (64:
993                                    U+10D00..10D3F)
994          \p{Block: Hanunoo}      (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32:
995                                    U+1720..173F)
996          \p{Block: Hatran}       (NOT \p{Hatran} NOR \p{Is_Hatran}) (32:
997                                    U+108E0..108FF)
998          \p{Block: Hebrew}       (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112:
999                                    U+0590..05FF)
1000          \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
1001                                    HighPUSurrogates}) (128: U+DB80..DBFF)
1002          \p{Block: High_PU_Surrogates} \p{Block=
1003                                    High_Private_Use_Surrogates} (128)
1004          \p{Block: High_Surrogates} (896: U+D800..DB7F)
1005          \p{Block: Hiragana}     (NOT \p{Hiragana} NOR \p{Is_Hiragana})
1006                                    (96: U+3040..309F)
1007          \p{Block: IDC}          \p{Block=
1008                                    Ideographic_Description_Characters} (NOT
1009                                    \p{ID_Continue} NOR \p{Is_IDC}) (16)
1010          \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
1011                                    NOT \p{ID_Continue} NOR \p{Is_IDC}) (16:
1012                                    U+2FF0..2FFF)
1013          \p{Block: Ideographic_Symbols} \p{Block=
1014                                    Ideographic_Symbols_And_Punctuation} (32)
1015          \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
1016                                    IdeographicSymbols}) (32: U+16FE0..16FFF)
1017          \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
1018                                    \p{Is_Imperial_Aramaic}) (32:
1019                                    U+10840..1085F)
1020          \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1021                                    (16)
1022          \p{Block: Indic_Siyaq_Numbers} (80: U+1EC70..1ECBF)
1023          \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
1024                                    NOR \p{Is_Inscriptional_Pahlavi}) (32:
1025                                    U+10B60..10B7F)
1026          \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
1027                                    NOR \p{Is_Inscriptional_Parthian}) (32:
1028                                    U+10B40..10B5F)
1029          \p{Block: IPA_Ext}      \p{Block=IPA_Extensions} (96)
1030          \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96:
1031                                    U+0250..02AF)
1032          \p{Block: Jamo}         \p{Block=Hangul_Jamo} (256)
1033          \p{Block: Jamo_Ext_A}   \p{Block=Hangul_Jamo_Extended_A} (32)
1034          \p{Block: Jamo_Ext_B}   \p{Block=Hangul_Jamo_Extended_B} (80)
1035          \p{Block: Javanese}     (NOT \p{Javanese} NOR \p{Is_Javanese})
1036                                    (96: U+A980..A9DF)
1037          \p{Block: Kaithi}       (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80:
1038                                    U+11080..110CF)
1039          \p{Block: Kana_Ext_A}   \p{Block=Kana_Extended_A} (48)
1040          \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48:
1041                                    U+1B100..1B12F)
1042          \p{Block: Kana_Sup}     \p{Block=Kana_Supplement} (256)
1043          \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256:
1044                                    U+1B000..1B0FF)
1045          \p{Block: Kanbun}       (16: U+3190..319F)
1046          \p{Block: Kangxi}       \p{Block=Kangxi_Radicals} (224)
1047          \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224:
1048                                    U+2F00..2FDF)
1049          \p{Block: Kannada}      (NOT \p{Kannada} NOR \p{Is_Kannada}) (128:
1050                                    U+0C80..0CFF)
1051          \p{Block: Katakana}     (NOT \p{Katakana} NOR \p{Is_Katakana})
1052                                    (96: U+30A0..30FF)
1053          \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
1054          \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
1055                                    KatakanaExt}) (16: U+31F0..31FF)
1056          \p{Block: Kayah_Li}     (48: U+A900..A92F)
1057          \p{Block: Kharoshthi}   (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
1058                                    (96: U+10A00..10A5F)
1059          \p{Block: Khmer}        (NOT \p{Khmer} NOR \p{Is_Khmer}) (128:
1060                                    U+1780..17FF)
1061          \p{Block: Khmer_Symbols} (32: U+19E0..19FF)
1062          \p{Block: Khojki}       (NOT \p{Khojki} NOR \p{Is_Khojki}) (80:
1063                                    U+11200..1124F)
1064          \p{Block: Khudawadi}    (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
1065                                    (80: U+112B0..112FF)
1066          \p{Block: Lao}          (NOT \p{Lao} NOR \p{Is_Lao}) (128:
1067                                    U+0E80..0EFF)
1068          \p{Block: Latin_1}      \p{Block=Latin_1_Supplement} (128)
1069          \p{Block: Latin_1_Sup}  \p{Block=Latin_1_Supplement} (128)
1070          \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80-
1071                                    \xff])
1072          \p{Block: Latin_Ext_A}  \p{Block=Latin_Extended_A} (128)
1073          \p{Block: Latin_Ext_Additional} \p{Block=
1074                                    Latin_Extended_Additional} (256)
1075          \p{Block: Latin_Ext_B}  \p{Block=Latin_Extended_B} (208)
1076          \p{Block: Latin_Ext_C}  \p{Block=Latin_Extended_C} (32)
1077          \p{Block: Latin_Ext_D}  \p{Block=Latin_Extended_D} (224)
1078          \p{Block: Latin_Ext_E}  \p{Block=Latin_Extended_E} (64)
1079          \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128:
1080                                    U+0100..017F)
1081          \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
1082                                    LatinExtAdditional}) (256: U+1E00..1EFF)
1083          \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208:
1084                                    U+0180..024F)
1085          \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32:
1086                                    U+2C60..2C7F)
1087          \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224:
1088                                    U+A720..A7FF)
1089          \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64:
1090                                    U+AB30..AB6F)
1091          \p{Block: Lepcha}       (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80:
1092                                    U+1C00..1C4F)
1093          \p{Block: Letterlike_Symbols} (80: U+2100..214F)
1094          \p{Block: Limbu}        (NOT \p{Limbu} NOR \p{Is_Limbu}) (80:
1095                                    U+1900..194F)
1096          \p{Block: Linear_A}     (NOT \p{Linear_A} NOR \p{Is_Linear_A})
1097                                    (384: U+10600..1077F)
1098          \p{Block: Linear_B_Ideograms} (128: U+10080..100FF)
1099          \p{Block: Linear_B_Syllabary} (128: U+10000..1007F)
1100          \p{Block: Lisu}         (48: U+A4D0..A4FF)
1101          \p{Block: Low_Surrogates} (1024: U+DC00..DFFF)
1102          \p{Block: Lycian}       (NOT \p{Lycian} NOR \p{Is_Lycian}) (32:
1103                                    U+10280..1029F)
1104          \p{Block: Lydian}       (NOT \p{Lydian} NOR \p{Is_Lydian}) (32:
1105                                    U+10920..1093F)
1106          \p{Block: Mahajani}     (NOT \p{Mahajani} NOR \p{Is_Mahajani})
1107                                    (48: U+11150..1117F)
1108          \p{Block: Mahjong}      \p{Block=Mahjong_Tiles} (48)
1109          \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48:
1110                                    U+1F000..1F02F)
1111          \p{Block: Makasar}      (NOT \p{Makasar} NOR \p{Is_Makasar}) (32:
1112                                    U+11EE0..11EFF)
1113          \p{Block: Malayalam}    (NOT \p{Malayalam} NOR \p{Is_Malayalam})
1114                                    (128: U+0D00..0D7F)
1115          \p{Block: Mandaic}      (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32:
1116                                    U+0840..085F)
1117          \p{Block: Manichaean}   (NOT \p{Manichaean} NOR \p{Is_Manichaean})
1118                                    (64: U+10AC0..10AFF)
1119          \p{Block: Marchen}      (NOT \p{Marchen} NOR \p{Is_Marchen}) (80:
1120                                    U+11C70..11CBF)
1121          \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR
1122                                    \p{Is_Masaram_Gondi}) (96:
1123                                    U+11D00..11D5F)
1124          \p{Block: Math_Alphanum} \p{Block=
1125                                    Mathematical_Alphanumeric_Symbols} (1024)
1126          \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
1127          \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
1128                                    MathAlphanum}) (1024: U+1D400..1D7FF)
1129          \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
1130                                    (256: U+2200..22FF)
1131          \p{Block: Mayan_Numerals} (32: U+1D2E0..1D2FF)
1132          \p{Block: Medefaidrin}  (NOT \p{Medefaidrin} NOR
1133                                    \p{Is_Medefaidrin}) (96: U+16E40..16E9F)
1134          \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
1135                                    \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF)
1136          \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
1137          \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
1138                                    (32: U+AAE0..AAFF)
1139          \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
1140                                    \p{Is_Mende_Kikakui}) (224:
1141                                    U+1E800..1E8DF)
1142          \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
1143                                    \p{Is_Meroitic_Cursive}) (96:
1144                                    U+109A0..109FF)
1145          \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F)
1146          \p{Block: Miao}         (NOT \p{Miao} NOR \p{Is_Miao}) (160:
1147                                    U+16F00..16F9F)
1148          \p{Block: Misc_Arrows}  \p{Block=Miscellaneous_Symbols_And_Arrows}
1149                                    (256)
1150          \p{Block: Misc_Math_Symbols_A} \p{Block=
1151                                    Miscellaneous_Mathematical_Symbols_A}
1152                                    (48)
1153          \p{Block: Misc_Math_Symbols_B} \p{Block=
1154                                    Miscellaneous_Mathematical_Symbols_B}
1155                                    (128)
1156          \p{Block: Misc_Pictographs} \p{Block=
1157                                    Miscellaneous_Symbols_And_Pictographs}
1158                                    (768)
1159          \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
1160          \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
1161          \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
1162                                    MiscMathSymbolsA}) (48: U+27C0..27EF)
1163          \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
1164                                    MiscMathSymbolsB}) (128: U+2980..29FF)
1165          \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols})
1166                                    (256: U+2600..26FF)
1167          \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
1168                                    MiscArrows}) (256: U+2B00..2BFF)
1169          \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
1170                                    MiscPictographs}) (768: U+1F300..1F5FF)
1171          \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
1172                                    (256: U+2300..23FF)
1173          \p{Block: Modi}         (NOT \p{Modi} NOR \p{Is_Modi}) (96:
1174                                    U+11600..1165F)
1175          \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
1176          \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F)
1177          \p{Block: Mongolian}    (NOT \p{Mongolian} NOR \p{Is_Mongolian})
1178                                    (176: U+1800..18AF)
1179          \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
1180          \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32:
1181                                    U+11660..1167F)
1182          \p{Block: Mro}          (NOT \p{Mro} NOR \p{Is_Mro}) (48:
1183                                    U+16A40..16A6F)
1184          \p{Block: Multani}      (NOT \p{Multani} NOR \p{Is_Multani}) (48:
1185                                    U+11280..112AF)
1186          \p{Block: Music}        \p{Block=Musical_Symbols} (256)
1187          \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256:
1188                                    U+1D100..1D1FF)
1189          \p{Block: Myanmar}      (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160:
1190                                    U+1000..109F)
1191          \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
1192          \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
1193          \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32:
1194                                    U+AA60..AA7F)
1195          \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32:
1196                                    U+A9E0..A9FF)
1197          \p{Block: Nabataean}    (NOT \p{Nabataean} NOR \p{Is_Nabataean})
1198                                    (48: U+10880..108AF)
1199          \p{Block: Nandinagari}  (NOT \p{Nandinagari} NOR
1200                                    \p{Is_Nandinagari}) (96: U+119A0..119FF)
1201          \p{Block: NB}           \p{Block=No_Block} (832_720 plus all
1202                                    above-Unicode code points)
1203          \p{Block: New_Tai_Lue}  (NOT \p{New_Tai_Lue} NOR
1204                                    \p{Is_New_Tai_Lue}) (96: U+1980..19DF)
1205          \p{Block: Newa}         (NOT \p{Newa} NOR \p{Is_Newa}) (128:
1206                                    U+11400..1147F)
1207          \p{Block: NKo}          (NOT \p{Nko} NOR \p{Is_NKo}) (64:
1208                                    U+07C0..07FF)
1209          \p{Block: No_Block}     (Short: \p{Blk=NB}) (832_720 plus all
1210                                    above-Unicode code points: U+0870..089F,
1211                                    U+2FE0..2FEF, U+10200..1027F,
1212                                    U+103E0..103FF, U+10570..105FF,
1213                                    U+10780..107FF ...)
1214          \p{Block: Number_Forms} (64: U+2150..218F)
1215          \p{Block: Nushu}        (NOT \p{Nushu} NOR \p{Is_Nushu}) (400:
1216                                    U+1B170..1B2FF)
1217          \p{Block: Nyiakeng_Puachue_Hmong} (NOT \p{Nyiakeng_Puachue_Hmong}
1218                                    NOR \p{Is_Nyiakeng_Puachue_Hmong}) (80:
1219                                    U+1E100..1E14F)
1220          \p{Block: OCR}          \p{Block=Optical_Character_Recognition}
1221                                    (32)
1222          \p{Block: Ogham}        (NOT \p{Ogham} NOR \p{Is_Ogham}) (32:
1223                                    U+1680..169F)
1224          \p{Block: Ol_Chiki}     (48: U+1C50..1C7F)
1225          \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
1226                                    \p{Is_Old_Hungarian}) (128:
1227                                    U+10C80..10CFF)
1228          \p{Block: Old_Italic}   (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
1229                                    (48: U+10300..1032F)
1230          \p{Block: Old_North_Arabian} (32: U+10A80..10A9F)
1231          \p{Block: Old_Permic}   (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
1232                                    (48: U+10350..1037F)
1233          \p{Block: Old_Persian}  (NOT \p{Old_Persian} NOR
1234                                    \p{Is_Old_Persian}) (64: U+103A0..103DF)
1235          \p{Block: Old_Sogdian}  (NOT \p{Old_Sogdian} NOR
1236                                    \p{Is_Old_Sogdian}) (48: U+10F00..10F2F)
1237          \p{Block: Old_South_Arabian} (32: U+10A60..10A7F)
1238          \p{Block: Old_Turkic}   (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
1239                                    (80: U+10C00..10C4F)
1240          \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32:
1241                                    U+2440..245F)
1242          \p{Block: Oriya}        (NOT \p{Oriya} NOR \p{Is_Oriya}) (128:
1243                                    U+0B00..0B7F)
1244          \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F)
1245          \p{Block: Osage}        (NOT \p{Osage} NOR \p{Is_Osage}) (80:
1246                                    U+104B0..104FF)
1247          \p{Block: Osmanya}      (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48:
1248                                    U+10480..104AF)
1249          \p{Block: Ottoman_Siyaq_Numbers} (80: U+1ED00..1ED4F)
1250          \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
1251                                    \p{Is_Pahawh_Hmong}) (144:
1252                                    U+16B00..16B8F)
1253          \p{Block: Palmyrene}    (32: U+10860..1087F)
1254          \p{Block: Pau_Cin_Hau}  (NOT \p{Pau_Cin_Hau} NOR
1255                                    \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF)
1256          \p{Block: Phags_Pa}     (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa})
1257                                    (64: U+A840..A87F)
1258          \p{Block: Phaistos}     \p{Block=Phaistos_Disc} (48)
1259          \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48:
1260                                    U+101D0..101FF)
1261          \p{Block: Phoenician}   (NOT \p{Phoenician} NOR \p{Is_Phoenician})
1262                                    (32: U+10900..1091F)
1263          \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
1264          \p{Block: Phonetic_Ext_Sup} \p{Block=
1265                                    Phonetic_Extensions_Supplement} (64)
1266          \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128:
1267                                    U+1D00..1D7F)
1268          \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
1269                                    PhoneticExtSup}) (64: U+1D80..1DBF)
1270          \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF)
1271          \p{Block: Private_Use}  \p{Block=Private_Use_Area} (NOT
1272                                    \p{Private_Use} NOR \p{Is_Private_Use})
1273                                    (6400)
1274          \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
1275                                    \p{Private_Use} NOR \p{Is_Private_Use})
1276                                    (6400: U+E000..F8FF)
1277          \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
1278                                    \p{Is_Psalter_Pahlavi}) (48:
1279                                    U+10B80..10BAF)
1280          \p{Block: PUA}          \p{Block=Private_Use_Area} (NOT
1281                                    \p{Private_Use} NOR \p{Is_Private_Use})
1282                                    (6400)
1283          \p{Block: Punctuation}  \p{Block=General_Punctuation} (NOT
1284                                    \p{Punct} NOR \p{Is_Punctuation}) (112)
1285          \p{Block: Rejang}       (NOT \p{Rejang} NOR \p{Is_Rejang}) (48:
1286                                    U+A930..A95F)
1287          \p{Block: Rumi}         \p{Block=Rumi_Numeral_Symbols} (32)
1288          \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32:
1289                                    U+10E60..10E7F)
1290          \p{Block: Runic}        (NOT \p{Runic} NOR \p{Is_Runic}) (96:
1291                                    U+16A0..16FF)
1292          \p{Block: Samaritan}    (NOT \p{Samaritan} NOR \p{Is_Samaritan})
1293                                    (64: U+0800..083F)
1294          \p{Block: Saurashtra}   (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
1295                                    (96: U+A880..A8DF)
1296          \p{Block: Sharada}      (NOT \p{Sharada} NOR \p{Is_Sharada}) (96:
1297                                    U+11180..111DF)
1298          \p{Block: Shavian}      (48: U+10450..1047F)
1299          \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF)
1300          \p{Block: Siddham}      (NOT \p{Siddham} NOR \p{Is_Siddham}) (128:
1301                                    U+11580..115FF)
1302          \p{Block: Sinhala}      (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128:
1303                                    U+0D80..0DFF)
1304          \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF)
1305          \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32:
1306                                    U+FE50..FE6F)
1307          \p{Block: Small_Forms}  \p{Block=Small_Form_Variants} (32)
1308          \p{Block: Small_Kana_Ext} \p{Block=Small_Kana_Extension} (64)
1309          \p{Block: Small_Kana_Extension} (Short: \p{Blk=SmallKanaExt}) (64:
1310                                    U+1B130..1B16F)
1311          \p{Block: Sogdian}      (NOT \p{Sogdian} NOR \p{Is_Sogdian}) (64:
1312                                    U+10F30..10F6F)
1313          \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
1314                                    \p{Is_Sora_Sompeng}) (48: U+110D0..110FF)
1315          \p{Block: Soyombo}      (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96:
1316                                    U+11A50..11AAF)
1317          \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
1318                                    ModifierLetters}) (80: U+02B0..02FF)
1319          \p{Block: Specials}     (16: U+FFF0..FFFF)
1320          \p{Block: Sundanese}    (NOT \p{Sundanese} NOR \p{Is_Sundanese})
1321                                    (64: U+1B80..1BBF)
1322          \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
1323          \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16:
1324                                    U+1CC0..1CCF)
1325          \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
1326          \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
1327          \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
1328          \p{Block: Sup_Math_Operators} \p{Block=
1329                                    Supplemental_Mathematical_Operators}
1330                                    (256)
1331          \p{Block: Sup_PUA_A}    \p{Block=Supplementary_Private_Use_Area_A}
1332                                    (65_536)
1333          \p{Block: Sup_PUA_B}    \p{Block=Supplementary_Private_Use_Area_B}
1334                                    (65_536)
1335          \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
1336          \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
1337                                    Supplemental_Symbols_And_Pictographs}
1338                                    (256)
1339          \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
1340          \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
1341                                    SuperAndSub}) (48: U+2070..209F)
1342          \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16:
1343                                    U+27F0..27FF)
1344          \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128:
1345                                    U+2900..297F)
1346          \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256:
1347                                    U+1F800..1F8FF)
1348          \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
1349                                    SupMathOperators}) (256: U+2A00..2AFF)
1350          \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
1351                                    SupPunctuation}) (128: U+2E00..2E7F)
1352          \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
1353                                    SupSymbolsAndPictographs}) (256:
1354                                    U+1F900..1F9FF)
1355          \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
1356                                    SupPUAA}) (65_536: U+F0000..FFFFF)
1357          \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
1358                                    SupPUAB}) (65_536: U+100000..10FFFF)
1359          \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF)
1360          \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
1361                                    \p{Is_Syloti_Nagri}) (48: U+A800..A82F)
1362          \p{Block: Symbols_And_Pictographs_Ext_A} \p{Block=
1363                                    Symbols_And_Pictographs_Extended_A} (144)
1364          \p{Block: Symbols_And_Pictographs_Extended_A} (Short: \p{Blk=
1365                                    SymbolsAndPictographsExtA}) (144:
1366                                    U+1FA70..1FAFF)
1367          \p{Block: Syriac}       (NOT \p{Syriac} NOR \p{Is_Syriac}) (80:
1368                                    U+0700..074F)
1369          \p{Block: Syriac_Sup}   \p{Block=Syriac_Supplement} (16)
1370          \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16:
1371                                    U+0860..086F)
1372          \p{Block: Tagalog}      (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32:
1373                                    U+1700..171F)
1374          \p{Block: Tagbanwa}     (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa})
1375                                    (32: U+1760..177F)
1376          \p{Block: Tags}         (128: U+E0000..E007F)
1377          \p{Block: Tai_Le}       (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48:
1378                                    U+1950..197F)
1379          \p{Block: Tai_Tham}     (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
1380                                    (144: U+1A20..1AAF)
1381          \p{Block: Tai_Viet}     (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet})
1382                                    (96: U+AA80..AADF)
1383          \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1384          \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96:
1385                                    U+1D300..1D35F)
1386          \p{Block: Takri}        (NOT \p{Takri} NOR \p{Is_Takri}) (80:
1387                                    U+11680..116CF)
1388          \p{Block: Tamil}        (NOT \p{Tamil} NOR \p{Is_Tamil}) (128:
1389                                    U+0B80..0BFF)
1390          \p{Block: Tamil_Sup}    \p{Block=Tamil_Supplement} (64)
1391          \p{Block: Tamil_Supplement} (Short: \p{Blk=TamilSup}) (64:
1392                                    U+11FC0..11FFF)
1393          \p{Block: Tangut}       (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144:
1394                                    U+17000..187FF)
1395          \p{Block: Tangut_Components} (768: U+18800..18AFF)
1396          \p{Block: Telugu}       (NOT \p{Telugu} NOR \p{Is_Telugu}) (128:
1397                                    U+0C00..0C7F)
1398          \p{Block: Thaana}       (NOT \p{Thaana} NOR \p{Is_Thaana}) (64:
1399                                    U+0780..07BF)
1400          \p{Block: Thai}         (NOT \p{Thai} NOR \p{Is_Thai}) (128:
1401                                    U+0E00..0E7F)
1402          \p{Block: Tibetan}      (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256:
1403                                    U+0F00..0FFF)
1404          \p{Block: Tifinagh}     (NOT \p{Tifinagh} NOR \p{Is_Tifinagh})
1405                                    (80: U+2D30..2D7F)
1406          \p{Block: Tirhuta}      (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96:
1407                                    U+11480..114DF)
1408          \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1409                                    (128)
1410          \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1411                                    TransportAndMap}) (128: U+1F680..1F6FF)
1412          \p{Block: UCAS}         \p{Block=
1413                                    Unified_Canadian_Aboriginal_Syllabics}
1414                                    (640)
1415          \p{Block: UCAS_Ext}     \p{Block=
1416                                    Unified_Canadian_Aboriginal_Syllabics_-
1417                                    Extended} (80)
1418          \p{Block: Ugaritic}     (NOT \p{Ugaritic} NOR \p{Is_Ugaritic})
1419                                    (32: U+10380..1039F)
1420          \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1421                                    UCAS}) (640: U+1400..167F)
1422          \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1423                                    \p{Blk=UCASExt}) (80: U+18B0..18FF)
1424          \p{Block: Vai}          (NOT \p{Vai} NOR \p{Is_Vai}) (320:
1425                                    U+A500..A63F)
1426          \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
1427                                    \p{Variation_Selector} NOR \p{Is_VS})
1428                                    (16: U+FE00..FE0F)
1429          \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
1430                                    (240: U+E0100..E01EF)
1431          \p{Block: Vedic_Ext}    \p{Block=Vedic_Extensions} (48)
1432          \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48:
1433                                    U+1CD0..1CFF)
1434          \p{Block: Vertical_Forms} (16: U+FE10..FE1F)
1435          \p{Block: VS}           \p{Block=Variation_Selectors} (NOT
1436                                    \p{Variation_Selector} NOR \p{Is_VS})
1437                                    (16)
1438          \p{Block: VS_Sup}       \p{Block=Variation_Selectors_Supplement}
1439                                    (240)
1440          \p{Block: Wancho}       (NOT \p{Wancho} NOR \p{Is_Wancho}) (64:
1441                                    U+1E2C0..1E2FF)
1442          \p{Block: Warang_Citi}  (NOT \p{Warang_Citi} NOR
1443                                    \p{Is_Warang_Citi}) (96: U+118A0..118FF)
1444          \p{Block: Yi_Radicals}  (64: U+A490..A4CF)
1445          \p{Block: Yi_Syllables} (1168: U+A000..A48F)
1446          \p{Block: Yijing}       \p{Block=Yijing_Hexagram_Symbols} (64)
1447          \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64:
1448                                    U+4DC0..4DFF)
1449          \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR
1450                                    \p{Is_Zanabazar_Square}) (80:
1451                                    U+11A00..11A4F)
1452        X \p{Block_Elements}      \p{Block=Block_Elements} (32)
1453          \p{Bopo}                \p{Bopomofo} (= \p{Script_Extensions=
1454                                    Bopomofo}) (NOT \p{Block=Bopomofo}) (112)
1455          \p{Bopomofo}            \p{Script_Extensions=Bopomofo} (Short:
1456                                    \p{Bopo}; NOT \p{Block=Bopomofo}) (112)
1457        X \p{Bopomofo_Ext}        \p{Bopomofo_Extended} (= \p{Block=
1458                                    Bopomofo_Extended}) (32)
1459        X \p{Bopomofo_Extended}   \p{Block=Bopomofo_Extended} (Short:
1460                                    \p{InBopomofoExt}) (32)
1461        X \p{Box_Drawing}         \p{Block=Box_Drawing} (128)
1462          \p{Bpt: *}              \p{Bidi_Paired_Bracket_Type: *}
1463          \p{Brah}                \p{Brahmi} (= \p{Script_Extensions=
1464                                    Brahmi}) (NOT \p{Block=Brahmi}) (109)
1465          \p{Brahmi}              \p{Script_Extensions=Brahmi} (Short:
1466                                    \p{Brah}; NOT \p{Block=Brahmi}) (109)
1467          \p{Brai}                \p{Braille} (= \p{Script_Extensions=
1468                                    Braille}) (256)
1469          \p{Braille}             \p{Script_Extensions=Braille} (Short:
1470                                    \p{Brai}) (256)
1471        X \p{Braille_Patterns}    \p{Block=Braille_Patterns} (Short:
1472                                    \p{InBraille}) (256)
1473          \p{Bugi}                \p{Buginese} (= \p{Script_Extensions=
1474                                    Buginese}) (NOT \p{Block=Buginese}) (31)
1475          \p{Buginese}            \p{Script_Extensions=Buginese} (Short:
1476                                    \p{Bugi}; NOT \p{Block=Buginese}) (31)
1477          \p{Buhd}                \p{Buhid} (= \p{Script_Extensions=Buhid})
1478                                    (NOT \p{Block=Buhid}) (22)
1479          \p{Buhid}               \p{Script_Extensions=Buhid} (Short:
1480                                    \p{Buhd}; NOT \p{Block=Buhid}) (22)
1481        X \p{Byzantine_Music}     \p{Byzantine_Musical_Symbols} (= \p{Block=
1482                                    Byzantine_Musical_Symbols}) (256)
1483        X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1484                                    (Short: \p{InByzantineMusic}) (256)
1485          \p{C} \pC               \p{Other} (= \p{General_Category=Other})
1486                                    (976_344 plus all above-Unicode code
1487                                    points)
1488          \p{Cakm}                \p{Chakma} (= \p{Script_Extensions=
1489                                    Chakma}) (NOT \p{Block=Chakma}) (90)
1490          \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
1491                                    (Short: \p{Cans}) (710)
1492        X \p{Canadian_Syllabics}  \p{Unified_Canadian_Aboriginal_Syllabics}
1493                                    (= \p{Block=
1494                                    Unified_Canadian_Aboriginal_Syllabics})
1495                                    (640)
1496        T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1497                                    Not_Reordered} (1_113_250 plus all
1498                                    above-Unicode code points)
1499        T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1500                                    Overlay} (32)
1501        T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1502                                    Nukta} (25)
1503        T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1504                                    Kana_Voicing} (2)
1505        T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1506                                    Virama} (58)
1507        T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1508                                    CCC10} (1)
1509          \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1:
1510                                    U+05B0)
1511        T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1512                                    CCC11} (1)
1513          \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1:
1514                                    U+05B1)
1515        T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1516                                    CCC12} (1)
1517          \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1:
1518                                    U+05B2)
1519        T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1520                                    CCC13} (1)
1521          \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1:
1522                                    U+05B3)
1523        T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1524                                    CCC14} (1)
1525          \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1:
1526                                    U+05B4)
1527        T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1528                                    CCC15} (1)
1529          \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1:
1530                                    U+05B5)
1531        T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1532                                    CCC16} (1)
1533          \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1:
1534                                    U+05B6)
1535        T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1536                                    CCC17} (1)
1537          \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1:
1538                                    U+05B7)
1539        T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1540                                    CCC18} (2)
1541          \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2:
1542                                    U+05B8, U+05C7)
1543        T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1544                                    CCC19} (2)
1545          \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2:
1546                                    U+05B9..05BA)
1547        T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1548                                    CCC20} (1)
1549          \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1:
1550                                    U+05BB)
1551        T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1552                                    CCC21} (1)
1553          \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1:
1554                                    U+05BC)
1555        T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1556                                    CCC22} (1)
1557          \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1:
1558                                    U+05BD)
1559        T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1560                                    CCC23} (1)
1561          \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1:
1562                                    U+05BF)
1563        T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1564                                    CCC24} (1)
1565          \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1:
1566                                    U+05C1)
1567        T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1568                                    CCC25} (1)
1569          \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1:
1570                                    U+05C2)
1571        T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1572                                    CCC26} (1)
1573          \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1:
1574                                    U+FB1E)
1575        T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1576                                    CCC27} (2)
1577          \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2:
1578                                    U+064B, U+08F0)
1579        T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1580                                    CCC28} (2)
1581          \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2:
1582                                    U+064C, U+08F1)
1583        T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1584                                    CCC29} (2)
1585          \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2:
1586                                    U+064D, U+08F2)
1587        T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1588                                    CCC30} (2)
1589          \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2:
1590                                    U+0618, U+064E)
1591        T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1592                                    CCC31} (2)
1593          \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2:
1594                                    U+0619, U+064F)
1595        T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1596                                    CCC32} (2)
1597          \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2:
1598                                    U+061A, U+0650)
1599        T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1600                                    CCC33} (1)
1601          \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1:
1602                                    U+0651)
1603        T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1604                                    CCC34} (1)
1605          \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1:
1606                                    U+0652)
1607        T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1608                                    CCC35} (1)
1609          \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1:
1610                                    U+0670)
1611        T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1612                                    CCC36} (1)
1613          \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1:
1614                                    U+0711)
1615        T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1616                                    CCC84} (1)
1617          \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1:
1618                                    U+0C55)
1619        T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1620                                    CCC91} (1)
1621          \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1:
1622                                    U+0C56)
1623        T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1624                                    CCC103} (2)
1625          \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2:
1626                                    U+0E38..0E39)
1627        T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1628                                    CCC107} (4)
1629          \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4:
1630                                    U+0E48..0E4B)
1631        T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1632                                    CCC118} (2)
1633          \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2:
1634                                    U+0EB8..0EB9)
1635        T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1636                                    CCC122} (4)
1637          \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4:
1638                                    U+0EC8..0ECB)
1639        T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1640                                    CCC129} (1)
1641          \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1:
1642                                    U+0F71)
1643        T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1644                                    CCC130} (6)
1645          \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6:
1646                                    U+0F72, U+0F7A..0F7D, U+0F80)
1647        T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1648                                    CCC132} (1)
1649          \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1:
1650                                    U+0F74)
1651        T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
1652                                    CCC133} (0)
1653          \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
1654        T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1655                                    Attached_Below_Left} (0)
1656        T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1657                                    Attached_Below} (5)
1658        T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1659                                    Attached_Above} (1)
1660        T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1661                                    Attached_Above_Right} (9)
1662        T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1663                                    Below_Left} (1)
1664        T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1665                                    Below} (163)
1666        T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1667                                    Below_Right} (4)
1668        T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1669                                    Left} (2)
1670        T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1671                                    Right} (1)
1672        T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1673                                    Above_Left} (5)
1674        T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1675                                    Above} (482)
1676        T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1677                                    Above_Right} (5)
1678        T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1679                                    Double_Below} (4)
1680        T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1681                                    Double_Above} (5)
1682        T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1683                                    Iota_Subscript} (1)
1684          \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1685                                    Above} (482)
1686          \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (482:
1687                                    U+0300..0314, U+033D..0344, U+0346,
1688                                    U+034A..034C, U+0350..0352, U+0357 ...)
1689          \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5:
1690                                    U+05AE, U+18A9, U+1DF7..1DF8, U+302B)
1691          \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (5:
1692                                    U+0315, U+031A, U+0358, U+1DF6, U+302C)
1693          \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1694                                    Above_Left} (5)
1695          \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1696                                    Above_Right} (5)
1697          \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1698                                    Attached_Above} (1)
1699          \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1700                                    Attached_Above_Right} (9)
1701          \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1702                                    Attached_Below} (5)
1703          \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1704                                    Attached_Below_Left} (0)
1705          \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1706                                    (1: U+1DCE)
1707          \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1708                                    \p{Ccc=ATAR}) (9: U+031B, U+0F39,
1709                                    U+1D165..1D166, U+1D16E..1D172)
1710          \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1711                                    (5: U+0321..0322, U+0327..0328, U+1DD0)
1712          \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1713                                    ATBL}) (0)
1714          \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1715                                    Below} (163)
1716          \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (163:
1717                                    U+0316..0319, U+031C..0320,
1718                                    U+0323..0326, U+0329..0333,
1719                                    U+0339..033C, U+0347..0349 ...)
1720          \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1:
1721                                    U+302A)
1722          \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4:
1723                                    U+059A, U+05AD, U+1939, U+302D)
1724          \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1725                                    Below_Left} (1)
1726          \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1727                                    Below_Right} (4)
1728          \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1729                                    Double_Above} (5)
1730          \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1731                                    Double_Below} (4)
1732          \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA})
1733                                    (5: U+035D..035E, U+0360..0361, U+1DCD)
1734          \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB})
1735                                    (4: U+035C, U+035F, U+0362, U+1DFC)
1736          \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1737                                    (1: U+0345)
1738          \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1739                                    Iota_Subscript} (1)
1740          \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV})
1741                                    (2: U+3099..309A)
1742          \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1743                                    Kana_Voicing} (2)
1744          \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1745                                    Left} (2)
1746          \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2:
1747                                    U+302E..302F)
1748          \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1749                                    Nukta} (25)
1750          \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1751                                    (1_113_250 plus all above-Unicode code
1752                                    points: U+0000..02FF, U+034F,
1753                                    U+0370..0482, U+0488..0590, U+05BE,
1754                                    U+05C0 ...)
1755          \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1756                                    Not_Reordered} (1_113_250 plus all
1757                                    above-Unicode code points)
1758          \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (25:
1759                                    U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C,
1760                                    U+0CBC ...)
1761          \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1762                                    Overlay} (32)
1763          \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32:
1764                                    U+0334..0338, U+1CD4, U+1CE2..1CE8,
1765                                    U+20D2..20D3, U+20D8..20DA, U+20E5..20E6
1766                                    ...)
1767          \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1768                                    Right} (1)
1769          \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1:
1770                                    U+1D16D)
1771          \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (58:
1772                                    U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
1773                                    U+0BCD ...)
1774          \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1775                                    Virama} (58)
1776          \p{Cans}                \p{Canadian_Aboriginal} (=
1777                                    \p{Script_Extensions=
1778                                    Canadian_Aboriginal}) (710)
1779          \p{Cari}                \p{Carian} (= \p{Script_Extensions=
1780                                    Carian}) (NOT \p{Block=Carian}) (49)
1781          \p{Carian}              \p{Script_Extensions=Carian} (Short:
1782                                    \p{Cari}; NOT \p{Block=Carian}) (49)
1783          \p{Case_Ignorable}      \p{Case_Ignorable=Y} (Short: \p{CI}) (2396)
1784          \p{Case_Ignorable: N*}  (Short: \p{CI=N}, \P{CI}) (1_111_716 plus
1785                                    all above-Unicode code points: [\x00-
1786                                    \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A-
1787                                    Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
1788                                    \xac\xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
1789                                    U+0100..02AF, U+0370..0373,
1790                                    U+0376..0379, U+037B..0383, U+0386 ...)
1791          \p{Case_Ignorable: Y*}  (Short: \p{CI=Y}, \p{CI}) (2396:
1792                                    [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8],
1793                                    U+02B0..036F, U+0374..0375, U+037A,
1794                                    U+0384..0385, U+0387 ...)
1795          \p{Cased}               \p{Cased=Y} (4279)
1796          \p{Cased: N*}           (Single: \P{Cased}) (1_109_833 plus all
1797                                    above-Unicode code points: [\x00-
1798                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1799                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
1800                                    \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7], U+01BB,
1801                                    U+01C0..01C3, U+0294, U+02B9..02BF,
1802                                    U+02C2..02DF ...)
1803          \p{Cased: Y*}           (Single: \p{Cased}) (4279: [A-Za-
1804                                    z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
1805                                    \xff], U+0100..01BA, U+01BC..01BF,
1806                                    U+01C4..0293, U+0295..02B8, U+02C0..02C1
1807                                    ...)
1808          \p{Cased_Letter}        \p{General_Category=Cased_Letter} (Short:
1809                                    \p{LC}) (3970)
1810          \p{Category: *}         \p{General_Category: *}
1811          \p{Caucasian_Albanian}  \p{Script_Extensions=Caucasian_Albanian}
1812                                    (Short: \p{Aghb}; NOT \p{Block=
1813                                    Caucasian_Albanian}) (53)
1814          \p{Cc}                  \p{XPosixCntrl} (= \p{General_Category=
1815                                    Control}) (65)
1816          \p{Ccc: *}              \p{Canonical_Combining_Class: *}
1817          \p{CE}                  \p{Composition_Exclusion} (=
1818                                    \p{Composition_Exclusion=Y}) (81)
1819          \p{CE: *}               \p{Composition_Exclusion: *}
1820          \p{Cf}                  \p{Format} (= \p{General_Category=Format})
1821                                    (161)
1822          \p{Chakma}              \p{Script_Extensions=Chakma} (Short:
1823                                    \p{Cakm}; NOT \p{Block=Chakma}) (90)
1824          \p{Cham}                \p{Script_Extensions=Cham} (NOT \p{Block=
1825                                    Cham}) (83)
1826          \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1827                                    \p{CWCF}) (1463)
1828          \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1829                                    (1_112_649 plus all above-Unicode code
1830                                    points: [\x00-
1831                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1832                                    >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\xb4\xb6-
1833                                    \xbf\xd7\xe0-\xff], U+0101, U+0103,
1834                                    U+0105, U+0107, U+0109 ...)
1835          \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1836                                    (1463: [A-Z\xb5\xc0-\xd6\xd8-\xdf],
1837                                    U+0100, U+0102, U+0104, U+0106, U+0108
1838                                    ...)
1839          \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1840                                    \p{CWCM}) (2841)
1841          \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1842                                    (1_111_271 plus all above-Unicode code
1843                                    points: [\x00-
1844                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1845                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1846                                    \xbf\xd7\xf7], U+0138, U+018D, U+019B,
1847                                    U+01AA..01AB, U+01BA..01BB ...)
1848          \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1849                                    (2841: [A-Za-z\xb5\xc0-\xd6\xd8-
1850                                    \xf6\xf8-\xff], U+0100..0137,
1851                                    U+0139..018C, U+018E..019A,
1852                                    U+019C..01A9, U+01AC..01B9 ...)
1853          \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1854                                    \p{CWL}) (1390)
1855          \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1856                                    (1_112_722 plus all above-Unicode code
1857                                    points: [\x00-
1858                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1859                                    >?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1860                                    \xbf\xd7\xdf-\xff], U+0101, U+0103,
1861                                    U+0105, U+0107, U+0109 ...)
1862          \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1390:
1863                                    [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102,
1864                                    U+0104, U+0106, U+0108 ...)
1865          \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1866                                    Y} (Short: \p{CWKCF}) (10_315)
1867          \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1868                                    \P{CWKCF}) (1_103_797 plus all above-
1869                                    Unicode code points: [\x00-
1870                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1871                                    >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1-
1872                                    \xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6-
1873                                    \xb7\xbb\xbf\xd7\xe0-\xff], U+0101,
1874                                    U+0103, U+0105, U+0107, U+0109 ...)
1875          \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1876                                    \p{CWKCF}) (10_315: [A-
1877                                    Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8-
1878                                    \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf],
1879                                    U+0100, U+0102, U+0104, U+0106, U+0108
1880                                    ...)
1881          \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1882                                    \p{CWT}) (1409)
1883          \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1884                                    (1_112_703 plus all above-Unicode code
1885                                    points: [\x00-
1886                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
1887                                    Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1888                                    \xde\xf7], U+0100, U+0102, U+0104,
1889                                    U+0106, U+0108 ...)
1890          \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1409:
1891                                    [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1892                                    U+0103, U+0105, U+0107, U+0109 ...)
1893          \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1894                                    \p{CWU}) (1482)
1895          \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1896                                    (1_112_630 plus all above-Unicode code
1897                                    points: [\x00-
1898                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
1899                                    Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1900                                    \xde\xf7], U+0100, U+0102, U+0104,
1901                                    U+0106, U+0108 ...)
1902          \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1482:
1903                                    [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1904                                    U+0103, U+0105, U+0107, U+0109 ...)
1905          \p{Cher}                \p{Cherokee} (= \p{Script_Extensions=
1906                                    Cherokee}) (NOT \p{Block=Cherokee}) (172)
1907          \p{Cherokee}            \p{Script_Extensions=Cherokee} (Short:
1908                                    \p{Cher}; NOT \p{Block=Cherokee}) (172)
1909        X \p{Cherokee_Sup}        \p{Cherokee_Supplement} (= \p{Block=
1910                                    Cherokee_Supplement}) (80)
1911        X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
1912                                    \p{InCherokeeSup}) (80)
1913        X \p{Chess_Symbols}       \p{Block=Chess_Symbols} (112)
1914          \p{CI}                  \p{Case_Ignorable} (= \p{Case_Ignorable=
1915                                    Y}) (2396)
1916          \p{CI: *}               \p{Case_Ignorable: *}
1917        X \p{CJK}                 \p{CJK_Unified_Ideographs} (= \p{Block=
1918                                    CJK_Unified_Ideographs}) (20_992)
1919        X \p{CJK_Compat}          \p{CJK_Compatibility} (= \p{Block=
1920                                    CJK_Compatibility}) (256)
1921        X \p{CJK_Compat_Forms}    \p{CJK_Compatibility_Forms} (= \p{Block=
1922                                    CJK_Compatibility_Forms}) (32)
1923        X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
1924                                    \p{Block=CJK_Compatibility_Ideographs})
1925                                    (512)
1926        X \p{CJK_Compat_Ideographs_Sup}
1927                                    \p{CJK_Compatibility_Ideographs_-
1928                                    Supplement} (= \p{Block=
1929                                    CJK_Compatibility_Ideographs_-
1930                                    Supplement}) (544)
1931        X \p{CJK_Compatibility}   \p{Block=CJK_Compatibility} (Short:
1932                                    \p{InCJKCompat}) (256)
1933        X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
1934                                    (Short: \p{InCJKCompatForms}) (32)
1935        X \p{CJK_Compatibility_Ideographs} \p{Block=
1936                                    CJK_Compatibility_Ideographs} (Short:
1937                                    \p{InCJKCompatIdeographs}) (512)
1938        X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
1939                                    CJK_Compatibility_Ideographs_Supplement}
1940                                    (Short: \p{InCJKCompatIdeographsSup})
1941                                    (544)
1942        X \p{CJK_Ext_A}           \p{CJK_Unified_Ideographs_Extension_A} (=
1943                                    \p{Block=
1944                                    CJK_Unified_Ideographs_Extension_A})
1945                                    (6592)
1946        X \p{CJK_Ext_B}           \p{CJK_Unified_Ideographs_Extension_B} (=
1947                                    \p{Block=
1948                                    CJK_Unified_Ideographs_Extension_B})
1949                                    (42_720)
1950        X \p{CJK_Ext_C}           \p{CJK_Unified_Ideographs_Extension_C} (=
1951                                    \p{Block=
1952                                    CJK_Unified_Ideographs_Extension_C})
1953                                    (4160)
1954        X \p{CJK_Ext_D}           \p{CJK_Unified_Ideographs_Extension_D} (=
1955                                    \p{Block=
1956                                    CJK_Unified_Ideographs_Extension_D})
1957                                    (224)
1958        X \p{CJK_Ext_E}           \p{CJK_Unified_Ideographs_Extension_E} (=
1959                                    \p{Block=
1960                                    CJK_Unified_Ideographs_Extension_E})
1961                                    (5776)
1962        X \p{CJK_Ext_F}           \p{CJK_Unified_Ideographs_Extension_F} (=
1963                                    \p{Block=
1964                                    CJK_Unified_Ideographs_Extension_F})
1965                                    (7488)
1966        X \p{CJK_Radicals_Sup}    \p{CJK_Radicals_Supplement} (= \p{Block=
1967                                    CJK_Radicals_Supplement}) (128)
1968        X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
1969                                    (Short: \p{InCJKRadicalsSup}) (128)
1970        X \p{CJK_Strokes}         \p{Block=CJK_Strokes} (48)
1971        X \p{CJK_Symbols}         \p{CJK_Symbols_And_Punctuation} (=
1972                                    \p{Block=CJK_Symbols_And_Punctuation})
1973                                    (64)
1974        X \p{CJK_Symbols_And_Punctuation} \p{Block=
1975                                    CJK_Symbols_And_Punctuation} (Short:
1976                                    \p{InCJKSymbols}) (64)
1977        X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
1978                                    (Short: \p{InCJK}) (20_992)
1979        X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
1980                                    CJK_Unified_Ideographs_Extension_A}
1981                                    (Short: \p{InCJKExtA}) (6592)
1982        X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
1983                                    CJK_Unified_Ideographs_Extension_B}
1984                                    (Short: \p{InCJKExtB}) (42_720)
1985        X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
1986                                    CJK_Unified_Ideographs_Extension_C}
1987                                    (Short: \p{InCJKExtC}) (4160)
1988        X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
1989                                    CJK_Unified_Ideographs_Extension_D}
1990                                    (Short: \p{InCJKExtD}) (224)
1991        X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
1992                                    CJK_Unified_Ideographs_Extension_E}
1993                                    (Short: \p{InCJKExtE}) (5776)
1994        X \p{CJK_Unified_Ideographs_Extension_F} \p{Block=
1995                                    CJK_Unified_Ideographs_Extension_F}
1996                                    (Short: \p{InCJKExtF}) (7488)
1997          \p{Close_Punctuation}   \p{General_Category=Close_Punctuation}
1998                                    (Short: \p{Pe}) (73)
1999          \p{Cn}                  \p{Unassigned} (= \p{General_Category=
2000                                    Unassigned}) (836_602 plus all above-
2001                                    Unicode code points)
2002          \p{Cntrl}               \p{XPosixCntrl} (= \p{General_Category=
2003                                    Control}) (65)
2004          \p{Co}                  \p{Private_Use} (= \p{General_Category=
2005                                    Private_Use}) (NOT \p{Private_Use_Area})
2006                                    (137_468)
2007        X \p{Combining_Diacritical_Marks} \p{Block=
2008                                    Combining_Diacritical_Marks} (Short:
2009                                    \p{InDiacriticals}) (112)
2010        X \p{Combining_Diacritical_Marks_Extended} \p{Block=
2011                                    Combining_Diacritical_Marks_Extended}
2012                                    (Short: \p{InDiacriticalsExt}) (80)
2013        X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
2014                                    Combining_Diacritical_Marks_For_Symbols}
2015                                    (Short: \p{InDiacriticalsForSymbols})
2016                                    (48)
2017        X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
2018                                    Combining_Diacritical_Marks_Supplement}
2019                                    (Short: \p{InDiacriticalsSup}) (64)
2020        X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
2021                                    \p{InHalfMarks}) (16)
2022          \p{Combining_Mark}      \p{Mark} (= \p{General_Category=Mark})
2023                                    (2268)
2024        X \p{Combining_Marks_For_Symbols}
2025                                    \p{Combining_Diacritical_Marks_For_-
2026                                    Symbols} (= \p{Block=
2027                                    Combining_Diacritical_Marks_For_-
2028                                    Symbols}) (48)
2029          \p{Common}              \p{Script_Extensions=Common} (Short:
2030                                    \p{Zyyy}) (7386)
2031        X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
2032                                    (Short: \p{InIndicNumberForms}) (16)
2033          \p{Comp_Ex}             \p{Full_Composition_Exclusion} (=
2034                                    \p{Full_Composition_Exclusion=Y}) (1120)
2035          \p{Comp_Ex: *}          \p{Full_Composition_Exclusion: *}
2036        X \p{Compat_Jamo}         \p{Hangul_Compatibility_Jamo} (= \p{Block=
2037                                    Hangul_Compatibility_Jamo}) (96)
2038          \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
2039                                    \p{CE}) (81)
2040          \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
2041                                    plus all above-Unicode code points:
2042                                    U+0000..0957, U+0960..09DB, U+09DE,
2043                                    U+09E0..0A32, U+0A34..0A35, U+0A37..0A58
2044                                    ...)
2045          \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81:
2046                                    U+0958..095F, U+09DC..09DD, U+09DF,
2047                                    U+0A33, U+0A36, U+0A59..0A5B ...)
2048          \p{Connector_Punctuation} \p{General_Category=
2049                                    Connector_Punctuation} (Short: \p{Pc})
2050                                    (10)
2051          \p{Control}             \p{XPosixCntrl} (= \p{General_Category=
2052                                    Control}) (65)
2053        X \p{Control_Pictures}    \p{Block=Control_Pictures} (64)
2054          \p{Copt}                \p{Coptic} (= \p{Script_Extensions=
2055                                    Coptic}) (NOT \p{Block=Coptic}) (165)
2056          \p{Coptic}              \p{Script_Extensions=Coptic} (Short:
2057                                    \p{Copt}; NOT \p{Block=Coptic}) (165)
2058        X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
2059        X \p{Counting_Rod}        \p{Counting_Rod_Numerals} (= \p{Block=
2060                                    Counting_Rod_Numerals}) (32)
2061        X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
2062                                    \p{InCountingRod}) (32)
2063          \p{Cprt}                \p{Cypriot} (= \p{Script_Extensions=
2064                                    Cypriot}) (112)
2065          \p{Cs}                  \p{Surrogate} (= \p{General_Category=
2066                                    Surrogate}) (2048)
2067          \p{Cuneiform}           \p{Script_Extensions=Cuneiform} (Short:
2068                                    \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
2069        X \p{Cuneiform_Numbers}   \p{Cuneiform_Numbers_And_Punctuation} (=
2070                                    \p{Block=
2071                                    Cuneiform_Numbers_And_Punctuation}) (128)
2072        X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
2073                                    Cuneiform_Numbers_And_Punctuation}
2074                                    (Short: \p{InCuneiformNumbers}) (128)
2075          \p{Currency_Symbol}     \p{General_Category=Currency_Symbol}
2076                                    (Short: \p{Sc}) (62)
2077        X \p{Currency_Symbols}    \p{Block=Currency_Symbols} (48)
2078          \p{CWCF}                \p{Changes_When_Casefolded} (=
2079                                    \p{Changes_When_Casefolded=Y}) (1463)
2080          \p{CWCF: *}             \p{Changes_When_Casefolded: *}
2081          \p{CWCM}                \p{Changes_When_Casemapped} (=
2082                                    \p{Changes_When_Casemapped=Y}) (2841)
2083          \p{CWCM: *}             \p{Changes_When_Casemapped: *}
2084          \p{CWKCF}               \p{Changes_When_NFKC_Casefolded} (=
2085                                    \p{Changes_When_NFKC_Casefolded=Y})
2086                                    (10_315)
2087          \p{CWKCF: *}            \p{Changes_When_NFKC_Casefolded: *}
2088          \p{CWL}                 \p{Changes_When_Lowercased} (=
2089                                    \p{Changes_When_Lowercased=Y}) (1390)
2090          \p{CWL: *}              \p{Changes_When_Lowercased: *}
2091          \p{CWT}                 \p{Changes_When_Titlecased} (=
2092                                    \p{Changes_When_Titlecased=Y}) (1409)
2093          \p{CWT: *}              \p{Changes_When_Titlecased: *}
2094          \p{CWU}                 \p{Changes_When_Uppercased} (=
2095                                    \p{Changes_When_Uppercased=Y}) (1482)
2096          \p{CWU: *}              \p{Changes_When_Uppercased: *}
2097          \p{Cypriot}             \p{Script_Extensions=Cypriot} (Short:
2098                                    \p{Cprt}) (112)
2099        X \p{Cypriot_Syllabary}   \p{Block=Cypriot_Syllabary} (64)
2100          \p{Cyrillic}            \p{Script_Extensions=Cyrillic} (Short:
2101                                    \p{Cyrl}; NOT \p{Block=Cyrillic}) (446)
2102        X \p{Cyrillic_Ext_A}      \p{Cyrillic_Extended_A} (= \p{Block=
2103                                    Cyrillic_Extended_A}) (32)
2104        X \p{Cyrillic_Ext_B}      \p{Cyrillic_Extended_B} (= \p{Block=
2105                                    Cyrillic_Extended_B}) (96)
2106        X \p{Cyrillic_Ext_C}      \p{Cyrillic_Extended_C} (= \p{Block=
2107                                    Cyrillic_Extended_C}) (16)
2108        X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
2109                                    \p{InCyrillicExtA}) (32)
2110        X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
2111                                    \p{InCyrillicExtB}) (96)
2112        X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
2113                                    \p{InCyrillicExtC}) (16)
2114        X \p{Cyrillic_Sup}        \p{Cyrillic_Supplement} (= \p{Block=
2115                                    Cyrillic_Supplement}) (48)
2116        X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
2117                                    \p{InCyrillicSup}) (48)
2118        X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
2119                                    Cyrillic_Supplement}) (48)
2120          \p{Cyrl}                \p{Cyrillic} (= \p{Script_Extensions=
2121                                    Cyrillic}) (NOT \p{Block=Cyrillic}) (446)
2122          \p{Dash}                \p{Dash=Y} (28)
2123          \p{Dash: N*}            (Single: \P{Dash}) (1_114_084 plus all
2124                                    above-Unicode code points: [\x00-
2125                                    \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-
2126                                    Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2127                                    U+0100..0589, U+058B..05BD,
2128                                    U+05BF..13FF, U+1401..1805, U+1807..200F
2129                                    ...)
2130          \p{Dash: Y*}            (Single: \p{Dash}) (28: [\-], U+058A,
2131                                    U+05BE, U+1400, U+1806, U+2010..2015 ...)
2132          \p{Dash_Punctuation}    \p{General_Category=Dash_Punctuation}
2133                                    (Short: \p{Pd}) (24)
2134          \p{Decimal_Number}      \p{XPosixDigit} (= \p{General_Category=
2135                                    Decimal_Number}) (630)
2136          \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
2137                                    (13_232)
2138          \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_232:
2139                                    [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9-
2140                                    \xdd\xe0-\xe5\xe7-\xef\xf1-\xf6\xf9-
2141                                    \xfd\xff], U+0100..010F, U+0112..0125,
2142                                    U+0128..0130, U+0134..0137, U+0139..013E
2143                                    ...)
2144          \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240:
2145                                    U+2460..2473, U+24B6..24EA,
2146                                    U+3244..3247, U+3251..327E,
2147                                    U+3280..32BF, U+32D0..32FE ...)
2148          \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
2149          \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720:
2150                                    [\xa8\xaf\xb4-\xb5\xb8], U+0132..0133,
2151                                    U+013F..0140, U+0149, U+017F,
2152                                    U+01C4..01CC ...)
2153          \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
2154          \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
2155          \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51,
2156                                    U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63
2157                                    ...)
2158          \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184: U+2102,
2159                                    U+210A..2113, U+2115, U+2119..211D,
2160                                    U+2124, U+2128 ...)
2161          \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
2162          \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc-
2163                                    \xbe], U+2150..215F, U+2189)
2164          \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
2165          \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54,
2166                                    U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68
2167                                    ...)
2168          \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
2169          \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50,
2170                                    U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62
2171                                    ...)
2172          \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
2173          \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55,
2174                                    U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69
2175                                    ...)
2176          \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
2177          \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122:
2178                                    U+FF61..FFBE, U+FFC2..FFC7,
2179                                    U+FFCA..FFCF, U+FFD2..FFD7,
2180                                    U+FFDA..FFDC, U+FFE8..FFEE)
2181          \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
2182          \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0],
2183                                    U+0F0C, U+2007, U+2011, U+202F)
2184          \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
2185                                    Non_Canonical} (Perl extension) (3664)
2186          \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
2187                                    decompositions (Short: \p{Dt=NonCanon})
2188                                    (Perl extension) (3664:
2189                                    [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-\xba\xbc-
2190                                    \xbe], U+0132..0133, U+013F..0140,
2191                                    U+0149, U+017F, U+01C4..01CC ...)
2192          \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_216 plus
2193                                    all above-Unicode code points: [\x00-
2194                                    \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6-
2195                                    \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-
2196                                    \xdf\xe6\xf0\xf7-\xf8\xfe],
2197                                    U+0110..0111, U+0126..0127, U+0131,
2198                                    U+0138, U+0141..0142 ...)
2199          \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26:
2200                                    U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B)
2201          \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
2202          \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (286)
2203          \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (286: U+3250,
2204                                    U+32CC..32CF, U+32FF..3357,
2205                                    U+3371..33DF, U+33FF, U+1F130..1F14F ...)
2206          \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38: U+1D62..1D6A,
2207                                    U+2080..208E, U+2090..209C, U+2C7C)
2208          \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (153)
2209          \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (153: [\xaa\xb2-
2210                                    \xb3\xb9-\xba], U+02B0..02B8,
2211                                    U+02E0..02E4, U+10FC, U+1D2C..1D2E,
2212                                    U+1D30..1D3A ...)
2213          \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
2214          \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F,
2215                                    U+30FF, U+FE10..FE19, U+FE30..FE44,
2216                                    U+FE47..FE48)
2217          \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000,
2218                                    U+FF01..FF60, U+FFE0..FFE6)
2219          \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
2220                                    Y} (Short: \p{DI}) (4173)
2221          \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
2222                                    (1_109_939 plus all above-Unicode code
2223                                    points: [\x00-\xac\xae-\xff],
2224                                    U+0100..034E, U+0350..061B,
2225                                    U+061D..115E, U+1161..17B3, U+17B6..180A
2226                                    ...)
2227          \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
2228                                    (4173: [\xad], U+034F, U+061C,
2229                                    U+115F..1160, U+17B4..17B5, U+180B..180E
2230                                    ...)
2231          \p{Dep}                 \p{Deprecated} (= \p{Deprecated=Y}) (15)
2232          \p{Dep: *}              \p{Deprecated: *}
2233          \p{Deprecated}          \p{Deprecated=Y} (Short: \p{Dep}) (15)
2234          \p{Deprecated: N*}      (Short: \p{Dep=N}, \P{Dep}) (1_114_097
2235                                    plus all above-Unicode code points:
2236                                    U+0000..0148, U+014A..0672,
2237                                    U+0674..0F76, U+0F78, U+0F7A..17A2,
2238                                    U+17A5..2069 ...)
2239          \p{Deprecated: Y*}      (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149,
2240                                    U+0673, U+0F77, U+0F79, U+17A3..17A4,
2241                                    U+206A..206F ...)
2242          \p{Deseret}             \p{Script_Extensions=Deseret} (Short:
2243                                    \p{Dsrt}) (80)
2244          \p{Deva}                \p{Devanagari} (= \p{Script_Extensions=
2245                                    Devanagari}) (NOT \p{Block=Devanagari})
2246                                    (210)
2247          \p{Devanagari}          \p{Script_Extensions=Devanagari} (Short:
2248                                    \p{Deva}; NOT \p{Block=Devanagari}) (210)
2249        X \p{Devanagari_Ext}      \p{Devanagari_Extended} (= \p{Block=
2250                                    Devanagari_Extended}) (32)
2251        X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
2252                                    \p{InDevanagariExt}) (32)
2253          \p{DI}                  \p{Default_Ignorable_Code_Point} (=
2254                                    \p{Default_Ignorable_Code_Point=Y})
2255                                    (4173)
2256          \p{DI: *}               \p{Default_Ignorable_Code_Point: *}
2257          \p{Dia}                 \p{Diacritic} (= \p{Diacritic=Y}) (873)
2258          \p{Dia: *}              \p{Diacritic: *}
2259          \p{Diacritic}           \p{Diacritic=Y} (Short: \p{Dia}) (873)
2260          \p{Diacritic: N*}       (Short: \p{Dia=N}, \P{Dia}) (1_113_239
2261                                    plus all above-Unicode code points:
2262                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2263                                    >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
2264                                    \xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
2265                                    U+0100..02AF, U+034F, U+0358..035C,
2266                                    U+0363..0373, U+0376..0379 ...)
2267          \p{Diacritic: Y*}       (Short: \p{Dia=Y}, \p{Dia}) (873:
2268                                    [\^`\xa8\xaf\xb4\xb7-\xb8],
2269                                    U+02B0..034E, U+0350..0357,
2270                                    U+035D..0362, U+0374..0375, U+037A ...)
2271        X \p{Diacriticals}        \p{Combining_Diacritical_Marks} (=
2272                                    \p{Block=Combining_Diacritical_Marks})
2273                                    (112)
2274        X \p{Diacriticals_Ext}    \p{Combining_Diacritical_Marks_Extended}
2275                                    (= \p{Block=
2276                                    Combining_Diacritical_Marks_Extended})
2277                                    (80)
2278        X \p{Diacriticals_For_Symbols}
2279                                    \p{Combining_Diacritical_Marks_For_-
2280                                    Symbols} (= \p{Block=
2281                                    Combining_Diacritical_Marks_For_-
2282                                    Symbols}) (48)
2283        X \p{Diacriticals_Sup}    \p{Combining_Diacritical_Marks_Supplement}
2284                                    (= \p{Block=
2285                                    Combining_Diacritical_Marks_Supplement})
2286                                    (64)
2287          \p{Digit}               \p{XPosixDigit} (= \p{General_Category=
2288                                    Decimal_Number}) (630)
2289        X \p{Dingbats}            \p{Block=Dingbats} (192)
2290          \p{Dogr}                \p{Dogra} (= \p{Script_Extensions=Dogra})
2291                                    (NOT \p{Block=Dogra}) (82)
2292          \p{Dogra}               \p{Script_Extensions=Dogra} (Short:
2293                                    \p{Dogr}; NOT \p{Block=Dogra}) (82)
2294        X \p{Domino}              \p{Domino_Tiles} (= \p{Block=
2295                                    Domino_Tiles}) (112)
2296        X \p{Domino_Tiles}        \p{Block=Domino_Tiles} (Short:
2297                                    \p{InDomino}) (112)
2298          \p{Dsrt}                \p{Deseret} (= \p{Script_Extensions=
2299                                    Deseret}) (80)
2300          \p{Dt: *}               \p{Decomposition_Type: *}
2301          \p{Dupl}                \p{Duployan} (= \p{Script_Extensions=
2302                                    Duployan}) (NOT \p{Block=Duployan}) (147)
2303          \p{Duployan}            \p{Script_Extensions=Duployan} (Short:
2304                                    \p{Dupl}; NOT \p{Block=Duployan}) (147)
2305          \p{Ea: *}               \p{East_Asian_Width: *}
2306        X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
2307                                    (208)
2308          \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
2309          \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739:
2310                                    [\xa1\xa4\xa7-\xa8\xaa\xad-\xae\xb0-
2311                                    \xb4\xb6-\xba\xbc-\xbf\xc6\xd0\xd7-
2312                                    \xd8\xde-\xe1\xe6\xe8-\xea\xec-
2313                                    \xed\xf0\xf2-\xf3\xf7-\xfa\xfc\xfe],
2314                                    U+0101, U+0111, U+0113, U+011B,
2315                                    U+0126..0127 ...)
2316          \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
2317          \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000,
2318                                    U+FF01..FF60, U+FFE0..FFE6)
2319          \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
2320          \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9,
2321                                    U+FF61..FFBE, U+FFC2..FFC7,
2322                                    U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC
2323                                    ...)
2324          \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (793_252 plus
2325                                    all above-Unicode code points)
2326          \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
2327          \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20-
2328                                    \x7e\xa2-\xa3\xa5-\xa6\xac\xaf],
2329                                    U+27E6..27ED, U+2985..2986)
2330          \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (793_252 plus all
2331                                    above-Unicode code points: [\x00-
2332                                    \x1f\x7f-\xa0\xa9\xab\xb5\xbb\xc0-
2333                                    \xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd\xe2-
2334                                    \xe5\xe7\xeb\xee-\xef\xf1\xf4-
2335                                    \xf6\xfb\xfd\xff], U+00FF..0100,
2336                                    U+0102..0110, U+0112, U+0114..011A,
2337                                    U+011C..0125 ...)
2338          \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (181_783)
2339          \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (181_783:
2340                                    U+1100..115F, U+231A..231B,
2341                                    U+2329..232A, U+23E9..23EC, U+23F0,
2342                                    U+23F3 ...)
2343          \p{Egyp}                \p{Egyptian_Hieroglyphs} (=
2344                                    \p{Script_Extensions=
2345                                    Egyptian_Hieroglyphs}) (NOT \p{Block=
2346                                    Egyptian_Hieroglyphs}) (1080)
2347        X \p{Egyptian_Hieroglyph_Format_Controls} \p{Block=
2348                                    Egyptian_Hieroglyph_Format_Controls} (16)
2349          \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
2350                                    Egyptian_Hieroglyphs} (Short: \p{Egyp};
2351                                    NOT \p{Block=Egyptian_Hieroglyphs})
2352                                    (1080)
2353          \p{Elba}                \p{Elbasan} (= \p{Script_Extensions=
2354                                    Elbasan}) (NOT \p{Block=Elbasan}) (40)
2355          \p{Elbasan}             \p{Script_Extensions=Elbasan} (Short:
2356                                    \p{Elba}; NOT \p{Block=Elbasan}) (40)
2357          \p{Elym}                \p{Elymaic} (= \p{Script_Extensions=
2358                                    Elymaic}) (NOT \p{Block=Elymaic}) (23)
2359          \p{Elymaic}             \p{Script_Extensions=Elymaic} (Short:
2360                                    \p{Elym}; NOT \p{Block=Elymaic}) (23)
2361        X \p{Emoticons}           \p{Block=Emoticons} (80)
2362        X \p{Enclosed_Alphanum}   \p{Enclosed_Alphanumerics} (= \p{Block=
2363                                    Enclosed_Alphanumerics}) (160)
2364        X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
2365                                    \p{Block=
2366                                    Enclosed_Alphanumeric_Supplement}) (256)
2367        X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
2368                                    Enclosed_Alphanumeric_Supplement}
2369                                    (Short: \p{InEnclosedAlphanumSup}) (256)
2370        X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
2371                                    (Short: \p{InEnclosedAlphanum}) (160)
2372        X \p{Enclosed_CJK}        \p{Enclosed_CJK_Letters_And_Months} (=
2373                                    \p{Block=
2374                                    Enclosed_CJK_Letters_And_Months}) (256)
2375        X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
2376                                    Enclosed_CJK_Letters_And_Months} (Short:
2377                                    \p{InEnclosedCJK}) (256)
2378        X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
2379                                    (= \p{Block=
2380                                    Enclosed_Ideographic_Supplement}) (256)
2381        X \p{Enclosed_Ideographic_Supplement} \p{Block=
2382                                    Enclosed_Ideographic_Supplement} (Short:
2383                                    \p{InEnclosedIdeographicSup}) (256)
2384          \p{Enclosing_Mark}      \p{General_Category=Enclosing_Mark}
2385                                    (Short: \p{Me}) (13)
2386          \p{Ethi}                \p{Ethiopic} (= \p{Script_Extensions=
2387                                    Ethiopic}) (NOT \p{Block=Ethiopic}) (495)
2388          \p{Ethiopic}            \p{Script_Extensions=Ethiopic} (Short:
2389                                    \p{Ethi}; NOT \p{Block=Ethiopic}) (495)
2390        X \p{Ethiopic_Ext}        \p{Ethiopic_Extended} (= \p{Block=
2391                                    Ethiopic_Extended}) (96)
2392        X \p{Ethiopic_Ext_A}      \p{Ethiopic_Extended_A} (= \p{Block=
2393                                    Ethiopic_Extended_A}) (48)
2394        X \p{Ethiopic_Extended}   \p{Block=Ethiopic_Extended} (Short:
2395                                    \p{InEthiopicExt}) (96)
2396        X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
2397                                    \p{InEthiopicExtA}) (48)
2398        X \p{Ethiopic_Sup}        \p{Ethiopic_Supplement} (= \p{Block=
2399                                    Ethiopic_Supplement}) (32)
2400        X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
2401                                    \p{InEthiopicSup}) (32)
2402          \p{Ext}                 \p{Extender} (= \p{Extender=Y}) (47)
2403          \p{Ext: *}              \p{Extender: *}
2404          \p{Extender}            \p{Extender=Y} (Short: \p{Ext}) (47)
2405          \p{Extender: N*}        (Short: \p{Ext=N}, \P{Ext}) (1_114_065
2406                                    plus all above-Unicode code points:
2407                                    [\x00-\xb6\xb8-\xff], U+0100..02CF,
2408                                    U+02D2..063F, U+0641..07F9,
2409                                    U+07FB..0E45, U+0E47..0EC5 ...)
2410          \p{Extender: Y*}        (Short: \p{Ext=Y}, \p{Ext}) (47: [\xb7],
2411                                    U+02D0..02D1, U+0640, U+07FA, U+0E46,
2412                                    U+0EC6 ...)
2413          \p{Final_Punctuation}   \p{General_Category=Final_Punctuation}
2414                                    (Short: \p{Pf}) (10)
2415          \p{Format}              \p{General_Category=Format} (Short:
2416                                    \p{Cf}) (161)
2417          \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
2418                                    (Short: \p{CompEx}) (1120)
2419          \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
2420                                    \P{CompEx}) (1_112_992 plus all above-
2421                                    Unicode code points: U+0000..033F,
2422                                    U+0342, U+0345..0373, U+0375..037D,
2423                                    U+037F..0386, U+0388..0957 ...)
2424          \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
2425                                    \p{CompEx}) (1120: U+0340..0341,
2426                                    U+0343..0344, U+0374, U+037E, U+0387,
2427                                    U+0958..095F ...)
2428          \p{Gc: *}               \p{General_Category: *}
2429          \p{GCB: *}              \p{Grapheme_Cluster_Break: *}
2430          \p{General_Category: C} \p{General_Category=Other} (976_344 plus
2431                                    all above-Unicode code points)
2432          \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
2433                                    \p{Gc=LC}, \p{LC}) (3970: [A-Za-
2434                                    z\xb5\xc0-\xd6\xd8-\xf6\xf8-\xff],
2435                                    U+0100..01BA, U+01BC..01BF,
2436                                    U+01C4..0293, U+0295..02AF, U+0370..0373
2437                                    ...)
2438          \p{General_Category: Cc} \p{General_Category=Control} (65)
2439          \p{General_Category: Cf} \p{General_Category=Format} (161)
2440          \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
2441                                    (73: [\)\]\}], U+0F3B, U+0F3D, U+169C,
2442                                    U+2046, U+207E ...)
2443          \p{General_Category: Cn} \p{General_Category=Unassigned} (836_602
2444                                    plus all above-Unicode code points)
2445          \p{General_Category: Cntrl} \p{General_Category=Control} (65)
2446          \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
2447          \p{General_Category: Combining_Mark} \p{General_Category=Mark}
2448                                    (2268)
2449          \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
2450                                    \p{Pc}) (10: [_], U+203F..2040, U+2054,
2451                                    U+FE33..FE34, U+FE4D..FE4F, U+FF3F)
2452          \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65:
2453                                    [\x00-\x1f\x7f-\x9f])
2454          \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
2455          \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
2456                                    (62: [\$\xa2-\xa5], U+058F, U+060B,
2457                                    U+07FE..07FF, U+09F2..09F3, U+09FB ...)
2458          \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
2459                                    (24: [\-], U+058A, U+05BE, U+1400,
2460                                    U+1806, U+2010..2015 ...)
2461          \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
2462                                    (630: [0-9], U+0660..0669, U+06F0..06F9,
2463                                    U+07C0..07C9, U+0966..096F, U+09E6..09EF
2464                                    ...)
2465          \p{General_Category: Digit} \p{General_Category=Decimal_Number}
2466                                    (630)
2467          \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
2468                                    (13: U+0488..0489, U+1ABE, U+20DD..20E0,
2469                                    U+20E2..20E4, U+A670..A672)
2470          \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
2471                                    (10: [\xbb], U+2019, U+201D, U+203A,
2472                                    U+2E03, U+2E05 ...)
2473          \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (161:
2474                                    [\xad], U+0600..0605, U+061C, U+06DD,
2475                                    U+070F, U+08E2 ...)
2476          \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
2477                                    \p{Pi}) (12: [\xab], U+2018,
2478                                    U+201B..201C, U+201F, U+2039, U+2E02 ...)
2479          \p{General_Category: L} \p{General_Category=Letter} (125_643)
2480        X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3970)
2481        X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
2482                                    the trailing '_' matters in spite of
2483                                    loose matching rules. (3970)
2484          \p{General_Category: LC} \p{General_Category=Cased_Letter} (3970)
2485          \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (125_643:
2486                                    [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
2487                                    \xf6\xf8-\xff], U+0100..02C1,
2488                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
2489                                    U+02EE ...)
2490          \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
2491                                    (236: U+16EE..16F0, U+2160..2182,
2492                                    U+2185..2188, U+3007, U+3021..3029,
2493                                    U+3038..303A ...)
2494          \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl})
2495                                    (1: U+2028)
2496          \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
2497                                    (/i= General_Category=Cased_Letter)
2498                                    (2151)
2499          \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (259)
2500          \p{General_Category: Lo} \p{General_Category=Other_Letter}
2501                                    (121_414)
2502          \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
2503                                    /i= General_Category=Cased_Letter)
2504                                    (2151: [a-z\xb5\xdf-\xf6\xf8-\xff],
2505                                    U+0101, U+0103, U+0105, U+0107, U+0109
2506                                    ...)
2507          \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
2508                                    (/i= General_Category=Cased_Letter) (31)
2509          \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
2510                                    (/i= General_Category=Cased_Letter)
2511                                    (1788)
2512          \p{General_Category: M} \p{General_Category=Mark} (2268)
2513          \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2268:
2514                                    U+0300..036F, U+0483..0489,
2515                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2516                                    U+05C4..05C5 ...)
2517          \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948:
2518                                    [+<=>\|~\xac\xb1\xd7\xf7], U+03F6,
2519                                    U+0606..0608, U+2044, U+2052,
2520                                    U+207A..207C ...)
2521          \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (429)
2522          \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
2523          \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
2524                                    (1826)
2525          \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
2526                                    (259: U+02B0..02C1, U+02C6..02D1,
2527                                    U+02E0..02E4, U+02EC, U+02EE, U+0374 ...)
2528          \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
2529                                    (121: [\^`\xa8\xaf\xb4\xb8],
2530                                    U+02C2..02C5, U+02D2..02DF,
2531                                    U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2532          \p{General_Category: N} \p{General_Category=Number} (1754)
2533          \p{General_Category: Nd} \p{General_Category=Decimal_Number} (630)
2534          \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
2535          \p{General_Category: No} \p{General_Category=Other_Number} (888)
2536          \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
2537                                    (1826: U+0300..036F, U+0483..0487,
2538                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2539                                    U+05C4..05C5 ...)
2540          \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1754:
2541                                    [0-9\xb2-\xb3\xb9\xbc-\xbe],
2542                                    U+0660..0669, U+06F0..06F9,
2543                                    U+07C0..07C9, U+0966..096F, U+09E6..09EF
2544                                    ...)
2545          \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
2546                                    (75: [\(\[\{], U+0F3A, U+0F3C, U+169B,
2547                                    U+201A, U+201E ...)
2548          \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (976_344 plus
2549                                    all above-Unicode code points: [\x00-
2550                                    \x1f\x7f-\x9f\xad], U+0378..0379,
2551                                    U+0380..0383, U+038B, U+038D, U+03A2 ...)
2552          \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
2553                                    (121_414: [\xaa\xba], U+01BB,
2554                                    U+01C0..01C3, U+0294, U+05D0..05EA,
2555                                    U+05EF..05F2 ...)
2556          \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No})
2557                                    (888: [\xb2-\xb3\xb9\xbc-\xbe],
2558                                    U+09F4..09F9, U+0B72..0B77,
2559                                    U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
2560                                    ...)
2561          \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
2562                                    (588:
2563                                    [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7\xb6-
2564                                    \xb7\xbf], U+037E, U+0387, U+055A..055F,
2565                                    U+0589, U+05C0 ...)
2566          \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
2567                                    (6161: [\xa6\xa9\xae\xb0], U+0482,
2568                                    U+058D..058E, U+060E..060F, U+06DE,
2569                                    U+06E9 ...)
2570          \p{General_Category: P} \p{General_Category=Punctuation} (792)
2571          \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
2572                                    \p{Zp}) (1: U+2029)
2573          \p{General_Category: Pc} \p{General_Category=
2574                                    Connector_Punctuation} (10)
2575          \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (24)
2576          \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
2577                                    (73)
2578          \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
2579                                    (10)
2580          \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
2581                                    (12)
2582          \p{General_Category: Po} \p{General_Category=Other_Punctuation}
2583                                    (588)
2584          \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
2585                                    (137_468: U+E000..F8FF, U+F0000..FFFFD,
2586                                    U+100000..10FFFD)
2587          \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (75)
2588          \p{General_Category: Punct} \p{General_Category=Punctuation} (792)
2589          \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (792:
2590                                    [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_-
2591                                    \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
2592                                    U+037E, U+0387, U+055A..055F,
2593                                    U+0589..058A, U+05BE ...)
2594          \p{General_Category: S} \p{General_Category=Symbol} (7292)
2595          \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (62)
2596          \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19:
2597                                    [\x20\xa0], U+1680, U+2000..200A,
2598                                    U+2028..2029, U+202F, U+205F ...)
2599          \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (121)
2600          \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
2601          \p{General_Category: So} \p{General_Category=Other_Symbol} (6161)
2602          \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
2603                                    (17: [\x20\xa0], U+1680, U+2000..200A,
2604                                    U+202F, U+205F, U+3000)
2605          \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc})
2606                                    (429: U+0903, U+093B, U+093E..0940,
2607                                    U+0949..094C, U+094E..094F, U+0982..0983
2608                                    ...)
2609          \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048:
2610                                    U+D800..DFFF)
2611          \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (7292:
2612                                    [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae-
2613                                    \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5,
2614                                    U+02D2..02DF, U+02E5..02EB, U+02ED,
2615                                    U+02EF..02FF ...)
2616          \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
2617                                    /i= General_Category=Cased_Letter) (31:
2618                                    U+01C5, U+01C8, U+01CB, U+01F2,
2619                                    U+1F88..1F8F, U+1F98..1F9F ...)
2620          \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
2621                                    (836_602 plus all above-Unicode code
2622                                    points: U+0378..0379, U+0380..0383,
2623                                    U+038B, U+038D, U+03A2, U+0530 ...)
2624          \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
2625                                    /i= General_Category=Cased_Letter)
2626                                    (1788: [A-Z\xc0-\xd6\xd8-\xde], U+0100,
2627                                    U+0102, U+0104, U+0106, U+0108 ...)
2628          \p{General_Category: Z} \p{General_Category=Separator} (19)
2629          \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
2630          \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
2631                                    (1)
2632          \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
2633        X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
2634                                    \p{InPunctuation}) (112)
2635        X \p{Geometric_Shapes}    \p{Block=Geometric_Shapes} (96)
2636        X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
2637                                    \p{Block=Geometric_Shapes_Extended})
2638                                    (128)
2639        X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
2640                                    (Short: \p{InGeometricShapesExt}) (128)
2641          \p{Geor}                \p{Georgian} (= \p{Script_Extensions=
2642                                    Georgian}) (NOT \p{Block=Georgian}) (175)
2643          \p{Georgian}            \p{Script_Extensions=Georgian} (Short:
2644                                    \p{Geor}; NOT \p{Block=Georgian}) (175)
2645        X \p{Georgian_Ext}        \p{Georgian_Extended} (= \p{Block=
2646                                    Georgian_Extended}) (48)
2647        X \p{Georgian_Extended}   \p{Block=Georgian_Extended} (Short:
2648                                    \p{InGeorgianExt}) (48)
2649        X \p{Georgian_Sup}        \p{Georgian_Supplement} (= \p{Block=
2650                                    Georgian_Supplement}) (48)
2651        X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
2652                                    \p{InGeorgianSup}) (48)
2653          \p{Glag}                \p{Glagolitic} (= \p{Script_Extensions=
2654                                    Glagolitic}) (NOT \p{Block=Glagolitic})
2655                                    (136)
2656          \p{Glagolitic}          \p{Script_Extensions=Glagolitic} (Short:
2657                                    \p{Glag}; NOT \p{Block=Glagolitic}) (136)
2658        X \p{Glagolitic_Sup}      \p{Glagolitic_Supplement} (= \p{Block=
2659                                    Glagolitic_Supplement}) (48)
2660        X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
2661                                    \p{InGlagoliticSup}) (48)
2662          \p{Gong}                \p{Gunjala_Gondi} (= \p{Script_Extensions=
2663                                    Gunjala_Gondi}) (NOT \p{Block=
2664                                    Gunjala_Gondi}) (65)
2665          \p{Gonm}                \p{Masaram_Gondi} (= \p{Script_Extensions=
2666                                    Masaram_Gondi}) (NOT \p{Block=
2667                                    Masaram_Gondi}) (77)
2668          \p{Goth}                \p{Gothic} (= \p{Script_Extensions=
2669                                    Gothic}) (NOT \p{Block=Gothic}) (27)
2670          \p{Gothic}              \p{Script_Extensions=Gothic} (Short:
2671                                    \p{Goth}; NOT \p{Block=Gothic}) (27)
2672          \p{Gr_Base}             \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
2673                                    (135_898)
2674          \p{Gr_Base: *}          \p{Grapheme_Base: *}
2675          \p{Gr_Ext}              \p{Grapheme_Extend} (= \p{Grapheme_Extend=
2676                                    Y}) (1965)
2677          \p{Gr_Ext: *}           \p{Grapheme_Extend: *}
2678          \p{Gran}                \p{Grantha} (= \p{Script_Extensions=
2679                                    Grantha}) (NOT \p{Block=Grantha}) (116)
2680          \p{Grantha}             \p{Script_Extensions=Grantha} (Short:
2681                                    \p{Gran}; NOT \p{Block=Grantha}) (116)
2682          \p{Graph}               \p{XPosixGraph} (275_378)
2683          \p{Grapheme_Base}       \p{Grapheme_Base=Y} (Short: \p{GrBase})
2684                                    (135_898)
2685          \p{Grapheme_Base: N*}   (Short: \p{GrBase=N}, \P{GrBase}) (978_214
2686                                    plus all above-Unicode code points:
2687                                    [\x00-\x1f\x7f-\x9f\xad], U+0300..036F,
2688                                    U+0378..0379, U+0380..0383, U+038B,
2689                                    U+038D ...)
2690          \p{Grapheme_Base: Y*}   (Short: \p{GrBase=Y}, \p{GrBase})
2691                                    (135_898: [\x20-\x7e\xa0-\xac\xae-\xff],
2692                                    U+0100..02FF, U+0370..0377,
2693                                    U+037A..037F, U+0384..038A, U+038C ...)
2694          \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
2695                                    (3886)
2696          \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (3886:
2697                                    [^\n\r\x20-\x7e\xa0-\xac\xae-\xff],
2698                                    U+061C, U+180E, U+200B, U+200E..200F,
2699                                    U+2028..202E ...)
2700          \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r])
2701          \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (0)
2702          \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (0)
2703          \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (0)
2704          \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
2705                                    (0)
2706          \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
2707                                    E_Base_GAZ} (0)
2708          \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
2709                                    E_Modifier} (0)
2710          \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
2711                                    (1970)
2712          \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1970:
2713                                    U+0300..036F, U+0483..0489,
2714                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2715                                    U+05C4..05C5 ...)
2716          \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
2717                                    Glue_After_Zwj} (0)
2718          \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (0)
2719          \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125:
2720                                    U+1100..115F, U+A960..A97C)
2721          \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n])
2722          \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00,
2723                                    U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C
2724                                    ...)
2725          \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773:
2726                                    U+AC01..AC1B, U+AC1D..AC37,
2727                                    U+AC39..AC53, U+AC55..AC6F,
2728                                    U+AC71..AC8B, U+AC8D..ACA7 ...)
2729          \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_096_301
2730                                    plus all above-Unicode code points:
2731                                    [\x20-\x7e\xa0-\xac\xae-\xff],
2732                                    U+0100..02FF, U+0370..0482,
2733                                    U+048A..0590, U+05BE, U+05C0 ...)
2734          \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
2735                                    (22)
2736          \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (22:
2737                                    U+0600..0605, U+06DD, U+070F, U+08E2,
2738                                    U+0D4E, U+110BD ...)
2739          \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
2740                                    (26: U+1F1E6..1F1FF)
2741          \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
2742                                    Regional_Indicator} (26)
2743          \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
2744                                    SpacingMark} (375)
2745          \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (375:
2746                                    U+0903, U+093B, U+093E..0940,
2747                                    U+0949..094C, U+094E..094F, U+0982..0983
2748                                    ...)
2749          \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137:
2750                                    U+11A8..11FF, U+D7CB..D7FB)
2751          \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95:
2752                                    U+1160..11A7, U+D7B0..D7C6)
2753          \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
2754                                    (1_096_301 plus all above-Unicode code
2755                                    points)
2756          \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D)
2757          \p{Grapheme_Extend}     \p{Grapheme_Extend=Y} (Short: \p{GrExt})
2758                                    (1965)
2759          \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_147
2760                                    plus all above-Unicode code points:
2761                                    U+0000..02FF, U+0370..0482,
2762                                    U+048A..0590, U+05BE, U+05C0, U+05C3 ...)
2763          \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1965:
2764                                    U+0300..036F, U+0483..0489,
2765                                    U+0591..05BD, U+05BF, U+05C1..05C2,
2766                                    U+05C4..05C5 ...)
2767          \p{Greek}               \p{Script_Extensions=Greek} (Short:
2768                                    \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
2769        X \p{Greek_And_Coptic}    \p{Block=Greek_And_Coptic} (Short:
2770                                    \p{InGreek}) (144)
2771        X \p{Greek_Ext}           \p{Greek_Extended} (= \p{Block=
2772                                    Greek_Extended}) (256)
2773        X \p{Greek_Extended}      \p{Block=Greek_Extended} (Short:
2774                                    \p{InGreekExt}) (256)
2775          \p{Grek}                \p{Greek} (= \p{Script_Extensions=Greek})
2776                                    (NOT \p{Greek_And_Coptic}) (522)
2777          \p{Gujarati}            \p{Script_Extensions=Gujarati} (Short:
2778                                    \p{Gujr}; NOT \p{Block=Gujarati}) (105)
2779          \p{Gujr}                \p{Gujarati} (= \p{Script_Extensions=
2780                                    Gujarati}) (NOT \p{Block=Gujarati}) (105)
2781          \p{Gunjala_Gondi}       \p{Script_Extensions=Gunjala_Gondi}
2782                                    (Short: \p{Gong}; NOT \p{Block=
2783                                    Gunjala_Gondi}) (65)
2784          \p{Gurmukhi}            \p{Script_Extensions=Gurmukhi} (Short:
2785                                    \p{Guru}; NOT \p{Block=Gurmukhi}) (94)
2786          \p{Guru}                \p{Gurmukhi} (= \p{Script_Extensions=
2787                                    Gurmukhi}) (NOT \p{Block=Gurmukhi}) (94)
2788        X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
2789                                    \p{Block=Halfwidth_And_Fullwidth_Forms})
2790                                    (240)
2791        X \p{Half_Marks}          \p{Combining_Half_Marks} (= \p{Block=
2792                                    Combining_Half_Marks}) (16)
2793        X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
2794                                    Halfwidth_And_Fullwidth_Forms} (Short:
2795                                    \p{InHalfAndFullForms}) (240)
2796          \p{Han}                 \p{Script_Extensions=Han} (89_513)
2797          \p{Hang}                \p{Hangul} (= \p{Script_Extensions=
2798                                    Hangul}) (NOT \p{Hangul_Syllables})
2799                                    (11_775)
2800          \p{Hangul}              \p{Script_Extensions=Hangul} (Short:
2801                                    \p{Hang}; NOT \p{Hangul_Syllables})
2802                                    (11_775)
2803        X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
2804                                    (Short: \p{InCompatJamo}) (96)
2805        X \p{Hangul_Jamo}         \p{Block=Hangul_Jamo} (Short: \p{InJamo})
2806                                    (256)
2807        X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
2808                                    (Short: \p{InJamoExtA}) (32)
2809        X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
2810                                    (Short: \p{InJamoExtB}) (80)
2811          \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
2812                                    (125)
2813          \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125:
2814                                    U+1100..115F, U+A960..A97C)
2815          \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
2816                                    (399)
2817          \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399:
2818                                    U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70,
2819                                    U+AC8C ...)
2820          \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
2821                                    LVT_Syllable} (10_773)
2822          \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
2823                                    (10_773: U+AC01..AC1B, U+AC1D..AC37,
2824                                    U+AC39..AC53, U+AC55..AC6F,
2825                                    U+AC71..AC8B, U+AC8D..ACA7 ...)
2826          \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
2827                                    Not_Applicable} (1_102_583 plus all
2828                                    above-Unicode code points)
2829          \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
2830                                    (1_102_583 plus all above-Unicode code
2831                                    points: U+0000..10FF, U+1200..A95F,
2832                                    U+A97D..ABFF, U+D7A4..D7AF,
2833                                    U+D7C7..D7CA, U+D7FC..infinity)
2834          \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
2835                                    (137)
2836          \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137:
2837                                    U+11A8..11FF, U+D7CB..D7FB)
2838          \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
2839                                    (95)
2840          \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95:
2841                                    U+1160..11A7, U+D7B0..D7C6)
2842        X \p{Hangul_Syllables}    \p{Block=Hangul_Syllables} (Short:
2843                                    \p{InHangul}) (11_184)
2844          \p{Hani}                \p{Han} (= \p{Script_Extensions=Han})
2845                                    (89_513)
2846          \p{Hanifi_Rohingya}     \p{Script_Extensions=Hanifi_Rohingya}
2847                                    (Short: \p{Rohg}; NOT \p{Block=
2848                                    Hanifi_Rohingya}) (55)
2849          \p{Hano}                \p{Hanunoo} (= \p{Script_Extensions=
2850                                    Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
2851          \p{Hanunoo}             \p{Script_Extensions=Hanunoo} (Short:
2852                                    \p{Hano}; NOT \p{Block=Hanunoo}) (23)
2853          \p{Hatr}                \p{Hatran} (= \p{Script_Extensions=
2854                                    Hatran}) (NOT \p{Block=Hatran}) (26)
2855          \p{Hatran}              \p{Script_Extensions=Hatran} (Short:
2856                                    \p{Hatr}; NOT \p{Block=Hatran}) (26)
2857          \p{Hebr}                \p{Hebrew} (= \p{Script_Extensions=
2858                                    Hebrew}) (NOT \p{Block=Hebrew}) (134)
2859          \p{Hebrew}              \p{Script_Extensions=Hebrew} (Short:
2860                                    \p{Hebr}; NOT \p{Block=Hebrew}) (134)
2861          \p{Hex}                 \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2862          \p{Hex: *}              \p{Hex_Digit: *}
2863          \p{Hex_Digit}           \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2864          \p{Hex_Digit: N*}       (Short: \p{Hex=N}, \P{Hex}) (1_114_068
2865                                    plus all above-Unicode code points:
2866                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
2867                                    >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
2868                                    U+0100..FF0F, U+FF1A..FF20,
2869                                    U+FF27..FF40, U+FF47..infinity)
2870          \p{Hex_Digit: Y*}       (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa-
2871                                    f], U+FF10..FF19, U+FF21..FF26,
2872                                    U+FF41..FF46)
2873        X \p{High_Private_Use_Surrogates} \p{Block=
2874                                    High_Private_Use_Surrogates} (Short:
2875                                    \p{InHighPUSurrogates}) (128)
2876        X \p{High_PU_Surrogates}  \p{High_Private_Use_Surrogates} (=
2877                                    \p{Block=High_Private_Use_Surrogates})
2878                                    (128)
2879        X \p{High_Surrogates}     \p{Block=High_Surrogates} (896)
2880          \p{Hira}                \p{Hiragana} (= \p{Script_Extensions=
2881                                    Hiragana}) (NOT \p{Block=Hiragana}) (431)
2882          \p{Hiragana}            \p{Script_Extensions=Hiragana} (Short:
2883                                    \p{Hira}; NOT \p{Block=Hiragana}) (431)
2884          \p{Hluw}                \p{Anatolian_Hieroglyphs} (=
2885                                    \p{Script_Extensions=
2886                                    Anatolian_Hieroglyphs}) (NOT \p{Block=
2887                                    Anatolian_Hieroglyphs}) (583)
2888          \p{Hmng}                \p{Pahawh_Hmong} (= \p{Script_Extensions=
2889                                    Pahawh_Hmong}) (NOT \p{Block=
2890                                    Pahawh_Hmong}) (127)
2891          \p{Hmnp}                \p{Nyiakeng_Puachue_Hmong} (=
2892                                    \p{Script_Extensions=
2893                                    Nyiakeng_Puachue_Hmong}) (NOT \p{Block=
2894                                    Nyiakeng_Puachue_Hmong}) (71)
2895          \p{HorizSpace}          \p{XPosixBlank} (18)
2896          \p{Hst: *}              \p{Hangul_Syllable_Type: *}
2897          \p{Hung}                \p{Old_Hungarian} (= \p{Script_Extensions=
2898                                    Old_Hungarian}) (NOT \p{Block=
2899                                    Old_Hungarian}) (108)
2900        D \p{Hyphen}              \p{Hyphen=Y} (11)
2901        D \p{Hyphen: N*}          Supplanted by Line_Break property values;
2902                                    see www.unicode.org/reports/tr14
2903                                    (Single: \P{Hyphen}) (1_114_101 plus all
2904                                    above-Unicode code points: [\x00-
2905                                    \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-
2906                                    Z\[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae-
2907                                    \xff], U+0100..0589, U+058B..1805,
2908                                    U+1807..200F, U+2012..2E16, U+2E18..30FA
2909                                    ...)
2910        D \p{Hyphen: Y*}          Supplanted by Line_Break property values;
2911                                    see www.unicode.org/reports/tr14
2912                                    (Single: \p{Hyphen}) (11: [\-\xad],
2913                                    U+058A, U+1806, U+2010..2011, U+2E17,
2914                                    U+30FB ...)
2915          \p{ID_Continue}         \p{ID_Continue=Y} (Short: \p{IDC}; NOT
2916                                    \p{Ideographic_Description_Characters})
2917                                    (128_789)
2918          \p{ID_Continue: N*}     (Short: \p{IDC=N}, \P{IDC}) (985_323 plus
2919                                    all above-Unicode code points: [\x00-
2920                                    \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
2921                                    >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-
2922                                    \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
2923                                    U+02C2..02C5, U+02D2..02DF,
2924                                    U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2925          \p{ID_Continue: Y*}     (Short: \p{IDC=Y}, \p{IDC}) (128_789:
2926                                    [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-
2927                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
2928                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
2929                                    U+02EE ...)
2930          \p{ID_Start}            \p{ID_Start=Y} (Short: \p{IDS}) (125_884)
2931          \p{ID_Start: N*}        (Short: \p{IDS=N}, \P{IDS}) (988_228 plus
2932                                    all above-Unicode code points: [\x00-
2933                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2934                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
2935                                    \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
2936                                    U+02C2..02C5, U+02D2..02DF,
2937                                    U+02E5..02EB, U+02ED, U+02EF..036F ...)
2938          \p{ID_Start: Y*}        (Short: \p{IDS=Y}, \p{IDS}) (125_884: [A-
2939                                    Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
2940                                    \xff], U+0100..02C1, U+02C6..02D1,
2941                                    U+02E0..02E4, U+02EC, U+02EE ...)
2942          \p{IDC}                 \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
2943                                    \p{Ideographic_Description_Characters})
2944                                    (128_789)
2945          \p{IDC: *}              \p{ID_Continue: *}
2946          \p{Ideo}                \p{Ideographic} (= \p{Ideographic=Y})
2947                                    (96_190)
2948          \p{Ideo: *}             \p{Ideographic: *}
2949          \p{Ideographic}         \p{Ideographic=Y} (Short: \p{Ideo})
2950                                    (96_190)
2951          \p{Ideographic: N*}     (Short: \p{Ideo=N}, \P{Ideo}) (1_017_922
2952                                    plus all above-Unicode code points:
2953                                    U+0000..3005, U+3008..3020,
2954                                    U+302A..3037, U+303B..33FF,
2955                                    U+4DB6..4DFF, U+9FF0..F8FF ...)
2956          \p{Ideographic: Y*}     (Short: \p{Ideo=Y}, \p{Ideo}) (96_190:
2957                                    U+3006..3007, U+3021..3029,
2958                                    U+3038..303A, U+3400..4DB5,
2959                                    U+4E00..9FEF, U+F900..FA6D ...)
2960        X \p{Ideographic_Description_Characters} \p{Block=
2961                                    Ideographic_Description_Characters}
2962                                    (Short: \p{InIDC}) (16)
2963        X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
2964                                    \p{Block=
2965                                    Ideographic_Symbols_And_Punctuation})
2966                                    (32)
2967        X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
2968                                    Ideographic_Symbols_And_Punctuation}
2969                                    (Short: \p{InIdeographicSymbols}) (32)
2970          \p{IDS}                 \p{ID_Start} (= \p{ID_Start=Y}) (125_884)
2971          \p{IDS: *}              \p{ID_Start: *}
2972          \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
2973                                    \p{IDSB}) (10)
2974          \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
2975                                    (1_114_102 plus all above-Unicode code
2976                                    points: U+0000..2FEF, U+2FF2..2FF3,
2977                                    U+2FFC..infinity)
2978          \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10:
2979                                    U+2FF0..2FF1, U+2FF4..2FFB)
2980          \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
2981                                    \p{IDST}) (2)
2982          \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
2983                                    (1_114_110 plus all above-Unicode code
2984                                    points: U+0000..2FF1, U+2FF4..infinity)
2985          \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2:
2986                                    U+2FF2..2FF3)
2987          \p{IDSB}                \p{IDS_Binary_Operator} (=
2988                                    \p{IDS_Binary_Operator=Y}) (10)
2989          \p{IDSB: *}             \p{IDS_Binary_Operator: *}
2990          \p{IDST}                \p{IDS_Trinary_Operator} (=
2991                                    \p{IDS_Trinary_Operator=Y}) (2)
2992          \p{IDST: *}             \p{IDS_Trinary_Operator: *}
2993          \p{Imperial_Aramaic}    \p{Script_Extensions=Imperial_Aramaic}
2994                                    (Short: \p{Armi}; NOT \p{Block=
2995                                    Imperial_Aramaic}) (31)
2996          \p{In: *}               \p{Present_In: *} (Perl extension)
2997        X \p{In_*}                \p{Block: *}
2998        X \p{Indic_Number_Forms}  \p{Common_Indic_Number_Forms} (= \p{Block=
2999                                    Common_Indic_Number_Forms}) (16)
3000          \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
3001                                    (349: U+093C, U+0941..0944, U+094D,
3002                                    U+0952, U+0956..0957, U+0962..0963 ...)
3003          \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC=
3004                                    BottomAndLeft}) (1: U+A9BF)
3005          \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
3006                                    BottomAndRight}) (2: U+1B3B, U+A9C0)
3007          \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (61:
3008                                    U+093F, U+094E, U+09BF, U+09C7..09C8,
3009                                    U+0A3F, U+0ABF ...)
3010          \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
3011                                    LeftAndRight}) (21: U+09CB..09CC,
3012                                    U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C,
3013                                    U+0DDC, U+0DDE ...)
3014          \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_936
3015                                    plus all above-Unicode code points:
3016                                    U+0000..08FF, U+0904..0939, U+093D,
3017                                    U+0950, U+0958..0961, U+0964..0980 ...)
3018          \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
3019                                    Overstruck}) (10: U+1CD4, U+1CE2..1CE8,
3020                                    U+10A01, U+10A06)
3021          \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (281:
3022                                    U+0903, U+093B, U+093E, U+0940,
3023                                    U+0949..094C, U+094F ...)
3024          \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (398:
3025                                    U+0900..0902, U+093A, U+0945..0948,
3026                                    U+0951, U+0953..0955, U+0981 ...)
3027          \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
3028                                    TopAndBottom}) (10: U+0C48, U+0F73,
3029                                    U+0F76..0F79, U+0F81, U+1B3C,
3030                                    U+1112E..1112F)
3031          \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
3032                                    \p{InPC=TopAndBottomAndRight}) (1:
3033                                    U+1B3D)
3034          \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
3035                                    TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE,
3036                                    U+1C29, U+114BB, U+115B9)
3037          \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
3038                                    \p{InPC=TopAndLeftAndRight}) (4: U+0B4C,
3039                                    U+0DDD, U+17BF, U+115BB)
3040          \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
3041                                    TopAndRight}) (13: U+0AC9, U+0B57,
3042                                    U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB,
3043                                    U+1925..1926 ...)
3044          \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
3045                                    VisualOrderLeft}) (19: U+0E40..0E44,
3046                                    U+0EC0..0EC4, U+19B5..19B7, U+19BA,
3047                                    U+AAB5..AAB6, U+AAB9 ...)
3048        X \p{Indic_Siyaq_Numbers} \p{Block=Indic_Siyaq_Numbers} (80)
3049          \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
3050                                    (17: U+093D, U+09BD, U+0ABD, U+0B3D,
3051                                    U+0C3D, U+0CBD ...)
3052          \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (86:
3053                                    U+0900..0902, U+0981..0982, U+09FC,
3054                                    U+0A01..0A02, U+0A70, U+0A81..0A82 ...)
3055          \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
3056                                    \p{InSC=BrahmiJoiningNumber}) (20:
3057                                    U+11052..11065)
3058          \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
3059                                    CantillationMark}) (59: U+0951..0952,
3060                                    U+0A51, U+0AFA..0AFC, U+1CD0..1CD2,
3061                                    U+1CD4..1CE1, U+1CF4 ...)
3062          \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
3063                                    (2160: U+0915..0939, U+0958..095F,
3064                                    U+0978..097F, U+0995..09A8,
3065                                    U+09AA..09B0, U+09B2 ...)
3066          \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
3067                                    ConsonantDead}) (12: U+09CE,
3068                                    U+0D54..0D56, U+0D7A..0D7F, U+1CF2..1CF3)
3069          \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
3070                                    ConsonantFinal}) (67: U+1930..1931,
3071                                    U+1933..1939, U+19C1..19C7,
3072                                    U+1A58..1A59, U+1BBE..1BBF, U+1BF0..1BF1
3073                                    ...)
3074          \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
3075                                    \p{InSC=ConsonantHeadLetter}) (5:
3076                                    U+0F88..0F8C)
3077          \p{Indic_Syllabic_Category: Consonant_Initial_Postfixed} (Short:
3078                                    \p{InSC=ConsonantInitialPostfixed}) (1:
3079                                    U+1A5A)
3080          \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
3081                                    ConsonantKiller}) (2: U+0E4C, U+17CD)
3082          \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
3083                                    ConsonantMedial}) (29: U+0A75,
3084                                    U+0EBC..0EBD, U+103B..103E,
3085                                    U+105E..1060, U+1082, U+1A55..1A56 ...)
3086          \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
3087                                    \p{InSC=ConsonantPlaceholder}) (22: [\-
3088                                    \xa0\xd7], U+0980, U+0A72..0A73, U+104B,
3089                                    U+104E, U+1900 ...)
3090          \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
3091                                    \p{InSC=ConsonantPrecedingRepha}) (2:
3092                                    U+0D4E, U+11D46)
3093          \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
3094                                    ConsonantPrefixed}) (9: U+111C2..111C3,
3095                                    U+11A3A, U+11A84..11A89)
3096          \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
3097                                    ConsonantSubjoined}) (94: U+0F8D..0F97,
3098                                    U+0F99..0FBC, U+1929..192B, U+1A57,
3099                                    U+1A5B..1A5E, U+1BA1..1BA3 ...)
3100          \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
3101                                    \p{InSC=ConsonantSucceedingRepha}) (4:
3102                                    U+17CC, U+1B03, U+1B81, U+A982)
3103          \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
3104                                    \p{InSC=ConsonantWithStacker}) (6:
3105                                    U+0CF1..0CF2, U+1CF5..1CF6,
3106                                    U+11003..11004)
3107          \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
3108                                    GeminationMark}) (3: U+0A71, U+11237,
3109                                    U+11A98)
3110          \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
3111                                    InvisibleStacker}) (11: U+1039, U+17D2,
3112                                    U+1A60, U+1BAB, U+AAF6, U+10A3F ...)
3113          \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1:
3114                                    U+200D)
3115          \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
3116                                    ModifyingLetter}) (1: U+0B83)
3117          \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
3118                                    NonJoiner}) (1: U+200C)
3119          \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (30:
3120                                    U+093C, U+09BC, U+0A3C, U+0ABC,
3121                                    U+0AFD..0AFF, U+0B3C ...)
3122          \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (481:
3123                                    [0-9], U+0966..096F, U+09E6..09EF,
3124                                    U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F
3125                                    ...)
3126          \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
3127                                    NumberJoiner}) (1: U+1107F)
3128          \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
3129                                    (1_109_650 plus all above-Unicode code
3130                                    points: [\x00-
3131                                    \x20!\"#\$\%&\'\(\)*+,.\/:;<=>?\@A-
3132                                    Z\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1-
3133                                    \xb1\xb4-\xd6\xd8-\xff], U+0100..08FF,
3134                                    U+0950, U+0953..0954, U+0964..0965,
3135                                    U+0970..0971 ...)
3136          \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
3137                                    PureKiller}) (21: U+0D3B..0D3C, U+0E3A,
3138                                    U+0E4E, U+0EBA, U+0F84, U+103A ...)
3139          \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
3140                                    RegisterShifter}) (2: U+17C9..17CA)
3141          \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
3142                                    SyllableModifier}) (25: [\xb2-\xb3],
3143                                    U+09FE, U+0F35, U+0F37, U+0FC6, U+17CB
3144                                    ...)
3145          \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
3146                                    ToneLetter}) (7: U+1970..1974, U+AAC0,
3147                                    U+AAC2)
3148          \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
3149                                    (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037,
3150                                    U+1063..1064, U+1069..106D, U+1087..108D
3151                                    ...)
3152          \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (27:
3153                                    U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
3154                                    U+0BCD ...)
3155          \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga})
3156                                    (35: U+0903, U+0983, U+0A03, U+0A83,
3157                                    U+0B03, U+0C03 ...)
3158          \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30:
3159                                    U+1963..196D, U+A85E..A861, U+A866,
3160                                    U+A922..A92A, U+11150..11154)
3161          \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
3162                                    VowelDependent}) (673: U+093A..093B,
3163                                    U+093E..094C, U+094E..094F,
3164                                    U+0955..0957, U+0962..0963, U+09BE..09C4
3165                                    ...)
3166          \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
3167                                    VowelIndependent}) (476: U+0904..0914,
3168                                    U+0960..0961, U+0972..0977,
3169                                    U+0985..098C, U+098F..0990, U+0993..0994
3170                                    ...)
3171          \p{Inherited}           \p{Script_Extensions=Inherited} (Short:
3172                                    \p{Zinh}) (502)
3173          \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
3174                                    (Short: \p{Pi}) (12)
3175          \p{InPC: *}             \p{Indic_Positional_Category: *}
3176          \p{InSC: *}             \p{Indic_Syllabic_Category: *}
3177          \p{Inscriptional_Pahlavi} \p{Script_Extensions=
3178                                    Inscriptional_Pahlavi} (Short: \p{Phli};
3179                                    NOT \p{Block=Inscriptional_Pahlavi}) (27)
3180          \p{Inscriptional_Parthian} \p{Script_Extensions=
3181                                    Inscriptional_Parthian} (Short:
3182                                    \p{Prti}; NOT \p{Block=
3183                                    Inscriptional_Parthian}) (30)
3184        X \p{IPA_Ext}             \p{IPA_Extensions} (= \p{Block=
3185                                    IPA_Extensions}) (96)
3186        X \p{IPA_Extensions}      \p{Block=IPA_Extensions} (Short:
3187                                    \p{InIPAExt}) (96)
3188          \p{Is_*}                \p{*} (Any exceptions are individually
3189                                    noted beginning with the word NOT.) If
3190                                    an entry has flag(s) at its beginning,
3191                                    like "D", the "Is_" form has the same
3192                                    flag(s)
3193          \p{Ital}                \p{Old_Italic} (= \p{Script_Extensions=
3194                                    Old_Italic}) (NOT \p{Block=Old_Italic})
3195                                    (39)
3196        X \p{Jamo}                \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
3197                                    (256)
3198        X \p{Jamo_Ext_A}          \p{Hangul_Jamo_Extended_A} (= \p{Block=
3199                                    Hangul_Jamo_Extended_A}) (32)
3200        X \p{Jamo_Ext_B}          \p{Hangul_Jamo_Extended_B} (= \p{Block=
3201                                    Hangul_Jamo_Extended_B}) (80)
3202          \p{Java}                \p{Javanese} (= \p{Script_Extensions=
3203                                    Javanese}) (NOT \p{Block=Javanese}) (91)
3204          \p{Javanese}            \p{Script_Extensions=Javanese} (Short:
3205                                    \p{Java}; NOT \p{Block=Javanese}) (91)
3206          \p{Jg: *}               \p{Joining_Group: *}
3207          \p{Join_C}              \p{Join_Control} (= \p{Join_Control=Y}) (2)
3208          \p{Join_C: *}           \p{Join_Control: *}
3209          \p{Join_Control}        \p{Join_Control=Y} (Short: \p{JoinC}) (2)
3210          \p{Join_Control: N*}    (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
3211                                    plus all above-Unicode code points:
3212                                    U+0000..200B, U+200E..infinity)
3213          \p{Join_Control: Y*}    (Short: \p{JoinC=Y}, \p{JoinC}) (2:
3214                                    U+200C..200D)
3215          \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1:
3216                                    U+08BB)
3217          \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1:
3218                                    U+08BD)
3219          \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (1:
3220                                    U+08BC)
3221          \p{Joining_Group: Ain}  (Short: \p{Jg=Ain}) (8: U+0639..063A,
3222                                    U+06A0, U+06FC, U+075D..075F, U+08B3)
3223          \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710)
3224          \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10: U+0622..0623,
3225                                    U+0625, U+0627, U+0671..0673, U+0675,
3226                                    U+0773..0774)
3227          \p{Joining_Group: Beh}  (Short: \p{Jg=Beh}) (24: U+0628,
3228                                    U+062A..062B, U+066E, U+0679..0680,
3229                                    U+0750..0756, U+08A0..08A1 ...)
3230          \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D)
3231          \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
3232                                    BurushaskiYehBarree}) (2: U+077A..077B)
3233          \p{Joining_Group: Dal}  (Short: \p{Jg=Dal}) (15: U+062F..0630,
3234                                    U+0688..0690, U+06EE, U+0759..075A,
3235                                    U+08AE)
3236          \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4:
3237                                    U+0715..0716, U+072A, U+072F)
3238          \p{Joining_Group: E}    (Short: \p{Jg=E}) (1: U+0725)
3239          \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7:
3240                                    U+063D..063F, U+06CC, U+06CE,
3241                                    U+0775..0776)
3242          \p{Joining_Group: Fe}   (Short: \p{Jg=Fe}) (1: U+074F)
3243          \p{Joining_Group: Feh}  (Short: \p{Jg=Feh}) (10: U+0641,
3244                                    U+06A1..06A6, U+0760..0761, U+08A4)
3245          \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1:
3246                                    U+0724)
3247          \p{Joining_Group: Gaf}  (Short: \p{Jg=Gaf}) (14: U+063B..063C,
3248                                    U+06A9, U+06AB, U+06AF..06B4,
3249                                    U+0762..0764, U+08B0)
3250          \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714,
3251                                    U+072E)
3252          \p{Joining_Group: Hah}  (Short: \p{Jg=Hah}) (18: U+062C..062E,
3253                                    U+0681..0687, U+06BF, U+0757..0758,
3254                                    U+076E..076F, U+0772 ...)
3255          \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
3256                                    HamzaOnHehGoal}) (1: U+06C3)
3257          \p{Joining_Group: Hanifi_Rohingya_Kinna_Ya} (Short: \p{Jg=
3258                                    HanifiRohingyaKinnaYa}) (4: U+10D19,
3259                                    U+10D1E, U+10D20, U+10D23)
3260          \p{Joining_Group: Hanifi_Rohingya_Pa} (Short: \p{Jg=
3261                                    HanifiRohingyaPa}) (3: U+10D02, U+10D09,
3262                                    U+10D1C)
3263          \p{Joining_Group: He}   (Short: \p{Jg=He}) (1: U+0717)
3264          \p{Joining_Group: Heh}  (Short: \p{Jg=Heh}) (1: U+0647)
3265          \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2:
3266                                    U+06C1..06C2)
3267          \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A)
3268          \p{Joining_Group: Kaf}  (Short: \p{Jg=Kaf}) (6: U+0643,
3269                                    U+06AC..06AE, U+077F, U+08B4)
3270          \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F)
3271          \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E)
3272          \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2:
3273                                    U+06BE, U+06FF)
3274          \p{Joining_Group: Lam}  (Short: \p{Jg=Lam}) (7: U+0644,
3275                                    U+06B5..06B8, U+076A, U+08A6)
3276          \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720)
3277          \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1:
3278                                    U+0866)
3279          \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1:
3280                                    U+0861)
3281          \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1:
3282                                    U+0868)
3283          \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla})
3284                                    (1: U+0869)
3285          \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1:
3286                                    U+0860)
3287          \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1:
3288                                    U+0864)
3289          \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna})
3290                                    (1: U+0865)
3291          \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1:
3292                                    U+0862)
3293          \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1:
3294                                    U+0867)
3295          \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1:
3296                                    U+086A)
3297          \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1:
3298                                    U+0863)
3299          \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
3300                                    ManichaeanAleph}) (1: U+10AC0)
3301          \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
3302                                    (2: U+10AD9..10ADA)
3303          \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
3304                                    (2: U+10AC1..10AC2)
3305          \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
3306                                    ManichaeanDaleth}) (1: U+10AC5)
3307          \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
3308                                    ManichaeanDhamedh}) (1: U+10AD4)
3309          \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
3310                                    (1: U+10AEC)
3311          \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
3312                                    ManichaeanGimel}) (2: U+10AC3..10AC4)
3313          \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
3314                                    (1: U+10ACD)
3315          \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
3316                                    ManichaeanHundred}) (1: U+10AEF)
3317          \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
3318                                    (3: U+10AD0..10AD2)
3319          \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
3320                                    ManichaeanLamedh}) (1: U+10AD3)
3321          \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem})
3322                                    (1: U+10AD6)
3323          \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun})
3324                                    (1: U+10AD7)
3325          \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne})
3326                                    (1: U+10AEB)
3327          \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2:
3328                                    U+10ADB..10ADC)
3329          \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
3330                                    (3: U+10ADE..10AE0)
3331          \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
3332                                    (1: U+10AE1)
3333          \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
3334                                    ManichaeanSadhe}) (1: U+10ADD)
3335          \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
3336                                    ManichaeanSamekh}) (1: U+10AD8)
3337          \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw})
3338                                    (1: U+10AE4)
3339          \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen})
3340                                    (1: U+10AED)
3341          \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
3342                                    (1: U+10ACE)
3343          \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
3344                                    ManichaeanThamedh}) (1: U+10AD5)
3345          \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
3346                                    ManichaeanTwenty}) (1: U+10AEE)
3347          \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw})
3348                                    (1: U+10AC7)
3349          \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
3350                                    (1: U+10ACF)
3351          \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
3352                                    ManichaeanZayin}) (2: U+10AC9..10ACA)
3353          \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645,
3354                                    U+0765..0766, U+08A7)
3355          \p{Joining_Group: Mim}  (Short: \p{Jg=Mim}) (1: U+0721)
3356          \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
3357                                    (1_113_800 plus all above-Unicode code
3358                                    points: U+0000..061F, U+0621, U+0640,
3359                                    U+064B..066D, U+0670, U+0674 ...)
3360          \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8: U+0646,
3361                                    U+06B9..06BC, U+0767..0769)
3362          \p{Joining_Group: Nun}  (Short: \p{Jg=Nun}) (1: U+0722)
3363          \p{Joining_Group: Nya}  (Short: \p{Jg=Nya}) (1: U+06BD)
3364          \p{Joining_Group: Pe}   (Short: \p{Jg=Pe}) (1: U+0726)
3365          \p{Joining_Group: Qaf}  (Short: \p{Jg=Qaf}) (5: U+0642, U+066F,
3366                                    U+06A7..06A8, U+08A5)
3367          \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729)
3368          \p{Joining_Group: Reh}  (Short: \p{Jg=Reh}) (19: U+0631..0632,
3369                                    U+0691..0699, U+06EF, U+075B,
3370                                    U+076B..076C, U+0771 ...)
3371          \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1:
3372                                    U+0727)
3373          \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1:
3374                                    U+08AC)
3375          \p{Joining_Group: Sad}  (Short: \p{Jg=Sad}) (6: U+0635..0636,
3376                                    U+069D..069E, U+06FB, U+08AF)
3377          \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728)
3378          \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634,
3379                                    U+069A..069C, U+06FA, U+075C, U+076D,
3380                                    U+0770 ...)
3381          \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723)
3382          \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B)
3383          \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1:
3384                                    U+08B1)
3385          \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA)
3386          \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718)
3387          \p{Joining_Group: Tah}  (Short: \p{Jg=Tah}) (4: U+0637..0638,
3388                                    U+069F, U+08A3)
3389          \p{Joining_Group: Taw}  (Short: \p{Jg=Taw}) (1: U+072C)
3390          \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3:
3391                                    U+0629, U+06C0, U+06D5)
3392          \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
3393                                    Hamza_On_Heh_Goal} (1)
3394          \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C)
3395          \p{Joining_Group: Waw}  (Short: \p{Jg=Waw}) (16: U+0624, U+0648,
3396                                    U+0676..0677, U+06C4..06CB, U+06CF,
3397                                    U+0778..0779 ...)
3398          \p{Joining_Group: Yeh}  (Short: \p{Jg=Yeh}) (11: U+0620, U+0626,
3399                                    U+0649..064A, U+0678, U+06D0..06D1,
3400                                    U+0777 ...)
3401          \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2:
3402                                    U+06D2..06D3)
3403          \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1:
3404                                    U+06CD)
3405          \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D)
3406          \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E)
3407          \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719)
3408          \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D)
3409          \p{Joining_Type: C}     \p{Joining_Type=Join_Causing} (4)
3410          \p{Joining_Type: D}     \p{Joining_Type=Dual_Joining} (565)
3411          \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (565: U+0620,
3412                                    U+0626, U+0628, U+062A..062E,
3413                                    U+0633..063F, U+0641..0647 ...)
3414          \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4: U+0640,
3415                                    U+07FA, U+180A, U+200D)
3416          \p{Joining_Type: L}     \p{Joining_Type=Left_Joining} (4)
3417          \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (4: U+A872,
3418                                    U+10ACD, U+10AD7, U+10D00)
3419          \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_437 plus
3420                                    all above-Unicode code points: [\x00-
3421                                    \xac\xae-\xff], U+0100..02FF,
3422                                    U+0370..0482, U+048A..0590, U+05BE,
3423                                    U+05C0 ...)
3424          \p{Joining_Type: R}     \p{Joining_Type=Right_Joining} (118)
3425          \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (118:
3426                                    U+0622..0625, U+0627, U+0629,
3427                                    U+062F..0632, U+0648, U+0671..0673 ...)
3428          \p{Joining_Type: T}     \p{Joining_Type=Transparent} (1984)
3429          \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1984: [\xad],
3430                                    U+0300..036F, U+0483..0489,
3431                                    U+0591..05BD, U+05BF, U+05C1..05C2 ...)
3432          \p{Joining_Type: U}     \p{Joining_Type=Non_Joining} (1_111_437
3433                                    plus all above-Unicode code points)
3434          \p{Jt: *}               \p{Joining_Type: *}
3435          \p{Kaithi}              \p{Script_Extensions=Kaithi} (Short:
3436                                    \p{Kthi}; NOT \p{Block=Kaithi}) (87)
3437          \p{Kali}                \p{Kayah_Li} (= \p{Script_Extensions=
3438                                    Kayah_Li}) (48)
3439          \p{Kana}                \p{Katakana} (= \p{Script_Extensions=
3440                                    Katakana}) (NOT \p{Block=Katakana}) (356)
3441        X \p{Kana_Ext_A}          \p{Kana_Extended_A} (= \p{Block=
3442                                    Kana_Extended_A}) (48)
3443        X \p{Kana_Extended_A}     \p{Block=Kana_Extended_A} (Short:
3444                                    \p{InKanaExtA}) (48)
3445        X \p{Kana_Sup}            \p{Kana_Supplement} (= \p{Block=
3446                                    Kana_Supplement}) (256)
3447        X \p{Kana_Supplement}     \p{Block=Kana_Supplement} (Short:
3448                                    \p{InKanaSup}) (256)
3449        X \p{Kanbun}              \p{Block=Kanbun} (16)
3450        X \p{Kangxi}              \p{Kangxi_Radicals} (= \p{Block=
3451                                    Kangxi_Radicals}) (224)
3452        X \p{Kangxi_Radicals}     \p{Block=Kangxi_Radicals} (Short:
3453                                    \p{InKangxi}) (224)
3454          \p{Kannada}             \p{Script_Extensions=Kannada} (Short:
3455                                    \p{Knda}; NOT \p{Block=Kannada}) (104)
3456          \p{Katakana}            \p{Script_Extensions=Katakana} (Short:
3457                                    \p{Kana}; NOT \p{Block=Katakana}) (356)
3458        X \p{Katakana_Ext}        \p{Katakana_Phonetic_Extensions} (=
3459                                    \p{Block=Katakana_Phonetic_Extensions})
3460                                    (16)
3461        X \p{Katakana_Phonetic_Extensions} \p{Block=
3462                                    Katakana_Phonetic_Extensions} (Short:
3463                                    \p{InKatakanaExt}) (16)
3464          \p{Kayah_Li}            \p{Script_Extensions=Kayah_Li} (Short:
3465                                    \p{Kali}) (48)
3466          \p{Khar}                \p{Kharoshthi} (= \p{Script_Extensions=
3467                                    Kharoshthi}) (NOT \p{Block=Kharoshthi})
3468                                    (68)
3469          \p{Kharoshthi}          \p{Script_Extensions=Kharoshthi} (Short:
3470                                    \p{Khar}; NOT \p{Block=Kharoshthi}) (68)
3471          \p{Khmer}               \p{Script_Extensions=Khmer} (Short:
3472                                    \p{Khmr}; NOT \p{Block=Khmer}) (146)
3473        X \p{Khmer_Symbols}       \p{Block=Khmer_Symbols} (32)
3474          \p{Khmr}                \p{Khmer} (= \p{Script_Extensions=Khmer})
3475                                    (NOT \p{Block=Khmer}) (146)
3476          \p{Khoj}                \p{Khojki} (= \p{Script_Extensions=
3477                                    Khojki}) (NOT \p{Block=Khojki}) (82)
3478          \p{Khojki}              \p{Script_Extensions=Khojki} (Short:
3479                                    \p{Khoj}; NOT \p{Block=Khojki}) (82)
3480          \p{Khudawadi}           \p{Script_Extensions=Khudawadi} (Short:
3481                                    \p{Sind}; NOT \p{Block=Khudawadi}) (81)
3482          \p{Knda}                \p{Kannada} (= \p{Script_Extensions=
3483                                    Kannada}) (NOT \p{Block=Kannada}) (104)
3484          \p{Kthi}                \p{Kaithi} (= \p{Script_Extensions=
3485                                    Kaithi}) (NOT \p{Block=Kaithi}) (87)
3486          \p{L} \pL               \p{Letter} (= \p{General_Category=Letter})
3487                                    (125_643)
3488        X \p{L&}                  \p{Cased_Letter} (= \p{General_Category=
3489                                    Cased_Letter}) (3970)
3490        X \p{L_}                  \p{Cased_Letter} (= \p{General_Category=
3491                                    Cased_Letter}) Note the trailing '_'
3492                                    matters in spite of loose matching
3493                                    rules. (3970)
3494          \p{Lana}                \p{Tai_Tham} (= \p{Script_Extensions=
3495                                    Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
3496          \p{Lao}                 \p{Script_Extensions=Lao} (NOT \p{Block=
3497                                    Lao}) (82)
3498          \p{Laoo}                \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
3499                                    \p{Block=Lao}) (82)
3500          \p{Latin}               \p{Script_Extensions=Latin} (Short:
3501                                    \p{Latn}) (1387)
3502        X \p{Latin_1}             \p{Latin_1_Supplement} (= \p{Block=
3503                                    Latin_1_Supplement}) (128)
3504        X \p{Latin_1_Sup}         \p{Latin_1_Supplement} (= \p{Block=
3505                                    Latin_1_Supplement}) (128)
3506        X \p{Latin_1_Supplement}  \p{Block=Latin_1_Supplement} (Short:
3507                                    \p{InLatin1}) (128)
3508        X \p{Latin_Ext_A}         \p{Latin_Extended_A} (= \p{Block=
3509                                    Latin_Extended_A}) (128)
3510        X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
3511                                    \p{Block=Latin_Extended_Additional})
3512                                    (256)
3513        X \p{Latin_Ext_B}         \p{Latin_Extended_B} (= \p{Block=
3514                                    Latin_Extended_B}) (208)
3515        X \p{Latin_Ext_C}         \p{Latin_Extended_C} (= \p{Block=
3516                                    Latin_Extended_C}) (32)
3517        X \p{Latin_Ext_D}         \p{Latin_Extended_D} (= \p{Block=
3518                                    Latin_Extended_D}) (224)
3519        X \p{Latin_Ext_E}         \p{Latin_Extended_E} (= \p{Block=
3520                                    Latin_Extended_E}) (64)
3521        X \p{Latin_Extended_A}    \p{Block=Latin_Extended_A} (Short:
3522                                    \p{InLatinExtA}) (128)
3523        X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
3524                                    (Short: \p{InLatinExtAdditional}) (256)
3525        X \p{Latin_Extended_B}    \p{Block=Latin_Extended_B} (Short:
3526                                    \p{InLatinExtB}) (208)
3527        X \p{Latin_Extended_C}    \p{Block=Latin_Extended_C} (Short:
3528                                    \p{InLatinExtC}) (32)
3529        X \p{Latin_Extended_D}    \p{Block=Latin_Extended_D} (Short:
3530                                    \p{InLatinExtD}) (224)
3531        X \p{Latin_Extended_E}    \p{Block=Latin_Extended_E} (Short:
3532                                    \p{InLatinExtE}) (64)
3533          \p{Latn}                \p{Latin} (= \p{Script_Extensions=Latin})
3534                                    (1387)
3535          \p{Lb: *}               \p{Line_Break: *}
3536          \p{LC}                  \p{Cased_Letter} (= \p{General_Category=
3537                                    Cased_Letter}) (3970)
3538          \p{Lepc}                \p{Lepcha} (= \p{Script_Extensions=
3539                                    Lepcha}) (NOT \p{Block=Lepcha}) (74)
3540          \p{Lepcha}              \p{Script_Extensions=Lepcha} (Short:
3541                                    \p{Lepc}; NOT \p{Block=Lepcha}) (74)
3542          \p{Letter}              \p{General_Category=Letter} (Short: \p{L})
3543                                    (125_643)
3544          \p{Letter_Number}       \p{General_Category=Letter_Number} (Short:
3545                                    \p{Nl}) (236)
3546        X \p{Letterlike_Symbols}  \p{Block=Letterlike_Symbols} (80)
3547          \p{Limb}                \p{Limbu} (= \p{Script_Extensions=Limbu})
3548                                    (NOT \p{Block=Limbu}) (69)
3549          \p{Limbu}               \p{Script_Extensions=Limbu} (Short:
3550                                    \p{Limb}; NOT \p{Block=Limbu}) (69)
3551          \p{Lina}                \p{Linear_A} (= \p{Script_Extensions=
3552                                    Linear_A}) (NOT \p{Block=Linear_A}) (386)
3553          \p{Linb}                \p{Linear_B} (= \p{Script_Extensions=
3554                                    Linear_B}) (268)
3555          \p{Line_Break: AI}      \p{Line_Break=Ambiguous} (707)
3556          \p{Line_Break: AL}      \p{Line_Break=Alphabetic} (20_582)
3557          \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (20_582: [#&*<=>\@A-
3558                                    Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0-
3559                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6,
3560                                    U+02CE..02CF, U+02D1..02D7, U+02DC,
3561                                    U+02DE ...)
3562          \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7-
3563                                    \xa8\xaa\xb2-\xb3\xb6-\xba\xbc-
3564                                    \xbe\xd7\xf7], U+02C7, U+02C9..02CB,
3565                                    U+02CD, U+02D0, U+02D8..02DB ...)
3566          \p{Line_Break: B2}      \p{Line_Break=Break_Both} (3)
3567          \p{Line_Break: BA}      \p{Line_Break=Break_After} (239)
3568          \p{Line_Break: BB}      \p{Line_Break=Break_Before} (45)
3569          \p{Line_Break: BK}      \p{Line_Break=Mandatory_Break} (4)
3570          \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (239: [\t\|\xad],
3571                                    U+058A, U+05BE, U+0964..0965,
3572                                    U+0E5A..0E5B, U+0F0B ...)
3573          \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (45: [\xb4],
3574                                    U+02C8, U+02CC, U+02DF, U+0C77, U+0C84
3575                                    ...)
3576          \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014,
3577                                    U+2E3A..2E3B)
3578          \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/])
3579          \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r])
3580          \p{Line_Break: CB}      \p{Line_Break=Contingent_Break} (1)
3581          \p{Line_Break: CJ}      \p{Line_Break=
3582                                    Conditional_Japanese_Starter} (58)
3583          \p{Line_Break: CL}      \p{Line_Break=Close_Punctuation} (91)
3584          \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]])
3585          \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (91: [\}],
3586                                    U+0F3B, U+0F3D, U+169C, U+2046, U+207E
3587                                    ...)
3588          \p{Line_Break: CM}      \p{Line_Break=Combining_Mark} (2260)
3589          \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2260:
3590                                    [^\t\n\cK\f\r\x20-\x7e\x85\xa0-\xff],
3591                                    U+0300..034E, U+0350..035B,
3592                                    U+0363..036F, U+0483..0489, U+0591..05BD
3593                                    ...)
3594          \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (750:
3595                                    U+0E01..0E3A, U+0E40..0E4E,
3596                                    U+0E81..0E82, U+0E84, U+0E86..0E8A,
3597                                    U+0E8C..0EA3 ...)
3598          \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
3599                                    (58: U+3041, U+3043, U+3045, U+3047,
3600                                    U+3049, U+3063 ...)
3601          \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC)
3602          \p{Line_Break: CP}      \p{Line_Break=Close_Parenthesis} (2)
3603          \p{Line_Break: CR}      \p{Line_Break=Carriage_Return} (1)
3604          \p{Line_Break: E_Base}  (Short: \p{Lb=EB}) (120: U+261D, U+26F9,
3605                                    U+270A..270D, U+1F385, U+1F3C2..1F3C4,
3606                                    U+1F3C7 ...)
3607          \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF)
3608          \p{Line_Break: EB}      \p{Line_Break=E_Base} (120)
3609          \p{Line_Break: EM}      \p{Line_Break=E_Modifier} (5)
3610          \p{Line_Break: EX}      \p{Line_Break=Exclamation} (37)
3611          \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (37: [!?], U+05C6,
3612                                    U+061B, U+061E..061F, U+06D4, U+07F9 ...)
3613          \p{Line_Break: GL}      \p{Line_Break=Glue} (25)
3614          \p{Line_Break: Glue}    (Short: \p{Lb=GL}) (25: [\xa0], U+034F,
3615                                    U+035C..0362, U+0F08, U+0F0C, U+0F12 ...)
3616          \p{Line_Break: H2}      (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C,
3617                                    U+AC38, U+AC54, U+AC70, U+AC8C ...)
3618          \p{Line_Break: H3}      (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B,
3619                                    U+AC1D..AC37, U+AC39..AC53,
3620                                    U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7
3621                                    ...)
3622          \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (75:
3623                                    U+05D0..05EA, U+05EF..05F2, U+FB1D,
3624                                    U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
3625                                    ...)
3626          \p{Line_Break: HL}      \p{Line_Break=Hebrew_Letter} (75)
3627          \p{Line_Break: HY}      \p{Line_Break=Hyphen} (1)
3628          \p{Line_Break: Hyphen}  (Short: \p{Lb=HY}) (1: [\-])
3629          \p{Line_Break: ID}      \p{Line_Break=Ideographic} (172_693)
3630          \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_693:
3631                                    U+231A..231B, U+23F0..23F3,
3632                                    U+2600..2603, U+2614..2615, U+2618,
3633                                    U+261A..261C ...)
3634          \p{Line_Break: IN}      \p{Line_Break=Inseparable} (6)
3635          \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;],
3636                                    U+037E, U+0589, U+060C..060D, U+07F8,
3637                                    U+2044 ...)
3638          \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026,
3639                                    U+22EF, U+FE19, U+10AF6)
3640          \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
3641          \p{Line_Break: IS}      \p{Line_Break=Infix_Numeric} (13)
3642          \p{Line_Break: JL}      (Short: \p{Lb=JL}) (125: U+1100..115F,
3643                                    U+A960..A97C)
3644          \p{Line_Break: JT}      (Short: \p{Lb=JT}) (137: U+11A8..11FF,
3645                                    U+D7CB..D7FB)
3646          \p{Line_Break: JV}      (Short: \p{Lb=JV}) (95: U+1160..11A7,
3647                                    U+D7B0..D7C6)
3648          \p{Line_Break: LF}      \p{Line_Break=Line_Feed} (1)
3649          \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n])
3650          \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f],
3651                                    U+2028..2029)
3652          \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85])
3653          \p{Line_Break: NL}      \p{Line_Break=Next_Line} (1)
3654          \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (33: U+17D6,
3655                                    U+203C..203D, U+2047..2049, U+3005,
3656                                    U+301C, U+303B..303C ...)
3657          \p{Line_Break: NS}      \p{Line_Break=Nonstarter} (33)
3658          \p{Line_Break: NU}      \p{Line_Break=Numeric} (622)
3659          \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (622: [0-9],
3660                                    U+0660..0669, U+066B..066C,
3661                                    U+06F0..06F9, U+07C0..07C9, U+0966..096F
3662                                    ...)
3663          \p{Line_Break: OP}      \p{Line_Break=Open_Punctuation} (88)
3664          \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (88:
3665                                    [\(\[\{\xa1\xbf], U+0F3A, U+0F3C,
3666                                    U+169B, U+201A, U+201E ...)
3667          \p{Line_Break: PO}      \p{Line_Break=Postfix_Numeric} (36)
3668          \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (36:
3669                                    [\%\xa2\xb0], U+0609..060B, U+066A,
3670                                    U+09F2..09F3, U+09F9, U+0D79 ...)
3671          \p{Line_Break: PR}      \p{Line_Break=Prefix_Numeric} (68)
3672          \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (68: [\$+\\\xa3-
3673                                    \xa5\xb1], U+058F, U+07FE..07FF, U+09FB,
3674                                    U+0AF1, U+0BF9 ...)
3675          \p{Line_Break: QU}      \p{Line_Break=Quotation} (39)
3676          \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb],
3677                                    U+2018..2019, U+201B..201D, U+201F,
3678                                    U+2039..203A, U+275B..2760 ...)
3679          \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26:
3680                                    U+1F1E6..1F1FF)
3681          \p{Line_Break: RI}      \p{Line_Break=Regional_Indicator} (26)
3682          \p{Line_Break: SA}      \p{Line_Break=Complex_Context} (750)
3683        D \p{Line_Break: SG}      \p{Line_Break=Surrogate} (2048)
3684          \p{Line_Break: SP}      \p{Line_Break=Space} (1)
3685          \p{Line_Break: Space}   (Short: \p{Lb=SP}) (1: [\x20])
3686        D \p{Line_Break: Surrogate} Surrogates should never appear in well-
3687                                    formed text, and therefore shouldn't be
3688                                    the basis for line breaking (Short:
3689                                    \p{Lb=SG}) (2048: U+D800..DFFF)
3690          \p{Line_Break: SY}      \p{Line_Break=Break_Symbols} (1)
3691          \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (901_897 plus all
3692                                    above-Unicode code points: U+0378..0379,
3693                                    U+0380..0383, U+038B, U+038D, U+03A2,
3694                                    U+0530 ...)
3695          \p{Line_Break: WJ}      \p{Line_Break=Word_Joiner} (2)
3696          \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF)
3697          \p{Line_Break: XX}      \p{Line_Break=Unknown} (901_897 plus all
3698                                    above-Unicode code points)
3699          \p{Line_Break: ZW}      \p{Line_Break=ZWSpace} (1)
3700          \p{Line_Break: ZWJ}     (Short: \p{Lb=ZWJ}) (1: U+200D)
3701          \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B)
3702          \p{Line_Separator}      \p{General_Category=Line_Separator}
3703                                    (Short: \p{Zl}) (1)
3704          \p{Linear_A}            \p{Script_Extensions=Linear_A} (Short:
3705                                    \p{Lina}; NOT \p{Block=Linear_A}) (386)
3706          \p{Linear_B}            \p{Script_Extensions=Linear_B} (Short:
3707                                    \p{Linb}) (268)
3708        X \p{Linear_B_Ideograms}  \p{Block=Linear_B_Ideograms} (128)
3709        X \p{Linear_B_Syllabary}  \p{Block=Linear_B_Syllabary} (128)
3710          \p{Lisu}                \p{Script_Extensions=Lisu} (48)
3711          \p{Ll}                  \p{Lowercase_Letter} (=
3712                                    \p{General_Category=Lowercase_Letter})
3713                                    (/i= General_Category=Cased_Letter)
3714                                    (2151)
3715          \p{Lm}                  \p{Modifier_Letter} (=
3716                                    \p{General_Category=Modifier_Letter})
3717                                    (259)
3718          \p{Lo}                  \p{Other_Letter} (= \p{General_Category=
3719                                    Other_Letter}) (121_414)
3720          \p{LOE}                 \p{Logical_Order_Exception} (=
3721                                    \p{Logical_Order_Exception=Y}) (19)
3722          \p{LOE: *}              \p{Logical_Order_Exception: *}
3723          \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
3724                                    \p{LOE}) (19)
3725          \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
3726                                    (1_114_093 plus all above-Unicode code
3727                                    points: U+0000..0E3F, U+0E45..0EBF,
3728                                    U+0EC5..19B4, U+19B8..19B9,
3729                                    U+19BB..AAB4, U+AAB7..AAB8 ...)
3730          \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19:
3731                                    U+0E40..0E44, U+0EC0..0EC4,
3732                                    U+19B5..19B7, U+19BA, U+AAB5..AAB6,
3733                                    U+AAB9 ...)
3734        X \p{Low_Surrogates}      \p{Block=Low_Surrogates} (1024)
3735          \p{Lower}               \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3736                                    Cased=Yes) (2340)
3737          \p{Lower: *}            \p{Lowercase: *}
3738          \p{Lowercase}           \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3739                                    Cased=Yes) (2340)
3740          \p{Lowercase: N*}       (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
3741                                    No) (1_111_772 plus all above-Unicode
3742                                    code points: [\x00-
3743                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
3744                                    Z\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
3745                                    \xb4\xb6-\xb9\xbb-\xde\xf7], U+0100,
3746                                    U+0102, U+0104, U+0106, U+0108 ...)
3747          \p{Lowercase: Y*}       (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
3748                                    Yes) (2340: [a-z\xaa\xb5\xba\xdf-
3749                                    \xf6\xf8-\xff], U+0101, U+0103, U+0105,
3750                                    U+0107, U+0109 ...)
3751          \p{Lowercase_Letter}    \p{General_Category=Lowercase_Letter}
3752                                    (Short: \p{Ll}; /i= General_Category=
3753                                    Cased_Letter) (2151)
3754          \p{Lt}                  \p{Titlecase_Letter} (=
3755                                    \p{General_Category=Titlecase_Letter})
3756                                    (/i= General_Category=Cased_Letter) (31)
3757          \p{Lu}                  \p{Uppercase_Letter} (=
3758                                    \p{General_Category=Uppercase_Letter})
3759                                    (/i= General_Category=Cased_Letter)
3760                                    (1788)
3761          \p{Lyci}                \p{Lycian} (= \p{Script_Extensions=
3762                                    Lycian}) (NOT \p{Block=Lycian}) (29)
3763          \p{Lycian}              \p{Script_Extensions=Lycian} (Short:
3764                                    \p{Lyci}; NOT \p{Block=Lycian}) (29)
3765          \p{Lydi}                \p{Lydian} (= \p{Script_Extensions=
3766                                    Lydian}) (NOT \p{Block=Lydian}) (27)
3767          \p{Lydian}              \p{Script_Extensions=Lydian} (Short:
3768                                    \p{Lydi}; NOT \p{Block=Lydian}) (27)
3769          \p{M} \pM               \p{Mark} (= \p{General_Category=Mark})
3770                                    (2268)
3771          \p{Mahajani}            \p{Script_Extensions=Mahajani} (Short:
3772                                    \p{Mahj}; NOT \p{Block=Mahajani}) (61)
3773          \p{Mahj}                \p{Mahajani} (= \p{Script_Extensions=
3774                                    Mahajani}) (NOT \p{Block=Mahajani}) (61)
3775        X \p{Mahjong}             \p{Mahjong_Tiles} (= \p{Block=
3776                                    Mahjong_Tiles}) (48)
3777        X \p{Mahjong_Tiles}       \p{Block=Mahjong_Tiles} (Short:
3778                                    \p{InMahjong}) (48)
3779          \p{Maka}                \p{Makasar} (= \p{Script_Extensions=
3780                                    Makasar}) (NOT \p{Block=Makasar}) (25)
3781          \p{Makasar}             \p{Script_Extensions=Makasar} (Short:
3782                                    \p{Maka}; NOT \p{Block=Makasar}) (25)
3783          \p{Malayalam}           \p{Script_Extensions=Malayalam} (Short:
3784                                    \p{Mlym}; NOT \p{Block=Malayalam}) (125)
3785          \p{Mand}                \p{Mandaic} (= \p{Script_Extensions=
3786                                    Mandaic}) (NOT \p{Block=Mandaic}) (30)
3787          \p{Mandaic}             \p{Script_Extensions=Mandaic} (Short:
3788                                    \p{Mand}; NOT \p{Block=Mandaic}) (30)
3789          \p{Mani}                \p{Manichaean} (= \p{Script_Extensions=
3790                                    Manichaean}) (NOT \p{Block=Manichaean})
3791                                    (52)
3792          \p{Manichaean}          \p{Script_Extensions=Manichaean} (Short:
3793                                    \p{Mani}; NOT \p{Block=Manichaean}) (52)
3794          \p{Marc}                \p{Marchen} (= \p{Script_Extensions=
3795                                    Marchen}) (NOT \p{Block=Marchen}) (68)
3796          \p{Marchen}             \p{Script_Extensions=Marchen} (Short:
3797                                    \p{Marc}; NOT \p{Block=Marchen}) (68)
3798          \p{Mark}                \p{General_Category=Mark} (Short: \p{M})
3799                                    (2268)
3800          \p{Masaram_Gondi}       \p{Script_Extensions=Masaram_Gondi}
3801                                    (Short: \p{Gonm}; NOT \p{Block=
3802                                    Masaram_Gondi}) (77)
3803          \p{Math}                \p{Math=Y} (2310)
3804          \p{Math: N*}            (Single: \P{Math}) (1_111_802 plus all
3805                                    above-Unicode code points: [\x00-
3806                                    \x20!\"#\$\%&\'\(\)*,\-.\/0-9:;?\@A-
3807                                    Z\[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2-
3808                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF,
3809                                    U+03D3..03D4, U+03D6..03EF,
3810                                    U+03F2..03F3, U+03F7..0605 ...)
3811          \p{Math: Y*}            (Single: \p{Math}) (2310: [+<=
3812                                    >\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
3813                                    U+03D5, U+03F0..03F1, U+03F4..03F6,
3814                                    U+0606..0608 ...)
3815        X \p{Math_Alphanum}       \p{Mathematical_Alphanumeric_Symbols} (=
3816                                    \p{Block=
3817                                    Mathematical_Alphanumeric_Symbols})
3818                                    (1024)
3819        X \p{Math_Operators}      \p{Mathematical_Operators} (= \p{Block=
3820                                    Mathematical_Operators}) (256)
3821          \p{Math_Symbol}         \p{General_Category=Math_Symbol} (Short:
3822                                    \p{Sm}) (948)
3823        X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
3824                                    Mathematical_Alphanumeric_Symbols}
3825                                    (Short: \p{InMathAlphanum}) (1024)
3826        X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
3827                                    (Short: \p{InMathOperators}) (256)
3828        X \p{Mayan_Numerals}      \p{Block=Mayan_Numerals} (32)
3829          \p{Mc}                  \p{Spacing_Mark} (= \p{General_Category=
3830                                    Spacing_Mark}) (429)
3831          \p{Me}                  \p{Enclosing_Mark} (= \p{General_Category=
3832                                    Enclosing_Mark}) (13)
3833          \p{Medefaidrin}         \p{Script_Extensions=Medefaidrin} (Short:
3834                                    \p{Medf}; NOT \p{Block=Medefaidrin}) (91)
3835          \p{Medf}                \p{Medefaidrin} (= \p{Script_Extensions=
3836                                    Medefaidrin}) (NOT \p{Block=
3837                                    Medefaidrin}) (91)
3838          \p{Meetei_Mayek}        \p{Script_Extensions=Meetei_Mayek} (Short:
3839                                    \p{Mtei}; NOT \p{Block=Meetei_Mayek})
3840                                    (79)
3841        X \p{Meetei_Mayek_Ext}    \p{Meetei_Mayek_Extensions} (= \p{Block=
3842                                    Meetei_Mayek_Extensions}) (32)
3843        X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
3844                                    (Short: \p{InMeeteiMayekExt}) (32)
3845          \p{Mend}                \p{Mende_Kikakui} (= \p{Script_Extensions=
3846                                    Mende_Kikakui}) (NOT \p{Block=
3847                                    Mende_Kikakui}) (213)
3848          \p{Mende_Kikakui}       \p{Script_Extensions=Mende_Kikakui}
3849                                    (Short: \p{Mend}; NOT \p{Block=
3850                                    Mende_Kikakui}) (213)
3851          \p{Merc}                \p{Meroitic_Cursive} (=
3852                                    \p{Script_Extensions=Meroitic_Cursive})
3853                                    (NOT \p{Block=Meroitic_Cursive}) (90)
3854          \p{Mero}                \p{Meroitic_Hieroglyphs} (=
3855                                    \p{Script_Extensions=
3856                                    Meroitic_Hieroglyphs}) (32)
3857          \p{Meroitic_Cursive}    \p{Script_Extensions=Meroitic_Cursive}
3858                                    (Short: \p{Merc}; NOT \p{Block=
3859                                    Meroitic_Cursive}) (90)
3860          \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
3861                                    Meroitic_Hieroglyphs} (Short: \p{Mero})
3862                                    (32)
3863          \p{Miao}                \p{Script_Extensions=Miao} (NOT \p{Block=
3864                                    Miao}) (149)
3865        X \p{Misc_Arrows}         \p{Miscellaneous_Symbols_And_Arrows} (=
3866                                    \p{Block=
3867                                    Miscellaneous_Symbols_And_Arrows}) (256)
3868        X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
3869                                    (= \p{Block=
3870                                    Miscellaneous_Mathematical_Symbols_A})
3871                                    (48)
3872        X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
3873                                    (= \p{Block=
3874                                    Miscellaneous_Mathematical_Symbols_B})
3875                                    (128)
3876        X \p{Misc_Pictographs}    \p{Miscellaneous_Symbols_And_Pictographs}
3877                                    (= \p{Block=
3878                                    Miscellaneous_Symbols_And_Pictographs})
3879                                    (768)
3880        X \p{Misc_Symbols}        \p{Miscellaneous_Symbols} (= \p{Block=
3881                                    Miscellaneous_Symbols}) (256)
3882        X \p{Misc_Technical}      \p{Miscellaneous_Technical} (= \p{Block=
3883                                    Miscellaneous_Technical}) (256)
3884        X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
3885                                    Miscellaneous_Mathematical_Symbols_A}
3886                                    (Short: \p{InMiscMathSymbolsA}) (48)
3887        X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
3888                                    Miscellaneous_Mathematical_Symbols_B}
3889                                    (Short: \p{InMiscMathSymbolsB}) (128)
3890        X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
3891                                    \p{InMiscSymbols}) (256)
3892        X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
3893                                    Miscellaneous_Symbols_And_Arrows}
3894                                    (Short: \p{InMiscArrows}) (256)
3895        X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
3896                                    Miscellaneous_Symbols_And_Pictographs}
3897                                    (Short: \p{InMiscPictographs}) (768)
3898        X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
3899                                    (Short: \p{InMiscTechnical}) (256)
3900          \p{Mlym}                \p{Malayalam} (= \p{Script_Extensions=
3901                                    Malayalam}) (NOT \p{Block=Malayalam})
3902                                    (125)
3903          \p{Mn}                  \p{Nonspacing_Mark} (=
3904                                    \p{General_Category=Nonspacing_Mark})
3905                                    (1826)
3906          \p{Modi}                \p{Script_Extensions=Modi} (NOT \p{Block=
3907                                    Modi}) (89)
3908          \p{Modifier_Letter}     \p{General_Category=Modifier_Letter}
3909                                    (Short: \p{Lm}) (259)
3910        X \p{Modifier_Letters}    \p{Spacing_Modifier_Letters} (= \p{Block=
3911                                    Spacing_Modifier_Letters}) (80)
3912          \p{Modifier_Symbol}     \p{General_Category=Modifier_Symbol}
3913                                    (Short: \p{Sk}) (121)
3914        X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
3915          \p{Mong}                \p{Mongolian} (= \p{Script_Extensions=
3916                                    Mongolian}) (NOT \p{Block=Mongolian})
3917                                    (171)
3918          \p{Mongolian}           \p{Script_Extensions=Mongolian} (Short:
3919                                    \p{Mong}; NOT \p{Block=Mongolian}) (171)
3920        X \p{Mongolian_Sup}       \p{Mongolian_Supplement} (= \p{Block=
3921                                    Mongolian_Supplement}) (32)
3922        X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
3923                                    \p{InMongolianSup}) (32)
3924          \p{Mro}                 \p{Script_Extensions=Mro} (NOT \p{Block=
3925                                    Mro}) (43)
3926          \p{Mroo}                \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
3927                                    \p{Block=Mro}) (43)
3928          \p{Mtei}                \p{Meetei_Mayek} (= \p{Script_Extensions=
3929                                    Meetei_Mayek}) (NOT \p{Block=
3930                                    Meetei_Mayek}) (79)
3931          \p{Mult}                \p{Multani} (= \p{Script_Extensions=
3932                                    Multani}) (NOT \p{Block=Multani}) (48)
3933          \p{Multani}             \p{Script_Extensions=Multani} (Short:
3934                                    \p{Mult}; NOT \p{Block=Multani}) (48)
3935        X \p{Music}               \p{Musical_Symbols} (= \p{Block=
3936                                    Musical_Symbols}) (256)
3937        X \p{Musical_Symbols}     \p{Block=Musical_Symbols} (Short:
3938                                    \p{InMusic}) (256)
3939          \p{Myanmar}             \p{Script_Extensions=Myanmar} (Short:
3940                                    \p{Mymr}; NOT \p{Block=Myanmar}) (224)
3941        X \p{Myanmar_Ext_A}       \p{Myanmar_Extended_A} (= \p{Block=
3942                                    Myanmar_Extended_A}) (32)
3943        X \p{Myanmar_Ext_B}       \p{Myanmar_Extended_B} (= \p{Block=
3944                                    Myanmar_Extended_B}) (32)
3945        X \p{Myanmar_Extended_A}  \p{Block=Myanmar_Extended_A} (Short:
3946                                    \p{InMyanmarExtA}) (32)
3947        X \p{Myanmar_Extended_B}  \p{Block=Myanmar_Extended_B} (Short:
3948                                    \p{InMyanmarExtB}) (32)
3949          \p{Mymr}                \p{Myanmar} (= \p{Script_Extensions=
3950                                    Myanmar}) (NOT \p{Block=Myanmar}) (224)
3951          \p{N} \pN               \p{Number} (= \p{General_Category=Number})
3952                                    (1754)
3953          \p{Nabataean}           \p{Script_Extensions=Nabataean} (Short:
3954                                    \p{Nbat}; NOT \p{Block=Nabataean}) (40)
3955          \p{Nand}                \p{Nandinagari} (= \p{Script_Extensions=
3956                                    Nandinagari}) (NOT \p{Block=
3957                                    Nandinagari}) (86)
3958          \p{Nandinagari}         \p{Script_Extensions=Nandinagari} (Short:
3959                                    \p{Nand}; NOT \p{Block=Nandinagari}) (86)
3960          \p{Narb}                \p{Old_North_Arabian} (=
3961                                    \p{Script_Extensions=Old_North_Arabian})
3962                                    (32)
3963        X \p{NB}                  \p{No_Block} (= \p{Block=No_Block})
3964                                    (832_720 plus all above-Unicode code
3965                                    points)
3966          \p{Nbat}                \p{Nabataean} (= \p{Script_Extensions=
3967                                    Nabataean}) (NOT \p{Block=Nabataean})
3968                                    (40)
3969          \p{NChar}               \p{Noncharacter_Code_Point} (=
3970                                    \p{Noncharacter_Code_Point=Y}) (66)
3971          \p{NChar: *}            \p{Noncharacter_Code_Point: *}
3972          \p{Nd}                  \p{XPosixDigit} (= \p{General_Category=
3973                                    Decimal_Number}) (630)
3974          \p{New_Tai_Lue}         \p{Script_Extensions=New_Tai_Lue} (Short:
3975                                    \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
3976          \p{Newa}                \p{Script_Extensions=Newa} (NOT \p{Block=
3977                                    Newa}) (94)
3978          \p{NFC_QC: *}           \p{NFC_Quick_Check: *}
3979          \p{NFC_Quick_Check: M}  \p{NFC_Quick_Check=Maybe} (110)
3980          \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (110:
3981                                    U+0300..0304, U+0306..030C, U+030F,
3982                                    U+0311, U+0313..0314, U+031B ...)
3983          \p{NFC_Quick_Check: N}  \p{NFC_Quick_Check=No} (NOT
3984                                    \P{NFC_Quick_Check} NOR \P{NFC_QC})
3985                                    (1120)
3986          \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
3987                                    \P{NFC_Quick_Check} NOR \P{NFC_QC})
3988                                    (1120: U+0340..0341, U+0343..0344,
3989                                    U+0374, U+037E, U+0387, U+0958..095F ...)
3990          \p{NFC_Quick_Check: Y}  \p{NFC_Quick_Check=Yes} (NOT
3991                                    \p{NFC_Quick_Check} NOR \p{NFC_QC})
3992                                    (1_112_882 plus all above-Unicode code
3993                                    points)
3994          \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
3995                                    \p{NFC_Quick_Check} NOR \p{NFC_QC})
3996                                    (1_112_882 plus all above-Unicode code
3997                                    points: U+0000..02FF, U+0305,
3998                                    U+030D..030E, U+0310, U+0312,
3999                                    U+0315..031A ...)
4000          \p{NFD_QC: *}           \p{NFD_Quick_Check: *}
4001          \p{NFD_Quick_Check: N}  \p{NFD_Quick_Check=No} (NOT
4002                                    \P{NFD_Quick_Check} NOR \P{NFD_QC})
4003                                    (13_232)
4004          \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
4005                                    \P{NFD_Quick_Check} NOR \P{NFD_QC})
4006                                    (13_232: [\xc0-\xc5\xc7-\xcf\xd1-
4007                                    \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
4008                                    \xf6\xf9-\xfd\xff], U+0100..010F,
4009                                    U+0112..0125, U+0128..0130,
4010                                    U+0134..0137, U+0139..013E ...)
4011          \p{NFD_Quick_Check: Y}  \p{NFD_Quick_Check=Yes} (NOT
4012                                    \p{NFD_Quick_Check} NOR \p{NFD_QC})
4013                                    (1_100_880 plus all above-Unicode code
4014                                    points)
4015          \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
4016                                    \p{NFD_Quick_Check} NOR \p{NFD_QC})
4017                                    (1_100_880 plus all above-Unicode code
4018                                    points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde-
4019                                    \xdf\xe6\xf0\xf7-\xf8\xfe],
4020                                    U+0110..0111, U+0126..0127,
4021                                    U+0131..0133, U+0138, U+013F..0142 ...)
4022          \p{NFKC_QC: *}          \p{NFKC_Quick_Check: *}
4023          \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (110)
4024          \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (110:
4025                                    U+0300..0304, U+0306..030C, U+030F,
4026                                    U+0311, U+0313..0314, U+031B ...)
4027          \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
4028                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4029                                    (4796)
4030          \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
4031                                    \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4032                                    (4796: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4033                                    \xba\xbc-\xbe], U+0132..0133,
4034                                    U+013F..0140, U+0149, U+017F,
4035                                    U+01C4..01CC ...)
4036          \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
4037                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4038                                    (1_109_206 plus all above-Unicode code
4039                                    points)
4040          \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
4041                                    \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4042                                    (1_109_206 plus all above-Unicode code
4043                                    points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4044                                    \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff],
4045                                    U+0100..0131, U+0134..013E,
4046                                    U+0141..0148, U+014A..017E, U+0180..01C3
4047                                    ...)
4048          \p{NFKD_QC: *}          \p{NFKD_Quick_Check: *}
4049          \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
4050                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4051                                    (16_896)
4052          \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
4053                                    \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4054                                    (16_896: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4055                                    \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1-
4056                                    \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
4057                                    \xf6\xf9-\xfd\xff], U+0100..010F,
4058                                    U+0112..0125, U+0128..0130,
4059                                    U+0132..0137, U+0139..0140 ...)
4060          \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
4061                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4062                                    (1_097_216 plus all above-Unicode code
4063                                    points)
4064          \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
4065                                    \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4066                                    (1_097_216 plus all above-Unicode code
4067                                    points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4068                                    \xae\xb0-\xb1\xb6-
4069                                    \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-
4070                                    \xdf\xe6\xf0\xf7-\xf8\xfe],
4071                                    U+0110..0111, U+0126..0127, U+0131,
4072                                    U+0138, U+0141..0142 ...)
4073          \p{Nko}                 \p{Script_Extensions=Nko} (NOT \p{Block=
4074                                    NKo}) (62)
4075          \p{Nkoo}                \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
4076                                    \p{Block=NKo}) (62)
4077          \p{Nl}                  \p{Letter_Number} (= \p{General_Category=
4078                                    Letter_Number}) (236)
4079          \p{No}                  \p{Other_Number} (= \p{General_Category=
4080                                    Other_Number}) (888)
4081        X \p{No_Block}            \p{Block=No_Block} (Short: \p{InNB})
4082                                    (832_720 plus all above-Unicode code
4083                                    points)
4084          \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
4085                                    \p{NChar}) (66)
4086          \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
4087                                    (1_114_046 plus all above-Unicode code
4088                                    points: U+0000..FDCF, U+FDF0..FFFD,
4089                                    U+10000..1FFFD, U+20000..2FFFD,
4090                                    U+30000..3FFFD, U+40000..4FFFD ...)
4091          \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
4092                                    (66: U+FDD0..FDEF, U+FFFE..FFFF,
4093                                    U+1FFFE..1FFFF, U+2FFFE..2FFFF,
4094                                    U+3FFFE..3FFFF, U+4FFFE..4FFFF ...)
4095          \p{Nonspacing_Mark}     \p{General_Category=Nonspacing_Mark}
4096                                    (Short: \p{Mn}) (1826)
4097          \p{Nshu}                \p{Nushu} (= \p{Script_Extensions=Nushu})
4098                                    (NOT \p{Block=Nushu}) (397)
4099          \p{Nt: *}               \p{Numeric_Type: *}
4100          \p{Number}              \p{General_Category=Number} (Short: \p{N})
4101                                    (1754)
4102        X \p{Number_Forms}        \p{Block=Number_Forms} (64)
4103          \p{Numeric_Type: De}    \p{Numeric_Type=Decimal} (630)
4104          \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (630: [0-9],
4105                                    U+0660..0669, U+06F0..06F9,
4106                                    U+07C0..07C9, U+0966..096F, U+09E6..09EF
4107                                    ...)
4108          \p{Numeric_Type: Di}    \p{Numeric_Type=Digit} (128)
4109          \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9],
4110                                    U+1369..1371, U+19DA, U+2070,
4111                                    U+2074..2079, U+2080..2089 ...)
4112          \p{Numeric_Type: None}  (Short: \p{Nt=None}) (1_112_277 plus all
4113                                    above-Unicode code points: [\x00-
4114                                    \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-
4115                                    Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4-
4116                                    \xb8\xba-\xbb\xbf-\xff], U+0100..065F,
4117                                    U+066A..06EF, U+06FA..07BF,
4118                                    U+07CA..0965, U+0970..09E5 ...)
4119          \p{Numeric_Type: Nu}    \p{Numeric_Type=Numeric} (1077)
4120          \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (1077: [\xbc-\xbe],
4121                                    U+09F4..09F9, U+0B72..0B77,
4122                                    U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
4123                                    ...)
4124        T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33)
4125        T \p{Numeric_Value: 0}    (Short: \p{Nv=0}) (81: [0], U+0660,
4126                                    U+06F0, U+07C0, U+0966, U+09E6 ...)
4127        T \p{Numeric_Value: 1/320} (Short: \p{Nv=1/320}) (2: U+11FC0,
4128                                    U+11FD4)
4129        T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (2: U+0D58, U+11FC1)
4130        T \p{Numeric_Value: 1/80} (Short: \p{Nv=1/80}) (1: U+11FC2)
4131        T \p{Numeric_Value: 1/64} (Short: \p{Nv=1/64}) (1: U+11FC3)
4132        T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (2: U+0D59, U+11FC4)
4133        T \p{Numeric_Value: 1/32} (Short: \p{Nv=1/32}) (1: U+11FC5)
4134        T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (2: U+0D5A, U+11FC6)
4135        T \p{Numeric_Value: 3/64} (Short: \p{Nv=3/64}) (1: U+11FC7)
4136        T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (2: U+0D5B, U+11FC8)
4137        T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (6: U+09F4, U+0B75,
4138                                    U+0D76, U+A833, U+11FC9..11FCA)
4139        T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6)
4140        T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (3: U+0D5C, U+2152,
4141                                    U+11FCB)
4142        T \p{Numeric_Value: 1/9}  (Short: \p{Nv=1/9}) (1: U+2151)
4143        T \p{Numeric_Value: 1/8}  (Short: \p{Nv=1/8}) (7: U+09F5, U+0B76,
4144                                    U+0D77, U+215B, U+A834, U+11FCC ...)
4145        T \p{Numeric_Value: 1/7}  (Short: \p{Nv=1/7}) (1: U+2150)
4146        T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (2: U+0D5D, U+11FCD)
4147        T \p{Numeric_Value: 1/6}  (Short: \p{Nv=1/6}) (4: U+2159, U+109F7,
4148                                    U+12461, U+1ED3D)
4149        T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (5: U+09F6, U+0B77,
4150                                    U+0D78, U+A835, U+11FCE)
4151        T \p{Numeric_Value: 1/5}  (Short: \p{Nv=1/5}) (3: U+0D5E, U+2155,
4152                                    U+11FCF)
4153        T \p{Numeric_Value: 1/4}  (Short: \p{Nv=1/4}) (14: [\xbc], U+09F7,
4154                                    U+0B72, U+0D73, U+A830, U+10140 ...)
4155        T \p{Numeric_Value: 1/3}  (Short: \p{Nv=1/3}) (6: U+2153, U+109F9,
4156                                    U+10E7D, U+1245A, U+1245D, U+12465)
4157        T \p{Numeric_Value: 3/8}  (Short: \p{Nv=3/8}) (1: U+215C)
4158        T \p{Numeric_Value: 2/5}  (Short: \p{Nv=2/5}) (1: U+2156)
4159        T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA)
4160        T \p{Numeric_Value: 1/2}  (Short: \p{Nv=1/2}) (19: [\xbd], U+0B73,
4161                                    U+0D74, U+0F2A, U+2CFD, U+A831 ...)
4162        T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC)
4163        T \p{Numeric_Value: 3/5}  (Short: \p{Nv=3/5}) (1: U+2157)
4164        T \p{Numeric_Value: 5/8}  (Short: \p{Nv=5/8}) (1: U+215D)
4165        T \p{Numeric_Value: 2/3}  (Short: \p{Nv=2/3}) (7: U+2154, U+10177,
4166                                    U+109FD, U+10E7E, U+1245B, U+1245E ...)
4167        T \p{Numeric_Value: 3/4}  (Short: \p{Nv=3/4}) (9: [\xbe], U+09F8,
4168                                    U+0B74, U+0D75, U+A832, U+10178 ...)
4169        T \p{Numeric_Value: 4/5}  (Short: \p{Nv=4/5}) (1: U+2158)
4170        T \p{Numeric_Value: 5/6}  (Short: \p{Nv=5/6}) (3: U+215A, U+109FF,
4171                                    U+1245C)
4172        T \p{Numeric_Value: 7/8}  (Short: \p{Nv=7/8}) (1: U+215E)
4173        T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC)
4174        T \p{Numeric_Value: 1}    (Short: \p{Nv=1}) (137: [1\xb9], U+0661,
4175                                    U+06F1, U+07C1, U+0967, U+09E7 ...)
4176        T \p{Numeric_Value: 3/2}  (Short: \p{Nv=3/2}) (1: U+0F2B)
4177        T \p{Numeric_Value: 2}    (Short: \p{Nv=2}) (136: [2\xb2], U+0662,
4178                                    U+06F2, U+07C2, U+0968, U+09E8 ...)
4179        T \p{Numeric_Value: 5/2}  (Short: \p{Nv=5/2}) (1: U+0F2C)
4180        T \p{Numeric_Value: 3}    (Short: \p{Nv=3}) (137: [3\xb3], U+0663,
4181                                    U+06F3, U+07C3, U+0969, U+09E9 ...)
4182        T \p{Numeric_Value: 7/2}  (Short: \p{Nv=7/2}) (1: U+0F2D)
4183        T \p{Numeric_Value: 4}    (Short: \p{Nv=4}) (128: [4], U+0664,
4184                                    U+06F4, U+07C4, U+096A, U+09EA ...)
4185        T \p{Numeric_Value: 9/2}  (Short: \p{Nv=9/2}) (1: U+0F2E)
4186        T \p{Numeric_Value: 5}    (Short: \p{Nv=5}) (127: [5], U+0665,
4187                                    U+06F5, U+07C5, U+096B, U+09EB ...)
4188        T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F)
4189        T \p{Numeric_Value: 6}    (Short: \p{Nv=6}) (111: [6], U+0666,
4190                                    U+06F6, U+07C6, U+096C, U+09EC ...)
4191        T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30)
4192        T \p{Numeric_Value: 7}    (Short: \p{Nv=7}) (110: [7], U+0667,
4193                                    U+06F7, U+07C7, U+096D, U+09ED ...)
4194        T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31)
4195        T \p{Numeric_Value: 8}    (Short: \p{Nv=8}) (106: [8], U+0668,
4196                                    U+06F8, U+07C8, U+096E, U+09EE ...)
4197        T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32)
4198        T \p{Numeric_Value: 9}    (Short: \p{Nv=9}) (110: [9], U+0669,
4199                                    U+06F9, U+07C9, U+096F, U+09EF ...)
4200        T \p{Numeric_Value: 10}   (Short: \p{Nv=10}) (61: U+0BF0, U+0D70,
4201                                    U+1372, U+2169, U+2179, U+2469 ...)
4202        T \p{Numeric_Value: 11}   (Short: \p{Nv=11}) (8: U+216A, U+217A,
4203                                    U+246A, U+247E, U+2492, U+24EB ...)
4204        T \p{Numeric_Value: 12}   (Short: \p{Nv=12}) (8: U+216B, U+217B,
4205                                    U+246B, U+247F, U+2493, U+24EC ...)
4206        T \p{Numeric_Value: 13}   (Short: \p{Nv=13}) (6: U+246C, U+2480,
4207                                    U+2494, U+24ED, U+16E8D, U+1D2ED)
4208        T \p{Numeric_Value: 14}   (Short: \p{Nv=14}) (6: U+246D, U+2481,
4209                                    U+2495, U+24EE, U+16E8E, U+1D2EE)
4210        T \p{Numeric_Value: 15}   (Short: \p{Nv=15}) (6: U+246E, U+2482,
4211                                    U+2496, U+24EF, U+16E8F, U+1D2EF)
4212        T \p{Numeric_Value: 16}   (Short: \p{Nv=16}) (7: U+09F9, U+246F,
4213                                    U+2483, U+2497, U+24F0, U+16E90 ...)
4214        T \p{Numeric_Value: 17}   (Short: \p{Nv=17}) (7: U+16EE, U+2470,
4215                                    U+2484, U+2498, U+24F1, U+16E91 ...)
4216        T \p{Numeric_Value: 18}   (Short: \p{Nv=18}) (7: U+16EF, U+2471,
4217                                    U+2485, U+2499, U+24F2, U+16E92 ...)
4218        T \p{Numeric_Value: 19}   (Short: \p{Nv=19}) (7: U+16F0, U+2472,
4219                                    U+2486, U+249A, U+24F3, U+16E93 ...)
4220        T \p{Numeric_Value: 20}   (Short: \p{Nv=20}) (35: U+1373, U+2473,
4221                                    U+2487, U+249B, U+24F4, U+3039 ...)
4222        T \p{Numeric_Value: 21}   (Short: \p{Nv=21}) (1: U+3251)
4223        T \p{Numeric_Value: 22}   (Short: \p{Nv=22}) (1: U+3252)
4224        T \p{Numeric_Value: 23}   (Short: \p{Nv=23}) (1: U+3253)
4225        T \p{Numeric_Value: 24}   (Short: \p{Nv=24}) (1: U+3254)
4226        T \p{Numeric_Value: 25}   (Short: \p{Nv=25}) (1: U+3255)
4227        T \p{Numeric_Value: 26}   (Short: \p{Nv=26}) (1: U+3256)
4228        T \p{Numeric_Value: 27}   (Short: \p{Nv=27}) (1: U+3257)
4229        T \p{Numeric_Value: 28}   (Short: \p{Nv=28}) (1: U+3258)
4230        T \p{Numeric_Value: 29}   (Short: \p{Nv=29}) (1: U+3259)
4231        T \p{Numeric_Value: 30}   (Short: \p{Nv=30}) (19: U+1374, U+303A,
4232                                    U+324A, U+325A, U+5345, U+10112 ...)
4233        T \p{Numeric_Value: 31}   (Short: \p{Nv=31}) (1: U+325B)
4234        T \p{Numeric_Value: 32}   (Short: \p{Nv=32}) (1: U+325C)
4235        T \p{Numeric_Value: 33}   (Short: \p{Nv=33}) (1: U+325D)
4236        T \p{Numeric_Value: 34}   (Short: \p{Nv=34}) (1: U+325E)
4237        T \p{Numeric_Value: 35}   (Short: \p{Nv=35}) (1: U+325F)
4238        T \p{Numeric_Value: 36}   (Short: \p{Nv=36}) (1: U+32B1)
4239        T \p{Numeric_Value: 37}   (Short: \p{Nv=37}) (1: U+32B2)
4240        T \p{Numeric_Value: 38}   (Short: \p{Nv=38}) (1: U+32B3)
4241        T \p{Numeric_Value: 39}   (Short: \p{Nv=39}) (1: U+32B4)
4242        T \p{Numeric_Value: 40}   (Short: \p{Nv=40}) (18: U+1375, U+324B,
4243                                    U+32B5, U+534C, U+10113, U+102ED ...)
4244        T \p{Numeric_Value: 41}   (Short: \p{Nv=41}) (1: U+32B6)
4245        T \p{Numeric_Value: 42}   (Short: \p{Nv=42}) (1: U+32B7)
4246        T \p{Numeric_Value: 43}   (Short: \p{Nv=43}) (1: U+32B8)
4247        T \p{Numeric_Value: 44}   (Short: \p{Nv=44}) (1: U+32B9)
4248        T \p{Numeric_Value: 45}   (Short: \p{Nv=45}) (1: U+32BA)
4249        T \p{Numeric_Value: 46}   (Short: \p{Nv=46}) (1: U+32BB)
4250        T \p{Numeric_Value: 47}   (Short: \p{Nv=47}) (1: U+32BC)
4251        T \p{Numeric_Value: 48}   (Short: \p{Nv=48}) (1: U+32BD)
4252        T \p{Numeric_Value: 49}   (Short: \p{Nv=49}) (1: U+32BE)
4253        T \p{Numeric_Value: 50}   (Short: \p{Nv=50}) (29: U+1376, U+216C,
4254                                    U+217C, U+2186, U+324C, U+32BF ...)
4255        T \p{Numeric_Value: 60}   (Short: \p{Nv=60}) (13: U+1377, U+324D,
4256                                    U+10115, U+102EF, U+109CE, U+10E6E ...)
4257        T \p{Numeric_Value: 70}   (Short: \p{Nv=70}) (13: U+1378, U+324E,
4258                                    U+10116, U+102F0, U+109CF, U+10E6F ...)
4259        T \p{Numeric_Value: 80}   (Short: \p{Nv=80}) (12: U+1379, U+324F,
4260                                    U+10117, U+102F1, U+10E70, U+11062 ...)
4261        T \p{Numeric_Value: 90}   (Short: \p{Nv=90}) (12: U+137A, U+10118,
4262                                    U+102F2, U+10341, U+10E71, U+11063 ...)
4263        T \p{Numeric_Value: 100}  (Short: \p{Nv=100}) (34: U+0BF1, U+0D71,
4264                                    U+137B, U+216D, U+217D, U+4F70 ...)
4265        T \p{Numeric_Value: 200}  (Short: \p{Nv=200}) (6: U+1011A, U+102F4,
4266                                    U+109D3, U+10E73, U+1EC84, U+1ED14)
4267        T \p{Numeric_Value: 300}  (Short: \p{Nv=300}) (7: U+1011B, U+1016B,
4268                                    U+102F5, U+109D4, U+10E74, U+1EC85 ...)
4269        T \p{Numeric_Value: 400}  (Short: \p{Nv=400}) (7: U+1011C, U+102F6,
4270                                    U+109D5, U+10E75, U+1EC86, U+1ED16 ...)
4271        T \p{Numeric_Value: 500}  (Short: \p{Nv=500}) (16: U+216E, U+217E,
4272                                    U+1011D, U+10145, U+1014C, U+10153 ...)
4273        T \p{Numeric_Value: 600}  (Short: \p{Nv=600}) (7: U+1011E, U+102F8,
4274                                    U+109D7, U+10E77, U+1EC88, U+1ED18 ...)
4275        T \p{Numeric_Value: 700}  (Short: \p{Nv=700}) (6: U+1011F, U+102F9,
4276                                    U+109D8, U+10E78, U+1EC89, U+1ED19)
4277        T \p{Numeric_Value: 800}  (Short: \p{Nv=800}) (6: U+10120, U+102FA,
4278                                    U+109D9, U+10E79, U+1EC8A, U+1ED1A)
4279        T \p{Numeric_Value: 900}  (Short: \p{Nv=900}) (7: U+10121, U+102FB,
4280                                    U+1034A, U+109DA, U+10E7A, U+1EC8B ...)
4281        T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (22: U+0BF2, U+0D72,
4282                                    U+216F, U+217F..2180, U+4EDF, U+5343 ...)
4283        T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (5: U+10123, U+109DC,
4284                                    U+1EC8D, U+1ED1D, U+1ED3A)
4285        T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (4: U+10124, U+109DD,
4286                                    U+1EC8E, U+1ED1E)
4287        T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (4: U+10125, U+109DE,
4288                                    U+1EC8F, U+1ED1F)
4289        T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (8: U+2181, U+10126,
4290                                    U+10146, U+1014E, U+10172, U+109DF ...)
4291        T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (4: U+10127, U+109E0,
4292                                    U+1EC91, U+1ED21)
4293        T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (4: U+10128, U+109E1,
4294                                    U+1EC92, U+1ED22)
4295        T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (4: U+10129, U+109E2,
4296                                    U+1EC93, U+1ED23)
4297        T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (4: U+1012A, U+109E3,
4298                                    U+1EC94, U+1ED24)
4299        T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (13:
4300                                    U+137C, U+2182, U+4E07, U+842C, U+1012B,
4301                                    U+10155 ...)
4302        T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (4:
4303                                    U+1012C, U+109E5, U+1EC96, U+1ED26)
4304        T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (4:
4305                                    U+1012D, U+109E6, U+1EC97, U+1ED27)
4306        T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (4:
4307                                    U+1012E, U+109E7, U+1EC98, U+1ED28)
4308        T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (7:
4309                                    U+2187, U+1012F, U+10147, U+10156,
4310                                    U+109E8, U+1EC99 ...)
4311        T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (4:
4312                                    U+10130, U+109E9, U+1EC9A, U+1ED2A)
4313        T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (4:
4314                                    U+10131, U+109EA, U+1EC9B, U+1ED2B)
4315        T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (4:
4316                                    U+10132, U+109EB, U+1EC9C, U+1ED2C)
4317        T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (4:
4318                                    U+10133, U+109EC, U+1EC9D, U+1ED2D)
4319        T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (5:
4320                                    U+2188, U+109ED, U+1EC9E, U+1ECA0,
4321                                    U+1ECB4)
4322        T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (2:
4323                                    U+109EE, U+1EC9F)
4324        T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1:
4325                                    U+12432)
4326        T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1:
4327                                    U+109EF)
4328        T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1:
4329                                    U+109F0)
4330        T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1:
4331                                    U+12433)
4332        T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1:
4333                                    U+109F1)
4334        T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1:
4335                                    U+109F2)
4336        T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1:
4337                                    U+109F3)
4338        T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1:
4339                                    U+109F4)
4340        T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1:
4341                                    U+109F5)
4342        T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1:
4343                                    U+16B5E)
4344        T \p{Numeric_Value: 10000000} (= 1.0e+07) (Short: \p{Nv=10000000})
4345                                    (1: U+1ECA1)
4346        T \p{Numeric_Value: 20000000} (= 2.0e+07) (Short: \p{Nv=20000000})
4347                                    (1: U+1ECA2)
4348        T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
4349                                    (3: U+4EBF, U+5104, U+16B5F)
4350        T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
4351                                    10000000000}) (1: U+16B60)
4352        T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
4353                                    1000000000000}) (2: U+5146, U+16B61)
4354          \p{Numeric_Value: NaN}  (Short: \p{Nv=NaN}) (1_112_277 plus all
4355                                    above-Unicode code points: [\x00-
4356                                    \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-
4357                                    Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4-
4358                                    \xb8\xba-\xbb\xbf-\xff], U+0100..065F,
4359                                    U+066A..06EF, U+06FA..07BF,
4360                                    U+07CA..0965, U+0970..09E5 ...)
4361          \p{Nushu}               \p{Script_Extensions=Nushu} (Short:
4362                                    \p{Nshu}; NOT \p{Block=Nushu}) (397)
4363          \p{Nv: *}               \p{Numeric_Value: *}
4364          \p{Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
4365                                    Nyiakeng_Puachue_Hmong} (Short:
4366                                    \p{Hmnp}; NOT \p{Block=
4367                                    Nyiakeng_Puachue_Hmong}) (71)
4368        X \p{OCR}                 \p{Optical_Character_Recognition} (=
4369                                    \p{Block=Optical_Character_Recognition})
4370                                    (32)
4371          \p{Ogam}                \p{Ogham} (= \p{Script_Extensions=Ogham})
4372                                    (NOT \p{Block=Ogham}) (29)
4373          \p{Ogham}               \p{Script_Extensions=Ogham} (Short:
4374                                    \p{Ogam}; NOT \p{Block=Ogham}) (29)
4375          \p{Ol_Chiki}            \p{Script_Extensions=Ol_Chiki} (Short:
4376                                    \p{Olck}) (48)
4377          \p{Olck}                \p{Ol_Chiki} (= \p{Script_Extensions=
4378                                    Ol_Chiki}) (48)
4379          \p{Old_Hungarian}       \p{Script_Extensions=Old_Hungarian}
4380                                    (Short: \p{Hung}; NOT \p{Block=
4381                                    Old_Hungarian}) (108)
4382          \p{Old_Italic}          \p{Script_Extensions=Old_Italic} (Short:
4383                                    \p{Ital}; NOT \p{Block=Old_Italic}) (39)
4384          \p{Old_North_Arabian}   \p{Script_Extensions=Old_North_Arabian}
4385                                    (Short: \p{Narb}) (32)
4386          \p{Old_Permic}          \p{Script_Extensions=Old_Permic} (Short:
4387                                    \p{Perm}; NOT \p{Block=Old_Permic}) (44)
4388          \p{Old_Persian}         \p{Script_Extensions=Old_Persian} (Short:
4389                                    \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
4390          \p{Old_Sogdian}         \p{Script_Extensions=Old_Sogdian} (Short:
4391                                    \p{Sogo}; NOT \p{Block=Old_Sogdian}) (40)
4392          \p{Old_South_Arabian}   \p{Script_Extensions=Old_South_Arabian}
4393                                    (Short: \p{Sarb}) (32)
4394          \p{Old_Turkic}          \p{Script_Extensions=Old_Turkic} (Short:
4395                                    \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
4396          \p{Open_Punctuation}    \p{General_Category=Open_Punctuation}
4397                                    (Short: \p{Ps}) (75)
4398        X \p{Optical_Character_Recognition} \p{Block=
4399                                    Optical_Character_Recognition} (Short:
4400                                    \p{InOCR}) (32)
4401          \p{Oriya}               \p{Script_Extensions=Oriya} (Short:
4402                                    \p{Orya}; NOT \p{Block=Oriya}) (96)
4403          \p{Orkh}                \p{Old_Turkic} (= \p{Script_Extensions=
4404                                    Old_Turkic}) (NOT \p{Block=Old_Turkic})
4405                                    (73)
4406        X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
4407          \p{Orya}                \p{Oriya} (= \p{Script_Extensions=Oriya})
4408                                    (NOT \p{Block=Oriya}) (96)
4409          \p{Osage}               \p{Script_Extensions=Osage} (Short:
4410                                    \p{Osge}; NOT \p{Block=Osage}) (72)
4411          \p{Osge}                \p{Osage} (= \p{Script_Extensions=Osage})
4412                                    (NOT \p{Block=Osage}) (72)
4413          \p{Osma}                \p{Osmanya} (= \p{Script_Extensions=
4414                                    Osmanya}) (NOT \p{Block=Osmanya}) (40)
4415          \p{Osmanya}             \p{Script_Extensions=Osmanya} (Short:
4416                                    \p{Osma}; NOT \p{Block=Osmanya}) (40)
4417          \p{Other}               \p{General_Category=Other} (Short: \p{C})
4418                                    (976_344 plus all above-Unicode code
4419                                    points)
4420          \p{Other_Letter}        \p{General_Category=Other_Letter} (Short:
4421                                    \p{Lo}) (121_414)
4422          \p{Other_Number}        \p{General_Category=Other_Number} (Short:
4423                                    \p{No}) (888)
4424          \p{Other_Punctuation}   \p{General_Category=Other_Punctuation}
4425                                    (Short: \p{Po}) (588)
4426          \p{Other_Symbol}        \p{General_Category=Other_Symbol} (Short:
4427                                    \p{So}) (6161)
4428        X \p{Ottoman_Siyaq_Numbers} \p{Block=Ottoman_Siyaq_Numbers} (80)
4429          \p{P} \pP               \p{Punct} (= \p{General_Category=
4430                                    Punctuation}) (NOT
4431                                    \p{General_Punctuation}) (792)
4432          \p{Pahawh_Hmong}        \p{Script_Extensions=Pahawh_Hmong} (Short:
4433                                    \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
4434                                    (127)
4435          \p{Palm}                \p{Palmyrene} (= \p{Script_Extensions=
4436                                    Palmyrene}) (32)
4437          \p{Palmyrene}           \p{Script_Extensions=Palmyrene} (Short:
4438                                    \p{Palm}) (32)
4439          \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
4440                                    (Short: \p{Zp}) (1)
4441          \p{Pat_Syn}             \p{Pattern_Syntax} (= \p{Pattern_Syntax=
4442                                    Y}) (2760)
4443          \p{Pat_Syn: *}          \p{Pattern_Syntax: *}
4444          \p{Pat_WS}              \p{Pattern_White_Space} (=
4445                                    \p{Pattern_White_Space=Y}) (11)
4446          \p{Pat_WS: *}           \p{Pattern_White_Space: *}
4447          \p{Pattern_Syntax}      \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
4448                                    (2760)
4449          \p{Pattern_Syntax: N*}  (Short: \p{PatSyn=N}, \P{PatSyn})
4450                                    (1_111_352 plus all above-Unicode code
4451                                    points: [\x00-\x200-9A-Z_a-z\x7f-
4452                                    \xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb7-
4453                                    \xba\xbc-\xbe\xc0-\xd6\xd8-\xf6\xf8-
4454                                    \xff], U+0100..200F, U+2028..202F,
4455                                    U+203F..2040, U+2054, U+205F..218F ...)
4456          \p{Pattern_Syntax: Y*}  (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760:
4457                                    [!\"#\$\%&\'\(\)*+,\-.\/:;<=
4458                                    >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab-
4459                                    \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
4460                                    U+2010..2027, U+2030..203E,
4461                                    U+2041..2053, U+2055..205E, U+2190..245F
4462                                    ...)
4463          \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
4464                                    \p{PatWS}) (11)
4465          \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
4466                                    (1_114_101 plus all above-Unicode code
4467                                    points: [^\t\n\cK\f\r\x20\x85],
4468                                    U+0100..200D, U+2010..2027,
4469                                    U+202A..infinity)
4470          \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11:
4471                                    [\t\n\cK\f\r\x20\x85], U+200E..200F,
4472                                    U+2028..2029)
4473          \p{Pau_Cin_Hau}         \p{Script_Extensions=Pau_Cin_Hau} (Short:
4474                                    \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
4475          \p{Pauc}                \p{Pau_Cin_Hau} (= \p{Script_Extensions=
4476                                    Pau_Cin_Hau}) (NOT \p{Block=
4477                                    Pau_Cin_Hau}) (57)
4478          \p{Pc}                  \p{Connector_Punctuation} (=
4479                                    \p{General_Category=
4480                                    Connector_Punctuation}) (10)
4481          \p{PCM}                 \p{Prepended_Concatenation_Mark} (=
4482                                    \p{Prepended_Concatenation_Mark=Y}) (11)
4483          \p{PCM: *}              \p{Prepended_Concatenation_Mark: *}
4484          \p{Pd}                  \p{Dash_Punctuation} (=
4485                                    \p{General_Category=Dash_Punctuation})
4486                                    (24)
4487          \p{Pe}                  \p{Close_Punctuation} (=
4488                                    \p{General_Category=Close_Punctuation})
4489                                    (73)
4490          \p{PerlSpace}           \p{PosixSpace} (6)
4491          \p{PerlWord}            \p{PosixWord} (63)
4492          \p{Perm}                \p{Old_Permic} (= \p{Script_Extensions=
4493                                    Old_Permic}) (NOT \p{Block=Old_Permic})
4494                                    (44)
4495          \p{Pf}                  \p{Final_Punctuation} (=
4496                                    \p{General_Category=Final_Punctuation})
4497                                    (10)
4498          \p{Phag}                \p{Phags_Pa} (= \p{Script_Extensions=
4499                                    Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
4500          \p{Phags_Pa}            \p{Script_Extensions=Phags_Pa} (Short:
4501                                    \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
4502        X \p{Phaistos}            \p{Phaistos_Disc} (= \p{Block=
4503                                    Phaistos_Disc}) (48)
4504        X \p{Phaistos_Disc}       \p{Block=Phaistos_Disc} (Short:
4505                                    \p{InPhaistos}) (48)
4506          \p{Phli}                \p{Inscriptional_Pahlavi} (=
4507                                    \p{Script_Extensions=
4508                                    Inscriptional_Pahlavi}) (NOT \p{Block=
4509                                    Inscriptional_Pahlavi}) (27)
4510          \p{Phlp}                \p{Psalter_Pahlavi} (=
4511                                    \p{Script_Extensions=Psalter_Pahlavi})
4512                                    (NOT \p{Block=Psalter_Pahlavi}) (30)
4513          \p{Phnx}                \p{Phoenician} (= \p{Script_Extensions=
4514                                    Phoenician}) (NOT \p{Block=Phoenician})
4515                                    (29)
4516          \p{Phoenician}          \p{Script_Extensions=Phoenician} (Short:
4517                                    \p{Phnx}; NOT \p{Block=Phoenician}) (29)
4518        X \p{Phonetic_Ext}        \p{Phonetic_Extensions} (= \p{Block=
4519                                    Phonetic_Extensions}) (128)
4520        X \p{Phonetic_Ext_Sup}    \p{Phonetic_Extensions_Supplement} (=
4521                                    \p{Block=
4522                                    Phonetic_Extensions_Supplement}) (64)
4523        X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
4524                                    \p{InPhoneticExt}) (128)
4525        X \p{Phonetic_Extensions_Supplement} \p{Block=
4526                                    Phonetic_Extensions_Supplement} (Short:
4527                                    \p{InPhoneticExtSup}) (64)
4528          \p{Pi}                  \p{Initial_Punctuation} (=
4529                                    \p{General_Category=
4530                                    Initial_Punctuation}) (12)
4531        X \p{Playing_Cards}       \p{Block=Playing_Cards} (96)
4532          \p{Plrd}                \p{Miao} (= \p{Script_Extensions=Miao})
4533                                    (NOT \p{Block=Miao}) (149)
4534          \p{Po}                  \p{Other_Punctuation} (=
4535                                    \p{General_Category=Other_Punctuation})
4536                                    (588)
4537          \p{PosixAlnum}          (62: [0-9A-Za-z])
4538          \p{PosixAlpha}          (52: [A-Za-z])
4539          \p{PosixBlank}          (2: [\t\x20])
4540          \p{PosixCntrl}          ASCII control characters (33: ACK, BEL,
4541                                    BS, CAN, CR, DC1, DC2, DC3, DC4, DEL,
4542                                    DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF,
4543                                    FS, GS, HT, LF, NAK, NUL, RS, SI, SO,
4544                                    SOH, STX, SUB, SYN, US, VT)
4545          \p{PosixDigit}          (10: [0-9])
4546          \p{PosixGraph}          (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
4547                                    Z\[\\\]\^_`a-z\{\|\}~])
4548          \p{PosixLower}          (/i= PosixAlpha) (26: [a-z])
4549          \p{PosixPrint}          (95: [\x20-\x7e])
4550          \p{PosixPunct}          (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<=
4551                                    >?\@\[\\\]\^_`\{\|\}~])
4552          \p{PosixSpace}          (Short: \p{PerlSpace}) (6:
4553                                    [\t\n\cK\f\r\x20])
4554          \p{PosixUpper}          (/i= PosixAlpha) (26: [A-Z])
4555          \p{PosixWord}           \w, restricted to ASCII (Short:
4556                                    \p{PerlWord}) (63: [0-9A-Z_a-z])
4557          \p{PosixXDigit}         \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
4558                                    (22)
4559          \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
4560                                    Y} (Short: \p{PCM}) (11)
4561          \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
4562                                    (1_114_101 plus all above-Unicode code
4563                                    points: U+0000..05FF, U+0606..06DC,
4564                                    U+06DE..070E, U+0710..08E1,
4565                                    U+08E3..110BC, U+110BE..110CC ...)
4566          \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
4567                                    (11: U+0600..0605, U+06DD, U+070F,
4568                                    U+08E2, U+110BD, U+110CD)
4569        T \p{Present_In: 1.1}     \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
4570                                    extension) (33_979)
4571        T \p{Present_In: 2.0}     Code point's usage introduced in version
4572                                    2.0 or earlier (Short: \p{In=2.0}) (Perl
4573                                    extension) (178_500: U+0000..01F5,
4574                                    U+01FA..0217, U+0250..02A8,
4575                                    U+02B0..02DE, U+02E0..02E9, U+0300..0345
4576                                    ...)
4577          \p{Present_In: V2_0}    \p{Present_In=2.0} (Perl extension)
4578                                    (178_500)
4579        T \p{Present_In: 2.1}     Code point's usage introduced in version
4580                                    2.1 or earlier (Short: \p{In=2.1}) (Perl
4581                                    extension) (178_502: U+0000..01F5,
4582                                    U+01FA..0217, U+0250..02A8,
4583                                    U+02B0..02DE, U+02E0..02E9, U+0300..0345
4584                                    ...)
4585          \p{Present_In: V2_1}    \p{Present_In=2.1} (Perl extension)
4586                                    (178_502)
4587        T \p{Present_In: 3.0}     Code point's usage introduced in version
4588                                    3.0 or earlier (Short: \p{In=3.0}) (Perl
4589                                    extension) (188_809: U+0000..021F,
4590                                    U+0222..0233, U+0250..02AD,
4591                                    U+02B0..02EE, U+0300..034E, U+0360..0362
4592                                    ...)
4593          \p{Present_In: V3_0}    \p{Present_In=3.0} (Perl extension)
4594                                    (188_809)
4595        T \p{Present_In: 3.1}     Code point's usage introduced in version
4596                                    3.1 or earlier (Short: \p{In=3.1}) (Perl
4597                                    extension) (233_787: U+0000..021F,
4598                                    U+0222..0233, U+0250..02AD,
4599                                    U+02B0..02EE, U+0300..034E, U+0360..0362
4600                                    ...)
4601          \p{Present_In: V3_1}    \p{Present_In=3.1} (Perl extension)
4602                                    (233_787)
4603        T \p{Present_In: 3.2}     Code point's usage introduced in version
4604                                    3.2 or earlier (Short: \p{In=3.2}) (Perl
4605                                    extension) (234_803: U+0000..0220,
4606                                    U+0222..0233, U+0250..02AD,
4607                                    U+02B0..02EE, U+0300..034F, U+0360..036F
4608                                    ...)
4609          \p{Present_In: V3_2}    \p{Present_In=3.2} (Perl extension)
4610                                    (234_803)
4611        T \p{Present_In: 4.0}     Code point's usage introduced in version
4612                                    4.0 or earlier (Short: \p{In=4.0}) (Perl
4613                                    extension) (236_029: U+0000..0236,
4614                                    U+0250..0357, U+035D..036F,
4615                                    U+0374..0375, U+037A, U+037E ...)
4616          \p{Present_In: V4_0}    \p{Present_In=4.0} (Perl extension)
4617                                    (236_029)
4618        T \p{Present_In: 4.1}     Code point's usage introduced in version
4619                                    4.1 or earlier (Short: \p{In=4.1}) (Perl
4620                                    extension) (237_302: U+0000..0241,
4621                                    U+0250..036F, U+0374..0375, U+037A,
4622                                    U+037E, U+0384..038A ...)
4623          \p{Present_In: V4_1}    \p{Present_In=4.1} (Perl extension)
4624                                    (237_302)
4625        T \p{Present_In: 5.0}     Code point's usage introduced in version
4626                                    5.0 or earlier (Short: \p{In=5.0}) (Perl
4627                                    extension) (238_671: U+0000..036F,
4628                                    U+0374..0375, U+037A..037E,
4629                                    U+0384..038A, U+038C, U+038E..03A1 ...)
4630          \p{Present_In: V5_0}    \p{Present_In=5.0} (Perl extension)
4631                                    (238_671)
4632        T \p{Present_In: 5.1}     Code point's usage introduced in version
4633                                    5.1 or earlier (Short: \p{In=5.1}) (Perl
4634                                    extension) (240_295: U+0000..0377,
4635                                    U+037A..037E, U+0384..038A, U+038C,
4636                                    U+038E..03A1, U+03A3..0523 ...)
4637          \p{Present_In: V5_1}    \p{Present_In=5.1} (Perl extension)
4638                                    (240_295)
4639        T \p{Present_In: 5.2}     Code point's usage introduced in version
4640                                    5.2 or earlier (Short: \p{In=5.2}) (Perl
4641                                    extension) (246_943: U+0000..0377,
4642                                    U+037A..037E, U+0384..038A, U+038C,
4643                                    U+038E..03A1, U+03A3..0525 ...)
4644          \p{Present_In: V5_2}    \p{Present_In=5.2} (Perl extension)
4645                                    (246_943)
4646        T \p{Present_In: 6.0}     Code point's usage introduced in version
4647                                    6.0 or earlier (Short: \p{In=6.0}) (Perl
4648                                    extension) (249_031: U+0000..0377,
4649                                    U+037A..037E, U+0384..038A, U+038C,
4650                                    U+038E..03A1, U+03A3..0527 ...)
4651          \p{Present_In: V6_0}    \p{Present_In=6.0} (Perl extension)
4652                                    (249_031)
4653        T \p{Present_In: 6.1}     Code point's usage introduced in version
4654                                    6.1 or earlier (Short: \p{In=6.1}) (Perl
4655                                    extension) (249_763: U+0000..0377,
4656                                    U+037A..037E, U+0384..038A, U+038C,
4657                                    U+038E..03A1, U+03A3..0527 ...)
4658          \p{Present_In: V6_1}    \p{Present_In=6.1} (Perl extension)
4659                                    (249_763)
4660        T \p{Present_In: 6.2}     Code point's usage introduced in version
4661                                    6.2 or earlier (Short: \p{In=6.2}) (Perl
4662                                    extension) (249_764: U+0000..0377,
4663                                    U+037A..037E, U+0384..038A, U+038C,
4664                                    U+038E..03A1, U+03A3..0527 ...)
4665          \p{Present_In: V6_2}    \p{Present_In=6.2} (Perl extension)
4666                                    (249_764)
4667        T \p{Present_In: 6.3}     Code point's usage introduced in version
4668                                    6.3 or earlier (Short: \p{In=6.3}) (Perl
4669                                    extension) (249_769: U+0000..0377,
4670                                    U+037A..037E, U+0384..038A, U+038C,
4671                                    U+038E..03A1, U+03A3..0527 ...)
4672          \p{Present_In: V6_3}    \p{Present_In=6.3} (Perl extension)
4673                                    (249_769)
4674        T \p{Present_In: 7.0}     Code point's usage introduced in version
4675                                    7.0 or earlier (Short: \p{In=7.0}) (Perl
4676                                    extension) (252_603: U+0000..0377,
4677                                    U+037A..037F, U+0384..038A, U+038C,
4678                                    U+038E..03A1, U+03A3..052F ...)
4679          \p{Present_In: V7_0}    \p{Present_In=7.0} (Perl extension)
4680                                    (252_603)
4681        T \p{Present_In: 8.0}     Code point's usage introduced in version
4682                                    8.0 or earlier (Short: \p{In=8.0}) (Perl
4683                                    extension) (260_319: U+0000..0377,
4684                                    U+037A..037F, U+0384..038A, U+038C,
4685                                    U+038E..03A1, U+03A3..052F ...)
4686          \p{Present_In: V8_0}    \p{Present_In=8.0} (Perl extension)
4687                                    (260_319)
4688        T \p{Present_In: 9.0}     Code point's usage introduced in version
4689                                    9.0 or earlier (Short: \p{In=9.0}) (Perl
4690                                    extension) (267_819: U+0000..0377,
4691                                    U+037A..037F, U+0384..038A, U+038C,
4692                                    U+038E..03A1, U+03A3..052F ...)
4693          \p{Present_In: V9_0}    \p{Present_In=9.0} (Perl extension)
4694                                    (267_819)
4695        T \p{Present_In: 10.0}    Code point's usage introduced in version
4696                                    10.0 or earlier (Short: \p{In=10.0})
4697                                    (Perl extension) (276_337: U+0000..0377,
4698                                    U+037A..037F, U+0384..038A, U+038C,
4699                                    U+038E..03A1, U+03A3..052F ...)
4700          \p{Present_In: V10_0}   \p{Present_In=10.0} (Perl extension)
4701                                    (276_337)
4702        T \p{Present_In: 11.0}    Code point's usage introduced in version
4703                                    11.0 or earlier (Short: \p{In=11.0})
4704                                    (Perl extension) (277_021: U+0000..0377,
4705                                    U+037A..037F, U+0384..038A, U+038C,
4706                                    U+038E..03A1, U+03A3..052F ...)
4707          \p{Present_In: V11_0}   \p{Present_In=11.0} (Perl extension)
4708                                    (277_021)
4709        T \p{Present_In: 12.0}    Code point's usage introduced in version
4710                                    12.0 or earlier (Short: \p{In=12.0})
4711                                    (Perl extension) (277_575: U+0000..0377,
4712                                    U+037A..037F, U+0384..038A, U+038C,
4713                                    U+038E..03A1, U+03A3..052F ...)
4714          \p{Present_In: V12_0}   \p{Present_In=12.0} (Perl extension)
4715                                    (277_575)
4716        T \p{Present_In: 12.1}    Code point's usage introduced in version
4717                                    12.1 or earlier (Short: \p{In=12.1})
4718                                    (Perl extension) (277_576: U+0000..0377,
4719                                    U+037A..037F, U+0384..038A, U+038C,
4720                                    U+038E..03A1, U+03A3..052F ...)
4721          \p{Present_In: V12_1}   \p{Present_In=12.1} (Perl extension)
4722                                    (277_576)
4723          \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
4724                                    Unassigned}) (Perl extension) (836_536
4725                                    plus all above-Unicode code points)
4726          \p{Print}               \p{XPosixPrint} (275_395)
4727          \p{Private_Use}         \p{General_Category=Private_Use} (Short:
4728                                    \p{Co}; NOT \p{Private_Use_Area})
4729                                    (137_468)
4730        X \p{Private_Use_Area}    \p{Block=Private_Use_Area} (Short:
4731                                    \p{InPUA}) (6400)
4732          \p{Prti}                \p{Inscriptional_Parthian} (=
4733                                    \p{Script_Extensions=
4734                                    Inscriptional_Parthian}) (NOT \p{Block=
4735                                    Inscriptional_Parthian}) (30)
4736          \p{Ps}                  \p{Open_Punctuation} (=
4737                                    \p{General_Category=Open_Punctuation})
4738                                    (75)
4739          \p{Psalter_Pahlavi}     \p{Script_Extensions=Psalter_Pahlavi}
4740                                    (Short: \p{Phlp}; NOT \p{Block=
4741                                    Psalter_Pahlavi}) (30)
4742        X \p{PUA}                 \p{Private_Use_Area} (= \p{Block=
4743                                    Private_Use_Area}) (6400)
4744          \p{Punct}               \p{General_Category=Punctuation} (Short:
4745                                    \p{P}; NOT \p{General_Punctuation}) (792)
4746          \p{Punctuation}         \p{Punct} (= \p{General_Category=
4747                                    Punctuation}) (NOT
4748                                    \p{General_Punctuation}) (792)
4749          \p{Qaac}                \p{Coptic} (= \p{Script_Extensions=
4750                                    Coptic}) (NOT \p{Block=Coptic}) (165)
4751          \p{Qaai}                \p{Inherited} (= \p{Script_Extensions=
4752                                    Inherited}) (502)
4753          \p{QMark}               \p{Quotation_Mark} (= \p{Quotation_Mark=
4754                                    Y}) (30)
4755          \p{QMark: *}            \p{Quotation_Mark: *}
4756          \p{Quotation_Mark}      \p{Quotation_Mark=Y} (Short: \p{QMark})
4757                                    (30)
4758          \p{Quotation_Mark: N*}  (Short: \p{QMark=N}, \P{QMark}) (1_114_082
4759                                    plus all above-Unicode code points:
4760                                    [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<=
4761                                    >?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
4762                                    \xaa\xac-\xba\xbc-\xff], U+0100..2017,
4763                                    U+2020..2038, U+203B..2E41,
4764                                    U+2E43..300B, U+3010..301C ...)
4765          \p{Quotation_Mark: Y*}  (Short: \p{QMark=Y}, \p{QMark}) (30:
4766                                    [\"\'\xab\xbb], U+2018..201F,
4767                                    U+2039..203A, U+2E42, U+300C..300F,
4768                                    U+301D..301F ...)
4769          \p{Radical}             \p{Radical=Y} (329)
4770          \p{Radical: N*}         (Single: \P{Radical}) (1_113_783 plus all
4771                                    above-Unicode code points: U+0000..2E7F,
4772                                    U+2E9A, U+2EF4..2EFF, U+2FD6..infinity)
4773          \p{Radical: Y*}         (Single: \p{Radical}) (329: U+2E80..2E99,
4774                                    U+2E9B..2EF3, U+2F00..2FD5)
4775          \p{Regional_Indicator}  \p{Regional_Indicator=Y} (Short: \p{RI})
4776                                    (26)
4777          \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086
4778                                    plus all above-Unicode code points:
4779                                    U+0000..1F1E5, U+1F200..infinity)
4780          \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26:
4781                                    U+1F1E6..1F1FF)
4782          \p{Rejang}              \p{Script_Extensions=Rejang} (Short:
4783                                    \p{Rjng}; NOT \p{Block=Rejang}) (37)
4784          \p{RI}                  \p{Regional_Indicator} (=
4785                                    \p{Regional_Indicator=Y}) (26)
4786          \p{RI: *}               \p{Regional_Indicator: *}
4787          \p{Rjng}                \p{Rejang} (= \p{Script_Extensions=
4788                                    Rejang}) (NOT \p{Block=Rejang}) (37)
4789          \p{Rohg}                \p{Hanifi_Rohingya} (=
4790                                    \p{Script_Extensions=Hanifi_Rohingya})
4791                                    (NOT \p{Block=Hanifi_Rohingya}) (55)
4792        X \p{Rumi}                \p{Rumi_Numeral_Symbols} (= \p{Block=
4793                                    Rumi_Numeral_Symbols}) (32)
4794        X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
4795                                    \p{InRumi}) (32)
4796          \p{Runic}               \p{Script_Extensions=Runic} (Short:
4797                                    \p{Runr}; NOT \p{Block=Runic}) (86)
4798          \p{Runr}                \p{Runic} (= \p{Script_Extensions=Runic})
4799                                    (NOT \p{Block=Runic}) (86)
4800          \p{S} \pS               \p{Symbol} (= \p{General_Category=Symbol})
4801                                    (7292)
4802          \p{Samaritan}           \p{Script_Extensions=Samaritan} (Short:
4803                                    \p{Samr}; NOT \p{Block=Samaritan}) (61)
4804          \p{Samr}                \p{Samaritan} (= \p{Script_Extensions=
4805                                    Samaritan}) (NOT \p{Block=Samaritan})
4806                                    (61)
4807          \p{Sarb}                \p{Old_South_Arabian} (=
4808                                    \p{Script_Extensions=Old_South_Arabian})
4809                                    (32)
4810          \p{Saur}                \p{Saurashtra} (= \p{Script_Extensions=
4811                                    Saurashtra}) (NOT \p{Block=Saurashtra})
4812                                    (82)
4813          \p{Saurashtra}          \p{Script_Extensions=Saurashtra} (Short:
4814                                    \p{Saur}; NOT \p{Block=Saurashtra}) (82)
4815          \p{SB: *}               \p{Sentence_Break: *}
4816          \p{Sc}                  \p{Currency_Symbol} (=
4817                                    \p{General_Category=Currency_Symbol})
4818                                    (62)
4819          \p{Sc: *}               \p{Script: *}
4820          \p{Script: Adlam}       (Short: \p{Sc=Adlm}) (88: U+1E900..1E94B,
4821                                    U+1E950..1E959, U+1E95E..1E95F)
4822          \p{Script: Adlm}        \p{Script=Adlam} (88)
4823          \p{Script: Aghb}        \p{Script=Caucasian_Albanian} (=
4824                                    \p{Script_Extensions=
4825                                    Caucasian_Albanian}) (53)
4826          \p{Script: Ahom}        \p{Script_Extensions=Ahom} (Short: \p{Sc=
4827                                    Ahom}, \p{Ahom}) (58)
4828          \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions=
4829                                    Anatolian_Hieroglyphs} (Short: \p{Sc=
4830                                    Hluw}, \p{Hluw}) (583)
4831          \p{Script: Arab}        \p{Script=Arabic} (1281)
4832          \p{Script: Arabic}      (Short: \p{Sc=Arab}) (1281: U+0600..0604,
4833                                    U+0606..060B, U+060D..061A, U+061C,
4834                                    U+061E, U+0620..063F ...)
4835          \p{Script: Armenian}    (Short: \p{Sc=Armn}) (95: U+0531..0556,
4836                                    U+0559..0588, U+058A, U+058D..058F,
4837                                    U+FB13..FB17)
4838          \p{Script: Armi}        \p{Script=Imperial_Aramaic} (=
4839                                    \p{Script_Extensions=Imperial_Aramaic})
4840                                    (31)
4841          \p{Script: Armn}        \p{Script=Armenian} (95)
4842          \p{Script: Avestan}     \p{Script_Extensions=Avestan} (Short:
4843                                    \p{Sc=Avst}, \p{Avst}) (61)
4844          \p{Script: Avst}        \p{Script=Avestan} (=
4845                                    \p{Script_Extensions=Avestan}) (61)
4846          \p{Script: Bali}        \p{Script=Balinese} (=
4847                                    \p{Script_Extensions=Balinese}) (121)
4848          \p{Script: Balinese}    \p{Script_Extensions=Balinese} (Short:
4849                                    \p{Sc=Bali}, \p{Bali}) (121)
4850          \p{Script: Bamu}        \p{Script=Bamum} (= \p{Script_Extensions=
4851                                    Bamum}) (657)
4852          \p{Script: Bamum}       \p{Script_Extensions=Bamum} (Short: \p{Sc=
4853                                    Bamu}, \p{Bamu}) (657)
4854          \p{Script: Bass}        \p{Script=Bassa_Vah} (=
4855                                    \p{Script_Extensions=Bassa_Vah}) (36)
4856          \p{Script: Bassa_Vah}   \p{Script_Extensions=Bassa_Vah} (Short:
4857                                    \p{Sc=Bass}, \p{Bass}) (36)
4858          \p{Script: Batak}       \p{Script_Extensions=Batak} (Short: \p{Sc=
4859                                    Batk}, \p{Batk}) (56)
4860          \p{Script: Batk}        \p{Script=Batak} (= \p{Script_Extensions=
4861                                    Batak}) (56)
4862          \p{Script: Beng}        \p{Script=Bengali} (96)
4863          \p{Script: Bengali}     (Short: \p{Sc=Beng}) (96: U+0980..0983,
4864                                    U+0985..098C, U+098F..0990,
4865                                    U+0993..09A8, U+09AA..09B0, U+09B2 ...)
4866          \p{Script: Bhaiksuki}   \p{Script_Extensions=Bhaiksuki} (Short:
4867                                    \p{Sc=Bhks}, \p{Bhks}) (97)
4868          \p{Script: Bhks}        \p{Script=Bhaiksuki} (=
4869                                    \p{Script_Extensions=Bhaiksuki}) (97)
4870          \p{Script: Bopo}        \p{Script=Bopomofo} (72)
4871          \p{Script: Bopomofo}    (Short: \p{Sc=Bopo}) (72: U+02EA..02EB,
4872                                    U+3105..312F, U+31A0..31BA)
4873          \p{Script: Brah}        \p{Script=Brahmi} (= \p{Script_Extensions=
4874                                    Brahmi}) (109)
4875          \p{Script: Brahmi}      \p{Script_Extensions=Brahmi} (Short:
4876                                    \p{Sc=Brah}, \p{Brah}) (109)
4877          \p{Script: Brai}        \p{Script=Braille} (=
4878                                    \p{Script_Extensions=Braille}) (256)
4879          \p{Script: Braille}     \p{Script_Extensions=Braille} (Short:
4880                                    \p{Sc=Brai}, \p{Brai}) (256)
4881          \p{Script: Bugi}        \p{Script=Buginese} (30)
4882          \p{Script: Buginese}    (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B,
4883                                    U+1A1E..1A1F)
4884          \p{Script: Buhd}        \p{Script=Buhid} (20)
4885          \p{Script: Buhid}       (Short: \p{Sc=Buhd}) (20: U+1740..1753)
4886          \p{Script: Cakm}        \p{Script=Chakma} (70)
4887          \p{Script: Canadian_Aboriginal} \p{Script_Extensions=
4888                                    Canadian_Aboriginal} (Short: \p{Sc=
4889                                    Cans}, \p{Cans}) (710)
4890          \p{Script: Cans}        \p{Script=Canadian_Aboriginal} (=
4891                                    \p{Script_Extensions=
4892                                    Canadian_Aboriginal}) (710)
4893          \p{Script: Cari}        \p{Script=Carian} (= \p{Script_Extensions=
4894                                    Carian}) (49)
4895          \p{Script: Carian}      \p{Script_Extensions=Carian} (Short:
4896                                    \p{Sc=Cari}, \p{Cari}) (49)
4897          \p{Script: Caucasian_Albanian} \p{Script_Extensions=
4898                                    Caucasian_Albanian} (Short: \p{Sc=Aghb},
4899                                    \p{Aghb}) (53)
4900          \p{Script: Chakma}      (Short: \p{Sc=Cakm}) (70: U+11100..11134,
4901                                    U+11136..11146)
4902          \p{Script: Cham}        \p{Script_Extensions=Cham} (Short: \p{Sc=
4903                                    Cham}, \p{Cham}) (83)
4904          \p{Script: Cher}        \p{Script=Cherokee} (=
4905                                    \p{Script_Extensions=Cherokee}) (172)
4906          \p{Script: Cherokee}    \p{Script_Extensions=Cherokee} (Short:
4907                                    \p{Sc=Cher}, \p{Cher}) (172)
4908          \p{Script: Common}      (Short: \p{Sc=Zyyy}) (7805: [\x00-
4909                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
4910                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
4911                                    \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF,
4912                                    U+02E5..02E9, U+02EC..02FF, U+0374,
4913                                    U+037E ...)
4914          \p{Script: Copt}        \p{Script=Coptic} (137)
4915          \p{Script: Coptic}      (Short: \p{Sc=Copt}) (137: U+03E2..03EF,
4916                                    U+2C80..2CF3, U+2CF9..2CFF)
4917          \p{Script: Cprt}        \p{Script=Cypriot} (55)
4918          \p{Script: Cuneiform}   \p{Script_Extensions=Cuneiform} (Short:
4919                                    \p{Sc=Xsux}, \p{Xsux}) (1234)
4920          \p{Script: Cypriot}     (Short: \p{Sc=Cprt}) (55: U+10800..10805,
4921                                    U+10808, U+1080A..10835, U+10837..10838,
4922                                    U+1083C, U+1083F)
4923          \p{Script: Cyrillic}    (Short: \p{Sc=Cyrl}) (443: U+0400..0484,
4924                                    U+0487..052F, U+1C80..1C88, U+1D2B,
4925                                    U+1D78, U+2DE0..2DFF ...)
4926          \p{Script: Cyrl}        \p{Script=Cyrillic} (443)
4927          \p{Script: Deseret}     \p{Script_Extensions=Deseret} (Short:
4928                                    \p{Sc=Dsrt}, \p{Dsrt}) (80)
4929          \p{Script: Deva}        \p{Script=Devanagari} (154)
4930          \p{Script: Devanagari}  (Short: \p{Sc=Deva}) (154: U+0900..0950,
4931                                    U+0955..0963, U+0966..097F, U+A8E0..A8FF)
4932          \p{Script: Dogr}        \p{Script=Dogra} (60)
4933          \p{Script: Dogra}       (Short: \p{Sc=Dogr}) (60: U+11800..1183B)
4934          \p{Script: Dsrt}        \p{Script=Deseret} (=
4935                                    \p{Script_Extensions=Deseret}) (80)
4936          \p{Script: Dupl}        \p{Script=Duployan} (143)
4937          \p{Script: Duployan}    (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A,
4938                                    U+1BC70..1BC7C, U+1BC80..1BC88,
4939                                    U+1BC90..1BC99, U+1BC9C..1BC9F)
4940          \p{Script: Egyp}        \p{Script=Egyptian_Hieroglyphs} (=
4941                                    \p{Script_Extensions=
4942                                    Egyptian_Hieroglyphs}) (1080)
4943          \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions=
4944                                    Egyptian_Hieroglyphs} (Short: \p{Sc=
4945                                    Egyp}, \p{Egyp}) (1080)
4946          \p{Script: Elba}        \p{Script=Elbasan} (=
4947                                    \p{Script_Extensions=Elbasan}) (40)
4948          \p{Script: Elbasan}     \p{Script_Extensions=Elbasan} (Short:
4949                                    \p{Sc=Elba}, \p{Elba}) (40)
4950          \p{Script: Elym}        \p{Script=Elymaic} (=
4951                                    \p{Script_Extensions=Elymaic}) (23)
4952          \p{Script: Elymaic}     \p{Script_Extensions=Elymaic} (Short:
4953                                    \p{Sc=Elym}, \p{Elym}) (23)
4954          \p{Script: Ethi}        \p{Script=Ethiopic} (=
4955                                    \p{Script_Extensions=Ethiopic}) (495)
4956          \p{Script: Ethiopic}    \p{Script_Extensions=Ethiopic} (Short:
4957                                    \p{Sc=Ethi}, \p{Ethi}) (495)
4958          \p{Script: Geor}        \p{Script=Georgian} (173)
4959          \p{Script: Georgian}    (Short: \p{Sc=Geor}) (173: U+10A0..10C5,
4960                                    U+10C7, U+10CD, U+10D0..10FA,
4961                                    U+10FC..10FF, U+1C90..1CBA ...)
4962          \p{Script: Glag}        \p{Script=Glagolitic} (132)
4963          \p{Script: Glagolitic}  (Short: \p{Sc=Glag}) (132: U+2C00..2C2E,
4964                                    U+2C30..2C5E, U+1E000..1E006,
4965                                    U+1E008..1E018, U+1E01B..1E021,
4966                                    U+1E023..1E024 ...)
4967          \p{Script: Gong}        \p{Script=Gunjala_Gondi} (63)
4968          \p{Script: Gonm}        \p{Script=Masaram_Gondi} (75)
4969          \p{Script: Goth}        \p{Script=Gothic} (= \p{Script_Extensions=
4970                                    Gothic}) (27)
4971          \p{Script: Gothic}      \p{Script_Extensions=Gothic} (Short:
4972                                    \p{Sc=Goth}, \p{Goth}) (27)
4973          \p{Script: Gran}        \p{Script=Grantha} (85)
4974          \p{Script: Grantha}     (Short: \p{Sc=Gran}) (85: U+11300..11303,
4975                                    U+11305..1130C, U+1130F..11310,
4976                                    U+11313..11328, U+1132A..11330,
4977                                    U+11332..11333 ...)
4978          \p{Script: Greek}       (Short: \p{Sc=Grek}) (518: U+0370..0373,
4979                                    U+0375..0377, U+037A..037D, U+037F,
4980                                    U+0384, U+0386 ...)
4981          \p{Script: Grek}        \p{Script=Greek} (518)
4982          \p{Script: Gujarati}    (Short: \p{Sc=Gujr}) (91: U+0A81..0A83,
4983                                    U+0A85..0A8D, U+0A8F..0A91,
4984                                    U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3
4985                                    ...)
4986          \p{Script: Gujr}        \p{Script=Gujarati} (91)
4987          \p{Script: Gunjala_Gondi} (Short: \p{Sc=Gong}) (63:
4988                                    U+11D60..11D65, U+11D67..11D68,
4989                                    U+11D6A..11D8E, U+11D90..11D91,
4990                                    U+11D93..11D98, U+11DA0..11DA9)
4991          \p{Script: Gurmukhi}    (Short: \p{Sc=Guru}) (80: U+0A01..0A03,
4992                                    U+0A05..0A0A, U+0A0F..0A10,
4993                                    U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33
4994                                    ...)
4995          \p{Script: Guru}        \p{Script=Gurmukhi} (80)
4996          \p{Script: Han}         (Short: \p{Sc=Han}) (89_233: U+2E80..2E99,
4997                                    U+2E9B..2EF3, U+2F00..2FD5, U+3005,
4998                                    U+3007, U+3021..3029 ...)
4999          \p{Script: Hang}        \p{Script=Hangul} (11_739)
5000          \p{Script: Hangul}      (Short: \p{Sc=Hang}) (11_739:
5001                                    U+1100..11FF, U+302E..302F,
5002                                    U+3131..318E, U+3200..321E,
5003                                    U+3260..327E, U+A960..A97C ...)
5004          \p{Script: Hani}        \p{Script=Han} (89_233)
5005          \p{Script: Hanifi_Rohingya} (Short: \p{Sc=Rohg}) (50:
5006                                    U+10D00..10D27, U+10D30..10D39)
5007          \p{Script: Hano}        \p{Script=Hanunoo} (21)
5008          \p{Script: Hanunoo}     (Short: \p{Sc=Hano}) (21: U+1720..1734)
5009          \p{Script: Hatr}        \p{Script=Hatran} (= \p{Script_Extensions=
5010                                    Hatran}) (26)
5011          \p{Script: Hatran}      \p{Script_Extensions=Hatran} (Short:
5012                                    \p{Sc=Hatr}, \p{Hatr}) (26)
5013          \p{Script: Hebr}        \p{Script=Hebrew} (= \p{Script_Extensions=
5014                                    Hebrew}) (134)
5015          \p{Script: Hebrew}      \p{Script_Extensions=Hebrew} (Short:
5016                                    \p{Sc=Hebr}, \p{Hebr}) (134)
5017          \p{Script: Hira}        \p{Script=Hiragana} (379)
5018          \p{Script: Hiragana}    (Short: \p{Sc=Hira}) (379: U+3041..3096,
5019                                    U+309D..309F, U+1B001..1B11E,
5020                                    U+1B150..1B152, U+1F200)
5021          \p{Script: Hluw}        \p{Script=Anatolian_Hieroglyphs} (=
5022                                    \p{Script_Extensions=
5023                                    Anatolian_Hieroglyphs}) (583)
5024          \p{Script: Hmng}        \p{Script=Pahawh_Hmong} (=
5025                                    \p{Script_Extensions=Pahawh_Hmong}) (127)
5026          \p{Script: Hmnp}        \p{Script=Nyiakeng_Puachue_Hmong} (=
5027                                    \p{Script_Extensions=
5028                                    Nyiakeng_Puachue_Hmong}) (71)
5029          \p{Script: Hung}        \p{Script=Old_Hungarian} (=
5030                                    \p{Script_Extensions=Old_Hungarian})
5031                                    (108)
5032          \p{Script: Imperial_Aramaic} \p{Script_Extensions=
5033                                    Imperial_Aramaic} (Short: \p{Sc=Armi},
5034                                    \p{Armi}) (31)
5035          \p{Script: Inherited}   (Short: \p{Sc=Zinh}) (571: U+0300..036F,
5036                                    U+0485..0486, U+064B..0655, U+0670,
5037                                    U+0951..0954, U+1AB0..1ABE ...)
5038          \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions=
5039                                    Inscriptional_Pahlavi} (Short: \p{Sc=
5040                                    Phli}, \p{Phli}) (27)
5041          \p{Script: Inscriptional_Parthian} \p{Script_Extensions=
5042                                    Inscriptional_Parthian} (Short: \p{Sc=
5043                                    Prti}, \p{Prti}) (30)
5044          \p{Script: Ital}        \p{Script=Old_Italic} (=
5045                                    \p{Script_Extensions=Old_Italic}) (39)
5046          \p{Script: Java}        \p{Script=Javanese} (90)
5047          \p{Script: Javanese}    (Short: \p{Sc=Java}) (90: U+A980..A9CD,
5048                                    U+A9D0..A9D9, U+A9DE..A9DF)
5049          \p{Script: Kaithi}      (Short: \p{Sc=Kthi}) (67: U+11080..110C1,
5050                                    U+110CD)
5051          \p{Script: Kali}        \p{Script=Kayah_Li} (47)
5052          \p{Script: Kana}        \p{Script=Katakana} (304)
5053          \p{Script: Kannada}     (Short: \p{Sc=Knda}) (89: U+0C80..0C8C,
5054                                    U+0C8E..0C90, U+0C92..0CA8,
5055                                    U+0CAA..0CB3, U+0CB5..0CB9, U+0CBC..0CC4
5056                                    ...)
5057          \p{Script: Katakana}    (Short: \p{Sc=Kana}) (304: U+30A1..30FA,
5058                                    U+30FD..30FF, U+31F0..31FF,
5059                                    U+32D0..32FE, U+3300..3357, U+FF66..FF6F
5060                                    ...)
5061          \p{Script: Kayah_Li}    (Short: \p{Sc=Kali}) (47: U+A900..A92D,
5062                                    U+A92F)
5063          \p{Script: Khar}        \p{Script=Kharoshthi} (=
5064                                    \p{Script_Extensions=Kharoshthi}) (68)
5065          \p{Script: Kharoshthi}  \p{Script_Extensions=Kharoshthi} (Short:
5066                                    \p{Sc=Khar}, \p{Khar}) (68)
5067          \p{Script: Khmer}       \p{Script_Extensions=Khmer} (Short: \p{Sc=
5068                                    Khmr}, \p{Khmr}) (146)
5069          \p{Script: Khmr}        \p{Script=Khmer} (= \p{Script_Extensions=
5070                                    Khmer}) (146)
5071          \p{Script: Khoj}        \p{Script=Khojki} (62)
5072          \p{Script: Khojki}      (Short: \p{Sc=Khoj}) (62: U+11200..11211,
5073                                    U+11213..1123E)
5074          \p{Script: Khudawadi}   (Short: \p{Sc=Sind}) (69: U+112B0..112EA,
5075                                    U+112F0..112F9)
5076          \p{Script: Knda}        \p{Script=Kannada} (89)
5077          \p{Script: Kthi}        \p{Script=Kaithi} (67)
5078          \p{Script: Lana}        \p{Script=Tai_Tham} (=
5079                                    \p{Script_Extensions=Tai_Tham}) (127)
5080          \p{Script: Lao}         \p{Script_Extensions=Lao} (Short: \p{Sc=
5081                                    Lao}, \p{Lao}) (82)
5082          \p{Script: Laoo}        \p{Script=Lao} (= \p{Script_Extensions=
5083                                    Lao}) (82)
5084          \p{Script: Latin}       (Short: \p{Sc=Latn}) (1366: [A-Za-
5085                                    z\xaa\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
5086                                    U+0100..02B8, U+02E0..02E4,
5087                                    U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65
5088                                    ...)
5089          \p{Script: Latn}        \p{Script=Latin} (1366)
5090          \p{Script: Lepc}        \p{Script=Lepcha} (= \p{Script_Extensions=
5091                                    Lepcha}) (74)
5092          \p{Script: Lepcha}      \p{Script_Extensions=Lepcha} (Short:
5093                                    \p{Sc=Lepc}, \p{Lepc}) (74)
5094          \p{Script: Limb}        \p{Script=Limbu} (68)
5095          \p{Script: Limbu}       (Short: \p{Sc=Limb}) (68: U+1900..191E,
5096                                    U+1920..192B, U+1930..193B, U+1940,
5097                                    U+1944..194F)
5098          \p{Script: Lina}        \p{Script=Linear_A} (341)
5099          \p{Script: Linb}        \p{Script=Linear_B} (211)
5100          \p{Script: Linear_A}    (Short: \p{Sc=Lina}) (341: U+10600..10736,
5101                                    U+10740..10755, U+10760..10767)
5102          \p{Script: Linear_B}    (Short: \p{Sc=Linb}) (211: U+10000..1000B,
5103                                    U+1000D..10026, U+10028..1003A,
5104                                    U+1003C..1003D, U+1003F..1004D,
5105                                    U+10050..1005D ...)
5106          \p{Script: Lisu}        \p{Script_Extensions=Lisu} (Short: \p{Sc=
5107                                    Lisu}, \p{Lisu}) (48)
5108          \p{Script: Lyci}        \p{Script=Lycian} (= \p{Script_Extensions=
5109                                    Lycian}) (29)
5110          \p{Script: Lycian}      \p{Script_Extensions=Lycian} (Short:
5111                                    \p{Sc=Lyci}, \p{Lyci}) (29)
5112          \p{Script: Lydi}        \p{Script=Lydian} (= \p{Script_Extensions=
5113                                    Lydian}) (27)
5114          \p{Script: Lydian}      \p{Script_Extensions=Lydian} (Short:
5115                                    \p{Sc=Lydi}, \p{Lydi}) (27)
5116          \p{Script: Mahajani}    (Short: \p{Sc=Mahj}) (39: U+11150..11176)
5117          \p{Script: Mahj}        \p{Script=Mahajani} (39)
5118          \p{Script: Maka}        \p{Script=Makasar} (=
5119                                    \p{Script_Extensions=Makasar}) (25)
5120          \p{Script: Makasar}     \p{Script_Extensions=Makasar} (Short:
5121                                    \p{Sc=Maka}, \p{Maka}) (25)
5122          \p{Script: Malayalam}   (Short: \p{Sc=Mlym}) (117: U+0D00..0D03,
5123                                    U+0D05..0D0C, U+0D0E..0D10,
5124                                    U+0D12..0D44, U+0D46..0D48, U+0D4A..0D4F
5125                                    ...)
5126          \p{Script: Mand}        \p{Script=Mandaic} (29)
5127          \p{Script: Mandaic}     (Short: \p{Sc=Mand}) (29: U+0840..085B,
5128                                    U+085E)
5129          \p{Script: Mani}        \p{Script=Manichaean} (51)
5130          \p{Script: Manichaean}  (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6,
5131                                    U+10AEB..10AF6)
5132          \p{Script: Marc}        \p{Script=Marchen} (=
5133                                    \p{Script_Extensions=Marchen}) (68)
5134          \p{Script: Marchen}     \p{Script_Extensions=Marchen} (Short:
5135                                    \p{Sc=Marc}, \p{Marc}) (68)
5136          \p{Script: Masaram_Gondi} (Short: \p{Sc=Gonm}) (75:
5137                                    U+11D00..11D06, U+11D08..11D09,
5138                                    U+11D0B..11D36, U+11D3A, U+11D3C..11D3D,
5139                                    U+11D3F..11D47 ...)
5140          \p{Script: Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
5141                                    \p{Sc=Medf}, \p{Medf}) (91)
5142          \p{Script: Medf}        \p{Script=Medefaidrin} (=
5143                                    \p{Script_Extensions=Medefaidrin}) (91)
5144          \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek}
5145                                    (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
5146          \p{Script: Mend}        \p{Script=Mende_Kikakui} (=
5147                                    \p{Script_Extensions=Mende_Kikakui})
5148                                    (213)
5149          \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
5150                                    (Short: \p{Sc=Mend}, \p{Mend}) (213)
5151          \p{Script: Merc}        \p{Script=Meroitic_Cursive} (=
5152                                    \p{Script_Extensions=Meroitic_Cursive})
5153                                    (90)
5154          \p{Script: Mero}        \p{Script=Meroitic_Hieroglyphs} (=
5155                                    \p{Script_Extensions=
5156                                    Meroitic_Hieroglyphs}) (32)
5157          \p{Script: Meroitic_Cursive} \p{Script_Extensions=
5158                                    Meroitic_Cursive} (Short: \p{Sc=Merc},
5159                                    \p{Merc}) (90)
5160          \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions=
5161                                    Meroitic_Hieroglyphs} (Short: \p{Sc=
5162                                    Mero}, \p{Mero}) (32)
5163          \p{Script: Miao}        \p{Script_Extensions=Miao} (Short: \p{Sc=
5164                                    Miao}, \p{Miao}) (149)
5165          \p{Script: Mlym}        \p{Script=Malayalam} (117)
5166          \p{Script: Modi}        (Short: \p{Sc=Modi}) (79: U+11600..11644,
5167                                    U+11650..11659)
5168          \p{Script: Mong}        \p{Script=Mongolian} (167)
5169          \p{Script: Mongolian}   (Short: \p{Sc=Mong}) (167: U+1800..1801,
5170                                    U+1804, U+1806..180E, U+1810..1819,
5171                                    U+1820..1878, U+1880..18AA ...)
5172          \p{Script: Mro}         \p{Script_Extensions=Mro} (Short: \p{Sc=
5173                                    Mro}, \p{Mro}) (43)
5174          \p{Script: Mroo}        \p{Script=Mro} (= \p{Script_Extensions=
5175                                    Mro}) (43)
5176          \p{Script: Mtei}        \p{Script=Meetei_Mayek} (=
5177                                    \p{Script_Extensions=Meetei_Mayek}) (79)
5178          \p{Script: Mult}        \p{Script=Multani} (38)
5179          \p{Script: Multani}     (Short: \p{Sc=Mult}) (38: U+11280..11286,
5180                                    U+11288, U+1128A..1128D, U+1128F..1129D,
5181                                    U+1129F..112A9)
5182          \p{Script: Myanmar}     (Short: \p{Sc=Mymr}) (223: U+1000..109F,
5183                                    U+A9E0..A9FE, U+AA60..AA7F)
5184          \p{Script: Mymr}        \p{Script=Myanmar} (223)
5185          \p{Script: Nabataean}   \p{Script_Extensions=Nabataean} (Short:
5186                                    \p{Sc=Nbat}, \p{Nbat}) (40)
5187          \p{Script: Nand}        \p{Script=Nandinagari} (65)
5188          \p{Script: Nandinagari} (Short: \p{Sc=Nand}) (65: U+119A0..119A7,
5189                                    U+119AA..119D7, U+119DA..119E4)
5190          \p{Script: Narb}        \p{Script=Old_North_Arabian} (=
5191                                    \p{Script_Extensions=Old_North_Arabian})
5192                                    (32)
5193          \p{Script: Nbat}        \p{Script=Nabataean} (=
5194                                    \p{Script_Extensions=Nabataean}) (40)
5195          \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
5196                                    \p{Sc=Talu}, \p{Talu}) (83)
5197          \p{Script: Newa}        \p{Script_Extensions=Newa} (Short: \p{Sc=
5198                                    Newa}, \p{Newa}) (94)
5199          \p{Script: Nko}         \p{Script_Extensions=Nko} (Short: \p{Sc=
5200                                    Nko}, \p{Nko}) (62)
5201          \p{Script: Nkoo}        \p{Script=Nko} (= \p{Script_Extensions=
5202                                    Nko}) (62)
5203          \p{Script: Nshu}        \p{Script=Nushu} (= \p{Script_Extensions=
5204                                    Nushu}) (397)
5205          \p{Script: Nushu}       \p{Script_Extensions=Nushu} (Short: \p{Sc=
5206                                    Nshu}, \p{Nshu}) (397)
5207          \p{Script: Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
5208                                    Nyiakeng_Puachue_Hmong} (Short: \p{Sc=
5209                                    Hmnp}, \p{Hmnp}) (71)
5210          \p{Script: Ogam}        \p{Script=Ogham} (= \p{Script_Extensions=
5211                                    Ogham}) (29)
5212          \p{Script: Ogham}       \p{Script_Extensions=Ogham} (Short: \p{Sc=
5213                                    Ogam}, \p{Ogam}) (29)
5214          \p{Script: Ol_Chiki}    \p{Script_Extensions=Ol_Chiki} (Short:
5215                                    \p{Sc=Olck}, \p{Olck}) (48)
5216          \p{Script: Olck}        \p{Script=Ol_Chiki} (=
5217                                    \p{Script_Extensions=Ol_Chiki}) (48)
5218          \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
5219                                    (Short: \p{Sc=Hung}, \p{Hung}) (108)
5220          \p{Script: Old_Italic}  \p{Script_Extensions=Old_Italic} (Short:
5221                                    \p{Sc=Ital}, \p{Ital}) (39)
5222          \p{Script: Old_North_Arabian} \p{Script_Extensions=
5223                                    Old_North_Arabian} (Short: \p{Sc=Narb},
5224                                    \p{Narb}) (32)
5225          \p{Script: Old_Permic}  (Short: \p{Sc=Perm}) (43: U+10350..1037A)
5226          \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
5227                                    \p{Sc=Xpeo}, \p{Xpeo}) (50)
5228          \p{Script: Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
5229                                    \p{Sc=Sogo}, \p{Sogo}) (40)
5230          \p{Script: Old_South_Arabian} \p{Script_Extensions=
5231                                    Old_South_Arabian} (Short: \p{Sc=Sarb},
5232                                    \p{Sarb}) (32)
5233          \p{Script: Old_Turkic}  \p{Script_Extensions=Old_Turkic} (Short:
5234                                    \p{Sc=Orkh}, \p{Orkh}) (73)
5235          \p{Script: Oriya}       (Short: \p{Sc=Orya}) (90: U+0B01..0B03,
5236                                    U+0B05..0B0C, U+0B0F..0B10,
5237                                    U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33
5238                                    ...)
5239          \p{Script: Orkh}        \p{Script=Old_Turkic} (=
5240                                    \p{Script_Extensions=Old_Turkic}) (73)
5241          \p{Script: Orya}        \p{Script=Oriya} (90)
5242          \p{Script: Osage}       \p{Script_Extensions=Osage} (Short: \p{Sc=
5243                                    Osge}, \p{Osge}) (72)
5244          \p{Script: Osge}        \p{Script=Osage} (= \p{Script_Extensions=
5245                                    Osage}) (72)
5246          \p{Script: Osma}        \p{Script=Osmanya} (=
5247                                    \p{Script_Extensions=Osmanya}) (40)
5248          \p{Script: Osmanya}     \p{Script_Extensions=Osmanya} (Short:
5249                                    \p{Sc=Osma}, \p{Osma}) (40)
5250          \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong}
5251                                    (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
5252          \p{Script: Palm}        \p{Script=Palmyrene} (=
5253                                    \p{Script_Extensions=Palmyrene}) (32)
5254          \p{Script: Palmyrene}   \p{Script_Extensions=Palmyrene} (Short:
5255                                    \p{Sc=Palm}, \p{Palm}) (32)
5256          \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
5257                                    \p{Sc=Pauc}, \p{Pauc}) (57)
5258          \p{Script: Pauc}        \p{Script=Pau_Cin_Hau} (=
5259                                    \p{Script_Extensions=Pau_Cin_Hau}) (57)
5260          \p{Script: Perm}        \p{Script=Old_Permic} (43)
5261          \p{Script: Phag}        \p{Script=Phags_Pa} (56)
5262          \p{Script: Phags_Pa}    (Short: \p{Sc=Phag}) (56: U+A840..A877)
5263          \p{Script: Phli}        \p{Script=Inscriptional_Pahlavi} (=
5264                                    \p{Script_Extensions=
5265                                    Inscriptional_Pahlavi}) (27)
5266          \p{Script: Phlp}        \p{Script=Psalter_Pahlavi} (29)
5267          \p{Script: Phnx}        \p{Script=Phoenician} (=
5268                                    \p{Script_Extensions=Phoenician}) (29)
5269          \p{Script: Phoenician}  \p{Script_Extensions=Phoenician} (Short:
5270                                    \p{Sc=Phnx}, \p{Phnx}) (29)
5271          \p{Script: Plrd}        \p{Script=Miao} (= \p{Script_Extensions=
5272                                    Miao}) (149)
5273          \p{Script: Prti}        \p{Script=Inscriptional_Parthian} (=
5274                                    \p{Script_Extensions=
5275                                    Inscriptional_Parthian}) (30)
5276          \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29:
5277                                    U+10B80..10B91, U+10B99..10B9C,
5278                                    U+10BA9..10BAF)
5279          \p{Script: Qaac}        \p{Script=Coptic} (137)
5280          \p{Script: Qaai}        \p{Script=Inherited} (571)
5281          \p{Script: Rejang}      \p{Script_Extensions=Rejang} (Short:
5282                                    \p{Sc=Rjng}, \p{Rjng}) (37)
5283          \p{Script: Rjng}        \p{Script=Rejang} (= \p{Script_Extensions=
5284                                    Rejang}) (37)
5285          \p{Script: Rohg}        \p{Script=Hanifi_Rohingya} (50)
5286          \p{Script: Runic}       \p{Script_Extensions=Runic} (Short: \p{Sc=
5287                                    Runr}, \p{Runr}) (86)
5288          \p{Script: Runr}        \p{Script=Runic} (= \p{Script_Extensions=
5289                                    Runic}) (86)
5290          \p{Script: Samaritan}   \p{Script_Extensions=Samaritan} (Short:
5291                                    \p{Sc=Samr}, \p{Samr}) (61)
5292          \p{Script: Samr}        \p{Script=Samaritan} (=
5293                                    \p{Script_Extensions=Samaritan}) (61)
5294          \p{Script: Sarb}        \p{Script=Old_South_Arabian} (=
5295                                    \p{Script_Extensions=Old_South_Arabian})
5296                                    (32)
5297          \p{Script: Saur}        \p{Script=Saurashtra} (=
5298                                    \p{Script_Extensions=Saurashtra}) (82)
5299          \p{Script: Saurashtra}  \p{Script_Extensions=Saurashtra} (Short:
5300                                    \p{Sc=Saur}, \p{Saur}) (82)
5301          \p{Script: Sgnw}        \p{Script=SignWriting} (=
5302                                    \p{Script_Extensions=SignWriting}) (672)
5303          \p{Script: Sharada}     (Short: \p{Sc=Shrd}) (94: U+11180..111CD,
5304                                    U+111D0..111DF)
5305          \p{Script: Shavian}     \p{Script_Extensions=Shavian} (Short:
5306                                    \p{Sc=Shaw}, \p{Shaw}) (48)
5307          \p{Script: Shaw}        \p{Script=Shavian} (=
5308                                    \p{Script_Extensions=Shavian}) (48)
5309          \p{Script: Shrd}        \p{Script=Sharada} (94)
5310          \p{Script: Sidd}        \p{Script=Siddham} (=
5311                                    \p{Script_Extensions=Siddham}) (92)
5312          \p{Script: Siddham}     \p{Script_Extensions=Siddham} (Short:
5313                                    \p{Sc=Sidd}, \p{Sidd}) (92)
5314          \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short:
5315                                    \p{Sc=Sgnw}, \p{Sgnw}) (672)
5316          \p{Script: Sind}        \p{Script=Khudawadi} (69)
5317          \p{Script: Sinh}        \p{Script=Sinhala} (110)
5318          \p{Script: Sinhala}     (Short: \p{Sc=Sinh}) (110: U+0D82..0D83,
5319                                    U+0D85..0D96, U+0D9A..0DB1,
5320                                    U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...)
5321          \p{Script: Sogd}        \p{Script=Sogdian} (42)
5322          \p{Script: Sogdian}     (Short: \p{Sc=Sogd}) (42: U+10F30..10F59)
5323          \p{Script: Sogo}        \p{Script=Old_Sogdian} (=
5324                                    \p{Script_Extensions=Old_Sogdian}) (40)
5325          \p{Script: Sora}        \p{Script=Sora_Sompeng} (=
5326                                    \p{Script_Extensions=Sora_Sompeng}) (35)
5327          \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng}
5328                                    (Short: \p{Sc=Sora}, \p{Sora}) (35)
5329          \p{Script: Soyo}        \p{Script=Soyombo} (=
5330                                    \p{Script_Extensions=Soyombo}) (83)
5331          \p{Script: Soyombo}     \p{Script_Extensions=Soyombo} (Short:
5332                                    \p{Sc=Soyo}, \p{Soyo}) (83)
5333          \p{Script: Sund}        \p{Script=Sundanese} (=
5334                                    \p{Script_Extensions=Sundanese}) (72)
5335          \p{Script: Sundanese}   \p{Script_Extensions=Sundanese} (Short:
5336                                    \p{Sc=Sund}, \p{Sund}) (72)
5337          \p{Script: Sylo}        \p{Script=Syloti_Nagri} (44)
5338          \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (44: U+A800..A82B)
5339          \p{Script: Syrc}        \p{Script=Syriac} (88)
5340          \p{Script: Syriac}      (Short: \p{Sc=Syrc}) (88: U+0700..070D,
5341                                    U+070F..074A, U+074D..074F, U+0860..086A)
5342          \p{Script: Tagalog}     (Short: \p{Sc=Tglg}) (20: U+1700..170C,
5343                                    U+170E..1714)
5344          \p{Script: Tagb}        \p{Script=Tagbanwa} (18)
5345          \p{Script: Tagbanwa}    (Short: \p{Sc=Tagb}) (18: U+1760..176C,
5346                                    U+176E..1770, U+1772..1773)
5347          \p{Script: Tai_Le}      (Short: \p{Sc=Tale}) (35: U+1950..196D,
5348                                    U+1970..1974)
5349          \p{Script: Tai_Tham}    \p{Script_Extensions=Tai_Tham} (Short:
5350                                    \p{Sc=Lana}, \p{Lana}) (127)
5351          \p{Script: Tai_Viet}    \p{Script_Extensions=Tai_Viet} (Short:
5352                                    \p{Sc=Tavt}, \p{Tavt}) (72)
5353          \p{Script: Takr}        \p{Script=Takri} (67)
5354          \p{Script: Takri}       (Short: \p{Sc=Takr}) (67: U+11680..116B8,
5355                                    U+116C0..116C9)
5356          \p{Script: Tale}        \p{Script=Tai_Le} (35)
5357          \p{Script: Talu}        \p{Script=New_Tai_Lue} (=
5358                                    \p{Script_Extensions=New_Tai_Lue}) (83)
5359          \p{Script: Tamil}       (Short: \p{Sc=Taml}) (123: U+0B82..0B83,
5360                                    U+0B85..0B8A, U+0B8E..0B90,
5361                                    U+0B92..0B95, U+0B99..0B9A, U+0B9C ...)
5362          \p{Script: Taml}        \p{Script=Tamil} (123)
5363          \p{Script: Tang}        \p{Script=Tangut} (= \p{Script_Extensions=
5364                                    Tangut}) (6892)
5365          \p{Script: Tangut}      \p{Script_Extensions=Tangut} (Short:
5366                                    \p{Sc=Tang}, \p{Tang}) (6892)
5367          \p{Script: Tavt}        \p{Script=Tai_Viet} (=
5368                                    \p{Script_Extensions=Tai_Viet}) (72)
5369          \p{Script: Telu}        \p{Script=Telugu} (98)
5370          \p{Script: Telugu}      (Short: \p{Sc=Telu}) (98: U+0C00..0C0C,
5371                                    U+0C0E..0C10, U+0C12..0C28,
5372                                    U+0C2A..0C39, U+0C3D..0C44, U+0C46..0C48
5373                                    ...)
5374          \p{Script: Tfng}        \p{Script=Tifinagh} (=
5375                                    \p{Script_Extensions=Tifinagh}) (59)
5376          \p{Script: Tglg}        \p{Script=Tagalog} (20)
5377          \p{Script: Thaa}        \p{Script=Thaana} (50)
5378          \p{Script: Thaana}      (Short: \p{Sc=Thaa}) (50: U+0780..07B1)
5379          \p{Script: Thai}        \p{Script_Extensions=Thai} (Short: \p{Sc=
5380                                    Thai}, \p{Thai}) (86)
5381          \p{Script: Tibetan}     \p{Script_Extensions=Tibetan} (Short:
5382                                    \p{Sc=Tibt}, \p{Tibt}) (207)
5383          \p{Script: Tibt}        \p{Script=Tibetan} (=
5384                                    \p{Script_Extensions=Tibetan}) (207)
5385          \p{Script: Tifinagh}    \p{Script_Extensions=Tifinagh} (Short:
5386                                    \p{Sc=Tfng}, \p{Tfng}) (59)
5387          \p{Script: Tirh}        \p{Script=Tirhuta} (82)
5388          \p{Script: Tirhuta}     (Short: \p{Sc=Tirh}) (82: U+11480..114C7,
5389                                    U+114D0..114D9)
5390          \p{Script: Ugar}        \p{Script=Ugaritic} (=
5391                                    \p{Script_Extensions=Ugaritic}) (31)
5392          \p{Script: Ugaritic}    \p{Script_Extensions=Ugaritic} (Short:
5393                                    \p{Sc=Ugar}, \p{Ugar}) (31)
5394          \p{Script: Unknown}     \p{Script_Extensions=Unknown} (Short:
5395                                    \p{Sc=Zzzz}, \p{Zzzz}) (976_118 plus all
5396                                    above-Unicode code points)
5397          \p{Script: Vai}         \p{Script_Extensions=Vai} (Short: \p{Sc=
5398                                    Vai}, \p{Vai}) (300)
5399          \p{Script: Vaii}        \p{Script=Vai} (= \p{Script_Extensions=
5400                                    Vai}) (300)
5401          \p{Script: Wancho}      \p{Script_Extensions=Wancho} (Short:
5402                                    \p{Sc=Wcho}, \p{Wcho}) (59)
5403          \p{Script: Wara}        \p{Script=Warang_Citi} (=
5404                                    \p{Script_Extensions=Warang_Citi}) (84)
5405          \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
5406                                    \p{Sc=Wara}, \p{Wara}) (84)
5407          \p{Script: Wcho}        \p{Script=Wancho} (= \p{Script_Extensions=
5408                                    Wancho}) (59)
5409          \p{Script: Xpeo}        \p{Script=Old_Persian} (=
5410                                    \p{Script_Extensions=Old_Persian}) (50)
5411          \p{Script: Xsux}        \p{Script=Cuneiform} (=
5412                                    \p{Script_Extensions=Cuneiform}) (1234)
5413          \p{Script: Yi}          (Short: \p{Sc=Yi}) (1220: U+A000..A48C,
5414                                    U+A490..A4C6)
5415          \p{Script: Yiii}        \p{Script=Yi} (1220)
5416          \p{Script: Zanabazar_Square} \p{Script_Extensions=
5417                                    Zanabazar_Square} (Short: \p{Sc=Zanb},
5418                                    \p{Zanb}) (72)
5419          \p{Script: Zanb}        \p{Script=Zanabazar_Square} (=
5420                                    \p{Script_Extensions=Zanabazar_Square})
5421                                    (72)
5422          \p{Script: Zinh}        \p{Script=Inherited} (571)
5423          \p{Script: Zyyy}        \p{Script=Common} (7805)
5424          \p{Script: Zzzz}        \p{Script=Unknown} (=
5425                                    \p{Script_Extensions=Unknown}) (976_118
5426                                    plus all above-Unicode code points)
5427          \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (89:
5428                                    U+0640, U+1E900..1E94B, U+1E950..1E959,
5429                                    U+1E95E..1E95F)
5430          \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (89)
5431          \p{Script_Extensions: Aghb} \p{Script_Extensions=
5432                                    Caucasian_Albanian} (53)
5433          \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (58:
5434                                    U+11700..1171A, U+1171D..1172B,
5435                                    U+11730..1173F)
5436          \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
5437                                    \p{Hluw}) (583: U+14400..14646)
5438          \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1325)
5439          \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
5440                                    (1325: U+0600..0604, U+0606..061C,
5441                                    U+061E..06DC, U+06DE..06FF,
5442                                    U+0750..077F, U+08A0..08B4 ...)
5443          \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
5444                                    (96: U+0531..0556, U+0559..058A,
5445                                    U+058D..058F, U+FB13..FB17)
5446          \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
5447                                    (31)
5448          \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (96)
5449          \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst})
5450                                    (61: U+10B00..10B35, U+10B39..10B3F)
5451          \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
5452          \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
5453          \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
5454                                    (121: U+1B00..1B4B, U+1B50..1B7C)
5455          \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
5456          \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657:
5457                                    U+A6A0..A6F7, U+16800..16A38)
5458          \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
5459          \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
5460                                    (36: U+16AD0..16AED, U+16AF0..16AF5)
5461          \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56:
5462                                    U+1BC0..1BF3, U+1BFC..1BFF)
5463          \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
5464          \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (113)
5465          \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng})
5466                                    (113: U+0951..0952, U+0964..0965,
5467                                    U+0980..0983, U+0985..098C,
5468                                    U+098F..0990, U+0993..09A8 ...)
5469          \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
5470                                    (97: U+11C00..11C08, U+11C0A..11C36,
5471                                    U+11C38..11C45, U+11C50..11C6C)
5472          \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
5473          \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (112)
5474          \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
5475                                    (112: U+02EA..02EB, U+3001..3003,
5476                                    U+3008..3011, U+3013..301F,
5477                                    U+302A..302D, U+3030 ...)
5478          \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (109)
5479          \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah})
5480                                    (109: U+11000..1104D, U+11052..1106F,
5481                                    U+1107F)
5482          \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
5483          \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
5484                                    (256: U+2800..28FF)
5485          \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
5486          \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
5487                                    (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF)
5488          \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
5489          \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22:
5490                                    U+1735..1736, U+1740..1753)
5491          \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (90)
5492          \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
5493                                    \p{Cans}) (710: U+1400..167F,
5494                                    U+18B0..18F5)
5495          \p{Script_Extensions: Cans} \p{Script_Extensions=
5496                                    Canadian_Aboriginal} (710)
5497          \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
5498          \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49:
5499                                    U+102A0..102D0)
5500          \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
5501                                    \p{Aghb}) (53: U+10530..10563, U+1056F)
5502          \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (90:
5503                                    U+09E6..09EF, U+1040..1049,
5504                                    U+11100..11134, U+11136..11146)
5505          \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83:
5506                                    U+AA00..AA36, U+AA40..AA4D,
5507                                    U+AA50..AA59, U+AA5C..AA5F)
5508          \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
5509          \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
5510                                    (172: U+13A0..13F5, U+13F8..13FD,
5511                                    U+AB70..ABBF)
5512          \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
5513                                    (7386: [\x00-
5514                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
5515                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
5516                                    \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF,
5517                                    U+02E5..02E9, U+02EC..02FF, U+0374,
5518                                    U+037E ...)
5519          \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
5520          \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt})
5521                                    (165: U+03E2..03EF, U+2C80..2CF3,
5522                                    U+2CF9..2CFF, U+102E0..102FB)
5523          \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
5524          \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
5525                                    (1234: U+12000..12399, U+12400..1246E,
5526                                    U+12470..12474, U+12480..12543)
5527          \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
5528                                    (112: U+10100..10102, U+10107..10133,
5529                                    U+10137..1013F, U+10800..10805, U+10808,
5530                                    U+1080A..10835 ...)
5531          \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
5532                                    (446: U+0400..052F, U+1C80..1C88,
5533                                    U+1D2B, U+1D78, U+2DE0..2DFF, U+2E43 ...)
5534          \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (446)
5535          \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt})
5536                                    (80: U+10400..1044F)
5537          \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210)
5538          \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
5539                                    (210: U+0900..0952, U+0955..097F,
5540                                    U+1CD0..1CF6, U+1CF8..1CF9, U+20F0,
5541                                    U+A830..A839 ...)
5542          \p{Script_Extensions: Dogr} \p{Script_Extensions=Dogra} (82)
5543          \p{Script_Extensions: Dogra} (Short: \p{Scx=Dogr}, \p{Dogr}) (82:
5544                                    U+0964..096F, U+A830..A839,
5545                                    U+11800..1183B)
5546          \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
5547          \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
5548          \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
5549                                    (147: U+1BC00..1BC6A, U+1BC70..1BC7C,
5550                                    U+1BC80..1BC88, U+1BC90..1BC99,
5551                                    U+1BC9C..1BCA3)
5552          \p{Script_Extensions: Egyp} \p{Script_Extensions=
5553                                    Egyptian_Hieroglyphs} (1080)
5554          \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
5555                                    \p{Egyp}) (1080: U+13000..1342E,
5556                                    U+13430..13438)
5557          \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
5558          \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba})
5559                                    (40: U+10500..10527)
5560          \p{Script_Extensions: Elym} \p{Script_Extensions=Elymaic} (23)
5561          \p{Script_Extensions: Elymaic} (Short: \p{Scx=Elym}, \p{Elym})
5562                                    (23: U+10FE0..10FF6)
5563          \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
5564          \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
5565                                    (495: U+1200..1248, U+124A..124D,
5566                                    U+1250..1256, U+1258, U+125A..125D,
5567                                    U+1260..1288 ...)
5568          \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (175)
5569          \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
5570                                    (175: U+0589, U+10A0..10C5, U+10C7,
5571                                    U+10CD, U+10D0..10FF, U+1C90..1CBA ...)
5572          \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (136)
5573          \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
5574                                    (136: U+0484, U+0487, U+2C00..2C2E,
5575                                    U+2C30..2C5E, U+2E43, U+A66F ...)
5576          \p{Script_Extensions: Gong} \p{Script_Extensions=Gunjala_Gondi}
5577                                    (65)
5578          \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi}
5579                                    (77)
5580          \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
5581          \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27:
5582                                    U+10330..1034A)
5583          \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (116)
5584          \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
5585                                    (116: U+0951..0952, U+0964..0965,
5586                                    U+0BE6..0BF3, U+1CD0, U+1CD2..1CD3,
5587                                    U+1CF2..1CF4 ...)
5588          \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522:
5589                                    U+0342, U+0345, U+0370..0373,
5590                                    U+0375..0377, U+037A..037D, U+037F ...)
5591          \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
5592          \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
5593                                    (105: U+0951..0952, U+0964..0965,
5594                                    U+0A81..0A83, U+0A85..0A8D,
5595                                    U+0A8F..0A91, U+0A93..0AA8 ...)
5596          \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105)
5597          \p{Script_Extensions: Gunjala_Gondi} (Short: \p{Scx=Gong},
5598                                    \p{Gong}) (65: U+0964..0965,
5599                                    U+11D60..11D65, U+11D67..11D68,
5600                                    U+11D6A..11D8E, U+11D90..11D91,
5601                                    U+11D93..11D98 ...)
5602          \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
5603                                    (94: U+0951..0952, U+0964..0965,
5604                                    U+0A01..0A03, U+0A05..0A0A,
5605                                    U+0A0F..0A10, U+0A13..0A28 ...)
5606          \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (94)
5607          \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (89_513:
5608                                    U+2E80..2E99, U+2E9B..2EF3,
5609                                    U+2F00..2FD5, U+3001..3003,
5610                                    U+3005..3011, U+3013..301F ...)
5611          \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
5612          \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
5613                                    (11_775: U+1100..11FF, U+3001..3003,
5614                                    U+3008..3011, U+3013..301F,
5615                                    U+302E..3030, U+3037 ...)
5616          \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (89_513)
5617          \p{Script_Extensions: Hanifi_Rohingya} (Short: \p{Scx=Rohg},
5618                                    \p{Rohg}) (55: U+060C, U+061B, U+061F,
5619                                    U+0640, U+06D4, U+10D00..10D27 ...)
5620          \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
5621          \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano})
5622                                    (23: U+1720..1736)
5623          \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
5624          \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26:
5625                                    U+108E0..108F2, U+108F4..108F5,
5626                                    U+108FB..108FF)
5627          \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (134)
5628          \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr})
5629                                    (134: U+0591..05C7, U+05D0..05EA,
5630                                    U+05EF..05F4, U+FB1D..FB36,
5631                                    U+FB38..FB3C, U+FB3E ...)
5632          \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (431)
5633          \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
5634                                    (431: U+3001..3003, U+3008..3011,
5635                                    U+3013..301F, U+3030..3035, U+3037,
5636                                    U+303C..303D ...)
5637          \p{Script_Extensions: Hluw} \p{Script_Extensions=
5638                                    Anatolian_Hieroglyphs} (583)
5639          \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
5640                                    (127)
5641          \p{Script_Extensions: Hmnp} \p{Script_Extensions=
5642                                    Nyiakeng_Puachue_Hmong} (71)
5643          \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
5644                                    (108)
5645          \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
5646                                    \p{Armi}) (31: U+10840..10855,
5647                                    U+10857..1085F)
5648          \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
5649                                    (502: U+0300..0341, U+0343..0344,
5650                                    U+0346..0362, U+0953..0954,
5651                                    U+1AB0..1ABE, U+1DC2..1DF9 ...)
5652          \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
5653                                    \p{Phli}) (27: U+10B60..10B72,
5654                                    U+10B78..10B7F)
5655          \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
5656                                    Prti}, \p{Prti}) (30: U+10B40..10B55,
5657                                    U+10B58..10B5F)
5658          \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39)
5659          \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
5660          \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
5661                                    (91: U+A980..A9CD, U+A9CF..A9D9,
5662                                    U+A9DE..A9DF)
5663          \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (87:
5664                                    U+0966..096F, U+A830..A839,
5665                                    U+11080..110C1, U+110CD)
5666          \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
5667          \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (356)
5668          \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
5669                                    (104: U+0951..0952, U+0964..0965,
5670                                    U+0C80..0C8C, U+0C8E..0C90,
5671                                    U+0C92..0CA8, U+0CAA..0CB3 ...)
5672          \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
5673                                    (356: U+3001..3003, U+3008..3011,
5674                                    U+3013..301F, U+3030..3035, U+3037,
5675                                    U+303C..303D ...)
5676          \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
5677                                    (48: U+A900..A92F)
5678          \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (68)
5679          \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
5680                                    (68: U+10A00..10A03, U+10A05..10A06,
5681                                    U+10A0C..10A13, U+10A15..10A17,
5682                                    U+10A19..10A35, U+10A38..10A3A ...)
5683          \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146:
5684                                    U+1780..17DD, U+17E0..17E9,
5685                                    U+17F0..17F9, U+19E0..19FF)
5686          \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
5687          \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (82)
5688          \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (82:
5689                                    U+0AE6..0AEF, U+A830..A839,
5690                                    U+11200..11211, U+11213..1123E)
5691          \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
5692                                    (81: U+0964..0965, U+A830..A839,
5693                                    U+112B0..112EA, U+112F0..112F9)
5694          \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (104)
5695          \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (87)
5696          \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
5697          \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (82:
5698                                    U+0E81..0E82, U+0E84, U+0E86..0E8A,
5699                                    U+0E8C..0EA3, U+0EA5, U+0EA7..0EBD ...)
5700          \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (82)
5701          \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn})
5702                                    (1387: [A-Za-z\xaa\xba\xc0-\xd6\xd8-
5703                                    \xf6\xf8-\xff], U+0100..02B8,
5704                                    U+02E0..02E4, U+0363..036F,
5705                                    U+0485..0486, U+0951..0952 ...)
5706          \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1387)
5707          \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
5708          \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74:
5709                                    U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F)
5710          \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
5711          \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69:
5712                                    U+0965, U+1900..191E, U+1920..192B,
5713                                    U+1930..193B, U+1940, U+1944..194F)
5714          \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
5715          \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
5716          \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
5717                                    (386: U+10107..10133, U+10600..10736,
5718                                    U+10740..10755, U+10760..10767)
5719          \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
5720                                    (268: U+10000..1000B, U+1000D..10026,
5721                                    U+10028..1003A, U+1003C..1003D,
5722                                    U+1003F..1004D, U+10050..1005D ...)
5723          \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (48:
5724                                    U+A4D0..A4FF)
5725          \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
5726          \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29:
5727                                    U+10280..1029C)
5728          \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
5729          \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27:
5730                                    U+10920..10939, U+1093F)
5731          \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
5732                                    (61: U+0964..096F, U+A830..A839,
5733                                    U+11150..11176)
5734          \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
5735          \p{Script_Extensions: Maka} \p{Script_Extensions=Makasar} (25)
5736          \p{Script_Extensions: Makasar} (Short: \p{Scx=Maka}, \p{Maka})
5737                                    (25: U+11EE0..11EF8)
5738          \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
5739                                    (125: U+0951..0952, U+0964..0965,
5740                                    U+0D00..0D03, U+0D05..0D0C,
5741                                    U+0D0E..0D10, U+0D12..0D44 ...)
5742          \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
5743          \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand})
5744                                    (30: U+0640, U+0840..085B, U+085E)
5745          \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
5746          \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
5747                                    (52: U+0640, U+10AC0..10AE6,
5748                                    U+10AEB..10AF6)
5749          \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
5750          \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc})
5751                                    (68: U+11C70..11C8F, U+11C92..11CA7,
5752                                    U+11CA9..11CB6)
5753          \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm},
5754                                    \p{Gonm}) (77: U+0964..0965,
5755                                    U+11D00..11D06, U+11D08..11D09,
5756                                    U+11D0B..11D36, U+11D3A, U+11D3C..11D3D
5757                                    ...)
5758          \p{Script_Extensions: Medefaidrin} (Short: \p{Scx=Medf}, \p{Medf})
5759                                    (91: U+16E40..16E9A)
5760          \p{Script_Extensions: Medf} \p{Script_Extensions=Medefaidrin} (91)
5761          \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
5762                                    \p{Mtei}) (79: U+AAE0..AAF6,
5763                                    U+ABC0..ABED, U+ABF0..ABF9)
5764          \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
5765                                    (213)
5766          \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
5767                                    \p{Mend}) (213: U+1E800..1E8C4,
5768                                    U+1E8C7..1E8D6)
5769          \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
5770                                    (90)
5771          \p{Script_Extensions: Mero} \p{Script_Extensions=
5772                                    Meroitic_Hieroglyphs} (32)
5773          \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
5774                                    \p{Merc}) (90: U+109A0..109B7,
5775                                    U+109BC..109CF, U+109D2..109FF)
5776          \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
5777                                    \p{Mero}) (32: U+10980..1099F)
5778          \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (149:
5779                                    U+16F00..16F4A, U+16F4F..16F87,
5780                                    U+16F8F..16F9F)
5781          \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (125)
5782          \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89:
5783                                    U+A830..A839, U+11600..11644,
5784                                    U+11650..11659)
5785          \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (171)
5786          \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
5787                                    (171: U+1800..180E, U+1810..1819,
5788                                    U+1820..1878, U+1880..18AA, U+202F,
5789                                    U+11660..1166C)
5790          \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43:
5791                                    U+16A40..16A5E, U+16A60..16A69,
5792                                    U+16A6E..16A6F)
5793          \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
5794          \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
5795          \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
5796          \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult})
5797                                    (48: U+0A66..0A6F, U+11280..11286,
5798                                    U+11288, U+1128A..1128D, U+1128F..1129D,
5799                                    U+1129F..112A9)
5800          \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
5801                                    (224: U+1000..109F, U+A92E,
5802                                    U+A9E0..A9FE, U+AA60..AA7F)
5803          \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
5804          \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
5805                                    (40: U+10880..1089E, U+108A7..108AF)
5806          \p{Script_Extensions: Nand} \p{Script_Extensions=Nandinagari} (86)
5807          \p{Script_Extensions: Nandinagari} (Short: \p{Scx=Nand}, \p{Nand})
5808                                    (86: U+0964..0965, U+0CE6..0CEF, U+1CE9,
5809                                    U+1CF2, U+1CFA, U+A830..A835 ...)
5810          \p{Script_Extensions: Narb} \p{Script_Extensions=
5811                                    Old_North_Arabian} (32)
5812          \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
5813          \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
5814                                    (83: U+1980..19AB, U+19B0..19C9,
5815                                    U+19D0..19DA, U+19DE..19DF)
5816          \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (94:
5817                                    U+11400..11459, U+1145B, U+1145D..1145F)
5818          \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (62:
5819                                    U+07C0..07FA, U+07FD..07FF)
5820          \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (62)
5821          \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397)
5822          \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397:
5823                                    U+16FE1, U+1B170..1B2FB)
5824          \p{Script_Extensions: Nyiakeng_Puachue_Hmong} (Short: \p{Scx=
5825                                    Hmnp}, \p{Hmnp}) (71: U+1E100..1E12C,
5826                                    U+1E130..1E13D, U+1E140..1E149,
5827                                    U+1E14E..1E14F)
5828          \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
5829          \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29:
5830                                    U+1680..169C)
5831          \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
5832                                    (48: U+1C50..1C7F)
5833          \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
5834          \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
5835                                    \p{Hung}) (108: U+10C80..10CB2,
5836                                    U+10CC0..10CF2, U+10CFA..10CFF)
5837          \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
5838                                    (39: U+10300..10323, U+1032D..1032F)
5839          \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
5840                                    \p{Narb}) (32: U+10A80..10A9F)
5841          \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
5842                                    (44: U+0483, U+10350..1037A)
5843          \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
5844                                    (50: U+103A0..103C3, U+103C8..103D5)
5845          \p{Script_Extensions: Old_Sogdian} (Short: \p{Scx=Sogo}, \p{Sogo})
5846                                    (40: U+10F00..10F27)
5847          \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
5848                                    \p{Sarb}) (32: U+10A60..10A7F)
5849          \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
5850                                    (73: U+10C00..10C48)
5851          \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (96:
5852                                    U+0951..0952, U+0964..0965,
5853                                    U+0B01..0B03, U+0B05..0B0C,
5854                                    U+0B0F..0B10, U+0B13..0B28 ...)
5855          \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
5856          \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (96)
5857          \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72:
5858                                    U+104B0..104D3, U+104D8..104FB)
5859          \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
5860          \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
5861          \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma})
5862                                    (40: U+10480..1049D, U+104A0..104A9)
5863          \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
5864                                    \p{Hmng}) (127: U+16B00..16B45,
5865                                    U+16B50..16B59, U+16B5B..16B61,
5866                                    U+16B63..16B77, U+16B7D..16B8F)
5867          \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
5868          \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
5869                                    (32: U+10860..1087F)
5870          \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
5871                                    (57: U+11AC0..11AF8)
5872          \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
5873          \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
5874          \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
5875          \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
5876                                    (59: U+1802..1803, U+1805, U+A840..A877)
5877          \p{Script_Extensions: Phli} \p{Script_Extensions=
5878                                    Inscriptional_Pahlavi} (27)
5879          \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
5880                                    (30)
5881          \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
5882          \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
5883                                    (29: U+10900..1091B, U+1091F)
5884          \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (149)
5885          \p{Script_Extensions: Prti} \p{Script_Extensions=
5886                                    Inscriptional_Parthian} (30)
5887          \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
5888                                    \p{Phlp}) (30: U+0640, U+10B80..10B91,
5889                                    U+10B99..10B9C, U+10BA9..10BAF)
5890          \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
5891          \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (502)
5892          \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37:
5893                                    U+A930..A953, U+A95F)
5894          \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
5895          \p{Script_Extensions: Rohg} \p{Script_Extensions=Hanifi_Rohingya}
5896                                    (55)
5897          \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86:
5898                                    U+16A0..16EA, U+16EE..16F8)
5899          \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
5900          \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
5901                                    (61: U+0800..082D, U+0830..083E)
5902          \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
5903          \p{Script_Extensions: Sarb} \p{Script_Extensions=
5904                                    Old_South_Arabian} (32)
5905          \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
5906          \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
5907                                    (82: U+A880..A8C5, U+A8CE..A8D9)
5908          \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
5909          \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
5910                                    (100: U+0951, U+1CD7, U+1CD9,
5911                                    U+1CDC..1CDD, U+1CE0, U+11180..111CD ...)
5912          \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw})
5913                                    (48: U+10450..1047F)
5914          \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
5915          \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (100)
5916          \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
5917          \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd})
5918                                    (92: U+11580..115B5, U+115B8..115DD)
5919          \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
5920                                    (672: U+1D800..1DA8B, U+1DA9B..1DA9F,
5921                                    U+1DAA1..1DAAF)
5922          \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
5923          \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (112)
5924          \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
5925                                    (112: U+0964..0965, U+0D82..0D83,
5926                                    U+0D85..0D96, U+0D9A..0DB1,
5927                                    U+0DB3..0DBB, U+0DBD ...)
5928          \p{Script_Extensions: Sogd} \p{Script_Extensions=Sogdian} (43)
5929          \p{Script_Extensions: Sogdian} (Short: \p{Scx=Sogd}, \p{Sogd})
5930                                    (43: U+0640, U+10F30..10F59)
5931          \p{Script_Extensions: Sogo} \p{Script_Extensions=Old_Sogdian} (40)
5932          \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
5933          \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
5934                                    \p{Sora}) (35: U+110D0..110E8,
5935                                    U+110F0..110F9)
5936          \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (83)
5937          \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo})
5938                                    (83: U+11A50..11AA2)
5939          \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
5940          \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
5941                                    (72: U+1B80..1BBF, U+1CC0..1CC7)
5942          \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (56)
5943          \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
5944                                    \p{Sylo}) (56: U+0964..0965,
5945                                    U+09E6..09EF, U+A800..A82B)
5946          \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (105)
5947          \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc})
5948                                    (105: U+060C, U+061B..061C, U+061F,
5949                                    U+0640, U+064B..0655, U+0670 ...)
5950          \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg})
5951                                    (22: U+1700..170C, U+170E..1714,
5952                                    U+1735..1736)
5953          \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
5954          \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
5955                                    (20: U+1735..1736, U+1760..176C,
5956                                    U+176E..1770, U+1772..1773)
5957          \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45:
5958                                    U+1040..1049, U+1950..196D, U+1970..1974)
5959          \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
5960                                    (127: U+1A20..1A5E, U+1A60..1A7C,
5961                                    U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD)
5962          \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
5963                                    (72: U+AA80..AAC2, U+AADB..AADF)
5964          \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (79)
5965          \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (79:
5966                                    U+0964..0965, U+A830..A839,
5967                                    U+11680..116B8, U+116C0..116C9)
5968          \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
5969          \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
5970          \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (133:
5971                                    U+0951..0952, U+0964..0965,
5972                                    U+0B82..0B83, U+0B85..0B8A,
5973                                    U+0B8E..0B90, U+0B92..0B95 ...)
5974          \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (133)
5975          \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6892)
5976          \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
5977                                    (6892: U+16FE0, U+17000..187F7,
5978                                    U+18800..18AF2)
5979          \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
5980          \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (104)
5981          \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu})
5982                                    (104: U+0951..0952, U+0964..0965,
5983                                    U+0C00..0C0C, U+0C0E..0C10,
5984                                    U+0C12..0C28, U+0C2A..0C39 ...)
5985          \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
5986          \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
5987          \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66)
5988          \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66:
5989                                    U+060C, U+061B..061C, U+061F,
5990                                    U+0660..0669, U+0780..07B1, U+FDF2 ...)
5991          \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86:
5992                                    U+0E01..0E3A, U+0E40..0E5B)
5993          \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
5994                                    (207: U+0F00..0F47, U+0F49..0F6C,
5995                                    U+0F71..0F97, U+0F99..0FBC,
5996                                    U+0FBE..0FCC, U+0FCE..0FD4 ...)
5997          \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
5998          \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
5999                                    (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F)
6000          \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (97)
6001          \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh})
6002                                    (97: U+0951..0952, U+0964..0965, U+1CF2,
6003                                    U+A830..A839, U+11480..114C7,
6004                                    U+114D0..114D9)
6005          \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
6006          \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
6007                                    (31: U+10380..1039D, U+1039F)
6008          \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
6009                                    (976_118 plus all above-Unicode code
6010                                    points: U+0378..0379, U+0380..0383,
6011                                    U+038B, U+038D, U+03A2, U+0530 ...)
6012          \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300:
6013                                    U+A500..A62B)
6014          \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
6015          \p{Script_Extensions: Wancho} (Short: \p{Scx=Wcho}, \p{Wcho}) (59:
6016                                    U+1E2C0..1E2F9, U+1E2FF)
6017          \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
6018          \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
6019                                    (84: U+118A0..118F2, U+118FF)
6020          \p{Script_Extensions: Wcho} \p{Script_Extensions=Wancho} (59)
6021          \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
6022          \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
6023          \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246:
6024                                    U+3001..3002, U+3008..3011,
6025                                    U+3014..301B, U+30FB, U+A000..A48C,
6026                                    U+A490..A4C6 ...)
6027          \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
6028          \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb},
6029                                    \p{Zanb}) (72: U+11A00..11A47)
6030          \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square}
6031                                    (72)
6032          \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (502)
6033          \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (7386)
6034          \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (976_118
6035                                    plus all above-Unicode code points)
6036          \p{Scx: *}              \p{Script_Extensions: *}
6037          \p{SD}                  \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
6038          \p{SD: *}               \p{Soft_Dotted: *}
6039          \p{Sentence_Break: AT}  \p{Sentence_Break=ATerm} (4)
6040          \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024,
6041                                    U+FE52, U+FF0E)
6042          \p{Sentence_Break: CL}  \p{Sentence_Break=Close} (187)
6043          \p{Sentence_Break: Close} (Short: \p{SB=CL}) (187:
6044                                    [\"\'\(\)\[\]\{\}\xab\xbb],
6045                                    U+0F3A..0F3D, U+169B..169C,
6046                                    U+2018..201F, U+2039..203A, U+2045..2046
6047                                    ...)
6048          \p{Sentence_Break: CR}  (Short: \p{SB=CR}) (1: [\r])
6049          \p{Sentence_Break: EX}  \p{Sentence_Break=Extend} (2368)
6050          \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2368: U+0300..036F,
6051                                    U+0483..0489, U+0591..05BD, U+05BF,
6052                                    U+05C1..05C2, U+05C4..05C5 ...)
6053          \p{Sentence_Break: FO}  \p{Sentence_Break=Format} (63)
6054          \p{Sentence_Break: Format} (Short: \p{SB=FO}) (63: [\xad],
6055                                    U+0600..0605, U+061C, U+06DD, U+070F,
6056                                    U+08E2 ...)
6057          \p{Sentence_Break: LE}  \p{Sentence_Break=OLetter} (121_822)
6058          \p{Sentence_Break: LF}  (Short: \p{SB=LF}) (1: [\n])
6059          \p{Sentence_Break: LO}  \p{Sentence_Break=Lower} (2293)
6060          \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2293: [a-
6061                                    z\xaa\xb5\xba\xdf-\xf6\xf8-\xff],
6062                                    U+0101, U+0103, U+0105, U+0107, U+0109
6063                                    ...)
6064          \p{Sentence_Break: NU}  \p{Sentence_Break=Numeric} (632)
6065          \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (632: [0-9],
6066                                    U+0660..0669, U+066B..066C,
6067                                    U+06F0..06F9, U+07C0..07C9, U+0966..096F
6068                                    ...)
6069          \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (121_822: U+01BB,
6070                                    U+01C0..01C3, U+0294, U+02B9..02BF,
6071                                    U+02C6..02D1, U+02EC ...)
6072          \p{Sentence_Break: Other} (Short: \p{SB=XX}) (984_661 plus all
6073                                    above-Unicode code points:
6074                                    [^\t\n\cK\f\r\x20!\"\'\(\),\-.0-9:?A-
6075                                    Z\[\]a-z\{\}\x85\xa0\xaa-
6076                                    \xab\xad\xb5\xba-\xbb\xc0-\xd6\xd8-
6077                                    \xf6\xf8-\xff], U+02C2..02C5,
6078                                    U+02D2..02DF, U+02E5..02EB, U+02ED,
6079                                    U+02EF..02FF ...)
6080          \p{Sentence_Break: SC}  \p{Sentence_Break=SContinue} (26)
6081          \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:],
6082                                    U+055D, U+060C..060D, U+07F8, U+1802,
6083                                    U+1808 ...)
6084          \p{Sentence_Break: SE}  \p{Sentence_Break=Sep} (3)
6085          \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85],
6086                                    U+2028..2029)
6087          \p{Sentence_Break: Sp}  (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0],
6088                                    U+1680, U+2000..200A, U+202F, U+205F,
6089                                    U+3000)
6090          \p{Sentence_Break: ST}  \p{Sentence_Break=STerm} (138)
6091          \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (138: [!?], U+0589,
6092                                    U+061E..061F, U+06D4, U+0700..0702,
6093                                    U+07F9 ...)
6094          \p{Sentence_Break: UP}  \p{Sentence_Break=Upper} (1893)
6095          \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1893: [A-Z\xc0-
6096                                    \xd6\xd8-\xde], U+0100, U+0102, U+0104,
6097                                    U+0106, U+0108 ...)
6098          \p{Sentence_Break: XX}  \p{Sentence_Break=Other} (984_661 plus all
6099                                    above-Unicode code points)
6100          \p{Sentence_Terminal}   \p{Sentence_Terminal=Y} (Short: \p{STerm})
6101                                    (141)
6102          \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
6103                                    (1_113_971 plus all above-Unicode code
6104                                    points: [\x00-\x20\"#\$\%&\'\(\)*+,\-
6105                                    \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
6106                                    \xff], U+0100..0588, U+058A..061D,
6107                                    U+0620..06D3, U+06D5..06FF, U+0703..07F8
6108                                    ...)
6109          \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (141:
6110                                    [!.?], U+0589, U+061E..061F, U+06D4,
6111                                    U+0700..0702, U+07F9 ...)
6112          \p{Separator}           \p{General_Category=Separator} (Short:
6113                                    \p{Z}) (19)
6114          \p{Sgnw}                \p{SignWriting} (= \p{Script_Extensions=
6115                                    SignWriting}) (672)
6116          \p{Sharada}             \p{Script_Extensions=Sharada} (Short:
6117                                    \p{Shrd}; NOT \p{Block=Sharada}) (100)
6118          \p{Shavian}             \p{Script_Extensions=Shavian} (Short:
6119                                    \p{Shaw}) (48)
6120          \p{Shaw}                \p{Shavian} (= \p{Script_Extensions=
6121                                    Shavian}) (48)
6122        X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
6123                                    (16)
6124          \p{Shrd}                \p{Sharada} (= \p{Script_Extensions=
6125                                    Sharada}) (NOT \p{Block=Sharada}) (100)
6126          \p{Sidd}                \p{Siddham} (= \p{Script_Extensions=
6127                                    Siddham}) (NOT \p{Block=Siddham}) (92)
6128          \p{Siddham}             \p{Script_Extensions=Siddham} (Short:
6129                                    \p{Sidd}; NOT \p{Block=Siddham}) (92)
6130          \p{SignWriting}         \p{Script_Extensions=SignWriting} (Short:
6131                                    \p{Sgnw}) (672)
6132          \p{Sind}                \p{Khudawadi} (= \p{Script_Extensions=
6133                                    Khudawadi}) (NOT \p{Block=Khudawadi})
6134                                    (81)
6135          \p{Sinh}                \p{Sinhala} (= \p{Script_Extensions=
6136                                    Sinhala}) (NOT \p{Block=Sinhala}) (112)
6137          \p{Sinhala}             \p{Script_Extensions=Sinhala} (Short:
6138                                    \p{Sinh}; NOT \p{Block=Sinhala}) (112)
6139        X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
6140          \p{Sk}                  \p{Modifier_Symbol} (=
6141                                    \p{General_Category=Modifier_Symbol})
6142                                    (121)
6143          \p{Sm}                  \p{Math_Symbol} (= \p{General_Category=
6144                                    Math_Symbol}) (948)
6145        X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
6146                                    \p{InSmallForms}) (32)
6147        X \p{Small_Forms}         \p{Small_Form_Variants} (= \p{Block=
6148                                    Small_Form_Variants}) (32)
6149        X \p{Small_Kana_Ext}      \p{Small_Kana_Extension} (= \p{Block=
6150                                    Small_Kana_Extension}) (64)
6151        X \p{Small_Kana_Extension} \p{Block=Small_Kana_Extension} (Short:
6152                                    \p{InSmallKanaExt}) (64)
6153          \p{So}                  \p{Other_Symbol} (= \p{General_Category=
6154                                    Other_Symbol}) (6161)
6155          \p{Soft_Dotted}         \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
6156          \p{Soft_Dotted: N*}     (Short: \p{SD=N}, \P{SD}) (1_114_066 plus
6157                                    all above-Unicode code points: [\x00-
6158                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6159                                    Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff],
6160                                    U+0100..012E, U+0130..0248,
6161                                    U+024A..0267, U+0269..029C, U+029E..02B1
6162                                    ...)
6163          \p{Soft_Dotted: Y*}     (Short: \p{SD=Y}, \p{SD}) (46: [i-j],
6164                                    U+012F, U+0249, U+0268, U+029D, U+02B2
6165                                    ...)
6166          \p{Sogd}                \p{Sogdian} (= \p{Script_Extensions=
6167                                    Sogdian}) (NOT \p{Block=Sogdian}) (43)
6168          \p{Sogdian}             \p{Script_Extensions=Sogdian} (Short:
6169                                    \p{Sogd}; NOT \p{Block=Sogdian}) (43)
6170          \p{Sogo}                \p{Old_Sogdian} (= \p{Script_Extensions=
6171                                    Old_Sogdian}) (NOT \p{Block=
6172                                    Old_Sogdian}) (40)
6173          \p{Sora}                \p{Sora_Sompeng} (= \p{Script_Extensions=
6174                                    Sora_Sompeng}) (NOT \p{Block=
6175                                    Sora_Sompeng}) (35)
6176          \p{Sora_Sompeng}        \p{Script_Extensions=Sora_Sompeng} (Short:
6177                                    \p{Sora}; NOT \p{Block=Sora_Sompeng})
6178                                    (35)
6179          \p{Soyo}                \p{Soyombo} (= \p{Script_Extensions=
6180                                    Soyombo}) (NOT \p{Block=Soyombo}) (83)
6181          \p{Soyombo}             \p{Script_Extensions=Soyombo} (Short:
6182                                    \p{Soyo}; NOT \p{Block=Soyombo}) (83)
6183          \p{Space}               \p{White_Space} (= \p{White_Space=Y}) (25)
6184          \p{Space: *}            \p{White_Space: *}
6185          \p{Space_Separator}     \p{General_Category=Space_Separator}
6186                                    (Short: \p{Zs}) (17)
6187          \p{SpacePerl}           \p{XPosixSpace} (25)
6188          \p{Spacing_Mark}        \p{General_Category=Spacing_Mark} (Short:
6189                                    \p{Mc}) (429)
6190        X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
6191                                    (Short: \p{InModifierLetters}) (80)
6192        X \p{Specials}            \p{Block=Specials} (16)
6193          \p{STerm}               \p{Sentence_Terminal} (=
6194                                    \p{Sentence_Terminal=Y}) (141)
6195          \p{STerm: *}            \p{Sentence_Terminal: *}
6196          \p{Sund}                \p{Sundanese} (= \p{Script_Extensions=
6197                                    Sundanese}) (NOT \p{Block=Sundanese})
6198                                    (72)
6199          \p{Sundanese}           \p{Script_Extensions=Sundanese} (Short:
6200                                    \p{Sund}; NOT \p{Block=Sundanese}) (72)
6201        X \p{Sundanese_Sup}       \p{Sundanese_Supplement} (= \p{Block=
6202                                    Sundanese_Supplement}) (16)
6203        X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
6204                                    \p{InSundaneseSup}) (16)
6205        X \p{Sup_Arrows_A}        \p{Supplemental_Arrows_A} (= \p{Block=
6206                                    Supplemental_Arrows_A}) (16)
6207        X \p{Sup_Arrows_B}        \p{Supplemental_Arrows_B} (= \p{Block=
6208                                    Supplemental_Arrows_B}) (128)
6209        X \p{Sup_Arrows_C}        \p{Supplemental_Arrows_C} (= \p{Block=
6210                                    Supplemental_Arrows_C}) (256)
6211        X \p{Sup_Math_Operators}  \p{Supplemental_Mathematical_Operators} (=
6212                                    \p{Block=
6213                                    Supplemental_Mathematical_Operators})
6214                                    (256)
6215        X \p{Sup_PUA_A}           \p{Supplementary_Private_Use_Area_A} (=
6216                                    \p{Block=
6217                                    Supplementary_Private_Use_Area_A})
6218                                    (65_536)
6219        X \p{Sup_PUA_B}           \p{Supplementary_Private_Use_Area_B} (=
6220                                    \p{Block=
6221                                    Supplementary_Private_Use_Area_B})
6222                                    (65_536)
6223        X \p{Sup_Punctuation}     \p{Supplemental_Punctuation} (= \p{Block=
6224                                    Supplemental_Punctuation}) (128)
6225        X \p{Sup_Symbols_And_Pictographs}
6226                                    \p{Supplemental_Symbols_And_Pictographs}
6227                                    (= \p{Block=
6228                                    Supplemental_Symbols_And_Pictographs})
6229                                    (256)
6230        X \p{Super_And_Sub}       \p{Superscripts_And_Subscripts} (=
6231                                    \p{Block=Superscripts_And_Subscripts})
6232                                    (48)
6233        X \p{Superscripts_And_Subscripts} \p{Block=
6234                                    Superscripts_And_Subscripts} (Short:
6235                                    \p{InSuperAndSub}) (48)
6236        X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
6237                                    \p{InSupArrowsA}) (16)
6238        X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
6239                                    \p{InSupArrowsB}) (128)
6240        X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
6241                                    \p{InSupArrowsC}) (256)
6242        X \p{Supplemental_Mathematical_Operators} \p{Block=
6243                                    Supplemental_Mathematical_Operators}
6244                                    (Short: \p{InSupMathOperators}) (256)
6245        X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
6246                                    (Short: \p{InSupPunctuation}) (128)
6247        X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
6248                                    Supplemental_Symbols_And_Pictographs}
6249                                    (Short: \p{InSupSymbolsAndPictographs})
6250                                    (256)
6251        X \p{Supplementary_Private_Use_Area_A} \p{Block=
6252                                    Supplementary_Private_Use_Area_A}
6253                                    (Short: \p{InSupPUAA}) (65_536)
6254        X \p{Supplementary_Private_Use_Area_B} \p{Block=
6255                                    Supplementary_Private_Use_Area_B}
6256                                    (Short: \p{InSupPUAB}) (65_536)
6257          \p{Surrogate}           \p{General_Category=Surrogate} (Short:
6258                                    \p{Cs}) (2048)
6259        X \p{Sutton_SignWriting}  \p{Block=Sutton_SignWriting} (688)
6260          \p{Sylo}                \p{Syloti_Nagri} (= \p{Script_Extensions=
6261                                    Syloti_Nagri}) (NOT \p{Block=
6262                                    Syloti_Nagri}) (56)
6263          \p{Syloti_Nagri}        \p{Script_Extensions=Syloti_Nagri} (Short:
6264                                    \p{Sylo}; NOT \p{Block=Syloti_Nagri})
6265                                    (56)
6266          \p{Symbol}              \p{General_Category=Symbol} (Short: \p{S})
6267                                    (7292)
6268        X \p{Symbols_And_Pictographs_Ext_A}
6269                                    \p{Symbols_And_Pictographs_Extended_A}
6270                                    (= \p{Block=
6271                                    Symbols_And_Pictographs_Extended_A})
6272                                    (144)
6273        X \p{Symbols_And_Pictographs_Extended_A} \p{Block=
6274                                    Symbols_And_Pictographs_Extended_A} (144)
6275          \p{Syrc}                \p{Syriac} (= \p{Script_Extensions=
6276                                    Syriac}) (NOT \p{Block=Syriac}) (105)
6277          \p{Syriac}              \p{Script_Extensions=Syriac} (Short:
6278                                    \p{Syrc}; NOT \p{Block=Syriac}) (105)
6279        X \p{Syriac_Sup}          \p{Syriac_Supplement} (= \p{Block=
6280                                    Syriac_Supplement}) (16)
6281        X \p{Syriac_Supplement}   \p{Block=Syriac_Supplement} (Short:
6282                                    \p{InSyriacSup}) (16)
6283          \p{Tagalog}             \p{Script_Extensions=Tagalog} (Short:
6284                                    \p{Tglg}; NOT \p{Block=Tagalog}) (22)
6285          \p{Tagb}                \p{Tagbanwa} (= \p{Script_Extensions=
6286                                    Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
6287          \p{Tagbanwa}            \p{Script_Extensions=Tagbanwa} (Short:
6288                                    \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
6289        X \p{Tags}                \p{Block=Tags} (128)
6290          \p{Tai_Le}              \p{Script_Extensions=Tai_Le} (Short:
6291                                    \p{Tale}; NOT \p{Block=Tai_Le}) (45)
6292          \p{Tai_Tham}            \p{Script_Extensions=Tai_Tham} (Short:
6293                                    \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
6294          \p{Tai_Viet}            \p{Script_Extensions=Tai_Viet} (Short:
6295                                    \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
6296        X \p{Tai_Xuan_Jing}       \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
6297                                    Tai_Xuan_Jing_Symbols}) (96)
6298        X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
6299                                    \p{InTaiXuanJing}) (96)
6300          \p{Takr}                \p{Takri} (= \p{Script_Extensions=Takri})
6301                                    (NOT \p{Block=Takri}) (79)
6302          \p{Takri}               \p{Script_Extensions=Takri} (Short:
6303                                    \p{Takr}; NOT \p{Block=Takri}) (79)
6304          \p{Tale}                \p{Tai_Le} (= \p{Script_Extensions=
6305                                    Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
6306          \p{Talu}                \p{New_Tai_Lue} (= \p{Script_Extensions=
6307                                    New_Tai_Lue}) (NOT \p{Block=
6308                                    New_Tai_Lue}) (83)
6309          \p{Tamil}               \p{Script_Extensions=Tamil} (Short:
6310                                    \p{Taml}; NOT \p{Block=Tamil}) (133)
6311        X \p{Tamil_Sup}           \p{Tamil_Supplement} (= \p{Block=
6312                                    Tamil_Supplement}) (64)
6313        X \p{Tamil_Supplement}    \p{Block=Tamil_Supplement} (Short:
6314                                    \p{InTamilSup}) (64)
6315          \p{Taml}                \p{Tamil} (= \p{Script_Extensions=Tamil})
6316                                    (NOT \p{Block=Tamil}) (133)
6317          \p{Tang}                \p{Tangut} (= \p{Script_Extensions=
6318                                    Tangut}) (NOT \p{Block=Tangut}) (6892)
6319          \p{Tangut}              \p{Script_Extensions=Tangut} (Short:
6320                                    \p{Tang}; NOT \p{Block=Tangut}) (6892)
6321        X \p{Tangut_Components}   \p{Block=Tangut_Components} (768)
6322          \p{Tavt}                \p{Tai_Viet} (= \p{Script_Extensions=
6323                                    Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
6324          \p{Telu}                \p{Telugu} (= \p{Script_Extensions=
6325                                    Telugu}) (NOT \p{Block=Telugu}) (104)
6326          \p{Telugu}              \p{Script_Extensions=Telugu} (Short:
6327                                    \p{Telu}; NOT \p{Block=Telugu}) (104)
6328          \p{Term}                \p{Terminal_Punctuation} (=
6329                                    \p{Terminal_Punctuation=Y}) (264)
6330          \p{Term: *}             \p{Terminal_Punctuation: *}
6331          \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
6332                                    \p{Term}) (264)
6333          \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
6334                                    (1_113_848 plus all above-Unicode code
6335                                    points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0-
6336                                    9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
6337                                    U+0100..037D, U+037F..0386,
6338                                    U+0388..0588, U+058A..05C2, U+05C4..060B
6339                                    ...)
6340          \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (264:
6341                                    [!,.:;?], U+037E, U+0387, U+0589,
6342                                    U+05C3, U+060C ...)
6343          \p{Tfng}                \p{Tifinagh} (= \p{Script_Extensions=
6344                                    Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
6345          \p{Tglg}                \p{Tagalog} (= \p{Script_Extensions=
6346                                    Tagalog}) (NOT \p{Block=Tagalog}) (22)
6347          \p{Thaa}                \p{Thaana} (= \p{Script_Extensions=
6348                                    Thaana}) (NOT \p{Block=Thaana}) (66)
6349          \p{Thaana}              \p{Script_Extensions=Thaana} (Short:
6350                                    \p{Thaa}; NOT \p{Block=Thaana}) (66)
6351          \p{Thai}                \p{Script_Extensions=Thai} (NOT \p{Block=
6352                                    Thai}) (86)
6353          \p{Tibetan}             \p{Script_Extensions=Tibetan} (Short:
6354                                    \p{Tibt}; NOT \p{Block=Tibetan}) (207)
6355          \p{Tibt}                \p{Tibetan} (= \p{Script_Extensions=
6356                                    Tibetan}) (NOT \p{Block=Tibetan}) (207)
6357          \p{Tifinagh}            \p{Script_Extensions=Tifinagh} (Short:
6358                                    \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
6359          \p{Tirh}                \p{Tirhuta} (= \p{Script_Extensions=
6360                                    Tirhuta}) (NOT \p{Block=Tirhuta}) (97)
6361          \p{Tirhuta}             \p{Script_Extensions=Tirhuta} (Short:
6362                                    \p{Tirh}; NOT \p{Block=Tirhuta}) (97)
6363          \p{Title}               \p{Titlecase} (/i= Cased=Yes) (31)
6364          \p{Titlecase}           (= \p{Gc=Lt}) (Short: \p{Title}; /i=
6365                                    Cased=Yes) (31: U+01C5, U+01C8, U+01CB,
6366                                    U+01F2, U+1F88..1F8F, U+1F98..1F9F ...)
6367          \p{Titlecase_Letter}    \p{General_Category=Titlecase_Letter}
6368                                    (Short: \p{Lt}; /i= General_Category=
6369                                    Cased_Letter) (31)
6370        X \p{Transport_And_Map}   \p{Transport_And_Map_Symbols} (= \p{Block=
6371                                    Transport_And_Map_Symbols}) (128)
6372        X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
6373                                    (Short: \p{InTransportAndMap}) (128)
6374        X \p{UCAS}                \p{Unified_Canadian_Aboriginal_Syllabics}
6375                                    (= \p{Block=
6376                                    Unified_Canadian_Aboriginal_Syllabics})
6377                                    (640)
6378        X \p{UCAS_Ext}            \p{Unified_Canadian_Aboriginal_Syllabics_-
6379                                    Extended} (= \p{Block=
6380                                    Unified_Canadian_Aboriginal_Syllabics_-
6381                                    Extended}) (80)
6382          \p{Ugar}                \p{Ugaritic} (= \p{Script_Extensions=
6383                                    Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
6384          \p{Ugaritic}            \p{Script_Extensions=Ugaritic} (Short:
6385                                    \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
6386          \p{UIdeo}               \p{Unified_Ideograph} (=
6387                                    \p{Unified_Ideograph=Y}) (87_887)
6388          \p{UIdeo: *}            \p{Unified_Ideograph: *}
6389          \p{Unassigned}          \p{General_Category=Unassigned} (Short:
6390                                    \p{Cn}) (836_602 plus all above-Unicode
6391                                    code points)
6392          \p{Unicode}             \p{Any} (1_114_112)
6393        X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
6394                                    Unified_Canadian_Aboriginal_Syllabics}
6395                                    (Short: \p{InUCAS}) (640)
6396        X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
6397                                    Unified_Canadian_Aboriginal_Syllabics_-
6398                                    Extended} (Short: \p{InUCASExt}) (80)
6399          \p{Unified_Ideograph}   \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
6400                                    (87_887)
6401          \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
6402                                    (1_026_225 plus all above-Unicode code
6403                                    points: U+0000..33FF, U+4DB6..4DFF,
6404                                    U+9FF0..FA0D, U+FA10, U+FA12,
6405                                    U+FA15..FA1E ...)
6406          \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (87_887:
6407                                    U+3400..4DB5, U+4E00..9FEF,
6408                                    U+FA0E..FA0F, U+FA11, U+FA13..FA14,
6409                                    U+FA1F ...)
6410          \p{Unknown}             \p{Script_Extensions=Unknown} (Short:
6411                                    \p{Zzzz}) (976_118 plus all above-
6412                                    Unicode code points)
6413          \p{Upper}               \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6414                                    Cased=Yes) (1908)
6415          \p{Upper: *}            \p{Uppercase: *}
6416          \p{Uppercase}           \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6417                                    Cased=Yes) (1908)
6418          \p{Uppercase: N*}       (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
6419                                    No) (1_112_204 plus all above-Unicode
6420                                    code points: [\x00-
6421                                    \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
6422                                    >?\@\[\\\]\^_`a-z\{\|\}~\x7f-
6423                                    \xbf\xd7\xdf-\xff], U+0101, U+0103,
6424                                    U+0105, U+0107, U+0109 ...)
6425          \p{Uppercase: Y*}       (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
6426                                    Yes) (1908: [A-Z\xc0-\xd6\xd8-\xde],
6427                                    U+0100, U+0102, U+0104, U+0106, U+0108
6428                                    ...)
6429          \p{Uppercase_Letter}    \p{General_Category=Uppercase_Letter}
6430                                    (Short: \p{Lu}; /i= General_Category=
6431                                    Cased_Letter) (1788)
6432          \p{Vai}                 \p{Script_Extensions=Vai} (NOT \p{Block=
6433                                    Vai}) (300)
6434          \p{Vaii}                \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
6435                                    \p{Block=Vai}) (300)
6436          \p{Variation_Selector}  \p{Variation_Selector=Y} (Short: \p{VS};
6437                                    NOT \p{Variation_Selectors}) (259)
6438          \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853
6439                                    plus all above-Unicode code points:
6440                                    U+0000..180A, U+180E..FDFF,
6441                                    U+FE10..E00FF, U+E01F0..infinity)
6442          \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259:
6443                                    U+180B..180D, U+FE00..FE0F,
6444                                    U+E0100..E01EF)
6445        X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
6446                                    \p{InVS}) (16)
6447        X \p{Variation_Selectors_Supplement} \p{Block=
6448                                    Variation_Selectors_Supplement} (Short:
6449                                    \p{InVSSup}) (240)
6450        X \p{Vedic_Ext}           \p{Vedic_Extensions} (= \p{Block=
6451                                    Vedic_Extensions}) (48)
6452        X \p{Vedic_Extensions}    \p{Block=Vedic_Extensions} (Short:
6453                                    \p{InVedicExt}) (48)
6454        X \p{Vertical_Forms}      \p{Block=Vertical_Forms} (16)
6455          \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated}
6456                                    (787_620 plus all above-Unicode code
6457                                    points)
6458          \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (787_620 plus
6459                                    all above-Unicode code points: [\x00-
6460                                    \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf-
6461                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9,
6462                                    U+02EC..10FF, U+1200..1400,
6463                                    U+1680..18AF, U+1900..2015 ...)
6464          \p{Vertical_Orientation: Tr} \p{Vertical_Orientation=
6465                                    Transformed_Rotated} (47)
6466          \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr})
6467                                    (47: U+2329..232A, U+3008..3011,
6468                                    U+3014..301F, U+3030, U+30A0, U+30FC ...)
6469          \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu})
6470                                    (148: U+3001..3002, U+3041, U+3043,
6471                                    U+3045, U+3047, U+3049 ...)
6472          \p{Vertical_Orientation: Tu} \p{Vertical_Orientation=
6473                                    Transformed_Upright} (148)
6474          \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright}
6475                                    (326_297)
6476          \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (326_297:
6477                                    [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7],
6478                                    U+02EA..02EB, U+1100..11FF,
6479                                    U+1401..167F, U+18B0..18FF, U+2016 ...)
6480          \p{VertSpace}           \v (7: [\n\cK\f\r\x85], U+2028..2029)
6481          \p{Vo: *}               \p{Vertical_Orientation: *}
6482          \p{VS}                  \p{Variation_Selector} (=
6483                                    \p{Variation_Selector=Y}) (NOT
6484                                    \p{Variation_Selectors}) (259)
6485          \p{VS: *}               \p{Variation_Selector: *}
6486        X \p{VS_Sup}              \p{Variation_Selectors_Supplement} (=
6487                                    \p{Block=
6488                                    Variation_Selectors_Supplement}) (240)
6489          \p{Wancho}              \p{Script_Extensions=Wancho} (Short:
6490                                    \p{Wcho}; NOT \p{Block=Wancho}) (59)
6491          \p{Wara}                \p{Warang_Citi} (= \p{Script_Extensions=
6492                                    Warang_Citi}) (NOT \p{Block=
6493                                    Warang_Citi}) (84)
6494          \p{Warang_Citi}         \p{Script_Extensions=Warang_Citi} (Short:
6495                                    \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
6496          \p{WB: *}               \p{Word_Break: *}
6497          \p{Wcho}                \p{Wancho} (= \p{Script_Extensions=
6498                                    Wancho}) (NOT \p{Block=Wancho}) (59)
6499          \p{White_Space}         \p{White_Space=Y} (Short: \p{Space}) (25)
6500          \p{White_Space: N*}     (Short: \p{Space=N}, \P{Space}) (1_114_087
6501                                    plus all above-Unicode code points:
6502                                    [^\t\n\cK\f\r\x20\x85\xa0],
6503                                    U+0100..167F, U+1681..1FFF,
6504                                    U+200B..2027, U+202A..202E, U+2030..205E
6505                                    ...)
6506          \p{White_Space: Y*}     (Short: \p{Space=Y}, \p{Space}) (25:
6507                                    [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6508                                    U+2000..200A, U+2028..2029, U+202F,
6509                                    U+205F ...)
6510          \p{Word}                \p{XPosixWord} (128_919)
6511          \p{Word_Break: ALetter} (Short: \p{WB=LE}) (28_693: [A-Za-
6512                                    z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6513                                    \xff], U+0100..02D7, U+02DE..02E4,
6514                                    U+02EC..02FF, U+0370..0374, U+0376..0377
6515                                    ...)
6516          \p{Word_Break: CR}      (Short: \p{WB=CR}) (1: [\r])
6517          \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"])
6518          \p{Word_Break: DQ}      \p{Word_Break=Double_Quote} (1)
6519          \p{Word_Break: E_Base}  (Short: \p{WB=EB}) (0)
6520          \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (0)
6521          \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (0)
6522          \p{Word_Break: EB}      \p{Word_Break=E_Base} (0)
6523          \p{Word_Break: EBG}     \p{Word_Break=E_Base_GAZ} (0)
6524          \p{Word_Break: EM}      \p{Word_Break=E_Modifier} (0)
6525          \p{Word_Break: EX}      \p{Word_Break=ExtendNumLet} (11)
6526          \p{Word_Break: Extend}  (Short: \p{WB=Extend}) (2372:
6527                                    U+0300..036F, U+0483..0489,
6528                                    U+0591..05BD, U+05BF, U+05C1..05C2,
6529                                    U+05C4..05C5 ...)
6530          \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F,
6531                                    U+203F..2040, U+2054, U+FE33..FE34,
6532                                    U+FE4D..FE4F ...)
6533          \p{Word_Break: FO}      \p{Word_Break=Format} (62)
6534          \p{Word_Break: Format}  (Short: \p{WB=FO}) (62: [\xad],
6535                                    U+0600..0605, U+061C, U+06DD, U+070F,
6536                                    U+08E2 ...)
6537          \p{Word_Break: GAZ}     \p{Word_Break=Glue_After_Zwj} (0)
6538          \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (0)
6539          \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (75:
6540                                    U+05D0..05EA, U+05EF..05F2, U+FB1D,
6541                                    U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
6542                                    ...)
6543          \p{Word_Break: HL}      \p{Word_Break=Hebrew_Letter} (75)
6544          \p{Word_Break: KA}      \p{Word_Break=Katakana} (314)
6545          \p{Word_Break: Katakana} (Short: \p{WB=KA}) (314: U+3031..3035,
6546                                    U+309B..309C, U+30A0..30FA,
6547                                    U+30FC..30FF, U+31F0..31FF, U+32D0..32FE
6548                                    ...)
6549          \p{Word_Break: LE}      \p{Word_Break=ALetter} (28_693)
6550          \p{Word_Break: LF}      (Short: \p{WB=LF}) (1: [\n])
6551          \p{Word_Break: MB}      \p{Word_Break=MidNumLet} (7)
6552          \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (8: [:\xb7], U+0387,
6553                                    U+05F4, U+2027, U+FE13, U+FE55 ...)
6554          \p{Word_Break: MidNum}  (Short: \p{WB=MN}) (15: [,;], U+037E,
6555                                    U+0589, U+060C..060D, U+066C, U+07F8 ...)
6556          \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.],
6557                                    U+2018..2019, U+2024, U+FE52, U+FF07,
6558                                    U+FF0E)
6559          \p{Word_Break: ML}      \p{Word_Break=MidLetter} (8)
6560          \p{Word_Break: MN}      \p{Word_Break=MidNum} (15)
6561          \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85],
6562                                    U+2028..2029)
6563          \p{Word_Break: NL}      \p{Word_Break=Newline} (5)
6564          \p{Word_Break: NU}      \p{Word_Break=Numeric} (631)
6565          \p{Word_Break: Numeric} (Short: \p{WB=NU}) (631: [0-9],
6566                                    U+0660..0669, U+066B, U+06F0..06F9,
6567                                    U+07C0..07C9, U+0966..096F ...)
6568          \p{Word_Break: Other}   (Short: \p{WB=XX}) (1_081_874 plus all
6569                                    above-Unicode code points:
6570                                    [^\n\cK\f\r\x20\"\',.0-9:;A-Z_a-
6571                                    z\x85\xaa\xad\xb5\xb7\xba\xc0-\xd6\xd8-
6572                                    \xf6\xf8-\xff], U+02D8..02DD,
6573                                    U+02E5..02EB, U+0375, U+0378..0379,
6574                                    U+0380..0385 ...)
6575          \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26:
6576                                    U+1F1E6..1F1FF)
6577          \p{Word_Break: RI}      \p{Word_Break=Regional_Indicator} (26)
6578          \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\'])
6579          \p{Word_Break: SQ}      \p{Word_Break=Single_Quote} (1)
6580          \p{Word_Break: WSegSpace} (Short: \p{WB=WSegSpace}) (14: [\x20],
6581                                    U+1680, U+2000..2006, U+2008..200A,
6582                                    U+205F, U+3000)
6583          \p{Word_Break: XX}      \p{Word_Break=Other} (1_081_874 plus all
6584                                    above-Unicode code points)
6585          \p{Word_Break: ZWJ}     (Short: \p{WB=ZWJ}) (1: U+200D)
6586          \p{WSpace}              \p{White_Space} (= \p{White_Space=Y}) (25)
6587          \p{WSpace: *}           \p{White_Space: *}
6588          \p{XDigit}              \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
6589          \p{XID_Continue}        \p{XID_Continue=Y} (Short: \p{XIDC})
6590                                    (128_770)
6591          \p{XID_Continue: N*}    (Short: \p{XIDC=N}, \P{XIDC}) (985_342
6592                                    plus all above-Unicode code points:
6593                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
6594                                    >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-
6595                                    \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
6596                                    U+02C2..02C5, U+02D2..02DF,
6597                                    U+02E5..02EB, U+02ED, U+02EF..02FF ...)
6598          \p{XID_Continue: Y*}    (Short: \p{XIDC=Y}, \p{XIDC}) (128_770:
6599                                    [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-
6600                                    \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6601                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
6602                                    U+02EE ...)
6603          \p{XID_Start}           \p{XID_Start=Y} (Short: \p{XIDS}) (125_861)
6604          \p{XID_Start: N*}       (Short: \p{XIDS=N}, \P{XIDS}) (988_251
6605                                    plus all above-Unicode code points:
6606                                    [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
6607                                    >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
6608                                    \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
6609                                    U+02C2..02C5, U+02D2..02DF,
6610                                    U+02E5..02EB, U+02ED, U+02EF..036F ...)
6611          \p{XID_Start: Y*}       (Short: \p{XIDS=Y}, \p{XIDS}) (125_861:
6612                                    [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
6613                                    \xf6\xf8-\xff], U+0100..02C1,
6614                                    U+02C6..02D1, U+02E0..02E4, U+02EC,
6615                                    U+02EE ...)
6616          \p{XIDC}                \p{XID_Continue} (= \p{XID_Continue=Y})
6617                                    (128_770)
6618          \p{XIDC: *}             \p{XID_Continue: *}
6619          \p{XIDS}                \p{XID_Start} (= \p{XID_Start=Y}) (125_861)
6620          \p{XIDS: *}             \p{XID_Start: *}
6621          \p{Xpeo}                \p{Old_Persian} (= \p{Script_Extensions=
6622                                    Old_Persian}) (NOT \p{Block=
6623                                    Old_Persian}) (50)
6624          \p{XPerlSpace}          \p{XPosixSpace} (25)
6625          \p{XPosixAlnum}         Alphabetic and (decimal) Numeric (Short:
6626                                    \p{Alnum}) (127_886: [0-9A-Za-
6627                                    z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6628                                    \xff], U+0100..02C1, U+02C6..02D1,
6629                                    U+02E0..02E4, U+02EC, U+02EE ...)
6630          \p{XPosixAlpha}         \p{Alphabetic=Y} (Short: \p{Alpha})
6631                                    (127_256)
6632          \p{XPosixBlank}         \h, Horizontal white space (Short:
6633                                    \p{Blank}) (18: [\t\x20\xa0], U+1680,
6634                                    U+2000..200A, U+202F, U+205F, U+3000)
6635          \p{XPosixCntrl}         \p{General_Category=Control} Control
6636                                    characters (Short: \p{Cc}) (65)
6637          \p{XPosixDigit}         \p{General_Category=Decimal_Number} [0-9]
6638                                    + all other decimal digits (Short:
6639                                    \p{Nd}) (630)
6640          \p{XPosixGraph}         Characters that are graphical (Short:
6641                                    \p{Graph}) (275_378:
6642                                    [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6643                                    Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff],
6644                                    U+0100..0377, U+037A..037F,
6645                                    U+0384..038A, U+038C, U+038E..03A1 ...)
6646          \p{XPosixLower}         \p{Lowercase=Y} (Short: \p{Lower}; /i=
6647                                    Cased=Yes) (2340)
6648          \p{XPosixPrint}         Characters that are graphical plus space
6649                                    characters (but no controls) (Short:
6650                                    \p{Print}) (275_395: [\x20-\x7e\xa0-
6651                                    \xff], U+0100..0377, U+037A..037F,
6652                                    U+0384..038A, U+038C, U+038E..03A1 ...)
6653          \p{XPosixPunct}         \p{Punct} + ASCII-range \p{Symbol} (801:
6654                                    [!\"#\$\%&\'\(\)*+,\-.\/:;<=
6655                                    >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
6656                                    \xb7\xbb\xbf], U+037E, U+0387,
6657                                    U+055A..055F, U+0589..058A, U+05BE ...)
6658          \p{XPosixSpace}         \s including beyond ASCII and vertical tab
6659                                    (Short: \p{SpacePerl}) (25:
6660                                    [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6661                                    U+2000..200A, U+2028..2029, U+202F,
6662                                    U+205F ...)
6663          \p{XPosixUpper}         \p{Uppercase=Y} (Short: \p{Upper}; /i=
6664                                    Cased=Yes) (1908)
6665          \p{XPosixWord}          \w, including beyond ASCII; = \p{Alnum} +
6666                                    \pM + \p{Pc} + \p{Join_Control} (Short:
6667                                    \p{Word}) (128_919: [0-9A-Z_a-
6668                                    z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6669                                    \xff], U+0100..02C1, U+02C6..02D1,
6670                                    U+02E0..02E4, U+02EC, U+02EE ...)
6671          \p{XPosixXDigit}        \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
6672          \p{Xsux}                \p{Cuneiform} (= \p{Script_Extensions=
6673                                    Cuneiform}) (NOT \p{Block=Cuneiform})
6674                                    (1234)
6675          \p{Yi}                  \p{Script_Extensions=Yi} (1246)
6676        X \p{Yi_Radicals}         \p{Block=Yi_Radicals} (64)
6677        X \p{Yi_Syllables}        \p{Block=Yi_Syllables} (1168)
6678          \p{Yiii}                \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
6679        X \p{Yijing}              \p{Yijing_Hexagram_Symbols} (= \p{Block=
6680                                    Yijing_Hexagram_Symbols}) (64)
6681        X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
6682                                    (Short: \p{InYijing}) (64)
6683          \p{Z} \pZ               \p{Separator} (= \p{General_Category=
6684                                    Separator}) (19)
6685          \p{Zanabazar_Square}    \p{Script_Extensions=Zanabazar_Square}
6686                                    (Short: \p{Zanb}; NOT \p{Block=
6687                                    Zanabazar_Square}) (72)
6688          \p{Zanb}                \p{Zanabazar_Square} (=
6689                                    \p{Script_Extensions=Zanabazar_Square})
6690                                    (NOT \p{Block=Zanabazar_Square}) (72)
6691          \p{Zinh}                \p{Inherited} (= \p{Script_Extensions=
6692                                    Inherited}) (502)
6693          \p{Zl}                  \p{Line_Separator} (= \p{General_Category=
6694                                    Line_Separator}) (1)
6695          \p{Zp}                  \p{Paragraph_Separator} (=
6696                                    \p{General_Category=
6697                                    Paragraph_Separator}) (1)
6698          \p{Zs}                  \p{Space_Separator} (=
6699                                    \p{General_Category=Space_Separator})
6700                                    (17)
6701          \p{Zyyy}                \p{Common} (= \p{Script_Extensions=
6702                                    Common}) (7386)
6703          \p{Zzzz}                \p{Unknown} (= \p{Script_Extensions=
6704                                    Unknown}) (976_118 plus all above-
6705                                    Unicode code points)
6706
6707   Legal "\p{}" and "\P{}" constructs that match no characters
6708       Unicode has some property-value pairs that currently don't match
6709       anything.  This happens generally either because they are obsolete, or
6710       they exist for symmetry with other forms, but no language has yet been
6711       encoded that uses them.  In this version of Unicode, the following
6712       match zero code points:
6713
6714       \p{Canonical_Combining_Class=Attached_Below_Left}
6715       \p{Canonical_Combining_Class=CCC133}
6716       \p{Grapheme_Cluster_Break=E_Base}
6717       \p{Grapheme_Cluster_Break=E_Base_GAZ}
6718       \p{Grapheme_Cluster_Break=E_Modifier}
6719       \p{Grapheme_Cluster_Break=Glue_After_Zwj}
6720       \p{Word_Break=E_Base}
6721       \p{Word_Break=E_Base_GAZ}
6722       \p{Word_Break=E_Modifier}
6723       \p{Word_Break=Glue_After_Zwj}
6724

Properties accessible through Unicode::UCD

6726       The value of any Unicode (not including Perl extensions) character
6727       property mentioned above for any single code point is available through
6728       "charprop()" in Unicode::UCD.  "charprops_all()" in Unicode::UCD
6729       returns the values of all the Unicode properties for a given code
6730       point.
6731
6732       Besides these, all the Unicode character properties mentioned above
6733       (except for those marked as for internal use by Perl) are also
6734       accessible by "prop_invlist()" in Unicode::UCD.
6735
6736       Due to their nature, not all Unicode character properties are suitable
6737       for regular expression matches, nor "prop_invlist()".  The remaining
6738       non-provisional, non-internal ones are accessible via "prop_invmap()"
6739       in Unicode::UCD (except for those that this Perl installation hasn't
6740       included; see below for which those are).
6741
6742       For compatibility with other parts of Perl, all the single forms given
6743       in the table in the section above are recognized.  BUT, there are some
6744       ambiguities between some Perl extensions and the Unicode properties,
6745       all of which are silently resolved in favor of the official Unicode
6746       property.  To avoid surprises, you should only use "prop_invmap()" for
6747       forms listed in the table below, which omits the non-recommended ones.
6748       The affected forms are the Perl single form equivalents of Unicode
6749       properties, such as "\p{sc}" being a single-form equivalent of
6750       "\p{gc=sc}", which is treated by "prop_invmap()" as the "Script"
6751       property, whose short name is "sc".  The table indicates the current
6752       ambiguities in the INFO column, beginning with the word "NOT".
6753
6754       The standard Unicode properties listed below are documented in
6755       <http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is
6756       documented in "prop_invmap()" in Unicode::UCD.  The other Perl
6757       extensions are in "Other Properties" in perlunicode;
6758
6759       The first column in the table is a name for the property; the second
6760       column is an alternative name, if any, plus possibly some annotations.
6761       The alternative name is the property's full name, unless that would
6762       simply repeat the first column, in which case the second column
6763       indicates the property's short name (if different).  The annotations
6764       are given only in the entry for the full name.  The annotations for
6765       binary properties include a list of the first few ranges that the
6766       property matches.  To avoid any ambiguity, the SPACE character is
6767       represented as "\x20".
6768
6769       If a property is obsolete, etc, the entry will be flagged with the same
6770       characters used in the table in the section above, like D or S.
6771
6772          NAME                      INFO
6773
6774          Age
6775          AHex                    ASCII_Hex_Digit
6776          All                     (Perl extension).  All code points,
6777                                  including those above Unicode.  Same as
6778                                  qr/./s.  U+0000..infinity
6779          Alnum                   XPosixAlnum.  (Perl extension)
6780          Alpha                   Alphabetic
6781          Alphabetic              (Short: Alpha).  [A-Za-z\xaa\xb5\xba\xc0-
6782                                  \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6783                                  U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
6784                                  ...
6785          Any                     (Perl extension).  All Unicode code
6786                                  points.  U+0000..10FFFF
6787          ASCII                   Block=Basic_Latin.  (Perl extension).
6788                                  [\x00-\x7f]
6789          ASCII_Hex_Digit         (Short: AHex).  [0-9A-Fa-f]
6790          Assigned                (Perl extension).  All assigned code
6791                                  points.  U+0000..0377, U+037A..037F,
6792                                  U+0384..038A, U+038C, U+038E..03A1,
6793                                  U+03A3..052F ...
6794          Bc                      Bidi_Class
6795          Bidi_C                  Bidi_Control
6796          Bidi_Class              (Short: bc)
6797          Bidi_Control            (Short: Bidi_C).  U+061C, U+200E..200F,
6798                                  U+202A..202E, U+2066..2069
6799          Bidi_M                  Bidi_Mirrored
6800          Bidi_Mirrored           (Short: Bidi_M).
6801                                  [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
6802                                  U+169B..169C, U+2039..203A, U+2045..2046,
6803                                  U+207D..207E ...
6804          Bidi_Mirroring_Glyph    (Short: bmg)
6805          Bidi_Paired_Bracket     (Short: bpb)
6806          Bidi_Paired_Bracket_Type (Short: bpt)
6807          Blank                   XPosixBlank.  (Perl extension)
6808          Blk                     Block
6809          Block                   (Short: blk)
6810          Bmg                     Bidi_Mirroring_Glyph
6811          Bpb                     Bidi_Paired_Bracket
6812          Bpt                     Bidi_Paired_Bracket_Type
6813          Canonical_Combining_Class (Short: ccc)
6814          Case_Folding            (Short: cf)
6815          Case_Ignorable          (Short: CI).
6816                                  [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8],
6817                                  U+02B0..036F, U+0374..0375, U+037A,
6818                                  U+0384..0385, U+0387 ...
6819          Cased                   [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6820                                  \xff], U+0100..01BA, U+01BC..01BF,
6821                                  U+01C4..0293, U+0295..02B8, U+02C0..02C1
6822                                  ...
6823          Category                General_Category
6824          Ccc                     Canonical_Combining_Class
6825          CE                      Composition_Exclusion
6826          Cf                      Case_Folding; NOT 'cf' meaning
6827                                  'General_Category=Format'
6828          Changes_When_Casefolded (Short: CWCF).  [A-Z\xb5\xc0-\xd6\xd8-
6829                                  \xdf], U+0100, U+0102, U+0104, U+0106,
6830                                  U+0108 ...
6831          Changes_When_Casemapped (Short: CWCM).  [A-Za-z\xb5\xc0-\xd6\xd8-
6832                                  \xf6\xf8-\xff], U+0100..0137,
6833                                  U+0139..018C, U+018E..019A, U+019C..01A9,
6834                                  U+01AC..01B9 ...
6835          Changes_When_Lowercased (Short: CWL).  [A-Z\xc0-\xd6\xd8-\xde],
6836                                  U+0100, U+0102, U+0104, U+0106, U+0108 ...
6837          Changes_When_NFKC_Casefolded (Short: CWKCF).  [A-
6838                                  Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8-
6839                                  \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf], U+0100,
6840                                  U+0102, U+0104, U+0106, U+0108 ...
6841          Changes_When_Titlecased (Short: CWT).  [a-z\xb5\xdf-\xf6\xf8-
6842                                  \xff], U+0101, U+0103, U+0105, U+0107,
6843                                  U+0109 ...
6844          Changes_When_Uppercased (Short: CWU).  [a-z\xb5\xdf-\xf6\xf8-
6845                                  \xff], U+0101, U+0103, U+0105, U+0107,
6846                                  U+0109 ...
6847          CI                      Case_Ignorable
6848          Cntrl                   XPosixCntrl (=General_Category=Control).
6849                                  (Perl extension)
6850          Comp_Ex                 Full_Composition_Exclusion
6851          Composition_Exclusion   (Short: CE).  U+0958..095F, U+09DC..09DD,
6852                                  U+09DF, U+0A33, U+0A36, U+0A59..0A5B ...
6853          CWCF                    Changes_When_Casefolded
6854          CWCM                    Changes_When_Casemapped
6855          CWKCF                   Changes_When_NFKC_Casefolded
6856          CWL                     Changes_When_Lowercased
6857          CWT                     Changes_When_Titlecased
6858          CWU                     Changes_When_Uppercased
6859          Dash                    [\-], U+058A, U+05BE, U+1400, U+1806,
6860                                  U+2010..2015 ...
6861          Decomposition_Mapping   (Short: dm)
6862          Decomposition_Type      (Short: dt)
6863          Default_Ignorable_Code_Point (Short: DI).  [\xad], U+034F, U+061C,
6864                                  U+115F..1160, U+17B4..17B5, U+180B..180E
6865                                  ...
6866          Dep                     Deprecated
6867          Deprecated              (Short: Dep).  U+0149, U+0673, U+0F77,
6868                                  U+0F79, U+17A3..17A4, U+206A..206F ...
6869          DI                      Default_Ignorable_Code_Point
6870          Dia                     Diacritic
6871          Diacritic               (Short: Dia).  [\^`\xa8\xaf\xb4\xb7-\xb8],
6872                                  U+02B0..034E, U+0350..0357, U+035D..0362,
6873                                  U+0374..0375, U+037A ...
6874          Digit                   XPosixDigit (=General_Category=
6875                                  Decimal_Number).  (Perl extension)
6876          Dm                      Decomposition_Mapping
6877          Dt                      Decomposition_Type
6878          Ea                      East_Asian_Width
6879          East_Asian_Width        (Short: ea)
6880          EqUIdeo                 Equivalent_Unified_Ideograph
6881          Equivalent_Unified_Ideograph (Short: EqUIdeo)
6882          Ext                     Extender
6883          Extender                (Short: Ext).  [\xb7], U+02D0..02D1,
6884                                  U+0640, U+07FA, U+0E46, U+0EC6 ...
6885          Full_Composition_Exclusion (Short: Comp_Ex).  U+0340..0341,
6886                                  U+0343..0344, U+0374, U+037E, U+0387,
6887                                  U+0958..095F ...
6888          Gc                      General_Category
6889          GCB                     Grapheme_Cluster_Break
6890          General_Category        (Short: gc)
6891          Gr_Base                 Grapheme_Base
6892          Gr_Ext                  Grapheme_Extend
6893          Graph                   XPosixGraph.  (Perl extension)
6894          Grapheme_Base           (Short: Gr_Base).  [\x20-\x7e\xa0-
6895                                  \xac\xae-\xff], U+0100..02FF,
6896                                  U+0370..0377, U+037A..037F, U+0384..038A,
6897                                  U+038C ...
6898          Grapheme_Cluster_Break  (Short: GCB)
6899          Grapheme_Extend         (Short: Gr_Ext).  U+0300..036F,
6900                                  U+0483..0489, U+0591..05BD, U+05BF,
6901                                  U+05C1..05C2, U+05C4..05C5 ...
6902          Hangul_Syllable_Type    (Short: hst)
6903          Hex                     Hex_Digit
6904          Hex_Digit               (Short: Hex).  [0-9A-Fa-f], U+FF10..FF19,
6905                                  U+FF21..FF26, U+FF41..FF46
6906          HorizSpace              XPosixBlank.  (Perl extension)
6907          Hst                     Hangul_Syllable_Type
6908        D Hyphen                  [\-\xad], U+058A, U+1806, U+2010..2011,
6909                                  U+2E17, U+30FB ...  Supplanted by
6910                                  Line_Break property values; see
6911                                  www.unicode.org/reports/tr14
6912          ID_Continue             (Short: IDC).  [0-9A-Z_a-
6913                                  z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8-
6914                                  \xff], U+0100..02C1, U+02C6..02D1,
6915                                  U+02E0..02E4, U+02EC, U+02EE ...
6916          ID_Start                (Short: IDS).  [A-Za-z\xaa\xb5\xba\xc0-
6917                                  \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6918                                  U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
6919                                  ...
6920          IDC                     ID_Continue
6921          Ideo                    Ideographic
6922          Ideographic             (Short: Ideo).  U+3006..3007,
6923                                  U+3021..3029, U+3038..303A, U+3400..4DB5,
6924                                  U+4E00..9FEF, U+F900..FA6D ...
6925          IDS                     ID_Start
6926          IDS_Binary_Operator     (Short: IDSB).  U+2FF0..2FF1, U+2FF4..2FFB
6927          IDS_Trinary_Operator    (Short: IDST).  U+2FF2..2FF3
6928          IDSB                    IDS_Binary_Operator
6929          IDST                    IDS_Trinary_Operator
6930          In                      Present_In.  (Perl extension)
6931          Indic_Positional_Category (Short: InPC)
6932          Indic_Syllabic_Category (Short: InSC)
6933          InPC                    Indic_Positional_Category
6934          InSC                    Indic_Syllabic_Category
6935          Isc                     ISO_Comment; NOT 'isc' meaning
6936                                  'General_Category=Other'
6937          ISO_Comment             (Short: isc)
6938          Jg                      Joining_Group
6939          Join_C                  Join_Control
6940          Join_Control            (Short: Join_C).  U+200C..200D
6941          Joining_Group           (Short: jg)
6942          Joining_Type            (Short: jt)
6943          Jt                      Joining_Type
6944          Lb                      Line_Break
6945          Lc                      Lowercase_Mapping; NOT 'lc' meaning
6946                                  'General_Category=Cased_Letter'
6947          Line_Break              (Short: lb)
6948          LOE                     Logical_Order_Exception
6949          Logical_Order_Exception (Short: LOE).  U+0E40..0E44, U+0EC0..0EC4,
6950                                  U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9
6951                                  ...
6952          Lower                   Lowercase
6953          Lowercase               (Short: Lower).  [a-z\xaa\xb5\xba\xdf-
6954                                  \xf6\xf8-\xff], U+0101, U+0103, U+0105,
6955                                  U+0107, U+0109 ...
6956          Lowercase_Mapping       (Short: lc)
6957          Math                    [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
6958                                  U+03D5, U+03F0..03F1, U+03F4..03F6,
6959                                  U+0606..0608 ...
6960          Na                      Name
6961          Na1                     Unicode_1_Name
6962          Name                    (Short: na)
6963          Name_Alias
6964          NChar                   Noncharacter_Code_Point
6965          NFC_QC                  NFC_Quick_Check
6966          NFC_Quick_Check         (Short: NFC_QC)
6967          NFD_QC                  NFD_Quick_Check
6968          NFD_Quick_Check         (Short: NFD_QC)
6969          NFKC_Casefold           (Short: NFKC_CF)
6970          NFKC_CF                 NFKC_Casefold
6971          NFKC_QC                 NFKC_Quick_Check
6972          NFKC_Quick_Check        (Short: NFKC_QC)
6973          NFKD_QC                 NFKD_Quick_Check
6974          NFKD_Quick_Check        (Short: NFKD_QC)
6975          Noncharacter_Code_Point (Short: NChar).  U+FDD0..FDEF,
6976                                  U+FFFE..FFFF, U+1FFFE..1FFFF,
6977                                  U+2FFFE..2FFFF, U+3FFFE..3FFFF,
6978                                  U+4FFFE..4FFFF ...
6979          Nt                      Numeric_Type
6980          Numeric_Type            (Short: nt)
6981          Numeric_Value           (Short: nv)
6982          Nv                      Numeric_Value
6983          Pat_Syn                 Pattern_Syntax
6984          Pat_WS                  Pattern_White_Space
6985          Pattern_Syntax          (Short: Pat_Syn).
6986                                  [!\"#\$\%&\'\(\)*+,\-.\/:;<=
6987                                  >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab-
6988                                  \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
6989                                  U+2010..2027, U+2030..203E, U+2041..2053,
6990                                  U+2055..205E, U+2190..245F ...
6991          Pattern_White_Space     (Short: Pat_WS).  [\t\n\cK\f\r\x20\x85],
6992                                  U+200E..200F, U+2028..2029
6993          PCM                     Prepended_Concatenation_Mark
6994          Perl_Decimal_Digit      (Perl extension)
6995          PerlSpace               PosixSpace.  (Perl extension)
6996          PerlWord                PosixWord.  (Perl extension)
6997          PosixAlnum              (Perl extension).  [0-9A-Za-z]
6998          PosixAlpha              (Perl extension).  [A-Za-z]
6999          PosixBlank              (Perl extension).  [\t\x20]
7000          PosixCntrl              (Perl extension).  ASCII control
7001                                  characters.  ACK, BEL, BS, CAN, CR, DC1,
7002                                  DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT,
7003                                  ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK,
7004                                  NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT
7005          PosixDigit              (Perl extension).  [0-9]
7006          PosixGraph              (Perl extension).
7007                                  [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
7008                                  Z\[\\\]\^_`a-z\{\|\}~]
7009          PosixLower              (Perl extension).  [a-z]
7010          PosixPrint              (Perl extension).  [\x20-\x7e]
7011          PosixPunct              (Perl extension).
7012                                  [!\"#\$\%&\'\(\)*+,\-.\/:;<=
7013                                  >?\@\[\\\]\^_`\{\|\}~]
7014          PosixSpace              (Perl extension).  [\t\n\cK\f\r\x20]
7015          PosixUpper              (Perl extension).  [A-Z]
7016          PosixWord               (Perl extension).  \w, restricted to
7017                                  ASCII.  [0-9A-Z_a-z]
7018          PosixXDigit             ASCII_Hex_Digit.  (Perl extension).
7019                                  [0-9A-Fa-f]
7020          Prepended_Concatenation_Mark (Short: PCM).  U+0600..0605, U+06DD,
7021                                  U+070F, U+08E2, U+110BD, U+110CD
7022          Present_In              (Short: In).  (Perl extension)
7023          Print                   XPosixPrint.  (Perl extension)
7024          Punct                   General_Category=Punctuation.  (Perl
7025                                  extension).
7026                                  [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_-
7027                                  \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
7028                                  U+037E, U+0387, U+055A..055F,
7029                                  U+0589..058A, U+05BE ...
7030          QMark                   Quotation_Mark
7031          Quotation_Mark          (Short: QMark).  [\"\'\xab\xbb],
7032                                  U+2018..201F, U+2039..203A, U+2E42,
7033                                  U+300C..300F, U+301D..301F ...
7034          Radical                 U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5
7035          Regional_Indicator      (Short: RI).  U+1F1E6..1F1FF
7036          RI                      Regional_Indicator
7037          SB                      Sentence_Break
7038          Sc                      Script; NOT 'sc' meaning
7039                                  'General_Category=Currency_Symbol'
7040          Scf                     Simple_Case_Folding
7041          Script                  (Short: sc)
7042          Script_Extensions       (Short: scx)
7043          Scx                     Script_Extensions
7044          SD                      Soft_Dotted
7045          Sentence_Break          (Short: SB)
7046          Sentence_Terminal       (Short: STerm).  [!.?], U+0589,
7047                                  U+061E..061F, U+06D4, U+0700..0702, U+07F9
7048                                  ...
7049          Sfc                     Simple_Case_Folding
7050          Simple_Case_Folding     (Short: scf)
7051          Simple_Lowercase_Mapping (Short: slc)
7052          Simple_Titlecase_Mapping (Short: stc)
7053          Simple_Uppercase_Mapping (Short: suc)
7054          Slc                     Simple_Lowercase_Mapping
7055          Soft_Dotted             (Short: SD).  [i-j], U+012F, U+0249,
7056                                  U+0268, U+029D, U+02B2 ...
7057          Space                   White_Space
7058          SpacePerl               XPosixSpace.  (Perl extension)
7059          Stc                     Simple_Titlecase_Mapping
7060          STerm                   Sentence_Terminal
7061          Suc                     Simple_Uppercase_Mapping
7062          Tc                      Titlecase_Mapping
7063          Term                    Terminal_Punctuation
7064          Terminal_Punctuation    (Short: Term).  [!,.:;?], U+037E, U+0387,
7065                                  U+0589, U+05C3, U+060C ...
7066          Title                   Titlecase.  (Perl extension)
7067          Titlecase               (Short: Title).  (Perl extension).  (=
7068                                  \p{Gc=Lt}).  U+01C5, U+01C8, U+01CB,
7069                                  U+01F2, U+1F88..1F8F, U+1F98..1F9F ...
7070          Titlecase_Mapping       (Short: tc)
7071          Uc                      Uppercase_Mapping
7072          UIdeo                   Unified_Ideograph
7073          Unicode                 Any.  (Perl extension)
7074          Unicode_1_Name          (Short: na1)
7075          Unified_Ideograph       (Short: UIdeo).  U+3400..4DB5,
7076                                  U+4E00..9FEF, U+FA0E..FA0F, U+FA11,
7077                                  U+FA13..FA14, U+FA1F ...
7078          Upper                   Uppercase
7079          Uppercase               (Short: Upper).  [A-Z\xc0-\xd6\xd8-\xde],
7080                                  U+0100, U+0102, U+0104, U+0106, U+0108 ...
7081          Uppercase_Mapping       (Short: uc)
7082          Variation_Selector      (Short: VS).  U+180B..180D, U+FE00..FE0F,
7083                                  U+E0100..E01EF
7084          Vertical_Orientation    (Short: vo)
7085          VertSpace               (Perl extension).  \v.  [\n\cK\f\r\x85],
7086                                  U+2028..2029
7087          Vo                      Vertical_Orientation
7088          VS                      Variation_Selector
7089          WB                      Word_Break
7090          White_Space             (Short: WSpace).
7091                                  [\t\n\cK\f\r\x20\x85\xa0], U+1680,
7092                                  U+2000..200A, U+2028..2029, U+202F, U+205F
7093                                  ...
7094          Word                    XPosixWord.  (Perl extension)
7095          Word_Break              (Short: WB)
7096          WSpace                  White_Space
7097          XDigit                  XPosixXDigit (=Hex_Digit).  (Perl
7098                                  extension)
7099          XID_Continue            (Short: XIDC).  [0-9A-Z_a-
7100                                  z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8-
7101                                  \xff], U+0100..02C1, U+02C6..02D1,
7102                                  U+02E0..02E4, U+02EC, U+02EE ...
7103          XID_Start               (Short: XIDS).  [A-Za-z\xaa\xb5\xba\xc0-
7104                                  \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7105                                  U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7106                                  ...
7107          XIDC                    XID_Continue
7108          XIDS                    XID_Start
7109          XPerlSpace              XPosixSpace.  (Perl extension)
7110          XPosixAlnum             (Short: Alnum).  (Perl extension).
7111                                  Alphabetic and (decimal) Numeric.  [0-9A-
7112                                  Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7113                                  \xff], U+0100..02C1, U+02C6..02D1,
7114                                  U+02E0..02E4, U+02EC, U+02EE ...
7115          XPosixAlpha             Alphabetic.  (Perl extension).  [A-Za-
7116                                  z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7117                                  U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7118                                  U+02EC, U+02EE ...
7119          XPosixBlank             (Short: Blank).  (Perl extension).  \h,
7120                                  Horizontal white space.  [\t\x20\xa0],
7121                                  U+1680, U+2000..200A, U+202F, U+205F,
7122                                  U+3000
7123          XPosixCntrl             General_Category=Control  (Short: Cntrl).
7124                                  (Perl extension).  Control characters.
7125                                  [\x00-\x1f\x7f-\x9f]
7126          XPosixDigit             General_Category=Decimal_Number  (Short:
7127                                  Digit).  (Perl extension).  [0-9] + all
7128                                  other decimal digits.  [0-9],
7129                                  U+0660..0669, U+06F0..06F9, U+07C0..07C9,
7130                                  U+0966..096F, U+09E6..09EF ...
7131          XPosixGraph             (Short: Graph).  (Perl extension).
7132                                  Characters that are graphical.
7133                                  [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
7134                                  Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff],
7135                                  U+0100..0377, U+037A..037F, U+0384..038A,
7136                                  U+038C, U+038E..03A1 ...
7137          XPosixLower             Lowercase.  (Perl extension).  [a-
7138                                  z\xaa\xb5\xba\xdf-\xf6\xf8-\xff], U+0101,
7139                                  U+0103, U+0105, U+0107, U+0109 ...
7140          XPosixPrint             (Short: Print).  (Perl extension).
7141                                  Characters that are graphical plus space
7142                                  characters (but no controls).  [\x20-
7143                                  \x7e\xa0-\xff], U+0100..0377,
7144                                  U+037A..037F, U+0384..038A, U+038C,
7145                                  U+038E..03A1 ...
7146          XPosixPunct             (Perl extension).  \p{Punct} + ASCII-range
7147                                  \p{Symbol}.  [!\"#\$\%&\'\(\)*+,\-.\/:;<=
7148                                  >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
7149                                  \xb7\xbb\xbf], U+037E, U+0387,
7150                                  U+055A..055F, U+0589..058A, U+05BE ...
7151          XPosixSpace             (Perl extension).  \s including beyond
7152                                  ASCII and vertical tab.
7153                                  [\t\n\cK\f\r\x20\x85\xa0], U+1680,
7154                                  U+2000..200A, U+2028..2029, U+202F, U+205F
7155                                  ...
7156          XPosixUpper             Uppercase.  (Perl extension).  [A-Z\xc0-
7157                                  \xd6\xd8-\xde], U+0100, U+0102, U+0104,
7158                                  U+0106, U+0108 ...
7159          XPosixWord              (Short: Word).  (Perl extension).  \w,
7160                                  including beyond ASCII; = \p{Alnum} + \pM
7161                                  + \p{Pc} + \p{Join_Control}.  [0-9A-Z_a-
7162                                  z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7163                                  U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7164                                  U+02EC, U+02EE ...
7165          XPosixXDigit            Hex_Digit  (Short: XDigit).  (Perl
7166                                  extension).  [0-9A-Fa-f], U+FF10..FF19,
7167                                  U+FF21..FF26, U+FF41..FF46
7168

Properties accessible through other means

7170       Certain properties are accessible also via core function calls.  These
7171       are:
7172
7173        Lowercase_Mapping          lc() and lcfirst()
7174        Titlecase_Mapping          ucfirst()
7175        Uppercase_Mapping          uc()
7176
7177       Also, Case_Folding is accessible through the "/i" modifier in regular
7178       expressions, the "\F" transliteration escape, and the "fc" operator.
7179
7180       And, the Name and Name_Aliases properties are accessible through the
7181       "\N{}" interpolation in double-quoted strings and regular expressions;
7182       and functions "charnames::viacode()", "charnames::vianame()", and
7183       "charnames::string_vianame()" (which require a "use charnames ();" to
7184       be specified.
7185
7186       Finally, most properties related to decomposition are accessible via
7187       Unicode::Normalize.
7188

Unicode character properties that are NOT accepted by Perl

7190       Perl will generate an error for a few character properties in Unicode
7191       when used in a regular expression.  The non-Unihan ones are listed
7192       below, with the reasons they are not accepted, perhaps with work-
7193       arounds.  The short names for the properties are listed enclosed in
7194       (parentheses).  As described after the list, an installation can change
7195       the defaults and choose to accept any of these.  The list is machine
7196       generated based on the choices made for the installation that generated
7197       this document.
7198
7199       Expands_On_NFC (XO_NFC)
7200       Expands_On_NFD (XO_NFD)
7201       Expands_On_NFKC (XO_NFKC)
7202       Expands_On_NFKD (XO_NFKD)
7203           Deprecated by Unicode.  These are characters that expand to more
7204           than one character in the specified normalization form, but whether
7205           they actually take up more bytes or not depends on the encoding
7206           being used.  For example, a UTF-8 encoded character may expand to a
7207           different number of bytes than a UTF-32 encoded character.
7208
7209       Extended_Pictographic (XPG)
7210           Not part of the Unicode Character Database
7211
7212       Grapheme_Link (Gr_Link)
7213           Duplicates ccc=vr (Canonical_Combining_Class=Virama)
7214
7215       Jamo_Short_Name (JSN)
7216       Other_Alphabetic (OAlpha)
7217       Other_Default_Ignorable_Code_Point (ODI)
7218       Other_Grapheme_Extend (OGr_Ext)
7219       Other_ID_Continue (OIDC)
7220       Other_ID_Start (OIDS)
7221       Other_Lowercase (OLower)
7222       Other_Math (OMath)
7223       Other_Uppercase (OUpper)
7224           Used by Unicode internally for generating other properties and not
7225           intended to be used stand-alone
7226
7227       Script=Katakana_Or_Hiragana (sc=Hrkt)
7228           Obsolete.  All code points previously matched by this have been
7229           moved to "Script=Common".  Consider instead using
7230           "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or
7231           both)
7232
7233       Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)
7234           All code points that would be matched by this are matched by either
7235           "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
7236
7237       An installation can choose to allow any of these to be matched by
7238       downloading the Unicode database from <http://www.unicode.org/Public/>
7239       to $Config{privlib}/unicore/ in the Perl source tree, changing the
7240       controlling lists contained in the program
7241       $Config{privlib}/unicore/mktables and then re-compiling and installing.
7242       (%Config is available from the Config module).
7243
7244       Also, perl can be recompiled to operate on an earlier version of the
7245       Unicode standard.  Further information is at
7246       $Config{privlib}/unicore/README.perl.
7247

Other information in the Unicode data base

7249       The Unicode data base is delivered in two different formats.  The XML
7250       version is valid for more modern Unicode releases.  The other version
7251       is a collection of files.  The two are intended to give equivalent
7252       information.  Perl uses the older form; this allows you to recompile
7253       Perl to use early Unicode releases.
7254
7255       The only non-character property that Perl currently supports is Named
7256       Sequences, in which a sequence of code points is given a name and
7257       generally treated as a single entity.  (Perl supports these via the
7258       "\N{...}" double-quotish construct, "charnames::string_vianame(name)"
7259       in charnames, and "namedseq()" in Unicode::UCD.
7260
7261       Below is a list of the files in the Unicode data base that Perl doesn't
7262       currently use, along with very brief descriptions of their purposes.
7263       Some of the names of the files have been shortened from those that
7264       Unicode uses, in order to allow them to be distinguishable from
7265       similarly named files on file systems for which only the first 8
7266       characters of a name are significant.
7267
7268       auxiliary/GraphemeBreakTest.html
7269       auxiliary/LineBreakTest.html
7270       auxiliary/SentenceBreakTest.html
7271       auxiliary/WordBreakTest.html
7272           Documentation of validation Tests
7273
7274       BidiCharacterTest.txt
7275       BidiTest.txt
7276       NormTest.txt
7277           Validation Tests
7278
7279       CJKRadicals.txt
7280           Maps the kRSUnicode property values to corresponding code points
7281
7282       EmojiSources.txt
7283           Maps certain Unicode code points to their legacy Japanese cell-
7284           phone values
7285
7286       extracted/DName.txt
7287           This file adds no new information not already present in other
7288           files
7289
7290       Index.txt
7291           Alphabetical index of Unicode characters
7292
7293       NamedSqProv.txt
7294           Named sequences proposed for inclusion in a later version of the
7295           Unicode Standard; if you need them now, you can append this file to
7296           NamedSequences.txt and recompile perl
7297
7298       NamesList.html
7299           Describes the format and contents of NamesList.txt
7300
7301       NamesList.txt
7302           Annotated list of characters
7303
7304       NormalizationCorrections.txt
7305           Documentation of corrections already incorporated into the Unicode
7306           data base
7307
7308       NushuSources.txt
7309           Specifies source material for Nushu characters
7310
7311       ReadMe.txt
7312           Documentation
7313
7314       StandardizedVariants.html
7315           Obsoleted as of Unicode 9.0, but previously provided a visual
7316           display of the standard variant sequences derived from
7317           StandardizedVariants.txt.
7318
7319       StandardizedVariants.txt
7320           Certain glyph variations for character display are standardized.
7321           This lists the non-Unihan ones; the Unihan ones are also not used
7322           by Perl, and are in a separate Unicode data base
7323           <http://www.unicode.org/ivd>
7324
7325       TangutSources.txt
7326           Specifies source mappings for Tangut ideographs and components.
7327           This data file also includes informative radical-stroke values that
7328           are used internally by Unicode
7329
7330       USourceData.txt
7331           Documentation of status and cross reference of proposals for
7332           encoding by Unicode of Unihan characters
7333
7334       USourceGlyphs.pdf
7335           Pictures of the characters in USourceData.txt
7336

SEE ALSO

7338       <http://www.unicode.org/reports/tr44/>
7339
7340       perlrecharclass
7341
7342       perlunicode
7343
7344
7345
7346perl v5.30.1                      2019-11-29                   PERLUNIPROPS(1)
Impressum