1PERLUNIPROPS(1) Perl Programmers Reference Guide PERLUNIPROPS(1)
2
3
4
6 perluniprops - Index of Unicode Version 10.0.0 character properties in
7 Perl
8
10 This document provides information about the portion of the Unicode
11 database that deals with character properties, that is the portion that
12 is defined on single code points. ("Other information in the Unicode
13 data base" below briefly mentions other data that Unicode provides.)
14
15 Perl can provide access to all non-provisional Unicode character
16 properties, though not all are enabled by default. The omitted ones
17 are the Unihan properties (accessible via the CPAN module
18 Unicode::Unihan) and certain deprecated or Unicode-internal properties.
19 (An installation may choose to recompile Perl's tables to change this.
20 See "Unicode character properties that are NOT accepted by Perl".)
21
22 For most purposes, access to Unicode properties from the Perl core is
23 through regular expression matches, as described in the next section.
24 For some special purposes, and to access the properties that are not
25 suitable for regular expression matching, all the Unicode character
26 properties that Perl handles are accessible via the standard
27 Unicode::UCD module, as described in the section "Properties accessible
28 through Unicode::UCD".
29
30 Perl also provides some additional extensions and short-cut synonyms
31 for Unicode properties.
32
33 This document merely lists all available properties and does not
34 attempt to explain what each property really means. There is a brief
35 description of each Perl extension; see "Other Properties" in
36 perlunicode for more information on these. There is some detail about
37 Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but
38 to find out about the intricacies of the official Unicode properties,
39 refer to the Unicode standard. A good starting place is
40 <http://www.unicode.org/reports/tr44/>.
41
42 Note that you can define your own properties; see "User-Defined
43 Character Properties" in perlunicode.
44
46 The Perl regular expression "\p{}" and "\P{}" constructs give access to
47 most of the Unicode character properties. The table below shows all
48 these constructs, both single and compound forms.
49
50 Compound forms consist of two components, separated by an equals sign
51 or a colon. The first component is the property name, and the second
52 component is the particular value of the property to match against, for
53 example, "\p{Script_Extensions: Greek}" and
54 "\p{Script_Extensions=Greek}" both mean to match characters whose
55 Script_Extensions property value is Greek. ("Script_Extensions" is an
56 improved version of the "Script" property.)
57
58 Single forms, like "\p{Greek}", are mostly Perl-defined shortcuts for
59 their equivalent compound forms. The table shows these equivalences.
60 (In our example, "\p{Greek}" is a just a shortcut for
61 "\p{Script_Extensions=Greek}"). There are also a few Perl-defined
62 single forms that are not shortcuts for a compound form. One such is
63 "\p{Word}". These are also listed in the table.
64
65 In parsing these constructs, Perl always ignores Upper/lower case
66 differences everywhere within the {braces}. Thus "\p{Greek}" means the
67 same thing as "\p{greek}". But note that changing the case of the "p"
68 or "P" before the left brace completely changes the meaning of the
69 construct, from "match" (for "\p{}") to "doesn't match" (for "\P{}").
70 Casing in this document is for improved legibility.
71
72 Also, white space, hyphens, and underscores are normally ignored
73 everywhere between the {braces}, and hence can be freely added or
74 removed even if the "/x" modifier hasn't been specified on the regular
75 expression. But in the table below a 'T' at the beginning of an entry
76 means that tighter (stricter) rules are used for that entry:
77
78 Single form ("\p{name}") tighter rules:
79 White space, hyphens, and underscores ARE significant except
80 for:
81
82 · white space adjacent to a non-word character
83
84 · underscores separating digits in numbers
85
86 That means, for example, that you can freely add or remove
87 white space adjacent to (but within) the braces without
88 affecting the meaning.
89
90 Compound form ("\p{name=value}" or "\p{name:value}") tighter rules:
91 The tighter rules given above for the single form apply to
92 everything to the right of the colon or equals; the looser
93 rules still apply to everything to the left.
94
95 That means, for example, that you can freely add or remove
96 white space adjacent to (but within) the braces and the colon
97 or equal sign.
98
99 Some properties are considered obsolete by Unicode, but still
100 available. There are several varieties of obsolescence:
101
102 Stabilized
103 A property may be stabilized. Such a determination does not
104 indicate that the property should or should not be used;
105 instead it is a declaration that the property will not be
106 maintained nor extended for newly encoded characters. Such
107 properties are marked with an 'S' in the table.
108
109 Deprecated
110 A property may be deprecated, perhaps because its original
111 intent has been replaced by another property, or because its
112 specification was somehow defective. This means that its use
113 is strongly discouraged, so much so that a warning will be
114 issued if used, unless the regular expression is in the scope
115 of a "no warnings 'deprecated'" statement. A 'D' flags each
116 such entry in the table, and the entry there for the longest,
117 most descriptive version of the property will give the reason
118 it is deprecated, and perhaps advice. Perl may issue such a
119 warning, even for properties that aren't officially deprecated
120 by Unicode, when there used to be characters or code points
121 that were matched by them, but no longer. This is to warn you
122 that your program may not work like it did on earlier Unicode
123 releases.
124
125 A deprecated property may be made unavailable in a future Perl
126 version, so it is best to move away from them.
127
128 A deprecated property may also be stabilized, but this fact is
129 not shown.
130
131 Obsolete
132 Properties marked with an 'O' in the table are considered
133 (plain) obsolete. Generally this designation is given to
134 properties that Unicode once used for internal purposes (but
135 not any longer).
136
137 Discouraged
138 This is not actually a Unicode-specified obsolescence, but
139 applies to certain Perl extensions that are present for
140 backwards compatibility, but are discouraged from being used.
141 These are not obsolete, but their meanings are not stable.
142 Future Unicode versions could force any of these extensions to
143 be removed without warning, replaced by another property with
144 the same name that means something different. An 'X' flags
145 each such entry in the table. Use the equivalent shown
146 instead.
147
148 In particular, matches in the Block property have single forms
149 defined by Perl that begin with "In_", ""Is_", or even with no
150 prefix at all, Like all DISCOURAGED forms, these are not
151 stable. For example, "\p{Block=Deseret}" can currently be
152 written as "\p{In_Deseret}", "\p{Is_Deseret}", or
153 "\p{Deseret}". But, a new Unicode version may come along that
154 would force Perl to change the meaning of one or more of these,
155 and your program would no longer be correct. Currently there
156 are no such conflicts with the form that begins "In_", but
157 there are many with the other two shortcuts, and Unicode
158 continues to define new properties that begin with "In", so
159 it's quite possible that a conflict will occur in the future.
160 The compound form is guaranteed to not become obsolete, and its
161 meaning is clearer anyway. See "Blocks" in perlunicode for
162 more information about this.
163
164 The table below has two columns. The left column contains the "\p{}"
165 constructs to look up, possibly preceded by the flags mentioned above;
166 and the right column contains information about them, like a
167 description, or synonyms. The table shows both the single and compound
168 forms for each property that has them. If the left column is a short
169 name for a property, the right column will give its longer, more
170 descriptive name; and if the left column is the longest name, the right
171 column will show any equivalent shortest name, in both single and
172 compound forms if applicable.
173
174 If braces are not needed to specify a property (e.g., "\pL"), the left
175 column contains both forms, with and without braces.
176
177 The right column will also caution you if a property means something
178 different than what might normally be expected.
179
180 All single forms are Perl extensions; a few compound forms are as well,
181 and are noted as such.
182
183 Numbers in (parentheses) indicate the total number of Unicode code
184 points matched by the property. For the entries that give the longest,
185 most descriptive version of the property, the count is followed by a
186 list of some of the code points matched by it. The list includes all
187 the matched characters in the 0-255 range, enclosed in the familiar
188 [brackets] the same as a regular expression bracketed character class.
189 Following that, the next few higher matching ranges are also given. To
190 avoid visual ambiguity, the SPACE character is represented as "\x20".
191
192 For emphasis, those properties that match no code points at all are
193 listed as well in a separate section following the table.
194
195 Most properties match the same code points regardless of whether "/i"
196 case-insensitive matching is specified or not. But a few properties
197 are affected. These are shown with the notation "(/i= other_property)"
198 in the second column. Under case-insensitive matching they match the
199 same code pode points as the property other_property.
200
201 There is no description given for most non-Perl defined properties (See
202 <http://www.unicode.org/reports/tr44/> for that).
203
204 For compactness, '*' is used as a wildcard instead of showing all
205 possible combinations. For example, entries like:
206
207 \p{Gc: *} \p{General_Category: *}
208
209 mean that 'Gc' is a synonym for 'General_Category', and anything that
210 is valid for the latter is also valid for the former. Similarly,
211
212 \p{Is_*} \p{*}
213
214 means that if and only if, for example, "\p{Foo}" exists, then
215 "\p{Is_Foo}" and "\p{IsFoo}" are also valid and all mean the same
216 thing. And similarly, "\p{Foo=Bar}" means the same as "\p{Is_Foo=Bar}"
217 and "\p{IsFoo=Bar}". "*" here is restricted to something not beginning
218 with an underscore.
219
220 Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
221 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table
222 shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
223 for the other possibilities. Note that not all properties which have
224 values 'Yes' and 'No' are binary, and they have all their values
225 spelled out without using this wild card, and a "NOT" clause in their
226 description that highlights their not being binary. These also require
227 the compound form to match them, whereas true binary properties have
228 both single and compound forms available.
229
230 Note that all non-essential underscores are removed in the display of
231 the short names below.
232
233 Legend summary:
234
235 * is a wild-card
236 (\d+) in the info column gives the number of Unicode code points
237 matched by this property.
238 D means this is deprecated.
239 O means this is obsolete.
240 S means this is stabilized.
241 T means tighter (stricter) name matching applies.
242 X means use of this form is discouraged, and may not be stable.
243
244 NAME INFO
245
246 \p{Adlam} \p{Script_Extensions=Adlam} (Short:
247 \p{Adlm}; NOT \p{Block=Adlam}) (88)
248 \p{Adlm} \p{Adlam} (= \p{Script_Extensions=Adlam})
249 (NOT \p{Block=Adlam}) (88)
250 X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
251 T \p{Age: 1.1} \p{Age=V1_1} (33_979)
252 \p{Age: V1_1} Code point's usage introduced in version
253 1.1 (33_979: U+0000..01F5, U+01FA..0217,
254 U+0250..02A8, U+02B0..02DE,
255 U+02E0..02E9, U+0300..0345 ...)
256 T \p{Age: 2.0} \p{Age=V2_0} (144_521)
257 \p{Age: V2_0} Code point's usage was introduced in
258 version 2.0; See also Property
259 'Present_In' (144_521: U+0591..05A1,
260 U+05A3..05AF, U+05C4, U+0F00..0F47,
261 U+0F49..0F69, U+0F71..0F8B ...)
262 T \p{Age: 2.1} \p{Age=V2_1} (2)
263 \p{Age: V2_1} Code point's usage was introduced in
264 version 2.1; See also Property
265 'Present_In' (2: U+20AC, U+FFFC)
266 T \p{Age: 3.0} \p{Age=V3_0} (10_307)
267 \p{Age: V3_0} Code point's usage was introduced in
268 version 3.0; See also Property
269 'Present_In' (10_307: U+01F6..01F9,
270 U+0218..021F, U+0222..0233,
271 U+02A9..02AD, U+02DF, U+02EA..02EE ...)
272 T \p{Age: 3.1} \p{Age=V3_1} (44_978)
273 \p{Age: V3_1} Code point's usage was introduced in
274 version 3.1; See also Property
275 'Present_In' (44_978: U+03F4..03F5,
276 U+FDD0..FDEF, U+10300..1031E,
277 U+10320..10323, U+10330..1034A,
278 U+10400..10425 ...)
279 T \p{Age: 3.2} \p{Age=V3_2} (1016)
280 \p{Age: V3_2} Code point's usage was introduced in
281 version 3.2; See also Property
282 'Present_In' (1016: U+0220, U+034F,
283 U+0363..036F, U+03D8..03D9, U+03F6,
284 U+048A..048B ...)
285 T \p{Age: 4.0} \p{Age=V4_0} (1226)
286 \p{Age: V4_0} Code point's usage was introduced in
287 version 4.0; See also Property
288 'Present_In' (1226: U+0221,
289 U+0234..0236, U+02AE..02AF,
290 U+02EF..02FF, U+0350..0357, U+035D..035F
291 ...)
292 T \p{Age: 4.1} \p{Age=V4_1} (1273)
293 \p{Age: V4_1} Code point's usage was introduced in
294 version 4.1; See also Property
295 'Present_In' (1273: U+0237..0241,
296 U+0358..035C, U+03FC..03FF,
297 U+04F6..04F7, U+05A2, U+05C5..05C7 ...)
298 T \p{Age: 5.0} \p{Age=V5_0} (1369)
299 \p{Age: V5_0} Code point's usage was introduced in
300 version 5.0; See also Property
301 'Present_In' (1369: U+0242..024F,
302 U+037B..037D, U+04CF, U+04FA..04FF,
303 U+0510..0513, U+05BA ...)
304 T \p{Age: 5.1} \p{Age=V5_1} (1624)
305 \p{Age: V5_1} Code point's usage was introduced in
306 version 5.1; See also Property
307 'Present_In' (1624: U+0370..0373,
308 U+0376..0377, U+03CF, U+0487,
309 U+0514..0523, U+0606..060A ...)
310 T \p{Age: 5.2} \p{Age=V5_2} (6648)
311 \p{Age: V5_2} Code point's usage was introduced in
312 version 5.2; See also Property
313 'Present_In' (6648: U+0524..0525,
314 U+0800..082D, U+0830..083E, U+0900,
315 U+094E, U+0955 ...)
316 T \p{Age: 6.0} \p{Age=V6_0} (2088)
317 \p{Age: V6_0} Code point's usage was introduced in
318 version 6.0; See also Property
319 'Present_In' (2088: U+0526..0527,
320 U+0620, U+065F, U+0840..085B, U+085E,
321 U+093A..093B ...)
322 T \p{Age: 6.1} \p{Age=V6_1} (732)
323 \p{Age: V6_1} Code point's usage was introduced in
324 version 6.1; See also Property
325 'Present_In' (732: U+058F, U+0604,
326 U+08A0, U+08A2..08AC, U+08E4..08FE,
327 U+0AF0 ...)
328 T \p{Age: 6.2} \p{Age=V6_2} (1)
329 \p{Age: V6_2} Code point's usage was introduced in
330 version 6.2; See also Property
331 'Present_In' (1: U+20BA)
332 T \p{Age: 6.3} \p{Age=V6_3} (5)
333 \p{Age: V6_3} Code point's usage was introduced in
334 version 6.3; See also Property
335 'Present_In' (5: U+061C, U+2066..2069)
336 T \p{Age: 7.0} \p{Age=V7_0} (2834)
337 \p{Age: V7_0} Code point's usage was introduced in
338 version 7.0; See also Property
339 'Present_In' (2834: U+037F,
340 U+0528..052F, U+058D..058E, U+0605,
341 U+08A1, U+08AD..08B2 ...)
342 T \p{Age: 8.0} \p{Age=V8_0} (7716)
343 \p{Age: V8_0} Code point's usage was introduced in
344 version 8.0; See also Property
345 'Present_In' (7716: U+08B3..08B4,
346 U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5
347 ...)
348 T \p{Age: 9.0} \p{Age=V9_0} (7500)
349 \p{Age: V9_0} Code point's usage was introduced in
350 version 9.0; See also Property
351 'Present_In' (7500: U+08B6..08BD,
352 U+08D4..08E2, U+0C80, U+0D4F,
353 U+0D54..0D56, U+0D58..0D5E ...)
354 T \p{Age: 10.0} \p{Age=V10_0} (8518)
355 \p{Age: V10_0} Code point's usage was introduced in
356 version 10.0; See also Property
357 'Present_In' (8518: U+0860..086A,
358 U+09FC..09FD, U+0AFA..0AFF, U+0D00,
359 U+0D3B..0D3C, U+1CF7 ...)
360 \p{Age: NA} \p{Age=Unassigned} (837_775 plus all
361 above-Unicode code points)
362 \p{Age: Unassigned} Code point's usage has not been assigned
363 in any Unicode release thus far. (Short:
364 \p{Age=NA}) (837_775 plus all above-
365 Unicode code points: U+0378..0379,
366 U+0380..0383, U+038B, U+038D, U+03A2,
367 U+0530 ...)
368 \p{Aghb} \p{Caucasian_Albanian} (=
369 \p{Script_Extensions=
370 Caucasian_Albanian}) (NOT \p{Block=
371 Caucasian_Albanian}) (53)
372 \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
373 (22)
374 \p{AHex: *} \p{ASCII_Hex_Digit: *}
375 \p{Ahom} \p{Script_Extensions=Ahom} (NOT \p{Block=
376 Ahom}) (57)
377 X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block=
378 Alchemical_Symbols}) (128)
379 X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short:
380 \p{InAlchemical}) (128)
381 \p{All} All code points, including those above
382 Unicode. Same as qr/./s (1_114_112 plus
383 all above-Unicode code points:
384 U+0000..infinity)
385 \p{Alnum} \p{XPosixAlnum} (127_219)
386 \p{Alpha} \p{XPosixAlpha} (= \p{Alphabetic=Y})
387 (126_629)
388 \p{Alpha: *} \p{Alphabetic: *}
389 \p{Alphabetic} \p{XPosixAlpha} (= \p{Alphabetic=Y})
390 (126_629)
391 \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (987_483
392 plus all above-Unicode code points:
393 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
394 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
395 \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
396 U+02C2..02C5, U+02D2..02DF,
397 U+02E5..02EB, U+02ED, U+02EF..0344 ...)
398 \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (126_629:
399 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
400 \xf6\xf8-\xff], U+0100..02C1,
401 U+02C6..02D1, U+02E0..02E4, U+02EC,
402 U+02EE ...)
403 X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (=
404 \p{Block=Alphabetic_Presentation_Forms})
405 (80)
406 X \p{Alphabetic_Presentation_Forms} \p{Block=
407 Alphabetic_Presentation_Forms} (Short:
408 \p{InAlphabeticPF}) (80)
409 \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
410 Anatolian_Hieroglyphs} (Short: \p{Hluw};
411 NOT \p{Block=Anatolian_Hieroglyphs})
412 (583)
413 X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
414 \p{Block=
415 Ancient_Greek_Musical_Notation}) (80)
416 X \p{Ancient_Greek_Musical_Notation} \p{Block=
417 Ancient_Greek_Musical_Notation} (Short:
418 \p{InAncientGreekMusic}) (80)
419 X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
420 X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
421 \p{Any} All Unicode code points (1_114_112:
422 U+0000..10FFFF)
423 \p{Arab} \p{Arabic} (= \p{Script_Extensions=
424 Arabic}) (NOT \p{Block=Arabic}) (1324)
425 \p{Arabic} \p{Script_Extensions=Arabic} (Short:
426 \p{Arab}; NOT \p{Block=Arabic}) (1324)
427 X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block=
428 Arabic_Extended_A}) (96)
429 X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short:
430 \p{InArabicExtA}) (96)
431 X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols}
432 (= \p{Block=
433 Arabic_Mathematical_Alphabetic_Symbols})
434 (256)
435 X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
436 Arabic_Mathematical_Alphabetic_Symbols}
437 (Short: \p{InArabicMath}) (256)
438 X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (=
439 \p{Block=Arabic_Presentation_Forms_A})
440 (688)
441 X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (=
442 \p{Block=Arabic_Presentation_Forms_B})
443 (144)
444 X \p{Arabic_Presentation_Forms_A} \p{Block=
445 Arabic_Presentation_Forms_A} (Short:
446 \p{InArabicPFA}) (688)
447 X \p{Arabic_Presentation_Forms_B} \p{Block=
448 Arabic_Presentation_Forms_B} (Short:
449 \p{InArabicPFB}) (144)
450 X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block=
451 Arabic_Supplement}) (48)
452 X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short:
453 \p{InArabicSup}) (48)
454 \p{Armenian} \p{Script_Extensions=Armenian} (Short:
455 \p{Armn}; NOT \p{Block=Armenian}) (94)
456 \p{Armi} \p{Imperial_Aramaic} (=
457 \p{Script_Extensions=Imperial_Aramaic})
458 (NOT \p{Block=Imperial_Aramaic}) (31)
459 \p{Armn} \p{Armenian} (= \p{Script_Extensions=
460 Armenian}) (NOT \p{Block=Armenian}) (94)
461 X \p{Arrows} \p{Block=Arrows} (112)
462 \p{ASCII} \p{Block=Basic_Latin} (128)
463 \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
464 (22)
465 \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
466 plus all above-Unicode code points:
467 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
468 >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
469 U+0100..infinity)
470 \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A-
471 Fa-f])
472 \p{Assigned} All assigned code points (276_271:
473 U+0000..0377, U+037A..037F,
474 U+0384..038A, U+038C, U+038E..03A1,
475 U+03A3..052F ...)
476 \p{Avestan} \p{Script_Extensions=Avestan} (Short:
477 \p{Avst}; NOT \p{Block=Avestan}) (61)
478 \p{Avst} \p{Avestan} (= \p{Script_Extensions=
479 Avestan}) (NOT \p{Block=Avestan}) (61)
480 \p{Bali} \p{Balinese} (= \p{Script_Extensions=
481 Balinese}) (NOT \p{Block=Balinese}) (121)
482 \p{Balinese} \p{Script_Extensions=Balinese} (Short:
483 \p{Bali}; NOT \p{Block=Balinese}) (121)
484 \p{Bamu} \p{Bamum} (= \p{Script_Extensions=Bamum})
485 (NOT \p{Block=Bamum}) (657)
486 \p{Bamum} \p{Script_Extensions=Bamum} (Short:
487 \p{Bamu}; NOT \p{Block=Bamum}) (657)
488 X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block=
489 Bamum_Supplement}) (576)
490 X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short:
491 \p{InBamumSup}) (576)
492 X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
493 \p{Bass} \p{Bassa_Vah} (= \p{Script_Extensions=
494 Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
495 (36)
496 \p{Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
497 \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
498 \p{Batak} \p{Script_Extensions=Batak} (Short:
499 \p{Batk}; NOT \p{Block=Batak}) (56)
500 \p{Batk} \p{Batak} (= \p{Script_Extensions=Batak})
501 (NOT \p{Block=Batak}) (56)
502 \p{Bc: *} \p{Bidi_Class: *}
503 \p{Beng} \p{Bengali} (= \p{Script_Extensions=
504 Bengali}) (NOT \p{Block=Bengali}) (101)
505 \p{Bengali} \p{Script_Extensions=Bengali} (Short:
506 \p{Beng}; NOT \p{Block=Bengali}) (101)
507 \p{Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
508 \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
509 \p{Bhks} \p{Bhaiksuki} (= \p{Script_Extensions=
510 Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
511 (97)
512 \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y})
513 (12)
514 \p{Bidi_C: *} \p{Bidi_Control: *}
515 \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1436)
516 \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (51)
517 \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1436: U+0608,
518 U+060B, U+060D, U+061B..064A,
519 U+066D..066F, U+0671..06D5 ...)
520 \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (51:
521 U+0600..0605, U+0660..0669,
522 U+066B..066C, U+06DD, U+08E2,
523 U+10E60..10E7E)
524 \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
525 \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4016)
526 \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016:
527 [^\t\n\cK\f\r\x1c-\x7e\x85\xa0-\xac\xae-
528 \xff], U+180E, U+200B..200D,
529 U+2060..2065, U+206A..206F, U+FDD0..FDEF
530 ...)
531 \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15:
532 [,.\/:\xa0], U+060C, U+202F, U+2044,
533 U+FE50, U+FE52 ...)
534 \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
535 \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (158)
536 \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
537 \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (87)
538 \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (158: [0-9\xb2-
539 \xb3\xb9], U+06F0..06F9, U+2070,
540 U+2074..2079, U+2080..2089, U+2488..249B
541 ...)
542 \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-],
543 U+207A..207B, U+208A..208B, U+2212,
544 U+FB29, U+FE62..FE63 ...)
545 \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (87:
546 [#\$\%\xa2-\xa5\xb0-\xb1], U+058F,
547 U+0609..060A, U+066A, U+09F2..09F3,
548 U+09FB ...)
549 \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1:
550 U+2068)
551 \p{Bidi_Class: FSI} \p{Bidi_Class=First_Strong_Isolate} (1)
552 \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_097_124
553 plus all above-Unicode code points)
554 \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_097_124 plus
555 all above-Unicode code points: [A-Za-
556 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
557 \xff], U+0100..02B8, U+02BB..02C1,
558 U+02D0..02D1, U+02E0..02E4, U+02EE ...)
559 \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1:
560 U+202A)
561 \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1:
562 U+2066)
563 \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1:
564 U+202D)
565 \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
566 \p{Bidi_Class: LRI} \p{Bidi_Class=Left_To_Right_Isolate} (1)
567 \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
568 \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1773:
569 U+0300..036F, U+0483..0489,
570 U+0591..05BD, U+05BF, U+05C1..05C2,
571 U+05C4..05C5 ...)
572 \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1773)
573 \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (5350)
574 \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (5350:
575 [!\"&\'\(\)*;<=
576 >?\@\[\\\]\^_`\{\|\}~\xa1\xa6-\xa9\xab-
577 \xac\xae-\xaf\xb4\xb6-\xb8\xbb-
578 \xbf\xd7\xf7], U+02B9..02BA,
579 U+02C2..02CF, U+02D2..02DF,
580 U+02E5..02ED, U+02EF..02FF ...)
581 \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7:
582 [\n\r\x1c-\x1e\x85], U+2029)
583 \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
584 \p{Bidi_Class: PDI} \p{Bidi_Class=Pop_Directional_Isolate} (1)
585 \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1:
586 U+202C)
587 \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1:
588 U+2069)
589 \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (4054)
590 \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (4054: U+0590,
591 U+05BE, U+05C0, U+05C3, U+05C6,
592 U+05C8..05FF ...)
593 \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1:
594 U+202B)
595 \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1:
596 U+2067)
597 \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1:
598 U+202E)
599 \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
600 \p{Bidi_Class: RLI} \p{Bidi_Class=Right_To_Left_Isolate} (1)
601 \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
602 \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
603 \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3:
604 [\t\cK\x1f])
605 \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20],
606 U+1680, U+2000..200A, U+2028, U+205F,
607 U+3000)
608 \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (17)
609 \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
610 \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
611 plus all above-Unicode code points:
612 U+0000..061B, U+061D..200D,
613 U+2010..2029, U+202F..2065,
614 U+206A..infinity)
615 \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (12:
616 U+061C, U+200E..200F, U+202A..202E,
617 U+2066..2069)
618 \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
619 (545)
620 \p{Bidi_M: *} \p{Bidi_Mirrored: *}
621 \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
622 (545)
623 \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567
624 plus all above-Unicode code points:
625 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A-
626 Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc-
627 \xff], U+0100..0F39, U+0F3E..169A,
628 U+169D..2038, U+203B..2044, U+2047..207C
629 ...)
630 \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (545:
631 [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
632 U+169B..169C, U+2039..203A,
633 U+2045..2046, U+207D..207E ...)
634 \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
635 (60)
636 \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60:
637 [\)\]\}], U+0F3B, U+0F3D, U+169C,
638 U+2046, U+207E ...)
639 \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
640 (1_113_992 plus all above-Unicode code
641 points)
642 \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992
643 plus all above-Unicode code points:
644 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<=
645 >?\@A-Z\\\^_`a-z\|~\x7f-\xff],
646 U+0100..0F39, U+0F3E..169A,
647 U+169D..2044, U+2047..207C, U+207F..208C
648 ...)
649 \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
650 (60)
651 \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60:
652 [\(\[\{], U+0F3A, U+0F3C, U+169B,
653 U+2045, U+207D ...)
654 \p{Blank} \p{XPosixBlank} (18)
655 \p{Blk: *} \p{Block: *}
656 \p{Block: Adlam} (NOT \p{Adlam} NOR \p{Is_Adlam}) (96:
657 U+1E900..1E95F)
658 \p{Block: Aegean_Numbers} (64: U+10100..1013F)
659 \p{Block: Ahom} (NOT \p{Ahom} NOR \p{Is_Ahom}) (64:
660 U+11700..1173F)
661 \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128)
662 \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128:
663 U+1F700..1F77F)
664 \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
665 (80)
666 \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
667 AlphabeticPF}) (80: U+FB00..FB4F)
668 \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
669 NOR \p{Is_Anatolian_Hieroglyphs}) (640:
670 U+14400..1467F)
671 \p{Block: Ancient_Greek_Music} \p{Block=
672 Ancient_Greek_Musical_Notation} (80)
673 \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
674 AncientGreekMusic}) (80: U+1D200..1D24F)
675 \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F)
676 \p{Block: Ancient_Symbols} (64: U+10190..101CF)
677 \p{Block: Arabic} (NOT \p{Arabic} NOR \p{Is_Arabic}) (256:
678 U+0600..06FF)
679 \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
680 \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96:
681 U+08A0..08FF)
682 \p{Block: Arabic_Math} \p{Block=
683 Arabic_Mathematical_Alphabetic_Symbols}
684 (256)
685 \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
686 ArabicMath}) (256: U+1EE00..1EEFF)
687 \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688)
688 \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144)
689 \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
690 (688: U+FB50..FDFF)
691 \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
692 (144: U+FE70..FEFF)
693 \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48)
694 \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48:
695 U+0750..077F)
696 \p{Block: Armenian} (NOT \p{Armenian} NOR \p{Is_Armenian})
697 (96: U+0530..058F)
698 \p{Block: Arrows} (112: U+2190..21FF)
699 \p{Block: ASCII} \p{Block=Basic_Latin} (128)
700 \p{Block: Avestan} (NOT \p{Avestan} NOR \p{Is_Avestan}) (64:
701 U+10B00..10B3F)
702 \p{Block: Balinese} (NOT \p{Balinese} NOR \p{Is_Balinese})
703 (128: U+1B00..1B7F)
704 \p{Block: Bamum} (NOT \p{Bamum} NOR \p{Is_Bamum}) (96:
705 U+A6A0..A6FF)
706 \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576)
707 \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576:
708 U+16800..16A3F)
709 \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}) (128: [\x00-\x7f])
710 \p{Block: Bassa_Vah} (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
711 (48: U+16AD0..16AFF)
712 \p{Block: Batak} (NOT \p{Batak} NOR \p{Is_Batak}) (64:
713 U+1BC0..1BFF)
714 \p{Block: Bengali} (NOT \p{Bengali} NOR \p{Is_Bengali}) (128:
715 U+0980..09FF)
716 \p{Block: Bhaiksuki} (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
717 (112: U+11C00..11C6F)
718 \p{Block: Block_Elements} (32: U+2580..259F)
719 \p{Block: Bopomofo} (NOT \p{Bopomofo} NOR \p{Is_Bopomofo})
720 (48: U+3100..312F)
721 \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
722 \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32:
723 U+31A0..31BF)
724 \p{Block: Box_Drawing} (128: U+2500..257F)
725 \p{Block: Brahmi} (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128:
726 U+11000..1107F)
727 \p{Block: Braille} \p{Block=Braille_Patterns} (256)
728 \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256:
729 U+2800..28FF)
730 \p{Block: Buginese} (NOT \p{Buginese} NOR \p{Is_Buginese})
731 (32: U+1A00..1A1F)
732 \p{Block: Buhid} (NOT \p{Buhid} NOR \p{Is_Buhid}) (32:
733 U+1740..175F)
734 \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
735 (256)
736 \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
737 ByzantineMusic}) (256: U+1D000..1D0FF)
738 \p{Block: Canadian_Syllabics} \p{Block=
739 Unified_Canadian_Aboriginal_Syllabics}
740 (640)
741 \p{Block: Carian} (NOT \p{Carian} NOR \p{Is_Carian}) (64:
742 U+102A0..102DF)
743 \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
744 \p{Is_Caucasian_Albanian}) (64:
745 U+10530..1056F)
746 \p{Block: Chakma} (NOT \p{Chakma} NOR \p{Is_Chakma}) (80:
747 U+11100..1114F)
748 \p{Block: Cham} (NOT \p{Cham} NOR \p{Is_Cham}) (96:
749 U+AA00..AA5F)
750 \p{Block: Cherokee} (NOT \p{Cherokee} NOR \p{Is_Cherokee})
751 (96: U+13A0..13FF)
752 \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
753 \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80:
754 U+AB70..ABBF)
755 \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992)
756 \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256)
757 \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
758 \p{Block: CJK_Compat_Ideographs} \p{Block=
759 CJK_Compatibility_Ideographs} (512)
760 \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
761 CJK_Compatibility_Ideographs_Supplement}
762 (544)
763 \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256:
764 U+3300..33FF)
765 \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
766 (32: U+FE30..FE4F)
767 \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
768 CJKCompatIdeographs}) (512: U+F900..FAFF)
769 \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
770 CJKCompatIdeographsSup}) (544:
771 U+2F800..2FA1F)
772 \p{Block: CJK_Ext_A} \p{Block=
773 CJK_Unified_Ideographs_Extension_A}
774 (6592)
775 \p{Block: CJK_Ext_B} \p{Block=
776 CJK_Unified_Ideographs_Extension_B}
777 (42_720)
778 \p{Block: CJK_Ext_C} \p{Block=
779 CJK_Unified_Ideographs_Extension_C}
780 (4160)
781 \p{Block: CJK_Ext_D} \p{Block=
782 CJK_Unified_Ideographs_Extension_D} (224)
783 \p{Block: CJK_Ext_E} \p{Block=
784 CJK_Unified_Ideographs_Extension_E}
785 (5776)
786 \p{Block: CJK_Ext_F} \p{Block=
787 CJK_Unified_Ideographs_Extension_F}
788 (7488)
789 \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
790 \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
791 (128: U+2E80..2EFF)
792 \p{Block: CJK_Strokes} (48: U+31C0..31EF)
793 \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64)
794 \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
795 (64: U+3000..303F)
796 \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992:
797 U+4E00..9FFF)
798 \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
799 CJKExtA}) (6592: U+3400..4DBF)
800 \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
801 CJKExtB}) (42_720: U+20000..2A6DF)
802 \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
803 CJKExtC}) (4160: U+2A700..2B73F)
804 \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
805 CJKExtD}) (224: U+2B740..2B81F)
806 \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
807 CJKExtE}) (5776: U+2B820..2CEAF)
808 \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk=
809 CJKExtF}) (7488: U+2CEB0..2EBEF)
810 \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
811 Diacriticals}) (112: U+0300..036F)
812 \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
813 DiacriticalsExt}) (80: U+1AB0..1AFF)
814 \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
815 DiacriticalsForSymbols}) (48:
816 U+20D0..20FF)
817 \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
818 DiacriticalsSup}) (64: U+1DC0..1DFF)
819 \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16:
820 U+FE20..FE2F)
821 \p{Block: Combining_Marks_For_Symbols} \p{Block=
822 Combining_Diacritical_Marks_For_Symbols}
823 (48)
824 \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
825 IndicNumberForms}) (16: U+A830..A83F)
826 \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96)
827 \p{Block: Control_Pictures} (64: U+2400..243F)
828 \p{Block: Coptic} (NOT \p{Coptic} NOR \p{Is_Coptic}) (128:
829 U+2C80..2CFF)
830 \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF)
831 \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
832 \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32:
833 U+1D360..1D37F)
834 \p{Block: Cuneiform} (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
835 (1024: U+12000..123FF)
836 \p{Block: Cuneiform_Numbers} \p{Block=
837 Cuneiform_Numbers_And_Punctuation} (128)
838 \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
839 CuneiformNumbers}) (128: U+12400..1247F)
840 \p{Block: Currency_Symbols} (48: U+20A0..20CF)
841 \p{Block: Cypriot_Syllabary} (64: U+10800..1083F)
842 \p{Block: Cyrillic} (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
843 (256: U+0400..04FF)
844 \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
845 \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
846 \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
847 \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32:
848 U+2DE0..2DFF)
849 \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96:
850 U+A640..A69F)
851 \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16:
852 U+1C80..1C8F)
853 \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
854 \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48:
855 U+0500..052F)
856 \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
857 (48)
858 \p{Block: Deseret} (80: U+10400..1044F)
859 \p{Block: Devanagari} (NOT \p{Devanagari} NOR \p{Is_Devanagari})
860 (128: U+0900..097F)
861 \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
862 \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32:
863 U+A8E0..A8FF)
864 \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
865 \p{Block: Diacriticals_Ext} \p{Block=
866 Combining_Diacritical_Marks_Extended}
867 (80)
868 \p{Block: Diacriticals_For_Symbols} \p{Block=
869 Combining_Diacritical_Marks_For_Symbols}
870 (48)
871 \p{Block: Diacriticals_Sup} \p{Block=
872 Combining_Diacritical_Marks_Supplement}
873 (64)
874 \p{Block: Dingbats} (192: U+2700..27BF)
875 \p{Block: Domino} \p{Block=Domino_Tiles} (112)
876 \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112:
877 U+1F030..1F09F)
878 \p{Block: Duployan} (NOT \p{Duployan} NOR \p{Is_Duployan})
879 (160: U+1BC00..1BC9F)
880 \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F)
881 \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
882 \p{Is_Egyptian_Hieroglyphs}) (1072:
883 U+13000..1342F)
884 \p{Block: Elbasan} (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48:
885 U+10500..1052F)
886 \p{Block: Emoticons} (80: U+1F600..1F64F)
887 \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
888 \p{Block: Enclosed_Alphanum_Sup} \p{Block=
889 Enclosed_Alphanumeric_Supplement} (256)
890 \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
891 EnclosedAlphanumSup}) (256:
892 U+1F100..1F1FF)
893 \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
894 EnclosedAlphanum}) (160: U+2460..24FF)
895 \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
896 (256)
897 \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
898 EnclosedCJK}) (256: U+3200..32FF)
899 \p{Block: Enclosed_Ideographic_Sup} \p{Block=
900 Enclosed_Ideographic_Supplement} (256)
901 \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
902 EnclosedIdeographicSup}) (256:
903 U+1F200..1F2FF)
904 \p{Block: Ethiopic} (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
905 (384: U+1200..137F)
906 \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
907 \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
908 \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96:
909 U+2D80..2DDF)
910 \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48:
911 U+AB00..AB2F)
912 \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
913 \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32:
914 U+1380..139F)
915 \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
916 \p{Punct} NOR \p{Is_Punctuation}) (112:
917 U+2000..206F)
918 \p{Block: Geometric_Shapes} (96: U+25A0..25FF)
919 \p{Block: Geometric_Shapes_Ext} \p{Block=
920 Geometric_Shapes_Extended} (128)
921 \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
922 GeometricShapesExt}) (128:
923 U+1F780..1F7FF)
924 \p{Block: Georgian} (NOT \p{Georgian} NOR \p{Is_Georgian})
925 (96: U+10A0..10FF)
926 \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
927 \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48:
928 U+2D00..2D2F)
929 \p{Block: Glagolitic} (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
930 (96: U+2C00..2C5F)
931 \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
932 \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
933 (48: U+1E000..1E02F)
934 \p{Block: Gothic} (NOT \p{Gothic} NOR \p{Is_Gothic}) (32:
935 U+10330..1034F)
936 \p{Block: Grantha} (NOT \p{Grantha} NOR \p{Is_Grantha}) (128:
937 U+11300..1137F)
938 \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
939 NOR \p{Is_Greek}) (144)
940 \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
941 NOR \p{Is_Greek}) (144: U+0370..03FF)
942 \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256)
943 \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256:
944 U+1F00..1FFF)
945 \p{Block: Gujarati} (NOT \p{Gujarati} NOR \p{Is_Gujarati})
946 (128: U+0A80..0AFF)
947 \p{Block: Gurmukhi} (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
948 (128: U+0A00..0A7F)
949 \p{Block: Half_And_Full_Forms} \p{Block=
950 Halfwidth_And_Fullwidth_Forms} (240)
951 \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16)
952 \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
953 HalfAndFullForms}) (240: U+FF00..FFEF)
954 \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul}
955 NOR \p{Is_Hangul}) (11_184)
956 \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
957 (96: U+3130..318F)
958 \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}) (256: U+1100..11FF)
959 \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32:
960 U+A960..A97F)
961 \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80:
962 U+D7B0..D7FF)
963 \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
964 NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF)
965 \p{Block: Hanunoo} (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32:
966 U+1720..173F)
967 \p{Block: Hatran} (NOT \p{Hatran} NOR \p{Is_Hatran}) (32:
968 U+108E0..108FF)
969 \p{Block: Hebrew} (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112:
970 U+0590..05FF)
971 \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
972 HighPUSurrogates}) (128: U+DB80..DBFF)
973 \p{Block: High_PU_Surrogates} \p{Block=
974 High_Private_Use_Surrogates} (128)
975 \p{Block: High_Surrogates} (896: U+D800..DB7F)
976 \p{Block: Hiragana} (NOT \p{Hiragana} NOR \p{Is_Hiragana})
977 (96: U+3040..309F)
978 \p{Block: IDC} \p{Block=
979 Ideographic_Description_Characters} (NOT
980 \p{ID_Continue} NOR \p{Is_IDC}) (16)
981 \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
982 NOT \p{ID_Continue} NOR \p{Is_IDC}) (16:
983 U+2FF0..2FFF)
984 \p{Block: Ideographic_Symbols} \p{Block=
985 Ideographic_Symbols_And_Punctuation} (32)
986 \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
987 IdeographicSymbols}) (32: U+16FE0..16FFF)
988 \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
989 \p{Is_Imperial_Aramaic}) (32:
990 U+10840..1085F)
991 \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
992 (16)
993 \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
994 NOR \p{Is_Inscriptional_Pahlavi}) (32:
995 U+10B60..10B7F)
996 \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
997 NOR \p{Is_Inscriptional_Parthian}) (32:
998 U+10B40..10B5F)
999 \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96)
1000 \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96:
1001 U+0250..02AF)
1002 \p{Block: Jamo} \p{Block=Hangul_Jamo} (256)
1003 \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32)
1004 \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80)
1005 \p{Block: Javanese} (NOT \p{Javanese} NOR \p{Is_Javanese})
1006 (96: U+A980..A9DF)
1007 \p{Block: Kaithi} (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80:
1008 U+11080..110CF)
1009 \p{Block: Kana_Ext_A} \p{Block=Kana_Extended_A} (48)
1010 \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48:
1011 U+1B100..1B12F)
1012 \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256)
1013 \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256:
1014 U+1B000..1B0FF)
1015 \p{Block: Kanbun} (16: U+3190..319F)
1016 \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224)
1017 \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224:
1018 U+2F00..2FDF)
1019 \p{Block: Kannada} (NOT \p{Kannada} NOR \p{Is_Kannada}) (128:
1020 U+0C80..0CFF)
1021 \p{Block: Katakana} (NOT \p{Katakana} NOR \p{Is_Katakana})
1022 (96: U+30A0..30FF)
1023 \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
1024 \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
1025 KatakanaExt}) (16: U+31F0..31FF)
1026 \p{Block: Kayah_Li} (48: U+A900..A92F)
1027 \p{Block: Kharoshthi} (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
1028 (96: U+10A00..10A5F)
1029 \p{Block: Khmer} (NOT \p{Khmer} NOR \p{Is_Khmer}) (128:
1030 U+1780..17FF)
1031 \p{Block: Khmer_Symbols} (32: U+19E0..19FF)
1032 \p{Block: Khojki} (NOT \p{Khojki} NOR \p{Is_Khojki}) (80:
1033 U+11200..1124F)
1034 \p{Block: Khudawadi} (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
1035 (80: U+112B0..112FF)
1036 \p{Block: Lao} (NOT \p{Lao} NOR \p{Is_Lao}) (128:
1037 U+0E80..0EFF)
1038 \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
1039 \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128)
1040 \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80-
1041 \xff])
1042 \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128)
1043 \p{Block: Latin_Ext_Additional} \p{Block=
1044 Latin_Extended_Additional} (256)
1045 \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208)
1046 \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32)
1047 \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224)
1048 \p{Block: Latin_Ext_E} \p{Block=Latin_Extended_E} (64)
1049 \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128:
1050 U+0100..017F)
1051 \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
1052 LatinExtAdditional}) (256: U+1E00..1EFF)
1053 \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208:
1054 U+0180..024F)
1055 \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32:
1056 U+2C60..2C7F)
1057 \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224:
1058 U+A720..A7FF)
1059 \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64:
1060 U+AB30..AB6F)
1061 \p{Block: Lepcha} (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80:
1062 U+1C00..1C4F)
1063 \p{Block: Letterlike_Symbols} (80: U+2100..214F)
1064 \p{Block: Limbu} (NOT \p{Limbu} NOR \p{Is_Limbu}) (80:
1065 U+1900..194F)
1066 \p{Block: Linear_A} (NOT \p{Linear_A} NOR \p{Is_Linear_A})
1067 (384: U+10600..1077F)
1068 \p{Block: Linear_B_Ideograms} (128: U+10080..100FF)
1069 \p{Block: Linear_B_Syllabary} (128: U+10000..1007F)
1070 \p{Block: Lisu} (48: U+A4D0..A4FF)
1071 \p{Block: Low_Surrogates} (1024: U+DC00..DFFF)
1072 \p{Block: Lycian} (NOT \p{Lycian} NOR \p{Is_Lycian}) (32:
1073 U+10280..1029F)
1074 \p{Block: Lydian} (NOT \p{Lydian} NOR \p{Is_Lydian}) (32:
1075 U+10920..1093F)
1076 \p{Block: Mahajani} (NOT \p{Mahajani} NOR \p{Is_Mahajani})
1077 (48: U+11150..1117F)
1078 \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48)
1079 \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48:
1080 U+1F000..1F02F)
1081 \p{Block: Malayalam} (NOT \p{Malayalam} NOR \p{Is_Malayalam})
1082 (128: U+0D00..0D7F)
1083 \p{Block: Mandaic} (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32:
1084 U+0840..085F)
1085 \p{Block: Manichaean} (NOT \p{Manichaean} NOR \p{Is_Manichaean})
1086 (64: U+10AC0..10AFF)
1087 \p{Block: Marchen} (NOT \p{Marchen} NOR \p{Is_Marchen}) (80:
1088 U+11C70..11CBF)
1089 \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR
1090 \p{Is_Masaram_Gondi}) (96:
1091 U+11D00..11D5F)
1092 \p{Block: Math_Alphanum} \p{Block=
1093 Mathematical_Alphanumeric_Symbols} (1024)
1094 \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
1095 \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
1096 MathAlphanum}) (1024: U+1D400..1D7FF)
1097 \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
1098 (256: U+2200..22FF)
1099 \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
1100 \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF)
1101 \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
1102 \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
1103 (32: U+AAE0..AAFF)
1104 \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
1105 \p{Is_Mende_Kikakui}) (224:
1106 U+1E800..1E8DF)
1107 \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
1108 \p{Is_Meroitic_Cursive}) (96:
1109 U+109A0..109FF)
1110 \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F)
1111 \p{Block: Miao} (NOT \p{Miao} NOR \p{Is_Miao}) (160:
1112 U+16F00..16F9F)
1113 \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows}
1114 (256)
1115 \p{Block: Misc_Math_Symbols_A} \p{Block=
1116 Miscellaneous_Mathematical_Symbols_A}
1117 (48)
1118 \p{Block: Misc_Math_Symbols_B} \p{Block=
1119 Miscellaneous_Mathematical_Symbols_B}
1120 (128)
1121 \p{Block: Misc_Pictographs} \p{Block=
1122 Miscellaneous_Symbols_And_Pictographs}
1123 (768)
1124 \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
1125 \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
1126 \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
1127 MiscMathSymbolsA}) (48: U+27C0..27EF)
1128 \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
1129 MiscMathSymbolsB}) (128: U+2980..29FF)
1130 \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols})
1131 (256: U+2600..26FF)
1132 \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
1133 MiscArrows}) (256: U+2B00..2BFF)
1134 \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
1135 MiscPictographs}) (768: U+1F300..1F5FF)
1136 \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
1137 (256: U+2300..23FF)
1138 \p{Block: Modi} (NOT \p{Modi} NOR \p{Is_Modi}) (96:
1139 U+11600..1165F)
1140 \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
1141 \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F)
1142 \p{Block: Mongolian} (NOT \p{Mongolian} NOR \p{Is_Mongolian})
1143 (176: U+1800..18AF)
1144 \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
1145 \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32:
1146 U+11660..1167F)
1147 \p{Block: Mro} (NOT \p{Mro} NOR \p{Is_Mro}) (48:
1148 U+16A40..16A6F)
1149 \p{Block: Multani} (NOT \p{Multani} NOR \p{Is_Multani}) (48:
1150 U+11280..112AF)
1151 \p{Block: Music} \p{Block=Musical_Symbols} (256)
1152 \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256:
1153 U+1D100..1D1FF)
1154 \p{Block: Myanmar} (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160:
1155 U+1000..109F)
1156 \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
1157 \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
1158 \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32:
1159 U+AA60..AA7F)
1160 \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32:
1161 U+A9E0..A9FF)
1162 \p{Block: Nabataean} (NOT \p{Nabataean} NOR \p{Is_Nabataean})
1163 (48: U+10880..108AF)
1164 \p{Block: NB} \p{Block=No_Block} (834_096 plus all
1165 above-Unicode code points)
1166 \p{Block: New_Tai_Lue} (NOT \p{New_Tai_Lue} NOR
1167 \p{Is_New_Tai_Lue}) (96: U+1980..19DF)
1168 \p{Block: Newa} (NOT \p{Newa} NOR \p{Is_Newa}) (128:
1169 U+11400..1147F)
1170 \p{Block: NKo} (NOT \p{Nko} NOR \p{Is_NKo}) (64:
1171 U+07C0..07FF)
1172 \p{Block: No_Block} (Short: \p{Blk=NB}) (834_096 plus all
1173 above-Unicode code points: U+0870..089F,
1174 U+1C90..1CBF, U+2FE0..2FEF,
1175 U+10200..1027F, U+103E0..103FF,
1176 U+10570..105FF ...)
1177 \p{Block: Number_Forms} (64: U+2150..218F)
1178 \p{Block: Nushu} (NOT \p{Nushu} NOR \p{Is_Nushu}) (400:
1179 U+1B170..1B2FF)
1180 \p{Block: OCR} \p{Block=Optical_Character_Recognition}
1181 (32)
1182 \p{Block: Ogham} (NOT \p{Ogham} NOR \p{Is_Ogham}) (32:
1183 U+1680..169F)
1184 \p{Block: Ol_Chiki} (48: U+1C50..1C7F)
1185 \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
1186 \p{Is_Old_Hungarian}) (128:
1187 U+10C80..10CFF)
1188 \p{Block: Old_Italic} (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
1189 (48: U+10300..1032F)
1190 \p{Block: Old_North_Arabian} (32: U+10A80..10A9F)
1191 \p{Block: Old_Permic} (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
1192 (48: U+10350..1037F)
1193 \p{Block: Old_Persian} (NOT \p{Old_Persian} NOR
1194 \p{Is_Old_Persian}) (64: U+103A0..103DF)
1195 \p{Block: Old_South_Arabian} (32: U+10A60..10A7F)
1196 \p{Block: Old_Turkic} (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
1197 (80: U+10C00..10C4F)
1198 \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32:
1199 U+2440..245F)
1200 \p{Block: Oriya} (NOT \p{Oriya} NOR \p{Is_Oriya}) (128:
1201 U+0B00..0B7F)
1202 \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F)
1203 \p{Block: Osage} (NOT \p{Osage} NOR \p{Is_Osage}) (80:
1204 U+104B0..104FF)
1205 \p{Block: Osmanya} (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48:
1206 U+10480..104AF)
1207 \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
1208 \p{Is_Pahawh_Hmong}) (144:
1209 U+16B00..16B8F)
1210 \p{Block: Palmyrene} (32: U+10860..1087F)
1211 \p{Block: Pau_Cin_Hau} (NOT \p{Pau_Cin_Hau} NOR
1212 \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF)
1213 \p{Block: Phags_Pa} (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa})
1214 (64: U+A840..A87F)
1215 \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48)
1216 \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48:
1217 U+101D0..101FF)
1218 \p{Block: Phoenician} (NOT \p{Phoenician} NOR \p{Is_Phoenician})
1219 (32: U+10900..1091F)
1220 \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
1221 \p{Block: Phonetic_Ext_Sup} \p{Block=
1222 Phonetic_Extensions_Supplement} (64)
1223 \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128:
1224 U+1D00..1D7F)
1225 \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
1226 PhoneticExtSup}) (64: U+1D80..1DBF)
1227 \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF)
1228 \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
1229 \p{Private_Use} NOR \p{Is_Private_Use})
1230 (6400)
1231 \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
1232 \p{Private_Use} NOR \p{Is_Private_Use})
1233 (6400: U+E000..F8FF)
1234 \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
1235 \p{Is_Psalter_Pahlavi}) (48:
1236 U+10B80..10BAF)
1237 \p{Block: PUA} \p{Block=Private_Use_Area} (NOT
1238 \p{Private_Use} NOR \p{Is_Private_Use})
1239 (6400)
1240 \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT
1241 \p{Punct} NOR \p{Is_Punctuation}) (112)
1242 \p{Block: Rejang} (NOT \p{Rejang} NOR \p{Is_Rejang}) (48:
1243 U+A930..A95F)
1244 \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32)
1245 \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32:
1246 U+10E60..10E7F)
1247 \p{Block: Runic} (NOT \p{Runic} NOR \p{Is_Runic}) (96:
1248 U+16A0..16FF)
1249 \p{Block: Samaritan} (NOT \p{Samaritan} NOR \p{Is_Samaritan})
1250 (64: U+0800..083F)
1251 \p{Block: Saurashtra} (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
1252 (96: U+A880..A8DF)
1253 \p{Block: Sharada} (NOT \p{Sharada} NOR \p{Is_Sharada}) (96:
1254 U+11180..111DF)
1255 \p{Block: Shavian} (48: U+10450..1047F)
1256 \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF)
1257 \p{Block: Siddham} (NOT \p{Siddham} NOR \p{Is_Siddham}) (128:
1258 U+11580..115FF)
1259 \p{Block: Sinhala} (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128:
1260 U+0D80..0DFF)
1261 \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF)
1262 \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32:
1263 U+FE50..FE6F)
1264 \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32)
1265 \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
1266 \p{Is_Sora_Sompeng}) (48: U+110D0..110FF)
1267 \p{Block: Soyombo} (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96:
1268 U+11A50..11AAF)
1269 \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
1270 ModifierLetters}) (80: U+02B0..02FF)
1271 \p{Block: Specials} (16: U+FFF0..FFFF)
1272 \p{Block: Sundanese} (NOT \p{Sundanese} NOR \p{Is_Sundanese})
1273 (64: U+1B80..1BBF)
1274 \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
1275 \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16:
1276 U+1CC0..1CCF)
1277 \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
1278 \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
1279 \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
1280 \p{Block: Sup_Math_Operators} \p{Block=
1281 Supplemental_Mathematical_Operators}
1282 (256)
1283 \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A}
1284 (65_536)
1285 \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B}
1286 (65_536)
1287 \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
1288 \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
1289 Supplemental_Symbols_And_Pictographs}
1290 (256)
1291 \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
1292 \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
1293 SuperAndSub}) (48: U+2070..209F)
1294 \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16:
1295 U+27F0..27FF)
1296 \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128:
1297 U+2900..297F)
1298 \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256:
1299 U+1F800..1F8FF)
1300 \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
1301 SupMathOperators}) (256: U+2A00..2AFF)
1302 \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
1303 SupPunctuation}) (128: U+2E00..2E7F)
1304 \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
1305 SupSymbolsAndPictographs}) (256:
1306 U+1F900..1F9FF)
1307 \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
1308 SupPUAA}) (65_536: U+F0000..FFFFF)
1309 \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
1310 SupPUAB}) (65_536: U+100000..10FFFF)
1311 \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF)
1312 \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
1313 \p{Is_Syloti_Nagri}) (48: U+A800..A82F)
1314 \p{Block: Syriac} (NOT \p{Syriac} NOR \p{Is_Syriac}) (80:
1315 U+0700..074F)
1316 \p{Block: Syriac_Sup} \p{Block=Syriac_Supplement} (16)
1317 \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16:
1318 U+0860..086F)
1319 \p{Block: Tagalog} (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32:
1320 U+1700..171F)
1321 \p{Block: Tagbanwa} (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa})
1322 (32: U+1760..177F)
1323 \p{Block: Tags} (128: U+E0000..E007F)
1324 \p{Block: Tai_Le} (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48:
1325 U+1950..197F)
1326 \p{Block: Tai_Tham} (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
1327 (144: U+1A20..1AAF)
1328 \p{Block: Tai_Viet} (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet})
1329 (96: U+AA80..AADF)
1330 \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1331 \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96:
1332 U+1D300..1D35F)
1333 \p{Block: Takri} (NOT \p{Takri} NOR \p{Is_Takri}) (80:
1334 U+11680..116CF)
1335 \p{Block: Tamil} (NOT \p{Tamil} NOR \p{Is_Tamil}) (128:
1336 U+0B80..0BFF)
1337 \p{Block: Tangut} (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144:
1338 U+17000..187FF)
1339 \p{Block: Tangut_Components} (768: U+18800..18AFF)
1340 \p{Block: Telugu} (NOT \p{Telugu} NOR \p{Is_Telugu}) (128:
1341 U+0C00..0C7F)
1342 \p{Block: Thaana} (NOT \p{Thaana} NOR \p{Is_Thaana}) (64:
1343 U+0780..07BF)
1344 \p{Block: Thai} (NOT \p{Thai} NOR \p{Is_Thai}) (128:
1345 U+0E00..0E7F)
1346 \p{Block: Tibetan} (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256:
1347 U+0F00..0FFF)
1348 \p{Block: Tifinagh} (NOT \p{Tifinagh} NOR \p{Is_Tifinagh})
1349 (80: U+2D30..2D7F)
1350 \p{Block: Tirhuta} (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96:
1351 U+11480..114DF)
1352 \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1353 (128)
1354 \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1355 TransportAndMap}) (128: U+1F680..1F6FF)
1356 \p{Block: UCAS} \p{Block=
1357 Unified_Canadian_Aboriginal_Syllabics}
1358 (640)
1359 \p{Block: UCAS_Ext} \p{Block=
1360 Unified_Canadian_Aboriginal_Syllabics_-
1361 Extended} (80)
1362 \p{Block: Ugaritic} (NOT \p{Ugaritic} NOR \p{Is_Ugaritic})
1363 (32: U+10380..1039F)
1364 \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1365 UCAS}) (640: U+1400..167F)
1366 \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1367 \p{Blk=UCASExt}) (80: U+18B0..18FF)
1368 \p{Block: Vai} (NOT \p{Vai} NOR \p{Is_Vai}) (320:
1369 U+A500..A63F)
1370 \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
1371 \p{Variation_Selector} NOR \p{Is_VS})
1372 (16: U+FE00..FE0F)
1373 \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
1374 (240: U+E0100..E01EF)
1375 \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48)
1376 \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48:
1377 U+1CD0..1CFF)
1378 \p{Block: Vertical_Forms} (16: U+FE10..FE1F)
1379 \p{Block: VS} \p{Block=Variation_Selectors} (NOT
1380 \p{Variation_Selector} NOR \p{Is_VS})
1381 (16)
1382 \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement}
1383 (240)
1384 \p{Block: Warang_Citi} (NOT \p{Warang_Citi} NOR
1385 \p{Is_Warang_Citi}) (96: U+118A0..118FF)
1386 \p{Block: Yi_Radicals} (64: U+A490..A4CF)
1387 \p{Block: Yi_Syllables} (1168: U+A000..A48F)
1388 \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64)
1389 \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64:
1390 U+4DC0..4DFF)
1391 \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR
1392 \p{Is_Zanabazar_Square}) (80:
1393 U+11A00..11A4F)
1394 X \p{Block_Elements} \p{Block=Block_Elements} (32)
1395 \p{Bopo} \p{Bopomofo} (= \p{Script_Extensions=
1396 Bopomofo}) (NOT \p{Block=Bopomofo}) (111)
1397 \p{Bopomofo} \p{Script_Extensions=Bopomofo} (Short:
1398 \p{Bopo}; NOT \p{Block=Bopomofo}) (111)
1399 X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block=
1400 Bopomofo_Extended}) (32)
1401 X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short:
1402 \p{InBopomofoExt}) (32)
1403 X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
1404 \p{Bpt: *} \p{Bidi_Paired_Bracket_Type: *}
1405 \p{Brah} \p{Brahmi} (= \p{Script_Extensions=
1406 Brahmi}) (NOT \p{Block=Brahmi}) (109)
1407 \p{Brahmi} \p{Script_Extensions=Brahmi} (Short:
1408 \p{Brah}; NOT \p{Block=Brahmi}) (109)
1409 \p{Brai} \p{Braille} (= \p{Script_Extensions=
1410 Braille}) (256)
1411 \p{Braille} \p{Script_Extensions=Braille} (Short:
1412 \p{Brai}) (256)
1413 X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short:
1414 \p{InBraille}) (256)
1415 \p{Bugi} \p{Buginese} (= \p{Script_Extensions=
1416 Buginese}) (NOT \p{Block=Buginese}) (31)
1417 \p{Buginese} \p{Script_Extensions=Buginese} (Short:
1418 \p{Bugi}; NOT \p{Block=Buginese}) (31)
1419 \p{Buhd} \p{Buhid} (= \p{Script_Extensions=Buhid})
1420 (NOT \p{Block=Buhid}) (22)
1421 \p{Buhid} \p{Script_Extensions=Buhid} (Short:
1422 \p{Buhd}; NOT \p{Block=Buhid}) (22)
1423 X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block=
1424 Byzantine_Musical_Symbols}) (256)
1425 X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1426 (Short: \p{InByzantineMusic}) (256)
1427 \p{C} \pC \p{Other} (= \p{General_Category=Other})
1428 (977_573 plus all above-Unicode code
1429 points)
1430 \p{Cakm} \p{Chakma} (= \p{Script_Extensions=
1431 Chakma}) (NOT \p{Block=Chakma}) (87)
1432 \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
1433 (Short: \p{Cans}) (710)
1434 X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
1435 (= \p{Block=
1436 Unified_Canadian_Aboriginal_Syllabics})
1437 (640)
1438 T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1439 Not_Reordered} (1_113_286 plus all
1440 above-Unicode code points)
1441 T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1442 Overlay} (32)
1443 T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1444 Nukta} (23)
1445 T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1446 Kana_Voicing} (2)
1447 T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1448 Virama} (54)
1449 T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1450 CCC10} (1)
1451 \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1:
1452 U+05B0)
1453 T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1454 CCC11} (1)
1455 \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1:
1456 U+05B1)
1457 T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1458 CCC12} (1)
1459 \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1:
1460 U+05B2)
1461 T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1462 CCC13} (1)
1463 \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1:
1464 U+05B3)
1465 T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1466 CCC14} (1)
1467 \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1:
1468 U+05B4)
1469 T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1470 CCC15} (1)
1471 \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1:
1472 U+05B5)
1473 T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1474 CCC16} (1)
1475 \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1:
1476 U+05B6)
1477 T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1478 CCC17} (1)
1479 \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1:
1480 U+05B7)
1481 T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1482 CCC18} (2)
1483 \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2:
1484 U+05B8, U+05C7)
1485 T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1486 CCC19} (2)
1487 \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2:
1488 U+05B9..05BA)
1489 T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1490 CCC20} (1)
1491 \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1:
1492 U+05BB)
1493 T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1494 CCC21} (1)
1495 \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1:
1496 U+05BC)
1497 T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1498 CCC22} (1)
1499 \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1:
1500 U+05BD)
1501 T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1502 CCC23} (1)
1503 \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1:
1504 U+05BF)
1505 T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1506 CCC24} (1)
1507 \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1:
1508 U+05C1)
1509 T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1510 CCC25} (1)
1511 \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1:
1512 U+05C2)
1513 T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1514 CCC26} (1)
1515 \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1:
1516 U+FB1E)
1517 T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1518 CCC27} (2)
1519 \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2:
1520 U+064B, U+08F0)
1521 T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1522 CCC28} (2)
1523 \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2:
1524 U+064C, U+08F1)
1525 T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1526 CCC29} (2)
1527 \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2:
1528 U+064D, U+08F2)
1529 T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1530 CCC30} (2)
1531 \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2:
1532 U+0618, U+064E)
1533 T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1534 CCC31} (2)
1535 \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2:
1536 U+0619, U+064F)
1537 T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1538 CCC32} (2)
1539 \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2:
1540 U+061A, U+0650)
1541 T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1542 CCC33} (1)
1543 \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1:
1544 U+0651)
1545 T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1546 CCC34} (1)
1547 \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1:
1548 U+0652)
1549 T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1550 CCC35} (1)
1551 \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1:
1552 U+0670)
1553 T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1554 CCC36} (1)
1555 \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1:
1556 U+0711)
1557 T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1558 CCC84} (1)
1559 \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1:
1560 U+0C55)
1561 T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1562 CCC91} (1)
1563 \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1:
1564 U+0C56)
1565 T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1566 CCC103} (2)
1567 \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2:
1568 U+0E38..0E39)
1569 T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1570 CCC107} (4)
1571 \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4:
1572 U+0E48..0E4B)
1573 T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1574 CCC118} (2)
1575 \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2:
1576 U+0EB8..0EB9)
1577 T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1578 CCC122} (4)
1579 \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4:
1580 U+0EC8..0ECB)
1581 T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1582 CCC129} (1)
1583 \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1:
1584 U+0F71)
1585 T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1586 CCC130} (6)
1587 \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6:
1588 U+0F72, U+0F7A..0F7D, U+0F80)
1589 T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1590 CCC132} (1)
1591 \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1:
1592 U+0F74)
1593 T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
1594 CCC133} (0)
1595 \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
1596 T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1597 Attached_Below_Left} (0)
1598 T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1599 Attached_Below} (5)
1600 T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1601 Attached_Above} (1)
1602 T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1603 Attached_Above_Right} (9)
1604 T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1605 Below_Left} (1)
1606 T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1607 Below} (154)
1608 T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1609 Below_Right} (4)
1610 T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1611 Left} (2)
1612 T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1613 Right} (1)
1614 T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1615 Above_Left} (5)
1616 T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1617 Above} (461)
1618 T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1619 Above_Right} (5)
1620 T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1621 Double_Below} (4)
1622 T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1623 Double_Above} (5)
1624 T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1625 Iota_Subscript} (1)
1626 \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1627 Above} (461)
1628 \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (461:
1629 U+0300..0314, U+033D..0344, U+0346,
1630 U+034A..034C, U+0350..0352, U+0357 ...)
1631 \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5:
1632 U+05AE, U+18A9, U+1DF7..1DF8, U+302B)
1633 \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (5:
1634 U+0315, U+031A, U+0358, U+1DF6, U+302C)
1635 \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1636 Above_Left} (5)
1637 \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1638 Above_Right} (5)
1639 \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1640 Attached_Above} (1)
1641 \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1642 Attached_Above_Right} (9)
1643 \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1644 Attached_Below} (5)
1645 \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1646 Attached_Below_Left} (0)
1647 \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1648 (1: U+1DCE)
1649 \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1650 \p{Ccc=ATAR}) (9: U+031B, U+0F39,
1651 U+1D165..1D166, U+1D16E..1D172)
1652 \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1653 (5: U+0321..0322, U+0327..0328, U+1DD0)
1654 \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1655 ATBL}) (0)
1656 \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1657 Below} (154)
1658 \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (154:
1659 U+0316..0319, U+031C..0320,
1660 U+0323..0326, U+0329..0333,
1661 U+0339..033C, U+0347..0349 ...)
1662 \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1:
1663 U+302A)
1664 \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4:
1665 U+059A, U+05AD, U+1939, U+302D)
1666 \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1667 Below_Left} (1)
1668 \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1669 Below_Right} (4)
1670 \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1671 Double_Above} (5)
1672 \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1673 Double_Below} (4)
1674 \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA})
1675 (5: U+035D..035E, U+0360..0361, U+1DCD)
1676 \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB})
1677 (4: U+035C, U+035F, U+0362, U+1DFC)
1678 \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1679 (1: U+0345)
1680 \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1681 Iota_Subscript} (1)
1682 \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV})
1683 (2: U+3099..309A)
1684 \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1685 Kana_Voicing} (2)
1686 \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1687 Left} (2)
1688 \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2:
1689 U+302E..302F)
1690 \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1691 Nukta} (23)
1692 \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1693 (1_113_286 plus all above-Unicode code
1694 points: U+0000..02FF, U+034F,
1695 U+0370..0482, U+0488..0590, U+05BE,
1696 U+05C0 ...)
1697 \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1698 Not_Reordered} (1_113_286 plus all
1699 above-Unicode code points)
1700 \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (23:
1701 U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C,
1702 U+0CBC ...)
1703 \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1704 Overlay} (32)
1705 \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32:
1706 U+0334..0338, U+1CD4, U+1CE2..1CE8,
1707 U+20D2..20D3, U+20D8..20DA, U+20E5..20E6
1708 ...)
1709 \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1710 Right} (1)
1711 \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1:
1712 U+1D16D)
1713 \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (54:
1714 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
1715 U+0BCD ...)
1716 \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1717 Virama} (54)
1718 \p{Cans} \p{Canadian_Aboriginal} (=
1719 \p{Script_Extensions=
1720 Canadian_Aboriginal}) (710)
1721 \p{Cari} \p{Carian} (= \p{Script_Extensions=
1722 Carian}) (NOT \p{Block=Carian}) (49)
1723 \p{Carian} \p{Script_Extensions=Carian} (Short:
1724 \p{Cari}; NOT \p{Block=Carian}) (49)
1725 \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (2314)
1726 \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_111_798 plus
1727 all above-Unicode code points: [\x00-
1728 \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A-
1729 Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
1730 \xac\xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
1731 U+0100..02AF, U+0370..0373,
1732 U+0376..0379, U+037B..0383, U+0386 ...)
1733 \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (2314:
1734 [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8],
1735 U+02B0..036F, U+0374..0375, U+037A,
1736 U+0384..0385, U+0387 ...)
1737 \p{Cased} \p{Cased=Y} (4105)
1738 \p{Cased: N*} (Single: \P{Cased}) (1_110_007 plus all
1739 above-Unicode code points: [\x00-
1740 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1741 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
1742 \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7], U+01BB,
1743 U+01C0..01C3, U+0294, U+02B9..02BF,
1744 U+02C2..02DF ...)
1745 \p{Cased: Y*} (Single: \p{Cased}) (4105: [A-Za-
1746 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
1747 \xff], U+0100..01BA, U+01BC..01BF,
1748 U+01C4..0293, U+0295..02B8, U+02C0..02C1
1749 ...)
1750 \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
1751 \p{LC}) (3796)
1752 \p{Category: *} \p{General_Category: *}
1753 \p{Caucasian_Albanian} \p{Script_Extensions=Caucasian_Albanian}
1754 (Short: \p{Aghb}; NOT \p{Block=
1755 Caucasian_Albanian}) (53)
1756 \p{Cc} \p{XPosixCntrl} (= \p{General_Category=
1757 Control}) (65)
1758 \p{Ccc: *} \p{Canonical_Combining_Class: *}
1759 \p{CE} \p{Composition_Exclusion} (=
1760 \p{Composition_Exclusion=Y}) (81)
1761 \p{CE: *} \p{Composition_Exclusion: *}
1762 \p{Cf} \p{Format} (= \p{General_Category=Format})
1763 (151)
1764 \p{Chakma} \p{Script_Extensions=Chakma} (Short:
1765 \p{Cakm}; NOT \p{Block=Chakma}) (87)
1766 \p{Cham} \p{Script_Extensions=Cham} (NOT \p{Block=
1767 Cham}) (83)
1768 \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1769 \p{CWCF}) (1377)
1770 \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1771 (1_112_735 plus all above-Unicode code
1772 points: [\x00-
1773 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1774 >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\xb4\xb6-
1775 \xbf\xd7\xe0-\xff], U+0101, U+0103,
1776 U+0105, U+0107, U+0109 ...)
1777 \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1778 (1377: [A-Z\xb5\xc0-\xd6\xd8-\xdf],
1779 U+0100, U+0102, U+0104, U+0106, U+0108
1780 ...)
1781 \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1782 \p{CWCM}) (2669)
1783 \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1784 (1_111_443 plus all above-Unicode code
1785 points: [\x00-
1786 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1787 >?\@\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1788 \xbf\xd7\xf7], U+0138, U+018D, U+019B,
1789 U+01AA..01AB, U+01BA..01BB ...)
1790 \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1791 (2669: [A-Za-z\xb5\xc0-\xd6\xd8-
1792 \xf6\xf8-\xff], U+0100..0137,
1793 U+0139..018C, U+018E..019A,
1794 U+019C..01A9, U+01AC..01B9 ...)
1795 \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1796 \p{CWL}) (1304)
1797 \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1798 (1_112_808 plus all above-Unicode code
1799 points: [\x00-
1800 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1801 >?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1802 \xbf\xd7\xdf-\xff], U+0101, U+0103,
1803 U+0105, U+0107, U+0109 ...)
1804 \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1304:
1805 [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102,
1806 U+0104, U+0106, U+0108 ...)
1807 \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1808 Y} (Short: \p{CWKCF}) (10_227)
1809 \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1810 \P{CWKCF}) (1_103_885 plus all above-
1811 Unicode code points: [\x00-
1812 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1813 >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1-
1814 \xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6-
1815 \xb7\xbb\xbf\xd7\xe0-\xff], U+0101,
1816 U+0103, U+0105, U+0107, U+0109 ...)
1817 \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1818 \p{CWKCF}) (10_227: [A-
1819 Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8-
1820 \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf],
1821 U+0100, U+0102, U+0104, U+0106, U+0108
1822 ...)
1823 \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1824 \p{CWT}) (1369)
1825 \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1826 (1_112_743 plus all above-Unicode code
1827 points: [\x00-
1828 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
1829 Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1830 \xde\xf7], U+0100, U+0102, U+0104,
1831 U+0106, U+0108 ...)
1832 \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1369:
1833 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1834 U+0103, U+0105, U+0107, U+0109 ...)
1835 \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1836 \p{CWU}) (1396)
1837 \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1838 (1_112_716 plus all above-Unicode code
1839 points: [\x00-
1840 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
1841 Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1842 \xde\xf7], U+0100, U+0102, U+0104,
1843 U+0106, U+0108 ...)
1844 \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1396:
1845 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1846 U+0103, U+0105, U+0107, U+0109 ...)
1847 \p{Cher} \p{Cherokee} (= \p{Script_Extensions=
1848 Cherokee}) (NOT \p{Block=Cherokee}) (172)
1849 \p{Cherokee} \p{Script_Extensions=Cherokee} (Short:
1850 \p{Cher}; NOT \p{Block=Cherokee}) (172)
1851 X \p{Cherokee_Sup} \p{Cherokee_Supplement} (= \p{Block=
1852 Cherokee_Supplement}) (80)
1853 X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
1854 \p{InCherokeeSup}) (80)
1855 \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
1856 Y}) (2314)
1857 \p{CI: *} \p{Case_Ignorable: *}
1858 X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block=
1859 CJK_Unified_Ideographs}) (20_992)
1860 X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block=
1861 CJK_Compatibility}) (256)
1862 X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block=
1863 CJK_Compatibility_Forms}) (32)
1864 X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
1865 \p{Block=CJK_Compatibility_Ideographs})
1866 (512)
1867 X \p{CJK_Compat_Ideographs_Sup}
1868 \p{CJK_Compatibility_Ideographs_-
1869 Supplement} (= \p{Block=
1870 CJK_Compatibility_Ideographs_-
1871 Supplement}) (544)
1872 X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short:
1873 \p{InCJKCompat}) (256)
1874 X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
1875 (Short: \p{InCJKCompatForms}) (32)
1876 X \p{CJK_Compatibility_Ideographs} \p{Block=
1877 CJK_Compatibility_Ideographs} (Short:
1878 \p{InCJKCompatIdeographs}) (512)
1879 X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
1880 CJK_Compatibility_Ideographs_Supplement}
1881 (Short: \p{InCJKCompatIdeographsSup})
1882 (544)
1883 X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (=
1884 \p{Block=
1885 CJK_Unified_Ideographs_Extension_A})
1886 (6592)
1887 X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (=
1888 \p{Block=
1889 CJK_Unified_Ideographs_Extension_B})
1890 (42_720)
1891 X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (=
1892 \p{Block=
1893 CJK_Unified_Ideographs_Extension_C})
1894 (4160)
1895 X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (=
1896 \p{Block=
1897 CJK_Unified_Ideographs_Extension_D})
1898 (224)
1899 X \p{CJK_Ext_E} \p{CJK_Unified_Ideographs_Extension_E} (=
1900 \p{Block=
1901 CJK_Unified_Ideographs_Extension_E})
1902 (5776)
1903 X \p{CJK_Ext_F} \p{CJK_Unified_Ideographs_Extension_F} (=
1904 \p{Block=
1905 CJK_Unified_Ideographs_Extension_F})
1906 (7488)
1907 X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block=
1908 CJK_Radicals_Supplement}) (128)
1909 X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
1910 (Short: \p{InCJKRadicalsSup}) (128)
1911 X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
1912 X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (=
1913 \p{Block=CJK_Symbols_And_Punctuation})
1914 (64)
1915 X \p{CJK_Symbols_And_Punctuation} \p{Block=
1916 CJK_Symbols_And_Punctuation} (Short:
1917 \p{InCJKSymbols}) (64)
1918 X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
1919 (Short: \p{InCJK}) (20_992)
1920 X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
1921 CJK_Unified_Ideographs_Extension_A}
1922 (Short: \p{InCJKExtA}) (6592)
1923 X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
1924 CJK_Unified_Ideographs_Extension_B}
1925 (Short: \p{InCJKExtB}) (42_720)
1926 X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
1927 CJK_Unified_Ideographs_Extension_C}
1928 (Short: \p{InCJKExtC}) (4160)
1929 X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
1930 CJK_Unified_Ideographs_Extension_D}
1931 (Short: \p{InCJKExtD}) (224)
1932 X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
1933 CJK_Unified_Ideographs_Extension_E}
1934 (Short: \p{InCJKExtE}) (5776)
1935 X \p{CJK_Unified_Ideographs_Extension_F} \p{Block=
1936 CJK_Unified_Ideographs_Extension_F}
1937 (Short: \p{InCJKExtF}) (7488)
1938 \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
1939 (Short: \p{Pe}) (73)
1940 \p{Cn} \p{Unassigned} (= \p{General_Category=
1941 Unassigned}) (837_841 plus all above-
1942 Unicode code points)
1943 \p{Cntrl} \p{XPosixCntrl} (= \p{General_Category=
1944 Control}) (65)
1945 \p{Co} \p{Private_Use} (= \p{General_Category=
1946 Private_Use}) (NOT \p{Private_Use_Area})
1947 (137_468)
1948 X \p{Combining_Diacritical_Marks} \p{Block=
1949 Combining_Diacritical_Marks} (Short:
1950 \p{InDiacriticals}) (112)
1951 X \p{Combining_Diacritical_Marks_Extended} \p{Block=
1952 Combining_Diacritical_Marks_Extended}
1953 (Short: \p{InDiacriticalsExt}) (80)
1954 X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
1955 Combining_Diacritical_Marks_For_Symbols}
1956 (Short: \p{InDiacriticalsForSymbols})
1957 (48)
1958 X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
1959 Combining_Diacritical_Marks_Supplement}
1960 (Short: \p{InDiacriticalsSup}) (64)
1961 X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
1962 \p{InHalfMarks}) (16)
1963 \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark})
1964 (2177)
1965 X \p{Combining_Marks_For_Symbols}
1966 \p{Combining_Diacritical_Marks_For_-
1967 Symbols} (= \p{Block=
1968 Combining_Diacritical_Marks_For_-
1969 Symbols}) (48)
1970 \p{Common} \p{Script_Extensions=Common} (Short:
1971 \p{Zyyy}) (6947)
1972 X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1973 (Short: \p{InIndicNumberForms}) (16)
1974 \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
1975 \p{Full_Composition_Exclusion=Y}) (1120)
1976 \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
1977 X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block=
1978 Hangul_Compatibility_Jamo}) (96)
1979 \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
1980 \p{CE}) (81)
1981 \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
1982 plus all above-Unicode code points:
1983 U+0000..0957, U+0960..09DB, U+09DE,
1984 U+09E0..0A32, U+0A34..0A35, U+0A37..0A58
1985 ...)
1986 \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81:
1987 U+0958..095F, U+09DC..09DD, U+09DF,
1988 U+0A33, U+0A36, U+0A59..0A5B ...)
1989 \p{Connector_Punctuation} \p{General_Category=
1990 Connector_Punctuation} (Short: \p{Pc})
1991 (10)
1992 \p{Control} \p{XPosixCntrl} (= \p{General_Category=
1993 Control}) (65)
1994 X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
1995 \p{Copt} \p{Coptic} (= \p{Script_Extensions=
1996 Coptic}) (NOT \p{Block=Coptic}) (165)
1997 \p{Coptic} \p{Script_Extensions=Coptic} (Short:
1998 \p{Copt}; NOT \p{Block=Coptic}) (165)
1999 X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
2000 X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block=
2001 Counting_Rod_Numerals}) (32)
2002 X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
2003 \p{InCountingRod}) (32)
2004 \p{Cprt} \p{Cypriot} (= \p{Script_Extensions=
2005 Cypriot}) (112)
2006 \p{Cs} \p{Surrogate} (= \p{General_Category=
2007 Surrogate}) (2048)
2008 \p{Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
2009 \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
2010 X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (=
2011 \p{Block=
2012 Cuneiform_Numbers_And_Punctuation}) (128)
2013 X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
2014 Cuneiform_Numbers_And_Punctuation}
2015 (Short: \p{InCuneiformNumbers}) (128)
2016 \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
2017 (Short: \p{Sc}) (54)
2018 X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
2019 \p{CWCF} \p{Changes_When_Casefolded} (=
2020 \p{Changes_When_Casefolded=Y}) (1377)
2021 \p{CWCF: *} \p{Changes_When_Casefolded: *}
2022 \p{CWCM} \p{Changes_When_Casemapped} (=
2023 \p{Changes_When_Casemapped=Y}) (2669)
2024 \p{CWCM: *} \p{Changes_When_Casemapped: *}
2025 \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
2026 \p{Changes_When_NFKC_Casefolded=Y})
2027 (10_227)
2028 \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
2029 \p{CWL} \p{Changes_When_Lowercased} (=
2030 \p{Changes_When_Lowercased=Y}) (1304)
2031 \p{CWL: *} \p{Changes_When_Lowercased: *}
2032 \p{CWT} \p{Changes_When_Titlecased} (=
2033 \p{Changes_When_Titlecased=Y}) (1369)
2034 \p{CWT: *} \p{Changes_When_Titlecased: *}
2035 \p{CWU} \p{Changes_When_Uppercased} (=
2036 \p{Changes_When_Uppercased=Y}) (1396)
2037 \p{CWU: *} \p{Changes_When_Uppercased: *}
2038 \p{Cypriot} \p{Script_Extensions=Cypriot} (Short:
2039 \p{Cprt}) (112)
2040 X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
2041 \p{Cyrillic} \p{Script_Extensions=Cyrillic} (Short:
2042 \p{Cyrl}; NOT \p{Block=Cyrillic}) (446)
2043 X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block=
2044 Cyrillic_Extended_A}) (32)
2045 X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block=
2046 Cyrillic_Extended_B}) (96)
2047 X \p{Cyrillic_Ext_C} \p{Cyrillic_Extended_C} (= \p{Block=
2048 Cyrillic_Extended_C}) (16)
2049 X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
2050 \p{InCyrillicExtA}) (32)
2051 X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
2052 \p{InCyrillicExtB}) (96)
2053 X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
2054 \p{InCyrillicExtC}) (16)
2055 X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block=
2056 Cyrillic_Supplement}) (48)
2057 X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
2058 \p{InCyrillicSup}) (48)
2059 X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
2060 Cyrillic_Supplement}) (48)
2061 \p{Cyrl} \p{Cyrillic} (= \p{Script_Extensions=
2062 Cyrillic}) (NOT \p{Block=Cyrillic}) (446)
2063 \p{Dash} \p{Dash=Y} (28)
2064 \p{Dash: N*} (Single: \P{Dash}) (1_114_084 plus all
2065 above-Unicode code points: [\x00-
2066 \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-
2067 Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2068 U+0100..0589, U+058B..05BD,
2069 U+05BF..13FF, U+1401..1805, U+1807..200F
2070 ...)
2071 \p{Dash: Y*} (Single: \p{Dash}) (28: [\-], U+058A,
2072 U+05BE, U+1400, U+1806, U+2010..2015 ...)
2073 \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
2074 (Short: \p{Pd}) (24)
2075 \p{Decimal_Number} \p{XPosixDigit} (= \p{General_Category=
2076 Decimal_Number}) (590)
2077 \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
2078 (13_232)
2079 \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_232:
2080 [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9-
2081 \xdd\xe0-\xe5\xe7-\xef\xf1-\xf6\xf9-
2082 \xfd\xff], U+0100..010F, U+0112..0125,
2083 U+0128..0130, U+0134..0137, U+0139..013E
2084 ...)
2085 \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240:
2086 U+2460..2473, U+24B6..24EA,
2087 U+3244..3247, U+3251..327E,
2088 U+3280..32BF, U+32D0..32FE ...)
2089 \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
2090 \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720:
2091 [\xa8\xaf\xb4-\xb5\xb8], U+0132..0133,
2092 U+013F..0140, U+0149, U+017F,
2093 U+01C4..01CC ...)
2094 \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
2095 \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
2096 \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51,
2097 U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63
2098 ...)
2099 \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184: U+2102,
2100 U+210A..2113, U+2115, U+2119..211D,
2101 U+2124, U+2128 ...)
2102 \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
2103 \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc-
2104 \xbe], U+2150..215F, U+2189)
2105 \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
2106 \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54,
2107 U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68
2108 ...)
2109 \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
2110 \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50,
2111 U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62
2112 ...)
2113 \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
2114 \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55,
2115 U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69
2116 ...)
2117 \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
2118 \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122:
2119 U+FF61..FFBE, U+FFC2..FFC7,
2120 U+FFCA..FFCF, U+FFD2..FFD7,
2121 U+FFDA..FFDC, U+FFE8..FFEE)
2122 \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
2123 \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0],
2124 U+0F0C, U+2007, U+2011, U+202F)
2125 \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
2126 Non_Canonical} (Perl extension) (3662)
2127 \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
2128 decompositions (Short: \p{Dt=NonCanon})
2129 (Perl extension) (3662:
2130 [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-\xba\xbc-
2131 \xbe], U+0132..0133, U+013F..0140,
2132 U+0149, U+017F, U+01C4..01CC ...)
2133 \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_218 plus
2134 all above-Unicode code points: [\x00-
2135 \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6-
2136 \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-
2137 \xdf\xe6\xf0\xf7-\xf8\xfe],
2138 U+0110..0111, U+0126..0127, U+0131,
2139 U+0138, U+0141..0142 ...)
2140 \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26:
2141 U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B)
2142 \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
2143 \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (285)
2144 \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (285: U+3250,
2145 U+32CC..32CF, U+3300..3357,
2146 U+3371..33DF, U+33FF, U+1F130..1F14F ...)
2147 \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38: U+1D62..1D6A,
2148 U+2080..208E, U+2090..209C, U+2C7C)
2149 \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (152)
2150 \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (152: [\xaa\xb2-
2151 \xb3\xb9-\xba], U+02B0..02B8,
2152 U+02E0..02E4, U+10FC, U+1D2C..1D2E,
2153 U+1D30..1D3A ...)
2154 \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
2155 \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F,
2156 U+30FF, U+FE10..FE19, U+FE30..FE44,
2157 U+FE47..FE48)
2158 \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000,
2159 U+FF01..FF60, U+FFE0..FFE6)
2160 \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
2161 Y} (Short: \p{DI}) (4173)
2162 \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
2163 (1_109_939 plus all above-Unicode code
2164 points: [\x00-\xac\xae-\xff],
2165 U+0100..034E, U+0350..061B,
2166 U+061D..115E, U+1161..17B3, U+17B6..180A
2167 ...)
2168 \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
2169 (4173: [\xad], U+034F, U+061C,
2170 U+115F..1160, U+17B4..17B5, U+180B..180E
2171 ...)
2172 \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (15)
2173 \p{Dep: *} \p{Deprecated: *}
2174 \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (15)
2175 \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_097
2176 plus all above-Unicode code points:
2177 U+0000..0148, U+014A..0672,
2178 U+0674..0F76, U+0F78, U+0F7A..17A2,
2179 U+17A5..2069 ...)
2180 \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149,
2181 U+0673, U+0F77, U+0F79, U+17A3..17A4,
2182 U+206A..206F ...)
2183 \p{Deseret} \p{Script_Extensions=Deseret} (Short:
2184 \p{Dsrt}) (80)
2185 \p{Deva} \p{Devanagari} (= \p{Script_Extensions=
2186 Devanagari}) (NOT \p{Block=Devanagari})
2187 (210)
2188 \p{Devanagari} \p{Script_Extensions=Devanagari} (Short:
2189 \p{Deva}; NOT \p{Block=Devanagari}) (210)
2190 X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block=
2191 Devanagari_Extended}) (32)
2192 X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
2193 \p{InDevanagariExt}) (32)
2194 \p{DI} \p{Default_Ignorable_Code_Point} (=
2195 \p{Default_Ignorable_Code_Point=Y})
2196 (4173)
2197 \p{DI: *} \p{Default_Ignorable_Code_Point: *}
2198 \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (798)
2199 \p{Dia: *} \p{Diacritic: *}
2200 \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (798)
2201 \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_314
2202 plus all above-Unicode code points:
2203 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2204 >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
2205 \xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
2206 U+0100..02AF, U+034F, U+0358..035C,
2207 U+0363..0373, U+0376..0379 ...)
2208 \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (798:
2209 [\^`\xa8\xaf\xb4\xb7-\xb8],
2210 U+02B0..034E, U+0350..0357,
2211 U+035D..0362, U+0374..0375, U+037A ...)
2212 X \p{Diacriticals} \p{Combining_Diacritical_Marks} (=
2213 \p{Block=Combining_Diacritical_Marks})
2214 (112)
2215 X \p{Diacriticals_Ext} \p{Combining_Diacritical_Marks_Extended}
2216 (= \p{Block=
2217 Combining_Diacritical_Marks_Extended})
2218 (80)
2219 X \p{Diacriticals_For_Symbols}
2220 \p{Combining_Diacritical_Marks_For_-
2221 Symbols} (= \p{Block=
2222 Combining_Diacritical_Marks_For_-
2223 Symbols}) (48)
2224 X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement}
2225 (= \p{Block=
2226 Combining_Diacritical_Marks_Supplement})
2227 (64)
2228 \p{Digit} \p{XPosixDigit} (= \p{General_Category=
2229 Decimal_Number}) (590)
2230 X \p{Dingbats} \p{Block=Dingbats} (192)
2231 X \p{Domino} \p{Domino_Tiles} (= \p{Block=
2232 Domino_Tiles}) (112)
2233 X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short:
2234 \p{InDomino}) (112)
2235 \p{Dsrt} \p{Deseret} (= \p{Script_Extensions=
2236 Deseret}) (80)
2237 \p{Dt: *} \p{Decomposition_Type: *}
2238 \p{Dupl} \p{Duployan} (= \p{Script_Extensions=
2239 Duployan}) (NOT \p{Block=Duployan}) (147)
2240 \p{Duployan} \p{Script_Extensions=Duployan} (Short:
2241 \p{Dupl}; NOT \p{Block=Duployan}) (147)
2242 \p{Ea: *} \p{East_Asian_Width: *}
2243 X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
2244 (208)
2245 \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
2246 \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739:
2247 [\xa1\xa4\xa7-\xa8\xaa\xad-\xae\xb0-
2248 \xb4\xb6-\xba\xbc-\xbf\xc6\xd0\xd7-
2249 \xd8\xde-\xe1\xe6\xe8-\xea\xec-
2250 \xed\xf0\xf2-\xf3\xf7-\xfa\xfc\xfe],
2251 U+0101, U+0111, U+0113, U+011B,
2252 U+0126..0127 ...)
2253 \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
2254 \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000,
2255 U+FF01..FF60, U+FFE0..FFE6)
2256 \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
2257 \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9,
2258 U+FF61..FFBE, U+FFC2..FFC7,
2259 U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC
2260 ...)
2261 \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (793_401 plus
2262 all above-Unicode code points)
2263 \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
2264 \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20-
2265 \x7e\xa2-\xa3\xa5-\xa6\xac\xaf],
2266 U+27E6..27ED, U+2985..2986)
2267 \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (793_401 plus all
2268 above-Unicode code points: [\x00-
2269 \x1f\x7f-\xa0\xa9\xab\xb5\xbb\xc0-
2270 \xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd\xe2-
2271 \xe5\xe7\xeb\xee-\xef\xf1\xf4-
2272 \xf6\xfb\xfd\xff], U+00FF..0100,
2273 U+0102..0110, U+0112, U+0114..011A,
2274 U+011C..0125 ...)
2275 \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (181_634)
2276 \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (181_634:
2277 U+1100..115F, U+231A..231B,
2278 U+2329..232A, U+23E9..23EC, U+23F0,
2279 U+23F3 ...)
2280 \p{Egyp} \p{Egyptian_Hieroglyphs} (=
2281 \p{Script_Extensions=
2282 Egyptian_Hieroglyphs}) (NOT \p{Block=
2283 Egyptian_Hieroglyphs}) (1071)
2284 \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
2285 Egyptian_Hieroglyphs} (Short: \p{Egyp};
2286 NOT \p{Block=Egyptian_Hieroglyphs})
2287 (1071)
2288 \p{Elba} \p{Elbasan} (= \p{Script_Extensions=
2289 Elbasan}) (NOT \p{Block=Elbasan}) (40)
2290 \p{Elbasan} \p{Script_Extensions=Elbasan} (Short:
2291 \p{Elba}; NOT \p{Block=Elbasan}) (40)
2292 X \p{Emoticons} \p{Block=Emoticons} (80)
2293 X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block=
2294 Enclosed_Alphanumerics}) (160)
2295 X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
2296 \p{Block=
2297 Enclosed_Alphanumeric_Supplement}) (256)
2298 X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
2299 Enclosed_Alphanumeric_Supplement}
2300 (Short: \p{InEnclosedAlphanumSup}) (256)
2301 X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
2302 (Short: \p{InEnclosedAlphanum}) (160)
2303 X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (=
2304 \p{Block=
2305 Enclosed_CJK_Letters_And_Months}) (256)
2306 X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
2307 Enclosed_CJK_Letters_And_Months} (Short:
2308 \p{InEnclosedCJK}) (256)
2309 X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
2310 (= \p{Block=
2311 Enclosed_Ideographic_Supplement}) (256)
2312 X \p{Enclosed_Ideographic_Supplement} \p{Block=
2313 Enclosed_Ideographic_Supplement} (Short:
2314 \p{InEnclosedIdeographicSup}) (256)
2315 \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
2316 (Short: \p{Me}) (13)
2317 \p{Ethi} \p{Ethiopic} (= \p{Script_Extensions=
2318 Ethiopic}) (NOT \p{Block=Ethiopic}) (495)
2319 \p{Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
2320 \p{Ethi}; NOT \p{Block=Ethiopic}) (495)
2321 X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block=
2322 Ethiopic_Extended}) (96)
2323 X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block=
2324 Ethiopic_Extended_A}) (48)
2325 X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short:
2326 \p{InEthiopicExt}) (96)
2327 X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
2328 \p{InEthiopicExtA}) (48)
2329 X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block=
2330 Ethiopic_Supplement}) (32)
2331 X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
2332 \p{InEthiopicSup}) (32)
2333 \p{Ext} \p{Extender} (= \p{Extender=Y}) (44)
2334 \p{Ext: *} \p{Extender: *}
2335 \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (44)
2336 \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_068
2337 plus all above-Unicode code points:
2338 [\x00-\xb6\xb8-\xff], U+0100..02CF,
2339 U+02D2..063F, U+0641..07F9,
2340 U+07FB..0E45, U+0E47..0EC5 ...)
2341 \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (44: [\xb7],
2342 U+02D0..02D1, U+0640, U+07FA, U+0E46,
2343 U+0EC6 ...)
2344 \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
2345 (Short: \p{Pf}) (10)
2346 \p{Format} \p{General_Category=Format} (Short:
2347 \p{Cf}) (151)
2348 \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
2349 (Short: \p{CompEx}) (1120)
2350 \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
2351 \P{CompEx}) (1_112_992 plus all above-
2352 Unicode code points: U+0000..033F,
2353 U+0342, U+0345..0373, U+0375..037D,
2354 U+037F..0386, U+0388..0957 ...)
2355 \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
2356 \p{CompEx}) (1120: U+0340..0341,
2357 U+0343..0344, U+0374, U+037E, U+0387,
2358 U+0958..095F ...)
2359 \p{Gc: *} \p{General_Category: *}
2360 \p{GCB: *} \p{Grapheme_Cluster_Break: *}
2361 \p{General_Category: C} \p{General_Category=Other} (977_573 plus
2362 all above-Unicode code points)
2363 \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
2364 \p{Gc=LC}, \p{LC}) (3796: [A-Za-
2365 z\xb5\xc0-\xd6\xd8-\xf6\xf8-\xff],
2366 U+0100..01BA, U+01BC..01BF,
2367 U+01C4..0293, U+0295..02AF, U+0370..0373
2368 ...)
2369 \p{General_Category: Cc} \p{General_Category=Control} (65)
2370 \p{General_Category: Cf} \p{General_Category=Format} (151)
2371 \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
2372 (73: [\)\]\}], U+0F3B, U+0F3D, U+169C,
2373 U+2046, U+207E ...)
2374 \p{General_Category: Cn} \p{General_Category=Unassigned} (837_841
2375 plus all above-Unicode code points)
2376 \p{General_Category: Cntrl} \p{General_Category=Control} (65)
2377 \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
2378 \p{General_Category: Combining_Mark} \p{General_Category=Mark}
2379 (2177)
2380 \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
2381 \p{Pc}) (10: [_], U+203F..2040, U+2054,
2382 U+FE33..FE34, U+FE4D..FE4F, U+FF3F)
2383 \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65:
2384 [\x00-\x1f\x7f-\x9f])
2385 \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
2386 \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
2387 (54: [\$\xa2-\xa5], U+058F, U+060B,
2388 U+09F2..09F3, U+09FB, U+0AF1 ...)
2389 \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
2390 (24: [\-], U+058A, U+05BE, U+1400,
2391 U+1806, U+2010..2015 ...)
2392 \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
2393 (590: [0-9], U+0660..0669, U+06F0..06F9,
2394 U+07C0..07C9, U+0966..096F, U+09E6..09EF
2395 ...)
2396 \p{General_Category: Digit} \p{General_Category=Decimal_Number}
2397 (590)
2398 \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
2399 (13: U+0488..0489, U+1ABE, U+20DD..20E0,
2400 U+20E2..20E4, U+A670..A672)
2401 \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
2402 (10: [\xbb], U+2019, U+201D, U+203A,
2403 U+2E03, U+2E05 ...)
2404 \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (151:
2405 [\xad], U+0600..0605, U+061C, U+06DD,
2406 U+070F, U+08E2 ...)
2407 \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
2408 \p{Pi}) (12: [\xab], U+2018,
2409 U+201B..201C, U+201F, U+2039, U+2E02 ...)
2410 \p{General_Category: L} \p{General_Category=Letter} (125_093)
2411 X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3796)
2412 X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
2413 the trailing '_' matters in spite of
2414 loose matching rules. (3796)
2415 \p{General_Category: LC} \p{General_Category=Cased_Letter} (3796)
2416 \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (125_093:
2417 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
2418 \xf6\xf8-\xff], U+0100..02C1,
2419 U+02C6..02D1, U+02E0..02E4, U+02EC,
2420 U+02EE ...)
2421 \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
2422 (236: U+16EE..16F0, U+2160..2182,
2423 U+2185..2188, U+3007, U+3021..3029,
2424 U+3038..303A ...)
2425 \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl})
2426 (1: U+2028)
2427 \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
2428 (/i= General_Category=Cased_Letter)
2429 (2063)
2430 \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (250)
2431 \p{General_Category: Lo} \p{General_Category=Other_Letter}
2432 (121_047)
2433 \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
2434 /i= General_Category=Cased_Letter)
2435 (2063: [a-z\xb5\xdf-\xf6\xf8-\xff],
2436 U+0101, U+0103, U+0105, U+0107, U+0109
2437 ...)
2438 \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
2439 (/i= General_Category=Cased_Letter) (31)
2440 \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
2441 (/i= General_Category=Cased_Letter)
2442 (1702)
2443 \p{General_Category: M} \p{General_Category=Mark} (2177)
2444 \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2177:
2445 U+0300..036F, U+0483..0489,
2446 U+0591..05BD, U+05BF, U+05C1..05C2,
2447 U+05C4..05C5 ...)
2448 \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948:
2449 [+<=>\|~\xac\xb1\xd7\xf7], U+03F6,
2450 U+0606..0608, U+2044, U+2052,
2451 U+207A..207C ...)
2452 \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (401)
2453 \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
2454 \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
2455 (1763)
2456 \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
2457 (250: U+02B0..02C1, U+02C6..02D1,
2458 U+02E0..02E4, U+02EC, U+02EE, U+0374 ...)
2459 \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
2460 (121: [\^`\xa8\xaf\xb4\xb8],
2461 U+02C2..02C5, U+02D2..02DF,
2462 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2463 \p{General_Category: N} \p{General_Category=Number} (1502)
2464 \p{General_Category: Nd} \p{General_Category=Decimal_Number} (590)
2465 \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
2466 \p{General_Category: No} \p{General_Category=Other_Number} (676)
2467 \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
2468 (1763: U+0300..036F, U+0483..0487,
2469 U+0591..05BD, U+05BF, U+05C1..05C2,
2470 U+05C4..05C5 ...)
2471 \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1502:
2472 [0-9\xb2-\xb3\xb9\xbc-\xbe],
2473 U+0660..0669, U+06F0..06F9,
2474 U+07C0..07C9, U+0966..096F, U+09E6..09EF
2475 ...)
2476 \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
2477 (75: [\(\[\{], U+0F3A, U+0F3C, U+169B,
2478 U+201A, U+201E ...)
2479 \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (977_573 plus
2480 all above-Unicode code points: [\x00-
2481 \x1f\x7f-\x9f\xad], U+0378..0379,
2482 U+0380..0383, U+038B, U+038D, U+03A2 ...)
2483 \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
2484 (121_047: [\xaa\xba], U+01BB,
2485 U+01C0..01C3, U+0294, U+05D0..05EA,
2486 U+05F0..05F2 ...)
2487 \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No})
2488 (676: [\xb2-\xb3\xb9\xbc-\xbe],
2489 U+09F4..09F9, U+0B72..0B77,
2490 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
2491 ...)
2492 \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
2493 (566:
2494 [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7\xb6-
2495 \xb7\xbf], U+037E, U+0387, U+055A..055F,
2496 U+0589, U+05C0 ...)
2497 \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
2498 (5855: [\xa6\xa9\xae\xb0], U+0482,
2499 U+058D..058E, U+060E..060F, U+06DE,
2500 U+06E9 ...)
2501 \p{General_Category: P} \p{General_Category=Punctuation} (770)
2502 \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
2503 \p{Zp}) (1: U+2029)
2504 \p{General_Category: Pc} \p{General_Category=
2505 Connector_Punctuation} (10)
2506 \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (24)
2507 \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
2508 (73)
2509 \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
2510 (10)
2511 \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
2512 (12)
2513 \p{General_Category: Po} \p{General_Category=Other_Punctuation}
2514 (566)
2515 \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
2516 (137_468: U+E000..F8FF, U+F0000..FFFFD,
2517 U+100000..10FFFD)
2518 \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (75)
2519 \p{General_Category: Punct} \p{General_Category=Punctuation} (770)
2520 \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (770:
2521 [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_-
2522 \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
2523 U+037E, U+0387, U+055A..055F,
2524 U+0589..058A, U+05BE ...)
2525 \p{General_Category: S} \p{General_Category=Symbol} (6978)
2526 \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (54)
2527 \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19:
2528 [\x20\xa0], U+1680, U+2000..200A,
2529 U+2028..2029, U+202F, U+205F ...)
2530 \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (121)
2531 \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
2532 \p{General_Category: So} \p{General_Category=Other_Symbol} (5855)
2533 \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
2534 (17: [\x20\xa0], U+1680, U+2000..200A,
2535 U+202F, U+205F, U+3000)
2536 \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc})
2537 (401: U+0903, U+093B, U+093E..0940,
2538 U+0949..094C, U+094E..094F, U+0982..0983
2539 ...)
2540 \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048:
2541 U+D800..DFFF)
2542 \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (6978:
2543 [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae-
2544 \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5,
2545 U+02D2..02DF, U+02E5..02EB, U+02ED,
2546 U+02EF..02FF ...)
2547 \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
2548 /i= General_Category=Cased_Letter) (31:
2549 U+01C5, U+01C8, U+01CB, U+01F2,
2550 U+1F88..1F8F, U+1F98..1F9F ...)
2551 \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
2552 (837_841 plus all above-Unicode code
2553 points: U+0378..0379, U+0380..0383,
2554 U+038B, U+038D, U+03A2, U+0530 ...)
2555 \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
2556 /i= General_Category=Cased_Letter)
2557 (1702: [A-Z\xc0-\xd6\xd8-\xde], U+0100,
2558 U+0102, U+0104, U+0106, U+0108 ...)
2559 \p{General_Category: Z} \p{General_Category=Separator} (19)
2560 \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
2561 \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
2562 (1)
2563 \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
2564 X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
2565 \p{InPunctuation}) (112)
2566 X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
2567 X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
2568 \p{Block=Geometric_Shapes_Extended})
2569 (128)
2570 X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
2571 (Short: \p{InGeometricShapesExt}) (128)
2572 \p{Geor} \p{Georgian} (= \p{Script_Extensions=
2573 Georgian}) (NOT \p{Block=Georgian}) (129)
2574 \p{Georgian} \p{Script_Extensions=Georgian} (Short:
2575 \p{Geor}; NOT \p{Block=Georgian}) (129)
2576 X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block=
2577 Georgian_Supplement}) (48)
2578 X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
2579 \p{InGeorgianSup}) (48)
2580 \p{Glag} \p{Glagolitic} (= \p{Script_Extensions=
2581 Glagolitic}) (NOT \p{Block=Glagolitic})
2582 (136)
2583 \p{Glagolitic} \p{Script_Extensions=Glagolitic} (Short:
2584 \p{Glag}; NOT \p{Block=Glagolitic}) (136)
2585 X \p{Glagolitic_Sup} \p{Glagolitic_Supplement} (= \p{Block=
2586 Glagolitic_Supplement}) (48)
2587 X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
2588 \p{InGlagoliticSup}) (48)
2589 \p{Gonm} \p{Masaram_Gondi} (= \p{Script_Extensions=
2590 Masaram_Gondi}) (NOT \p{Block=
2591 Masaram_Gondi}) (75)
2592 \p{Goth} \p{Gothic} (= \p{Script_Extensions=
2593 Gothic}) (NOT \p{Block=Gothic}) (27)
2594 \p{Gothic} \p{Script_Extensions=Gothic} (Short:
2595 \p{Goth}; NOT \p{Block=Gothic}) (27)
2596 \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
2597 (134_733)
2598 \p{Gr_Base: *} \p{Grapheme_Base: *}
2599 \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
2600 Y}) (1901)
2601 \p{Gr_Ext: *} \p{Grapheme_Extend: *}
2602 \p{Gran} \p{Grantha} (= \p{Script_Extensions=
2603 Grantha}) (NOT \p{Block=Grantha}) (113)
2604 \p{Grantha} \p{Script_Extensions=Grantha} (Short:
2605 \p{Gran}; NOT \p{Block=Grantha}) (113)
2606 \p{Graph} \p{XPosixGraph} (274_139)
2607 \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
2608 (134_733)
2609 \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase}) (979_379
2610 plus all above-Unicode code points:
2611 [\x00-\x1f\x7f-\x9f\xad], U+0300..036F,
2612 U+0378..0379, U+0380..0383, U+038B,
2613 U+038D ...)
2614 \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase})
2615 (134_733: [\x20-\x7e\xa0-\xac\xae-\xff],
2616 U+0100..02FF, U+0370..0377,
2617 U+037A..037F, U+0384..038A, U+038C ...)
2618 \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
2619 (5925)
2620 \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (5925:
2621 [^\n\r\x20-\x7e\xa0-\xac\xae-\xff],
2622 U+061C, U+180E, U+200B, U+200E..200F,
2623 U+2028..202E ...)
2624 \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r])
2625 \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (98:
2626 U+261D, U+26F9, U+270A..270D, U+1F385,
2627 U+1F3C2..1F3C4, U+1F3C7 ...)
2628 \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (4:
2629 U+1F466..1F469)
2630 \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (5:
2631 U+1F3FB..1F3FF)
2632 \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
2633 (98)
2634 \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
2635 E_Base_GAZ} (4)
2636 \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
2637 E_Modifier} (5)
2638 \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
2639 (1901)
2640 \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1901:
2641 U+0300..036F, U+0483..0489,
2642 U+0591..05BD, U+05BF, U+05C1..05C2,
2643 U+05C4..05C5 ...)
2644 \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
2645 Glue_After_Zwj} (22)
2646 \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ})
2647 (22: U+2640, U+2642, U+2695..2696,
2648 U+2708, U+2764, U+1F308 ...)
2649 \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125:
2650 U+1100..115F, U+A960..A97C)
2651 \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n])
2652 \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00,
2653 U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C
2654 ...)
2655 \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773:
2656 U+AC01..AC1B, U+AC1D..AC37,
2657 U+AC39..AC53, U+AC55..AC6F,
2658 U+AC71..AC8B, U+AC8D..ACA7 ...)
2659 \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_094_232
2660 plus all above-Unicode code points:
2661 [\x20-\x7e\xa0-\xac\xae-\xff],
2662 U+0100..02FF, U+0370..0482,
2663 U+048A..0590, U+05BE, U+05C0 ...)
2664 \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
2665 (19)
2666 \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (19:
2667 U+0600..0605, U+06DD, U+070F, U+08E2,
2668 U+0D4E, U+110BD ...)
2669 \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
2670 (26: U+1F1E6..1F1FF)
2671 \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
2672 Regional_Indicator} (26)
2673 \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
2674 SpacingMark} (348)
2675 \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (348:
2676 U+0903, U+093B, U+093E..0940,
2677 U+0949..094C, U+094E..094F, U+0982..0983
2678 ...)
2679 \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137:
2680 U+11A8..11FF, U+D7CB..D7FB)
2681 \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95:
2682 U+1160..11A7, U+D7B0..D7C6)
2683 \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
2684 (1_094_232 plus all above-Unicode code
2685 points)
2686 \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D)
2687 \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
2688 (1901)
2689 \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_211
2690 plus all above-Unicode code points:
2691 U+0000..02FF, U+0370..0482,
2692 U+048A..0590, U+05BE, U+05C0, U+05C3 ...)
2693 \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1901:
2694 U+0300..036F, U+0483..0489,
2695 U+0591..05BD, U+05BF, U+05C1..05C2,
2696 U+05C4..05C5 ...)
2697 \p{Greek} \p{Script_Extensions=Greek} (Short:
2698 \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
2699 X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
2700 \p{InGreek}) (144)
2701 X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block=
2702 Greek_Extended}) (256)
2703 X \p{Greek_Extended} \p{Block=Greek_Extended} (Short:
2704 \p{InGreekExt}) (256)
2705 \p{Grek} \p{Greek} (= \p{Script_Extensions=Greek})
2706 (NOT \p{Greek_And_Coptic}) (522)
2707 \p{Gujarati} \p{Script_Extensions=Gujarati} (Short:
2708 \p{Gujr}; NOT \p{Block=Gujarati}) (105)
2709 \p{Gujr} \p{Gujarati} (= \p{Script_Extensions=
2710 Gujarati}) (NOT \p{Block=Gujarati}) (105)
2711 \p{Gurmukhi} \p{Script_Extensions=Gurmukhi} (Short:
2712 \p{Guru}; NOT \p{Block=Gurmukhi}) (93)
2713 \p{Guru} \p{Gurmukhi} (= \p{Script_Extensions=
2714 Gurmukhi}) (NOT \p{Block=Gurmukhi}) (93)
2715 X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
2716 \p{Block=Halfwidth_And_Fullwidth_Forms})
2717 (240)
2718 X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block=
2719 Combining_Half_Marks}) (16)
2720 X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
2721 Halfwidth_And_Fullwidth_Forms} (Short:
2722 \p{InHalfAndFullForms}) (240)
2723 \p{Han} \p{Script_Extensions=Han} (89_507)
2724 \p{Hang} \p{Hangul} (= \p{Script_Extensions=
2725 Hangul}) (NOT \p{Hangul_Syllables})
2726 (11_775)
2727 \p{Hangul} \p{Script_Extensions=Hangul} (Short:
2728 \p{Hang}; NOT \p{Hangul_Syllables})
2729 (11_775)
2730 X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
2731 (Short: \p{InCompatJamo}) (96)
2732 X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo})
2733 (256)
2734 X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
2735 (Short: \p{InJamoExtA}) (32)
2736 X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
2737 (Short: \p{InJamoExtB}) (80)
2738 \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
2739 (125)
2740 \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125:
2741 U+1100..115F, U+A960..A97C)
2742 \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
2743 (399)
2744 \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399:
2745 U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70,
2746 U+AC8C ...)
2747 \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
2748 LVT_Syllable} (10_773)
2749 \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
2750 (10_773: U+AC01..AC1B, U+AC1D..AC37,
2751 U+AC39..AC53, U+AC55..AC6F,
2752 U+AC71..AC8B, U+AC8D..ACA7 ...)
2753 \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
2754 Not_Applicable} (1_102_583 plus all
2755 above-Unicode code points)
2756 \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
2757 (1_102_583 plus all above-Unicode code
2758 points: U+0000..10FF, U+1200..A95F,
2759 U+A97D..ABFF, U+D7A4..D7AF,
2760 U+D7C7..D7CA, U+D7FC..infinity)
2761 \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
2762 (137)
2763 \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137:
2764 U+11A8..11FF, U+D7CB..D7FB)
2765 \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
2766 (95)
2767 \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95:
2768 U+1160..11A7, U+D7B0..D7C6)
2769 X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short:
2770 \p{InHangul}) (11_184)
2771 \p{Hani} \p{Han} (= \p{Script_Extensions=Han})
2772 (89_507)
2773 \p{Hano} \p{Hanunoo} (= \p{Script_Extensions=
2774 Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
2775 \p{Hanunoo} \p{Script_Extensions=Hanunoo} (Short:
2776 \p{Hano}; NOT \p{Block=Hanunoo}) (23)
2777 \p{Hatr} \p{Hatran} (= \p{Script_Extensions=
2778 Hatran}) (NOT \p{Block=Hatran}) (26)
2779 \p{Hatran} \p{Script_Extensions=Hatran} (Short:
2780 \p{Hatr}; NOT \p{Block=Hatran}) (26)
2781 \p{Hebr} \p{Hebrew} (= \p{Script_Extensions=
2782 Hebrew}) (NOT \p{Block=Hebrew}) (133)
2783 \p{Hebrew} \p{Script_Extensions=Hebrew} (Short:
2784 \p{Hebr}; NOT \p{Block=Hebrew}) (133)
2785 \p{Hex} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2786 \p{Hex: *} \p{Hex_Digit: *}
2787 \p{Hex_Digit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2788 \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068
2789 plus all above-Unicode code points:
2790 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
2791 >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
2792 U+0100..FF0F, U+FF1A..FF20,
2793 U+FF27..FF40, U+FF47..infinity)
2794 \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa-
2795 f], U+FF10..FF19, U+FF21..FF26,
2796 U+FF41..FF46)
2797 X \p{High_Private_Use_Surrogates} \p{Block=
2798 High_Private_Use_Surrogates} (Short:
2799 \p{InHighPUSurrogates}) (128)
2800 X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (=
2801 \p{Block=High_Private_Use_Surrogates})
2802 (128)
2803 X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
2804 \p{Hira} \p{Hiragana} (= \p{Script_Extensions=
2805 Hiragana}) (NOT \p{Block=Hiragana}) (428)
2806 \p{Hiragana} \p{Script_Extensions=Hiragana} (Short:
2807 \p{Hira}; NOT \p{Block=Hiragana}) (428)
2808 \p{Hluw} \p{Anatolian_Hieroglyphs} (=
2809 \p{Script_Extensions=
2810 Anatolian_Hieroglyphs}) (NOT \p{Block=
2811 Anatolian_Hieroglyphs}) (583)
2812 \p{Hmng} \p{Pahawh_Hmong} (= \p{Script_Extensions=
2813 Pahawh_Hmong}) (NOT \p{Block=
2814 Pahawh_Hmong}) (127)
2815 \p{HorizSpace} \p{XPosixBlank} (18)
2816 \p{Hst: *} \p{Hangul_Syllable_Type: *}
2817 \p{Hung} \p{Old_Hungarian} (= \p{Script_Extensions=
2818 Old_Hungarian}) (NOT \p{Block=
2819 Old_Hungarian}) (108)
2820 D \p{Hyphen} \p{Hyphen=Y} (11)
2821 D \p{Hyphen: N*} Supplanted by Line_Break property values;
2822 see www.unicode.org/reports/tr14
2823 (Single: \P{Hyphen}) (1_114_101 plus all
2824 above-Unicode code points: [\x00-
2825 \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-
2826 Z\[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae-
2827 \xff], U+0100..0589, U+058B..1805,
2828 U+1807..200F, U+2012..2E16, U+2E18..30FA
2829 ...)
2830 D \p{Hyphen: Y*} Supplanted by Line_Break property values;
2831 see www.unicode.org/reports/tr14
2832 (Single: \p{Hyphen}) (11: [\-\xad],
2833 U+058A, U+1806, U+2010..2011, U+2E17,
2834 U+30FB ...)
2835 \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT
2836 \p{Ideographic_Description_Characters})
2837 (128_108)
2838 \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (986_004 plus
2839 all above-Unicode code points: [\x00-
2840 \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
2841 >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-
2842 \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
2843 U+02C2..02C5, U+02D2..02DF,
2844 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2845 \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (128_108:
2846 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-
2847 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
2848 U+02C6..02D1, U+02E0..02E4, U+02EC,
2849 U+02EE ...)
2850 \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (125_334)
2851 \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (988_778 plus
2852 all above-Unicode code points: [\x00-
2853 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2854 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
2855 \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
2856 U+02C2..02C5, U+02D2..02DF,
2857 U+02E5..02EB, U+02ED, U+02EF..036F ...)
2858 \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (125_334: [A-
2859 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
2860 \xff], U+0100..02C1, U+02C6..02D1,
2861 U+02E0..02E4, U+02EC, U+02EE ...)
2862 \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
2863 \p{Ideographic_Description_Characters})
2864 (128_108)
2865 \p{IDC: *} \p{ID_Continue: *}
2866 \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
2867 (96_174)
2868 \p{Ideo: *} \p{Ideographic: *}
2869 \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
2870 (96_174)
2871 \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_017_938
2872 plus all above-Unicode code points:
2873 U+0000..3005, U+3008..3020,
2874 U+302A..3037, U+303B..33FF,
2875 U+4DB6..4DFF, U+9FEB..F8FF ...)
2876 \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (96_174:
2877 U+3006..3007, U+3021..3029,
2878 U+3038..303A, U+3400..4DB5,
2879 U+4E00..9FEA, U+F900..FA6D ...)
2880 X \p{Ideographic_Description_Characters} \p{Block=
2881 Ideographic_Description_Characters}
2882 (Short: \p{InIDC}) (16)
2883 X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
2884 \p{Block=
2885 Ideographic_Symbols_And_Punctuation})
2886 (32)
2887 X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
2888 Ideographic_Symbols_And_Punctuation}
2889 (Short: \p{InIdeographicSymbols}) (32)
2890 \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (125_334)
2891 \p{IDS: *} \p{ID_Start: *}
2892 \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
2893 \p{IDSB}) (10)
2894 \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
2895 (1_114_102 plus all above-Unicode code
2896 points: U+0000..2FEF, U+2FF2..2FF3,
2897 U+2FFC..infinity)
2898 \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10:
2899 U+2FF0..2FF1, U+2FF4..2FFB)
2900 \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
2901 \p{IDST}) (2)
2902 \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
2903 (1_114_110 plus all above-Unicode code
2904 points: U+0000..2FF1, U+2FF4..infinity)
2905 \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2:
2906 U+2FF2..2FF3)
2907 \p{IDSB} \p{IDS_Binary_Operator} (=
2908 \p{IDS_Binary_Operator=Y}) (10)
2909 \p{IDSB: *} \p{IDS_Binary_Operator: *}
2910 \p{IDST} \p{IDS_Trinary_Operator} (=
2911 \p{IDS_Trinary_Operator=Y}) (2)
2912 \p{IDST: *} \p{IDS_Trinary_Operator: *}
2913 \p{Imperial_Aramaic} \p{Script_Extensions=Imperial_Aramaic}
2914 (Short: \p{Armi}; NOT \p{Block=
2915 Imperial_Aramaic}) (31)
2916 \p{In: *} \p{Present_In: *} (Perl extension)
2917 X \p{In_*} \p{Block: *}
2918 X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block=
2919 Common_Indic_Number_Forms}) (16)
2920 \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
2921 (330: U+093C, U+0941..0944, U+094D,
2922 U+0952, U+0956..0957, U+0962..0963 ...)
2923 \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC=
2924 BottomAndLeft}) (1: U+A9BF)
2925 \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
2926 BottomAndRight}) (2: U+1B3B, U+A9C0)
2927 \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (57:
2928 U+093F, U+094E, U+09BF, U+09C7..09C8,
2929 U+0A3F, U+0ABF ...)
2930 \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
2931 LeftAndRight}) (21: U+09CB..09CC,
2932 U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C,
2933 U+0DDC, U+0DDE ...)
2934 \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_996
2935 plus all above-Unicode code points:
2936 U+0000..08FF, U+0904..0939, U+093D,
2937 U+0950, U+0958..0961, U+0964..0980 ...)
2938 \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
2939 Overstruck}) (10: U+1CD4, U+1CE2..1CE8,
2940 U+10A01, U+10A06)
2941 \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (262:
2942 U+0903, U+093B, U+093E, U+0940,
2943 U+0949..094C, U+094F ...)
2944 \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (380:
2945 U+0900..0902, U+093A, U+0945..0948,
2946 U+0951, U+0953..0955, U+0981 ...)
2947 \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
2948 TopAndBottom}) (10: U+0C48, U+0F73,
2949 U+0F76..0F79, U+0F81, U+1B3C,
2950 U+1112E..1112F)
2951 \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
2952 \p{InPC=TopAndBottomAndRight}) (1:
2953 U+1B3D)
2954 \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
2955 TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE,
2956 U+1C29, U+114BB, U+115B9)
2957 \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
2958 \p{InPC=TopAndLeftAndRight}) (4: U+0B4C,
2959 U+0DDD, U+17BF, U+115BB)
2960 \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
2961 TopAndRight}) (13: U+0AC9, U+0B57,
2962 U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB,
2963 U+1925..1926 ...)
2964 \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
2965 VisualOrderLeft}) (19: U+0E40..0E44,
2966 U+0EC0..0EC4, U+19B5..19B7, U+19BA,
2967 U+AAB5..AAB6, U+AAB9 ...)
2968 \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
2969 (15: U+093D, U+09BD, U+0ABD, U+0B3D,
2970 U+0C3D, U+0CBD ...)
2971 \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (80:
2972 U+0900..0902, U+0981..0982, U+09FC,
2973 U+0A01..0A02, U+0A70, U+0A81..0A82 ...)
2974 \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
2975 \p{InSC=BrahmiJoiningNumber}) (20:
2976 U+11052..11065)
2977 \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
2978 CantillationMark}) (57: U+0951..0952,
2979 U+0AFA..0AFC, U+1CD0..1CD2,
2980 U+1CD4..1CE1, U+1CF4, U+1CF7..1CF9 ...)
2981 \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
2982 (2024: U+0915..0939, U+0958..095F,
2983 U+0978..097F, U+0995..09A8,
2984 U+09AA..09B0, U+09B2 ...)
2985 \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
2986 ConsonantDead}) (10: U+09CE,
2987 U+0D54..0D56, U+0D7A..0D7F)
2988 \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
2989 ConsonantFinal}) (68: U+1930..1931,
2990 U+1933..1939, U+19C1..19C7,
2991 U+1A58..1A59, U+1BBE..1BBF, U+1BF0..1BF1
2992 ...)
2993 \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
2994 \p{InSC=ConsonantHeadLetter}) (5:
2995 U+0F88..0F8C)
2996 \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
2997 ConsonantKiller}) (2: U+0E4C, U+17CD)
2998 \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
2999 ConsonantMedial}) (27: U+0A75,
3000 U+0EBC..0EBD, U+103B..103E,
3001 U+105E..1060, U+1082, U+1A55..1A56 ...)
3002 \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
3003 \p{InSC=ConsonantPlaceholder}) (18: [\-
3004 \xa0\xd7], U+0A72..0A73, U+104E, U+1900,
3005 U+2010..2014, U+25CC ...)
3006 \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
3007 \p{InSC=ConsonantPrecedingRepha}) (2:
3008 U+0D4E, U+11D46)
3009 \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
3010 ConsonantPrefixed}) (7: U+111C2..111C3,
3011 U+11A3A, U+11A86..11A89)
3012 \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
3013 ConsonantSubjoined}) (95: U+0F8D..0F97,
3014 U+0F99..0FBC, U+1929..192B, U+1A57,
3015 U+1A5B..1A5E, U+1BA1..1BA3 ...)
3016 \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
3017 \p{InSC=ConsonantSucceedingRepha}) (5:
3018 U+17CC, U+1A5A, U+1B03, U+1B81, U+A982)
3019 \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
3020 \p{InSC=ConsonantWithStacker}) (4:
3021 U+0CF1..0CF2, U+11003..11004)
3022 \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
3023 GeminationMark}) (3: U+0A71, U+11237,
3024 U+11A98)
3025 \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
3026 InvisibleStacker}) (10: U+1039, U+17D2,
3027 U+1A60, U+1BAB, U+AAF6, U+10A3F ...)
3028 \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1:
3029 U+200D)
3030 \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
3031 ModifyingLetter}) (1: U+0B83)
3032 \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
3033 NonJoiner}) (1: U+200C)
3034 \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (28:
3035 U+093C, U+09BC, U+0A3C, U+0ABC,
3036 U+0AFD..0AFF, U+0B3C ...)
3037 \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (469:
3038 [0-9], U+0966..096F, U+09E6..09EF,
3039 U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F
3040 ...)
3041 \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
3042 NumberJoiner}) (1: U+1107F)
3043 \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
3044 (1_109_901 plus all above-Unicode code
3045 points: [\x00-
3046 \x20!\"#\$\%&\'\(\)*+,.\/:;<=>?\@A-
3047 Z\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1-
3048 \xb1\xb4-\xd6\xd8-\xff], U+0100..08FF,
3049 U+0950, U+0953..0954, U+0964..0965,
3050 U+0970..0971 ...)
3051 \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
3052 PureKiller}) (21: U+0D3B..0D3C, U+0E3A,
3053 U+0E4E, U+0F84, U+103A, U+1714 ...)
3054 \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
3055 RegisterShifter}) (2: U+17C9..17CA)
3056 \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
3057 SyllableModifier}) (22: [\xb2-\xb3],
3058 U+0F35, U+0F37, U+0FC6, U+17CB,
3059 U+17CE..17D0 ...)
3060 \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
3061 ToneLetter}) (7: U+1970..1974, U+AAC0,
3062 U+AAC2)
3063 \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
3064 (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037,
3065 U+1063..1064, U+1069..106D, U+1087..108D
3066 ...)
3067 \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (24:
3068 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
3069 U+0BCD ...)
3070 \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga})
3071 (34: U+0903, U+0983, U+0A03, U+0A83,
3072 U+0B03, U+0C03 ...)
3073 \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30:
3074 U+1963..196D, U+A85E..A861, U+A866,
3075 U+A922..A92A, U+11150..11154)
3076 \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
3077 VowelDependent}) (633: U+093A..093B,
3078 U+093E..094C, U+094E..094F,
3079 U+0955..0957, U+0962..0963, U+09BE..09C4
3080 ...)
3081 \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
3082 VowelIndependent}) (443: U+0904..0914,
3083 U+0960..0961, U+0972..0977,
3084 U+0985..098C, U+098F..0990, U+0993..0994
3085 ...)
3086 \p{Inherited} \p{Script_Extensions=Inherited} (Short:
3087 \p{Zinh}) (500)
3088 \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
3089 (Short: \p{Pi}) (12)
3090 \p{InPC: *} \p{Indic_Positional_Category: *}
3091 \p{InSC: *} \p{Indic_Syllabic_Category: *}
3092 \p{Inscriptional_Pahlavi} \p{Script_Extensions=
3093 Inscriptional_Pahlavi} (Short: \p{Phli};
3094 NOT \p{Block=Inscriptional_Pahlavi}) (27)
3095 \p{Inscriptional_Parthian} \p{Script_Extensions=
3096 Inscriptional_Parthian} (Short:
3097 \p{Prti}; NOT \p{Block=
3098 Inscriptional_Parthian}) (30)
3099 X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block=
3100 IPA_Extensions}) (96)
3101 X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short:
3102 \p{InIPAExt}) (96)
3103 \p{Is_*} \p{*} (Any exceptions are individually
3104 noted beginning with the word NOT.) If
3105 an entry has flag(s) at its beginning,
3106 like "D", the "Is_" form has the same
3107 flag(s)
3108 \p{Ital} \p{Old_Italic} (= \p{Script_Extensions=
3109 Old_Italic}) (NOT \p{Block=Old_Italic})
3110 (39)
3111 X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
3112 (256)
3113 X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block=
3114 Hangul_Jamo_Extended_A}) (32)
3115 X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block=
3116 Hangul_Jamo_Extended_B}) (80)
3117 \p{Java} \p{Javanese} (= \p{Script_Extensions=
3118 Javanese}) (NOT \p{Block=Javanese}) (91)
3119 \p{Javanese} \p{Script_Extensions=Javanese} (Short:
3120 \p{Java}; NOT \p{Block=Javanese}) (91)
3121 \p{Jg: *} \p{Joining_Group: *}
3122 \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
3123 \p{Join_C: *} \p{Join_Control: *}
3124 \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
3125 \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
3126 plus all above-Unicode code points:
3127 U+0000..200B, U+200E..infinity)
3128 \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2:
3129 U+200C..200D)
3130 \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1:
3131 U+08BB)
3132 \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1:
3133 U+08BD)
3134 \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (1:
3135 U+08BC)
3136 \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (8: U+0639..063A,
3137 U+06A0, U+06FC, U+075D..075F, U+08B3)
3138 \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710)
3139 \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10: U+0622..0623,
3140 U+0625, U+0627, U+0671..0673, U+0675,
3141 U+0773..0774)
3142 \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (24: U+0628,
3143 U+062A..062B, U+066E, U+0679..0680,
3144 U+0750..0756, U+08A0..08A1 ...)
3145 \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D)
3146 \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
3147 BurushaskiYehBarree}) (2: U+077A..077B)
3148 \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (15: U+062F..0630,
3149 U+0688..0690, U+06EE, U+0759..075A,
3150 U+08AE)
3151 \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4:
3152 U+0715..0716, U+072A, U+072F)
3153 \p{Joining_Group: E} (Short: \p{Jg=E}) (1: U+0725)
3154 \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7:
3155 U+063D..063F, U+06CC, U+06CE,
3156 U+0775..0776)
3157 \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1: U+074F)
3158 \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10: U+0641,
3159 U+06A1..06A6, U+0760..0761, U+08A4)
3160 \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1:
3161 U+0724)
3162 \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (14: U+063B..063C,
3163 U+06A9, U+06AB, U+06AF..06B4,
3164 U+0762..0764, U+08B0)
3165 \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714,
3166 U+072E)
3167 \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (18: U+062C..062E,
3168 U+0681..0687, U+06BF, U+0757..0758,
3169 U+076E..076F, U+0772 ...)
3170 \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
3171 HamzaOnHehGoal}) (1: U+06C3)
3172 \p{Joining_Group: He} (Short: \p{Jg=He}) (1: U+0717)
3173 \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1: U+0647)
3174 \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2:
3175 U+06C1..06C2)
3176 \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A)
3177 \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (6: U+0643,
3178 U+06AC..06AE, U+077F, U+08B4)
3179 \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F)
3180 \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E)
3181 \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2:
3182 U+06BE, U+06FF)
3183 \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (7: U+0644,
3184 U+06B5..06B8, U+076A, U+08A6)
3185 \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720)
3186 \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1:
3187 U+0866)
3188 \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1:
3189 U+0861)
3190 \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1:
3191 U+0868)
3192 \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla})
3193 (1: U+0869)
3194 \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1:
3195 U+0860)
3196 \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1:
3197 U+0864)
3198 \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna})
3199 (1: U+0865)
3200 \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1:
3201 U+0862)
3202 \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1:
3203 U+0867)
3204 \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1:
3205 U+086A)
3206 \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1:
3207 U+0863)
3208 \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
3209 ManichaeanAleph}) (1: U+10AC0)
3210 \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
3211 (2: U+10AD9..10ADA)
3212 \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
3213 (2: U+10AC1..10AC2)
3214 \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
3215 ManichaeanDaleth}) (1: U+10AC5)
3216 \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
3217 ManichaeanDhamedh}) (1: U+10AD4)
3218 \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
3219 (1: U+10AEC)
3220 \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
3221 ManichaeanGimel}) (2: U+10AC3..10AC4)
3222 \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
3223 (1: U+10ACD)
3224 \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
3225 ManichaeanHundred}) (1: U+10AEF)
3226 \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
3227 (3: U+10AD0..10AD2)
3228 \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
3229 ManichaeanLamedh}) (1: U+10AD3)
3230 \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem})
3231 (1: U+10AD6)
3232 \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun})
3233 (1: U+10AD7)
3234 \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne})
3235 (1: U+10AEB)
3236 \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2:
3237 U+10ADB..10ADC)
3238 \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
3239 (3: U+10ADE..10AE0)
3240 \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
3241 (1: U+10AE1)
3242 \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
3243 ManichaeanSadhe}) (1: U+10ADD)
3244 \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
3245 ManichaeanSamekh}) (1: U+10AD8)
3246 \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw})
3247 (1: U+10AE4)
3248 \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen})
3249 (1: U+10AED)
3250 \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
3251 (1: U+10ACE)
3252 \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
3253 ManichaeanThamedh}) (1: U+10AD5)
3254 \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
3255 ManichaeanTwenty}) (1: U+10AEE)
3256 \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw})
3257 (1: U+10AC7)
3258 \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
3259 (1: U+10ACF)
3260 \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
3261 ManichaeanZayin}) (2: U+10AC9..10ACA)
3262 \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645,
3263 U+0765..0766, U+08A7)
3264 \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1: U+0721)
3265 \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
3266 (1_113_807 plus all above-Unicode code
3267 points: U+0000..061F, U+0621, U+0640,
3268 U+064B..066D, U+0670, U+0674 ...)
3269 \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8: U+0646,
3270 U+06B9..06BC, U+0767..0769)
3271 \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1: U+0722)
3272 \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1: U+06BD)
3273 \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1: U+0726)
3274 \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (5: U+0642, U+066F,
3275 U+06A7..06A8, U+08A5)
3276 \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729)
3277 \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (19: U+0631..0632,
3278 U+0691..0699, U+06EF, U+075B,
3279 U+076B..076C, U+0771 ...)
3280 \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1:
3281 U+0727)
3282 \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1:
3283 U+08AC)
3284 \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (6: U+0635..0636,
3285 U+069D..069E, U+06FB, U+08AF)
3286 \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728)
3287 \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634,
3288 U+069A..069C, U+06FA, U+075C, U+076D,
3289 U+0770 ...)
3290 \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723)
3291 \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B)
3292 \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1:
3293 U+08B1)
3294 \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA)
3295 \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718)
3296 \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (4: U+0637..0638,
3297 U+069F, U+08A3)
3298 \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1: U+072C)
3299 \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3:
3300 U+0629, U+06C0, U+06D5)
3301 \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
3302 Hamza_On_Heh_Goal} (1)
3303 \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C)
3304 \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16: U+0624, U+0648,
3305 U+0676..0677, U+06C4..06CB, U+06CF,
3306 U+0778..0779 ...)
3307 \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (11: U+0620, U+0626,
3308 U+0649..064A, U+0678, U+06D0..06D1,
3309 U+0777 ...)
3310 \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2:
3311 U+06D2..06D3)
3312 \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1:
3313 U+06CD)
3314 \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D)
3315 \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E)
3316 \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719)
3317 \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D)
3318 \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (4)
3319 \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (507)
3320 \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (507: U+0620,
3321 U+0626, U+0628, U+062A..062E,
3322 U+0633..063F, U+0641..0647 ...)
3323 \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4: U+0640,
3324 U+07FA, U+180A, U+200D)
3325 \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (3)
3326 \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (3: U+A872,
3327 U+10ACD, U+10AD7)
3328 \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_571 plus
3329 all above-Unicode code points: [\x00-
3330 \xac\xae-\xff], U+0100..02FF,
3331 U+0370..0482, U+048A..0590, U+05BE,
3332 U+05C0 ...)
3333 \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (115)
3334 \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (115:
3335 U+0622..0625, U+0627, U+0629,
3336 U+062F..0632, U+0648, U+0671..0673 ...)
3337 \p{Joining_Type: T} \p{Joining_Type=Transparent} (1912)
3338 \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1912: [\xad],
3339 U+0300..036F, U+0483..0489,
3340 U+0591..05BD, U+05BF, U+05C1..05C2 ...)
3341 \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_111_571
3342 plus all above-Unicode code points)
3343 \p{Jt: *} \p{Joining_Type: *}
3344 \p{Kaithi} \p{Script_Extensions=Kaithi} (Short:
3345 \p{Kthi}; NOT \p{Block=Kaithi}) (86)
3346 \p{Kali} \p{Kayah_Li} (= \p{Script_Extensions=
3347 Kayah_Li}) (48)
3348 \p{Kana} \p{Katakana} (= \p{Script_Extensions=
3349 Katakana}) (NOT \p{Block=Katakana}) (352)
3350 X \p{Kana_Ext_A} \p{Kana_Extended_A} (= \p{Block=
3351 Kana_Extended_A}) (48)
3352 X \p{Kana_Extended_A} \p{Block=Kana_Extended_A} (Short:
3353 \p{InKanaExtA}) (48)
3354 X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block=
3355 Kana_Supplement}) (256)
3356 X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short:
3357 \p{InKanaSup}) (256)
3358 X \p{Kanbun} \p{Block=Kanbun} (16)
3359 X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block=
3360 Kangxi_Radicals}) (224)
3361 X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short:
3362 \p{InKangxi}) (224)
3363 \p{Kannada} \p{Script_Extensions=Kannada} (Short:
3364 \p{Knda}; NOT \p{Block=Kannada}) (100)
3365 \p{Katakana} \p{Script_Extensions=Katakana} (Short:
3366 \p{Kana}; NOT \p{Block=Katakana}) (352)
3367 X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (=
3368 \p{Block=Katakana_Phonetic_Extensions})
3369 (16)
3370 X \p{Katakana_Phonetic_Extensions} \p{Block=
3371 Katakana_Phonetic_Extensions} (Short:
3372 \p{InKatakanaExt}) (16)
3373 \p{Kayah_Li} \p{Script_Extensions=Kayah_Li} (Short:
3374 \p{Kali}) (48)
3375 \p{Khar} \p{Kharoshthi} (= \p{Script_Extensions=
3376 Kharoshthi}) (NOT \p{Block=Kharoshthi})
3377 (65)
3378 \p{Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
3379 \p{Khar}; NOT \p{Block=Kharoshthi}) (65)
3380 \p{Khmer} \p{Script_Extensions=Khmer} (Short:
3381 \p{Khmr}; NOT \p{Block=Khmer}) (146)
3382 X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
3383 \p{Khmr} \p{Khmer} (= \p{Script_Extensions=Khmer})
3384 (NOT \p{Block=Khmer}) (146)
3385 \p{Khoj} \p{Khojki} (= \p{Script_Extensions=
3386 Khojki}) (NOT \p{Block=Khojki}) (72)
3387 \p{Khojki} \p{Script_Extensions=Khojki} (Short:
3388 \p{Khoj}; NOT \p{Block=Khojki}) (72)
3389 \p{Khudawadi} \p{Script_Extensions=Khudawadi} (Short:
3390 \p{Sind}; NOT \p{Block=Khudawadi}) (81)
3391 \p{Knda} \p{Kannada} (= \p{Script_Extensions=
3392 Kannada}) (NOT \p{Block=Kannada}) (100)
3393 \p{Kthi} \p{Kaithi} (= \p{Script_Extensions=
3394 Kaithi}) (NOT \p{Block=Kaithi}) (86)
3395 \p{L} \pL \p{Letter} (= \p{General_Category=Letter})
3396 (125_093)
3397 X \p{L&} \p{Cased_Letter} (= \p{General_Category=
3398 Cased_Letter}) (3796)
3399 X \p{L_} \p{Cased_Letter} (= \p{General_Category=
3400 Cased_Letter}) Note the trailing '_'
3401 matters in spite of loose matching
3402 rules. (3796)
3403 \p{Lana} \p{Tai_Tham} (= \p{Script_Extensions=
3404 Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
3405 \p{Lao} \p{Script_Extensions=Lao} (NOT \p{Block=
3406 Lao}) (67)
3407 \p{Laoo} \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
3408 \p{Block=Lao}) (67)
3409 \p{Latin} \p{Script_Extensions=Latin} (Short:
3410 \p{Latn}) (1370)
3411 X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
3412 Latin_1_Supplement}) (128)
3413 X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block=
3414 Latin_1_Supplement}) (128)
3415 X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
3416 \p{InLatin1}) (128)
3417 X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block=
3418 Latin_Extended_A}) (128)
3419 X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
3420 \p{Block=Latin_Extended_Additional})
3421 (256)
3422 X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block=
3423 Latin_Extended_B}) (208)
3424 X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block=
3425 Latin_Extended_C}) (32)
3426 X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block=
3427 Latin_Extended_D}) (224)
3428 X \p{Latin_Ext_E} \p{Latin_Extended_E} (= \p{Block=
3429 Latin_Extended_E}) (64)
3430 X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short:
3431 \p{InLatinExtA}) (128)
3432 X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
3433 (Short: \p{InLatinExtAdditional}) (256)
3434 X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short:
3435 \p{InLatinExtB}) (208)
3436 X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short:
3437 \p{InLatinExtC}) (32)
3438 X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short:
3439 \p{InLatinExtD}) (224)
3440 X \p{Latin_Extended_E} \p{Block=Latin_Extended_E} (Short:
3441 \p{InLatinExtE}) (64)
3442 \p{Latn} \p{Latin} (= \p{Script_Extensions=Latin})
3443 (1370)
3444 \p{Lb: *} \p{Line_Break: *}
3445 \p{LC} \p{Cased_Letter} (= \p{General_Category=
3446 Cased_Letter}) (3796)
3447 \p{Lepc} \p{Lepcha} (= \p{Script_Extensions=
3448 Lepcha}) (NOT \p{Block=Lepcha}) (74)
3449 \p{Lepcha} \p{Script_Extensions=Lepcha} (Short:
3450 \p{Lepc}; NOT \p{Block=Lepcha}) (74)
3451 \p{Letter} \p{General_Category=Letter} (Short: \p{L})
3452 (125_093)
3453 \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
3454 \p{Nl}) (236)
3455 X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
3456 \p{Limb} \p{Limbu} (= \p{Script_Extensions=Limbu})
3457 (NOT \p{Block=Limbu}) (69)
3458 \p{Limbu} \p{Script_Extensions=Limbu} (Short:
3459 \p{Limb}; NOT \p{Block=Limbu}) (69)
3460 \p{Lina} \p{Linear_A} (= \p{Script_Extensions=
3461 Linear_A}) (NOT \p{Block=Linear_A}) (386)
3462 \p{Linb} \p{Linear_B} (= \p{Script_Extensions=
3463 Linear_B}) (268)
3464 \p{Line_Break: AI} \p{Line_Break=Ambiguous} (707)
3465 \p{Line_Break: AL} \p{Line_Break=Alphabetic} (19_690)
3466 \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (19_690: [#&*<=>\@A-
3467 Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0-
3468 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6,
3469 U+02CE..02CF, U+02D1..02D7, U+02DC,
3470 U+02DE ...)
3471 \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7-
3472 \xa8\xaa\xb2-\xb3\xb6-\xba\xbc-
3473 \xbe\xd7\xf7], U+02C7, U+02C9..02CB,
3474 U+02CD, U+02D0, U+02D8..02DB ...)
3475 \p{Line_Break: B2} \p{Line_Break=Break_Both} (3)
3476 \p{Line_Break: BA} \p{Line_Break=Break_After} (232)
3477 \p{Line_Break: BB} \p{Line_Break=Break_Before} (42)
3478 \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
3479 \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (232: [\t\|\xad],
3480 U+058A, U+05BE, U+0964..0965,
3481 U+0E5A..0E5B, U+0F0B ...)
3482 \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (42: [\xb4],
3483 U+02C8, U+02CC, U+02DF, U+0F01..0F04,
3484 U+0F06..0F07 ...)
3485 \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014,
3486 U+2E3A..2E3B)
3487 \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/])
3488 \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r])
3489 \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
3490 \p{Line_Break: CJ} \p{Line_Break=
3491 Conditional_Japanese_Starter} (51)
3492 \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (90)
3493 \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]])
3494 \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (90: [\}],
3495 U+0F3B, U+0F3D, U+169C, U+2046, U+207E
3496 ...)
3497 \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (2170)
3498 \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2170:
3499 [^\t\n\cK\f\r\x20-\x7e\x85\xa0-\xff],
3500 U+0300..034E, U+0350..035B,
3501 U+0363..036F, U+0483..0489, U+0591..05BD
3502 ...)
3503 \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (734:
3504 U+0E01..0E3A, U+0E40..0E4E,
3505 U+0E81..0E82, U+0E84, U+0E87..0E88,
3506 U+0E8A ...)
3507 \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
3508 (51: U+3041, U+3043, U+3045, U+3047,
3509 U+3049, U+3063 ...)
3510 \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC)
3511 \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
3512 \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
3513 \p{Line_Break: E_Base} (Short: \p{Lb=EB}) (102: U+261D, U+26F9,
3514 U+270A..270D, U+1F385, U+1F3C2..1F3C4,
3515 U+1F3C7 ...)
3516 \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF)
3517 \p{Line_Break: EB} \p{Line_Break=E_Base} (102)
3518 \p{Line_Break: EM} \p{Line_Break=E_Modifier} (5)
3519 \p{Line_Break: EX} \p{Line_Break=Exclamation} (37)
3520 \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (37: [!?], U+05C6,
3521 U+061B, U+061E..061F, U+06D4, U+07F9 ...)
3522 \p{Line_Break: GL} \p{Line_Break=Glue} (18)
3523 \p{Line_Break: Glue} (Short: \p{Lb=GL}) (18: [\xa0], U+034F,
3524 U+035C..0362, U+0F08, U+0F0C, U+0F12 ...)
3525 \p{Line_Break: H2} (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C,
3526 U+AC38, U+AC54, U+AC70, U+AC8C ...)
3527 \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B,
3528 U+AC1D..AC37, U+AC39..AC53,
3529 U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7
3530 ...)
3531 \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (74:
3532 U+05D0..05EA, U+05F0..05F2, U+FB1D,
3533 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
3534 ...)
3535 \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (74)
3536 \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
3537 \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1: [\-])
3538 \p{Line_Break: ID} \p{Line_Break=Ideographic} (172_784)
3539 \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_784:
3540 U+231A..231B, U+23F0..23F3,
3541 U+2600..2603, U+2614..2615, U+2618,
3542 U+261A..261C ...)
3543 \p{Line_Break: IN} \p{Line_Break=Inseparable} (6)
3544 \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;],
3545 U+037E, U+0589, U+060C..060D, U+07F8,
3546 U+2044 ...)
3547 \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026,
3548 U+22EF, U+FE19, U+10AF6)
3549 \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
3550 \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
3551 \p{Line_Break: JL} (Short: \p{Lb=JL}) (125: U+1100..115F,
3552 U+A960..A97C)
3553 \p{Line_Break: JT} (Short: \p{Lb=JT}) (137: U+11A8..11FF,
3554 U+D7CB..D7FB)
3555 \p{Line_Break: JV} (Short: \p{Lb=JV}) (95: U+1160..11A7,
3556 U+D7B0..D7C6)
3557 \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
3558 \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n])
3559 \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f],
3560 U+2028..2029)
3561 \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85])
3562 \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
3563 \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (31: U+17D6,
3564 U+203C..203D, U+2047..2049, U+3005,
3565 U+301C, U+303B..303C ...)
3566 \p{Line_Break: NS} \p{Line_Break=Nonstarter} (31)
3567 \p{Line_Break: NU} \p{Line_Break=Numeric} (582)
3568 \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (582: [0-9],
3569 U+0660..0669, U+066B..066C,
3570 U+06F0..06F9, U+07C0..07C9, U+0966..096F
3571 ...)
3572 \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (87)
3573 \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (87:
3574 [\(\[\{\xa1\xbf], U+0F3A, U+0F3C,
3575 U+169B, U+201A, U+201E ...)
3576 \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (30)
3577 \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (30:
3578 [\%\xa2\xb0], U+0609..060B, U+066A,
3579 U+09F2..09F3, U+09F9, U+0D79 ...)
3580 \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (65)
3581 \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (65: [\$+\\\xa3-
3582 \xa5\xb1], U+058F, U+09FB, U+0AF1,
3583 U+0BF9, U+0E3F ...)
3584 \p{Line_Break: QU} \p{Line_Break=Quotation} (39)
3585 \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb],
3586 U+2018..2019, U+201B..201D, U+201F,
3587 U+2039..203A, U+275B..2760 ...)
3588 \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26:
3589 U+1F1E6..1F1FF)
3590 \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26)
3591 \p{Line_Break: SA} \p{Line_Break=Complex_Context} (734)
3592 D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
3593 \p{Line_Break: SP} \p{Line_Break=Space} (1)
3594 \p{Line_Break: Space} (Short: \p{Lb=SP}) (1: [\x20])
3595 D \p{Line_Break: Surrogate} Deprecated by Unicode because surrogates
3596 should never appear in well-formed text,
3597 and therefore shouldn't be the basis for
3598 line breaking (Short: \p{Lb=SG}) (2048:
3599 U+D800..DFFF)
3600 \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
3601 \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (902_900 plus all
3602 above-Unicode code points: U+0378..0379,
3603 U+0380..0383, U+038B, U+038D, U+03A2,
3604 U+0530 ...)
3605 \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
3606 \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF)
3607 \p{Line_Break: XX} \p{Line_Break=Unknown} (902_900 plus all
3608 above-Unicode code points)
3609 \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
3610 \p{Line_Break: ZWJ} (Short: \p{Lb=ZWJ}) (1: U+200D)
3611 \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B)
3612 \p{Line_Separator} \p{General_Category=Line_Separator}
3613 (Short: \p{Zl}) (1)
3614 \p{Linear_A} \p{Script_Extensions=Linear_A} (Short:
3615 \p{Lina}; NOT \p{Block=Linear_A}) (386)
3616 \p{Linear_B} \p{Script_Extensions=Linear_B} (Short:
3617 \p{Linb}) (268)
3618 X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
3619 X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
3620 \p{Lisu} \p{Script_Extensions=Lisu} (48)
3621 \p{Ll} \p{Lowercase_Letter} (=
3622 \p{General_Category=Lowercase_Letter})
3623 (/i= General_Category=Cased_Letter)
3624 (2063)
3625 \p{Lm} \p{Modifier_Letter} (=
3626 \p{General_Category=Modifier_Letter})
3627 (250)
3628 \p{Lo} \p{Other_Letter} (= \p{General_Category=
3629 Other_Letter}) (121_047)
3630 \p{LOE} \p{Logical_Order_Exception} (=
3631 \p{Logical_Order_Exception=Y}) (19)
3632 \p{LOE: *} \p{Logical_Order_Exception: *}
3633 \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
3634 \p{LOE}) (19)
3635 \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
3636 (1_114_093 plus all above-Unicode code
3637 points: U+0000..0E3F, U+0E45..0EBF,
3638 U+0EC5..19B4, U+19B8..19B9,
3639 U+19BB..AAB4, U+AAB7..AAB8 ...)
3640 \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19:
3641 U+0E40..0E44, U+0EC0..0EC4,
3642 U+19B5..19B7, U+19BA, U+AAB5..AAB6,
3643 U+AAB9 ...)
3644 X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
3645 \p{Lower} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3646 Cased=Yes) (2252)
3647 \p{Lower: *} \p{Lowercase: *}
3648 \p{Lowercase} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3649 Cased=Yes) (2252)
3650 \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
3651 No) (1_111_860 plus all above-Unicode
3652 code points: [\x00-
3653 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
3654 Z\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
3655 \xb4\xb6-\xb9\xbb-\xde\xf7], U+0100,
3656 U+0102, U+0104, U+0106, U+0108 ...)
3657 \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
3658 Yes) (2252: [a-z\xaa\xb5\xba\xdf-
3659 \xf6\xf8-\xff], U+0101, U+0103, U+0105,
3660 U+0107, U+0109 ...)
3661 \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
3662 (Short: \p{Ll}; /i= General_Category=
3663 Cased_Letter) (2063)
3664 \p{Lt} \p{Titlecase_Letter} (=
3665 \p{General_Category=Titlecase_Letter})
3666 (/i= General_Category=Cased_Letter) (31)
3667 \p{Lu} \p{Uppercase_Letter} (=
3668 \p{General_Category=Uppercase_Letter})
3669 (/i= General_Category=Cased_Letter)
3670 (1702)
3671 \p{Lyci} \p{Lycian} (= \p{Script_Extensions=
3672 Lycian}) (NOT \p{Block=Lycian}) (29)
3673 \p{Lycian} \p{Script_Extensions=Lycian} (Short:
3674 \p{Lyci}; NOT \p{Block=Lycian}) (29)
3675 \p{Lydi} \p{Lydian} (= \p{Script_Extensions=
3676 Lydian}) (NOT \p{Block=Lydian}) (27)
3677 \p{Lydian} \p{Script_Extensions=Lydian} (Short:
3678 \p{Lydi}; NOT \p{Block=Lydian}) (27)
3679 \p{M} \pM \p{Mark} (= \p{General_Category=Mark})
3680 (2177)
3681 \p{Mahajani} \p{Script_Extensions=Mahajani} (Short:
3682 \p{Mahj}; NOT \p{Block=Mahajani}) (61)
3683 \p{Mahj} \p{Mahajani} (= \p{Script_Extensions=
3684 Mahajani}) (NOT \p{Block=Mahajani}) (61)
3685 X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block=
3686 Mahjong_Tiles}) (48)
3687 X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short:
3688 \p{InMahjong}) (48)
3689 \p{Malayalam} \p{Script_Extensions=Malayalam} (Short:
3690 \p{Mlym}; NOT \p{Block=Malayalam}) (122)
3691 \p{Mand} \p{Mandaic} (= \p{Script_Extensions=
3692 Mandaic}) (NOT \p{Block=Mandaic}) (30)
3693 \p{Mandaic} \p{Script_Extensions=Mandaic} (Short:
3694 \p{Mand}; NOT \p{Block=Mandaic}) (30)
3695 \p{Mani} \p{Manichaean} (= \p{Script_Extensions=
3696 Manichaean}) (NOT \p{Block=Manichaean})
3697 (52)
3698 \p{Manichaean} \p{Script_Extensions=Manichaean} (Short:
3699 \p{Mani}; NOT \p{Block=Manichaean}) (52)
3700 \p{Marc} \p{Marchen} (= \p{Script_Extensions=
3701 Marchen}) (NOT \p{Block=Marchen}) (68)
3702 \p{Marchen} \p{Script_Extensions=Marchen} (Short:
3703 \p{Marc}; NOT \p{Block=Marchen}) (68)
3704 \p{Mark} \p{General_Category=Mark} (Short: \p{M})
3705 (2177)
3706 \p{Masaram_Gondi} \p{Script_Extensions=Masaram_Gondi}
3707 (Short: \p{Gonm}; NOT \p{Block=
3708 Masaram_Gondi}) (75)
3709 \p{Math} \p{Math=Y} (2310)
3710 \p{Math: N*} (Single: \P{Math}) (1_111_802 plus all
3711 above-Unicode code points: [\x00-
3712 \x20!\"#\$\%&\'\(\)*,\-.\/0-9:;?\@A-
3713 Z\[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2-
3714 \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF,
3715 U+03D3..03D4, U+03D6..03EF,
3716 U+03F2..03F3, U+03F7..0605 ...)
3717 \p{Math: Y*} (Single: \p{Math}) (2310: [+<=
3718 >\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
3719 U+03D5, U+03F0..03F1, U+03F4..03F6,
3720 U+0606..0608 ...)
3721 X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (=
3722 \p{Block=
3723 Mathematical_Alphanumeric_Symbols})
3724 (1024)
3725 X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block=
3726 Mathematical_Operators}) (256)
3727 \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
3728 \p{Sm}) (948)
3729 X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
3730 Mathematical_Alphanumeric_Symbols}
3731 (Short: \p{InMathAlphanum}) (1024)
3732 X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
3733 (Short: \p{InMathOperators}) (256)
3734 \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
3735 Spacing_Mark}) (401)
3736 \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
3737 Enclosing_Mark}) (13)
3738 \p{Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek} (Short:
3739 \p{Mtei}; NOT \p{Block=Meetei_Mayek})
3740 (79)
3741 X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block=
3742 Meetei_Mayek_Extensions}) (32)
3743 X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
3744 (Short: \p{InMeeteiMayekExt}) (32)
3745 \p{Mend} \p{Mende_Kikakui} (= \p{Script_Extensions=
3746 Mende_Kikakui}) (NOT \p{Block=
3747 Mende_Kikakui}) (213)
3748 \p{Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
3749 (Short: \p{Mend}; NOT \p{Block=
3750 Mende_Kikakui}) (213)
3751 \p{Merc} \p{Meroitic_Cursive} (=
3752 \p{Script_Extensions=Meroitic_Cursive})
3753 (NOT \p{Block=Meroitic_Cursive}) (90)
3754 \p{Mero} \p{Meroitic_Hieroglyphs} (=
3755 \p{Script_Extensions=
3756 Meroitic_Hieroglyphs}) (32)
3757 \p{Meroitic_Cursive} \p{Script_Extensions=Meroitic_Cursive}
3758 (Short: \p{Merc}; NOT \p{Block=
3759 Meroitic_Cursive}) (90)
3760 \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
3761 Meroitic_Hieroglyphs} (Short: \p{Mero})
3762 (32)
3763 \p{Miao} \p{Script_Extensions=Miao} (NOT \p{Block=
3764 Miao}) (133)
3765 X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (=
3766 \p{Block=
3767 Miscellaneous_Symbols_And_Arrows}) (256)
3768 X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
3769 (= \p{Block=
3770 Miscellaneous_Mathematical_Symbols_A})
3771 (48)
3772 X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
3773 (= \p{Block=
3774 Miscellaneous_Mathematical_Symbols_B})
3775 (128)
3776 X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs}
3777 (= \p{Block=
3778 Miscellaneous_Symbols_And_Pictographs})
3779 (768)
3780 X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block=
3781 Miscellaneous_Symbols}) (256)
3782 X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block=
3783 Miscellaneous_Technical}) (256)
3784 X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
3785 Miscellaneous_Mathematical_Symbols_A}
3786 (Short: \p{InMiscMathSymbolsA}) (48)
3787 X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
3788 Miscellaneous_Mathematical_Symbols_B}
3789 (Short: \p{InMiscMathSymbolsB}) (128)
3790 X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
3791 \p{InMiscSymbols}) (256)
3792 X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
3793 Miscellaneous_Symbols_And_Arrows}
3794 (Short: \p{InMiscArrows}) (256)
3795 X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
3796 Miscellaneous_Symbols_And_Pictographs}
3797 (Short: \p{InMiscPictographs}) (768)
3798 X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
3799 (Short: \p{InMiscTechnical}) (256)
3800 \p{Mlym} \p{Malayalam} (= \p{Script_Extensions=
3801 Malayalam}) (NOT \p{Block=Malayalam})
3802 (122)
3803 \p{Mn} \p{Nonspacing_Mark} (=
3804 \p{General_Category=Nonspacing_Mark})
3805 (1763)
3806 \p{Modi} \p{Script_Extensions=Modi} (NOT \p{Block=
3807 Modi}) (89)
3808 \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
3809 (Short: \p{Lm}) (250)
3810 X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block=
3811 Spacing_Modifier_Letters}) (80)
3812 \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
3813 (Short: \p{Sk}) (121)
3814 X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
3815 \p{Mong} \p{Mongolian} (= \p{Script_Extensions=
3816 Mongolian}) (NOT \p{Block=Mongolian})
3817 (169)
3818 \p{Mongolian} \p{Script_Extensions=Mongolian} (Short:
3819 \p{Mong}; NOT \p{Block=Mongolian}) (169)
3820 X \p{Mongolian_Sup} \p{Mongolian_Supplement} (= \p{Block=
3821 Mongolian_Supplement}) (32)
3822 X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
3823 \p{InMongolianSup}) (32)
3824 \p{Mro} \p{Script_Extensions=Mro} (NOT \p{Block=
3825 Mro}) (43)
3826 \p{Mroo} \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
3827 \p{Block=Mro}) (43)
3828 \p{Mtei} \p{Meetei_Mayek} (= \p{Script_Extensions=
3829 Meetei_Mayek}) (NOT \p{Block=
3830 Meetei_Mayek}) (79)
3831 \p{Mult} \p{Multani} (= \p{Script_Extensions=
3832 Multani}) (NOT \p{Block=Multani}) (48)
3833 \p{Multani} \p{Script_Extensions=Multani} (Short:
3834 \p{Mult}; NOT \p{Block=Multani}) (48)
3835 X \p{Music} \p{Musical_Symbols} (= \p{Block=
3836 Musical_Symbols}) (256)
3837 X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short:
3838 \p{InMusic}) (256)
3839 \p{Myanmar} \p{Script_Extensions=Myanmar} (Short:
3840 \p{Mymr}; NOT \p{Block=Myanmar}) (224)
3841 X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block=
3842 Myanmar_Extended_A}) (32)
3843 X \p{Myanmar_Ext_B} \p{Myanmar_Extended_B} (= \p{Block=
3844 Myanmar_Extended_B}) (32)
3845 X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short:
3846 \p{InMyanmarExtA}) (32)
3847 X \p{Myanmar_Extended_B} \p{Block=Myanmar_Extended_B} (Short:
3848 \p{InMyanmarExtB}) (32)
3849 \p{Mymr} \p{Myanmar} (= \p{Script_Extensions=
3850 Myanmar}) (NOT \p{Block=Myanmar}) (224)
3851 \p{N} \pN \p{Number} (= \p{General_Category=Number})
3852 (1502)
3853 \p{Nabataean} \p{Script_Extensions=Nabataean} (Short:
3854 \p{Nbat}; NOT \p{Block=Nabataean}) (40)
3855 \p{Narb} \p{Old_North_Arabian} (=
3856 \p{Script_Extensions=Old_North_Arabian})
3857 (32)
3858 X \p{NB} \p{No_Block} (= \p{Block=No_Block})
3859 (834_096 plus all above-Unicode code
3860 points)
3861 \p{Nbat} \p{Nabataean} (= \p{Script_Extensions=
3862 Nabataean}) (NOT \p{Block=Nabataean})
3863 (40)
3864 \p{NChar} \p{Noncharacter_Code_Point} (=
3865 \p{Noncharacter_Code_Point=Y}) (66)
3866 \p{NChar: *} \p{Noncharacter_Code_Point: *}
3867 \p{Nd} \p{XPosixDigit} (= \p{General_Category=
3868 Decimal_Number}) (590)
3869 \p{New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
3870 \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
3871 \p{Newa} \p{Script_Extensions=Newa} (NOT \p{Block=
3872 Newa}) (92)
3873 \p{NFC_QC: *} \p{NFC_Quick_Check: *}
3874 \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (110)
3875 \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (110:
3876 U+0300..0304, U+0306..030C, U+030F,
3877 U+0311, U+0313..0314, U+031B ...)
3878 \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
3879 \P{NFC_Quick_Check} NOR \P{NFC_QC})
3880 (1120)
3881 \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
3882 \P{NFC_Quick_Check} NOR \P{NFC_QC})
3883 (1120: U+0340..0341, U+0343..0344,
3884 U+0374, U+037E, U+0387, U+0958..095F ...)
3885 \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
3886 \p{NFC_Quick_Check} NOR \p{NFC_QC})
3887 (1_112_882 plus all above-Unicode code
3888 points)
3889 \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
3890 \p{NFC_Quick_Check} NOR \p{NFC_QC})
3891 (1_112_882 plus all above-Unicode code
3892 points: U+0000..02FF, U+0305,
3893 U+030D..030E, U+0310, U+0312,
3894 U+0315..031A ...)
3895 \p{NFD_QC: *} \p{NFD_Quick_Check: *}
3896 \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
3897 \P{NFD_Quick_Check} NOR \P{NFD_QC})
3898 (13_232)
3899 \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
3900 \P{NFD_Quick_Check} NOR \P{NFD_QC})
3901 (13_232: [\xc0-\xc5\xc7-\xcf\xd1-
3902 \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
3903 \xf6\xf9-\xfd\xff], U+0100..010F,
3904 U+0112..0125, U+0128..0130,
3905 U+0134..0137, U+0139..013E ...)
3906 \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
3907 \p{NFD_Quick_Check} NOR \p{NFD_QC})
3908 (1_100_880 plus all above-Unicode code
3909 points)
3910 \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
3911 \p{NFD_Quick_Check} NOR \p{NFD_QC})
3912 (1_100_880 plus all above-Unicode code
3913 points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde-
3914 \xdf\xe6\xf0\xf7-\xf8\xfe],
3915 U+0110..0111, U+0126..0127,
3916 U+0131..0133, U+0138, U+013F..0142 ...)
3917 \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
3918 \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (110)
3919 \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (110:
3920 U+0300..0304, U+0306..030C, U+030F,
3921 U+0311, U+0313..0314, U+031B ...)
3922 \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
3923 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
3924 (4794)
3925 \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
3926 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
3927 (4794: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
3928 \xba\xbc-\xbe], U+0132..0133,
3929 U+013F..0140, U+0149, U+017F,
3930 U+01C4..01CC ...)
3931 \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
3932 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
3933 (1_109_208 plus all above-Unicode code
3934 points)
3935 \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
3936 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
3937 (1_109_208 plus all above-Unicode code
3938 points: [\x00-\x9f\xa1-\xa7\xa9\xab-
3939 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff],
3940 U+0100..0131, U+0134..013E,
3941 U+0141..0148, U+014A..017E, U+0180..01C3
3942 ...)
3943 \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
3944 \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
3945 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
3946 (16_894)
3947 \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
3948 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
3949 (16_894: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
3950 \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1-
3951 \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
3952 \xf6\xf9-\xfd\xff], U+0100..010F,
3953 U+0112..0125, U+0128..0130,
3954 U+0132..0137, U+0139..0140 ...)
3955 \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
3956 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
3957 (1_097_218 plus all above-Unicode code
3958 points)
3959 \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
3960 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
3961 (1_097_218 plus all above-Unicode code
3962 points: [\x00-\x9f\xa1-\xa7\xa9\xab-
3963 \xae\xb0-\xb1\xb6-
3964 \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-
3965 \xdf\xe6\xf0\xf7-\xf8\xfe],
3966 U+0110..0111, U+0126..0127, U+0131,
3967 U+0138, U+0141..0142 ...)
3968 \p{Nko} \p{Script_Extensions=Nko} (NOT \p{Block=
3969 NKo}) (59)
3970 \p{Nkoo} \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
3971 \p{Block=NKo}) (59)
3972 \p{Nl} \p{Letter_Number} (= \p{General_Category=
3973 Letter_Number}) (236)
3974 \p{No} \p{Other_Number} (= \p{General_Category=
3975 Other_Number}) (676)
3976 X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB})
3977 (834_096 plus all above-Unicode code
3978 points)
3979 \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
3980 \p{NChar}) (66)
3981 \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
3982 (1_114_046 plus all above-Unicode code
3983 points: U+0000..FDCF, U+FDF0..FFFD,
3984 U+10000..1FFFD, U+20000..2FFFD,
3985 U+30000..3FFFD, U+40000..4FFFD ...)
3986 \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
3987 (66: U+FDD0..FDEF, U+FFFE..FFFF,
3988 U+1FFFE..1FFFF, U+2FFFE..2FFFF,
3989 U+3FFFE..3FFFF, U+4FFFE..4FFFF ...)
3990 \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
3991 (Short: \p{Mn}) (1763)
3992 \p{Nshu} \p{Nushu} (= \p{Script_Extensions=Nushu})
3993 (NOT \p{Block=Nushu}) (397)
3994 \p{Nt: *} \p{Numeric_Type: *}
3995 \p{Number} \p{General_Category=Number} (Short: \p{N})
3996 (1502)
3997 X \p{Number_Forms} \p{Block=Number_Forms} (64)
3998 \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (590)
3999 \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (590: [0-9],
4000 U+0660..0669, U+06F0..06F9,
4001 U+07C0..07C9, U+0966..096F, U+09E6..09EF
4002 ...)
4003 \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
4004 \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9],
4005 U+1369..1371, U+19DA, U+2070,
4006 U+2074..2079, U+2080..2089 ...)
4007 \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_529 plus all
4008 above-Unicode code points: [\x00-
4009 \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-
4010 Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4-
4011 \xb8\xba-\xbb\xbf-\xff], U+0100..065F,
4012 U+066A..06EF, U+06FA..07BF,
4013 U+07CA..0965, U+0970..09E5 ...)
4014 \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (865)
4015 \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (865: [\xbc-\xbe],
4016 U+09F4..09F9, U+0B72..0B77,
4017 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
4018 ...)
4019 T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33)
4020 T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (75: [0], U+0660,
4021 U+06F0, U+07C0, U+0966, U+09E6 ...)
4022 T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (1: U+0D58)
4023 T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (1: U+0D59)
4024 T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (1: U+0D5A)
4025 T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (1: U+0D5B)
4026 T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (4: U+09F4, U+0B75,
4027 U+0D76, U+A833)
4028 T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6)
4029 T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (2: U+0D5C, U+2152)
4030 T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1: U+2151)
4031 T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (6: U+09F5, U+0B76,
4032 U+0D77, U+215B, U+A834, U+1245F)
4033 T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1: U+2150)
4034 T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (1: U+0D5D)
4035 T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (3: U+2159, U+109F7,
4036 U+12461)
4037 T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (4: U+09F6, U+0B77,
4038 U+0D78, U+A835)
4039 T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (2: U+0D5E, U+2155)
4040 T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (12: [\xbc], U+09F7,
4041 U+0B72, U+0D73, U+A830, U+10140 ...)
4042 T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (6: U+2153, U+109F9,
4043 U+10E7D, U+1245A, U+1245D, U+12465)
4044 T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1: U+215C)
4045 T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1: U+2156)
4046 T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA)
4047 T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (13: [\xbd], U+0B73,
4048 U+0D74, U+0F2A, U+2CFD, U+A831 ...)
4049 T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC)
4050 T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1: U+2157)
4051 T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1: U+215D)
4052 T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (7: U+2154, U+10177,
4053 U+109FD, U+10E7E, U+1245B, U+1245E ...)
4054 T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (7: [\xbe], U+09F8,
4055 U+0B74, U+0D75, U+A832, U+10178 ...)
4056 T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1: U+2158)
4057 T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (3: U+215A, U+109FF,
4058 U+1245C)
4059 T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1: U+215E)
4060 T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC)
4061 T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (122: [1\xb9], U+0661,
4062 U+06F1, U+07C1, U+0967, U+09E7 ...)
4063 T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1: U+0F2B)
4064 T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (122: [2\xb2], U+0662,
4065 U+06F2, U+07C2, U+0968, U+09E8 ...)
4066 T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1: U+0F2C)
4067 T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (124: [3\xb3], U+0663,
4068 U+06F3, U+07C3, U+0969, U+09E9 ...)
4069 T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1: U+0F2D)
4070 T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (116: [4], U+0664,
4071 U+06F4, U+07C4, U+096A, U+09EA ...)
4072 T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1: U+0F2E)
4073 T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (114: [5], U+0665,
4074 U+06F5, U+07C5, U+096B, U+09EB ...)
4075 T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F)
4076 T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (101: [6], U+0666,
4077 U+06F6, U+07C6, U+096C, U+09EC ...)
4078 T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30)
4079 T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (100: [7], U+0667,
4080 U+06F7, U+07C7, U+096D, U+09ED ...)
4081 T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31)
4082 T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (96: [8], U+0668,
4083 U+06F8, U+07C8, U+096E, U+09EE ...)
4084 T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32)
4085 T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (100: [9], U+0669,
4086 U+06F9, U+07C9, U+096F, U+09EF ...)
4087 T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (54: U+0BF0, U+0D70,
4088 U+1372, U+2169, U+2179, U+2469 ...)
4089 T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (6: U+216A, U+217A,
4090 U+246A, U+247E, U+2492, U+24EB)
4091 T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (6: U+216B, U+217B,
4092 U+246B, U+247F, U+2493, U+24EC)
4093 T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (4: U+246C, U+2480,
4094 U+2494, U+24ED)
4095 T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (4: U+246D, U+2481,
4096 U+2495, U+24EE)
4097 T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (4: U+246E, U+2482,
4098 U+2496, U+24EF)
4099 T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (5: U+09F9, U+246F,
4100 U+2483, U+2497, U+24F0)
4101 T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (5: U+16EE, U+2470,
4102 U+2484, U+2498, U+24F1)
4103 T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (5: U+16EF, U+2471,
4104 U+2485, U+2499, U+24F2)
4105 T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (5: U+16F0, U+2472,
4106 U+2486, U+249A, U+24F3)
4107 T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (31: U+1373, U+2473,
4108 U+2487, U+249B, U+24F4, U+3039 ...)
4109 T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1: U+3251)
4110 T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1: U+3252)
4111 T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1: U+3253)
4112 T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1: U+3254)
4113 T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1: U+3255)
4114 T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1: U+3256)
4115 T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1: U+3257)
4116 T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1: U+3258)
4117 T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1: U+3259)
4118 T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (16: U+1374, U+303A,
4119 U+324A, U+325A, U+5345, U+10112 ...)
4120 T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1: U+325B)
4121 T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1: U+325C)
4122 T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1: U+325D)
4123 T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1: U+325E)
4124 T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1: U+325F)
4125 T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1: U+32B1)
4126 T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1: U+32B2)
4127 T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1: U+32B3)
4128 T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1: U+32B4)
4129 T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (16: U+1375, U+324B,
4130 U+32B5, U+534C, U+10113, U+102ED ...)
4131 T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1: U+32B6)
4132 T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1: U+32B7)
4133 T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1: U+32B8)
4134 T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1: U+32B9)
4135 T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1: U+32BA)
4136 T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1: U+32BB)
4137 T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1: U+32BC)
4138 T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1: U+32BD)
4139 T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1: U+32BE)
4140 T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (27: U+1376, U+216C,
4141 U+217C, U+2186, U+324C, U+32BF ...)
4142 T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (11: U+1377, U+324D,
4143 U+10115, U+102EF, U+109CE, U+10E6E ...)
4144 T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (11: U+1378, U+324E,
4145 U+10116, U+102F0, U+109CF, U+10E6F ...)
4146 T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (10: U+1379, U+324F,
4147 U+10117, U+102F1, U+10E70, U+11062 ...)
4148 T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (10: U+137A, U+10118,
4149 U+102F2, U+10341, U+10E71, U+11063 ...)
4150 T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (30: U+0BF1, U+0D71,
4151 U+137B, U+216D, U+217D, U+4F70 ...)
4152 T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (4: U+1011A, U+102F4,
4153 U+109D3, U+10E73)
4154 T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (5: U+1011B, U+1016B,
4155 U+102F5, U+109D4, U+10E74)
4156 T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (4: U+1011C, U+102F6,
4157 U+109D5, U+10E75)
4158 T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (14: U+216E, U+217E,
4159 U+1011D, U+10145, U+1014C, U+10153 ...)
4160 T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (4: U+1011E, U+102F8,
4161 U+109D7, U+10E77)
4162 T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (4: U+1011F, U+102F9,
4163 U+109D8, U+10E78)
4164 T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (4: U+10120, U+102FA,
4165 U+109D9, U+10E79)
4166 T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (5: U+10121, U+102FB,
4167 U+1034A, U+109DA, U+10E7A)
4168 T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (20: U+0BF2, U+0D72,
4169 U+216F, U+217F..2180, U+4EDF, U+5343 ...)
4170 T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (2: U+10123, U+109DC)
4171 T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (2: U+10124, U+109DD)
4172 T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (2: U+10125, U+109DE)
4173 T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (6: U+2181, U+10126,
4174 U+10146, U+1014E, U+10172, U+109DF)
4175 T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (2: U+10127, U+109E0)
4176 T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (2: U+10128, U+109E1)
4177 T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (2: U+10129, U+109E2)
4178 T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (2: U+1012A, U+109E3)
4179 T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (9:
4180 U+137C, U+2182, U+4E07, U+842C, U+1012B,
4181 U+10155 ...)
4182 T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (2:
4183 U+1012C, U+109E5)
4184 T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (2:
4185 U+1012D, U+109E6)
4186 T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (2:
4187 U+1012E, U+109E7)
4188 T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (5:
4189 U+2187, U+1012F, U+10147, U+10156,
4190 U+109E8)
4191 T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (2:
4192 U+10130, U+109E9)
4193 T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (2:
4194 U+10131, U+109EA)
4195 T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (2:
4196 U+10132, U+109EB)
4197 T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (2:
4198 U+10133, U+109EC)
4199 T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (2:
4200 U+2188, U+109ED)
4201 T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (1:
4202 U+109EE)
4203 T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1:
4204 U+12432)
4205 T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1:
4206 U+109EF)
4207 T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1:
4208 U+109F0)
4209 T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1:
4210 U+12433)
4211 T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1:
4212 U+109F1)
4213 T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1:
4214 U+109F2)
4215 T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1:
4216 U+109F3)
4217 T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1:
4218 U+109F4)
4219 T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1:
4220 U+109F5)
4221 T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1:
4222 U+16B5E)
4223 T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
4224 (3: U+4EBF, U+5104, U+16B5F)
4225 T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
4226 10000000000}) (1: U+16B60)
4227 T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
4228 1000000000000}) (2: U+5146, U+16B61)
4229 \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_529 plus all
4230 above-Unicode code points: [\x00-
4231 \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-
4232 Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4-
4233 \xb8\xba-\xbb\xbf-\xff], U+0100..065F,
4234 U+066A..06EF, U+06FA..07BF,
4235 U+07CA..0965, U+0970..09E5 ...)
4236 \p{Nushu} \p{Script_Extensions=Nushu} (Short:
4237 \p{Nshu}; NOT \p{Block=Nushu}) (397)
4238 \p{Nv: *} \p{Numeric_Value: *}
4239 X \p{OCR} \p{Optical_Character_Recognition} (=
4240 \p{Block=Optical_Character_Recognition})
4241 (32)
4242 \p{Ogam} \p{Ogham} (= \p{Script_Extensions=Ogham})
4243 (NOT \p{Block=Ogham}) (29)
4244 \p{Ogham} \p{Script_Extensions=Ogham} (Short:
4245 \p{Ogam}; NOT \p{Block=Ogham}) (29)
4246 \p{Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
4247 \p{Olck}) (48)
4248 \p{Olck} \p{Ol_Chiki} (= \p{Script_Extensions=
4249 Ol_Chiki}) (48)
4250 \p{Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
4251 (Short: \p{Hung}; NOT \p{Block=
4252 Old_Hungarian}) (108)
4253 \p{Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
4254 \p{Ital}; NOT \p{Block=Old_Italic}) (39)
4255 \p{Old_North_Arabian} \p{Script_Extensions=Old_North_Arabian}
4256 (Short: \p{Narb}) (32)
4257 \p{Old_Permic} \p{Script_Extensions=Old_Permic} (Short:
4258 \p{Perm}; NOT \p{Block=Old_Permic}) (44)
4259 \p{Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
4260 \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
4261 \p{Old_South_Arabian} \p{Script_Extensions=Old_South_Arabian}
4262 (Short: \p{Sarb}) (32)
4263 \p{Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
4264 \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
4265 \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
4266 (Short: \p{Ps}) (75)
4267 X \p{Optical_Character_Recognition} \p{Block=
4268 Optical_Character_Recognition} (Short:
4269 \p{InOCR}) (32)
4270 \p{Oriya} \p{Script_Extensions=Oriya} (Short:
4271 \p{Orya}; NOT \p{Block=Oriya}) (94)
4272 \p{Orkh} \p{Old_Turkic} (= \p{Script_Extensions=
4273 Old_Turkic}) (NOT \p{Block=Old_Turkic})
4274 (73)
4275 X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
4276 \p{Orya} \p{Oriya} (= \p{Script_Extensions=Oriya})
4277 (NOT \p{Block=Oriya}) (94)
4278 \p{Osage} \p{Script_Extensions=Osage} (Short:
4279 \p{Osge}; NOT \p{Block=Osage}) (72)
4280 \p{Osge} \p{Osage} (= \p{Script_Extensions=Osage})
4281 (NOT \p{Block=Osage}) (72)
4282 \p{Osma} \p{Osmanya} (= \p{Script_Extensions=
4283 Osmanya}) (NOT \p{Block=Osmanya}) (40)
4284 \p{Osmanya} \p{Script_Extensions=Osmanya} (Short:
4285 \p{Osma}; NOT \p{Block=Osmanya}) (40)
4286 \p{Other} \p{General_Category=Other} (Short: \p{C})
4287 (977_573 plus all above-Unicode code
4288 points)
4289 \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
4290 \p{Lo}) (121_047)
4291 \p{Other_Number} \p{General_Category=Other_Number} (Short:
4292 \p{No}) (676)
4293 \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
4294 (Short: \p{Po}) (566)
4295 \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
4296 \p{So}) (5855)
4297 \p{P} \pP \p{Punct} (= \p{General_Category=
4298 Punctuation}) (NOT
4299 \p{General_Punctuation}) (770)
4300 \p{Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong} (Short:
4301 \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
4302 (127)
4303 \p{Palm} \p{Palmyrene} (= \p{Script_Extensions=
4304 Palmyrene}) (32)
4305 \p{Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
4306 \p{Palm}) (32)
4307 \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
4308 (Short: \p{Zp}) (1)
4309 \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
4310 Y}) (2760)
4311 \p{Pat_Syn: *} \p{Pattern_Syntax: *}
4312 \p{Pat_WS} \p{Pattern_White_Space} (=
4313 \p{Pattern_White_Space=Y}) (11)
4314 \p{Pat_WS: *} \p{Pattern_White_Space: *}
4315 \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
4316 (2760)
4317 \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
4318 (1_111_352 plus all above-Unicode code
4319 points: [\x00-\x200-9A-Z_a-z\x7f-
4320 \xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb7-
4321 \xba\xbc-\xbe\xc0-\xd6\xd8-\xf6\xf8-
4322 \xff], U+0100..200F, U+2028..202F,
4323 U+203F..2040, U+2054, U+205F..218F ...)
4324 \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760:
4325 [!\"#\$\%&\'\(\)*+,\-.\/:;<=
4326 >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab-
4327 \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
4328 U+2010..2027, U+2030..203E,
4329 U+2041..2053, U+2055..205E, U+2190..245F
4330 ...)
4331 \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
4332 \p{PatWS}) (11)
4333 \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
4334 (1_114_101 plus all above-Unicode code
4335 points: [^\t\n\cK\f\r\x20\x85],
4336 U+0100..200D, U+2010..2027,
4337 U+202A..infinity)
4338 \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11:
4339 [\t\n\cK\f\r\x20\x85], U+200E..200F,
4340 U+2028..2029)
4341 \p{Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
4342 \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
4343 \p{Pauc} \p{Pau_Cin_Hau} (= \p{Script_Extensions=
4344 Pau_Cin_Hau}) (NOT \p{Block=
4345 Pau_Cin_Hau}) (57)
4346 \p{Pc} \p{Connector_Punctuation} (=
4347 \p{General_Category=
4348 Connector_Punctuation}) (10)
4349 \p{PCM} \p{Prepended_Concatenation_Mark} (=
4350 \p{Prepended_Concatenation_Mark=Y}) (10)
4351 \p{PCM: *} \p{Prepended_Concatenation_Mark: *}
4352 \p{Pd} \p{Dash_Punctuation} (=
4353 \p{General_Category=Dash_Punctuation})
4354 (24)
4355 \p{Pe} \p{Close_Punctuation} (=
4356 \p{General_Category=Close_Punctuation})
4357 (73)
4358 \p{PerlSpace} \p{PosixSpace} (6)
4359 \p{PerlWord} \p{PosixWord} (63)
4360 \p{Perm} \p{Old_Permic} (= \p{Script_Extensions=
4361 Old_Permic}) (NOT \p{Block=Old_Permic})
4362 (44)
4363 \p{Pf} \p{Final_Punctuation} (=
4364 \p{General_Category=Final_Punctuation})
4365 (10)
4366 \p{Phag} \p{Phags_Pa} (= \p{Script_Extensions=
4367 Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
4368 \p{Phags_Pa} \p{Script_Extensions=Phags_Pa} (Short:
4369 \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
4370 X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block=
4371 Phaistos_Disc}) (48)
4372 X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short:
4373 \p{InPhaistos}) (48)
4374 \p{Phli} \p{Inscriptional_Pahlavi} (=
4375 \p{Script_Extensions=
4376 Inscriptional_Pahlavi}) (NOT \p{Block=
4377 Inscriptional_Pahlavi}) (27)
4378 \p{Phlp} \p{Psalter_Pahlavi} (=
4379 \p{Script_Extensions=Psalter_Pahlavi})
4380 (NOT \p{Block=Psalter_Pahlavi}) (30)
4381 \p{Phnx} \p{Phoenician} (= \p{Script_Extensions=
4382 Phoenician}) (NOT \p{Block=Phoenician})
4383 (29)
4384 \p{Phoenician} \p{Script_Extensions=Phoenician} (Short:
4385 \p{Phnx}; NOT \p{Block=Phoenician}) (29)
4386 X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block=
4387 Phonetic_Extensions}) (128)
4388 X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (=
4389 \p{Block=
4390 Phonetic_Extensions_Supplement}) (64)
4391 X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
4392 \p{InPhoneticExt}) (128)
4393 X \p{Phonetic_Extensions_Supplement} \p{Block=
4394 Phonetic_Extensions_Supplement} (Short:
4395 \p{InPhoneticExtSup}) (64)
4396 \p{Pi} \p{Initial_Punctuation} (=
4397 \p{General_Category=
4398 Initial_Punctuation}) (12)
4399 X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
4400 \p{Plrd} \p{Miao} (= \p{Script_Extensions=Miao})
4401 (NOT \p{Block=Miao}) (133)
4402 \p{Po} \p{Other_Punctuation} (=
4403 \p{General_Category=Other_Punctuation})
4404 (566)
4405 \p{PosixAlnum} (62: [0-9A-Za-z])
4406 \p{PosixAlpha} (52: [A-Za-z])
4407 \p{PosixBlank} (2: [\t\x20])
4408 \p{PosixCntrl} ASCII control characters (33: ACK, BEL,
4409 BS, CAN, CR, DC1, DC2, DC3, DC4, DEL,
4410 DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF,
4411 FS, GS, HT, LF, NAK, NUL, RS, SI, SO,
4412 SOH, STX, SUB, SYN, US, VT)
4413 \p{PosixDigit} (10: [0-9])
4414 \p{PosixGraph} (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
4415 Z\[\\\]\^_`a-z\{\|\}~])
4416 \p{PosixLower} (/i= PosixAlpha) (26: [a-z])
4417 \p{PosixPrint} (95: [\x20-\x7e])
4418 \p{PosixPunct} (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<=
4419 >?\@\[\\\]\^_`\{\|\}~])
4420 \p{PosixSpace} (Short: \p{PerlSpace}) (6:
4421 [\t\n\cK\f\r\x20])
4422 \p{PosixUpper} (/i= PosixAlpha) (26: [A-Z])
4423 \p{PosixWord} \w, restricted to ASCII (Short:
4424 \p{PerlWord}) (63: [0-9A-Z_a-z])
4425 \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
4426 (22)
4427 \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
4428 Y} (Short: \p{PCM}) (10)
4429 \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
4430 (1_114_102 plus all above-Unicode code
4431 points: U+0000..05FF, U+0606..06DC,
4432 U+06DE..070E, U+0710..08E1,
4433 U+08E3..110BC, U+110BE..infinity)
4434 \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
4435 (10: U+0600..0605, U+06DD, U+070F,
4436 U+08E2, U+110BD)
4437 T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
4438 extension) (33_979)
4439 T \p{Present_In: 2.0} Code point's usage introduced in version
4440 2.0 or earlier (Short: \p{In=2.0}) (Perl
4441 extension) (178_500: U+0000..01F5,
4442 U+01FA..0217, U+0250..02A8,
4443 U+02B0..02DE, U+02E0..02E9, U+0300..0345
4444 ...)
4445 \p{Present_In: V2_0} \p{Present_In=2.0} (Perl extension)
4446 (178_500)
4447 T \p{Present_In: 2.1} Code point's usage introduced in version
4448 2.1 or earlier (Short: \p{In=2.1}) (Perl
4449 extension) (178_502: U+0000..01F5,
4450 U+01FA..0217, U+0250..02A8,
4451 U+02B0..02DE, U+02E0..02E9, U+0300..0345
4452 ...)
4453 \p{Present_In: V2_1} \p{Present_In=2.1} (Perl extension)
4454 (178_502)
4455 T \p{Present_In: 3.0} Code point's usage introduced in version
4456 3.0 or earlier (Short: \p{In=3.0}) (Perl
4457 extension) (188_809: U+0000..021F,
4458 U+0222..0233, U+0250..02AD,
4459 U+02B0..02EE, U+0300..034E, U+0360..0362
4460 ...)
4461 \p{Present_In: V3_0} \p{Present_In=3.0} (Perl extension)
4462 (188_809)
4463 T \p{Present_In: 3.1} Code point's usage introduced in version
4464 3.1 or earlier (Short: \p{In=3.1}) (Perl
4465 extension) (233_787: U+0000..021F,
4466 U+0222..0233, U+0250..02AD,
4467 U+02B0..02EE, U+0300..034E, U+0360..0362
4468 ...)
4469 \p{Present_In: V3_1} \p{Present_In=3.1} (Perl extension)
4470 (233_787)
4471 T \p{Present_In: 3.2} Code point's usage introduced in version
4472 3.2 or earlier (Short: \p{In=3.2}) (Perl
4473 extension) (234_803: U+0000..0220,
4474 U+0222..0233, U+0250..02AD,
4475 U+02B0..02EE, U+0300..034F, U+0360..036F
4476 ...)
4477 \p{Present_In: V3_2} \p{Present_In=3.2} (Perl extension)
4478 (234_803)
4479 T \p{Present_In: 4.0} Code point's usage introduced in version
4480 4.0 or earlier (Short: \p{In=4.0}) (Perl
4481 extension) (236_029: U+0000..0236,
4482 U+0250..0357, U+035D..036F,
4483 U+0374..0375, U+037A, U+037E ...)
4484 \p{Present_In: V4_0} \p{Present_In=4.0} (Perl extension)
4485 (236_029)
4486 T \p{Present_In: 4.1} Code point's usage introduced in version
4487 4.1 or earlier (Short: \p{In=4.1}) (Perl
4488 extension) (237_302: U+0000..0241,
4489 U+0250..036F, U+0374..0375, U+037A,
4490 U+037E, U+0384..038A ...)
4491 \p{Present_In: V4_1} \p{Present_In=4.1} (Perl extension)
4492 (237_302)
4493 T \p{Present_In: 5.0} Code point's usage introduced in version
4494 5.0 or earlier (Short: \p{In=5.0}) (Perl
4495 extension) (238_671: U+0000..036F,
4496 U+0374..0375, U+037A..037E,
4497 U+0384..038A, U+038C, U+038E..03A1 ...)
4498 \p{Present_In: V5_0} \p{Present_In=5.0} (Perl extension)
4499 (238_671)
4500 T \p{Present_In: 5.1} Code point's usage introduced in version
4501 5.1 or earlier (Short: \p{In=5.1}) (Perl
4502 extension) (240_295: U+0000..0377,
4503 U+037A..037E, U+0384..038A, U+038C,
4504 U+038E..03A1, U+03A3..0523 ...)
4505 \p{Present_In: V5_1} \p{Present_In=5.1} (Perl extension)
4506 (240_295)
4507 T \p{Present_In: 5.2} Code point's usage introduced in version
4508 5.2 or earlier (Short: \p{In=5.2}) (Perl
4509 extension) (246_943: U+0000..0377,
4510 U+037A..037E, U+0384..038A, U+038C,
4511 U+038E..03A1, U+03A3..0525 ...)
4512 \p{Present_In: V5_2} \p{Present_In=5.2} (Perl extension)
4513 (246_943)
4514 T \p{Present_In: 6.0} Code point's usage introduced in version
4515 6.0 or earlier (Short: \p{In=6.0}) (Perl
4516 extension) (249_031: U+0000..0377,
4517 U+037A..037E, U+0384..038A, U+038C,
4518 U+038E..03A1, U+03A3..0527 ...)
4519 \p{Present_In: V6_0} \p{Present_In=6.0} (Perl extension)
4520 (249_031)
4521 T \p{Present_In: 6.1} Code point's usage introduced in version
4522 6.1 or earlier (Short: \p{In=6.1}) (Perl
4523 extension) (249_763: U+0000..0377,
4524 U+037A..037E, U+0384..038A, U+038C,
4525 U+038E..03A1, U+03A3..0527 ...)
4526 \p{Present_In: V6_1} \p{Present_In=6.1} (Perl extension)
4527 (249_763)
4528 T \p{Present_In: 6.2} Code point's usage introduced in version
4529 6.2 or earlier (Short: \p{In=6.2}) (Perl
4530 extension) (249_764: U+0000..0377,
4531 U+037A..037E, U+0384..038A, U+038C,
4532 U+038E..03A1, U+03A3..0527 ...)
4533 \p{Present_In: V6_2} \p{Present_In=6.2} (Perl extension)
4534 (249_764)
4535 T \p{Present_In: 6.3} Code point's usage introduced in version
4536 6.3 or earlier (Short: \p{In=6.3}) (Perl
4537 extension) (249_769: U+0000..0377,
4538 U+037A..037E, U+0384..038A, U+038C,
4539 U+038E..03A1, U+03A3..0527 ...)
4540 \p{Present_In: V6_3} \p{Present_In=6.3} (Perl extension)
4541 (249_769)
4542 T \p{Present_In: 7.0} Code point's usage introduced in version
4543 7.0 or earlier (Short: \p{In=7.0}) (Perl
4544 extension) (252_603: U+0000..0377,
4545 U+037A..037F, U+0384..038A, U+038C,
4546 U+038E..03A1, U+03A3..052F ...)
4547 \p{Present_In: V7_0} \p{Present_In=7.0} (Perl extension)
4548 (252_603)
4549 T \p{Present_In: 8.0} Code point's usage introduced in version
4550 8.0 or earlier (Short: \p{In=8.0}) (Perl
4551 extension) (260_319: U+0000..0377,
4552 U+037A..037F, U+0384..038A, U+038C,
4553 U+038E..03A1, U+03A3..052F ...)
4554 \p{Present_In: V8_0} \p{Present_In=8.0} (Perl extension)
4555 (260_319)
4556 T \p{Present_In: 9.0} Code point's usage introduced in version
4557 9.0 or earlier (Short: \p{In=9.0}) (Perl
4558 extension) (267_819: U+0000..0377,
4559 U+037A..037F, U+0384..038A, U+038C,
4560 U+038E..03A1, U+03A3..052F ...)
4561 \p{Present_In: V9_0} \p{Present_In=9.0} (Perl extension)
4562 (267_819)
4563 T \p{Present_In: 10.0} Code point's usage introduced in version
4564 10.0 or earlier (Short: \p{In=10.0})
4565 (Perl extension) (276_337: U+0000..0377,
4566 U+037A..037F, U+0384..038A, U+038C,
4567 U+038E..03A1, U+03A3..052F ...)
4568 \p{Present_In: V10_0} \p{Present_In=10.0} (Perl extension)
4569 (276_337)
4570 \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
4571 Unassigned}) (Perl extension) (837_775
4572 plus all above-Unicode code points)
4573 \p{Print} \p{XPosixPrint} (274_156)
4574 \p{Private_Use} \p{General_Category=Private_Use} (Short:
4575 \p{Co}; NOT \p{Private_Use_Area})
4576 (137_468)
4577 X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
4578 \p{InPUA}) (6400)
4579 \p{Prti} \p{Inscriptional_Parthian} (=
4580 \p{Script_Extensions=
4581 Inscriptional_Parthian}) (NOT \p{Block=
4582 Inscriptional_Parthian}) (30)
4583 \p{Ps} \p{Open_Punctuation} (=
4584 \p{General_Category=Open_Punctuation})
4585 (75)
4586 \p{Psalter_Pahlavi} \p{Script_Extensions=Psalter_Pahlavi}
4587 (Short: \p{Phlp}; NOT \p{Block=
4588 Psalter_Pahlavi}) (30)
4589 X \p{PUA} \p{Private_Use_Area} (= \p{Block=
4590 Private_Use_Area}) (6400)
4591 \p{Punct} \p{General_Category=Punctuation} (Short:
4592 \p{P}; NOT \p{General_Punctuation}) (770)
4593 \p{Punctuation} \p{Punct} (= \p{General_Category=
4594 Punctuation}) (NOT
4595 \p{General_Punctuation}) (770)
4596 \p{Qaac} \p{Coptic} (= \p{Script_Extensions=
4597 Coptic}) (NOT \p{Block=Coptic}) (165)
4598 \p{Qaai} \p{Inherited} (= \p{Script_Extensions=
4599 Inherited}) (500)
4600 \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
4601 Y}) (30)
4602 \p{QMark: *} \p{Quotation_Mark: *}
4603 \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
4604 (30)
4605 \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_082
4606 plus all above-Unicode code points:
4607 [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<=
4608 >?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
4609 \xaa\xac-\xba\xbc-\xff], U+0100..2017,
4610 U+2020..2038, U+203B..2E41,
4611 U+2E43..300B, U+3010..301C ...)
4612 \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (30:
4613 [\"\'\xab\xbb], U+2018..201F,
4614 U+2039..203A, U+2E42, U+300C..300F,
4615 U+301D..301F ...)
4616 \p{Radical} \p{Radical=Y} (329)
4617 \p{Radical: N*} (Single: \P{Radical}) (1_113_783 plus all
4618 above-Unicode code points: U+0000..2E7F,
4619 U+2E9A, U+2EF4..2EFF, U+2FD6..infinity)
4620 \p{Radical: Y*} (Single: \p{Radical}) (329: U+2E80..2E99,
4621 U+2E9B..2EF3, U+2F00..2FD5)
4622 \p{Regional_Indicator} \p{Regional_Indicator=Y} (Short: \p{RI})
4623 (26)
4624 \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086
4625 plus all above-Unicode code points:
4626 U+0000..1F1E5, U+1F200..infinity)
4627 \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26:
4628 U+1F1E6..1F1FF)
4629 \p{Rejang} \p{Script_Extensions=Rejang} (Short:
4630 \p{Rjng}; NOT \p{Block=Rejang}) (37)
4631 \p{RI} \p{Regional_Indicator} (=
4632 \p{Regional_Indicator=Y}) (26)
4633 \p{RI: *} \p{Regional_Indicator: *}
4634 \p{Rjng} \p{Rejang} (= \p{Script_Extensions=
4635 Rejang}) (NOT \p{Block=Rejang}) (37)
4636 X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block=
4637 Rumi_Numeral_Symbols}) (32)
4638 X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
4639 \p{InRumi}) (32)
4640 \p{Runic} \p{Script_Extensions=Runic} (Short:
4641 \p{Runr}; NOT \p{Block=Runic}) (86)
4642 \p{Runr} \p{Runic} (= \p{Script_Extensions=Runic})
4643 (NOT \p{Block=Runic}) (86)
4644 \p{S} \pS \p{Symbol} (= \p{General_Category=Symbol})
4645 (6978)
4646 \p{Samaritan} \p{Script_Extensions=Samaritan} (Short:
4647 \p{Samr}; NOT \p{Block=Samaritan}) (61)
4648 \p{Samr} \p{Samaritan} (= \p{Script_Extensions=
4649 Samaritan}) (NOT \p{Block=Samaritan})
4650 (61)
4651 \p{Sarb} \p{Old_South_Arabian} (=
4652 \p{Script_Extensions=Old_South_Arabian})
4653 (32)
4654 \p{Saur} \p{Saurashtra} (= \p{Script_Extensions=
4655 Saurashtra}) (NOT \p{Block=Saurashtra})
4656 (82)
4657 \p{Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
4658 \p{Saur}; NOT \p{Block=Saurashtra}) (82)
4659 \p{SB: *} \p{Sentence_Break: *}
4660 \p{Sc} \p{Currency_Symbol} (=
4661 \p{General_Category=Currency_Symbol})
4662 (54)
4663 \p{Sc: *} \p{Script: *}
4664 \p{Script: Adlam} (Short: \p{Sc=Adlm}) (87: U+1E900..1E94A,
4665 U+1E950..1E959, U+1E95E..1E95F)
4666 \p{Script: Adlm} \p{Script=Adlam} (87)
4667 \p{Script: Aghb} \p{Script=Caucasian_Albanian} (=
4668 \p{Script_Extensions=
4669 Caucasian_Albanian}) (53)
4670 \p{Script: Ahom} \p{Script_Extensions=Ahom} (Short: \p{Sc=
4671 Ahom}, \p{Ahom}) (57)
4672 \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions=
4673 Anatolian_Hieroglyphs} (Short: \p{Sc=
4674 Hluw}, \p{Hluw}) (583)
4675 \p{Script: Arab} \p{Script=Arabic} (1280)
4676 \p{Script: Arabic} (Short: \p{Sc=Arab}) (1280: U+0600..0604,
4677 U+0606..060B, U+060D..061A, U+061C,
4678 U+061E, U+0620..063F ...)
4679 \p{Script: Armenian} (Short: \p{Sc=Armn}) (93: U+0531..0556,
4680 U+0559..055F, U+0561..0587, U+058A,
4681 U+058D..058F, U+FB13..FB17)
4682 \p{Script: Armi} \p{Script=Imperial_Aramaic} (=
4683 \p{Script_Extensions=Imperial_Aramaic})
4684 (31)
4685 \p{Script: Armn} \p{Script=Armenian} (93)
4686 \p{Script: Avestan} \p{Script_Extensions=Avestan} (Short:
4687 \p{Sc=Avst}, \p{Avst}) (61)
4688 \p{Script: Avst} \p{Script=Avestan} (=
4689 \p{Script_Extensions=Avestan}) (61)
4690 \p{Script: Bali} \p{Script=Balinese} (=
4691 \p{Script_Extensions=Balinese}) (121)
4692 \p{Script: Balinese} \p{Script_Extensions=Balinese} (Short:
4693 \p{Sc=Bali}, \p{Bali}) (121)
4694 \p{Script: Bamu} \p{Script=Bamum} (= \p{Script_Extensions=
4695 Bamum}) (657)
4696 \p{Script: Bamum} \p{Script_Extensions=Bamum} (Short: \p{Sc=
4697 Bamu}, \p{Bamu}) (657)
4698 \p{Script: Bass} \p{Script=Bassa_Vah} (=
4699 \p{Script_Extensions=Bassa_Vah}) (36)
4700 \p{Script: Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
4701 \p{Sc=Bass}, \p{Bass}) (36)
4702 \p{Script: Batak} \p{Script_Extensions=Batak} (Short: \p{Sc=
4703 Batk}, \p{Batk}) (56)
4704 \p{Script: Batk} \p{Script=Batak} (= \p{Script_Extensions=
4705 Batak}) (56)
4706 \p{Script: Beng} \p{Script=Bengali} (95)
4707 \p{Script: Bengali} (Short: \p{Sc=Beng}) (95: U+0980..0983,
4708 U+0985..098C, U+098F..0990,
4709 U+0993..09A8, U+09AA..09B0, U+09B2 ...)
4710 \p{Script: Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
4711 \p{Sc=Bhks}, \p{Bhks}) (97)
4712 \p{Script: Bhks} \p{Script=Bhaiksuki} (=
4713 \p{Script_Extensions=Bhaiksuki}) (97)
4714 \p{Script: Bopo} \p{Script=Bopomofo} (71)
4715 \p{Script: Bopomofo} (Short: \p{Sc=Bopo}) (71: U+02EA..02EB,
4716 U+3105..312E, U+31A0..31BA)
4717 \p{Script: Brah} \p{Script=Brahmi} (= \p{Script_Extensions=
4718 Brahmi}) (109)
4719 \p{Script: Brahmi} \p{Script_Extensions=Brahmi} (Short:
4720 \p{Sc=Brah}, \p{Brah}) (109)
4721 \p{Script: Brai} \p{Script=Braille} (=
4722 \p{Script_Extensions=Braille}) (256)
4723 \p{Script: Braille} \p{Script_Extensions=Braille} (Short:
4724 \p{Sc=Brai}, \p{Brai}) (256)
4725 \p{Script: Bugi} \p{Script=Buginese} (30)
4726 \p{Script: Buginese} (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B,
4727 U+1A1E..1A1F)
4728 \p{Script: Buhd} \p{Script=Buhid} (20)
4729 \p{Script: Buhid} (Short: \p{Sc=Buhd}) (20: U+1740..1753)
4730 \p{Script: Cakm} \p{Script=Chakma} (67)
4731 \p{Script: Canadian_Aboriginal} \p{Script_Extensions=
4732 Canadian_Aboriginal} (Short: \p{Sc=
4733 Cans}, \p{Cans}) (710)
4734 \p{Script: Cans} \p{Script=Canadian_Aboriginal} (=
4735 \p{Script_Extensions=
4736 Canadian_Aboriginal}) (710)
4737 \p{Script: Cari} \p{Script=Carian} (= \p{Script_Extensions=
4738 Carian}) (49)
4739 \p{Script: Carian} \p{Script_Extensions=Carian} (Short:
4740 \p{Sc=Cari}, \p{Cari}) (49)
4741 \p{Script: Caucasian_Albanian} \p{Script_Extensions=
4742 Caucasian_Albanian} (Short: \p{Sc=Aghb},
4743 \p{Aghb}) (53)
4744 \p{Script: Chakma} (Short: \p{Sc=Cakm}) (67: U+11100..11134,
4745 U+11136..11143)
4746 \p{Script: Cham} \p{Script_Extensions=Cham} (Short: \p{Sc=
4747 Cham}, \p{Cham}) (83)
4748 \p{Script: Cher} \p{Script=Cherokee} (=
4749 \p{Script_Extensions=Cherokee}) (172)
4750 \p{Script: Cherokee} \p{Script_Extensions=Cherokee} (Short:
4751 \p{Sc=Cher}, \p{Cher}) (172)
4752 \p{Script: Common} (Short: \p{Sc=Zyyy}) (7363: [\x00-
4753 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
4754 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
4755 \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF,
4756 U+02E5..02E9, U+02EC..02FF, U+0374,
4757 U+037E ...)
4758 \p{Script: Copt} \p{Script=Coptic} (137)
4759 \p{Script: Coptic} (Short: \p{Sc=Copt}) (137: U+03E2..03EF,
4760 U+2C80..2CF3, U+2CF9..2CFF)
4761 \p{Script: Cprt} \p{Script=Cypriot} (55)
4762 \p{Script: Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
4763 \p{Sc=Xsux}, \p{Xsux}) (1234)
4764 \p{Script: Cypriot} (Short: \p{Sc=Cprt}) (55: U+10800..10805,
4765 U+10808, U+1080A..10835, U+10837..10838,
4766 U+1083C, U+1083F)
4767 \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}) (443: U+0400..0484,
4768 U+0487..052F, U+1C80..1C88, U+1D2B,
4769 U+1D78, U+2DE0..2DFF ...)
4770 \p{Script: Cyrl} \p{Script=Cyrillic} (443)
4771 \p{Script: Deseret} \p{Script_Extensions=Deseret} (Short:
4772 \p{Sc=Dsrt}, \p{Dsrt}) (80)
4773 \p{Script: Deva} \p{Script=Devanagari} (154)
4774 \p{Script: Devanagari} (Short: \p{Sc=Deva}) (154: U+0900..0950,
4775 U+0953..0963, U+0966..097F, U+A8E0..A8FD)
4776 \p{Script: Dsrt} \p{Script=Deseret} (=
4777 \p{Script_Extensions=Deseret}) (80)
4778 \p{Script: Dupl} \p{Script=Duployan} (143)
4779 \p{Script: Duployan} (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A,
4780 U+1BC70..1BC7C, U+1BC80..1BC88,
4781 U+1BC90..1BC99, U+1BC9C..1BC9F)
4782 \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (=
4783 \p{Script_Extensions=
4784 Egyptian_Hieroglyphs}) (1071)
4785 \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions=
4786 Egyptian_Hieroglyphs} (Short: \p{Sc=
4787 Egyp}, \p{Egyp}) (1071)
4788 \p{Script: Elba} \p{Script=Elbasan} (=
4789 \p{Script_Extensions=Elbasan}) (40)
4790 \p{Script: Elbasan} \p{Script_Extensions=Elbasan} (Short:
4791 \p{Sc=Elba}, \p{Elba}) (40)
4792 \p{Script: Ethi} \p{Script=Ethiopic} (=
4793 \p{Script_Extensions=Ethiopic}) (495)
4794 \p{Script: Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
4795 \p{Sc=Ethi}, \p{Ethi}) (495)
4796 \p{Script: Geor} \p{Script=Georgian} (127)
4797 \p{Script: Georgian} (Short: \p{Sc=Geor}) (127: U+10A0..10C5,
4798 U+10C7, U+10CD, U+10D0..10FA,
4799 U+10FC..10FF, U+2D00..2D25 ...)
4800 \p{Script: Glag} \p{Script=Glagolitic} (132)
4801 \p{Script: Glagolitic} (Short: \p{Sc=Glag}) (132: U+2C00..2C2E,
4802 U+2C30..2C5E, U+1E000..1E006,
4803 U+1E008..1E018, U+1E01B..1E021,
4804 U+1E023..1E024 ...)
4805 \p{Script: Gonm} \p{Script=Masaram_Gondi} (=
4806 \p{Script_Extensions=Masaram_Gondi}) (75)
4807 \p{Script: Goth} \p{Script=Gothic} (= \p{Script_Extensions=
4808 Gothic}) (27)
4809 \p{Script: Gothic} \p{Script_Extensions=Gothic} (Short:
4810 \p{Sc=Goth}, \p{Goth}) (27)
4811 \p{Script: Gran} \p{Script=Grantha} (85)
4812 \p{Script: Grantha} (Short: \p{Sc=Gran}) (85: U+11300..11303,
4813 U+11305..1130C, U+1130F..11310,
4814 U+11313..11328, U+1132A..11330,
4815 U+11332..11333 ...)
4816 \p{Script: Greek} (Short: \p{Sc=Grek}) (518: U+0370..0373,
4817 U+0375..0377, U+037A..037D, U+037F,
4818 U+0384, U+0386 ...)
4819 \p{Script: Grek} \p{Script=Greek} (518)
4820 \p{Script: Gujarati} (Short: \p{Sc=Gujr}) (91: U+0A81..0A83,
4821 U+0A85..0A8D, U+0A8F..0A91,
4822 U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3
4823 ...)
4824 \p{Script: Gujr} \p{Script=Gujarati} (91)
4825 \p{Script: Gurmukhi} (Short: \p{Sc=Guru}) (79: U+0A01..0A03,
4826 U+0A05..0A0A, U+0A0F..0A10,
4827 U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33
4828 ...)
4829 \p{Script: Guru} \p{Script=Gurmukhi} (79)
4830 \p{Script: Han} (Short: \p{Sc=Han}) (89_228: U+2E80..2E99,
4831 U+2E9B..2EF3, U+2F00..2FD5, U+3005,
4832 U+3007, U+3021..3029 ...)
4833 \p{Script: Hang} \p{Script=Hangul} (11_739)
4834 \p{Script: Hangul} (Short: \p{Sc=Hang}) (11_739:
4835 U+1100..11FF, U+302E..302F,
4836 U+3131..318E, U+3200..321E,
4837 U+3260..327E, U+A960..A97C ...)
4838 \p{Script: Hani} \p{Script=Han} (89_228)
4839 \p{Script: Hano} \p{Script=Hanunoo} (21)
4840 \p{Script: Hanunoo} (Short: \p{Sc=Hano}) (21: U+1720..1734)
4841 \p{Script: Hatr} \p{Script=Hatran} (= \p{Script_Extensions=
4842 Hatran}) (26)
4843 \p{Script: Hatran} \p{Script_Extensions=Hatran} (Short:
4844 \p{Sc=Hatr}, \p{Hatr}) (26)
4845 \p{Script: Hebr} \p{Script=Hebrew} (= \p{Script_Extensions=
4846 Hebrew}) (133)
4847 \p{Script: Hebrew} \p{Script_Extensions=Hebrew} (Short:
4848 \p{Sc=Hebr}, \p{Hebr}) (133)
4849 \p{Script: Hira} \p{Script=Hiragana} (376)
4850 \p{Script: Hiragana} (Short: \p{Sc=Hira}) (376: U+3041..3096,
4851 U+309D..309F, U+1B001..1B11E, U+1F200)
4852 \p{Script: Hluw} \p{Script=Anatolian_Hieroglyphs} (=
4853 \p{Script_Extensions=
4854 Anatolian_Hieroglyphs}) (583)
4855 \p{Script: Hmng} \p{Script=Pahawh_Hmong} (=
4856 \p{Script_Extensions=Pahawh_Hmong}) (127)
4857 \p{Script: Hung} \p{Script=Old_Hungarian} (=
4858 \p{Script_Extensions=Old_Hungarian})
4859 (108)
4860 \p{Script: Imperial_Aramaic} \p{Script_Extensions=
4861 Imperial_Aramaic} (Short: \p{Sc=Armi},
4862 \p{Armi}) (31)
4863 \p{Script: Inherited} (Short: \p{Sc=Zinh}) (568: U+0300..036F,
4864 U+0485..0486, U+064B..0655, U+0670,
4865 U+0951..0952, U+1AB0..1ABE ...)
4866 \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions=
4867 Inscriptional_Pahlavi} (Short: \p{Sc=
4868 Phli}, \p{Phli}) (27)
4869 \p{Script: Inscriptional_Parthian} \p{Script_Extensions=
4870 Inscriptional_Parthian} (Short: \p{Sc=
4871 Prti}, \p{Prti}) (30)
4872 \p{Script: Ital} \p{Script=Old_Italic} (=
4873 \p{Script_Extensions=Old_Italic}) (39)
4874 \p{Script: Java} \p{Script=Javanese} (90)
4875 \p{Script: Javanese} (Short: \p{Sc=Java}) (90: U+A980..A9CD,
4876 U+A9D0..A9D9, U+A9DE..A9DF)
4877 \p{Script: Kaithi} (Short: \p{Sc=Kthi}) (66: U+11080..110C1)
4878 \p{Script: Kali} \p{Script=Kayah_Li} (47)
4879 \p{Script: Kana} \p{Script=Katakana} (300)
4880 \p{Script: Kannada} (Short: \p{Sc=Knda}) (88: U+0C80..0C83,
4881 U+0C85..0C8C, U+0C8E..0C90,
4882 U+0C92..0CA8, U+0CAA..0CB3, U+0CB5..0CB9
4883 ...)
4884 \p{Script: Katakana} (Short: \p{Sc=Kana}) (300: U+30A1..30FA,
4885 U+30FD..30FF, U+31F0..31FF,
4886 U+32D0..32FE, U+3300..3357, U+FF66..FF6F
4887 ...)
4888 \p{Script: Kayah_Li} (Short: \p{Sc=Kali}) (47: U+A900..A92D,
4889 U+A92F)
4890 \p{Script: Khar} \p{Script=Kharoshthi} (=
4891 \p{Script_Extensions=Kharoshthi}) (65)
4892 \p{Script: Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
4893 \p{Sc=Khar}, \p{Khar}) (65)
4894 \p{Script: Khmer} \p{Script_Extensions=Khmer} (Short: \p{Sc=
4895 Khmr}, \p{Khmr}) (146)
4896 \p{Script: Khmr} \p{Script=Khmer} (= \p{Script_Extensions=
4897 Khmer}) (146)
4898 \p{Script: Khoj} \p{Script=Khojki} (62)
4899 \p{Script: Khojki} (Short: \p{Sc=Khoj}) (62: U+11200..11211,
4900 U+11213..1123E)
4901 \p{Script: Khudawadi} (Short: \p{Sc=Sind}) (69: U+112B0..112EA,
4902 U+112F0..112F9)
4903 \p{Script: Knda} \p{Script=Kannada} (88)
4904 \p{Script: Kthi} \p{Script=Kaithi} (66)
4905 \p{Script: Lana} \p{Script=Tai_Tham} (=
4906 \p{Script_Extensions=Tai_Tham}) (127)
4907 \p{Script: Lao} \p{Script_Extensions=Lao} (Short: \p{Sc=
4908 Lao}, \p{Lao}) (67)
4909 \p{Script: Laoo} \p{Script=Lao} (= \p{Script_Extensions=
4910 Lao}) (67)
4911 \p{Script: Latin} (Short: \p{Sc=Latn}) (1350: [A-Za-
4912 z\xaa\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
4913 U+0100..02B8, U+02E0..02E4,
4914 U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65
4915 ...)
4916 \p{Script: Latn} \p{Script=Latin} (1350)
4917 \p{Script: Lepc} \p{Script=Lepcha} (= \p{Script_Extensions=
4918 Lepcha}) (74)
4919 \p{Script: Lepcha} \p{Script_Extensions=Lepcha} (Short:
4920 \p{Sc=Lepc}, \p{Lepc}) (74)
4921 \p{Script: Limb} \p{Script=Limbu} (68)
4922 \p{Script: Limbu} (Short: \p{Sc=Limb}) (68: U+1900..191E,
4923 U+1920..192B, U+1930..193B, U+1940,
4924 U+1944..194F)
4925 \p{Script: Lina} \p{Script=Linear_A} (341)
4926 \p{Script: Linb} \p{Script=Linear_B} (211)
4927 \p{Script: Linear_A} (Short: \p{Sc=Lina}) (341: U+10600..10736,
4928 U+10740..10755, U+10760..10767)
4929 \p{Script: Linear_B} (Short: \p{Sc=Linb}) (211: U+10000..1000B,
4930 U+1000D..10026, U+10028..1003A,
4931 U+1003C..1003D, U+1003F..1004D,
4932 U+10050..1005D ...)
4933 \p{Script: Lisu} \p{Script_Extensions=Lisu} (Short: \p{Sc=
4934 Lisu}, \p{Lisu}) (48)
4935 \p{Script: Lyci} \p{Script=Lycian} (= \p{Script_Extensions=
4936 Lycian}) (29)
4937 \p{Script: Lycian} \p{Script_Extensions=Lycian} (Short:
4938 \p{Sc=Lyci}, \p{Lyci}) (29)
4939 \p{Script: Lydi} \p{Script=Lydian} (= \p{Script_Extensions=
4940 Lydian}) (27)
4941 \p{Script: Lydian} \p{Script_Extensions=Lydian} (Short:
4942 \p{Sc=Lydi}, \p{Lydi}) (27)
4943 \p{Script: Mahajani} (Short: \p{Sc=Mahj}) (39: U+11150..11176)
4944 \p{Script: Mahj} \p{Script=Mahajani} (39)
4945 \p{Script: Malayalam} (Short: \p{Sc=Mlym}) (117: U+0D00..0D03,
4946 U+0D05..0D0C, U+0D0E..0D10,
4947 U+0D12..0D44, U+0D46..0D48, U+0D4A..0D4F
4948 ...)
4949 \p{Script: Mand} \p{Script=Mandaic} (29)
4950 \p{Script: Mandaic} (Short: \p{Sc=Mand}) (29: U+0840..085B,
4951 U+085E)
4952 \p{Script: Mani} \p{Script=Manichaean} (51)
4953 \p{Script: Manichaean} (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6,
4954 U+10AEB..10AF6)
4955 \p{Script: Marc} \p{Script=Marchen} (=
4956 \p{Script_Extensions=Marchen}) (68)
4957 \p{Script: Marchen} \p{Script_Extensions=Marchen} (Short:
4958 \p{Sc=Marc}, \p{Marc}) (68)
4959 \p{Script: Masaram_Gondi} \p{Script_Extensions=Masaram_Gondi}
4960 (Short: \p{Sc=Gonm}, \p{Gonm}) (75)
4961 \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek}
4962 (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
4963 \p{Script: Mend} \p{Script=Mende_Kikakui} (=
4964 \p{Script_Extensions=Mende_Kikakui})
4965 (213)
4966 \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
4967 (Short: \p{Sc=Mend}, \p{Mend}) (213)
4968 \p{Script: Merc} \p{Script=Meroitic_Cursive} (=
4969 \p{Script_Extensions=Meroitic_Cursive})
4970 (90)
4971 \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (=
4972 \p{Script_Extensions=
4973 Meroitic_Hieroglyphs}) (32)
4974 \p{Script: Meroitic_Cursive} \p{Script_Extensions=
4975 Meroitic_Cursive} (Short: \p{Sc=Merc},
4976 \p{Merc}) (90)
4977 \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions=
4978 Meroitic_Hieroglyphs} (Short: \p{Sc=
4979 Mero}, \p{Mero}) (32)
4980 \p{Script: Miao} \p{Script_Extensions=Miao} (Short: \p{Sc=
4981 Miao}, \p{Miao}) (133)
4982 \p{Script: Mlym} \p{Script=Malayalam} (117)
4983 \p{Script: Modi} (Short: \p{Sc=Modi}) (79: U+11600..11644,
4984 U+11650..11659)
4985 \p{Script: Mong} \p{Script=Mongolian} (166)
4986 \p{Script: Mongolian} (Short: \p{Sc=Mong}) (166: U+1800..1801,
4987 U+1804, U+1806..180E, U+1810..1819,
4988 U+1820..1877, U+1880..18AA ...)
4989 \p{Script: Mro} \p{Script_Extensions=Mro} (Short: \p{Sc=
4990 Mro}, \p{Mro}) (43)
4991 \p{Script: Mroo} \p{Script=Mro} (= \p{Script_Extensions=
4992 Mro}) (43)
4993 \p{Script: Mtei} \p{Script=Meetei_Mayek} (=
4994 \p{Script_Extensions=Meetei_Mayek}) (79)
4995 \p{Script: Mult} \p{Script=Multani} (38)
4996 \p{Script: Multani} (Short: \p{Sc=Mult}) (38: U+11280..11286,
4997 U+11288, U+1128A..1128D, U+1128F..1129D,
4998 U+1129F..112A9)
4999 \p{Script: Myanmar} (Short: \p{Sc=Mymr}) (223: U+1000..109F,
5000 U+A9E0..A9FE, U+AA60..AA7F)
5001 \p{Script: Mymr} \p{Script=Myanmar} (223)
5002 \p{Script: Nabataean} \p{Script_Extensions=Nabataean} (Short:
5003 \p{Sc=Nbat}, \p{Nbat}) (40)
5004 \p{Script: Narb} \p{Script=Old_North_Arabian} (=
5005 \p{Script_Extensions=Old_North_Arabian})
5006 (32)
5007 \p{Script: Nbat} \p{Script=Nabataean} (=
5008 \p{Script_Extensions=Nabataean}) (40)
5009 \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
5010 \p{Sc=Talu}, \p{Talu}) (83)
5011 \p{Script: Newa} \p{Script_Extensions=Newa} (Short: \p{Sc=
5012 Newa}, \p{Newa}) (92)
5013 \p{Script: Nko} \p{Script_Extensions=Nko} (Short: \p{Sc=
5014 Nko}, \p{Nko}) (59)
5015 \p{Script: Nkoo} \p{Script=Nko} (= \p{Script_Extensions=
5016 Nko}) (59)
5017 \p{Script: Nshu} \p{Script=Nushu} (= \p{Script_Extensions=
5018 Nushu}) (397)
5019 \p{Script: Nushu} \p{Script_Extensions=Nushu} (Short: \p{Sc=
5020 Nshu}, \p{Nshu}) (397)
5021 \p{Script: Ogam} \p{Script=Ogham} (= \p{Script_Extensions=
5022 Ogham}) (29)
5023 \p{Script: Ogham} \p{Script_Extensions=Ogham} (Short: \p{Sc=
5024 Ogam}, \p{Ogam}) (29)
5025 \p{Script: Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
5026 \p{Sc=Olck}, \p{Olck}) (48)
5027 \p{Script: Olck} \p{Script=Ol_Chiki} (=
5028 \p{Script_Extensions=Ol_Chiki}) (48)
5029 \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
5030 (Short: \p{Sc=Hung}, \p{Hung}) (108)
5031 \p{Script: Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
5032 \p{Sc=Ital}, \p{Ital}) (39)
5033 \p{Script: Old_North_Arabian} \p{Script_Extensions=
5034 Old_North_Arabian} (Short: \p{Sc=Narb},
5035 \p{Narb}) (32)
5036 \p{Script: Old_Permic} (Short: \p{Sc=Perm}) (43: U+10350..1037A)
5037 \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
5038 \p{Sc=Xpeo}, \p{Xpeo}) (50)
5039 \p{Script: Old_South_Arabian} \p{Script_Extensions=
5040 Old_South_Arabian} (Short: \p{Sc=Sarb},
5041 \p{Sarb}) (32)
5042 \p{Script: Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
5043 \p{Sc=Orkh}, \p{Orkh}) (73)
5044 \p{Script: Oriya} (Short: \p{Sc=Orya}) (90: U+0B01..0B03,
5045 U+0B05..0B0C, U+0B0F..0B10,
5046 U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33
5047 ...)
5048 \p{Script: Orkh} \p{Script=Old_Turkic} (=
5049 \p{Script_Extensions=Old_Turkic}) (73)
5050 \p{Script: Orya} \p{Script=Oriya} (90)
5051 \p{Script: Osage} \p{Script_Extensions=Osage} (Short: \p{Sc=
5052 Osge}, \p{Osge}) (72)
5053 \p{Script: Osge} \p{Script=Osage} (= \p{Script_Extensions=
5054 Osage}) (72)
5055 \p{Script: Osma} \p{Script=Osmanya} (=
5056 \p{Script_Extensions=Osmanya}) (40)
5057 \p{Script: Osmanya} \p{Script_Extensions=Osmanya} (Short:
5058 \p{Sc=Osma}, \p{Osma}) (40)
5059 \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong}
5060 (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
5061 \p{Script: Palm} \p{Script=Palmyrene} (=
5062 \p{Script_Extensions=Palmyrene}) (32)
5063 \p{Script: Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
5064 \p{Sc=Palm}, \p{Palm}) (32)
5065 \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
5066 \p{Sc=Pauc}, \p{Pauc}) (57)
5067 \p{Script: Pauc} \p{Script=Pau_Cin_Hau} (=
5068 \p{Script_Extensions=Pau_Cin_Hau}) (57)
5069 \p{Script: Perm} \p{Script=Old_Permic} (43)
5070 \p{Script: Phag} \p{Script=Phags_Pa} (56)
5071 \p{Script: Phags_Pa} (Short: \p{Sc=Phag}) (56: U+A840..A877)
5072 \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (=
5073 \p{Script_Extensions=
5074 Inscriptional_Pahlavi}) (27)
5075 \p{Script: Phlp} \p{Script=Psalter_Pahlavi} (29)
5076 \p{Script: Phnx} \p{Script=Phoenician} (=
5077 \p{Script_Extensions=Phoenician}) (29)
5078 \p{Script: Phoenician} \p{Script_Extensions=Phoenician} (Short:
5079 \p{Sc=Phnx}, \p{Phnx}) (29)
5080 \p{Script: Plrd} \p{Script=Miao} (= \p{Script_Extensions=
5081 Miao}) (133)
5082 \p{Script: Prti} \p{Script=Inscriptional_Parthian} (=
5083 \p{Script_Extensions=
5084 Inscriptional_Parthian}) (30)
5085 \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29:
5086 U+10B80..10B91, U+10B99..10B9C,
5087 U+10BA9..10BAF)
5088 \p{Script: Qaac} \p{Script=Coptic} (137)
5089 \p{Script: Qaai} \p{Script=Inherited} (568)
5090 \p{Script: Rejang} \p{Script_Extensions=Rejang} (Short:
5091 \p{Sc=Rjng}, \p{Rjng}) (37)
5092 \p{Script: Rjng} \p{Script=Rejang} (= \p{Script_Extensions=
5093 Rejang}) (37)
5094 \p{Script: Runic} \p{Script_Extensions=Runic} (Short: \p{Sc=
5095 Runr}, \p{Runr}) (86)
5096 \p{Script: Runr} \p{Script=Runic} (= \p{Script_Extensions=
5097 Runic}) (86)
5098 \p{Script: Samaritan} \p{Script_Extensions=Samaritan} (Short:
5099 \p{Sc=Samr}, \p{Samr}) (61)
5100 \p{Script: Samr} \p{Script=Samaritan} (=
5101 \p{Script_Extensions=Samaritan}) (61)
5102 \p{Script: Sarb} \p{Script=Old_South_Arabian} (=
5103 \p{Script_Extensions=Old_South_Arabian})
5104 (32)
5105 \p{Script: Saur} \p{Script=Saurashtra} (=
5106 \p{Script_Extensions=Saurashtra}) (82)
5107 \p{Script: Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
5108 \p{Sc=Saur}, \p{Saur}) (82)
5109 \p{Script: Sgnw} \p{Script=SignWriting} (=
5110 \p{Script_Extensions=SignWriting}) (672)
5111 \p{Script: Sharada} (Short: \p{Sc=Shrd}) (94: U+11180..111CD,
5112 U+111D0..111DF)
5113 \p{Script: Shavian} \p{Script_Extensions=Shavian} (Short:
5114 \p{Sc=Shaw}, \p{Shaw}) (48)
5115 \p{Script: Shaw} \p{Script=Shavian} (=
5116 \p{Script_Extensions=Shavian}) (48)
5117 \p{Script: Shrd} \p{Script=Sharada} (94)
5118 \p{Script: Sidd} \p{Script=Siddham} (=
5119 \p{Script_Extensions=Siddham}) (92)
5120 \p{Script: Siddham} \p{Script_Extensions=Siddham} (Short:
5121 \p{Sc=Sidd}, \p{Sidd}) (92)
5122 \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short:
5123 \p{Sc=Sgnw}, \p{Sgnw}) (672)
5124 \p{Script: Sind} \p{Script=Khudawadi} (69)
5125 \p{Script: Sinh} \p{Script=Sinhala} (110)
5126 \p{Script: Sinhala} (Short: \p{Sc=Sinh}) (110: U+0D82..0D83,
5127 U+0D85..0D96, U+0D9A..0DB1,
5128 U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...)
5129 \p{Script: Sora} \p{Script=Sora_Sompeng} (=
5130 \p{Script_Extensions=Sora_Sompeng}) (35)
5131 \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng}
5132 (Short: \p{Sc=Sora}, \p{Sora}) (35)
5133 \p{Script: Soyo} \p{Script=Soyombo} (=
5134 \p{Script_Extensions=Soyombo}) (80)
5135 \p{Script: Soyombo} \p{Script_Extensions=Soyombo} (Short:
5136 \p{Sc=Soyo}, \p{Soyo}) (80)
5137 \p{Script: Sund} \p{Script=Sundanese} (=
5138 \p{Script_Extensions=Sundanese}) (72)
5139 \p{Script: Sundanese} \p{Script_Extensions=Sundanese} (Short:
5140 \p{Sc=Sund}, \p{Sund}) (72)
5141 \p{Script: Sylo} \p{Script=Syloti_Nagri} (44)
5142 \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (44: U+A800..A82B)
5143 \p{Script: Syrc} \p{Script=Syriac} (88)
5144 \p{Script: Syriac} (Short: \p{Sc=Syrc}) (88: U+0700..070D,
5145 U+070F..074A, U+074D..074F, U+0860..086A)
5146 \p{Script: Tagalog} (Short: \p{Sc=Tglg}) (20: U+1700..170C,
5147 U+170E..1714)
5148 \p{Script: Tagb} \p{Script=Tagbanwa} (18)
5149 \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}) (18: U+1760..176C,
5150 U+176E..1770, U+1772..1773)
5151 \p{Script: Tai_Le} (Short: \p{Sc=Tale}) (35: U+1950..196D,
5152 U+1970..1974)
5153 \p{Script: Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
5154 \p{Sc=Lana}, \p{Lana}) (127)
5155 \p{Script: Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
5156 \p{Sc=Tavt}, \p{Tavt}) (72)
5157 \p{Script: Takr} \p{Script=Takri} (66)
5158 \p{Script: Takri} (Short: \p{Sc=Takr}) (66: U+11680..116B7,
5159 U+116C0..116C9)
5160 \p{Script: Tale} \p{Script=Tai_Le} (35)
5161 \p{Script: Talu} \p{Script=New_Tai_Lue} (=
5162 \p{Script_Extensions=New_Tai_Lue}) (83)
5163 \p{Script: Tamil} (Short: \p{Sc=Taml}) (72: U+0B82..0B83,
5164 U+0B85..0B8A, U+0B8E..0B90,
5165 U+0B92..0B95, U+0B99..0B9A, U+0B9C ...)
5166 \p{Script: Taml} \p{Script=Tamil} (72)
5167 \p{Script: Tang} \p{Script=Tangut} (= \p{Script_Extensions=
5168 Tangut}) (6881)
5169 \p{Script: Tangut} \p{Script_Extensions=Tangut} (Short:
5170 \p{Sc=Tang}, \p{Tang}) (6881)
5171 \p{Script: Tavt} \p{Script=Tai_Viet} (=
5172 \p{Script_Extensions=Tai_Viet}) (72)
5173 \p{Script: Telu} \p{Script=Telugu} (96)
5174 \p{Script: Telugu} (Short: \p{Sc=Telu}) (96: U+0C00..0C03,
5175 U+0C05..0C0C, U+0C0E..0C10,
5176 U+0C12..0C28, U+0C2A..0C39, U+0C3D..0C44
5177 ...)
5178 \p{Script: Tfng} \p{Script=Tifinagh} (=
5179 \p{Script_Extensions=Tifinagh}) (59)
5180 \p{Script: Tglg} \p{Script=Tagalog} (20)
5181 \p{Script: Thaa} \p{Script=Thaana} (50)
5182 \p{Script: Thaana} (Short: \p{Sc=Thaa}) (50: U+0780..07B1)
5183 \p{Script: Thai} \p{Script_Extensions=Thai} (Short: \p{Sc=
5184 Thai}, \p{Thai}) (86)
5185 \p{Script: Tibetan} \p{Script_Extensions=Tibetan} (Short:
5186 \p{Sc=Tibt}, \p{Tibt}) (207)
5187 \p{Script: Tibt} \p{Script=Tibetan} (=
5188 \p{Script_Extensions=Tibetan}) (207)
5189 \p{Script: Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
5190 \p{Sc=Tfng}, \p{Tfng}) (59)
5191 \p{Script: Tirh} \p{Script=Tirhuta} (82)
5192 \p{Script: Tirhuta} (Short: \p{Sc=Tirh}) (82: U+11480..114C7,
5193 U+114D0..114D9)
5194 \p{Script: Ugar} \p{Script=Ugaritic} (=
5195 \p{Script_Extensions=Ugaritic}) (31)
5196 \p{Script: Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
5197 \p{Sc=Ugar}, \p{Ugar}) (31)
5198 \p{Script: Unknown} \p{Script_Extensions=Unknown} (Short:
5199 \p{Sc=Zzzz}, \p{Zzzz}) (977_357 plus all
5200 above-Unicode code points)
5201 \p{Script: Vai} \p{Script_Extensions=Vai} (Short: \p{Sc=
5202 Vai}, \p{Vai}) (300)
5203 \p{Script: Vaii} \p{Script=Vai} (= \p{Script_Extensions=
5204 Vai}) (300)
5205 \p{Script: Wara} \p{Script=Warang_Citi} (=
5206 \p{Script_Extensions=Warang_Citi}) (84)
5207 \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
5208 \p{Sc=Wara}, \p{Wara}) (84)
5209 \p{Script: Xpeo} \p{Script=Old_Persian} (=
5210 \p{Script_Extensions=Old_Persian}) (50)
5211 \p{Script: Xsux} \p{Script=Cuneiform} (=
5212 \p{Script_Extensions=Cuneiform}) (1234)
5213 \p{Script: Yi} (Short: \p{Sc=Yi}) (1220: U+A000..A48C,
5214 U+A490..A4C6)
5215 \p{Script: Yiii} \p{Script=Yi} (1220)
5216 \p{Script: Zanabazar_Square} \p{Script_Extensions=
5217 Zanabazar_Square} (Short: \p{Sc=Zanb},
5218 \p{Zanb}) (72)
5219 \p{Script: Zanb} \p{Script=Zanabazar_Square} (=
5220 \p{Script_Extensions=Zanabazar_Square})
5221 (72)
5222 \p{Script: Zinh} \p{Script=Inherited} (568)
5223 \p{Script: Zyyy} \p{Script=Common} (7363)
5224 \p{Script: Zzzz} \p{Script=Unknown} (=
5225 \p{Script_Extensions=Unknown}) (977_357
5226 plus all above-Unicode code points)
5227 \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (88:
5228 U+0640, U+1E900..1E94A, U+1E950..1E959,
5229 U+1E95E..1E95F)
5230 \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (88)
5231 \p{Script_Extensions: Aghb} \p{Script_Extensions=
5232 Caucasian_Albanian} (53)
5233 \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (57:
5234 U+11700..11719, U+1171D..1172B,
5235 U+11730..1173F)
5236 \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
5237 \p{Hluw}) (583: U+14400..14646)
5238 \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1324)
5239 \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
5240 (1324: U+0600..0604, U+0606..061C,
5241 U+061E..06DC, U+06DE..06FF,
5242 U+0750..077F, U+08A0..08B4 ...)
5243 \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
5244 (94: U+0531..0556, U+0559..055F,
5245 U+0561..0587, U+0589..058A,
5246 U+058D..058F, U+FB13..FB17)
5247 \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
5248 (31)
5249 \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (94)
5250 \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst})
5251 (61: U+10B00..10B35, U+10B39..10B3F)
5252 \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
5253 \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
5254 \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
5255 (121: U+1B00..1B4B, U+1B50..1B7C)
5256 \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
5257 \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657:
5258 U+A6A0..A6F7, U+16800..16A38)
5259 \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
5260 \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
5261 (36: U+16AD0..16AED, U+16AF0..16AF5)
5262 \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56:
5263 U+1BC0..1BF3, U+1BFC..1BFF)
5264 \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
5265 \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (101)
5266 \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng})
5267 (101: U+0951..0952, U+0964..0965,
5268 U+0980..0983, U+0985..098C,
5269 U+098F..0990, U+0993..09A8 ...)
5270 \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
5271 (97: U+11C00..11C08, U+11C0A..11C36,
5272 U+11C38..11C45, U+11C50..11C6C)
5273 \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
5274 \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (111)
5275 \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
5276 (111: U+02EA..02EB, U+3001..3003,
5277 U+3008..3011, U+3013..301F,
5278 U+302A..302D, U+3030 ...)
5279 \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (109)
5280 \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah})
5281 (109: U+11000..1104D, U+11052..1106F,
5282 U+1107F)
5283 \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
5284 \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
5285 (256: U+2800..28FF)
5286 \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
5287 \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
5288 (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF)
5289 \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
5290 \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22:
5291 U+1735..1736, U+1740..1753)
5292 \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (87)
5293 \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
5294 \p{Cans}) (710: U+1400..167F,
5295 U+18B0..18F5)
5296 \p{Script_Extensions: Cans} \p{Script_Extensions=
5297 Canadian_Aboriginal} (710)
5298 \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
5299 \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49:
5300 U+102A0..102D0)
5301 \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
5302 \p{Aghb}) (53: U+10530..10563, U+1056F)
5303 \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (87:
5304 U+09E6..09EF, U+1040..1049,
5305 U+11100..11134, U+11136..11143)
5306 \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83:
5307 U+AA00..AA36, U+AA40..AA4D,
5308 U+AA50..AA59, U+AA5C..AA5F)
5309 \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
5310 \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
5311 (172: U+13A0..13F5, U+13F8..13FD,
5312 U+AB70..ABBF)
5313 \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
5314 (6947: [\x00-
5315 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
5316 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
5317 \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF,
5318 U+02E5..02E9, U+02EC..02FF, U+0374,
5319 U+037E ...)
5320 \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
5321 \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt})
5322 (165: U+03E2..03EF, U+2C80..2CF3,
5323 U+2CF9..2CFF, U+102E0..102FB)
5324 \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
5325 \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
5326 (1234: U+12000..12399, U+12400..1246E,
5327 U+12470..12474, U+12480..12543)
5328 \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
5329 (112: U+10100..10102, U+10107..10133,
5330 U+10137..1013F, U+10800..10805, U+10808,
5331 U+1080A..10835 ...)
5332 \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
5333 (446: U+0400..052F, U+1C80..1C88,
5334 U+1D2B, U+1D78, U+2DE0..2DFF, U+2E43 ...)
5335 \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (446)
5336 \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt})
5337 (80: U+10400..1044F)
5338 \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210)
5339 \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
5340 (210: U+0900..097F, U+1CD0..1CF6,
5341 U+1CF8..1CF9, U+20F0, U+A830..A839,
5342 U+A8E0..A8FD)
5343 \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
5344 \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
5345 \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
5346 (147: U+1BC00..1BC6A, U+1BC70..1BC7C,
5347 U+1BC80..1BC88, U+1BC90..1BC99,
5348 U+1BC9C..1BCA3)
5349 \p{Script_Extensions: Egyp} \p{Script_Extensions=
5350 Egyptian_Hieroglyphs} (1071)
5351 \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
5352 \p{Egyp}) (1071: U+13000..1342E)
5353 \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
5354 \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba})
5355 (40: U+10500..10527)
5356 \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
5357 \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
5358 (495: U+1200..1248, U+124A..124D,
5359 U+1250..1256, U+1258, U+125A..125D,
5360 U+1260..1288 ...)
5361 \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (129)
5362 \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
5363 (129: U+0589, U+10A0..10C5, U+10C7,
5364 U+10CD, U+10D0..10FF, U+2D00..2D25 ...)
5365 \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (136)
5366 \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
5367 (136: U+0484, U+0487, U+2C00..2C2E,
5368 U+2C30..2C5E, U+2E43, U+A66F ...)
5369 \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi}
5370 (75)
5371 \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
5372 \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27:
5373 U+10330..1034A)
5374 \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (113)
5375 \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
5376 (113: U+0951..0952, U+0964..0965,
5377 U+0BAA, U+0BB5, U+0BE6..0BF2, U+1CD0 ...)
5378 \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522:
5379 U+0342, U+0345, U+0370..0373,
5380 U+0375..0377, U+037A..037D, U+037F ...)
5381 \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
5382 \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
5383 (105: U+0951..0952, U+0964..0965,
5384 U+0A81..0A83, U+0A85..0A8D,
5385 U+0A8F..0A91, U+0A93..0AA8 ...)
5386 \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105)
5387 \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
5388 (93: U+0951..0952, U+0964..0965,
5389 U+0A01..0A03, U+0A05..0A0A,
5390 U+0A0F..0A10, U+0A13..0A28 ...)
5391 \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (93)
5392 \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (89_507:
5393 U+2E80..2E99, U+2E9B..2EF3,
5394 U+2F00..2FD5, U+3001..3003,
5395 U+3005..3011, U+3013..301F ...)
5396 \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
5397 \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
5398 (11_775: U+1100..11FF, U+3001..3003,
5399 U+3008..3011, U+3013..301F,
5400 U+302E..3030, U+3037 ...)
5401 \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (89_507)
5402 \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
5403 \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano})
5404 (23: U+1720..1736)
5405 \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
5406 \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26:
5407 U+108E0..108F2, U+108F4..108F5,
5408 U+108FB..108FF)
5409 \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (133)
5410 \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr})
5411 (133: U+0591..05C7, U+05D0..05EA,
5412 U+05F0..05F4, U+FB1D..FB36,
5413 U+FB38..FB3C, U+FB3E ...)
5414 \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (428)
5415 \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
5416 (428: U+3001..3003, U+3008..3011,
5417 U+3013..301F, U+3030..3035, U+3037,
5418 U+303C..303D ...)
5419 \p{Script_Extensions: Hluw} \p{Script_Extensions=
5420 Anatolian_Hieroglyphs} (583)
5421 \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
5422 (127)
5423 \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
5424 (108)
5425 \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
5426 \p{Armi}) (31: U+10840..10855,
5427 U+10857..1085F)
5428 \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
5429 (500: U+0300..0341, U+0343..0344,
5430 U+0346..0362, U+1AB0..1ABE,
5431 U+1DC2..1DF9, U+1DFB..1DFF ...)
5432 \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
5433 \p{Phli}) (27: U+10B60..10B72,
5434 U+10B78..10B7F)
5435 \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
5436 Prti}, \p{Prti}) (30: U+10B40..10B55,
5437 U+10B58..10B5F)
5438 \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39)
5439 \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
5440 \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
5441 (91: U+A980..A9CD, U+A9CF..A9D9,
5442 U+A9DE..A9DF)
5443 \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (86:
5444 U+0966..096F, U+A830..A839,
5445 U+11080..110C1)
5446 \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
5447 \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (352)
5448 \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
5449 (100: U+0951..0952, U+0964..0965,
5450 U+0C80..0C83, U+0C85..0C8C,
5451 U+0C8E..0C90, U+0C92..0CA8 ...)
5452 \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
5453 (352: U+3001..3003, U+3008..3011,
5454 U+3013..301F, U+3030..3035, U+3037,
5455 U+303C..303D ...)
5456 \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
5457 (48: U+A900..A92F)
5458 \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (65)
5459 \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
5460 (65: U+10A00..10A03, U+10A05..10A06,
5461 U+10A0C..10A13, U+10A15..10A17,
5462 U+10A19..10A33, U+10A38..10A3A ...)
5463 \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146:
5464 U+1780..17DD, U+17E0..17E9,
5465 U+17F0..17F9, U+19E0..19FF)
5466 \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
5467 \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (72)
5468 \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (72:
5469 U+0AE6..0AEF, U+11200..11211,
5470 U+11213..1123E)
5471 \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
5472 (81: U+0964..0965, U+A830..A839,
5473 U+112B0..112EA, U+112F0..112F9)
5474 \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (100)
5475 \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (86)
5476 \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
5477 \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (67:
5478 U+0E81..0E82, U+0E84, U+0E87..0E88,
5479 U+0E8A, U+0E8D, U+0E94..0E97 ...)
5480 \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (67)
5481 \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn})
5482 (1370: [A-Za-z\xaa\xba\xc0-\xd6\xd8-
5483 \xf6\xf8-\xff], U+0100..02B8,
5484 U+02E0..02E4, U+0363..036F,
5485 U+0485..0486, U+0951..0952 ...)
5486 \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1370)
5487 \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
5488 \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74:
5489 U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F)
5490 \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
5491 \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69:
5492 U+0965, U+1900..191E, U+1920..192B,
5493 U+1930..193B, U+1940, U+1944..194F)
5494 \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
5495 \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
5496 \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
5497 (386: U+10107..10133, U+10600..10736,
5498 U+10740..10755, U+10760..10767)
5499 \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
5500 (268: U+10000..1000B, U+1000D..10026,
5501 U+10028..1003A, U+1003C..1003D,
5502 U+1003F..1004D, U+10050..1005D ...)
5503 \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (48:
5504 U+A4D0..A4FF)
5505 \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
5506 \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29:
5507 U+10280..1029C)
5508 \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
5509 \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27:
5510 U+10920..10939, U+1093F)
5511 \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
5512 (61: U+0964..096F, U+A830..A839,
5513 U+11150..11176)
5514 \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
5515 \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
5516 (122: U+0951..0952, U+0964..0965,
5517 U+0D00..0D03, U+0D05..0D0C,
5518 U+0D0E..0D10, U+0D12..0D44 ...)
5519 \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
5520 \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand})
5521 (30: U+0640, U+0840..085B, U+085E)
5522 \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
5523 \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
5524 (52: U+0640, U+10AC0..10AE6,
5525 U+10AEB..10AF6)
5526 \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
5527 \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc})
5528 (68: U+11C70..11C8F, U+11C92..11CA7,
5529 U+11CA9..11CB6)
5530 \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm},
5531 \p{Gonm}) (75: U+11D00..11D06,
5532 U+11D08..11D09, U+11D0B..11D36, U+11D3A,
5533 U+11D3C..11D3D, U+11D3F..11D47 ...)
5534 \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
5535 \p{Mtei}) (79: U+AAE0..AAF6,
5536 U+ABC0..ABED, U+ABF0..ABF9)
5537 \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
5538 (213)
5539 \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
5540 \p{Mend}) (213: U+1E800..1E8C4,
5541 U+1E8C7..1E8D6)
5542 \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
5543 (90)
5544 \p{Script_Extensions: Mero} \p{Script_Extensions=
5545 Meroitic_Hieroglyphs} (32)
5546 \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
5547 \p{Merc}) (90: U+109A0..109B7,
5548 U+109BC..109CF, U+109D2..109FF)
5549 \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
5550 \p{Mero}) (32: U+10980..1099F)
5551 \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (133:
5552 U+16F00..16F44, U+16F50..16F7E,
5553 U+16F8F..16F9F)
5554 \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (122)
5555 \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89:
5556 U+A830..A839, U+11600..11644,
5557 U+11650..11659)
5558 \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (169)
5559 \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
5560 (169: U+1800..180E, U+1810..1819,
5561 U+1820..1877, U+1880..18AA,
5562 U+11660..1166C)
5563 \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43:
5564 U+16A40..16A5E, U+16A60..16A69,
5565 U+16A6E..16A6F)
5566 \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
5567 \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
5568 \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
5569 \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult})
5570 (48: U+0A66..0A6F, U+11280..11286,
5571 U+11288, U+1128A..1128D, U+1128F..1129D,
5572 U+1129F..112A9)
5573 \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
5574 (224: U+1000..109F, U+A92E,
5575 U+A9E0..A9FE, U+AA60..AA7F)
5576 \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
5577 \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
5578 (40: U+10880..1089E, U+108A7..108AF)
5579 \p{Script_Extensions: Narb} \p{Script_Extensions=
5580 Old_North_Arabian} (32)
5581 \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
5582 \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
5583 (83: U+1980..19AB, U+19B0..19C9,
5584 U+19D0..19DA, U+19DE..19DF)
5585 \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (92:
5586 U+11400..11459, U+1145B, U+1145D)
5587 \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (59:
5588 U+07C0..07FA)
5589 \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (59)
5590 \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397)
5591 \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397:
5592 U+16FE1, U+1B170..1B2FB)
5593 \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
5594 \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29:
5595 U+1680..169C)
5596 \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
5597 (48: U+1C50..1C7F)
5598 \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
5599 \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
5600 \p{Hung}) (108: U+10C80..10CB2,
5601 U+10CC0..10CF2, U+10CFA..10CFF)
5602 \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
5603 (39: U+10300..10323, U+1032D..1032F)
5604 \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
5605 \p{Narb}) (32: U+10A80..10A9F)
5606 \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
5607 (44: U+0483, U+10350..1037A)
5608 \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
5609 (50: U+103A0..103C3, U+103C8..103D5)
5610 \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
5611 \p{Sarb}) (32: U+10A60..10A7F)
5612 \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
5613 (73: U+10C00..10C48)
5614 \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (94:
5615 U+0951..0952, U+0964..0965,
5616 U+0B01..0B03, U+0B05..0B0C,
5617 U+0B0F..0B10, U+0B13..0B28 ...)
5618 \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
5619 \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (94)
5620 \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72:
5621 U+104B0..104D3, U+104D8..104FB)
5622 \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
5623 \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
5624 \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma})
5625 (40: U+10480..1049D, U+104A0..104A9)
5626 \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
5627 \p{Hmng}) (127: U+16B00..16B45,
5628 U+16B50..16B59, U+16B5B..16B61,
5629 U+16B63..16B77, U+16B7D..16B8F)
5630 \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
5631 \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
5632 (32: U+10860..1087F)
5633 \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
5634 (57: U+11AC0..11AF8)
5635 \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
5636 \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
5637 \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
5638 \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
5639 (59: U+1802..1803, U+1805, U+A840..A877)
5640 \p{Script_Extensions: Phli} \p{Script_Extensions=
5641 Inscriptional_Pahlavi} (27)
5642 \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
5643 (30)
5644 \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
5645 \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
5646 (29: U+10900..1091B, U+1091F)
5647 \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (133)
5648 \p{Script_Extensions: Prti} \p{Script_Extensions=
5649 Inscriptional_Parthian} (30)
5650 \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
5651 \p{Phlp}) (30: U+0640, U+10B80..10B91,
5652 U+10B99..10B9C, U+10BA9..10BAF)
5653 \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
5654 \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (500)
5655 \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37:
5656 U+A930..A953, U+A95F)
5657 \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
5658 \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86:
5659 U+16A0..16EA, U+16EE..16F8)
5660 \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
5661 \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
5662 (61: U+0800..082D, U+0830..083E)
5663 \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
5664 \p{Script_Extensions: Sarb} \p{Script_Extensions=
5665 Old_South_Arabian} (32)
5666 \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
5667 \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
5668 (82: U+A880..A8C5, U+A8CE..A8D9)
5669 \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
5670 \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
5671 (100: U+0951, U+1CD7, U+1CD9,
5672 U+1CDC..1CDD, U+1CE0, U+11180..111CD ...)
5673 \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw})
5674 (48: U+10450..1047F)
5675 \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
5676 \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (100)
5677 \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
5678 \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd})
5679 (92: U+11580..115B5, U+115B8..115DD)
5680 \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
5681 (672: U+1D800..1DA8B, U+1DA9B..1DA9F,
5682 U+1DAA1..1DAAF)
5683 \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
5684 \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (112)
5685 \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
5686 (112: U+0964..0965, U+0D82..0D83,
5687 U+0D85..0D96, U+0D9A..0DB1,
5688 U+0DB3..0DBB, U+0DBD ...)
5689 \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
5690 \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
5691 \p{Sora}) (35: U+110D0..110E8,
5692 U+110F0..110F9)
5693 \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (80)
5694 \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo})
5695 (80: U+11A50..11A83, U+11A86..11A9C,
5696 U+11A9E..11AA2)
5697 \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
5698 \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
5699 (72: U+1B80..1BBF, U+1CC0..1CC7)
5700 \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (56)
5701 \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
5702 \p{Sylo}) (56: U+0964..0965,
5703 U+09E6..09EF, U+A800..A82B)
5704 \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (105)
5705 \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc})
5706 (105: U+060C, U+061B..061C, U+061F,
5707 U+0640, U+064B..0655, U+0670 ...)
5708 \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg})
5709 (22: U+1700..170C, U+170E..1714,
5710 U+1735..1736)
5711 \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
5712 \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
5713 (20: U+1735..1736, U+1760..176C,
5714 U+176E..1770, U+1772..1773)
5715 \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45:
5716 U+1040..1049, U+1950..196D, U+1970..1974)
5717 \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
5718 (127: U+1A20..1A5E, U+1A60..1A7C,
5719 U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD)
5720 \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
5721 (72: U+AA80..AAC2, U+AADB..AADF)
5722 \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (78)
5723 \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (78:
5724 U+0964..0965, U+A830..A839,
5725 U+11680..116B7, U+116C0..116C9)
5726 \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
5727 \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
5728 \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (81:
5729 U+0951..0952, U+0964..0965,
5730 U+0B82..0B83, U+0B85..0B8A,
5731 U+0B8E..0B90, U+0B92..0B95 ...)
5732 \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (81)
5733 \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6881)
5734 \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
5735 (6881: U+16FE0, U+17000..187EC,
5736 U+18800..18AF2)
5737 \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
5738 \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (101)
5739 \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu})
5740 (101: U+0951..0952, U+0964..0965,
5741 U+0C00..0C03, U+0C05..0C0C,
5742 U+0C0E..0C10, U+0C12..0C28 ...)
5743 \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
5744 \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
5745 \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66)
5746 \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66:
5747 U+060C, U+061B..061C, U+061F,
5748 U+0660..0669, U+0780..07B1, U+FDF2 ...)
5749 \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86:
5750 U+0E01..0E3A, U+0E40..0E5B)
5751 \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
5752 (207: U+0F00..0F47, U+0F49..0F6C,
5753 U+0F71..0F97, U+0F99..0FBC,
5754 U+0FBE..0FCC, U+0FCE..0FD4 ...)
5755 \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
5756 \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
5757 (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F)
5758 \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (94)
5759 \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh})
5760 (94: U+0964..0965, U+A830..A839,
5761 U+11480..114C7, U+114D0..114D9)
5762 \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
5763 \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
5764 (31: U+10380..1039D, U+1039F)
5765 \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
5766 (977_357 plus all above-Unicode code
5767 points: U+0378..0379, U+0380..0383,
5768 U+038B, U+038D, U+03A2, U+0530 ...)
5769 \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300:
5770 U+A500..A62B)
5771 \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
5772 \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
5773 \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
5774 (84: U+118A0..118F2, U+118FF)
5775 \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
5776 \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
5777 \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246:
5778 U+3001..3002, U+3008..3011,
5779 U+3014..301B, U+30FB, U+A000..A48C,
5780 U+A490..A4C6 ...)
5781 \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
5782 \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb},
5783 \p{Zanb}) (72: U+11A00..11A47)
5784 \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square}
5785 (72)
5786 \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (500)
5787 \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (6947)
5788 \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (977_357
5789 plus all above-Unicode code points)
5790 \p{Scx: *} \p{Script_Extensions: *}
5791 \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
5792 \p{SD: *} \p{Soft_Dotted: *}
5793 \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
5794 \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024,
5795 U+FE52, U+FF0E)
5796 \p{Sentence_Break: CL} \p{Sentence_Break=Close} (187)
5797 \p{Sentence_Break: Close} (Short: \p{SB=CL}) (187:
5798 [\"\'\(\)\[\]\{\}\xab\xbb],
5799 U+0F3A..0F3D, U+169B..169C,
5800 U+2018..201F, U+2039..203A, U+2045..2046
5801 ...)
5802 \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1: [\r])
5803 \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (2277)
5804 \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2277: U+0300..036F,
5805 U+0483..0489, U+0591..05BD, U+05BF,
5806 U+05C1..05C2, U+05C4..05C5 ...)
5807 \p{Sentence_Break: FO} \p{Sentence_Break=Format} (53)
5808 \p{Sentence_Break: Format} (Short: \p{SB=FO}) (53: [\xad],
5809 U+0600..0605, U+061C, U+06DD, U+070F,
5810 U+08E2 ...)
5811 \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (121_354)
5812 \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1: [\n])
5813 \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (2251)
5814 \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2251: [a-
5815 z\xaa\xb5\xba\xdf-\xf6\xf8-\xff],
5816 U+0101, U+0103, U+0105, U+0107, U+0109
5817 ...)
5818 \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (582)
5819 \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (582: [0-9],
5820 U+0660..0669, U+066B..066C,
5821 U+06F0..06F9, U+07C0..07C9, U+0966..096F
5822 ...)
5823 \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (121_354: U+01BB,
5824 U+01C0..01C3, U+0294, U+02B9..02BF,
5825 U+02C6..02D1, U+02EC ...)
5826 \p{Sentence_Break: Other} (Short: \p{SB=XX}) (985_375 plus all
5827 above-Unicode code points:
5828 [^\t\n\cK\f\r\x20!\"\'\(\),\-.0-9:?A-
5829 Z\[\]a-z\{\}\x85\xa0\xaa-
5830 \xab\xad\xb5\xba-\xbb\xc0-\xd6\xd8-
5831 \xf6\xf8-\xff], U+02C2..02C5,
5832 U+02D2..02DF, U+02E5..02EB, U+02ED,
5833 U+02EF..02FF ...)
5834 \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
5835 \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:],
5836 U+055D, U+060C..060D, U+07F8, U+1802,
5837 U+1808 ...)
5838 \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
5839 \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85],
5840 U+2028..2029)
5841 \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0],
5842 U+1680, U+2000..200A, U+202F, U+205F,
5843 U+3000)
5844 \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (125)
5845 \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (125: [!?], U+0589,
5846 U+061F, U+06D4, U+0700..0702, U+07F9 ...)
5847 \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1853)
5848 \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1853: [A-Z\xc0-
5849 \xd6\xd8-\xde], U+0100, U+0102, U+0104,
5850 U+0106, U+0108 ...)
5851 \p{Sentence_Break: XX} \p{Sentence_Break=Other} (985_375 plus all
5852 above-Unicode code points)
5853 \p{Sentence_Terminal} \p{Sentence_Terminal=Y} (Short: \p{STerm})
5854 (128)
5855 \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
5856 (1_113_984 plus all above-Unicode code
5857 points: [\x00-\x20\"#\$\%&\'\(\)*+,\-
5858 \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
5859 \xff], U+0100..0588, U+058A..061E,
5860 U+0620..06D3, U+06D5..06FF, U+0703..07F8
5861 ...)
5862 \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (128:
5863 [!.?], U+0589, U+061F, U+06D4,
5864 U+0700..0702, U+07F9 ...)
5865 \p{Separator} \p{General_Category=Separator} (Short:
5866 \p{Z}) (19)
5867 \p{Sgnw} \p{SignWriting} (= \p{Script_Extensions=
5868 SignWriting}) (672)
5869 \p{Sharada} \p{Script_Extensions=Sharada} (Short:
5870 \p{Shrd}; NOT \p{Block=Sharada}) (100)
5871 \p{Shavian} \p{Script_Extensions=Shavian} (Short:
5872 \p{Shaw}) (48)
5873 \p{Shaw} \p{Shavian} (= \p{Script_Extensions=
5874 Shavian}) (48)
5875 X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
5876 (16)
5877 \p{Shrd} \p{Sharada} (= \p{Script_Extensions=
5878 Sharada}) (NOT \p{Block=Sharada}) (100)
5879 \p{Sidd} \p{Siddham} (= \p{Script_Extensions=
5880 Siddham}) (NOT \p{Block=Siddham}) (92)
5881 \p{Siddham} \p{Script_Extensions=Siddham} (Short:
5882 \p{Sidd}; NOT \p{Block=Siddham}) (92)
5883 \p{SignWriting} \p{Script_Extensions=SignWriting} (Short:
5884 \p{Sgnw}) (672)
5885 \p{Sind} \p{Khudawadi} (= \p{Script_Extensions=
5886 Khudawadi}) (NOT \p{Block=Khudawadi})
5887 (81)
5888 \p{Sinh} \p{Sinhala} (= \p{Script_Extensions=
5889 Sinhala}) (NOT \p{Block=Sinhala}) (112)
5890 \p{Sinhala} \p{Script_Extensions=Sinhala} (Short:
5891 \p{Sinh}; NOT \p{Block=Sinhala}) (112)
5892 X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
5893 \p{Sk} \p{Modifier_Symbol} (=
5894 \p{General_Category=Modifier_Symbol})
5895 (121)
5896 \p{Sm} \p{Math_Symbol} (= \p{General_Category=
5897 Math_Symbol}) (948)
5898 X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
5899 \p{InSmallForms}) (32)
5900 X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block=
5901 Small_Form_Variants}) (32)
5902 \p{So} \p{Other_Symbol} (= \p{General_Category=
5903 Other_Symbol}) (5855)
5904 \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
5905 \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_066 plus
5906 all above-Unicode code points: [\x00-
5907 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
5908 Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff],
5909 U+0100..012E, U+0130..0248,
5910 U+024A..0267, U+0269..029C, U+029E..02B1
5911 ...)
5912 \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (46: [i-j],
5913 U+012F, U+0249, U+0268, U+029D, U+02B2
5914 ...)
5915 \p{Sora} \p{Sora_Sompeng} (= \p{Script_Extensions=
5916 Sora_Sompeng}) (NOT \p{Block=
5917 Sora_Sompeng}) (35)
5918 \p{Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng} (Short:
5919 \p{Sora}; NOT \p{Block=Sora_Sompeng})
5920 (35)
5921 \p{Soyo} \p{Soyombo} (= \p{Script_Extensions=
5922 Soyombo}) (NOT \p{Block=Soyombo}) (80)
5923 \p{Soyombo} \p{Script_Extensions=Soyombo} (Short:
5924 \p{Soyo}; NOT \p{Block=Soyombo}) (80)
5925 \p{Space} \p{White_Space} (= \p{White_Space=Y}) (25)
5926 \p{Space: *} \p{White_Space: *}
5927 \p{Space_Separator} \p{General_Category=Space_Separator}
5928 (Short: \p{Zs}) (17)
5929 \p{SpacePerl} \p{XPosixSpace} (25)
5930 \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
5931 \p{Mc}) (401)
5932 X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
5933 (Short: \p{InModifierLetters}) (80)
5934 X \p{Specials} \p{Block=Specials} (16)
5935 \p{STerm} \p{Sentence_Terminal} (=
5936 \p{Sentence_Terminal=Y}) (128)
5937 \p{STerm: *} \p{Sentence_Terminal: *}
5938 \p{Sund} \p{Sundanese} (= \p{Script_Extensions=
5939 Sundanese}) (NOT \p{Block=Sundanese})
5940 (72)
5941 \p{Sundanese} \p{Script_Extensions=Sundanese} (Short:
5942 \p{Sund}; NOT \p{Block=Sundanese}) (72)
5943 X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block=
5944 Sundanese_Supplement}) (16)
5945 X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
5946 \p{InSundaneseSup}) (16)
5947 X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block=
5948 Supplemental_Arrows_A}) (16)
5949 X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block=
5950 Supplemental_Arrows_B}) (128)
5951 X \p{Sup_Arrows_C} \p{Supplemental_Arrows_C} (= \p{Block=
5952 Supplemental_Arrows_C}) (256)
5953 X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (=
5954 \p{Block=
5955 Supplemental_Mathematical_Operators})
5956 (256)
5957 X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (=
5958 \p{Block=
5959 Supplementary_Private_Use_Area_A})
5960 (65_536)
5961 X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (=
5962 \p{Block=
5963 Supplementary_Private_Use_Area_B})
5964 (65_536)
5965 X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block=
5966 Supplemental_Punctuation}) (128)
5967 X \p{Sup_Symbols_And_Pictographs}
5968 \p{Supplemental_Symbols_And_Pictographs}
5969 (= \p{Block=
5970 Supplemental_Symbols_And_Pictographs})
5971 (256)
5972 X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (=
5973 \p{Block=Superscripts_And_Subscripts})
5974 (48)
5975 X \p{Superscripts_And_Subscripts} \p{Block=
5976 Superscripts_And_Subscripts} (Short:
5977 \p{InSuperAndSub}) (48)
5978 X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
5979 \p{InSupArrowsA}) (16)
5980 X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
5981 \p{InSupArrowsB}) (128)
5982 X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
5983 \p{InSupArrowsC}) (256)
5984 X \p{Supplemental_Mathematical_Operators} \p{Block=
5985 Supplemental_Mathematical_Operators}
5986 (Short: \p{InSupMathOperators}) (256)
5987 X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
5988 (Short: \p{InSupPunctuation}) (128)
5989 X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
5990 Supplemental_Symbols_And_Pictographs}
5991 (Short: \p{InSupSymbolsAndPictographs})
5992 (256)
5993 X \p{Supplementary_Private_Use_Area_A} \p{Block=
5994 Supplementary_Private_Use_Area_A}
5995 (Short: \p{InSupPUAA}) (65_536)
5996 X \p{Supplementary_Private_Use_Area_B} \p{Block=
5997 Supplementary_Private_Use_Area_B}
5998 (Short: \p{InSupPUAB}) (65_536)
5999 \p{Surrogate} \p{General_Category=Surrogate} (Short:
6000 \p{Cs}) (2048)
6001 X \p{Sutton_SignWriting} \p{Block=Sutton_SignWriting} (688)
6002 \p{Sylo} \p{Syloti_Nagri} (= \p{Script_Extensions=
6003 Syloti_Nagri}) (NOT \p{Block=
6004 Syloti_Nagri}) (56)
6005 \p{Syloti_Nagri} \p{Script_Extensions=Syloti_Nagri} (Short:
6006 \p{Sylo}; NOT \p{Block=Syloti_Nagri})
6007 (56)
6008 \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
6009 (6978)
6010 \p{Syrc} \p{Syriac} (= \p{Script_Extensions=
6011 Syriac}) (NOT \p{Block=Syriac}) (105)
6012 \p{Syriac} \p{Script_Extensions=Syriac} (Short:
6013 \p{Syrc}; NOT \p{Block=Syriac}) (105)
6014 X \p{Syriac_Sup} \p{Syriac_Supplement} (= \p{Block=
6015 Syriac_Supplement}) (16)
6016 X \p{Syriac_Supplement} \p{Block=Syriac_Supplement} (Short:
6017 \p{InSyriacSup}) (16)
6018 \p{Tagalog} \p{Script_Extensions=Tagalog} (Short:
6019 \p{Tglg}; NOT \p{Block=Tagalog}) (22)
6020 \p{Tagb} \p{Tagbanwa} (= \p{Script_Extensions=
6021 Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
6022 \p{Tagbanwa} \p{Script_Extensions=Tagbanwa} (Short:
6023 \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
6024 X \p{Tags} \p{Block=Tags} (128)
6025 \p{Tai_Le} \p{Script_Extensions=Tai_Le} (Short:
6026 \p{Tale}; NOT \p{Block=Tai_Le}) (45)
6027 \p{Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
6028 \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
6029 \p{Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
6030 \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
6031 X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
6032 Tai_Xuan_Jing_Symbols}) (96)
6033 X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
6034 \p{InTaiXuanJing}) (96)
6035 \p{Takr} \p{Takri} (= \p{Script_Extensions=Takri})
6036 (NOT \p{Block=Takri}) (78)
6037 \p{Takri} \p{Script_Extensions=Takri} (Short:
6038 \p{Takr}; NOT \p{Block=Takri}) (78)
6039 \p{Tale} \p{Tai_Le} (= \p{Script_Extensions=
6040 Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
6041 \p{Talu} \p{New_Tai_Lue} (= \p{Script_Extensions=
6042 New_Tai_Lue}) (NOT \p{Block=
6043 New_Tai_Lue}) (83)
6044 \p{Tamil} \p{Script_Extensions=Tamil} (Short:
6045 \p{Taml}; NOT \p{Block=Tamil}) (81)
6046 \p{Taml} \p{Tamil} (= \p{Script_Extensions=Tamil})
6047 (NOT \p{Block=Tamil}) (81)
6048 \p{Tang} \p{Tangut} (= \p{Script_Extensions=
6049 Tangut}) (NOT \p{Block=Tangut}) (6881)
6050 \p{Tangut} \p{Script_Extensions=Tangut} (Short:
6051 \p{Tang}; NOT \p{Block=Tangut}) (6881)
6052 X \p{Tangut_Components} \p{Block=Tangut_Components} (768)
6053 \p{Tavt} \p{Tai_Viet} (= \p{Script_Extensions=
6054 Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
6055 \p{Telu} \p{Telugu} (= \p{Script_Extensions=
6056 Telugu}) (NOT \p{Block=Telugu}) (101)
6057 \p{Telugu} \p{Script_Extensions=Telugu} (Short:
6058 \p{Telu}; NOT \p{Block=Telugu}) (101)
6059 \p{Term} \p{Terminal_Punctuation} (=
6060 \p{Terminal_Punctuation=Y}) (252)
6061 \p{Term: *} \p{Terminal_Punctuation: *}
6062 \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
6063 \p{Term}) (252)
6064 \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
6065 (1_113_860 plus all above-Unicode code
6066 points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0-
6067 9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
6068 U+0100..037D, U+037F..0386,
6069 U+0388..0588, U+058A..05C2, U+05C4..060B
6070 ...)
6071 \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (252:
6072 [!,.:;?], U+037E, U+0387, U+0589,
6073 U+05C3, U+060C ...)
6074 \p{Tfng} \p{Tifinagh} (= \p{Script_Extensions=
6075 Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
6076 \p{Tglg} \p{Tagalog} (= \p{Script_Extensions=
6077 Tagalog}) (NOT \p{Block=Tagalog}) (22)
6078 \p{Thaa} \p{Thaana} (= \p{Script_Extensions=
6079 Thaana}) (NOT \p{Block=Thaana}) (66)
6080 \p{Thaana} \p{Script_Extensions=Thaana} (Short:
6081 \p{Thaa}; NOT \p{Block=Thaana}) (66)
6082 \p{Thai} \p{Script_Extensions=Thai} (NOT \p{Block=
6083 Thai}) (86)
6084 \p{Tibetan} \p{Script_Extensions=Tibetan} (Short:
6085 \p{Tibt}; NOT \p{Block=Tibetan}) (207)
6086 \p{Tibt} \p{Tibetan} (= \p{Script_Extensions=
6087 Tibetan}) (NOT \p{Block=Tibetan}) (207)
6088 \p{Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
6089 \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
6090 \p{Tirh} \p{Tirhuta} (= \p{Script_Extensions=
6091 Tirhuta}) (NOT \p{Block=Tirhuta}) (94)
6092 \p{Tirhuta} \p{Script_Extensions=Tirhuta} (Short:
6093 \p{Tirh}; NOT \p{Block=Tirhuta}) (94)
6094 \p{Title} \p{Titlecase} (/i= Cased=Yes) (31)
6095 \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i=
6096 Cased=Yes) (31: U+01C5, U+01C8, U+01CB,
6097 U+01F2, U+1F88..1F8F, U+1F98..1F9F ...)
6098 \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
6099 (Short: \p{Lt}; /i= General_Category=
6100 Cased_Letter) (31)
6101 X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block=
6102 Transport_And_Map_Symbols}) (128)
6103 X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
6104 (Short: \p{InTransportAndMap}) (128)
6105 X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics}
6106 (= \p{Block=
6107 Unified_Canadian_Aboriginal_Syllabics})
6108 (640)
6109 X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_-
6110 Extended} (= \p{Block=
6111 Unified_Canadian_Aboriginal_Syllabics_-
6112 Extended}) (80)
6113 \p{Ugar} \p{Ugaritic} (= \p{Script_Extensions=
6114 Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
6115 \p{Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
6116 \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
6117 \p{UIdeo} \p{Unified_Ideograph} (=
6118 \p{Unified_Ideograph=Y}) (87_882)
6119 \p{UIdeo: *} \p{Unified_Ideograph: *}
6120 \p{Unassigned} \p{General_Category=Unassigned} (Short:
6121 \p{Cn}) (837_841 plus all above-Unicode
6122 code points)
6123 \p{Unicode} \p{Any} (1_114_112)
6124 X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
6125 Unified_Canadian_Aboriginal_Syllabics}
6126 (Short: \p{InUCAS}) (640)
6127 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
6128 Unified_Canadian_Aboriginal_Syllabics_-
6129 Extended} (Short: \p{InUCASExt}) (80)
6130 \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
6131 (87_882)
6132 \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
6133 (1_026_230 plus all above-Unicode code
6134 points: U+0000..33FF, U+4DB6..4DFF,
6135 U+9FEB..FA0D, U+FA10, U+FA12,
6136 U+FA15..FA1E ...)
6137 \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (87_882:
6138 U+3400..4DB5, U+4E00..9FEA,
6139 U+FA0E..FA0F, U+FA11, U+FA13..FA14,
6140 U+FA1F ...)
6141 \p{Unknown} \p{Script_Extensions=Unknown} (Short:
6142 \p{Zzzz}) (977_357 plus all above-
6143 Unicode code points)
6144 \p{Upper} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6145 Cased=Yes) (1822)
6146 \p{Upper: *} \p{Uppercase: *}
6147 \p{Uppercase} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6148 Cased=Yes) (1822)
6149 \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
6150 No) (1_112_290 plus all above-Unicode
6151 code points: [\x00-
6152 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
6153 >?\@\[\\\]\^_`a-z\{\|\}~\x7f-
6154 \xbf\xd7\xdf-\xff], U+0101, U+0103,
6155 U+0105, U+0107, U+0109 ...)
6156 \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
6157 Yes) (1822: [A-Z\xc0-\xd6\xd8-\xde],
6158 U+0100, U+0102, U+0104, U+0106, U+0108
6159 ...)
6160 \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
6161 (Short: \p{Lu}; /i= General_Category=
6162 Cased_Letter) (1702)
6163 \p{Vai} \p{Script_Extensions=Vai} (NOT \p{Block=
6164 Vai}) (300)
6165 \p{Vaii} \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
6166 \p{Block=Vai}) (300)
6167 \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS};
6168 NOT \p{Variation_Selectors}) (259)
6169 \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853
6170 plus all above-Unicode code points:
6171 U+0000..180A, U+180E..FDFF,
6172 U+FE10..E00FF, U+E01F0..infinity)
6173 \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259:
6174 U+180B..180D, U+FE00..FE0F,
6175 U+E0100..E01EF)
6176 X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
6177 \p{InVS}) (16)
6178 X \p{Variation_Selectors_Supplement} \p{Block=
6179 Variation_Selectors_Supplement} (Short:
6180 \p{InVSSup}) (240)
6181 X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block=
6182 Vedic_Extensions}) (48)
6183 X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short:
6184 \p{InVedicExt}) (48)
6185 X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
6186 \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated}
6187 (787_892 plus all above-Unicode code
6188 points)
6189 \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (787_892 plus
6190 all above-Unicode code points: [\x00-
6191 \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf-
6192 \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9,
6193 U+02EC..10FF, U+1200..1400,
6194 U+1680..18AF, U+1900..2015 ...)
6195 \p{Vertical_Orientation: Tr} \p{Vertical_Orientation=
6196 Transformed_Rotated} (47)
6197 \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr})
6198 (47: U+2329..232A, U+3008..3011,
6199 U+3014..301F, U+3030, U+30A0, U+30FC ...)
6200 \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu})
6201 (147: U+3001..3002, U+3041, U+3043,
6202 U+3045, U+3047, U+3049 ...)
6203 \p{Vertical_Orientation: Tu} \p{Vertical_Orientation=
6204 Transformed_Upright} (147)
6205 \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright}
6206 (326_026)
6207 \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (326_026:
6208 [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7],
6209 U+02EA..02EB, U+1100..11FF,
6210 U+1401..167F, U+18B0..18FF, U+2016 ...)
6211 \p{VertSpace} \v (7: [\n\cK\f\r\x85], U+2028..2029)
6212 \p{Vo: *} \p{Vertical_Orientation: *}
6213 \p{VS} \p{Variation_Selector} (=
6214 \p{Variation_Selector=Y}) (NOT
6215 \p{Variation_Selectors}) (259)
6216 \p{VS: *} \p{Variation_Selector: *}
6217 X \p{VS_Sup} \p{Variation_Selectors_Supplement} (=
6218 \p{Block=
6219 Variation_Selectors_Supplement}) (240)
6220 \p{Wara} \p{Warang_Citi} (= \p{Script_Extensions=
6221 Warang_Citi}) (NOT \p{Block=
6222 Warang_Citi}) (84)
6223 \p{Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
6224 \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
6225 \p{WB: *} \p{Word_Break: *}
6226 \p{White_Space} \p{White_Space=Y} (Short: \p{Space}) (25)
6227 \p{White_Space: N*} (Short: \p{Space=N}, \P{Space}) (1_114_087
6228 plus all above-Unicode code points:
6229 [^\t\n\cK\f\r\x20\x85\xa0],
6230 U+0100..167F, U+1681..1FFF,
6231 U+200B..2027, U+202A..202E, U+2030..205E
6232 ...)
6233 \p{White_Space: Y*} (Short: \p{Space=Y}, \p{Space}) (25:
6234 [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6235 U+2000..200A, U+2028..2029, U+202F,
6236 U+205F ...)
6237 \p{Word} \p{XPosixWord} (128_238)
6238 \p{Word_Break: ALetter} (Short: \p{WB=LE}) (28_179: [A-Za-
6239 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6240 \xff], U+0100..02D7, U+02DE..02E4,
6241 U+02EC..02FF, U+0370..0374, U+0376..0377
6242 ...)
6243 \p{Word_Break: CR} (Short: \p{WB=CR}) (1: [\r])
6244 \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"])
6245 \p{Word_Break: DQ} \p{Word_Break=Double_Quote} (1)
6246 \p{Word_Break: E_Base} (Short: \p{WB=EB}) (98: U+261D, U+26F9,
6247 U+270A..270D, U+1F385, U+1F3C2..1F3C4,
6248 U+1F3C7 ...)
6249 \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (4: U+1F466..1F469)
6250 \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (5: U+1F3FB..1F3FF)
6251 \p{Word_Break: EB} \p{Word_Break=E_Base} (98)
6252 \p{Word_Break: EBG} \p{Word_Break=E_Base_GAZ} (4)
6253 \p{Word_Break: EM} \p{Word_Break=E_Modifier} (5)
6254 \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (11)
6255 \p{Word_Break: Extend} (Short: \p{WB=Extend}) (2276:
6256 U+0300..036F, U+0483..0489,
6257 U+0591..05BD, U+05BF, U+05C1..05C2,
6258 U+05C4..05C5 ...)
6259 \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F,
6260 U+203F..2040, U+2054, U+FE33..FE34,
6261 U+FE4D..FE4F ...)
6262 \p{Word_Break: FO} \p{Word_Break=Format} (52)
6263 \p{Word_Break: Format} (Short: \p{WB=FO}) (52: [\xad],
6264 U+0600..0605, U+061C, U+06DD, U+070F,
6265 U+08E2 ...)
6266 \p{Word_Break: GAZ} \p{Word_Break=Glue_After_Zwj} (22)
6267 \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (22: U+2640,
6268 U+2642, U+2695..2696, U+2708, U+2764,
6269 U+1F308 ...)
6270 \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (74:
6271 U+05D0..05EA, U+05F0..05F2, U+FB1D,
6272 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
6273 ...)
6274 \p{Word_Break: HL} \p{Word_Break=Hebrew_Letter} (74)
6275 \p{Word_Break: KA} \p{Word_Break=Katakana} (310)
6276 \p{Word_Break: Katakana} (Short: \p{WB=KA}) (310: U+3031..3035,
6277 U+309B..309C, U+30A0..30FA,
6278 U+30FC..30FF, U+31F0..31FF, U+32D0..32FE
6279 ...)
6280 \p{Word_Break: LE} \p{Word_Break=ALetter} (28_179)
6281 \p{Word_Break: LF} (Short: \p{WB=LF}) (1: [\n])
6282 \p{Word_Break: MB} \p{Word_Break=MidNumLet} (7)
6283 \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (8: [:\xb7], U+0387,
6284 U+05F4, U+2027, U+FE13, U+FE55 ...)
6285 \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15: [,;], U+037E,
6286 U+0589, U+060C..060D, U+066C, U+07F8 ...)
6287 \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.],
6288 U+2018..2019, U+2024, U+FE52, U+FF07,
6289 U+FF0E)
6290 \p{Word_Break: ML} \p{Word_Break=MidLetter} (8)
6291 \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
6292 \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85],
6293 U+2028..2029)
6294 \p{Word_Break: NL} \p{Word_Break=Newline} (5)
6295 \p{Word_Break: NU} \p{Word_Break=Numeric} (581)
6296 \p{Word_Break: Numeric} (Short: \p{WB=NU}) (581: [0-9],
6297 U+0660..0669, U+066B, U+06F0..06F9,
6298 U+07C0..07C9, U+0966..096F ...)
6299 \p{Word_Break: Other} (Short: \p{WB=XX}) (1_082_434 plus all
6300 above-Unicode code points:
6301 [^\n\cK\f\r\"\',.0-9:;A-Z_a-
6302 z\x85\xaa\xad\xb5\xb7\xba\xc0-\xd6\xd8-
6303 \xf6\xf8-\xff], U+02D8..02DD,
6304 U+02E5..02EB, U+0375, U+0378..0379,
6305 U+0380..0385 ...)
6306 \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26:
6307 U+1F1E6..1F1FF)
6308 \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26)
6309 \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\'])
6310 \p{Word_Break: SQ} \p{Word_Break=Single_Quote} (1)
6311 \p{Word_Break: XX} \p{Word_Break=Other} (1_082_434 plus all
6312 above-Unicode code points)
6313 \p{Word_Break: ZWJ} (Short: \p{WB=ZWJ}) (1: U+200D)
6314 \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (25)
6315 \p{WSpace: *} \p{White_Space: *}
6316 \p{XDigit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
6317 \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
6318 (128_089)
6319 \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (986_023
6320 plus all above-Unicode code points:
6321 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
6322 >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-
6323 \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
6324 U+02C2..02C5, U+02D2..02DF,
6325 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
6326 \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (128_089:
6327 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-
6328 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6329 U+02C6..02D1, U+02E0..02E4, U+02EC,
6330 U+02EE ...)
6331 \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (125_311)
6332 \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (988_801
6333 plus all above-Unicode code points:
6334 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
6335 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
6336 \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
6337 U+02C2..02C5, U+02D2..02DF,
6338 U+02E5..02EB, U+02ED, U+02EF..036F ...)
6339 \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (125_311:
6340 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
6341 \xf6\xf8-\xff], U+0100..02C1,
6342 U+02C6..02D1, U+02E0..02E4, U+02EC,
6343 U+02EE ...)
6344 \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
6345 (128_089)
6346 \p{XIDC: *} \p{XID_Continue: *}
6347 \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (125_311)
6348 \p{XIDS: *} \p{XID_Start: *}
6349 \p{Xpeo} \p{Old_Persian} (= \p{Script_Extensions=
6350 Old_Persian}) (NOT \p{Block=
6351 Old_Persian}) (50)
6352 \p{XPerlSpace} \p{XPosixSpace} (25)
6353 \p{XPosixAlnum} Alphabetic and (decimal) Numeric (Short:
6354 \p{Alnum}) (127_219: [0-9A-Za-
6355 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6356 \xff], U+0100..02C1, U+02C6..02D1,
6357 U+02E0..02E4, U+02EC, U+02EE ...)
6358 \p{XPosixAlpha} \p{Alphabetic=Y} (Short: \p{Alpha})
6359 (126_629)
6360 \p{XPosixBlank} \h, Horizontal white space (Short:
6361 \p{Blank}) (18: [\t\x20\xa0], U+1680,
6362 U+2000..200A, U+202F, U+205F, U+3000)
6363 \p{XPosixCntrl} \p{General_Category=Control} Control
6364 characters (Short: \p{Cc}) (65)
6365 \p{XPosixDigit} \p{General_Category=Decimal_Number} [0-9]
6366 + all other decimal digits (Short:
6367 \p{Nd}) (590)
6368 \p{XPosixGraph} Characters that are graphical (Short:
6369 \p{Graph}) (274_139:
6370 [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6371 Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff],
6372 U+0100..0377, U+037A..037F,
6373 U+0384..038A, U+038C, U+038E..03A1 ...)
6374 \p{XPosixLower} \p{Lowercase=Y} (Short: \p{Lower}; /i=
6375 Cased=Yes) (2252)
6376 \p{XPosixPrint} Characters that are graphical plus space
6377 characters (but no controls) (Short:
6378 \p{Print}) (274_156: [\x20-\x7e\xa0-
6379 \xff], U+0100..0377, U+037A..037F,
6380 U+0384..038A, U+038C, U+038E..03A1 ...)
6381 \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (779:
6382 [!\"#\$\%&\'\(\)*+,\-.\/:;<=
6383 >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
6384 \xb7\xbb\xbf], U+037E, U+0387,
6385 U+055A..055F, U+0589..058A, U+05BE ...)
6386 \p{XPosixSpace} \s including beyond ASCII and vertical tab
6387 (Short: \p{SpacePerl}) (25:
6388 [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6389 U+2000..200A, U+2028..2029, U+202F,
6390 U+205F ...)
6391 \p{XPosixUpper} \p{Uppercase=Y} (Short: \p{Upper}; /i=
6392 Cased=Yes) (1822)
6393 \p{XPosixWord} \w, including beyond ASCII; = \p{Alnum} +
6394 \pM + \p{Pc} + \p{Join_Control} (Short:
6395 \p{Word}) (128_238: [0-9A-Z_a-
6396 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6397 \xff], U+0100..02C1, U+02C6..02D1,
6398 U+02E0..02E4, U+02EC, U+02EE ...)
6399 \p{XPosixXDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
6400 \p{Xsux} \p{Cuneiform} (= \p{Script_Extensions=
6401 Cuneiform}) (NOT \p{Block=Cuneiform})
6402 (1234)
6403 \p{Yi} \p{Script_Extensions=Yi} (1246)
6404 X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
6405 X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
6406 \p{Yiii} \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
6407 X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block=
6408 Yijing_Hexagram_Symbols}) (64)
6409 X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
6410 (Short: \p{InYijing}) (64)
6411 \p{Z} \pZ \p{Separator} (= \p{General_Category=
6412 Separator}) (19)
6413 \p{Zanabazar_Square} \p{Script_Extensions=Zanabazar_Square}
6414 (Short: \p{Zanb}; NOT \p{Block=
6415 Zanabazar_Square}) (72)
6416 \p{Zanb} \p{Zanabazar_Square} (=
6417 \p{Script_Extensions=Zanabazar_Square})
6418 (NOT \p{Block=Zanabazar_Square}) (72)
6419 \p{Zinh} \p{Inherited} (= \p{Script_Extensions=
6420 Inherited}) (500)
6421 \p{Zl} \p{Line_Separator} (= \p{General_Category=
6422 Line_Separator}) (1)
6423 \p{Zp} \p{Paragraph_Separator} (=
6424 \p{General_Category=
6425 Paragraph_Separator}) (1)
6426 \p{Zs} \p{Space_Separator} (=
6427 \p{General_Category=Space_Separator})
6428 (17)
6429 \p{Zyyy} \p{Common} (= \p{Script_Extensions=
6430 Common}) (6947)
6431 \p{Zzzz} \p{Unknown} (= \p{Script_Extensions=
6432 Unknown}) (977_357 plus all above-
6433 Unicode code points)
6434 TX\p{_CanonDCIJ} (For internal use by Perl, not necessarily
6435 stable) (= \p{Soft_Dotted=Y}) (46)
6436 TX\p{_Case_Ignorable} (For internal use by Perl, not necessarily
6437 stable) (= \p{Case_Ignorable=Y}) (2314)
6438 TX\p{_CombAbove} (For internal use by Perl, not necessarily
6439 stable) (= \p{Canonical_Combining_Class=
6440 Above}) (461)
6441
6442 Legal "\p{}" and "\P{}" constructs that match no characters
6443 Unicode has some property-value pairs that currently don't match
6444 anything. This happens generally either because they are obsolete, or
6445 they exist for symmetry with other forms, but no language has yet been
6446 encoded that uses them. In this version of Unicode, the following
6447 match zero code points:
6448
6449 \p{Canonical_Combining_Class=Attached_Below_Left}
6450 \p{Canonical_Combining_Class=CCC133}
6451
6453 The value of any Unicode (not including Perl extensions) character
6454 property mentioned above for any single code point is available through
6455 "charprop()" in Unicode::UCD. "charprops_all()" in Unicode::UCD
6456 returns the values of all the Unicode properties for a given code
6457 point.
6458
6459 Besides these, all the Unicode character properties mentioned above
6460 (except for those marked as for internal use by Perl) are also
6461 accessible by "prop_invlist()" in Unicode::UCD.
6462
6463 Due to their nature, not all Unicode character properties are suitable
6464 for regular expression matches, nor "prop_invlist()". The remaining
6465 non-provisional, non-internal ones are accessible via "prop_invmap()"
6466 in Unicode::UCD (except for those that this Perl installation hasn't
6467 included; see below for which those are).
6468
6469 For compatibility with other parts of Perl, all the single forms given
6470 in the table in the section above are recognized. BUT, there are some
6471 ambiguities between some Perl extensions and the Unicode properties,
6472 all of which are silently resolved in favor of the official Unicode
6473 property. To avoid surprises, you should only use "prop_invmap()" for
6474 forms listed in the table below, which omits the non-recommended ones.
6475 The affected forms are the Perl single form equivalents of Unicode
6476 properties, such as "\p{sc}" being a single-form equivalent of
6477 "\p{gc=sc}", which is treated by "prop_invmap()" as the "Script"
6478 property, whose short name is "sc". The table indicates the current
6479 ambiguities in the INFO column, beginning with the word "NOT".
6480
6481 The standard Unicode properties listed below are documented in
6482 <http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is
6483 documented in "prop_invmap()" in Unicode::UCD. The other Perl
6484 extensions are in "Other Properties" in perlunicode;
6485
6486 The first column in the table is a name for the property; the second
6487 column is an alternative name, if any, plus possibly some annotations.
6488 The alternative name is the property's full name, unless that would
6489 simply repeat the first column, in which case the second column
6490 indicates the property's short name (if different). The annotations
6491 are given only in the entry for the full name. The annotations for
6492 binary properties include a list of the first few ranges that the
6493 property matches. To avoid any ambiguity, the SPACE character is
6494 represented as "\x20".
6495
6496 If a property is obsolete, etc, the entry will be flagged with the same
6497 characters used in the table in the section above, like D or S.
6498
6499 NAME INFO
6500
6501 Age
6502 AHex ASCII_Hex_Digit
6503 All (Perl extension). All code points,
6504 including those above Unicode. Same as
6505 qr/./s. U+0000..infinity
6506 Alnum XPosixAlnum. (Perl extension)
6507 Alpha Alphabetic
6508 Alphabetic (Short: Alpha). [A-Za-z\xaa\xb5\xba\xc0-
6509 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6510 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
6511 ...
6512 Any (Perl extension). All Unicode code
6513 points. U+0000..10FFFF
6514 ASCII Block=Basic_Latin. (Perl extension).
6515 [\x00-\x7f]
6516 ASCII_Hex_Digit (Short: AHex). [0-9A-Fa-f]
6517 Assigned (Perl extension). All assigned code
6518 points. U+0000..0377, U+037A..037F,
6519 U+0384..038A, U+038C, U+038E..03A1,
6520 U+03A3..052F ...
6521 Bc Bidi_Class
6522 Bidi_C Bidi_Control
6523 Bidi_Class (Short: bc)
6524 Bidi_Control (Short: Bidi_C). U+061C, U+200E..200F,
6525 U+202A..202E, U+2066..2069
6526 Bidi_M Bidi_Mirrored
6527 Bidi_Mirrored (Short: Bidi_M).
6528 [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
6529 U+169B..169C, U+2039..203A, U+2045..2046,
6530 U+207D..207E ...
6531 Bidi_Mirroring_Glyph (Short: bmg)
6532 Bidi_Paired_Bracket (Short: bpb)
6533 Bidi_Paired_Bracket_Type (Short: bpt)
6534 Blank XPosixBlank. (Perl extension)
6535 Blk Block
6536 Block (Short: blk)
6537 Bmg Bidi_Mirroring_Glyph
6538 Bpb Bidi_Paired_Bracket
6539 Bpt Bidi_Paired_Bracket_Type
6540 Canonical_Combining_Class (Short: ccc)
6541 Case_Folding (Short: cf)
6542 Case_Ignorable (Short: CI).
6543 [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8],
6544 U+02B0..036F, U+0374..0375, U+037A,
6545 U+0384..0385, U+0387 ...
6546 Cased [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6547 \xff], U+0100..01BA, U+01BC..01BF,
6548 U+01C4..0293, U+0295..02B8, U+02C0..02C1
6549 ...
6550 Category General_Category
6551 Ccc Canonical_Combining_Class
6552 CE Composition_Exclusion
6553 Cf Case_Folding; NOT 'cf' meaning
6554 'General_Category=Format'
6555 Changes_When_Casefolded (Short: CWCF). [A-Z\xb5\xc0-\xd6\xd8-
6556 \xdf], U+0100, U+0102, U+0104, U+0106,
6557 U+0108 ...
6558 Changes_When_Casemapped (Short: CWCM). [A-Za-z\xb5\xc0-\xd6\xd8-
6559 \xf6\xf8-\xff], U+0100..0137,
6560 U+0139..018C, U+018E..019A, U+019C..01A9,
6561 U+01AC..01B9 ...
6562 Changes_When_Lowercased (Short: CWL). [A-Z\xc0-\xd6\xd8-\xde],
6563 U+0100, U+0102, U+0104, U+0106, U+0108 ...
6564 Changes_When_NFKC_Casefolded (Short: CWKCF). [A-
6565 Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8-
6566 \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf], U+0100,
6567 U+0102, U+0104, U+0106, U+0108 ...
6568 Changes_When_Titlecased (Short: CWT). [a-z\xb5\xdf-\xf6\xf8-
6569 \xff], U+0101, U+0103, U+0105, U+0107,
6570 U+0109 ...
6571 Changes_When_Uppercased (Short: CWU). [a-z\xb5\xdf-\xf6\xf8-
6572 \xff], U+0101, U+0103, U+0105, U+0107,
6573 U+0109 ...
6574 CI Case_Ignorable
6575 Cntrl XPosixCntrl (=General_Category=Control).
6576 (Perl extension)
6577 Comp_Ex Full_Composition_Exclusion
6578 Composition_Exclusion (Short: CE). U+0958..095F, U+09DC..09DD,
6579 U+09DF, U+0A33, U+0A36, U+0A59..0A5B ...
6580 CWCF Changes_When_Casefolded
6581 CWCM Changes_When_Casemapped
6582 CWKCF Changes_When_NFKC_Casefolded
6583 CWL Changes_When_Lowercased
6584 CWT Changes_When_Titlecased
6585 CWU Changes_When_Uppercased
6586 Dash [\-], U+058A, U+05BE, U+1400, U+1806,
6587 U+2010..2015 ...
6588 Decomposition_Mapping (Short: dm)
6589 Decomposition_Type (Short: dt)
6590 Default_Ignorable_Code_Point (Short: DI). [\xad], U+034F, U+061C,
6591 U+115F..1160, U+17B4..17B5, U+180B..180E
6592 ...
6593 Dep Deprecated
6594 Deprecated (Short: Dep). U+0149, U+0673, U+0F77,
6595 U+0F79, U+17A3..17A4, U+206A..206F ...
6596 DI Default_Ignorable_Code_Point
6597 Dia Diacritic
6598 Diacritic (Short: Dia). [\^`\xa8\xaf\xb4\xb7-\xb8],
6599 U+02B0..034E, U+0350..0357, U+035D..0362,
6600 U+0374..0375, U+037A ...
6601 Digit XPosixDigit (=General_Category=
6602 Decimal_Number). (Perl extension)
6603 Dm Decomposition_Mapping
6604 Dt Decomposition_Type
6605 Ea East_Asian_Width
6606 East_Asian_Width (Short: ea)
6607 Ext Extender
6608 Extender (Short: Ext). [\xb7], U+02D0..02D1,
6609 U+0640, U+07FA, U+0E46, U+0EC6 ...
6610 Full_Composition_Exclusion (Short: Comp_Ex). U+0340..0341,
6611 U+0343..0344, U+0374, U+037E, U+0387,
6612 U+0958..095F ...
6613 Gc General_Category
6614 GCB Grapheme_Cluster_Break
6615 General_Category (Short: gc)
6616 Gr_Base Grapheme_Base
6617 Gr_Ext Grapheme_Extend
6618 Graph XPosixGraph. (Perl extension)
6619 Grapheme_Base (Short: Gr_Base). [\x20-\x7e\xa0-
6620 \xac\xae-\xff], U+0100..02FF,
6621 U+0370..0377, U+037A..037F, U+0384..038A,
6622 U+038C ...
6623 Grapheme_Cluster_Break (Short: GCB)
6624 Grapheme_Extend (Short: Gr_Ext). U+0300..036F,
6625 U+0483..0489, U+0591..05BD, U+05BF,
6626 U+05C1..05C2, U+05C4..05C5 ...
6627 Hangul_Syllable_Type (Short: hst)
6628 Hex Hex_Digit
6629 Hex_Digit (Short: Hex). [0-9A-Fa-f], U+FF10..FF19,
6630 U+FF21..FF26, U+FF41..FF46
6631 HorizSpace XPosixBlank. (Perl extension)
6632 Hst Hangul_Syllable_Type
6633 D Hyphen [\-\xad], U+058A, U+1806, U+2010..2011,
6634 U+2E17, U+30FB ... Supplanted by
6635 Line_Break property values; see
6636 www.unicode.org/reports/tr14
6637 ID_Continue (Short: IDC). [0-9A-Z_a-
6638 z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8-
6639 \xff], U+0100..02C1, U+02C6..02D1,
6640 U+02E0..02E4, U+02EC, U+02EE ...
6641 ID_Start (Short: IDS). [A-Za-z\xaa\xb5\xba\xc0-
6642 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6643 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
6644 ...
6645 IDC ID_Continue
6646 Ideo Ideographic
6647 Ideographic (Short: Ideo). U+3006..3007,
6648 U+3021..3029, U+3038..303A, U+3400..4DB5,
6649 U+4E00..9FEA, U+F900..FA6D ...
6650 IDS ID_Start
6651 IDS_Binary_Operator (Short: IDSB). U+2FF0..2FF1, U+2FF4..2FFB
6652 IDS_Trinary_Operator (Short: IDST). U+2FF2..2FF3
6653 IDSB IDS_Binary_Operator
6654 IDST IDS_Trinary_Operator
6655 In Present_In. (Perl extension)
6656 Indic_Positional_Category (Short: InPC)
6657 Indic_Syllabic_Category (Short: InSC)
6658 InPC Indic_Positional_Category
6659 InSC Indic_Syllabic_Category
6660 Isc ISO_Comment; NOT 'isc' meaning
6661 'General_Category=Other'
6662 ISO_Comment (Short: isc)
6663 Jg Joining_Group
6664 Join_C Join_Control
6665 Join_Control (Short: Join_C). U+200C..200D
6666 Joining_Group (Short: jg)
6667 Joining_Type (Short: jt)
6668 Jt Joining_Type
6669 Lb Line_Break
6670 Lc Lowercase_Mapping; NOT 'lc' meaning
6671 'General_Category=Cased_Letter'
6672 Line_Break (Short: lb)
6673 LOE Logical_Order_Exception
6674 Logical_Order_Exception (Short: LOE). U+0E40..0E44, U+0EC0..0EC4,
6675 U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9
6676 ...
6677 Lower Lowercase
6678 Lowercase (Short: Lower). [a-z\xaa\xb5\xba\xdf-
6679 \xf6\xf8-\xff], U+0101, U+0103, U+0105,
6680 U+0107, U+0109 ...
6681 Lowercase_Mapping (Short: lc)
6682 Math [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
6683 U+03D5, U+03F0..03F1, U+03F4..03F6,
6684 U+0606..0608 ...
6685 Na Name
6686 Na1 Unicode_1_Name
6687 Name (Short: na)
6688 Name_Alias
6689 NChar Noncharacter_Code_Point
6690 NFC_QC NFC_Quick_Check
6691 NFC_Quick_Check (Short: NFC_QC)
6692 NFD_QC NFD_Quick_Check
6693 NFD_Quick_Check (Short: NFD_QC)
6694 NFKC_Casefold (Short: NFKC_CF)
6695 NFKC_CF NFKC_Casefold
6696 NFKC_QC NFKC_Quick_Check
6697 NFKC_Quick_Check (Short: NFKC_QC)
6698 NFKD_QC NFKD_Quick_Check
6699 NFKD_Quick_Check (Short: NFKD_QC)
6700 Noncharacter_Code_Point (Short: NChar). U+FDD0..FDEF,
6701 U+FFFE..FFFF, U+1FFFE..1FFFF,
6702 U+2FFFE..2FFFF, U+3FFFE..3FFFF,
6703 U+4FFFE..4FFFF ...
6704 Nt Numeric_Type
6705 Numeric_Type (Short: nt)
6706 Numeric_Value (Short: nv)
6707 Nv Numeric_Value
6708 Pat_Syn Pattern_Syntax
6709 Pat_WS Pattern_White_Space
6710 Pattern_Syntax (Short: Pat_Syn).
6711 [!\"#\$\%&\'\(\)*+,\-.\/:;<=
6712 >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab-
6713 \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
6714 U+2010..2027, U+2030..203E, U+2041..2053,
6715 U+2055..205E, U+2190..245F ...
6716 Pattern_White_Space (Short: Pat_WS). [\t\n\cK\f\r\x20\x85],
6717 U+200E..200F, U+2028..2029
6718 PCM Prepended_Concatenation_Mark
6719 Perl_Decimal_Digit (Perl extension)
6720 PerlSpace PosixSpace. (Perl extension)
6721 PerlWord PosixWord. (Perl extension)
6722 PosixAlnum (Perl extension). [0-9A-Za-z]
6723 PosixAlpha (Perl extension). [A-Za-z]
6724 PosixBlank (Perl extension). [\t\x20]
6725 PosixCntrl (Perl extension). ASCII control
6726 characters. ACK, BEL, BS, CAN, CR, DC1,
6727 DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT,
6728 ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK,
6729 NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT
6730 PosixDigit (Perl extension). [0-9]
6731 PosixGraph (Perl extension).
6732 [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6733 Z\[\\\]\^_`a-z\{\|\}~]
6734 PosixLower (Perl extension). [a-z]
6735 PosixPrint (Perl extension). [\x20-\x7e]
6736 PosixPunct (Perl extension).
6737 [!\"#\$\%&\'\(\)*+,\-.\/:;<=
6738 >?\@\[\\\]\^_`\{\|\}~]
6739 PosixSpace (Perl extension). [\t\n\cK\f\r\x20]
6740 PosixUpper (Perl extension). [A-Z]
6741 PosixWord (Perl extension). \w, restricted to
6742 ASCII. [0-9A-Z_a-z]
6743 PosixXDigit ASCII_Hex_Digit. (Perl extension).
6744 [0-9A-Fa-f]
6745 Prepended_Concatenation_Mark (Short: PCM). U+0600..0605, U+06DD,
6746 U+070F, U+08E2, U+110BD
6747 Present_In (Short: In). (Perl extension)
6748 Print XPosixPrint. (Perl extension)
6749 Punct General_Category=Punctuation. (Perl
6750 extension).
6751 [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_-
6752 \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
6753 U+037E, U+0387, U+055A..055F,
6754 U+0589..058A, U+05BE ...
6755 QMark Quotation_Mark
6756 Quotation_Mark (Short: QMark). [\"\'\xab\xbb],
6757 U+2018..201F, U+2039..203A, U+2E42,
6758 U+300C..300F, U+301D..301F ...
6759 Radical U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5
6760 Regional_Indicator (Short: RI). U+1F1E6..1F1FF
6761 RI Regional_Indicator
6762 SB Sentence_Break
6763 Sc Script; NOT 'sc' meaning
6764 'General_Category=Currency_Symbol'
6765 Scf Simple_Case_Folding
6766 Script (Short: sc)
6767 Script_Extensions (Short: scx)
6768 Scx Script_Extensions
6769 SD Soft_Dotted
6770 Sentence_Break (Short: SB)
6771 Sentence_Terminal (Short: STerm). [!.?], U+0589, U+061F,
6772 U+06D4, U+0700..0702, U+07F9 ...
6773 Sfc Simple_Case_Folding
6774 Simple_Case_Folding (Short: scf)
6775 Simple_Lowercase_Mapping (Short: slc)
6776 Simple_Titlecase_Mapping (Short: stc)
6777 Simple_Uppercase_Mapping (Short: suc)
6778 Slc Simple_Lowercase_Mapping
6779 Soft_Dotted (Short: SD). [i-j], U+012F, U+0249,
6780 U+0268, U+029D, U+02B2 ...
6781 Space White_Space
6782 SpacePerl XPosixSpace. (Perl extension)
6783 Stc Simple_Titlecase_Mapping
6784 STerm Sentence_Terminal
6785 Suc Simple_Uppercase_Mapping
6786 Tc Titlecase_Mapping
6787 Term Terminal_Punctuation
6788 Terminal_Punctuation (Short: Term). [!,.:;?], U+037E, U+0387,
6789 U+0589, U+05C3, U+060C ...
6790 Title Titlecase. (Perl extension)
6791 Titlecase (Short: Title). (Perl extension). (=
6792 \p{Gc=Lt}). U+01C5, U+01C8, U+01CB,
6793 U+01F2, U+1F88..1F8F, U+1F98..1F9F ...
6794 Titlecase_Mapping (Short: tc)
6795 Uc Uppercase_Mapping
6796 UIdeo Unified_Ideograph
6797 Unicode Any. (Perl extension)
6798 Unicode_1_Name (Short: na1)
6799 Unified_Ideograph (Short: UIdeo). U+3400..4DB5,
6800 U+4E00..9FEA, U+FA0E..FA0F, U+FA11,
6801 U+FA13..FA14, U+FA1F ...
6802 Upper Uppercase
6803 Uppercase (Short: Upper). [A-Z\xc0-\xd6\xd8-\xde],
6804 U+0100, U+0102, U+0104, U+0106, U+0108 ...
6805 Uppercase_Mapping (Short: uc)
6806 Variation_Selector (Short: VS). U+180B..180D, U+FE00..FE0F,
6807 U+E0100..E01EF
6808 Vertical_Orientation (Short: vo)
6809 VertSpace (Perl extension). \v. [\n\cK\f\r\x85],
6810 U+2028..2029
6811 Vo Vertical_Orientation
6812 VS Variation_Selector
6813 WB Word_Break
6814 White_Space (Short: WSpace).
6815 [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6816 U+2000..200A, U+2028..2029, U+202F, U+205F
6817 ...
6818 Word XPosixWord. (Perl extension)
6819 Word_Break (Short: WB)
6820 WSpace White_Space
6821 XDigit XPosixXDigit (=Hex_Digit). (Perl
6822 extension)
6823 XID_Continue (Short: XIDC). [0-9A-Z_a-
6824 z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8-
6825 \xff], U+0100..02C1, U+02C6..02D1,
6826 U+02E0..02E4, U+02EC, U+02EE ...
6827 XID_Start (Short: XIDS). [A-Za-z\xaa\xb5\xba\xc0-
6828 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6829 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
6830 ...
6831 XIDC XID_Continue
6832 XIDS XID_Start
6833 XPerlSpace XPosixSpace. (Perl extension)
6834 XPosixAlnum (Short: Alnum). (Perl extension).
6835 Alphabetic and (decimal) Numeric. [0-9A-
6836 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6837 \xff], U+0100..02C1, U+02C6..02D1,
6838 U+02E0..02E4, U+02EC, U+02EE ...
6839 XPosixAlpha Alphabetic. (Perl extension). [A-Za-
6840 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6841 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
6842 U+02EC, U+02EE ...
6843 XPosixBlank (Short: Blank). (Perl extension). \h,
6844 Horizontal white space. [\t\x20\xa0],
6845 U+1680, U+2000..200A, U+202F, U+205F,
6846 U+3000
6847 XPosixCntrl General_Category=Control (Short: Cntrl).
6848 (Perl extension). Control characters.
6849 [\x00-\x1f\x7f-\x9f]
6850 XPosixDigit General_Category=Decimal_Number (Short:
6851 Digit). (Perl extension). [0-9] + all
6852 other decimal digits. [0-9],
6853 U+0660..0669, U+06F0..06F9, U+07C0..07C9,
6854 U+0966..096F, U+09E6..09EF ...
6855 XPosixGraph (Short: Graph). (Perl extension).
6856 Characters that are graphical.
6857 [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6858 Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff],
6859 U+0100..0377, U+037A..037F, U+0384..038A,
6860 U+038C, U+038E..03A1 ...
6861 XPosixLower Lowercase. (Perl extension). [a-
6862 z\xaa\xb5\xba\xdf-\xf6\xf8-\xff], U+0101,
6863 U+0103, U+0105, U+0107, U+0109 ...
6864 XPosixPrint (Short: Print). (Perl extension).
6865 Characters that are graphical plus space
6866 characters (but no controls). [\x20-
6867 \x7e\xa0-\xff], U+0100..0377,
6868 U+037A..037F, U+0384..038A, U+038C,
6869 U+038E..03A1 ...
6870 XPosixPunct (Perl extension). \p{Punct} + ASCII-range
6871 \p{Symbol}. [!\"#\$\%&\'\(\)*+,\-.\/:;<=
6872 >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
6873 \xb7\xbb\xbf], U+037E, U+0387,
6874 U+055A..055F, U+0589..058A, U+05BE ...
6875 XPosixSpace (Perl extension). \s including beyond
6876 ASCII and vertical tab.
6877 [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6878 U+2000..200A, U+2028..2029, U+202F, U+205F
6879 ...
6880 XPosixUpper Uppercase. (Perl extension). [A-Z\xc0-
6881 \xd6\xd8-\xde], U+0100, U+0102, U+0104,
6882 U+0106, U+0108 ...
6883 XPosixWord (Short: Word). (Perl extension). \w,
6884 including beyond ASCII; = \p{Alnum} + \pM
6885 + \p{Pc} + \p{Join_Control}. [0-9A-Z_a-
6886 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6887 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
6888 U+02EC, U+02EE ...
6889 XPosixXDigit Hex_Digit (Short: XDigit). (Perl
6890 extension). [0-9A-Fa-f], U+FF10..FF19,
6891 U+FF21..FF26, U+FF41..FF46
6892
6894 Certain properties are accessible also via core function calls. These
6895 are:
6896
6897 Lowercase_Mapping lc() and lcfirst()
6898 Titlecase_Mapping ucfirst()
6899 Uppercase_Mapping uc()
6900
6901 Also, Case_Folding is accessible through the "/i" modifier in regular
6902 expressions, the "\F" transliteration escape, and the "fc" operator.
6903
6904 And, the Name and Name_Aliases properties are accessible through the
6905 "\N{}" interpolation in double-quoted strings and regular expressions;
6906 and functions "charnames::viacode()", "charnames::vianame()", and
6907 "charnames::string_vianame()" (which require a "use charnames ();" to
6908 be specified.
6909
6910 Finally, most properties related to decomposition are accessible via
6911 Unicode::Normalize.
6912
6914 Perl will generate an error for a few character properties in Unicode
6915 when used in a regular expression. The non-Unihan ones are listed
6916 below, with the reasons they are not accepted, perhaps with work-
6917 arounds. The short names for the properties are listed enclosed in
6918 (parentheses). As described after the list, an installation can change
6919 the defaults and choose to accept any of these. The list is machine
6920 generated based on the choices made for the installation that generated
6921 this document.
6922
6923 Expands_On_NFC (XO_NFC)
6924 Expands_On_NFD (XO_NFD)
6925 Expands_On_NFKC (XO_NFKC)
6926 Expands_On_NFKD (XO_NFKD)
6927 Deprecated by Unicode. These are characters that expand to more
6928 than one character in the specified normalization form, but whether
6929 they actually take up more bytes or not depends on the encoding
6930 being used. For example, a UTF-8 encoded character may expand to a
6931 different number of bytes than a UTF-32 encoded character.
6932
6933 Grapheme_Link (Gr_Link)
6934 Deprecated by Unicode: Duplicates ccc=vr
6935 (Canonical_Combining_Class=Virama)
6936
6937 Jamo_Short_Name (JSN)
6938 Other_Alphabetic (OAlpha)
6939 Other_Default_Ignorable_Code_Point (ODI)
6940 Other_Grapheme_Extend (OGr_Ext)
6941 Other_ID_Continue (OIDC)
6942 Other_ID_Start (OIDS)
6943 Other_Lowercase (OLower)
6944 Other_Math (OMath)
6945 Other_Uppercase (OUpper)
6946 Used by Unicode internally for generating other properties and not
6947 intended to be used stand-alone
6948
6949 Script=Katakana_Or_Hiragana (sc=Hrkt)
6950 Obsolete. All code points previously matched by this have been
6951 moved to "Script=Common". Consider instead using
6952 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or
6953 both)
6954
6955 Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)
6956 All code points that would be matched by this are matched by either
6957 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
6958
6959 An installation can choose to allow any of these to be matched by
6960 downloading the Unicode database from <http://www.unicode.org/Public/>
6961 to $Config{privlib}/unicore/ in the Perl source tree, changing the
6962 controlling lists contained in the program
6963 $Config{privlib}/unicore/mktables and then re-compiling and installing.
6964 (%Config is available from the Config module).
6965
6966 Also, perl can be recompiled to operate on an earlier version of the
6967 Unicode standard. Further information is at
6968 $Config{privlib}/unicore/README.perl.
6969
6971 The Unicode data base is delivered in two different formats. The XML
6972 version is valid for more modern Unicode releases. The other version
6973 is a collection of files. The two are intended to give equivalent
6974 information. Perl uses the older form; this allows you to recompile
6975 Perl to use early Unicode releases.
6976
6977 The only non-character property that Perl currently supports is Named
6978 Sequences, in which a sequence of code points is given a name and
6979 generally treated as a single entity. (Perl supports these via the
6980 "\N{...}" double-quotish construct, "charnames::string_vianame(name)"
6981 in charnames, and "namedseq()" in Unicode::UCD.
6982
6983 Below is a list of the files in the Unicode data base that Perl doesn't
6984 currently use, along with very brief descriptions of their purposes.
6985 Some of the names of the files have been shortened from those that
6986 Unicode uses, in order to allow them to be distinguishable from
6987 similarly named files on file systems for which only the first 8
6988 characters of a name are significant.
6989
6990 auxiliary/GraphemeBreakTest.html
6991 auxiliary/LineBreakTest.html
6992 auxiliary/SentenceBreakTest.html
6993 auxiliary/WordBreakTest.html
6994 Documentation of validation Tests
6995
6996 BidiCharacterTest.txt
6997 BidiTest.txt
6998 NormTest.txt
6999 Validation Tests
7000
7001 CJKRadicals.txt
7002 Maps the kRSUnicode property values to corresponding code points
7003
7004 EmojiSources.txt
7005 Maps certain Unicode code points to their legacy Japanese cell-
7006 phone values
7007
7008 extracted/DName.txt
7009 This file adds no new information not already present in other
7010 files
7011
7012 Index.txt
7013 Alphabetical index of Unicode characters
7014
7015 NamedSqProv.txt
7016 Named sequences proposed for inclusion in a later version of the
7017 Unicode Standard; if you need them now, you can append this file to
7018 NamedSequences.txt and recompile perl
7019
7020 NamesList.html
7021 Describes the format and contents of NamesList.txt
7022
7023 NamesList.txt
7024 Annotated list of characters
7025
7026 NormalizationCorrections.txt
7027 Documentation of corrections already incorporated into the Unicode
7028 data base
7029
7030 NushuSources.txt
7031 Specifies source material for Nushu characters
7032
7033 ReadMe.txt
7034 Documentation
7035
7036 StandardizedVariants.html
7037 Obsoleted as of Unicode 9.0, but previously provided a visual
7038 display of the standard variant sequences derived from
7039 StandardizedVariants.txt.
7040
7041 StandardizedVariants.txt
7042 Certain glyph variations for character display are standardized.
7043 This lists the non-Unihan ones; the Unihan ones are also not used
7044 by Perl, and are in a separate Unicode data base
7045 <http://www.unicode.org/ivd>
7046
7047 TangutSources.txt
7048 Specifies source mappings for Tangut ideographs and components.
7049 This data file also includes informative radical-stroke values that
7050 are used internally by Unicode
7051
7052 USourceData.txt
7053 Documentation of status and cross reference of proposals for
7054 encoding by Unicode of Unihan characters
7055
7056 USourceGlyphs.pdf
7057 Pictures of the characters in USourceData.txt
7058
7060 <http://www.unicode.org/reports/tr44/>
7061
7062 perlrecharclass
7063
7064 perlunicode
7065
7066
7067
7068perl v5.28.2 2019-04-23 PERLUNIPROPS(1)