1PERLUNIPROPS(1) Perl Programmers Reference Guide PERLUNIPROPS(1)
2
3
4
6 perluniprops - Index of Unicode Version 15.0.0 character properties in
7 Perl
8
10 This document provides information about the portion of the Unicode
11 database that deals with character properties, that is the portion that
12 is defined on single code points. ("Other information in the Unicode
13 data base" below briefly mentions other data that Unicode provides.)
14
15 Perl can provide access to all non-provisional Unicode character
16 properties, though not all are enabled by default. The omitted ones
17 are the Unihan properties and certain deprecated or Unicode-internal
18 properties. (An installation may choose to recompile Perl's tables to
19 change this. See "Unicode character properties that are NOT accepted
20 by Perl".)
21
22 For most purposes, access to Unicode properties from the Perl core is
23 through regular expression matches, as described in the next section.
24 For some special purposes, and to access the properties that are not
25 suitable for regular expression matching, all the Unicode character
26 properties that Perl handles are accessible via the standard
27 Unicode::UCD module, as described in the section "Properties accessible
28 through Unicode::UCD".
29
30 Perl also provides some additional extensions and short-cut synonyms
31 for Unicode properties.
32
33 This document merely lists all available properties and does not
34 attempt to explain what each property really means. There is a brief
35 description of each Perl extension; see "Other Properties" in
36 perlunicode for more information on these. There is some detail about
37 Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but
38 to find out about the intricacies of the official Unicode properties,
39 refer to the Unicode standard. A good starting place is
40 <http://www.unicode.org/reports/tr44/>.
41
42 Note that you can define your own properties; see "User-Defined
43 Character Properties" in perlunicode.
44
46 The Perl regular expression "\p{}" and "\P{}" constructs give access to
47 most of the Unicode character properties. The table below shows all
48 these constructs, both single and compound forms.
49
50 Compound forms consist of two components, separated by an equals sign
51 or a colon. The first component is the property name, and the second
52 component is the particular value of the property to match against, for
53 example, "\p{Script_Extensions: Greek}" and
54 "\p{Script_Extensions=Greek}" both mean to match characters whose
55 Script_Extensions property value is Greek. ("Script_Extensions" is an
56 improved version of the "Script" property.)
57
58 Single forms, like "\p{Greek}", are mostly Perl-defined shortcuts for
59 their equivalent compound forms. The table shows these equivalences.
60 (In our example, "\p{Greek}" is a just a shortcut for
61 "\p{Script_Extensions=Greek}"). There are also a few Perl-defined
62 single forms that are not shortcuts for a compound form. One such is
63 "\p{Word}". These are also listed in the table.
64
65 In parsing these constructs, Perl always ignores Upper/lower case
66 differences everywhere within the {braces}. Thus "\p{Greek}" means the
67 same thing as "\p{greek}". But note that changing the case of the "p"
68 or "P" before the left brace completely changes the meaning of the
69 construct, from "match" (for "\p{}") to "doesn't match" (for "\P{}").
70 Casing in this document is for improved legibility.
71
72 Also, white space, hyphens, and underscores are normally ignored
73 everywhere between the {braces}, and hence can be freely added or
74 removed even if the "/x" modifier hasn't been specified on the regular
75 expression. But in the table below a 'T' at the beginning of an entry
76 means that tighter (stricter) rules are used for that entry:
77
78 Single form ("\p{name}") tighter rules:
79 White space, hyphens, and underscores ARE significant except
80 for:
81
82 • white space adjacent to a non-word character
83
84 • underscores separating digits in numbers
85
86 That means, for example, that you can freely add or remove
87 white space adjacent to (but within) the braces without
88 affecting the meaning.
89
90 Compound form ("\p{name=value}" or "\p{name:value}") tighter rules:
91 The tighter rules given above for the single form apply to
92 everything to the right of the colon or equals; the looser
93 rules still apply to everything to the left.
94
95 That means, for example, that you can freely add or remove
96 white space adjacent to (but within) the braces and the colon
97 or equal sign.
98
99 Some properties are considered obsolete by Unicode, but still
100 available. There are several varieties of obsolescence:
101
102 Stabilized
103 A property may be stabilized. Such a determination does not
104 indicate that the property should or should not be used;
105 instead it is a declaration that the property will not be
106 maintained nor extended for newly encoded characters. Such
107 properties are marked with an 'S' in the table.
108
109 Deprecated
110 A property may be deprecated, perhaps because its original
111 intent has been replaced by another property, or because its
112 specification was somehow defective. This means that its use
113 is strongly discouraged, so much so that a warning will be
114 issued if used, unless the regular expression is in the scope
115 of a "no warnings 'deprecated'" statement. A 'D' flags each
116 such entry in the table, and the entry there for the longest,
117 most descriptive version of the property will give the reason
118 it is deprecated, and perhaps advice. Perl may issue such a
119 warning, even for properties that aren't officially deprecated
120 by Unicode, when there used to be characters or code points
121 that were matched by them, but no longer. This is to warn you
122 that your program may not work like it did on earlier Unicode
123 releases.
124
125 A deprecated property may be made unavailable in a future Perl
126 version, so it is best to move away from them.
127
128 A deprecated property may also be stabilized, but this fact is
129 not shown.
130
131 Obsolete
132 Properties marked with an 'O' in the table are considered
133 (plain) obsolete. Generally this designation is given to
134 properties that Unicode once used for internal purposes (but
135 not any longer).
136
137 Discouraged
138 This is not actually a Unicode-specified obsolescence, but
139 applies to certain Perl extensions that are present for
140 backwards compatibility, but are discouraged from being used.
141 These are not obsolete, but their meanings are not stable.
142 Future Unicode versions could force any of these extensions to
143 be removed without warning, replaced by another property with
144 the same name that means something different. An 'X' flags
145 each such entry in the table. Use the equivalent shown
146 instead.
147
148 In particular, matches in the Block property have single forms
149 defined by Perl that begin with "In_", ""Is_", or even with no
150 prefix at all, Like all DISCOURAGED forms, these are not
151 stable. For example, "\p{Block=Deseret}" can currently be
152 written as "\p{In_Deseret}", "\p{Is_Deseret}", or
153 "\p{Deseret}". But, a new Unicode version may come along that
154 would force Perl to change the meaning of one or more of these,
155 and your program would no longer be correct. Currently there
156 are no such conflicts with the form that begins "In_", but
157 there are many with the other two shortcuts, and Unicode
158 continues to define new properties that begin with "In", so
159 it's quite possible that a conflict will occur in the future.
160 The compound form is guaranteed to not become obsolete, and its
161 meaning is clearer anyway. See "Blocks" in perlunicode for
162 more information about this.
163
164 User-defined properties must begin with "In" or "Is". These
165 override any Unicode property of the same name.
166
167 The table below has two columns. The left column contains the "\p{}"
168 constructs to look up, possibly preceded by the flags mentioned above;
169 and the right column contains information about them, like a
170 description, or synonyms. The table shows both the single and compound
171 forms for each property that has them. If the left column is a short
172 name for a property, the right column will give its longer, more
173 descriptive name; and if the left column is the longest name, the right
174 column will show any equivalent shortest name, in both single and
175 compound forms if applicable.
176
177 If braces are not needed to specify a property (e.g., "\pL"), the left
178 column contains both forms, with and without braces.
179
180 The right column will also caution you if a property means something
181 different than what might normally be expected.
182
183 All single forms are Perl extensions; a few compound forms are as well,
184 and are noted as such.
185
186 Numbers in (parentheses) indicate the total number of Unicode code
187 points matched by the property. For the entries that give the longest,
188 most descriptive version of the property, the count is followed by a
189 list of some of the code points matched by it. The list includes all
190 the matched characters in the 0-255 range, enclosed in the familiar
191 [brackets] the same as a regular expression bracketed character class.
192 Following that, the next few higher matching ranges are also given. To
193 avoid visual ambiguity, the SPACE character is represented as "\x20".
194
195 For emphasis, those properties that match no code points at all are
196 listed as well in a separate section following the table.
197
198 Most properties match the same code points regardless of whether "/i"
199 case-insensitive matching is specified or not. But a few properties
200 are affected. These are shown with the notation "(/i= other_property)"
201 in the second column. Under case-insensitive matching they match the
202 same code pode points as the property other_property.
203
204 There is no description given for most non-Perl defined properties (See
205 <http://www.unicode.org/reports/tr44/> for that).
206
207 For compactness, '*' is used as a wildcard instead of showing all
208 possible combinations. For example, entries like:
209
210 \p{Gc: *} \p{General_Category: *}
211
212 mean that 'Gc' is a synonym for 'General_Category', and anything that
213 is valid for the latter is also valid for the former. Similarly,
214
215 \p{Is_*} \p{*}
216
217 means that if and only if, for example, "\p{Foo}" exists, then
218 "\p{Is_Foo}" and "\p{IsFoo}" are also valid and all mean the same
219 thing. And similarly, "\p{Foo=Bar}" means the same as "\p{Is_Foo=Bar}"
220 and "\p{IsFoo=Bar}". "*" here is restricted to something not beginning
221 with an underscore.
222
223 Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
224 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table
225 shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
226 for the other possibilities. Note that not all properties which have
227 values 'Yes' and 'No' are binary, and they have all their values
228 spelled out without using this wild card, and a "NOT" clause in their
229 description that highlights their not being binary. These also require
230 the compound form to match them, whereas true binary properties have
231 both single and compound forms available.
232
233 Note that all non-essential underscores are removed in the display of
234 the short names below.
235
236 Legend summary:
237
238 * is a wild-card
239 (\d+) in the info column gives the number of Unicode code points
240 matched by this property.
241 D means this is deprecated.
242 O means this is obsolete.
243 S means this is stabilized.
244 T means tighter (stricter) name matching applies.
245 X means use of this form is discouraged, and may not be stable.
246
247 NAME INFO
248
249 \p{Adlam} \p{Script_Extensions=Adlam} (Short:
250 \p{Adlm}; NOT \p{Block=Adlam}) (90)
251 \p{Adlm} \p{Adlam} (= \p{Script_Extensions=Adlam})
252 (NOT \p{Block=Adlam}) (90)
253 X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
254 T \p{Age: 1.1} \p{Age=V1_1} (33_979)
255 \p{Age: V1_1} Code point's usage introduced in version
256 1.1 (33_979: U+0000..01F5, U+01FA..0217,
257 U+0250..02A8, U+02B0..02DE,
258 U+02E0..02E9, U+0300..0345 ...)
259 T \p{Age: 2.0} \p{Age=V2_0} (144_521)
260 \p{Age: V2_0} Code point's usage was introduced in
261 version 2.0; See also Property
262 'Present_In' (144_521: U+0591..05A1,
263 U+05A3..05AF, U+05C4, U+0F00..0F47,
264 U+0F49..0F69, U+0F71..0F8B ...)
265 T \p{Age: 2.1} \p{Age=V2_1} (2)
266 \p{Age: V2_1} Code point's usage was introduced in
267 version 2.1; See also Property
268 'Present_In' (2: U+20AC, U+FFFC)
269 T \p{Age: 3.0} \p{Age=V3_0} (10_307)
270 \p{Age: V3_0} Code point's usage was introduced in
271 version 3.0; See also Property
272 'Present_In' (10_307: U+01F6..01F9,
273 U+0218..021F, U+0222..0233,
274 U+02A9..02AD, U+02DF, U+02EA..02EE ...)
275 T \p{Age: 3.1} \p{Age=V3_1} (44_978)
276 \p{Age: V3_1} Code point's usage was introduced in
277 version 3.1; See also Property
278 'Present_In' (44_978: U+03F4..03F5,
279 U+FDD0..FDEF, U+10300..1031E,
280 U+10320..10323, U+10330..1034A,
281 U+10400..10425 ...)
282 T \p{Age: 3.2} \p{Age=V3_2} (1016)
283 \p{Age: V3_2} Code point's usage was introduced in
284 version 3.2; See also Property
285 'Present_In' (1016: U+0220, U+034F,
286 U+0363..036F, U+03D8..03D9, U+03F6,
287 U+048A..048B ...)
288 T \p{Age: 4.0} \p{Age=V4_0} (1226)
289 \p{Age: V4_0} Code point's usage was introduced in
290 version 4.0; See also Property
291 'Present_In' (1226: U+0221,
292 U+0234..0236, U+02AE..02AF,
293 U+02EF..02FF, U+0350..0357, U+035D..035F
294 ...)
295 T \p{Age: 4.1} \p{Age=V4_1} (1273)
296 \p{Age: V4_1} Code point's usage was introduced in
297 version 4.1; See also Property
298 'Present_In' (1273: U+0237..0241,
299 U+0358..035C, U+03FC..03FF,
300 U+04F6..04F7, U+05A2, U+05C5..05C7 ...)
301 T \p{Age: 5.0} \p{Age=V5_0} (1369)
302 \p{Age: V5_0} Code point's usage was introduced in
303 version 5.0; See also Property
304 'Present_In' (1369: U+0242..024F,
305 U+037B..037D, U+04CF, U+04FA..04FF,
306 U+0510..0513, U+05BA ...)
307 T \p{Age: 5.1} \p{Age=V5_1} (1624)
308 \p{Age: V5_1} Code point's usage was introduced in
309 version 5.1; See also Property
310 'Present_In' (1624: U+0370..0373,
311 U+0376..0377, U+03CF, U+0487,
312 U+0514..0523, U+0606..060A ...)
313 T \p{Age: 5.2} \p{Age=V5_2} (6648)
314 \p{Age: V5_2} Code point's usage was introduced in
315 version 5.2; See also Property
316 'Present_In' (6648: U+0524..0525,
317 U+0800..082D, U+0830..083E, U+0900,
318 U+094E, U+0955 ...)
319 T \p{Age: 6.0} \p{Age=V6_0} (2088)
320 \p{Age: V6_0} Code point's usage was introduced in
321 version 6.0; See also Property
322 'Present_In' (2088: U+0526..0527,
323 U+0620, U+065F, U+0840..085B, U+085E,
324 U+093A..093B ...)
325 T \p{Age: 6.1} \p{Age=V6_1} (732)
326 \p{Age: V6_1} Code point's usage was introduced in
327 version 6.1; See also Property
328 'Present_In' (732: U+058F, U+0604,
329 U+08A0, U+08A2..08AC, U+08E4..08FE,
330 U+0AF0 ...)
331 T \p{Age: 6.2} \p{Age=V6_2} (1)
332 \p{Age: V6_2} Code point's usage was introduced in
333 version 6.2; See also Property
334 'Present_In' (1: U+20BA)
335 T \p{Age: 6.3} \p{Age=V6_3} (5)
336 \p{Age: V6_3} Code point's usage was introduced in
337 version 6.3; See also Property
338 'Present_In' (5: U+061C, U+2066..2069)
339 T \p{Age: 7.0} \p{Age=V7_0} (2834)
340 \p{Age: V7_0} Code point's usage was introduced in
341 version 7.0; See also Property
342 'Present_In' (2834: U+037F,
343 U+0528..052F, U+058D..058E, U+0605,
344 U+08A1, U+08AD..08B2 ...)
345 T \p{Age: 8.0} \p{Age=V8_0} (7716)
346 \p{Age: V8_0} Code point's usage was introduced in
347 version 8.0; See also Property
348 'Present_In' (7716: U+08B3..08B4,
349 U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5
350 ...)
351 T \p{Age: 9.0} \p{Age=V9_0} (7500)
352 \p{Age: V9_0} Code point's usage was introduced in
353 version 9.0; See also Property
354 'Present_In' (7500: U+08B6..08BD,
355 U+08D4..08E2, U+0C80, U+0D4F,
356 U+0D54..0D56, U+0D58..0D5E ...)
357 T \p{Age: 10.0} \p{Age=V10_0} (8518)
358 \p{Age: V10_0} Code point's usage was introduced in
359 version 10.0; See also Property
360 'Present_In' (8518: U+0860..086A,
361 U+09FC..09FD, U+0AFA..0AFF, U+0D00,
362 U+0D3B..0D3C, U+1CF7 ...)
363 T \p{Age: 11.0} \p{Age=V11_0} (684)
364 \p{Age: V11_0} Code point's usage was introduced in
365 version 11.0; See also Property
366 'Present_In' (684: U+0560, U+0588,
367 U+05EF, U+07FD..07FF, U+08D3, U+09FE ...)
368 T \p{Age: 12.0} \p{Age=V12_0} (554)
369 \p{Age: V12_0} Code point's usage was introduced in
370 version 12.0; See also Property
371 'Present_In' (554: U+0C77, U+0E86,
372 U+0E89, U+0E8C, U+0E8E..0E93, U+0E98 ...)
373 T \p{Age: 12.1} \p{Age=V12_1} (1)
374 \p{Age: V12_1} Code point's usage was introduced in
375 version 12.1; See also Property
376 'Present_In' (1: U+32FF)
377 T \p{Age: 13.0} \p{Age=V13_0} (5930)
378 \p{Age: V13_0} Code point's usage was introduced in
379 version 13.0; See also Property
380 'Present_In' (5930: U+08BE..08C7,
381 U+0B55, U+0D04, U+0D81, U+1ABF..1AC0,
382 U+2B97 ...)
383 T \p{Age: 14.0} \p{Age=V14_0} (838)
384 \p{Age: V14_0} Code point's usage was introduced in
385 version 14.0; See also Property
386 'Present_In' (838: U+061D, U+0870..088E,
387 U+0890..0891, U+0898..089F, U+08B5,
388 U+08C8..08D2 ...)
389 T \p{Age: 15.0} \p{Age=V15_0} (4489)
390 \p{Age: V15_0} Code point's usage was introduced in
391 version 15.0; See also Property
392 'Present_In' (4489: U+0CF3, U+0ECE,
393 U+10EFD..10EFF, U+1123F..11241,
394 U+11B00..11B09, U+11F00..11F10 ...)
395 \p{Age: NA} \p{Age=Unassigned} (825_279 plus all
396 above-Unicode code points)
397 \p{Age: Unassigned} Code point's usage has not been assigned
398 in any Unicode release thus far.
399 (Short: \p{Age=NA}) (825_279 plus all above-Unicode code points:
400 U+0378..0379, U+0380..0383, U+038B,
401 U+038D, U+03A2, U+0530 ...)
402 \p{Aghb} \p{Caucasian_Albanian} (=
403 \p{Script_Extensions=
404 Caucasian_Albanian}) (NOT \p{Block=
405 Caucasian_Albanian}) (53)
406 \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
407 (22)
408 \p{AHex: *} \p{ASCII_Hex_Digit: *}
409 \p{Ahom} \p{Script_Extensions=Ahom} (NOT \p{Block=
410 Ahom}) (65)
411 X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block=
412 Alchemical_Symbols}) (128)
413 X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short:
414 \p{InAlchemical}) (128)
415 \p{All} All code points, including those above
416 Unicode. Same as qr/./s (1_114_112 plus
417 all above-Unicode code points:
418 U+0000..infinity)
419 \p{Alnum} \p{XPosixAlnum} (138_445)
420 \p{Alpha} \p{XPosixAlpha} (= \p{Alphabetic=Y})
421 (137_765)
422 \p{Alpha: *} \p{Alphabetic: *}
423 \p{Alphabetic} \p{XPosixAlpha} (= \p{Alphabetic=Y})
424 (137_765)
425 \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (976_347
426 plus all above-Unicode code points:
427 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
428 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4
429 \xb6-\xb9\xbb-\xbf\xd7\xf7],
430 U+02C2..02C5, U+02D2..02DF,
431 U+02E5..02EB, U+02ED, U+02EF..0344 ...)
432 \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (137_765:
433 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
434 \xf8-\xff], U+0100..02C1, U+02C6..02D1,
435 U+02E0..02E4, U+02EC, U+02EE ...)
436 X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (=
437 \p{Block=Alphabetic_Presentation_Forms})
438 (80)
439 X \p{Alphabetic_Presentation_Forms} \p{Block=
440 Alphabetic_Presentation_Forms} (Short:
441 \p{InAlphabeticPF}) (80)
442 \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
443 Anatolian_Hieroglyphs} (Short: \p{Hluw};
444 NOT \p{Block=Anatolian_Hieroglyphs})
445 (583)
446 X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
447 \p{Block=
448 Ancient_Greek_Musical_Notation}) (80)
449 X \p{Ancient_Greek_Musical_Notation} \p{Block=
450 Ancient_Greek_Musical_Notation} (Short:
451 \p{InAncientGreekMusic}) (80)
452 X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
453 X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
454 \p{Any} All Unicode code points (1_114_112:
455 U+0000..10FFFF)
456 \p{Arab} \p{Arabic} (= \p{Script_Extensions=
457 Arabic}) (NOT \p{Block=Arabic}) (1414)
458 \p{Arabic} \p{Script_Extensions=Arabic} (Short:
459 \p{Arab}; NOT \p{Block=Arabic}) (1414)
460 X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block=
461 Arabic_Extended_A}) (96)
462 X \p{Arabic_Ext_B} \p{Arabic_Extended_B} (= \p{Block=
463 Arabic_Extended_B}) (48)
464 X \p{Arabic_Ext_C} \p{Arabic_Extended_C} (= \p{Block=
465 Arabic_Extended_C}) (64)
466 X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short:
467 \p{InArabicExtA}) (96)
468 X \p{Arabic_Extended_B} \p{Block=Arabic_Extended_B} (Short:
469 \p{InArabicExtB}) (48)
470 X \p{Arabic_Extended_C} \p{Block=Arabic_Extended_C} (Short:
471 \p{InArabicExtC}) (64)
472 X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols}
473 (= \p{Block=
474 Arabic_Mathematical_Alphabetic_Symbols})
475 (256)
476 X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
477 Arabic_Mathematical_Alphabetic_Symbols}
478 (Short: \p{InArabicMath}) (256)
479 X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (=
480 \p{Block=Arabic_Presentation_Forms_A})
481 (688)
482 X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (=
483 \p{Block=Arabic_Presentation_Forms_B})
484 (144)
485 X \p{Arabic_Presentation_Forms_A} \p{Block=
486 Arabic_Presentation_Forms_A} (Short:
487 \p{InArabicPFA}) (688)
488 X \p{Arabic_Presentation_Forms_B} \p{Block=
489 Arabic_Presentation_Forms_B} (Short:
490 \p{InArabicPFB}) (144)
491 X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block=
492 Arabic_Supplement}) (48)
493 X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short:
494 \p{InArabicSup}) (48)
495 \p{Armenian} \p{Script_Extensions=Armenian} (Short:
496 \p{Armn}; NOT \p{Block=Armenian}) (96)
497 \p{Armi} \p{Imperial_Aramaic} (=
498 \p{Script_Extensions=Imperial_Aramaic})
499 (NOT \p{Block=Imperial_Aramaic}) (31)
500 \p{Armn} \p{Armenian} (= \p{Script_Extensions=
501 Armenian}) (NOT \p{Block=Armenian}) (96)
502 X \p{Arrows} \p{Block=Arrows} (112)
503 \p{ASCII} \p{Block=Basic_Latin} (128)
504 \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
505 (22)
506 \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
507 plus all above-Unicode code points:
508 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
509 \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
510 U+0100..infinity)
511 \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A-
512 Fa-f])
513 \p{Assigned} All assigned code points (288_767:
514 U+0000..0377, U+037A..037F,
515 U+0384..038A, U+038C, U+038E..03A1,
516 U+03A3..052F ...)
517 \p{Avestan} \p{Script_Extensions=Avestan} (Short:
518 \p{Avst}; NOT \p{Block=Avestan}) (61)
519 \p{Avst} \p{Avestan} (= \p{Script_Extensions=
520 Avestan}) (NOT \p{Block=Avestan}) (61)
521 \p{Bali} \p{Balinese} (= \p{Script_Extensions=
522 Balinese}) (NOT \p{Block=Balinese}) (124)
523 \p{Balinese} \p{Script_Extensions=Balinese} (Short:
524 \p{Bali}; NOT \p{Block=Balinese}) (124)
525 \p{Bamu} \p{Bamum} (= \p{Script_Extensions=Bamum})
526 (NOT \p{Block=Bamum}) (657)
527 \p{Bamum} \p{Script_Extensions=Bamum} (Short:
528 \p{Bamu}; NOT \p{Block=Bamum}) (657)
529 X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block=
530 Bamum_Supplement}) (576)
531 X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short:
532 \p{InBamumSup}) (576)
533 X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
534 \p{Bass} \p{Bassa_Vah} (= \p{Script_Extensions=
535 Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
536 (36)
537 \p{Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
538 \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
539 \p{Batak} \p{Script_Extensions=Batak} (Short:
540 \p{Batk}; NOT \p{Block=Batak}) (56)
541 \p{Batk} \p{Batak} (= \p{Script_Extensions=Batak})
542 (NOT \p{Block=Batak}) (56)
543 \p{Bc: *} \p{Bidi_Class: *}
544 \p{Beng} \p{Bengali} (= \p{Script_Extensions=
545 Bengali}) (NOT \p{Block=Bengali}) (113)
546 \p{Bengali} \p{Script_Extensions=Bengali} (Short:
547 \p{Beng}; NOT \p{Block=Bengali}) (113)
548 \p{Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
549 \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
550 \p{Bhks} \p{Bhaiksuki} (= \p{Script_Extensions=
551 Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
552 (97)
553 \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y})
554 (12)
555 \p{Bidi_C: *} \p{Bidi_Control: *}
556 \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1769)
557 \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (63)
558 \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1769: U+0608,
559 U+060B, U+060D, U+061B..064A,
560 U+066D..066F, U+0671..06D5 ...)
561 \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (63:
562 U+0600..0605, U+0660..0669,
563 U+066B..066C, U+06DD, U+0890..0891,
564 U+08E2 ...)
565 \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
566 \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4016)
567 \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016: [^\t\n
568 \cK\f\r\x1c-\x7e\x85\xa0-\xac\xae-\xff],
569 U+180E, U+200B..200D, U+2060..2065,
570 U+206A..206F, U+FDD0..FDEF ...)
571 \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15: [,.\/:
572 \xa0], U+060C, U+202F, U+2044, U+FE50,
573 U+FE52 ...)
574 \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
575 \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (168)
576 \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
577 \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (92)
578 \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (168: [0-9\xb2-
579 \xb3\xb9], U+06F0..06F9, U+2070,
580 U+2074..2079, U+2080..2089, U+2488..249B
581 ...)
582 \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-],
583 U+207A..207B, U+208A..208B, U+2212,
584 U+FB29, U+FE62..FE63 ...)
585 \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (92: [#\$
586 \%\xa2-\xa5\xb0-\xb1], U+058F,
587 U+0609..060A, U+066A, U+09F2..09F3,
588 U+09FB ...)
589 \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1:
590 U+2068)
591 \p{Bidi_Class: FSI} \p{Bidi_Class=First_Strong_Isolate} (1)
592 \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_096_272
593 plus all above-Unicode code points)
594 \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_096_272 plus
595 all above-Unicode code points: [A-Za-z
596 \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
597 \xff], U+0100..02B8, U+02BB..02C1,
598 U+02D0..02D1, U+02E0..02E4, U+02EE ...)
599 \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1:
600 U+202A)
601 \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1:
602 U+2066)
603 \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1:
604 U+202D)
605 \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
606 \p{Bidi_Class: LRI} \p{Bidi_Class=Left_To_Right_Isolate} (1)
607 \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
608 \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1993:
609 U+0300..036F, U+0483..0489,
610 U+0591..05BD, U+05BF, U+05C1..05C2,
611 U+05C4..05C5 ...)
612 \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1993)
613 \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (6029)
614 \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (6029: [!\"&\'
615 \(\)*;<=>?\@\[\\\]\^_`\{\|\}~\xa1\xa6-
616 \xa9\xab-\xac\xae-\xaf\xb4\xb6-\xb8\xbb-
617 \xbf\xd7\xf7], U+02B9..02BA,
618 U+02C2..02CF, U+02D2..02DF,
619 U+02E5..02ED, U+02EF..02FF ...)
620 \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7: [\n\r
621 \x1c-\x1e\x85], U+2029)
622 \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
623 \p{Bidi_Class: PDI} \p{Bidi_Class=Pop_Directional_Isolate} (1)
624 \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1:
625 U+202C)
626 \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1:
627 U+2069)
628 \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (3647)
629 \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (3647: U+0590,
630 U+05BE, U+05C0, U+05C3, U+05C6,
631 U+05C8..05FF ...)
632 \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1:
633 U+202B)
634 \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1:
635 U+2067)
636 \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1:
637 U+202E)
638 \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
639 \p{Bidi_Class: RLI} \p{Bidi_Class=Right_To_Left_Isolate} (1)
640 \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
641 \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
642 \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3: [\t\cK
643 \x1f])
644 \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20],
645 U+1680, U+2000..200A, U+2028, U+205F,
646 U+3000)
647 \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (17)
648 \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
649 \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
650 plus all above-Unicode code points:
651 U+0000..061B, U+061D..200D,
652 U+2010..2029, U+202F..2065,
653 U+206A..infinity)
654 \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (12:
655 U+061C, U+200E..200F, U+202A..202E,
656 U+2066..2069)
657 \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
658 (553)
659 \p{Bidi_M: *} \p{Bidi_Mirrored: *}
660 \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
661 (553)
662 \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_559
663 plus all above-Unicode code points:
664 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A-
665 Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc-
666 \xff], U+0100..0F39, U+0F3E..169A,
667 U+169D..2038, U+203B..2044, U+2047..207C
668 ...)
669 \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (553:
670 [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
671 U+169B..169C, U+2039..203A,
672 U+2045..2046, U+207D..207E ...)
673 \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
674 (64)
675 \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (64: [\)\]
676 \}], U+0F3B, U+0F3D, U+169C, U+2046,
677 U+207E ...)
678 \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
679 (1_113_984 plus all above-Unicode code
680 points)
681 \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_984
682 plus all above-Unicode code points:
683 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<=>?
684 \@A-Z\\\^_`a-z\|~\x7f-\xff],
685 U+0100..0F39, U+0F3E..169A,
686 U+169D..2044, U+2047..207C, U+207F..208C
687 ...)
688 \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
689 (64)
690 \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (64:
691 [\(\[\{], U+0F3A, U+0F3C, U+169B,
692 U+2045, U+207D ...)
693 \p{Blank} \p{XPosixBlank} (18)
694 \p{Blk: *} \p{Block: *}
695 \p{Block: Adlam} (NOT \p{Adlam} NOR \p{Is_Adlam}) (96:
696 U+1E900..1E95F)
697 \p{Block: Aegean_Numbers} (64: U+10100..1013F)
698 \p{Block: Ahom} (NOT \p{Ahom} NOR \p{Is_Ahom}) (80:
699 U+11700..1174F)
700 \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128)
701 \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128:
702 U+1F700..1F77F)
703 \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
704 (80)
705 \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
706 AlphabeticPF}) (80: U+FB00..FB4F)
707 \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
708 NOR \p{Is_Anatolian_Hieroglyphs}) (640:
709 U+14400..1467F)
710 \p{Block: Ancient_Greek_Music} \p{Block=
711 Ancient_Greek_Musical_Notation} (80)
712 \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
713 AncientGreekMusic}) (80: U+1D200..1D24F)
714 \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F)
715 \p{Block: Ancient_Symbols} (64: U+10190..101CF)
716 \p{Block: Arabic} (NOT \p{Arabic} NOR \p{Is_Arabic}) (256:
717 U+0600..06FF)
718 \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
719 \p{Block: Arabic_Ext_B} \p{Block=Arabic_Extended_B} (48)
720 \p{Block: Arabic_Ext_C} \p{Block=Arabic_Extended_C} (64)
721 \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96:
722 U+08A0..08FF)
723 \p{Block: Arabic_Extended_B} (Short: \p{Blk=ArabicExtB}) (48:
724 U+0870..089F)
725 \p{Block: Arabic_Extended_C} (Short: \p{Blk=ArabicExtC}) (64:
726 U+10EC0..10EFF)
727 \p{Block: Arabic_Math} \p{Block=
728 Arabic_Mathematical_Alphabetic_Symbols}
729 (256)
730 \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
731 ArabicMath}) (256: U+1EE00..1EEFF)
732 \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688)
733 \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144)
734 \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
735 (688: U+FB50..FDFF)
736 \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
737 (144: U+FE70..FEFF)
738 \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48)
739 \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48:
740 U+0750..077F)
741 \p{Block: Armenian} (NOT \p{Armenian} NOR \p{Is_Armenian})
742 (96: U+0530..058F)
743 \p{Block: Arrows} (112: U+2190..21FF)
744 \p{Block: ASCII} \p{Block=Basic_Latin} (128)
745 \p{Block: Avestan} (NOT \p{Avestan} NOR \p{Is_Avestan}) (64:
746 U+10B00..10B3F)
747 \p{Block: Balinese} (NOT \p{Balinese} NOR \p{Is_Balinese})
748 (128: U+1B00..1B7F)
749 \p{Block: Bamum} (NOT \p{Bamum} NOR \p{Is_Bamum}) (96:
750 U+A6A0..A6FF)
751 \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576)
752 \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576:
753 U+16800..16A3F)
754 \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}) (128: [\x00-\x7f])
755 \p{Block: Bassa_Vah} (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
756 (48: U+16AD0..16AFF)
757 \p{Block: Batak} (NOT \p{Batak} NOR \p{Is_Batak}) (64:
758 U+1BC0..1BFF)
759 \p{Block: Bengali} (NOT \p{Bengali} NOR \p{Is_Bengali}) (128:
760 U+0980..09FF)
761 \p{Block: Bhaiksuki} (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
762 (112: U+11C00..11C6F)
763 \p{Block: Block_Elements} (32: U+2580..259F)
764 \p{Block: Bopomofo} (NOT \p{Bopomofo} NOR \p{Is_Bopomofo})
765 (48: U+3100..312F)
766 \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
767 \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32:
768 U+31A0..31BF)
769 \p{Block: Box_Drawing} (128: U+2500..257F)
770 \p{Block: Brahmi} (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128:
771 U+11000..1107F)
772 \p{Block: Braille} \p{Block=Braille_Patterns} (256)
773 \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256:
774 U+2800..28FF)
775 \p{Block: Buginese} (NOT \p{Buginese} NOR \p{Is_Buginese})
776 (32: U+1A00..1A1F)
777 \p{Block: Buhid} (NOT \p{Buhid} NOR \p{Is_Buhid}) (32:
778 U+1740..175F)
779 \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
780 (256)
781 \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
782 ByzantineMusic}) (256: U+1D000..1D0FF)
783 \p{Block: Canadian_Syllabics} \p{Block=
784 Unified_Canadian_Aboriginal_Syllabics}
785 (640)
786 \p{Block: Carian} (NOT \p{Carian} NOR \p{Is_Carian}) (64:
787 U+102A0..102DF)
788 \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
789 \p{Is_Caucasian_Albanian}) (64:
790 U+10530..1056F)
791 \p{Block: Chakma} (NOT \p{Chakma} NOR \p{Is_Chakma}) (80:
792 U+11100..1114F)
793 \p{Block: Cham} (NOT \p{Cham} NOR \p{Is_Cham}) (96:
794 U+AA00..AA5F)
795 \p{Block: Cherokee} (NOT \p{Cherokee} NOR \p{Is_Cherokee})
796 (96: U+13A0..13FF)
797 \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
798 \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80:
799 U+AB70..ABBF)
800 \p{Block: Chess_Symbols} (112: U+1FA00..1FA6F)
801 \p{Block: Chorasmian} (NOT \p{Chorasmian} NOR \p{Is_Chorasmian})
802 (48: U+10FB0..10FDF)
803 \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992)
804 \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256)
805 \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
806 \p{Block: CJK_Compat_Ideographs} \p{Block=
807 CJK_Compatibility_Ideographs} (512)
808 \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
809 CJK_Compatibility_Ideographs_Supplement}
810 (544)
811 \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256:
812 U+3300..33FF)
813 \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
814 (32: U+FE30..FE4F)
815 \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
816 CJKCompatIdeographs}) (512: U+F900..FAFF)
817 \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
818 CJKCompatIdeographsSup}) (544:
819 U+2F800..2FA1F)
820 \p{Block: CJK_Ext_A} \p{Block=
821 CJK_Unified_Ideographs_Extension_A}
822 (6592)
823 \p{Block: CJK_Ext_B} \p{Block=
824 CJK_Unified_Ideographs_Extension_B}
825 (42_720)
826 \p{Block: CJK_Ext_C} \p{Block=
827 CJK_Unified_Ideographs_Extension_C}
828 (4160)
829 \p{Block: CJK_Ext_D} \p{Block=
830 CJK_Unified_Ideographs_Extension_D} (224)
831 \p{Block: CJK_Ext_E} \p{Block=
832 CJK_Unified_Ideographs_Extension_E}
833 (5776)
834 \p{Block: CJK_Ext_F} \p{Block=
835 CJK_Unified_Ideographs_Extension_F}
836 (7488)
837 \p{Block: CJK_Ext_G} \p{Block=
838 CJK_Unified_Ideographs_Extension_G}
839 (4944)
840 \p{Block: CJK_Ext_H} \p{Block=
841 CJK_Unified_Ideographs_Extension_H}
842 (4192)
843 \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
844 \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
845 (128: U+2E80..2EFF)
846 \p{Block: CJK_Strokes} (48: U+31C0..31EF)
847 \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64)
848 \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
849 (64: U+3000..303F)
850 \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992:
851 U+4E00..9FFF)
852 \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
853 CJKExtA}) (6592: U+3400..4DBF)
854 \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
855 CJKExtB}) (42_720: U+20000..2A6DF)
856 \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
857 CJKExtC}) (4160: U+2A700..2B73F)
858 \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
859 CJKExtD}) (224: U+2B740..2B81F)
860 \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
861 CJKExtE}) (5776: U+2B820..2CEAF)
862 \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk=
863 CJKExtF}) (7488: U+2CEB0..2EBEF)
864 \p{Block: CJK_Unified_Ideographs_Extension_G} (Short: \p{Blk=
865 CJKExtG}) (4944: U+30000..3134F)
866 \p{Block: CJK_Unified_Ideographs_Extension_H} (Short: \p{Blk=
867 CJKExtH}) (4192: U+31350..323AF)
868 \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
869 Diacriticals}) (112: U+0300..036F)
870 \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
871 DiacriticalsExt}) (80: U+1AB0..1AFF)
872 \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
873 DiacriticalsForSymbols}) (48:
874 U+20D0..20FF)
875 \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
876 DiacriticalsSup}) (64: U+1DC0..1DFF)
877 \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16:
878 U+FE20..FE2F)
879 \p{Block: Combining_Marks_For_Symbols} \p{Block=
880 Combining_Diacritical_Marks_For_Symbols}
881 (48)
882 \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
883 IndicNumberForms}) (16: U+A830..A83F)
884 \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96)
885 \p{Block: Control_Pictures} (64: U+2400..243F)
886 \p{Block: Coptic} (NOT \p{Coptic} NOR \p{Is_Coptic}) (128:
887 U+2C80..2CFF)
888 \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF)
889 \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
890 \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32:
891 U+1D360..1D37F)
892 \p{Block: Cuneiform} (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
893 (1024: U+12000..123FF)
894 \p{Block: Cuneiform_Numbers} \p{Block=
895 Cuneiform_Numbers_And_Punctuation} (128)
896 \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
897 CuneiformNumbers}) (128: U+12400..1247F)
898 \p{Block: Currency_Symbols} (48: U+20A0..20CF)
899 \p{Block: Cypriot_Syllabary} (64: U+10800..1083F)
900 \p{Block: Cypro_Minoan} (NOT \p{Cypro_Minoan} NOR
901 \p{Is_Cypro_Minoan}) (112:
902 U+12F90..12FFF)
903 \p{Block: Cyrillic} (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
904 (256: U+0400..04FF)
905 \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
906 \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
907 \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
908 \p{Block: Cyrillic_Ext_D} \p{Block=Cyrillic_Extended_D} (96)
909 \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32:
910 U+2DE0..2DFF)
911 \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96:
912 U+A640..A69F)
913 \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16:
914 U+1C80..1C8F)
915 \p{Block: Cyrillic_Extended_D} (Short: \p{Blk=CyrillicExtD}) (96:
916 U+1E030..1E08F)
917 \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
918 \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48:
919 U+0500..052F)
920 \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
921 (48)
922 \p{Block: Deseret} (80: U+10400..1044F)
923 \p{Block: Devanagari} (NOT \p{Devanagari} NOR \p{Is_Devanagari})
924 (128: U+0900..097F)
925 \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
926 \p{Block: Devanagari_Ext_A} \p{Block=Devanagari_Extended_A} (96)
927 \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32:
928 U+A8E0..A8FF)
929 \p{Block: Devanagari_Extended_A} (Short: \p{Blk=DevanagariExtA})
930 (96: U+11B00..11B5F)
931 \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
932 \p{Block: Diacriticals_Ext} \p{Block=
933 Combining_Diacritical_Marks_Extended}
934 (80)
935 \p{Block: Diacriticals_For_Symbols} \p{Block=
936 Combining_Diacritical_Marks_For_Symbols}
937 (48)
938 \p{Block: Diacriticals_Sup} \p{Block=
939 Combining_Diacritical_Marks_Supplement}
940 (64)
941 \p{Block: Dingbats} (192: U+2700..27BF)
942 \p{Block: Dives_Akuru} (NOT \p{Dives_Akuru} NOR
943 \p{Is_Dives_Akuru}) (96: U+11900..1195F)
944 \p{Block: Dogra} (NOT \p{Dogra} NOR \p{Is_Dogra}) (80:
945 U+11800..1184F)
946 \p{Block: Domino} \p{Block=Domino_Tiles} (112)
947 \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112:
948 U+1F030..1F09F)
949 \p{Block: Duployan} (NOT \p{Duployan} NOR \p{Is_Duployan})
950 (160: U+1BC00..1BC9F)
951 \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F)
952 \p{Block: Egyptian_Hieroglyph_Format_Controls} (48: U+13430..1345F)
953 \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
954 \p{Is_Egyptian_Hieroglyphs}) (1072:
955 U+13000..1342F)
956 \p{Block: Elbasan} (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48:
957 U+10500..1052F)
958 \p{Block: Elymaic} (NOT \p{Elymaic} NOR \p{Is_Elymaic}) (32:
959 U+10FE0..10FFF)
960 \p{Block: Emoticons} (80: U+1F600..1F64F)
961 \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
962 \p{Block: Enclosed_Alphanum_Sup} \p{Block=
963 Enclosed_Alphanumeric_Supplement} (256)
964 \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
965 EnclosedAlphanumSup}) (256:
966 U+1F100..1F1FF)
967 \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
968 EnclosedAlphanum}) (160: U+2460..24FF)
969 \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
970 (256)
971 \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
972 EnclosedCJK}) (256: U+3200..32FF)
973 \p{Block: Enclosed_Ideographic_Sup} \p{Block=
974 Enclosed_Ideographic_Supplement} (256)
975 \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
976 EnclosedIdeographicSup}) (256:
977 U+1F200..1F2FF)
978 \p{Block: Ethiopic} (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
979 (384: U+1200..137F)
980 \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
981 \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
982 \p{Block: Ethiopic_Ext_B} \p{Block=Ethiopic_Extended_B} (32)
983 \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96:
984 U+2D80..2DDF)
985 \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48:
986 U+AB00..AB2F)
987 \p{Block: Ethiopic_Extended_B} (Short: \p{Blk=EthiopicExtB}) (32:
988 U+1E7E0..1E7FF)
989 \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
990 \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32:
991 U+1380..139F)
992 \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
993 \p{Punct} NOR \p{Is_Punctuation}) (112:
994 U+2000..206F)
995 \p{Block: Geometric_Shapes} (96: U+25A0..25FF)
996 \p{Block: Geometric_Shapes_Ext} \p{Block=
997 Geometric_Shapes_Extended} (128)
998 \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
999 GeometricShapesExt}) (128:
1000 U+1F780..1F7FF)
1001 \p{Block: Georgian} (NOT \p{Georgian} NOR \p{Is_Georgian})
1002 (96: U+10A0..10FF)
1003 \p{Block: Georgian_Ext} \p{Block=Georgian_Extended} (48)
1004 \p{Block: Georgian_Extended} (Short: \p{Blk=GeorgianExt}) (48:
1005 U+1C90..1CBF)
1006 \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
1007 \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48:
1008 U+2D00..2D2F)
1009 \p{Block: Glagolitic} (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
1010 (96: U+2C00..2C5F)
1011 \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
1012 \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
1013 (48: U+1E000..1E02F)
1014 \p{Block: Gothic} (NOT \p{Gothic} NOR \p{Is_Gothic}) (32:
1015 U+10330..1034F)
1016 \p{Block: Grantha} (NOT \p{Grantha} NOR \p{Is_Grantha}) (128:
1017 U+11300..1137F)
1018 \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
1019 NOR \p{Is_Greek}) (144)
1020 \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
1021 NOR \p{Is_Greek}) (144: U+0370..03FF)
1022 \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256)
1023 \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256:
1024 U+1F00..1FFF)
1025 \p{Block: Gujarati} (NOT \p{Gujarati} NOR \p{Is_Gujarati})
1026 (128: U+0A80..0AFF)
1027 \p{Block: Gunjala_Gondi} (NOT \p{Gunjala_Gondi} NOR
1028 \p{Is_Gunjala_Gondi}) (80:
1029 U+11D60..11DAF)
1030 \p{Block: Gurmukhi} (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
1031 (128: U+0A00..0A7F)
1032 \p{Block: Half_And_Full_Forms} \p{Block=
1033 Halfwidth_And_Fullwidth_Forms} (240)
1034 \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16)
1035 \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
1036 HalfAndFullForms}) (240: U+FF00..FFEF)
1037 \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul}
1038 NOR \p{Is_Hangul}) (11_184)
1039 \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
1040 (96: U+3130..318F)
1041 \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}) (256: U+1100..11FF)
1042 \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32:
1043 U+A960..A97F)
1044 \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80:
1045 U+D7B0..D7FF)
1046 \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
1047 NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF)
1048 \p{Block: Hanifi_Rohingya} (NOT \p{Hanifi_Rohingya} NOR
1049 \p{Is_Hanifi_Rohingya}) (64:
1050 U+10D00..10D3F)
1051 \p{Block: Hanunoo} (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32:
1052 U+1720..173F)
1053 \p{Block: Hatran} (NOT \p{Hatran} NOR \p{Is_Hatran}) (32:
1054 U+108E0..108FF)
1055 \p{Block: Hebrew} (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112:
1056 U+0590..05FF)
1057 \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
1058 HighPUSurrogates}) (128: U+DB80..DBFF)
1059 \p{Block: High_PU_Surrogates} \p{Block=
1060 High_Private_Use_Surrogates} (128)
1061 \p{Block: High_Surrogates} (896: U+D800..DB7F)
1062 \p{Block: Hiragana} (NOT \p{Hiragana} NOR \p{Is_Hiragana})
1063 (96: U+3040..309F)
1064 \p{Block: IDC} \p{Block=
1065 Ideographic_Description_Characters} (NOT
1066 \p{ID_Continue} NOR \p{Is_IDC}) (16)
1067 \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
1068 NOT \p{ID_Continue} NOR \p{Is_IDC}) (16:
1069 U+2FF0..2FFF)
1070 \p{Block: Ideographic_Symbols} \p{Block=
1071 Ideographic_Symbols_And_Punctuation} (32)
1072 \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
1073 IdeographicSymbols}) (32: U+16FE0..16FFF)
1074 \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
1075 \p{Is_Imperial_Aramaic}) (32:
1076 U+10840..1085F)
1077 \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1078 (16)
1079 \p{Block: Indic_Siyaq_Numbers} (80: U+1EC70..1ECBF)
1080 \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
1081 NOR \p{Is_Inscriptional_Pahlavi}) (32:
1082 U+10B60..10B7F)
1083 \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
1084 NOR \p{Is_Inscriptional_Parthian}) (32:
1085 U+10B40..10B5F)
1086 \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96)
1087 \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96:
1088 U+0250..02AF)
1089 \p{Block: Jamo} \p{Block=Hangul_Jamo} (256)
1090 \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32)
1091 \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80)
1092 \p{Block: Javanese} (NOT \p{Javanese} NOR \p{Is_Javanese})
1093 (96: U+A980..A9DF)
1094 \p{Block: Kaithi} (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80:
1095 U+11080..110CF)
1096 \p{Block: Kaktovik_Numerals} (32: U+1D2C0..1D2DF)
1097 \p{Block: Kana_Ext_A} \p{Block=Kana_Extended_A} (48)
1098 \p{Block: Kana_Ext_B} \p{Block=Kana_Extended_B} (16)
1099 \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48:
1100 U+1B100..1B12F)
1101 \p{Block: Kana_Extended_B} (Short: \p{Blk=KanaExtB}) (16:
1102 U+1AFF0..1AFFF)
1103 \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256)
1104 \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256:
1105 U+1B000..1B0FF)
1106 \p{Block: Kanbun} (16: U+3190..319F)
1107 \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224)
1108 \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224:
1109 U+2F00..2FDF)
1110 \p{Block: Kannada} (NOT \p{Kannada} NOR \p{Is_Kannada}) (128:
1111 U+0C80..0CFF)
1112 \p{Block: Katakana} (NOT \p{Katakana} NOR \p{Is_Katakana})
1113 (96: U+30A0..30FF)
1114 \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
1115 \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
1116 KatakanaExt}) (16: U+31F0..31FF)
1117 \p{Block: Kawi} (NOT \p{Kawi} NOR \p{Is_Kawi}) (96:
1118 U+11F00..11F5F)
1119 \p{Block: Kayah_Li} (48: U+A900..A92F)
1120 \p{Block: Kharoshthi} (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
1121 (96: U+10A00..10A5F)
1122 \p{Block: Khitan_Small_Script} (NOT \p{Khitan_Small_Script} NOR
1123 \p{Is_Khitan_Small_Script}) (512:
1124 U+18B00..18CFF)
1125 \p{Block: Khmer} (NOT \p{Khmer} NOR \p{Is_Khmer}) (128:
1126 U+1780..17FF)
1127 \p{Block: Khmer_Symbols} (32: U+19E0..19FF)
1128 \p{Block: Khojki} (NOT \p{Khojki} NOR \p{Is_Khojki}) (80:
1129 U+11200..1124F)
1130 \p{Block: Khudawadi} (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
1131 (80: U+112B0..112FF)
1132 \p{Block: Lao} (NOT \p{Lao} NOR \p{Is_Lao}) (128:
1133 U+0E80..0EFF)
1134 \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
1135 \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128)
1136 \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80-
1137 \xff])
1138 \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128)
1139 \p{Block: Latin_Ext_Additional} \p{Block=
1140 Latin_Extended_Additional} (256)
1141 \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208)
1142 \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32)
1143 \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224)
1144 \p{Block: Latin_Ext_E} \p{Block=Latin_Extended_E} (64)
1145 \p{Block: Latin_Ext_F} \p{Block=Latin_Extended_F} (64)
1146 \p{Block: Latin_Ext_G} \p{Block=Latin_Extended_G} (256)
1147 \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128:
1148 U+0100..017F)
1149 \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
1150 LatinExtAdditional}) (256: U+1E00..1EFF)
1151 \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208:
1152 U+0180..024F)
1153 \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32:
1154 U+2C60..2C7F)
1155 \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224:
1156 U+A720..A7FF)
1157 \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64:
1158 U+AB30..AB6F)
1159 \p{Block: Latin_Extended_F} (Short: \p{Blk=LatinExtF}) (64:
1160 U+10780..107BF)
1161 \p{Block: Latin_Extended_G} (Short: \p{Blk=LatinExtG}) (256:
1162 U+1DF00..1DFFF)
1163 \p{Block: Lepcha} (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80:
1164 U+1C00..1C4F)
1165 \p{Block: Letterlike_Symbols} (80: U+2100..214F)
1166 \p{Block: Limbu} (NOT \p{Limbu} NOR \p{Is_Limbu}) (80:
1167 U+1900..194F)
1168 \p{Block: Linear_A} (NOT \p{Linear_A} NOR \p{Is_Linear_A})
1169 (384: U+10600..1077F)
1170 \p{Block: Linear_B_Ideograms} (128: U+10080..100FF)
1171 \p{Block: Linear_B_Syllabary} (128: U+10000..1007F)
1172 \p{Block: Lisu} (NOT \p{Lisu} NOR \p{Is_Lisu}) (48:
1173 U+A4D0..A4FF)
1174 \p{Block: Lisu_Sup} \p{Block=Lisu_Supplement} (16)
1175 \p{Block: Lisu_Supplement} (Short: \p{Blk=LisuSup}) (16:
1176 U+11FB0..11FBF)
1177 \p{Block: Low_Surrogates} (1024: U+DC00..DFFF)
1178 \p{Block: Lycian} (NOT \p{Lycian} NOR \p{Is_Lycian}) (32:
1179 U+10280..1029F)
1180 \p{Block: Lydian} (NOT \p{Lydian} NOR \p{Is_Lydian}) (32:
1181 U+10920..1093F)
1182 \p{Block: Mahajani} (NOT \p{Mahajani} NOR \p{Is_Mahajani})
1183 (48: U+11150..1117F)
1184 \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48)
1185 \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48:
1186 U+1F000..1F02F)
1187 \p{Block: Makasar} (NOT \p{Makasar} NOR \p{Is_Makasar}) (32:
1188 U+11EE0..11EFF)
1189 \p{Block: Malayalam} (NOT \p{Malayalam} NOR \p{Is_Malayalam})
1190 (128: U+0D00..0D7F)
1191 \p{Block: Mandaic} (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32:
1192 U+0840..085F)
1193 \p{Block: Manichaean} (NOT \p{Manichaean} NOR \p{Is_Manichaean})
1194 (64: U+10AC0..10AFF)
1195 \p{Block: Marchen} (NOT \p{Marchen} NOR \p{Is_Marchen}) (80:
1196 U+11C70..11CBF)
1197 \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR
1198 \p{Is_Masaram_Gondi}) (96:
1199 U+11D00..11D5F)
1200 \p{Block: Math_Alphanum} \p{Block=
1201 Mathematical_Alphanumeric_Symbols} (1024)
1202 \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
1203 \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
1204 MathAlphanum}) (1024: U+1D400..1D7FF)
1205 \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
1206 (256: U+2200..22FF)
1207 \p{Block: Mayan_Numerals} (32: U+1D2E0..1D2FF)
1208 \p{Block: Medefaidrin} (NOT \p{Medefaidrin} NOR
1209 \p{Is_Medefaidrin}) (96: U+16E40..16E9F)
1210 \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
1211 \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF)
1212 \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
1213 \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
1214 (32: U+AAE0..AAFF)
1215 \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
1216 \p{Is_Mende_Kikakui}) (224:
1217 U+1E800..1E8DF)
1218 \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
1219 \p{Is_Meroitic_Cursive}) (96:
1220 U+109A0..109FF)
1221 \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F)
1222 \p{Block: Miao} (NOT \p{Miao} NOR \p{Is_Miao}) (160:
1223 U+16F00..16F9F)
1224 \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows}
1225 (256)
1226 \p{Block: Misc_Math_Symbols_A} \p{Block=
1227 Miscellaneous_Mathematical_Symbols_A}
1228 (48)
1229 \p{Block: Misc_Math_Symbols_B} \p{Block=
1230 Miscellaneous_Mathematical_Symbols_B}
1231 (128)
1232 \p{Block: Misc_Pictographs} \p{Block=
1233 Miscellaneous_Symbols_And_Pictographs}
1234 (768)
1235 \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
1236 \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
1237 \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
1238 MiscMathSymbolsA}) (48: U+27C0..27EF)
1239 \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
1240 MiscMathSymbolsB}) (128: U+2980..29FF)
1241 \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols})
1242 (256: U+2600..26FF)
1243 \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
1244 MiscArrows}) (256: U+2B00..2BFF)
1245 \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
1246 MiscPictographs}) (768: U+1F300..1F5FF)
1247 \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
1248 (256: U+2300..23FF)
1249 \p{Block: Modi} (NOT \p{Modi} NOR \p{Is_Modi}) (96:
1250 U+11600..1165F)
1251 \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
1252 \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F)
1253 \p{Block: Mongolian} (NOT \p{Mongolian} NOR \p{Is_Mongolian})
1254 (176: U+1800..18AF)
1255 \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
1256 \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32:
1257 U+11660..1167F)
1258 \p{Block: Mro} (NOT \p{Mro} NOR \p{Is_Mro}) (48:
1259 U+16A40..16A6F)
1260 \p{Block: Multani} (NOT \p{Multani} NOR \p{Is_Multani}) (48:
1261 U+11280..112AF)
1262 \p{Block: Music} \p{Block=Musical_Symbols} (256)
1263 \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256:
1264 U+1D100..1D1FF)
1265 \p{Block: Myanmar} (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160:
1266 U+1000..109F)
1267 \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
1268 \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
1269 \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32:
1270 U+AA60..AA7F)
1271 \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32:
1272 U+A9E0..A9FF)
1273 \p{Block: Nabataean} (NOT \p{Nabataean} NOR \p{Is_Nabataean})
1274 (48: U+10880..108AF)
1275 \p{Block: Nag_Mundari} (NOT \p{Nag_Mundari} NOR
1276 \p{Is_Nag_Mundari}) (48: U+1E4D0..1E4FF)
1277 \p{Block: Nandinagari} (NOT \p{Nandinagari} NOR
1278 \p{Is_Nandinagari}) (96: U+119A0..119FF)
1279 \p{Block: NB} \p{Block=No_Block} (820_944 plus all
1280 above-Unicode code points)
1281 \p{Block: New_Tai_Lue} (NOT \p{New_Tai_Lue} NOR
1282 \p{Is_New_Tai_Lue}) (96: U+1980..19DF)
1283 \p{Block: Newa} (NOT \p{Newa} NOR \p{Is_Newa}) (128:
1284 U+11400..1147F)
1285 \p{Block: NKo} (NOT \p{Nko} NOR \p{Is_NKo}) (64:
1286 U+07C0..07FF)
1287 \p{Block: No_Block} (Short: \p{Blk=NB}) (820_944 plus all
1288 above-Unicode code points: U+2FE0..2FEF,
1289 U+10200..1027F, U+103E0..103FF,
1290 U+105C0..105FF, U+107C0..107FF,
1291 U+108B0..108DF ...)
1292 \p{Block: Number_Forms} (64: U+2150..218F)
1293 \p{Block: Nushu} (NOT \p{Nushu} NOR \p{Is_Nushu}) (400:
1294 U+1B170..1B2FF)
1295 \p{Block: Nyiakeng_Puachue_Hmong} (NOT \p{Nyiakeng_Puachue_Hmong}
1296 NOR \p{Is_Nyiakeng_Puachue_Hmong}) (80:
1297 U+1E100..1E14F)
1298 \p{Block: OCR} \p{Block=Optical_Character_Recognition}
1299 (32)
1300 \p{Block: Ogham} (NOT \p{Ogham} NOR \p{Is_Ogham}) (32:
1301 U+1680..169F)
1302 \p{Block: Ol_Chiki} (48: U+1C50..1C7F)
1303 \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
1304 \p{Is_Old_Hungarian}) (128:
1305 U+10C80..10CFF)
1306 \p{Block: Old_Italic} (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
1307 (48: U+10300..1032F)
1308 \p{Block: Old_North_Arabian} (32: U+10A80..10A9F)
1309 \p{Block: Old_Permic} (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
1310 (48: U+10350..1037F)
1311 \p{Block: Old_Persian} (NOT \p{Old_Persian} NOR
1312 \p{Is_Old_Persian}) (64: U+103A0..103DF)
1313 \p{Block: Old_Sogdian} (NOT \p{Old_Sogdian} NOR
1314 \p{Is_Old_Sogdian}) (48: U+10F00..10F2F)
1315 \p{Block: Old_South_Arabian} (32: U+10A60..10A7F)
1316 \p{Block: Old_Turkic} (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
1317 (80: U+10C00..10C4F)
1318 \p{Block: Old_Uyghur} (NOT \p{Old_Uyghur} NOR \p{Is_Old_Uyghur})
1319 (64: U+10F70..10FAF)
1320 \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32:
1321 U+2440..245F)
1322 \p{Block: Oriya} (NOT \p{Oriya} NOR \p{Is_Oriya}) (128:
1323 U+0B00..0B7F)
1324 \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F)
1325 \p{Block: Osage} (NOT \p{Osage} NOR \p{Is_Osage}) (80:
1326 U+104B0..104FF)
1327 \p{Block: Osmanya} (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48:
1328 U+10480..104AF)
1329 \p{Block: Ottoman_Siyaq_Numbers} (80: U+1ED00..1ED4F)
1330 \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
1331 \p{Is_Pahawh_Hmong}) (144:
1332 U+16B00..16B8F)
1333 \p{Block: Palmyrene} (32: U+10860..1087F)
1334 \p{Block: Pau_Cin_Hau} (NOT \p{Pau_Cin_Hau} NOR
1335 \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF)
1336 \p{Block: Phags_Pa} (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa})
1337 (64: U+A840..A87F)
1338 \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48)
1339 \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48:
1340 U+101D0..101FF)
1341 \p{Block: Phoenician} (NOT \p{Phoenician} NOR \p{Is_Phoenician})
1342 (32: U+10900..1091F)
1343 \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
1344 \p{Block: Phonetic_Ext_Sup} \p{Block=
1345 Phonetic_Extensions_Supplement} (64)
1346 \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128:
1347 U+1D00..1D7F)
1348 \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
1349 PhoneticExtSup}) (64: U+1D80..1DBF)
1350 \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF)
1351 \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
1352 \p{Private_Use} NOR \p{Is_Private_Use})
1353 (6400)
1354 \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
1355 \p{Private_Use} NOR \p{Is_Private_Use})
1356 (6400: U+E000..F8FF)
1357 \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
1358 \p{Is_Psalter_Pahlavi}) (48:
1359 U+10B80..10BAF)
1360 \p{Block: PUA} \p{Block=Private_Use_Area} (NOT
1361 \p{Private_Use} NOR \p{Is_Private_Use})
1362 (6400)
1363 \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT
1364 \p{Punct} NOR \p{Is_Punctuation}) (112)
1365 \p{Block: Rejang} (NOT \p{Rejang} NOR \p{Is_Rejang}) (48:
1366 U+A930..A95F)
1367 \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32)
1368 \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32:
1369 U+10E60..10E7F)
1370 \p{Block: Runic} (NOT \p{Runic} NOR \p{Is_Runic}) (96:
1371 U+16A0..16FF)
1372 \p{Block: Samaritan} (NOT \p{Samaritan} NOR \p{Is_Samaritan})
1373 (64: U+0800..083F)
1374 \p{Block: Saurashtra} (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
1375 (96: U+A880..A8DF)
1376 \p{Block: Sharada} (NOT \p{Sharada} NOR \p{Is_Sharada}) (96:
1377 U+11180..111DF)
1378 \p{Block: Shavian} (48: U+10450..1047F)
1379 \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF)
1380 \p{Block: Siddham} (NOT \p{Siddham} NOR \p{Is_Siddham}) (128:
1381 U+11580..115FF)
1382 \p{Block: Sinhala} (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128:
1383 U+0D80..0DFF)
1384 \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF)
1385 \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32:
1386 U+FE50..FE6F)
1387 \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32)
1388 \p{Block: Small_Kana_Ext} \p{Block=Small_Kana_Extension} (64)
1389 \p{Block: Small_Kana_Extension} (Short: \p{Blk=SmallKanaExt}) (64:
1390 U+1B130..1B16F)
1391 \p{Block: Sogdian} (NOT \p{Sogdian} NOR \p{Is_Sogdian}) (64:
1392 U+10F30..10F6F)
1393 \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
1394 \p{Is_Sora_Sompeng}) (48: U+110D0..110FF)
1395 \p{Block: Soyombo} (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96:
1396 U+11A50..11AAF)
1397 \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
1398 ModifierLetters}) (80: U+02B0..02FF)
1399 \p{Block: Specials} (16: U+FFF0..FFFF)
1400 \p{Block: Sundanese} (NOT \p{Sundanese} NOR \p{Is_Sundanese})
1401 (64: U+1B80..1BBF)
1402 \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
1403 \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16:
1404 U+1CC0..1CCF)
1405 \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
1406 \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
1407 \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
1408 \p{Block: Sup_Math_Operators} \p{Block=
1409 Supplemental_Mathematical_Operators}
1410 (256)
1411 \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A}
1412 (65_536)
1413 \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B}
1414 (65_536)
1415 \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
1416 \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
1417 Supplemental_Symbols_And_Pictographs}
1418 (256)
1419 \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
1420 \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
1421 SuperAndSub}) (48: U+2070..209F)
1422 \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16:
1423 U+27F0..27FF)
1424 \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128:
1425 U+2900..297F)
1426 \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256:
1427 U+1F800..1F8FF)
1428 \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
1429 SupMathOperators}) (256: U+2A00..2AFF)
1430 \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
1431 SupPunctuation}) (128: U+2E00..2E7F)
1432 \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
1433 SupSymbolsAndPictographs}) (256:
1434 U+1F900..1F9FF)
1435 \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
1436 SupPUAA}) (65_536: U+F0000..FFFFF)
1437 \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
1438 SupPUAB}) (65_536: U+100000..10FFFF)
1439 \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF)
1440 \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
1441 \p{Is_Syloti_Nagri}) (48: U+A800..A82F)
1442 \p{Block: Symbols_And_Pictographs_Ext_A} \p{Block=
1443 Symbols_And_Pictographs_Extended_A} (144)
1444 \p{Block: Symbols_And_Pictographs_Extended_A} (Short: \p{Blk=
1445 SymbolsAndPictographsExtA}) (144:
1446 U+1FA70..1FAFF)
1447 \p{Block: Symbols_For_Legacy_Computing} (256: U+1FB00..1FBFF)
1448 \p{Block: Syriac} (NOT \p{Syriac} NOR \p{Is_Syriac}) (80:
1449 U+0700..074F)
1450 \p{Block: Syriac_Sup} \p{Block=Syriac_Supplement} (16)
1451 \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16:
1452 U+0860..086F)
1453 \p{Block: Tagalog} (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32:
1454 U+1700..171F)
1455 \p{Block: Tagbanwa} (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa})
1456 (32: U+1760..177F)
1457 \p{Block: Tags} (128: U+E0000..E007F)
1458 \p{Block: Tai_Le} (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48:
1459 U+1950..197F)
1460 \p{Block: Tai_Tham} (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
1461 (144: U+1A20..1AAF)
1462 \p{Block: Tai_Viet} (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet})
1463 (96: U+AA80..AADF)
1464 \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1465 \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96:
1466 U+1D300..1D35F)
1467 \p{Block: Takri} (NOT \p{Takri} NOR \p{Is_Takri}) (80:
1468 U+11680..116CF)
1469 \p{Block: Tamil} (NOT \p{Tamil} NOR \p{Is_Tamil}) (128:
1470 U+0B80..0BFF)
1471 \p{Block: Tamil_Sup} \p{Block=Tamil_Supplement} (64)
1472 \p{Block: Tamil_Supplement} (Short: \p{Blk=TamilSup}) (64:
1473 U+11FC0..11FFF)
1474 \p{Block: Tangsa} (NOT \p{Tangsa} NOR \p{Is_Tangsa}) (96:
1475 U+16A70..16ACF)
1476 \p{Block: Tangut} (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144:
1477 U+17000..187FF)
1478 \p{Block: Tangut_Components} (768: U+18800..18AFF)
1479 \p{Block: Tangut_Sup} \p{Block=Tangut_Supplement} (128)
1480 \p{Block: Tangut_Supplement} (Short: \p{Blk=TangutSup}) (128:
1481 U+18D00..18D7F)
1482 \p{Block: Telugu} (NOT \p{Telugu} NOR \p{Is_Telugu}) (128:
1483 U+0C00..0C7F)
1484 \p{Block: Thaana} (NOT \p{Thaana} NOR \p{Is_Thaana}) (64:
1485 U+0780..07BF)
1486 \p{Block: Thai} (NOT \p{Thai} NOR \p{Is_Thai}) (128:
1487 U+0E00..0E7F)
1488 \p{Block: Tibetan} (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256:
1489 U+0F00..0FFF)
1490 \p{Block: Tifinagh} (NOT \p{Tifinagh} NOR \p{Is_Tifinagh})
1491 (80: U+2D30..2D7F)
1492 \p{Block: Tirhuta} (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96:
1493 U+11480..114DF)
1494 \p{Block: Toto} (NOT \p{Toto} NOR \p{Is_Toto}) (48:
1495 U+1E290..1E2BF)
1496 \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1497 (128)
1498 \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1499 TransportAndMap}) (128: U+1F680..1F6FF)
1500 \p{Block: UCAS} \p{Block=
1501 Unified_Canadian_Aboriginal_Syllabics}
1502 (640)
1503 \p{Block: UCAS_Ext} \p{Block=
1504 Unified_Canadian_Aboriginal_Syllabics_-
1505 Extended} (80)
1506 \p{Block: UCAS_Ext_A} \p{Block=
1507 Unified_Canadian_Aboriginal_Syllabics_-
1508 Extended_A} (16)
1509 \p{Block: Ugaritic} (NOT \p{Ugaritic} NOR \p{Is_Ugaritic})
1510 (32: U+10380..1039F)
1511 \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1512 UCAS}) (640: U+1400..167F)
1513 \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1514 \p{Blk=UCASExt}) (80: U+18B0..18FF)
1515 \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended_A}
1516 (Short: \p{Blk=UCASExtA}) (16:
1517 U+11AB0..11ABF)
1518 \p{Block: Vai} (NOT \p{Vai} NOR \p{Is_Vai}) (320:
1519 U+A500..A63F)
1520 \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
1521 \p{Variation_Selector} NOR \p{Is_VS})
1522 (16: U+FE00..FE0F)
1523 \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
1524 (240: U+E0100..E01EF)
1525 \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48)
1526 \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48:
1527 U+1CD0..1CFF)
1528 \p{Block: Vertical_Forms} (16: U+FE10..FE1F)
1529 \p{Block: Vithkuqi} (NOT \p{Vithkuqi} NOR \p{Is_Vithkuqi})
1530 (80: U+10570..105BF)
1531 \p{Block: VS} \p{Block=Variation_Selectors} (NOT
1532 \p{Variation_Selector} NOR \p{Is_VS})
1533 (16)
1534 \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement}
1535 (240)
1536 \p{Block: Wancho} (NOT \p{Wancho} NOR \p{Is_Wancho}) (64:
1537 U+1E2C0..1E2FF)
1538 \p{Block: Warang_Citi} (NOT \p{Warang_Citi} NOR
1539 \p{Is_Warang_Citi}) (96: U+118A0..118FF)
1540 \p{Block: Yezidi} (NOT \p{Yezidi} NOR \p{Is_Yezidi}) (64:
1541 U+10E80..10EBF)
1542 \p{Block: Yi_Radicals} (64: U+A490..A4CF)
1543 \p{Block: Yi_Syllables} (1168: U+A000..A48F)
1544 \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64)
1545 \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64:
1546 U+4DC0..4DFF)
1547 \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR
1548 \p{Is_Zanabazar_Square}) (80:
1549 U+11A00..11A4F)
1550 \p{Block: Znamenny_Music} \p{Block=Znamenny_Musical_Notation} (208)
1551 \p{Block: Znamenny_Musical_Notation} (Short: \p{Blk=
1552 ZnamennyMusic}) (208: U+1CF00..1CFCF)
1553 X \p{Block_Elements} \p{Block=Block_Elements} (32)
1554 \p{Bopo} \p{Bopomofo} (= \p{Script_Extensions=
1555 Bopomofo}) (NOT \p{Block=Bopomofo}) (117)
1556 \p{Bopomofo} \p{Script_Extensions=Bopomofo} (Short:
1557 \p{Bopo}; NOT \p{Block=Bopomofo}) (117)
1558 X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block=
1559 Bopomofo_Extended}) (32)
1560 X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short:
1561 \p{InBopomofoExt}) (32)
1562 X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
1563 \p{Bpt: *} \p{Bidi_Paired_Bracket_Type: *}
1564 \p{Brah} \p{Brahmi} (= \p{Script_Extensions=
1565 Brahmi}) (NOT \p{Block=Brahmi}) (115)
1566 \p{Brahmi} \p{Script_Extensions=Brahmi} (Short:
1567 \p{Brah}; NOT \p{Block=Brahmi}) (115)
1568 \p{Brai} \p{Braille} (= \p{Script_Extensions=
1569 Braille}) (256)
1570 \p{Braille} \p{Script_Extensions=Braille} (Short:
1571 \p{Brai}) (256)
1572 X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short:
1573 \p{InBraille}) (256)
1574 \p{Bugi} \p{Buginese} (= \p{Script_Extensions=
1575 Buginese}) (NOT \p{Block=Buginese}) (31)
1576 \p{Buginese} \p{Script_Extensions=Buginese} (Short:
1577 \p{Bugi}; NOT \p{Block=Buginese}) (31)
1578 \p{Buhd} \p{Buhid} (= \p{Script_Extensions=Buhid})
1579 (NOT \p{Block=Buhid}) (22)
1580 \p{Buhid} \p{Script_Extensions=Buhid} (Short:
1581 \p{Buhd}; NOT \p{Block=Buhid}) (22)
1582 X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block=
1583 Byzantine_Musical_Symbols}) (256)
1584 X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1585 (Short: \p{InByzantineMusic}) (256)
1586 \p{C} \pC \p{Other} (= \p{General_Category=Other})
1587 (965_096 plus all above-Unicode code
1588 points)
1589 \p{Cakm} \p{Chakma} (= \p{Script_Extensions=
1590 Chakma}) (NOT \p{Block=Chakma}) (91)
1591 \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
1592 (Short: \p{Cans}) (726)
1593 X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
1594 (= \p{Block=
1595 Unified_Canadian_Aboriginal_Syllabics})
1596 (640)
1597 T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1598 Not_Reordered} (1_113_190 plus all
1599 above-Unicode code points)
1600 T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1601 Overlay} (32)
1602 T \p{Canonical_Combining_Class: 6} \p{Canonical_Combining_Class=
1603 Han_Reading} (2)
1604 T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1605 Nukta} (27)
1606 T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1607 Kana_Voicing} (2)
1608 T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1609 Virama} (65)
1610 T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1611 CCC10} (1)
1612 \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1:
1613 U+05B0)
1614 T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1615 CCC11} (1)
1616 \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1:
1617 U+05B1)
1618 T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1619 CCC12} (1)
1620 \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1:
1621 U+05B2)
1622 T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1623 CCC13} (1)
1624 \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1:
1625 U+05B3)
1626 T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1627 CCC14} (1)
1628 \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1:
1629 U+05B4)
1630 T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1631 CCC15} (1)
1632 \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1:
1633 U+05B5)
1634 T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1635 CCC16} (1)
1636 \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1:
1637 U+05B6)
1638 T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1639 CCC17} (1)
1640 \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1:
1641 U+05B7)
1642 T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1643 CCC18} (2)
1644 \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2:
1645 U+05B8, U+05C7)
1646 T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1647 CCC19} (2)
1648 \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2:
1649 U+05B9..05BA)
1650 T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1651 CCC20} (1)
1652 \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1:
1653 U+05BB)
1654 T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1655 CCC21} (1)
1656 \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1:
1657 U+05BC)
1658 T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1659 CCC22} (1)
1660 \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1:
1661 U+05BD)
1662 T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1663 CCC23} (1)
1664 \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1:
1665 U+05BF)
1666 T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1667 CCC24} (1)
1668 \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1:
1669 U+05C1)
1670 T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1671 CCC25} (1)
1672 \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1:
1673 U+05C2)
1674 T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1675 CCC26} (1)
1676 \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1:
1677 U+FB1E)
1678 T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1679 CCC27} (2)
1680 \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2:
1681 U+064B, U+08F0)
1682 T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1683 CCC28} (2)
1684 \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2:
1685 U+064C, U+08F1)
1686 T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1687 CCC29} (2)
1688 \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2:
1689 U+064D, U+08F2)
1690 T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1691 CCC30} (2)
1692 \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2:
1693 U+0618, U+064E)
1694 T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1695 CCC31} (2)
1696 \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2:
1697 U+0619, U+064F)
1698 T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1699 CCC32} (2)
1700 \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2:
1701 U+061A, U+0650)
1702 T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1703 CCC33} (1)
1704 \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1:
1705 U+0651)
1706 T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1707 CCC34} (1)
1708 \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1:
1709 U+0652)
1710 T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1711 CCC35} (1)
1712 \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1:
1713 U+0670)
1714 T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1715 CCC36} (1)
1716 \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1:
1717 U+0711)
1718 T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1719 CCC84} (1)
1720 \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1:
1721 U+0C55)
1722 T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1723 CCC91} (1)
1724 \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1:
1725 U+0C56)
1726 T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1727 CCC103} (2)
1728 \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2:
1729 U+0E38..0E39)
1730 T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1731 CCC107} (4)
1732 \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4:
1733 U+0E48..0E4B)
1734 T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1735 CCC118} (2)
1736 \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2:
1737 U+0EB8..0EB9)
1738 T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1739 CCC122} (4)
1740 \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4:
1741 U+0EC8..0ECB)
1742 T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1743 CCC129} (1)
1744 \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1:
1745 U+0F71)
1746 T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1747 CCC130} (6)
1748 \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6:
1749 U+0F72, U+0F7A..0F7D, U+0F80)
1750 T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1751 CCC132} (1)
1752 \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1:
1753 U+0F74)
1754 T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
1755 CCC133} (0)
1756 \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
1757 T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1758 Attached_Below_Left} (0)
1759 T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1760 Attached_Below} (5)
1761 T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1762 Attached_Above} (1)
1763 T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1764 Attached_Above_Right} (9)
1765 T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1766 Below_Left} (2)
1767 T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1768 Below} (181)
1769 T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1770 Below_Right} (4)
1771 T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1772 Left} (2)
1773 T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1774 Right} (1)
1775 T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1776 Above_Left} (5)
1777 T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1778 Above} (510)
1779 T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1780 Above_Right} (7)
1781 T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1782 Double_Below} (4)
1783 T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1784 Double_Above} (5)
1785 T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1786 Iota_Subscript} (1)
1787 \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1788 Above} (510)
1789 \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (510:
1790 U+0300..0314, U+033D..0344, U+0346,
1791 U+034A..034C, U+0350..0352, U+0357 ...)
1792 \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5:
1793 U+05AE, U+18A9, U+1DF7..1DF8, U+302B)
1794 \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (7:
1795 U+0315, U+031A, U+0358, U+1DF6, U+302C,
1796 U+1E4EC..1E4ED)
1797 \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1798 Above_Left} (5)
1799 \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1800 Above_Right} (7)
1801 \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1802 Attached_Above} (1)
1803 \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1804 Attached_Above_Right} (9)
1805 \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1806 Attached_Below} (5)
1807 \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1808 Attached_Below_Left} (0)
1809 \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1810 (1: U+1DCE)
1811 \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1812 \p{Ccc=ATAR}) (9: U+031B, U+0F39,
1813 U+1D165..1D166, U+1D16E..1D172)
1814 \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1815 (5: U+0321..0322, U+0327..0328, U+1DD0)
1816 \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1817 ATBL}) (0)
1818 \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1819 Below} (181)
1820 \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (181:
1821 U+0316..0319, U+031C..0320,
1822 U+0323..0326, U+0329..0333,
1823 U+0339..033C, U+0347..0349 ...)
1824 \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (2:
1825 U+1DFA, U+302A)
1826 \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4:
1827 U+059A, U+05AD, U+1939, U+302D)
1828 \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1829 Below_Left} (2)
1830 \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1831 Below_Right} (4)
1832 \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1833 Double_Above} (5)
1834 \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1835 Double_Below} (4)
1836 \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA})
1837 (5: U+035D..035E, U+0360..0361, U+1DCD)
1838 \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB})
1839 (4: U+035C, U+035F, U+0362, U+1DFC)
1840 \p{Canonical_Combining_Class: Han_Reading} (Short: \p{Ccc=HANR})
1841 (2: U+16FF0..16FF1)
1842 \p{Canonical_Combining_Class: HANR} \p{Canonical_Combining_Class=
1843 Han_Reading} (2)
1844 \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1845 (1: U+0345)
1846 \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1847 Iota_Subscript} (1)
1848 \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV})
1849 (2: U+3099..309A)
1850 \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1851 Kana_Voicing} (2)
1852 \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1853 Left} (2)
1854 \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2:
1855 U+302E..302F)
1856 \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1857 Nukta} (27)
1858 \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1859 (1_113_190 plus all above-Unicode code
1860 points: U+0000..02FF, U+034F,
1861 U+0370..0482, U+0488..0590, U+05BE,
1862 U+05C0 ...)
1863 \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1864 Not_Reordered} (1_113_190 plus all
1865 above-Unicode code points)
1866 \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (27:
1867 U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C,
1868 U+0C3C ...)
1869 \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1870 Overlay} (32)
1871 \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32:
1872 U+0334..0338, U+1CD4, U+1CE2..1CE8,
1873 U+20D2..20D3, U+20D8..20DA, U+20E5..20E6
1874 ...)
1875 \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1876 Right} (1)
1877 \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1:
1878 U+1D16D)
1879 \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (65:
1880 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
1881 U+0BCD ...)
1882 \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1883 Virama} (65)
1884 \p{Cans} \p{Canadian_Aboriginal} (=
1885 \p{Script_Extensions=
1886 Canadian_Aboriginal}) (726)
1887 \p{Cari} \p{Carian} (= \p{Script_Extensions=
1888 Carian}) (NOT \p{Block=Carian}) (49)
1889 \p{Carian} \p{Script_Extensions=Carian} (Short:
1890 \p{Cari}; NOT \p{Block=Carian}) (49)
1891 \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (2707)
1892 \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_111_405 plus
1893 all above-Unicode code points: [\x00-
1894 \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A-Z
1895 \[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-\xac\xae
1896 \xb0-\xb3\xb5-\xb6\xb9-\xff],
1897 U+0100..02AF, U+0370..0373,
1898 U+0376..0379, U+037B..0383, U+0386 ...)
1899 \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (2707: [\'.:\^`
1900 \xa8\xad\xaf\xb4\xb7-\xb8],
1901 U+02B0..036F, U+0374..0375, U+037A,
1902 U+0384..0385, U+0387 ...)
1903 \p{Cased} \p{Cased=Y} (4526)
1904 \p{Cased: N*} (Single: \P{Cased}) (1_109_586 plus all
1905 above-Unicode code points: [\x00-\x20!
1906 \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]
1907 \^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9
1908 \xbb-\xbf\xd7\xf7], U+01BB,
1909 U+01C0..01C3, U+0294, U+02B9..02BF,
1910 U+02C2..02DF ...)
1911 \p{Cased: Y*} (Single: \p{Cased}) (4526: [A-Za-z\xaa
1912 \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
1913 U+0100..01BA, U+01BC..01BF,
1914 U+01C4..0293, U+0295..02B8, U+02C0..02C1
1915 ...)
1916 \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
1917 \p{LC}) (4095)
1918 \p{Category: *} \p{General_Category: *}
1919 \p{Caucasian_Albanian} \p{Script_Extensions=Caucasian_Albanian}
1920 (Short: \p{Aghb}; NOT \p{Block=
1921 Caucasian_Albanian}) (53)
1922 \p{Cc} \p{XPosixCntrl} (= \p{General_Category=
1923 Control}) (65)
1924 \p{Ccc: *} \p{Canonical_Combining_Class: *}
1925 \p{CE} \p{Composition_Exclusion} (=
1926 \p{Composition_Exclusion=Y}) (81)
1927 \p{CE: *} \p{Composition_Exclusion: *}
1928 \p{Cf} \p{Format} (= \p{General_Category=Format})
1929 (170)
1930 \p{Chakma} \p{Script_Extensions=Chakma} (Short:
1931 \p{Cakm}; NOT \p{Block=Chakma}) (91)
1932 \p{Cham} \p{Script_Extensions=Cham} (NOT \p{Block=
1933 Cham}) (83)
1934 \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1935 \p{CWCF}) (1506)
1936 \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1937 (1_112_606 plus all above-Unicode code
1938 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1939 \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1940 \xb4\xb6-\xbf\xd7\xe0-\xff], U+0101,
1941 U+0103, U+0105, U+0107, U+0109 ...)
1942 \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1943 (1506: [A-Z\xb5\xc0-\xd6\xd8-\xdf],
1944 U+0100, U+0102, U+0104, U+0106, U+0108
1945 ...)
1946 \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1947 \p{CWCM}) (2927)
1948 \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1949 (1_111_185 plus all above-Unicode code
1950 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1951 \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xb4
1952 \xb6-\xbf\xd7\xf7], U+0138, U+018D,
1953 U+019B, U+01AA..01AB, U+01BA..01BB ...)
1954 \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1955 (2927: [A-Za-z\xb5\xc0-\xd6\xd8-\xf6
1956 \xf8-\xff], U+0100..0137, U+0139..018C,
1957 U+018E..019A, U+019C..01A9, U+01AC..01B9
1958 ...)
1959 \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1960 \p{CWL}) (1433)
1961 \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1962 (1_112_679 plus all above-Unicode code
1963 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1964 \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1965 \xbf\xd7\xdf-\xff], U+0101, U+0103,
1966 U+0105, U+0107, U+0109 ...)
1967 \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1433:
1968 [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102,
1969 U+0104, U+0106, U+0108 ...)
1970 \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1971 Y} (Short: \p{CWKCF}) (10_491)
1972 \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1973 \P{CWKCF}) (1_103_621 plus all above-
1974 Unicode code points: [\x00-\x20!\"#\$
1975 \%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a-
1976 z\{\|\}~\x7f-\x9f\xa1-\xa7\xa9\xab-\xac
1977 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xd7\xe0-
1978 \xff], U+0101, U+0103, U+0105, U+0107,
1979 U+0109 ...)
1980 \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1981 \p{CWKCF}) (10_491: [A-Z\xa0\xa8\xaa
1982 \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0-
1983 \xd6\xd8-\xdf], U+0100, U+0102, U+0104,
1984 U+0106, U+0108 ...)
1985 \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1986 \p{CWT}) (1452)
1987 \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1988 (1_112_660 plus all above-Unicode code
1989 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1990 \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f-
1991 \xb4\xb6-\xde\xf7], U+0100, U+0102,
1992 U+0104, U+0106, U+0108 ...)
1993 \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1452:
1994 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1995 U+0103, U+0105, U+0107, U+0109 ...)
1996 \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1997 \p{CWU}) (1525)
1998 \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1999 (1_112_587 plus all above-Unicode code
2000 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
2001 \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f-
2002 \xb4\xb6-\xde\xf7], U+0100, U+0102,
2003 U+0104, U+0106, U+0108 ...)
2004 \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1525:
2005 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
2006 U+0103, U+0105, U+0107, U+0109 ...)
2007 \p{Cher} \p{Cherokee} (= \p{Script_Extensions=
2008 Cherokee}) (NOT \p{Block=Cherokee}) (172)
2009 \p{Cherokee} \p{Script_Extensions=Cherokee} (Short:
2010 \p{Cher}; NOT \p{Block=Cherokee}) (172)
2011 X \p{Cherokee_Sup} \p{Cherokee_Supplement} (= \p{Block=
2012 Cherokee_Supplement}) (80)
2013 X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
2014 \p{InCherokeeSup}) (80)
2015 X \p{Chess_Symbols} \p{Block=Chess_Symbols} (112)
2016 \p{Chorasmian} \p{Script_Extensions=Chorasmian} (Short:
2017 \p{Chrs}; NOT \p{Block=Chorasmian}) (28)
2018 \p{Chrs} \p{Chorasmian} (= \p{Script_Extensions=
2019 Chorasmian}) (NOT \p{Block=Chorasmian})
2020 (28)
2021 \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
2022 Y}) (2707)
2023 \p{CI: *} \p{Case_Ignorable: *}
2024 X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block=
2025 CJK_Unified_Ideographs}) (20_992)
2026 X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block=
2027 CJK_Compatibility}) (256)
2028 X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block=
2029 CJK_Compatibility_Forms}) (32)
2030 X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
2031 \p{Block=CJK_Compatibility_Ideographs})
2032 (512)
2033 X \p{CJK_Compat_Ideographs_Sup}
2034 \p{CJK_Compatibility_Ideographs_-
2035 Supplement} (= \p{Block=
2036 CJK_Compatibility_Ideographs_-
2037 Supplement}) (544)
2038 X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short:
2039 \p{InCJKCompat}) (256)
2040 X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
2041 (Short: \p{InCJKCompatForms}) (32)
2042 X \p{CJK_Compatibility_Ideographs} \p{Block=
2043 CJK_Compatibility_Ideographs} (Short:
2044 \p{InCJKCompatIdeographs}) (512)
2045 X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
2046 CJK_Compatibility_Ideographs_Supplement}
2047 (Short: \p{InCJKCompatIdeographsSup})
2048 (544)
2049 X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (=
2050 \p{Block=
2051 CJK_Unified_Ideographs_Extension_A})
2052 (6592)
2053 X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (=
2054 \p{Block=
2055 CJK_Unified_Ideographs_Extension_B})
2056 (42_720)
2057 X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (=
2058 \p{Block=
2059 CJK_Unified_Ideographs_Extension_C})
2060 (4160)
2061 X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (=
2062 \p{Block=
2063 CJK_Unified_Ideographs_Extension_D})
2064 (224)
2065 X \p{CJK_Ext_E} \p{CJK_Unified_Ideographs_Extension_E} (=
2066 \p{Block=
2067 CJK_Unified_Ideographs_Extension_E})
2068 (5776)
2069 X \p{CJK_Ext_F} \p{CJK_Unified_Ideographs_Extension_F} (=
2070 \p{Block=
2071 CJK_Unified_Ideographs_Extension_F})
2072 (7488)
2073 X \p{CJK_Ext_G} \p{CJK_Unified_Ideographs_Extension_G} (=
2074 \p{Block=
2075 CJK_Unified_Ideographs_Extension_G})
2076 (4944)
2077 X \p{CJK_Ext_H} \p{CJK_Unified_Ideographs_Extension_H} (=
2078 \p{Block=
2079 CJK_Unified_Ideographs_Extension_H})
2080 (4192)
2081 X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block=
2082 CJK_Radicals_Supplement}) (128)
2083 X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
2084 (Short: \p{InCJKRadicalsSup}) (128)
2085 X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
2086 X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (=
2087 \p{Block=CJK_Symbols_And_Punctuation})
2088 (64)
2089 X \p{CJK_Symbols_And_Punctuation} \p{Block=
2090 CJK_Symbols_And_Punctuation} (Short:
2091 \p{InCJKSymbols}) (64)
2092 X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
2093 (Short: \p{InCJK}) (20_992)
2094 X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
2095 CJK_Unified_Ideographs_Extension_A}
2096 (Short: \p{InCJKExtA}) (6592)
2097 X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
2098 CJK_Unified_Ideographs_Extension_B}
2099 (Short: \p{InCJKExtB}) (42_720)
2100 X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
2101 CJK_Unified_Ideographs_Extension_C}
2102 (Short: \p{InCJKExtC}) (4160)
2103 X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
2104 CJK_Unified_Ideographs_Extension_D}
2105 (Short: \p{InCJKExtD}) (224)
2106 X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
2107 CJK_Unified_Ideographs_Extension_E}
2108 (Short: \p{InCJKExtE}) (5776)
2109 X \p{CJK_Unified_Ideographs_Extension_F} \p{Block=
2110 CJK_Unified_Ideographs_Extension_F}
2111 (Short: \p{InCJKExtF}) (7488)
2112 X \p{CJK_Unified_Ideographs_Extension_G} \p{Block=
2113 CJK_Unified_Ideographs_Extension_G}
2114 (Short: \p{InCJKExtG}) (4944)
2115 X \p{CJK_Unified_Ideographs_Extension_H} \p{Block=
2116 CJK_Unified_Ideographs_Extension_H}
2117 (Short: \p{InCJKExtH}) (4192)
2118 \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
2119 (Short: \p{Pe}) (77)
2120 \p{Cn} \p{Unassigned} (= \p{General_Category=
2121 Unassigned}) (825_345 plus all above-
2122 Unicode code points)
2123 \p{Cntrl} \p{XPosixCntrl} (= \p{General_Category=
2124 Control}) (65)
2125 \p{Co} \p{Private_Use} (= \p{General_Category=
2126 Private_Use}) (NOT \p{Private_Use_Area})
2127 (137_468)
2128 X \p{Combining_Diacritical_Marks} \p{Block=
2129 Combining_Diacritical_Marks} (Short:
2130 \p{InDiacriticals}) (112)
2131 X \p{Combining_Diacritical_Marks_Extended} \p{Block=
2132 Combining_Diacritical_Marks_Extended}
2133 (Short: \p{InDiacriticalsExt}) (80)
2134 X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
2135 Combining_Diacritical_Marks_For_Symbols}
2136 (Short: \p{InDiacriticalsForSymbols})
2137 (48)
2138 X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
2139 Combining_Diacritical_Marks_Supplement}
2140 (Short: \p{InDiacriticalsSup}) (64)
2141 X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
2142 \p{InHalfMarks}) (16)
2143 \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark})
2144 (2450)
2145 X \p{Combining_Marks_For_Symbols}
2146 \p{Combining_Diacritical_Marks_For_-
2147 Symbols} (= \p{Block=
2148 Combining_Diacritical_Marks_For_-
2149 Symbols}) (48)
2150 \p{Common} \p{Script_Extensions=Common} (Short:
2151 \p{Zyyy}) (7873)
2152 X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
2153 (Short: \p{InIndicNumberForms}) (16)
2154 \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
2155 \p{Full_Composition_Exclusion=Y}) (1120)
2156 \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
2157 X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block=
2158 Hangul_Compatibility_Jamo}) (96)
2159 \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
2160 \p{CE}) (81)
2161 \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
2162 plus all above-Unicode code points:
2163 U+0000..0957, U+0960..09DB, U+09DE,
2164 U+09E0..0A32, U+0A34..0A35, U+0A37..0A58
2165 ...)
2166 \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81:
2167 U+0958..095F, U+09DC..09DD, U+09DF,
2168 U+0A33, U+0A36, U+0A59..0A5B ...)
2169 \p{Connector_Punctuation} \p{General_Category=
2170 Connector_Punctuation} (Short: \p{Pc})
2171 (10)
2172 \p{Control} \p{XPosixCntrl} (= \p{General_Category=
2173 Control}) (65)
2174 X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
2175 \p{Copt} \p{Coptic} (= \p{Script_Extensions=
2176 Coptic}) (NOT \p{Block=Coptic}) (165)
2177 \p{Coptic} \p{Script_Extensions=Coptic} (Short:
2178 \p{Copt}; NOT \p{Block=Coptic}) (165)
2179 X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
2180 X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block=
2181 Counting_Rod_Numerals}) (32)
2182 X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
2183 \p{InCountingRod}) (32)
2184 \p{Cpmn} \p{Cypro_Minoan} (= \p{Script_Extensions=
2185 Cypro_Minoan}) (NOT \p{Block=
2186 Cypro_Minoan}) (101)
2187 \p{Cprt} \p{Cypriot} (= \p{Script_Extensions=
2188 Cypriot}) (112)
2189 \p{Cs} \p{Surrogate} (= \p{General_Category=
2190 Surrogate}) (2048)
2191 \p{Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
2192 \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
2193 X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (=
2194 \p{Block=
2195 Cuneiform_Numbers_And_Punctuation}) (128)
2196 X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
2197 Cuneiform_Numbers_And_Punctuation}
2198 (Short: \p{InCuneiformNumbers}) (128)
2199 \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
2200 (Short: \p{Sc}) (63)
2201 X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
2202 \p{CWCF} \p{Changes_When_Casefolded} (=
2203 \p{Changes_When_Casefolded=Y}) (1506)
2204 \p{CWCF: *} \p{Changes_When_Casefolded: *}
2205 \p{CWCM} \p{Changes_When_Casemapped} (=
2206 \p{Changes_When_Casemapped=Y}) (2927)
2207 \p{CWCM: *} \p{Changes_When_Casemapped: *}
2208 \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
2209 \p{Changes_When_NFKC_Casefolded=Y})
2210 (10_491)
2211 \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
2212 \p{CWL} \p{Changes_When_Lowercased} (=
2213 \p{Changes_When_Lowercased=Y}) (1433)
2214 \p{CWL: *} \p{Changes_When_Lowercased: *}
2215 \p{CWT} \p{Changes_When_Titlecased} (=
2216 \p{Changes_When_Titlecased=Y}) (1452)
2217 \p{CWT: *} \p{Changes_When_Titlecased: *}
2218 \p{CWU} \p{Changes_When_Uppercased} (=
2219 \p{Changes_When_Uppercased=Y}) (1525)
2220 \p{CWU: *} \p{Changes_When_Uppercased: *}
2221 \p{Cypriot} \p{Script_Extensions=Cypriot} (Short:
2222 \p{Cprt}) (112)
2223 X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
2224 \p{Cypro_Minoan} \p{Script_Extensions=Cypro_Minoan} (Short:
2225 \p{Cpmn}; NOT \p{Block=Cypro_Minoan})
2226 (101)
2227 \p{Cyrillic} \p{Script_Extensions=Cyrillic} (Short:
2228 \p{Cyrl}; NOT \p{Block=Cyrillic}) (510)
2229 X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block=
2230 Cyrillic_Extended_A}) (32)
2231 X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block=
2232 Cyrillic_Extended_B}) (96)
2233 X \p{Cyrillic_Ext_C} \p{Cyrillic_Extended_C} (= \p{Block=
2234 Cyrillic_Extended_C}) (16)
2235 X \p{Cyrillic_Ext_D} \p{Cyrillic_Extended_D} (= \p{Block=
2236 Cyrillic_Extended_D}) (96)
2237 X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
2238 \p{InCyrillicExtA}) (32)
2239 X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
2240 \p{InCyrillicExtB}) (96)
2241 X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
2242 \p{InCyrillicExtC}) (16)
2243 X \p{Cyrillic_Extended_D} \p{Block=Cyrillic_Extended_D} (Short:
2244 \p{InCyrillicExtD}) (96)
2245 X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block=
2246 Cyrillic_Supplement}) (48)
2247 X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
2248 \p{InCyrillicSup}) (48)
2249 X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
2250 Cyrillic_Supplement}) (48)
2251 \p{Cyrl} \p{Cyrillic} (= \p{Script_Extensions=
2252 Cyrillic}) (NOT \p{Block=Cyrillic}) (510)
2253 \p{Dash} \p{Dash=Y} (30)
2254 \p{Dash: N*} (Single: \P{Dash}) (1_114_082 plus all
2255 above-Unicode code points: [\x00-\x20!
2256 \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z
2257 \[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2258 U+0100..0589, U+058B..05BD,
2259 U+05BF..13FF, U+1401..1805, U+1807..200F
2260 ...)
2261 \p{Dash: Y*} (Single: \p{Dash}) (30: [\-], U+058A,
2262 U+05BE, U+1400, U+1806, U+2010..2015 ...)
2263 \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
2264 (Short: \p{Pd}) (26)
2265 \p{Decimal_Number} \p{XPosixDigit} (= \p{General_Category=
2266 Decimal_Number}) (680)
2267 \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
2268 (13_233)
2269 \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_233:
2270 [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd
2271 \xe0-\xe5\xe7-\xef\xf1-\xf6\xf9-\xfd
2272 \xff], U+0100..010F, U+0112..0125,
2273 U+0128..0130, U+0134..0137, U+0139..013E
2274 ...)
2275 \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240:
2276 U+2460..2473, U+24B6..24EA,
2277 U+3244..3247, U+3251..327E,
2278 U+3280..32BF, U+32D0..32FE ...)
2279 \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
2280 \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720: [\xa8
2281 \xaf\xb4-\xb5\xb8], U+0132..0133,
2282 U+013F..0140, U+0149, U+017F,
2283 U+01C4..01CC ...)
2284 \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
2285 \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
2286 \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51,
2287 U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63
2288 ...)
2289 \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1194: U+2102,
2290 U+210A..2113, U+2115, U+2119..211D,
2291 U+2124, U+2128 ...)
2292 \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
2293 \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc-
2294 \xbe], U+2150..215F, U+2189)
2295 \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
2296 \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54,
2297 U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68
2298 ...)
2299 \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
2300 \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50,
2301 U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62
2302 ...)
2303 \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
2304 \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55,
2305 U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69
2306 ...)
2307 \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
2308 \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122:
2309 U+FF61..FFBE, U+FFC2..FFC7,
2310 U+FFCA..FFCF, U+FFD2..FFD7,
2311 U+FFDA..FFDC, U+FFE8..FFEE)
2312 \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
2313 \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0],
2314 U+0F0C, U+2007, U+2011, U+202F)
2315 \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
2316 Non_Canonical} (Perl extension) (3796)
2317 \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
2318 decompositions (Short: \p{Dt=NonCanon})
2319 (Perl extension) (3796: [\xa0\xa8\xaa
2320 \xaf\xb2-\xb5\xb8-\xba\xbc-\xbe],
2321 U+0132..0133, U+013F..0140, U+0149,
2322 U+017F, U+01C4..01CC ...)
2323 \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_083 plus
2324 all above-Unicode code points: [\x00-
2325 \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6-
2326 \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-\xdf
2327 \xe6\xf0\xf7-\xf8\xfe], U+0110..0111,
2328 U+0126..0127, U+0131, U+0138,
2329 U+0141..0142 ...)
2330 \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26:
2331 U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B)
2332 \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
2333 \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (286)
2334 \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (286: U+3250,
2335 U+32CC..32CF, U+32FF..3357,
2336 U+3371..33DF, U+33FF, U+1F130..1F14F ...)
2337 \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (64: U+1D62..1D6A,
2338 U+2080..208E, U+2090..209C, U+2C7C,
2339 U+1E051..1E06A)
2340 \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (249)
2341 \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (249: [\xaa\xb2-
2342 \xb3\xb9-\xba], U+02B0..02B8,
2343 U+02E0..02E4, U+10FC, U+1D2C..1D2E,
2344 U+1D30..1D3A ...)
2345 \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
2346 \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F,
2347 U+30FF, U+FE10..FE19, U+FE30..FE44,
2348 U+FE47..FE48)
2349 \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000,
2350 U+FF01..FF60, U+FFE0..FFE6)
2351 \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
2352 Y} (Short: \p{DI}) (4174)
2353 \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
2354 (1_109_938 plus all above-Unicode code
2355 points: [\x00-\xac\xae-\xff],
2356 U+0100..034E, U+0350..061B,
2357 U+061D..115E, U+1161..17B3, U+17B6..180A
2358 ...)
2359 \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
2360 (4174: [\xad], U+034F, U+061C,
2361 U+115F..1160, U+17B4..17B5, U+180B..180F
2362 ...)
2363 \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (15)
2364 \p{Dep: *} \p{Deprecated: *}
2365 \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (15)
2366 \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_097
2367 plus all above-Unicode code points:
2368 U+0000..0148, U+014A..0672,
2369 U+0674..0F76, U+0F78, U+0F7A..17A2,
2370 U+17A5..2069 ...)
2371 \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149,
2372 U+0673, U+0F77, U+0F79, U+17A3..17A4,
2373 U+206A..206F ...)
2374 \p{Deseret} \p{Script_Extensions=Deseret} (Short:
2375 \p{Dsrt}) (80)
2376 \p{Deva} \p{Devanagari} (= \p{Script_Extensions=
2377 Devanagari}) (NOT \p{Block=Devanagari})
2378 (220)
2379 \p{Devanagari} \p{Script_Extensions=Devanagari} (Short:
2380 \p{Deva}; NOT \p{Block=Devanagari}) (220)
2381 X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block=
2382 Devanagari_Extended}) (32)
2383 X \p{Devanagari_Ext_A} \p{Devanagari_Extended_A} (= \p{Block=
2384 Devanagari_Extended_A}) (96)
2385 X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
2386 \p{InDevanagariExt}) (32)
2387 X \p{Devanagari_Extended_A} \p{Block=Devanagari_Extended_A} (Short:
2388 \p{InDevanagariExtA}) (96)
2389 \p{DI} \p{Default_Ignorable_Code_Point} (=
2390 \p{Default_Ignorable_Code_Point=Y})
2391 (4174)
2392 \p{DI: *} \p{Default_Ignorable_Code_Point: *}
2393 \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (1144)
2394 \p{Dia: *} \p{Diacritic: *}
2395 \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (1144)
2396 \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_112_968
2397 plus all above-Unicode code points:
2398 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2399 >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
2400 \xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
2401 U+0100..02AF, U+034F, U+0358..035C,
2402 U+0363..0373, U+0376..0379 ...)
2403 \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (1144: [\^`
2404 \xa8\xaf\xb4\xb7-\xb8], U+02B0..034E,
2405 U+0350..0357, U+035D..0362,
2406 U+0374..0375, U+037A ...)
2407 X \p{Diacriticals} \p{Combining_Diacritical_Marks} (=
2408 \p{Block=Combining_Diacritical_Marks})
2409 (112)
2410 X \p{Diacriticals_Ext} \p{Combining_Diacritical_Marks_Extended}
2411 (= \p{Block=
2412 Combining_Diacritical_Marks_Extended})
2413 (80)
2414 X \p{Diacriticals_For_Symbols}
2415 \p{Combining_Diacritical_Marks_For_-
2416 Symbols} (= \p{Block=
2417 Combining_Diacritical_Marks_For_-
2418 Symbols}) (48)
2419 X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement}
2420 (= \p{Block=
2421 Combining_Diacritical_Marks_Supplement})
2422 (64)
2423 \p{Diak} \p{Dives_Akuru} (= \p{Script_Extensions=
2424 Dives_Akuru}) (NOT \p{Block=
2425 Dives_Akuru}) (72)
2426 \p{Digit} \p{XPosixDigit} (= \p{General_Category=
2427 Decimal_Number}) (680)
2428 X \p{Dingbats} \p{Block=Dingbats} (192)
2429 \p{Dives_Akuru} \p{Script_Extensions=Dives_Akuru} (Short:
2430 \p{Diak}; NOT \p{Block=Dives_Akuru}) (72)
2431 \p{Dogr} \p{Dogra} (= \p{Script_Extensions=Dogra})
2432 (NOT \p{Block=Dogra}) (82)
2433 \p{Dogra} \p{Script_Extensions=Dogra} (Short:
2434 \p{Dogr}; NOT \p{Block=Dogra}) (82)
2435 X \p{Domino} \p{Domino_Tiles} (= \p{Block=
2436 Domino_Tiles}) (112)
2437 X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short:
2438 \p{InDomino}) (112)
2439 \p{Dsrt} \p{Deseret} (= \p{Script_Extensions=
2440 Deseret}) (80)
2441 \p{Dt: *} \p{Decomposition_Type: *}
2442 \p{Dupl} \p{Duployan} (= \p{Script_Extensions=
2443 Duployan}) (NOT \p{Block=Duployan}) (147)
2444 \p{Duployan} \p{Script_Extensions=Duployan} (Short:
2445 \p{Dupl}; NOT \p{Block=Duployan}) (147)
2446 \p{Ea: *} \p{East_Asian_Width: *}
2447 X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
2448 (208)
2449 \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
2450 \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739: [\xa1
2451 \xa4\xa7-\xa8\xaa\xad-\xae\xb0-\xb4\xb6-
2452 \xba\xbc-\xbf\xc6\xd0\xd7-\xd8\xde-\xe1
2453 \xe6\xe8-\xea\xec-\xed\xf0\xf2-\xf3\xf7-
2454 \xfa\xfc\xfe], U+0101, U+0111, U+0113,
2455 U+011B, U+0126..0127 ...)
2456 \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
2457 \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000,
2458 U+FF01..FF60, U+FFE0..FFE6)
2459 \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
2460 \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9,
2461 U+FF61..FFBE, U+FFC2..FFC7,
2462 U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC
2463 ...)
2464 \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (792_623 plus
2465 all above-Unicode code points)
2466 \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
2467 \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20-\x7e
2468 \xa2-\xa3\xa5-\xa6\xac\xaf],
2469 U+27E6..27ED, U+2985..2986)
2470 \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (792_623 plus all
2471 above-Unicode code points: [\x00-\x1f
2472 \x7f-\xa0\xa9\xab\xb5\xbb\xc0-\xc5\xc7-
2473 \xcf\xd1-\xd6\xd9-\xdd\xe2-\xe5\xe7\xeb
2474 \xee-\xef\xf1\xf4-\xf6\xfb\xfd\xff],
2475 U+00FF..0100, U+0102..0110, U+0112,
2476 U+0114..011A, U+011C..0125 ...)
2477 \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (182_412)
2478 \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (182_412:
2479 U+1100..115F, U+231A..231B,
2480 U+2329..232A, U+23E9..23EC, U+23F0,
2481 U+23F3 ...)
2482 \p{EBase} \p{Emoji_Modifier_Base} (=
2483 \p{Emoji_Modifier_Base=Y}) (134)
2484 \p{EBase: *} \p{Emoji_Modifier_Base: *}
2485 \p{EComp} \p{Emoji_Component} (= \p{Emoji_Component=
2486 Y}) (146)
2487 \p{EComp: *} \p{Emoji_Component: *}
2488 \p{Egyp} \p{Egyptian_Hieroglyphs} (=
2489 \p{Script_Extensions=
2490 Egyptian_Hieroglyphs}) (NOT \p{Block=
2491 Egyptian_Hieroglyphs}) (1110)
2492 X \p{Egyptian_Hieroglyph_Format_Controls} \p{Block=
2493 Egyptian_Hieroglyph_Format_Controls} (48)
2494 \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
2495 Egyptian_Hieroglyphs} (Short: \p{Egyp};
2496 NOT \p{Block=Egyptian_Hieroglyphs})
2497 (1110)
2498 \p{Elba} \p{Elbasan} (= \p{Script_Extensions=
2499 Elbasan}) (NOT \p{Block=Elbasan}) (40)
2500 \p{Elbasan} \p{Script_Extensions=Elbasan} (Short:
2501 \p{Elba}; NOT \p{Block=Elbasan}) (40)
2502 \p{Elym} \p{Elymaic} (= \p{Script_Extensions=
2503 Elymaic}) (NOT \p{Block=Elymaic}) (23)
2504 \p{Elymaic} \p{Script_Extensions=Elymaic} (Short:
2505 \p{Elym}; NOT \p{Block=Elymaic}) (23)
2506 \p{EMod} \p{Emoji_Modifier} (= \p{Emoji_Modifier=
2507 Y}) (5)
2508 \p{EMod: *} \p{Emoji_Modifier: *}
2509 \p{Emoji} \p{Emoji=Y} (1424)
2510 \p{Emoji: N*} (Single: \P{Emoji}) (1_112_688 plus all
2511 above-Unicode code points: [\x00-\x20!
2512 \"\$\%&\'\(\)+,\-.\/:;<=>?\@A-Z\[\\\]
2513 \^_`a-z\{\|\}~\x7f-\xa8\xaa-\xad\xaf-
2514 \xff], U+0100..203B, U+203D..2048,
2515 U+204A..2121, U+2123..2138, U+213A..2193
2516 ...)
2517 \p{Emoji: Y*} (Single: \p{Emoji}) (1424: [#*0-9\xa9
2518 \xae], U+203C, U+2049, U+2122, U+2139,
2519 U+2194..2199 ...)
2520 \p{Emoji_Component} \p{Emoji_Component=Y} (Short: \p{EComp})
2521 (146)
2522 \p{Emoji_Component: N*} (Short: \p{EComp=N}, \P{EComp}) (1_113_966
2523 plus all above-Unicode code points:
2524 [\x00-\x20!\"\$\%&\'\(\)+,\-.\/:;<=>?
2525 \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2526 U+0100..200C, U+200E..20E2,
2527 U+20E4..FE0E, U+FE10..1F1E5,
2528 U+1F200..1F3FA ...)
2529 \p{Emoji_Component: Y*} (Short: \p{EComp=Y}, \p{EComp}) (146:
2530 [#*0-9], U+200D, U+20E3, U+FE0F,
2531 U+1F1E6..1F1FF, U+1F3FB..1F3FF ...)
2532 \p{Emoji_Modifier} \p{Emoji_Modifier=Y} (Short: \p{EMod}) (5)
2533 \p{Emoji_Modifier: N*} (Short: \p{EMod=N}, \P{EMod}) (1_114_107
2534 plus all above-Unicode code points:
2535 U+0000..1F3FA, U+1F400..infinity)
2536 \p{Emoji_Modifier: Y*} (Short: \p{EMod=Y}, \p{EMod}) (5:
2537 U+1F3FB..1F3FF)
2538 \p{Emoji_Modifier_Base} \p{Emoji_Modifier_Base=Y} (Short:
2539 \p{EBase}) (134)
2540 \p{Emoji_Modifier_Base: N*} (Short: \p{EBase=N}, \P{EBase})
2541 (1_113_978 plus all above-Unicode code
2542 points: U+0000..261C, U+261E..26F8,
2543 U+26FA..2709, U+270E..1F384,
2544 U+1F386..1F3C1, U+1F3C5..1F3C6 ...)
2545 \p{Emoji_Modifier_Base: Y*} (Short: \p{EBase=Y}, \p{EBase}) (134:
2546 U+261D, U+26F9, U+270A..270D, U+1F385,
2547 U+1F3C2..1F3C4, U+1F3C7 ...)
2548 \p{Emoji_Presentation} \p{Emoji_Presentation=Y} (Short:
2549 \p{EPres}) (1205)
2550 \p{Emoji_Presentation: N*} (Short: \p{EPres=N}, \P{EPres})
2551 (1_112_907 plus all above-Unicode code
2552 points: U+0000..2319, U+231C..23E8,
2553 U+23ED..23EF, U+23F1..23F2,
2554 U+23F4..25FC, U+25FF..2613 ...)
2555 \p{Emoji_Presentation: Y*} (Short: \p{EPres=Y}, \p{EPres}) (1205:
2556 U+231A..231B, U+23E9..23EC, U+23F0,
2557 U+23F3, U+25FD..25FE, U+2614..2615 ...)
2558 X \p{Emoticons} \p{Block=Emoticons} (80)
2559 X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block=
2560 Enclosed_Alphanumerics}) (160)
2561 X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
2562 \p{Block=
2563 Enclosed_Alphanumeric_Supplement}) (256)
2564 X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
2565 Enclosed_Alphanumeric_Supplement}
2566 (Short: \p{InEnclosedAlphanumSup}) (256)
2567 X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
2568 (Short: \p{InEnclosedAlphanum}) (160)
2569 X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (=
2570 \p{Block=
2571 Enclosed_CJK_Letters_And_Months}) (256)
2572 X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
2573 Enclosed_CJK_Letters_And_Months} (Short:
2574 \p{InEnclosedCJK}) (256)
2575 X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
2576 (= \p{Block=
2577 Enclosed_Ideographic_Supplement}) (256)
2578 X \p{Enclosed_Ideographic_Supplement} \p{Block=
2579 Enclosed_Ideographic_Supplement} (Short:
2580 \p{InEnclosedIdeographicSup}) (256)
2581 \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
2582 (Short: \p{Me}) (13)
2583 \p{EPres} \p{Emoji_Presentation} (=
2584 \p{Emoji_Presentation=Y}) (1205)
2585 \p{EPres: *} \p{Emoji_Presentation: *}
2586 \p{Ethi} \p{Ethiopic} (= \p{Script_Extensions=
2587 Ethiopic}) (NOT \p{Block=Ethiopic}) (523)
2588 \p{Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
2589 \p{Ethi}; NOT \p{Block=Ethiopic}) (523)
2590 X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block=
2591 Ethiopic_Extended}) (96)
2592 X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block=
2593 Ethiopic_Extended_A}) (48)
2594 X \p{Ethiopic_Ext_B} \p{Ethiopic_Extended_B} (= \p{Block=
2595 Ethiopic_Extended_B}) (32)
2596 X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short:
2597 \p{InEthiopicExt}) (96)
2598 X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
2599 \p{InEthiopicExtA}) (48)
2600 X \p{Ethiopic_Extended_B} \p{Block=Ethiopic_Extended_B} (Short:
2601 \p{InEthiopicExtB}) (32)
2602 X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block=
2603 Ethiopic_Supplement}) (32)
2604 X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
2605 \p{InEthiopicSup}) (32)
2606 \p{Ext} \p{Extender} (= \p{Extender=Y}) (50)
2607 \p{Ext: *} \p{Extender: *}
2608 \p{Extended_Pictographic} \p{Extended_Pictographic=Y} (Short:
2609 \p{ExtPict}) (3537)
2610 \p{Extended_Pictographic: N*} (Short: \p{ExtPict=N}, \P{ExtPict})
2611 (1_110_575 plus all above-Unicode code
2612 points: [\x00-\xa8\xaa-\xad\xaf-\xff],
2613 U+0100..203B, U+203D..2048,
2614 U+204A..2121, U+2123..2138, U+213A..2193
2615 ...)
2616 \p{Extended_Pictographic: Y*} (Short: \p{ExtPict=Y}, \p{ExtPict})
2617 (3537: [\xa9\xae], U+203C, U+2049,
2618 U+2122, U+2139, U+2194..2199 ...)
2619 \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (50)
2620 \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_062
2621 plus all above-Unicode code points:
2622 [\x00-\xb6\xb8-\xff], U+0100..02CF,
2623 U+02D2..063F, U+0641..07F9,
2624 U+07FB..0B54, U+0B56..0E45 ...)
2625 \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (50: [\xb7],
2626 U+02D0..02D1, U+0640, U+07FA, U+0B55,
2627 U+0E46 ...)
2628 \p{ExtPict} \p{Extended_Pictographic} (=
2629 \p{Extended_Pictographic=Y}) (3537)
2630 \p{ExtPict: *} \p{Extended_Pictographic: *}
2631 \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
2632 (Short: \p{Pf}) (10)
2633 \p{Format} \p{General_Category=Format} (Short:
2634 \p{Cf}) (170)
2635 \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
2636 (Short: \p{CompEx}) (1120)
2637 \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
2638 \P{CompEx}) (1_112_992 plus all above-
2639 Unicode code points: U+0000..033F,
2640 U+0342, U+0345..0373, U+0375..037D,
2641 U+037F..0386, U+0388..0957 ...)
2642 \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
2643 \p{CompEx}) (1120: U+0340..0341,
2644 U+0343..0344, U+0374, U+037E, U+0387,
2645 U+0958..095F ...)
2646 \p{Gc: *} \p{General_Category: *}
2647 \p{GCB: *} \p{Grapheme_Cluster_Break: *}
2648 \p{General_Category: C} \p{General_Category=Other} (965_096 plus
2649 all above-Unicode code points)
2650 \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
2651 \p{Gc=LC}, \p{LC}) (4095: [A-Za-z\xb5
2652 \xc0-\xd6\xd8-\xf6\xf8-\xff],
2653 U+0100..01BA, U+01BC..01BF,
2654 U+01C4..0293, U+0295..02AF, U+0370..0373
2655 ...)
2656 \p{General_Category: Cc} \p{General_Category=Control} (65)
2657 \p{General_Category: Cf} \p{General_Category=Format} (170)
2658 \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
2659 (77: [\)\]\}], U+0F3B, U+0F3D, U+169C,
2660 U+2046, U+207E ...)
2661 \p{General_Category: Cn} \p{General_Category=Unassigned} (825_345
2662 plus all above-Unicode code points)
2663 \p{General_Category: Cntrl} \p{General_Category=Control} (65)
2664 \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
2665 \p{General_Category: Combining_Mark} \p{General_Category=Mark}
2666 (2450)
2667 \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
2668 \p{Pc}) (10: [_], U+203F..2040, U+2054,
2669 U+FE33..FE34, U+FE4D..FE4F, U+FF3F)
2670 \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65:
2671 [\x00-\x1f\x7f-\x9f])
2672 \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
2673 \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
2674 (63: [\$\xa2-\xa5], U+058F, U+060B,
2675 U+07FE..07FF, U+09F2..09F3, U+09FB ...)
2676 \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
2677 (26: [\-], U+058A, U+05BE, U+1400,
2678 U+1806, U+2010..2015 ...)
2679 \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
2680 (680: [0-9], U+0660..0669, U+06F0..06F9,
2681 U+07C0..07C9, U+0966..096F, U+09E6..09EF
2682 ...)
2683 \p{General_Category: Digit} \p{General_Category=Decimal_Number}
2684 (680)
2685 \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
2686 (13: U+0488..0489, U+1ABE, U+20DD..20E0,
2687 U+20E2..20E4, U+A670..A672)
2688 \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
2689 (10: [\xbb], U+2019, U+201D, U+203A,
2690 U+2E03, U+2E05 ...)
2691 \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (170:
2692 [\xad], U+0600..0605, U+061C, U+06DD,
2693 U+070F, U+0890..0891 ...)
2694 \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
2695 \p{Pi}) (12: [\xab], U+2018,
2696 U+201B..201C, U+201F, U+2039, U+2E02 ...)
2697 \p{General_Category: L} \p{General_Category=Letter} (136_104)
2698 X \p{General_Category: L&} \p{General_Category=Cased_Letter} (4095)
2699 X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
2700 the trailing '_' matters in spite of
2701 loose matching rules. (4095)
2702 \p{General_Category: LC} \p{General_Category=Cased_Letter} (4095)
2703 \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (136_104:
2704 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
2705 \xf8-\xff], U+0100..02C1, U+02C6..02D1,
2706 U+02E0..02E4, U+02EC, U+02EE ...)
2707 \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
2708 (236: U+16EE..16F0, U+2160..2182,
2709 U+2185..2188, U+3007, U+3021..3029,
2710 U+3038..303A ...)
2711 \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl})
2712 (1: U+2028)
2713 \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
2714 (/i= General_Category=Cased_Letter)
2715 (2233)
2716 \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (397)
2717 \p{General_Category: Lo} \p{General_Category=Other_Letter}
2718 (131_612)
2719 \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
2720 /i= General_Category=Cased_Letter)
2721 (2233: [a-z\xb5\xdf-\xf6\xf8-\xff],
2722 U+0101, U+0103, U+0105, U+0107, U+0109
2723 ...)
2724 \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
2725 (/i= General_Category=Cased_Letter) (31)
2726 \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
2727 (/i= General_Category=Cased_Letter)
2728 (1831)
2729 \p{General_Category: M} \p{General_Category=Mark} (2450)
2730 \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2450:
2731 U+0300..036F, U+0483..0489,
2732 U+0591..05BD, U+05BF, U+05C1..05C2,
2733 U+05C4..05C5 ...)
2734 \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948:
2735 [+<=>\|~\xac\xb1\xd7\xf7], U+03F6,
2736 U+0606..0608, U+2044, U+2052,
2737 U+207A..207C ...)
2738 \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (452)
2739 \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
2740 \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
2741 (1985)
2742 \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
2743 (397: U+02B0..02C1, U+02C6..02D1,
2744 U+02E0..02E4, U+02EC, U+02EE, U+0374 ...)
2745 \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
2746 (125: [\^`\xa8\xaf\xb4\xb8],
2747 U+02C2..02C5, U+02D2..02DF,
2748 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2749 \p{General_Category: N} \p{General_Category=Number} (1831)
2750 \p{General_Category: Nd} \p{General_Category=Decimal_Number} (680)
2751 \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
2752 \p{General_Category: No} \p{General_Category=Other_Number} (915)
2753 \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
2754 (1985: U+0300..036F, U+0483..0487,
2755 U+0591..05BD, U+05BF, U+05C1..05C2,
2756 U+05C4..05C5 ...)
2757 \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1831: [0-9
2758 \xb2-\xb3\xb9\xbc-\xbe], U+0660..0669,
2759 U+06F0..06F9, U+07C0..07C9,
2760 U+0966..096F, U+09E6..09EF ...)
2761 \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
2762 (79: [\(\[\{], U+0F3A, U+0F3C, U+169B,
2763 U+201A, U+201E ...)
2764 \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (965_096 plus
2765 all above-Unicode code points: [\x00-
2766 \x1f\x7f-\x9f\xad], U+0378..0379,
2767 U+0380..0383, U+038B, U+038D, U+03A2 ...)
2768 \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
2769 (131_612: [\xaa\xba], U+01BB,
2770 U+01C0..01C3, U+0294, U+05D0..05EA,
2771 U+05EF..05F2 ...)
2772 \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No})
2773 (915: [\xb2-\xb3\xb9\xbc-\xbe],
2774 U+09F4..09F9, U+0B72..0B77,
2775 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
2776 ...)
2777 \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
2778 (628: [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7
2779 \xb6-\xb7\xbf], U+037E, U+0387,
2780 U+055A..055F, U+0589, U+05C0 ...)
2781 \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
2782 (6634: [\xa6\xa9\xae\xb0], U+0482,
2783 U+058D..058E, U+060E..060F, U+06DE,
2784 U+06E9 ...)
2785 \p{General_Category: P} \p{General_Category=Punctuation} (842)
2786 \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
2787 \p{Zp}) (1: U+2029)
2788 \p{General_Category: Pc} \p{General_Category=
2789 Connector_Punctuation} (10)
2790 \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (26)
2791 \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
2792 (77)
2793 \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
2794 (10)
2795 \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
2796 (12)
2797 \p{General_Category: Po} \p{General_Category=Other_Punctuation}
2798 (628)
2799 \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
2800 (137_468: U+E000..F8FF, U+F0000..FFFFD,
2801 U+100000..10FFFD)
2802 \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (79)
2803 \p{General_Category: Punct} \p{General_Category=Punctuation} (842)
2804 \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (842:
2805 [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_\{\}
2806 \xa1\xa7\xab\xb6-\xb7\xbb\xbf], U+037E,
2807 U+0387, U+055A..055F, U+0589..058A,
2808 U+05BE ...)
2809 \p{General_Category: S} \p{General_Category=Symbol} (7770)
2810 \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (63)
2811 \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19:
2812 [\x20\xa0], U+1680, U+2000..200A,
2813 U+2028..2029, U+202F, U+205F ...)
2814 \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (125)
2815 \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
2816 \p{General_Category: So} \p{General_Category=Other_Symbol} (6634)
2817 \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
2818 (17: [\x20\xa0], U+1680, U+2000..200A,
2819 U+202F, U+205F, U+3000)
2820 \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc})
2821 (452: U+0903, U+093B, U+093E..0940,
2822 U+0949..094C, U+094E..094F, U+0982..0983
2823 ...)
2824 \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048:
2825 U+D800..DFFF)
2826 \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (7770:
2827 [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae-
2828 \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5,
2829 U+02D2..02DF, U+02E5..02EB, U+02ED,
2830 U+02EF..02FF ...)
2831 \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
2832 /i= General_Category=Cased_Letter) (31:
2833 U+01C5, U+01C8, U+01CB, U+01F2,
2834 U+1F88..1F8F, U+1F98..1F9F ...)
2835 \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
2836 (825_345 plus all above-Unicode code
2837 points: U+0378..0379, U+0380..0383,
2838 U+038B, U+038D, U+03A2, U+0530 ...)
2839 \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
2840 /i= General_Category=Cased_Letter)
2841 (1831: [A-Z\xc0-\xd6\xd8-\xde], U+0100,
2842 U+0102, U+0104, U+0106, U+0108 ...)
2843 \p{General_Category: Z} \p{General_Category=Separator} (19)
2844 \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
2845 \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
2846 (1)
2847 \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
2848 X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
2849 \p{InPunctuation}) (112)
2850 X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
2851 X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
2852 \p{Block=Geometric_Shapes_Extended})
2853 (128)
2854 X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
2855 (Short: \p{InGeometricShapesExt}) (128)
2856 \p{Geor} \p{Georgian} (= \p{Script_Extensions=
2857 Georgian}) (NOT \p{Block=Georgian}) (174)
2858 \p{Georgian} \p{Script_Extensions=Georgian} (Short:
2859 \p{Geor}; NOT \p{Block=Georgian}) (174)
2860 X \p{Georgian_Ext} \p{Georgian_Extended} (= \p{Block=
2861 Georgian_Extended}) (48)
2862 X \p{Georgian_Extended} \p{Block=Georgian_Extended} (Short:
2863 \p{InGeorgianExt}) (48)
2864 X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block=
2865 Georgian_Supplement}) (48)
2866 X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
2867 \p{InGeorgianSup}) (48)
2868 \p{Glag} \p{Glagolitic} (= \p{Script_Extensions=
2869 Glagolitic}) (NOT \p{Block=Glagolitic})
2870 (138)
2871 \p{Glagolitic} \p{Script_Extensions=Glagolitic} (Short:
2872 \p{Glag}; NOT \p{Block=Glagolitic}) (138)
2873 X \p{Glagolitic_Sup} \p{Glagolitic_Supplement} (= \p{Block=
2874 Glagolitic_Supplement}) (48)
2875 X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
2876 \p{InGlagoliticSup}) (48)
2877 \p{Gong} \p{Gunjala_Gondi} (= \p{Script_Extensions=
2878 Gunjala_Gondi}) (NOT \p{Block=
2879 Gunjala_Gondi}) (65)
2880 \p{Gonm} \p{Masaram_Gondi} (= \p{Script_Extensions=
2881 Masaram_Gondi}) (NOT \p{Block=
2882 Masaram_Gondi}) (77)
2883 \p{Goth} \p{Gothic} (= \p{Script_Extensions=
2884 Gothic}) (NOT \p{Block=Gothic}) (27)
2885 \p{Gothic} \p{Script_Extensions=Gothic} (Short:
2886 \p{Goth}; NOT \p{Block=Gothic}) (27)
2887 \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
2888 (146_986)
2889 \p{Gr_Base: *} \p{Grapheme_Base: *}
2890 \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
2891 Y}) (2125)
2892 \p{Gr_Ext: *} \p{Grapheme_Extend: *}
2893 \p{Gran} \p{Grantha} (= \p{Script_Extensions=
2894 Grantha}) (NOT \p{Block=Grantha}) (116)
2895 \p{Grantha} \p{Script_Extensions=Grantha} (Short:
2896 \p{Gran}; NOT \p{Block=Grantha}) (116)
2897 \p{Graph} \p{XPosixGraph} (286_635)
2898 \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
2899 (146_986)
2900 \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase}) (967_126
2901 plus all above-Unicode code points:
2902 [\x00-\x1f\x7f-\x9f\xad], U+0300..036F,
2903 U+0378..0379, U+0380..0383, U+038B,
2904 U+038D ...)
2905 \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase})
2906 (146_986: [\x20-\x7e\xa0-\xac\xae-\xff],
2907 U+0100..02FF, U+0370..0377,
2908 U+037A..037F, U+0384..038A, U+038C ...)
2909 \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
2910 (3893)
2911 \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (3893: [^
2912 \n\r\x20-\x7e\xa0-\xac\xae-\xff],
2913 U+061C, U+180E, U+200B, U+200E..200F,
2914 U+2028..202E ...)
2915 \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r])
2916 \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (0)
2917 \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (0)
2918 \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (0)
2919 \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
2920 (0)
2921 \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
2922 E_Base_GAZ} (0)
2923 \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
2924 E_Modifier} (0)
2925 \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
2926 (2130)
2927 \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (2130:
2928 U+0300..036F, U+0483..0489,
2929 U+0591..05BD, U+05BF, U+05C1..05C2,
2930 U+05C4..05C5 ...)
2931 \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
2932 Glue_After_Zwj} (0)
2933 \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (0)
2934 \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125:
2935 U+1100..115F, U+A960..A97C)
2936 \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n])
2937 \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00,
2938 U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C
2939 ...)
2940 \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773:
2941 U+AC01..AC1B, U+AC1D..AC37,
2942 U+AC39..AC53, U+AC55..AC6F,
2943 U+AC71..AC8B, U+AC8D..ACA7 ...)
2944 \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_096_109
2945 plus all above-Unicode code points:
2946 [\x20-\x7e\xa0-\xac\xae-\xff],
2947 U+0100..02FF, U+0370..0482,
2948 U+048A..0590, U+05BE, U+05C0 ...)
2949 \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
2950 (27)
2951 \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (27:
2952 U+0600..0605, U+06DD, U+070F,
2953 U+0890..0891, U+08E2, U+0D4E ...)
2954 \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
2955 (26: U+1F1E6..1F1FF)
2956 \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
2957 Regional_Indicator} (26)
2958 \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
2959 SpacingMark} (395)
2960 \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (395:
2961 U+0903, U+093B, U+093E..0940,
2962 U+0949..094C, U+094E..094F, U+0982..0983
2963 ...)
2964 \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137:
2965 U+11A8..11FF, U+D7CB..D7FB)
2966 \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95:
2967 U+1160..11A7, U+D7B0..D7C6)
2968 \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
2969 (1_096_109 plus all above-Unicode code
2970 points)
2971 \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D)
2972 \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
2973 (2125)
2974 \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_111_987
2975 plus all above-Unicode code points:
2976 U+0000..02FF, U+0370..0482,
2977 U+048A..0590, U+05BE, U+05C0, U+05C3 ...)
2978 \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (2125:
2979 U+0300..036F, U+0483..0489,
2980 U+0591..05BD, U+05BF, U+05C1..05C2,
2981 U+05C4..05C5 ...)
2982 \p{Greek} \p{Script_Extensions=Greek} (Short:
2983 \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
2984 X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
2985 \p{InGreek}) (144)
2986 X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block=
2987 Greek_Extended}) (256)
2988 X \p{Greek_Extended} \p{Block=Greek_Extended} (Short:
2989 \p{InGreekExt}) (256)
2990 \p{Grek} \p{Greek} (= \p{Script_Extensions=Greek})
2991 (NOT \p{Greek_And_Coptic}) (522)
2992 \p{Gujarati} \p{Script_Extensions=Gujarati} (Short:
2993 \p{Gujr}; NOT \p{Block=Gujarati}) (105)
2994 \p{Gujr} \p{Gujarati} (= \p{Script_Extensions=
2995 Gujarati}) (NOT \p{Block=Gujarati}) (105)
2996 \p{Gunjala_Gondi} \p{Script_Extensions=Gunjala_Gondi}
2997 (Short: \p{Gong}; NOT \p{Block=
2998 Gunjala_Gondi}) (65)
2999 \p{Gurmukhi} \p{Script_Extensions=Gurmukhi} (Short:
3000 \p{Guru}; NOT \p{Block=Gurmukhi}) (94)
3001 \p{Guru} \p{Gurmukhi} (= \p{Script_Extensions=
3002 Gurmukhi}) (NOT \p{Block=Gurmukhi}) (94)
3003 X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
3004 \p{Block=Halfwidth_And_Fullwidth_Forms})
3005 (240)
3006 X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block=
3007 Combining_Half_Marks}) (16)
3008 X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
3009 Halfwidth_And_Fullwidth_Forms} (Short:
3010 \p{InHalfAndFullForms}) (240)
3011 \p{Han} \p{Script_Extensions=Han} (98_696)
3012 \p{Hang} \p{Hangul} (= \p{Script_Extensions=
3013 Hangul}) (NOT \p{Hangul_Syllables})
3014 (11_775)
3015 \p{Hangul} \p{Script_Extensions=Hangul} (Short:
3016 \p{Hang}; NOT \p{Hangul_Syllables})
3017 (11_775)
3018 X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
3019 (Short: \p{InCompatJamo}) (96)
3020 X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo})
3021 (256)
3022 X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
3023 (Short: \p{InJamoExtA}) (32)
3024 X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
3025 (Short: \p{InJamoExtB}) (80)
3026 \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
3027 (125)
3028 \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125:
3029 U+1100..115F, U+A960..A97C)
3030 \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
3031 (399)
3032 \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399:
3033 U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70,
3034 U+AC8C ...)
3035 \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
3036 LVT_Syllable} (10_773)
3037 \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
3038 (10_773: U+AC01..AC1B, U+AC1D..AC37,
3039 U+AC39..AC53, U+AC55..AC6F,
3040 U+AC71..AC8B, U+AC8D..ACA7 ...)
3041 \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
3042 Not_Applicable} (1_102_583 plus all
3043 above-Unicode code points)
3044 \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
3045 (1_102_583 plus all above-Unicode code
3046 points: U+0000..10FF, U+1200..A95F,
3047 U+A97D..ABFF, U+D7A4..D7AF,
3048 U+D7C7..D7CA, U+D7FC..infinity)
3049 \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
3050 (137)
3051 \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137:
3052 U+11A8..11FF, U+D7CB..D7FB)
3053 \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
3054 (95)
3055 \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95:
3056 U+1160..11A7, U+D7B0..D7C6)
3057 X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short:
3058 \p{InHangul}) (11_184)
3059 \p{Hani} \p{Han} (= \p{Script_Extensions=Han})
3060 (98_696)
3061 \p{Hanifi_Rohingya} \p{Script_Extensions=Hanifi_Rohingya}
3062 (Short: \p{Rohg}; NOT \p{Block=
3063 Hanifi_Rohingya}) (55)
3064 \p{Hano} \p{Hanunoo} (= \p{Script_Extensions=
3065 Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
3066 \p{Hanunoo} \p{Script_Extensions=Hanunoo} (Short:
3067 \p{Hano}; NOT \p{Block=Hanunoo}) (23)
3068 \p{Hatr} \p{Hatran} (= \p{Script_Extensions=
3069 Hatran}) (NOT \p{Block=Hatran}) (26)
3070 \p{Hatran} \p{Script_Extensions=Hatran} (Short:
3071 \p{Hatr}; NOT \p{Block=Hatran}) (26)
3072 \p{Hebr} \p{Hebrew} (= \p{Script_Extensions=
3073 Hebrew}) (NOT \p{Block=Hebrew}) (134)
3074 \p{Hebrew} \p{Script_Extensions=Hebrew} (Short:
3075 \p{Hebr}; NOT \p{Block=Hebrew}) (134)
3076 \p{Hex} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
3077 \p{Hex: *} \p{Hex_Digit: *}
3078 \p{Hex_Digit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
3079 \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068
3080 plus all above-Unicode code points:
3081 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
3082 \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
3083 U+0100..FF0F, U+FF1A..FF20,
3084 U+FF27..FF40, U+FF47..infinity)
3085 \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa-
3086 f], U+FF10..FF19, U+FF21..FF26,
3087 U+FF41..FF46)
3088 X \p{High_Private_Use_Surrogates} \p{Block=
3089 High_Private_Use_Surrogates} (Short:
3090 \p{InHighPUSurrogates}) (128)
3091 X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (=
3092 \p{Block=High_Private_Use_Surrogates})
3093 (128)
3094 X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
3095 \p{Hira} \p{Hiragana} (= \p{Script_Extensions=
3096 Hiragana}) (NOT \p{Block=Hiragana}) (433)
3097 \p{Hiragana} \p{Script_Extensions=Hiragana} (Short:
3098 \p{Hira}; NOT \p{Block=Hiragana}) (433)
3099 \p{Hluw} \p{Anatolian_Hieroglyphs} (=
3100 \p{Script_Extensions=
3101 Anatolian_Hieroglyphs}) (NOT \p{Block=
3102 Anatolian_Hieroglyphs}) (583)
3103 \p{Hmng} \p{Pahawh_Hmong} (= \p{Script_Extensions=
3104 Pahawh_Hmong}) (NOT \p{Block=
3105 Pahawh_Hmong}) (127)
3106 \p{Hmnp} \p{Nyiakeng_Puachue_Hmong} (=
3107 \p{Script_Extensions=
3108 Nyiakeng_Puachue_Hmong}) (NOT \p{Block=
3109 Nyiakeng_Puachue_Hmong}) (71)
3110 \p{HorizSpace} \p{XPosixBlank} (18)
3111 \p{Hst: *} \p{Hangul_Syllable_Type: *}
3112 \p{Hung} \p{Old_Hungarian} (= \p{Script_Extensions=
3113 Old_Hungarian}) (NOT \p{Block=
3114 Old_Hungarian}) (108)
3115 D \p{Hyphen} \p{Hyphen=Y} (11)
3116 D \p{Hyphen: N*} Supplanted by Line_Break property values;
3117 see www.unicode.org/reports/tr14
3118 (Single: \P{Hyphen}) (1_114_101 plus all
3119 above-Unicode code points: [\x00-\x20!
3120 \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z
3121 \[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae-\xff],
3122 U+0100..0589, U+058B..1805,
3123 U+1807..200F, U+2012..2E16, U+2E18..30FA
3124 ...)
3125 D \p{Hyphen: Y*} Supplanted by Line_Break property values;
3126 see www.unicode.org/reports/tr14
3127 (Single: \p{Hyphen}) (11: [\-\xad],
3128 U+058A, U+1806, U+2010..2011, U+2E17,
3129 U+30FB ...)
3130 \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT
3131 \p{Ideographic_Description_Characters})
3132 (139_482)
3133 \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (974_630 plus
3134 all above-Unicode code points: [\x00-
3135 \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@
3136 \[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4\xb6
3137 \xb8-\xb9\xbb-\xbf\xd7\xf7],
3138 U+02C2..02C5, U+02D2..02DF,
3139 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
3140 \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (139_482:
3141 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6
3142 \xd8-\xf6\xf8-\xff], U+0100..02C1,
3143 U+02C6..02D1, U+02E0..02E4, U+02EC,
3144 U+02EE ...)
3145 \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (136_345)
3146 \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (977_767 plus
3147 all above-Unicode code points: [\x00-
3148 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@
3149 \[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6-
3150 \xb9\xbb-\xbf\xd7\xf7], U+02C2..02C5,
3151 U+02D2..02DF, U+02E5..02EB, U+02ED,
3152 U+02EF..036F ...)
3153 \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (136_345: [A-
3154 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
3155 \xff], U+0100..02C1, U+02C6..02D1,
3156 U+02E0..02E4, U+02EC, U+02EE ...)
3157 \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
3158 \p{Ideographic_Description_Characters})
3159 (139_482)
3160 \p{IDC: *} \p{ID_Continue: *}
3161 \p{Identifier_Status: Allowed} (112_159: [\'\-.0-9:A-Z_a-z\xb7
3162 \xc0-\xd6\xd8-\xf6\xf8-\xff],
3163 U+0100..0131, U+0134..013E,
3164 U+0141..0148, U+014A..017E, U+018F ...)
3165 \p{Identifier_Status: Restricted} (1_001_953 plus all above-
3166 Unicode code points: [\x00-\x20!\"#\$
3167 \%&\(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x7f-
3168 \xb6\xb8-\xbf\xd7\xf7], U+0132..0133,
3169 U+013F..0140, U+0149, U+017F..018E,
3170 U+0190..019F ...)
3171 \p{Identifier_Type: Default_Ignorable} (398: [\xad], U+034F,
3172 U+061C, U+115F..1160, U+17B4..17B5,
3173 U+180B..180F ...)
3174 \p{Identifier_Type: Deprecated} (15: U+0149, U+0673, U+0F77,
3175 U+0F79, U+17A3..17A4, U+206A..206F ...)
3176 \p{Identifier_Type: Exclusion} (17_241: U+03E2..03EF,
3177 U+0800..082D, U+0830..083E,
3178 U+1680..169C, U+16A0..16EA, U+16EE..16F8
3179 ...)
3180 \p{Identifier_Type: Inclusion} (17: [\'\-.:\xb7], U+0375, U+058A,
3181 U+05F3..05F4, U+06FD..06FE, U+0F0B ...)
3182 \p{Identifier_Type: Limited_Use} (5268: U+0700..070D,
3183 U+070F..074A, U+074D..074F,
3184 U+07C0..07FA, U+07FD..07FF, U+0840..085B
3185 ...)
3186 \p{Identifier_Type: Not_Character} (964_920 plus all above-Unicode
3187 code points: [^\t\n\cK\f\r\x20-\x7e\x85
3188 \xa0-\xff], U+0378..0379, U+0380..0383,
3189 U+038B, U+038D, U+03A2 ...)
3190 \p{Identifier_Type: Not_NFKC} (4921: [\xa0\xa8\xaa\xaf\xb2-\xb5
3191 \xb8-\xba\xbc-\xbe], U+0132..0133,
3192 U+013F..0140, U+017F, U+01C4..01CC,
3193 U+01F1..01F3 ...)
3194 \p{Identifier_Type: Not_XID} (8277: [\t\n\cK\f\r\x20!\"#\$\%&
3195 \(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x85
3196 \xa1-\xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6
3197 \xbb\xbf\xd7\xf7], U+02C2..02C5,
3198 U+02D2..02D7, U+02DE..02DF,
3199 U+02E5..02EB, U+02ED ...)
3200 \p{Identifier_Type: Obsolete} (1627: U+018D, U+01AA..01AB,
3201 U+01B9..01BB, U+01BE..01BF,
3202 U+01F6..01F7, U+021C..021D ...)
3203 \p{Identifier_Type: Recommended} (112_142: [0-9A-Z_a-z\xc0-\xd6
3204 \xd8-\xf6\xf8-\xff], U+0100..0131,
3205 U+0134..013E, U+0141..0148,
3206 U+014A..017E, U+018F ...)
3207 \p{Identifier_Type: Technical} (1660: U+0180, U+018D,
3208 U+01AA..01AB, U+01BA..01BB, U+01BE,
3209 U+01C0..01C3 ...)
3210 \p{Identifier_Type: Uncommon_Use} (396: U+0181..018C, U+018E,
3211 U+0190..019F, U+01A2..01A9,
3212 U+01AC..01AE, U+01B1..01B8 ...)
3213 \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
3214 (105_854)
3215 \p{Ideo: *} \p{Ideographic: *}
3216 \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
3217 (105_854)
3218 \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_008_258
3219 plus all above-Unicode code points:
3220 U+0000..3005, U+3008..3020,
3221 U+302A..3037, U+303B..33FF,
3222 U+4DC0..4DFF, U+A000..F8FF ...)
3223 \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (105_854:
3224 U+3006..3007, U+3021..3029,
3225 U+3038..303A, U+3400..4DBF,
3226 U+4E00..9FFF, U+F900..FA6D ...)
3227 X \p{Ideographic_Description_Characters} \p{Block=
3228 Ideographic_Description_Characters}
3229 (Short: \p{InIDC}) (16)
3230 X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
3231 \p{Block=
3232 Ideographic_Symbols_And_Punctuation})
3233 (32)
3234 X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
3235 Ideographic_Symbols_And_Punctuation}
3236 (Short: \p{InIdeographicSymbols}) (32)
3237 \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (136_345)
3238 \p{IDS: *} \p{ID_Start: *}
3239 \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
3240 \p{IDSB}) (10)
3241 \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
3242 (1_114_102 plus all above-Unicode code
3243 points: U+0000..2FEF, U+2FF2..2FF3,
3244 U+2FFC..infinity)
3245 \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10:
3246 U+2FF0..2FF1, U+2FF4..2FFB)
3247 \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
3248 \p{IDST}) (2)
3249 \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
3250 (1_114_110 plus all above-Unicode code
3251 points: U+0000..2FF1, U+2FF4..infinity)
3252 \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2:
3253 U+2FF2..2FF3)
3254 \p{IDSB} \p{IDS_Binary_Operator} (=
3255 \p{IDS_Binary_Operator=Y}) (10)
3256 \p{IDSB: *} \p{IDS_Binary_Operator: *}
3257 \p{IDST} \p{IDS_Trinary_Operator} (=
3258 \p{IDS_Trinary_Operator=Y}) (2)
3259 \p{IDST: *} \p{IDS_Trinary_Operator: *}
3260 \p{Imperial_Aramaic} \p{Script_Extensions=Imperial_Aramaic}
3261 (Short: \p{Armi}; NOT \p{Block=
3262 Imperial_Aramaic}) (31)
3263 \p{In: *} \p{Present_In: *} (Perl extension)
3264 X \p{In_*} \p{Block: *}
3265 X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block=
3266 Common_Indic_Number_Forms}) (16)
3267 \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
3268 (356: U+093C, U+0941..0944, U+094D,
3269 U+0952, U+0956..0957, U+0962..0963 ...)
3270 \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC=
3271 BottomAndLeft}) (1: U+A9BF)
3272 \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
3273 BottomAndRight}) (4: U+1B3B, U+A9BE,
3274 U+A9C0, U+11942)
3275 \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (66:
3276 U+093F, U+094E, U+09BF, U+09C7..09C8,
3277 U+0A3F, U+0ABF ...)
3278 \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
3279 LeftAndRight}) (22: U+09CB..09CC,
3280 U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C,
3281 U+0DDC, U+0DDE ...)
3282 \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_875
3283 plus all above-Unicode code points:
3284 U+0000..08FF, U+0904..0939, U+093D,
3285 U+0950, U+0953..0954, U+0958..0961 ...)
3286 \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
3287 Overstruck}) (10: U+1CD4, U+1CE2..1CE8,
3288 U+10A01, U+10A06)
3289 \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (295:
3290 U+0903, U+093B, U+093E, U+0940,
3291 U+0949..094C, U+094F ...)
3292 \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (428:
3293 U+0900..0902, U+093A, U+0945..0948,
3294 U+0951, U+0955, U+0981 ...)
3295 \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
3296 TopAndBottom}) (10: U+0C48, U+0F73,
3297 U+0F76..0F79, U+0F81, U+1B3C,
3298 U+1112E..1112F)
3299 \p{Indic_Positional_Category: Top_And_Bottom_And_Left} (Short:
3300 \p{InPC=TopAndBottomAndLeft}) (2:
3301 U+103C, U+1171E)
3302 \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
3303 \p{InPC=TopAndBottomAndRight}) (1:
3304 U+1B3D)
3305 \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
3306 TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE,
3307 U+1C29, U+114BB, U+115B9)
3308 \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
3309 \p{InPC=TopAndLeftAndRight}) (4: U+0B4C,
3310 U+0DDD, U+17BF, U+115BB)
3311 \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
3312 TopAndRight}) (13: U+0AC9, U+0B57,
3313 U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB,
3314 U+1925..1926 ...)
3315 \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
3316 VisualOrderLeft}) (19: U+0E40..0E44,
3317 U+0EC0..0EC4, U+19B5..19B7, U+19BA,
3318 U+AAB5..AAB6, U+AAB9 ...)
3319 X \p{Indic_Siyaq_Numbers} \p{Block=Indic_Siyaq_Numbers} (80)
3320 \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
3321 (17: U+093D, U+09BD, U+0ABD, U+0B3D,
3322 U+0C3D, U+0CBD ...)
3323 \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (94:
3324 U+0900..0902, U+0981..0982, U+09FC,
3325 U+0A01..0A02, U+0A70, U+0A81..0A82 ...)
3326 \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
3327 \p{InSC=BrahmiJoiningNumber}) (20:
3328 U+11052..11065)
3329 \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
3330 CantillationMark}) (58: U+0951..0952,
3331 U+0A51, U+0AFA, U+0AFC, U+1CD0..1CD2,
3332 U+1CD4..1CE1 ...)
3333 \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
3334 (2241: U+0915..0939, U+0958..095F,
3335 U+0978..097F, U+0995..09A8,
3336 U+09AA..09B0, U+09B2 ...)
3337 \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
3338 ConsonantDead}) (14: U+09CE, U+0C5D,
3339 U+0CDD, U+0D54..0D56, U+0D7A..0D7F,
3340 U+1CF2..1CF3)
3341 \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
3342 ConsonantFinal}) (70: U+1930..1931,
3343 U+1933..1939, U+19C1..19C7,
3344 U+1A58..1A59, U+1B03, U+1B81 ...)
3345 \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
3346 \p{InSC=ConsonantHeadLetter}) (5:
3347 U+0F88..0F8C)
3348 \p{Indic_Syllabic_Category: Consonant_Initial_Postfixed} (Short:
3349 \p{InSC=ConsonantInitialPostfixed}) (1:
3350 U+1A5A)
3351 \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
3352 ConsonantKiller}) (2: U+0E4C, U+17CD)
3353 \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
3354 ConsonantMedial}) (31: U+0A75,
3355 U+0EBC..0EBD, U+103B..103E,
3356 U+105E..1060, U+1082, U+1A55..1A56 ...)
3357 \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
3358 \p{InSC=ConsonantPlaceholder}) (22: [\-
3359 \xa0\xd7], U+0980, U+0A72..0A73, U+104B,
3360 U+104E, U+1900 ...)
3361 \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
3362 \p{InSC=ConsonantPrecedingRepha}) (4:
3363 U+0D4E, U+11941, U+11D46, U+11F02)
3364 \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
3365 ConsonantPrefixed}) (10: U+111C2..111C3,
3366 U+1193F, U+11A3A, U+11A84..11A89)
3367 \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
3368 ConsonantSubjoined}) (94: U+0F8D..0F97,
3369 U+0F99..0FBC, U+1929..192B, U+1A57,
3370 U+1A5B..1A5E, U+1BA1..1BA3 ...)
3371 \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
3372 \p{InSC=ConsonantSucceedingRepha}) (1:
3373 U+17CC)
3374 \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
3375 \p{InSC=ConsonantWithStacker}) (8:
3376 U+0CF1..0CF2, U+1CF5..1CF6,
3377 U+11003..11004, U+11460..11461)
3378 \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
3379 GeminationMark}) (4: U+0A71, U+0AFB,
3380 U+11237, U+11A98)
3381 \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
3382 InvisibleStacker}) (13: U+1039, U+17D2,
3383 U+1A60, U+1BAB, U+AAF6, U+10A3F ...)
3384 \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1:
3385 U+200D)
3386 \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
3387 ModifyingLetter}) (1: U+0B83)
3388 \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
3389 NonJoiner}) (1: U+200C)
3390 \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (32:
3391 U+093C, U+09BC, U+0A3C, U+0ABC,
3392 U+0AFD..0AFF, U+0B3C ...)
3393 \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (501:
3394 [0-9], U+0966..096F, U+09E6..09EF,
3395 U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F
3396 ...)
3397 \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
3398 NumberJoiner}) (1: U+1107F)
3399 \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
3400 (1_109_473 plus all above-Unicode code
3401 points: [\x00-\x20!\"#\$\%&\'\(\)*+,.
3402 \/:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
3403 \x9f\xa1-\xb1\xb4-\xd6\xd8-\xff],
3404 U+0100..08FF, U+0950, U+0953..0954,
3405 U+0964..0965, U+0970..0971 ...)
3406 \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
3407 PureKiller}) (26: U+0D3B..0D3C, U+0E3A,
3408 U+0E4E, U+0EBA, U+0F84, U+103A ...)
3409 \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
3410 RegisterShifter}) (2: U+17C9..17CA)
3411 \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
3412 SyllableModifier}) (26: [\xb2-\xb3],
3413 U+09FE, U+0ECE, U+0F35, U+0F37, U+0FC6
3414 ...)
3415 \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
3416 ToneLetter}) (7: U+1970..1974, U+AAC0,
3417 U+AAC2)
3418 \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
3419 (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037,
3420 U+1063..1064, U+1069..106D, U+1087..108D
3421 ...)
3422 \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (27:
3423 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
3424 U+0BCD ...)
3425 \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga})
3426 (36: U+0903, U+0983, U+0A03, U+0A83,
3427 U+0B03, U+0C03 ...)
3428 \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30:
3429 U+1963..196D, U+A85E..A861, U+A866,
3430 U+A922..A92A, U+11150..11154)
3431 \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
3432 VowelDependent}) (697: U+093A..093B,
3433 U+093E..094C, U+094E..094F,
3434 U+0955..0957, U+0962..0963, U+09BE..09C4
3435 ...)
3436 \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
3437 VowelIndependent}) (500: U+0904..0914,
3438 U+0960..0961, U+0972..0977,
3439 U+0985..098C, U+098F..0990, U+0993..0994
3440 ...)
3441 \p{Inherited} \p{Script_Extensions=Inherited} (Short:
3442 \p{Zinh}) (586)
3443 \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
3444 (Short: \p{Pi}) (12)
3445 \p{InPC: *} \p{Indic_Positional_Category: *}
3446 \p{InSC: *} \p{Indic_Syllabic_Category: *}
3447 \p{Inscriptional_Pahlavi} \p{Script_Extensions=
3448 Inscriptional_Pahlavi} (Short: \p{Phli};
3449 NOT \p{Block=Inscriptional_Pahlavi}) (27)
3450 \p{Inscriptional_Parthian} \p{Script_Extensions=
3451 Inscriptional_Parthian} (Short:
3452 \p{Prti}; NOT \p{Block=
3453 Inscriptional_Parthian}) (30)
3454 X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block=
3455 IPA_Extensions}) (96)
3456 X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short:
3457 \p{InIPAExt}) (96)
3458 \p{Is_*} \p{*} (Any exceptions are individually
3459 noted beginning with the word NOT.) If
3460 an entry has flag(s) at its beginning,
3461 like "D", the "Is_" form has the same
3462 flag(s)
3463 \p{Ital} \p{Old_Italic} (= \p{Script_Extensions=
3464 Old_Italic}) (NOT \p{Block=Old_Italic})
3465 (39)
3466 X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
3467 (256)
3468 X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block=
3469 Hangul_Jamo_Extended_A}) (32)
3470 X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block=
3471 Hangul_Jamo_Extended_B}) (80)
3472 \p{Java} \p{Javanese} (= \p{Script_Extensions=
3473 Javanese}) (NOT \p{Block=Javanese}) (91)
3474 \p{Javanese} \p{Script_Extensions=Javanese} (Short:
3475 \p{Java}; NOT \p{Block=Javanese}) (91)
3476 \p{Jg: *} \p{Joining_Group: *}
3477 \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
3478 \p{Join_C: *} \p{Join_Control: *}
3479 \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
3480 \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
3481 plus all above-Unicode code points:
3482 U+0000..200B, U+200E..infinity)
3483 \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2:
3484 U+200C..200D)
3485 \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1:
3486 U+08BB)
3487 \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1:
3488 U+08BD)
3489 \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (2:
3490 U+08BC, U+08C4)
3491 \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (9: U+0639..063A,
3492 U+06A0, U+06FC, U+075D..075F, U+08B3,
3493 U+08C3)
3494 \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710)
3495 \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (29: U+0622..0623,
3496 U+0625, U+0627, U+0671..0673, U+0675,
3497 U+0773..0774 ...)
3498 \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (27: U+0628,
3499 U+062A..062B, U+066E, U+0679..0680,
3500 U+0750..0756, U+08A0..08A1 ...)
3501 \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D)
3502 \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
3503 BurushaskiYehBarree}) (2: U+077A..077B)
3504 \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (15: U+062F..0630,
3505 U+0688..0690, U+06EE, U+0759..075A,
3506 U+08AE)
3507 \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4:
3508 U+0715..0716, U+072A, U+072F)
3509 \p{Joining_Group: E} (Short: \p{Jg=E}) (1: U+0725)
3510 \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7:
3511 U+063D..063F, U+06CC, U+06CE,
3512 U+0775..0776)
3513 \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1: U+074F)
3514 \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10: U+0641,
3515 U+06A1..06A6, U+0760..0761, U+08A4)
3516 \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1:
3517 U+0724)
3518 \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (17: U+063B..063C,
3519 U+06A9, U+06AB, U+06AF..06B4,
3520 U+0762..0764, U+088D ...)
3521 \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714,
3522 U+072E)
3523 \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (22: U+062C..062E,
3524 U+0681..0687, U+06BF, U+0757..0758,
3525 U+076E..076F, U+0772 ...)
3526 \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
3527 HamzaOnHehGoal}) (1: U+06C3)
3528 \p{Joining_Group: Hanifi_Rohingya_Kinna_Ya} (Short: \p{Jg=
3529 HanifiRohingyaKinnaYa}) (4: U+10D19,
3530 U+10D1E, U+10D20, U+10D23)
3531 \p{Joining_Group: Hanifi_Rohingya_Pa} (Short: \p{Jg=
3532 HanifiRohingyaPa}) (3: U+10D02, U+10D09,
3533 U+10D1C)
3534 \p{Joining_Group: He} (Short: \p{Jg=He}) (1: U+0717)
3535 \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1: U+0647)
3536 \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2:
3537 U+06C1..06C2)
3538 \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A)
3539 \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (6: U+0643,
3540 U+06AC..06AE, U+077F, U+08B4)
3541 \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F)
3542 \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E)
3543 \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2:
3544 U+06BE, U+06FF)
3545 \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (8: U+0644,
3546 U+06B5..06B8, U+076A, U+08A6, U+08C7)
3547 \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720)
3548 \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1:
3549 U+0866)
3550 \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1:
3551 U+0861)
3552 \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1:
3553 U+0868)
3554 \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla})
3555 (1: U+0869)
3556 \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1:
3557 U+0860)
3558 \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1:
3559 U+0864)
3560 \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna})
3561 (1: U+0865)
3562 \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1:
3563 U+0862)
3564 \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1:
3565 U+0867)
3566 \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1:
3567 U+086A)
3568 \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1:
3569 U+0863)
3570 \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
3571 ManichaeanAleph}) (1: U+10AC0)
3572 \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
3573 (2: U+10AD9..10ADA)
3574 \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
3575 (2: U+10AC1..10AC2)
3576 \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
3577 ManichaeanDaleth}) (1: U+10AC5)
3578 \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
3579 ManichaeanDhamedh}) (1: U+10AD4)
3580 \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
3581 (1: U+10AEC)
3582 \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
3583 ManichaeanGimel}) (2: U+10AC3..10AC4)
3584 \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
3585 (1: U+10ACD)
3586 \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
3587 ManichaeanHundred}) (1: U+10AEF)
3588 \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
3589 (3: U+10AD0..10AD2)
3590 \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
3591 ManichaeanLamedh}) (1: U+10AD3)
3592 \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem})
3593 (1: U+10AD6)
3594 \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun})
3595 (1: U+10AD7)
3596 \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne})
3597 (1: U+10AEB)
3598 \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2:
3599 U+10ADB..10ADC)
3600 \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
3601 (3: U+10ADE..10AE0)
3602 \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
3603 (1: U+10AE1)
3604 \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
3605 ManichaeanSadhe}) (1: U+10ADD)
3606 \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
3607 ManichaeanSamekh}) (1: U+10AD8)
3608 \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw})
3609 (1: U+10AE4)
3610 \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen})
3611 (1: U+10AED)
3612 \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
3613 (1: U+10ACE)
3614 \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
3615 ManichaeanThamedh}) (1: U+10AD5)
3616 \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
3617 ManichaeanTwenty}) (1: U+10AEE)
3618 \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw})
3619 (1: U+10AC7)
3620 \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
3621 (1: U+10ACF)
3622 \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
3623 ManichaeanZayin}) (2: U+10AC9..10ACA)
3624 \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645,
3625 U+0765..0766, U+08A7)
3626 \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1: U+0721)
3627 \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
3628 (1_113_762 plus all above-Unicode code
3629 points: U+0000..061F, U+0621, U+0640,
3630 U+064B..066D, U+0670, U+0674 ...)
3631 \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (9: U+0646,
3632 U+06B9..06BC, U+0767..0769, U+0889)
3633 \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1: U+0722)
3634 \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1: U+06BD)
3635 \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1: U+0726)
3636 \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (6: U+0642, U+066F,
3637 U+06A7..06A8, U+08A5, U+08B5)
3638 \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729)
3639 \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (19: U+0631..0632,
3640 U+0691..0699, U+06EF, U+075B,
3641 U+076B..076C, U+0771 ...)
3642 \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1:
3643 U+0727)
3644 \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1:
3645 U+08AC)
3646 \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (6: U+0635..0636,
3647 U+069D..069E, U+06FB, U+08AF)
3648 \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728)
3649 \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634,
3650 U+069A..069C, U+06FA, U+075C, U+076D,
3651 U+0770 ...)
3652 \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723)
3653 \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B)
3654 \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1:
3655 U+08B1)
3656 \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA)
3657 \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718)
3658 \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (6: U+0637..0638,
3659 U+069F, U+088B..088C, U+08A3)
3660 \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1: U+072C)
3661 \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3:
3662 U+0629, U+06C0, U+06D5)
3663 \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
3664 Hamza_On_Heh_Goal} (1)
3665 \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C)
3666 \p{Joining_Group: Thin_Yeh} (Short: \p{Jg=ThinYeh}) (1: U+0886)
3667 \p{Joining_Group: Vertical_Tail} (Short: \p{Jg=VerticalTail}) (1:
3668 U+088E)
3669 \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16: U+0624, U+0648,
3670 U+0676..0677, U+06C4..06CB, U+06CF,
3671 U+0778..0779 ...)
3672 \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (11: U+0620, U+0626,
3673 U+0649..064A, U+0678, U+06D0..06D1,
3674 U+0777 ...)
3675 \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2:
3676 U+06D2..06D3)
3677 \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1:
3678 U+06CD)
3679 \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D)
3680 \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E)
3681 \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719)
3682 \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D)
3683 \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (7)
3684 \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (610)
3685 \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (610: U+0620,
3686 U+0626, U+0628, U+062A..062E,
3687 U+0633..063F, U+0641..0647 ...)
3688 \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (7: U+0640,
3689 U+07FA, U+0883..0885, U+180A, U+200D)
3690 \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (5)
3691 \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (5: U+A872,
3692 U+10ACD, U+10AD7, U+10D00, U+10FCB)
3693 \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_188 plus
3694 all above-Unicode code points: [\x00-
3695 \xac\xae-\xff], U+0100..02FF,
3696 U+0370..0482, U+048A..0590, U+05BE,
3697 U+05C0 ...)
3698 \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (152)
3699 \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (152:
3700 U+0622..0625, U+0627, U+0629,
3701 U+062F..0632, U+0648, U+0671..0673 ...)
3702 \p{Joining_Type: T} \p{Joining_Type=Transparent} (2150)
3703 \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (2150: [\xad],
3704 U+0300..036F, U+0483..0489,
3705 U+0591..05BD, U+05BF, U+05C1..05C2 ...)
3706 \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_111_188
3707 plus all above-Unicode code points)
3708 \p{Jt: *} \p{Joining_Type: *}
3709 \p{Kaithi} \p{Script_Extensions=Kaithi} (Short:
3710 \p{Kthi}; NOT \p{Block=Kaithi}) (88)
3711 X \p{Kaktovik_Numerals} \p{Block=Kaktovik_Numerals} (32)
3712 \p{Kali} \p{Kayah_Li} (= \p{Script_Extensions=
3713 Kayah_Li}) (48)
3714 \p{Kana} \p{Katakana} (= \p{Script_Extensions=
3715 Katakana}) (NOT \p{Block=Katakana}) (373)
3716 X \p{Kana_Ext_A} \p{Kana_Extended_A} (= \p{Block=
3717 Kana_Extended_A}) (48)
3718 X \p{Kana_Ext_B} \p{Kana_Extended_B} (= \p{Block=
3719 Kana_Extended_B}) (16)
3720 X \p{Kana_Extended_A} \p{Block=Kana_Extended_A} (Short:
3721 \p{InKanaExtA}) (48)
3722 X \p{Kana_Extended_B} \p{Block=Kana_Extended_B} (Short:
3723 \p{InKanaExtB}) (16)
3724 X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block=
3725 Kana_Supplement}) (256)
3726 X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short:
3727 \p{InKanaSup}) (256)
3728 X \p{Kanbun} \p{Block=Kanbun} (16)
3729 X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block=
3730 Kangxi_Radicals}) (224)
3731 X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short:
3732 \p{InKangxi}) (224)
3733 \p{Kannada} \p{Script_Extensions=Kannada} (Short:
3734 \p{Knda}; NOT \p{Block=Kannada}) (106)
3735 \p{Katakana} \p{Script_Extensions=Katakana} (Short:
3736 \p{Kana}; NOT \p{Block=Katakana}) (373)
3737 X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (=
3738 \p{Block=Katakana_Phonetic_Extensions})
3739 (16)
3740 X \p{Katakana_Phonetic_Extensions} \p{Block=
3741 Katakana_Phonetic_Extensions} (Short:
3742 \p{InKatakanaExt}) (16)
3743 \p{Kawi} \p{Script_Extensions=Kawi} (NOT \p{Block=
3744 Kawi}) (86)
3745 \p{Kayah_Li} \p{Script_Extensions=Kayah_Li} (Short:
3746 \p{Kali}) (48)
3747 \p{Khar} \p{Kharoshthi} (= \p{Script_Extensions=
3748 Kharoshthi}) (NOT \p{Block=Kharoshthi})
3749 (68)
3750 \p{Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
3751 \p{Khar}; NOT \p{Block=Kharoshthi}) (68)
3752 \p{Khitan_Small_Script} \p{Script_Extensions=Khitan_Small_Script}
3753 (Short: \p{Kits}; NOT \p{Block=
3754 Khitan_Small_Script}) (471)
3755 \p{Khmer} \p{Script_Extensions=Khmer} (Short:
3756 \p{Khmr}; NOT \p{Block=Khmer}) (146)
3757 X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
3758 \p{Khmr} \p{Khmer} (= \p{Script_Extensions=Khmer})
3759 (NOT \p{Block=Khmer}) (146)
3760 \p{Khoj} \p{Khojki} (= \p{Script_Extensions=
3761 Khojki}) (NOT \p{Block=Khojki}) (85)
3762 \p{Khojki} \p{Script_Extensions=Khojki} (Short:
3763 \p{Khoj}; NOT \p{Block=Khojki}) (85)
3764 \p{Khudawadi} \p{Script_Extensions=Khudawadi} (Short:
3765 \p{Sind}; NOT \p{Block=Khudawadi}) (81)
3766 \p{Kits} \p{Khitan_Small_Script} (=
3767 \p{Script_Extensions=
3768 Khitan_Small_Script}) (NOT \p{Block=
3769 Khitan_Small_Script}) (471)
3770 \p{Knda} \p{Kannada} (= \p{Script_Extensions=
3771 Kannada}) (NOT \p{Block=Kannada}) (106)
3772 \p{Kthi} \p{Kaithi} (= \p{Script_Extensions=
3773 Kaithi}) (NOT \p{Block=Kaithi}) (88)
3774 \p{L} \pL \p{Letter} (= \p{General_Category=Letter})
3775 (136_104)
3776 X \p{L&} \p{Cased_Letter} (= \p{General_Category=
3777 Cased_Letter}) (4095)
3778 X \p{L_} \p{Cased_Letter} (= \p{General_Category=
3779 Cased_Letter}) Note the trailing '_'
3780 matters in spite of loose matching
3781 rules. (4095)
3782 \p{Lana} \p{Tai_Tham} (= \p{Script_Extensions=
3783 Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
3784 \p{Lao} \p{Script_Extensions=Lao} (NOT \p{Block=
3785 Lao}) (83)
3786 \p{Laoo} \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
3787 \p{Block=Lao}) (83)
3788 \p{Latin} \p{Script_Extensions=Latin} (Short:
3789 \p{Latn}) (1510)
3790 X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
3791 Latin_1_Supplement}) (128)
3792 X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block=
3793 Latin_1_Supplement}) (128)
3794 X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
3795 \p{InLatin1}) (128)
3796 X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block=
3797 Latin_Extended_A}) (128)
3798 X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
3799 \p{Block=Latin_Extended_Additional})
3800 (256)
3801 X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block=
3802 Latin_Extended_B}) (208)
3803 X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block=
3804 Latin_Extended_C}) (32)
3805 X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block=
3806 Latin_Extended_D}) (224)
3807 X \p{Latin_Ext_E} \p{Latin_Extended_E} (= \p{Block=
3808 Latin_Extended_E}) (64)
3809 X \p{Latin_Ext_F} \p{Latin_Extended_F} (= \p{Block=
3810 Latin_Extended_F}) (64)
3811 X \p{Latin_Ext_G} \p{Latin_Extended_G} (= \p{Block=
3812 Latin_Extended_G}) (256)
3813 X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short:
3814 \p{InLatinExtA}) (128)
3815 X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
3816 (Short: \p{InLatinExtAdditional}) (256)
3817 X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short:
3818 \p{InLatinExtB}) (208)
3819 X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short:
3820 \p{InLatinExtC}) (32)
3821 X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short:
3822 \p{InLatinExtD}) (224)
3823 X \p{Latin_Extended_E} \p{Block=Latin_Extended_E} (Short:
3824 \p{InLatinExtE}) (64)
3825 X \p{Latin_Extended_F} \p{Block=Latin_Extended_F} (Short:
3826 \p{InLatinExtF}) (64)
3827 X \p{Latin_Extended_G} \p{Block=Latin_Extended_G} (Short:
3828 \p{InLatinExtG}) (256)
3829 \p{Latn} \p{Latin} (= \p{Script_Extensions=Latin})
3830 (1510)
3831 \p{Lb: *} \p{Line_Break: *}
3832 \p{LC} \p{Cased_Letter} (= \p{General_Category=
3833 Cased_Letter}) (4095)
3834 \p{Lepc} \p{Lepcha} (= \p{Script_Extensions=
3835 Lepcha}) (NOT \p{Block=Lepcha}) (74)
3836 \p{Lepcha} \p{Script_Extensions=Lepcha} (Short:
3837 \p{Lepc}; NOT \p{Block=Lepcha}) (74)
3838 \p{Letter} \p{General_Category=Letter} (Short: \p{L})
3839 (136_104)
3840 \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
3841 \p{Nl}) (236)
3842 X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
3843 \p{Limb} \p{Limbu} (= \p{Script_Extensions=Limbu})
3844 (NOT \p{Block=Limbu}) (69)
3845 \p{Limbu} \p{Script_Extensions=Limbu} (Short:
3846 \p{Limb}; NOT \p{Block=Limbu}) (69)
3847 \p{Lina} \p{Linear_A} (= \p{Script_Extensions=
3848 Linear_A}) (NOT \p{Block=Linear_A}) (386)
3849 \p{Linb} \p{Linear_B} (= \p{Script_Extensions=
3850 Linear_B}) (268)
3851 \p{Line_Break: AI} \p{Line_Break=Ambiguous} (707)
3852 \p{Line_Break: AL} \p{Line_Break=Alphabetic} (22_215)
3853 \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (22_215: [#&*<=>\@A-
3854 Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0-
3855 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6,
3856 U+02CE..02CF, U+02D1..02D7, U+02DC,
3857 U+02DE ...)
3858 \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7-\xa8\xaa
3859 \xb2-\xb3\xb6-\xba\xbc-\xbe\xd7\xf7],
3860 U+02C7, U+02C9..02CB, U+02CD, U+02D0,
3861 U+02D8..02DB ...)
3862 \p{Line_Break: B2} \p{Line_Break=Break_Both} (3)
3863 \p{Line_Break: BA} \p{Line_Break=Break_After} (249)
3864 \p{Line_Break: BB} \p{Line_Break=Break_Before} (55)
3865 \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
3866 \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (249: [\t\|\xad],
3867 U+058A, U+05BE, U+0964..0965,
3868 U+0E5A..0E5B, U+0F0B ...)
3869 \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (55: [\xb4],
3870 U+02C8, U+02CC, U+02DF, U+0C77, U+0C84
3871 ...)
3872 \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014,
3873 U+2E3A..2E3B)
3874 \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/])
3875 \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r])
3876 \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
3877 \p{Line_Break: CJ} \p{Line_Break=
3878 Conditional_Japanese_Starter} (60)
3879 \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (97)
3880 \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]])
3881 \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (97: [\}],
3882 U+0F3B, U+0F3D, U+169C, U+2046, U+207E
3883 ...)
3884 \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (2438)
3885 \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2438: [^\t\n
3886 \cK\f\r\x20-\x7e\x85\xa0-\xff],
3887 U+0300..034E, U+0350..035B,
3888 U+0363..036F, U+0483..0489, U+0591..05BD
3889 ...)
3890 \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (758:
3891 U+0E01..0E3A, U+0E40..0E4E,
3892 U+0E81..0E82, U+0E84, U+0E86..0E8A,
3893 U+0E8C..0EA3 ...)
3894 \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
3895 (60: U+3041, U+3043, U+3045, U+3047,
3896 U+3049, U+3063 ...)
3897 \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC)
3898 \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
3899 \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
3900 \p{Line_Break: E_Base} (Short: \p{Lb=EB}) (134: U+261D, U+26F9,
3901 U+270A..270D, U+1F385, U+1F3C2..1F3C4,
3902 U+1F3C7 ...)
3903 \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF)
3904 \p{Line_Break: EB} \p{Line_Break=E_Base} (134)
3905 \p{Line_Break: EM} \p{Line_Break=E_Modifier} (5)
3906 \p{Line_Break: EX} \p{Line_Break=Exclamation} (40)
3907 \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (40: [!?], U+05C6,
3908 U+061B, U+061D..061F, U+06D4, U+07F9 ...)
3909 \p{Line_Break: GL} \p{Line_Break=Glue} (31)
3910 \p{Line_Break: Glue} (Short: \p{Lb=GL}) (31: [\xa0], U+034F,
3911 U+035C..0362, U+0F08, U+0F0C, U+0F12 ...)
3912 \p{Line_Break: H2} (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C,
3913 U+AC38, U+AC54, U+AC70, U+AC8C ...)
3914 \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B,
3915 U+AC1D..AC37, U+AC39..AC53,
3916 U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7
3917 ...)
3918 \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (75:
3919 U+05D0..05EA, U+05EF..05F2, U+FB1D,
3920 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
3921 ...)
3922 \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (75)
3923 \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
3924 \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1: [\-])
3925 \p{Line_Break: ID} \p{Line_Break=Ideographic} (172_465)
3926 \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_465:
3927 U+231A..231B, U+23F0..23F3,
3928 U+2600..2603, U+2614..2615, U+2618,
3929 U+261A..261C ...)
3930 \p{Line_Break: IN} \p{Line_Break=Inseparable} (6)
3931 \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;],
3932 U+037E, U+0589, U+060C..060D, U+07F8,
3933 U+2044 ...)
3934 \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026,
3935 U+22EF, U+FE19, U+10AF6)
3936 \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
3937 \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
3938 \p{Line_Break: JL} (Short: \p{Lb=JL}) (125: U+1100..115F,
3939 U+A960..A97C)
3940 \p{Line_Break: JT} (Short: \p{Lb=JT}) (137: U+11A8..11FF,
3941 U+D7CB..D7FB)
3942 \p{Line_Break: JV} (Short: \p{Lb=JV}) (95: U+1160..11A7,
3943 U+D7B0..D7C6)
3944 \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
3945 \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n])
3946 \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f],
3947 U+2028..2029)
3948 \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85])
3949 \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
3950 \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (33: U+17D6,
3951 U+203C..203D, U+2047..2049, U+3005,
3952 U+301C, U+303B..303C ...)
3953 \p{Line_Break: NS} \p{Line_Break=Nonstarter} (33)
3954 \p{Line_Break: NU} \p{Line_Break=Numeric} (672)
3955 \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (672: [0-9],
3956 U+0660..0669, U+066B..066C,
3957 U+06F0..06F9, U+07C0..07C9, U+0966..096F
3958 ...)
3959 \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (94)
3960 \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (94: [\(\[\{
3961 \xa1\xbf], U+0F3A, U+0F3C, U+169B,
3962 U+201A, U+201E ...)
3963 \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (38)
3964 \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (38: [\%\xa2
3965 \xb0], U+0609..060B, U+066A,
3966 U+09F2..09F3, U+09F9, U+0D79 ...)
3967 \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (67)
3968 \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (67: [\$+\\\xa3-
3969 \xa5\xb1], U+058F, U+07FE..07FF, U+09FB,
3970 U+0AF1, U+0BF9 ...)
3971 \p{Line_Break: QU} \p{Line_Break=Quotation} (39)
3972 \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb],
3973 U+2018..2019, U+201B..201D, U+201F,
3974 U+2039..203A, U+275B..2760 ...)
3975 \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26:
3976 U+1F1E6..1F1FF)
3977 \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26)
3978 \p{Line_Break: SA} \p{Line_Break=Complex_Context} (758)
3979 D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
3980 \p{Line_Break: SP} \p{Line_Break=Space} (1)
3981 \p{Line_Break: Space} (Short: \p{Lb=SP}) (1: [\x20])
3982 D \p{Line_Break: Surrogate} Surrogates should never appear in well-
3983 formed text, and therefore shouldn't be
3984 the basis for line breaking (Short:
3985 \p{Lb=SG}) (2048: U+D800..DFFF)
3986 \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
3987 \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (900_198 plus all
3988 above-Unicode code points: U+0378..0379,
3989 U+0380..0383, U+038B, U+038D, U+03A2,
3990 U+0530 ...)
3991 \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
3992 \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF)
3993 \p{Line_Break: XX} \p{Line_Break=Unknown} (900_198 plus all
3994 above-Unicode code points)
3995 \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
3996 \p{Line_Break: ZWJ} (Short: \p{Lb=ZWJ}) (1: U+200D)
3997 \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B)
3998 \p{Line_Separator} \p{General_Category=Line_Separator}
3999 (Short: \p{Zl}) (1)
4000 \p{Linear_A} \p{Script_Extensions=Linear_A} (Short:
4001 \p{Lina}; NOT \p{Block=Linear_A}) (386)
4002 \p{Linear_B} \p{Script_Extensions=Linear_B} (Short:
4003 \p{Linb}) (268)
4004 X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
4005 X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
4006 \p{Lisu} \p{Script_Extensions=Lisu} (NOT \p{Block=
4007 Lisu}) (49)
4008 X \p{Lisu_Sup} \p{Lisu_Supplement} (= \p{Block=
4009 Lisu_Supplement}) (16)
4010 X \p{Lisu_Supplement} \p{Block=Lisu_Supplement} (Short:
4011 \p{InLisuSup}) (16)
4012 \p{Ll} \p{Lowercase_Letter} (=
4013 \p{General_Category=Lowercase_Letter})
4014 (/i= General_Category=Cased_Letter)
4015 (2233)
4016 \p{Lm} \p{Modifier_Letter} (=
4017 \p{General_Category=Modifier_Letter})
4018 (397)
4019 \p{Lo} \p{Other_Letter} (= \p{General_Category=
4020 Other_Letter}) (131_612)
4021 \p{LOE} \p{Logical_Order_Exception} (=
4022 \p{Logical_Order_Exception=Y}) (19)
4023 \p{LOE: *} \p{Logical_Order_Exception: *}
4024 \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
4025 \p{LOE}) (19)
4026 \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
4027 (1_114_093 plus all above-Unicode code
4028 points: U+0000..0E3F, U+0E45..0EBF,
4029 U+0EC5..19B4, U+19B8..19B9,
4030 U+19BB..AAB4, U+AAB7..AAB8 ...)
4031 \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19:
4032 U+0E40..0E44, U+0EC0..0EC4,
4033 U+19B5..19B7, U+19BA, U+AAB5..AAB6,
4034 U+AAB9 ...)
4035 X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
4036 \p{Lower} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
4037 Cased=Yes) (2544)
4038 \p{Lower: *} \p{Lowercase: *}
4039 \p{Lowercase} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
4040 Cased=Yes) (2544)
4041 \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
4042 No) (1_111_568 plus all above-Unicode
4043 code points: [\x00-\x20!\"#\$\%&\'
4044 \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`\{
4045 \|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9\xbb-
4046 \xde\xf7], U+0100, U+0102, U+0104,
4047 U+0106, U+0108 ...)
4048 \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
4049 Yes) (2544: [a-z\xaa\xb5\xba\xdf-\xf6
4050 \xf8-\xff], U+0101, U+0103, U+0105,
4051 U+0107, U+0109 ...)
4052 \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
4053 (Short: \p{Ll}; /i= General_Category=
4054 Cased_Letter) (2233)
4055 \p{Lt} \p{Titlecase_Letter} (=
4056 \p{General_Category=Titlecase_Letter})
4057 (/i= General_Category=Cased_Letter) (31)
4058 \p{Lu} \p{Uppercase_Letter} (=
4059 \p{General_Category=Uppercase_Letter})
4060 (/i= General_Category=Cased_Letter)
4061 (1831)
4062 \p{Lyci} \p{Lycian} (= \p{Script_Extensions=
4063 Lycian}) (NOT \p{Block=Lycian}) (29)
4064 \p{Lycian} \p{Script_Extensions=Lycian} (Short:
4065 \p{Lyci}; NOT \p{Block=Lycian}) (29)
4066 \p{Lydi} \p{Lydian} (= \p{Script_Extensions=
4067 Lydian}) (NOT \p{Block=Lydian}) (27)
4068 \p{Lydian} \p{Script_Extensions=Lydian} (Short:
4069 \p{Lydi}; NOT \p{Block=Lydian}) (27)
4070 \p{M} \pM \p{Mark} (= \p{General_Category=Mark})
4071 (2450)
4072 \p{Mahajani} \p{Script_Extensions=Mahajani} (Short:
4073 \p{Mahj}; NOT \p{Block=Mahajani}) (61)
4074 \p{Mahj} \p{Mahajani} (= \p{Script_Extensions=
4075 Mahajani}) (NOT \p{Block=Mahajani}) (61)
4076 X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block=
4077 Mahjong_Tiles}) (48)
4078 X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short:
4079 \p{InMahjong}) (48)
4080 \p{Maka} \p{Makasar} (= \p{Script_Extensions=
4081 Makasar}) (NOT \p{Block=Makasar}) (25)
4082 \p{Makasar} \p{Script_Extensions=Makasar} (Short:
4083 \p{Maka}; NOT \p{Block=Makasar}) (25)
4084 \p{Malayalam} \p{Script_Extensions=Malayalam} (Short:
4085 \p{Mlym}; NOT \p{Block=Malayalam}) (126)
4086 \p{Mand} \p{Mandaic} (= \p{Script_Extensions=
4087 Mandaic}) (NOT \p{Block=Mandaic}) (30)
4088 \p{Mandaic} \p{Script_Extensions=Mandaic} (Short:
4089 \p{Mand}; NOT \p{Block=Mandaic}) (30)
4090 \p{Mani} \p{Manichaean} (= \p{Script_Extensions=
4091 Manichaean}) (NOT \p{Block=Manichaean})
4092 (52)
4093 \p{Manichaean} \p{Script_Extensions=Manichaean} (Short:
4094 \p{Mani}; NOT \p{Block=Manichaean}) (52)
4095 \p{Marc} \p{Marchen} (= \p{Script_Extensions=
4096 Marchen}) (NOT \p{Block=Marchen}) (68)
4097 \p{Marchen} \p{Script_Extensions=Marchen} (Short:
4098 \p{Marc}; NOT \p{Block=Marchen}) (68)
4099 \p{Mark} \p{General_Category=Mark} (Short: \p{M})
4100 (2450)
4101 \p{Masaram_Gondi} \p{Script_Extensions=Masaram_Gondi}
4102 (Short: \p{Gonm}; NOT \p{Block=
4103 Masaram_Gondi}) (77)
4104 \p{Math} \p{Math=Y} (2310)
4105 \p{Math: N*} (Single: \P{Math}) (1_111_802 plus all
4106 above-Unicode code points: [\x00-\x20!
4107 \"#\$\%&\'\(\)*,\-.\/0-9:;?\@A-Z
4108 \[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2-
4109 \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF,
4110 U+03D3..03D4, U+03D6..03EF,
4111 U+03F2..03F3, U+03F7..0605 ...)
4112 \p{Math: Y*} (Single: \p{Math}) (2310: [+<=>\^\|~\xac
4113 \xb1\xd7\xf7], U+03D0..03D2, U+03D5,
4114 U+03F0..03F1, U+03F4..03F6, U+0606..0608
4115 ...)
4116 X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (=
4117 \p{Block=
4118 Mathematical_Alphanumeric_Symbols})
4119 (1024)
4120 X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block=
4121 Mathematical_Operators}) (256)
4122 \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
4123 \p{Sm}) (948)
4124 X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
4125 Mathematical_Alphanumeric_Symbols}
4126 (Short: \p{InMathAlphanum}) (1024)
4127 X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
4128 (Short: \p{InMathOperators}) (256)
4129 X \p{Mayan_Numerals} \p{Block=Mayan_Numerals} (32)
4130 \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
4131 Spacing_Mark}) (452)
4132 \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
4133 Enclosing_Mark}) (13)
4134 \p{Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
4135 \p{Medf}; NOT \p{Block=Medefaidrin}) (91)
4136 \p{Medf} \p{Medefaidrin} (= \p{Script_Extensions=
4137 Medefaidrin}) (NOT \p{Block=
4138 Medefaidrin}) (91)
4139 \p{Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek} (Short:
4140 \p{Mtei}; NOT \p{Block=Meetei_Mayek})
4141 (79)
4142 X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block=
4143 Meetei_Mayek_Extensions}) (32)
4144 X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
4145 (Short: \p{InMeeteiMayekExt}) (32)
4146 \p{Mend} \p{Mende_Kikakui} (= \p{Script_Extensions=
4147 Mende_Kikakui}) (NOT \p{Block=
4148 Mende_Kikakui}) (213)
4149 \p{Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
4150 (Short: \p{Mend}; NOT \p{Block=
4151 Mende_Kikakui}) (213)
4152 \p{Merc} \p{Meroitic_Cursive} (=
4153 \p{Script_Extensions=Meroitic_Cursive})
4154 (NOT \p{Block=Meroitic_Cursive}) (90)
4155 \p{Mero} \p{Meroitic_Hieroglyphs} (=
4156 \p{Script_Extensions=
4157 Meroitic_Hieroglyphs}) (32)
4158 \p{Meroitic_Cursive} \p{Script_Extensions=Meroitic_Cursive}
4159 (Short: \p{Merc}; NOT \p{Block=
4160 Meroitic_Cursive}) (90)
4161 \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
4162 Meroitic_Hieroglyphs} (Short: \p{Mero})
4163 (32)
4164 \p{Miao} \p{Script_Extensions=Miao} (NOT \p{Block=
4165 Miao}) (149)
4166 X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (=
4167 \p{Block=
4168 Miscellaneous_Symbols_And_Arrows}) (256)
4169 X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
4170 (= \p{Block=
4171 Miscellaneous_Mathematical_Symbols_A})
4172 (48)
4173 X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
4174 (= \p{Block=
4175 Miscellaneous_Mathematical_Symbols_B})
4176 (128)
4177 X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs}
4178 (= \p{Block=
4179 Miscellaneous_Symbols_And_Pictographs})
4180 (768)
4181 X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block=
4182 Miscellaneous_Symbols}) (256)
4183 X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block=
4184 Miscellaneous_Technical}) (256)
4185 X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
4186 Miscellaneous_Mathematical_Symbols_A}
4187 (Short: \p{InMiscMathSymbolsA}) (48)
4188 X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
4189 Miscellaneous_Mathematical_Symbols_B}
4190 (Short: \p{InMiscMathSymbolsB}) (128)
4191 X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
4192 \p{InMiscSymbols}) (256)
4193 X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
4194 Miscellaneous_Symbols_And_Arrows}
4195 (Short: \p{InMiscArrows}) (256)
4196 X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
4197 Miscellaneous_Symbols_And_Pictographs}
4198 (Short: \p{InMiscPictographs}) (768)
4199 X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
4200 (Short: \p{InMiscTechnical}) (256)
4201 \p{Mlym} \p{Malayalam} (= \p{Script_Extensions=
4202 Malayalam}) (NOT \p{Block=Malayalam})
4203 (126)
4204 \p{Mn} \p{Nonspacing_Mark} (=
4205 \p{General_Category=Nonspacing_Mark})
4206 (1985)
4207 \p{Modi} \p{Script_Extensions=Modi} (NOT \p{Block=
4208 Modi}) (89)
4209 \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
4210 (Short: \p{Lm}) (397)
4211 X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block=
4212 Spacing_Modifier_Letters}) (80)
4213 \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
4214 (Short: \p{Sk}) (125)
4215 X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
4216 \p{Mong} \p{Mongolian} (= \p{Script_Extensions=
4217 Mongolian}) (NOT \p{Block=Mongolian})
4218 (172)
4219 \p{Mongolian} \p{Script_Extensions=Mongolian} (Short:
4220 \p{Mong}; NOT \p{Block=Mongolian}) (172)
4221 X \p{Mongolian_Sup} \p{Mongolian_Supplement} (= \p{Block=
4222 Mongolian_Supplement}) (32)
4223 X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
4224 \p{InMongolianSup}) (32)
4225 \p{Mro} \p{Script_Extensions=Mro} (NOT \p{Block=
4226 Mro}) (43)
4227 \p{Mroo} \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
4228 \p{Block=Mro}) (43)
4229 \p{Mtei} \p{Meetei_Mayek} (= \p{Script_Extensions=
4230 Meetei_Mayek}) (NOT \p{Block=
4231 Meetei_Mayek}) (79)
4232 \p{Mult} \p{Multani} (= \p{Script_Extensions=
4233 Multani}) (NOT \p{Block=Multani}) (48)
4234 \p{Multani} \p{Script_Extensions=Multani} (Short:
4235 \p{Mult}; NOT \p{Block=Multani}) (48)
4236 X \p{Music} \p{Musical_Symbols} (= \p{Block=
4237 Musical_Symbols}) (256)
4238 X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short:
4239 \p{InMusic}) (256)
4240 \p{Myanmar} \p{Script_Extensions=Myanmar} (Short:
4241 \p{Mymr}; NOT \p{Block=Myanmar}) (224)
4242 X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block=
4243 Myanmar_Extended_A}) (32)
4244 X \p{Myanmar_Ext_B} \p{Myanmar_Extended_B} (= \p{Block=
4245 Myanmar_Extended_B}) (32)
4246 X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short:
4247 \p{InMyanmarExtA}) (32)
4248 X \p{Myanmar_Extended_B} \p{Block=Myanmar_Extended_B} (Short:
4249 \p{InMyanmarExtB}) (32)
4250 \p{Mymr} \p{Myanmar} (= \p{Script_Extensions=
4251 Myanmar}) (NOT \p{Block=Myanmar}) (224)
4252 \p{N} \pN \p{Number} (= \p{General_Category=Number})
4253 (1831)
4254 \p{Na=*} \p{Name=*}
4255 \p{Nabataean} \p{Script_Extensions=Nabataean} (Short:
4256 \p{Nbat}; NOT \p{Block=Nabataean}) (40)
4257 \p{Nag_Mundari} \p{Script_Extensions=Nag_Mundari} (Short:
4258 \p{Nagm}; NOT \p{Block=Nag_Mundari}) (42)
4259 \p{Nagm} \p{Nag_Mundari} (= \p{Script_Extensions=
4260 Nag_Mundari}) (NOT \p{Block=
4261 Nag_Mundari}) (42)
4262 \p{Name=*} Combination of Name and Name_Alias
4263 properties; has special loose matching
4264 rules, for which see Unicode UAX #44
4265 \p{Nand} \p{Nandinagari} (= \p{Script_Extensions=
4266 Nandinagari}) (NOT \p{Block=
4267 Nandinagari}) (86)
4268 \p{Nandinagari} \p{Script_Extensions=Nandinagari} (Short:
4269 \p{Nand}; NOT \p{Block=Nandinagari}) (86)
4270 \p{Narb} \p{Old_North_Arabian} (=
4271 \p{Script_Extensions=Old_North_Arabian})
4272 (32)
4273 X \p{NB} \p{No_Block} (= \p{Block=No_Block})
4274 (820_944 plus all above-Unicode code
4275 points)
4276 \p{Nbat} \p{Nabataean} (= \p{Script_Extensions=
4277 Nabataean}) (NOT \p{Block=Nabataean})
4278 (40)
4279 \p{NChar} \p{Noncharacter_Code_Point} (=
4280 \p{Noncharacter_Code_Point=Y}) (66)
4281 \p{NChar: *} \p{Noncharacter_Code_Point: *}
4282 \p{Nd} \p{XPosixDigit} (= \p{General_Category=
4283 Decimal_Number}) (680)
4284 \p{New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
4285 \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
4286 \p{Newa} \p{Script_Extensions=Newa} (NOT \p{Block=
4287 Newa}) (97)
4288 \p{NFC_QC: *} \p{NFC_Quick_Check: *}
4289 \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (111)
4290 \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (111:
4291 U+0300..0304, U+0306..030C, U+030F,
4292 U+0311, U+0313..0314, U+031B ...)
4293 \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
4294 \P{NFC_Quick_Check} NOR \P{NFC_QC})
4295 (1120)
4296 \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
4297 \P{NFC_Quick_Check} NOR \P{NFC_QC})
4298 (1120: U+0340..0341, U+0343..0344,
4299 U+0374, U+037E, U+0387, U+0958..095F ...)
4300 \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
4301 \p{NFC_Quick_Check} NOR \p{NFC_QC})
4302 (1_112_881 plus all above-Unicode code
4303 points)
4304 \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
4305 \p{NFC_Quick_Check} NOR \p{NFC_QC})
4306 (1_112_881 plus all above-Unicode code
4307 points: U+0000..02FF, U+0305,
4308 U+030D..030E, U+0310, U+0312,
4309 U+0315..031A ...)
4310 \p{NFD_QC: *} \p{NFD_Quick_Check: *}
4311 \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
4312 \P{NFD_Quick_Check} NOR \P{NFD_QC})
4313 (13_233)
4314 \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
4315 \P{NFD_Quick_Check} NOR \P{NFD_QC})
4316 (13_233: [\xc0-\xc5\xc7-\xcf\xd1-\xd6
4317 \xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-\xf6
4318 \xf9-\xfd\xff], U+0100..010F,
4319 U+0112..0125, U+0128..0130,
4320 U+0134..0137, U+0139..013E ...)
4321 \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
4322 \p{NFD_Quick_Check} NOR \p{NFD_QC})
4323 (1_100_879 plus all above-Unicode code
4324 points)
4325 \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
4326 \p{NFD_Quick_Check} NOR \p{NFD_QC})
4327 (1_100_879 plus all above-Unicode code
4328 points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde-
4329 \xdf\xe6\xf0\xf7-\xf8\xfe],
4330 U+0110..0111, U+0126..0127,
4331 U+0131..0133, U+0138, U+013F..0142 ...)
4332 \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
4333 \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (111)
4334 \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (111:
4335 U+0300..0304, U+0306..030C, U+030F,
4336 U+0311, U+0313..0314, U+031B ...)
4337 \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
4338 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4339 (4928)
4340 \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
4341 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4342 (4928: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4343 \xba\xbc-\xbe], U+0132..0133,
4344 U+013F..0140, U+0149, U+017F,
4345 U+01C4..01CC ...)
4346 \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
4347 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4348 (1_109_073 plus all above-Unicode code
4349 points)
4350 \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
4351 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4352 (1_109_073 plus all above-Unicode code
4353 points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4354 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff],
4355 U+0100..0131, U+0134..013E,
4356 U+0141..0148, U+014A..017E, U+0180..01C3
4357 ...)
4358 \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
4359 \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
4360 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4361 (17_029)
4362 \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
4363 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4364 (17_029: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4365 \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1-
4366 \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
4367 \xf6\xf9-\xfd\xff], U+0100..010F,
4368 U+0112..0125, U+0128..0130,
4369 U+0132..0137, U+0139..0140 ...)
4370 \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
4371 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4372 (1_097_083 plus all above-Unicode code
4373 points)
4374 \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
4375 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4376 (1_097_083 plus all above-Unicode code
4377 points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4378 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xc6\xd0
4379 \xd7-\xd8\xde-\xdf\xe6\xf0\xf7-\xf8
4380 \xfe], U+0110..0111, U+0126..0127,
4381 U+0131, U+0138, U+0141..0142 ...)
4382 \p{Nko} \p{Script_Extensions=Nko} (NOT \p{Block=
4383 NKo}) (67)
4384 \p{Nkoo} \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
4385 \p{Block=NKo}) (67)
4386 \p{Nl} \p{Letter_Number} (= \p{General_Category=
4387 Letter_Number}) (236)
4388 \p{No} \p{Other_Number} (= \p{General_Category=
4389 Other_Number}) (915)
4390 X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB})
4391 (820_944 plus all above-Unicode code
4392 points)
4393 \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
4394 \p{NChar}) (66)
4395 \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
4396 (1_114_046 plus all above-Unicode code
4397 points: U+0000..FDCF, U+FDF0..FFFD,
4398 U+10000..1FFFD, U+20000..2FFFD,
4399 U+30000..3FFFD, U+40000..4FFFD ...)
4400 \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
4401 (66: U+FDD0..FDEF, U+FFFE..FFFF,
4402 U+1FFFE..1FFFF, U+2FFFE..2FFFF,
4403 U+3FFFE..3FFFF, U+4FFFE..4FFFF ...)
4404 \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
4405 (Short: \p{Mn}) (1985)
4406 \p{Nshu} \p{Nushu} (= \p{Script_Extensions=Nushu})
4407 (NOT \p{Block=Nushu}) (397)
4408 \p{Nt: *} \p{Numeric_Type: *}
4409 \p{Number} \p{General_Category=Number} (Short: \p{N})
4410 (1831)
4411 X \p{Number_Forms} \p{Block=Number_Forms} (64)
4412 \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (680)
4413 \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (680: [0-9],
4414 U+0660..0669, U+06F0..06F9,
4415 U+07C0..07C9, U+0966..096F, U+09E6..09EF
4416 ...)
4417 \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
4418 \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9],
4419 U+1369..1371, U+19DA, U+2070,
4420 U+2074..2079, U+2080..2089 ...)
4421 \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_200 plus all
4422 above-Unicode code points: [\x00-\x20!
4423 \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\]
4424 \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba-
4425 \xbb\xbf-\xff], U+0100..065F,
4426 U+066A..06EF, U+06FA..07BF,
4427 U+07CA..0965, U+0970..09E5 ...)
4428 \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (1104)
4429 \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (1104: [\xbc-\xbe],
4430 U+09F4..09F9, U+0B72..0B77,
4431 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
4432 ...)
4433 T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33)
4434 T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (87: [0], U+0660,
4435 U+06F0, U+07C0, U+0966, U+09E6 ...)
4436 T \p{Numeric_Value: 1/320} (Short: \p{Nv=1/320}) (2: U+11FC0,
4437 U+11FD4)
4438 T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (2: U+0D58, U+11FC1)
4439 T \p{Numeric_Value: 1/80} (Short: \p{Nv=1/80}) (1: U+11FC2)
4440 T \p{Numeric_Value: 1/64} (Short: \p{Nv=1/64}) (1: U+11FC3)
4441 T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (2: U+0D59, U+11FC4)
4442 T \p{Numeric_Value: 1/32} (Short: \p{Nv=1/32}) (1: U+11FC5)
4443 T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (2: U+0D5A, U+11FC6)
4444 T \p{Numeric_Value: 3/64} (Short: \p{Nv=3/64}) (1: U+11FC7)
4445 T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (2: U+0D5B, U+11FC8)
4446 T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (6: U+09F4, U+0B75,
4447 U+0D76, U+A833, U+11FC9..11FCA)
4448 T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6)
4449 T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (3: U+0D5C, U+2152,
4450 U+11FCB)
4451 T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1: U+2151)
4452 T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (7: U+09F5, U+0B76,
4453 U+0D77, U+215B, U+A834, U+11FCC ...)
4454 T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1: U+2150)
4455 T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (2: U+0D5D, U+11FCD)
4456 T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (4: U+2159, U+109F7,
4457 U+12461, U+1ED3D)
4458 T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (5: U+09F6, U+0B77,
4459 U+0D78, U+A835, U+11FCE)
4460 T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (3: U+0D5E, U+2155,
4461 U+11FCF)
4462 T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (14: [\xbc], U+09F7,
4463 U+0B72, U+0D73, U+A830, U+10140 ...)
4464 T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (6: U+2153, U+109F9,
4465 U+10E7D, U+1245A, U+1245D, U+12465)
4466 T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1: U+215C)
4467 T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1: U+2156)
4468 T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA)
4469 T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (19: [\xbd], U+0B73,
4470 U+0D74, U+0F2A, U+2CFD, U+A831 ...)
4471 T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC)
4472 T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1: U+2157)
4473 T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1: U+215D)
4474 T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (7: U+2154, U+10177,
4475 U+109FD, U+10E7E, U+1245B, U+1245E ...)
4476 T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (9: [\xbe], U+09F8,
4477 U+0B74, U+0D75, U+A832, U+10178 ...)
4478 T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1: U+2158)
4479 T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (3: U+215A, U+109FF,
4480 U+1245C)
4481 T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1: U+215E)
4482 T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC)
4483 T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (144: [1\xb9], U+0661,
4484 U+06F1, U+07C1, U+0967, U+09E7 ...)
4485 T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1: U+0F2B)
4486 T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (143: [2\xb2], U+0662,
4487 U+06F2, U+07C2, U+0968, U+09E8 ...)
4488 T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1: U+0F2C)
4489 T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (144: [3\xb3], U+0663,
4490 U+06F3, U+07C3, U+0969, U+09E9 ...)
4491 T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1: U+0F2D)
4492 T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (135: [4], U+0664,
4493 U+06F4, U+07C4, U+096A, U+09EA ...)
4494 T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1: U+0F2E)
4495 T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (133: [5], U+0665,
4496 U+06F5, U+07C5, U+096B, U+09EB ...)
4497 T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F)
4498 T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (117: [6], U+0666,
4499 U+06F6, U+07C6, U+096C, U+09EC ...)
4500 T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30)
4501 T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (116: [7], U+0667,
4502 U+06F7, U+07C7, U+096D, U+09ED ...)
4503 T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31)
4504 T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (112: [8], U+0668,
4505 U+06F8, U+07C8, U+096E, U+09EE ...)
4506 T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32)
4507 T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (116: [9], U+0669,
4508 U+06F9, U+07C9, U+096F, U+09EF ...)
4509 T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (63: U+0BF0, U+0D70,
4510 U+1372, U+2169, U+2179, U+2469 ...)
4511 T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (9: U+216A, U+217A,
4512 U+246A, U+247E, U+2492, U+24EB ...)
4513 T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (9: U+216B, U+217B,
4514 U+246B, U+247F, U+2493, U+24EC ...)
4515 T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (7: U+246C, U+2480,
4516 U+2494, U+24ED, U+16E8D, U+1D2CD ...)
4517 T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (7: U+246D, U+2481,
4518 U+2495, U+24EE, U+16E8E, U+1D2CE ...)
4519 T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (7: U+246E, U+2482,
4520 U+2496, U+24EF, U+16E8F, U+1D2CF ...)
4521 T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (8: U+09F9, U+246F,
4522 U+2483, U+2497, U+24F0, U+16E90 ...)
4523 T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (8: U+16EE, U+2470,
4524 U+2484, U+2498, U+24F1, U+16E91 ...)
4525 T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (8: U+16EF, U+2471,
4526 U+2485, U+2499, U+24F2, U+16E92 ...)
4527 T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (8: U+16F0, U+2472,
4528 U+2486, U+249A, U+24F3, U+16E93 ...)
4529 T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (36: U+1373, U+2473,
4530 U+2487, U+249B, U+24F4, U+3039 ...)
4531 T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1: U+3251)
4532 T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1: U+3252)
4533 T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1: U+3253)
4534 T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1: U+3254)
4535 T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1: U+3255)
4536 T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1: U+3256)
4537 T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1: U+3257)
4538 T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1: U+3258)
4539 T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1: U+3259)
4540 T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (19: U+1374, U+303A,
4541 U+324A, U+325A, U+5345, U+10112 ...)
4542 T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1: U+325B)
4543 T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1: U+325C)
4544 T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1: U+325D)
4545 T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1: U+325E)
4546 T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1: U+325F)
4547 T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1: U+32B1)
4548 T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1: U+32B2)
4549 T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1: U+32B3)
4550 T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1: U+32B4)
4551 T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (18: U+1375, U+324B,
4552 U+32B5, U+534C, U+10113, U+102ED ...)
4553 T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1: U+32B6)
4554 T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1: U+32B7)
4555 T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1: U+32B8)
4556 T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1: U+32B9)
4557 T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1: U+32BA)
4558 T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1: U+32BB)
4559 T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1: U+32BC)
4560 T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1: U+32BD)
4561 T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1: U+32BE)
4562 T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (29: U+1376, U+216C,
4563 U+217C, U+2186, U+324C, U+32BF ...)
4564 T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (13: U+1377, U+324D,
4565 U+10115, U+102EF, U+109CE, U+10E6E ...)
4566 T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (13: U+1378, U+324E,
4567 U+10116, U+102F0, U+109CF, U+10E6F ...)
4568 T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (12: U+1379, U+324F,
4569 U+10117, U+102F1, U+10E70, U+11062 ...)
4570 T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (12: U+137A, U+10118,
4571 U+102F2, U+10341, U+10E71, U+11063 ...)
4572 T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (35: U+0BF1, U+0D71,
4573 U+137B, U+216D, U+217D, U+4F70 ...)
4574 T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (6: U+1011A, U+102F4,
4575 U+109D3, U+10E73, U+1EC84, U+1ED14)
4576 T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (7: U+1011B, U+1016B,
4577 U+102F5, U+109D4, U+10E74, U+1EC85 ...)
4578 T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (7: U+1011C, U+102F6,
4579 U+109D5, U+10E75, U+1EC86, U+1ED16 ...)
4580 T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (16: U+216E, U+217E,
4581 U+1011D, U+10145, U+1014C, U+10153 ...)
4582 T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (7: U+1011E, U+102F8,
4583 U+109D7, U+10E77, U+1EC88, U+1ED18 ...)
4584 T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (6: U+1011F, U+102F9,
4585 U+109D8, U+10E78, U+1EC89, U+1ED19)
4586 T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (6: U+10120, U+102FA,
4587 U+109D9, U+10E79, U+1EC8A, U+1ED1A)
4588 T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (7: U+10121, U+102FB,
4589 U+1034A, U+109DA, U+10E7A, U+1EC8B ...)
4590 T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (22: U+0BF2, U+0D72,
4591 U+216F, U+217F..2180, U+4EDF, U+5343 ...)
4592 T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (5: U+10123, U+109DC,
4593 U+1EC8D, U+1ED1D, U+1ED3A)
4594 T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (4: U+10124, U+109DD,
4595 U+1EC8E, U+1ED1E)
4596 T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (4: U+10125, U+109DE,
4597 U+1EC8F, U+1ED1F)
4598 T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (8: U+2181, U+10126,
4599 U+10146, U+1014E, U+10172, U+109DF ...)
4600 T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (4: U+10127, U+109E0,
4601 U+1EC91, U+1ED21)
4602 T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (4: U+10128, U+109E1,
4603 U+1EC92, U+1ED22)
4604 T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (4: U+10129, U+109E2,
4605 U+1EC93, U+1ED23)
4606 T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (4: U+1012A, U+109E3,
4607 U+1EC94, U+1ED24)
4608 T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (13:
4609 U+137C, U+2182, U+4E07, U+842C, U+1012B,
4610 U+10155 ...)
4611 T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (4:
4612 U+1012C, U+109E5, U+1EC96, U+1ED26)
4613 T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (4:
4614 U+1012D, U+109E6, U+1EC97, U+1ED27)
4615 T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (4:
4616 U+1012E, U+109E7, U+1EC98, U+1ED28)
4617 T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (7:
4618 U+2187, U+1012F, U+10147, U+10156,
4619 U+109E8, U+1EC99 ...)
4620 T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (4:
4621 U+10130, U+109E9, U+1EC9A, U+1ED2A)
4622 T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (4:
4623 U+10131, U+109EA, U+1EC9B, U+1ED2B)
4624 T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (4:
4625 U+10132, U+109EB, U+1EC9C, U+1ED2C)
4626 T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (4:
4627 U+10133, U+109EC, U+1EC9D, U+1ED2D)
4628 T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (5:
4629 U+2188, U+109ED, U+1EC9E, U+1ECA0,
4630 U+1ECB4)
4631 T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (2:
4632 U+109EE, U+1EC9F)
4633 T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1:
4634 U+12432)
4635 T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1:
4636 U+109EF)
4637 T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1:
4638 U+109F0)
4639 T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1:
4640 U+12433)
4641 T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1:
4642 U+109F1)
4643 T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1:
4644 U+109F2)
4645 T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1:
4646 U+109F3)
4647 T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1:
4648 U+109F4)
4649 T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1:
4650 U+109F5)
4651 T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1:
4652 U+16B5E)
4653 T \p{Numeric_Value: 10000000} (= 1.0e+07) (Short: \p{Nv=10000000})
4654 (1: U+1ECA1)
4655 T \p{Numeric_Value: 20000000} (= 2.0e+07) (Short: \p{Nv=20000000})
4656 (1: U+1ECA2)
4657 T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
4658 (3: U+4EBF, U+5104, U+16B5F)
4659 T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
4660 10000000000}) (1: U+16B60)
4661 T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
4662 1000000000000}) (2: U+5146, U+16B61)
4663 \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_200 plus all
4664 above-Unicode code points: [\x00-\x20!
4665 \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\]
4666 \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba-
4667 \xbb\xbf-\xff], U+0100..065F,
4668 U+066A..06EF, U+06FA..07BF,
4669 U+07CA..0965, U+0970..09E5 ...)
4670 \p{Nushu} \p{Script_Extensions=Nushu} (Short:
4671 \p{Nshu}; NOT \p{Block=Nushu}) (397)
4672 \p{Nv: *} \p{Numeric_Value: *}
4673 \p{Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
4674 Nyiakeng_Puachue_Hmong} (Short:
4675 \p{Hmnp}; NOT \p{Block=
4676 Nyiakeng_Puachue_Hmong}) (71)
4677 X \p{OCR} \p{Optical_Character_Recognition} (=
4678 \p{Block=Optical_Character_Recognition})
4679 (32)
4680 \p{Ogam} \p{Ogham} (= \p{Script_Extensions=Ogham})
4681 (NOT \p{Block=Ogham}) (29)
4682 \p{Ogham} \p{Script_Extensions=Ogham} (Short:
4683 \p{Ogam}; NOT \p{Block=Ogham}) (29)
4684 \p{Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
4685 \p{Olck}) (48)
4686 \p{Olck} \p{Ol_Chiki} (= \p{Script_Extensions=
4687 Ol_Chiki}) (48)
4688 \p{Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
4689 (Short: \p{Hung}; NOT \p{Block=
4690 Old_Hungarian}) (108)
4691 \p{Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
4692 \p{Ital}; NOT \p{Block=Old_Italic}) (39)
4693 \p{Old_North_Arabian} \p{Script_Extensions=Old_North_Arabian}
4694 (Short: \p{Narb}) (32)
4695 \p{Old_Permic} \p{Script_Extensions=Old_Permic} (Short:
4696 \p{Perm}; NOT \p{Block=Old_Permic}) (44)
4697 \p{Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
4698 \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
4699 \p{Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
4700 \p{Sogo}; NOT \p{Block=Old_Sogdian}) (40)
4701 \p{Old_South_Arabian} \p{Script_Extensions=Old_South_Arabian}
4702 (Short: \p{Sarb}) (32)
4703 \p{Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
4704 \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
4705 \p{Old_Uyghur} \p{Script_Extensions=Old_Uyghur} (Short:
4706 \p{Ougr}; NOT \p{Block=Old_Uyghur}) (28)
4707 \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
4708 (Short: \p{Ps}) (79)
4709 X \p{Optical_Character_Recognition} \p{Block=
4710 Optical_Character_Recognition} (Short:
4711 \p{InOCR}) (32)
4712 \p{Oriya} \p{Script_Extensions=Oriya} (Short:
4713 \p{Orya}; NOT \p{Block=Oriya}) (97)
4714 \p{Orkh} \p{Old_Turkic} (= \p{Script_Extensions=
4715 Old_Turkic}) (NOT \p{Block=Old_Turkic})
4716 (73)
4717 X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
4718 \p{Orya} \p{Oriya} (= \p{Script_Extensions=Oriya})
4719 (NOT \p{Block=Oriya}) (97)
4720 \p{Osage} \p{Script_Extensions=Osage} (Short:
4721 \p{Osge}; NOT \p{Block=Osage}) (72)
4722 \p{Osge} \p{Osage} (= \p{Script_Extensions=Osage})
4723 (NOT \p{Block=Osage}) (72)
4724 \p{Osma} \p{Osmanya} (= \p{Script_Extensions=
4725 Osmanya}) (NOT \p{Block=Osmanya}) (40)
4726 \p{Osmanya} \p{Script_Extensions=Osmanya} (Short:
4727 \p{Osma}; NOT \p{Block=Osmanya}) (40)
4728 \p{Other} \p{General_Category=Other} (Short: \p{C})
4729 (965_096 plus all above-Unicode code
4730 points)
4731 \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
4732 \p{Lo}) (131_612)
4733 \p{Other_Number} \p{General_Category=Other_Number} (Short:
4734 \p{No}) (915)
4735 \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
4736 (Short: \p{Po}) (628)
4737 \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
4738 \p{So}) (6634)
4739 X \p{Ottoman_Siyaq_Numbers} \p{Block=Ottoman_Siyaq_Numbers} (80)
4740 \p{Ougr} \p{Old_Uyghur} (= \p{Script_Extensions=
4741 Old_Uyghur}) (NOT \p{Block=Old_Uyghur})
4742 (28)
4743 \p{P} \pP \p{Punct} (= \p{General_Category=
4744 Punctuation}) (NOT
4745 \p{General_Punctuation}) (842)
4746 \p{Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong} (Short:
4747 \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
4748 (127)
4749 \p{Palm} \p{Palmyrene} (= \p{Script_Extensions=
4750 Palmyrene}) (32)
4751 \p{Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
4752 \p{Palm}) (32)
4753 \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
4754 (Short: \p{Zp}) (1)
4755 \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
4756 Y}) (2760)
4757 \p{Pat_Syn: *} \p{Pattern_Syntax: *}
4758 \p{Pat_WS} \p{Pattern_White_Space} (=
4759 \p{Pattern_White_Space=Y}) (11)
4760 \p{Pat_WS: *} \p{Pattern_White_Space: *}
4761 \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
4762 (2760)
4763 \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
4764 (1_111_352 plus all above-Unicode code
4765 points: [\x00-\x200-9A-Z_a-z\x7f-\xa0
4766 \xa8\xaa\xad\xaf\xb2-\xb5\xb7-\xba\xbc-
4767 \xbe\xc0-\xd6\xd8-\xf6\xf8-\xff],
4768 U+0100..200F, U+2028..202F,
4769 U+203F..2040, U+2054, U+205F..218F ...)
4770 \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760:
4771 [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\]
4772 \^`\{\|\}~\xa1-\xa7\xa9\xab-\xac\xae
4773 \xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
4774 U+2010..2027, U+2030..203E,
4775 U+2041..2053, U+2055..205E, U+2190..245F
4776 ...)
4777 \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
4778 \p{PatWS}) (11)
4779 \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
4780 (1_114_101 plus all above-Unicode code
4781 points: [^\t\n\cK\f\r\x20\x85],
4782 U+0100..200D, U+2010..2027,
4783 U+202A..infinity)
4784 \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11:
4785 [\t\n\cK\f\r\x20\x85], U+200E..200F,
4786 U+2028..2029)
4787 \p{Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
4788 \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
4789 \p{Pauc} \p{Pau_Cin_Hau} (= \p{Script_Extensions=
4790 Pau_Cin_Hau}) (NOT \p{Block=
4791 Pau_Cin_Hau}) (57)
4792 \p{Pc} \p{Connector_Punctuation} (=
4793 \p{General_Category=
4794 Connector_Punctuation}) (10)
4795 \p{PCM} \p{Prepended_Concatenation_Mark} (=
4796 \p{Prepended_Concatenation_Mark=Y}) (13)
4797 \p{PCM: *} \p{Prepended_Concatenation_Mark: *}
4798 \p{Pd} \p{Dash_Punctuation} (=
4799 \p{General_Category=Dash_Punctuation})
4800 (26)
4801 \p{Pe} \p{Close_Punctuation} (=
4802 \p{General_Category=Close_Punctuation})
4803 (77)
4804 \p{PerlSpace} \p{PosixSpace} (6)
4805 \p{PerlWord} \p{PosixWord} (63)
4806 \p{Perm} \p{Old_Permic} (= \p{Script_Extensions=
4807 Old_Permic}) (NOT \p{Block=Old_Permic})
4808 (44)
4809 \p{Pf} \p{Final_Punctuation} (=
4810 \p{General_Category=Final_Punctuation})
4811 (10)
4812 \p{Phag} \p{Phags_Pa} (= \p{Script_Extensions=
4813 Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
4814 \p{Phags_Pa} \p{Script_Extensions=Phags_Pa} (Short:
4815 \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
4816 X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block=
4817 Phaistos_Disc}) (48)
4818 X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short:
4819 \p{InPhaistos}) (48)
4820 \p{Phli} \p{Inscriptional_Pahlavi} (=
4821 \p{Script_Extensions=
4822 Inscriptional_Pahlavi}) (NOT \p{Block=
4823 Inscriptional_Pahlavi}) (27)
4824 \p{Phlp} \p{Psalter_Pahlavi} (=
4825 \p{Script_Extensions=Psalter_Pahlavi})
4826 (NOT \p{Block=Psalter_Pahlavi}) (30)
4827 \p{Phnx} \p{Phoenician} (= \p{Script_Extensions=
4828 Phoenician}) (NOT \p{Block=Phoenician})
4829 (29)
4830 \p{Phoenician} \p{Script_Extensions=Phoenician} (Short:
4831 \p{Phnx}; NOT \p{Block=Phoenician}) (29)
4832 X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block=
4833 Phonetic_Extensions}) (128)
4834 X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (=
4835 \p{Block=
4836 Phonetic_Extensions_Supplement}) (64)
4837 X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
4838 \p{InPhoneticExt}) (128)
4839 X \p{Phonetic_Extensions_Supplement} \p{Block=
4840 Phonetic_Extensions_Supplement} (Short:
4841 \p{InPhoneticExtSup}) (64)
4842 \p{Pi} \p{Initial_Punctuation} (=
4843 \p{General_Category=
4844 Initial_Punctuation}) (12)
4845 X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
4846 \p{Plrd} \p{Miao} (= \p{Script_Extensions=Miao})
4847 (NOT \p{Block=Miao}) (149)
4848 \p{Po} \p{Other_Punctuation} (=
4849 \p{General_Category=Other_Punctuation})
4850 (628)
4851 \p{PosixAlnum} (62: [0-9A-Za-z])
4852 \p{PosixAlpha} (52: [A-Za-z])
4853 \p{PosixBlank} (2: [\t\x20])
4854 \p{PosixCntrl} ASCII control characters (33: ACK, BEL,
4855 BS, CAN, CR, DC1, DC2, DC3, DC4, DEL,
4856 DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF,
4857 FS, GS, HT, LF, NAK, NUL, RS, SI, SO,
4858 SOH, STX, SUB, SYN, US, VT)
4859 \p{PosixDigit} (10: [0-9])
4860 \p{PosixGraph} (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
4861 Z\[\\\]\^_`a-z\{\|\}~])
4862 \p{PosixLower} (/i= PosixAlpha) (26: [a-z])
4863 \p{PosixPrint} (95: [\x20-\x7e])
4864 \p{PosixPunct} (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@
4865 \[\\\]\^_`\{\|\}~])
4866 \p{PosixSpace} (Short: \p{PerlSpace}) (6: [\t\n\cK\f\r
4867 \x20])
4868 \p{PosixUpper} (/i= PosixAlpha) (26: [A-Z])
4869 \p{PosixWord} \w, restricted to ASCII (Short:
4870 \p{PerlWord}) (63: [0-9A-Z_a-z])
4871 \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
4872 (22)
4873 \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
4874 Y} (Short: \p{PCM}) (13)
4875 \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
4876 (1_114_099 plus all above-Unicode code
4877 points: U+0000..05FF, U+0606..06DC,
4878 U+06DE..070E, U+0710..088F,
4879 U+0892..08E1, U+08E3..110BC ...)
4880 \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
4881 (13: U+0600..0605, U+06DD, U+070F,
4882 U+0890..0891, U+08E2, U+110BD ...)
4883 T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
4884 extension) (33_979)
4885 \p{Present_In: V1_1} \p{Present_In=1.1} (= \p{Age=V1_1}) (Perl
4886 extension) (33_979)
4887 T \p{Present_In: 2.0} Code point's usage introduced in version
4888 2.0 or earlier (Short: \p{In=2.0}) (Perl
4889 extension) (178_500: U+0000..01F5,
4890 U+01FA..0217, U+0250..02A8,
4891 U+02B0..02DE, U+02E0..02E9, U+0300..0345
4892 ...)
4893 \p{Present_In: V2_0} \p{Present_In=2.0} (Perl extension)
4894 (178_500)
4895 T \p{Present_In: 2.1} Code point's usage introduced in version
4896 2.1 or earlier (Short: \p{In=2.1}) (Perl
4897 extension) (178_502: U+0000..01F5,
4898 U+01FA..0217, U+0250..02A8,
4899 U+02B0..02DE, U+02E0..02E9, U+0300..0345
4900 ...)
4901 \p{Present_In: V2_1} \p{Present_In=2.1} (Perl extension)
4902 (178_502)
4903 T \p{Present_In: 3.0} Code point's usage introduced in version
4904 3.0 or earlier (Short: \p{In=3.0}) (Perl
4905 extension) (188_809: U+0000..021F,
4906 U+0222..0233, U+0250..02AD,
4907 U+02B0..02EE, U+0300..034E, U+0360..0362
4908 ...)
4909 \p{Present_In: V3_0} \p{Present_In=3.0} (Perl extension)
4910 (188_809)
4911 T \p{Present_In: 3.1} Code point's usage introduced in version
4912 3.1 or earlier (Short: \p{In=3.1}) (Perl
4913 extension) (233_787: U+0000..021F,
4914 U+0222..0233, U+0250..02AD,
4915 U+02B0..02EE, U+0300..034E, U+0360..0362
4916 ...)
4917 \p{Present_In: V3_1} \p{Present_In=3.1} (Perl extension)
4918 (233_787)
4919 T \p{Present_In: 3.2} Code point's usage introduced in version
4920 3.2 or earlier (Short: \p{In=3.2}) (Perl
4921 extension) (234_803: U+0000..0220,
4922 U+0222..0233, U+0250..02AD,
4923 U+02B0..02EE, U+0300..034F, U+0360..036F
4924 ...)
4925 \p{Present_In: V3_2} \p{Present_In=3.2} (Perl extension)
4926 (234_803)
4927 T \p{Present_In: 4.0} Code point's usage introduced in version
4928 4.0 or earlier (Short: \p{In=4.0}) (Perl
4929 extension) (236_029: U+0000..0236,
4930 U+0250..0357, U+035D..036F,
4931 U+0374..0375, U+037A, U+037E ...)
4932 \p{Present_In: V4_0} \p{Present_In=4.0} (Perl extension)
4933 (236_029)
4934 T \p{Present_In: 4.1} Code point's usage introduced in version
4935 4.1 or earlier (Short: \p{In=4.1}) (Perl
4936 extension) (237_302: U+0000..0241,
4937 U+0250..036F, U+0374..0375, U+037A,
4938 U+037E, U+0384..038A ...)
4939 \p{Present_In: V4_1} \p{Present_In=4.1} (Perl extension)
4940 (237_302)
4941 T \p{Present_In: 5.0} Code point's usage introduced in version
4942 5.0 or earlier (Short: \p{In=5.0}) (Perl
4943 extension) (238_671: U+0000..036F,
4944 U+0374..0375, U+037A..037E,
4945 U+0384..038A, U+038C, U+038E..03A1 ...)
4946 \p{Present_In: V5_0} \p{Present_In=5.0} (Perl extension)
4947 (238_671)
4948 T \p{Present_In: 5.1} Code point's usage introduced in version
4949 5.1 or earlier (Short: \p{In=5.1}) (Perl
4950 extension) (240_295: U+0000..0377,
4951 U+037A..037E, U+0384..038A, U+038C,
4952 U+038E..03A1, U+03A3..0523 ...)
4953 \p{Present_In: V5_1} \p{Present_In=5.1} (Perl extension)
4954 (240_295)
4955 T \p{Present_In: 5.2} Code point's usage introduced in version
4956 5.2 or earlier (Short: \p{In=5.2}) (Perl
4957 extension) (246_943: U+0000..0377,
4958 U+037A..037E, U+0384..038A, U+038C,
4959 U+038E..03A1, U+03A3..0525 ...)
4960 \p{Present_In: V5_2} \p{Present_In=5.2} (Perl extension)
4961 (246_943)
4962 T \p{Present_In: 6.0} Code point's usage introduced in version
4963 6.0 or earlier (Short: \p{In=6.0}) (Perl
4964 extension) (249_031: U+0000..0377,
4965 U+037A..037E, U+0384..038A, U+038C,
4966 U+038E..03A1, U+03A3..0527 ...)
4967 \p{Present_In: V6_0} \p{Present_In=6.0} (Perl extension)
4968 (249_031)
4969 T \p{Present_In: 6.1} Code point's usage introduced in version
4970 6.1 or earlier (Short: \p{In=6.1}) (Perl
4971 extension) (249_763: U+0000..0377,
4972 U+037A..037E, U+0384..038A, U+038C,
4973 U+038E..03A1, U+03A3..0527 ...)
4974 \p{Present_In: V6_1} \p{Present_In=6.1} (Perl extension)
4975 (249_763)
4976 T \p{Present_In: 6.2} Code point's usage introduced in version
4977 6.2 or earlier (Short: \p{In=6.2}) (Perl
4978 extension) (249_764: U+0000..0377,
4979 U+037A..037E, U+0384..038A, U+038C,
4980 U+038E..03A1, U+03A3..0527 ...)
4981 \p{Present_In: V6_2} \p{Present_In=6.2} (Perl extension)
4982 (249_764)
4983 T \p{Present_In: 6.3} Code point's usage introduced in version
4984 6.3 or earlier (Short: \p{In=6.3}) (Perl
4985 extension) (249_769: U+0000..0377,
4986 U+037A..037E, U+0384..038A, U+038C,
4987 U+038E..03A1, U+03A3..0527 ...)
4988 \p{Present_In: V6_3} \p{Present_In=6.3} (Perl extension)
4989 (249_769)
4990 T \p{Present_In: 7.0} Code point's usage introduced in version
4991 7.0 or earlier (Short: \p{In=7.0}) (Perl
4992 extension) (252_603: U+0000..0377,
4993 U+037A..037F, U+0384..038A, U+038C,
4994 U+038E..03A1, U+03A3..052F ...)
4995 \p{Present_In: V7_0} \p{Present_In=7.0} (Perl extension)
4996 (252_603)
4997 T \p{Present_In: 8.0} Code point's usage introduced in version
4998 8.0 or earlier (Short: \p{In=8.0}) (Perl
4999 extension) (260_319: U+0000..0377,
5000 U+037A..037F, U+0384..038A, U+038C,
5001 U+038E..03A1, U+03A3..052F ...)
5002 \p{Present_In: V8_0} \p{Present_In=8.0} (Perl extension)
5003 (260_319)
5004 T \p{Present_In: 9.0} Code point's usage introduced in version
5005 9.0 or earlier (Short: \p{In=9.0}) (Perl
5006 extension) (267_819: U+0000..0377,
5007 U+037A..037F, U+0384..038A, U+038C,
5008 U+038E..03A1, U+03A3..052F ...)
5009 \p{Present_In: V9_0} \p{Present_In=9.0} (Perl extension)
5010 (267_819)
5011 T \p{Present_In: 10.0} Code point's usage introduced in version
5012 10.0 or earlier (Short: \p{In=10.0})
5013 (Perl extension) (276_337: U+0000..0377,
5014 U+037A..037F, U+0384..038A, U+038C,
5015 U+038E..03A1, U+03A3..052F ...)
5016 \p{Present_In: V10_0} \p{Present_In=10.0} (Perl extension)
5017 (276_337)
5018 T \p{Present_In: 11.0} Code point's usage introduced in version
5019 11.0 or earlier (Short: \p{In=11.0})
5020 (Perl extension) (277_021: U+0000..0377,
5021 U+037A..037F, U+0384..038A, U+038C,
5022 U+038E..03A1, U+03A3..052F ...)
5023 \p{Present_In: V11_0} \p{Present_In=11.0} (Perl extension)
5024 (277_021)
5025 T \p{Present_In: 12.0} Code point's usage introduced in version
5026 12.0 or earlier (Short: \p{In=12.0})
5027 (Perl extension) (277_575: U+0000..0377,
5028 U+037A..037F, U+0384..038A, U+038C,
5029 U+038E..03A1, U+03A3..052F ...)
5030 \p{Present_In: V12_0} \p{Present_In=12.0} (Perl extension)
5031 (277_575)
5032 T \p{Present_In: 12.1} Code point's usage introduced in version
5033 12.1 or earlier (Short: \p{In=12.1})
5034 (Perl extension) (277_576: U+0000..0377,
5035 U+037A..037F, U+0384..038A, U+038C,
5036 U+038E..03A1, U+03A3..052F ...)
5037 \p{Present_In: V12_1} \p{Present_In=12.1} (Perl extension)
5038 (277_576)
5039 T \p{Present_In: 13.0} Code point's usage introduced in version
5040 13.0 or earlier (Short: \p{In=13.0})
5041 (Perl extension) (283_506: U+0000..0377,
5042 U+037A..037F, U+0384..038A, U+038C,
5043 U+038E..03A1, U+03A3..052F ...)
5044 \p{Present_In: V13_0} \p{Present_In=13.0} (Perl extension)
5045 (283_506)
5046 T \p{Present_In: 14.0} Code point's usage introduced in version
5047 14.0 or earlier (Short: \p{In=14.0})
5048 (Perl extension) (284_344: U+0000..0377,
5049 U+037A..037F, U+0384..038A, U+038C,
5050 U+038E..03A1, U+03A3..052F ...)
5051 \p{Present_In: V14_0} \p{Present_In=14.0} (Perl extension)
5052 (284_344)
5053 T \p{Present_In: 15.0} Code point's usage introduced in version
5054 15.0 or earlier (Short: \p{In=15.0})
5055 (Perl extension) (288_833: U+0000..0377,
5056 U+037A..037F, U+0384..038A, U+038C,
5057 U+038E..03A1, U+03A3..052F ...)
5058 \p{Present_In: V15_0} \p{Present_In=15.0} (Perl extension)
5059 (288_833)
5060 \p{Present_In: NA} \p{Present_In=Unassigned} (= \p{Age=
5061 Unassigned}) (Perl extension) (825_279
5062 plus all above-Unicode code points)
5063 \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=NA})
5064 (Perl extension) (825_279 plus all
5065 above-Unicode code points)
5066 \p{Print} \p{XPosixPrint} (286_652)
5067 \p{Private_Use} \p{General_Category=Private_Use} (Short:
5068 \p{Co}; NOT \p{Private_Use_Area})
5069 (137_468)
5070 X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
5071 \p{InPUA}) (6400)
5072 \p{Prti} \p{Inscriptional_Parthian} (=
5073 \p{Script_Extensions=
5074 Inscriptional_Parthian}) (NOT \p{Block=
5075 Inscriptional_Parthian}) (30)
5076 \p{Ps} \p{Open_Punctuation} (=
5077 \p{General_Category=Open_Punctuation})
5078 (79)
5079 \p{Psalter_Pahlavi} \p{Script_Extensions=Psalter_Pahlavi}
5080 (Short: \p{Phlp}; NOT \p{Block=
5081 Psalter_Pahlavi}) (30)
5082 X \p{PUA} \p{Private_Use_Area} (= \p{Block=
5083 Private_Use_Area}) (6400)
5084 \p{Punct} \p{General_Category=Punctuation} (Short:
5085 \p{P}; NOT \p{General_Punctuation}) (842)
5086 \p{Punctuation} \p{Punct} (= \p{General_Category=
5087 Punctuation}) (NOT
5088 \p{General_Punctuation}) (842)
5089 \p{Qaac} \p{Coptic} (= \p{Script_Extensions=
5090 Coptic}) (NOT \p{Block=Coptic}) (165)
5091 \p{Qaai} \p{Inherited} (= \p{Script_Extensions=
5092 Inherited}) (586)
5093 \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
5094 Y}) (30)
5095 \p{QMark: *} \p{Quotation_Mark: *}
5096 \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
5097 (30)
5098 \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_082
5099 plus all above-Unicode code points:
5100 [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<=>?
5101 \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xaa\xac-
5102 \xba\xbc-\xff], U+0100..2017,
5103 U+2020..2038, U+203B..2E41,
5104 U+2E43..300B, U+3010..301C ...)
5105 \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (30: [\"
5106 \'\xab\xbb], U+2018..201F, U+2039..203A,
5107 U+2E42, U+300C..300F, U+301D..301F ...)
5108 \p{Radical} \p{Radical=Y} (329)
5109 \p{Radical: N*} (Single: \P{Radical}) (1_113_783 plus all
5110 above-Unicode code points: U+0000..2E7F,
5111 U+2E9A, U+2EF4..2EFF, U+2FD6..infinity)
5112 \p{Radical: Y*} (Single: \p{Radical}) (329: U+2E80..2E99,
5113 U+2E9B..2EF3, U+2F00..2FD5)
5114 \p{Regional_Indicator} \p{Regional_Indicator=Y} (Short: \p{RI})
5115 (26)
5116 \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086
5117 plus all above-Unicode code points:
5118 U+0000..1F1E5, U+1F200..infinity)
5119 \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26:
5120 U+1F1E6..1F1FF)
5121 \p{Rejang} \p{Script_Extensions=Rejang} (Short:
5122 \p{Rjng}; NOT \p{Block=Rejang}) (37)
5123 \p{RI} \p{Regional_Indicator} (=
5124 \p{Regional_Indicator=Y}) (26)
5125 \p{RI: *} \p{Regional_Indicator: *}
5126 \p{Rjng} \p{Rejang} (= \p{Script_Extensions=
5127 Rejang}) (NOT \p{Block=Rejang}) (37)
5128 \p{Rohg} \p{Hanifi_Rohingya} (=
5129 \p{Script_Extensions=Hanifi_Rohingya})
5130 (NOT \p{Block=Hanifi_Rohingya}) (55)
5131 X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block=
5132 Rumi_Numeral_Symbols}) (32)
5133 X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
5134 \p{InRumi}) (32)
5135 \p{Runic} \p{Script_Extensions=Runic} (Short:
5136 \p{Runr}; NOT \p{Block=Runic}) (86)
5137 \p{Runr} \p{Runic} (= \p{Script_Extensions=Runic})
5138 (NOT \p{Block=Runic}) (86)
5139 \p{S} \pS \p{Symbol} (= \p{General_Category=Symbol})
5140 (7770)
5141 \p{Samaritan} \p{Script_Extensions=Samaritan} (Short:
5142 \p{Samr}; NOT \p{Block=Samaritan}) (61)
5143 \p{Samr} \p{Samaritan} (= \p{Script_Extensions=
5144 Samaritan}) (NOT \p{Block=Samaritan})
5145 (61)
5146 \p{Sarb} \p{Old_South_Arabian} (=
5147 \p{Script_Extensions=Old_South_Arabian})
5148 (32)
5149 \p{Saur} \p{Saurashtra} (= \p{Script_Extensions=
5150 Saurashtra}) (NOT \p{Block=Saurashtra})
5151 (82)
5152 \p{Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
5153 \p{Saur}; NOT \p{Block=Saurashtra}) (82)
5154 \p{SB: *} \p{Sentence_Break: *}
5155 \p{Sc} \p{Currency_Symbol} (=
5156 \p{General_Category=Currency_Symbol})
5157 (63)
5158 \p{Sc: *} \p{Script: *}
5159 \p{Script: Adlam} (Short: \p{Sc=Adlm}) (88: U+1E900..1E94B,
5160 U+1E950..1E959, U+1E95E..1E95F)
5161 \p{Script: Adlm} \p{Script=Adlam} (88)
5162 \p{Script: Aghb} \p{Script=Caucasian_Albanian} (=
5163 \p{Script_Extensions=
5164 Caucasian_Albanian}) (53)
5165 \p{Script: Ahom} \p{Script_Extensions=Ahom} (Short: \p{Sc=
5166 Ahom}, \p{Ahom}) (65)
5167 \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions=
5168 Anatolian_Hieroglyphs} (Short: \p{Sc=
5169 Hluw}, \p{Hluw}) (583)
5170 \p{Script: Arab} \p{Script=Arabic} (1368)
5171 \p{Script: Arabic} (Short: \p{Sc=Arab}) (1368: U+0600..0604,
5172 U+0606..060B, U+060D..061A,
5173 U+061C..061E, U+0620..063F, U+0641..064A
5174 ...)
5175 \p{Script: Armenian} \p{Script_Extensions=Armenian} (Short:
5176 \p{Sc=Armn}, \p{Armn}) (96)
5177 \p{Script: Armi} \p{Script=Imperial_Aramaic} (=
5178 \p{Script_Extensions=Imperial_Aramaic})
5179 (31)
5180 \p{Script: Armn} \p{Script=Armenian} (=
5181 \p{Script_Extensions=Armenian}) (96)
5182 \p{Script: Avestan} \p{Script_Extensions=Avestan} (Short:
5183 \p{Sc=Avst}, \p{Avst}) (61)
5184 \p{Script: Avst} \p{Script=Avestan} (=
5185 \p{Script_Extensions=Avestan}) (61)
5186 \p{Script: Bali} \p{Script=Balinese} (=
5187 \p{Script_Extensions=Balinese}) (124)
5188 \p{Script: Balinese} \p{Script_Extensions=Balinese} (Short:
5189 \p{Sc=Bali}, \p{Bali}) (124)
5190 \p{Script: Bamu} \p{Script=Bamum} (= \p{Script_Extensions=
5191 Bamum}) (657)
5192 \p{Script: Bamum} \p{Script_Extensions=Bamum} (Short: \p{Sc=
5193 Bamu}, \p{Bamu}) (657)
5194 \p{Script: Bass} \p{Script=Bassa_Vah} (=
5195 \p{Script_Extensions=Bassa_Vah}) (36)
5196 \p{Script: Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
5197 \p{Sc=Bass}, \p{Bass}) (36)
5198 \p{Script: Batak} \p{Script_Extensions=Batak} (Short: \p{Sc=
5199 Batk}, \p{Batk}) (56)
5200 \p{Script: Batk} \p{Script=Batak} (= \p{Script_Extensions=
5201 Batak}) (56)
5202 \p{Script: Beng} \p{Script=Bengali} (96)
5203 \p{Script: Bengali} (Short: \p{Sc=Beng}) (96: U+0980..0983,
5204 U+0985..098C, U+098F..0990,
5205 U+0993..09A8, U+09AA..09B0, U+09B2 ...)
5206 \p{Script: Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
5207 \p{Sc=Bhks}, \p{Bhks}) (97)
5208 \p{Script: Bhks} \p{Script=Bhaiksuki} (=
5209 \p{Script_Extensions=Bhaiksuki}) (97)
5210 \p{Script: Bopo} \p{Script=Bopomofo} (77)
5211 \p{Script: Bopomofo} (Short: \p{Sc=Bopo}) (77: U+02EA..02EB,
5212 U+3105..312F, U+31A0..31BF)
5213 \p{Script: Brah} \p{Script=Brahmi} (= \p{Script_Extensions=
5214 Brahmi}) (115)
5215 \p{Script: Brahmi} \p{Script_Extensions=Brahmi} (Short:
5216 \p{Sc=Brah}, \p{Brah}) (115)
5217 \p{Script: Brai} \p{Script=Braille} (=
5218 \p{Script_Extensions=Braille}) (256)
5219 \p{Script: Braille} \p{Script_Extensions=Braille} (Short:
5220 \p{Sc=Brai}, \p{Brai}) (256)
5221 \p{Script: Bugi} \p{Script=Buginese} (30)
5222 \p{Script: Buginese} (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B,
5223 U+1A1E..1A1F)
5224 \p{Script: Buhd} \p{Script=Buhid} (20)
5225 \p{Script: Buhid} (Short: \p{Sc=Buhd}) (20: U+1740..1753)
5226 \p{Script: Cakm} \p{Script=Chakma} (71)
5227 \p{Script: Canadian_Aboriginal} \p{Script_Extensions=
5228 Canadian_Aboriginal} (Short: \p{Sc=
5229 Cans}, \p{Cans}) (726)
5230 \p{Script: Cans} \p{Script=Canadian_Aboriginal} (=
5231 \p{Script_Extensions=
5232 Canadian_Aboriginal}) (726)
5233 \p{Script: Cari} \p{Script=Carian} (= \p{Script_Extensions=
5234 Carian}) (49)
5235 \p{Script: Carian} \p{Script_Extensions=Carian} (Short:
5236 \p{Sc=Cari}, \p{Cari}) (49)
5237 \p{Script: Caucasian_Albanian} \p{Script_Extensions=
5238 Caucasian_Albanian} (Short: \p{Sc=Aghb},
5239 \p{Aghb}) (53)
5240 \p{Script: Chakma} (Short: \p{Sc=Cakm}) (71: U+11100..11134,
5241 U+11136..11147)
5242 \p{Script: Cham} \p{Script_Extensions=Cham} (Short: \p{Sc=
5243 Cham}, \p{Cham}) (83)
5244 \p{Script: Cher} \p{Script=Cherokee} (=
5245 \p{Script_Extensions=Cherokee}) (172)
5246 \p{Script: Cherokee} \p{Script_Extensions=Cherokee} (Short:
5247 \p{Sc=Cher}, \p{Cher}) (172)
5248 \p{Script: Chorasmian} \p{Script_Extensions=Chorasmian} (Short:
5249 \p{Sc=Chrs}, \p{Chrs}) (28)
5250 \p{Script: Chrs} \p{Script=Chorasmian} (=
5251 \p{Script_Extensions=Chorasmian}) (28)
5252 \p{Script: Common} (Short: \p{Sc=Zyyy}) (8301: [\x00-\x20!
5253 \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]
5254 \^_`\{\|\}~\x7f-\xa9\xab-\xb9\xbb-\xbf
5255 \xd7\xf7], U+02B9..02DF, U+02E5..02E9,
5256 U+02EC..02FF, U+0374, U+037E ...)
5257 \p{Script: Copt} \p{Script=Coptic} (137)
5258 \p{Script: Coptic} (Short: \p{Sc=Copt}) (137: U+03E2..03EF,
5259 U+2C80..2CF3, U+2CF9..2CFF)
5260 \p{Script: Cpmn} \p{Script=Cypro_Minoan} (99)
5261 \p{Script: Cprt} \p{Script=Cypriot} (55)
5262 \p{Script: Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
5263 \p{Sc=Xsux}, \p{Xsux}) (1234)
5264 \p{Script: Cypriot} (Short: \p{Sc=Cprt}) (55: U+10800..10805,
5265 U+10808, U+1080A..10835, U+10837..10838,
5266 U+1083C, U+1083F)
5267 \p{Script: Cypro_Minoan} (Short: \p{Sc=Cpmn}) (99: U+12F90..12FF2)
5268 \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}) (506: U+0400..0484,
5269 U+0487..052F, U+1C80..1C88, U+1D2B,
5270 U+1D78, U+2DE0..2DFF ...)
5271 \p{Script: Cyrl} \p{Script=Cyrillic} (506)
5272 \p{Script: Deseret} \p{Script_Extensions=Deseret} (Short:
5273 \p{Sc=Dsrt}, \p{Dsrt}) (80)
5274 \p{Script: Deva} \p{Script=Devanagari} (164)
5275 \p{Script: Devanagari} (Short: \p{Sc=Deva}) (164: U+0900..0950,
5276 U+0955..0963, U+0966..097F,
5277 U+A8E0..A8FF, U+11B00..11B09)
5278 \p{Script: Diak} \p{Script=Dives_Akuru} (=
5279 \p{Script_Extensions=Dives_Akuru}) (72)
5280 \p{Script: Dives_Akuru} \p{Script_Extensions=Dives_Akuru} (Short:
5281 \p{Sc=Diak}, \p{Diak}) (72)
5282 \p{Script: Dogr} \p{Script=Dogra} (60)
5283 \p{Script: Dogra} (Short: \p{Sc=Dogr}) (60: U+11800..1183B)
5284 \p{Script: Dsrt} \p{Script=Deseret} (=
5285 \p{Script_Extensions=Deseret}) (80)
5286 \p{Script: Dupl} \p{Script=Duployan} (143)
5287 \p{Script: Duployan} (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A,
5288 U+1BC70..1BC7C, U+1BC80..1BC88,
5289 U+1BC90..1BC99, U+1BC9C..1BC9F)
5290 \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (=
5291 \p{Script_Extensions=
5292 Egyptian_Hieroglyphs}) (1110)
5293 \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions=
5294 Egyptian_Hieroglyphs} (Short: \p{Sc=
5295 Egyp}, \p{Egyp}) (1110)
5296 \p{Script: Elba} \p{Script=Elbasan} (=
5297 \p{Script_Extensions=Elbasan}) (40)
5298 \p{Script: Elbasan} \p{Script_Extensions=Elbasan} (Short:
5299 \p{Sc=Elba}, \p{Elba}) (40)
5300 \p{Script: Elym} \p{Script=Elymaic} (=
5301 \p{Script_Extensions=Elymaic}) (23)
5302 \p{Script: Elymaic} \p{Script_Extensions=Elymaic} (Short:
5303 \p{Sc=Elym}, \p{Elym}) (23)
5304 \p{Script: Ethi} \p{Script=Ethiopic} (=
5305 \p{Script_Extensions=Ethiopic}) (523)
5306 \p{Script: Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
5307 \p{Sc=Ethi}, \p{Ethi}) (523)
5308 \p{Script: Geor} \p{Script=Georgian} (173)
5309 \p{Script: Georgian} (Short: \p{Sc=Geor}) (173: U+10A0..10C5,
5310 U+10C7, U+10CD, U+10D0..10FA,
5311 U+10FC..10FF, U+1C90..1CBA ...)
5312 \p{Script: Glag} \p{Script=Glagolitic} (134)
5313 \p{Script: Glagolitic} (Short: \p{Sc=Glag}) (134: U+2C00..2C5F,
5314 U+1E000..1E006, U+1E008..1E018,
5315 U+1E01B..1E021, U+1E023..1E024,
5316 U+1E026..1E02A)
5317 \p{Script: Gong} \p{Script=Gunjala_Gondi} (63)
5318 \p{Script: Gonm} \p{Script=Masaram_Gondi} (75)
5319 \p{Script: Goth} \p{Script=Gothic} (= \p{Script_Extensions=
5320 Gothic}) (27)
5321 \p{Script: Gothic} \p{Script_Extensions=Gothic} (Short:
5322 \p{Sc=Goth}, \p{Goth}) (27)
5323 \p{Script: Gran} \p{Script=Grantha} (85)
5324 \p{Script: Grantha} (Short: \p{Sc=Gran}) (85: U+11300..11303,
5325 U+11305..1130C, U+1130F..11310,
5326 U+11313..11328, U+1132A..11330,
5327 U+11332..11333 ...)
5328 \p{Script: Greek} (Short: \p{Sc=Grek}) (518: U+0370..0373,
5329 U+0375..0377, U+037A..037D, U+037F,
5330 U+0384, U+0386 ...)
5331 \p{Script: Grek} \p{Script=Greek} (518)
5332 \p{Script: Gujarati} (Short: \p{Sc=Gujr}) (91: U+0A81..0A83,
5333 U+0A85..0A8D, U+0A8F..0A91,
5334 U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3
5335 ...)
5336 \p{Script: Gujr} \p{Script=Gujarati} (91)
5337 \p{Script: Gunjala_Gondi} (Short: \p{Sc=Gong}) (63:
5338 U+11D60..11D65, U+11D67..11D68,
5339 U+11D6A..11D8E, U+11D90..11D91,
5340 U+11D93..11D98, U+11DA0..11DA9)
5341 \p{Script: Gurmukhi} (Short: \p{Sc=Guru}) (80: U+0A01..0A03,
5342 U+0A05..0A0A, U+0A0F..0A10,
5343 U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33
5344 ...)
5345 \p{Script: Guru} \p{Script=Gurmukhi} (80)
5346 \p{Script: Han} (Short: \p{Sc=Han}) (98_408: U+2E80..2E99,
5347 U+2E9B..2EF3, U+2F00..2FD5, U+3005,
5348 U+3007, U+3021..3029 ...)
5349 \p{Script: Hang} \p{Script=Hangul} (11_739)
5350 \p{Script: Hangul} (Short: \p{Sc=Hang}) (11_739:
5351 U+1100..11FF, U+302E..302F,
5352 U+3131..318E, U+3200..321E,
5353 U+3260..327E, U+A960..A97C ...)
5354 \p{Script: Hani} \p{Script=Han} (98_408)
5355 \p{Script: Hanifi_Rohingya} (Short: \p{Sc=Rohg}) (50:
5356 U+10D00..10D27, U+10D30..10D39)
5357 \p{Script: Hano} \p{Script=Hanunoo} (21)
5358 \p{Script: Hanunoo} (Short: \p{Sc=Hano}) (21: U+1720..1734)
5359 \p{Script: Hatr} \p{Script=Hatran} (= \p{Script_Extensions=
5360 Hatran}) (26)
5361 \p{Script: Hatran} \p{Script_Extensions=Hatran} (Short:
5362 \p{Sc=Hatr}, \p{Hatr}) (26)
5363 \p{Script: Hebr} \p{Script=Hebrew} (= \p{Script_Extensions=
5364 Hebrew}) (134)
5365 \p{Script: Hebrew} \p{Script_Extensions=Hebrew} (Short:
5366 \p{Sc=Hebr}, \p{Hebr}) (134)
5367 \p{Script: Hira} \p{Script=Hiragana} (381)
5368 \p{Script: Hiragana} (Short: \p{Sc=Hira}) (381: U+3041..3096,
5369 U+309D..309F, U+1B001..1B11F, U+1B132,
5370 U+1B150..1B152, U+1F200)
5371 \p{Script: Hluw} \p{Script=Anatolian_Hieroglyphs} (=
5372 \p{Script_Extensions=
5373 Anatolian_Hieroglyphs}) (583)
5374 \p{Script: Hmng} \p{Script=Pahawh_Hmong} (=
5375 \p{Script_Extensions=Pahawh_Hmong}) (127)
5376 \p{Script: Hmnp} \p{Script=Nyiakeng_Puachue_Hmong} (=
5377 \p{Script_Extensions=
5378 Nyiakeng_Puachue_Hmong}) (71)
5379 \p{Script: Hung} \p{Script=Old_Hungarian} (=
5380 \p{Script_Extensions=Old_Hungarian})
5381 (108)
5382 \p{Script: Imperial_Aramaic} \p{Script_Extensions=
5383 Imperial_Aramaic} (Short: \p{Sc=Armi},
5384 \p{Armi}) (31)
5385 \p{Script: Inherited} (Short: \p{Sc=Zinh}) (657: U+0300..036F,
5386 U+0485..0486, U+064B..0655, U+0670,
5387 U+0951..0954, U+1AB0..1ACE ...)
5388 \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions=
5389 Inscriptional_Pahlavi} (Short: \p{Sc=
5390 Phli}, \p{Phli}) (27)
5391 \p{Script: Inscriptional_Parthian} \p{Script_Extensions=
5392 Inscriptional_Parthian} (Short: \p{Sc=
5393 Prti}, \p{Prti}) (30)
5394 \p{Script: Ital} \p{Script=Old_Italic} (=
5395 \p{Script_Extensions=Old_Italic}) (39)
5396 \p{Script: Java} \p{Script=Javanese} (90)
5397 \p{Script: Javanese} (Short: \p{Sc=Java}) (90: U+A980..A9CD,
5398 U+A9D0..A9D9, U+A9DE..A9DF)
5399 \p{Script: Kaithi} (Short: \p{Sc=Kthi}) (68: U+11080..110C2,
5400 U+110CD)
5401 \p{Script: Kali} \p{Script=Kayah_Li} (47)
5402 \p{Script: Kana} \p{Script=Katakana} (321)
5403 \p{Script: Kannada} (Short: \p{Sc=Knda}) (91: U+0C80..0C8C,
5404 U+0C8E..0C90, U+0C92..0CA8,
5405 U+0CAA..0CB3, U+0CB5..0CB9, U+0CBC..0CC4
5406 ...)
5407 \p{Script: Katakana} (Short: \p{Sc=Kana}) (321: U+30A1..30FA,
5408 U+30FD..30FF, U+31F0..31FF,
5409 U+32D0..32FE, U+3300..3357, U+FF66..FF6F
5410 ...)
5411 \p{Script: Kawi} \p{Script_Extensions=Kawi} (Short: \p{Sc=
5412 Kawi}, \p{Kawi}) (86)
5413 \p{Script: Kayah_Li} (Short: \p{Sc=Kali}) (47: U+A900..A92D,
5414 U+A92F)
5415 \p{Script: Khar} \p{Script=Kharoshthi} (=
5416 \p{Script_Extensions=Kharoshthi}) (68)
5417 \p{Script: Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
5418 \p{Sc=Khar}, \p{Khar}) (68)
5419 \p{Script: Khitan_Small_Script} \p{Script_Extensions=
5420 Khitan_Small_Script} (Short: \p{Sc=
5421 Kits}, \p{Kits}) (471)
5422 \p{Script: Khmer} \p{Script_Extensions=Khmer} (Short: \p{Sc=
5423 Khmr}, \p{Khmr}) (146)
5424 \p{Script: Khmr} \p{Script=Khmer} (= \p{Script_Extensions=
5425 Khmer}) (146)
5426 \p{Script: Khoj} \p{Script=Khojki} (65)
5427 \p{Script: Khojki} (Short: \p{Sc=Khoj}) (65: U+11200..11211,
5428 U+11213..11241)
5429 \p{Script: Khudawadi} (Short: \p{Sc=Sind}) (69: U+112B0..112EA,
5430 U+112F0..112F9)
5431 \p{Script: Kits} \p{Script=Khitan_Small_Script} (=
5432 \p{Script_Extensions=
5433 Khitan_Small_Script}) (471)
5434 \p{Script: Knda} \p{Script=Kannada} (91)
5435 \p{Script: Kthi} \p{Script=Kaithi} (68)
5436 \p{Script: Lana} \p{Script=Tai_Tham} (=
5437 \p{Script_Extensions=Tai_Tham}) (127)
5438 \p{Script: Lao} \p{Script_Extensions=Lao} (Short: \p{Sc=
5439 Lao}, \p{Lao}) (83)
5440 \p{Script: Laoo} \p{Script=Lao} (= \p{Script_Extensions=
5441 Lao}) (83)
5442 \p{Script: Latin} (Short: \p{Sc=Latn}) (1481: [A-Za-z\xaa
5443 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
5444 U+0100..02B8, U+02E0..02E4,
5445 U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65
5446 ...)
5447 \p{Script: Latn} \p{Script=Latin} (1481)
5448 \p{Script: Lepc} \p{Script=Lepcha} (= \p{Script_Extensions=
5449 Lepcha}) (74)
5450 \p{Script: Lepcha} \p{Script_Extensions=Lepcha} (Short:
5451 \p{Sc=Lepc}, \p{Lepc}) (74)
5452 \p{Script: Limb} \p{Script=Limbu} (68)
5453 \p{Script: Limbu} (Short: \p{Sc=Limb}) (68: U+1900..191E,
5454 U+1920..192B, U+1930..193B, U+1940,
5455 U+1944..194F)
5456 \p{Script: Lina} \p{Script=Linear_A} (341)
5457 \p{Script: Linb} \p{Script=Linear_B} (211)
5458 \p{Script: Linear_A} (Short: \p{Sc=Lina}) (341: U+10600..10736,
5459 U+10740..10755, U+10760..10767)
5460 \p{Script: Linear_B} (Short: \p{Sc=Linb}) (211: U+10000..1000B,
5461 U+1000D..10026, U+10028..1003A,
5462 U+1003C..1003D, U+1003F..1004D,
5463 U+10050..1005D ...)
5464 \p{Script: Lisu} \p{Script_Extensions=Lisu} (Short: \p{Sc=
5465 Lisu}, \p{Lisu}) (49)
5466 \p{Script: Lyci} \p{Script=Lycian} (= \p{Script_Extensions=
5467 Lycian}) (29)
5468 \p{Script: Lycian} \p{Script_Extensions=Lycian} (Short:
5469 \p{Sc=Lyci}, \p{Lyci}) (29)
5470 \p{Script: Lydi} \p{Script=Lydian} (= \p{Script_Extensions=
5471 Lydian}) (27)
5472 \p{Script: Lydian} \p{Script_Extensions=Lydian} (Short:
5473 \p{Sc=Lydi}, \p{Lydi}) (27)
5474 \p{Script: Mahajani} (Short: \p{Sc=Mahj}) (39: U+11150..11176)
5475 \p{Script: Mahj} \p{Script=Mahajani} (39)
5476 \p{Script: Maka} \p{Script=Makasar} (=
5477 \p{Script_Extensions=Makasar}) (25)
5478 \p{Script: Makasar} \p{Script_Extensions=Makasar} (Short:
5479 \p{Sc=Maka}, \p{Maka}) (25)
5480 \p{Script: Malayalam} (Short: \p{Sc=Mlym}) (118: U+0D00..0D0C,
5481 U+0D0E..0D10, U+0D12..0D44,
5482 U+0D46..0D48, U+0D4A..0D4F, U+0D54..0D63
5483 ...)
5484 \p{Script: Mand} \p{Script=Mandaic} (29)
5485 \p{Script: Mandaic} (Short: \p{Sc=Mand}) (29: U+0840..085B,
5486 U+085E)
5487 \p{Script: Mani} \p{Script=Manichaean} (51)
5488 \p{Script: Manichaean} (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6,
5489 U+10AEB..10AF6)
5490 \p{Script: Marc} \p{Script=Marchen} (=
5491 \p{Script_Extensions=Marchen}) (68)
5492 \p{Script: Marchen} \p{Script_Extensions=Marchen} (Short:
5493 \p{Sc=Marc}, \p{Marc}) (68)
5494 \p{Script: Masaram_Gondi} (Short: \p{Sc=Gonm}) (75:
5495 U+11D00..11D06, U+11D08..11D09,
5496 U+11D0B..11D36, U+11D3A, U+11D3C..11D3D,
5497 U+11D3F..11D47 ...)
5498 \p{Script: Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
5499 \p{Sc=Medf}, \p{Medf}) (91)
5500 \p{Script: Medf} \p{Script=Medefaidrin} (=
5501 \p{Script_Extensions=Medefaidrin}) (91)
5502 \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek}
5503 (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
5504 \p{Script: Mend} \p{Script=Mende_Kikakui} (=
5505 \p{Script_Extensions=Mende_Kikakui})
5506 (213)
5507 \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
5508 (Short: \p{Sc=Mend}, \p{Mend}) (213)
5509 \p{Script: Merc} \p{Script=Meroitic_Cursive} (=
5510 \p{Script_Extensions=Meroitic_Cursive})
5511 (90)
5512 \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (=
5513 \p{Script_Extensions=
5514 Meroitic_Hieroglyphs}) (32)
5515 \p{Script: Meroitic_Cursive} \p{Script_Extensions=
5516 Meroitic_Cursive} (Short: \p{Sc=Merc},
5517 \p{Merc}) (90)
5518 \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions=
5519 Meroitic_Hieroglyphs} (Short: \p{Sc=
5520 Mero}, \p{Mero}) (32)
5521 \p{Script: Miao} \p{Script_Extensions=Miao} (Short: \p{Sc=
5522 Miao}, \p{Miao}) (149)
5523 \p{Script: Mlym} \p{Script=Malayalam} (118)
5524 \p{Script: Modi} (Short: \p{Sc=Modi}) (79: U+11600..11644,
5525 U+11650..11659)
5526 \p{Script: Mong} \p{Script=Mongolian} (168)
5527 \p{Script: Mongolian} (Short: \p{Sc=Mong}) (168: U+1800..1801,
5528 U+1804, U+1806..1819, U+1820..1878,
5529 U+1880..18AA, U+11660..1166C)
5530 \p{Script: Mro} \p{Script_Extensions=Mro} (Short: \p{Sc=
5531 Mro}, \p{Mro}) (43)
5532 \p{Script: Mroo} \p{Script=Mro} (= \p{Script_Extensions=
5533 Mro}) (43)
5534 \p{Script: Mtei} \p{Script=Meetei_Mayek} (=
5535 \p{Script_Extensions=Meetei_Mayek}) (79)
5536 \p{Script: Mult} \p{Script=Multani} (38)
5537 \p{Script: Multani} (Short: \p{Sc=Mult}) (38: U+11280..11286,
5538 U+11288, U+1128A..1128D, U+1128F..1129D,
5539 U+1129F..112A9)
5540 \p{Script: Myanmar} (Short: \p{Sc=Mymr}) (223: U+1000..109F,
5541 U+A9E0..A9FE, U+AA60..AA7F)
5542 \p{Script: Mymr} \p{Script=Myanmar} (223)
5543 \p{Script: Nabataean} \p{Script_Extensions=Nabataean} (Short:
5544 \p{Sc=Nbat}, \p{Nbat}) (40)
5545 \p{Script: Nag_Mundari} \p{Script_Extensions=Nag_Mundari} (Short:
5546 \p{Sc=Nagm}, \p{Nagm}) (42)
5547 \p{Script: Nagm} \p{Script=Nag_Mundari} (=
5548 \p{Script_Extensions=Nag_Mundari}) (42)
5549 \p{Script: Nand} \p{Script=Nandinagari} (65)
5550 \p{Script: Nandinagari} (Short: \p{Sc=Nand}) (65: U+119A0..119A7,
5551 U+119AA..119D7, U+119DA..119E4)
5552 \p{Script: Narb} \p{Script=Old_North_Arabian} (=
5553 \p{Script_Extensions=Old_North_Arabian})
5554 (32)
5555 \p{Script: Nbat} \p{Script=Nabataean} (=
5556 \p{Script_Extensions=Nabataean}) (40)
5557 \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
5558 \p{Sc=Talu}, \p{Talu}) (83)
5559 \p{Script: Newa} \p{Script_Extensions=Newa} (Short: \p{Sc=
5560 Newa}, \p{Newa}) (97)
5561 \p{Script: Nko} (Short: \p{Sc=Nko}) (62: U+07C0..07FA,
5562 U+07FD..07FF)
5563 \p{Script: Nkoo} \p{Script=Nko} (62)
5564 \p{Script: Nshu} \p{Script=Nushu} (= \p{Script_Extensions=
5565 Nushu}) (397)
5566 \p{Script: Nushu} \p{Script_Extensions=Nushu} (Short: \p{Sc=
5567 Nshu}, \p{Nshu}) (397)
5568 \p{Script: Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
5569 Nyiakeng_Puachue_Hmong} (Short: \p{Sc=
5570 Hmnp}, \p{Hmnp}) (71)
5571 \p{Script: Ogam} \p{Script=Ogham} (= \p{Script_Extensions=
5572 Ogham}) (29)
5573 \p{Script: Ogham} \p{Script_Extensions=Ogham} (Short: \p{Sc=
5574 Ogam}, \p{Ogam}) (29)
5575 \p{Script: Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
5576 \p{Sc=Olck}, \p{Olck}) (48)
5577 \p{Script: Olck} \p{Script=Ol_Chiki} (=
5578 \p{Script_Extensions=Ol_Chiki}) (48)
5579 \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
5580 (Short: \p{Sc=Hung}, \p{Hung}) (108)
5581 \p{Script: Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
5582 \p{Sc=Ital}, \p{Ital}) (39)
5583 \p{Script: Old_North_Arabian} \p{Script_Extensions=
5584 Old_North_Arabian} (Short: \p{Sc=Narb},
5585 \p{Narb}) (32)
5586 \p{Script: Old_Permic} (Short: \p{Sc=Perm}) (43: U+10350..1037A)
5587 \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
5588 \p{Sc=Xpeo}, \p{Xpeo}) (50)
5589 \p{Script: Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
5590 \p{Sc=Sogo}, \p{Sogo}) (40)
5591 \p{Script: Old_South_Arabian} \p{Script_Extensions=
5592 Old_South_Arabian} (Short: \p{Sc=Sarb},
5593 \p{Sarb}) (32)
5594 \p{Script: Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
5595 \p{Sc=Orkh}, \p{Orkh}) (73)
5596 \p{Script: Old_Uyghur} (Short: \p{Sc=Ougr}) (26: U+10F70..10F89)
5597 \p{Script: Oriya} (Short: \p{Sc=Orya}) (91: U+0B01..0B03,
5598 U+0B05..0B0C, U+0B0F..0B10,
5599 U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33
5600 ...)
5601 \p{Script: Orkh} \p{Script=Old_Turkic} (=
5602 \p{Script_Extensions=Old_Turkic}) (73)
5603 \p{Script: Orya} \p{Script=Oriya} (91)
5604 \p{Script: Osage} \p{Script_Extensions=Osage} (Short: \p{Sc=
5605 Osge}, \p{Osge}) (72)
5606 \p{Script: Osge} \p{Script=Osage} (= \p{Script_Extensions=
5607 Osage}) (72)
5608 \p{Script: Osma} \p{Script=Osmanya} (=
5609 \p{Script_Extensions=Osmanya}) (40)
5610 \p{Script: Osmanya} \p{Script_Extensions=Osmanya} (Short:
5611 \p{Sc=Osma}, \p{Osma}) (40)
5612 \p{Script: Ougr} \p{Script=Old_Uyghur} (26)
5613 \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong}
5614 (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
5615 \p{Script: Palm} \p{Script=Palmyrene} (=
5616 \p{Script_Extensions=Palmyrene}) (32)
5617 \p{Script: Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
5618 \p{Sc=Palm}, \p{Palm}) (32)
5619 \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
5620 \p{Sc=Pauc}, \p{Pauc}) (57)
5621 \p{Script: Pauc} \p{Script=Pau_Cin_Hau} (=
5622 \p{Script_Extensions=Pau_Cin_Hau}) (57)
5623 \p{Script: Perm} \p{Script=Old_Permic} (43)
5624 \p{Script: Phag} \p{Script=Phags_Pa} (56)
5625 \p{Script: Phags_Pa} (Short: \p{Sc=Phag}) (56: U+A840..A877)
5626 \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (=
5627 \p{Script_Extensions=
5628 Inscriptional_Pahlavi}) (27)
5629 \p{Script: Phlp} \p{Script=Psalter_Pahlavi} (29)
5630 \p{Script: Phnx} \p{Script=Phoenician} (=
5631 \p{Script_Extensions=Phoenician}) (29)
5632 \p{Script: Phoenician} \p{Script_Extensions=Phoenician} (Short:
5633 \p{Sc=Phnx}, \p{Phnx}) (29)
5634 \p{Script: Plrd} \p{Script=Miao} (= \p{Script_Extensions=
5635 Miao}) (149)
5636 \p{Script: Prti} \p{Script=Inscriptional_Parthian} (=
5637 \p{Script_Extensions=
5638 Inscriptional_Parthian}) (30)
5639 \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29:
5640 U+10B80..10B91, U+10B99..10B9C,
5641 U+10BA9..10BAF)
5642 \p{Script: Qaac} \p{Script=Coptic} (137)
5643 \p{Script: Qaai} \p{Script=Inherited} (657)
5644 \p{Script: Rejang} \p{Script_Extensions=Rejang} (Short:
5645 \p{Sc=Rjng}, \p{Rjng}) (37)
5646 \p{Script: Rjng} \p{Script=Rejang} (= \p{Script_Extensions=
5647 Rejang}) (37)
5648 \p{Script: Rohg} \p{Script=Hanifi_Rohingya} (50)
5649 \p{Script: Runic} \p{Script_Extensions=Runic} (Short: \p{Sc=
5650 Runr}, \p{Runr}) (86)
5651 \p{Script: Runr} \p{Script=Runic} (= \p{Script_Extensions=
5652 Runic}) (86)
5653 \p{Script: Samaritan} \p{Script_Extensions=Samaritan} (Short:
5654 \p{Sc=Samr}, \p{Samr}) (61)
5655 \p{Script: Samr} \p{Script=Samaritan} (=
5656 \p{Script_Extensions=Samaritan}) (61)
5657 \p{Script: Sarb} \p{Script=Old_South_Arabian} (=
5658 \p{Script_Extensions=Old_South_Arabian})
5659 (32)
5660 \p{Script: Saur} \p{Script=Saurashtra} (=
5661 \p{Script_Extensions=Saurashtra}) (82)
5662 \p{Script: Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
5663 \p{Sc=Saur}, \p{Saur}) (82)
5664 \p{Script: Sgnw} \p{Script=SignWriting} (=
5665 \p{Script_Extensions=SignWriting}) (672)
5666 \p{Script: Sharada} (Short: \p{Sc=Shrd}) (96: U+11180..111DF)
5667 \p{Script: Shavian} \p{Script_Extensions=Shavian} (Short:
5668 \p{Sc=Shaw}, \p{Shaw}) (48)
5669 \p{Script: Shaw} \p{Script=Shavian} (=
5670 \p{Script_Extensions=Shavian}) (48)
5671 \p{Script: Shrd} \p{Script=Sharada} (96)
5672 \p{Script: Sidd} \p{Script=Siddham} (=
5673 \p{Script_Extensions=Siddham}) (92)
5674 \p{Script: Siddham} \p{Script_Extensions=Siddham} (Short:
5675 \p{Sc=Sidd}, \p{Sidd}) (92)
5676 \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short:
5677 \p{Sc=Sgnw}, \p{Sgnw}) (672)
5678 \p{Script: Sind} \p{Script=Khudawadi} (69)
5679 \p{Script: Sinh} \p{Script=Sinhala} (111)
5680 \p{Script: Sinhala} (Short: \p{Sc=Sinh}) (111: U+0D81..0D83,
5681 U+0D85..0D96, U+0D9A..0DB1,
5682 U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...)
5683 \p{Script: Sogd} \p{Script=Sogdian} (42)
5684 \p{Script: Sogdian} (Short: \p{Sc=Sogd}) (42: U+10F30..10F59)
5685 \p{Script: Sogo} \p{Script=Old_Sogdian} (=
5686 \p{Script_Extensions=Old_Sogdian}) (40)
5687 \p{Script: Sora} \p{Script=Sora_Sompeng} (=
5688 \p{Script_Extensions=Sora_Sompeng}) (35)
5689 \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng}
5690 (Short: \p{Sc=Sora}, \p{Sora}) (35)
5691 \p{Script: Soyo} \p{Script=Soyombo} (=
5692 \p{Script_Extensions=Soyombo}) (83)
5693 \p{Script: Soyombo} \p{Script_Extensions=Soyombo} (Short:
5694 \p{Sc=Soyo}, \p{Soyo}) (83)
5695 \p{Script: Sund} \p{Script=Sundanese} (=
5696 \p{Script_Extensions=Sundanese}) (72)
5697 \p{Script: Sundanese} \p{Script_Extensions=Sundanese} (Short:
5698 \p{Sc=Sund}, \p{Sund}) (72)
5699 \p{Script: Sylo} \p{Script=Syloti_Nagri} (45)
5700 \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (45: U+A800..A82C)
5701 \p{Script: Syrc} \p{Script=Syriac} (88)
5702 \p{Script: Syriac} (Short: \p{Sc=Syrc}) (88: U+0700..070D,
5703 U+070F..074A, U+074D..074F, U+0860..086A)
5704 \p{Script: Tagalog} (Short: \p{Sc=Tglg}) (23: U+1700..1715,
5705 U+171F)
5706 \p{Script: Tagb} \p{Script=Tagbanwa} (18)
5707 \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}) (18: U+1760..176C,
5708 U+176E..1770, U+1772..1773)
5709 \p{Script: Tai_Le} (Short: \p{Sc=Tale}) (35: U+1950..196D,
5710 U+1970..1974)
5711 \p{Script: Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
5712 \p{Sc=Lana}, \p{Lana}) (127)
5713 \p{Script: Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
5714 \p{Sc=Tavt}, \p{Tavt}) (72)
5715 \p{Script: Takr} \p{Script=Takri} (68)
5716 \p{Script: Takri} (Short: \p{Sc=Takr}) (68: U+11680..116B9,
5717 U+116C0..116C9)
5718 \p{Script: Tale} \p{Script=Tai_Le} (35)
5719 \p{Script: Talu} \p{Script=New_Tai_Lue} (=
5720 \p{Script_Extensions=New_Tai_Lue}) (83)
5721 \p{Script: Tamil} (Short: \p{Sc=Taml}) (123: U+0B82..0B83,
5722 U+0B85..0B8A, U+0B8E..0B90,
5723 U+0B92..0B95, U+0B99..0B9A, U+0B9C ...)
5724 \p{Script: Taml} \p{Script=Tamil} (123)
5725 \p{Script: Tang} \p{Script=Tangut} (= \p{Script_Extensions=
5726 Tangut}) (6914)
5727 \p{Script: Tangsa} \p{Script_Extensions=Tangsa} (Short:
5728 \p{Sc=Tnsa}, \p{Tnsa}) (89)
5729 \p{Script: Tangut} \p{Script_Extensions=Tangut} (Short:
5730 \p{Sc=Tang}, \p{Tang}) (6914)
5731 \p{Script: Tavt} \p{Script=Tai_Viet} (=
5732 \p{Script_Extensions=Tai_Viet}) (72)
5733 \p{Script: Telu} \p{Script=Telugu} (100)
5734 \p{Script: Telugu} (Short: \p{Sc=Telu}) (100: U+0C00..0C0C,
5735 U+0C0E..0C10, U+0C12..0C28,
5736 U+0C2A..0C39, U+0C3C..0C44, U+0C46..0C48
5737 ...)
5738 \p{Script: Tfng} \p{Script=Tifinagh} (=
5739 \p{Script_Extensions=Tifinagh}) (59)
5740 \p{Script: Tglg} \p{Script=Tagalog} (23)
5741 \p{Script: Thaa} \p{Script=Thaana} (50)
5742 \p{Script: Thaana} (Short: \p{Sc=Thaa}) (50: U+0780..07B1)
5743 \p{Script: Thai} \p{Script_Extensions=Thai} (Short: \p{Sc=
5744 Thai}, \p{Thai}) (86)
5745 \p{Script: Tibetan} \p{Script_Extensions=Tibetan} (Short:
5746 \p{Sc=Tibt}, \p{Tibt}) (207)
5747 \p{Script: Tibt} \p{Script=Tibetan} (=
5748 \p{Script_Extensions=Tibetan}) (207)
5749 \p{Script: Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
5750 \p{Sc=Tfng}, \p{Tfng}) (59)
5751 \p{Script: Tirh} \p{Script=Tirhuta} (82)
5752 \p{Script: Tirhuta} (Short: \p{Sc=Tirh}) (82: U+11480..114C7,
5753 U+114D0..114D9)
5754 \p{Script: Tnsa} \p{Script=Tangsa} (= \p{Script_Extensions=
5755 Tangsa}) (89)
5756 \p{Script: Toto} \p{Script_Extensions=Toto} (Short: \p{Sc=
5757 Toto}, \p{Toto}) (31)
5758 \p{Script: Ugar} \p{Script=Ugaritic} (=
5759 \p{Script_Extensions=Ugaritic}) (31)
5760 \p{Script: Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
5761 \p{Sc=Ugar}, \p{Ugar}) (31)
5762 \p{Script: Unknown} \p{Script_Extensions=Unknown} (Short:
5763 \p{Sc=Zzzz}, \p{Zzzz}) (964_861 plus all
5764 above-Unicode code points)
5765 \p{Script: Vai} \p{Script_Extensions=Vai} (Short: \p{Sc=
5766 Vai}, \p{Vai}) (300)
5767 \p{Script: Vaii} \p{Script=Vai} (= \p{Script_Extensions=
5768 Vai}) (300)
5769 \p{Script: Vith} \p{Script=Vithkuqi} (=
5770 \p{Script_Extensions=Vithkuqi}) (70)
5771 \p{Script: Vithkuqi} \p{Script_Extensions=Vithkuqi} (Short:
5772 \p{Sc=Vith}, \p{Vith}) (70)
5773 \p{Script: Wancho} \p{Script_Extensions=Wancho} (Short:
5774 \p{Sc=Wcho}, \p{Wcho}) (59)
5775 \p{Script: Wara} \p{Script=Warang_Citi} (=
5776 \p{Script_Extensions=Warang_Citi}) (84)
5777 \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
5778 \p{Sc=Wara}, \p{Wara}) (84)
5779 \p{Script: Wcho} \p{Script=Wancho} (= \p{Script_Extensions=
5780 Wancho}) (59)
5781 \p{Script: Xpeo} \p{Script=Old_Persian} (=
5782 \p{Script_Extensions=Old_Persian}) (50)
5783 \p{Script: Xsux} \p{Script=Cuneiform} (=
5784 \p{Script_Extensions=Cuneiform}) (1234)
5785 \p{Script: Yezi} \p{Script=Yezidi} (47)
5786 \p{Script: Yezidi} (Short: \p{Sc=Yezi}) (47: U+10E80..10EA9,
5787 U+10EAB..10EAD, U+10EB0..10EB1)
5788 \p{Script: Yi} (Short: \p{Sc=Yi}) (1220: U+A000..A48C,
5789 U+A490..A4C6)
5790 \p{Script: Yiii} \p{Script=Yi} (1220)
5791 \p{Script: Zanabazar_Square} \p{Script_Extensions=
5792 Zanabazar_Square} (Short: \p{Sc=Zanb},
5793 \p{Zanb}) (72)
5794 \p{Script: Zanb} \p{Script=Zanabazar_Square} (=
5795 \p{Script_Extensions=Zanabazar_Square})
5796 (72)
5797 \p{Script: Zinh} \p{Script=Inherited} (657)
5798 \p{Script: Zyyy} \p{Script=Common} (8301)
5799 \p{Script: Zzzz} \p{Script=Unknown} (=
5800 \p{Script_Extensions=Unknown}) (964_861
5801 plus all above-Unicode code points)
5802 \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (90:
5803 U+061F, U+0640, U+1E900..1E94B,
5804 U+1E950..1E959, U+1E95E..1E95F)
5805 \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (90)
5806 \p{Script_Extensions: Aghb} \p{Script_Extensions=
5807 Caucasian_Albanian} (53)
5808 \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (65:
5809 U+11700..1171A, U+1171D..1172B,
5810 U+11730..11746)
5811 \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
5812 \p{Hluw}) (583: U+14400..14646)
5813 \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1414)
5814 \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
5815 (1414: U+0600..0604, U+0606..06DC,
5816 U+06DE..06FF, U+0750..077F,
5817 U+0870..088E, U+0890..0891 ...)
5818 \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
5819 (96: U+0531..0556, U+0559..058A,
5820 U+058D..058F, U+FB13..FB17)
5821 \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
5822 (31)
5823 \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (96)
5824 \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst})
5825 (61: U+10B00..10B35, U+10B39..10B3F)
5826 \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
5827 \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (124)
5828 \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
5829 (124: U+1B00..1B4C, U+1B50..1B7E)
5830 \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
5831 \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657:
5832 U+A6A0..A6F7, U+16800..16A38)
5833 \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
5834 \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
5835 (36: U+16AD0..16AED, U+16AF0..16AF5)
5836 \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56:
5837 U+1BC0..1BF3, U+1BFC..1BFF)
5838 \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
5839 \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (113)
5840 \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng})
5841 (113: U+0951..0952, U+0964..0965,
5842 U+0980..0983, U+0985..098C,
5843 U+098F..0990, U+0993..09A8 ...)
5844 \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
5845 (97: U+11C00..11C08, U+11C0A..11C36,
5846 U+11C38..11C45, U+11C50..11C6C)
5847 \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
5848 \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (117)
5849 \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
5850 (117: U+02EA..02EB, U+3001..3003,
5851 U+3008..3011, U+3013..301F,
5852 U+302A..302D, U+3030 ...)
5853 \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (115)
5854 \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah})
5855 (115: U+11000..1104D, U+11052..11075,
5856 U+1107F)
5857 \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
5858 \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
5859 (256: U+2800..28FF)
5860 \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
5861 \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
5862 (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF)
5863 \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
5864 \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22:
5865 U+1735..1736, U+1740..1753)
5866 \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (91)
5867 \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
5868 \p{Cans}) (726: U+1400..167F,
5869 U+18B0..18F5, U+11AB0..11ABF)
5870 \p{Script_Extensions: Cans} \p{Script_Extensions=
5871 Canadian_Aboriginal} (726)
5872 \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
5873 \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49:
5874 U+102A0..102D0)
5875 \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
5876 \p{Aghb}) (53: U+10530..10563, U+1056F)
5877 \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (91:
5878 U+09E6..09EF, U+1040..1049,
5879 U+11100..11134, U+11136..11147)
5880 \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83:
5881 U+AA00..AA36, U+AA40..AA4D,
5882 U+AA50..AA59, U+AA5C..AA5F)
5883 \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
5884 \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
5885 (172: U+13A0..13F5, U+13F8..13FD,
5886 U+AB70..ABBF)
5887 \p{Script_Extensions: Chorasmian} (Short: \p{Scx=Chrs}, \p{Chrs})
5888 (28: U+10FB0..10FCB)
5889 \p{Script_Extensions: Chrs} \p{Script_Extensions=Chorasmian} (28)
5890 \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
5891 (7873: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
5892 \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xa9
5893 \xab-\xb9\xbb-\xbf\xd7\xf7],
5894 U+02B9..02DF, U+02E5..02E9,
5895 U+02EC..02FF, U+0374, U+037E ...)
5896 \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
5897 \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt})
5898 (165: U+03E2..03EF, U+2C80..2CF3,
5899 U+2CF9..2CFF, U+102E0..102FB)
5900 \p{Script_Extensions: Cpmn} \p{Script_Extensions=Cypro_Minoan}
5901 (101)
5902 \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
5903 \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
5904 (1234: U+12000..12399, U+12400..1246E,
5905 U+12470..12474, U+12480..12543)
5906 \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
5907 (112: U+10100..10102, U+10107..10133,
5908 U+10137..1013F, U+10800..10805, U+10808,
5909 U+1080A..10835 ...)
5910 \p{Script_Extensions: Cypro_Minoan} (Short: \p{Scx=Cpmn},
5911 \p{Cpmn}) (101: U+10100..10101,
5912 U+12F90..12FF2)
5913 \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
5914 (510: U+0400..052F, U+1C80..1C88,
5915 U+1D2B, U+1D78, U+1DF8, U+2DE0..2DFF ...)
5916 \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (510)
5917 \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt})
5918 (80: U+10400..1044F)
5919 \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (220)
5920 \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
5921 (220: U+0900..0952, U+0955..097F,
5922 U+1CD0..1CF6, U+1CF8..1CF9, U+20F0,
5923 U+A830..A839 ...)
5924 \p{Script_Extensions: Diak} \p{Script_Extensions=Dives_Akuru} (72)
5925 \p{Script_Extensions: Dives_Akuru} (Short: \p{Scx=Diak}, \p{Diak})
5926 (72: U+11900..11906, U+11909,
5927 U+1190C..11913, U+11915..11916,
5928 U+11918..11935, U+11937..11938 ...)
5929 \p{Script_Extensions: Dogr} \p{Script_Extensions=Dogra} (82)
5930 \p{Script_Extensions: Dogra} (Short: \p{Scx=Dogr}, \p{Dogr}) (82:
5931 U+0964..096F, U+A830..A839,
5932 U+11800..1183B)
5933 \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
5934 \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
5935 \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
5936 (147: U+1BC00..1BC6A, U+1BC70..1BC7C,
5937 U+1BC80..1BC88, U+1BC90..1BC99,
5938 U+1BC9C..1BCA3)
5939 \p{Script_Extensions: Egyp} \p{Script_Extensions=
5940 Egyptian_Hieroglyphs} (1110)
5941 \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
5942 \p{Egyp}) (1110: U+13000..13455)
5943 \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
5944 \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba})
5945 (40: U+10500..10527)
5946 \p{Script_Extensions: Elym} \p{Script_Extensions=Elymaic} (23)
5947 \p{Script_Extensions: Elymaic} (Short: \p{Scx=Elym}, \p{Elym})
5948 (23: U+10FE0..10FF6)
5949 \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (523)
5950 \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
5951 (523: U+1200..1248, U+124A..124D,
5952 U+1250..1256, U+1258, U+125A..125D,
5953 U+1260..1288 ...)
5954 \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (174)
5955 \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
5956 (174: U+10A0..10C5, U+10C7, U+10CD,
5957 U+10D0..10FF, U+1C90..1CBA, U+1CBD..1CBF
5958 ...)
5959 \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (138)
5960 \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
5961 (138: U+0484, U+0487, U+2C00..2C5F,
5962 U+2E43, U+A66F, U+1E000..1E006 ...)
5963 \p{Script_Extensions: Gong} \p{Script_Extensions=Gunjala_Gondi}
5964 (65)
5965 \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi}
5966 (77)
5967 \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
5968 \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27:
5969 U+10330..1034A)
5970 \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (116)
5971 \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
5972 (116: U+0951..0952, U+0964..0965,
5973 U+0BE6..0BF3, U+1CD0, U+1CD2..1CD3,
5974 U+1CF2..1CF4 ...)
5975 \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522:
5976 U+0342, U+0345, U+0370..0373,
5977 U+0375..0377, U+037A..037D, U+037F ...)
5978 \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
5979 \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
5980 (105: U+0951..0952, U+0964..0965,
5981 U+0A81..0A83, U+0A85..0A8D,
5982 U+0A8F..0A91, U+0A93..0AA8 ...)
5983 \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105)
5984 \p{Script_Extensions: Gunjala_Gondi} (Short: \p{Scx=Gong},
5985 \p{Gong}) (65: U+0964..0965,
5986 U+11D60..11D65, U+11D67..11D68,
5987 U+11D6A..11D8E, U+11D90..11D91,
5988 U+11D93..11D98 ...)
5989 \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
5990 (94: U+0951..0952, U+0964..0965,
5991 U+0A01..0A03, U+0A05..0A0A,
5992 U+0A0F..0A10, U+0A13..0A28 ...)
5993 \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (94)
5994 \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (98_696:
5995 U+2E80..2E99, U+2E9B..2EF3,
5996 U+2F00..2FD5, U+3001..3003,
5997 U+3005..3011, U+3013..301F ...)
5998 \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
5999 \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
6000 (11_775: U+1100..11FF, U+3001..3003,
6001 U+3008..3011, U+3013..301F,
6002 U+302E..3030, U+3037 ...)
6003 \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (98_696)
6004 \p{Script_Extensions: Hanifi_Rohingya} (Short: \p{Scx=Rohg},
6005 \p{Rohg}) (55: U+060C, U+061B, U+061F,
6006 U+0640, U+06D4, U+10D00..10D27 ...)
6007 \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
6008 \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano})
6009 (23: U+1720..1736)
6010 \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
6011 \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26:
6012 U+108E0..108F2, U+108F4..108F5,
6013 U+108FB..108FF)
6014 \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (134)
6015 \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr})
6016 (134: U+0591..05C7, U+05D0..05EA,
6017 U+05EF..05F4, U+FB1D..FB36,
6018 U+FB38..FB3C, U+FB3E ...)
6019 \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (433)
6020 \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
6021 (433: U+3001..3003, U+3008..3011,
6022 U+3013..301F, U+3030..3035, U+3037,
6023 U+303C..303D ...)
6024 \p{Script_Extensions: Hluw} \p{Script_Extensions=
6025 Anatolian_Hieroglyphs} (583)
6026 \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
6027 (127)
6028 \p{Script_Extensions: Hmnp} \p{Script_Extensions=
6029 Nyiakeng_Puachue_Hmong} (71)
6030 \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
6031 (108)
6032 \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
6033 \p{Armi}) (31: U+10840..10855,
6034 U+10857..1085F)
6035 \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
6036 (586: U+0300..0341, U+0343..0344,
6037 U+0346..0362, U+0953..0954,
6038 U+1AB0..1ACE, U+1DC2..1DF7 ...)
6039 \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
6040 \p{Phli}) (27: U+10B60..10B72,
6041 U+10B78..10B7F)
6042 \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
6043 Prti}, \p{Prti}) (30: U+10B40..10B55,
6044 U+10B58..10B5F)
6045 \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39)
6046 \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
6047 \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
6048 (91: U+A980..A9CD, U+A9CF..A9D9,
6049 U+A9DE..A9DF)
6050 \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (88:
6051 U+0966..096F, U+A830..A839,
6052 U+11080..110C2, U+110CD)
6053 \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
6054 \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (373)
6055 \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
6056 (106: U+0951..0952, U+0964..0965,
6057 U+0C80..0C8C, U+0C8E..0C90,
6058 U+0C92..0CA8, U+0CAA..0CB3 ...)
6059 \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
6060 (373: U+3001..3003, U+3008..3011,
6061 U+3013..301F, U+3030..3035, U+3037,
6062 U+303C..303D ...)
6063 \p{Script_Extensions: Kawi} (Short: \p{Scx=Kawi}, \p{Kawi}) (86:
6064 U+11F00..11F10, U+11F12..11F3A,
6065 U+11F3E..11F59)
6066 \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
6067 (48: U+A900..A92F)
6068 \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (68)
6069 \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
6070 (68: U+10A00..10A03, U+10A05..10A06,
6071 U+10A0C..10A13, U+10A15..10A17,
6072 U+10A19..10A35, U+10A38..10A3A ...)
6073 \p{Script_Extensions: Khitan_Small_Script} (Short: \p{Scx=Kits},
6074 \p{Kits}) (471: U+16FE4, U+18B00..18CD5)
6075 \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146:
6076 U+1780..17DD, U+17E0..17E9,
6077 U+17F0..17F9, U+19E0..19FF)
6078 \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
6079 \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (85)
6080 \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (85:
6081 U+0AE6..0AEF, U+A830..A839,
6082 U+11200..11211, U+11213..11241)
6083 \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
6084 (81: U+0964..0965, U+A830..A839,
6085 U+112B0..112EA, U+112F0..112F9)
6086 \p{Script_Extensions: Kits} \p{Script_Extensions=
6087 Khitan_Small_Script} (471)
6088 \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (106)
6089 \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (88)
6090 \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
6091 \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (83:
6092 U+0E81..0E82, U+0E84, U+0E86..0E8A,
6093 U+0E8C..0EA3, U+0EA5, U+0EA7..0EBD ...)
6094 \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (83)
6095 \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn})
6096 (1510: [A-Za-z\xaa\xba\xc0-\xd6\xd8-
6097 \xf6\xf8-\xff], U+0100..02B8,
6098 U+02E0..02E4, U+0363..036F,
6099 U+0485..0486, U+0951..0952 ...)
6100 \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1510)
6101 \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
6102 \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74:
6103 U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F)
6104 \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
6105 \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69:
6106 U+0965, U+1900..191E, U+1920..192B,
6107 U+1930..193B, U+1940, U+1944..194F)
6108 \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
6109 \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
6110 \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
6111 (386: U+10107..10133, U+10600..10736,
6112 U+10740..10755, U+10760..10767)
6113 \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
6114 (268: U+10000..1000B, U+1000D..10026,
6115 U+10028..1003A, U+1003C..1003D,
6116 U+1003F..1004D, U+10050..1005D ...)
6117 \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (49:
6118 U+A4D0..A4FF, U+11FB0)
6119 \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
6120 \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29:
6121 U+10280..1029C)
6122 \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
6123 \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27:
6124 U+10920..10939, U+1093F)
6125 \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
6126 (61: U+0964..096F, U+A830..A839,
6127 U+11150..11176)
6128 \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
6129 \p{Script_Extensions: Maka} \p{Script_Extensions=Makasar} (25)
6130 \p{Script_Extensions: Makasar} (Short: \p{Scx=Maka}, \p{Maka})
6131 (25: U+11EE0..11EF8)
6132 \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
6133 (126: U+0951..0952, U+0964..0965,
6134 U+0D00..0D0C, U+0D0E..0D10,
6135 U+0D12..0D44, U+0D46..0D48 ...)
6136 \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
6137 \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand})
6138 (30: U+0640, U+0840..085B, U+085E)
6139 \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
6140 \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
6141 (52: U+0640, U+10AC0..10AE6,
6142 U+10AEB..10AF6)
6143 \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
6144 \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc})
6145 (68: U+11C70..11C8F, U+11C92..11CA7,
6146 U+11CA9..11CB6)
6147 \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm},
6148 \p{Gonm}) (77: U+0964..0965,
6149 U+11D00..11D06, U+11D08..11D09,
6150 U+11D0B..11D36, U+11D3A, U+11D3C..11D3D
6151 ...)
6152 \p{Script_Extensions: Medefaidrin} (Short: \p{Scx=Medf}, \p{Medf})
6153 (91: U+16E40..16E9A)
6154 \p{Script_Extensions: Medf} \p{Script_Extensions=Medefaidrin} (91)
6155 \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
6156 \p{Mtei}) (79: U+AAE0..AAF6,
6157 U+ABC0..ABED, U+ABF0..ABF9)
6158 \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
6159 (213)
6160 \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
6161 \p{Mend}) (213: U+1E800..1E8C4,
6162 U+1E8C7..1E8D6)
6163 \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
6164 (90)
6165 \p{Script_Extensions: Mero} \p{Script_Extensions=
6166 Meroitic_Hieroglyphs} (32)
6167 \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
6168 \p{Merc}) (90: U+109A0..109B7,
6169 U+109BC..109CF, U+109D2..109FF)
6170 \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
6171 \p{Mero}) (32: U+10980..1099F)
6172 \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (149:
6173 U+16F00..16F4A, U+16F4F..16F87,
6174 U+16F8F..16F9F)
6175 \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (126)
6176 \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89:
6177 U+A830..A839, U+11600..11644,
6178 U+11650..11659)
6179 \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (172)
6180 \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
6181 (172: U+1800..1819, U+1820..1878,
6182 U+1880..18AA, U+202F, U+11660..1166C)
6183 \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43:
6184 U+16A40..16A5E, U+16A60..16A69,
6185 U+16A6E..16A6F)
6186 \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
6187 \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
6188 \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
6189 \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult})
6190 (48: U+0A66..0A6F, U+11280..11286,
6191 U+11288, U+1128A..1128D, U+1128F..1129D,
6192 U+1129F..112A9)
6193 \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
6194 (224: U+1000..109F, U+A92E,
6195 U+A9E0..A9FE, U+AA60..AA7F)
6196 \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
6197 \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
6198 (40: U+10880..1089E, U+108A7..108AF)
6199 \p{Script_Extensions: Nag_Mundari} (Short: \p{Scx=Nagm}, \p{Nagm})
6200 (42: U+1E4D0..1E4F9)
6201 \p{Script_Extensions: Nagm} \p{Script_Extensions=Nag_Mundari} (42)
6202 \p{Script_Extensions: Nand} \p{Script_Extensions=Nandinagari} (86)
6203 \p{Script_Extensions: Nandinagari} (Short: \p{Scx=Nand}, \p{Nand})
6204 (86: U+0964..0965, U+0CE6..0CEF, U+1CE9,
6205 U+1CF2, U+1CFA, U+A830..A835 ...)
6206 \p{Script_Extensions: Narb} \p{Script_Extensions=
6207 Old_North_Arabian} (32)
6208 \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
6209 \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
6210 (83: U+1980..19AB, U+19B0..19C9,
6211 U+19D0..19DA, U+19DE..19DF)
6212 \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (97:
6213 U+11400..1145B, U+1145D..11461)
6214 \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (67:
6215 U+060C, U+061B, U+061F, U+07C0..07FA,
6216 U+07FD..07FF, U+FD3E..FD3F)
6217 \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (67)
6218 \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397)
6219 \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397:
6220 U+16FE1, U+1B170..1B2FB)
6221 \p{Script_Extensions: Nyiakeng_Puachue_Hmong} (Short: \p{Scx=
6222 Hmnp}, \p{Hmnp}) (71: U+1E100..1E12C,
6223 U+1E130..1E13D, U+1E140..1E149,
6224 U+1E14E..1E14F)
6225 \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
6226 \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29:
6227 U+1680..169C)
6228 \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
6229 (48: U+1C50..1C7F)
6230 \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
6231 \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
6232 \p{Hung}) (108: U+10C80..10CB2,
6233 U+10CC0..10CF2, U+10CFA..10CFF)
6234 \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
6235 (39: U+10300..10323, U+1032D..1032F)
6236 \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
6237 \p{Narb}) (32: U+10A80..10A9F)
6238 \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
6239 (44: U+0483, U+10350..1037A)
6240 \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
6241 (50: U+103A0..103C3, U+103C8..103D5)
6242 \p{Script_Extensions: Old_Sogdian} (Short: \p{Scx=Sogo}, \p{Sogo})
6243 (40: U+10F00..10F27)
6244 \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
6245 \p{Sarb}) (32: U+10A60..10A7F)
6246 \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
6247 (73: U+10C00..10C48)
6248 \p{Script_Extensions: Old_Uyghur} (Short: \p{Scx=Ougr}, \p{Ougr})
6249 (28: U+0640, U+10AF2, U+10F70..10F89)
6250 \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (97:
6251 U+0951..0952, U+0964..0965,
6252 U+0B01..0B03, U+0B05..0B0C,
6253 U+0B0F..0B10, U+0B13..0B28 ...)
6254 \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
6255 \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (97)
6256 \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72:
6257 U+104B0..104D3, U+104D8..104FB)
6258 \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
6259 \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
6260 \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma})
6261 (40: U+10480..1049D, U+104A0..104A9)
6262 \p{Script_Extensions: Ougr} \p{Script_Extensions=Old_Uyghur} (28)
6263 \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
6264 \p{Hmng}) (127: U+16B00..16B45,
6265 U+16B50..16B59, U+16B5B..16B61,
6266 U+16B63..16B77, U+16B7D..16B8F)
6267 \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
6268 \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
6269 (32: U+10860..1087F)
6270 \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
6271 (57: U+11AC0..11AF8)
6272 \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
6273 \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
6274 \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
6275 \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
6276 (59: U+1802..1803, U+1805, U+A840..A877)
6277 \p{Script_Extensions: Phli} \p{Script_Extensions=
6278 Inscriptional_Pahlavi} (27)
6279 \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
6280 (30)
6281 \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
6282 \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
6283 (29: U+10900..1091B, U+1091F)
6284 \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (149)
6285 \p{Script_Extensions: Prti} \p{Script_Extensions=
6286 Inscriptional_Parthian} (30)
6287 \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
6288 \p{Phlp}) (30: U+0640, U+10B80..10B91,
6289 U+10B99..10B9C, U+10BA9..10BAF)
6290 \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
6291 \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (586)
6292 \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37:
6293 U+A930..A953, U+A95F)
6294 \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
6295 \p{Script_Extensions: Rohg} \p{Script_Extensions=Hanifi_Rohingya}
6296 (55)
6297 \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86:
6298 U+16A0..16EA, U+16EE..16F8)
6299 \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
6300 \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
6301 (61: U+0800..082D, U+0830..083E)
6302 \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
6303 \p{Script_Extensions: Sarb} \p{Script_Extensions=
6304 Old_South_Arabian} (32)
6305 \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
6306 \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
6307 (82: U+A880..A8C5, U+A8CE..A8D9)
6308 \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
6309 \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
6310 (102: U+0951, U+1CD7, U+1CD9,
6311 U+1CDC..1CDD, U+1CE0, U+11180..111DF)
6312 \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw})
6313 (48: U+10450..1047F)
6314 \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
6315 \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (102)
6316 \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
6317 \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd})
6318 (92: U+11580..115B5, U+115B8..115DD)
6319 \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
6320 (672: U+1D800..1DA8B, U+1DA9B..1DA9F,
6321 U+1DAA1..1DAAF)
6322 \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
6323 \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (113)
6324 \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
6325 (113: U+0964..0965, U+0D81..0D83,
6326 U+0D85..0D96, U+0D9A..0DB1,
6327 U+0DB3..0DBB, U+0DBD ...)
6328 \p{Script_Extensions: Sogd} \p{Script_Extensions=Sogdian} (43)
6329 \p{Script_Extensions: Sogdian} (Short: \p{Scx=Sogd}, \p{Sogd})
6330 (43: U+0640, U+10F30..10F59)
6331 \p{Script_Extensions: Sogo} \p{Script_Extensions=Old_Sogdian} (40)
6332 \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
6333 \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
6334 \p{Sora}) (35: U+110D0..110E8,
6335 U+110F0..110F9)
6336 \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (83)
6337 \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo})
6338 (83: U+11A50..11AA2)
6339 \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
6340 \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
6341 (72: U+1B80..1BBF, U+1CC0..1CC7)
6342 \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (57)
6343 \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
6344 \p{Sylo}) (57: U+0964..0965,
6345 U+09E6..09EF, U+A800..A82C)
6346 \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (107)
6347 \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc})
6348 (107: U+060C, U+061B..061C, U+061F,
6349 U+0640, U+064B..0655, U+0670 ...)
6350 \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg})
6351 (25: U+1700..1715, U+171F, U+1735..1736)
6352 \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
6353 \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
6354 (20: U+1735..1736, U+1760..176C,
6355 U+176E..1770, U+1772..1773)
6356 \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45:
6357 U+1040..1049, U+1950..196D, U+1970..1974)
6358 \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
6359 (127: U+1A20..1A5E, U+1A60..1A7C,
6360 U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD)
6361 \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
6362 (72: U+AA80..AAC2, U+AADB..AADF)
6363 \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (80)
6364 \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (80:
6365 U+0964..0965, U+A830..A839,
6366 U+11680..116B9, U+116C0..116C9)
6367 \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
6368 \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
6369 \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (133:
6370 U+0951..0952, U+0964..0965,
6371 U+0B82..0B83, U+0B85..0B8A,
6372 U+0B8E..0B90, U+0B92..0B95 ...)
6373 \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (133)
6374 \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6914)
6375 \p{Script_Extensions: Tangsa} (Short: \p{Scx=Tnsa}, \p{Tnsa}) (89:
6376 U+16A70..16ABE, U+16AC0..16AC9)
6377 \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
6378 (6914: U+16FE0, U+17000..187F7,
6379 U+18800..18AFF, U+18D00..18D08)
6380 \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
6381 \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (106)
6382 \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu})
6383 (106: U+0951..0952, U+0964..0965,
6384 U+0C00..0C0C, U+0C0E..0C10,
6385 U+0C12..0C28, U+0C2A..0C39 ...)
6386 \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
6387 \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (25)
6388 \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66)
6389 \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66:
6390 U+060C, U+061B..061C, U+061F,
6391 U+0660..0669, U+0780..07B1, U+FDF2 ...)
6392 \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86:
6393 U+0E01..0E3A, U+0E40..0E5B)
6394 \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
6395 (207: U+0F00..0F47, U+0F49..0F6C,
6396 U+0F71..0F97, U+0F99..0FBC,
6397 U+0FBE..0FCC, U+0FCE..0FD4 ...)
6398 \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
6399 \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
6400 (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F)
6401 \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (97)
6402 \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh})
6403 (97: U+0951..0952, U+0964..0965, U+1CF2,
6404 U+A830..A839, U+11480..114C7,
6405 U+114D0..114D9)
6406 \p{Script_Extensions: Tnsa} \p{Script_Extensions=Tangsa} (89)
6407 \p{Script_Extensions: Toto} (Short: \p{Scx=Toto}, \p{Toto}) (31:
6408 U+1E290..1E2AE)
6409 \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
6410 \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
6411 (31: U+10380..1039D, U+1039F)
6412 \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
6413 (964_861 plus all above-Unicode code
6414 points: U+0378..0379, U+0380..0383,
6415 U+038B, U+038D, U+03A2, U+0530 ...)
6416 \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300:
6417 U+A500..A62B)
6418 \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
6419 \p{Script_Extensions: Vith} \p{Script_Extensions=Vithkuqi} (70)
6420 \p{Script_Extensions: Vithkuqi} (Short: \p{Scx=Vith}, \p{Vith})
6421 (70: U+10570..1057A, U+1057C..1058A,
6422 U+1058C..10592, U+10594..10595,
6423 U+10597..105A1, U+105A3..105B1 ...)
6424 \p{Script_Extensions: Wancho} (Short: \p{Scx=Wcho}, \p{Wcho}) (59:
6425 U+1E2C0..1E2F9, U+1E2FF)
6426 \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
6427 \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
6428 (84: U+118A0..118F2, U+118FF)
6429 \p{Script_Extensions: Wcho} \p{Script_Extensions=Wancho} (59)
6430 \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
6431 \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
6432 \p{Script_Extensions: Yezi} \p{Script_Extensions=Yezidi} (60)
6433 \p{Script_Extensions: Yezidi} (Short: \p{Scx=Yezi}, \p{Yezi}) (60:
6434 U+060C, U+061B, U+061F, U+0660..0669,
6435 U+10E80..10EA9, U+10EAB..10EAD ...)
6436 \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246:
6437 U+3001..3002, U+3008..3011,
6438 U+3014..301B, U+30FB, U+A000..A48C,
6439 U+A490..A4C6 ...)
6440 \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
6441 \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb},
6442 \p{Zanb}) (72: U+11A00..11A47)
6443 \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square}
6444 (72)
6445 \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (586)
6446 \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (7873)
6447 \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (964_861
6448 plus all above-Unicode code points)
6449 \p{Scx: *} \p{Script_Extensions: *}
6450 \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (50)
6451 \p{SD: *} \p{Soft_Dotted: *}
6452 \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
6453 \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024,
6454 U+FE52, U+FF0E)
6455 \p{Sentence_Break: CL} \p{Sentence_Break=Close} (195)
6456 \p{Sentence_Break: Close} (Short: \p{SB=CL}) (195: [\"\'\(\)\[\]
6457 \{\}\xab\xbb], U+0F3A..0F3D,
6458 U+169B..169C, U+2018..201F,
6459 U+2039..203A, U+2045..2046 ...)
6460 \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1: [\r])
6461 \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (2550)
6462 \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2550: U+0300..036F,
6463 U+0483..0489, U+0591..05BD, U+05BF,
6464 U+05C1..05C2, U+05C4..05C5 ...)
6465 \p{Sentence_Break: FO} \p{Sentence_Break=Format} (72)
6466 \p{Sentence_Break: Format} (Short: \p{SB=FO}) (72: [\xad],
6467 U+0600..0605, U+061C, U+06DD, U+070F,
6468 U+0890..0891 ...)
6469 \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (132_036)
6470 \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1: [\n])
6471 \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (2497)
6472 \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2497: [a-z\xaa\xb5
6473 \xba\xdf-\xf6\xf8-\xff], U+0101, U+0103,
6474 U+0105, U+0107, U+0109 ...)
6475 \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (682)
6476 \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (682: [0-9],
6477 U+0660..0669, U+066B..066C,
6478 U+06F0..06F9, U+07C0..07C9, U+0966..096F
6479 ...)
6480 \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (132_036: U+01BB,
6481 U+01C0..01C3, U+0294, U+02B9..02BF,
6482 U+02C6..02D1, U+02EC ...)
6483 \p{Sentence_Break: Other} (Short: \p{SB=XX}) (973_938 plus all
6484 above-Unicode code points: [^\t\n\cK\f
6485 \r\x20!\"\'\(\),\-.0-9:?A-Z\[\]a-z\{\}
6486 \x85\xa0\xaa-\xab\xad\xb5\xba-\xbb\xc0-
6487 \xd6\xd8-\xf6\xf8-\xff], U+02C2..02C5,
6488 U+02D2..02DF, U+02E5..02EB, U+02ED,
6489 U+02EF..02FF ...)
6490 \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
6491 \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:],
6492 U+055D, U+060C..060D, U+07F8, U+1802,
6493 U+1808 ...)
6494 \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
6495 \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85],
6496 U+2028..2029)
6497 \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0],
6498 U+1680, U+2000..200A, U+202F, U+205F,
6499 U+3000)
6500 \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (151)
6501 \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (151: [!?], U+0589,
6502 U+061D..061F, U+06D4, U+0700..0702,
6503 U+07F9 ...)
6504 \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1936)
6505 \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1936: [A-Z\xc0-\xd6
6506 \xd8-\xde], U+0100, U+0102, U+0104,
6507 U+0106, U+0108 ...)
6508 \p{Sentence_Break: XX} \p{Sentence_Break=Other} (973_938 plus all
6509 above-Unicode code points)
6510 \p{Sentence_Terminal} \p{Sentence_Terminal=Y} (Short: \p{STerm})
6511 (154)
6512 \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
6513 (1_113_958 plus all above-Unicode code
6514 points: [\x00-\x20\"#\$\%&\'\(\)*+,\-
6515 \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
6516 \xff], U+0100..0588, U+058A..061C,
6517 U+0620..06D3, U+06D5..06FF, U+0703..07F8
6518 ...)
6519 \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (154:
6520 [!.?], U+0589, U+061D..061F, U+06D4,
6521 U+0700..0702, U+07F9 ...)
6522 \p{Separator} \p{General_Category=Separator} (Short:
6523 \p{Z}) (19)
6524 \p{Sgnw} \p{SignWriting} (= \p{Script_Extensions=
6525 SignWriting}) (672)
6526 \p{Sharada} \p{Script_Extensions=Sharada} (Short:
6527 \p{Shrd}; NOT \p{Block=Sharada}) (102)
6528 \p{Shavian} \p{Script_Extensions=Shavian} (Short:
6529 \p{Shaw}) (48)
6530 \p{Shaw} \p{Shavian} (= \p{Script_Extensions=
6531 Shavian}) (48)
6532 X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
6533 (16)
6534 \p{Shrd} \p{Sharada} (= \p{Script_Extensions=
6535 Sharada}) (NOT \p{Block=Sharada}) (102)
6536 \p{Sidd} \p{Siddham} (= \p{Script_Extensions=
6537 Siddham}) (NOT \p{Block=Siddham}) (92)
6538 \p{Siddham} \p{Script_Extensions=Siddham} (Short:
6539 \p{Sidd}; NOT \p{Block=Siddham}) (92)
6540 \p{SignWriting} \p{Script_Extensions=SignWriting} (Short:
6541 \p{Sgnw}) (672)
6542 \p{Sind} \p{Khudawadi} (= \p{Script_Extensions=
6543 Khudawadi}) (NOT \p{Block=Khudawadi})
6544 (81)
6545 \p{Sinh} \p{Sinhala} (= \p{Script_Extensions=
6546 Sinhala}) (NOT \p{Block=Sinhala}) (113)
6547 \p{Sinhala} \p{Script_Extensions=Sinhala} (Short:
6548 \p{Sinh}; NOT \p{Block=Sinhala}) (113)
6549 X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
6550 \p{Sk} \p{Modifier_Symbol} (=
6551 \p{General_Category=Modifier_Symbol})
6552 (125)
6553 \p{Sm} \p{Math_Symbol} (= \p{General_Category=
6554 Math_Symbol}) (948)
6555 X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
6556 \p{InSmallForms}) (32)
6557 X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block=
6558 Small_Form_Variants}) (32)
6559 X \p{Small_Kana_Ext} \p{Small_Kana_Extension} (= \p{Block=
6560 Small_Kana_Extension}) (64)
6561 X \p{Small_Kana_Extension} \p{Block=Small_Kana_Extension} (Short:
6562 \p{InSmallKanaExt}) (64)
6563 \p{So} \p{Other_Symbol} (= \p{General_Category=
6564 Other_Symbol}) (6634)
6565 \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (50)
6566 \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_062 plus
6567 all above-Unicode code points: [\x00-
6568 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6569 Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff],
6570 U+0100..012E, U+0130..0248,
6571 U+024A..0267, U+0269..029C, U+029E..02B1
6572 ...)
6573 \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (50: [i-j],
6574 U+012F, U+0249, U+0268, U+029D, U+02B2
6575 ...)
6576 \p{Sogd} \p{Sogdian} (= \p{Script_Extensions=
6577 Sogdian}) (NOT \p{Block=Sogdian}) (43)
6578 \p{Sogdian} \p{Script_Extensions=Sogdian} (Short:
6579 \p{Sogd}; NOT \p{Block=Sogdian}) (43)
6580 \p{Sogo} \p{Old_Sogdian} (= \p{Script_Extensions=
6581 Old_Sogdian}) (NOT \p{Block=
6582 Old_Sogdian}) (40)
6583 \p{Sora} \p{Sora_Sompeng} (= \p{Script_Extensions=
6584 Sora_Sompeng}) (NOT \p{Block=
6585 Sora_Sompeng}) (35)
6586 \p{Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng} (Short:
6587 \p{Sora}; NOT \p{Block=Sora_Sompeng})
6588 (35)
6589 \p{Soyo} \p{Soyombo} (= \p{Script_Extensions=
6590 Soyombo}) (NOT \p{Block=Soyombo}) (83)
6591 \p{Soyombo} \p{Script_Extensions=Soyombo} (Short:
6592 \p{Soyo}; NOT \p{Block=Soyombo}) (83)
6593 \p{Space} \p{White_Space} (= \p{White_Space=Y}) (25)
6594 \p{Space: *} \p{White_Space: *}
6595 \p{Space_Separator} \p{General_Category=Space_Separator}
6596 (Short: \p{Zs}) (17)
6597 \p{SpacePerl} \p{XPosixSpace} (25)
6598 \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
6599 \p{Mc}) (452)
6600 X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
6601 (Short: \p{InModifierLetters}) (80)
6602 X \p{Specials} \p{Block=Specials} (16)
6603 \p{STerm} \p{Sentence_Terminal} (=
6604 \p{Sentence_Terminal=Y}) (154)
6605 \p{STerm: *} \p{Sentence_Terminal: *}
6606 \p{Sund} \p{Sundanese} (= \p{Script_Extensions=
6607 Sundanese}) (NOT \p{Block=Sundanese})
6608 (72)
6609 \p{Sundanese} \p{Script_Extensions=Sundanese} (Short:
6610 \p{Sund}; NOT \p{Block=Sundanese}) (72)
6611 X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block=
6612 Sundanese_Supplement}) (16)
6613 X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
6614 \p{InSundaneseSup}) (16)
6615 X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block=
6616 Supplemental_Arrows_A}) (16)
6617 X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block=
6618 Supplemental_Arrows_B}) (128)
6619 X \p{Sup_Arrows_C} \p{Supplemental_Arrows_C} (= \p{Block=
6620 Supplemental_Arrows_C}) (256)
6621 X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (=
6622 \p{Block=
6623 Supplemental_Mathematical_Operators})
6624 (256)
6625 X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (=
6626 \p{Block=
6627 Supplementary_Private_Use_Area_A})
6628 (65_536)
6629 X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (=
6630 \p{Block=
6631 Supplementary_Private_Use_Area_B})
6632 (65_536)
6633 X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block=
6634 Supplemental_Punctuation}) (128)
6635 X \p{Sup_Symbols_And_Pictographs}
6636 \p{Supplemental_Symbols_And_Pictographs}
6637 (= \p{Block=
6638 Supplemental_Symbols_And_Pictographs})
6639 (256)
6640 X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (=
6641 \p{Block=Superscripts_And_Subscripts})
6642 (48)
6643 X \p{Superscripts_And_Subscripts} \p{Block=
6644 Superscripts_And_Subscripts} (Short:
6645 \p{InSuperAndSub}) (48)
6646 X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
6647 \p{InSupArrowsA}) (16)
6648 X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
6649 \p{InSupArrowsB}) (128)
6650 X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
6651 \p{InSupArrowsC}) (256)
6652 X \p{Supplemental_Mathematical_Operators} \p{Block=
6653 Supplemental_Mathematical_Operators}
6654 (Short: \p{InSupMathOperators}) (256)
6655 X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
6656 (Short: \p{InSupPunctuation}) (128)
6657 X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
6658 Supplemental_Symbols_And_Pictographs}
6659 (Short: \p{InSupSymbolsAndPictographs})
6660 (256)
6661 X \p{Supplementary_Private_Use_Area_A} \p{Block=
6662 Supplementary_Private_Use_Area_A}
6663 (Short: \p{InSupPUAA}) (65_536)
6664 X \p{Supplementary_Private_Use_Area_B} \p{Block=
6665 Supplementary_Private_Use_Area_B}
6666 (Short: \p{InSupPUAB}) (65_536)
6667 \p{Surrogate} \p{General_Category=Surrogate} (Short:
6668 \p{Cs}) (2048)
6669 X \p{Sutton_SignWriting} \p{Block=Sutton_SignWriting} (688)
6670 \p{Sylo} \p{Syloti_Nagri} (= \p{Script_Extensions=
6671 Syloti_Nagri}) (NOT \p{Block=
6672 Syloti_Nagri}) (57)
6673 \p{Syloti_Nagri} \p{Script_Extensions=Syloti_Nagri} (Short:
6674 \p{Sylo}; NOT \p{Block=Syloti_Nagri})
6675 (57)
6676 \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
6677 (7770)
6678 X \p{Symbols_And_Pictographs_Ext_A}
6679 \p{Symbols_And_Pictographs_Extended_A}
6680 (= \p{Block=
6681 Symbols_And_Pictographs_Extended_A})
6682 (144)
6683 X \p{Symbols_And_Pictographs_Extended_A} \p{Block=
6684 Symbols_And_Pictographs_Extended_A} (144)
6685 X \p{Symbols_For_Legacy_Computing} \p{Block=
6686 Symbols_For_Legacy_Computing} (256)
6687 \p{Syrc} \p{Syriac} (= \p{Script_Extensions=
6688 Syriac}) (NOT \p{Block=Syriac}) (107)
6689 \p{Syriac} \p{Script_Extensions=Syriac} (Short:
6690 \p{Syrc}; NOT \p{Block=Syriac}) (107)
6691 X \p{Syriac_Sup} \p{Syriac_Supplement} (= \p{Block=
6692 Syriac_Supplement}) (16)
6693 X \p{Syriac_Supplement} \p{Block=Syriac_Supplement} (Short:
6694 \p{InSyriacSup}) (16)
6695 \p{Tagalog} \p{Script_Extensions=Tagalog} (Short:
6696 \p{Tglg}; NOT \p{Block=Tagalog}) (25)
6697 \p{Tagb} \p{Tagbanwa} (= \p{Script_Extensions=
6698 Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
6699 \p{Tagbanwa} \p{Script_Extensions=Tagbanwa} (Short:
6700 \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
6701 X \p{Tags} \p{Block=Tags} (128)
6702 \p{Tai_Le} \p{Script_Extensions=Tai_Le} (Short:
6703 \p{Tale}; NOT \p{Block=Tai_Le}) (45)
6704 \p{Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
6705 \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
6706 \p{Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
6707 \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
6708 X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
6709 Tai_Xuan_Jing_Symbols}) (96)
6710 X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
6711 \p{InTaiXuanJing}) (96)
6712 \p{Takr} \p{Takri} (= \p{Script_Extensions=Takri})
6713 (NOT \p{Block=Takri}) (80)
6714 \p{Takri} \p{Script_Extensions=Takri} (Short:
6715 \p{Takr}; NOT \p{Block=Takri}) (80)
6716 \p{Tale} \p{Tai_Le} (= \p{Script_Extensions=
6717 Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
6718 \p{Talu} \p{New_Tai_Lue} (= \p{Script_Extensions=
6719 New_Tai_Lue}) (NOT \p{Block=
6720 New_Tai_Lue}) (83)
6721 \p{Tamil} \p{Script_Extensions=Tamil} (Short:
6722 \p{Taml}; NOT \p{Block=Tamil}) (133)
6723 X \p{Tamil_Sup} \p{Tamil_Supplement} (= \p{Block=
6724 Tamil_Supplement}) (64)
6725 X \p{Tamil_Supplement} \p{Block=Tamil_Supplement} (Short:
6726 \p{InTamilSup}) (64)
6727 \p{Taml} \p{Tamil} (= \p{Script_Extensions=Tamil})
6728 (NOT \p{Block=Tamil}) (133)
6729 \p{Tang} \p{Tangut} (= \p{Script_Extensions=
6730 Tangut}) (NOT \p{Block=Tangut}) (6914)
6731 \p{Tangsa} \p{Script_Extensions=Tangsa} (Short:
6732 \p{Tnsa}; NOT \p{Block=Tangsa}) (89)
6733 \p{Tangut} \p{Script_Extensions=Tangut} (Short:
6734 \p{Tang}; NOT \p{Block=Tangut}) (6914)
6735 X \p{Tangut_Components} \p{Block=Tangut_Components} (768)
6736 X \p{Tangut_Sup} \p{Tangut_Supplement} (= \p{Block=
6737 Tangut_Supplement}) (128)
6738 X \p{Tangut_Supplement} \p{Block=Tangut_Supplement} (Short:
6739 \p{InTangutSup}) (128)
6740 \p{Tavt} \p{Tai_Viet} (= \p{Script_Extensions=
6741 Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
6742 \p{Telu} \p{Telugu} (= \p{Script_Extensions=
6743 Telugu}) (NOT \p{Block=Telugu}) (106)
6744 \p{Telugu} \p{Script_Extensions=Telugu} (Short:
6745 \p{Telu}; NOT \p{Block=Telugu}) (106)
6746 \p{Term} \p{Terminal_Punctuation} (=
6747 \p{Terminal_Punctuation=Y}) (278)
6748 \p{Term: *} \p{Terminal_Punctuation: *}
6749 \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
6750 \p{Term}) (278)
6751 \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
6752 (1_113_834 plus all above-Unicode code
6753 points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0-
6754 9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
6755 U+0100..037D, U+037F..0386,
6756 U+0388..0588, U+058A..05C2, U+05C4..060B
6757 ...)
6758 \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (278:
6759 [!,.:;?], U+037E, U+0387, U+0589,
6760 U+05C3, U+060C ...)
6761 \p{Tfng} \p{Tifinagh} (= \p{Script_Extensions=
6762 Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
6763 \p{Tglg} \p{Tagalog} (= \p{Script_Extensions=
6764 Tagalog}) (NOT \p{Block=Tagalog}) (25)
6765 \p{Thaa} \p{Thaana} (= \p{Script_Extensions=
6766 Thaana}) (NOT \p{Block=Thaana}) (66)
6767 \p{Thaana} \p{Script_Extensions=Thaana} (Short:
6768 \p{Thaa}; NOT \p{Block=Thaana}) (66)
6769 \p{Thai} \p{Script_Extensions=Thai} (NOT \p{Block=
6770 Thai}) (86)
6771 \p{Tibetan} \p{Script_Extensions=Tibetan} (Short:
6772 \p{Tibt}; NOT \p{Block=Tibetan}) (207)
6773 \p{Tibt} \p{Tibetan} (= \p{Script_Extensions=
6774 Tibetan}) (NOT \p{Block=Tibetan}) (207)
6775 \p{Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
6776 \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
6777 \p{Tirh} \p{Tirhuta} (= \p{Script_Extensions=
6778 Tirhuta}) (NOT \p{Block=Tirhuta}) (97)
6779 \p{Tirhuta} \p{Script_Extensions=Tirhuta} (Short:
6780 \p{Tirh}; NOT \p{Block=Tirhuta}) (97)
6781 \p{Title} \p{Titlecase} (/i= Cased=Yes) (31)
6782 \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i=
6783 Cased=Yes) (31: U+01C5, U+01C8, U+01CB,
6784 U+01F2, U+1F88..1F8F, U+1F98..1F9F ...)
6785 \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
6786 (Short: \p{Lt}; /i= General_Category=
6787 Cased_Letter) (31)
6788 \p{Tnsa} \p{Tangsa} (= \p{Script_Extensions=
6789 Tangsa}) (NOT \p{Block=Tangsa}) (89)
6790 \p{Toto} \p{Script_Extensions=Toto} (NOT \p{Block=
6791 Toto}) (31)
6792 X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block=
6793 Transport_And_Map_Symbols}) (128)
6794 X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
6795 (Short: \p{InTransportAndMap}) (128)
6796 X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics}
6797 (= \p{Block=
6798 Unified_Canadian_Aboriginal_Syllabics})
6799 (640)
6800 X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_-
6801 Extended} (= \p{Block=
6802 Unified_Canadian_Aboriginal_Syllabics_-
6803 Extended}) (80)
6804 X \p{UCAS_Ext_A} \p{Unified_Canadian_Aboriginal_Syllabics_-
6805 Extended_A} (= \p{Block=
6806 Unified_Canadian_Aboriginal_Syllabics_-
6807 Extended_A}) (16)
6808 \p{Ugar} \p{Ugaritic} (= \p{Script_Extensions=
6809 Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
6810 \p{Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
6811 \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
6812 \p{UIdeo} \p{Unified_Ideograph} (=
6813 \p{Unified_Ideograph=Y}) (97_058)
6814 \p{UIdeo: *} \p{Unified_Ideograph: *}
6815 \p{Unassigned} \p{General_Category=Unassigned} (Short:
6816 \p{Cn}) (825_345 plus all above-Unicode
6817 code points)
6818 \p{Unicode} \p{Any} (1_114_112)
6819 X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
6820 Unified_Canadian_Aboriginal_Syllabics}
6821 (Short: \p{InUCAS}) (640)
6822 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
6823 Unified_Canadian_Aboriginal_Syllabics_-
6824 Extended} (Short: \p{InUCASExt}) (80)
6825 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended_A} \p{Block=
6826 Unified_Canadian_Aboriginal_Syllabics_-
6827 Extended_A} (Short: \p{InUCASExtA}) (16)
6828 \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
6829 (97_058)
6830 \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
6831 (1_017_054 plus all above-Unicode code
6832 points: U+0000..33FF, U+4DC0..4DFF,
6833 U+A000..FA0D, U+FA10, U+FA12,
6834 U+FA15..FA1E ...)
6835 \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (97_058:
6836 U+3400..4DBF, U+4E00..9FFF,
6837 U+FA0E..FA0F, U+FA11, U+FA13..FA14,
6838 U+FA1F ...)
6839 \p{Unknown} \p{Script_Extensions=Unknown} (Short:
6840 \p{Zzzz}) (964_861 plus all above-
6841 Unicode code points)
6842 \p{Upper} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6843 Cased=Yes) (1951)
6844 \p{Upper: *} \p{Uppercase: *}
6845 \p{Uppercase} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6846 Cased=Yes) (1951)
6847 \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
6848 No) (1_112_161 plus all above-Unicode
6849 code points: [\x00-\x20!\"#\$\%&\'
6850 \(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a-z\{
6851 \|\}~\x7f-\xbf\xd7\xdf-\xff], U+0101,
6852 U+0103, U+0105, U+0107, U+0109 ...)
6853 \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
6854 Yes) (1951: [A-Z\xc0-\xd6\xd8-\xde],
6855 U+0100, U+0102, U+0104, U+0106, U+0108
6856 ...)
6857 \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
6858 (Short: \p{Lu}; /i= General_Category=
6859 Cased_Letter) (1831)
6860 \p{Vai} \p{Script_Extensions=Vai} (NOT \p{Block=
6861 Vai}) (300)
6862 \p{Vaii} \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
6863 \p{Block=Vai}) (300)
6864 \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS};
6865 NOT \p{Variation_Selectors}) (260)
6866 \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_852
6867 plus all above-Unicode code points:
6868 U+0000..180A, U+180E, U+1810..FDFF,
6869 U+FE10..E00FF, U+E01F0..infinity)
6870 \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (260:
6871 U+180B..180D, U+180F, U+FE00..FE0F,
6872 U+E0100..E01EF)
6873 X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
6874 \p{InVS}) (16)
6875 X \p{Variation_Selectors_Supplement} \p{Block=
6876 Variation_Selectors_Supplement} (Short:
6877 \p{InVSSup}) (240)
6878 X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block=
6879 Vedic_Extensions}) (48)
6880 X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short:
6881 \p{InVedicExt}) (48)
6882 X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
6883 \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated}
6884 (786_609 plus all above-Unicode code
6885 points)
6886 \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (786_609 plus
6887 all above-Unicode code points: [\x00-
6888 \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf-
6889 \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9,
6890 U+02EC..10FF, U+1200..1400,
6891 U+1680..18AF, U+1900..2015 ...)
6892 \p{Vertical_Orientation: Tr} \p{Vertical_Orientation=
6893 Transformed_Rotated} (47)
6894 \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr})
6895 (47: U+2329..232A, U+3008..3011,
6896 U+3014..301F, U+3030, U+30A0, U+30FC ...)
6897 \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu})
6898 (148: U+3001..3002, U+3041, U+3043,
6899 U+3045, U+3047, U+3049 ...)
6900 \p{Vertical_Orientation: Tu} \p{Vertical_Orientation=
6901 Transformed_Upright} (148)
6902 \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright}
6903 (327_308)
6904 \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (327_308:
6905 [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7],
6906 U+02EA..02EB, U+1100..11FF,
6907 U+1401..167F, U+18B0..18FF, U+2016 ...)
6908 \p{VertSpace} \v (7: [\n\cK\f\r\x85], U+2028..2029)
6909 \p{Vith} \p{Vithkuqi} (= \p{Script_Extensions=
6910 Vithkuqi}) (NOT \p{Block=Vithkuqi}) (70)
6911 \p{Vithkuqi} \p{Script_Extensions=Vithkuqi} (Short:
6912 \p{Vith}; NOT \p{Block=Vithkuqi}) (70)
6913 \p{Vo: *} \p{Vertical_Orientation: *}
6914 \p{VS} \p{Variation_Selector} (=
6915 \p{Variation_Selector=Y}) (NOT
6916 \p{Variation_Selectors}) (260)
6917 \p{VS: *} \p{Variation_Selector: *}
6918 X \p{VS_Sup} \p{Variation_Selectors_Supplement} (=
6919 \p{Block=
6920 Variation_Selectors_Supplement}) (240)
6921 \p{Wancho} \p{Script_Extensions=Wancho} (Short:
6922 \p{Wcho}; NOT \p{Block=Wancho}) (59)
6923 \p{Wara} \p{Warang_Citi} (= \p{Script_Extensions=
6924 Warang_Citi}) (NOT \p{Block=
6925 Warang_Citi}) (84)
6926 \p{Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
6927 \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
6928 \p{WB: *} \p{Word_Break: *}
6929 \p{Wcho} \p{Wancho} (= \p{Script_Extensions=
6930 Wancho}) (NOT \p{Block=Wancho}) (59)
6931 \p{White_Space} \p{White_Space=Y} (Short: \p{Space}) (25)
6932 \p{White_Space: N*} (Short: \p{Space=N}, \P{Space}) (1_114_087
6933 plus all above-Unicode code points: [^
6934 \t\n\cK\f\r\x20\x85\xa0], U+0100..167F,
6935 U+1681..1FFF, U+200B..2027,
6936 U+202A..202E, U+2030..205E ...)
6937 \p{White_Space: Y*} (Short: \p{Space=Y}, \p{Space}) (25: [\t
6938 \n\cK\f\r\x20\x85\xa0], U+1680,
6939 U+2000..200A, U+2028..2029, U+202F,
6940 U+205F ...)
6941 \p{Word} \p{XPosixWord} (139_612)
6942 \p{Word_Break: ALetter} (Short: \p{WB=LE}) (29_489: [A-Za-z\xaa
6943 \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6944 U+0100..02D7, U+02DE..02FF,
6945 U+0370..0374, U+0376..0377, U+037A..037D
6946 ...)
6947 \p{Word_Break: CR} (Short: \p{WB=CR}) (1: [\r])
6948 \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"])
6949 \p{Word_Break: DQ} \p{Word_Break=Double_Quote} (1)
6950 \p{Word_Break: E_Base} (Short: \p{WB=EB}) (0)
6951 \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (0)
6952 \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (0)
6953 \p{Word_Break: EB} \p{Word_Break=E_Base} (0)
6954 \p{Word_Break: EBG} \p{Word_Break=E_Base_GAZ} (0)
6955 \p{Word_Break: EM} \p{Word_Break=E_Modifier} (0)
6956 \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (11)
6957 \p{Word_Break: Extend} (Short: \p{WB=Extend}) (2554:
6958 U+0300..036F, U+0483..0489,
6959 U+0591..05BD, U+05BF, U+05C1..05C2,
6960 U+05C4..05C5 ...)
6961 \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F,
6962 U+203F..2040, U+2054, U+FE33..FE34,
6963 U+FE4D..FE4F ...)
6964 \p{Word_Break: FO} \p{Word_Break=Format} (71)
6965 \p{Word_Break: Format} (Short: \p{WB=FO}) (71: [\xad],
6966 U+0600..0605, U+061C, U+06DD, U+070F,
6967 U+0890..0891 ...)
6968 \p{Word_Break: GAZ} \p{Word_Break=Glue_After_Zwj} (0)
6969 \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (0)
6970 \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (75:
6971 U+05D0..05EA, U+05EF..05F2, U+FB1D,
6972 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
6973 ...)
6974 \p{Word_Break: HL} \p{Word_Break=Hebrew_Letter} (75)
6975 \p{Word_Break: KA} \p{Word_Break=Katakana} (331)
6976 \p{Word_Break: Katakana} (Short: \p{WB=KA}) (331: U+3031..3035,
6977 U+309B..309C, U+30A0..30FA,
6978 U+30FC..30FF, U+31F0..31FF, U+32D0..32FE
6979 ...)
6980 \p{Word_Break: LE} \p{Word_Break=ALetter} (29_489)
6981 \p{Word_Break: LF} (Short: \p{WB=LF}) (1: [\n])
6982 \p{Word_Break: MB} \p{Word_Break=MidNumLet} (7)
6983 \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (9: [:\xb7], U+0387,
6984 U+055F, U+05F4, U+2027, U+FE13 ...)
6985 \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15: [,;], U+037E,
6986 U+0589, U+060C..060D, U+066C, U+07F8 ...)
6987 \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.],
6988 U+2018..2019, U+2024, U+FE52, U+FF07,
6989 U+FF0E)
6990 \p{Word_Break: ML} \p{Word_Break=MidLetter} (9)
6991 \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
6992 \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85],
6993 U+2028..2029)
6994 \p{Word_Break: NL} \p{Word_Break=Newline} (5)
6995 \p{Word_Break: NU} \p{Word_Break=Numeric} (681)
6996 \p{Word_Break: Numeric} (Short: \p{WB=NU}) (681: [0-9],
6997 U+0660..0669, U+066B, U+06F0..06F9,
6998 U+07C0..07C9, U+0966..096F ...)
6999 \p{Word_Break: Other} (Short: \p{WB=XX}) (1_080_819 plus all
7000 above-Unicode code points: [^\n\cK\f\r
7001 \x20\"\',.0-9:;A-Z_a-z\x85\xaa\xad\xb5
7002 \xb7\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7003 U+02D8..02DD, U+0375, U+0378..0379,
7004 U+0380..0385, U+038B ...)
7005 \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26:
7006 U+1F1E6..1F1FF)
7007 \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26)
7008 \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\'])
7009 \p{Word_Break: SQ} \p{Word_Break=Single_Quote} (1)
7010 \p{Word_Break: WSegSpace} (Short: \p{WB=WSegSpace}) (14: [\x20],
7011 U+1680, U+2000..2006, U+2008..200A,
7012 U+205F, U+3000)
7013 \p{Word_Break: XX} \p{Word_Break=Other} (1_080_819 plus all
7014 above-Unicode code points)
7015 \p{Word_Break: ZWJ} (Short: \p{WB=ZWJ}) (1: U+200D)
7016 \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (25)
7017 \p{WSpace: *} \p{White_Space: *}
7018 \p{XDigit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
7019 \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
7020 (139_463)
7021 \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (974_649
7022 plus all above-Unicode code points:
7023 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
7024 \@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4
7025 \xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
7026 U+02C2..02C5, U+02D2..02DF,
7027 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
7028 \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (139_463:
7029 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6
7030 \xd8-\xf6\xf8-\xff], U+0100..02C1,
7031 U+02C6..02D1, U+02E0..02E4, U+02EC,
7032 U+02EE ...)
7033 \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (136_322)
7034 \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (977_790
7035 plus all above-Unicode code points:
7036 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
7037 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4
7038 \xb6-\xb9\xbb-\xbf\xd7\xf7],
7039 U+02C2..02C5, U+02D2..02DF,
7040 U+02E5..02EB, U+02ED, U+02EF..036F ...)
7041 \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (136_322:
7042 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
7043 \xf8-\xff], U+0100..02C1, U+02C6..02D1,
7044 U+02E0..02E4, U+02EC, U+02EE ...)
7045 \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
7046 (139_463)
7047 \p{XIDC: *} \p{XID_Continue: *}
7048 \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (136_322)
7049 \p{XIDS: *} \p{XID_Start: *}
7050 \p{Xpeo} \p{Old_Persian} (= \p{Script_Extensions=
7051 Old_Persian}) (NOT \p{Block=
7052 Old_Persian}) (50)
7053 \p{XPerlSpace} \p{XPosixSpace} (25)
7054 \p{XPosixAlnum} Alphabetic and (decimal) Numeric (Short:
7055 \p{Alnum}) (138_445: [0-9A-Za-z\xaa\xb5
7056 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7057 U+0100..02C1, U+02C6..02D1,
7058 U+02E0..02E4, U+02EC, U+02EE ...)
7059 \p{XPosixAlpha} \p{Alphabetic=Y} (Short: \p{Alpha})
7060 (137_765)
7061 \p{XPosixBlank} \h, Horizontal white space (Short:
7062 \p{Blank}) (18: [\t\x20\xa0], U+1680,
7063 U+2000..200A, U+202F, U+205F, U+3000)
7064 \p{XPosixCntrl} \p{General_Category=Control} Control
7065 characters (Short: \p{Cc}) (65)
7066 \p{XPosixDigit} \p{General_Category=Decimal_Number} [0-9]
7067 + all other decimal digits (Short:
7068 \p{Nd}) (680)
7069 \p{XPosixGraph} Characters that are graphical (Short:
7070 \p{Graph}) (286_635: [!\"#\$\%&\'
7071 \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`a-z
7072 \{\|\}~\xa1-\xff], U+0100..0377,
7073 U+037A..037F, U+0384..038A, U+038C,
7074 U+038E..03A1 ...)
7075 \p{XPosixLower} \p{Lowercase=Y} (Short: \p{Lower}; /i=
7076 Cased=Yes) (2544)
7077 \p{XPosixPrint} Characters that are graphical plus space
7078 characters (but no controls) (Short:
7079 \p{Print}) (286_652: [\x20-\x7e\xa0-
7080 \xff], U+0100..0377, U+037A..037F,
7081 U+0384..038A, U+038C, U+038E..03A1 ...)
7082 \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (851:
7083 [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\]
7084 \^_`\{\|\}~\xa1\xa7\xab\xb6-\xb7\xbb
7085 \xbf], U+037E, U+0387, U+055A..055F,
7086 U+0589..058A, U+05BE ...)
7087 \p{XPosixSpace} \s including beyond ASCII and vertical tab
7088 (Short: \p{SpacePerl}) (25: [\t\n\cK\f
7089 \r\x20\x85\xa0], U+1680, U+2000..200A,
7090 U+2028..2029, U+202F, U+205F ...)
7091 \p{XPosixUpper} \p{Uppercase=Y} (Short: \p{Upper}; /i=
7092 Cased=Yes) (1951)
7093 \p{XPosixWord} \w, including beyond ASCII; = \p{Alnum} +
7094 \pM + \p{Pc} + \p{Join_Control} (Short:
7095 \p{Word}) (139_612: [0-9A-Z_a-z\xaa\xb5
7096 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7097 U+0100..02C1, U+02C6..02D1,
7098 U+02E0..02E4, U+02EC, U+02EE ...)
7099 \p{XPosixXDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
7100 \p{Xsux} \p{Cuneiform} (= \p{Script_Extensions=
7101 Cuneiform}) (NOT \p{Block=Cuneiform})
7102 (1234)
7103 \p{Yezi} \p{Yezidi} (= \p{Script_Extensions=
7104 Yezidi}) (NOT \p{Block=Yezidi}) (60)
7105 \p{Yezidi} \p{Script_Extensions=Yezidi} (Short:
7106 \p{Yezi}; NOT \p{Block=Yezidi}) (60)
7107 \p{Yi} \p{Script_Extensions=Yi} (1246)
7108 X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
7109 X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
7110 \p{Yiii} \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
7111 X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block=
7112 Yijing_Hexagram_Symbols}) (64)
7113 X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
7114 (Short: \p{InYijing}) (64)
7115 \p{Z} \pZ \p{Separator} (= \p{General_Category=
7116 Separator}) (19)
7117 \p{Zanabazar_Square} \p{Script_Extensions=Zanabazar_Square}
7118 (Short: \p{Zanb}; NOT \p{Block=
7119 Zanabazar_Square}) (72)
7120 \p{Zanb} \p{Zanabazar_Square} (=
7121 \p{Script_Extensions=Zanabazar_Square})
7122 (NOT \p{Block=Zanabazar_Square}) (72)
7123 \p{Zinh} \p{Inherited} (= \p{Script_Extensions=
7124 Inherited}) (586)
7125 \p{Zl} \p{Line_Separator} (= \p{General_Category=
7126 Line_Separator}) (1)
7127 X \p{Znamenny_Music} \p{Znamenny_Musical_Notation} (= \p{Block=
7128 Znamenny_Musical_Notation}) (208)
7129 X \p{Znamenny_Musical_Notation} \p{Block=Znamenny_Musical_Notation}
7130 (Short: \p{InZnamennyMusic}) (208)
7131 \p{Zp} \p{Paragraph_Separator} (=
7132 \p{General_Category=
7133 Paragraph_Separator}) (1)
7134 \p{Zs} \p{Space_Separator} (=
7135 \p{General_Category=Space_Separator})
7136 (17)
7137 \p{Zyyy} \p{Common} (= \p{Script_Extensions=
7138 Common}) (7873)
7139 \p{Zzzz} \p{Unknown} (= \p{Script_Extensions=
7140 Unknown}) (964_861 plus all above-
7141 Unicode code points)
7142
7143 Legal "\p{}" and "\P{}" constructs that match no characters
7144 Unicode has some property-value pairs that currently don't match
7145 anything. This happens generally either because they are obsolete, or
7146 they exist for symmetry with other forms, but no language has yet been
7147 encoded that uses them. In this version of Unicode, the following
7148 match zero code points:
7149
7150 \p{Canonical_Combining_Class=Attached_Below_Left}
7151 \p{Canonical_Combining_Class=CCC133}
7152 \p{Grapheme_Cluster_Break=E_Base}
7153 \p{Grapheme_Cluster_Break=E_Base_GAZ}
7154 \p{Grapheme_Cluster_Break=E_Modifier}
7155 \p{Grapheme_Cluster_Break=Glue_After_Zwj}
7156 \p{Word_Break=E_Base}
7157 \p{Word_Break=E_Base_GAZ}
7158 \p{Word_Break=E_Modifier}
7159 \p{Word_Break=Glue_After_Zwj}
7160
7162 The value of any Unicode (not including Perl extensions) character
7163 property mentioned above for any single code point is available through
7164 "charprop()" in Unicode::UCD. "charprops_all()" in Unicode::UCD
7165 returns the values of all the Unicode properties for a given code
7166 point.
7167
7168 Besides these, all the Unicode character properties mentioned above
7169 (except for those marked as for internal use by Perl) are also
7170 accessible by "prop_invlist()" in Unicode::UCD.
7171
7172 Due to their nature, not all Unicode character properties are suitable
7173 for regular expression matches, nor prop_invlist(). The remaining non-
7174 provisional, non-internal ones are accessible via "prop_invmap()" in
7175 Unicode::UCD (except for those that this Perl installation hasn't
7176 included; see below for which those are).
7177
7178 For compatibility with other parts of Perl, all the single forms given
7179 in the table in the section above are recognized. BUT, there are some
7180 ambiguities between some Perl extensions and the Unicode properties,
7181 all of which are silently resolved in favor of the official Unicode
7182 property. To avoid surprises, you should only use prop_invmap() for
7183 forms listed in the table below, which omits the non-recommended ones.
7184 The affected forms are the Perl single form equivalents of Unicode
7185 properties, such as "\p{sc}" being a single-form equivalent of
7186 "\p{gc=sc}", which is treated by prop_invmap() as the "Script"
7187 property, whose short name is "sc". The table indicates the current
7188 ambiguities in the INFO column, beginning with the word "NOT".
7189
7190 The standard Unicode properties listed below are documented in
7191 <http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is
7192 documented in "prop_invmap()" in Unicode::UCD. The other Perl
7193 extensions are in "Other Properties" in perlunicode;
7194
7195 The first column in the table is a name for the property; the second
7196 column is an alternative name, if any, plus possibly some annotations.
7197 The alternative name is the property's full name, unless that would
7198 simply repeat the first column, in which case the second column
7199 indicates the property's short name (if different). The annotations
7200 are given only in the entry for the full name. The annotations for
7201 binary properties include a list of the first few ranges that the
7202 property matches. To avoid any ambiguity, the SPACE character is
7203 represented as "\x20".
7204
7205 If a property is obsolete, etc, the entry will be flagged with the same
7206 characters used in the table in the section above, like D or S.
7207
7208 NAME INFO
7209
7210 Age
7211 AHex ASCII_Hex_Digit
7212 All (Perl extension). All code points,
7213 including those above Unicode. Same as
7214 qr/./s. U+0000..infinity
7215 Alnum XPosixAlnum. (Perl extension)
7216 Alpha Alphabetic
7217 Alphabetic (Short: Alpha). [A-Za-z\xaa\xb5\xba\xc0-
7218 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7219 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7220 ...
7221 Any (Perl extension). All Unicode code
7222 points. U+0000..10FFFF
7223 ASCII Block=Basic_Latin. (Perl extension).
7224 [\x00-\x7f]
7225 ASCII_Hex_Digit (Short: AHex). [0-9A-Fa-f]
7226 Assigned (Perl extension). All assigned code
7227 points. U+0000..0377, U+037A..037F,
7228 U+0384..038A, U+038C, U+038E..03A1,
7229 U+03A3..052F ...
7230 Bc Bidi_Class
7231 Bidi_C Bidi_Control
7232 Bidi_Class (Short: bc)
7233 Bidi_Control (Short: Bidi_C). U+061C, U+200E..200F,
7234 U+202A..202E, U+2066..2069
7235 Bidi_M Bidi_Mirrored
7236 Bidi_Mirrored (Short: Bidi_M). [\(\)<>\[\]\{\}\xab
7237 \xbb], U+0F3A..0F3D, U+169B..169C,
7238 U+2039..203A, U+2045..2046, U+207D..207E
7239 ...
7240 Bidi_Mirroring_Glyph (Short: bmg)
7241 Bidi_Paired_Bracket (Short: bpb)
7242 Bidi_Paired_Bracket_Type (Short: bpt)
7243 Blank XPosixBlank. (Perl extension)
7244 Blk Block
7245 Block (Short: blk)
7246 Bmg Bidi_Mirroring_Glyph
7247 Bpb Bidi_Paired_Bracket
7248 Bpt Bidi_Paired_Bracket_Type
7249 Canonical_Combining_Class (Short: ccc)
7250 Case_Folding (Short: cf)
7251 Case_Ignorable (Short: CI). [\'.:\^`\xa8\xad\xaf\xb4
7252 \xb7-\xb8], U+02B0..036F, U+0374..0375,
7253 U+037A, U+0384..0385, U+0387 ...
7254 Cased [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7255 \xff], U+0100..01BA, U+01BC..01BF,
7256 U+01C4..0293, U+0295..02B8, U+02C0..02C1
7257 ...
7258 Category General_Category
7259 Ccc Canonical_Combining_Class
7260 CE Composition_Exclusion
7261 Cf Case_Folding; NOT 'cf' meaning
7262 'General_Category=Format'
7263 Changes_When_Casefolded (Short: CWCF). [A-Z\xb5\xc0-\xd6\xd8-
7264 \xdf], U+0100, U+0102, U+0104, U+0106,
7265 U+0108 ...
7266 Changes_When_Casemapped (Short: CWCM). [A-Za-z\xb5\xc0-\xd6\xd8-
7267 \xf6\xf8-\xff], U+0100..0137,
7268 U+0139..018C, U+018E..019A, U+019C..01A9,
7269 U+01AC..01B9 ...
7270 Changes_When_Lowercased (Short: CWL). [A-Z\xc0-\xd6\xd8-\xde],
7271 U+0100, U+0102, U+0104, U+0106, U+0108 ...
7272 Changes_When_NFKC_Casefolded (Short: CWKCF). [A-Z\xa0\xa8\xaa
7273 \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0-
7274 \xd6\xd8-\xdf], U+0100, U+0102, U+0104,
7275 U+0106, U+0108 ...
7276 Changes_When_Titlecased (Short: CWT). [a-z\xb5\xdf-\xf6\xf8-
7277 \xff], U+0101, U+0103, U+0105, U+0107,
7278 U+0109 ...
7279 Changes_When_Uppercased (Short: CWU). [a-z\xb5\xdf-\xf6\xf8-
7280 \xff], U+0101, U+0103, U+0105, U+0107,
7281 U+0109 ...
7282 CI Case_Ignorable
7283 Cntrl XPosixCntrl (=General_Category=Control).
7284 (Perl extension)
7285 Comp_Ex Full_Composition_Exclusion
7286 Composition_Exclusion (Short: CE). U+0958..095F, U+09DC..09DD,
7287 U+09DF, U+0A33, U+0A36, U+0A59..0A5B ...
7288 CWCF Changes_When_Casefolded
7289 CWCM Changes_When_Casemapped
7290 CWKCF Changes_When_NFKC_Casefolded
7291 CWL Changes_When_Lowercased
7292 CWT Changes_When_Titlecased
7293 CWU Changes_When_Uppercased
7294 Dash [\-], U+058A, U+05BE, U+1400, U+1806,
7295 U+2010..2015 ...
7296 Decomposition_Mapping (Short: dm)
7297 Decomposition_Type (Short: dt)
7298 Default_Ignorable_Code_Point (Short: DI). [\xad], U+034F, U+061C,
7299 U+115F..1160, U+17B4..17B5, U+180B..180F
7300 ...
7301 Dep Deprecated
7302 Deprecated (Short: Dep). U+0149, U+0673, U+0F77,
7303 U+0F79, U+17A3..17A4, U+206A..206F ...
7304 DI Default_Ignorable_Code_Point
7305 Dia Diacritic
7306 Diacritic (Short: Dia). [\^`\xa8\xaf\xb4\xb7-\xb8],
7307 U+02B0..034E, U+0350..0357, U+035D..0362,
7308 U+0374..0375, U+037A ...
7309 Digit XPosixDigit (=General_Category=
7310 Decimal_Number). (Perl extension)
7311 Dm Decomposition_Mapping
7312 Dt Decomposition_Type
7313 Ea East_Asian_Width
7314 East_Asian_Width (Short: ea)
7315 EBase Emoji_Modifier_Base
7316 EComp Emoji_Component
7317 EMod Emoji_Modifier
7318 Emoji [#*0-9\xa9\xae], U+203C, U+2049, U+2122,
7319 U+2139, U+2194..2199 ...
7320 Emoji_Component (Short: EComp). [#*0-9], U+200D, U+20E3,
7321 U+FE0F, U+1F1E6..1F1FF, U+1F3FB..1F3FF ...
7322 Emoji_Modifier (Short: EMod). U+1F3FB..1F3FF
7323 Emoji_Modifier_Base (Short: EBase). U+261D, U+26F9,
7324 U+270A..270D, U+1F385, U+1F3C2..1F3C4,
7325 U+1F3C7 ...
7326 Emoji_Presentation (Short: EPres). U+231A..231B,
7327 U+23E9..23EC, U+23F0, U+23F3,
7328 U+25FD..25FE, U+2614..2615 ...
7329 EPres Emoji_Presentation
7330 EqUIdeo Equivalent_Unified_Ideograph
7331 Equivalent_Unified_Ideograph (Short: EqUIdeo)
7332 Ext Extender
7333 Extended_Pictographic (Short: ExtPict). [\xa9\xae], U+203C,
7334 U+2049, U+2122, U+2139, U+2194..2199 ...
7335 Extender (Short: Ext). [\xb7], U+02D0..02D1,
7336 U+0640, U+07FA, U+0B55, U+0E46 ...
7337 ExtPict Extended_Pictographic
7338 Full_Composition_Exclusion (Short: Comp_Ex). U+0340..0341,
7339 U+0343..0344, U+0374, U+037E, U+0387,
7340 U+0958..095F ...
7341 Gc General_Category
7342 GCB Grapheme_Cluster_Break
7343 General_Category (Short: gc)
7344 Gr_Base Grapheme_Base
7345 Gr_Ext Grapheme_Extend
7346 Graph XPosixGraph. (Perl extension)
7347 Grapheme_Base (Short: Gr_Base). [\x20-\x7e\xa0-\xac
7348 \xae-\xff], U+0100..02FF, U+0370..0377,
7349 U+037A..037F, U+0384..038A, U+038C ...
7350 Grapheme_Cluster_Break (Short: GCB)
7351 Grapheme_Extend (Short: Gr_Ext). U+0300..036F,
7352 U+0483..0489, U+0591..05BD, U+05BF,
7353 U+05C1..05C2, U+05C4..05C5 ...
7354 Hangul_Syllable_Type (Short: hst)
7355 Hex Hex_Digit
7356 Hex_Digit (Short: Hex). [0-9A-Fa-f], U+FF10..FF19,
7357 U+FF21..FF26, U+FF41..FF46
7358 HorizSpace XPosixBlank. (Perl extension)
7359 Hst Hangul_Syllable_Type
7360 D Hyphen [\-\xad], U+058A, U+1806, U+2010..2011,
7361 U+2E17, U+30FB ... Supplanted by
7362 Line_Break property values; see
7363 www.unicode.org/reports/tr14
7364 ID_Continue (Short: IDC). [0-9A-Z_a-z\xaa\xb5\xb7
7365 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7366 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7367 U+02EC, U+02EE ...
7368 ID_Start (Short: IDS). [A-Za-z\xaa\xb5\xba\xc0-
7369 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7370 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7371 ...
7372 IDC ID_Continue
7373 Identifier_Status
7374 Identifier_Type
7375 Ideo Ideographic
7376 Ideographic (Short: Ideo). U+3006..3007,
7377 U+3021..3029, U+3038..303A, U+3400..4DBF,
7378 U+4E00..9FFF, U+F900..FA6D ...
7379 IDS ID_Start
7380 IDS_Binary_Operator (Short: IDSB). U+2FF0..2FF1, U+2FF4..2FFB
7381 IDS_Trinary_Operator (Short: IDST). U+2FF2..2FF3
7382 IDSB IDS_Binary_Operator
7383 IDST IDS_Trinary_Operator
7384 In Present_In. (Perl extension)
7385 Indic_Positional_Category (Short: InPC)
7386 Indic_Syllabic_Category (Short: InSC)
7387 InPC Indic_Positional_Category
7388 InSC Indic_Syllabic_Category
7389 Isc ISO_Comment; NOT 'isc' meaning
7390 'General_Category=Other'
7391 ISO_Comment (Short: isc)
7392 Jg Joining_Group
7393 Join_C Join_Control
7394 Join_Control (Short: Join_C). U+200C..200D
7395 Joining_Group (Short: jg)
7396 Joining_Type (Short: jt)
7397 Jt Joining_Type
7398 Lb Line_Break
7399 Lc Lowercase_Mapping; NOT 'lc' meaning
7400 'General_Category=Cased_Letter'
7401 Line_Break (Short: lb)
7402 LOE Logical_Order_Exception
7403 Logical_Order_Exception (Short: LOE). U+0E40..0E44, U+0EC0..0EC4,
7404 U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9
7405 ...
7406 Lower Lowercase
7407 Lowercase (Short: Lower). [a-z\xaa\xb5\xba\xdf-
7408 \xf6\xf8-\xff], U+0101, U+0103, U+0105,
7409 U+0107, U+0109 ...
7410 Lowercase_Mapping (Short: lc)
7411 Math [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
7412 U+03D5, U+03F0..03F1, U+03F4..03F6,
7413 U+0606..0608 ...
7414 Na Name
7415 Na1 Unicode_1_Name
7416 Name (Short: na)
7417 Name_Alias
7418 NChar Noncharacter_Code_Point
7419 NFC_QC NFC_Quick_Check
7420 NFC_Quick_Check (Short: NFC_QC)
7421 NFD_QC NFD_Quick_Check
7422 NFD_Quick_Check (Short: NFD_QC)
7423 NFKC_Casefold (Short: NFKC_CF)
7424 NFKC_CF NFKC_Casefold
7425 NFKC_QC NFKC_Quick_Check
7426 NFKC_Quick_Check (Short: NFKC_QC)
7427 NFKD_QC NFKD_Quick_Check
7428 NFKD_Quick_Check (Short: NFKD_QC)
7429 Noncharacter_Code_Point (Short: NChar). U+FDD0..FDEF,
7430 U+FFFE..FFFF, U+1FFFE..1FFFF,
7431 U+2FFFE..2FFFF, U+3FFFE..3FFFF,
7432 U+4FFFE..4FFFF ...
7433 Nt Numeric_Type
7434 Numeric_Type (Short: nt)
7435 Numeric_Value (Short: nv)
7436 Nv Numeric_Value
7437 Pat_Syn Pattern_Syntax
7438 Pat_WS Pattern_White_Space
7439 Pattern_Syntax (Short: Pat_Syn). [!\"#\$\%&\'\(\)*+,\-.
7440 \/:;<=>?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9
7441 \xab-\xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7
7442 \xf7], U+2010..2027, U+2030..203E,
7443 U+2041..2053, U+2055..205E, U+2190..245F
7444 ...
7445 Pattern_White_Space (Short: Pat_WS). [\t\n\cK\f\r\x20\x85],
7446 U+200E..200F, U+2028..2029
7447 PCM Prepended_Concatenation_Mark
7448 Perl_Decimal_Digit (Perl extension)
7449 PerlSpace PosixSpace. (Perl extension)
7450 PerlWord PosixWord. (Perl extension)
7451 PosixAlnum (Perl extension). [0-9A-Za-z]
7452 PosixAlpha (Perl extension). [A-Za-z]
7453 PosixBlank (Perl extension). [\t\x20]
7454 PosixCntrl (Perl extension). ASCII control
7455 characters. ACK, BEL, BS, CAN, CR, DC1,
7456 DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT,
7457 ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK,
7458 NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT
7459 PosixDigit (Perl extension). [0-9]
7460 PosixGraph (Perl extension). [!\"#\$\%&\'\(\)*+,\-.
7461 \/0-9:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~]
7462 PosixLower (Perl extension). [a-z]
7463 PosixPrint (Perl extension). [\x20-\x7e]
7464 PosixPunct (Perl extension). [!\"#\$\%&\'\(\)*+,\-.
7465 \/:;<=>?\@\[\\\]\^_`\{\|\}~]
7466 PosixSpace (Perl extension). [\t\n\cK\f\r\x20]
7467 PosixUpper (Perl extension). [A-Z]
7468 PosixWord (Perl extension). \w, restricted to
7469 ASCII. [0-9A-Z_a-z]
7470 PosixXDigit ASCII_Hex_Digit. (Perl extension).
7471 [0-9A-Fa-f]
7472 Prepended_Concatenation_Mark (Short: PCM). U+0600..0605, U+06DD,
7473 U+070F, U+0890..0891, U+08E2, U+110BD ...
7474 Present_In (Short: In). (Perl extension)
7475 Print XPosixPrint. (Perl extension)
7476 Punct General_Category=Punctuation. (Perl
7477 extension). [!\"#\%&\'\(\)*,\-.\/:;?\@
7478 \[\\\]_\{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
7479 U+037E, U+0387, U+055A..055F,
7480 U+0589..058A, U+05BE ...
7481 QMark Quotation_Mark
7482 Quotation_Mark (Short: QMark). [\"\'\xab\xbb],
7483 U+2018..201F, U+2039..203A, U+2E42,
7484 U+300C..300F, U+301D..301F ...
7485 Radical U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5
7486 Regional_Indicator (Short: RI). U+1F1E6..1F1FF
7487 RI Regional_Indicator
7488 SB Sentence_Break
7489 Sc Script; NOT 'sc' meaning
7490 'General_Category=Currency_Symbol'
7491 Scf Simple_Case_Folding
7492 Script (Short: sc)
7493 Script_Extensions (Short: scx)
7494 Scx Script_Extensions
7495 SD Soft_Dotted
7496 Sentence_Break (Short: SB)
7497 Sentence_Terminal (Short: STerm). [!.?], U+0589,
7498 U+061D..061F, U+06D4, U+0700..0702, U+07F9
7499 ...
7500 Sfc Simple_Case_Folding
7501 Simple_Case_Folding (Short: scf)
7502 Simple_Lowercase_Mapping (Short: slc)
7503 Simple_Titlecase_Mapping (Short: stc)
7504 Simple_Uppercase_Mapping (Short: suc)
7505 Slc Simple_Lowercase_Mapping
7506 Soft_Dotted (Short: SD). [i-j], U+012F, U+0249,
7507 U+0268, U+029D, U+02B2 ...
7508 Space White_Space
7509 SpacePerl XPosixSpace. (Perl extension)
7510 Stc Simple_Titlecase_Mapping
7511 STerm Sentence_Terminal
7512 Suc Simple_Uppercase_Mapping
7513 Tc Titlecase_Mapping
7514 Term Terminal_Punctuation
7515 Terminal_Punctuation (Short: Term). [!,.:;?], U+037E, U+0387,
7516 U+0589, U+05C3, U+060C ...
7517 Title Titlecase. (Perl extension)
7518 Titlecase (Short: Title). (Perl extension). (=
7519 \p{Gc=Lt}). U+01C5, U+01C8, U+01CB,
7520 U+01F2, U+1F88..1F8F, U+1F98..1F9F ...
7521 Titlecase_Mapping (Short: tc)
7522 Uc Uppercase_Mapping
7523 UIdeo Unified_Ideograph
7524 Unicode Any. (Perl extension)
7525 Unicode_1_Name (Short: na1)
7526 Unified_Ideograph (Short: UIdeo). U+3400..4DBF,
7527 U+4E00..9FFF, U+FA0E..FA0F, U+FA11,
7528 U+FA13..FA14, U+FA1F ...
7529 Upper Uppercase
7530 Uppercase (Short: Upper). [A-Z\xc0-\xd6\xd8-\xde],
7531 U+0100, U+0102, U+0104, U+0106, U+0108 ...
7532 Uppercase_Mapping (Short: uc)
7533 Variation_Selector (Short: VS). U+180B..180D, U+180F,
7534 U+FE00..FE0F, U+E0100..E01EF
7535 Vertical_Orientation (Short: vo)
7536 VertSpace (Perl extension). \v. [\n\cK\f\r\x85],
7537 U+2028..2029
7538 Vo Vertical_Orientation
7539 VS Variation_Selector
7540 WB Word_Break
7541 White_Space (Short: WSpace). [\t\n\cK\f\r\x20\x85
7542 \xa0], U+1680, U+2000..200A, U+2028..2029,
7543 U+202F, U+205F ...
7544 Word XPosixWord. (Perl extension)
7545 Word_Break (Short: WB)
7546 WSpace White_Space
7547 XDigit XPosixXDigit (=Hex_Digit). (Perl
7548 extension)
7549 XID_Continue (Short: XIDC). [0-9A-Z_a-z\xaa\xb5\xb7
7550 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7551 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7552 U+02EC, U+02EE ...
7553 XID_Start (Short: XIDS). [A-Za-z\xaa\xb5\xba\xc0-
7554 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7555 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7556 ...
7557 XIDC XID_Continue
7558 XIDS XID_Start
7559 XPerlSpace XPosixSpace. (Perl extension)
7560 XPosixAlnum (Short: Alnum). (Perl extension).
7561 Alphabetic and (decimal) Numeric. [0-9A-
7562 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7563 \xff], U+0100..02C1, U+02C6..02D1,
7564 U+02E0..02E4, U+02EC, U+02EE ...
7565 XPosixAlpha Alphabetic. (Perl extension). [A-Za-z
7566 \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7567 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7568 U+02EC, U+02EE ...
7569 XPosixBlank (Short: Blank). (Perl extension). \h,
7570 Horizontal white space. [\t\x20\xa0],
7571 U+1680, U+2000..200A, U+202F, U+205F,
7572 U+3000
7573 XPosixCntrl General_Category=Control (Short: Cntrl).
7574 (Perl extension). Control characters.
7575 [\x00-\x1f\x7f-\x9f]
7576 XPosixDigit General_Category=Decimal_Number (Short:
7577 Digit). (Perl extension). [0-9] + all
7578 other decimal digits. [0-9],
7579 U+0660..0669, U+06F0..06F9, U+07C0..07C9,
7580 U+0966..096F, U+09E6..09EF ...
7581 XPosixGraph (Short: Graph). (Perl extension).
7582 Characters that are graphical. [!\"#\$
7583 \%&\'\(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]
7584 \^_`a-z\{\|\}~\xa1-\xff], U+0100..0377,
7585 U+037A..037F, U+0384..038A, U+038C,
7586 U+038E..03A1 ...
7587 XPosixLower Lowercase. (Perl extension). [a-z\xaa
7588 \xb5\xba\xdf-\xf6\xf8-\xff], U+0101,
7589 U+0103, U+0105, U+0107, U+0109 ...
7590 XPosixPrint (Short: Print). (Perl extension).
7591 Characters that are graphical plus space
7592 characters (but no controls). [\x20-\x7e
7593 \xa0-\xff], U+0100..0377, U+037A..037F,
7594 U+0384..038A, U+038C, U+038E..03A1 ...
7595 XPosixPunct (Perl extension). \p{Punct} + ASCII-range
7596 \p{Symbol}. [!\"#\$\%&\'\(\)*+,\-.\/:;<=
7597 >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
7598 \xb7\xbb\xbf], U+037E, U+0387,
7599 U+055A..055F, U+0589..058A, U+05BE ...
7600 XPosixSpace (Perl extension). \s including beyond
7601 ASCII and vertical tab. [\t\n\cK\f\r\x20
7602 \x85\xa0], U+1680, U+2000..200A,
7603 U+2028..2029, U+202F, U+205F ...
7604 XPosixUpper Uppercase. (Perl extension). [A-Z\xc0-
7605 \xd6\xd8-\xde], U+0100, U+0102, U+0104,
7606 U+0106, U+0108 ...
7607 XPosixWord (Short: Word). (Perl extension). \w,
7608 including beyond ASCII; = \p{Alnum} + \pM
7609 + \p{Pc} + \p{Join_Control}. [0-9A-Z_a-z
7610 \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7611 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7612 U+02EC, U+02EE ...
7613 XPosixXDigit Hex_Digit (Short: XDigit). (Perl
7614 extension). [0-9A-Fa-f], U+FF10..FF19,
7615 U+FF21..FF26, U+FF41..FF46
7616
7618 Certain properties are accessible also via core function calls. These
7619 are:
7620
7621 Lowercase_Mapping lc() and lcfirst()
7622 Titlecase_Mapping ucfirst()
7623 Uppercase_Mapping uc()
7624
7625 Also, Case_Folding is accessible through the "/i" modifier in regular
7626 expressions, the "\F" transliteration escape, and the "fc" operator.
7627
7628 Besides being able to say "\p{Name=...}", the Name and Name_Aliases
7629 properties are accessible through the "\N{}" interpolation in double-
7630 quoted strings and regular expressions; and functions
7631 charnames::viacode(), charnames::vianame(), and
7632 charnames::string_vianame() (which require a "use charnames ();" to be
7633 specified.
7634
7635 Finally, most properties related to decomposition are accessible via
7636 Unicode::Normalize.
7637
7639 Perl will generate an error for a few character properties in Unicode
7640 when used in a regular expression. The non-Unihan ones are listed
7641 below, with the reasons they are not accepted, perhaps with work-
7642 arounds. The short names for the properties are listed enclosed in
7643 (parentheses). As described after the list, an installation can change
7644 the defaults and choose to accept any of these. The list is machine
7645 generated based on the choices made for the installation that generated
7646 this document.
7647
7648 Expands_On_NFC (XO_NFC)
7649 Expands_On_NFD (XO_NFD)
7650 Expands_On_NFKC (XO_NFKC)
7651 Expands_On_NFKD (XO_NFKD)
7652 Deprecated by Unicode. These are characters that expand to more
7653 than one character in the specified normalization form, but whether
7654 they actually take up more bytes or not depends on the encoding
7655 being used. For example, a UTF-8 encoded character may expand to a
7656 different number of bytes than a UTF-32 encoded character.
7657
7658 Grapheme_Link (Gr_Link)
7659 Duplicates ccc=vr (Canonical_Combining_Class=Virama)
7660
7661 Jamo_Short_Name (JSN)
7662 Other_Alphabetic (OAlpha)
7663 Other_Default_Ignorable_Code_Point (ODI)
7664 Other_Grapheme_Extend (OGr_Ext)
7665 Other_ID_Continue (OIDC)
7666 Other_ID_Start (OIDS)
7667 Other_Lowercase (OLower)
7668 Other_Math (OMath)
7669 Other_Uppercase (OUpper)
7670 Used by Unicode internally for generating other properties and not
7671 intended to be used stand-alone
7672
7673 Script=Katakana_Or_Hiragana (sc=Hrkt)
7674 Obsolete. All code points previously matched by this have been
7675 moved to "Script=Common". Consider instead using
7676 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or
7677 both)
7678
7679 Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)
7680 All code points that would be matched by this are matched by either
7681 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
7682
7683 An installation can choose to allow any of these to be matched by
7684 downloading the Unicode database from <http://www.unicode.org/Public/>
7685 to $Config{privlib}/unicore/ in the Perl source tree, changing the
7686 controlling lists contained in the program
7687 $Config{privlib}/unicore/mktables and then re-compiling and installing.
7688 (%Config is available from the Config module).
7689
7690 Also, perl can be recompiled to operate on an earlier version of the
7691 Unicode standard. Further information is at
7692 $Config{privlib}/unicore/README.perl.
7693
7695 The Unicode data base is delivered in two different formats. The XML
7696 version is valid for more modern Unicode releases. The other version
7697 is a collection of files. The two are intended to give equivalent
7698 information. Perl uses the older form; this allows you to recompile
7699 Perl to use early Unicode releases.
7700
7701 The only non-character property that Perl currently supports is Named
7702 Sequences, in which a sequence of code points is given a name and
7703 generally treated as a single entity. (Perl supports these via the
7704 "\N{...}" double-quotish construct, "charnames::string_vianame(name)"
7705 in charnames, and "namedseq()" in Unicode::UCD.
7706
7707 Below is a list of the files in the Unicode data base that Perl doesn't
7708 currently use, along with very brief descriptions of their purposes.
7709 Some of the names of the files have been shortened from those that
7710 Unicode uses, in order to allow them to be distinguishable from
7711 similarly named files on file systems for which only the first 8
7712 characters of a name are significant.
7713
7714 auxiliary/GraphemeBreakTest.html
7715 auxiliary/LineBreakTest.html
7716 auxiliary/SentenceBreakTest.html
7717 auxiliary/WordBreakTest.html
7718 Documentation of validation Tests
7719
7720 BidiCharacterTest.txt
7721 BidiTest.txt
7722 Validation Tests
7723
7724 CJKRadicals.txt
7725 Maps the kRSUnicode property values to corresponding code points
7726
7727 confusables.txt
7728 confusablesSummary.txt
7729 intentional.txt
7730 Currently unused by Perl
7731
7732 emoji/ReadMe.txt
7733 ReadMe.txt
7734 Documentation
7735
7736 EmojiSources.txt
7737 Maps certain Unicode code points to their legacy Japanese cell-
7738 phone values
7739
7740 extracted/DName.txt
7741 This file adds no new information not already present in other
7742 files
7743
7744 Index.txt
7745 Alphabetical index of Unicode characters
7746
7747 NamedSqProv.txt
7748 Named sequences proposed for inclusion in a later version of the
7749 Unicode Standard; if you need them now, you can append this file to
7750 NamedSequences.txt and recompile perl
7751
7752 NamesList.html
7753 Describes the format and contents of NamesList.txt
7754
7755 NamesList.txt
7756 Annotated list of characters
7757
7758 NormalizationCorrections.txt
7759 Documentation of corrections already incorporated into the Unicode
7760 data base
7761
7762 NushuSources.txt
7763 Specifies source material for Nushu characters
7764
7765 StandardizedVariants.html
7766 Obsoleted as of Unicode 9.0, but previously provided a visual
7767 display of the standard variant sequences derived from
7768 StandardizedVariants.txt.
7769
7770 StandardizedVariants.txt
7771 Certain glyph variations for character display are standardized.
7772 This lists the non-Unihan ones; the Unihan ones are also not used
7773 by Perl, and are in a separate Unicode data base
7774 <http://www.unicode.org/ivd>
7775
7776 TangutSources.txt
7777 Specifies source mappings for Tangut ideographs and components.
7778 This data file also includes informative radical-stroke values that
7779 are used internally by Unicode
7780
7781 USourceData.txt
7782 Documentation of status and cross reference of proposals for
7783 encoding by Unicode of Unihan characters
7784
7785 USourceGlyphs.pdf
7786 Pictures of the characters in USourceData.txt
7787
7789 <http://www.unicode.org/reports/tr44/>
7790
7791 perlrecharclass
7792
7793 perlunicode
7794
7795
7796
7797perl v5.38.2 2023-11-30 PERLUNIPROPS(1)