1PERLUNIPROPS(1) Perl Programmers Reference Guide PERLUNIPROPS(1)
2
3
4
6 perluniprops - Index of Unicode Version 13.0.0 character properties in
7 Perl
8
10 This document provides information about the portion of the Unicode
11 database that deals with character properties, that is the portion that
12 is defined on single code points. ("Other information in the Unicode
13 data base" below briefly mentions other data that Unicode provides.)
14
15 Perl can provide access to all non-provisional Unicode character
16 properties, though not all are enabled by default. The omitted ones
17 are the Unihan properties (accessible via the CPAN module
18 Unicode::Unihan) and certain deprecated or Unicode-internal properties.
19 (An installation may choose to recompile Perl's tables to change this.
20 See "Unicode character properties that are NOT accepted by Perl".)
21
22 For most purposes, access to Unicode properties from the Perl core is
23 through regular expression matches, as described in the next section.
24 For some special purposes, and to access the properties that are not
25 suitable for regular expression matching, all the Unicode character
26 properties that Perl handles are accessible via the standard
27 Unicode::UCD module, as described in the section "Properties accessible
28 through Unicode::UCD".
29
30 Perl also provides some additional extensions and short-cut synonyms
31 for Unicode properties.
32
33 This document merely lists all available properties and does not
34 attempt to explain what each property really means. There is a brief
35 description of each Perl extension; see "Other Properties" in
36 perlunicode for more information on these. There is some detail about
37 Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but
38 to find out about the intricacies of the official Unicode properties,
39 refer to the Unicode standard. A good starting place is
40 <http://www.unicode.org/reports/tr44/>.
41
42 Note that you can define your own properties; see "User-Defined
43 Character Properties" in perlunicode.
44
46 The Perl regular expression "\p{}" and "\P{}" constructs give access to
47 most of the Unicode character properties. The table below shows all
48 these constructs, both single and compound forms.
49
50 Compound forms consist of two components, separated by an equals sign
51 or a colon. The first component is the property name, and the second
52 component is the particular value of the property to match against, for
53 example, "\p{Script_Extensions: Greek}" and
54 "\p{Script_Extensions=Greek}" both mean to match characters whose
55 Script_Extensions property value is Greek. ("Script_Extensions" is an
56 improved version of the "Script" property.)
57
58 Single forms, like "\p{Greek}", are mostly Perl-defined shortcuts for
59 their equivalent compound forms. The table shows these equivalences.
60 (In our example, "\p{Greek}" is a just a shortcut for
61 "\p{Script_Extensions=Greek}"). There are also a few Perl-defined
62 single forms that are not shortcuts for a compound form. One such is
63 "\p{Word}". These are also listed in the table.
64
65 In parsing these constructs, Perl always ignores Upper/lower case
66 differences everywhere within the {braces}. Thus "\p{Greek}" means the
67 same thing as "\p{greek}". But note that changing the case of the "p"
68 or "P" before the left brace completely changes the meaning of the
69 construct, from "match" (for "\p{}") to "doesn't match" (for "\P{}").
70 Casing in this document is for improved legibility.
71
72 Also, white space, hyphens, and underscores are normally ignored
73 everywhere between the {braces}, and hence can be freely added or
74 removed even if the "/x" modifier hasn't been specified on the regular
75 expression. But in the table below a 'T' at the beginning of an entry
76 means that tighter (stricter) rules are used for that entry:
77
78 Single form ("\p{name}") tighter rules:
79 White space, hyphens, and underscores ARE significant except
80 for:
81
82 • white space adjacent to a non-word character
83
84 • underscores separating digits in numbers
85
86 That means, for example, that you can freely add or remove
87 white space adjacent to (but within) the braces without
88 affecting the meaning.
89
90 Compound form ("\p{name=value}" or "\p{name:value}") tighter rules:
91 The tighter rules given above for the single form apply to
92 everything to the right of the colon or equals; the looser
93 rules still apply to everything to the left.
94
95 That means, for example, that you can freely add or remove
96 white space adjacent to (but within) the braces and the colon
97 or equal sign.
98
99 Some properties are considered obsolete by Unicode, but still
100 available. There are several varieties of obsolescence:
101
102 Stabilized
103 A property may be stabilized. Such a determination does not
104 indicate that the property should or should not be used;
105 instead it is a declaration that the property will not be
106 maintained nor extended for newly encoded characters. Such
107 properties are marked with an 'S' in the table.
108
109 Deprecated
110 A property may be deprecated, perhaps because its original
111 intent has been replaced by another property, or because its
112 specification was somehow defective. This means that its use
113 is strongly discouraged, so much so that a warning will be
114 issued if used, unless the regular expression is in the scope
115 of a "no warnings 'deprecated'" statement. A 'D' flags each
116 such entry in the table, and the entry there for the longest,
117 most descriptive version of the property will give the reason
118 it is deprecated, and perhaps advice. Perl may issue such a
119 warning, even for properties that aren't officially deprecated
120 by Unicode, when there used to be characters or code points
121 that were matched by them, but no longer. This is to warn you
122 that your program may not work like it did on earlier Unicode
123 releases.
124
125 A deprecated property may be made unavailable in a future Perl
126 version, so it is best to move away from them.
127
128 A deprecated property may also be stabilized, but this fact is
129 not shown.
130
131 Obsolete
132 Properties marked with an 'O' in the table are considered
133 (plain) obsolete. Generally this designation is given to
134 properties that Unicode once used for internal purposes (but
135 not any longer).
136
137 Discouraged
138 This is not actually a Unicode-specified obsolescence, but
139 applies to certain Perl extensions that are present for
140 backwards compatibility, but are discouraged from being used.
141 These are not obsolete, but their meanings are not stable.
142 Future Unicode versions could force any of these extensions to
143 be removed without warning, replaced by another property with
144 the same name that means something different. An 'X' flags
145 each such entry in the table. Use the equivalent shown
146 instead.
147
148 In particular, matches in the Block property have single forms
149 defined by Perl that begin with "In_", ""Is_", or even with no
150 prefix at all, Like all DISCOURAGED forms, these are not
151 stable. For example, "\p{Block=Deseret}" can currently be
152 written as "\p{In_Deseret}", "\p{Is_Deseret}", or
153 "\p{Deseret}". But, a new Unicode version may come along that
154 would force Perl to change the meaning of one or more of these,
155 and your program would no longer be correct. Currently there
156 are no such conflicts with the form that begins "In_", but
157 there are many with the other two shortcuts, and Unicode
158 continues to define new properties that begin with "In", so
159 it's quite possible that a conflict will occur in the future.
160 The compound form is guaranteed to not become obsolete, and its
161 meaning is clearer anyway. See "Blocks" in perlunicode for
162 more information about this.
163
164 User-defined properties must begin with "In" or "Is". These
165 override any Unicode property of the same name.
166
167 The table below has two columns. The left column contains the "\p{}"
168 constructs to look up, possibly preceded by the flags mentioned above;
169 and the right column contains information about them, like a
170 description, or synonyms. The table shows both the single and compound
171 forms for each property that has them. If the left column is a short
172 name for a property, the right column will give its longer, more
173 descriptive name; and if the left column is the longest name, the right
174 column will show any equivalent shortest name, in both single and
175 compound forms if applicable.
176
177 If braces are not needed to specify a property (e.g., "\pL"), the left
178 column contains both forms, with and without braces.
179
180 The right column will also caution you if a property means something
181 different than what might normally be expected.
182
183 All single forms are Perl extensions; a few compound forms are as well,
184 and are noted as such.
185
186 Numbers in (parentheses) indicate the total number of Unicode code
187 points matched by the property. For the entries that give the longest,
188 most descriptive version of the property, the count is followed by a
189 list of some of the code points matched by it. The list includes all
190 the matched characters in the 0-255 range, enclosed in the familiar
191 [brackets] the same as a regular expression bracketed character class.
192 Following that, the next few higher matching ranges are also given. To
193 avoid visual ambiguity, the SPACE character is represented as "\x20".
194
195 For emphasis, those properties that match no code points at all are
196 listed as well in a separate section following the table.
197
198 Most properties match the same code points regardless of whether "/i"
199 case-insensitive matching is specified or not. But a few properties
200 are affected. These are shown with the notation "(/i= other_property)"
201 in the second column. Under case-insensitive matching they match the
202 same code pode points as the property other_property.
203
204 There is no description given for most non-Perl defined properties (See
205 <http://www.unicode.org/reports/tr44/> for that).
206
207 For compactness, '*' is used as a wildcard instead of showing all
208 possible combinations. For example, entries like:
209
210 \p{Gc: *} \p{General_Category: *}
211
212 mean that 'Gc' is a synonym for 'General_Category', and anything that
213 is valid for the latter is also valid for the former. Similarly,
214
215 \p{Is_*} \p{*}
216
217 means that if and only if, for example, "\p{Foo}" exists, then
218 "\p{Is_Foo}" and "\p{IsFoo}" are also valid and all mean the same
219 thing. And similarly, "\p{Foo=Bar}" means the same as "\p{Is_Foo=Bar}"
220 and "\p{IsFoo=Bar}". "*" here is restricted to something not beginning
221 with an underscore.
222
223 Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
224 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table
225 shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
226 for the other possibilities. Note that not all properties which have
227 values 'Yes' and 'No' are binary, and they have all their values
228 spelled out without using this wild card, and a "NOT" clause in their
229 description that highlights their not being binary. These also require
230 the compound form to match them, whereas true binary properties have
231 both single and compound forms available.
232
233 Note that all non-essential underscores are removed in the display of
234 the short names below.
235
236 Legend summary:
237
238 * is a wild-card
239 (\d+) in the info column gives the number of Unicode code points
240 matched by this property.
241 D means this is deprecated.
242 O means this is obsolete.
243 S means this is stabilized.
244 T means tighter (stricter) name matching applies.
245 X means use of this form is discouraged, and may not be stable.
246
247 NAME INFO
248
249 \p{Adlam} \p{Script_Extensions=Adlam} (Short:
250 \p{Adlm}; NOT \p{Block=Adlam}) (89)
251 \p{Adlm} \p{Adlam} (= \p{Script_Extensions=Adlam})
252 (NOT \p{Block=Adlam}) (89)
253 X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
254 T \p{Age: 1.1} \p{Age=V1_1} (33_979)
255 \p{Age: V1_1} Code point's usage introduced in version
256 1.1 (33_979: U+0000..01F5, U+01FA..0217,
257 U+0250..02A8, U+02B0..02DE,
258 U+02E0..02E9, U+0300..0345 ...)
259 T \p{Age: 2.0} \p{Age=V2_0} (144_521)
260 \p{Age: V2_0} Code point's usage was introduced in
261 version 2.0; See also Property
262 'Present_In' (144_521: U+0591..05A1,
263 U+05A3..05AF, U+05C4, U+0F00..0F47,
264 U+0F49..0F69, U+0F71..0F8B ...)
265 T \p{Age: 2.1} \p{Age=V2_1} (2)
266 \p{Age: V2_1} Code point's usage was introduced in
267 version 2.1; See also Property
268 'Present_In' (2: U+20AC, U+FFFC)
269 T \p{Age: 3.0} \p{Age=V3_0} (10_307)
270 \p{Age: V3_0} Code point's usage was introduced in
271 version 3.0; See also Property
272 'Present_In' (10_307: U+01F6..01F9,
273 U+0218..021F, U+0222..0233,
274 U+02A9..02AD, U+02DF, U+02EA..02EE ...)
275 T \p{Age: 3.1} \p{Age=V3_1} (44_978)
276 \p{Age: V3_1} Code point's usage was introduced in
277 version 3.1; See also Property
278 'Present_In' (44_978: U+03F4..03F5,
279 U+FDD0..FDEF, U+10300..1031E,
280 U+10320..10323, U+10330..1034A,
281 U+10400..10425 ...)
282 T \p{Age: 3.2} \p{Age=V3_2} (1016)
283 \p{Age: V3_2} Code point's usage was introduced in
284 version 3.2; See also Property
285 'Present_In' (1016: U+0220, U+034F,
286 U+0363..036F, U+03D8..03D9, U+03F6,
287 U+048A..048B ...)
288 T \p{Age: 4.0} \p{Age=V4_0} (1226)
289 \p{Age: V4_0} Code point's usage was introduced in
290 version 4.0; See also Property
291 'Present_In' (1226: U+0221,
292 U+0234..0236, U+02AE..02AF,
293 U+02EF..02FF, U+0350..0357, U+035D..035F
294 ...)
295 T \p{Age: 4.1} \p{Age=V4_1} (1273)
296 \p{Age: V4_1} Code point's usage was introduced in
297 version 4.1; See also Property
298 'Present_In' (1273: U+0237..0241,
299 U+0358..035C, U+03FC..03FF,
300 U+04F6..04F7, U+05A2, U+05C5..05C7 ...)
301 T \p{Age: 5.0} \p{Age=V5_0} (1369)
302 \p{Age: V5_0} Code point's usage was introduced in
303 version 5.0; See also Property
304 'Present_In' (1369: U+0242..024F,
305 U+037B..037D, U+04CF, U+04FA..04FF,
306 U+0510..0513, U+05BA ...)
307 T \p{Age: 5.1} \p{Age=V5_1} (1624)
308 \p{Age: V5_1} Code point's usage was introduced in
309 version 5.1; See also Property
310 'Present_In' (1624: U+0370..0373,
311 U+0376..0377, U+03CF, U+0487,
312 U+0514..0523, U+0606..060A ...)
313 T \p{Age: 5.2} \p{Age=V5_2} (6648)
314 \p{Age: V5_2} Code point's usage was introduced in
315 version 5.2; See also Property
316 'Present_In' (6648: U+0524..0525,
317 U+0800..082D, U+0830..083E, U+0900,
318 U+094E, U+0955 ...)
319 T \p{Age: 6.0} \p{Age=V6_0} (2088)
320 \p{Age: V6_0} Code point's usage was introduced in
321 version 6.0; See also Property
322 'Present_In' (2088: U+0526..0527,
323 U+0620, U+065F, U+0840..085B, U+085E,
324 U+093A..093B ...)
325 T \p{Age: 6.1} \p{Age=V6_1} (732)
326 \p{Age: V6_1} Code point's usage was introduced in
327 version 6.1; See also Property
328 'Present_In' (732: U+058F, U+0604,
329 U+08A0, U+08A2..08AC, U+08E4..08FE,
330 U+0AF0 ...)
331 T \p{Age: 6.2} \p{Age=V6_2} (1)
332 \p{Age: V6_2} Code point's usage was introduced in
333 version 6.2; See also Property
334 'Present_In' (1: U+20BA)
335 T \p{Age: 6.3} \p{Age=V6_3} (5)
336 \p{Age: V6_3} Code point's usage was introduced in
337 version 6.3; See also Property
338 'Present_In' (5: U+061C, U+2066..2069)
339 T \p{Age: 7.0} \p{Age=V7_0} (2834)
340 \p{Age: V7_0} Code point's usage was introduced in
341 version 7.0; See also Property
342 'Present_In' (2834: U+037F,
343 U+0528..052F, U+058D..058E, U+0605,
344 U+08A1, U+08AD..08B2 ...)
345 T \p{Age: 8.0} \p{Age=V8_0} (7716)
346 \p{Age: V8_0} Code point's usage was introduced in
347 version 8.0; See also Property
348 'Present_In' (7716: U+08B3..08B4,
349 U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5
350 ...)
351 T \p{Age: 9.0} \p{Age=V9_0} (7500)
352 \p{Age: V9_0} Code point's usage was introduced in
353 version 9.0; See also Property
354 'Present_In' (7500: U+08B6..08BD,
355 U+08D4..08E2, U+0C80, U+0D4F,
356 U+0D54..0D56, U+0D58..0D5E ...)
357 T \p{Age: 10.0} \p{Age=V10_0} (8518)
358 \p{Age: V10_0} Code point's usage was introduced in
359 version 10.0; See also Property
360 'Present_In' (8518: U+0860..086A,
361 U+09FC..09FD, U+0AFA..0AFF, U+0D00,
362 U+0D3B..0D3C, U+1CF7 ...)
363 T \p{Age: 11.0} \p{Age=V11_0} (684)
364 \p{Age: V11_0} Code point's usage was introduced in
365 version 11.0; See also Property
366 'Present_In' (684: U+0560, U+0588,
367 U+05EF, U+07FD..07FF, U+08D3, U+09FE ...)
368 T \p{Age: 12.0} \p{Age=V12_0} (554)
369 \p{Age: V12_0} Code point's usage was introduced in
370 version 12.0; See also Property
371 'Present_In' (554: U+0C77, U+0E86,
372 U+0E89, U+0E8C, U+0E8E..0E93, U+0E98 ...)
373 T \p{Age: 12.1} \p{Age=V12_1} (1)
374 \p{Age: V12_1} Code point's usage was introduced in
375 version 12.1; See also Property
376 'Present_In' (1: U+32FF)
377 T \p{Age: 13.0} \p{Age=V13_0} (5930)
378 \p{Age: V13_0} Code point's usage was introduced in
379 version 13.0; See also Property
380 'Present_In' (5930: U+08BE..08C7,
381 U+0B55, U+0D04, U+0D81, U+1ABF..1AC0,
382 U+2B97 ...)
383 \p{Age: NA} \p{Age=Unassigned} (830_606 plus all
384 above-Unicode code points)
385 \p{Age: Unassigned} Code point's usage has not been assigned
386 in any Unicode release thus far.
387 (Short: \p{Age=NA}) (830_606 plus all above-Unicode code points:
388 U+0378..0379, U+0380..0383, U+038B,
389 U+038D, U+03A2, U+0530 ...)
390 \p{Aghb} \p{Caucasian_Albanian} (=
391 \p{Script_Extensions=
392 Caucasian_Albanian}) (NOT \p{Block=
393 Caucasian_Albanian}) (53)
394 \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
395 (22)
396 \p{AHex: *} \p{ASCII_Hex_Digit: *}
397 \p{Ahom} \p{Script_Extensions=Ahom} (NOT \p{Block=
398 Ahom}) (58)
399 X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block=
400 Alchemical_Symbols}) (128)
401 X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short:
402 \p{InAlchemical}) (128)
403 \p{All} All code points, including those above
404 Unicode. Same as qr/./s (1_114_112 plus
405 all above-Unicode code points:
406 U+0000..infinity)
407 \p{Alnum} \p{XPosixAlnum} (133_525)
408 \p{Alpha} \p{XPosixAlpha} (= \p{Alphabetic=Y})
409 (132_875)
410 \p{Alpha: *} \p{Alphabetic: *}
411 \p{Alphabetic} \p{XPosixAlpha} (= \p{Alphabetic=Y})
412 (132_875)
413 \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (981_237
414 plus all above-Unicode code points:
415 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
416 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4
417 \xb6-\xb9\xbb-\xbf\xd7\xf7],
418 U+02C2..02C5, U+02D2..02DF,
419 U+02E5..02EB, U+02ED, U+02EF..0344 ...)
420 \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (132_875:
421 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
422 \xf8-\xff], U+0100..02C1, U+02C6..02D1,
423 U+02E0..02E4, U+02EC, U+02EE ...)
424 X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (=
425 \p{Block=Alphabetic_Presentation_Forms})
426 (80)
427 X \p{Alphabetic_Presentation_Forms} \p{Block=
428 Alphabetic_Presentation_Forms} (Short:
429 \p{InAlphabeticPF}) (80)
430 \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
431 Anatolian_Hieroglyphs} (Short: \p{Hluw};
432 NOT \p{Block=Anatolian_Hieroglyphs})
433 (583)
434 X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
435 \p{Block=
436 Ancient_Greek_Musical_Notation}) (80)
437 X \p{Ancient_Greek_Musical_Notation} \p{Block=
438 Ancient_Greek_Musical_Notation} (Short:
439 \p{InAncientGreekMusic}) (80)
440 X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
441 X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
442 \p{Any} All Unicode code points (1_114_112:
443 U+0000..10FFFF)
444 \p{Arab} \p{Arabic} (= \p{Script_Extensions=
445 Arabic}) (NOT \p{Block=Arabic}) (1335)
446 \p{Arabic} \p{Script_Extensions=Arabic} (Short:
447 \p{Arab}; NOT \p{Block=Arabic}) (1335)
448 X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block=
449 Arabic_Extended_A}) (96)
450 X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short:
451 \p{InArabicExtA}) (96)
452 X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols}
453 (= \p{Block=
454 Arabic_Mathematical_Alphabetic_Symbols})
455 (256)
456 X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
457 Arabic_Mathematical_Alphabetic_Symbols}
458 (Short: \p{InArabicMath}) (256)
459 X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (=
460 \p{Block=Arabic_Presentation_Forms_A})
461 (688)
462 X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (=
463 \p{Block=Arabic_Presentation_Forms_B})
464 (144)
465 X \p{Arabic_Presentation_Forms_A} \p{Block=
466 Arabic_Presentation_Forms_A} (Short:
467 \p{InArabicPFA}) (688)
468 X \p{Arabic_Presentation_Forms_B} \p{Block=
469 Arabic_Presentation_Forms_B} (Short:
470 \p{InArabicPFB}) (144)
471 X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block=
472 Arabic_Supplement}) (48)
473 X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short:
474 \p{InArabicSup}) (48)
475 \p{Armenian} \p{Script_Extensions=Armenian} (Short:
476 \p{Armn}; NOT \p{Block=Armenian}) (96)
477 \p{Armi} \p{Imperial_Aramaic} (=
478 \p{Script_Extensions=Imperial_Aramaic})
479 (NOT \p{Block=Imperial_Aramaic}) (31)
480 \p{Armn} \p{Armenian} (= \p{Script_Extensions=
481 Armenian}) (NOT \p{Block=Armenian}) (96)
482 X \p{Arrows} \p{Block=Arrows} (112)
483 \p{ASCII} \p{Block=Basic_Latin} (128)
484 \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
485 (22)
486 \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
487 plus all above-Unicode code points:
488 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
489 \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
490 U+0100..infinity)
491 \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A-
492 Fa-f])
493 \p{Assigned} All assigned code points (283_440:
494 U+0000..0377, U+037A..037F,
495 U+0384..038A, U+038C, U+038E..03A1,
496 U+03A3..052F ...)
497 \p{Avestan} \p{Script_Extensions=Avestan} (Short:
498 \p{Avst}; NOT \p{Block=Avestan}) (61)
499 \p{Avst} \p{Avestan} (= \p{Script_Extensions=
500 Avestan}) (NOT \p{Block=Avestan}) (61)
501 \p{Bali} \p{Balinese} (= \p{Script_Extensions=
502 Balinese}) (NOT \p{Block=Balinese}) (121)
503 \p{Balinese} \p{Script_Extensions=Balinese} (Short:
504 \p{Bali}; NOT \p{Block=Balinese}) (121)
505 \p{Bamu} \p{Bamum} (= \p{Script_Extensions=Bamum})
506 (NOT \p{Block=Bamum}) (657)
507 \p{Bamum} \p{Script_Extensions=Bamum} (Short:
508 \p{Bamu}; NOT \p{Block=Bamum}) (657)
509 X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block=
510 Bamum_Supplement}) (576)
511 X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short:
512 \p{InBamumSup}) (576)
513 X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
514 \p{Bass} \p{Bassa_Vah} (= \p{Script_Extensions=
515 Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
516 (36)
517 \p{Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
518 \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
519 \p{Batak} \p{Script_Extensions=Batak} (Short:
520 \p{Batk}; NOT \p{Block=Batak}) (56)
521 \p{Batk} \p{Batak} (= \p{Script_Extensions=Batak})
522 (NOT \p{Block=Batak}) (56)
523 \p{Bc: *} \p{Bidi_Class: *}
524 \p{Beng} \p{Bengali} (= \p{Script_Extensions=
525 Bengali}) (NOT \p{Block=Bengali}) (113)
526 \p{Bengali} \p{Script_Extensions=Bengali} (Short:
527 \p{Beng}; NOT \p{Block=Bengali}) (113)
528 \p{Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
529 \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
530 \p{Bhks} \p{Bhaiksuki} (= \p{Script_Extensions=
531 Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
532 (97)
533 \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y})
534 (12)
535 \p{Bidi_C: *} \p{Bidi_Control: *}
536 \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1698)
537 \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (61)
538 \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1698: U+0608,
539 U+060B, U+060D, U+061B..064A,
540 U+066D..066F, U+0671..06D5 ...)
541 \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (61:
542 U+0600..0605, U+0660..0669,
543 U+066B..066C, U+06DD, U+08E2,
544 U+10D30..10D39 ...)
545 \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
546 \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4016)
547 \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016: [^\t\n
548 \cK\f\r\x1c-\x7e\x85\xa0-\xac\xae-\xff],
549 U+180E, U+200B..200D, U+2060..2065,
550 U+206A..206F, U+FDD0..FDEF ...)
551 \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15: [,.\/:
552 \xa0], U+060C, U+202F, U+2044, U+FE50,
553 U+FE52 ...)
554 \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
555 \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (168)
556 \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
557 \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (92)
558 \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (168: [0-9\xb2-
559 \xb3\xb9], U+06F0..06F9, U+2070,
560 U+2074..2079, U+2080..2089, U+2488..249B
561 ...)
562 \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-],
563 U+207A..207B, U+208A..208B, U+2212,
564 U+FB29, U+FE62..FE63 ...)
565 \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (92: [#\$
566 \%\xa2-\xa5\xb0-\xb1], U+058F,
567 U+0609..060A, U+066A, U+09F2..09F3,
568 U+09FB ...)
569 \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1:
570 U+2068)
571 \p{Bidi_Class: FSI} \p{Bidi_Class=First_Strong_Isolate} (1)
572 \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_096_473
573 plus all above-Unicode code points)
574 \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_096_473 plus
575 all above-Unicode code points: [A-Za-z
576 \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
577 \xff], U+0100..02B8, U+02BB..02C1,
578 U+02D0..02D1, U+02E0..02E4, U+02EE ...)
579 \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1:
580 U+202A)
581 \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1:
582 U+2066)
583 \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1:
584 U+202D)
585 \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
586 \p{Bidi_Class: LRI} \p{Bidi_Class=Left_To_Right_Isolate} (1)
587 \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
588 \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1847:
589 U+0300..036F, U+0483..0489,
590 U+0591..05BD, U+05BF, U+05C1..05C2,
591 U+05C4..05C5 ...)
592 \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1847)
593 \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (5931)
594 \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (5931: [!\"&\'
595 \(\)*;<=>?\@\[\\\]\^_`\{\|\}~\xa1\xa6-
596 \xa9\xab-\xac\xae-\xaf\xb4\xb6-\xb8\xbb-
597 \xbf\xd7\xf7], U+02B9..02BA,
598 U+02C2..02CF, U+02D2..02DF,
599 U+02E5..02ED, U+02EF..02FF ...)
600 \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7: [\n\r
601 \x1c-\x1e\x85], U+2029)
602 \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
603 \p{Bidi_Class: PDI} \p{Bidi_Class=Pop_Directional_Isolate} (1)
604 \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1:
605 U+202C)
606 \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1:
607 U+2069)
608 \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (3763)
609 \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (3763: U+0590,
610 U+05BE, U+05C0, U+05C3, U+05C6,
611 U+05C8..05FF ...)
612 \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1:
613 U+202B)
614 \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1:
615 U+2067)
616 \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1:
617 U+202E)
618 \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
619 \p{Bidi_Class: RLI} \p{Bidi_Class=Right_To_Left_Isolate} (1)
620 \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
621 \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
622 \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3: [\t\cK
623 \x1f])
624 \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20],
625 U+1680, U+2000..200A, U+2028, U+205F,
626 U+3000)
627 \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (17)
628 \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
629 \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
630 plus all above-Unicode code points:
631 U+0000..061B, U+061D..200D,
632 U+2010..2029, U+202F..2065,
633 U+206A..infinity)
634 \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (12:
635 U+061C, U+200E..200F, U+202A..202E,
636 U+2066..2069)
637 \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
638 (545)
639 \p{Bidi_M: *} \p{Bidi_Mirrored: *}
640 \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
641 (545)
642 \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567
643 plus all above-Unicode code points:
644 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A-
645 Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc-
646 \xff], U+0100..0F39, U+0F3E..169A,
647 U+169D..2038, U+203B..2044, U+2047..207C
648 ...)
649 \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (545:
650 [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
651 U+169B..169C, U+2039..203A,
652 U+2045..2046, U+207D..207E ...)
653 \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
654 (60)
655 \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60: [\)\]
656 \}], U+0F3B, U+0F3D, U+169C, U+2046,
657 U+207E ...)
658 \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
659 (1_113_992 plus all above-Unicode code
660 points)
661 \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992
662 plus all above-Unicode code points:
663 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<=>?
664 \@A-Z\\\^_`a-z\|~\x7f-\xff],
665 U+0100..0F39, U+0F3E..169A,
666 U+169D..2044, U+2047..207C, U+207F..208C
667 ...)
668 \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
669 (60)
670 \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60:
671 [\(\[\{], U+0F3A, U+0F3C, U+169B,
672 U+2045, U+207D ...)
673 \p{Blank} \p{XPosixBlank} (18)
674 \p{Blk: *} \p{Block: *}
675 \p{Block: Adlam} (NOT \p{Adlam} NOR \p{Is_Adlam}) (96:
676 U+1E900..1E95F)
677 \p{Block: Aegean_Numbers} (64: U+10100..1013F)
678 \p{Block: Ahom} (NOT \p{Ahom} NOR \p{Is_Ahom}) (64:
679 U+11700..1173F)
680 \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128)
681 \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128:
682 U+1F700..1F77F)
683 \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
684 (80)
685 \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
686 AlphabeticPF}) (80: U+FB00..FB4F)
687 \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
688 NOR \p{Is_Anatolian_Hieroglyphs}) (640:
689 U+14400..1467F)
690 \p{Block: Ancient_Greek_Music} \p{Block=
691 Ancient_Greek_Musical_Notation} (80)
692 \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
693 AncientGreekMusic}) (80: U+1D200..1D24F)
694 \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F)
695 \p{Block: Ancient_Symbols} (64: U+10190..101CF)
696 \p{Block: Arabic} (NOT \p{Arabic} NOR \p{Is_Arabic}) (256:
697 U+0600..06FF)
698 \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
699 \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96:
700 U+08A0..08FF)
701 \p{Block: Arabic_Math} \p{Block=
702 Arabic_Mathematical_Alphabetic_Symbols}
703 (256)
704 \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
705 ArabicMath}) (256: U+1EE00..1EEFF)
706 \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688)
707 \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144)
708 \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
709 (688: U+FB50..FDFF)
710 \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
711 (144: U+FE70..FEFF)
712 \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48)
713 \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48:
714 U+0750..077F)
715 \p{Block: Armenian} (NOT \p{Armenian} NOR \p{Is_Armenian})
716 (96: U+0530..058F)
717 \p{Block: Arrows} (112: U+2190..21FF)
718 \p{Block: ASCII} \p{Block=Basic_Latin} (128)
719 \p{Block: Avestan} (NOT \p{Avestan} NOR \p{Is_Avestan}) (64:
720 U+10B00..10B3F)
721 \p{Block: Balinese} (NOT \p{Balinese} NOR \p{Is_Balinese})
722 (128: U+1B00..1B7F)
723 \p{Block: Bamum} (NOT \p{Bamum} NOR \p{Is_Bamum}) (96:
724 U+A6A0..A6FF)
725 \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576)
726 \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576:
727 U+16800..16A3F)
728 \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}) (128: [\x00-\x7f])
729 \p{Block: Bassa_Vah} (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
730 (48: U+16AD0..16AFF)
731 \p{Block: Batak} (NOT \p{Batak} NOR \p{Is_Batak}) (64:
732 U+1BC0..1BFF)
733 \p{Block: Bengali} (NOT \p{Bengali} NOR \p{Is_Bengali}) (128:
734 U+0980..09FF)
735 \p{Block: Bhaiksuki} (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
736 (112: U+11C00..11C6F)
737 \p{Block: Block_Elements} (32: U+2580..259F)
738 \p{Block: Bopomofo} (NOT \p{Bopomofo} NOR \p{Is_Bopomofo})
739 (48: U+3100..312F)
740 \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
741 \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32:
742 U+31A0..31BF)
743 \p{Block: Box_Drawing} (128: U+2500..257F)
744 \p{Block: Brahmi} (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128:
745 U+11000..1107F)
746 \p{Block: Braille} \p{Block=Braille_Patterns} (256)
747 \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256:
748 U+2800..28FF)
749 \p{Block: Buginese} (NOT \p{Buginese} NOR \p{Is_Buginese})
750 (32: U+1A00..1A1F)
751 \p{Block: Buhid} (NOT \p{Buhid} NOR \p{Is_Buhid}) (32:
752 U+1740..175F)
753 \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
754 (256)
755 \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
756 ByzantineMusic}) (256: U+1D000..1D0FF)
757 \p{Block: Canadian_Syllabics} \p{Block=
758 Unified_Canadian_Aboriginal_Syllabics}
759 (640)
760 \p{Block: Carian} (NOT \p{Carian} NOR \p{Is_Carian}) (64:
761 U+102A0..102DF)
762 \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
763 \p{Is_Caucasian_Albanian}) (64:
764 U+10530..1056F)
765 \p{Block: Chakma} (NOT \p{Chakma} NOR \p{Is_Chakma}) (80:
766 U+11100..1114F)
767 \p{Block: Cham} (NOT \p{Cham} NOR \p{Is_Cham}) (96:
768 U+AA00..AA5F)
769 \p{Block: Cherokee} (NOT \p{Cherokee} NOR \p{Is_Cherokee})
770 (96: U+13A0..13FF)
771 \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
772 \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80:
773 U+AB70..ABBF)
774 \p{Block: Chess_Symbols} (112: U+1FA00..1FA6F)
775 \p{Block: Chorasmian} (NOT \p{Chorasmian} NOR \p{Is_Chorasmian})
776 (48: U+10FB0..10FDF)
777 \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992)
778 \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256)
779 \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
780 \p{Block: CJK_Compat_Ideographs} \p{Block=
781 CJK_Compatibility_Ideographs} (512)
782 \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
783 CJK_Compatibility_Ideographs_Supplement}
784 (544)
785 \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256:
786 U+3300..33FF)
787 \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
788 (32: U+FE30..FE4F)
789 \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
790 CJKCompatIdeographs}) (512: U+F900..FAFF)
791 \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
792 CJKCompatIdeographsSup}) (544:
793 U+2F800..2FA1F)
794 \p{Block: CJK_Ext_A} \p{Block=
795 CJK_Unified_Ideographs_Extension_A}
796 (6592)
797 \p{Block: CJK_Ext_B} \p{Block=
798 CJK_Unified_Ideographs_Extension_B}
799 (42_720)
800 \p{Block: CJK_Ext_C} \p{Block=
801 CJK_Unified_Ideographs_Extension_C}
802 (4160)
803 \p{Block: CJK_Ext_D} \p{Block=
804 CJK_Unified_Ideographs_Extension_D} (224)
805 \p{Block: CJK_Ext_E} \p{Block=
806 CJK_Unified_Ideographs_Extension_E}
807 (5776)
808 \p{Block: CJK_Ext_F} \p{Block=
809 CJK_Unified_Ideographs_Extension_F}
810 (7488)
811 \p{Block: CJK_Ext_G} \p{Block=
812 CJK_Unified_Ideographs_Extension_G}
813 (4944)
814 \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
815 \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
816 (128: U+2E80..2EFF)
817 \p{Block: CJK_Strokes} (48: U+31C0..31EF)
818 \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64)
819 \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
820 (64: U+3000..303F)
821 \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992:
822 U+4E00..9FFF)
823 \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
824 CJKExtA}) (6592: U+3400..4DBF)
825 \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
826 CJKExtB}) (42_720: U+20000..2A6DF)
827 \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
828 CJKExtC}) (4160: U+2A700..2B73F)
829 \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
830 CJKExtD}) (224: U+2B740..2B81F)
831 \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
832 CJKExtE}) (5776: U+2B820..2CEAF)
833 \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk=
834 CJKExtF}) (7488: U+2CEB0..2EBEF)
835 \p{Block: CJK_Unified_Ideographs_Extension_G} (Short: \p{Blk=
836 CJKExtG}) (4944: U+30000..3134F)
837 \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
838 Diacriticals}) (112: U+0300..036F)
839 \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
840 DiacriticalsExt}) (80: U+1AB0..1AFF)
841 \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
842 DiacriticalsForSymbols}) (48:
843 U+20D0..20FF)
844 \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
845 DiacriticalsSup}) (64: U+1DC0..1DFF)
846 \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16:
847 U+FE20..FE2F)
848 \p{Block: Combining_Marks_For_Symbols} \p{Block=
849 Combining_Diacritical_Marks_For_Symbols}
850 (48)
851 \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
852 IndicNumberForms}) (16: U+A830..A83F)
853 \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96)
854 \p{Block: Control_Pictures} (64: U+2400..243F)
855 \p{Block: Coptic} (NOT \p{Coptic} NOR \p{Is_Coptic}) (128:
856 U+2C80..2CFF)
857 \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF)
858 \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
859 \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32:
860 U+1D360..1D37F)
861 \p{Block: Cuneiform} (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
862 (1024: U+12000..123FF)
863 \p{Block: Cuneiform_Numbers} \p{Block=
864 Cuneiform_Numbers_And_Punctuation} (128)
865 \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
866 CuneiformNumbers}) (128: U+12400..1247F)
867 \p{Block: Currency_Symbols} (48: U+20A0..20CF)
868 \p{Block: Cypriot_Syllabary} (64: U+10800..1083F)
869 \p{Block: Cyrillic} (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
870 (256: U+0400..04FF)
871 \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
872 \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
873 \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
874 \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32:
875 U+2DE0..2DFF)
876 \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96:
877 U+A640..A69F)
878 \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16:
879 U+1C80..1C8F)
880 \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
881 \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48:
882 U+0500..052F)
883 \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
884 (48)
885 \p{Block: Deseret} (80: U+10400..1044F)
886 \p{Block: Devanagari} (NOT \p{Devanagari} NOR \p{Is_Devanagari})
887 (128: U+0900..097F)
888 \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
889 \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32:
890 U+A8E0..A8FF)
891 \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
892 \p{Block: Diacriticals_Ext} \p{Block=
893 Combining_Diacritical_Marks_Extended}
894 (80)
895 \p{Block: Diacriticals_For_Symbols} \p{Block=
896 Combining_Diacritical_Marks_For_Symbols}
897 (48)
898 \p{Block: Diacriticals_Sup} \p{Block=
899 Combining_Diacritical_Marks_Supplement}
900 (64)
901 \p{Block: Dingbats} (192: U+2700..27BF)
902 \p{Block: Dives_Akuru} (NOT \p{Dives_Akuru} NOR
903 \p{Is_Dives_Akuru}) (96: U+11900..1195F)
904 \p{Block: Dogra} (NOT \p{Dogra} NOR \p{Is_Dogra}) (80:
905 U+11800..1184F)
906 \p{Block: Domino} \p{Block=Domino_Tiles} (112)
907 \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112:
908 U+1F030..1F09F)
909 \p{Block: Duployan} (NOT \p{Duployan} NOR \p{Is_Duployan})
910 (160: U+1BC00..1BC9F)
911 \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F)
912 \p{Block: Egyptian_Hieroglyph_Format_Controls} (16: U+13430..1343F)
913 \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
914 \p{Is_Egyptian_Hieroglyphs}) (1072:
915 U+13000..1342F)
916 \p{Block: Elbasan} (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48:
917 U+10500..1052F)
918 \p{Block: Elymaic} (NOT \p{Elymaic} NOR \p{Is_Elymaic}) (32:
919 U+10FE0..10FFF)
920 \p{Block: Emoticons} (80: U+1F600..1F64F)
921 \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
922 \p{Block: Enclosed_Alphanum_Sup} \p{Block=
923 Enclosed_Alphanumeric_Supplement} (256)
924 \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
925 EnclosedAlphanumSup}) (256:
926 U+1F100..1F1FF)
927 \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
928 EnclosedAlphanum}) (160: U+2460..24FF)
929 \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
930 (256)
931 \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
932 EnclosedCJK}) (256: U+3200..32FF)
933 \p{Block: Enclosed_Ideographic_Sup} \p{Block=
934 Enclosed_Ideographic_Supplement} (256)
935 \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
936 EnclosedIdeographicSup}) (256:
937 U+1F200..1F2FF)
938 \p{Block: Ethiopic} (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
939 (384: U+1200..137F)
940 \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
941 \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
942 \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96:
943 U+2D80..2DDF)
944 \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48:
945 U+AB00..AB2F)
946 \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
947 \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32:
948 U+1380..139F)
949 \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
950 \p{Punct} NOR \p{Is_Punctuation}) (112:
951 U+2000..206F)
952 \p{Block: Geometric_Shapes} (96: U+25A0..25FF)
953 \p{Block: Geometric_Shapes_Ext} \p{Block=
954 Geometric_Shapes_Extended} (128)
955 \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
956 GeometricShapesExt}) (128:
957 U+1F780..1F7FF)
958 \p{Block: Georgian} (NOT \p{Georgian} NOR \p{Is_Georgian})
959 (96: U+10A0..10FF)
960 \p{Block: Georgian_Ext} \p{Block=Georgian_Extended} (48)
961 \p{Block: Georgian_Extended} (Short: \p{Blk=GeorgianExt}) (48:
962 U+1C90..1CBF)
963 \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
964 \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48:
965 U+2D00..2D2F)
966 \p{Block: Glagolitic} (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
967 (96: U+2C00..2C5F)
968 \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
969 \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
970 (48: U+1E000..1E02F)
971 \p{Block: Gothic} (NOT \p{Gothic} NOR \p{Is_Gothic}) (32:
972 U+10330..1034F)
973 \p{Block: Grantha} (NOT \p{Grantha} NOR \p{Is_Grantha}) (128:
974 U+11300..1137F)
975 \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
976 NOR \p{Is_Greek}) (144)
977 \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
978 NOR \p{Is_Greek}) (144: U+0370..03FF)
979 \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256)
980 \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256:
981 U+1F00..1FFF)
982 \p{Block: Gujarati} (NOT \p{Gujarati} NOR \p{Is_Gujarati})
983 (128: U+0A80..0AFF)
984 \p{Block: Gunjala_Gondi} (NOT \p{Gunjala_Gondi} NOR
985 \p{Is_Gunjala_Gondi}) (80:
986 U+11D60..11DAF)
987 \p{Block: Gurmukhi} (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
988 (128: U+0A00..0A7F)
989 \p{Block: Half_And_Full_Forms} \p{Block=
990 Halfwidth_And_Fullwidth_Forms} (240)
991 \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16)
992 \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
993 HalfAndFullForms}) (240: U+FF00..FFEF)
994 \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul}
995 NOR \p{Is_Hangul}) (11_184)
996 \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
997 (96: U+3130..318F)
998 \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}) (256: U+1100..11FF)
999 \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32:
1000 U+A960..A97F)
1001 \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80:
1002 U+D7B0..D7FF)
1003 \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
1004 NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF)
1005 \p{Block: Hanifi_Rohingya} (NOT \p{Hanifi_Rohingya} NOR
1006 \p{Is_Hanifi_Rohingya}) (64:
1007 U+10D00..10D3F)
1008 \p{Block: Hanunoo} (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32:
1009 U+1720..173F)
1010 \p{Block: Hatran} (NOT \p{Hatran} NOR \p{Is_Hatran}) (32:
1011 U+108E0..108FF)
1012 \p{Block: Hebrew} (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112:
1013 U+0590..05FF)
1014 \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
1015 HighPUSurrogates}) (128: U+DB80..DBFF)
1016 \p{Block: High_PU_Surrogates} \p{Block=
1017 High_Private_Use_Surrogates} (128)
1018 \p{Block: High_Surrogates} (896: U+D800..DB7F)
1019 \p{Block: Hiragana} (NOT \p{Hiragana} NOR \p{Is_Hiragana})
1020 (96: U+3040..309F)
1021 \p{Block: IDC} \p{Block=
1022 Ideographic_Description_Characters} (NOT
1023 \p{ID_Continue} NOR \p{Is_IDC}) (16)
1024 \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
1025 NOT \p{ID_Continue} NOR \p{Is_IDC}) (16:
1026 U+2FF0..2FFF)
1027 \p{Block: Ideographic_Symbols} \p{Block=
1028 Ideographic_Symbols_And_Punctuation} (32)
1029 \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
1030 IdeographicSymbols}) (32: U+16FE0..16FFF)
1031 \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
1032 \p{Is_Imperial_Aramaic}) (32:
1033 U+10840..1085F)
1034 \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1035 (16)
1036 \p{Block: Indic_Siyaq_Numbers} (80: U+1EC70..1ECBF)
1037 \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
1038 NOR \p{Is_Inscriptional_Pahlavi}) (32:
1039 U+10B60..10B7F)
1040 \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
1041 NOR \p{Is_Inscriptional_Parthian}) (32:
1042 U+10B40..10B5F)
1043 \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96)
1044 \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96:
1045 U+0250..02AF)
1046 \p{Block: Jamo} \p{Block=Hangul_Jamo} (256)
1047 \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32)
1048 \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80)
1049 \p{Block: Javanese} (NOT \p{Javanese} NOR \p{Is_Javanese})
1050 (96: U+A980..A9DF)
1051 \p{Block: Kaithi} (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80:
1052 U+11080..110CF)
1053 \p{Block: Kana_Ext_A} \p{Block=Kana_Extended_A} (48)
1054 \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48:
1055 U+1B100..1B12F)
1056 \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256)
1057 \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256:
1058 U+1B000..1B0FF)
1059 \p{Block: Kanbun} (16: U+3190..319F)
1060 \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224)
1061 \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224:
1062 U+2F00..2FDF)
1063 \p{Block: Kannada} (NOT \p{Kannada} NOR \p{Is_Kannada}) (128:
1064 U+0C80..0CFF)
1065 \p{Block: Katakana} (NOT \p{Katakana} NOR \p{Is_Katakana})
1066 (96: U+30A0..30FF)
1067 \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
1068 \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
1069 KatakanaExt}) (16: U+31F0..31FF)
1070 \p{Block: Kayah_Li} (48: U+A900..A92F)
1071 \p{Block: Kharoshthi} (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
1072 (96: U+10A00..10A5F)
1073 \p{Block: Khitan_Small_Script} (NOT \p{Khitan_Small_Script} NOR
1074 \p{Is_Khitan_Small_Script}) (512:
1075 U+18B00..18CFF)
1076 \p{Block: Khmer} (NOT \p{Khmer} NOR \p{Is_Khmer}) (128:
1077 U+1780..17FF)
1078 \p{Block: Khmer_Symbols} (32: U+19E0..19FF)
1079 \p{Block: Khojki} (NOT \p{Khojki} NOR \p{Is_Khojki}) (80:
1080 U+11200..1124F)
1081 \p{Block: Khudawadi} (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
1082 (80: U+112B0..112FF)
1083 \p{Block: Lao} (NOT \p{Lao} NOR \p{Is_Lao}) (128:
1084 U+0E80..0EFF)
1085 \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
1086 \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128)
1087 \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80-
1088 \xff])
1089 \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128)
1090 \p{Block: Latin_Ext_Additional} \p{Block=
1091 Latin_Extended_Additional} (256)
1092 \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208)
1093 \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32)
1094 \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224)
1095 \p{Block: Latin_Ext_E} \p{Block=Latin_Extended_E} (64)
1096 \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128:
1097 U+0100..017F)
1098 \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
1099 LatinExtAdditional}) (256: U+1E00..1EFF)
1100 \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208:
1101 U+0180..024F)
1102 \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32:
1103 U+2C60..2C7F)
1104 \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224:
1105 U+A720..A7FF)
1106 \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64:
1107 U+AB30..AB6F)
1108 \p{Block: Lepcha} (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80:
1109 U+1C00..1C4F)
1110 \p{Block: Letterlike_Symbols} (80: U+2100..214F)
1111 \p{Block: Limbu} (NOT \p{Limbu} NOR \p{Is_Limbu}) (80:
1112 U+1900..194F)
1113 \p{Block: Linear_A} (NOT \p{Linear_A} NOR \p{Is_Linear_A})
1114 (384: U+10600..1077F)
1115 \p{Block: Linear_B_Ideograms} (128: U+10080..100FF)
1116 \p{Block: Linear_B_Syllabary} (128: U+10000..1007F)
1117 \p{Block: Lisu} (NOT \p{Lisu} NOR \p{Is_Lisu}) (48:
1118 U+A4D0..A4FF)
1119 \p{Block: Lisu_Sup} \p{Block=Lisu_Supplement} (16)
1120 \p{Block: Lisu_Supplement} (Short: \p{Blk=LisuSup}) (16:
1121 U+11FB0..11FBF)
1122 \p{Block: Low_Surrogates} (1024: U+DC00..DFFF)
1123 \p{Block: Lycian} (NOT \p{Lycian} NOR \p{Is_Lycian}) (32:
1124 U+10280..1029F)
1125 \p{Block: Lydian} (NOT \p{Lydian} NOR \p{Is_Lydian}) (32:
1126 U+10920..1093F)
1127 \p{Block: Mahajani} (NOT \p{Mahajani} NOR \p{Is_Mahajani})
1128 (48: U+11150..1117F)
1129 \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48)
1130 \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48:
1131 U+1F000..1F02F)
1132 \p{Block: Makasar} (NOT \p{Makasar} NOR \p{Is_Makasar}) (32:
1133 U+11EE0..11EFF)
1134 \p{Block: Malayalam} (NOT \p{Malayalam} NOR \p{Is_Malayalam})
1135 (128: U+0D00..0D7F)
1136 \p{Block: Mandaic} (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32:
1137 U+0840..085F)
1138 \p{Block: Manichaean} (NOT \p{Manichaean} NOR \p{Is_Manichaean})
1139 (64: U+10AC0..10AFF)
1140 \p{Block: Marchen} (NOT \p{Marchen} NOR \p{Is_Marchen}) (80:
1141 U+11C70..11CBF)
1142 \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR
1143 \p{Is_Masaram_Gondi}) (96:
1144 U+11D00..11D5F)
1145 \p{Block: Math_Alphanum} \p{Block=
1146 Mathematical_Alphanumeric_Symbols} (1024)
1147 \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
1148 \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
1149 MathAlphanum}) (1024: U+1D400..1D7FF)
1150 \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
1151 (256: U+2200..22FF)
1152 \p{Block: Mayan_Numerals} (32: U+1D2E0..1D2FF)
1153 \p{Block: Medefaidrin} (NOT \p{Medefaidrin} NOR
1154 \p{Is_Medefaidrin}) (96: U+16E40..16E9F)
1155 \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
1156 \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF)
1157 \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
1158 \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
1159 (32: U+AAE0..AAFF)
1160 \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
1161 \p{Is_Mende_Kikakui}) (224:
1162 U+1E800..1E8DF)
1163 \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
1164 \p{Is_Meroitic_Cursive}) (96:
1165 U+109A0..109FF)
1166 \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F)
1167 \p{Block: Miao} (NOT \p{Miao} NOR \p{Is_Miao}) (160:
1168 U+16F00..16F9F)
1169 \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows}
1170 (256)
1171 \p{Block: Misc_Math_Symbols_A} \p{Block=
1172 Miscellaneous_Mathematical_Symbols_A}
1173 (48)
1174 \p{Block: Misc_Math_Symbols_B} \p{Block=
1175 Miscellaneous_Mathematical_Symbols_B}
1176 (128)
1177 \p{Block: Misc_Pictographs} \p{Block=
1178 Miscellaneous_Symbols_And_Pictographs}
1179 (768)
1180 \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
1181 \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
1182 \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
1183 MiscMathSymbolsA}) (48: U+27C0..27EF)
1184 \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
1185 MiscMathSymbolsB}) (128: U+2980..29FF)
1186 \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols})
1187 (256: U+2600..26FF)
1188 \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
1189 MiscArrows}) (256: U+2B00..2BFF)
1190 \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
1191 MiscPictographs}) (768: U+1F300..1F5FF)
1192 \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
1193 (256: U+2300..23FF)
1194 \p{Block: Modi} (NOT \p{Modi} NOR \p{Is_Modi}) (96:
1195 U+11600..1165F)
1196 \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
1197 \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F)
1198 \p{Block: Mongolian} (NOT \p{Mongolian} NOR \p{Is_Mongolian})
1199 (176: U+1800..18AF)
1200 \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
1201 \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32:
1202 U+11660..1167F)
1203 \p{Block: Mro} (NOT \p{Mro} NOR \p{Is_Mro}) (48:
1204 U+16A40..16A6F)
1205 \p{Block: Multani} (NOT \p{Multani} NOR \p{Is_Multani}) (48:
1206 U+11280..112AF)
1207 \p{Block: Music} \p{Block=Musical_Symbols} (256)
1208 \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256:
1209 U+1D100..1D1FF)
1210 \p{Block: Myanmar} (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160:
1211 U+1000..109F)
1212 \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
1213 \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
1214 \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32:
1215 U+AA60..AA7F)
1216 \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32:
1217 U+A9E0..A9FF)
1218 \p{Block: Nabataean} (NOT \p{Nabataean} NOR \p{Is_Nabataean})
1219 (48: U+10880..108AF)
1220 \p{Block: Nandinagari} (NOT \p{Nandinagari} NOR
1221 \p{Is_Nandinagari}) (96: U+119A0..119FF)
1222 \p{Block: NB} \p{Block=No_Block} (826_640 plus all
1223 above-Unicode code points)
1224 \p{Block: New_Tai_Lue} (NOT \p{New_Tai_Lue} NOR
1225 \p{Is_New_Tai_Lue}) (96: U+1980..19DF)
1226 \p{Block: Newa} (NOT \p{Newa} NOR \p{Is_Newa}) (128:
1227 U+11400..1147F)
1228 \p{Block: NKo} (NOT \p{Nko} NOR \p{Is_NKo}) (64:
1229 U+07C0..07FF)
1230 \p{Block: No_Block} (Short: \p{Blk=NB}) (826_640 plus all
1231 above-Unicode code points: U+0870..089F,
1232 U+2FE0..2FEF, U+10200..1027F,
1233 U+103E0..103FF, U+10570..105FF,
1234 U+10780..107FF ...)
1235 \p{Block: Number_Forms} (64: U+2150..218F)
1236 \p{Block: Nushu} (NOT \p{Nushu} NOR \p{Is_Nushu}) (400:
1237 U+1B170..1B2FF)
1238 \p{Block: Nyiakeng_Puachue_Hmong} (NOT \p{Nyiakeng_Puachue_Hmong}
1239 NOR \p{Is_Nyiakeng_Puachue_Hmong}) (80:
1240 U+1E100..1E14F)
1241 \p{Block: OCR} \p{Block=Optical_Character_Recognition}
1242 (32)
1243 \p{Block: Ogham} (NOT \p{Ogham} NOR \p{Is_Ogham}) (32:
1244 U+1680..169F)
1245 \p{Block: Ol_Chiki} (48: U+1C50..1C7F)
1246 \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
1247 \p{Is_Old_Hungarian}) (128:
1248 U+10C80..10CFF)
1249 \p{Block: Old_Italic} (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
1250 (48: U+10300..1032F)
1251 \p{Block: Old_North_Arabian} (32: U+10A80..10A9F)
1252 \p{Block: Old_Permic} (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
1253 (48: U+10350..1037F)
1254 \p{Block: Old_Persian} (NOT \p{Old_Persian} NOR
1255 \p{Is_Old_Persian}) (64: U+103A0..103DF)
1256 \p{Block: Old_Sogdian} (NOT \p{Old_Sogdian} NOR
1257 \p{Is_Old_Sogdian}) (48: U+10F00..10F2F)
1258 \p{Block: Old_South_Arabian} (32: U+10A60..10A7F)
1259 \p{Block: Old_Turkic} (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
1260 (80: U+10C00..10C4F)
1261 \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32:
1262 U+2440..245F)
1263 \p{Block: Oriya} (NOT \p{Oriya} NOR \p{Is_Oriya}) (128:
1264 U+0B00..0B7F)
1265 \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F)
1266 \p{Block: Osage} (NOT \p{Osage} NOR \p{Is_Osage}) (80:
1267 U+104B0..104FF)
1268 \p{Block: Osmanya} (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48:
1269 U+10480..104AF)
1270 \p{Block: Ottoman_Siyaq_Numbers} (80: U+1ED00..1ED4F)
1271 \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
1272 \p{Is_Pahawh_Hmong}) (144:
1273 U+16B00..16B8F)
1274 \p{Block: Palmyrene} (32: U+10860..1087F)
1275 \p{Block: Pau_Cin_Hau} (NOT \p{Pau_Cin_Hau} NOR
1276 \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF)
1277 \p{Block: Phags_Pa} (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa})
1278 (64: U+A840..A87F)
1279 \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48)
1280 \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48:
1281 U+101D0..101FF)
1282 \p{Block: Phoenician} (NOT \p{Phoenician} NOR \p{Is_Phoenician})
1283 (32: U+10900..1091F)
1284 \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
1285 \p{Block: Phonetic_Ext_Sup} \p{Block=
1286 Phonetic_Extensions_Supplement} (64)
1287 \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128:
1288 U+1D00..1D7F)
1289 \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
1290 PhoneticExtSup}) (64: U+1D80..1DBF)
1291 \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF)
1292 \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
1293 \p{Private_Use} NOR \p{Is_Private_Use})
1294 (6400)
1295 \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
1296 \p{Private_Use} NOR \p{Is_Private_Use})
1297 (6400: U+E000..F8FF)
1298 \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
1299 \p{Is_Psalter_Pahlavi}) (48:
1300 U+10B80..10BAF)
1301 \p{Block: PUA} \p{Block=Private_Use_Area} (NOT
1302 \p{Private_Use} NOR \p{Is_Private_Use})
1303 (6400)
1304 \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT
1305 \p{Punct} NOR \p{Is_Punctuation}) (112)
1306 \p{Block: Rejang} (NOT \p{Rejang} NOR \p{Is_Rejang}) (48:
1307 U+A930..A95F)
1308 \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32)
1309 \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32:
1310 U+10E60..10E7F)
1311 \p{Block: Runic} (NOT \p{Runic} NOR \p{Is_Runic}) (96:
1312 U+16A0..16FF)
1313 \p{Block: Samaritan} (NOT \p{Samaritan} NOR \p{Is_Samaritan})
1314 (64: U+0800..083F)
1315 \p{Block: Saurashtra} (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
1316 (96: U+A880..A8DF)
1317 \p{Block: Sharada} (NOT \p{Sharada} NOR \p{Is_Sharada}) (96:
1318 U+11180..111DF)
1319 \p{Block: Shavian} (48: U+10450..1047F)
1320 \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF)
1321 \p{Block: Siddham} (NOT \p{Siddham} NOR \p{Is_Siddham}) (128:
1322 U+11580..115FF)
1323 \p{Block: Sinhala} (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128:
1324 U+0D80..0DFF)
1325 \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF)
1326 \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32:
1327 U+FE50..FE6F)
1328 \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32)
1329 \p{Block: Small_Kana_Ext} \p{Block=Small_Kana_Extension} (64)
1330 \p{Block: Small_Kana_Extension} (Short: \p{Blk=SmallKanaExt}) (64:
1331 U+1B130..1B16F)
1332 \p{Block: Sogdian} (NOT \p{Sogdian} NOR \p{Is_Sogdian}) (64:
1333 U+10F30..10F6F)
1334 \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
1335 \p{Is_Sora_Sompeng}) (48: U+110D0..110FF)
1336 \p{Block: Soyombo} (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96:
1337 U+11A50..11AAF)
1338 \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
1339 ModifierLetters}) (80: U+02B0..02FF)
1340 \p{Block: Specials} (16: U+FFF0..FFFF)
1341 \p{Block: Sundanese} (NOT \p{Sundanese} NOR \p{Is_Sundanese})
1342 (64: U+1B80..1BBF)
1343 \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
1344 \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16:
1345 U+1CC0..1CCF)
1346 \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
1347 \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
1348 \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
1349 \p{Block: Sup_Math_Operators} \p{Block=
1350 Supplemental_Mathematical_Operators}
1351 (256)
1352 \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A}
1353 (65_536)
1354 \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B}
1355 (65_536)
1356 \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
1357 \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
1358 Supplemental_Symbols_And_Pictographs}
1359 (256)
1360 \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
1361 \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
1362 SuperAndSub}) (48: U+2070..209F)
1363 \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16:
1364 U+27F0..27FF)
1365 \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128:
1366 U+2900..297F)
1367 \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256:
1368 U+1F800..1F8FF)
1369 \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
1370 SupMathOperators}) (256: U+2A00..2AFF)
1371 \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
1372 SupPunctuation}) (128: U+2E00..2E7F)
1373 \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
1374 SupSymbolsAndPictographs}) (256:
1375 U+1F900..1F9FF)
1376 \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
1377 SupPUAA}) (65_536: U+F0000..FFFFF)
1378 \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
1379 SupPUAB}) (65_536: U+100000..10FFFF)
1380 \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF)
1381 \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
1382 \p{Is_Syloti_Nagri}) (48: U+A800..A82F)
1383 \p{Block: Symbols_And_Pictographs_Ext_A} \p{Block=
1384 Symbols_And_Pictographs_Extended_A} (144)
1385 \p{Block: Symbols_And_Pictographs_Extended_A} (Short: \p{Blk=
1386 SymbolsAndPictographsExtA}) (144:
1387 U+1FA70..1FAFF)
1388 \p{Block: Symbols_For_Legacy_Computing} (256: U+1FB00..1FBFF)
1389 \p{Block: Syriac} (NOT \p{Syriac} NOR \p{Is_Syriac}) (80:
1390 U+0700..074F)
1391 \p{Block: Syriac_Sup} \p{Block=Syriac_Supplement} (16)
1392 \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16:
1393 U+0860..086F)
1394 \p{Block: Tagalog} (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32:
1395 U+1700..171F)
1396 \p{Block: Tagbanwa} (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa})
1397 (32: U+1760..177F)
1398 \p{Block: Tags} (128: U+E0000..E007F)
1399 \p{Block: Tai_Le} (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48:
1400 U+1950..197F)
1401 \p{Block: Tai_Tham} (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
1402 (144: U+1A20..1AAF)
1403 \p{Block: Tai_Viet} (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet})
1404 (96: U+AA80..AADF)
1405 \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1406 \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96:
1407 U+1D300..1D35F)
1408 \p{Block: Takri} (NOT \p{Takri} NOR \p{Is_Takri}) (80:
1409 U+11680..116CF)
1410 \p{Block: Tamil} (NOT \p{Tamil} NOR \p{Is_Tamil}) (128:
1411 U+0B80..0BFF)
1412 \p{Block: Tamil_Sup} \p{Block=Tamil_Supplement} (64)
1413 \p{Block: Tamil_Supplement} (Short: \p{Blk=TamilSup}) (64:
1414 U+11FC0..11FFF)
1415 \p{Block: Tangut} (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144:
1416 U+17000..187FF)
1417 \p{Block: Tangut_Components} (768: U+18800..18AFF)
1418 \p{Block: Tangut_Sup} \p{Block=Tangut_Supplement} (144)
1419 \p{Block: Tangut_Supplement} (Short: \p{Blk=TangutSup}) (144:
1420 U+18D00..18D8F)
1421 \p{Block: Telugu} (NOT \p{Telugu} NOR \p{Is_Telugu}) (128:
1422 U+0C00..0C7F)
1423 \p{Block: Thaana} (NOT \p{Thaana} NOR \p{Is_Thaana}) (64:
1424 U+0780..07BF)
1425 \p{Block: Thai} (NOT \p{Thai} NOR \p{Is_Thai}) (128:
1426 U+0E00..0E7F)
1427 \p{Block: Tibetan} (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256:
1428 U+0F00..0FFF)
1429 \p{Block: Tifinagh} (NOT \p{Tifinagh} NOR \p{Is_Tifinagh})
1430 (80: U+2D30..2D7F)
1431 \p{Block: Tirhuta} (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96:
1432 U+11480..114DF)
1433 \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1434 (128)
1435 \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1436 TransportAndMap}) (128: U+1F680..1F6FF)
1437 \p{Block: UCAS} \p{Block=
1438 Unified_Canadian_Aboriginal_Syllabics}
1439 (640)
1440 \p{Block: UCAS_Ext} \p{Block=
1441 Unified_Canadian_Aboriginal_Syllabics_-
1442 Extended} (80)
1443 \p{Block: Ugaritic} (NOT \p{Ugaritic} NOR \p{Is_Ugaritic})
1444 (32: U+10380..1039F)
1445 \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1446 UCAS}) (640: U+1400..167F)
1447 \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1448 \p{Blk=UCASExt}) (80: U+18B0..18FF)
1449 \p{Block: Vai} (NOT \p{Vai} NOR \p{Is_Vai}) (320:
1450 U+A500..A63F)
1451 \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
1452 \p{Variation_Selector} NOR \p{Is_VS})
1453 (16: U+FE00..FE0F)
1454 \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
1455 (240: U+E0100..E01EF)
1456 \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48)
1457 \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48:
1458 U+1CD0..1CFF)
1459 \p{Block: Vertical_Forms} (16: U+FE10..FE1F)
1460 \p{Block: VS} \p{Block=Variation_Selectors} (NOT
1461 \p{Variation_Selector} NOR \p{Is_VS})
1462 (16)
1463 \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement}
1464 (240)
1465 \p{Block: Wancho} (NOT \p{Wancho} NOR \p{Is_Wancho}) (64:
1466 U+1E2C0..1E2FF)
1467 \p{Block: Warang_Citi} (NOT \p{Warang_Citi} NOR
1468 \p{Is_Warang_Citi}) (96: U+118A0..118FF)
1469 \p{Block: Yezidi} (NOT \p{Yezidi} NOR \p{Is_Yezidi}) (64:
1470 U+10E80..10EBF)
1471 \p{Block: Yi_Radicals} (64: U+A490..A4CF)
1472 \p{Block: Yi_Syllables} (1168: U+A000..A48F)
1473 \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64)
1474 \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64:
1475 U+4DC0..4DFF)
1476 \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR
1477 \p{Is_Zanabazar_Square}) (80:
1478 U+11A00..11A4F)
1479 X \p{Block_Elements} \p{Block=Block_Elements} (32)
1480 \p{Bopo} \p{Bopomofo} (= \p{Script_Extensions=
1481 Bopomofo}) (NOT \p{Block=Bopomofo}) (117)
1482 \p{Bopomofo} \p{Script_Extensions=Bopomofo} (Short:
1483 \p{Bopo}; NOT \p{Block=Bopomofo}) (117)
1484 X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block=
1485 Bopomofo_Extended}) (32)
1486 X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short:
1487 \p{InBopomofoExt}) (32)
1488 X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
1489 \p{Bpt: *} \p{Bidi_Paired_Bracket_Type: *}
1490 \p{Brah} \p{Brahmi} (= \p{Script_Extensions=
1491 Brahmi}) (NOT \p{Block=Brahmi}) (109)
1492 \p{Brahmi} \p{Script_Extensions=Brahmi} (Short:
1493 \p{Brah}; NOT \p{Block=Brahmi}) (109)
1494 \p{Brai} \p{Braille} (= \p{Script_Extensions=
1495 Braille}) (256)
1496 \p{Braille} \p{Script_Extensions=Braille} (Short:
1497 \p{Brai}) (256)
1498 X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short:
1499 \p{InBraille}) (256)
1500 \p{Bugi} \p{Buginese} (= \p{Script_Extensions=
1501 Buginese}) (NOT \p{Block=Buginese}) (31)
1502 \p{Buginese} \p{Script_Extensions=Buginese} (Short:
1503 \p{Bugi}; NOT \p{Block=Buginese}) (31)
1504 \p{Buhd} \p{Buhid} (= \p{Script_Extensions=Buhid})
1505 (NOT \p{Block=Buhid}) (22)
1506 \p{Buhid} \p{Script_Extensions=Buhid} (Short:
1507 \p{Buhd}; NOT \p{Block=Buhid}) (22)
1508 X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block=
1509 Byzantine_Musical_Symbols}) (256)
1510 X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1511 (Short: \p{InByzantineMusic}) (256)
1512 \p{C} \pC \p{Other} (= \p{General_Category=Other})
1513 (970_414 plus all above-Unicode code
1514 points)
1515 \p{Cakm} \p{Chakma} (= \p{Script_Extensions=
1516 Chakma}) (NOT \p{Block=Chakma}) (91)
1517 \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
1518 (Short: \p{Cans}) (710)
1519 X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
1520 (= \p{Block=
1521 Unified_Canadian_Aboriginal_Syllabics})
1522 (640)
1523 T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1524 Not_Reordered} (1_113_240 plus all
1525 above-Unicode code points)
1526 T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1527 Overlay} (32)
1528 T \p{Canonical_Combining_Class: 6} \p{Canonical_Combining_Class=
1529 Han_Reading} (2)
1530 T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1531 Nukta} (26)
1532 T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1533 Kana_Voicing} (2)
1534 T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1535 Virama} (61)
1536 T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1537 CCC10} (1)
1538 \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1:
1539 U+05B0)
1540 T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1541 CCC11} (1)
1542 \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1:
1543 U+05B1)
1544 T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1545 CCC12} (1)
1546 \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1:
1547 U+05B2)
1548 T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1549 CCC13} (1)
1550 \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1:
1551 U+05B3)
1552 T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1553 CCC14} (1)
1554 \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1:
1555 U+05B4)
1556 T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1557 CCC15} (1)
1558 \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1:
1559 U+05B5)
1560 T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1561 CCC16} (1)
1562 \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1:
1563 U+05B6)
1564 T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1565 CCC17} (1)
1566 \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1:
1567 U+05B7)
1568 T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1569 CCC18} (2)
1570 \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2:
1571 U+05B8, U+05C7)
1572 T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1573 CCC19} (2)
1574 \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2:
1575 U+05B9..05BA)
1576 T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1577 CCC20} (1)
1578 \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1:
1579 U+05BB)
1580 T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1581 CCC21} (1)
1582 \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1:
1583 U+05BC)
1584 T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1585 CCC22} (1)
1586 \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1:
1587 U+05BD)
1588 T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1589 CCC23} (1)
1590 \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1:
1591 U+05BF)
1592 T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1593 CCC24} (1)
1594 \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1:
1595 U+05C1)
1596 T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1597 CCC25} (1)
1598 \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1:
1599 U+05C2)
1600 T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1601 CCC26} (1)
1602 \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1:
1603 U+FB1E)
1604 T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1605 CCC27} (2)
1606 \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2:
1607 U+064B, U+08F0)
1608 T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1609 CCC28} (2)
1610 \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2:
1611 U+064C, U+08F1)
1612 T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1613 CCC29} (2)
1614 \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2:
1615 U+064D, U+08F2)
1616 T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1617 CCC30} (2)
1618 \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2:
1619 U+0618, U+064E)
1620 T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1621 CCC31} (2)
1622 \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2:
1623 U+0619, U+064F)
1624 T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1625 CCC32} (2)
1626 \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2:
1627 U+061A, U+0650)
1628 T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1629 CCC33} (1)
1630 \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1:
1631 U+0651)
1632 T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1633 CCC34} (1)
1634 \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1:
1635 U+0652)
1636 T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1637 CCC35} (1)
1638 \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1:
1639 U+0670)
1640 T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1641 CCC36} (1)
1642 \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1:
1643 U+0711)
1644 T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1645 CCC84} (1)
1646 \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1:
1647 U+0C55)
1648 T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1649 CCC91} (1)
1650 \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1:
1651 U+0C56)
1652 T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1653 CCC103} (2)
1654 \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2:
1655 U+0E38..0E39)
1656 T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1657 CCC107} (4)
1658 \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4:
1659 U+0E48..0E4B)
1660 T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1661 CCC118} (2)
1662 \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2:
1663 U+0EB8..0EB9)
1664 T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1665 CCC122} (4)
1666 \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4:
1667 U+0EC8..0ECB)
1668 T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1669 CCC129} (1)
1670 \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1:
1671 U+0F71)
1672 T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1673 CCC130} (6)
1674 \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6:
1675 U+0F72, U+0F7A..0F7D, U+0F80)
1676 T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1677 CCC132} (1)
1678 \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1:
1679 U+0F74)
1680 T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
1681 CCC133} (0)
1682 \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
1683 T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1684 Attached_Below_Left} (0)
1685 T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1686 Attached_Below} (5)
1687 T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1688 Attached_Above} (1)
1689 T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1690 Attached_Above_Right} (9)
1691 T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1692 Below_Left} (1)
1693 T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1694 Below} (165)
1695 T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1696 Below_Right} (4)
1697 T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1698 Left} (2)
1699 T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1700 Right} (1)
1701 T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1702 Above_Left} (5)
1703 T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1704 Above} (484)
1705 T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1706 Above_Right} (5)
1707 T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1708 Double_Below} (4)
1709 T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1710 Double_Above} (5)
1711 T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1712 Iota_Subscript} (1)
1713 \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1714 Above} (484)
1715 \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (484:
1716 U+0300..0314, U+033D..0344, U+0346,
1717 U+034A..034C, U+0350..0352, U+0357 ...)
1718 \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5:
1719 U+05AE, U+18A9, U+1DF7..1DF8, U+302B)
1720 \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (5:
1721 U+0315, U+031A, U+0358, U+1DF6, U+302C)
1722 \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1723 Above_Left} (5)
1724 \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1725 Above_Right} (5)
1726 \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1727 Attached_Above} (1)
1728 \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1729 Attached_Above_Right} (9)
1730 \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1731 Attached_Below} (5)
1732 \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1733 Attached_Below_Left} (0)
1734 \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1735 (1: U+1DCE)
1736 \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1737 \p{Ccc=ATAR}) (9: U+031B, U+0F39,
1738 U+1D165..1D166, U+1D16E..1D172)
1739 \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1740 (5: U+0321..0322, U+0327..0328, U+1DD0)
1741 \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1742 ATBL}) (0)
1743 \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1744 Below} (165)
1745 \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (165:
1746 U+0316..0319, U+031C..0320,
1747 U+0323..0326, U+0329..0333,
1748 U+0339..033C, U+0347..0349 ...)
1749 \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1:
1750 U+302A)
1751 \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4:
1752 U+059A, U+05AD, U+1939, U+302D)
1753 \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1754 Below_Left} (1)
1755 \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1756 Below_Right} (4)
1757 \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1758 Double_Above} (5)
1759 \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1760 Double_Below} (4)
1761 \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA})
1762 (5: U+035D..035E, U+0360..0361, U+1DCD)
1763 \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB})
1764 (4: U+035C, U+035F, U+0362, U+1DFC)
1765 \p{Canonical_Combining_Class: Han_Reading} (Short: \p{Ccc=HANR})
1766 (2: U+16FF0..16FF1)
1767 \p{Canonical_Combining_Class: HANR} \p{Canonical_Combining_Class=
1768 Han_Reading} (2)
1769 \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1770 (1: U+0345)
1771 \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1772 Iota_Subscript} (1)
1773 \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV})
1774 (2: U+3099..309A)
1775 \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1776 Kana_Voicing} (2)
1777 \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1778 Left} (2)
1779 \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2:
1780 U+302E..302F)
1781 \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1782 Nukta} (26)
1783 \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1784 (1_113_240 plus all above-Unicode code
1785 points: U+0000..02FF, U+034F,
1786 U+0370..0482, U+0488..0590, U+05BE,
1787 U+05C0 ...)
1788 \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1789 Not_Reordered} (1_113_240 plus all
1790 above-Unicode code points)
1791 \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (26:
1792 U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C,
1793 U+0CBC ...)
1794 \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1795 Overlay} (32)
1796 \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32:
1797 U+0334..0338, U+1CD4, U+1CE2..1CE8,
1798 U+20D2..20D3, U+20D8..20DA, U+20E5..20E6
1799 ...)
1800 \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1801 Right} (1)
1802 \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1:
1803 U+1D16D)
1804 \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (61:
1805 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
1806 U+0BCD ...)
1807 \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1808 Virama} (61)
1809 \p{Cans} \p{Canadian_Aboriginal} (=
1810 \p{Script_Extensions=
1811 Canadian_Aboriginal}) (710)
1812 \p{Cari} \p{Carian} (= \p{Script_Extensions=
1813 Carian}) (NOT \p{Block=Carian}) (49)
1814 \p{Carian} \p{Script_Extensions=Carian} (Short:
1815 \p{Cari}; NOT \p{Block=Carian}) (49)
1816 \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (2413)
1817 \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_111_699 plus
1818 all above-Unicode code points: [\x00-
1819 \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A-Z
1820 \[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-\xac\xae
1821 \xb0-\xb3\xb5-\xb6\xb9-\xff],
1822 U+0100..02AF, U+0370..0373,
1823 U+0376..0379, U+037B..0383, U+0386 ...)
1824 \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (2413: [\'.:\^`
1825 \xa8\xad\xaf\xb4\xb7-\xb8],
1826 U+02B0..036F, U+0374..0375, U+037A,
1827 U+0384..0385, U+0387 ...)
1828 \p{Cased} \p{Cased=Y} (4286)
1829 \p{Cased: N*} (Single: \P{Cased}) (1_109_826 plus all
1830 above-Unicode code points: [\x00-\x20!
1831 \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]
1832 \^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9
1833 \xbb-\xbf\xd7\xf7], U+01BB,
1834 U+01C0..01C3, U+0294, U+02B9..02BF,
1835 U+02C2..02DF ...)
1836 \p{Cased: Y*} (Single: \p{Cased}) (4286: [A-Za-z\xaa
1837 \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
1838 U+0100..01BA, U+01BC..01BF,
1839 U+01C4..0293, U+0295..02B8, U+02C0..02C1
1840 ...)
1841 \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
1842 \p{LC}) (3977)
1843 \p{Category: *} \p{General_Category: *}
1844 \p{Caucasian_Albanian} \p{Script_Extensions=Caucasian_Albanian}
1845 (Short: \p{Aghb}; NOT \p{Block=
1846 Caucasian_Albanian}) (53)
1847 \p{Cc} \p{XPosixCntrl} (= \p{General_Category=
1848 Control}) (65)
1849 \p{Ccc: *} \p{Canonical_Combining_Class: *}
1850 \p{CE} \p{Composition_Exclusion} (=
1851 \p{Composition_Exclusion=Y}) (81)
1852 \p{CE: *} \p{Composition_Exclusion: *}
1853 \p{Cf} \p{Format} (= \p{General_Category=Format})
1854 (161)
1855 \p{Chakma} \p{Script_Extensions=Chakma} (Short:
1856 \p{Cakm}; NOT \p{Block=Chakma}) (91)
1857 \p{Cham} \p{Script_Extensions=Cham} (NOT \p{Block=
1858 Cham}) (83)
1859 \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1860 \p{CWCF}) (1466)
1861 \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1862 (1_112_646 plus all above-Unicode code
1863 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1864 \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1865 \xb4\xb6-\xbf\xd7\xe0-\xff], U+0101,
1866 U+0103, U+0105, U+0107, U+0109 ...)
1867 \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1868 (1466: [A-Z\xb5\xc0-\xd6\xd8-\xdf],
1869 U+0100, U+0102, U+0104, U+0106, U+0108
1870 ...)
1871 \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1872 \p{CWCM}) (2847)
1873 \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1874 (1_111_265 plus all above-Unicode code
1875 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1876 \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xb4
1877 \xb6-\xbf\xd7\xf7], U+0138, U+018D,
1878 U+019B, U+01AA..01AB, U+01BA..01BB ...)
1879 \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1880 (2847: [A-Za-z\xb5\xc0-\xd6\xd8-\xf6
1881 \xf8-\xff], U+0100..0137, U+0139..018C,
1882 U+018E..019A, U+019C..01A9, U+01AC..01B9
1883 ...)
1884 \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1885 \p{CWL}) (1393)
1886 \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1887 (1_112_719 plus all above-Unicode code
1888 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1889 \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1890 \xbf\xd7\xdf-\xff], U+0101, U+0103,
1891 U+0105, U+0107, U+0109 ...)
1892 \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1393:
1893 [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102,
1894 U+0104, U+0106, U+0108 ...)
1895 \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1896 Y} (Short: \p{CWKCF}) (10_329)
1897 \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1898 \P{CWKCF}) (1_103_783 plus all above-
1899 Unicode code points: [\x00-\x20!\"#\$
1900 \%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a-
1901 z\{\|\}~\x7f-\x9f\xa1-\xa7\xa9\xab-\xac
1902 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xd7\xe0-
1903 \xff], U+0101, U+0103, U+0105, U+0107,
1904 U+0109 ...)
1905 \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1906 \p{CWKCF}) (10_329: [A-Z\xa0\xa8\xaa
1907 \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0-
1908 \xd6\xd8-\xdf], U+0100, U+0102, U+0104,
1909 U+0106, U+0108 ...)
1910 \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1911 \p{CWT}) (1412)
1912 \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1913 (1_112_700 plus all above-Unicode code
1914 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1915 \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f-
1916 \xb4\xb6-\xde\xf7], U+0100, U+0102,
1917 U+0104, U+0106, U+0108 ...)
1918 \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1412:
1919 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1920 U+0103, U+0105, U+0107, U+0109 ...)
1921 \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1922 \p{CWU}) (1485)
1923 \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1924 (1_112_627 plus all above-Unicode code
1925 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1926 \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f-
1927 \xb4\xb6-\xde\xf7], U+0100, U+0102,
1928 U+0104, U+0106, U+0108 ...)
1929 \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1485:
1930 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1931 U+0103, U+0105, U+0107, U+0109 ...)
1932 \p{Cher} \p{Cherokee} (= \p{Script_Extensions=
1933 Cherokee}) (NOT \p{Block=Cherokee}) (172)
1934 \p{Cherokee} \p{Script_Extensions=Cherokee} (Short:
1935 \p{Cher}; NOT \p{Block=Cherokee}) (172)
1936 X \p{Cherokee_Sup} \p{Cherokee_Supplement} (= \p{Block=
1937 Cherokee_Supplement}) (80)
1938 X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
1939 \p{InCherokeeSup}) (80)
1940 X \p{Chess_Symbols} \p{Block=Chess_Symbols} (112)
1941 \p{Chorasmian} \p{Script_Extensions=Chorasmian} (Short:
1942 \p{Chrs}; NOT \p{Block=Chorasmian}) (28)
1943 \p{Chrs} \p{Chorasmian} (= \p{Script_Extensions=
1944 Chorasmian}) (NOT \p{Block=Chorasmian})
1945 (28)
1946 \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
1947 Y}) (2413)
1948 \p{CI: *} \p{Case_Ignorable: *}
1949 X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block=
1950 CJK_Unified_Ideographs}) (20_992)
1951 X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block=
1952 CJK_Compatibility}) (256)
1953 X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block=
1954 CJK_Compatibility_Forms}) (32)
1955 X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
1956 \p{Block=CJK_Compatibility_Ideographs})
1957 (512)
1958 X \p{CJK_Compat_Ideographs_Sup}
1959 \p{CJK_Compatibility_Ideographs_-
1960 Supplement} (= \p{Block=
1961 CJK_Compatibility_Ideographs_-
1962 Supplement}) (544)
1963 X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short:
1964 \p{InCJKCompat}) (256)
1965 X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
1966 (Short: \p{InCJKCompatForms}) (32)
1967 X \p{CJK_Compatibility_Ideographs} \p{Block=
1968 CJK_Compatibility_Ideographs} (Short:
1969 \p{InCJKCompatIdeographs}) (512)
1970 X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
1971 CJK_Compatibility_Ideographs_Supplement}
1972 (Short: \p{InCJKCompatIdeographsSup})
1973 (544)
1974 X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (=
1975 \p{Block=
1976 CJK_Unified_Ideographs_Extension_A})
1977 (6592)
1978 X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (=
1979 \p{Block=
1980 CJK_Unified_Ideographs_Extension_B})
1981 (42_720)
1982 X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (=
1983 \p{Block=
1984 CJK_Unified_Ideographs_Extension_C})
1985 (4160)
1986 X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (=
1987 \p{Block=
1988 CJK_Unified_Ideographs_Extension_D})
1989 (224)
1990 X \p{CJK_Ext_E} \p{CJK_Unified_Ideographs_Extension_E} (=
1991 \p{Block=
1992 CJK_Unified_Ideographs_Extension_E})
1993 (5776)
1994 X \p{CJK_Ext_F} \p{CJK_Unified_Ideographs_Extension_F} (=
1995 \p{Block=
1996 CJK_Unified_Ideographs_Extension_F})
1997 (7488)
1998 X \p{CJK_Ext_G} \p{CJK_Unified_Ideographs_Extension_G} (=
1999 \p{Block=
2000 CJK_Unified_Ideographs_Extension_G})
2001 (4944)
2002 X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block=
2003 CJK_Radicals_Supplement}) (128)
2004 X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
2005 (Short: \p{InCJKRadicalsSup}) (128)
2006 X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
2007 X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (=
2008 \p{Block=CJK_Symbols_And_Punctuation})
2009 (64)
2010 X \p{CJK_Symbols_And_Punctuation} \p{Block=
2011 CJK_Symbols_And_Punctuation} (Short:
2012 \p{InCJKSymbols}) (64)
2013 X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
2014 (Short: \p{InCJK}) (20_992)
2015 X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
2016 CJK_Unified_Ideographs_Extension_A}
2017 (Short: \p{InCJKExtA}) (6592)
2018 X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
2019 CJK_Unified_Ideographs_Extension_B}
2020 (Short: \p{InCJKExtB}) (42_720)
2021 X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
2022 CJK_Unified_Ideographs_Extension_C}
2023 (Short: \p{InCJKExtC}) (4160)
2024 X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
2025 CJK_Unified_Ideographs_Extension_D}
2026 (Short: \p{InCJKExtD}) (224)
2027 X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
2028 CJK_Unified_Ideographs_Extension_E}
2029 (Short: \p{InCJKExtE}) (5776)
2030 X \p{CJK_Unified_Ideographs_Extension_F} \p{Block=
2031 CJK_Unified_Ideographs_Extension_F}
2032 (Short: \p{InCJKExtF}) (7488)
2033 X \p{CJK_Unified_Ideographs_Extension_G} \p{Block=
2034 CJK_Unified_Ideographs_Extension_G}
2035 (Short: \p{InCJKExtG}) (4944)
2036 \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
2037 (Short: \p{Pe}) (73)
2038 \p{Cn} \p{Unassigned} (= \p{General_Category=
2039 Unassigned}) (830_672 plus all above-
2040 Unicode code points)
2041 \p{Cntrl} \p{XPosixCntrl} (= \p{General_Category=
2042 Control}) (65)
2043 \p{Co} \p{Private_Use} (= \p{General_Category=
2044 Private_Use}) (NOT \p{Private_Use_Area})
2045 (137_468)
2046 X \p{Combining_Diacritical_Marks} \p{Block=
2047 Combining_Diacritical_Marks} (Short:
2048 \p{InDiacriticals}) (112)
2049 X \p{Combining_Diacritical_Marks_Extended} \p{Block=
2050 Combining_Diacritical_Marks_Extended}
2051 (Short: \p{InDiacriticalsExt}) (80)
2052 X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
2053 Combining_Diacritical_Marks_For_Symbols}
2054 (Short: \p{InDiacriticalsForSymbols})
2055 (48)
2056 X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
2057 Combining_Diacritical_Marks_Supplement}
2058 (Short: \p{InDiacriticalsSup}) (64)
2059 X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
2060 \p{InHalfMarks}) (16)
2061 \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark})
2062 (2295)
2063 X \p{Combining_Marks_For_Symbols}
2064 \p{Combining_Diacritical_Marks_For_-
2065 Symbols} (= \p{Block=
2066 Combining_Diacritical_Marks_For_-
2067 Symbols}) (48)
2068 \p{Common} \p{Script_Extensions=Common} (Short:
2069 \p{Zyyy}) (7661)
2070 X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
2071 (Short: \p{InIndicNumberForms}) (16)
2072 \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
2073 \p{Full_Composition_Exclusion=Y}) (1120)
2074 \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
2075 X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block=
2076 Hangul_Compatibility_Jamo}) (96)
2077 \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
2078 \p{CE}) (81)
2079 \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
2080 plus all above-Unicode code points:
2081 U+0000..0957, U+0960..09DB, U+09DE,
2082 U+09E0..0A32, U+0A34..0A35, U+0A37..0A58
2083 ...)
2084 \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81:
2085 U+0958..095F, U+09DC..09DD, U+09DF,
2086 U+0A33, U+0A36, U+0A59..0A5B ...)
2087 \p{Connector_Punctuation} \p{General_Category=
2088 Connector_Punctuation} (Short: \p{Pc})
2089 (10)
2090 \p{Control} \p{XPosixCntrl} (= \p{General_Category=
2091 Control}) (65)
2092 X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
2093 \p{Copt} \p{Coptic} (= \p{Script_Extensions=
2094 Coptic}) (NOT \p{Block=Coptic}) (165)
2095 \p{Coptic} \p{Script_Extensions=Coptic} (Short:
2096 \p{Copt}; NOT \p{Block=Coptic}) (165)
2097 X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
2098 X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block=
2099 Counting_Rod_Numerals}) (32)
2100 X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
2101 \p{InCountingRod}) (32)
2102 \p{Cprt} \p{Cypriot} (= \p{Script_Extensions=
2103 Cypriot}) (112)
2104 \p{Cs} \p{Surrogate} (= \p{General_Category=
2105 Surrogate}) (2048)
2106 \p{Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
2107 \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
2108 X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (=
2109 \p{Block=
2110 Cuneiform_Numbers_And_Punctuation}) (128)
2111 X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
2112 Cuneiform_Numbers_And_Punctuation}
2113 (Short: \p{InCuneiformNumbers}) (128)
2114 \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
2115 (Short: \p{Sc}) (62)
2116 X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
2117 \p{CWCF} \p{Changes_When_Casefolded} (=
2118 \p{Changes_When_Casefolded=Y}) (1466)
2119 \p{CWCF: *} \p{Changes_When_Casefolded: *}
2120 \p{CWCM} \p{Changes_When_Casemapped} (=
2121 \p{Changes_When_Casemapped=Y}) (2847)
2122 \p{CWCM: *} \p{Changes_When_Casemapped: *}
2123 \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
2124 \p{Changes_When_NFKC_Casefolded=Y})
2125 (10_329)
2126 \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
2127 \p{CWL} \p{Changes_When_Lowercased} (=
2128 \p{Changes_When_Lowercased=Y}) (1393)
2129 \p{CWL: *} \p{Changes_When_Lowercased: *}
2130 \p{CWT} \p{Changes_When_Titlecased} (=
2131 \p{Changes_When_Titlecased=Y}) (1412)
2132 \p{CWT: *} \p{Changes_When_Titlecased: *}
2133 \p{CWU} \p{Changes_When_Uppercased} (=
2134 \p{Changes_When_Uppercased=Y}) (1485)
2135 \p{CWU: *} \p{Changes_When_Uppercased: *}
2136 \p{Cypriot} \p{Script_Extensions=Cypriot} (Short:
2137 \p{Cprt}) (112)
2138 X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
2139 \p{Cyrillic} \p{Script_Extensions=Cyrillic} (Short:
2140 \p{Cyrl}; NOT \p{Block=Cyrillic}) (447)
2141 X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block=
2142 Cyrillic_Extended_A}) (32)
2143 X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block=
2144 Cyrillic_Extended_B}) (96)
2145 X \p{Cyrillic_Ext_C} \p{Cyrillic_Extended_C} (= \p{Block=
2146 Cyrillic_Extended_C}) (16)
2147 X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
2148 \p{InCyrillicExtA}) (32)
2149 X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
2150 \p{InCyrillicExtB}) (96)
2151 X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
2152 \p{InCyrillicExtC}) (16)
2153 X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block=
2154 Cyrillic_Supplement}) (48)
2155 X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
2156 \p{InCyrillicSup}) (48)
2157 X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
2158 Cyrillic_Supplement}) (48)
2159 \p{Cyrl} \p{Cyrillic} (= \p{Script_Extensions=
2160 Cyrillic}) (NOT \p{Block=Cyrillic}) (447)
2161 \p{Dash} \p{Dash=Y} (29)
2162 \p{Dash: N*} (Single: \P{Dash}) (1_114_083 plus all
2163 above-Unicode code points: [\x00-\x20!
2164 \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z
2165 \[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2166 U+0100..0589, U+058B..05BD,
2167 U+05BF..13FF, U+1401..1805, U+1807..200F
2168 ...)
2169 \p{Dash: Y*} (Single: \p{Dash}) (29: [\-], U+058A,
2170 U+05BE, U+1400, U+1806, U+2010..2015 ...)
2171 \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
2172 (Short: \p{Pd}) (25)
2173 \p{Decimal_Number} \p{XPosixDigit} (= \p{General_Category=
2174 Decimal_Number}) (650)
2175 \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
2176 (13_233)
2177 \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_233:
2178 [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd
2179 \xe0-\xe5\xe7-\xef\xf1-\xf6\xf9-\xfd
2180 \xff], U+0100..010F, U+0112..0125,
2181 U+0128..0130, U+0134..0137, U+0139..013E
2182 ...)
2183 \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240:
2184 U+2460..2473, U+24B6..24EA,
2185 U+3244..3247, U+3251..327E,
2186 U+3280..32BF, U+32D0..32FE ...)
2187 \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
2188 \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720: [\xa8
2189 \xaf\xb4-\xb5\xb8], U+0132..0133,
2190 U+013F..0140, U+0149, U+017F,
2191 U+01C4..01CC ...)
2192 \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
2193 \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
2194 \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51,
2195 U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63
2196 ...)
2197 \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1194: U+2102,
2198 U+210A..2113, U+2115, U+2119..211D,
2199 U+2124, U+2128 ...)
2200 \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
2201 \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc-
2202 \xbe], U+2150..215F, U+2189)
2203 \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
2204 \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54,
2205 U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68
2206 ...)
2207 \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
2208 \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50,
2209 U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62
2210 ...)
2211 \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
2212 \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55,
2213 U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69
2214 ...)
2215 \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
2216 \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122:
2217 U+FF61..FFBE, U+FFC2..FFC7,
2218 U+FFCA..FFCF, U+FFD2..FFD7,
2219 U+FFDA..FFDC, U+FFE8..FFEE)
2220 \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
2221 \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0],
2222 U+0F0C, U+2007, U+2011, U+202F)
2223 \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
2224 Non_Canonical} (Perl extension) (3675)
2225 \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
2226 decompositions (Short: \p{Dt=NonCanon})
2227 (Perl extension) (3675: [\xa0\xa8\xaa
2228 \xaf\xb2-\xb5\xb8-\xba\xbc-\xbe],
2229 U+0132..0133, U+013F..0140, U+0149,
2230 U+017F, U+01C4..01CC ...)
2231 \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_204 plus
2232 all above-Unicode code points: [\x00-
2233 \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6-
2234 \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-\xdf
2235 \xe6\xf0\xf7-\xf8\xfe], U+0110..0111,
2236 U+0126..0127, U+0131, U+0138,
2237 U+0141..0142 ...)
2238 \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26:
2239 U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B)
2240 \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
2241 \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (286)
2242 \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (286: U+3250,
2243 U+32CC..32CF, U+32FF..3357,
2244 U+3371..33DF, U+33FF, U+1F130..1F14F ...)
2245 \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38: U+1D62..1D6A,
2246 U+2080..208E, U+2090..209C, U+2C7C)
2247 \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (154)
2248 \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (154: [\xaa\xb2-
2249 \xb3\xb9-\xba], U+02B0..02B8,
2250 U+02E0..02E4, U+10FC, U+1D2C..1D2E,
2251 U+1D30..1D3A ...)
2252 \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
2253 \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F,
2254 U+30FF, U+FE10..FE19, U+FE30..FE44,
2255 U+FE47..FE48)
2256 \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000,
2257 U+FF01..FF60, U+FFE0..FFE6)
2258 \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
2259 Y} (Short: \p{DI}) (4173)
2260 \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
2261 (1_109_939 plus all above-Unicode code
2262 points: [\x00-\xac\xae-\xff],
2263 U+0100..034E, U+0350..061B,
2264 U+061D..115E, U+1161..17B3, U+17B6..180A
2265 ...)
2266 \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
2267 (4173: [\xad], U+034F, U+061C,
2268 U+115F..1160, U+17B4..17B5, U+180B..180E
2269 ...)
2270 \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (15)
2271 \p{Dep: *} \p{Deprecated: *}
2272 \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (15)
2273 \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_097
2274 plus all above-Unicode code points:
2275 U+0000..0148, U+014A..0672,
2276 U+0674..0F76, U+0F78, U+0F7A..17A2,
2277 U+17A5..2069 ...)
2278 \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149,
2279 U+0673, U+0F77, U+0F79, U+17A3..17A4,
2280 U+206A..206F ...)
2281 \p{Deseret} \p{Script_Extensions=Deseret} (Short:
2282 \p{Dsrt}) (80)
2283 \p{Deva} \p{Devanagari} (= \p{Script_Extensions=
2284 Devanagari}) (NOT \p{Block=Devanagari})
2285 (210)
2286 \p{Devanagari} \p{Script_Extensions=Devanagari} (Short:
2287 \p{Deva}; NOT \p{Block=Devanagari}) (210)
2288 X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block=
2289 Devanagari_Extended}) (32)
2290 X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
2291 \p{InDevanagariExt}) (32)
2292 \p{DI} \p{Default_Ignorable_Code_Point} (=
2293 \p{Default_Ignorable_Code_Point=Y})
2294 (4173)
2295 \p{DI: *} \p{Default_Ignorable_Code_Point: *}
2296 \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (882)
2297 \p{Dia: *} \p{Diacritic: *}
2298 \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (882)
2299 \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_230
2300 plus all above-Unicode code points:
2301 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2302 >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
2303 \xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
2304 U+0100..02AF, U+034F, U+0358..035C,
2305 U+0363..0373, U+0376..0379 ...)
2306 \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (882: [\^`
2307 \xa8\xaf\xb4\xb7-\xb8], U+02B0..034E,
2308 U+0350..0357, U+035D..0362,
2309 U+0374..0375, U+037A ...)
2310 X \p{Diacriticals} \p{Combining_Diacritical_Marks} (=
2311 \p{Block=Combining_Diacritical_Marks})
2312 (112)
2313 X \p{Diacriticals_Ext} \p{Combining_Diacritical_Marks_Extended}
2314 (= \p{Block=
2315 Combining_Diacritical_Marks_Extended})
2316 (80)
2317 X \p{Diacriticals_For_Symbols}
2318 \p{Combining_Diacritical_Marks_For_-
2319 Symbols} (= \p{Block=
2320 Combining_Diacritical_Marks_For_-
2321 Symbols}) (48)
2322 X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement}
2323 (= \p{Block=
2324 Combining_Diacritical_Marks_Supplement})
2325 (64)
2326 \p{Diak} \p{Dives_Akuru} (= \p{Script_Extensions=
2327 Dives_Akuru}) (NOT \p{Block=
2328 Dives_Akuru}) (72)
2329 \p{Digit} \p{XPosixDigit} (= \p{General_Category=
2330 Decimal_Number}) (650)
2331 X \p{Dingbats} \p{Block=Dingbats} (192)
2332 \p{Dives_Akuru} \p{Script_Extensions=Dives_Akuru} (Short:
2333 \p{Diak}; NOT \p{Block=Dives_Akuru}) (72)
2334 \p{Dogr} \p{Dogra} (= \p{Script_Extensions=Dogra})
2335 (NOT \p{Block=Dogra}) (82)
2336 \p{Dogra} \p{Script_Extensions=Dogra} (Short:
2337 \p{Dogr}; NOT \p{Block=Dogra}) (82)
2338 X \p{Domino} \p{Domino_Tiles} (= \p{Block=
2339 Domino_Tiles}) (112)
2340 X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short:
2341 \p{InDomino}) (112)
2342 \p{Dsrt} \p{Deseret} (= \p{Script_Extensions=
2343 Deseret}) (80)
2344 \p{Dt: *} \p{Decomposition_Type: *}
2345 \p{Dupl} \p{Duployan} (= \p{Script_Extensions=
2346 Duployan}) (NOT \p{Block=Duployan}) (147)
2347 \p{Duployan} \p{Script_Extensions=Duployan} (Short:
2348 \p{Dupl}; NOT \p{Block=Duployan}) (147)
2349 \p{Ea: *} \p{East_Asian_Width: *}
2350 X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
2351 (208)
2352 \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
2353 \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739: [\xa1
2354 \xa4\xa7-\xa8\xaa\xad-\xae\xb0-\xb4\xb6-
2355 \xba\xbc-\xbf\xc6\xd0\xd7-\xd8\xde-\xe1
2356 \xe6\xe8-\xea\xec-\xed\xf0\xf2-\xf3\xf7-
2357 \xfa\xfc\xfe], U+0101, U+0111, U+0113,
2358 U+011B, U+0126..0127 ...)
2359 \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
2360 \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000,
2361 U+FF01..FF60, U+FFE0..FFE6)
2362 \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
2363 \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9,
2364 U+FF61..FFBE, U+FFC2..FFC7,
2365 U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC
2366 ...)
2367 \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (792_699 plus
2368 all above-Unicode code points)
2369 \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
2370 \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20-\x7e
2371 \xa2-\xa3\xa5-\xa6\xac\xaf],
2372 U+27E6..27ED, U+2985..2986)
2373 \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (792_699 plus all
2374 above-Unicode code points: [\x00-\x1f
2375 \x7f-\xa0\xa9\xab\xb5\xbb\xc0-\xc5\xc7-
2376 \xcf\xd1-\xd6\xd9-\xdd\xe2-\xe5\xe7\xeb
2377 \xee-\xef\xf1\xf4-\xf6\xfb\xfd\xff],
2378 U+00FF..0100, U+0102..0110, U+0112,
2379 U+0114..011A, U+011C..0125 ...)
2380 \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (182_336)
2381 \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (182_336:
2382 U+1100..115F, U+231A..231B,
2383 U+2329..232A, U+23E9..23EC, U+23F0,
2384 U+23F3 ...)
2385 \p{EBase} \p{Emoji_Modifier_Base} (=
2386 \p{Emoji_Modifier_Base=Y}) (122)
2387 \p{EBase: *} \p{Emoji_Modifier_Base: *}
2388 \p{EComp} \p{Emoji_Component} (= \p{Emoji_Component=
2389 Y}) (146)
2390 \p{EComp: *} \p{Emoji_Component: *}
2391 \p{Egyp} \p{Egyptian_Hieroglyphs} (=
2392 \p{Script_Extensions=
2393 Egyptian_Hieroglyphs}) (NOT \p{Block=
2394 Egyptian_Hieroglyphs}) (1080)
2395 X \p{Egyptian_Hieroglyph_Format_Controls} \p{Block=
2396 Egyptian_Hieroglyph_Format_Controls} (16)
2397 \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
2398 Egyptian_Hieroglyphs} (Short: \p{Egyp};
2399 NOT \p{Block=Egyptian_Hieroglyphs})
2400 (1080)
2401 \p{Elba} \p{Elbasan} (= \p{Script_Extensions=
2402 Elbasan}) (NOT \p{Block=Elbasan}) (40)
2403 \p{Elbasan} \p{Script_Extensions=Elbasan} (Short:
2404 \p{Elba}; NOT \p{Block=Elbasan}) (40)
2405 \p{Elym} \p{Elymaic} (= \p{Script_Extensions=
2406 Elymaic}) (NOT \p{Block=Elymaic}) (23)
2407 \p{Elymaic} \p{Script_Extensions=Elymaic} (Short:
2408 \p{Elym}; NOT \p{Block=Elymaic}) (23)
2409 \p{EMod} \p{Emoji_Modifier} (= \p{Emoji_Modifier=
2410 Y}) (5)
2411 \p{EMod: *} \p{Emoji_Modifier: *}
2412 \p{Emoji} \p{Emoji=Y} (1367)
2413 \p{Emoji: N*} (Single: \P{Emoji}) (1_112_745 plus all
2414 above-Unicode code points: [\x00-\x20!
2415 \"\$\%&\'\(\)+,\-.\/:;<=>?\@A-Z\[\\\]
2416 \^_`a-z\{\|\}~\x7f-\xa8\xaa-\xad\xaf-
2417 \xff], U+0100..203B, U+203D..2048,
2418 U+204A..2121, U+2123..2138, U+213A..2193
2419 ...)
2420 \p{Emoji: Y*} (Single: \p{Emoji}) (1367: [#*0-9\xa9
2421 \xae], U+203C, U+2049, U+2122, U+2139,
2422 U+2194..2199 ...)
2423 \p{Emoji_Component} \p{Emoji_Component=Y} (Short: \p{EComp})
2424 (146)
2425 \p{Emoji_Component: N*} (Short: \p{EComp=N}, \P{EComp}) (1_113_966
2426 plus all above-Unicode code points:
2427 [\x00-\x20!\"\$\%&\'\(\)+,\-.\/:;<=>?
2428 \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2429 U+0100..200C, U+200E..20E2,
2430 U+20E4..FE0E, U+FE10..1F1E5,
2431 U+1F200..1F3FA ...)
2432 \p{Emoji_Component: Y*} (Short: \p{EComp=Y}, \p{EComp}) (146:
2433 [#*0-9], U+200D, U+20E3, U+FE0F,
2434 U+1F1E6..1F1FF, U+1F3FB..1F3FF ...)
2435 \p{Emoji_Modifier} \p{Emoji_Modifier=Y} (Short: \p{EMod}) (5)
2436 \p{Emoji_Modifier: N*} (Short: \p{EMod=N}, \P{EMod}) (1_114_107
2437 plus all above-Unicode code points:
2438 U+0000..1F3FA, U+1F400..infinity)
2439 \p{Emoji_Modifier: Y*} (Short: \p{EMod=Y}, \p{EMod}) (5:
2440 U+1F3FB..1F3FF)
2441 \p{Emoji_Modifier_Base} \p{Emoji_Modifier_Base=Y} (Short:
2442 \p{EBase}) (122)
2443 \p{Emoji_Modifier_Base: N*} (Short: \p{EBase=N}, \P{EBase})
2444 (1_113_990 plus all above-Unicode code
2445 points: U+0000..261C, U+261E..26F8,
2446 U+26FA..2709, U+270E..1F384,
2447 U+1F386..1F3C1, U+1F3C5..1F3C6 ...)
2448 \p{Emoji_Modifier_Base: Y*} (Short: \p{EBase=Y}, \p{EBase}) (122:
2449 U+261D, U+26F9, U+270A..270D, U+1F385,
2450 U+1F3C2..1F3C4, U+1F3C7 ...)
2451 \p{Emoji_Presentation} \p{Emoji_Presentation=Y} (Short:
2452 \p{EPres}) (1148)
2453 \p{Emoji_Presentation: N*} (Short: \p{EPres=N}, \P{EPres})
2454 (1_112_964 plus all above-Unicode code
2455 points: U+0000..2319, U+231C..23E8,
2456 U+23ED..23EF, U+23F1..23F2,
2457 U+23F4..25FC, U+25FF..2613 ...)
2458 \p{Emoji_Presentation: Y*} (Short: \p{EPres=Y}, \p{EPres}) (1148:
2459 U+231A..231B, U+23E9..23EC, U+23F0,
2460 U+23F3, U+25FD..25FE, U+2614..2615 ...)
2461 X \p{Emoticons} \p{Block=Emoticons} (80)
2462 X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block=
2463 Enclosed_Alphanumerics}) (160)
2464 X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
2465 \p{Block=
2466 Enclosed_Alphanumeric_Supplement}) (256)
2467 X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
2468 Enclosed_Alphanumeric_Supplement}
2469 (Short: \p{InEnclosedAlphanumSup}) (256)
2470 X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
2471 (Short: \p{InEnclosedAlphanum}) (160)
2472 X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (=
2473 \p{Block=
2474 Enclosed_CJK_Letters_And_Months}) (256)
2475 X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
2476 Enclosed_CJK_Letters_And_Months} (Short:
2477 \p{InEnclosedCJK}) (256)
2478 X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
2479 (= \p{Block=
2480 Enclosed_Ideographic_Supplement}) (256)
2481 X \p{Enclosed_Ideographic_Supplement} \p{Block=
2482 Enclosed_Ideographic_Supplement} (Short:
2483 \p{InEnclosedIdeographicSup}) (256)
2484 \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
2485 (Short: \p{Me}) (13)
2486 \p{EPres} \p{Emoji_Presentation} (=
2487 \p{Emoji_Presentation=Y}) (1148)
2488 \p{EPres: *} \p{Emoji_Presentation: *}
2489 \p{Ethi} \p{Ethiopic} (= \p{Script_Extensions=
2490 Ethiopic}) (NOT \p{Block=Ethiopic}) (495)
2491 \p{Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
2492 \p{Ethi}; NOT \p{Block=Ethiopic}) (495)
2493 X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block=
2494 Ethiopic_Extended}) (96)
2495 X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block=
2496 Ethiopic_Extended_A}) (48)
2497 X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short:
2498 \p{InEthiopicExt}) (96)
2499 X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
2500 \p{InEthiopicExtA}) (48)
2501 X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block=
2502 Ethiopic_Supplement}) (32)
2503 X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
2504 \p{InEthiopicSup}) (32)
2505 \p{Ext} \p{Extender} (= \p{Extender=Y}) (48)
2506 \p{Ext: *} \p{Extender: *}
2507 \p{Extended_Pictographic} \p{Extended_Pictographic=Y} (Short:
2508 \p{ExtPict}) (3537)
2509 \p{Extended_Pictographic: N*} (Short: \p{ExtPict=N}, \P{ExtPict})
2510 (1_110_575 plus all above-Unicode code
2511 points: [\x00-\xa8\xaa-\xad\xaf-\xff],
2512 U+0100..203B, U+203D..2048,
2513 U+204A..2121, U+2123..2138, U+213A..2193
2514 ...)
2515 \p{Extended_Pictographic: Y*} (Short: \p{ExtPict=Y}, \p{ExtPict})
2516 (3537: [\xa9\xae], U+203C, U+2049,
2517 U+2122, U+2139, U+2194..2199 ...)
2518 \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (48)
2519 \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_064
2520 plus all above-Unicode code points:
2521 [\x00-\xb6\xb8-\xff], U+0100..02CF,
2522 U+02D2..063F, U+0641..07F9,
2523 U+07FB..0B54, U+0B56..0E45 ...)
2524 \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (48: [\xb7],
2525 U+02D0..02D1, U+0640, U+07FA, U+0B55,
2526 U+0E46 ...)
2527 \p{ExtPict} \p{Extended_Pictographic} (=
2528 \p{Extended_Pictographic=Y}) (3537)
2529 \p{ExtPict: *} \p{Extended_Pictographic: *}
2530 \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
2531 (Short: \p{Pf}) (10)
2532 \p{Format} \p{General_Category=Format} (Short:
2533 \p{Cf}) (161)
2534 \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
2535 (Short: \p{CompEx}) (1120)
2536 \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
2537 \P{CompEx}) (1_112_992 plus all above-
2538 Unicode code points: U+0000..033F,
2539 U+0342, U+0345..0373, U+0375..037D,
2540 U+037F..0386, U+0388..0957 ...)
2541 \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
2542 \p{CompEx}) (1120: U+0340..0341,
2543 U+0343..0344, U+0374, U+037E, U+0387,
2544 U+0958..095F ...)
2545 \p{Gc: *} \p{General_Category: *}
2546 \p{GCB: *} \p{Grapheme_Cluster_Break: *}
2547 \p{General_Category: C} \p{General_Category=Other} (970_414 plus
2548 all above-Unicode code points)
2549 \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
2550 \p{Gc=LC}, \p{LC}) (3977: [A-Za-z\xb5
2551 \xc0-\xd6\xd8-\xf6\xf8-\xff],
2552 U+0100..01BA, U+01BC..01BF,
2553 U+01C4..0293, U+0295..02AF, U+0370..0373
2554 ...)
2555 \p{General_Category: Cc} \p{General_Category=Control} (65)
2556 \p{General_Category: Cf} \p{General_Category=Format} (161)
2557 \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
2558 (73: [\)\]\}], U+0F3B, U+0F3D, U+169C,
2559 U+2046, U+207E ...)
2560 \p{General_Category: Cn} \p{General_Category=Unassigned} (830_672
2561 plus all above-Unicode code points)
2562 \p{General_Category: Cntrl} \p{General_Category=Control} (65)
2563 \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
2564 \p{General_Category: Combining_Mark} \p{General_Category=Mark}
2565 (2295)
2566 \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
2567 \p{Pc}) (10: [_], U+203F..2040, U+2054,
2568 U+FE33..FE34, U+FE4D..FE4F, U+FF3F)
2569 \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65:
2570 [\x00-\x1f\x7f-\x9f])
2571 \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
2572 \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
2573 (62: [\$\xa2-\xa5], U+058F, U+060B,
2574 U+07FE..07FF, U+09F2..09F3, U+09FB ...)
2575 \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
2576 (25: [\-], U+058A, U+05BE, U+1400,
2577 U+1806, U+2010..2015 ...)
2578 \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
2579 (650: [0-9], U+0660..0669, U+06F0..06F9,
2580 U+07C0..07C9, U+0966..096F, U+09E6..09EF
2581 ...)
2582 \p{General_Category: Digit} \p{General_Category=Decimal_Number}
2583 (650)
2584 \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
2585 (13: U+0488..0489, U+1ABE, U+20DD..20E0,
2586 U+20E2..20E4, U+A670..A672)
2587 \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
2588 (10: [\xbb], U+2019, U+201D, U+203A,
2589 U+2E03, U+2E05 ...)
2590 \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (161:
2591 [\xad], U+0600..0605, U+061C, U+06DD,
2592 U+070F, U+08E2 ...)
2593 \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
2594 \p{Pi}) (12: [\xab], U+2018,
2595 U+201B..201C, U+201F, U+2039, U+2E02 ...)
2596 \p{General_Category: L} \p{General_Category=Letter} (131_241)
2597 X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3977)
2598 X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
2599 the trailing '_' matters in spite of
2600 loose matching rules. (3977)
2601 \p{General_Category: LC} \p{General_Category=Cased_Letter} (3977)
2602 \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (131_241:
2603 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
2604 \xf8-\xff], U+0100..02C1, U+02C6..02D1,
2605 U+02E0..02E4, U+02EC, U+02EE ...)
2606 \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
2607 (236: U+16EE..16F0, U+2160..2182,
2608 U+2185..2188, U+3007, U+3021..3029,
2609 U+3038..303A ...)
2610 \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl})
2611 (1: U+2028)
2612 \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
2613 (/i= General_Category=Cased_Letter)
2614 (2155)
2615 \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (260)
2616 \p{General_Category: Lo} \p{General_Category=Other_Letter}
2617 (127_004)
2618 \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
2619 /i= General_Category=Cased_Letter)
2620 (2155: [a-z\xb5\xdf-\xf6\xf8-\xff],
2621 U+0101, U+0103, U+0105, U+0107, U+0109
2622 ...)
2623 \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
2624 (/i= General_Category=Cased_Letter) (31)
2625 \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
2626 (/i= General_Category=Cased_Letter)
2627 (1791)
2628 \p{General_Category: M} \p{General_Category=Mark} (2295)
2629 \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2295:
2630 U+0300..036F, U+0483..0489,
2631 U+0591..05BD, U+05BF, U+05C1..05C2,
2632 U+05C4..05C5 ...)
2633 \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948:
2634 [+<=>\|~\xac\xb1\xd7\xf7], U+03F6,
2635 U+0606..0608, U+2044, U+2052,
2636 U+207A..207C ...)
2637 \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (443)
2638 \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
2639 \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
2640 (1839)
2641 \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
2642 (260: U+02B0..02C1, U+02C6..02D1,
2643 U+02E0..02E4, U+02EC, U+02EE, U+0374 ...)
2644 \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
2645 (123: [\^`\xa8\xaf\xb4\xb8],
2646 U+02C2..02C5, U+02D2..02DF,
2647 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2648 \p{General_Category: N} \p{General_Category=Number} (1781)
2649 \p{General_Category: Nd} \p{General_Category=Decimal_Number} (650)
2650 \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
2651 \p{General_Category: No} \p{General_Category=Other_Number} (895)
2652 \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
2653 (1839: U+0300..036F, U+0483..0487,
2654 U+0591..05BD, U+05BF, U+05C1..05C2,
2655 U+05C4..05C5 ...)
2656 \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1781: [0-9
2657 \xb2-\xb3\xb9\xbc-\xbe], U+0660..0669,
2658 U+06F0..06F9, U+07C0..07C9,
2659 U+0966..096F, U+09E6..09EF ...)
2660 \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
2661 (75: [\(\[\{], U+0F3A, U+0F3C, U+169B,
2662 U+201A, U+201E ...)
2663 \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (970_414 plus
2664 all above-Unicode code points: [\x00-
2665 \x1f\x7f-\x9f\xad], U+0378..0379,
2666 U+0380..0383, U+038B, U+038D, U+03A2 ...)
2667 \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
2668 (127_004: [\xaa\xba], U+01BB,
2669 U+01C0..01C3, U+0294, U+05D0..05EA,
2670 U+05EF..05F2 ...)
2671 \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No})
2672 (895: [\xb2-\xb3\xb9\xbc-\xbe],
2673 U+09F4..09F9, U+0B72..0B77,
2674 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
2675 ...)
2676 \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
2677 (593: [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7
2678 \xb6-\xb7\xbf], U+037E, U+0387,
2679 U+055A..055F, U+0589, U+05C0 ...)
2680 \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
2681 (6431: [\xa6\xa9\xae\xb0], U+0482,
2682 U+058D..058E, U+060E..060F, U+06DE,
2683 U+06E9 ...)
2684 \p{General_Category: P} \p{General_Category=Punctuation} (798)
2685 \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
2686 \p{Zp}) (1: U+2029)
2687 \p{General_Category: Pc} \p{General_Category=
2688 Connector_Punctuation} (10)
2689 \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (25)
2690 \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
2691 (73)
2692 \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
2693 (10)
2694 \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
2695 (12)
2696 \p{General_Category: Po} \p{General_Category=Other_Punctuation}
2697 (593)
2698 \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
2699 (137_468: U+E000..F8FF, U+F0000..FFFFD,
2700 U+100000..10FFFD)
2701 \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (75)
2702 \p{General_Category: Punct} \p{General_Category=Punctuation} (798)
2703 \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (798:
2704 [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_\{\}
2705 \xa1\xa7\xab\xb6-\xb7\xbb\xbf], U+037E,
2706 U+0387, U+055A..055F, U+0589..058A,
2707 U+05BE ...)
2708 \p{General_Category: S} \p{General_Category=Symbol} (7564)
2709 \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (62)
2710 \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19:
2711 [\x20\xa0], U+1680, U+2000..200A,
2712 U+2028..2029, U+202F, U+205F ...)
2713 \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (123)
2714 \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
2715 \p{General_Category: So} \p{General_Category=Other_Symbol} (6431)
2716 \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
2717 (17: [\x20\xa0], U+1680, U+2000..200A,
2718 U+202F, U+205F, U+3000)
2719 \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc})
2720 (443: U+0903, U+093B, U+093E..0940,
2721 U+0949..094C, U+094E..094F, U+0982..0983
2722 ...)
2723 \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048:
2724 U+D800..DFFF)
2725 \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (7564:
2726 [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae-
2727 \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5,
2728 U+02D2..02DF, U+02E5..02EB, U+02ED,
2729 U+02EF..02FF ...)
2730 \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
2731 /i= General_Category=Cased_Letter) (31:
2732 U+01C5, U+01C8, U+01CB, U+01F2,
2733 U+1F88..1F8F, U+1F98..1F9F ...)
2734 \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
2735 (830_672 plus all above-Unicode code
2736 points: U+0378..0379, U+0380..0383,
2737 U+038B, U+038D, U+03A2, U+0530 ...)
2738 \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
2739 /i= General_Category=Cased_Letter)
2740 (1791: [A-Z\xc0-\xd6\xd8-\xde], U+0100,
2741 U+0102, U+0104, U+0106, U+0108 ...)
2742 \p{General_Category: Z} \p{General_Category=Separator} (19)
2743 \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
2744 \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
2745 (1)
2746 \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
2747 X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
2748 \p{InPunctuation}) (112)
2749 X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
2750 X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
2751 \p{Block=Geometric_Shapes_Extended})
2752 (128)
2753 X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
2754 (Short: \p{InGeometricShapesExt}) (128)
2755 \p{Geor} \p{Georgian} (= \p{Script_Extensions=
2756 Georgian}) (NOT \p{Block=Georgian}) (174)
2757 \p{Georgian} \p{Script_Extensions=Georgian} (Short:
2758 \p{Geor}; NOT \p{Block=Georgian}) (174)
2759 X \p{Georgian_Ext} \p{Georgian_Extended} (= \p{Block=
2760 Georgian_Extended}) (48)
2761 X \p{Georgian_Extended} \p{Block=Georgian_Extended} (Short:
2762 \p{InGeorgianExt}) (48)
2763 X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block=
2764 Georgian_Supplement}) (48)
2765 X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
2766 \p{InGeorgianSup}) (48)
2767 \p{Glag} \p{Glagolitic} (= \p{Script_Extensions=
2768 Glagolitic}) (NOT \p{Block=Glagolitic})
2769 (136)
2770 \p{Glagolitic} \p{Script_Extensions=Glagolitic} (Short:
2771 \p{Glag}; NOT \p{Block=Glagolitic}) (136)
2772 X \p{Glagolitic_Sup} \p{Glagolitic_Supplement} (= \p{Block=
2773 Glagolitic_Supplement}) (48)
2774 X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
2775 \p{InGlagoliticSup}) (48)
2776 \p{Gong} \p{Gunjala_Gondi} (= \p{Script_Extensions=
2777 Gunjala_Gondi}) (NOT \p{Block=
2778 Gunjala_Gondi}) (65)
2779 \p{Gonm} \p{Masaram_Gondi} (= \p{Script_Extensions=
2780 Masaram_Gondi}) (NOT \p{Block=
2781 Masaram_Gondi}) (77)
2782 \p{Goth} \p{Gothic} (= \p{Script_Extensions=
2783 Gothic}) (NOT \p{Block=Gothic}) (27)
2784 \p{Gothic} \p{Script_Extensions=Gothic} (Short:
2785 \p{Goth}; NOT \p{Block=Gothic}) (27)
2786 \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
2787 (141_814)
2788 \p{Gr_Base: *} \p{Grapheme_Base: *}
2789 \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
2790 Y}) (1979)
2791 \p{Gr_Ext: *} \p{Grapheme_Extend: *}
2792 \p{Gran} \p{Grantha} (= \p{Script_Extensions=
2793 Grantha}) (NOT \p{Block=Grantha}) (116)
2794 \p{Grantha} \p{Script_Extensions=Grantha} (Short:
2795 \p{Gran}; NOT \p{Block=Grantha}) (116)
2796 \p{Graph} \p{XPosixGraph} (281_308)
2797 \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
2798 (141_814)
2799 \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase}) (972_298
2800 plus all above-Unicode code points:
2801 [\x00-\x1f\x7f-\x9f\xad], U+0300..036F,
2802 U+0378..0379, U+0380..0383, U+038B,
2803 U+038D ...)
2804 \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase})
2805 (141_814: [\x20-\x7e\xa0-\xac\xae-\xff],
2806 U+0100..02FF, U+0370..0377,
2807 U+037A..037F, U+0384..038A, U+038C ...)
2808 \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
2809 (3886)
2810 \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (3886: [^
2811 \n\r\x20-\x7e\xa0-\xac\xae-\xff],
2812 U+061C, U+180E, U+200B, U+200E..200F,
2813 U+2028..202E ...)
2814 \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r])
2815 \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (0)
2816 \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (0)
2817 \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (0)
2818 \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
2819 (0)
2820 \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
2821 E_Base_GAZ} (0)
2822 \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
2823 E_Modifier} (0)
2824 \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
2825 (1984)
2826 \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1984:
2827 U+0300..036F, U+0483..0489,
2828 U+0591..05BD, U+05BF, U+05C1..05C2,
2829 U+05C4..05C5 ...)
2830 \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
2831 Glue_After_Zwj} (0)
2832 \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (0)
2833 \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125:
2834 U+1100..115F, U+A960..A97C)
2835 \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n])
2836 \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00,
2837 U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C
2838 ...)
2839 \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773:
2840 U+AC01..AC1B, U+AC1D..AC37,
2841 U+AC39..AC53, U+AC55..AC6F,
2842 U+AC71..AC8B, U+AC8D..ACA7 ...)
2843 \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_096_272
2844 plus all above-Unicode code points:
2845 [\x20-\x7e\xa0-\xac\xae-\xff],
2846 U+0100..02FF, U+0370..0482,
2847 U+048A..0590, U+05BE, U+05C0 ...)
2848 \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
2849 (24)
2850 \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (24:
2851 U+0600..0605, U+06DD, U+070F, U+08E2,
2852 U+0D4E, U+110BD ...)
2853 \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
2854 (26: U+1F1E6..1F1FF)
2855 \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
2856 Regional_Indicator} (26)
2857 \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
2858 SpacingMark} (388)
2859 \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (388:
2860 U+0903, U+093B, U+093E..0940,
2861 U+0949..094C, U+094E..094F, U+0982..0983
2862 ...)
2863 \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137:
2864 U+11A8..11FF, U+D7CB..D7FB)
2865 \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95:
2866 U+1160..11A7, U+D7B0..D7C6)
2867 \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
2868 (1_096_272 plus all above-Unicode code
2869 points)
2870 \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D)
2871 \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
2872 (1979)
2873 \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_133
2874 plus all above-Unicode code points:
2875 U+0000..02FF, U+0370..0482,
2876 U+048A..0590, U+05BE, U+05C0, U+05C3 ...)
2877 \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1979:
2878 U+0300..036F, U+0483..0489,
2879 U+0591..05BD, U+05BF, U+05C1..05C2,
2880 U+05C4..05C5 ...)
2881 \p{Greek} \p{Script_Extensions=Greek} (Short:
2882 \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
2883 X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
2884 \p{InGreek}) (144)
2885 X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block=
2886 Greek_Extended}) (256)
2887 X \p{Greek_Extended} \p{Block=Greek_Extended} (Short:
2888 \p{InGreekExt}) (256)
2889 \p{Grek} \p{Greek} (= \p{Script_Extensions=Greek})
2890 (NOT \p{Greek_And_Coptic}) (522)
2891 \p{Gujarati} \p{Script_Extensions=Gujarati} (Short:
2892 \p{Gujr}; NOT \p{Block=Gujarati}) (105)
2893 \p{Gujr} \p{Gujarati} (= \p{Script_Extensions=
2894 Gujarati}) (NOT \p{Block=Gujarati}) (105)
2895 \p{Gunjala_Gondi} \p{Script_Extensions=Gunjala_Gondi}
2896 (Short: \p{Gong}; NOT \p{Block=
2897 Gunjala_Gondi}) (65)
2898 \p{Gurmukhi} \p{Script_Extensions=Gurmukhi} (Short:
2899 \p{Guru}; NOT \p{Block=Gurmukhi}) (94)
2900 \p{Guru} \p{Gurmukhi} (= \p{Script_Extensions=
2901 Gurmukhi}) (NOT \p{Block=Gurmukhi}) (94)
2902 X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
2903 \p{Block=Halfwidth_And_Fullwidth_Forms})
2904 (240)
2905 X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block=
2906 Combining_Half_Marks}) (16)
2907 X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
2908 Halfwidth_And_Fullwidth_Forms} (Short:
2909 \p{InHalfAndFullForms}) (240)
2910 \p{Han} \p{Script_Extensions=Han} (94_492)
2911 \p{Hang} \p{Hangul} (= \p{Script_Extensions=
2912 Hangul}) (NOT \p{Hangul_Syllables})
2913 (11_775)
2914 \p{Hangul} \p{Script_Extensions=Hangul} (Short:
2915 \p{Hang}; NOT \p{Hangul_Syllables})
2916 (11_775)
2917 X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
2918 (Short: \p{InCompatJamo}) (96)
2919 X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo})
2920 (256)
2921 X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
2922 (Short: \p{InJamoExtA}) (32)
2923 X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
2924 (Short: \p{InJamoExtB}) (80)
2925 \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
2926 (125)
2927 \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125:
2928 U+1100..115F, U+A960..A97C)
2929 \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
2930 (399)
2931 \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399:
2932 U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70,
2933 U+AC8C ...)
2934 \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
2935 LVT_Syllable} (10_773)
2936 \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
2937 (10_773: U+AC01..AC1B, U+AC1D..AC37,
2938 U+AC39..AC53, U+AC55..AC6F,
2939 U+AC71..AC8B, U+AC8D..ACA7 ...)
2940 \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
2941 Not_Applicable} (1_102_583 plus all
2942 above-Unicode code points)
2943 \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
2944 (1_102_583 plus all above-Unicode code
2945 points: U+0000..10FF, U+1200..A95F,
2946 U+A97D..ABFF, U+D7A4..D7AF,
2947 U+D7C7..D7CA, U+D7FC..infinity)
2948 \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
2949 (137)
2950 \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137:
2951 U+11A8..11FF, U+D7CB..D7FB)
2952 \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
2953 (95)
2954 \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95:
2955 U+1160..11A7, U+D7B0..D7C6)
2956 X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short:
2957 \p{InHangul}) (11_184)
2958 \p{Hani} \p{Han} (= \p{Script_Extensions=Han})
2959 (94_492)
2960 \p{Hanifi_Rohingya} \p{Script_Extensions=Hanifi_Rohingya}
2961 (Short: \p{Rohg}; NOT \p{Block=
2962 Hanifi_Rohingya}) (55)
2963 \p{Hano} \p{Hanunoo} (= \p{Script_Extensions=
2964 Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
2965 \p{Hanunoo} \p{Script_Extensions=Hanunoo} (Short:
2966 \p{Hano}; NOT \p{Block=Hanunoo}) (23)
2967 \p{Hatr} \p{Hatran} (= \p{Script_Extensions=
2968 Hatran}) (NOT \p{Block=Hatran}) (26)
2969 \p{Hatran} \p{Script_Extensions=Hatran} (Short:
2970 \p{Hatr}; NOT \p{Block=Hatran}) (26)
2971 \p{Hebr} \p{Hebrew} (= \p{Script_Extensions=
2972 Hebrew}) (NOT \p{Block=Hebrew}) (134)
2973 \p{Hebrew} \p{Script_Extensions=Hebrew} (Short:
2974 \p{Hebr}; NOT \p{Block=Hebrew}) (134)
2975 \p{Hex} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2976 \p{Hex: *} \p{Hex_Digit: *}
2977 \p{Hex_Digit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2978 \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068
2979 plus all above-Unicode code points:
2980 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
2981 \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
2982 U+0100..FF0F, U+FF1A..FF20,
2983 U+FF27..FF40, U+FF47..infinity)
2984 \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa-
2985 f], U+FF10..FF19, U+FF21..FF26,
2986 U+FF41..FF46)
2987 X \p{High_Private_Use_Surrogates} \p{Block=
2988 High_Private_Use_Surrogates} (Short:
2989 \p{InHighPUSurrogates}) (128)
2990 X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (=
2991 \p{Block=High_Private_Use_Surrogates})
2992 (128)
2993 X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
2994 \p{Hira} \p{Hiragana} (= \p{Script_Extensions=
2995 Hiragana}) (NOT \p{Block=Hiragana}) (431)
2996 \p{Hiragana} \p{Script_Extensions=Hiragana} (Short:
2997 \p{Hira}; NOT \p{Block=Hiragana}) (431)
2998 \p{Hluw} \p{Anatolian_Hieroglyphs} (=
2999 \p{Script_Extensions=
3000 Anatolian_Hieroglyphs}) (NOT \p{Block=
3001 Anatolian_Hieroglyphs}) (583)
3002 \p{Hmng} \p{Pahawh_Hmong} (= \p{Script_Extensions=
3003 Pahawh_Hmong}) (NOT \p{Block=
3004 Pahawh_Hmong}) (127)
3005 \p{Hmnp} \p{Nyiakeng_Puachue_Hmong} (=
3006 \p{Script_Extensions=
3007 Nyiakeng_Puachue_Hmong}) (NOT \p{Block=
3008 Nyiakeng_Puachue_Hmong}) (71)
3009 \p{HorizSpace} \p{XPosixBlank} (18)
3010 \p{Hst: *} \p{Hangul_Syllable_Type: *}
3011 \p{Hung} \p{Old_Hungarian} (= \p{Script_Extensions=
3012 Old_Hungarian}) (NOT \p{Block=
3013 Old_Hungarian}) (108)
3014 D \p{Hyphen} \p{Hyphen=Y} (11)
3015 D \p{Hyphen: N*} Supplanted by Line_Break property values;
3016 see www.unicode.org/reports/tr14
3017 (Single: \P{Hyphen}) (1_114_101 plus all
3018 above-Unicode code points: [\x00-\x20!
3019 \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z
3020 \[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae-\xff],
3021 U+0100..0589, U+058B..1805,
3022 U+1807..200F, U+2012..2E16, U+2E18..30FA
3023 ...)
3024 D \p{Hyphen: Y*} Supplanted by Line_Break property values;
3025 see www.unicode.org/reports/tr14
3026 (Single: \p{Hyphen}) (11: [\-\xad],
3027 U+058A, U+1806, U+2010..2011, U+2E17,
3028 U+30FB ...)
3029 \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT
3030 \p{Ideographic_Description_Characters})
3031 (134_434)
3032 \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (979_678 plus
3033 all above-Unicode code points: [\x00-
3034 \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@
3035 \[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4\xb6
3036 \xb8-\xb9\xbb-\xbf\xd7\xf7],
3037 U+02C2..02C5, U+02D2..02DF,
3038 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
3039 \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (134_434:
3040 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6
3041 \xd8-\xf6\xf8-\xff], U+0100..02C1,
3042 U+02C6..02D1, U+02E0..02E4, U+02EC,
3043 U+02EE ...)
3044 \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (131_482)
3045 \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (982_630 plus
3046 all above-Unicode code points: [\x00-
3047 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@
3048 \[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6-
3049 \xb9\xbb-\xbf\xd7\xf7], U+02C2..02C5,
3050 U+02D2..02DF, U+02E5..02EB, U+02ED,
3051 U+02EF..036F ...)
3052 \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (131_482: [A-
3053 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
3054 \xff], U+0100..02C1, U+02C6..02D1,
3055 U+02E0..02E4, U+02EC, U+02EE ...)
3056 \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
3057 \p{Ideographic_Description_Characters})
3058 (134_434)
3059 \p{IDC: *} \p{ID_Continue: *}
3060 \p{Identifier_Status: Allowed} (107_835: [\'\-.0-9:A-Z_a-z\xb7
3061 \xc0-\xd6\xd8-\xf6\xf8-\xff],
3062 U+0100..0131, U+0134..013E,
3063 U+0141..0148, U+014A..017E, U+018F ...)
3064 \p{Identifier_Status: Restricted} (1_006_277 plus all above-
3065 Unicode code points: [\x00-\x20!\"#\$
3066 \%&\(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x7f-
3067 \xb6\xb8-\xbf\xd7\xf7], U+0132..0133,
3068 U+013F..0140, U+0149, U+017F..018E,
3069 U+0190..019F ...)
3070 \p{Identifier_Type: Default_Ignorable} (395: [\xad], U+034F,
3071 U+061C, U+115F..1160, U+17B4..17B5,
3072 U+180B..180E ...)
3073 \p{Identifier_Type: Deprecated} (15: U+0149, U+0673, U+0F77,
3074 U+0F79, U+17A3..17A4, U+206A..206F ...)
3075 \p{Identifier_Type: Exclusion} (16_745: U+03E2..03EF,
3076 U+0800..082D, U+0830..083E,
3077 U+1680..169C, U+16A0..16EA, U+16EE..16F8
3078 ...)
3079 \p{Identifier_Type: Inclusion} (19: [\'\-.:\xb7], U+0375, U+058A,
3080 U+05F3..05F4, U+06FD..06FE, U+0F0B ...)
3081 \p{Identifier_Type: Limited_Use} (5248: U+0700..070D,
3082 U+070F..074A, U+074D..074F,
3083 U+07C0..07FA, U+07FD..07FF, U+0840..085B
3084 ...)
3085 \p{Identifier_Type: Not_Character} (970_247 plus all above-Unicode
3086 code points: [^\t\n\cK\f\r\x20-\x7e\x85
3087 \xa0-\xff], U+0378..0379, U+0380..0383,
3088 U+038B, U+038D, U+03A2 ...)
3089 \p{Identifier_Type: Not_NFKC} (4800: [\xa0\xa8\xaa\xaf\xb2-\xb5
3090 \xb8-\xba\xbc-\xbe], U+0132..0133,
3091 U+013F..0140, U+017F, U+01C4..01CC,
3092 U+01F1..01F3 ...)
3093 \p{Identifier_Type: Not_XID} (7998: [\t\n\cK\f\r\x20!\"#\$\%&
3094 \(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x85
3095 \xa1-\xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6
3096 \xbb\xbf\xd7\xf7], U+02C2..02C5,
3097 U+02D2..02D7, U+02DE..02DF,
3098 U+02E5..02EB, U+02ED ...)
3099 \p{Identifier_Type: Obsolete} (1611: U+018D, U+01AA..01AB,
3100 U+01B9..01BB, U+01BE..01BF,
3101 U+01F6..01F7, U+021C..021D ...)
3102 \p{Identifier_Type: Recommended} (107_816: [0-9A-Z_a-z\xc0-\xd6
3103 \xd8-\xf6\xf8-\xff], U+0100..0131,
3104 U+0134..013E, U+0141..0148,
3105 U+014A..017E, U+018F ...)
3106 \p{Identifier_Type: Technical} (1463: U+0180, U+018D,
3107 U+01AA..01AB, U+01BA..01BB, U+01BE,
3108 U+01C0..01C3 ...)
3109 \p{Identifier_Type: Uncommon_Use} (348: U+0181..018C, U+018E,
3110 U+0190..019F, U+01A2..01A9,
3111 U+01AC..01AE, U+01B1..01B8 ...)
3112 \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
3113 (101_652)
3114 \p{Ideo: *} \p{Ideographic: *}
3115 \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
3116 (101_652)
3117 \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_012_460
3118 plus all above-Unicode code points:
3119 U+0000..3005, U+3008..3020,
3120 U+302A..3037, U+303B..33FF,
3121 U+4DC0..4DFF, U+9FFD..F8FF ...)
3122 \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (101_652:
3123 U+3006..3007, U+3021..3029,
3124 U+3038..303A, U+3400..4DBF,
3125 U+4E00..9FFC, U+F900..FA6D ...)
3126 X \p{Ideographic_Description_Characters} \p{Block=
3127 Ideographic_Description_Characters}
3128 (Short: \p{InIDC}) (16)
3129 X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
3130 \p{Block=
3131 Ideographic_Symbols_And_Punctuation})
3132 (32)
3133 X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
3134 Ideographic_Symbols_And_Punctuation}
3135 (Short: \p{InIdeographicSymbols}) (32)
3136 \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (131_482)
3137 \p{IDS: *} \p{ID_Start: *}
3138 \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
3139 \p{IDSB}) (10)
3140 \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
3141 (1_114_102 plus all above-Unicode code
3142 points: U+0000..2FEF, U+2FF2..2FF3,
3143 U+2FFC..infinity)
3144 \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10:
3145 U+2FF0..2FF1, U+2FF4..2FFB)
3146 \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
3147 \p{IDST}) (2)
3148 \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
3149 (1_114_110 plus all above-Unicode code
3150 points: U+0000..2FF1, U+2FF4..infinity)
3151 \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2:
3152 U+2FF2..2FF3)
3153 \p{IDSB} \p{IDS_Binary_Operator} (=
3154 \p{IDS_Binary_Operator=Y}) (10)
3155 \p{IDSB: *} \p{IDS_Binary_Operator: *}
3156 \p{IDST} \p{IDS_Trinary_Operator} (=
3157 \p{IDS_Trinary_Operator=Y}) (2)
3158 \p{IDST: *} \p{IDS_Trinary_Operator: *}
3159 \p{Imperial_Aramaic} \p{Script_Extensions=Imperial_Aramaic}
3160 (Short: \p{Armi}; NOT \p{Block=
3161 Imperial_Aramaic}) (31)
3162 \p{In: *} \p{Present_In: *} (Perl extension)
3163 X \p{In_*} \p{Block: *}
3164 X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block=
3165 Common_Indic_Number_Forms}) (16)
3166 \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
3167 (351: U+093C, U+0941..0944, U+094D,
3168 U+0952, U+0956..0957, U+0962..0963 ...)
3169 \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC=
3170 BottomAndLeft}) (1: U+A9BF)
3171 \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
3172 BottomAndRight}) (4: U+1B3B, U+A9BE,
3173 U+A9C0, U+11942)
3174 \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (64:
3175 U+093F, U+094E, U+09BF, U+09C7..09C8,
3176 U+0A3F, U+0ABF ...)
3177 \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
3178 LeftAndRight}) (22: U+09CB..09CC,
3179 U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C,
3180 U+0DDC, U+0DDE ...)
3181 \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_902
3182 plus all above-Unicode code points:
3183 U+0000..08FF, U+0904..0939, U+093D,
3184 U+0950, U+0958..0961, U+0964..0980 ...)
3185 \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
3186 Overstruck}) (10: U+1CD4, U+1CE2..1CE8,
3187 U+10A01, U+10A06)
3188 \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (288:
3189 U+0903, U+093B, U+093E, U+0940,
3190 U+0949..094C, U+094F ...)
3191 \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (415:
3192 U+0900..0902, U+093A, U+0945..0948,
3193 U+0951, U+0953..0955, U+0981 ...)
3194 \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
3195 TopAndBottom}) (10: U+0C48, U+0F73,
3196 U+0F76..0F79, U+0F81, U+1B3C,
3197 U+1112E..1112F)
3198 \p{Indic_Positional_Category: Top_And_Bottom_And_Left} (Short:
3199 \p{InPC=TopAndBottomAndLeft}) (2:
3200 U+103C, U+1171E)
3201 \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
3202 \p{InPC=TopAndBottomAndRight}) (1:
3203 U+1B3D)
3204 \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
3205 TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE,
3206 U+1C29, U+114BB, U+115B9)
3207 \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
3208 \p{InPC=TopAndLeftAndRight}) (4: U+0B4C,
3209 U+0DDD, U+17BF, U+115BB)
3210 \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
3211 TopAndRight}) (13: U+0AC9, U+0B57,
3212 U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB,
3213 U+1925..1926 ...)
3214 \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
3215 VisualOrderLeft}) (19: U+0E40..0E44,
3216 U+0EC0..0EC4, U+19B5..19B7, U+19BA,
3217 U+AAB5..AAB6, U+AAB9 ...)
3218 X \p{Indic_Siyaq_Numbers} \p{Block=Indic_Siyaq_Numbers} (80)
3219 \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
3220 (17: U+093D, U+09BD, U+0ABD, U+0B3D,
3221 U+0C3D, U+0CBD ...)
3222 \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (91:
3223 U+0900..0902, U+0981..0982, U+09FC,
3224 U+0A01..0A02, U+0A70, U+0A81..0A82 ...)
3225 \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
3226 \p{InSC=BrahmiJoiningNumber}) (20:
3227 U+11052..11065)
3228 \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
3229 CantillationMark}) (59: U+0951..0952,
3230 U+0A51, U+0AFA..0AFC, U+1CD0..1CD2,
3231 U+1CD4..1CE1, U+1CF4 ...)
3232 \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
3233 (2195: U+0915..0939, U+0958..095F,
3234 U+0978..097F, U+0995..09A8,
3235 U+09AA..09B0, U+09B2 ...)
3236 \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
3237 ConsonantDead}) (12: U+09CE,
3238 U+0D54..0D56, U+0D7A..0D7F, U+1CF2..1CF3)
3239 \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
3240 ConsonantFinal}) (67: U+1930..1931,
3241 U+1933..1939, U+19C1..19C7,
3242 U+1A58..1A59, U+1BBE..1BBF, U+1BF0..1BF1
3243 ...)
3244 \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
3245 \p{InSC=ConsonantHeadLetter}) (5:
3246 U+0F88..0F8C)
3247 \p{Indic_Syllabic_Category: Consonant_Initial_Postfixed} (Short:
3248 \p{InSC=ConsonantInitialPostfixed}) (1:
3249 U+1A5A)
3250 \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
3251 ConsonantKiller}) (2: U+0E4C, U+17CD)
3252 \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
3253 ConsonantMedial}) (31: U+0A75,
3254 U+0EBC..0EBD, U+103B..103E,
3255 U+105E..1060, U+1082, U+1A55..1A56 ...)
3256 \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
3257 \p{InSC=ConsonantPlaceholder}) (22: [\-
3258 \xa0\xd7], U+0980, U+0A72..0A73, U+104B,
3259 U+104E, U+1900 ...)
3260 \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
3261 \p{InSC=ConsonantPrecedingRepha}) (3:
3262 U+0D4E, U+11941, U+11D46)
3263 \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
3264 ConsonantPrefixed}) (10: U+111C2..111C3,
3265 U+1193F, U+11A3A, U+11A84..11A89)
3266 \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
3267 ConsonantSubjoined}) (94: U+0F8D..0F97,
3268 U+0F99..0FBC, U+1929..192B, U+1A57,
3269 U+1A5B..1A5E, U+1BA1..1BA3 ...)
3270 \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
3271 \p{InSC=ConsonantSucceedingRepha}) (4:
3272 U+17CC, U+1B03, U+1B81, U+A982)
3273 \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
3274 \p{InSC=ConsonantWithStacker}) (8:
3275 U+0CF1..0CF2, U+1CF5..1CF6,
3276 U+11003..11004, U+11460..11461)
3277 \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
3278 GeminationMark}) (3: U+0A71, U+11237,
3279 U+11A98)
3280 \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
3281 InvisibleStacker}) (12: U+1039, U+17D2,
3282 U+1A60, U+1BAB, U+AAF6, U+10A3F ...)
3283 \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1:
3284 U+200D)
3285 \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
3286 ModifyingLetter}) (1: U+0B83)
3287 \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
3288 NonJoiner}) (1: U+200C)
3289 \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (31:
3290 U+093C, U+09BC, U+0A3C, U+0ABC,
3291 U+0AFD..0AFF, U+0B3C ...)
3292 \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (491:
3293 [0-9], U+0966..096F, U+09E6..09EF,
3294 U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F
3295 ...)
3296 \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
3297 NumberJoiner}) (1: U+1107F)
3298 \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
3299 (1_109_572 plus all above-Unicode code
3300 points: [\x00-\x20!\"#\$\%&\'\(\)*+,.
3301 \/:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
3302 \x9f\xa1-\xb1\xb4-\xd6\xd8-\xff],
3303 U+0100..08FF, U+0950, U+0953..0954,
3304 U+0964..0965, U+0970..0971 ...)
3305 \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
3306 PureKiller}) (23: U+0D3B..0D3C, U+0E3A,
3307 U+0E4E, U+0EBA, U+0F84, U+103A ...)
3308 \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
3309 RegisterShifter}) (2: U+17C9..17CA)
3310 \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
3311 SyllableModifier}) (25: [\xb2-\xb3],
3312 U+09FE, U+0F35, U+0F37, U+0FC6, U+17CB
3313 ...)
3314 \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
3315 ToneLetter}) (7: U+1970..1974, U+AAC0,
3316 U+AAC2)
3317 \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
3318 (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037,
3319 U+1063..1064, U+1069..106D, U+1087..108D
3320 ...)
3321 \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (27:
3322 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
3323 U+0BCD ...)
3324 \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga})
3325 (35: U+0903, U+0983, U+0A03, U+0A83,
3326 U+0B03, U+0C03 ...)
3327 \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30:
3328 U+1963..196D, U+A85E..A861, U+A866,
3329 U+A922..A92A, U+11150..11154)
3330 \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
3331 VowelDependent}) (683: U+093A..093B,
3332 U+093E..094C, U+094E..094F,
3333 U+0955..0957, U+0962..0963, U+09BE..09C4
3334 ...)
3335 \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
3336 VowelIndependent}) (484: U+0904..0914,
3337 U+0960..0961, U+0972..0977,
3338 U+0985..098C, U+098F..0990, U+0993..0994
3339 ...)
3340 \p{Inherited} \p{Script_Extensions=Inherited} (Short:
3341 \p{Zinh}) (503)
3342 \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
3343 (Short: \p{Pi}) (12)
3344 \p{InPC: *} \p{Indic_Positional_Category: *}
3345 \p{InSC: *} \p{Indic_Syllabic_Category: *}
3346 \p{Inscriptional_Pahlavi} \p{Script_Extensions=
3347 Inscriptional_Pahlavi} (Short: \p{Phli};
3348 NOT \p{Block=Inscriptional_Pahlavi}) (27)
3349 \p{Inscriptional_Parthian} \p{Script_Extensions=
3350 Inscriptional_Parthian} (Short:
3351 \p{Prti}; NOT \p{Block=
3352 Inscriptional_Parthian}) (30)
3353 X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block=
3354 IPA_Extensions}) (96)
3355 X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short:
3356 \p{InIPAExt}) (96)
3357 \p{Is_*} \p{*} (Any exceptions are individually
3358 noted beginning with the word NOT.) If
3359 an entry has flag(s) at its beginning,
3360 like "D", the "Is_" form has the same
3361 flag(s)
3362 \p{Ital} \p{Old_Italic} (= \p{Script_Extensions=
3363 Old_Italic}) (NOT \p{Block=Old_Italic})
3364 (39)
3365 X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
3366 (256)
3367 X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block=
3368 Hangul_Jamo_Extended_A}) (32)
3369 X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block=
3370 Hangul_Jamo_Extended_B}) (80)
3371 \p{Java} \p{Javanese} (= \p{Script_Extensions=
3372 Javanese}) (NOT \p{Block=Javanese}) (91)
3373 \p{Javanese} \p{Script_Extensions=Javanese} (Short:
3374 \p{Java}; NOT \p{Block=Javanese}) (91)
3375 \p{Jg: *} \p{Joining_Group: *}
3376 \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
3377 \p{Join_C: *} \p{Join_Control: *}
3378 \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
3379 \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
3380 plus all above-Unicode code points:
3381 U+0000..200B, U+200E..infinity)
3382 \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2:
3383 U+200C..200D)
3384 \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1:
3385 U+08BB)
3386 \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1:
3387 U+08BD)
3388 \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (2:
3389 U+08BC, U+08C4)
3390 \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (9: U+0639..063A,
3391 U+06A0, U+06FC, U+075D..075F, U+08B3,
3392 U+08C3)
3393 \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710)
3394 \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10: U+0622..0623,
3395 U+0625, U+0627, U+0671..0673, U+0675,
3396 U+0773..0774)
3397 \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (27: U+0628,
3398 U+062A..062B, U+066E, U+0679..0680,
3399 U+0750..0756, U+08A0..08A1 ...)
3400 \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D)
3401 \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
3402 BurushaskiYehBarree}) (2: U+077A..077B)
3403 \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (15: U+062F..0630,
3404 U+0688..0690, U+06EE, U+0759..075A,
3405 U+08AE)
3406 \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4:
3407 U+0715..0716, U+072A, U+072F)
3408 \p{Joining_Group: E} (Short: \p{Jg=E}) (1: U+0725)
3409 \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7:
3410 U+063D..063F, U+06CC, U+06CE,
3411 U+0775..0776)
3412 \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1: U+074F)
3413 \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10: U+0641,
3414 U+06A1..06A6, U+0760..0761, U+08A4)
3415 \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1:
3416 U+0724)
3417 \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (15: U+063B..063C,
3418 U+06A9, U+06AB, U+06AF..06B4,
3419 U+0762..0764, U+08B0 ...)
3420 \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714,
3421 U+072E)
3422 \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (21: U+062C..062E,
3423 U+0681..0687, U+06BF, U+0757..0758,
3424 U+076E..076F, U+0772 ...)
3425 \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
3426 HamzaOnHehGoal}) (1: U+06C3)
3427 \p{Joining_Group: Hanifi_Rohingya_Kinna_Ya} (Short: \p{Jg=
3428 HanifiRohingyaKinnaYa}) (4: U+10D19,
3429 U+10D1E, U+10D20, U+10D23)
3430 \p{Joining_Group: Hanifi_Rohingya_Pa} (Short: \p{Jg=
3431 HanifiRohingyaPa}) (3: U+10D02, U+10D09,
3432 U+10D1C)
3433 \p{Joining_Group: He} (Short: \p{Jg=He}) (1: U+0717)
3434 \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1: U+0647)
3435 \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2:
3436 U+06C1..06C2)
3437 \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A)
3438 \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (6: U+0643,
3439 U+06AC..06AE, U+077F, U+08B4)
3440 \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F)
3441 \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E)
3442 \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2:
3443 U+06BE, U+06FF)
3444 \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (8: U+0644,
3445 U+06B5..06B8, U+076A, U+08A6, U+08C7)
3446 \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720)
3447 \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1:
3448 U+0866)
3449 \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1:
3450 U+0861)
3451 \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1:
3452 U+0868)
3453 \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla})
3454 (1: U+0869)
3455 \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1:
3456 U+0860)
3457 \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1:
3458 U+0864)
3459 \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna})
3460 (1: U+0865)
3461 \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1:
3462 U+0862)
3463 \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1:
3464 U+0867)
3465 \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1:
3466 U+086A)
3467 \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1:
3468 U+0863)
3469 \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
3470 ManichaeanAleph}) (1: U+10AC0)
3471 \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
3472 (2: U+10AD9..10ADA)
3473 \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
3474 (2: U+10AC1..10AC2)
3475 \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
3476 ManichaeanDaleth}) (1: U+10AC5)
3477 \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
3478 ManichaeanDhamedh}) (1: U+10AD4)
3479 \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
3480 (1: U+10AEC)
3481 \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
3482 ManichaeanGimel}) (2: U+10AC3..10AC4)
3483 \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
3484 (1: U+10ACD)
3485 \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
3486 ManichaeanHundred}) (1: U+10AEF)
3487 \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
3488 (3: U+10AD0..10AD2)
3489 \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
3490 ManichaeanLamedh}) (1: U+10AD3)
3491 \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem})
3492 (1: U+10AD6)
3493 \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun})
3494 (1: U+10AD7)
3495 \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne})
3496 (1: U+10AEB)
3497 \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2:
3498 U+10ADB..10ADC)
3499 \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
3500 (3: U+10ADE..10AE0)
3501 \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
3502 (1: U+10AE1)
3503 \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
3504 ManichaeanSadhe}) (1: U+10ADD)
3505 \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
3506 ManichaeanSamekh}) (1: U+10AD8)
3507 \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw})
3508 (1: U+10AE4)
3509 \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen})
3510 (1: U+10AED)
3511 \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
3512 (1: U+10ACE)
3513 \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
3514 ManichaeanThamedh}) (1: U+10AD5)
3515 \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
3516 ManichaeanTwenty}) (1: U+10AEE)
3517 \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw})
3518 (1: U+10AC7)
3519 \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
3520 (1: U+10ACF)
3521 \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
3522 ManichaeanZayin}) (2: U+10AC9..10ACA)
3523 \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645,
3524 U+0765..0766, U+08A7)
3525 \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1: U+0721)
3526 \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
3527 (1_113_790 plus all above-Unicode code
3528 points: U+0000..061F, U+0621, U+0640,
3529 U+064B..066D, U+0670, U+0674 ...)
3530 \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8: U+0646,
3531 U+06B9..06BC, U+0767..0769)
3532 \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1: U+0722)
3533 \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1: U+06BD)
3534 \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1: U+0726)
3535 \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (5: U+0642, U+066F,
3536 U+06A7..06A8, U+08A5)
3537 \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729)
3538 \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (19: U+0631..0632,
3539 U+0691..0699, U+06EF, U+075B,
3540 U+076B..076C, U+0771 ...)
3541 \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1:
3542 U+0727)
3543 \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1:
3544 U+08AC)
3545 \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (6: U+0635..0636,
3546 U+069D..069E, U+06FB, U+08AF)
3547 \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728)
3548 \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634,
3549 U+069A..069C, U+06FA, U+075C, U+076D,
3550 U+0770 ...)
3551 \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723)
3552 \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B)
3553 \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1:
3554 U+08B1)
3555 \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA)
3556 \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718)
3557 \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (4: U+0637..0638,
3558 U+069F, U+08A3)
3559 \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1: U+072C)
3560 \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3:
3561 U+0629, U+06C0, U+06D5)
3562 \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
3563 Hamza_On_Heh_Goal} (1)
3564 \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C)
3565 \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16: U+0624, U+0648,
3566 U+0676..0677, U+06C4..06CB, U+06CF,
3567 U+0778..0779 ...)
3568 \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (11: U+0620, U+0626,
3569 U+0649..064A, U+0678, U+06D0..06D1,
3570 U+0777 ...)
3571 \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2:
3572 U+06D2..06D3)
3573 \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1:
3574 U+06CD)
3575 \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D)
3576 \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E)
3577 \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719)
3578 \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D)
3579 \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (4)
3580 \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (586)
3581 \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (586: U+0620,
3582 U+0626, U+0628, U+062A..062E,
3583 U+0633..063F, U+0641..0647 ...)
3584 \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4: U+0640,
3585 U+07FA, U+180A, U+200D)
3586 \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (5)
3587 \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (5: U+A872,
3588 U+10ACD, U+10AD7, U+10D00, U+10FCB)
3589 \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_390 plus
3590 all above-Unicode code points: [\x00-
3591 \xac\xae-\xff], U+0100..02FF,
3592 U+0370..0482, U+048A..0590, U+05BE,
3593 U+05C0 ...)
3594 \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (130)
3595 \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (130:
3596 U+0622..0625, U+0627, U+0629,
3597 U+062F..0632, U+0648, U+0671..0673 ...)
3598 \p{Joining_Type: T} \p{Joining_Type=Transparent} (1997)
3599 \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1997: [\xad],
3600 U+0300..036F, U+0483..0489,
3601 U+0591..05BD, U+05BF, U+05C1..05C2 ...)
3602 \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_111_390
3603 plus all above-Unicode code points)
3604 \p{Jt: *} \p{Joining_Type: *}
3605 \p{Kaithi} \p{Script_Extensions=Kaithi} (Short:
3606 \p{Kthi}; NOT \p{Block=Kaithi}) (87)
3607 \p{Kali} \p{Kayah_Li} (= \p{Script_Extensions=
3608 Kayah_Li}) (48)
3609 \p{Kana} \p{Katakana} (= \p{Script_Extensions=
3610 Katakana}) (NOT \p{Block=Katakana}) (356)
3611 X \p{Kana_Ext_A} \p{Kana_Extended_A} (= \p{Block=
3612 Kana_Extended_A}) (48)
3613 X \p{Kana_Extended_A} \p{Block=Kana_Extended_A} (Short:
3614 \p{InKanaExtA}) (48)
3615 X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block=
3616 Kana_Supplement}) (256)
3617 X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short:
3618 \p{InKanaSup}) (256)
3619 X \p{Kanbun} \p{Block=Kanbun} (16)
3620 X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block=
3621 Kangxi_Radicals}) (224)
3622 X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short:
3623 \p{InKangxi}) (224)
3624 \p{Kannada} \p{Script_Extensions=Kannada} (Short:
3625 \p{Knda}; NOT \p{Block=Kannada}) (104)
3626 \p{Katakana} \p{Script_Extensions=Katakana} (Short:
3627 \p{Kana}; NOT \p{Block=Katakana}) (356)
3628 X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (=
3629 \p{Block=Katakana_Phonetic_Extensions})
3630 (16)
3631 X \p{Katakana_Phonetic_Extensions} \p{Block=
3632 Katakana_Phonetic_Extensions} (Short:
3633 \p{InKatakanaExt}) (16)
3634 \p{Kayah_Li} \p{Script_Extensions=Kayah_Li} (Short:
3635 \p{Kali}) (48)
3636 \p{Khar} \p{Kharoshthi} (= \p{Script_Extensions=
3637 Kharoshthi}) (NOT \p{Block=Kharoshthi})
3638 (68)
3639 \p{Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
3640 \p{Khar}; NOT \p{Block=Kharoshthi}) (68)
3641 \p{Khitan_Small_Script} \p{Script_Extensions=Khitan_Small_Script}
3642 (Short: \p{Kits}; NOT \p{Block=
3643 Khitan_Small_Script}) (471)
3644 \p{Khmer} \p{Script_Extensions=Khmer} (Short:
3645 \p{Khmr}; NOT \p{Block=Khmer}) (146)
3646 X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
3647 \p{Khmr} \p{Khmer} (= \p{Script_Extensions=Khmer})
3648 (NOT \p{Block=Khmer}) (146)
3649 \p{Khoj} \p{Khojki} (= \p{Script_Extensions=
3650 Khojki}) (NOT \p{Block=Khojki}) (82)
3651 \p{Khojki} \p{Script_Extensions=Khojki} (Short:
3652 \p{Khoj}; NOT \p{Block=Khojki}) (82)
3653 \p{Khudawadi} \p{Script_Extensions=Khudawadi} (Short:
3654 \p{Sind}; NOT \p{Block=Khudawadi}) (81)
3655 \p{Kits} \p{Khitan_Small_Script} (=
3656 \p{Script_Extensions=
3657 Khitan_Small_Script}) (NOT \p{Block=
3658 Khitan_Small_Script}) (471)
3659 \p{Knda} \p{Kannada} (= \p{Script_Extensions=
3660 Kannada}) (NOT \p{Block=Kannada}) (104)
3661 \p{Kthi} \p{Kaithi} (= \p{Script_Extensions=
3662 Kaithi}) (NOT \p{Block=Kaithi}) (87)
3663 \p{L} \pL \p{Letter} (= \p{General_Category=Letter})
3664 (131_241)
3665 X \p{L&} \p{Cased_Letter} (= \p{General_Category=
3666 Cased_Letter}) (3977)
3667 X \p{L_} \p{Cased_Letter} (= \p{General_Category=
3668 Cased_Letter}) Note the trailing '_'
3669 matters in spite of loose matching
3670 rules. (3977)
3671 \p{Lana} \p{Tai_Tham} (= \p{Script_Extensions=
3672 Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
3673 \p{Lao} \p{Script_Extensions=Lao} (NOT \p{Block=
3674 Lao}) (82)
3675 \p{Laoo} \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
3676 \p{Block=Lao}) (82)
3677 \p{Latin} \p{Script_Extensions=Latin} (Short:
3678 \p{Latn}) (1403)
3679 X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
3680 Latin_1_Supplement}) (128)
3681 X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block=
3682 Latin_1_Supplement}) (128)
3683 X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
3684 \p{InLatin1}) (128)
3685 X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block=
3686 Latin_Extended_A}) (128)
3687 X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
3688 \p{Block=Latin_Extended_Additional})
3689 (256)
3690 X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block=
3691 Latin_Extended_B}) (208)
3692 X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block=
3693 Latin_Extended_C}) (32)
3694 X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block=
3695 Latin_Extended_D}) (224)
3696 X \p{Latin_Ext_E} \p{Latin_Extended_E} (= \p{Block=
3697 Latin_Extended_E}) (64)
3698 X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short:
3699 \p{InLatinExtA}) (128)
3700 X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
3701 (Short: \p{InLatinExtAdditional}) (256)
3702 X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short:
3703 \p{InLatinExtB}) (208)
3704 X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short:
3705 \p{InLatinExtC}) (32)
3706 X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short:
3707 \p{InLatinExtD}) (224)
3708 X \p{Latin_Extended_E} \p{Block=Latin_Extended_E} (Short:
3709 \p{InLatinExtE}) (64)
3710 \p{Latn} \p{Latin} (= \p{Script_Extensions=Latin})
3711 (1403)
3712 \p{Lb: *} \p{Line_Break: *}
3713 \p{LC} \p{Cased_Letter} (= \p{General_Category=
3714 Cased_Letter}) (3977)
3715 \p{Lepc} \p{Lepcha} (= \p{Script_Extensions=
3716 Lepcha}) (NOT \p{Block=Lepcha}) (74)
3717 \p{Lepcha} \p{Script_Extensions=Lepcha} (Short:
3718 \p{Lepc}; NOT \p{Block=Lepcha}) (74)
3719 \p{Letter} \p{General_Category=Letter} (Short: \p{L})
3720 (131_241)
3721 \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
3722 \p{Nl}) (236)
3723 X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
3724 \p{Limb} \p{Limbu} (= \p{Script_Extensions=Limbu})
3725 (NOT \p{Block=Limbu}) (69)
3726 \p{Limbu} \p{Script_Extensions=Limbu} (Short:
3727 \p{Limb}; NOT \p{Block=Limbu}) (69)
3728 \p{Lina} \p{Linear_A} (= \p{Script_Extensions=
3729 Linear_A}) (NOT \p{Block=Linear_A}) (386)
3730 \p{Linb} \p{Linear_B} (= \p{Script_Extensions=
3731 Linear_B}) (268)
3732 \p{Line_Break: AI} \p{Line_Break=Ambiguous} (707)
3733 \p{Line_Break: AL} \p{Line_Break=Alphabetic} (21_400)
3734 \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (21_400: [#&*<=>\@A-
3735 Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0-
3736 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6,
3737 U+02CE..02CF, U+02D1..02D7, U+02DC,
3738 U+02DE ...)
3739 \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7-\xa8\xaa
3740 \xb2-\xb3\xb6-\xba\xbc-\xbe\xd7\xf7],
3741 U+02C7, U+02C9..02CB, U+02CD, U+02D0,
3742 U+02D8..02DB ...)
3743 \p{Line_Break: B2} \p{Line_Break=Break_Both} (3)
3744 \p{Line_Break: BA} \p{Line_Break=Break_After} (244)
3745 \p{Line_Break: BB} \p{Line_Break=Break_Before} (45)
3746 \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
3747 \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (244: [\t\|\xad],
3748 U+058A, U+05BE, U+0964..0965,
3749 U+0E5A..0E5B, U+0F0B ...)
3750 \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (45: [\xb4],
3751 U+02C8, U+02CC, U+02DF, U+0C77, U+0C84
3752 ...)
3753 \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014,
3754 U+2E3A..2E3B)
3755 \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/])
3756 \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r])
3757 \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
3758 \p{Line_Break: CJ} \p{Line_Break=
3759 Conditional_Japanese_Starter} (58)
3760 \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (91)
3761 \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]])
3762 \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (91: [\}],
3763 U+0F3B, U+0F3D, U+169C, U+2046, U+207E
3764 ...)
3765 \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (2286)
3766 \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2286: [^\t\n
3767 \cK\f\r\x20-\x7e\x85\xa0-\xff],
3768 U+0300..034E, U+0350..035B,
3769 U+0363..036F, U+0483..0489, U+0591..05BD
3770 ...)
3771 \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (750:
3772 U+0E01..0E3A, U+0E40..0E4E,
3773 U+0E81..0E82, U+0E84, U+0E86..0E8A,
3774 U+0E8C..0EA3 ...)
3775 \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
3776 (58: U+3041, U+3043, U+3045, U+3047,
3777 U+3049, U+3063 ...)
3778 \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC)
3779 \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
3780 \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
3781 \p{Line_Break: E_Base} (Short: \p{Lb=EB}) (122: U+261D, U+26F9,
3782 U+270A..270D, U+1F385, U+1F3C2..1F3C4,
3783 U+1F3C7 ...)
3784 \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF)
3785 \p{Line_Break: EB} \p{Line_Break=E_Base} (122)
3786 \p{Line_Break: EM} \p{Line_Break=E_Modifier} (5)
3787 \p{Line_Break: EX} \p{Line_Break=Exclamation} (37)
3788 \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (37: [!?], U+05C6,
3789 U+061B, U+061E..061F, U+06D4, U+07F9 ...)
3790 \p{Line_Break: GL} \p{Line_Break=Glue} (26)
3791 \p{Line_Break: Glue} (Short: \p{Lb=GL}) (26: [\xa0], U+034F,
3792 U+035C..0362, U+0F08, U+0F0C, U+0F12 ...)
3793 \p{Line_Break: H2} (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C,
3794 U+AC38, U+AC54, U+AC70, U+AC8C ...)
3795 \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B,
3796 U+AC1D..AC37, U+AC39..AC53,
3797 U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7
3798 ...)
3799 \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (75:
3800 U+05D0..05EA, U+05EF..05F2, U+FB1D,
3801 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
3802 ...)
3803 \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (75)
3804 \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
3805 \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1: [\-])
3806 \p{Line_Break: ID} \p{Line_Break=Ideographic} (172_462)
3807 \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_462:
3808 U+231A..231B, U+23F0..23F3,
3809 U+2600..2603, U+2614..2615, U+2618,
3810 U+261A..261C ...)
3811 \p{Line_Break: IN} \p{Line_Break=Inseparable} (6)
3812 \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;],
3813 U+037E, U+0589, U+060C..060D, U+07F8,
3814 U+2044 ...)
3815 \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026,
3816 U+22EF, U+FE19, U+10AF6)
3817 \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
3818 \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
3819 \p{Line_Break: JL} (Short: \p{Lb=JL}) (125: U+1100..115F,
3820 U+A960..A97C)
3821 \p{Line_Break: JT} (Short: \p{Lb=JT}) (137: U+11A8..11FF,
3822 U+D7CB..D7FB)
3823 \p{Line_Break: JV} (Short: \p{Lb=JV}) (95: U+1160..11A7,
3824 U+D7B0..D7C6)
3825 \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
3826 \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n])
3827 \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f],
3828 U+2028..2029)
3829 \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85])
3830 \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
3831 \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (33: U+17D6,
3832 U+203C..203D, U+2047..2049, U+3005,
3833 U+301C, U+303B..303C ...)
3834 \p{Line_Break: NS} \p{Line_Break=Nonstarter} (33)
3835 \p{Line_Break: NU} \p{Line_Break=Numeric} (642)
3836 \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (642: [0-9],
3837 U+0660..0669, U+066B..066C,
3838 U+06F0..06F9, U+07C0..07C9, U+0966..096F
3839 ...)
3840 \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (88)
3841 \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (88: [\(\[\{
3842 \xa1\xbf], U+0F3A, U+0F3C, U+169B,
3843 U+201A, U+201E ...)
3844 \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (36)
3845 \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (36: [\%\xa2
3846 \xb0], U+0609..060B, U+066A,
3847 U+09F2..09F3, U+09F9, U+0D79 ...)
3848 \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (68)
3849 \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (68: [\$+\\\xa3-
3850 \xa5\xb1], U+058F, U+07FE..07FF, U+09FB,
3851 U+0AF1, U+0BF9 ...)
3852 \p{Line_Break: QU} \p{Line_Break=Quotation} (39)
3853 \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb],
3854 U+2018..2019, U+201B..201D, U+201F,
3855 U+2039..203A, U+275B..2760 ...)
3856 \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26:
3857 U+1F1E6..1F1FF)
3858 \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26)
3859 \p{Line_Break: SA} \p{Line_Break=Complex_Context} (750)
3860 D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
3861 \p{Line_Break: SP} \p{Line_Break=Space} (1)
3862 \p{Line_Break: Space} (Short: \p{Lb=SP}) (1: [\x20])
3863 D \p{Line_Break: Surrogate} Surrogates should never appear in well-
3864 formed text, and therefore shouldn't be
3865 the basis for line breaking (Short:
3866 \p{Lb=SG}) (2048: U+D800..DFFF)
3867 \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
3868 \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (901_256 plus all
3869 above-Unicode code points: U+0378..0379,
3870 U+0380..0383, U+038B, U+038D, U+03A2,
3871 U+0530 ...)
3872 \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
3873 \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF)
3874 \p{Line_Break: XX} \p{Line_Break=Unknown} (901_256 plus all
3875 above-Unicode code points)
3876 \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
3877 \p{Line_Break: ZWJ} (Short: \p{Lb=ZWJ}) (1: U+200D)
3878 \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B)
3879 \p{Line_Separator} \p{General_Category=Line_Separator}
3880 (Short: \p{Zl}) (1)
3881 \p{Linear_A} \p{Script_Extensions=Linear_A} (Short:
3882 \p{Lina}; NOT \p{Block=Linear_A}) (386)
3883 \p{Linear_B} \p{Script_Extensions=Linear_B} (Short:
3884 \p{Linb}) (268)
3885 X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
3886 X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
3887 \p{Lisu} \p{Script_Extensions=Lisu} (NOT \p{Block=
3888 Lisu}) (49)
3889 X \p{Lisu_Sup} \p{Lisu_Supplement} (= \p{Block=
3890 Lisu_Supplement}) (16)
3891 X \p{Lisu_Supplement} \p{Block=Lisu_Supplement} (Short:
3892 \p{InLisuSup}) (16)
3893 \p{Ll} \p{Lowercase_Letter} (=
3894 \p{General_Category=Lowercase_Letter})
3895 (/i= General_Category=Cased_Letter)
3896 (2155)
3897 \p{Lm} \p{Modifier_Letter} (=
3898 \p{General_Category=Modifier_Letter})
3899 (260)
3900 \p{Lo} \p{Other_Letter} (= \p{General_Category=
3901 Other_Letter}) (127_004)
3902 \p{LOE} \p{Logical_Order_Exception} (=
3903 \p{Logical_Order_Exception=Y}) (19)
3904 \p{LOE: *} \p{Logical_Order_Exception: *}
3905 \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
3906 \p{LOE}) (19)
3907 \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
3908 (1_114_093 plus all above-Unicode code
3909 points: U+0000..0E3F, U+0E45..0EBF,
3910 U+0EC5..19B4, U+19B8..19B9,
3911 U+19BB..AAB4, U+AAB7..AAB8 ...)
3912 \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19:
3913 U+0E40..0E44, U+0EC0..0EC4,
3914 U+19B5..19B7, U+19BA, U+AAB5..AAB6,
3915 U+AAB9 ...)
3916 X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
3917 \p{Lower} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3918 Cased=Yes) (2344)
3919 \p{Lower: *} \p{Lowercase: *}
3920 \p{Lowercase} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3921 Cased=Yes) (2344)
3922 \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
3923 No) (1_111_768 plus all above-Unicode
3924 code points: [\x00-\x20!\"#\$\%&\'
3925 \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`\{
3926 \|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9\xbb-
3927 \xde\xf7], U+0100, U+0102, U+0104,
3928 U+0106, U+0108 ...)
3929 \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
3930 Yes) (2344: [a-z\xaa\xb5\xba\xdf-\xf6
3931 \xf8-\xff], U+0101, U+0103, U+0105,
3932 U+0107, U+0109 ...)
3933 \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
3934 (Short: \p{Ll}; /i= General_Category=
3935 Cased_Letter) (2155)
3936 \p{Lt} \p{Titlecase_Letter} (=
3937 \p{General_Category=Titlecase_Letter})
3938 (/i= General_Category=Cased_Letter) (31)
3939 \p{Lu} \p{Uppercase_Letter} (=
3940 \p{General_Category=Uppercase_Letter})
3941 (/i= General_Category=Cased_Letter)
3942 (1791)
3943 \p{Lyci} \p{Lycian} (= \p{Script_Extensions=
3944 Lycian}) (NOT \p{Block=Lycian}) (29)
3945 \p{Lycian} \p{Script_Extensions=Lycian} (Short:
3946 \p{Lyci}; NOT \p{Block=Lycian}) (29)
3947 \p{Lydi} \p{Lydian} (= \p{Script_Extensions=
3948 Lydian}) (NOT \p{Block=Lydian}) (27)
3949 \p{Lydian} \p{Script_Extensions=Lydian} (Short:
3950 \p{Lydi}; NOT \p{Block=Lydian}) (27)
3951 \p{M} \pM \p{Mark} (= \p{General_Category=Mark})
3952 (2295)
3953 \p{Mahajani} \p{Script_Extensions=Mahajani} (Short:
3954 \p{Mahj}; NOT \p{Block=Mahajani}) (61)
3955 \p{Mahj} \p{Mahajani} (= \p{Script_Extensions=
3956 Mahajani}) (NOT \p{Block=Mahajani}) (61)
3957 X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block=
3958 Mahjong_Tiles}) (48)
3959 X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short:
3960 \p{InMahjong}) (48)
3961 \p{Maka} \p{Makasar} (= \p{Script_Extensions=
3962 Makasar}) (NOT \p{Block=Makasar}) (25)
3963 \p{Makasar} \p{Script_Extensions=Makasar} (Short:
3964 \p{Maka}; NOT \p{Block=Makasar}) (25)
3965 \p{Malayalam} \p{Script_Extensions=Malayalam} (Short:
3966 \p{Mlym}; NOT \p{Block=Malayalam}) (126)
3967 \p{Mand} \p{Mandaic} (= \p{Script_Extensions=
3968 Mandaic}) (NOT \p{Block=Mandaic}) (30)
3969 \p{Mandaic} \p{Script_Extensions=Mandaic} (Short:
3970 \p{Mand}; NOT \p{Block=Mandaic}) (30)
3971 \p{Mani} \p{Manichaean} (= \p{Script_Extensions=
3972 Manichaean}) (NOT \p{Block=Manichaean})
3973 (52)
3974 \p{Manichaean} \p{Script_Extensions=Manichaean} (Short:
3975 \p{Mani}; NOT \p{Block=Manichaean}) (52)
3976 \p{Marc} \p{Marchen} (= \p{Script_Extensions=
3977 Marchen}) (NOT \p{Block=Marchen}) (68)
3978 \p{Marchen} \p{Script_Extensions=Marchen} (Short:
3979 \p{Marc}; NOT \p{Block=Marchen}) (68)
3980 \p{Mark} \p{General_Category=Mark} (Short: \p{M})
3981 (2295)
3982 \p{Masaram_Gondi} \p{Script_Extensions=Masaram_Gondi}
3983 (Short: \p{Gonm}; NOT \p{Block=
3984 Masaram_Gondi}) (77)
3985 \p{Math} \p{Math=Y} (2310)
3986 \p{Math: N*} (Single: \P{Math}) (1_111_802 plus all
3987 above-Unicode code points: [\x00-\x20!
3988 \"#\$\%&\'\(\)*,\-.\/0-9:;?\@A-Z
3989 \[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2-
3990 \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF,
3991 U+03D3..03D4, U+03D6..03EF,
3992 U+03F2..03F3, U+03F7..0605 ...)
3993 \p{Math: Y*} (Single: \p{Math}) (2310: [+<=>\^\|~\xac
3994 \xb1\xd7\xf7], U+03D0..03D2, U+03D5,
3995 U+03F0..03F1, U+03F4..03F6, U+0606..0608
3996 ...)
3997 X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (=
3998 \p{Block=
3999 Mathematical_Alphanumeric_Symbols})
4000 (1024)
4001 X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block=
4002 Mathematical_Operators}) (256)
4003 \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
4004 \p{Sm}) (948)
4005 X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
4006 Mathematical_Alphanumeric_Symbols}
4007 (Short: \p{InMathAlphanum}) (1024)
4008 X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
4009 (Short: \p{InMathOperators}) (256)
4010 X \p{Mayan_Numerals} \p{Block=Mayan_Numerals} (32)
4011 \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
4012 Spacing_Mark}) (443)
4013 \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
4014 Enclosing_Mark}) (13)
4015 \p{Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
4016 \p{Medf}; NOT \p{Block=Medefaidrin}) (91)
4017 \p{Medf} \p{Medefaidrin} (= \p{Script_Extensions=
4018 Medefaidrin}) (NOT \p{Block=
4019 Medefaidrin}) (91)
4020 \p{Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek} (Short:
4021 \p{Mtei}; NOT \p{Block=Meetei_Mayek})
4022 (79)
4023 X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block=
4024 Meetei_Mayek_Extensions}) (32)
4025 X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
4026 (Short: \p{InMeeteiMayekExt}) (32)
4027 \p{Mend} \p{Mende_Kikakui} (= \p{Script_Extensions=
4028 Mende_Kikakui}) (NOT \p{Block=
4029 Mende_Kikakui}) (213)
4030 \p{Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
4031 (Short: \p{Mend}; NOT \p{Block=
4032 Mende_Kikakui}) (213)
4033 \p{Merc} \p{Meroitic_Cursive} (=
4034 \p{Script_Extensions=Meroitic_Cursive})
4035 (NOT \p{Block=Meroitic_Cursive}) (90)
4036 \p{Mero} \p{Meroitic_Hieroglyphs} (=
4037 \p{Script_Extensions=
4038 Meroitic_Hieroglyphs}) (32)
4039 \p{Meroitic_Cursive} \p{Script_Extensions=Meroitic_Cursive}
4040 (Short: \p{Merc}; NOT \p{Block=
4041 Meroitic_Cursive}) (90)
4042 \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
4043 Meroitic_Hieroglyphs} (Short: \p{Mero})
4044 (32)
4045 \p{Miao} \p{Script_Extensions=Miao} (NOT \p{Block=
4046 Miao}) (149)
4047 X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (=
4048 \p{Block=
4049 Miscellaneous_Symbols_And_Arrows}) (256)
4050 X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
4051 (= \p{Block=
4052 Miscellaneous_Mathematical_Symbols_A})
4053 (48)
4054 X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
4055 (= \p{Block=
4056 Miscellaneous_Mathematical_Symbols_B})
4057 (128)
4058 X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs}
4059 (= \p{Block=
4060 Miscellaneous_Symbols_And_Pictographs})
4061 (768)
4062 X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block=
4063 Miscellaneous_Symbols}) (256)
4064 X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block=
4065 Miscellaneous_Technical}) (256)
4066 X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
4067 Miscellaneous_Mathematical_Symbols_A}
4068 (Short: \p{InMiscMathSymbolsA}) (48)
4069 X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
4070 Miscellaneous_Mathematical_Symbols_B}
4071 (Short: \p{InMiscMathSymbolsB}) (128)
4072 X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
4073 \p{InMiscSymbols}) (256)
4074 X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
4075 Miscellaneous_Symbols_And_Arrows}
4076 (Short: \p{InMiscArrows}) (256)
4077 X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
4078 Miscellaneous_Symbols_And_Pictographs}
4079 (Short: \p{InMiscPictographs}) (768)
4080 X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
4081 (Short: \p{InMiscTechnical}) (256)
4082 \p{Mlym} \p{Malayalam} (= \p{Script_Extensions=
4083 Malayalam}) (NOT \p{Block=Malayalam})
4084 (126)
4085 \p{Mn} \p{Nonspacing_Mark} (=
4086 \p{General_Category=Nonspacing_Mark})
4087 (1839)
4088 \p{Modi} \p{Script_Extensions=Modi} (NOT \p{Block=
4089 Modi}) (89)
4090 \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
4091 (Short: \p{Lm}) (260)
4092 X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block=
4093 Spacing_Modifier_Letters}) (80)
4094 \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
4095 (Short: \p{Sk}) (123)
4096 X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
4097 \p{Mong} \p{Mongolian} (= \p{Script_Extensions=
4098 Mongolian}) (NOT \p{Block=Mongolian})
4099 (171)
4100 \p{Mongolian} \p{Script_Extensions=Mongolian} (Short:
4101 \p{Mong}; NOT \p{Block=Mongolian}) (171)
4102 X \p{Mongolian_Sup} \p{Mongolian_Supplement} (= \p{Block=
4103 Mongolian_Supplement}) (32)
4104 X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
4105 \p{InMongolianSup}) (32)
4106 \p{Mro} \p{Script_Extensions=Mro} (NOT \p{Block=
4107 Mro}) (43)
4108 \p{Mroo} \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
4109 \p{Block=Mro}) (43)
4110 \p{Mtei} \p{Meetei_Mayek} (= \p{Script_Extensions=
4111 Meetei_Mayek}) (NOT \p{Block=
4112 Meetei_Mayek}) (79)
4113 \p{Mult} \p{Multani} (= \p{Script_Extensions=
4114 Multani}) (NOT \p{Block=Multani}) (48)
4115 \p{Multani} \p{Script_Extensions=Multani} (Short:
4116 \p{Mult}; NOT \p{Block=Multani}) (48)
4117 X \p{Music} \p{Musical_Symbols} (= \p{Block=
4118 Musical_Symbols}) (256)
4119 X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short:
4120 \p{InMusic}) (256)
4121 \p{Myanmar} \p{Script_Extensions=Myanmar} (Short:
4122 \p{Mymr}; NOT \p{Block=Myanmar}) (224)
4123 X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block=
4124 Myanmar_Extended_A}) (32)
4125 X \p{Myanmar_Ext_B} \p{Myanmar_Extended_B} (= \p{Block=
4126 Myanmar_Extended_B}) (32)
4127 X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short:
4128 \p{InMyanmarExtA}) (32)
4129 X \p{Myanmar_Extended_B} \p{Block=Myanmar_Extended_B} (Short:
4130 \p{InMyanmarExtB}) (32)
4131 \p{Mymr} \p{Myanmar} (= \p{Script_Extensions=
4132 Myanmar}) (NOT \p{Block=Myanmar}) (224)
4133 \p{N} \pN \p{Number} (= \p{General_Category=Number})
4134 (1781)
4135 \p{Na=*} \p{Name=*}
4136 \p{Nabataean} \p{Script_Extensions=Nabataean} (Short:
4137 \p{Nbat}; NOT \p{Block=Nabataean}) (40)
4138 \p{Name=*} Combination of Name and Name_Alias
4139 properties; has special loose matching
4140 rules, for which see Unicode UAX #44
4141 \p{Nand} \p{Nandinagari} (= \p{Script_Extensions=
4142 Nandinagari}) (NOT \p{Block=
4143 Nandinagari}) (86)
4144 \p{Nandinagari} \p{Script_Extensions=Nandinagari} (Short:
4145 \p{Nand}; NOT \p{Block=Nandinagari}) (86)
4146 \p{Narb} \p{Old_North_Arabian} (=
4147 \p{Script_Extensions=Old_North_Arabian})
4148 (32)
4149 X \p{NB} \p{No_Block} (= \p{Block=No_Block})
4150 (826_640 plus all above-Unicode code
4151 points)
4152 \p{Nbat} \p{Nabataean} (= \p{Script_Extensions=
4153 Nabataean}) (NOT \p{Block=Nabataean})
4154 (40)
4155 \p{NChar} \p{Noncharacter_Code_Point} (=
4156 \p{Noncharacter_Code_Point=Y}) (66)
4157 \p{NChar: *} \p{Noncharacter_Code_Point: *}
4158 \p{Nd} \p{XPosixDigit} (= \p{General_Category=
4159 Decimal_Number}) (650)
4160 \p{New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
4161 \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
4162 \p{Newa} \p{Script_Extensions=Newa} (NOT \p{Block=
4163 Newa}) (97)
4164 \p{NFC_QC: *} \p{NFC_Quick_Check: *}
4165 \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (111)
4166 \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (111:
4167 U+0300..0304, U+0306..030C, U+030F,
4168 U+0311, U+0313..0314, U+031B ...)
4169 \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
4170 \P{NFC_Quick_Check} NOR \P{NFC_QC})
4171 (1120)
4172 \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
4173 \P{NFC_Quick_Check} NOR \P{NFC_QC})
4174 (1120: U+0340..0341, U+0343..0344,
4175 U+0374, U+037E, U+0387, U+0958..095F ...)
4176 \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
4177 \p{NFC_Quick_Check} NOR \p{NFC_QC})
4178 (1_112_881 plus all above-Unicode code
4179 points)
4180 \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
4181 \p{NFC_Quick_Check} NOR \p{NFC_QC})
4182 (1_112_881 plus all above-Unicode code
4183 points: U+0000..02FF, U+0305,
4184 U+030D..030E, U+0310, U+0312,
4185 U+0315..031A ...)
4186 \p{NFD_QC: *} \p{NFD_Quick_Check: *}
4187 \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
4188 \P{NFD_Quick_Check} NOR \P{NFD_QC})
4189 (13_233)
4190 \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
4191 \P{NFD_Quick_Check} NOR \P{NFD_QC})
4192 (13_233: [\xc0-\xc5\xc7-\xcf\xd1-\xd6
4193 \xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-\xf6
4194 \xf9-\xfd\xff], U+0100..010F,
4195 U+0112..0125, U+0128..0130,
4196 U+0134..0137, U+0139..013E ...)
4197 \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
4198 \p{NFD_Quick_Check} NOR \p{NFD_QC})
4199 (1_100_879 plus all above-Unicode code
4200 points)
4201 \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
4202 \p{NFD_Quick_Check} NOR \p{NFD_QC})
4203 (1_100_879 plus all above-Unicode code
4204 points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde-
4205 \xdf\xe6\xf0\xf7-\xf8\xfe],
4206 U+0110..0111, U+0126..0127,
4207 U+0131..0133, U+0138, U+013F..0142 ...)
4208 \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
4209 \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (111)
4210 \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (111:
4211 U+0300..0304, U+0306..030C, U+030F,
4212 U+0311, U+0313..0314, U+031B ...)
4213 \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
4214 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4215 (4807)
4216 \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
4217 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4218 (4807: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4219 \xba\xbc-\xbe], U+0132..0133,
4220 U+013F..0140, U+0149, U+017F,
4221 U+01C4..01CC ...)
4222 \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
4223 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4224 (1_109_194 plus all above-Unicode code
4225 points)
4226 \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
4227 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4228 (1_109_194 plus all above-Unicode code
4229 points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4230 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff],
4231 U+0100..0131, U+0134..013E,
4232 U+0141..0148, U+014A..017E, U+0180..01C3
4233 ...)
4234 \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
4235 \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
4236 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4237 (16_908)
4238 \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
4239 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4240 (16_908: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4241 \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1-
4242 \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
4243 \xf6\xf9-\xfd\xff], U+0100..010F,
4244 U+0112..0125, U+0128..0130,
4245 U+0132..0137, U+0139..0140 ...)
4246 \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
4247 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4248 (1_097_204 plus all above-Unicode code
4249 points)
4250 \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
4251 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4252 (1_097_204 plus all above-Unicode code
4253 points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4254 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xc6\xd0
4255 \xd7-\xd8\xde-\xdf\xe6\xf0\xf7-\xf8
4256 \xfe], U+0110..0111, U+0126..0127,
4257 U+0131, U+0138, U+0141..0142 ...)
4258 \p{Nko} \p{Script_Extensions=Nko} (NOT \p{Block=
4259 NKo}) (62)
4260 \p{Nkoo} \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
4261 \p{Block=NKo}) (62)
4262 \p{Nl} \p{Letter_Number} (= \p{General_Category=
4263 Letter_Number}) (236)
4264 \p{No} \p{Other_Number} (= \p{General_Category=
4265 Other_Number}) (895)
4266 X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB})
4267 (826_640 plus all above-Unicode code
4268 points)
4269 \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
4270 \p{NChar}) (66)
4271 \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
4272 (1_114_046 plus all above-Unicode code
4273 points: U+0000..FDCF, U+FDF0..FFFD,
4274 U+10000..1FFFD, U+20000..2FFFD,
4275 U+30000..3FFFD, U+40000..4FFFD ...)
4276 \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
4277 (66: U+FDD0..FDEF, U+FFFE..FFFF,
4278 U+1FFFE..1FFFF, U+2FFFE..2FFFF,
4279 U+3FFFE..3FFFF, U+4FFFE..4FFFF ...)
4280 \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
4281 (Short: \p{Mn}) (1839)
4282 \p{Nshu} \p{Nushu} (= \p{Script_Extensions=Nushu})
4283 (NOT \p{Block=Nushu}) (397)
4284 \p{Nt: *} \p{Numeric_Type: *}
4285 \p{Number} \p{General_Category=Number} (Short: \p{N})
4286 (1781)
4287 X \p{Number_Forms} \p{Block=Number_Forms} (64)
4288 \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (650)
4289 \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (650: [0-9],
4290 U+0660..0669, U+06F0..06F9,
4291 U+07C0..07C9, U+0966..096F, U+09E6..09EF
4292 ...)
4293 \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
4294 \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9],
4295 U+1369..1371, U+19DA, U+2070,
4296 U+2074..2079, U+2080..2089 ...)
4297 \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_250 plus all
4298 above-Unicode code points: [\x00-\x20!
4299 \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\]
4300 \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba-
4301 \xbb\xbf-\xff], U+0100..065F,
4302 U+066A..06EF, U+06FA..07BF,
4303 U+07CA..0965, U+0970..09E5 ...)
4304 \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (1084)
4305 \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (1084: [\xbc-\xbe],
4306 U+09F4..09F9, U+0B72..0B77,
4307 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
4308 ...)
4309 T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33)
4310 T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (83: [0], U+0660,
4311 U+06F0, U+07C0, U+0966, U+09E6 ...)
4312 T \p{Numeric_Value: 1/320} (Short: \p{Nv=1/320}) (2: U+11FC0,
4313 U+11FD4)
4314 T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (2: U+0D58, U+11FC1)
4315 T \p{Numeric_Value: 1/80} (Short: \p{Nv=1/80}) (1: U+11FC2)
4316 T \p{Numeric_Value: 1/64} (Short: \p{Nv=1/64}) (1: U+11FC3)
4317 T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (2: U+0D59, U+11FC4)
4318 T \p{Numeric_Value: 1/32} (Short: \p{Nv=1/32}) (1: U+11FC5)
4319 T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (2: U+0D5A, U+11FC6)
4320 T \p{Numeric_Value: 3/64} (Short: \p{Nv=3/64}) (1: U+11FC7)
4321 T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (2: U+0D5B, U+11FC8)
4322 T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (6: U+09F4, U+0B75,
4323 U+0D76, U+A833, U+11FC9..11FCA)
4324 T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6)
4325 T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (3: U+0D5C, U+2152,
4326 U+11FCB)
4327 T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1: U+2151)
4328 T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (7: U+09F5, U+0B76,
4329 U+0D77, U+215B, U+A834, U+11FCC ...)
4330 T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1: U+2150)
4331 T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (2: U+0D5D, U+11FCD)
4332 T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (4: U+2159, U+109F7,
4333 U+12461, U+1ED3D)
4334 T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (5: U+09F6, U+0B77,
4335 U+0D78, U+A835, U+11FCE)
4336 T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (3: U+0D5E, U+2155,
4337 U+11FCF)
4338 T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (14: [\xbc], U+09F7,
4339 U+0B72, U+0D73, U+A830, U+10140 ...)
4340 T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (6: U+2153, U+109F9,
4341 U+10E7D, U+1245A, U+1245D, U+12465)
4342 T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1: U+215C)
4343 T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1: U+2156)
4344 T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA)
4345 T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (19: [\xbd], U+0B73,
4346 U+0D74, U+0F2A, U+2CFD, U+A831 ...)
4347 T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC)
4348 T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1: U+2157)
4349 T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1: U+215D)
4350 T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (7: U+2154, U+10177,
4351 U+109FD, U+10E7E, U+1245B, U+1245E ...)
4352 T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (9: [\xbe], U+09F8,
4353 U+0B74, U+0D75, U+A832, U+10178 ...)
4354 T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1: U+2158)
4355 T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (3: U+215A, U+109FF,
4356 U+1245C)
4357 T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1: U+215E)
4358 T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC)
4359 T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (140: [1\xb9], U+0661,
4360 U+06F1, U+07C1, U+0967, U+09E7 ...)
4361 T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1: U+0F2B)
4362 T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (139: [2\xb2], U+0662,
4363 U+06F2, U+07C2, U+0968, U+09E8 ...)
4364 T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1: U+0F2C)
4365 T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (140: [3\xb3], U+0663,
4366 U+06F3, U+07C3, U+0969, U+09E9 ...)
4367 T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1: U+0F2D)
4368 T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (131: [4], U+0664,
4369 U+06F4, U+07C4, U+096A, U+09EA ...)
4370 T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1: U+0F2E)
4371 T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (129: [5], U+0665,
4372 U+06F5, U+07C5, U+096B, U+09EB ...)
4373 T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F)
4374 T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (113: [6], U+0666,
4375 U+06F6, U+07C6, U+096C, U+09EC ...)
4376 T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30)
4377 T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (112: [7], U+0667,
4378 U+06F7, U+07C7, U+096D, U+09ED ...)
4379 T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31)
4380 T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (108: [8], U+0668,
4381 U+06F8, U+07C8, U+096E, U+09EE ...)
4382 T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32)
4383 T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (112: [9], U+0669,
4384 U+06F9, U+07C9, U+096F, U+09EF ...)
4385 T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (62: U+0BF0, U+0D70,
4386 U+1372, U+2169, U+2179, U+2469 ...)
4387 T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (8: U+216A, U+217A,
4388 U+246A, U+247E, U+2492, U+24EB ...)
4389 T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (8: U+216B, U+217B,
4390 U+246B, U+247F, U+2493, U+24EC ...)
4391 T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (6: U+246C, U+2480,
4392 U+2494, U+24ED, U+16E8D, U+1D2ED)
4393 T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (6: U+246D, U+2481,
4394 U+2495, U+24EE, U+16E8E, U+1D2EE)
4395 T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (6: U+246E, U+2482,
4396 U+2496, U+24EF, U+16E8F, U+1D2EF)
4397 T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (7: U+09F9, U+246F,
4398 U+2483, U+2497, U+24F0, U+16E90 ...)
4399 T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (7: U+16EE, U+2470,
4400 U+2484, U+2498, U+24F1, U+16E91 ...)
4401 T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (7: U+16EF, U+2471,
4402 U+2485, U+2499, U+24F2, U+16E92 ...)
4403 T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (7: U+16F0, U+2472,
4404 U+2486, U+249A, U+24F3, U+16E93 ...)
4405 T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (36: U+1373, U+2473,
4406 U+2487, U+249B, U+24F4, U+3039 ...)
4407 T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1: U+3251)
4408 T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1: U+3252)
4409 T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1: U+3253)
4410 T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1: U+3254)
4411 T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1: U+3255)
4412 T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1: U+3256)
4413 T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1: U+3257)
4414 T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1: U+3258)
4415 T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1: U+3259)
4416 T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (19: U+1374, U+303A,
4417 U+324A, U+325A, U+5345, U+10112 ...)
4418 T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1: U+325B)
4419 T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1: U+325C)
4420 T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1: U+325D)
4421 T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1: U+325E)
4422 T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1: U+325F)
4423 T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1: U+32B1)
4424 T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1: U+32B2)
4425 T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1: U+32B3)
4426 T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1: U+32B4)
4427 T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (18: U+1375, U+324B,
4428 U+32B5, U+534C, U+10113, U+102ED ...)
4429 T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1: U+32B6)
4430 T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1: U+32B7)
4431 T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1: U+32B8)
4432 T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1: U+32B9)
4433 T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1: U+32BA)
4434 T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1: U+32BB)
4435 T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1: U+32BC)
4436 T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1: U+32BD)
4437 T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1: U+32BE)
4438 T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (29: U+1376, U+216C,
4439 U+217C, U+2186, U+324C, U+32BF ...)
4440 T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (13: U+1377, U+324D,
4441 U+10115, U+102EF, U+109CE, U+10E6E ...)
4442 T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (13: U+1378, U+324E,
4443 U+10116, U+102F0, U+109CF, U+10E6F ...)
4444 T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (12: U+1379, U+324F,
4445 U+10117, U+102F1, U+10E70, U+11062 ...)
4446 T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (12: U+137A, U+10118,
4447 U+102F2, U+10341, U+10E71, U+11063 ...)
4448 T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (35: U+0BF1, U+0D71,
4449 U+137B, U+216D, U+217D, U+4F70 ...)
4450 T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (6: U+1011A, U+102F4,
4451 U+109D3, U+10E73, U+1EC84, U+1ED14)
4452 T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (7: U+1011B, U+1016B,
4453 U+102F5, U+109D4, U+10E74, U+1EC85 ...)
4454 T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (7: U+1011C, U+102F6,
4455 U+109D5, U+10E75, U+1EC86, U+1ED16 ...)
4456 T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (16: U+216E, U+217E,
4457 U+1011D, U+10145, U+1014C, U+10153 ...)
4458 T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (7: U+1011E, U+102F8,
4459 U+109D7, U+10E77, U+1EC88, U+1ED18 ...)
4460 T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (6: U+1011F, U+102F9,
4461 U+109D8, U+10E78, U+1EC89, U+1ED19)
4462 T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (6: U+10120, U+102FA,
4463 U+109D9, U+10E79, U+1EC8A, U+1ED1A)
4464 T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (7: U+10121, U+102FB,
4465 U+1034A, U+109DA, U+10E7A, U+1EC8B ...)
4466 T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (22: U+0BF2, U+0D72,
4467 U+216F, U+217F..2180, U+4EDF, U+5343 ...)
4468 T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (5: U+10123, U+109DC,
4469 U+1EC8D, U+1ED1D, U+1ED3A)
4470 T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (4: U+10124, U+109DD,
4471 U+1EC8E, U+1ED1E)
4472 T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (4: U+10125, U+109DE,
4473 U+1EC8F, U+1ED1F)
4474 T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (8: U+2181, U+10126,
4475 U+10146, U+1014E, U+10172, U+109DF ...)
4476 T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (4: U+10127, U+109E0,
4477 U+1EC91, U+1ED21)
4478 T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (4: U+10128, U+109E1,
4479 U+1EC92, U+1ED22)
4480 T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (4: U+10129, U+109E2,
4481 U+1EC93, U+1ED23)
4482 T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (4: U+1012A, U+109E3,
4483 U+1EC94, U+1ED24)
4484 T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (13:
4485 U+137C, U+2182, U+4E07, U+842C, U+1012B,
4486 U+10155 ...)
4487 T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (4:
4488 U+1012C, U+109E5, U+1EC96, U+1ED26)
4489 T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (4:
4490 U+1012D, U+109E6, U+1EC97, U+1ED27)
4491 T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (4:
4492 U+1012E, U+109E7, U+1EC98, U+1ED28)
4493 T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (7:
4494 U+2187, U+1012F, U+10147, U+10156,
4495 U+109E8, U+1EC99 ...)
4496 T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (4:
4497 U+10130, U+109E9, U+1EC9A, U+1ED2A)
4498 T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (4:
4499 U+10131, U+109EA, U+1EC9B, U+1ED2B)
4500 T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (4:
4501 U+10132, U+109EB, U+1EC9C, U+1ED2C)
4502 T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (4:
4503 U+10133, U+109EC, U+1EC9D, U+1ED2D)
4504 T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (5:
4505 U+2188, U+109ED, U+1EC9E, U+1ECA0,
4506 U+1ECB4)
4507 T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (2:
4508 U+109EE, U+1EC9F)
4509 T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1:
4510 U+12432)
4511 T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1:
4512 U+109EF)
4513 T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1:
4514 U+109F0)
4515 T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1:
4516 U+12433)
4517 T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1:
4518 U+109F1)
4519 T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1:
4520 U+109F2)
4521 T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1:
4522 U+109F3)
4523 T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1:
4524 U+109F4)
4525 T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1:
4526 U+109F5)
4527 T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1:
4528 U+16B5E)
4529 T \p{Numeric_Value: 10000000} (= 1.0e+07) (Short: \p{Nv=10000000})
4530 (1: U+1ECA1)
4531 T \p{Numeric_Value: 20000000} (= 2.0e+07) (Short: \p{Nv=20000000})
4532 (1: U+1ECA2)
4533 T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
4534 (3: U+4EBF, U+5104, U+16B5F)
4535 T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
4536 10000000000}) (1: U+16B60)
4537 T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
4538 1000000000000}) (2: U+5146, U+16B61)
4539 \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_250 plus all
4540 above-Unicode code points: [\x00-\x20!
4541 \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\]
4542 \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba-
4543 \xbb\xbf-\xff], U+0100..065F,
4544 U+066A..06EF, U+06FA..07BF,
4545 U+07CA..0965, U+0970..09E5 ...)
4546 \p{Nushu} \p{Script_Extensions=Nushu} (Short:
4547 \p{Nshu}; NOT \p{Block=Nushu}) (397)
4548 \p{Nv: *} \p{Numeric_Value: *}
4549 \p{Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
4550 Nyiakeng_Puachue_Hmong} (Short:
4551 \p{Hmnp}; NOT \p{Block=
4552 Nyiakeng_Puachue_Hmong}) (71)
4553 X \p{OCR} \p{Optical_Character_Recognition} (=
4554 \p{Block=Optical_Character_Recognition})
4555 (32)
4556 \p{Ogam} \p{Ogham} (= \p{Script_Extensions=Ogham})
4557 (NOT \p{Block=Ogham}) (29)
4558 \p{Ogham} \p{Script_Extensions=Ogham} (Short:
4559 \p{Ogam}; NOT \p{Block=Ogham}) (29)
4560 \p{Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
4561 \p{Olck}) (48)
4562 \p{Olck} \p{Ol_Chiki} (= \p{Script_Extensions=
4563 Ol_Chiki}) (48)
4564 \p{Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
4565 (Short: \p{Hung}; NOT \p{Block=
4566 Old_Hungarian}) (108)
4567 \p{Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
4568 \p{Ital}; NOT \p{Block=Old_Italic}) (39)
4569 \p{Old_North_Arabian} \p{Script_Extensions=Old_North_Arabian}
4570 (Short: \p{Narb}) (32)
4571 \p{Old_Permic} \p{Script_Extensions=Old_Permic} (Short:
4572 \p{Perm}; NOT \p{Block=Old_Permic}) (44)
4573 \p{Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
4574 \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
4575 \p{Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
4576 \p{Sogo}; NOT \p{Block=Old_Sogdian}) (40)
4577 \p{Old_South_Arabian} \p{Script_Extensions=Old_South_Arabian}
4578 (Short: \p{Sarb}) (32)
4579 \p{Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
4580 \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
4581 \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
4582 (Short: \p{Ps}) (75)
4583 X \p{Optical_Character_Recognition} \p{Block=
4584 Optical_Character_Recognition} (Short:
4585 \p{InOCR}) (32)
4586 \p{Oriya} \p{Script_Extensions=Oriya} (Short:
4587 \p{Orya}; NOT \p{Block=Oriya}) (97)
4588 \p{Orkh} \p{Old_Turkic} (= \p{Script_Extensions=
4589 Old_Turkic}) (NOT \p{Block=Old_Turkic})
4590 (73)
4591 X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
4592 \p{Orya} \p{Oriya} (= \p{Script_Extensions=Oriya})
4593 (NOT \p{Block=Oriya}) (97)
4594 \p{Osage} \p{Script_Extensions=Osage} (Short:
4595 \p{Osge}; NOT \p{Block=Osage}) (72)
4596 \p{Osge} \p{Osage} (= \p{Script_Extensions=Osage})
4597 (NOT \p{Block=Osage}) (72)
4598 \p{Osma} \p{Osmanya} (= \p{Script_Extensions=
4599 Osmanya}) (NOT \p{Block=Osmanya}) (40)
4600 \p{Osmanya} \p{Script_Extensions=Osmanya} (Short:
4601 \p{Osma}; NOT \p{Block=Osmanya}) (40)
4602 \p{Other} \p{General_Category=Other} (Short: \p{C})
4603 (970_414 plus all above-Unicode code
4604 points)
4605 \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
4606 \p{Lo}) (127_004)
4607 \p{Other_Number} \p{General_Category=Other_Number} (Short:
4608 \p{No}) (895)
4609 \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
4610 (Short: \p{Po}) (593)
4611 \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
4612 \p{So}) (6431)
4613 X \p{Ottoman_Siyaq_Numbers} \p{Block=Ottoman_Siyaq_Numbers} (80)
4614 \p{P} \pP \p{Punct} (= \p{General_Category=
4615 Punctuation}) (NOT
4616 \p{General_Punctuation}) (798)
4617 \p{Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong} (Short:
4618 \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
4619 (127)
4620 \p{Palm} \p{Palmyrene} (= \p{Script_Extensions=
4621 Palmyrene}) (32)
4622 \p{Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
4623 \p{Palm}) (32)
4624 \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
4625 (Short: \p{Zp}) (1)
4626 \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
4627 Y}) (2760)
4628 \p{Pat_Syn: *} \p{Pattern_Syntax: *}
4629 \p{Pat_WS} \p{Pattern_White_Space} (=
4630 \p{Pattern_White_Space=Y}) (11)
4631 \p{Pat_WS: *} \p{Pattern_White_Space: *}
4632 \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
4633 (2760)
4634 \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
4635 (1_111_352 plus all above-Unicode code
4636 points: [\x00-\x200-9A-Z_a-z\x7f-\xa0
4637 \xa8\xaa\xad\xaf\xb2-\xb5\xb7-\xba\xbc-
4638 \xbe\xc0-\xd6\xd8-\xf6\xf8-\xff],
4639 U+0100..200F, U+2028..202F,
4640 U+203F..2040, U+2054, U+205F..218F ...)
4641 \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760:
4642 [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\]
4643 \^`\{\|\}~\xa1-\xa7\xa9\xab-\xac\xae
4644 \xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
4645 U+2010..2027, U+2030..203E,
4646 U+2041..2053, U+2055..205E, U+2190..245F
4647 ...)
4648 \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
4649 \p{PatWS}) (11)
4650 \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
4651 (1_114_101 plus all above-Unicode code
4652 points: [^\t\n\cK\f\r\x20\x85],
4653 U+0100..200D, U+2010..2027,
4654 U+202A..infinity)
4655 \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11:
4656 [\t\n\cK\f\r\x20\x85], U+200E..200F,
4657 U+2028..2029)
4658 \p{Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
4659 \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
4660 \p{Pauc} \p{Pau_Cin_Hau} (= \p{Script_Extensions=
4661 Pau_Cin_Hau}) (NOT \p{Block=
4662 Pau_Cin_Hau}) (57)
4663 \p{Pc} \p{Connector_Punctuation} (=
4664 \p{General_Category=
4665 Connector_Punctuation}) (10)
4666 \p{PCM} \p{Prepended_Concatenation_Mark} (=
4667 \p{Prepended_Concatenation_Mark=Y}) (11)
4668 \p{PCM: *} \p{Prepended_Concatenation_Mark: *}
4669 \p{Pd} \p{Dash_Punctuation} (=
4670 \p{General_Category=Dash_Punctuation})
4671 (25)
4672 \p{Pe} \p{Close_Punctuation} (=
4673 \p{General_Category=Close_Punctuation})
4674 (73)
4675 \p{PerlSpace} \p{PosixSpace} (6)
4676 \p{PerlWord} \p{PosixWord} (63)
4677 \p{Perm} \p{Old_Permic} (= \p{Script_Extensions=
4678 Old_Permic}) (NOT \p{Block=Old_Permic})
4679 (44)
4680 \p{Pf} \p{Final_Punctuation} (=
4681 \p{General_Category=Final_Punctuation})
4682 (10)
4683 \p{Phag} \p{Phags_Pa} (= \p{Script_Extensions=
4684 Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
4685 \p{Phags_Pa} \p{Script_Extensions=Phags_Pa} (Short:
4686 \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
4687 X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block=
4688 Phaistos_Disc}) (48)
4689 X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short:
4690 \p{InPhaistos}) (48)
4691 \p{Phli} \p{Inscriptional_Pahlavi} (=
4692 \p{Script_Extensions=
4693 Inscriptional_Pahlavi}) (NOT \p{Block=
4694 Inscriptional_Pahlavi}) (27)
4695 \p{Phlp} \p{Psalter_Pahlavi} (=
4696 \p{Script_Extensions=Psalter_Pahlavi})
4697 (NOT \p{Block=Psalter_Pahlavi}) (30)
4698 \p{Phnx} \p{Phoenician} (= \p{Script_Extensions=
4699 Phoenician}) (NOT \p{Block=Phoenician})
4700 (29)
4701 \p{Phoenician} \p{Script_Extensions=Phoenician} (Short:
4702 \p{Phnx}; NOT \p{Block=Phoenician}) (29)
4703 X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block=
4704 Phonetic_Extensions}) (128)
4705 X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (=
4706 \p{Block=
4707 Phonetic_Extensions_Supplement}) (64)
4708 X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
4709 \p{InPhoneticExt}) (128)
4710 X \p{Phonetic_Extensions_Supplement} \p{Block=
4711 Phonetic_Extensions_Supplement} (Short:
4712 \p{InPhoneticExtSup}) (64)
4713 \p{Pi} \p{Initial_Punctuation} (=
4714 \p{General_Category=
4715 Initial_Punctuation}) (12)
4716 X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
4717 \p{Plrd} \p{Miao} (= \p{Script_Extensions=Miao})
4718 (NOT \p{Block=Miao}) (149)
4719 \p{Po} \p{Other_Punctuation} (=
4720 \p{General_Category=Other_Punctuation})
4721 (593)
4722 \p{PosixAlnum} (62: [0-9A-Za-z])
4723 \p{PosixAlpha} (52: [A-Za-z])
4724 \p{PosixBlank} (2: [\t\x20])
4725 \p{PosixCntrl} ASCII control characters (33: ACK, BEL,
4726 BS, CAN, CR, DC1, DC2, DC3, DC4, DEL,
4727 DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF,
4728 FS, GS, HT, LF, NAK, NUL, RS, SI, SO,
4729 SOH, STX, SUB, SYN, US, VT)
4730 \p{PosixDigit} (10: [0-9])
4731 \p{PosixGraph} (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
4732 Z\[\\\]\^_`a-z\{\|\}~])
4733 \p{PosixLower} (/i= PosixAlpha) (26: [a-z])
4734 \p{PosixPrint} (95: [\x20-\x7e])
4735 \p{PosixPunct} (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@
4736 \[\\\]\^_`\{\|\}~])
4737 \p{PosixSpace} (Short: \p{PerlSpace}) (6: [\t\n\cK\f\r
4738 \x20])
4739 \p{PosixUpper} (/i= PosixAlpha) (26: [A-Z])
4740 \p{PosixWord} \w, restricted to ASCII (Short:
4741 \p{PerlWord}) (63: [0-9A-Z_a-z])
4742 \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
4743 (22)
4744 \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
4745 Y} (Short: \p{PCM}) (11)
4746 \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
4747 (1_114_101 plus all above-Unicode code
4748 points: U+0000..05FF, U+0606..06DC,
4749 U+06DE..070E, U+0710..08E1,
4750 U+08E3..110BC, U+110BE..110CC ...)
4751 \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
4752 (11: U+0600..0605, U+06DD, U+070F,
4753 U+08E2, U+110BD, U+110CD)
4754 T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
4755 extension) (33_979)
4756 T \p{Present_In: 2.0} Code point's usage introduced in version
4757 2.0 or earlier (Short: \p{In=2.0}) (Perl
4758 extension) (178_500: U+0000..01F5,
4759 U+01FA..0217, U+0250..02A8,
4760 U+02B0..02DE, U+02E0..02E9, U+0300..0345
4761 ...)
4762 \p{Present_In: V2_0} \p{Present_In=2.0} (Perl extension)
4763 (178_500)
4764 T \p{Present_In: 2.1} Code point's usage introduced in version
4765 2.1 or earlier (Short: \p{In=2.1}) (Perl
4766 extension) (178_502: U+0000..01F5,
4767 U+01FA..0217, U+0250..02A8,
4768 U+02B0..02DE, U+02E0..02E9, U+0300..0345
4769 ...)
4770 \p{Present_In: V2_1} \p{Present_In=2.1} (Perl extension)
4771 (178_502)
4772 T \p{Present_In: 3.0} Code point's usage introduced in version
4773 3.0 or earlier (Short: \p{In=3.0}) (Perl
4774 extension) (188_809: U+0000..021F,
4775 U+0222..0233, U+0250..02AD,
4776 U+02B0..02EE, U+0300..034E, U+0360..0362
4777 ...)
4778 \p{Present_In: V3_0} \p{Present_In=3.0} (Perl extension)
4779 (188_809)
4780 T \p{Present_In: 3.1} Code point's usage introduced in version
4781 3.1 or earlier (Short: \p{In=3.1}) (Perl
4782 extension) (233_787: U+0000..021F,
4783 U+0222..0233, U+0250..02AD,
4784 U+02B0..02EE, U+0300..034E, U+0360..0362
4785 ...)
4786 \p{Present_In: V3_1} \p{Present_In=3.1} (Perl extension)
4787 (233_787)
4788 T \p{Present_In: 3.2} Code point's usage introduced in version
4789 3.2 or earlier (Short: \p{In=3.2}) (Perl
4790 extension) (234_803: U+0000..0220,
4791 U+0222..0233, U+0250..02AD,
4792 U+02B0..02EE, U+0300..034F, U+0360..036F
4793 ...)
4794 \p{Present_In: V3_2} \p{Present_In=3.2} (Perl extension)
4795 (234_803)
4796 T \p{Present_In: 4.0} Code point's usage introduced in version
4797 4.0 or earlier (Short: \p{In=4.0}) (Perl
4798 extension) (236_029: U+0000..0236,
4799 U+0250..0357, U+035D..036F,
4800 U+0374..0375, U+037A, U+037E ...)
4801 \p{Present_In: V4_0} \p{Present_In=4.0} (Perl extension)
4802 (236_029)
4803 T \p{Present_In: 4.1} Code point's usage introduced in version
4804 4.1 or earlier (Short: \p{In=4.1}) (Perl
4805 extension) (237_302: U+0000..0241,
4806 U+0250..036F, U+0374..0375, U+037A,
4807 U+037E, U+0384..038A ...)
4808 \p{Present_In: V4_1} \p{Present_In=4.1} (Perl extension)
4809 (237_302)
4810 T \p{Present_In: 5.0} Code point's usage introduced in version
4811 5.0 or earlier (Short: \p{In=5.0}) (Perl
4812 extension) (238_671: U+0000..036F,
4813 U+0374..0375, U+037A..037E,
4814 U+0384..038A, U+038C, U+038E..03A1 ...)
4815 \p{Present_In: V5_0} \p{Present_In=5.0} (Perl extension)
4816 (238_671)
4817 T \p{Present_In: 5.1} Code point's usage introduced in version
4818 5.1 or earlier (Short: \p{In=5.1}) (Perl
4819 extension) (240_295: U+0000..0377,
4820 U+037A..037E, U+0384..038A, U+038C,
4821 U+038E..03A1, U+03A3..0523 ...)
4822 \p{Present_In: V5_1} \p{Present_In=5.1} (Perl extension)
4823 (240_295)
4824 T \p{Present_In: 5.2} Code point's usage introduced in version
4825 5.2 or earlier (Short: \p{In=5.2}) (Perl
4826 extension) (246_943: U+0000..0377,
4827 U+037A..037E, U+0384..038A, U+038C,
4828 U+038E..03A1, U+03A3..0525 ...)
4829 \p{Present_In: V5_2} \p{Present_In=5.2} (Perl extension)
4830 (246_943)
4831 T \p{Present_In: 6.0} Code point's usage introduced in version
4832 6.0 or earlier (Short: \p{In=6.0}) (Perl
4833 extension) (249_031: U+0000..0377,
4834 U+037A..037E, U+0384..038A, U+038C,
4835 U+038E..03A1, U+03A3..0527 ...)
4836 \p{Present_In: V6_0} \p{Present_In=6.0} (Perl extension)
4837 (249_031)
4838 T \p{Present_In: 6.1} Code point's usage introduced in version
4839 6.1 or earlier (Short: \p{In=6.1}) (Perl
4840 extension) (249_763: U+0000..0377,
4841 U+037A..037E, U+0384..038A, U+038C,
4842 U+038E..03A1, U+03A3..0527 ...)
4843 \p{Present_In: V6_1} \p{Present_In=6.1} (Perl extension)
4844 (249_763)
4845 T \p{Present_In: 6.2} Code point's usage introduced in version
4846 6.2 or earlier (Short: \p{In=6.2}) (Perl
4847 extension) (249_764: U+0000..0377,
4848 U+037A..037E, U+0384..038A, U+038C,
4849 U+038E..03A1, U+03A3..0527 ...)
4850 \p{Present_In: V6_2} \p{Present_In=6.2} (Perl extension)
4851 (249_764)
4852 T \p{Present_In: 6.3} Code point's usage introduced in version
4853 6.3 or earlier (Short: \p{In=6.3}) (Perl
4854 extension) (249_769: U+0000..0377,
4855 U+037A..037E, U+0384..038A, U+038C,
4856 U+038E..03A1, U+03A3..0527 ...)
4857 \p{Present_In: V6_3} \p{Present_In=6.3} (Perl extension)
4858 (249_769)
4859 T \p{Present_In: 7.0} Code point's usage introduced in version
4860 7.0 or earlier (Short: \p{In=7.0}) (Perl
4861 extension) (252_603: U+0000..0377,
4862 U+037A..037F, U+0384..038A, U+038C,
4863 U+038E..03A1, U+03A3..052F ...)
4864 \p{Present_In: V7_0} \p{Present_In=7.0} (Perl extension)
4865 (252_603)
4866 T \p{Present_In: 8.0} Code point's usage introduced in version
4867 8.0 or earlier (Short: \p{In=8.0}) (Perl
4868 extension) (260_319: U+0000..0377,
4869 U+037A..037F, U+0384..038A, U+038C,
4870 U+038E..03A1, U+03A3..052F ...)
4871 \p{Present_In: V8_0} \p{Present_In=8.0} (Perl extension)
4872 (260_319)
4873 T \p{Present_In: 9.0} Code point's usage introduced in version
4874 9.0 or earlier (Short: \p{In=9.0}) (Perl
4875 extension) (267_819: U+0000..0377,
4876 U+037A..037F, U+0384..038A, U+038C,
4877 U+038E..03A1, U+03A3..052F ...)
4878 \p{Present_In: V9_0} \p{Present_In=9.0} (Perl extension)
4879 (267_819)
4880 T \p{Present_In: 10.0} Code point's usage introduced in version
4881 10.0 or earlier (Short: \p{In=10.0})
4882 (Perl extension) (276_337: U+0000..0377,
4883 U+037A..037F, U+0384..038A, U+038C,
4884 U+038E..03A1, U+03A3..052F ...)
4885 \p{Present_In: V10_0} \p{Present_In=10.0} (Perl extension)
4886 (276_337)
4887 T \p{Present_In: 11.0} Code point's usage introduced in version
4888 11.0 or earlier (Short: \p{In=11.0})
4889 (Perl extension) (277_021: U+0000..0377,
4890 U+037A..037F, U+0384..038A, U+038C,
4891 U+038E..03A1, U+03A3..052F ...)
4892 \p{Present_In: V11_0} \p{Present_In=11.0} (Perl extension)
4893 (277_021)
4894 T \p{Present_In: 12.0} Code point's usage introduced in version
4895 12.0 or earlier (Short: \p{In=12.0})
4896 (Perl extension) (277_575: U+0000..0377,
4897 U+037A..037F, U+0384..038A, U+038C,
4898 U+038E..03A1, U+03A3..052F ...)
4899 \p{Present_In: V12_0} \p{Present_In=12.0} (Perl extension)
4900 (277_575)
4901 T \p{Present_In: 12.1} Code point's usage introduced in version
4902 12.1 or earlier (Short: \p{In=12.1})
4903 (Perl extension) (277_576: U+0000..0377,
4904 U+037A..037F, U+0384..038A, U+038C,
4905 U+038E..03A1, U+03A3..052F ...)
4906 \p{Present_In: V12_1} \p{Present_In=12.1} (Perl extension)
4907 (277_576)
4908 T \p{Present_In: 13.0} Code point's usage introduced in version
4909 13.0 or earlier (Short: \p{In=13.0})
4910 (Perl extension) (283_506: U+0000..0377,
4911 U+037A..037F, U+0384..038A, U+038C,
4912 U+038E..03A1, U+03A3..052F ...)
4913 \p{Present_In: V13_0} \p{Present_In=13.0} (Perl extension)
4914 (283_506)
4915 \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
4916 Unassigned}) (Perl extension) (830_606
4917 plus all above-Unicode code points)
4918 \p{Print} \p{XPosixPrint} (281_325)
4919 \p{Private_Use} \p{General_Category=Private_Use} (Short:
4920 \p{Co}; NOT \p{Private_Use_Area})
4921 (137_468)
4922 X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
4923 \p{InPUA}) (6400)
4924 \p{Prti} \p{Inscriptional_Parthian} (=
4925 \p{Script_Extensions=
4926 Inscriptional_Parthian}) (NOT \p{Block=
4927 Inscriptional_Parthian}) (30)
4928 \p{Ps} \p{Open_Punctuation} (=
4929 \p{General_Category=Open_Punctuation})
4930 (75)
4931 \p{Psalter_Pahlavi} \p{Script_Extensions=Psalter_Pahlavi}
4932 (Short: \p{Phlp}; NOT \p{Block=
4933 Psalter_Pahlavi}) (30)
4934 X \p{PUA} \p{Private_Use_Area} (= \p{Block=
4935 Private_Use_Area}) (6400)
4936 \p{Punct} \p{General_Category=Punctuation} (Short:
4937 \p{P}; NOT \p{General_Punctuation}) (798)
4938 \p{Punctuation} \p{Punct} (= \p{General_Category=
4939 Punctuation}) (NOT
4940 \p{General_Punctuation}) (798)
4941 \p{Qaac} \p{Coptic} (= \p{Script_Extensions=
4942 Coptic}) (NOT \p{Block=Coptic}) (165)
4943 \p{Qaai} \p{Inherited} (= \p{Script_Extensions=
4944 Inherited}) (503)
4945 \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
4946 Y}) (30)
4947 \p{QMark: *} \p{Quotation_Mark: *}
4948 \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
4949 (30)
4950 \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_082
4951 plus all above-Unicode code points:
4952 [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<=>?
4953 \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xaa\xac-
4954 \xba\xbc-\xff], U+0100..2017,
4955 U+2020..2038, U+203B..2E41,
4956 U+2E43..300B, U+3010..301C ...)
4957 \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (30: [\"
4958 \'\xab\xbb], U+2018..201F, U+2039..203A,
4959 U+2E42, U+300C..300F, U+301D..301F ...)
4960 \p{Radical} \p{Radical=Y} (329)
4961 \p{Radical: N*} (Single: \P{Radical}) (1_113_783 plus all
4962 above-Unicode code points: U+0000..2E7F,
4963 U+2E9A, U+2EF4..2EFF, U+2FD6..infinity)
4964 \p{Radical: Y*} (Single: \p{Radical}) (329: U+2E80..2E99,
4965 U+2E9B..2EF3, U+2F00..2FD5)
4966 \p{Regional_Indicator} \p{Regional_Indicator=Y} (Short: \p{RI})
4967 (26)
4968 \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086
4969 plus all above-Unicode code points:
4970 U+0000..1F1E5, U+1F200..infinity)
4971 \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26:
4972 U+1F1E6..1F1FF)
4973 \p{Rejang} \p{Script_Extensions=Rejang} (Short:
4974 \p{Rjng}; NOT \p{Block=Rejang}) (37)
4975 \p{RI} \p{Regional_Indicator} (=
4976 \p{Regional_Indicator=Y}) (26)
4977 \p{RI: *} \p{Regional_Indicator: *}
4978 \p{Rjng} \p{Rejang} (= \p{Script_Extensions=
4979 Rejang}) (NOT \p{Block=Rejang}) (37)
4980 \p{Rohg} \p{Hanifi_Rohingya} (=
4981 \p{Script_Extensions=Hanifi_Rohingya})
4982 (NOT \p{Block=Hanifi_Rohingya}) (55)
4983 X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block=
4984 Rumi_Numeral_Symbols}) (32)
4985 X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
4986 \p{InRumi}) (32)
4987 \p{Runic} \p{Script_Extensions=Runic} (Short:
4988 \p{Runr}; NOT \p{Block=Runic}) (86)
4989 \p{Runr} \p{Runic} (= \p{Script_Extensions=Runic})
4990 (NOT \p{Block=Runic}) (86)
4991 \p{S} \pS \p{Symbol} (= \p{General_Category=Symbol})
4992 (7564)
4993 \p{Samaritan} \p{Script_Extensions=Samaritan} (Short:
4994 \p{Samr}; NOT \p{Block=Samaritan}) (61)
4995 \p{Samr} \p{Samaritan} (= \p{Script_Extensions=
4996 Samaritan}) (NOT \p{Block=Samaritan})
4997 (61)
4998 \p{Sarb} \p{Old_South_Arabian} (=
4999 \p{Script_Extensions=Old_South_Arabian})
5000 (32)
5001 \p{Saur} \p{Saurashtra} (= \p{Script_Extensions=
5002 Saurashtra}) (NOT \p{Block=Saurashtra})
5003 (82)
5004 \p{Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
5005 \p{Saur}; NOT \p{Block=Saurashtra}) (82)
5006 \p{SB: *} \p{Sentence_Break: *}
5007 \p{Sc} \p{Currency_Symbol} (=
5008 \p{General_Category=Currency_Symbol})
5009 (62)
5010 \p{Sc: *} \p{Script: *}
5011 \p{Script: Adlam} (Short: \p{Sc=Adlm}) (88: U+1E900..1E94B,
5012 U+1E950..1E959, U+1E95E..1E95F)
5013 \p{Script: Adlm} \p{Script=Adlam} (88)
5014 \p{Script: Aghb} \p{Script=Caucasian_Albanian} (=
5015 \p{Script_Extensions=
5016 Caucasian_Albanian}) (53)
5017 \p{Script: Ahom} \p{Script_Extensions=Ahom} (Short: \p{Sc=
5018 Ahom}, \p{Ahom}) (58)
5019 \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions=
5020 Anatolian_Hieroglyphs} (Short: \p{Sc=
5021 Hluw}, \p{Hluw}) (583)
5022 \p{Script: Arab} \p{Script=Arabic} (1291)
5023 \p{Script: Arabic} (Short: \p{Sc=Arab}) (1291: U+0600..0604,
5024 U+0606..060B, U+060D..061A, U+061C,
5025 U+061E, U+0620..063F ...)
5026 \p{Script: Armenian} \p{Script_Extensions=Armenian} (Short:
5027 \p{Sc=Armn}, \p{Armn}) (96)
5028 \p{Script: Armi} \p{Script=Imperial_Aramaic} (=
5029 \p{Script_Extensions=Imperial_Aramaic})
5030 (31)
5031 \p{Script: Armn} \p{Script=Armenian} (=
5032 \p{Script_Extensions=Armenian}) (96)
5033 \p{Script: Avestan} \p{Script_Extensions=Avestan} (Short:
5034 \p{Sc=Avst}, \p{Avst}) (61)
5035 \p{Script: Avst} \p{Script=Avestan} (=
5036 \p{Script_Extensions=Avestan}) (61)
5037 \p{Script: Bali} \p{Script=Balinese} (=
5038 \p{Script_Extensions=Balinese}) (121)
5039 \p{Script: Balinese} \p{Script_Extensions=Balinese} (Short:
5040 \p{Sc=Bali}, \p{Bali}) (121)
5041 \p{Script: Bamu} \p{Script=Bamum} (= \p{Script_Extensions=
5042 Bamum}) (657)
5043 \p{Script: Bamum} \p{Script_Extensions=Bamum} (Short: \p{Sc=
5044 Bamu}, \p{Bamu}) (657)
5045 \p{Script: Bass} \p{Script=Bassa_Vah} (=
5046 \p{Script_Extensions=Bassa_Vah}) (36)
5047 \p{Script: Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
5048 \p{Sc=Bass}, \p{Bass}) (36)
5049 \p{Script: Batak} \p{Script_Extensions=Batak} (Short: \p{Sc=
5050 Batk}, \p{Batk}) (56)
5051 \p{Script: Batk} \p{Script=Batak} (= \p{Script_Extensions=
5052 Batak}) (56)
5053 \p{Script: Beng} \p{Script=Bengali} (96)
5054 \p{Script: Bengali} (Short: \p{Sc=Beng}) (96: U+0980..0983,
5055 U+0985..098C, U+098F..0990,
5056 U+0993..09A8, U+09AA..09B0, U+09B2 ...)
5057 \p{Script: Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
5058 \p{Sc=Bhks}, \p{Bhks}) (97)
5059 \p{Script: Bhks} \p{Script=Bhaiksuki} (=
5060 \p{Script_Extensions=Bhaiksuki}) (97)
5061 \p{Script: Bopo} \p{Script=Bopomofo} (77)
5062 \p{Script: Bopomofo} (Short: \p{Sc=Bopo}) (77: U+02EA..02EB,
5063 U+3105..312F, U+31A0..31BF)
5064 \p{Script: Brah} \p{Script=Brahmi} (= \p{Script_Extensions=
5065 Brahmi}) (109)
5066 \p{Script: Brahmi} \p{Script_Extensions=Brahmi} (Short:
5067 \p{Sc=Brah}, \p{Brah}) (109)
5068 \p{Script: Brai} \p{Script=Braille} (=
5069 \p{Script_Extensions=Braille}) (256)
5070 \p{Script: Braille} \p{Script_Extensions=Braille} (Short:
5071 \p{Sc=Brai}, \p{Brai}) (256)
5072 \p{Script: Bugi} \p{Script=Buginese} (30)
5073 \p{Script: Buginese} (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B,
5074 U+1A1E..1A1F)
5075 \p{Script: Buhd} \p{Script=Buhid} (20)
5076 \p{Script: Buhid} (Short: \p{Sc=Buhd}) (20: U+1740..1753)
5077 \p{Script: Cakm} \p{Script=Chakma} (71)
5078 \p{Script: Canadian_Aboriginal} \p{Script_Extensions=
5079 Canadian_Aboriginal} (Short: \p{Sc=
5080 Cans}, \p{Cans}) (710)
5081 \p{Script: Cans} \p{Script=Canadian_Aboriginal} (=
5082 \p{Script_Extensions=
5083 Canadian_Aboriginal}) (710)
5084 \p{Script: Cari} \p{Script=Carian} (= \p{Script_Extensions=
5085 Carian}) (49)
5086 \p{Script: Carian} \p{Script_Extensions=Carian} (Short:
5087 \p{Sc=Cari}, \p{Cari}) (49)
5088 \p{Script: Caucasian_Albanian} \p{Script_Extensions=
5089 Caucasian_Albanian} (Short: \p{Sc=Aghb},
5090 \p{Aghb}) (53)
5091 \p{Script: Chakma} (Short: \p{Sc=Cakm}) (71: U+11100..11134,
5092 U+11136..11147)
5093 \p{Script: Cham} \p{Script_Extensions=Cham} (Short: \p{Sc=
5094 Cham}, \p{Cham}) (83)
5095 \p{Script: Cher} \p{Script=Cherokee} (=
5096 \p{Script_Extensions=Cherokee}) (172)
5097 \p{Script: Cherokee} \p{Script_Extensions=Cherokee} (Short:
5098 \p{Sc=Cher}, \p{Cher}) (172)
5099 \p{Script: Chorasmian} \p{Script_Extensions=Chorasmian} (Short:
5100 \p{Sc=Chrs}, \p{Chrs}) (28)
5101 \p{Script: Chrs} \p{Script=Chorasmian} (=
5102 \p{Script_Extensions=Chorasmian}) (28)
5103 \p{Script: Common} (Short: \p{Sc=Zyyy}) (8087: [\x00-\x20!
5104 \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]
5105 \^_`\{\|\}~\x7f-\xa9\xab-\xb9\xbb-\xbf
5106 \xd7\xf7], U+02B9..02DF, U+02E5..02E9,
5107 U+02EC..02FF, U+0374, U+037E ...)
5108 \p{Script: Copt} \p{Script=Coptic} (137)
5109 \p{Script: Coptic} (Short: \p{Sc=Copt}) (137: U+03E2..03EF,
5110 U+2C80..2CF3, U+2CF9..2CFF)
5111 \p{Script: Cprt} \p{Script=Cypriot} (55)
5112 \p{Script: Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
5113 \p{Sc=Xsux}, \p{Xsux}) (1234)
5114 \p{Script: Cypriot} (Short: \p{Sc=Cprt}) (55: U+10800..10805,
5115 U+10808, U+1080A..10835, U+10837..10838,
5116 U+1083C, U+1083F)
5117 \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}) (443: U+0400..0484,
5118 U+0487..052F, U+1C80..1C88, U+1D2B,
5119 U+1D78, U+2DE0..2DFF ...)
5120 \p{Script: Cyrl} \p{Script=Cyrillic} (443)
5121 \p{Script: Deseret} \p{Script_Extensions=Deseret} (Short:
5122 \p{Sc=Dsrt}, \p{Dsrt}) (80)
5123 \p{Script: Deva} \p{Script=Devanagari} (154)
5124 \p{Script: Devanagari} (Short: \p{Sc=Deva}) (154: U+0900..0950,
5125 U+0955..0963, U+0966..097F, U+A8E0..A8FF)
5126 \p{Script: Diak} \p{Script=Dives_Akuru} (=
5127 \p{Script_Extensions=Dives_Akuru}) (72)
5128 \p{Script: Dives_Akuru} \p{Script_Extensions=Dives_Akuru} (Short:
5129 \p{Sc=Diak}, \p{Diak}) (72)
5130 \p{Script: Dogr} \p{Script=Dogra} (60)
5131 \p{Script: Dogra} (Short: \p{Sc=Dogr}) (60: U+11800..1183B)
5132 \p{Script: Dsrt} \p{Script=Deseret} (=
5133 \p{Script_Extensions=Deseret}) (80)
5134 \p{Script: Dupl} \p{Script=Duployan} (143)
5135 \p{Script: Duployan} (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A,
5136 U+1BC70..1BC7C, U+1BC80..1BC88,
5137 U+1BC90..1BC99, U+1BC9C..1BC9F)
5138 \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (=
5139 \p{Script_Extensions=
5140 Egyptian_Hieroglyphs}) (1080)
5141 \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions=
5142 Egyptian_Hieroglyphs} (Short: \p{Sc=
5143 Egyp}, \p{Egyp}) (1080)
5144 \p{Script: Elba} \p{Script=Elbasan} (=
5145 \p{Script_Extensions=Elbasan}) (40)
5146 \p{Script: Elbasan} \p{Script_Extensions=Elbasan} (Short:
5147 \p{Sc=Elba}, \p{Elba}) (40)
5148 \p{Script: Elym} \p{Script=Elymaic} (=
5149 \p{Script_Extensions=Elymaic}) (23)
5150 \p{Script: Elymaic} \p{Script_Extensions=Elymaic} (Short:
5151 \p{Sc=Elym}, \p{Elym}) (23)
5152 \p{Script: Ethi} \p{Script=Ethiopic} (=
5153 \p{Script_Extensions=Ethiopic}) (495)
5154 \p{Script: Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
5155 \p{Sc=Ethi}, \p{Ethi}) (495)
5156 \p{Script: Geor} \p{Script=Georgian} (173)
5157 \p{Script: Georgian} (Short: \p{Sc=Geor}) (173: U+10A0..10C5,
5158 U+10C7, U+10CD, U+10D0..10FA,
5159 U+10FC..10FF, U+1C90..1CBA ...)
5160 \p{Script: Glag} \p{Script=Glagolitic} (132)
5161 \p{Script: Glagolitic} (Short: \p{Sc=Glag}) (132: U+2C00..2C2E,
5162 U+2C30..2C5E, U+1E000..1E006,
5163 U+1E008..1E018, U+1E01B..1E021,
5164 U+1E023..1E024 ...)
5165 \p{Script: Gong} \p{Script=Gunjala_Gondi} (63)
5166 \p{Script: Gonm} \p{Script=Masaram_Gondi} (75)
5167 \p{Script: Goth} \p{Script=Gothic} (= \p{Script_Extensions=
5168 Gothic}) (27)
5169 \p{Script: Gothic} \p{Script_Extensions=Gothic} (Short:
5170 \p{Sc=Goth}, \p{Goth}) (27)
5171 \p{Script: Gran} \p{Script=Grantha} (85)
5172 \p{Script: Grantha} (Short: \p{Sc=Gran}) (85: U+11300..11303,
5173 U+11305..1130C, U+1130F..11310,
5174 U+11313..11328, U+1132A..11330,
5175 U+11332..11333 ...)
5176 \p{Script: Greek} (Short: \p{Sc=Grek}) (518: U+0370..0373,
5177 U+0375..0377, U+037A..037D, U+037F,
5178 U+0384, U+0386 ...)
5179 \p{Script: Grek} \p{Script=Greek} (518)
5180 \p{Script: Gujarati} (Short: \p{Sc=Gujr}) (91: U+0A81..0A83,
5181 U+0A85..0A8D, U+0A8F..0A91,
5182 U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3
5183 ...)
5184 \p{Script: Gujr} \p{Script=Gujarati} (91)
5185 \p{Script: Gunjala_Gondi} (Short: \p{Sc=Gong}) (63:
5186 U+11D60..11D65, U+11D67..11D68,
5187 U+11D6A..11D8E, U+11D90..11D91,
5188 U+11D93..11D98, U+11DA0..11DA9)
5189 \p{Script: Gurmukhi} (Short: \p{Sc=Guru}) (80: U+0A01..0A03,
5190 U+0A05..0A0A, U+0A0F..0A10,
5191 U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33
5192 ...)
5193 \p{Script: Guru} \p{Script=Gurmukhi} (80)
5194 \p{Script: Han} (Short: \p{Sc=Han}) (94_204: U+2E80..2E99,
5195 U+2E9B..2EF3, U+2F00..2FD5, U+3005,
5196 U+3007, U+3021..3029 ...)
5197 \p{Script: Hang} \p{Script=Hangul} (11_739)
5198 \p{Script: Hangul} (Short: \p{Sc=Hang}) (11_739:
5199 U+1100..11FF, U+302E..302F,
5200 U+3131..318E, U+3200..321E,
5201 U+3260..327E, U+A960..A97C ...)
5202 \p{Script: Hani} \p{Script=Han} (94_204)
5203 \p{Script: Hanifi_Rohingya} (Short: \p{Sc=Rohg}) (50:
5204 U+10D00..10D27, U+10D30..10D39)
5205 \p{Script: Hano} \p{Script=Hanunoo} (21)
5206 \p{Script: Hanunoo} (Short: \p{Sc=Hano}) (21: U+1720..1734)
5207 \p{Script: Hatr} \p{Script=Hatran} (= \p{Script_Extensions=
5208 Hatran}) (26)
5209 \p{Script: Hatran} \p{Script_Extensions=Hatran} (Short:
5210 \p{Sc=Hatr}, \p{Hatr}) (26)
5211 \p{Script: Hebr} \p{Script=Hebrew} (= \p{Script_Extensions=
5212 Hebrew}) (134)
5213 \p{Script: Hebrew} \p{Script_Extensions=Hebrew} (Short:
5214 \p{Sc=Hebr}, \p{Hebr}) (134)
5215 \p{Script: Hira} \p{Script=Hiragana} (379)
5216 \p{Script: Hiragana} (Short: \p{Sc=Hira}) (379: U+3041..3096,
5217 U+309D..309F, U+1B001..1B11E,
5218 U+1B150..1B152, U+1F200)
5219 \p{Script: Hluw} \p{Script=Anatolian_Hieroglyphs} (=
5220 \p{Script_Extensions=
5221 Anatolian_Hieroglyphs}) (583)
5222 \p{Script: Hmng} \p{Script=Pahawh_Hmong} (=
5223 \p{Script_Extensions=Pahawh_Hmong}) (127)
5224 \p{Script: Hmnp} \p{Script=Nyiakeng_Puachue_Hmong} (=
5225 \p{Script_Extensions=
5226 Nyiakeng_Puachue_Hmong}) (71)
5227 \p{Script: Hung} \p{Script=Old_Hungarian} (=
5228 \p{Script_Extensions=Old_Hungarian})
5229 (108)
5230 \p{Script: Imperial_Aramaic} \p{Script_Extensions=
5231 Imperial_Aramaic} (Short: \p{Sc=Armi},
5232 \p{Armi}) (31)
5233 \p{Script: Inherited} (Short: \p{Sc=Zinh}) (573: U+0300..036F,
5234 U+0485..0486, U+064B..0655, U+0670,
5235 U+0951..0954, U+1AB0..1AC0 ...)
5236 \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions=
5237 Inscriptional_Pahlavi} (Short: \p{Sc=
5238 Phli}, \p{Phli}) (27)
5239 \p{Script: Inscriptional_Parthian} \p{Script_Extensions=
5240 Inscriptional_Parthian} (Short: \p{Sc=
5241 Prti}, \p{Prti}) (30)
5242 \p{Script: Ital} \p{Script=Old_Italic} (=
5243 \p{Script_Extensions=Old_Italic}) (39)
5244 \p{Script: Java} \p{Script=Javanese} (90)
5245 \p{Script: Javanese} (Short: \p{Sc=Java}) (90: U+A980..A9CD,
5246 U+A9D0..A9D9, U+A9DE..A9DF)
5247 \p{Script: Kaithi} (Short: \p{Sc=Kthi}) (67: U+11080..110C1,
5248 U+110CD)
5249 \p{Script: Kali} \p{Script=Kayah_Li} (47)
5250 \p{Script: Kana} \p{Script=Katakana} (304)
5251 \p{Script: Kannada} (Short: \p{Sc=Knda}) (89: U+0C80..0C8C,
5252 U+0C8E..0C90, U+0C92..0CA8,
5253 U+0CAA..0CB3, U+0CB5..0CB9, U+0CBC..0CC4
5254 ...)
5255 \p{Script: Katakana} (Short: \p{Sc=Kana}) (304: U+30A1..30FA,
5256 U+30FD..30FF, U+31F0..31FF,
5257 U+32D0..32FE, U+3300..3357, U+FF66..FF6F
5258 ...)
5259 \p{Script: Kayah_Li} (Short: \p{Sc=Kali}) (47: U+A900..A92D,
5260 U+A92F)
5261 \p{Script: Khar} \p{Script=Kharoshthi} (=
5262 \p{Script_Extensions=Kharoshthi}) (68)
5263 \p{Script: Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
5264 \p{Sc=Khar}, \p{Khar}) (68)
5265 \p{Script: Khitan_Small_Script} \p{Script_Extensions=
5266 Khitan_Small_Script} (Short: \p{Sc=
5267 Kits}, \p{Kits}) (471)
5268 \p{Script: Khmer} \p{Script_Extensions=Khmer} (Short: \p{Sc=
5269 Khmr}, \p{Khmr}) (146)
5270 \p{Script: Khmr} \p{Script=Khmer} (= \p{Script_Extensions=
5271 Khmer}) (146)
5272 \p{Script: Khoj} \p{Script=Khojki} (62)
5273 \p{Script: Khojki} (Short: \p{Sc=Khoj}) (62: U+11200..11211,
5274 U+11213..1123E)
5275 \p{Script: Khudawadi} (Short: \p{Sc=Sind}) (69: U+112B0..112EA,
5276 U+112F0..112F9)
5277 \p{Script: Kits} \p{Script=Khitan_Small_Script} (=
5278 \p{Script_Extensions=
5279 Khitan_Small_Script}) (471)
5280 \p{Script: Knda} \p{Script=Kannada} (89)
5281 \p{Script: Kthi} \p{Script=Kaithi} (67)
5282 \p{Script: Lana} \p{Script=Tai_Tham} (=
5283 \p{Script_Extensions=Tai_Tham}) (127)
5284 \p{Script: Lao} \p{Script_Extensions=Lao} (Short: \p{Sc=
5285 Lao}, \p{Lao}) (82)
5286 \p{Script: Laoo} \p{Script=Lao} (= \p{Script_Extensions=
5287 Lao}) (82)
5288 \p{Script: Latin} (Short: \p{Sc=Latn}) (1374: [A-Za-z\xaa
5289 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
5290 U+0100..02B8, U+02E0..02E4,
5291 U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65
5292 ...)
5293 \p{Script: Latn} \p{Script=Latin} (1374)
5294 \p{Script: Lepc} \p{Script=Lepcha} (= \p{Script_Extensions=
5295 Lepcha}) (74)
5296 \p{Script: Lepcha} \p{Script_Extensions=Lepcha} (Short:
5297 \p{Sc=Lepc}, \p{Lepc}) (74)
5298 \p{Script: Limb} \p{Script=Limbu} (68)
5299 \p{Script: Limbu} (Short: \p{Sc=Limb}) (68: U+1900..191E,
5300 U+1920..192B, U+1930..193B, U+1940,
5301 U+1944..194F)
5302 \p{Script: Lina} \p{Script=Linear_A} (341)
5303 \p{Script: Linb} \p{Script=Linear_B} (211)
5304 \p{Script: Linear_A} (Short: \p{Sc=Lina}) (341: U+10600..10736,
5305 U+10740..10755, U+10760..10767)
5306 \p{Script: Linear_B} (Short: \p{Sc=Linb}) (211: U+10000..1000B,
5307 U+1000D..10026, U+10028..1003A,
5308 U+1003C..1003D, U+1003F..1004D,
5309 U+10050..1005D ...)
5310 \p{Script: Lisu} \p{Script_Extensions=Lisu} (Short: \p{Sc=
5311 Lisu}, \p{Lisu}) (49)
5312 \p{Script: Lyci} \p{Script=Lycian} (= \p{Script_Extensions=
5313 Lycian}) (29)
5314 \p{Script: Lycian} \p{Script_Extensions=Lycian} (Short:
5315 \p{Sc=Lyci}, \p{Lyci}) (29)
5316 \p{Script: Lydi} \p{Script=Lydian} (= \p{Script_Extensions=
5317 Lydian}) (27)
5318 \p{Script: Lydian} \p{Script_Extensions=Lydian} (Short:
5319 \p{Sc=Lydi}, \p{Lydi}) (27)
5320 \p{Script: Mahajani} (Short: \p{Sc=Mahj}) (39: U+11150..11176)
5321 \p{Script: Mahj} \p{Script=Mahajani} (39)
5322 \p{Script: Maka} \p{Script=Makasar} (=
5323 \p{Script_Extensions=Makasar}) (25)
5324 \p{Script: Makasar} \p{Script_Extensions=Makasar} (Short:
5325 \p{Sc=Maka}, \p{Maka}) (25)
5326 \p{Script: Malayalam} (Short: \p{Sc=Mlym}) (118: U+0D00..0D0C,
5327 U+0D0E..0D10, U+0D12..0D44,
5328 U+0D46..0D48, U+0D4A..0D4F, U+0D54..0D63
5329 ...)
5330 \p{Script: Mand} \p{Script=Mandaic} (29)
5331 \p{Script: Mandaic} (Short: \p{Sc=Mand}) (29: U+0840..085B,
5332 U+085E)
5333 \p{Script: Mani} \p{Script=Manichaean} (51)
5334 \p{Script: Manichaean} (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6,
5335 U+10AEB..10AF6)
5336 \p{Script: Marc} \p{Script=Marchen} (=
5337 \p{Script_Extensions=Marchen}) (68)
5338 \p{Script: Marchen} \p{Script_Extensions=Marchen} (Short:
5339 \p{Sc=Marc}, \p{Marc}) (68)
5340 \p{Script: Masaram_Gondi} (Short: \p{Sc=Gonm}) (75:
5341 U+11D00..11D06, U+11D08..11D09,
5342 U+11D0B..11D36, U+11D3A, U+11D3C..11D3D,
5343 U+11D3F..11D47 ...)
5344 \p{Script: Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
5345 \p{Sc=Medf}, \p{Medf}) (91)
5346 \p{Script: Medf} \p{Script=Medefaidrin} (=
5347 \p{Script_Extensions=Medefaidrin}) (91)
5348 \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek}
5349 (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
5350 \p{Script: Mend} \p{Script=Mende_Kikakui} (=
5351 \p{Script_Extensions=Mende_Kikakui})
5352 (213)
5353 \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
5354 (Short: \p{Sc=Mend}, \p{Mend}) (213)
5355 \p{Script: Merc} \p{Script=Meroitic_Cursive} (=
5356 \p{Script_Extensions=Meroitic_Cursive})
5357 (90)
5358 \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (=
5359 \p{Script_Extensions=
5360 Meroitic_Hieroglyphs}) (32)
5361 \p{Script: Meroitic_Cursive} \p{Script_Extensions=
5362 Meroitic_Cursive} (Short: \p{Sc=Merc},
5363 \p{Merc}) (90)
5364 \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions=
5365 Meroitic_Hieroglyphs} (Short: \p{Sc=
5366 Mero}, \p{Mero}) (32)
5367 \p{Script: Miao} \p{Script_Extensions=Miao} (Short: \p{Sc=
5368 Miao}, \p{Miao}) (149)
5369 \p{Script: Mlym} \p{Script=Malayalam} (118)
5370 \p{Script: Modi} (Short: \p{Sc=Modi}) (79: U+11600..11644,
5371 U+11650..11659)
5372 \p{Script: Mong} \p{Script=Mongolian} (167)
5373 \p{Script: Mongolian} (Short: \p{Sc=Mong}) (167: U+1800..1801,
5374 U+1804, U+1806..180E, U+1810..1819,
5375 U+1820..1878, U+1880..18AA ...)
5376 \p{Script: Mro} \p{Script_Extensions=Mro} (Short: \p{Sc=
5377 Mro}, \p{Mro}) (43)
5378 \p{Script: Mroo} \p{Script=Mro} (= \p{Script_Extensions=
5379 Mro}) (43)
5380 \p{Script: Mtei} \p{Script=Meetei_Mayek} (=
5381 \p{Script_Extensions=Meetei_Mayek}) (79)
5382 \p{Script: Mult} \p{Script=Multani} (38)
5383 \p{Script: Multani} (Short: \p{Sc=Mult}) (38: U+11280..11286,
5384 U+11288, U+1128A..1128D, U+1128F..1129D,
5385 U+1129F..112A9)
5386 \p{Script: Myanmar} (Short: \p{Sc=Mymr}) (223: U+1000..109F,
5387 U+A9E0..A9FE, U+AA60..AA7F)
5388 \p{Script: Mymr} \p{Script=Myanmar} (223)
5389 \p{Script: Nabataean} \p{Script_Extensions=Nabataean} (Short:
5390 \p{Sc=Nbat}, \p{Nbat}) (40)
5391 \p{Script: Nand} \p{Script=Nandinagari} (65)
5392 \p{Script: Nandinagari} (Short: \p{Sc=Nand}) (65: U+119A0..119A7,
5393 U+119AA..119D7, U+119DA..119E4)
5394 \p{Script: Narb} \p{Script=Old_North_Arabian} (=
5395 \p{Script_Extensions=Old_North_Arabian})
5396 (32)
5397 \p{Script: Nbat} \p{Script=Nabataean} (=
5398 \p{Script_Extensions=Nabataean}) (40)
5399 \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
5400 \p{Sc=Talu}, \p{Talu}) (83)
5401 \p{Script: Newa} \p{Script_Extensions=Newa} (Short: \p{Sc=
5402 Newa}, \p{Newa}) (97)
5403 \p{Script: Nko} \p{Script_Extensions=Nko} (Short: \p{Sc=
5404 Nko}, \p{Nko}) (62)
5405 \p{Script: Nkoo} \p{Script=Nko} (= \p{Script_Extensions=
5406 Nko}) (62)
5407 \p{Script: Nshu} \p{Script=Nushu} (= \p{Script_Extensions=
5408 Nushu}) (397)
5409 \p{Script: Nushu} \p{Script_Extensions=Nushu} (Short: \p{Sc=
5410 Nshu}, \p{Nshu}) (397)
5411 \p{Script: Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
5412 Nyiakeng_Puachue_Hmong} (Short: \p{Sc=
5413 Hmnp}, \p{Hmnp}) (71)
5414 \p{Script: Ogam} \p{Script=Ogham} (= \p{Script_Extensions=
5415 Ogham}) (29)
5416 \p{Script: Ogham} \p{Script_Extensions=Ogham} (Short: \p{Sc=
5417 Ogam}, \p{Ogam}) (29)
5418 \p{Script: Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
5419 \p{Sc=Olck}, \p{Olck}) (48)
5420 \p{Script: Olck} \p{Script=Ol_Chiki} (=
5421 \p{Script_Extensions=Ol_Chiki}) (48)
5422 \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
5423 (Short: \p{Sc=Hung}, \p{Hung}) (108)
5424 \p{Script: Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
5425 \p{Sc=Ital}, \p{Ital}) (39)
5426 \p{Script: Old_North_Arabian} \p{Script_Extensions=
5427 Old_North_Arabian} (Short: \p{Sc=Narb},
5428 \p{Narb}) (32)
5429 \p{Script: Old_Permic} (Short: \p{Sc=Perm}) (43: U+10350..1037A)
5430 \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
5431 \p{Sc=Xpeo}, \p{Xpeo}) (50)
5432 \p{Script: Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
5433 \p{Sc=Sogo}, \p{Sogo}) (40)
5434 \p{Script: Old_South_Arabian} \p{Script_Extensions=
5435 Old_South_Arabian} (Short: \p{Sc=Sarb},
5436 \p{Sarb}) (32)
5437 \p{Script: Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
5438 \p{Sc=Orkh}, \p{Orkh}) (73)
5439 \p{Script: Oriya} (Short: \p{Sc=Orya}) (91: U+0B01..0B03,
5440 U+0B05..0B0C, U+0B0F..0B10,
5441 U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33
5442 ...)
5443 \p{Script: Orkh} \p{Script=Old_Turkic} (=
5444 \p{Script_Extensions=Old_Turkic}) (73)
5445 \p{Script: Orya} \p{Script=Oriya} (91)
5446 \p{Script: Osage} \p{Script_Extensions=Osage} (Short: \p{Sc=
5447 Osge}, \p{Osge}) (72)
5448 \p{Script: Osge} \p{Script=Osage} (= \p{Script_Extensions=
5449 Osage}) (72)
5450 \p{Script: Osma} \p{Script=Osmanya} (=
5451 \p{Script_Extensions=Osmanya}) (40)
5452 \p{Script: Osmanya} \p{Script_Extensions=Osmanya} (Short:
5453 \p{Sc=Osma}, \p{Osma}) (40)
5454 \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong}
5455 (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
5456 \p{Script: Palm} \p{Script=Palmyrene} (=
5457 \p{Script_Extensions=Palmyrene}) (32)
5458 \p{Script: Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
5459 \p{Sc=Palm}, \p{Palm}) (32)
5460 \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
5461 \p{Sc=Pauc}, \p{Pauc}) (57)
5462 \p{Script: Pauc} \p{Script=Pau_Cin_Hau} (=
5463 \p{Script_Extensions=Pau_Cin_Hau}) (57)
5464 \p{Script: Perm} \p{Script=Old_Permic} (43)
5465 \p{Script: Phag} \p{Script=Phags_Pa} (56)
5466 \p{Script: Phags_Pa} (Short: \p{Sc=Phag}) (56: U+A840..A877)
5467 \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (=
5468 \p{Script_Extensions=
5469 Inscriptional_Pahlavi}) (27)
5470 \p{Script: Phlp} \p{Script=Psalter_Pahlavi} (29)
5471 \p{Script: Phnx} \p{Script=Phoenician} (=
5472 \p{Script_Extensions=Phoenician}) (29)
5473 \p{Script: Phoenician} \p{Script_Extensions=Phoenician} (Short:
5474 \p{Sc=Phnx}, \p{Phnx}) (29)
5475 \p{Script: Plrd} \p{Script=Miao} (= \p{Script_Extensions=
5476 Miao}) (149)
5477 \p{Script: Prti} \p{Script=Inscriptional_Parthian} (=
5478 \p{Script_Extensions=
5479 Inscriptional_Parthian}) (30)
5480 \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29:
5481 U+10B80..10B91, U+10B99..10B9C,
5482 U+10BA9..10BAF)
5483 \p{Script: Qaac} \p{Script=Coptic} (137)
5484 \p{Script: Qaai} \p{Script=Inherited} (573)
5485 \p{Script: Rejang} \p{Script_Extensions=Rejang} (Short:
5486 \p{Sc=Rjng}, \p{Rjng}) (37)
5487 \p{Script: Rjng} \p{Script=Rejang} (= \p{Script_Extensions=
5488 Rejang}) (37)
5489 \p{Script: Rohg} \p{Script=Hanifi_Rohingya} (50)
5490 \p{Script: Runic} \p{Script_Extensions=Runic} (Short: \p{Sc=
5491 Runr}, \p{Runr}) (86)
5492 \p{Script: Runr} \p{Script=Runic} (= \p{Script_Extensions=
5493 Runic}) (86)
5494 \p{Script: Samaritan} \p{Script_Extensions=Samaritan} (Short:
5495 \p{Sc=Samr}, \p{Samr}) (61)
5496 \p{Script: Samr} \p{Script=Samaritan} (=
5497 \p{Script_Extensions=Samaritan}) (61)
5498 \p{Script: Sarb} \p{Script=Old_South_Arabian} (=
5499 \p{Script_Extensions=Old_South_Arabian})
5500 (32)
5501 \p{Script: Saur} \p{Script=Saurashtra} (=
5502 \p{Script_Extensions=Saurashtra}) (82)
5503 \p{Script: Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
5504 \p{Sc=Saur}, \p{Saur}) (82)
5505 \p{Script: Sgnw} \p{Script=SignWriting} (=
5506 \p{Script_Extensions=SignWriting}) (672)
5507 \p{Script: Sharada} (Short: \p{Sc=Shrd}) (96: U+11180..111DF)
5508 \p{Script: Shavian} \p{Script_Extensions=Shavian} (Short:
5509 \p{Sc=Shaw}, \p{Shaw}) (48)
5510 \p{Script: Shaw} \p{Script=Shavian} (=
5511 \p{Script_Extensions=Shavian}) (48)
5512 \p{Script: Shrd} \p{Script=Sharada} (96)
5513 \p{Script: Sidd} \p{Script=Siddham} (=
5514 \p{Script_Extensions=Siddham}) (92)
5515 \p{Script: Siddham} \p{Script_Extensions=Siddham} (Short:
5516 \p{Sc=Sidd}, \p{Sidd}) (92)
5517 \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short:
5518 \p{Sc=Sgnw}, \p{Sgnw}) (672)
5519 \p{Script: Sind} \p{Script=Khudawadi} (69)
5520 \p{Script: Sinh} \p{Script=Sinhala} (111)
5521 \p{Script: Sinhala} (Short: \p{Sc=Sinh}) (111: U+0D81..0D83,
5522 U+0D85..0D96, U+0D9A..0DB1,
5523 U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...)
5524 \p{Script: Sogd} \p{Script=Sogdian} (42)
5525 \p{Script: Sogdian} (Short: \p{Sc=Sogd}) (42: U+10F30..10F59)
5526 \p{Script: Sogo} \p{Script=Old_Sogdian} (=
5527 \p{Script_Extensions=Old_Sogdian}) (40)
5528 \p{Script: Sora} \p{Script=Sora_Sompeng} (=
5529 \p{Script_Extensions=Sora_Sompeng}) (35)
5530 \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng}
5531 (Short: \p{Sc=Sora}, \p{Sora}) (35)
5532 \p{Script: Soyo} \p{Script=Soyombo} (=
5533 \p{Script_Extensions=Soyombo}) (83)
5534 \p{Script: Soyombo} \p{Script_Extensions=Soyombo} (Short:
5535 \p{Sc=Soyo}, \p{Soyo}) (83)
5536 \p{Script: Sund} \p{Script=Sundanese} (=
5537 \p{Script_Extensions=Sundanese}) (72)
5538 \p{Script: Sundanese} \p{Script_Extensions=Sundanese} (Short:
5539 \p{Sc=Sund}, \p{Sund}) (72)
5540 \p{Script: Sylo} \p{Script=Syloti_Nagri} (45)
5541 \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (45: U+A800..A82C)
5542 \p{Script: Syrc} \p{Script=Syriac} (88)
5543 \p{Script: Syriac} (Short: \p{Sc=Syrc}) (88: U+0700..070D,
5544 U+070F..074A, U+074D..074F, U+0860..086A)
5545 \p{Script: Tagalog} (Short: \p{Sc=Tglg}) (20: U+1700..170C,
5546 U+170E..1714)
5547 \p{Script: Tagb} \p{Script=Tagbanwa} (18)
5548 \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}) (18: U+1760..176C,
5549 U+176E..1770, U+1772..1773)
5550 \p{Script: Tai_Le} (Short: \p{Sc=Tale}) (35: U+1950..196D,
5551 U+1970..1974)
5552 \p{Script: Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
5553 \p{Sc=Lana}, \p{Lana}) (127)
5554 \p{Script: Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
5555 \p{Sc=Tavt}, \p{Tavt}) (72)
5556 \p{Script: Takr} \p{Script=Takri} (67)
5557 \p{Script: Takri} (Short: \p{Sc=Takr}) (67: U+11680..116B8,
5558 U+116C0..116C9)
5559 \p{Script: Tale} \p{Script=Tai_Le} (35)
5560 \p{Script: Talu} \p{Script=New_Tai_Lue} (=
5561 \p{Script_Extensions=New_Tai_Lue}) (83)
5562 \p{Script: Tamil} (Short: \p{Sc=Taml}) (123: U+0B82..0B83,
5563 U+0B85..0B8A, U+0B8E..0B90,
5564 U+0B92..0B95, U+0B99..0B9A, U+0B9C ...)
5565 \p{Script: Taml} \p{Script=Tamil} (123)
5566 \p{Script: Tang} \p{Script=Tangut} (= \p{Script_Extensions=
5567 Tangut}) (6914)
5568 \p{Script: Tangut} \p{Script_Extensions=Tangut} (Short:
5569 \p{Sc=Tang}, \p{Tang}) (6914)
5570 \p{Script: Tavt} \p{Script=Tai_Viet} (=
5571 \p{Script_Extensions=Tai_Viet}) (72)
5572 \p{Script: Telu} \p{Script=Telugu} (98)
5573 \p{Script: Telugu} (Short: \p{Sc=Telu}) (98: U+0C00..0C0C,
5574 U+0C0E..0C10, U+0C12..0C28,
5575 U+0C2A..0C39, U+0C3D..0C44, U+0C46..0C48
5576 ...)
5577 \p{Script: Tfng} \p{Script=Tifinagh} (=
5578 \p{Script_Extensions=Tifinagh}) (59)
5579 \p{Script: Tglg} \p{Script=Tagalog} (20)
5580 \p{Script: Thaa} \p{Script=Thaana} (50)
5581 \p{Script: Thaana} (Short: \p{Sc=Thaa}) (50: U+0780..07B1)
5582 \p{Script: Thai} \p{Script_Extensions=Thai} (Short: \p{Sc=
5583 Thai}, \p{Thai}) (86)
5584 \p{Script: Tibetan} \p{Script_Extensions=Tibetan} (Short:
5585 \p{Sc=Tibt}, \p{Tibt}) (207)
5586 \p{Script: Tibt} \p{Script=Tibetan} (=
5587 \p{Script_Extensions=Tibetan}) (207)
5588 \p{Script: Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
5589 \p{Sc=Tfng}, \p{Tfng}) (59)
5590 \p{Script: Tirh} \p{Script=Tirhuta} (82)
5591 \p{Script: Tirhuta} (Short: \p{Sc=Tirh}) (82: U+11480..114C7,
5592 U+114D0..114D9)
5593 \p{Script: Ugar} \p{Script=Ugaritic} (=
5594 \p{Script_Extensions=Ugaritic}) (31)
5595 \p{Script: Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
5596 \p{Sc=Ugar}, \p{Ugar}) (31)
5597 \p{Script: Unknown} \p{Script_Extensions=Unknown} (Short:
5598 \p{Sc=Zzzz}, \p{Zzzz}) (970_188 plus all
5599 above-Unicode code points)
5600 \p{Script: Vai} \p{Script_Extensions=Vai} (Short: \p{Sc=
5601 Vai}, \p{Vai}) (300)
5602 \p{Script: Vaii} \p{Script=Vai} (= \p{Script_Extensions=
5603 Vai}) (300)
5604 \p{Script: Wancho} \p{Script_Extensions=Wancho} (Short:
5605 \p{Sc=Wcho}, \p{Wcho}) (59)
5606 \p{Script: Wara} \p{Script=Warang_Citi} (=
5607 \p{Script_Extensions=Warang_Citi}) (84)
5608 \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
5609 \p{Sc=Wara}, \p{Wara}) (84)
5610 \p{Script: Wcho} \p{Script=Wancho} (= \p{Script_Extensions=
5611 Wancho}) (59)
5612 \p{Script: Xpeo} \p{Script=Old_Persian} (=
5613 \p{Script_Extensions=Old_Persian}) (50)
5614 \p{Script: Xsux} \p{Script=Cuneiform} (=
5615 \p{Script_Extensions=Cuneiform}) (1234)
5616 \p{Script: Yezi} \p{Script=Yezidi} (47)
5617 \p{Script: Yezidi} (Short: \p{Sc=Yezi}) (47: U+10E80..10EA9,
5618 U+10EAB..10EAD, U+10EB0..10EB1)
5619 \p{Script: Yi} (Short: \p{Sc=Yi}) (1220: U+A000..A48C,
5620 U+A490..A4C6)
5621 \p{Script: Yiii} \p{Script=Yi} (1220)
5622 \p{Script: Zanabazar_Square} \p{Script_Extensions=
5623 Zanabazar_Square} (Short: \p{Sc=Zanb},
5624 \p{Zanb}) (72)
5625 \p{Script: Zanb} \p{Script=Zanabazar_Square} (=
5626 \p{Script_Extensions=Zanabazar_Square})
5627 (72)
5628 \p{Script: Zinh} \p{Script=Inherited} (573)
5629 \p{Script: Zyyy} \p{Script=Common} (8087)
5630 \p{Script: Zzzz} \p{Script=Unknown} (=
5631 \p{Script_Extensions=Unknown}) (970_188
5632 plus all above-Unicode code points)
5633 \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (89:
5634 U+0640, U+1E900..1E94B, U+1E950..1E959,
5635 U+1E95E..1E95F)
5636 \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (89)
5637 \p{Script_Extensions: Aghb} \p{Script_Extensions=
5638 Caucasian_Albanian} (53)
5639 \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (58:
5640 U+11700..1171A, U+1171D..1172B,
5641 U+11730..1173F)
5642 \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
5643 \p{Hluw}) (583: U+14400..14646)
5644 \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1335)
5645 \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
5646 (1335: U+0600..0604, U+0606..061C,
5647 U+061E..06DC, U+06DE..06FF,
5648 U+0750..077F, U+08A0..08B4 ...)
5649 \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
5650 (96: U+0531..0556, U+0559..058A,
5651 U+058D..058F, U+FB13..FB17)
5652 \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
5653 (31)
5654 \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (96)
5655 \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst})
5656 (61: U+10B00..10B35, U+10B39..10B3F)
5657 \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
5658 \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
5659 \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
5660 (121: U+1B00..1B4B, U+1B50..1B7C)
5661 \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
5662 \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657:
5663 U+A6A0..A6F7, U+16800..16A38)
5664 \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
5665 \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
5666 (36: U+16AD0..16AED, U+16AF0..16AF5)
5667 \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56:
5668 U+1BC0..1BF3, U+1BFC..1BFF)
5669 \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
5670 \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (113)
5671 \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng})
5672 (113: U+0951..0952, U+0964..0965,
5673 U+0980..0983, U+0985..098C,
5674 U+098F..0990, U+0993..09A8 ...)
5675 \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
5676 (97: U+11C00..11C08, U+11C0A..11C36,
5677 U+11C38..11C45, U+11C50..11C6C)
5678 \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
5679 \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (117)
5680 \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
5681 (117: U+02EA..02EB, U+3001..3003,
5682 U+3008..3011, U+3013..301F,
5683 U+302A..302D, U+3030 ...)
5684 \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (109)
5685 \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah})
5686 (109: U+11000..1104D, U+11052..1106F,
5687 U+1107F)
5688 \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
5689 \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
5690 (256: U+2800..28FF)
5691 \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
5692 \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
5693 (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF)
5694 \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
5695 \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22:
5696 U+1735..1736, U+1740..1753)
5697 \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (91)
5698 \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
5699 \p{Cans}) (710: U+1400..167F,
5700 U+18B0..18F5)
5701 \p{Script_Extensions: Cans} \p{Script_Extensions=
5702 Canadian_Aboriginal} (710)
5703 \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
5704 \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49:
5705 U+102A0..102D0)
5706 \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
5707 \p{Aghb}) (53: U+10530..10563, U+1056F)
5708 \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (91:
5709 U+09E6..09EF, U+1040..1049,
5710 U+11100..11134, U+11136..11147)
5711 \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83:
5712 U+AA00..AA36, U+AA40..AA4D,
5713 U+AA50..AA59, U+AA5C..AA5F)
5714 \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
5715 \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
5716 (172: U+13A0..13F5, U+13F8..13FD,
5717 U+AB70..ABBF)
5718 \p{Script_Extensions: Chorasmian} (Short: \p{Scx=Chrs}, \p{Chrs})
5719 (28: U+10FB0..10FCB)
5720 \p{Script_Extensions: Chrs} \p{Script_Extensions=Chorasmian} (28)
5721 \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
5722 (7661: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
5723 \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xa9
5724 \xab-\xb9\xbb-\xbf\xd7\xf7],
5725 U+02B9..02DF, U+02E5..02E9,
5726 U+02EC..02FF, U+0374, U+037E ...)
5727 \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
5728 \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt})
5729 (165: U+03E2..03EF, U+2C80..2CF3,
5730 U+2CF9..2CFF, U+102E0..102FB)
5731 \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
5732 \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
5733 (1234: U+12000..12399, U+12400..1246E,
5734 U+12470..12474, U+12480..12543)
5735 \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
5736 (112: U+10100..10102, U+10107..10133,
5737 U+10137..1013F, U+10800..10805, U+10808,
5738 U+1080A..10835 ...)
5739 \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
5740 (447: U+0400..052F, U+1C80..1C88,
5741 U+1D2B, U+1D78, U+1DF8, U+2DE0..2DFF ...)
5742 \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (447)
5743 \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt})
5744 (80: U+10400..1044F)
5745 \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210)
5746 \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
5747 (210: U+0900..0952, U+0955..097F,
5748 U+1CD0..1CF6, U+1CF8..1CF9, U+20F0,
5749 U+A830..A839 ...)
5750 \p{Script_Extensions: Diak} \p{Script_Extensions=Dives_Akuru} (72)
5751 \p{Script_Extensions: Dives_Akuru} (Short: \p{Scx=Diak}, \p{Diak})
5752 (72: U+11900..11906, U+11909,
5753 U+1190C..11913, U+11915..11916,
5754 U+11918..11935, U+11937..11938 ...)
5755 \p{Script_Extensions: Dogr} \p{Script_Extensions=Dogra} (82)
5756 \p{Script_Extensions: Dogra} (Short: \p{Scx=Dogr}, \p{Dogr}) (82:
5757 U+0964..096F, U+A830..A839,
5758 U+11800..1183B)
5759 \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
5760 \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
5761 \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
5762 (147: U+1BC00..1BC6A, U+1BC70..1BC7C,
5763 U+1BC80..1BC88, U+1BC90..1BC99,
5764 U+1BC9C..1BCA3)
5765 \p{Script_Extensions: Egyp} \p{Script_Extensions=
5766 Egyptian_Hieroglyphs} (1080)
5767 \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
5768 \p{Egyp}) (1080: U+13000..1342E,
5769 U+13430..13438)
5770 \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
5771 \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba})
5772 (40: U+10500..10527)
5773 \p{Script_Extensions: Elym} \p{Script_Extensions=Elymaic} (23)
5774 \p{Script_Extensions: Elymaic} (Short: \p{Scx=Elym}, \p{Elym})
5775 (23: U+10FE0..10FF6)
5776 \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
5777 \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
5778 (495: U+1200..1248, U+124A..124D,
5779 U+1250..1256, U+1258, U+125A..125D,
5780 U+1260..1288 ...)
5781 \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (174)
5782 \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
5783 (174: U+10A0..10C5, U+10C7, U+10CD,
5784 U+10D0..10FF, U+1C90..1CBA, U+1CBD..1CBF
5785 ...)
5786 \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (136)
5787 \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
5788 (136: U+0484, U+0487, U+2C00..2C2E,
5789 U+2C30..2C5E, U+2E43, U+A66F ...)
5790 \p{Script_Extensions: Gong} \p{Script_Extensions=Gunjala_Gondi}
5791 (65)
5792 \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi}
5793 (77)
5794 \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
5795 \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27:
5796 U+10330..1034A)
5797 \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (116)
5798 \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
5799 (116: U+0951..0952, U+0964..0965,
5800 U+0BE6..0BF3, U+1CD0, U+1CD2..1CD3,
5801 U+1CF2..1CF4 ...)
5802 \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522:
5803 U+0342, U+0345, U+0370..0373,
5804 U+0375..0377, U+037A..037D, U+037F ...)
5805 \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
5806 \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
5807 (105: U+0951..0952, U+0964..0965,
5808 U+0A81..0A83, U+0A85..0A8D,
5809 U+0A8F..0A91, U+0A93..0AA8 ...)
5810 \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105)
5811 \p{Script_Extensions: Gunjala_Gondi} (Short: \p{Scx=Gong},
5812 \p{Gong}) (65: U+0964..0965,
5813 U+11D60..11D65, U+11D67..11D68,
5814 U+11D6A..11D8E, U+11D90..11D91,
5815 U+11D93..11D98 ...)
5816 \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
5817 (94: U+0951..0952, U+0964..0965,
5818 U+0A01..0A03, U+0A05..0A0A,
5819 U+0A0F..0A10, U+0A13..0A28 ...)
5820 \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (94)
5821 \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (94_492:
5822 U+2E80..2E99, U+2E9B..2EF3,
5823 U+2F00..2FD5, U+3001..3003,
5824 U+3005..3011, U+3013..301F ...)
5825 \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
5826 \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
5827 (11_775: U+1100..11FF, U+3001..3003,
5828 U+3008..3011, U+3013..301F,
5829 U+302E..3030, U+3037 ...)
5830 \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (94_492)
5831 \p{Script_Extensions: Hanifi_Rohingya} (Short: \p{Scx=Rohg},
5832 \p{Rohg}) (55: U+060C, U+061B, U+061F,
5833 U+0640, U+06D4, U+10D00..10D27 ...)
5834 \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
5835 \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano})
5836 (23: U+1720..1736)
5837 \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
5838 \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26:
5839 U+108E0..108F2, U+108F4..108F5,
5840 U+108FB..108FF)
5841 \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (134)
5842 \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr})
5843 (134: U+0591..05C7, U+05D0..05EA,
5844 U+05EF..05F4, U+FB1D..FB36,
5845 U+FB38..FB3C, U+FB3E ...)
5846 \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (431)
5847 \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
5848 (431: U+3001..3003, U+3008..3011,
5849 U+3013..301F, U+3030..3035, U+3037,
5850 U+303C..303D ...)
5851 \p{Script_Extensions: Hluw} \p{Script_Extensions=
5852 Anatolian_Hieroglyphs} (583)
5853 \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
5854 (127)
5855 \p{Script_Extensions: Hmnp} \p{Script_Extensions=
5856 Nyiakeng_Puachue_Hmong} (71)
5857 \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
5858 (108)
5859 \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
5860 \p{Armi}) (31: U+10840..10855,
5861 U+10857..1085F)
5862 \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
5863 (503: U+0300..0341, U+0343..0344,
5864 U+0346..0362, U+0953..0954,
5865 U+1AB0..1AC0, U+1DC2..1DF7 ...)
5866 \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
5867 \p{Phli}) (27: U+10B60..10B72,
5868 U+10B78..10B7F)
5869 \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
5870 Prti}, \p{Prti}) (30: U+10B40..10B55,
5871 U+10B58..10B5F)
5872 \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39)
5873 \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
5874 \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
5875 (91: U+A980..A9CD, U+A9CF..A9D9,
5876 U+A9DE..A9DF)
5877 \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (87:
5878 U+0966..096F, U+A830..A839,
5879 U+11080..110C1, U+110CD)
5880 \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
5881 \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (356)
5882 \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
5883 (104: U+0951..0952, U+0964..0965,
5884 U+0C80..0C8C, U+0C8E..0C90,
5885 U+0C92..0CA8, U+0CAA..0CB3 ...)
5886 \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
5887 (356: U+3001..3003, U+3008..3011,
5888 U+3013..301F, U+3030..3035, U+3037,
5889 U+303C..303D ...)
5890 \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
5891 (48: U+A900..A92F)
5892 \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (68)
5893 \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
5894 (68: U+10A00..10A03, U+10A05..10A06,
5895 U+10A0C..10A13, U+10A15..10A17,
5896 U+10A19..10A35, U+10A38..10A3A ...)
5897 \p{Script_Extensions: Khitan_Small_Script} (Short: \p{Scx=Kits},
5898 \p{Kits}) (471: U+16FE4, U+18B00..18CD5)
5899 \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146:
5900 U+1780..17DD, U+17E0..17E9,
5901 U+17F0..17F9, U+19E0..19FF)
5902 \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
5903 \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (82)
5904 \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (82:
5905 U+0AE6..0AEF, U+A830..A839,
5906 U+11200..11211, U+11213..1123E)
5907 \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
5908 (81: U+0964..0965, U+A830..A839,
5909 U+112B0..112EA, U+112F0..112F9)
5910 \p{Script_Extensions: Kits} \p{Script_Extensions=
5911 Khitan_Small_Script} (471)
5912 \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (104)
5913 \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (87)
5914 \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
5915 \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (82:
5916 U+0E81..0E82, U+0E84, U+0E86..0E8A,
5917 U+0E8C..0EA3, U+0EA5, U+0EA7..0EBD ...)
5918 \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (82)
5919 \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn})
5920 (1403: [A-Za-z\xaa\xba\xc0-\xd6\xd8-
5921 \xf6\xf8-\xff], U+0100..02B8,
5922 U+02E0..02E4, U+0363..036F,
5923 U+0485..0486, U+0951..0952 ...)
5924 \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1403)
5925 \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
5926 \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74:
5927 U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F)
5928 \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
5929 \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69:
5930 U+0965, U+1900..191E, U+1920..192B,
5931 U+1930..193B, U+1940, U+1944..194F)
5932 \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
5933 \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
5934 \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
5935 (386: U+10107..10133, U+10600..10736,
5936 U+10740..10755, U+10760..10767)
5937 \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
5938 (268: U+10000..1000B, U+1000D..10026,
5939 U+10028..1003A, U+1003C..1003D,
5940 U+1003F..1004D, U+10050..1005D ...)
5941 \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (49:
5942 U+A4D0..A4FF, U+11FB0)
5943 \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
5944 \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29:
5945 U+10280..1029C)
5946 \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
5947 \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27:
5948 U+10920..10939, U+1093F)
5949 \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
5950 (61: U+0964..096F, U+A830..A839,
5951 U+11150..11176)
5952 \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
5953 \p{Script_Extensions: Maka} \p{Script_Extensions=Makasar} (25)
5954 \p{Script_Extensions: Makasar} (Short: \p{Scx=Maka}, \p{Maka})
5955 (25: U+11EE0..11EF8)
5956 \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
5957 (126: U+0951..0952, U+0964..0965,
5958 U+0D00..0D0C, U+0D0E..0D10,
5959 U+0D12..0D44, U+0D46..0D48 ...)
5960 \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
5961 \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand})
5962 (30: U+0640, U+0840..085B, U+085E)
5963 \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
5964 \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
5965 (52: U+0640, U+10AC0..10AE6,
5966 U+10AEB..10AF6)
5967 \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
5968 \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc})
5969 (68: U+11C70..11C8F, U+11C92..11CA7,
5970 U+11CA9..11CB6)
5971 \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm},
5972 \p{Gonm}) (77: U+0964..0965,
5973 U+11D00..11D06, U+11D08..11D09,
5974 U+11D0B..11D36, U+11D3A, U+11D3C..11D3D
5975 ...)
5976 \p{Script_Extensions: Medefaidrin} (Short: \p{Scx=Medf}, \p{Medf})
5977 (91: U+16E40..16E9A)
5978 \p{Script_Extensions: Medf} \p{Script_Extensions=Medefaidrin} (91)
5979 \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
5980 \p{Mtei}) (79: U+AAE0..AAF6,
5981 U+ABC0..ABED, U+ABF0..ABF9)
5982 \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
5983 (213)
5984 \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
5985 \p{Mend}) (213: U+1E800..1E8C4,
5986 U+1E8C7..1E8D6)
5987 \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
5988 (90)
5989 \p{Script_Extensions: Mero} \p{Script_Extensions=
5990 Meroitic_Hieroglyphs} (32)
5991 \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
5992 \p{Merc}) (90: U+109A0..109B7,
5993 U+109BC..109CF, U+109D2..109FF)
5994 \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
5995 \p{Mero}) (32: U+10980..1099F)
5996 \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (149:
5997 U+16F00..16F4A, U+16F4F..16F87,
5998 U+16F8F..16F9F)
5999 \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (126)
6000 \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89:
6001 U+A830..A839, U+11600..11644,
6002 U+11650..11659)
6003 \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (171)
6004 \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
6005 (171: U+1800..180E, U+1810..1819,
6006 U+1820..1878, U+1880..18AA, U+202F,
6007 U+11660..1166C)
6008 \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43:
6009 U+16A40..16A5E, U+16A60..16A69,
6010 U+16A6E..16A6F)
6011 \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
6012 \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
6013 \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
6014 \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult})
6015 (48: U+0A66..0A6F, U+11280..11286,
6016 U+11288, U+1128A..1128D, U+1128F..1129D,
6017 U+1129F..112A9)
6018 \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
6019 (224: U+1000..109F, U+A92E,
6020 U+A9E0..A9FE, U+AA60..AA7F)
6021 \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
6022 \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
6023 (40: U+10880..1089E, U+108A7..108AF)
6024 \p{Script_Extensions: Nand} \p{Script_Extensions=Nandinagari} (86)
6025 \p{Script_Extensions: Nandinagari} (Short: \p{Scx=Nand}, \p{Nand})
6026 (86: U+0964..0965, U+0CE6..0CEF, U+1CE9,
6027 U+1CF2, U+1CFA, U+A830..A835 ...)
6028 \p{Script_Extensions: Narb} \p{Script_Extensions=
6029 Old_North_Arabian} (32)
6030 \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
6031 \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
6032 (83: U+1980..19AB, U+19B0..19C9,
6033 U+19D0..19DA, U+19DE..19DF)
6034 \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (97:
6035 U+11400..1145B, U+1145D..11461)
6036 \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (62:
6037 U+07C0..07FA, U+07FD..07FF)
6038 \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (62)
6039 \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397)
6040 \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397:
6041 U+16FE1, U+1B170..1B2FB)
6042 \p{Script_Extensions: Nyiakeng_Puachue_Hmong} (Short: \p{Scx=
6043 Hmnp}, \p{Hmnp}) (71: U+1E100..1E12C,
6044 U+1E130..1E13D, U+1E140..1E149,
6045 U+1E14E..1E14F)
6046 \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
6047 \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29:
6048 U+1680..169C)
6049 \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
6050 (48: U+1C50..1C7F)
6051 \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
6052 \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
6053 \p{Hung}) (108: U+10C80..10CB2,
6054 U+10CC0..10CF2, U+10CFA..10CFF)
6055 \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
6056 (39: U+10300..10323, U+1032D..1032F)
6057 \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
6058 \p{Narb}) (32: U+10A80..10A9F)
6059 \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
6060 (44: U+0483, U+10350..1037A)
6061 \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
6062 (50: U+103A0..103C3, U+103C8..103D5)
6063 \p{Script_Extensions: Old_Sogdian} (Short: \p{Scx=Sogo}, \p{Sogo})
6064 (40: U+10F00..10F27)
6065 \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
6066 \p{Sarb}) (32: U+10A60..10A7F)
6067 \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
6068 (73: U+10C00..10C48)
6069 \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (97:
6070 U+0951..0952, U+0964..0965,
6071 U+0B01..0B03, U+0B05..0B0C,
6072 U+0B0F..0B10, U+0B13..0B28 ...)
6073 \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
6074 \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (97)
6075 \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72:
6076 U+104B0..104D3, U+104D8..104FB)
6077 \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
6078 \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
6079 \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma})
6080 (40: U+10480..1049D, U+104A0..104A9)
6081 \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
6082 \p{Hmng}) (127: U+16B00..16B45,
6083 U+16B50..16B59, U+16B5B..16B61,
6084 U+16B63..16B77, U+16B7D..16B8F)
6085 \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
6086 \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
6087 (32: U+10860..1087F)
6088 \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
6089 (57: U+11AC0..11AF8)
6090 \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
6091 \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
6092 \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
6093 \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
6094 (59: U+1802..1803, U+1805, U+A840..A877)
6095 \p{Script_Extensions: Phli} \p{Script_Extensions=
6096 Inscriptional_Pahlavi} (27)
6097 \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
6098 (30)
6099 \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
6100 \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
6101 (29: U+10900..1091B, U+1091F)
6102 \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (149)
6103 \p{Script_Extensions: Prti} \p{Script_Extensions=
6104 Inscriptional_Parthian} (30)
6105 \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
6106 \p{Phlp}) (30: U+0640, U+10B80..10B91,
6107 U+10B99..10B9C, U+10BA9..10BAF)
6108 \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
6109 \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (503)
6110 \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37:
6111 U+A930..A953, U+A95F)
6112 \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
6113 \p{Script_Extensions: Rohg} \p{Script_Extensions=Hanifi_Rohingya}
6114 (55)
6115 \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86:
6116 U+16A0..16EA, U+16EE..16F8)
6117 \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
6118 \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
6119 (61: U+0800..082D, U+0830..083E)
6120 \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
6121 \p{Script_Extensions: Sarb} \p{Script_Extensions=
6122 Old_South_Arabian} (32)
6123 \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
6124 \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
6125 (82: U+A880..A8C5, U+A8CE..A8D9)
6126 \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
6127 \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
6128 (102: U+0951, U+1CD7, U+1CD9,
6129 U+1CDC..1CDD, U+1CE0, U+11180..111DF)
6130 \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw})
6131 (48: U+10450..1047F)
6132 \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
6133 \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (102)
6134 \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
6135 \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd})
6136 (92: U+11580..115B5, U+115B8..115DD)
6137 \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
6138 (672: U+1D800..1DA8B, U+1DA9B..1DA9F,
6139 U+1DAA1..1DAAF)
6140 \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
6141 \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (113)
6142 \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
6143 (113: U+0964..0965, U+0D81..0D83,
6144 U+0D85..0D96, U+0D9A..0DB1,
6145 U+0DB3..0DBB, U+0DBD ...)
6146 \p{Script_Extensions: Sogd} \p{Script_Extensions=Sogdian} (43)
6147 \p{Script_Extensions: Sogdian} (Short: \p{Scx=Sogd}, \p{Sogd})
6148 (43: U+0640, U+10F30..10F59)
6149 \p{Script_Extensions: Sogo} \p{Script_Extensions=Old_Sogdian} (40)
6150 \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
6151 \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
6152 \p{Sora}) (35: U+110D0..110E8,
6153 U+110F0..110F9)
6154 \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (83)
6155 \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo})
6156 (83: U+11A50..11AA2)
6157 \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
6158 \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
6159 (72: U+1B80..1BBF, U+1CC0..1CC7)
6160 \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (57)
6161 \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
6162 \p{Sylo}) (57: U+0964..0965,
6163 U+09E6..09EF, U+A800..A82C)
6164 \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (106)
6165 \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc})
6166 (106: U+060C, U+061B..061C, U+061F,
6167 U+0640, U+064B..0655, U+0670 ...)
6168 \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg})
6169 (22: U+1700..170C, U+170E..1714,
6170 U+1735..1736)
6171 \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
6172 \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
6173 (20: U+1735..1736, U+1760..176C,
6174 U+176E..1770, U+1772..1773)
6175 \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45:
6176 U+1040..1049, U+1950..196D, U+1970..1974)
6177 \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
6178 (127: U+1A20..1A5E, U+1A60..1A7C,
6179 U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD)
6180 \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
6181 (72: U+AA80..AAC2, U+AADB..AADF)
6182 \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (79)
6183 \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (79:
6184 U+0964..0965, U+A830..A839,
6185 U+11680..116B8, U+116C0..116C9)
6186 \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
6187 \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
6188 \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (133:
6189 U+0951..0952, U+0964..0965,
6190 U+0B82..0B83, U+0B85..0B8A,
6191 U+0B8E..0B90, U+0B92..0B95 ...)
6192 \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (133)
6193 \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6914)
6194 \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
6195 (6914: U+16FE0, U+17000..187F7,
6196 U+18800..18AFF, U+18D00..18D08)
6197 \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
6198 \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (104)
6199 \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu})
6200 (104: U+0951..0952, U+0964..0965,
6201 U+0C00..0C0C, U+0C0E..0C10,
6202 U+0C12..0C28, U+0C2A..0C39 ...)
6203 \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
6204 \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
6205 \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66)
6206 \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66:
6207 U+060C, U+061B..061C, U+061F,
6208 U+0660..0669, U+0780..07B1, U+FDF2 ...)
6209 \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86:
6210 U+0E01..0E3A, U+0E40..0E5B)
6211 \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
6212 (207: U+0F00..0F47, U+0F49..0F6C,
6213 U+0F71..0F97, U+0F99..0FBC,
6214 U+0FBE..0FCC, U+0FCE..0FD4 ...)
6215 \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
6216 \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
6217 (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F)
6218 \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (97)
6219 \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh})
6220 (97: U+0951..0952, U+0964..0965, U+1CF2,
6221 U+A830..A839, U+11480..114C7,
6222 U+114D0..114D9)
6223 \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
6224 \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
6225 (31: U+10380..1039D, U+1039F)
6226 \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
6227 (970_188 plus all above-Unicode code
6228 points: U+0378..0379, U+0380..0383,
6229 U+038B, U+038D, U+03A2, U+0530 ...)
6230 \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300:
6231 U+A500..A62B)
6232 \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
6233 \p{Script_Extensions: Wancho} (Short: \p{Scx=Wcho}, \p{Wcho}) (59:
6234 U+1E2C0..1E2F9, U+1E2FF)
6235 \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
6236 \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
6237 (84: U+118A0..118F2, U+118FF)
6238 \p{Script_Extensions: Wcho} \p{Script_Extensions=Wancho} (59)
6239 \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
6240 \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
6241 \p{Script_Extensions: Yezi} \p{Script_Extensions=Yezidi} (60)
6242 \p{Script_Extensions: Yezidi} (Short: \p{Scx=Yezi}, \p{Yezi}) (60:
6243 U+060C, U+061B, U+061F, U+0660..0669,
6244 U+10E80..10EA9, U+10EAB..10EAD ...)
6245 \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246:
6246 U+3001..3002, U+3008..3011,
6247 U+3014..301B, U+30FB, U+A000..A48C,
6248 U+A490..A4C6 ...)
6249 \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
6250 \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb},
6251 \p{Zanb}) (72: U+11A00..11A47)
6252 \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square}
6253 (72)
6254 \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (503)
6255 \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (7661)
6256 \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (970_188
6257 plus all above-Unicode code points)
6258 \p{Scx: *} \p{Script_Extensions: *}
6259 \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
6260 \p{SD: *} \p{Soft_Dotted: *}
6261 \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
6262 \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024,
6263 U+FE52, U+FF0E)
6264 \p{Sentence_Break: CL} \p{Sentence_Break=Close} (187)
6265 \p{Sentence_Break: Close} (Short: \p{SB=CL}) (187: [\"\'\(\)\[\]
6266 \{\}\xab\xbb], U+0F3A..0F3D,
6267 U+169B..169C, U+2018..201F,
6268 U+2039..203A, U+2045..2046 ...)
6269 \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1: [\r])
6270 \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (2395)
6271 \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2395: U+0300..036F,
6272 U+0483..0489, U+0591..05BD, U+05BF,
6273 U+05C1..05C2, U+05C4..05C5 ...)
6274 \p{Sentence_Break: FO} \p{Sentence_Break=Format} (63)
6275 \p{Sentence_Break: Format} (Short: \p{SB=FO}) (63: [\xad],
6276 U+0600..0605, U+061C, U+06DD, U+070F,
6277 U+08E2 ...)
6278 \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (127_413)
6279 \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1: [\n])
6280 \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (2297)
6281 \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2297: [a-z\xaa\xb5
6282 \xba\xdf-\xf6\xf8-\xff], U+0101, U+0103,
6283 U+0105, U+0107, U+0109 ...)
6284 \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (652)
6285 \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (652: [0-9],
6286 U+0660..0669, U+066B..066C,
6287 U+06F0..06F9, U+07C0..07C9, U+0966..096F
6288 ...)
6289 \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (127_413: U+01BB,
6290 U+01C0..01C3, U+0294, U+02B9..02BF,
6291 U+02C6..02D1, U+02EC ...)
6292 \p{Sentence_Break: Other} (Short: \p{SB=XX}) (979_014 plus all
6293 above-Unicode code points: [^\t\n\cK\f
6294 \r\x20!\"\'\(\),\-.0-9:?A-Z\[\]a-z\{\}
6295 \x85\xa0\xaa-\xab\xad\xb5\xba-\xbb\xc0-
6296 \xd6\xd8-\xf6\xf8-\xff], U+02C2..02C5,
6297 U+02D2..02DF, U+02E5..02EB, U+02ED,
6298 U+02EF..02FF ...)
6299 \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
6300 \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:],
6301 U+055D, U+060C..060D, U+07F8, U+1802,
6302 U+1808 ...)
6303 \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
6304 \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85],
6305 U+2028..2029)
6306 \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0],
6307 U+1680, U+2000..200A, U+202F, U+205F,
6308 U+3000)
6309 \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (140)
6310 \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (140: [!?], U+0589,
6311 U+061E..061F, U+06D4, U+0700..0702,
6312 U+07F9 ...)
6313 \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1896)
6314 \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1896: [A-Z\xc0-\xd6
6315 \xd8-\xde], U+0100, U+0102, U+0104,
6316 U+0106, U+0108 ...)
6317 \p{Sentence_Break: XX} \p{Sentence_Break=Other} (979_014 plus all
6318 above-Unicode code points)
6319 \p{Sentence_Terminal} \p{Sentence_Terminal=Y} (Short: \p{STerm})
6320 (143)
6321 \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
6322 (1_113_969 plus all above-Unicode code
6323 points: [\x00-\x20\"#\$\%&\'\(\)*+,\-
6324 \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
6325 \xff], U+0100..0588, U+058A..061D,
6326 U+0620..06D3, U+06D5..06FF, U+0703..07F8
6327 ...)
6328 \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (143:
6329 [!.?], U+0589, U+061E..061F, U+06D4,
6330 U+0700..0702, U+07F9 ...)
6331 \p{Separator} \p{General_Category=Separator} (Short:
6332 \p{Z}) (19)
6333 \p{Sgnw} \p{SignWriting} (= \p{Script_Extensions=
6334 SignWriting}) (672)
6335 \p{Sharada} \p{Script_Extensions=Sharada} (Short:
6336 \p{Shrd}; NOT \p{Block=Sharada}) (102)
6337 \p{Shavian} \p{Script_Extensions=Shavian} (Short:
6338 \p{Shaw}) (48)
6339 \p{Shaw} \p{Shavian} (= \p{Script_Extensions=
6340 Shavian}) (48)
6341 X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
6342 (16)
6343 \p{Shrd} \p{Sharada} (= \p{Script_Extensions=
6344 Sharada}) (NOT \p{Block=Sharada}) (102)
6345 \p{Sidd} \p{Siddham} (= \p{Script_Extensions=
6346 Siddham}) (NOT \p{Block=Siddham}) (92)
6347 \p{Siddham} \p{Script_Extensions=Siddham} (Short:
6348 \p{Sidd}; NOT \p{Block=Siddham}) (92)
6349 \p{SignWriting} \p{Script_Extensions=SignWriting} (Short:
6350 \p{Sgnw}) (672)
6351 \p{Sind} \p{Khudawadi} (= \p{Script_Extensions=
6352 Khudawadi}) (NOT \p{Block=Khudawadi})
6353 (81)
6354 \p{Sinh} \p{Sinhala} (= \p{Script_Extensions=
6355 Sinhala}) (NOT \p{Block=Sinhala}) (113)
6356 \p{Sinhala} \p{Script_Extensions=Sinhala} (Short:
6357 \p{Sinh}; NOT \p{Block=Sinhala}) (113)
6358 X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
6359 \p{Sk} \p{Modifier_Symbol} (=
6360 \p{General_Category=Modifier_Symbol})
6361 (123)
6362 \p{Sm} \p{Math_Symbol} (= \p{General_Category=
6363 Math_Symbol}) (948)
6364 X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
6365 \p{InSmallForms}) (32)
6366 X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block=
6367 Small_Form_Variants}) (32)
6368 X \p{Small_Kana_Ext} \p{Small_Kana_Extension} (= \p{Block=
6369 Small_Kana_Extension}) (64)
6370 X \p{Small_Kana_Extension} \p{Block=Small_Kana_Extension} (Short:
6371 \p{InSmallKanaExt}) (64)
6372 \p{So} \p{Other_Symbol} (= \p{General_Category=
6373 Other_Symbol}) (6431)
6374 \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
6375 \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_066 plus
6376 all above-Unicode code points: [\x00-
6377 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6378 Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff],
6379 U+0100..012E, U+0130..0248,
6380 U+024A..0267, U+0269..029C, U+029E..02B1
6381 ...)
6382 \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (46: [i-j],
6383 U+012F, U+0249, U+0268, U+029D, U+02B2
6384 ...)
6385 \p{Sogd} \p{Sogdian} (= \p{Script_Extensions=
6386 Sogdian}) (NOT \p{Block=Sogdian}) (43)
6387 \p{Sogdian} \p{Script_Extensions=Sogdian} (Short:
6388 \p{Sogd}; NOT \p{Block=Sogdian}) (43)
6389 \p{Sogo} \p{Old_Sogdian} (= \p{Script_Extensions=
6390 Old_Sogdian}) (NOT \p{Block=
6391 Old_Sogdian}) (40)
6392 \p{Sora} \p{Sora_Sompeng} (= \p{Script_Extensions=
6393 Sora_Sompeng}) (NOT \p{Block=
6394 Sora_Sompeng}) (35)
6395 \p{Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng} (Short:
6396 \p{Sora}; NOT \p{Block=Sora_Sompeng})
6397 (35)
6398 \p{Soyo} \p{Soyombo} (= \p{Script_Extensions=
6399 Soyombo}) (NOT \p{Block=Soyombo}) (83)
6400 \p{Soyombo} \p{Script_Extensions=Soyombo} (Short:
6401 \p{Soyo}; NOT \p{Block=Soyombo}) (83)
6402 \p{Space} \p{White_Space} (= \p{White_Space=Y}) (25)
6403 \p{Space: *} \p{White_Space: *}
6404 \p{Space_Separator} \p{General_Category=Space_Separator}
6405 (Short: \p{Zs}) (17)
6406 \p{SpacePerl} \p{XPosixSpace} (25)
6407 \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
6408 \p{Mc}) (443)
6409 X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
6410 (Short: \p{InModifierLetters}) (80)
6411 X \p{Specials} \p{Block=Specials} (16)
6412 \p{STerm} \p{Sentence_Terminal} (=
6413 \p{Sentence_Terminal=Y}) (143)
6414 \p{STerm: *} \p{Sentence_Terminal: *}
6415 \p{Sund} \p{Sundanese} (= \p{Script_Extensions=
6416 Sundanese}) (NOT \p{Block=Sundanese})
6417 (72)
6418 \p{Sundanese} \p{Script_Extensions=Sundanese} (Short:
6419 \p{Sund}; NOT \p{Block=Sundanese}) (72)
6420 X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block=
6421 Sundanese_Supplement}) (16)
6422 X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
6423 \p{InSundaneseSup}) (16)
6424 X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block=
6425 Supplemental_Arrows_A}) (16)
6426 X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block=
6427 Supplemental_Arrows_B}) (128)
6428 X \p{Sup_Arrows_C} \p{Supplemental_Arrows_C} (= \p{Block=
6429 Supplemental_Arrows_C}) (256)
6430 X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (=
6431 \p{Block=
6432 Supplemental_Mathematical_Operators})
6433 (256)
6434 X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (=
6435 \p{Block=
6436 Supplementary_Private_Use_Area_A})
6437 (65_536)
6438 X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (=
6439 \p{Block=
6440 Supplementary_Private_Use_Area_B})
6441 (65_536)
6442 X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block=
6443 Supplemental_Punctuation}) (128)
6444 X \p{Sup_Symbols_And_Pictographs}
6445 \p{Supplemental_Symbols_And_Pictographs}
6446 (= \p{Block=
6447 Supplemental_Symbols_And_Pictographs})
6448 (256)
6449 X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (=
6450 \p{Block=Superscripts_And_Subscripts})
6451 (48)
6452 X \p{Superscripts_And_Subscripts} \p{Block=
6453 Superscripts_And_Subscripts} (Short:
6454 \p{InSuperAndSub}) (48)
6455 X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
6456 \p{InSupArrowsA}) (16)
6457 X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
6458 \p{InSupArrowsB}) (128)
6459 X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
6460 \p{InSupArrowsC}) (256)
6461 X \p{Supplemental_Mathematical_Operators} \p{Block=
6462 Supplemental_Mathematical_Operators}
6463 (Short: \p{InSupMathOperators}) (256)
6464 X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
6465 (Short: \p{InSupPunctuation}) (128)
6466 X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
6467 Supplemental_Symbols_And_Pictographs}
6468 (Short: \p{InSupSymbolsAndPictographs})
6469 (256)
6470 X \p{Supplementary_Private_Use_Area_A} \p{Block=
6471 Supplementary_Private_Use_Area_A}
6472 (Short: \p{InSupPUAA}) (65_536)
6473 X \p{Supplementary_Private_Use_Area_B} \p{Block=
6474 Supplementary_Private_Use_Area_B}
6475 (Short: \p{InSupPUAB}) (65_536)
6476 \p{Surrogate} \p{General_Category=Surrogate} (Short:
6477 \p{Cs}) (2048)
6478 X \p{Sutton_SignWriting} \p{Block=Sutton_SignWriting} (688)
6479 \p{Sylo} \p{Syloti_Nagri} (= \p{Script_Extensions=
6480 Syloti_Nagri}) (NOT \p{Block=
6481 Syloti_Nagri}) (57)
6482 \p{Syloti_Nagri} \p{Script_Extensions=Syloti_Nagri} (Short:
6483 \p{Sylo}; NOT \p{Block=Syloti_Nagri})
6484 (57)
6485 \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
6486 (7564)
6487 X \p{Symbols_And_Pictographs_Ext_A}
6488 \p{Symbols_And_Pictographs_Extended_A}
6489 (= \p{Block=
6490 Symbols_And_Pictographs_Extended_A})
6491 (144)
6492 X \p{Symbols_And_Pictographs_Extended_A} \p{Block=
6493 Symbols_And_Pictographs_Extended_A} (144)
6494 X \p{Symbols_For_Legacy_Computing} \p{Block=
6495 Symbols_For_Legacy_Computing} (256)
6496 \p{Syrc} \p{Syriac} (= \p{Script_Extensions=
6497 Syriac}) (NOT \p{Block=Syriac}) (106)
6498 \p{Syriac} \p{Script_Extensions=Syriac} (Short:
6499 \p{Syrc}; NOT \p{Block=Syriac}) (106)
6500 X \p{Syriac_Sup} \p{Syriac_Supplement} (= \p{Block=
6501 Syriac_Supplement}) (16)
6502 X \p{Syriac_Supplement} \p{Block=Syriac_Supplement} (Short:
6503 \p{InSyriacSup}) (16)
6504 \p{Tagalog} \p{Script_Extensions=Tagalog} (Short:
6505 \p{Tglg}; NOT \p{Block=Tagalog}) (22)
6506 \p{Tagb} \p{Tagbanwa} (= \p{Script_Extensions=
6507 Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
6508 \p{Tagbanwa} \p{Script_Extensions=Tagbanwa} (Short:
6509 \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
6510 X \p{Tags} \p{Block=Tags} (128)
6511 \p{Tai_Le} \p{Script_Extensions=Tai_Le} (Short:
6512 \p{Tale}; NOT \p{Block=Tai_Le}) (45)
6513 \p{Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
6514 \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
6515 \p{Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
6516 \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
6517 X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
6518 Tai_Xuan_Jing_Symbols}) (96)
6519 X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
6520 \p{InTaiXuanJing}) (96)
6521 \p{Takr} \p{Takri} (= \p{Script_Extensions=Takri})
6522 (NOT \p{Block=Takri}) (79)
6523 \p{Takri} \p{Script_Extensions=Takri} (Short:
6524 \p{Takr}; NOT \p{Block=Takri}) (79)
6525 \p{Tale} \p{Tai_Le} (= \p{Script_Extensions=
6526 Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
6527 \p{Talu} \p{New_Tai_Lue} (= \p{Script_Extensions=
6528 New_Tai_Lue}) (NOT \p{Block=
6529 New_Tai_Lue}) (83)
6530 \p{Tamil} \p{Script_Extensions=Tamil} (Short:
6531 \p{Taml}; NOT \p{Block=Tamil}) (133)
6532 X \p{Tamil_Sup} \p{Tamil_Supplement} (= \p{Block=
6533 Tamil_Supplement}) (64)
6534 X \p{Tamil_Supplement} \p{Block=Tamil_Supplement} (Short:
6535 \p{InTamilSup}) (64)
6536 \p{Taml} \p{Tamil} (= \p{Script_Extensions=Tamil})
6537 (NOT \p{Block=Tamil}) (133)
6538 \p{Tang} \p{Tangut} (= \p{Script_Extensions=
6539 Tangut}) (NOT \p{Block=Tangut}) (6914)
6540 \p{Tangut} \p{Script_Extensions=Tangut} (Short:
6541 \p{Tang}; NOT \p{Block=Tangut}) (6914)
6542 X \p{Tangut_Components} \p{Block=Tangut_Components} (768)
6543 X \p{Tangut_Sup} \p{Tangut_Supplement} (= \p{Block=
6544 Tangut_Supplement}) (144)
6545 X \p{Tangut_Supplement} \p{Block=Tangut_Supplement} (Short:
6546 \p{InTangutSup}) (144)
6547 \p{Tavt} \p{Tai_Viet} (= \p{Script_Extensions=
6548 Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
6549 \p{Telu} \p{Telugu} (= \p{Script_Extensions=
6550 Telugu}) (NOT \p{Block=Telugu}) (104)
6551 \p{Telugu} \p{Script_Extensions=Telugu} (Short:
6552 \p{Telu}; NOT \p{Block=Telugu}) (104)
6553 \p{Term} \p{Terminal_Punctuation} (=
6554 \p{Terminal_Punctuation=Y}) (267)
6555 \p{Term: *} \p{Terminal_Punctuation: *}
6556 \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
6557 \p{Term}) (267)
6558 \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
6559 (1_113_845 plus all above-Unicode code
6560 points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0-
6561 9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
6562 U+0100..037D, U+037F..0386,
6563 U+0388..0588, U+058A..05C2, U+05C4..060B
6564 ...)
6565 \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (267:
6566 [!,.:;?], U+037E, U+0387, U+0589,
6567 U+05C3, U+060C ...)
6568 \p{Tfng} \p{Tifinagh} (= \p{Script_Extensions=
6569 Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
6570 \p{Tglg} \p{Tagalog} (= \p{Script_Extensions=
6571 Tagalog}) (NOT \p{Block=Tagalog}) (22)
6572 \p{Thaa} \p{Thaana} (= \p{Script_Extensions=
6573 Thaana}) (NOT \p{Block=Thaana}) (66)
6574 \p{Thaana} \p{Script_Extensions=Thaana} (Short:
6575 \p{Thaa}; NOT \p{Block=Thaana}) (66)
6576 \p{Thai} \p{Script_Extensions=Thai} (NOT \p{Block=
6577 Thai}) (86)
6578 \p{Tibetan} \p{Script_Extensions=Tibetan} (Short:
6579 \p{Tibt}; NOT \p{Block=Tibetan}) (207)
6580 \p{Tibt} \p{Tibetan} (= \p{Script_Extensions=
6581 Tibetan}) (NOT \p{Block=Tibetan}) (207)
6582 \p{Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
6583 \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
6584 \p{Tirh} \p{Tirhuta} (= \p{Script_Extensions=
6585 Tirhuta}) (NOT \p{Block=Tirhuta}) (97)
6586 \p{Tirhuta} \p{Script_Extensions=Tirhuta} (Short:
6587 \p{Tirh}; NOT \p{Block=Tirhuta}) (97)
6588 \p{Title} \p{Titlecase} (/i= Cased=Yes) (31)
6589 \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i=
6590 Cased=Yes) (31: U+01C5, U+01C8, U+01CB,
6591 U+01F2, U+1F88..1F8F, U+1F98..1F9F ...)
6592 \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
6593 (Short: \p{Lt}; /i= General_Category=
6594 Cased_Letter) (31)
6595 X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block=
6596 Transport_And_Map_Symbols}) (128)
6597 X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
6598 (Short: \p{InTransportAndMap}) (128)
6599 X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics}
6600 (= \p{Block=
6601 Unified_Canadian_Aboriginal_Syllabics})
6602 (640)
6603 X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_-
6604 Extended} (= \p{Block=
6605 Unified_Canadian_Aboriginal_Syllabics_-
6606 Extended}) (80)
6607 \p{Ugar} \p{Ugaritic} (= \p{Script_Extensions=
6608 Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
6609 \p{Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
6610 \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
6611 \p{UIdeo} \p{Unified_Ideograph} (=
6612 \p{Unified_Ideograph=Y}) (92_856)
6613 \p{UIdeo: *} \p{Unified_Ideograph: *}
6614 \p{Unassigned} \p{General_Category=Unassigned} (Short:
6615 \p{Cn}) (830_672 plus all above-Unicode
6616 code points)
6617 \p{Unicode} \p{Any} (1_114_112)
6618 X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
6619 Unified_Canadian_Aboriginal_Syllabics}
6620 (Short: \p{InUCAS}) (640)
6621 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
6622 Unified_Canadian_Aboriginal_Syllabics_-
6623 Extended} (Short: \p{InUCASExt}) (80)
6624 \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
6625 (92_856)
6626 \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
6627 (1_021_256 plus all above-Unicode code
6628 points: U+0000..33FF, U+4DC0..4DFF,
6629 U+9FFD..FA0D, U+FA10, U+FA12,
6630 U+FA15..FA1E ...)
6631 \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (92_856:
6632 U+3400..4DBF, U+4E00..9FFC,
6633 U+FA0E..FA0F, U+FA11, U+FA13..FA14,
6634 U+FA1F ...)
6635 \p{Unknown} \p{Script_Extensions=Unknown} (Short:
6636 \p{Zzzz}) (970_188 plus all above-
6637 Unicode code points)
6638 \p{Upper} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6639 Cased=Yes) (1911)
6640 \p{Upper: *} \p{Uppercase: *}
6641 \p{Uppercase} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6642 Cased=Yes) (1911)
6643 \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
6644 No) (1_112_201 plus all above-Unicode
6645 code points: [\x00-\x20!\"#\$\%&\'
6646 \(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a-z\{
6647 \|\}~\x7f-\xbf\xd7\xdf-\xff], U+0101,
6648 U+0103, U+0105, U+0107, U+0109 ...)
6649 \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
6650 Yes) (1911: [A-Z\xc0-\xd6\xd8-\xde],
6651 U+0100, U+0102, U+0104, U+0106, U+0108
6652 ...)
6653 \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
6654 (Short: \p{Lu}; /i= General_Category=
6655 Cased_Letter) (1791)
6656 \p{Vai} \p{Script_Extensions=Vai} (NOT \p{Block=
6657 Vai}) (300)
6658 \p{Vaii} \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
6659 \p{Block=Vai}) (300)
6660 \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS};
6661 NOT \p{Variation_Selectors}) (259)
6662 \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853
6663 plus all above-Unicode code points:
6664 U+0000..180A, U+180E..FDFF,
6665 U+FE10..E00FF, U+E01F0..infinity)
6666 \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259:
6667 U+180B..180D, U+FE00..FE0F,
6668 U+E0100..E01EF)
6669 X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
6670 \p{InVS}) (16)
6671 X \p{Variation_Selectors_Supplement} \p{Block=
6672 Variation_Selectors_Supplement} (Short:
6673 \p{InVSSup}) (240)
6674 X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block=
6675 Vedic_Extensions}) (48)
6676 X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short:
6677 \p{InVedicExt}) (48)
6678 X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
6679 \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated}
6680 (786_865 plus all above-Unicode code
6681 points)
6682 \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (786_865 plus
6683 all above-Unicode code points: [\x00-
6684 \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf-
6685 \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9,
6686 U+02EC..10FF, U+1200..1400,
6687 U+1680..18AF, U+1900..2015 ...)
6688 \p{Vertical_Orientation: Tr} \p{Vertical_Orientation=
6689 Transformed_Rotated} (47)
6690 \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr})
6691 (47: U+2329..232A, U+3008..3011,
6692 U+3014..301F, U+3030, U+30A0, U+30FC ...)
6693 \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu})
6694 (148: U+3001..3002, U+3041, U+3043,
6695 U+3045, U+3047, U+3049 ...)
6696 \p{Vertical_Orientation: Tu} \p{Vertical_Orientation=
6697 Transformed_Upright} (148)
6698 \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright}
6699 (327_052)
6700 \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (327_052:
6701 [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7],
6702 U+02EA..02EB, U+1100..11FF,
6703 U+1401..167F, U+18B0..18FF, U+2016 ...)
6704 \p{VertSpace} \v (7: [\n\cK\f\r\x85], U+2028..2029)
6705 \p{Vo: *} \p{Vertical_Orientation: *}
6706 \p{VS} \p{Variation_Selector} (=
6707 \p{Variation_Selector=Y}) (NOT
6708 \p{Variation_Selectors}) (259)
6709 \p{VS: *} \p{Variation_Selector: *}
6710 X \p{VS_Sup} \p{Variation_Selectors_Supplement} (=
6711 \p{Block=
6712 Variation_Selectors_Supplement}) (240)
6713 \p{Wancho} \p{Script_Extensions=Wancho} (Short:
6714 \p{Wcho}; NOT \p{Block=Wancho}) (59)
6715 \p{Wara} \p{Warang_Citi} (= \p{Script_Extensions=
6716 Warang_Citi}) (NOT \p{Block=
6717 Warang_Citi}) (84)
6718 \p{Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
6719 \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
6720 \p{WB: *} \p{Word_Break: *}
6721 \p{Wcho} \p{Wancho} (= \p{Script_Extensions=
6722 Wancho}) (NOT \p{Block=Wancho}) (59)
6723 \p{White_Space} \p{White_Space=Y} (Short: \p{Space}) (25)
6724 \p{White_Space: N*} (Short: \p{Space=N}, \P{Space}) (1_114_087
6725 plus all above-Unicode code points: [^
6726 \t\n\cK\f\r\x20\x85\xa0], U+0100..167F,
6727 U+1681..1FFF, U+200B..2027,
6728 U+202A..202E, U+2030..205E ...)
6729 \p{White_Space: Y*} (Short: \p{Space=Y}, \p{Space}) (25: [\t
6730 \n\cK\f\r\x20\x85\xa0], U+1680,
6731 U+2000..200A, U+2028..2029, U+202F,
6732 U+205F ...)
6733 \p{Word} \p{XPosixWord} (134_564)
6734 \p{Word_Break: ALetter} (Short: \p{WB=LE}) (28_854: [A-Za-z\xaa
6735 \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6736 U+0100..02D7, U+02DE..02FF,
6737 U+0370..0374, U+0376..0377, U+037A..037D
6738 ...)
6739 \p{Word_Break: CR} (Short: \p{WB=CR}) (1: [\r])
6740 \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"])
6741 \p{Word_Break: DQ} \p{Word_Break=Double_Quote} (1)
6742 \p{Word_Break: E_Base} (Short: \p{WB=EB}) (0)
6743 \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (0)
6744 \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (0)
6745 \p{Word_Break: EB} \p{Word_Break=E_Base} (0)
6746 \p{Word_Break: EBG} \p{Word_Break=E_Base_GAZ} (0)
6747 \p{Word_Break: EM} \p{Word_Break=E_Modifier} (0)
6748 \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (11)
6749 \p{Word_Break: Extend} (Short: \p{WB=Extend}) (2399:
6750 U+0300..036F, U+0483..0489,
6751 U+0591..05BD, U+05BF, U+05C1..05C2,
6752 U+05C4..05C5 ...)
6753 \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F,
6754 U+203F..2040, U+2054, U+FE33..FE34,
6755 U+FE4D..FE4F ...)
6756 \p{Word_Break: FO} \p{Word_Break=Format} (62)
6757 \p{Word_Break: Format} (Short: \p{WB=FO}) (62: [\xad],
6758 U+0600..0605, U+061C, U+06DD, U+070F,
6759 U+08E2 ...)
6760 \p{Word_Break: GAZ} \p{Word_Break=Glue_After_Zwj} (0)
6761 \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (0)
6762 \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (75:
6763 U+05D0..05EA, U+05EF..05F2, U+FB1D,
6764 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
6765 ...)
6766 \p{Word_Break: HL} \p{Word_Break=Hebrew_Letter} (75)
6767 \p{Word_Break: KA} \p{Word_Break=Katakana} (314)
6768 \p{Word_Break: Katakana} (Short: \p{WB=KA}) (314: U+3031..3035,
6769 U+309B..309C, U+30A0..30FA,
6770 U+30FC..30FF, U+31F0..31FF, U+32D0..32FE
6771 ...)
6772 \p{Word_Break: LE} \p{Word_Break=ALetter} (28_854)
6773 \p{Word_Break: LF} (Short: \p{WB=LF}) (1: [\n])
6774 \p{Word_Break: MB} \p{Word_Break=MidNumLet} (7)
6775 \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (9: [:\xb7], U+0387,
6776 U+055F, U+05F4, U+2027, U+FE13 ...)
6777 \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15: [,;], U+037E,
6778 U+0589, U+060C..060D, U+066C, U+07F8 ...)
6779 \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.],
6780 U+2018..2019, U+2024, U+FE52, U+FF07,
6781 U+FF0E)
6782 \p{Word_Break: ML} \p{Word_Break=MidLetter} (9)
6783 \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
6784 \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85],
6785 U+2028..2029)
6786 \p{Word_Break: NL} \p{Word_Break=Newline} (5)
6787 \p{Word_Break: NU} \p{Word_Break=Numeric} (651)
6788 \p{Word_Break: Numeric} (Short: \p{WB=NU}) (651: [0-9],
6789 U+0660..0669, U+066B, U+06F0..06F9,
6790 U+07C0..07C9, U+0966..096F ...)
6791 \p{Word_Break: Other} (Short: \p{WB=XX}) (1_081_665 plus all
6792 above-Unicode code points: [^\n\cK\f\r
6793 \x20\"\',.0-9:;A-Z_a-z\x85\xaa\xad\xb5
6794 \xb7\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6795 U+02D8..02DD, U+0375, U+0378..0379,
6796 U+0380..0385, U+038B ...)
6797 \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26:
6798 U+1F1E6..1F1FF)
6799 \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26)
6800 \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\'])
6801 \p{Word_Break: SQ} \p{Word_Break=Single_Quote} (1)
6802 \p{Word_Break: WSegSpace} (Short: \p{WB=WSegSpace}) (14: [\x20],
6803 U+1680, U+2000..2006, U+2008..200A,
6804 U+205F, U+3000)
6805 \p{Word_Break: XX} \p{Word_Break=Other} (1_081_665 plus all
6806 above-Unicode code points)
6807 \p{Word_Break: ZWJ} (Short: \p{WB=ZWJ}) (1: U+200D)
6808 \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (25)
6809 \p{WSpace: *} \p{White_Space: *}
6810 \p{XDigit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
6811 \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
6812 (134_415)
6813 \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (979_697
6814 plus all above-Unicode code points:
6815 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
6816 \@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4
6817 \xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
6818 U+02C2..02C5, U+02D2..02DF,
6819 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
6820 \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (134_415:
6821 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6
6822 \xd8-\xf6\xf8-\xff], U+0100..02C1,
6823 U+02C6..02D1, U+02E0..02E4, U+02EC,
6824 U+02EE ...)
6825 \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (131_459)
6826 \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (982_653
6827 plus all above-Unicode code points:
6828 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
6829 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4
6830 \xb6-\xb9\xbb-\xbf\xd7\xf7],
6831 U+02C2..02C5, U+02D2..02DF,
6832 U+02E5..02EB, U+02ED, U+02EF..036F ...)
6833 \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (131_459:
6834 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
6835 \xf8-\xff], U+0100..02C1, U+02C6..02D1,
6836 U+02E0..02E4, U+02EC, U+02EE ...)
6837 \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
6838 (134_415)
6839 \p{XIDC: *} \p{XID_Continue: *}
6840 \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (131_459)
6841 \p{XIDS: *} \p{XID_Start: *}
6842 \p{Xpeo} \p{Old_Persian} (= \p{Script_Extensions=
6843 Old_Persian}) (NOT \p{Block=
6844 Old_Persian}) (50)
6845 \p{XPerlSpace} \p{XPosixSpace} (25)
6846 \p{XPosixAlnum} Alphabetic and (decimal) Numeric (Short:
6847 \p{Alnum}) (133_525: [0-9A-Za-z\xaa\xb5
6848 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6849 U+0100..02C1, U+02C6..02D1,
6850 U+02E0..02E4, U+02EC, U+02EE ...)
6851 \p{XPosixAlpha} \p{Alphabetic=Y} (Short: \p{Alpha})
6852 (132_875)
6853 \p{XPosixBlank} \h, Horizontal white space (Short:
6854 \p{Blank}) (18: [\t\x20\xa0], U+1680,
6855 U+2000..200A, U+202F, U+205F, U+3000)
6856 \p{XPosixCntrl} \p{General_Category=Control} Control
6857 characters (Short: \p{Cc}) (65)
6858 \p{XPosixDigit} \p{General_Category=Decimal_Number} [0-9]
6859 + all other decimal digits (Short:
6860 \p{Nd}) (650)
6861 \p{XPosixGraph} Characters that are graphical (Short:
6862 \p{Graph}) (281_308: [!\"#\$\%&\'
6863 \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`a-z
6864 \{\|\}~\xa1-\xff], U+0100..0377,
6865 U+037A..037F, U+0384..038A, U+038C,
6866 U+038E..03A1 ...)
6867 \p{XPosixLower} \p{Lowercase=Y} (Short: \p{Lower}; /i=
6868 Cased=Yes) (2344)
6869 \p{XPosixPrint} Characters that are graphical plus space
6870 characters (but no controls) (Short:
6871 \p{Print}) (281_325: [\x20-\x7e\xa0-
6872 \xff], U+0100..0377, U+037A..037F,
6873 U+0384..038A, U+038C, U+038E..03A1 ...)
6874 \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (807:
6875 [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\]
6876 \^_`\{\|\}~\xa1\xa7\xab\xb6-\xb7\xbb
6877 \xbf], U+037E, U+0387, U+055A..055F,
6878 U+0589..058A, U+05BE ...)
6879 \p{XPosixSpace} \s including beyond ASCII and vertical tab
6880 (Short: \p{SpacePerl}) (25: [\t\n\cK\f
6881 \r\x20\x85\xa0], U+1680, U+2000..200A,
6882 U+2028..2029, U+202F, U+205F ...)
6883 \p{XPosixUpper} \p{Uppercase=Y} (Short: \p{Upper}; /i=
6884 Cased=Yes) (1911)
6885 \p{XPosixWord} \w, including beyond ASCII; = \p{Alnum} +
6886 \pM + \p{Pc} + \p{Join_Control} (Short:
6887 \p{Word}) (134_564: [0-9A-Z_a-z\xaa\xb5
6888 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6889 U+0100..02C1, U+02C6..02D1,
6890 U+02E0..02E4, U+02EC, U+02EE ...)
6891 \p{XPosixXDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
6892 \p{Xsux} \p{Cuneiform} (= \p{Script_Extensions=
6893 Cuneiform}) (NOT \p{Block=Cuneiform})
6894 (1234)
6895 \p{Yezi} \p{Yezidi} (= \p{Script_Extensions=
6896 Yezidi}) (NOT \p{Block=Yezidi}) (60)
6897 \p{Yezidi} \p{Script_Extensions=Yezidi} (Short:
6898 \p{Yezi}; NOT \p{Block=Yezidi}) (60)
6899 \p{Yi} \p{Script_Extensions=Yi} (1246)
6900 X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
6901 X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
6902 \p{Yiii} \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
6903 X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block=
6904 Yijing_Hexagram_Symbols}) (64)
6905 X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
6906 (Short: \p{InYijing}) (64)
6907 \p{Z} \pZ \p{Separator} (= \p{General_Category=
6908 Separator}) (19)
6909 \p{Zanabazar_Square} \p{Script_Extensions=Zanabazar_Square}
6910 (Short: \p{Zanb}; NOT \p{Block=
6911 Zanabazar_Square}) (72)
6912 \p{Zanb} \p{Zanabazar_Square} (=
6913 \p{Script_Extensions=Zanabazar_Square})
6914 (NOT \p{Block=Zanabazar_Square}) (72)
6915 \p{Zinh} \p{Inherited} (= \p{Script_Extensions=
6916 Inherited}) (503)
6917 \p{Zl} \p{Line_Separator} (= \p{General_Category=
6918 Line_Separator}) (1)
6919 \p{Zp} \p{Paragraph_Separator} (=
6920 \p{General_Category=
6921 Paragraph_Separator}) (1)
6922 \p{Zs} \p{Space_Separator} (=
6923 \p{General_Category=Space_Separator})
6924 (17)
6925 \p{Zyyy} \p{Common} (= \p{Script_Extensions=
6926 Common}) (7661)
6927 \p{Zzzz} \p{Unknown} (= \p{Script_Extensions=
6928 Unknown}) (970_188 plus all above-
6929 Unicode code points)
6930
6931 Legal "\p{}" and "\P{}" constructs that match no characters
6932 Unicode has some property-value pairs that currently don't match
6933 anything. This happens generally either because they are obsolete, or
6934 they exist for symmetry with other forms, but no language has yet been
6935 encoded that uses them. In this version of Unicode, the following
6936 match zero code points:
6937
6938 \p{Canonical_Combining_Class=Attached_Below_Left}
6939 \p{Canonical_Combining_Class=CCC133}
6940 \p{Grapheme_Cluster_Break=E_Base}
6941 \p{Grapheme_Cluster_Break=E_Base_GAZ}
6942 \p{Grapheme_Cluster_Break=E_Modifier}
6943 \p{Grapheme_Cluster_Break=Glue_After_Zwj}
6944 \p{Word_Break=E_Base}
6945 \p{Word_Break=E_Base_GAZ}
6946 \p{Word_Break=E_Modifier}
6947 \p{Word_Break=Glue_After_Zwj}
6948
6950 The value of any Unicode (not including Perl extensions) character
6951 property mentioned above for any single code point is available through
6952 "charprop()" in Unicode::UCD. "charprops_all()" in Unicode::UCD
6953 returns the values of all the Unicode properties for a given code
6954 point.
6955
6956 Besides these, all the Unicode character properties mentioned above
6957 (except for those marked as for internal use by Perl) are also
6958 accessible by "prop_invlist()" in Unicode::UCD.
6959
6960 Due to their nature, not all Unicode character properties are suitable
6961 for regular expression matches, nor "prop_invlist()". The remaining
6962 non-provisional, non-internal ones are accessible via "prop_invmap()"
6963 in Unicode::UCD (except for those that this Perl installation hasn't
6964 included; see below for which those are).
6965
6966 For compatibility with other parts of Perl, all the single forms given
6967 in the table in the section above are recognized. BUT, there are some
6968 ambiguities between some Perl extensions and the Unicode properties,
6969 all of which are silently resolved in favor of the official Unicode
6970 property. To avoid surprises, you should only use "prop_invmap()" for
6971 forms listed in the table below, which omits the non-recommended ones.
6972 The affected forms are the Perl single form equivalents of Unicode
6973 properties, such as "\p{sc}" being a single-form equivalent of
6974 "\p{gc=sc}", which is treated by "prop_invmap()" as the "Script"
6975 property, whose short name is "sc". The table indicates the current
6976 ambiguities in the INFO column, beginning with the word "NOT".
6977
6978 The standard Unicode properties listed below are documented in
6979 <http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is
6980 documented in "prop_invmap()" in Unicode::UCD. The other Perl
6981 extensions are in "Other Properties" in perlunicode;
6982
6983 The first column in the table is a name for the property; the second
6984 column is an alternative name, if any, plus possibly some annotations.
6985 The alternative name is the property's full name, unless that would
6986 simply repeat the first column, in which case the second column
6987 indicates the property's short name (if different). The annotations
6988 are given only in the entry for the full name. The annotations for
6989 binary properties include a list of the first few ranges that the
6990 property matches. To avoid any ambiguity, the SPACE character is
6991 represented as "\x20".
6992
6993 If a property is obsolete, etc, the entry will be flagged with the same
6994 characters used in the table in the section above, like D or S.
6995
6996 NAME INFO
6997
6998 Age
6999 AHex ASCII_Hex_Digit
7000 All (Perl extension). All code points,
7001 including those above Unicode. Same as
7002 qr/./s. U+0000..infinity
7003 Alnum XPosixAlnum. (Perl extension)
7004 Alpha Alphabetic
7005 Alphabetic (Short: Alpha). [A-Za-z\xaa\xb5\xba\xc0-
7006 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7007 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7008 ...
7009 Any (Perl extension). All Unicode code
7010 points. U+0000..10FFFF
7011 ASCII Block=Basic_Latin. (Perl extension).
7012 [\x00-\x7f]
7013 ASCII_Hex_Digit (Short: AHex). [0-9A-Fa-f]
7014 Assigned (Perl extension). All assigned code
7015 points. U+0000..0377, U+037A..037F,
7016 U+0384..038A, U+038C, U+038E..03A1,
7017 U+03A3..052F ...
7018 Bc Bidi_Class
7019 Bidi_C Bidi_Control
7020 Bidi_Class (Short: bc)
7021 Bidi_Control (Short: Bidi_C). U+061C, U+200E..200F,
7022 U+202A..202E, U+2066..2069
7023 Bidi_M Bidi_Mirrored
7024 Bidi_Mirrored (Short: Bidi_M). [\(\)<>\[\]\{\}\xab
7025 \xbb], U+0F3A..0F3D, U+169B..169C,
7026 U+2039..203A, U+2045..2046, U+207D..207E
7027 ...
7028 Bidi_Mirroring_Glyph (Short: bmg)
7029 Bidi_Paired_Bracket (Short: bpb)
7030 Bidi_Paired_Bracket_Type (Short: bpt)
7031 Blank XPosixBlank. (Perl extension)
7032 Blk Block
7033 Block (Short: blk)
7034 Bmg Bidi_Mirroring_Glyph
7035 Bpb Bidi_Paired_Bracket
7036 Bpt Bidi_Paired_Bracket_Type
7037 Canonical_Combining_Class (Short: ccc)
7038 Case_Folding (Short: cf)
7039 Case_Ignorable (Short: CI). [\'.:\^`\xa8\xad\xaf\xb4
7040 \xb7-\xb8], U+02B0..036F, U+0374..0375,
7041 U+037A, U+0384..0385, U+0387 ...
7042 Cased [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7043 \xff], U+0100..01BA, U+01BC..01BF,
7044 U+01C4..0293, U+0295..02B8, U+02C0..02C1
7045 ...
7046 Category General_Category
7047 Ccc Canonical_Combining_Class
7048 CE Composition_Exclusion
7049 Cf Case_Folding; NOT 'cf' meaning
7050 'General_Category=Format'
7051 Changes_When_Casefolded (Short: CWCF). [A-Z\xb5\xc0-\xd6\xd8-
7052 \xdf], U+0100, U+0102, U+0104, U+0106,
7053 U+0108 ...
7054 Changes_When_Casemapped (Short: CWCM). [A-Za-z\xb5\xc0-\xd6\xd8-
7055 \xf6\xf8-\xff], U+0100..0137,
7056 U+0139..018C, U+018E..019A, U+019C..01A9,
7057 U+01AC..01B9 ...
7058 Changes_When_Lowercased (Short: CWL). [A-Z\xc0-\xd6\xd8-\xde],
7059 U+0100, U+0102, U+0104, U+0106, U+0108 ...
7060 Changes_When_NFKC_Casefolded (Short: CWKCF). [A-Z\xa0\xa8\xaa
7061 \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0-
7062 \xd6\xd8-\xdf], U+0100, U+0102, U+0104,
7063 U+0106, U+0108 ...
7064 Changes_When_Titlecased (Short: CWT). [a-z\xb5\xdf-\xf6\xf8-
7065 \xff], U+0101, U+0103, U+0105, U+0107,
7066 U+0109 ...
7067 Changes_When_Uppercased (Short: CWU). [a-z\xb5\xdf-\xf6\xf8-
7068 \xff], U+0101, U+0103, U+0105, U+0107,
7069 U+0109 ...
7070 CI Case_Ignorable
7071 Cntrl XPosixCntrl (=General_Category=Control).
7072 (Perl extension)
7073 Comp_Ex Full_Composition_Exclusion
7074 Composition_Exclusion (Short: CE). U+0958..095F, U+09DC..09DD,
7075 U+09DF, U+0A33, U+0A36, U+0A59..0A5B ...
7076 CWCF Changes_When_Casefolded
7077 CWCM Changes_When_Casemapped
7078 CWKCF Changes_When_NFKC_Casefolded
7079 CWL Changes_When_Lowercased
7080 CWT Changes_When_Titlecased
7081 CWU Changes_When_Uppercased
7082 Dash [\-], U+058A, U+05BE, U+1400, U+1806,
7083 U+2010..2015 ...
7084 Decomposition_Mapping (Short: dm)
7085 Decomposition_Type (Short: dt)
7086 Default_Ignorable_Code_Point (Short: DI). [\xad], U+034F, U+061C,
7087 U+115F..1160, U+17B4..17B5, U+180B..180E
7088 ...
7089 Dep Deprecated
7090 Deprecated (Short: Dep). U+0149, U+0673, U+0F77,
7091 U+0F79, U+17A3..17A4, U+206A..206F ...
7092 DI Default_Ignorable_Code_Point
7093 Dia Diacritic
7094 Diacritic (Short: Dia). [\^`\xa8\xaf\xb4\xb7-\xb8],
7095 U+02B0..034E, U+0350..0357, U+035D..0362,
7096 U+0374..0375, U+037A ...
7097 Digit XPosixDigit (=General_Category=
7098 Decimal_Number). (Perl extension)
7099 Dm Decomposition_Mapping
7100 Dt Decomposition_Type
7101 Ea East_Asian_Width
7102 East_Asian_Width (Short: ea)
7103 EBase Emoji_Modifier_Base
7104 EComp Emoji_Component
7105 EMod Emoji_Modifier
7106 Emoji [#*0-9\xa9\xae], U+203C, U+2049, U+2122,
7107 U+2139, U+2194..2199 ...
7108 Emoji_Component (Short: EComp). [#*0-9], U+200D, U+20E3,
7109 U+FE0F, U+1F1E6..1F1FF, U+1F3FB..1F3FF ...
7110 Emoji_Modifier (Short: EMod). U+1F3FB..1F3FF
7111 Emoji_Modifier_Base (Short: EBase). U+261D, U+26F9,
7112 U+270A..270D, U+1F385, U+1F3C2..1F3C4,
7113 U+1F3C7 ...
7114 Emoji_Presentation (Short: EPres). U+231A..231B,
7115 U+23E9..23EC, U+23F0, U+23F3,
7116 U+25FD..25FE, U+2614..2615 ...
7117 EPres Emoji_Presentation
7118 EqUIdeo Equivalent_Unified_Ideograph
7119 Equivalent_Unified_Ideograph (Short: EqUIdeo)
7120 Ext Extender
7121 Extended_Pictographic (Short: ExtPict). [\xa9\xae], U+203C,
7122 U+2049, U+2122, U+2139, U+2194..2199 ...
7123 Extender (Short: Ext). [\xb7], U+02D0..02D1,
7124 U+0640, U+07FA, U+0B55, U+0E46 ...
7125 ExtPict Extended_Pictographic
7126 Full_Composition_Exclusion (Short: Comp_Ex). U+0340..0341,
7127 U+0343..0344, U+0374, U+037E, U+0387,
7128 U+0958..095F ...
7129 Gc General_Category
7130 GCB Grapheme_Cluster_Break
7131 General_Category (Short: gc)
7132 Gr_Base Grapheme_Base
7133 Gr_Ext Grapheme_Extend
7134 Graph XPosixGraph. (Perl extension)
7135 Grapheme_Base (Short: Gr_Base). [\x20-\x7e\xa0-\xac
7136 \xae-\xff], U+0100..02FF, U+0370..0377,
7137 U+037A..037F, U+0384..038A, U+038C ...
7138 Grapheme_Cluster_Break (Short: GCB)
7139 Grapheme_Extend (Short: Gr_Ext). U+0300..036F,
7140 U+0483..0489, U+0591..05BD, U+05BF,
7141 U+05C1..05C2, U+05C4..05C5 ...
7142 Hangul_Syllable_Type (Short: hst)
7143 Hex Hex_Digit
7144 Hex_Digit (Short: Hex). [0-9A-Fa-f], U+FF10..FF19,
7145 U+FF21..FF26, U+FF41..FF46
7146 HorizSpace XPosixBlank. (Perl extension)
7147 Hst Hangul_Syllable_Type
7148 D Hyphen [\-\xad], U+058A, U+1806, U+2010..2011,
7149 U+2E17, U+30FB ... Supplanted by
7150 Line_Break property values; see
7151 www.unicode.org/reports/tr14
7152 ID_Continue (Short: IDC). [0-9A-Z_a-z\xaa\xb5\xb7
7153 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7154 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7155 U+02EC, U+02EE ...
7156 ID_Start (Short: IDS). [A-Za-z\xaa\xb5\xba\xc0-
7157 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7158 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7159 ...
7160 IDC ID_Continue
7161 Identifier_Status
7162 Identifier_Type
7163 Ideo Ideographic
7164 Ideographic (Short: Ideo). U+3006..3007,
7165 U+3021..3029, U+3038..303A, U+3400..4DBF,
7166 U+4E00..9FFC, U+F900..FA6D ...
7167 IDS ID_Start
7168 IDS_Binary_Operator (Short: IDSB). U+2FF0..2FF1, U+2FF4..2FFB
7169 IDS_Trinary_Operator (Short: IDST). U+2FF2..2FF3
7170 IDSB IDS_Binary_Operator
7171 IDST IDS_Trinary_Operator
7172 In Present_In. (Perl extension)
7173 Indic_Positional_Category (Short: InPC)
7174 Indic_Syllabic_Category (Short: InSC)
7175 InPC Indic_Positional_Category
7176 InSC Indic_Syllabic_Category
7177 Isc ISO_Comment; NOT 'isc' meaning
7178 'General_Category=Other'
7179 ISO_Comment (Short: isc)
7180 Jg Joining_Group
7181 Join_C Join_Control
7182 Join_Control (Short: Join_C). U+200C..200D
7183 Joining_Group (Short: jg)
7184 Joining_Type (Short: jt)
7185 Jt Joining_Type
7186 Lb Line_Break
7187 Lc Lowercase_Mapping; NOT 'lc' meaning
7188 'General_Category=Cased_Letter'
7189 Line_Break (Short: lb)
7190 LOE Logical_Order_Exception
7191 Logical_Order_Exception (Short: LOE). U+0E40..0E44, U+0EC0..0EC4,
7192 U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9
7193 ...
7194 Lower Lowercase
7195 Lowercase (Short: Lower). [a-z\xaa\xb5\xba\xdf-
7196 \xf6\xf8-\xff], U+0101, U+0103, U+0105,
7197 U+0107, U+0109 ...
7198 Lowercase_Mapping (Short: lc)
7199 Math [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
7200 U+03D5, U+03F0..03F1, U+03F4..03F6,
7201 U+0606..0608 ...
7202 Na Name
7203 Na1 Unicode_1_Name
7204 Name (Short: na)
7205 Name_Alias
7206 NChar Noncharacter_Code_Point
7207 NFC_QC NFC_Quick_Check
7208 NFC_Quick_Check (Short: NFC_QC)
7209 NFD_QC NFD_Quick_Check
7210 NFD_Quick_Check (Short: NFD_QC)
7211 NFKC_Casefold (Short: NFKC_CF)
7212 NFKC_CF NFKC_Casefold
7213 NFKC_QC NFKC_Quick_Check
7214 NFKC_Quick_Check (Short: NFKC_QC)
7215 NFKD_QC NFKD_Quick_Check
7216 NFKD_Quick_Check (Short: NFKD_QC)
7217 Noncharacter_Code_Point (Short: NChar). U+FDD0..FDEF,
7218 U+FFFE..FFFF, U+1FFFE..1FFFF,
7219 U+2FFFE..2FFFF, U+3FFFE..3FFFF,
7220 U+4FFFE..4FFFF ...
7221 Nt Numeric_Type
7222 Numeric_Type (Short: nt)
7223 Numeric_Value (Short: nv)
7224 Nv Numeric_Value
7225 Pat_Syn Pattern_Syntax
7226 Pat_WS Pattern_White_Space
7227 Pattern_Syntax (Short: Pat_Syn). [!\"#\$\%&\'\(\)*+,\-.
7228 \/:;<=>?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9
7229 \xab-\xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7
7230 \xf7], U+2010..2027, U+2030..203E,
7231 U+2041..2053, U+2055..205E, U+2190..245F
7232 ...
7233 Pattern_White_Space (Short: Pat_WS). [\t\n\cK\f\r\x20\x85],
7234 U+200E..200F, U+2028..2029
7235 PCM Prepended_Concatenation_Mark
7236 Perl_Decimal_Digit (Perl extension)
7237 PerlSpace PosixSpace. (Perl extension)
7238 PerlWord PosixWord. (Perl extension)
7239 PosixAlnum (Perl extension). [0-9A-Za-z]
7240 PosixAlpha (Perl extension). [A-Za-z]
7241 PosixBlank (Perl extension). [\t\x20]
7242 PosixCntrl (Perl extension). ASCII control
7243 characters. ACK, BEL, BS, CAN, CR, DC1,
7244 DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT,
7245 ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK,
7246 NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT
7247 PosixDigit (Perl extension). [0-9]
7248 PosixGraph (Perl extension). [!\"#\$\%&\'\(\)*+,\-.
7249 \/0-9:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~]
7250 PosixLower (Perl extension). [a-z]
7251 PosixPrint (Perl extension). [\x20-\x7e]
7252 PosixPunct (Perl extension). [!\"#\$\%&\'\(\)*+,\-.
7253 \/:;<=>?\@\[\\\]\^_`\{\|\}~]
7254 PosixSpace (Perl extension). [\t\n\cK\f\r\x20]
7255 PosixUpper (Perl extension). [A-Z]
7256 PosixWord (Perl extension). \w, restricted to
7257 ASCII. [0-9A-Z_a-z]
7258 PosixXDigit ASCII_Hex_Digit. (Perl extension).
7259 [0-9A-Fa-f]
7260 Prepended_Concatenation_Mark (Short: PCM). U+0600..0605, U+06DD,
7261 U+070F, U+08E2, U+110BD, U+110CD
7262 Present_In (Short: In). (Perl extension)
7263 Print XPosixPrint. (Perl extension)
7264 Punct General_Category=Punctuation. (Perl
7265 extension). [!\"#\%&\'\(\)*,\-.\/:;?\@
7266 \[\\\]_\{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
7267 U+037E, U+0387, U+055A..055F,
7268 U+0589..058A, U+05BE ...
7269 QMark Quotation_Mark
7270 Quotation_Mark (Short: QMark). [\"\'\xab\xbb],
7271 U+2018..201F, U+2039..203A, U+2E42,
7272 U+300C..300F, U+301D..301F ...
7273 Radical U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5
7274 Regional_Indicator (Short: RI). U+1F1E6..1F1FF
7275 RI Regional_Indicator
7276 SB Sentence_Break
7277 Sc Script; NOT 'sc' meaning
7278 'General_Category=Currency_Symbol'
7279 Scf Simple_Case_Folding
7280 Script (Short: sc)
7281 Script_Extensions (Short: scx)
7282 Scx Script_Extensions
7283 SD Soft_Dotted
7284 Sentence_Break (Short: SB)
7285 Sentence_Terminal (Short: STerm). [!.?], U+0589,
7286 U+061E..061F, U+06D4, U+0700..0702, U+07F9
7287 ...
7288 Sfc Simple_Case_Folding
7289 Simple_Case_Folding (Short: scf)
7290 Simple_Lowercase_Mapping (Short: slc)
7291 Simple_Titlecase_Mapping (Short: stc)
7292 Simple_Uppercase_Mapping (Short: suc)
7293 Slc Simple_Lowercase_Mapping
7294 Soft_Dotted (Short: SD). [i-j], U+012F, U+0249,
7295 U+0268, U+029D, U+02B2 ...
7296 Space White_Space
7297 SpacePerl XPosixSpace. (Perl extension)
7298 Stc Simple_Titlecase_Mapping
7299 STerm Sentence_Terminal
7300 Suc Simple_Uppercase_Mapping
7301 Tc Titlecase_Mapping
7302 Term Terminal_Punctuation
7303 Terminal_Punctuation (Short: Term). [!,.:;?], U+037E, U+0387,
7304 U+0589, U+05C3, U+060C ...
7305 Title Titlecase. (Perl extension)
7306 Titlecase (Short: Title). (Perl extension). (=
7307 \p{Gc=Lt}). U+01C5, U+01C8, U+01CB,
7308 U+01F2, U+1F88..1F8F, U+1F98..1F9F ...
7309 Titlecase_Mapping (Short: tc)
7310 Uc Uppercase_Mapping
7311 UIdeo Unified_Ideograph
7312 Unicode Any. (Perl extension)
7313 Unicode_1_Name (Short: na1)
7314 Unified_Ideograph (Short: UIdeo). U+3400..4DBF,
7315 U+4E00..9FFC, U+FA0E..FA0F, U+FA11,
7316 U+FA13..FA14, U+FA1F ...
7317 Upper Uppercase
7318 Uppercase (Short: Upper). [A-Z\xc0-\xd6\xd8-\xde],
7319 U+0100, U+0102, U+0104, U+0106, U+0108 ...
7320 Uppercase_Mapping (Short: uc)
7321 Variation_Selector (Short: VS). U+180B..180D, U+FE00..FE0F,
7322 U+E0100..E01EF
7323 Vertical_Orientation (Short: vo)
7324 VertSpace (Perl extension). \v. [\n\cK\f\r\x85],
7325 U+2028..2029
7326 Vo Vertical_Orientation
7327 VS Variation_Selector
7328 WB Word_Break
7329 White_Space (Short: WSpace). [\t\n\cK\f\r\x20\x85
7330 \xa0], U+1680, U+2000..200A, U+2028..2029,
7331 U+202F, U+205F ...
7332 Word XPosixWord. (Perl extension)
7333 Word_Break (Short: WB)
7334 WSpace White_Space
7335 XDigit XPosixXDigit (=Hex_Digit). (Perl
7336 extension)
7337 XID_Continue (Short: XIDC). [0-9A-Z_a-z\xaa\xb5\xb7
7338 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7339 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7340 U+02EC, U+02EE ...
7341 XID_Start (Short: XIDS). [A-Za-z\xaa\xb5\xba\xc0-
7342 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7343 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7344 ...
7345 XIDC XID_Continue
7346 XIDS XID_Start
7347 XPerlSpace XPosixSpace. (Perl extension)
7348 XPosixAlnum (Short: Alnum). (Perl extension).
7349 Alphabetic and (decimal) Numeric. [0-9A-
7350 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7351 \xff], U+0100..02C1, U+02C6..02D1,
7352 U+02E0..02E4, U+02EC, U+02EE ...
7353 XPosixAlpha Alphabetic. (Perl extension). [A-Za-z
7354 \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7355 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7356 U+02EC, U+02EE ...
7357 XPosixBlank (Short: Blank). (Perl extension). \h,
7358 Horizontal white space. [\t\x20\xa0],
7359 U+1680, U+2000..200A, U+202F, U+205F,
7360 U+3000
7361 XPosixCntrl General_Category=Control (Short: Cntrl).
7362 (Perl extension). Control characters.
7363 [\x00-\x1f\x7f-\x9f]
7364 XPosixDigit General_Category=Decimal_Number (Short:
7365 Digit). (Perl extension). [0-9] + all
7366 other decimal digits. [0-9],
7367 U+0660..0669, U+06F0..06F9, U+07C0..07C9,
7368 U+0966..096F, U+09E6..09EF ...
7369 XPosixGraph (Short: Graph). (Perl extension).
7370 Characters that are graphical. [!\"#\$
7371 \%&\'\(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]
7372 \^_`a-z\{\|\}~\xa1-\xff], U+0100..0377,
7373 U+037A..037F, U+0384..038A, U+038C,
7374 U+038E..03A1 ...
7375 XPosixLower Lowercase. (Perl extension). [a-z\xaa
7376 \xb5\xba\xdf-\xf6\xf8-\xff], U+0101,
7377 U+0103, U+0105, U+0107, U+0109 ...
7378 XPosixPrint (Short: Print). (Perl extension).
7379 Characters that are graphical plus space
7380 characters (but no controls). [\x20-\x7e
7381 \xa0-\xff], U+0100..0377, U+037A..037F,
7382 U+0384..038A, U+038C, U+038E..03A1 ...
7383 XPosixPunct (Perl extension). \p{Punct} + ASCII-range
7384 \p{Symbol}. [!\"#\$\%&\'\(\)*+,\-.\/:;<=
7385 >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
7386 \xb7\xbb\xbf], U+037E, U+0387,
7387 U+055A..055F, U+0589..058A, U+05BE ...
7388 XPosixSpace (Perl extension). \s including beyond
7389 ASCII and vertical tab. [\t\n\cK\f\r\x20
7390 \x85\xa0], U+1680, U+2000..200A,
7391 U+2028..2029, U+202F, U+205F ...
7392 XPosixUpper Uppercase. (Perl extension). [A-Z\xc0-
7393 \xd6\xd8-\xde], U+0100, U+0102, U+0104,
7394 U+0106, U+0108 ...
7395 XPosixWord (Short: Word). (Perl extension). \w,
7396 including beyond ASCII; = \p{Alnum} + \pM
7397 + \p{Pc} + \p{Join_Control}. [0-9A-Z_a-z
7398 \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7399 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7400 U+02EC, U+02EE ...
7401 XPosixXDigit Hex_Digit (Short: XDigit). (Perl
7402 extension). [0-9A-Fa-f], U+FF10..FF19,
7403 U+FF21..FF26, U+FF41..FF46
7404
7406 Certain properties are accessible also via core function calls. These
7407 are:
7408
7409 Lowercase_Mapping lc() and lcfirst()
7410 Titlecase_Mapping ucfirst()
7411 Uppercase_Mapping uc()
7412
7413 Also, Case_Folding is accessible through the "/i" modifier in regular
7414 expressions, the "\F" transliteration escape, and the "fc" operator.
7415
7416 Besides being able to say "\p{Name=...}", the Name and Name_Aliases
7417 properties are accessible through the "\N{}" interpolation in double-
7418 quoted strings and regular expressions; and functions
7419 "charnames::viacode()", "charnames::vianame()", and
7420 "charnames::string_vianame()" (which require a "use charnames ();" to
7421 be specified.
7422
7423 Finally, most properties related to decomposition are accessible via
7424 Unicode::Normalize.
7425
7427 Perl will generate an error for a few character properties in Unicode
7428 when used in a regular expression. The non-Unihan ones are listed
7429 below, with the reasons they are not accepted, perhaps with work-
7430 arounds. The short names for the properties are listed enclosed in
7431 (parentheses). As described after the list, an installation can change
7432 the defaults and choose to accept any of these. The list is machine
7433 generated based on the choices made for the installation that generated
7434 this document.
7435
7436 Expands_On_NFC (XO_NFC)
7437 Expands_On_NFD (XO_NFD)
7438 Expands_On_NFKC (XO_NFKC)
7439 Expands_On_NFKD (XO_NFKD)
7440 Deprecated by Unicode. These are characters that expand to more
7441 than one character in the specified normalization form, but whether
7442 they actually take up more bytes or not depends on the encoding
7443 being used. For example, a UTF-8 encoded character may expand to a
7444 different number of bytes than a UTF-32 encoded character.
7445
7446 Grapheme_Link (Gr_Link)
7447 Duplicates ccc=vr (Canonical_Combining_Class=Virama)
7448
7449 Jamo_Short_Name (JSN)
7450 Other_Alphabetic (OAlpha)
7451 Other_Default_Ignorable_Code_Point (ODI)
7452 Other_Grapheme_Extend (OGr_Ext)
7453 Other_ID_Continue (OIDC)
7454 Other_ID_Start (OIDS)
7455 Other_Lowercase (OLower)
7456 Other_Math (OMath)
7457 Other_Uppercase (OUpper)
7458 Used by Unicode internally for generating other properties and not
7459 intended to be used stand-alone
7460
7461 Script=Katakana_Or_Hiragana (sc=Hrkt)
7462 Obsolete. All code points previously matched by this have been
7463 moved to "Script=Common". Consider instead using
7464 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or
7465 both)
7466
7467 Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)
7468 All code points that would be matched by this are matched by either
7469 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
7470
7471 An installation can choose to allow any of these to be matched by
7472 downloading the Unicode database from <http://www.unicode.org/Public/>
7473 to $Config{privlib}/unicore/ in the Perl source tree, changing the
7474 controlling lists contained in the program
7475 $Config{privlib}/unicore/mktables and then re-compiling and installing.
7476 (%Config is available from the Config module).
7477
7478 Also, perl can be recompiled to operate on an earlier version of the
7479 Unicode standard. Further information is at
7480 $Config{privlib}/unicore/README.perl.
7481
7483 The Unicode data base is delivered in two different formats. The XML
7484 version is valid for more modern Unicode releases. The other version
7485 is a collection of files. The two are intended to give equivalent
7486 information. Perl uses the older form; this allows you to recompile
7487 Perl to use early Unicode releases.
7488
7489 The only non-character property that Perl currently supports is Named
7490 Sequences, in which a sequence of code points is given a name and
7491 generally treated as a single entity. (Perl supports these via the
7492 "\N{...}" double-quotish construct, "charnames::string_vianame(name)"
7493 in charnames, and "namedseq()" in Unicode::UCD.
7494
7495 Below is a list of the files in the Unicode data base that Perl doesn't
7496 currently use, along with very brief descriptions of their purposes.
7497 Some of the names of the files have been shortened from those that
7498 Unicode uses, in order to allow them to be distinguishable from
7499 similarly named files on file systems for which only the first 8
7500 characters of a name are significant.
7501
7502 auxiliary/GraphemeBreakTest.html
7503 auxiliary/LineBreakTest.html
7504 auxiliary/SentenceBreakTest.html
7505 auxiliary/WordBreakTest.html
7506 Documentation of validation Tests
7507
7508 BidiCharacterTest.txt
7509 BidiTest.txt
7510 NormTest.txt
7511 Validation Tests
7512
7513 CJKRadicals.txt
7514 Maps the kRSUnicode property values to corresponding code points
7515
7516 emoji/ReadMe.txt
7517 ReadMe.txt
7518 Documentation
7519
7520 EmojiSources.txt
7521 Maps certain Unicode code points to their legacy Japanese cell-
7522 phone values
7523
7524 extracted/DName.txt
7525 This file adds no new information not already present in other
7526 files
7527
7528 Index.txt
7529 Alphabetical index of Unicode characters
7530
7531 NamedSqProv.txt
7532 Named sequences proposed for inclusion in a later version of the
7533 Unicode Standard; if you need them now, you can append this file to
7534 NamedSequences.txt and recompile perl
7535
7536 NamesList.html
7537 Describes the format and contents of NamesList.txt
7538
7539 NamesList.txt
7540 Annotated list of characters
7541
7542 NormalizationCorrections.txt
7543 Documentation of corrections already incorporated into the Unicode
7544 data base
7545
7546 NushuSources.txt
7547 Specifies source material for Nushu characters
7548
7549 StandardizedVariants.html
7550 Obsoleted as of Unicode 9.0, but previously provided a visual
7551 display of the standard variant sequences derived from
7552 StandardizedVariants.txt.
7553
7554 StandardizedVariants.txt
7555 Certain glyph variations for character display are standardized.
7556 This lists the non-Unihan ones; the Unihan ones are also not used
7557 by Perl, and are in a separate Unicode data base
7558 <http://www.unicode.org/ivd>
7559
7560 TangutSources.txt
7561 Specifies source mappings for Tangut ideographs and components.
7562 This data file also includes informative radical-stroke values that
7563 are used internally by Unicode
7564
7565 USourceData.txt
7566 Documentation of status and cross reference of proposals for
7567 encoding by Unicode of Unihan characters
7568
7569 USourceGlyphs.pdf
7570 Pictures of the characters in USourceData.txt
7571
7573 <http://www.unicode.org/reports/tr44/>
7574
7575 perlrecharclass
7576
7577 perlunicode
7578
7579
7580
7581perl v5.32.1 2021-05-31 PERLUNIPROPS(1)