1PERLUNIPROPS(1) Perl Programmers Reference Guide PERLUNIPROPS(1)
2
3
4
6 perluniprops - Index of Unicode Version 14.0.0 character properties in
7 Perl
8
10 This document provides information about the portion of the Unicode
11 database that deals with character properties, that is the portion that
12 is defined on single code points. ("Other information in the Unicode
13 data base" below briefly mentions other data that Unicode provides.)
14
15 Perl can provide access to all non-provisional Unicode character
16 properties, though not all are enabled by default. The omitted ones
17 are the Unihan properties and certain deprecated or Unicode-internal
18 properties. (An installation may choose to recompile Perl's tables to
19 change this. See "Unicode character properties that are NOT accepted
20 by Perl".)
21
22 For most purposes, access to Unicode properties from the Perl core is
23 through regular expression matches, as described in the next section.
24 For some special purposes, and to access the properties that are not
25 suitable for regular expression matching, all the Unicode character
26 properties that Perl handles are accessible via the standard
27 Unicode::UCD module, as described in the section "Properties accessible
28 through Unicode::UCD".
29
30 Perl also provides some additional extensions and short-cut synonyms
31 for Unicode properties.
32
33 This document merely lists all available properties and does not
34 attempt to explain what each property really means. There is a brief
35 description of each Perl extension; see "Other Properties" in
36 perlunicode for more information on these. There is some detail about
37 Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but
38 to find out about the intricacies of the official Unicode properties,
39 refer to the Unicode standard. A good starting place is
40 <http://www.unicode.org/reports/tr44/>.
41
42 Note that you can define your own properties; see "User-Defined
43 Character Properties" in perlunicode.
44
46 The Perl regular expression "\p{}" and "\P{}" constructs give access to
47 most of the Unicode character properties. The table below shows all
48 these constructs, both single and compound forms.
49
50 Compound forms consist of two components, separated by an equals sign
51 or a colon. The first component is the property name, and the second
52 component is the particular value of the property to match against, for
53 example, "\p{Script_Extensions: Greek}" and
54 "\p{Script_Extensions=Greek}" both mean to match characters whose
55 Script_Extensions property value is Greek. ("Script_Extensions" is an
56 improved version of the "Script" property.)
57
58 Single forms, like "\p{Greek}", are mostly Perl-defined shortcuts for
59 their equivalent compound forms. The table shows these equivalences.
60 (In our example, "\p{Greek}" is a just a shortcut for
61 "\p{Script_Extensions=Greek}"). There are also a few Perl-defined
62 single forms that are not shortcuts for a compound form. One such is
63 "\p{Word}". These are also listed in the table.
64
65 In parsing these constructs, Perl always ignores Upper/lower case
66 differences everywhere within the {braces}. Thus "\p{Greek}" means the
67 same thing as "\p{greek}". But note that changing the case of the "p"
68 or "P" before the left brace completely changes the meaning of the
69 construct, from "match" (for "\p{}") to "doesn't match" (for "\P{}").
70 Casing in this document is for improved legibility.
71
72 Also, white space, hyphens, and underscores are normally ignored
73 everywhere between the {braces}, and hence can be freely added or
74 removed even if the "/x" modifier hasn't been specified on the regular
75 expression. But in the table below a 'T' at the beginning of an entry
76 means that tighter (stricter) rules are used for that entry:
77
78 Single form ("\p{name}") tighter rules:
79 White space, hyphens, and underscores ARE significant except
80 for:
81
82 • white space adjacent to a non-word character
83
84 • underscores separating digits in numbers
85
86 That means, for example, that you can freely add or remove
87 white space adjacent to (but within) the braces without
88 affecting the meaning.
89
90 Compound form ("\p{name=value}" or "\p{name:value}") tighter rules:
91 The tighter rules given above for the single form apply to
92 everything to the right of the colon or equals; the looser
93 rules still apply to everything to the left.
94
95 That means, for example, that you can freely add or remove
96 white space adjacent to (but within) the braces and the colon
97 or equal sign.
98
99 Some properties are considered obsolete by Unicode, but still
100 available. There are several varieties of obsolescence:
101
102 Stabilized
103 A property may be stabilized. Such a determination does not
104 indicate that the property should or should not be used;
105 instead it is a declaration that the property will not be
106 maintained nor extended for newly encoded characters. Such
107 properties are marked with an 'S' in the table.
108
109 Deprecated
110 A property may be deprecated, perhaps because its original
111 intent has been replaced by another property, or because its
112 specification was somehow defective. This means that its use
113 is strongly discouraged, so much so that a warning will be
114 issued if used, unless the regular expression is in the scope
115 of a "no warnings 'deprecated'" statement. A 'D' flags each
116 such entry in the table, and the entry there for the longest,
117 most descriptive version of the property will give the reason
118 it is deprecated, and perhaps advice. Perl may issue such a
119 warning, even for properties that aren't officially deprecated
120 by Unicode, when there used to be characters or code points
121 that were matched by them, but no longer. This is to warn you
122 that your program may not work like it did on earlier Unicode
123 releases.
124
125 A deprecated property may be made unavailable in a future Perl
126 version, so it is best to move away from them.
127
128 A deprecated property may also be stabilized, but this fact is
129 not shown.
130
131 Obsolete
132 Properties marked with an 'O' in the table are considered
133 (plain) obsolete. Generally this designation is given to
134 properties that Unicode once used for internal purposes (but
135 not any longer).
136
137 Discouraged
138 This is not actually a Unicode-specified obsolescence, but
139 applies to certain Perl extensions that are present for
140 backwards compatibility, but are discouraged from being used.
141 These are not obsolete, but their meanings are not stable.
142 Future Unicode versions could force any of these extensions to
143 be removed without warning, replaced by another property with
144 the same name that means something different. An 'X' flags
145 each such entry in the table. Use the equivalent shown
146 instead.
147
148 In particular, matches in the Block property have single forms
149 defined by Perl that begin with "In_", ""Is_", or even with no
150 prefix at all, Like all DISCOURAGED forms, these are not
151 stable. For example, "\p{Block=Deseret}" can currently be
152 written as "\p{In_Deseret}", "\p{Is_Deseret}", or
153 "\p{Deseret}". But, a new Unicode version may come along that
154 would force Perl to change the meaning of one or more of these,
155 and your program would no longer be correct. Currently there
156 are no such conflicts with the form that begins "In_", but
157 there are many with the other two shortcuts, and Unicode
158 continues to define new properties that begin with "In", so
159 it's quite possible that a conflict will occur in the future.
160 The compound form is guaranteed to not become obsolete, and its
161 meaning is clearer anyway. See "Blocks" in perlunicode for
162 more information about this.
163
164 User-defined properties must begin with "In" or "Is". These
165 override any Unicode property of the same name.
166
167 The table below has two columns. The left column contains the "\p{}"
168 constructs to look up, possibly preceded by the flags mentioned above;
169 and the right column contains information about them, like a
170 description, or synonyms. The table shows both the single and compound
171 forms for each property that has them. If the left column is a short
172 name for a property, the right column will give its longer, more
173 descriptive name; and if the left column is the longest name, the right
174 column will show any equivalent shortest name, in both single and
175 compound forms if applicable.
176
177 If braces are not needed to specify a property (e.g., "\pL"), the left
178 column contains both forms, with and without braces.
179
180 The right column will also caution you if a property means something
181 different than what might normally be expected.
182
183 All single forms are Perl extensions; a few compound forms are as well,
184 and are noted as such.
185
186 Numbers in (parentheses) indicate the total number of Unicode code
187 points matched by the property. For the entries that give the longest,
188 most descriptive version of the property, the count is followed by a
189 list of some of the code points matched by it. The list includes all
190 the matched characters in the 0-255 range, enclosed in the familiar
191 [brackets] the same as a regular expression bracketed character class.
192 Following that, the next few higher matching ranges are also given. To
193 avoid visual ambiguity, the SPACE character is represented as "\x20".
194
195 For emphasis, those properties that match no code points at all are
196 listed as well in a separate section following the table.
197
198 Most properties match the same code points regardless of whether "/i"
199 case-insensitive matching is specified or not. But a few properties
200 are affected. These are shown with the notation "(/i= other_property)"
201 in the second column. Under case-insensitive matching they match the
202 same code pode points as the property other_property.
203
204 There is no description given for most non-Perl defined properties (See
205 <http://www.unicode.org/reports/tr44/> for that).
206
207 For compactness, '*' is used as a wildcard instead of showing all
208 possible combinations. For example, entries like:
209
210 \p{Gc: *} \p{General_Category: *}
211
212 mean that 'Gc' is a synonym for 'General_Category', and anything that
213 is valid for the latter is also valid for the former. Similarly,
214
215 \p{Is_*} \p{*}
216
217 means that if and only if, for example, "\p{Foo}" exists, then
218 "\p{Is_Foo}" and "\p{IsFoo}" are also valid and all mean the same
219 thing. And similarly, "\p{Foo=Bar}" means the same as "\p{Is_Foo=Bar}"
220 and "\p{IsFoo=Bar}". "*" here is restricted to something not beginning
221 with an underscore.
222
223 Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
224 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table
225 shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
226 for the other possibilities. Note that not all properties which have
227 values 'Yes' and 'No' are binary, and they have all their values
228 spelled out without using this wild card, and a "NOT" clause in their
229 description that highlights their not being binary. These also require
230 the compound form to match them, whereas true binary properties have
231 both single and compound forms available.
232
233 Note that all non-essential underscores are removed in the display of
234 the short names below.
235
236 Legend summary:
237
238 * is a wild-card
239 (\d+) in the info column gives the number of Unicode code points
240 matched by this property.
241 D means this is deprecated.
242 O means this is obsolete.
243 S means this is stabilized.
244 T means tighter (stricter) name matching applies.
245 X means use of this form is discouraged, and may not be stable.
246
247 NAME INFO
248
249 \p{Adlam} \p{Script_Extensions=Adlam} (Short:
250 \p{Adlm}; NOT \p{Block=Adlam}) (90)
251 \p{Adlm} \p{Adlam} (= \p{Script_Extensions=Adlam})
252 (NOT \p{Block=Adlam}) (90)
253 X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
254 T \p{Age: 1.1} \p{Age=V1_1} (33_979)
255 \p{Age: V1_1} Code point's usage introduced in version
256 1.1 (33_979: U+0000..01F5, U+01FA..0217,
257 U+0250..02A8, U+02B0..02DE,
258 U+02E0..02E9, U+0300..0345 ...)
259 T \p{Age: 2.0} \p{Age=V2_0} (144_521)
260 \p{Age: V2_0} Code point's usage was introduced in
261 version 2.0; See also Property
262 'Present_In' (144_521: U+0591..05A1,
263 U+05A3..05AF, U+05C4, U+0F00..0F47,
264 U+0F49..0F69, U+0F71..0F8B ...)
265 T \p{Age: 2.1} \p{Age=V2_1} (2)
266 \p{Age: V2_1} Code point's usage was introduced in
267 version 2.1; See also Property
268 'Present_In' (2: U+20AC, U+FFFC)
269 T \p{Age: 3.0} \p{Age=V3_0} (10_307)
270 \p{Age: V3_0} Code point's usage was introduced in
271 version 3.0; See also Property
272 'Present_In' (10_307: U+01F6..01F9,
273 U+0218..021F, U+0222..0233,
274 U+02A9..02AD, U+02DF, U+02EA..02EE ...)
275 T \p{Age: 3.1} \p{Age=V3_1} (44_978)
276 \p{Age: V3_1} Code point's usage was introduced in
277 version 3.1; See also Property
278 'Present_In' (44_978: U+03F4..03F5,
279 U+FDD0..FDEF, U+10300..1031E,
280 U+10320..10323, U+10330..1034A,
281 U+10400..10425 ...)
282 T \p{Age: 3.2} \p{Age=V3_2} (1016)
283 \p{Age: V3_2} Code point's usage was introduced in
284 version 3.2; See also Property
285 'Present_In' (1016: U+0220, U+034F,
286 U+0363..036F, U+03D8..03D9, U+03F6,
287 U+048A..048B ...)
288 T \p{Age: 4.0} \p{Age=V4_0} (1226)
289 \p{Age: V4_0} Code point's usage was introduced in
290 version 4.0; See also Property
291 'Present_In' (1226: U+0221,
292 U+0234..0236, U+02AE..02AF,
293 U+02EF..02FF, U+0350..0357, U+035D..035F
294 ...)
295 T \p{Age: 4.1} \p{Age=V4_1} (1273)
296 \p{Age: V4_1} Code point's usage was introduced in
297 version 4.1; See also Property
298 'Present_In' (1273: U+0237..0241,
299 U+0358..035C, U+03FC..03FF,
300 U+04F6..04F7, U+05A2, U+05C5..05C7 ...)
301 T \p{Age: 5.0} \p{Age=V5_0} (1369)
302 \p{Age: V5_0} Code point's usage was introduced in
303 version 5.0; See also Property
304 'Present_In' (1369: U+0242..024F,
305 U+037B..037D, U+04CF, U+04FA..04FF,
306 U+0510..0513, U+05BA ...)
307 T \p{Age: 5.1} \p{Age=V5_1} (1624)
308 \p{Age: V5_1} Code point's usage was introduced in
309 version 5.1; See also Property
310 'Present_In' (1624: U+0370..0373,
311 U+0376..0377, U+03CF, U+0487,
312 U+0514..0523, U+0606..060A ...)
313 T \p{Age: 5.2} \p{Age=V5_2} (6648)
314 \p{Age: V5_2} Code point's usage was introduced in
315 version 5.2; See also Property
316 'Present_In' (6648: U+0524..0525,
317 U+0800..082D, U+0830..083E, U+0900,
318 U+094E, U+0955 ...)
319 T \p{Age: 6.0} \p{Age=V6_0} (2088)
320 \p{Age: V6_0} Code point's usage was introduced in
321 version 6.0; See also Property
322 'Present_In' (2088: U+0526..0527,
323 U+0620, U+065F, U+0840..085B, U+085E,
324 U+093A..093B ...)
325 T \p{Age: 6.1} \p{Age=V6_1} (732)
326 \p{Age: V6_1} Code point's usage was introduced in
327 version 6.1; See also Property
328 'Present_In' (732: U+058F, U+0604,
329 U+08A0, U+08A2..08AC, U+08E4..08FE,
330 U+0AF0 ...)
331 T \p{Age: 6.2} \p{Age=V6_2} (1)
332 \p{Age: V6_2} Code point's usage was introduced in
333 version 6.2; See also Property
334 'Present_In' (1: U+20BA)
335 T \p{Age: 6.3} \p{Age=V6_3} (5)
336 \p{Age: V6_3} Code point's usage was introduced in
337 version 6.3; See also Property
338 'Present_In' (5: U+061C, U+2066..2069)
339 T \p{Age: 7.0} \p{Age=V7_0} (2834)
340 \p{Age: V7_0} Code point's usage was introduced in
341 version 7.0; See also Property
342 'Present_In' (2834: U+037F,
343 U+0528..052F, U+058D..058E, U+0605,
344 U+08A1, U+08AD..08B2 ...)
345 T \p{Age: 8.0} \p{Age=V8_0} (7716)
346 \p{Age: V8_0} Code point's usage was introduced in
347 version 8.0; See also Property
348 'Present_In' (7716: U+08B3..08B4,
349 U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5
350 ...)
351 T \p{Age: 9.0} \p{Age=V9_0} (7500)
352 \p{Age: V9_0} Code point's usage was introduced in
353 version 9.0; See also Property
354 'Present_In' (7500: U+08B6..08BD,
355 U+08D4..08E2, U+0C80, U+0D4F,
356 U+0D54..0D56, U+0D58..0D5E ...)
357 T \p{Age: 10.0} \p{Age=V10_0} (8518)
358 \p{Age: V10_0} Code point's usage was introduced in
359 version 10.0; See also Property
360 'Present_In' (8518: U+0860..086A,
361 U+09FC..09FD, U+0AFA..0AFF, U+0D00,
362 U+0D3B..0D3C, U+1CF7 ...)
363 T \p{Age: 11.0} \p{Age=V11_0} (684)
364 \p{Age: V11_0} Code point's usage was introduced in
365 version 11.0; See also Property
366 'Present_In' (684: U+0560, U+0588,
367 U+05EF, U+07FD..07FF, U+08D3, U+09FE ...)
368 T \p{Age: 12.0} \p{Age=V12_0} (554)
369 \p{Age: V12_0} Code point's usage was introduced in
370 version 12.0; See also Property
371 'Present_In' (554: U+0C77, U+0E86,
372 U+0E89, U+0E8C, U+0E8E..0E93, U+0E98 ...)
373 T \p{Age: 12.1} \p{Age=V12_1} (1)
374 \p{Age: V12_1} Code point's usage was introduced in
375 version 12.1; See also Property
376 'Present_In' (1: U+32FF)
377 T \p{Age: 13.0} \p{Age=V13_0} (5930)
378 \p{Age: V13_0} Code point's usage was introduced in
379 version 13.0; See also Property
380 'Present_In' (5930: U+08BE..08C7,
381 U+0B55, U+0D04, U+0D81, U+1ABF..1AC0,
382 U+2B97 ...)
383 T \p{Age: 14.0} \p{Age=V14_0} (838)
384 \p{Age: V14_0} Code point's usage was introduced in
385 version 14.0; See also Property
386 'Present_In' (838: U+061D, U+0870..088E,
387 U+0890..0891, U+0898..089F, U+08B5,
388 U+08C8..08D2 ...)
389 \p{Age: NA} \p{Age=Unassigned} (829_768 plus all
390 above-Unicode code points)
391 \p{Age: Unassigned} Code point's usage has not been assigned
392 in any Unicode release thus far.
393 (Short: \p{Age=NA}) (829_768 plus all above-Unicode code points:
394 U+0378..0379, U+0380..0383, U+038B,
395 U+038D, U+03A2, U+0530 ...)
396 \p{Aghb} \p{Caucasian_Albanian} (=
397 \p{Script_Extensions=
398 Caucasian_Albanian}) (NOT \p{Block=
399 Caucasian_Albanian}) (53)
400 \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
401 (22)
402 \p{AHex: *} \p{ASCII_Hex_Digit: *}
403 \p{Ahom} \p{Script_Extensions=Ahom} (NOT \p{Block=
404 Ahom}) (65)
405 X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block=
406 Alchemical_Symbols}) (128)
407 X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short:
408 \p{InAlchemical}) (128)
409 \p{All} All code points, including those above
410 Unicode. Same as qr/./s (1_114_112 plus
411 all above-Unicode code points:
412 U+0000..infinity)
413 \p{Alnum} \p{XPosixAlnum} (134_056)
414 \p{Alpha} \p{XPosixAlpha} (= \p{Alphabetic=Y})
415 (133_396)
416 \p{Alpha: *} \p{Alphabetic: *}
417 \p{Alphabetic} \p{XPosixAlpha} (= \p{Alphabetic=Y})
418 (133_396)
419 \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (980_716
420 plus all above-Unicode code points:
421 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
422 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4
423 \xb6-\xb9\xbb-\xbf\xd7\xf7],
424 U+02C2..02C5, U+02D2..02DF,
425 U+02E5..02EB, U+02ED, U+02EF..0344 ...)
426 \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (133_396:
427 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
428 \xf8-\xff], U+0100..02C1, U+02C6..02D1,
429 U+02E0..02E4, U+02EC, U+02EE ...)
430 X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (=
431 \p{Block=Alphabetic_Presentation_Forms})
432 (80)
433 X \p{Alphabetic_Presentation_Forms} \p{Block=
434 Alphabetic_Presentation_Forms} (Short:
435 \p{InAlphabeticPF}) (80)
436 \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
437 Anatolian_Hieroglyphs} (Short: \p{Hluw};
438 NOT \p{Block=Anatolian_Hieroglyphs})
439 (583)
440 X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
441 \p{Block=
442 Ancient_Greek_Musical_Notation}) (80)
443 X \p{Ancient_Greek_Musical_Notation} \p{Block=
444 Ancient_Greek_Musical_Notation} (Short:
445 \p{InAncientGreekMusic}) (80)
446 X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
447 X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
448 \p{Any} All Unicode code points (1_114_112:
449 U+0000..10FFFF)
450 \p{Arab} \p{Arabic} (= \p{Script_Extensions=
451 Arabic}) (NOT \p{Block=Arabic}) (1411)
452 \p{Arabic} \p{Script_Extensions=Arabic} (Short:
453 \p{Arab}; NOT \p{Block=Arabic}) (1411)
454 X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block=
455 Arabic_Extended_A}) (96)
456 X \p{Arabic_Ext_B} \p{Arabic_Extended_B} (= \p{Block=
457 Arabic_Extended_B}) (48)
458 X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short:
459 \p{InArabicExtA}) (96)
460 X \p{Arabic_Extended_B} \p{Block=Arabic_Extended_B} (Short:
461 \p{InArabicExtB}) (48)
462 X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols}
463 (= \p{Block=
464 Arabic_Mathematical_Alphabetic_Symbols})
465 (256)
466 X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
467 Arabic_Mathematical_Alphabetic_Symbols}
468 (Short: \p{InArabicMath}) (256)
469 X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (=
470 \p{Block=Arabic_Presentation_Forms_A})
471 (688)
472 X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (=
473 \p{Block=Arabic_Presentation_Forms_B})
474 (144)
475 X \p{Arabic_Presentation_Forms_A} \p{Block=
476 Arabic_Presentation_Forms_A} (Short:
477 \p{InArabicPFA}) (688)
478 X \p{Arabic_Presentation_Forms_B} \p{Block=
479 Arabic_Presentation_Forms_B} (Short:
480 \p{InArabicPFB}) (144)
481 X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block=
482 Arabic_Supplement}) (48)
483 X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short:
484 \p{InArabicSup}) (48)
485 \p{Armenian} \p{Script_Extensions=Armenian} (Short:
486 \p{Armn}; NOT \p{Block=Armenian}) (96)
487 \p{Armi} \p{Imperial_Aramaic} (=
488 \p{Script_Extensions=Imperial_Aramaic})
489 (NOT \p{Block=Imperial_Aramaic}) (31)
490 \p{Armn} \p{Armenian} (= \p{Script_Extensions=
491 Armenian}) (NOT \p{Block=Armenian}) (96)
492 X \p{Arrows} \p{Block=Arrows} (112)
493 \p{ASCII} \p{Block=Basic_Latin} (128)
494 \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
495 (22)
496 \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
497 plus all above-Unicode code points:
498 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
499 \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
500 U+0100..infinity)
501 \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A-
502 Fa-f])
503 \p{Assigned} All assigned code points (284_278:
504 U+0000..0377, U+037A..037F,
505 U+0384..038A, U+038C, U+038E..03A1,
506 U+03A3..052F ...)
507 \p{Avestan} \p{Script_Extensions=Avestan} (Short:
508 \p{Avst}; NOT \p{Block=Avestan}) (61)
509 \p{Avst} \p{Avestan} (= \p{Script_Extensions=
510 Avestan}) (NOT \p{Block=Avestan}) (61)
511 \p{Bali} \p{Balinese} (= \p{Script_Extensions=
512 Balinese}) (NOT \p{Block=Balinese}) (124)
513 \p{Balinese} \p{Script_Extensions=Balinese} (Short:
514 \p{Bali}; NOT \p{Block=Balinese}) (124)
515 \p{Bamu} \p{Bamum} (= \p{Script_Extensions=Bamum})
516 (NOT \p{Block=Bamum}) (657)
517 \p{Bamum} \p{Script_Extensions=Bamum} (Short:
518 \p{Bamu}; NOT \p{Block=Bamum}) (657)
519 X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block=
520 Bamum_Supplement}) (576)
521 X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short:
522 \p{InBamumSup}) (576)
523 X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
524 \p{Bass} \p{Bassa_Vah} (= \p{Script_Extensions=
525 Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
526 (36)
527 \p{Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
528 \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
529 \p{Batak} \p{Script_Extensions=Batak} (Short:
530 \p{Batk}; NOT \p{Block=Batak}) (56)
531 \p{Batk} \p{Batak} (= \p{Script_Extensions=Batak})
532 (NOT \p{Block=Batak}) (56)
533 \p{Bc: *} \p{Bidi_Class: *}
534 \p{Beng} \p{Bengali} (= \p{Script_Extensions=
535 Bengali}) (NOT \p{Block=Bengali}) (113)
536 \p{Bengali} \p{Script_Extensions=Bengali} (Short:
537 \p{Beng}; NOT \p{Block=Bengali}) (113)
538 \p{Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
539 \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
540 \p{Bhks} \p{Bhaiksuki} (= \p{Script_Extensions=
541 Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
542 (97)
543 \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y})
544 (12)
545 \p{Bidi_C: *} \p{Bidi_Control: *}
546 \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1708)
547 \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (63)
548 \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1708: U+0608,
549 U+060B, U+060D, U+061B..064A,
550 U+066D..066F, U+0671..06D5 ...)
551 \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (63:
552 U+0600..0605, U+0660..0669,
553 U+066B..066C, U+06DD, U+0890..0891,
554 U+08E2 ...)
555 \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
556 \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4016)
557 \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016: [^\t\n
558 \cK\f\r\x1c-\x7e\x85\xa0-\xac\xae-\xff],
559 U+180E, U+200B..200D, U+2060..2065,
560 U+206A..206F, U+FDD0..FDEF ...)
561 \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15: [,.\/:
562 \xa0], U+060C, U+202F, U+2044, U+FE50,
563 U+FE52 ...)
564 \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
565 \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (168)
566 \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
567 \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (92)
568 \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (168: [0-9\xb2-
569 \xb3\xb9], U+06F0..06F9, U+2070,
570 U+2074..2079, U+2080..2089, U+2488..249B
571 ...)
572 \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-],
573 U+207A..207B, U+208A..208B, U+2212,
574 U+FB29, U+FE62..FE63 ...)
575 \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (92: [#\$
576 \%\xa2-\xa5\xb0-\xb1], U+058F,
577 U+0609..060A, U+066A, U+09F2..09F3,
578 U+09FB ...)
579 \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1:
580 U+2068)
581 \p{Bidi_Class: FSI} \p{Bidi_Class=First_Strong_Isolate} (1)
582 \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_096_333
583 plus all above-Unicode code points)
584 \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_096_333 plus
585 all above-Unicode code points: [A-Za-z
586 \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
587 \xff], U+0100..02B8, U+02BB..02C1,
588 U+02D0..02D1, U+02E0..02E4, U+02EE ...)
589 \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1:
590 U+202A)
591 \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1:
592 U+2066)
593 \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1:
594 U+202D)
595 \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
596 \p{Bidi_Class: LRI} \p{Bidi_Class=Left_To_Right_Isolate} (1)
597 \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
598 \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1958:
599 U+0300..036F, U+0483..0489,
600 U+0591..05BD, U+05BF, U+05C1..05C2,
601 U+05C4..05C5 ...)
602 \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1958)
603 \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (6000)
604 \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (6000: [!\"&\'
605 \(\)*;<=>?\@\[\\\]\^_`\{\|\}~\xa1\xa6-
606 \xa9\xab-\xac\xae-\xaf\xb4\xb6-\xb8\xbb-
607 \xbf\xd7\xf7], U+02B9..02BA,
608 U+02C2..02CF, U+02D2..02DF,
609 U+02E5..02ED, U+02EF..02FF ...)
610 \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7: [\n\r
611 \x1c-\x1e\x85], U+2029)
612 \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
613 \p{Bidi_Class: PDI} \p{Bidi_Class=Pop_Directional_Isolate} (1)
614 \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1:
615 U+202C)
616 \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1:
617 U+2069)
618 \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (3711)
619 \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (3711: U+0590,
620 U+05BE, U+05C0, U+05C3, U+05C6,
621 U+05C8..05FF ...)
622 \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1:
623 U+202B)
624 \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1:
625 U+2067)
626 \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1:
627 U+202E)
628 \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
629 \p{Bidi_Class: RLI} \p{Bidi_Class=Right_To_Left_Isolate} (1)
630 \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
631 \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
632 \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3: [\t\cK
633 \x1f])
634 \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20],
635 U+1680, U+2000..200A, U+2028, U+205F,
636 U+3000)
637 \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (17)
638 \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
639 \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
640 plus all above-Unicode code points:
641 U+0000..061B, U+061D..200D,
642 U+2010..2029, U+202F..2065,
643 U+206A..infinity)
644 \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (12:
645 U+061C, U+200E..200F, U+202A..202E,
646 U+2066..2069)
647 \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
648 (553)
649 \p{Bidi_M: *} \p{Bidi_Mirrored: *}
650 \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
651 (553)
652 \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_559
653 plus all above-Unicode code points:
654 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A-
655 Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc-
656 \xff], U+0100..0F39, U+0F3E..169A,
657 U+169D..2038, U+203B..2044, U+2047..207C
658 ...)
659 \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (553:
660 [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
661 U+169B..169C, U+2039..203A,
662 U+2045..2046, U+207D..207E ...)
663 \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
664 (64)
665 \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (64: [\)\]
666 \}], U+0F3B, U+0F3D, U+169C, U+2046,
667 U+207E ...)
668 \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
669 (1_113_984 plus all above-Unicode code
670 points)
671 \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_984
672 plus all above-Unicode code points:
673 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<=>?
674 \@A-Z\\\^_`a-z\|~\x7f-\xff],
675 U+0100..0F39, U+0F3E..169A,
676 U+169D..2044, U+2047..207C, U+207F..208C
677 ...)
678 \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
679 (64)
680 \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (64:
681 [\(\[\{], U+0F3A, U+0F3C, U+169B,
682 U+2045, U+207D ...)
683 \p{Blank} \p{XPosixBlank} (18)
684 \p{Blk: *} \p{Block: *}
685 \p{Block: Adlam} (NOT \p{Adlam} NOR \p{Is_Adlam}) (96:
686 U+1E900..1E95F)
687 \p{Block: Aegean_Numbers} (64: U+10100..1013F)
688 \p{Block: Ahom} (NOT \p{Ahom} NOR \p{Is_Ahom}) (80:
689 U+11700..1174F)
690 \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128)
691 \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128:
692 U+1F700..1F77F)
693 \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
694 (80)
695 \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
696 AlphabeticPF}) (80: U+FB00..FB4F)
697 \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
698 NOR \p{Is_Anatolian_Hieroglyphs}) (640:
699 U+14400..1467F)
700 \p{Block: Ancient_Greek_Music} \p{Block=
701 Ancient_Greek_Musical_Notation} (80)
702 \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
703 AncientGreekMusic}) (80: U+1D200..1D24F)
704 \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F)
705 \p{Block: Ancient_Symbols} (64: U+10190..101CF)
706 \p{Block: Arabic} (NOT \p{Arabic} NOR \p{Is_Arabic}) (256:
707 U+0600..06FF)
708 \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
709 \p{Block: Arabic_Ext_B} \p{Block=Arabic_Extended_B} (48)
710 \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96:
711 U+08A0..08FF)
712 \p{Block: Arabic_Extended_B} (Short: \p{Blk=ArabicExtB}) (48:
713 U+0870..089F)
714 \p{Block: Arabic_Math} \p{Block=
715 Arabic_Mathematical_Alphabetic_Symbols}
716 (256)
717 \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
718 ArabicMath}) (256: U+1EE00..1EEFF)
719 \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688)
720 \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144)
721 \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
722 (688: U+FB50..FDFF)
723 \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
724 (144: U+FE70..FEFF)
725 \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48)
726 \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48:
727 U+0750..077F)
728 \p{Block: Armenian} (NOT \p{Armenian} NOR \p{Is_Armenian})
729 (96: U+0530..058F)
730 \p{Block: Arrows} (112: U+2190..21FF)
731 \p{Block: ASCII} \p{Block=Basic_Latin} (128)
732 \p{Block: Avestan} (NOT \p{Avestan} NOR \p{Is_Avestan}) (64:
733 U+10B00..10B3F)
734 \p{Block: Balinese} (NOT \p{Balinese} NOR \p{Is_Balinese})
735 (128: U+1B00..1B7F)
736 \p{Block: Bamum} (NOT \p{Bamum} NOR \p{Is_Bamum}) (96:
737 U+A6A0..A6FF)
738 \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576)
739 \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576:
740 U+16800..16A3F)
741 \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}) (128: [\x00-\x7f])
742 \p{Block: Bassa_Vah} (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
743 (48: U+16AD0..16AFF)
744 \p{Block: Batak} (NOT \p{Batak} NOR \p{Is_Batak}) (64:
745 U+1BC0..1BFF)
746 \p{Block: Bengali} (NOT \p{Bengali} NOR \p{Is_Bengali}) (128:
747 U+0980..09FF)
748 \p{Block: Bhaiksuki} (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
749 (112: U+11C00..11C6F)
750 \p{Block: Block_Elements} (32: U+2580..259F)
751 \p{Block: Bopomofo} (NOT \p{Bopomofo} NOR \p{Is_Bopomofo})
752 (48: U+3100..312F)
753 \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
754 \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32:
755 U+31A0..31BF)
756 \p{Block: Box_Drawing} (128: U+2500..257F)
757 \p{Block: Brahmi} (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128:
758 U+11000..1107F)
759 \p{Block: Braille} \p{Block=Braille_Patterns} (256)
760 \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256:
761 U+2800..28FF)
762 \p{Block: Buginese} (NOT \p{Buginese} NOR \p{Is_Buginese})
763 (32: U+1A00..1A1F)
764 \p{Block: Buhid} (NOT \p{Buhid} NOR \p{Is_Buhid}) (32:
765 U+1740..175F)
766 \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
767 (256)
768 \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
769 ByzantineMusic}) (256: U+1D000..1D0FF)
770 \p{Block: Canadian_Syllabics} \p{Block=
771 Unified_Canadian_Aboriginal_Syllabics}
772 (640)
773 \p{Block: Carian} (NOT \p{Carian} NOR \p{Is_Carian}) (64:
774 U+102A0..102DF)
775 \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
776 \p{Is_Caucasian_Albanian}) (64:
777 U+10530..1056F)
778 \p{Block: Chakma} (NOT \p{Chakma} NOR \p{Is_Chakma}) (80:
779 U+11100..1114F)
780 \p{Block: Cham} (NOT \p{Cham} NOR \p{Is_Cham}) (96:
781 U+AA00..AA5F)
782 \p{Block: Cherokee} (NOT \p{Cherokee} NOR \p{Is_Cherokee})
783 (96: U+13A0..13FF)
784 \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
785 \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80:
786 U+AB70..ABBF)
787 \p{Block: Chess_Symbols} (112: U+1FA00..1FA6F)
788 \p{Block: Chorasmian} (NOT \p{Chorasmian} NOR \p{Is_Chorasmian})
789 (48: U+10FB0..10FDF)
790 \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992)
791 \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256)
792 \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
793 \p{Block: CJK_Compat_Ideographs} \p{Block=
794 CJK_Compatibility_Ideographs} (512)
795 \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
796 CJK_Compatibility_Ideographs_Supplement}
797 (544)
798 \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256:
799 U+3300..33FF)
800 \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
801 (32: U+FE30..FE4F)
802 \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
803 CJKCompatIdeographs}) (512: U+F900..FAFF)
804 \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
805 CJKCompatIdeographsSup}) (544:
806 U+2F800..2FA1F)
807 \p{Block: CJK_Ext_A} \p{Block=
808 CJK_Unified_Ideographs_Extension_A}
809 (6592)
810 \p{Block: CJK_Ext_B} \p{Block=
811 CJK_Unified_Ideographs_Extension_B}
812 (42_720)
813 \p{Block: CJK_Ext_C} \p{Block=
814 CJK_Unified_Ideographs_Extension_C}
815 (4160)
816 \p{Block: CJK_Ext_D} \p{Block=
817 CJK_Unified_Ideographs_Extension_D} (224)
818 \p{Block: CJK_Ext_E} \p{Block=
819 CJK_Unified_Ideographs_Extension_E}
820 (5776)
821 \p{Block: CJK_Ext_F} \p{Block=
822 CJK_Unified_Ideographs_Extension_F}
823 (7488)
824 \p{Block: CJK_Ext_G} \p{Block=
825 CJK_Unified_Ideographs_Extension_G}
826 (4944)
827 \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
828 \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
829 (128: U+2E80..2EFF)
830 \p{Block: CJK_Strokes} (48: U+31C0..31EF)
831 \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64)
832 \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
833 (64: U+3000..303F)
834 \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992:
835 U+4E00..9FFF)
836 \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
837 CJKExtA}) (6592: U+3400..4DBF)
838 \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
839 CJKExtB}) (42_720: U+20000..2A6DF)
840 \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
841 CJKExtC}) (4160: U+2A700..2B73F)
842 \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
843 CJKExtD}) (224: U+2B740..2B81F)
844 \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
845 CJKExtE}) (5776: U+2B820..2CEAF)
846 \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk=
847 CJKExtF}) (7488: U+2CEB0..2EBEF)
848 \p{Block: CJK_Unified_Ideographs_Extension_G} (Short: \p{Blk=
849 CJKExtG}) (4944: U+30000..3134F)
850 \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
851 Diacriticals}) (112: U+0300..036F)
852 \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
853 DiacriticalsExt}) (80: U+1AB0..1AFF)
854 \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
855 DiacriticalsForSymbols}) (48:
856 U+20D0..20FF)
857 \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
858 DiacriticalsSup}) (64: U+1DC0..1DFF)
859 \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16:
860 U+FE20..FE2F)
861 \p{Block: Combining_Marks_For_Symbols} \p{Block=
862 Combining_Diacritical_Marks_For_Symbols}
863 (48)
864 \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
865 IndicNumberForms}) (16: U+A830..A83F)
866 \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96)
867 \p{Block: Control_Pictures} (64: U+2400..243F)
868 \p{Block: Coptic} (NOT \p{Coptic} NOR \p{Is_Coptic}) (128:
869 U+2C80..2CFF)
870 \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF)
871 \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
872 \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32:
873 U+1D360..1D37F)
874 \p{Block: Cuneiform} (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
875 (1024: U+12000..123FF)
876 \p{Block: Cuneiform_Numbers} \p{Block=
877 Cuneiform_Numbers_And_Punctuation} (128)
878 \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
879 CuneiformNumbers}) (128: U+12400..1247F)
880 \p{Block: Currency_Symbols} (48: U+20A0..20CF)
881 \p{Block: Cypriot_Syllabary} (64: U+10800..1083F)
882 \p{Block: Cypro_Minoan} (NOT \p{Cypro_Minoan} NOR
883 \p{Is_Cypro_Minoan}) (112:
884 U+12F90..12FFF)
885 \p{Block: Cyrillic} (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
886 (256: U+0400..04FF)
887 \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
888 \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
889 \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
890 \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32:
891 U+2DE0..2DFF)
892 \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96:
893 U+A640..A69F)
894 \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16:
895 U+1C80..1C8F)
896 \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
897 \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48:
898 U+0500..052F)
899 \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
900 (48)
901 \p{Block: Deseret} (80: U+10400..1044F)
902 \p{Block: Devanagari} (NOT \p{Devanagari} NOR \p{Is_Devanagari})
903 (128: U+0900..097F)
904 \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
905 \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32:
906 U+A8E0..A8FF)
907 \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
908 \p{Block: Diacriticals_Ext} \p{Block=
909 Combining_Diacritical_Marks_Extended}
910 (80)
911 \p{Block: Diacriticals_For_Symbols} \p{Block=
912 Combining_Diacritical_Marks_For_Symbols}
913 (48)
914 \p{Block: Diacriticals_Sup} \p{Block=
915 Combining_Diacritical_Marks_Supplement}
916 (64)
917 \p{Block: Dingbats} (192: U+2700..27BF)
918 \p{Block: Dives_Akuru} (NOT \p{Dives_Akuru} NOR
919 \p{Is_Dives_Akuru}) (96: U+11900..1195F)
920 \p{Block: Dogra} (NOT \p{Dogra} NOR \p{Is_Dogra}) (80:
921 U+11800..1184F)
922 \p{Block: Domino} \p{Block=Domino_Tiles} (112)
923 \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112:
924 U+1F030..1F09F)
925 \p{Block: Duployan} (NOT \p{Duployan} NOR \p{Is_Duployan})
926 (160: U+1BC00..1BC9F)
927 \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F)
928 \p{Block: Egyptian_Hieroglyph_Format_Controls} (16: U+13430..1343F)
929 \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
930 \p{Is_Egyptian_Hieroglyphs}) (1072:
931 U+13000..1342F)
932 \p{Block: Elbasan} (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48:
933 U+10500..1052F)
934 \p{Block: Elymaic} (NOT \p{Elymaic} NOR \p{Is_Elymaic}) (32:
935 U+10FE0..10FFF)
936 \p{Block: Emoticons} (80: U+1F600..1F64F)
937 \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
938 \p{Block: Enclosed_Alphanum_Sup} \p{Block=
939 Enclosed_Alphanumeric_Supplement} (256)
940 \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
941 EnclosedAlphanumSup}) (256:
942 U+1F100..1F1FF)
943 \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
944 EnclosedAlphanum}) (160: U+2460..24FF)
945 \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
946 (256)
947 \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
948 EnclosedCJK}) (256: U+3200..32FF)
949 \p{Block: Enclosed_Ideographic_Sup} \p{Block=
950 Enclosed_Ideographic_Supplement} (256)
951 \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
952 EnclosedIdeographicSup}) (256:
953 U+1F200..1F2FF)
954 \p{Block: Ethiopic} (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
955 (384: U+1200..137F)
956 \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
957 \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
958 \p{Block: Ethiopic_Ext_B} \p{Block=Ethiopic_Extended_B} (32)
959 \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96:
960 U+2D80..2DDF)
961 \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48:
962 U+AB00..AB2F)
963 \p{Block: Ethiopic_Extended_B} (Short: \p{Blk=EthiopicExtB}) (32:
964 U+1E7E0..1E7FF)
965 \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
966 \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32:
967 U+1380..139F)
968 \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
969 \p{Punct} NOR \p{Is_Punctuation}) (112:
970 U+2000..206F)
971 \p{Block: Geometric_Shapes} (96: U+25A0..25FF)
972 \p{Block: Geometric_Shapes_Ext} \p{Block=
973 Geometric_Shapes_Extended} (128)
974 \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
975 GeometricShapesExt}) (128:
976 U+1F780..1F7FF)
977 \p{Block: Georgian} (NOT \p{Georgian} NOR \p{Is_Georgian})
978 (96: U+10A0..10FF)
979 \p{Block: Georgian_Ext} \p{Block=Georgian_Extended} (48)
980 \p{Block: Georgian_Extended} (Short: \p{Blk=GeorgianExt}) (48:
981 U+1C90..1CBF)
982 \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
983 \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48:
984 U+2D00..2D2F)
985 \p{Block: Glagolitic} (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
986 (96: U+2C00..2C5F)
987 \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
988 \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
989 (48: U+1E000..1E02F)
990 \p{Block: Gothic} (NOT \p{Gothic} NOR \p{Is_Gothic}) (32:
991 U+10330..1034F)
992 \p{Block: Grantha} (NOT \p{Grantha} NOR \p{Is_Grantha}) (128:
993 U+11300..1137F)
994 \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
995 NOR \p{Is_Greek}) (144)
996 \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
997 NOR \p{Is_Greek}) (144: U+0370..03FF)
998 \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256)
999 \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256:
1000 U+1F00..1FFF)
1001 \p{Block: Gujarati} (NOT \p{Gujarati} NOR \p{Is_Gujarati})
1002 (128: U+0A80..0AFF)
1003 \p{Block: Gunjala_Gondi} (NOT \p{Gunjala_Gondi} NOR
1004 \p{Is_Gunjala_Gondi}) (80:
1005 U+11D60..11DAF)
1006 \p{Block: Gurmukhi} (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
1007 (128: U+0A00..0A7F)
1008 \p{Block: Half_And_Full_Forms} \p{Block=
1009 Halfwidth_And_Fullwidth_Forms} (240)
1010 \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16)
1011 \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
1012 HalfAndFullForms}) (240: U+FF00..FFEF)
1013 \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul}
1014 NOR \p{Is_Hangul}) (11_184)
1015 \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
1016 (96: U+3130..318F)
1017 \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}) (256: U+1100..11FF)
1018 \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32:
1019 U+A960..A97F)
1020 \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80:
1021 U+D7B0..D7FF)
1022 \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
1023 NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF)
1024 \p{Block: Hanifi_Rohingya} (NOT \p{Hanifi_Rohingya} NOR
1025 \p{Is_Hanifi_Rohingya}) (64:
1026 U+10D00..10D3F)
1027 \p{Block: Hanunoo} (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32:
1028 U+1720..173F)
1029 \p{Block: Hatran} (NOT \p{Hatran} NOR \p{Is_Hatran}) (32:
1030 U+108E0..108FF)
1031 \p{Block: Hebrew} (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112:
1032 U+0590..05FF)
1033 \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
1034 HighPUSurrogates}) (128: U+DB80..DBFF)
1035 \p{Block: High_PU_Surrogates} \p{Block=
1036 High_Private_Use_Surrogates} (128)
1037 \p{Block: High_Surrogates} (896: U+D800..DB7F)
1038 \p{Block: Hiragana} (NOT \p{Hiragana} NOR \p{Is_Hiragana})
1039 (96: U+3040..309F)
1040 \p{Block: IDC} \p{Block=
1041 Ideographic_Description_Characters} (NOT
1042 \p{ID_Continue} NOR \p{Is_IDC}) (16)
1043 \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
1044 NOT \p{ID_Continue} NOR \p{Is_IDC}) (16:
1045 U+2FF0..2FFF)
1046 \p{Block: Ideographic_Symbols} \p{Block=
1047 Ideographic_Symbols_And_Punctuation} (32)
1048 \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
1049 IdeographicSymbols}) (32: U+16FE0..16FFF)
1050 \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
1051 \p{Is_Imperial_Aramaic}) (32:
1052 U+10840..1085F)
1053 \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1054 (16)
1055 \p{Block: Indic_Siyaq_Numbers} (80: U+1EC70..1ECBF)
1056 \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
1057 NOR \p{Is_Inscriptional_Pahlavi}) (32:
1058 U+10B60..10B7F)
1059 \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
1060 NOR \p{Is_Inscriptional_Parthian}) (32:
1061 U+10B40..10B5F)
1062 \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96)
1063 \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96:
1064 U+0250..02AF)
1065 \p{Block: Jamo} \p{Block=Hangul_Jamo} (256)
1066 \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32)
1067 \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80)
1068 \p{Block: Javanese} (NOT \p{Javanese} NOR \p{Is_Javanese})
1069 (96: U+A980..A9DF)
1070 \p{Block: Kaithi} (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80:
1071 U+11080..110CF)
1072 \p{Block: Kana_Ext_A} \p{Block=Kana_Extended_A} (48)
1073 \p{Block: Kana_Ext_B} \p{Block=Kana_Extended_B} (16)
1074 \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48:
1075 U+1B100..1B12F)
1076 \p{Block: Kana_Extended_B} (Short: \p{Blk=KanaExtB}) (16:
1077 U+1AFF0..1AFFF)
1078 \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256)
1079 \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256:
1080 U+1B000..1B0FF)
1081 \p{Block: Kanbun} (16: U+3190..319F)
1082 \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224)
1083 \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224:
1084 U+2F00..2FDF)
1085 \p{Block: Kannada} (NOT \p{Kannada} NOR \p{Is_Kannada}) (128:
1086 U+0C80..0CFF)
1087 \p{Block: Katakana} (NOT \p{Katakana} NOR \p{Is_Katakana})
1088 (96: U+30A0..30FF)
1089 \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
1090 \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
1091 KatakanaExt}) (16: U+31F0..31FF)
1092 \p{Block: Kayah_Li} (48: U+A900..A92F)
1093 \p{Block: Kharoshthi} (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
1094 (96: U+10A00..10A5F)
1095 \p{Block: Khitan_Small_Script} (NOT \p{Khitan_Small_Script} NOR
1096 \p{Is_Khitan_Small_Script}) (512:
1097 U+18B00..18CFF)
1098 \p{Block: Khmer} (NOT \p{Khmer} NOR \p{Is_Khmer}) (128:
1099 U+1780..17FF)
1100 \p{Block: Khmer_Symbols} (32: U+19E0..19FF)
1101 \p{Block: Khojki} (NOT \p{Khojki} NOR \p{Is_Khojki}) (80:
1102 U+11200..1124F)
1103 \p{Block: Khudawadi} (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
1104 (80: U+112B0..112FF)
1105 \p{Block: Lao} (NOT \p{Lao} NOR \p{Is_Lao}) (128:
1106 U+0E80..0EFF)
1107 \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
1108 \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128)
1109 \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80-
1110 \xff])
1111 \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128)
1112 \p{Block: Latin_Ext_Additional} \p{Block=
1113 Latin_Extended_Additional} (256)
1114 \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208)
1115 \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32)
1116 \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224)
1117 \p{Block: Latin_Ext_E} \p{Block=Latin_Extended_E} (64)
1118 \p{Block: Latin_Ext_F} \p{Block=Latin_Extended_F} (64)
1119 \p{Block: Latin_Ext_G} \p{Block=Latin_Extended_G} (256)
1120 \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128:
1121 U+0100..017F)
1122 \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
1123 LatinExtAdditional}) (256: U+1E00..1EFF)
1124 \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208:
1125 U+0180..024F)
1126 \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32:
1127 U+2C60..2C7F)
1128 \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224:
1129 U+A720..A7FF)
1130 \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64:
1131 U+AB30..AB6F)
1132 \p{Block: Latin_Extended_F} (Short: \p{Blk=LatinExtF}) (64:
1133 U+10780..107BF)
1134 \p{Block: Latin_Extended_G} (Short: \p{Blk=LatinExtG}) (256:
1135 U+1DF00..1DFFF)
1136 \p{Block: Lepcha} (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80:
1137 U+1C00..1C4F)
1138 \p{Block: Letterlike_Symbols} (80: U+2100..214F)
1139 \p{Block: Limbu} (NOT \p{Limbu} NOR \p{Is_Limbu}) (80:
1140 U+1900..194F)
1141 \p{Block: Linear_A} (NOT \p{Linear_A} NOR \p{Is_Linear_A})
1142 (384: U+10600..1077F)
1143 \p{Block: Linear_B_Ideograms} (128: U+10080..100FF)
1144 \p{Block: Linear_B_Syllabary} (128: U+10000..1007F)
1145 \p{Block: Lisu} (NOT \p{Lisu} NOR \p{Is_Lisu}) (48:
1146 U+A4D0..A4FF)
1147 \p{Block: Lisu_Sup} \p{Block=Lisu_Supplement} (16)
1148 \p{Block: Lisu_Supplement} (Short: \p{Blk=LisuSup}) (16:
1149 U+11FB0..11FBF)
1150 \p{Block: Low_Surrogates} (1024: U+DC00..DFFF)
1151 \p{Block: Lycian} (NOT \p{Lycian} NOR \p{Is_Lycian}) (32:
1152 U+10280..1029F)
1153 \p{Block: Lydian} (NOT \p{Lydian} NOR \p{Is_Lydian}) (32:
1154 U+10920..1093F)
1155 \p{Block: Mahajani} (NOT \p{Mahajani} NOR \p{Is_Mahajani})
1156 (48: U+11150..1117F)
1157 \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48)
1158 \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48:
1159 U+1F000..1F02F)
1160 \p{Block: Makasar} (NOT \p{Makasar} NOR \p{Is_Makasar}) (32:
1161 U+11EE0..11EFF)
1162 \p{Block: Malayalam} (NOT \p{Malayalam} NOR \p{Is_Malayalam})
1163 (128: U+0D00..0D7F)
1164 \p{Block: Mandaic} (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32:
1165 U+0840..085F)
1166 \p{Block: Manichaean} (NOT \p{Manichaean} NOR \p{Is_Manichaean})
1167 (64: U+10AC0..10AFF)
1168 \p{Block: Marchen} (NOT \p{Marchen} NOR \p{Is_Marchen}) (80:
1169 U+11C70..11CBF)
1170 \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR
1171 \p{Is_Masaram_Gondi}) (96:
1172 U+11D00..11D5F)
1173 \p{Block: Math_Alphanum} \p{Block=
1174 Mathematical_Alphanumeric_Symbols} (1024)
1175 \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
1176 \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
1177 MathAlphanum}) (1024: U+1D400..1D7FF)
1178 \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
1179 (256: U+2200..22FF)
1180 \p{Block: Mayan_Numerals} (32: U+1D2E0..1D2FF)
1181 \p{Block: Medefaidrin} (NOT \p{Medefaidrin} NOR
1182 \p{Is_Medefaidrin}) (96: U+16E40..16E9F)
1183 \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
1184 \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF)
1185 \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
1186 \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
1187 (32: U+AAE0..AAFF)
1188 \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
1189 \p{Is_Mende_Kikakui}) (224:
1190 U+1E800..1E8DF)
1191 \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
1192 \p{Is_Meroitic_Cursive}) (96:
1193 U+109A0..109FF)
1194 \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F)
1195 \p{Block: Miao} (NOT \p{Miao} NOR \p{Is_Miao}) (160:
1196 U+16F00..16F9F)
1197 \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows}
1198 (256)
1199 \p{Block: Misc_Math_Symbols_A} \p{Block=
1200 Miscellaneous_Mathematical_Symbols_A}
1201 (48)
1202 \p{Block: Misc_Math_Symbols_B} \p{Block=
1203 Miscellaneous_Mathematical_Symbols_B}
1204 (128)
1205 \p{Block: Misc_Pictographs} \p{Block=
1206 Miscellaneous_Symbols_And_Pictographs}
1207 (768)
1208 \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
1209 \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
1210 \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
1211 MiscMathSymbolsA}) (48: U+27C0..27EF)
1212 \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
1213 MiscMathSymbolsB}) (128: U+2980..29FF)
1214 \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols})
1215 (256: U+2600..26FF)
1216 \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
1217 MiscArrows}) (256: U+2B00..2BFF)
1218 \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
1219 MiscPictographs}) (768: U+1F300..1F5FF)
1220 \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
1221 (256: U+2300..23FF)
1222 \p{Block: Modi} (NOT \p{Modi} NOR \p{Is_Modi}) (96:
1223 U+11600..1165F)
1224 \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
1225 \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F)
1226 \p{Block: Mongolian} (NOT \p{Mongolian} NOR \p{Is_Mongolian})
1227 (176: U+1800..18AF)
1228 \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
1229 \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32:
1230 U+11660..1167F)
1231 \p{Block: Mro} (NOT \p{Mro} NOR \p{Is_Mro}) (48:
1232 U+16A40..16A6F)
1233 \p{Block: Multani} (NOT \p{Multani} NOR \p{Is_Multani}) (48:
1234 U+11280..112AF)
1235 \p{Block: Music} \p{Block=Musical_Symbols} (256)
1236 \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256:
1237 U+1D100..1D1FF)
1238 \p{Block: Myanmar} (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160:
1239 U+1000..109F)
1240 \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
1241 \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
1242 \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32:
1243 U+AA60..AA7F)
1244 \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32:
1245 U+A9E0..A9FF)
1246 \p{Block: Nabataean} (NOT \p{Nabataean} NOR \p{Is_Nabataean})
1247 (48: U+10880..108AF)
1248 \p{Block: Nandinagari} (NOT \p{Nandinagari} NOR
1249 \p{Is_Nandinagari}) (96: U+119A0..119FF)
1250 \p{Block: NB} \p{Block=No_Block} (825_600 plus all
1251 above-Unicode code points)
1252 \p{Block: New_Tai_Lue} (NOT \p{New_Tai_Lue} NOR
1253 \p{Is_New_Tai_Lue}) (96: U+1980..19DF)
1254 \p{Block: Newa} (NOT \p{Newa} NOR \p{Is_Newa}) (128:
1255 U+11400..1147F)
1256 \p{Block: NKo} (NOT \p{Nko} NOR \p{Is_NKo}) (64:
1257 U+07C0..07FF)
1258 \p{Block: No_Block} (Short: \p{Blk=NB}) (825_600 plus all
1259 above-Unicode code points: U+2FE0..2FEF,
1260 U+10200..1027F, U+103E0..103FF,
1261 U+105C0..105FF, U+107C0..107FF,
1262 U+108B0..108DF ...)
1263 \p{Block: Number_Forms} (64: U+2150..218F)
1264 \p{Block: Nushu} (NOT \p{Nushu} NOR \p{Is_Nushu}) (400:
1265 U+1B170..1B2FF)
1266 \p{Block: Nyiakeng_Puachue_Hmong} (NOT \p{Nyiakeng_Puachue_Hmong}
1267 NOR \p{Is_Nyiakeng_Puachue_Hmong}) (80:
1268 U+1E100..1E14F)
1269 \p{Block: OCR} \p{Block=Optical_Character_Recognition}
1270 (32)
1271 \p{Block: Ogham} (NOT \p{Ogham} NOR \p{Is_Ogham}) (32:
1272 U+1680..169F)
1273 \p{Block: Ol_Chiki} (48: U+1C50..1C7F)
1274 \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
1275 \p{Is_Old_Hungarian}) (128:
1276 U+10C80..10CFF)
1277 \p{Block: Old_Italic} (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
1278 (48: U+10300..1032F)
1279 \p{Block: Old_North_Arabian} (32: U+10A80..10A9F)
1280 \p{Block: Old_Permic} (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
1281 (48: U+10350..1037F)
1282 \p{Block: Old_Persian} (NOT \p{Old_Persian} NOR
1283 \p{Is_Old_Persian}) (64: U+103A0..103DF)
1284 \p{Block: Old_Sogdian} (NOT \p{Old_Sogdian} NOR
1285 \p{Is_Old_Sogdian}) (48: U+10F00..10F2F)
1286 \p{Block: Old_South_Arabian} (32: U+10A60..10A7F)
1287 \p{Block: Old_Turkic} (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
1288 (80: U+10C00..10C4F)
1289 \p{Block: Old_Uyghur} (NOT \p{Old_Uyghur} NOR \p{Is_Old_Uyghur})
1290 (64: U+10F70..10FAF)
1291 \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32:
1292 U+2440..245F)
1293 \p{Block: Oriya} (NOT \p{Oriya} NOR \p{Is_Oriya}) (128:
1294 U+0B00..0B7F)
1295 \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F)
1296 \p{Block: Osage} (NOT \p{Osage} NOR \p{Is_Osage}) (80:
1297 U+104B0..104FF)
1298 \p{Block: Osmanya} (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48:
1299 U+10480..104AF)
1300 \p{Block: Ottoman_Siyaq_Numbers} (80: U+1ED00..1ED4F)
1301 \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
1302 \p{Is_Pahawh_Hmong}) (144:
1303 U+16B00..16B8F)
1304 \p{Block: Palmyrene} (32: U+10860..1087F)
1305 \p{Block: Pau_Cin_Hau} (NOT \p{Pau_Cin_Hau} NOR
1306 \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF)
1307 \p{Block: Phags_Pa} (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa})
1308 (64: U+A840..A87F)
1309 \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48)
1310 \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48:
1311 U+101D0..101FF)
1312 \p{Block: Phoenician} (NOT \p{Phoenician} NOR \p{Is_Phoenician})
1313 (32: U+10900..1091F)
1314 \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
1315 \p{Block: Phonetic_Ext_Sup} \p{Block=
1316 Phonetic_Extensions_Supplement} (64)
1317 \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128:
1318 U+1D00..1D7F)
1319 \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
1320 PhoneticExtSup}) (64: U+1D80..1DBF)
1321 \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF)
1322 \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
1323 \p{Private_Use} NOR \p{Is_Private_Use})
1324 (6400)
1325 \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
1326 \p{Private_Use} NOR \p{Is_Private_Use})
1327 (6400: U+E000..F8FF)
1328 \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
1329 \p{Is_Psalter_Pahlavi}) (48:
1330 U+10B80..10BAF)
1331 \p{Block: PUA} \p{Block=Private_Use_Area} (NOT
1332 \p{Private_Use} NOR \p{Is_Private_Use})
1333 (6400)
1334 \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT
1335 \p{Punct} NOR \p{Is_Punctuation}) (112)
1336 \p{Block: Rejang} (NOT \p{Rejang} NOR \p{Is_Rejang}) (48:
1337 U+A930..A95F)
1338 \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32)
1339 \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32:
1340 U+10E60..10E7F)
1341 \p{Block: Runic} (NOT \p{Runic} NOR \p{Is_Runic}) (96:
1342 U+16A0..16FF)
1343 \p{Block: Samaritan} (NOT \p{Samaritan} NOR \p{Is_Samaritan})
1344 (64: U+0800..083F)
1345 \p{Block: Saurashtra} (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
1346 (96: U+A880..A8DF)
1347 \p{Block: Sharada} (NOT \p{Sharada} NOR \p{Is_Sharada}) (96:
1348 U+11180..111DF)
1349 \p{Block: Shavian} (48: U+10450..1047F)
1350 \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF)
1351 \p{Block: Siddham} (NOT \p{Siddham} NOR \p{Is_Siddham}) (128:
1352 U+11580..115FF)
1353 \p{Block: Sinhala} (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128:
1354 U+0D80..0DFF)
1355 \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF)
1356 \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32:
1357 U+FE50..FE6F)
1358 \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32)
1359 \p{Block: Small_Kana_Ext} \p{Block=Small_Kana_Extension} (64)
1360 \p{Block: Small_Kana_Extension} (Short: \p{Blk=SmallKanaExt}) (64:
1361 U+1B130..1B16F)
1362 \p{Block: Sogdian} (NOT \p{Sogdian} NOR \p{Is_Sogdian}) (64:
1363 U+10F30..10F6F)
1364 \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
1365 \p{Is_Sora_Sompeng}) (48: U+110D0..110FF)
1366 \p{Block: Soyombo} (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96:
1367 U+11A50..11AAF)
1368 \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
1369 ModifierLetters}) (80: U+02B0..02FF)
1370 \p{Block: Specials} (16: U+FFF0..FFFF)
1371 \p{Block: Sundanese} (NOT \p{Sundanese} NOR \p{Is_Sundanese})
1372 (64: U+1B80..1BBF)
1373 \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
1374 \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16:
1375 U+1CC0..1CCF)
1376 \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
1377 \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
1378 \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
1379 \p{Block: Sup_Math_Operators} \p{Block=
1380 Supplemental_Mathematical_Operators}
1381 (256)
1382 \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A}
1383 (65_536)
1384 \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B}
1385 (65_536)
1386 \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
1387 \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
1388 Supplemental_Symbols_And_Pictographs}
1389 (256)
1390 \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
1391 \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
1392 SuperAndSub}) (48: U+2070..209F)
1393 \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16:
1394 U+27F0..27FF)
1395 \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128:
1396 U+2900..297F)
1397 \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256:
1398 U+1F800..1F8FF)
1399 \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
1400 SupMathOperators}) (256: U+2A00..2AFF)
1401 \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
1402 SupPunctuation}) (128: U+2E00..2E7F)
1403 \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
1404 SupSymbolsAndPictographs}) (256:
1405 U+1F900..1F9FF)
1406 \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
1407 SupPUAA}) (65_536: U+F0000..FFFFF)
1408 \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
1409 SupPUAB}) (65_536: U+100000..10FFFF)
1410 \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF)
1411 \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
1412 \p{Is_Syloti_Nagri}) (48: U+A800..A82F)
1413 \p{Block: Symbols_And_Pictographs_Ext_A} \p{Block=
1414 Symbols_And_Pictographs_Extended_A} (144)
1415 \p{Block: Symbols_And_Pictographs_Extended_A} (Short: \p{Blk=
1416 SymbolsAndPictographsExtA}) (144:
1417 U+1FA70..1FAFF)
1418 \p{Block: Symbols_For_Legacy_Computing} (256: U+1FB00..1FBFF)
1419 \p{Block: Syriac} (NOT \p{Syriac} NOR \p{Is_Syriac}) (80:
1420 U+0700..074F)
1421 \p{Block: Syriac_Sup} \p{Block=Syriac_Supplement} (16)
1422 \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16:
1423 U+0860..086F)
1424 \p{Block: Tagalog} (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32:
1425 U+1700..171F)
1426 \p{Block: Tagbanwa} (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa})
1427 (32: U+1760..177F)
1428 \p{Block: Tags} (128: U+E0000..E007F)
1429 \p{Block: Tai_Le} (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48:
1430 U+1950..197F)
1431 \p{Block: Tai_Tham} (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
1432 (144: U+1A20..1AAF)
1433 \p{Block: Tai_Viet} (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet})
1434 (96: U+AA80..AADF)
1435 \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1436 \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96:
1437 U+1D300..1D35F)
1438 \p{Block: Takri} (NOT \p{Takri} NOR \p{Is_Takri}) (80:
1439 U+11680..116CF)
1440 \p{Block: Tamil} (NOT \p{Tamil} NOR \p{Is_Tamil}) (128:
1441 U+0B80..0BFF)
1442 \p{Block: Tamil_Sup} \p{Block=Tamil_Supplement} (64)
1443 \p{Block: Tamil_Supplement} (Short: \p{Blk=TamilSup}) (64:
1444 U+11FC0..11FFF)
1445 \p{Block: Tangsa} (NOT \p{Tangsa} NOR \p{Is_Tangsa}) (96:
1446 U+16A70..16ACF)
1447 \p{Block: Tangut} (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144:
1448 U+17000..187FF)
1449 \p{Block: Tangut_Components} (768: U+18800..18AFF)
1450 \p{Block: Tangut_Sup} \p{Block=Tangut_Supplement} (128)
1451 \p{Block: Tangut_Supplement} (Short: \p{Blk=TangutSup}) (128:
1452 U+18D00..18D7F)
1453 \p{Block: Telugu} (NOT \p{Telugu} NOR \p{Is_Telugu}) (128:
1454 U+0C00..0C7F)
1455 \p{Block: Thaana} (NOT \p{Thaana} NOR \p{Is_Thaana}) (64:
1456 U+0780..07BF)
1457 \p{Block: Thai} (NOT \p{Thai} NOR \p{Is_Thai}) (128:
1458 U+0E00..0E7F)
1459 \p{Block: Tibetan} (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256:
1460 U+0F00..0FFF)
1461 \p{Block: Tifinagh} (NOT \p{Tifinagh} NOR \p{Is_Tifinagh})
1462 (80: U+2D30..2D7F)
1463 \p{Block: Tirhuta} (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96:
1464 U+11480..114DF)
1465 \p{Block: Toto} (NOT \p{Toto} NOR \p{Is_Toto}) (48:
1466 U+1E290..1E2BF)
1467 \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1468 (128)
1469 \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1470 TransportAndMap}) (128: U+1F680..1F6FF)
1471 \p{Block: UCAS} \p{Block=
1472 Unified_Canadian_Aboriginal_Syllabics}
1473 (640)
1474 \p{Block: UCAS_Ext} \p{Block=
1475 Unified_Canadian_Aboriginal_Syllabics_-
1476 Extended} (80)
1477 \p{Block: UCAS_Ext_A} \p{Block=
1478 Unified_Canadian_Aboriginal_Syllabics_-
1479 Extended_A} (16)
1480 \p{Block: Ugaritic} (NOT \p{Ugaritic} NOR \p{Is_Ugaritic})
1481 (32: U+10380..1039F)
1482 \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1483 UCAS}) (640: U+1400..167F)
1484 \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1485 \p{Blk=UCASExt}) (80: U+18B0..18FF)
1486 \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended_A}
1487 (Short: \p{Blk=UCASExtA}) (16:
1488 U+11AB0..11ABF)
1489 \p{Block: Vai} (NOT \p{Vai} NOR \p{Is_Vai}) (320:
1490 U+A500..A63F)
1491 \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
1492 \p{Variation_Selector} NOR \p{Is_VS})
1493 (16: U+FE00..FE0F)
1494 \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
1495 (240: U+E0100..E01EF)
1496 \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48)
1497 \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48:
1498 U+1CD0..1CFF)
1499 \p{Block: Vertical_Forms} (16: U+FE10..FE1F)
1500 \p{Block: Vithkuqi} (NOT \p{Vithkuqi} NOR \p{Is_Vithkuqi})
1501 (80: U+10570..105BF)
1502 \p{Block: VS} \p{Block=Variation_Selectors} (NOT
1503 \p{Variation_Selector} NOR \p{Is_VS})
1504 (16)
1505 \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement}
1506 (240)
1507 \p{Block: Wancho} (NOT \p{Wancho} NOR \p{Is_Wancho}) (64:
1508 U+1E2C0..1E2FF)
1509 \p{Block: Warang_Citi} (NOT \p{Warang_Citi} NOR
1510 \p{Is_Warang_Citi}) (96: U+118A0..118FF)
1511 \p{Block: Yezidi} (NOT \p{Yezidi} NOR \p{Is_Yezidi}) (64:
1512 U+10E80..10EBF)
1513 \p{Block: Yi_Radicals} (64: U+A490..A4CF)
1514 \p{Block: Yi_Syllables} (1168: U+A000..A48F)
1515 \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64)
1516 \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64:
1517 U+4DC0..4DFF)
1518 \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR
1519 \p{Is_Zanabazar_Square}) (80:
1520 U+11A00..11A4F)
1521 \p{Block: Znamenny_Music} \p{Block=Znamenny_Musical_Notation} (208)
1522 \p{Block: Znamenny_Musical_Notation} (Short: \p{Blk=
1523 ZnamennyMusic}) (208: U+1CF00..1CFCF)
1524 X \p{Block_Elements} \p{Block=Block_Elements} (32)
1525 \p{Bopo} \p{Bopomofo} (= \p{Script_Extensions=
1526 Bopomofo}) (NOT \p{Block=Bopomofo}) (117)
1527 \p{Bopomofo} \p{Script_Extensions=Bopomofo} (Short:
1528 \p{Bopo}; NOT \p{Block=Bopomofo}) (117)
1529 X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block=
1530 Bopomofo_Extended}) (32)
1531 X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short:
1532 \p{InBopomofoExt}) (32)
1533 X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
1534 \p{Bpt: *} \p{Bidi_Paired_Bracket_Type: *}
1535 \p{Brah} \p{Brahmi} (= \p{Script_Extensions=
1536 Brahmi}) (NOT \p{Block=Brahmi}) (115)
1537 \p{Brahmi} \p{Script_Extensions=Brahmi} (Short:
1538 \p{Brah}; NOT \p{Block=Brahmi}) (115)
1539 \p{Brai} \p{Braille} (= \p{Script_Extensions=
1540 Braille}) (256)
1541 \p{Braille} \p{Script_Extensions=Braille} (Short:
1542 \p{Brai}) (256)
1543 X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short:
1544 \p{InBraille}) (256)
1545 \p{Bugi} \p{Buginese} (= \p{Script_Extensions=
1546 Buginese}) (NOT \p{Block=Buginese}) (31)
1547 \p{Buginese} \p{Script_Extensions=Buginese} (Short:
1548 \p{Bugi}; NOT \p{Block=Buginese}) (31)
1549 \p{Buhd} \p{Buhid} (= \p{Script_Extensions=Buhid})
1550 (NOT \p{Block=Buhid}) (22)
1551 \p{Buhid} \p{Script_Extensions=Buhid} (Short:
1552 \p{Buhd}; NOT \p{Block=Buhid}) (22)
1553 X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block=
1554 Byzantine_Musical_Symbols}) (256)
1555 X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1556 (Short: \p{InByzantineMusic}) (256)
1557 \p{C} \pC \p{Other} (= \p{General_Category=Other})
1558 (969_578 plus all above-Unicode code
1559 points)
1560 \p{Cakm} \p{Chakma} (= \p{Script_Extensions=
1561 Chakma}) (NOT \p{Block=Chakma}) (91)
1562 \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
1563 (Short: \p{Cans}) (726)
1564 X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
1565 (= \p{Block=
1566 Unified_Canadian_Aboriginal_Syllabics})
1567 (640)
1568 T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1569 Not_Reordered} (1_113_200 plus all
1570 above-Unicode code points)
1571 T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1572 Overlay} (32)
1573 T \p{Canonical_Combining_Class: 6} \p{Canonical_Combining_Class=
1574 Han_Reading} (2)
1575 T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1576 Nukta} (27)
1577 T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1578 Kana_Voicing} (2)
1579 T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1580 Virama} (63)
1581 T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1582 CCC10} (1)
1583 \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1:
1584 U+05B0)
1585 T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1586 CCC11} (1)
1587 \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1:
1588 U+05B1)
1589 T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1590 CCC12} (1)
1591 \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1:
1592 U+05B2)
1593 T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1594 CCC13} (1)
1595 \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1:
1596 U+05B3)
1597 T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1598 CCC14} (1)
1599 \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1:
1600 U+05B4)
1601 T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1602 CCC15} (1)
1603 \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1:
1604 U+05B5)
1605 T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1606 CCC16} (1)
1607 \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1:
1608 U+05B6)
1609 T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1610 CCC17} (1)
1611 \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1:
1612 U+05B7)
1613 T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1614 CCC18} (2)
1615 \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2:
1616 U+05B8, U+05C7)
1617 T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1618 CCC19} (2)
1619 \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2:
1620 U+05B9..05BA)
1621 T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1622 CCC20} (1)
1623 \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1:
1624 U+05BB)
1625 T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1626 CCC21} (1)
1627 \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1:
1628 U+05BC)
1629 T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1630 CCC22} (1)
1631 \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1:
1632 U+05BD)
1633 T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1634 CCC23} (1)
1635 \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1:
1636 U+05BF)
1637 T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1638 CCC24} (1)
1639 \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1:
1640 U+05C1)
1641 T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1642 CCC25} (1)
1643 \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1:
1644 U+05C2)
1645 T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1646 CCC26} (1)
1647 \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1:
1648 U+FB1E)
1649 T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1650 CCC27} (2)
1651 \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2:
1652 U+064B, U+08F0)
1653 T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1654 CCC28} (2)
1655 \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2:
1656 U+064C, U+08F1)
1657 T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1658 CCC29} (2)
1659 \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2:
1660 U+064D, U+08F2)
1661 T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1662 CCC30} (2)
1663 \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2:
1664 U+0618, U+064E)
1665 T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1666 CCC31} (2)
1667 \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2:
1668 U+0619, U+064F)
1669 T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1670 CCC32} (2)
1671 \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2:
1672 U+061A, U+0650)
1673 T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1674 CCC33} (1)
1675 \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1:
1676 U+0651)
1677 T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1678 CCC34} (1)
1679 \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1:
1680 U+0652)
1681 T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1682 CCC35} (1)
1683 \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1:
1684 U+0670)
1685 T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1686 CCC36} (1)
1687 \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1:
1688 U+0711)
1689 T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1690 CCC84} (1)
1691 \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1:
1692 U+0C55)
1693 T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1694 CCC91} (1)
1695 \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1:
1696 U+0C56)
1697 T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1698 CCC103} (2)
1699 \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2:
1700 U+0E38..0E39)
1701 T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1702 CCC107} (4)
1703 \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4:
1704 U+0E48..0E4B)
1705 T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1706 CCC118} (2)
1707 \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2:
1708 U+0EB8..0EB9)
1709 T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1710 CCC122} (4)
1711 \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4:
1712 U+0EC8..0ECB)
1713 T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1714 CCC129} (1)
1715 \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1:
1716 U+0F71)
1717 T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1718 CCC130} (6)
1719 \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6:
1720 U+0F72, U+0F7A..0F7D, U+0F80)
1721 T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1722 CCC132} (1)
1723 \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1:
1724 U+0F74)
1725 T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
1726 CCC133} (0)
1727 \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
1728 T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1729 Attached_Below_Left} (0)
1730 T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1731 Attached_Below} (5)
1732 T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1733 Attached_Above} (1)
1734 T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1735 Attached_Above_Right} (9)
1736 T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1737 Below_Left} (2)
1738 T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1739 Below} (177)
1740 T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1741 Below_Right} (4)
1742 T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1743 Left} (2)
1744 T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1745 Right} (1)
1746 T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1747 Above_Left} (5)
1748 T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1749 Above} (508)
1750 T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1751 Above_Right} (5)
1752 T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1753 Double_Below} (4)
1754 T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1755 Double_Above} (5)
1756 T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1757 Iota_Subscript} (1)
1758 \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1759 Above} (508)
1760 \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (508:
1761 U+0300..0314, U+033D..0344, U+0346,
1762 U+034A..034C, U+0350..0352, U+0357 ...)
1763 \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5:
1764 U+05AE, U+18A9, U+1DF7..1DF8, U+302B)
1765 \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (5:
1766 U+0315, U+031A, U+0358, U+1DF6, U+302C)
1767 \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1768 Above_Left} (5)
1769 \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1770 Above_Right} (5)
1771 \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1772 Attached_Above} (1)
1773 \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1774 Attached_Above_Right} (9)
1775 \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1776 Attached_Below} (5)
1777 \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1778 Attached_Below_Left} (0)
1779 \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1780 (1: U+1DCE)
1781 \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1782 \p{Ccc=ATAR}) (9: U+031B, U+0F39,
1783 U+1D165..1D166, U+1D16E..1D172)
1784 \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1785 (5: U+0321..0322, U+0327..0328, U+1DD0)
1786 \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1787 ATBL}) (0)
1788 \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1789 Below} (177)
1790 \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (177:
1791 U+0316..0319, U+031C..0320,
1792 U+0323..0326, U+0329..0333,
1793 U+0339..033C, U+0347..0349 ...)
1794 \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (2:
1795 U+1DFA, U+302A)
1796 \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4:
1797 U+059A, U+05AD, U+1939, U+302D)
1798 \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1799 Below_Left} (2)
1800 \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1801 Below_Right} (4)
1802 \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1803 Double_Above} (5)
1804 \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1805 Double_Below} (4)
1806 \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA})
1807 (5: U+035D..035E, U+0360..0361, U+1DCD)
1808 \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB})
1809 (4: U+035C, U+035F, U+0362, U+1DFC)
1810 \p{Canonical_Combining_Class: Han_Reading} (Short: \p{Ccc=HANR})
1811 (2: U+16FF0..16FF1)
1812 \p{Canonical_Combining_Class: HANR} \p{Canonical_Combining_Class=
1813 Han_Reading} (2)
1814 \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1815 (1: U+0345)
1816 \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1817 Iota_Subscript} (1)
1818 \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV})
1819 (2: U+3099..309A)
1820 \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1821 Kana_Voicing} (2)
1822 \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1823 Left} (2)
1824 \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2:
1825 U+302E..302F)
1826 \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1827 Nukta} (27)
1828 \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1829 (1_113_200 plus all above-Unicode code
1830 points: U+0000..02FF, U+034F,
1831 U+0370..0482, U+0488..0590, U+05BE,
1832 U+05C0 ...)
1833 \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1834 Not_Reordered} (1_113_200 plus all
1835 above-Unicode code points)
1836 \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (27:
1837 U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C,
1838 U+0C3C ...)
1839 \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1840 Overlay} (32)
1841 \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32:
1842 U+0334..0338, U+1CD4, U+1CE2..1CE8,
1843 U+20D2..20D3, U+20D8..20DA, U+20E5..20E6
1844 ...)
1845 \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1846 Right} (1)
1847 \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1:
1848 U+1D16D)
1849 \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (63:
1850 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
1851 U+0BCD ...)
1852 \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1853 Virama} (63)
1854 \p{Cans} \p{Canadian_Aboriginal} (=
1855 \p{Script_Extensions=
1856 Canadian_Aboriginal}) (726)
1857 \p{Cari} \p{Carian} (= \p{Script_Extensions=
1858 Carian}) (NOT \p{Block=Carian}) (49)
1859 \p{Carian} \p{Script_Extensions=Carian} (Short:
1860 \p{Cari}; NOT \p{Block=Carian}) (49)
1861 \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (2602)
1862 \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_111_510 plus
1863 all above-Unicode code points: [\x00-
1864 \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A-Z
1865 \[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-\xac\xae
1866 \xb0-\xb3\xb5-\xb6\xb9-\xff],
1867 U+0100..02AF, U+0370..0373,
1868 U+0376..0379, U+037B..0383, U+0386 ...)
1869 \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (2602: [\'.:\^`
1870 \xa8\xad\xaf\xb4\xb7-\xb8],
1871 U+02B0..036F, U+0374..0375, U+037A,
1872 U+0384..0385, U+0387 ...)
1873 \p{Cased} \p{Cased=Y} (4453)
1874 \p{Cased: N*} (Single: \P{Cased}) (1_109_659 plus all
1875 above-Unicode code points: [\x00-\x20!
1876 \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]
1877 \^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9
1878 \xbb-\xbf\xd7\xf7], U+01BB,
1879 U+01C0..01C3, U+0294, U+02B9..02BF,
1880 U+02C2..02DF ...)
1881 \p{Cased: Y*} (Single: \p{Cased}) (4453: [A-Za-z\xaa
1882 \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
1883 U+0100..01BA, U+01BC..01BF,
1884 U+01C4..0293, U+0295..02B8, U+02C0..02C1
1885 ...)
1886 \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
1887 \p{LC}) (4089)
1888 \p{Category: *} \p{General_Category: *}
1889 \p{Caucasian_Albanian} \p{Script_Extensions=Caucasian_Albanian}
1890 (Short: \p{Aghb}; NOT \p{Block=
1891 Caucasian_Albanian}) (53)
1892 \p{Cc} \p{XPosixCntrl} (= \p{General_Category=
1893 Control}) (65)
1894 \p{Ccc: *} \p{Canonical_Combining_Class: *}
1895 \p{CE} \p{Composition_Exclusion} (=
1896 \p{Composition_Exclusion=Y}) (81)
1897 \p{CE: *} \p{Composition_Exclusion: *}
1898 \p{Cf} \p{Format} (= \p{General_Category=Format})
1899 (163)
1900 \p{Chakma} \p{Script_Extensions=Chakma} (Short:
1901 \p{Cakm}; NOT \p{Block=Chakma}) (91)
1902 \p{Cham} \p{Script_Extensions=Cham} (NOT \p{Block=
1903 Cham}) (83)
1904 \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1905 \p{CWCF}) (1506)
1906 \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1907 (1_112_606 plus all above-Unicode code
1908 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1909 \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1910 \xb4\xb6-\xbf\xd7\xe0-\xff], U+0101,
1911 U+0103, U+0105, U+0107, U+0109 ...)
1912 \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1913 (1506: [A-Z\xb5\xc0-\xd6\xd8-\xdf],
1914 U+0100, U+0102, U+0104, U+0106, U+0108
1915 ...)
1916 \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1917 \p{CWCM}) (2927)
1918 \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1919 (1_111_185 plus all above-Unicode code
1920 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1921 \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xb4
1922 \xb6-\xbf\xd7\xf7], U+0138, U+018D,
1923 U+019B, U+01AA..01AB, U+01BA..01BB ...)
1924 \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1925 (2927: [A-Za-z\xb5\xc0-\xd6\xd8-\xf6
1926 \xf8-\xff], U+0100..0137, U+0139..018C,
1927 U+018E..019A, U+019C..01A9, U+01AC..01B9
1928 ...)
1929 \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1930 \p{CWL}) (1433)
1931 \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1932 (1_112_679 plus all above-Unicode code
1933 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1934 \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1935 \xbf\xd7\xdf-\xff], U+0101, U+0103,
1936 U+0105, U+0107, U+0109 ...)
1937 \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1433:
1938 [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102,
1939 U+0104, U+0106, U+0108 ...)
1940 \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1941 Y} (Short: \p{CWKCF}) (10_429)
1942 \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1943 \P{CWKCF}) (1_103_683 plus all above-
1944 Unicode code points: [\x00-\x20!\"#\$
1945 \%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a-
1946 z\{\|\}~\x7f-\x9f\xa1-\xa7\xa9\xab-\xac
1947 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xd7\xe0-
1948 \xff], U+0101, U+0103, U+0105, U+0107,
1949 U+0109 ...)
1950 \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1951 \p{CWKCF}) (10_429: [A-Z\xa0\xa8\xaa
1952 \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0-
1953 \xd6\xd8-\xdf], U+0100, U+0102, U+0104,
1954 U+0106, U+0108 ...)
1955 \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1956 \p{CWT}) (1452)
1957 \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1958 (1_112_660 plus all above-Unicode code
1959 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1960 \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f-
1961 \xb4\xb6-\xde\xf7], U+0100, U+0102,
1962 U+0104, U+0106, U+0108 ...)
1963 \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1452:
1964 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1965 U+0103, U+0105, U+0107, U+0109 ...)
1966 \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1967 \p{CWU}) (1525)
1968 \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1969 (1_112_587 plus all above-Unicode code
1970 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1971 \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f-
1972 \xb4\xb6-\xde\xf7], U+0100, U+0102,
1973 U+0104, U+0106, U+0108 ...)
1974 \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1525:
1975 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1976 U+0103, U+0105, U+0107, U+0109 ...)
1977 \p{Cher} \p{Cherokee} (= \p{Script_Extensions=
1978 Cherokee}) (NOT \p{Block=Cherokee}) (172)
1979 \p{Cherokee} \p{Script_Extensions=Cherokee} (Short:
1980 \p{Cher}; NOT \p{Block=Cherokee}) (172)
1981 X \p{Cherokee_Sup} \p{Cherokee_Supplement} (= \p{Block=
1982 Cherokee_Supplement}) (80)
1983 X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
1984 \p{InCherokeeSup}) (80)
1985 X \p{Chess_Symbols} \p{Block=Chess_Symbols} (112)
1986 \p{Chorasmian} \p{Script_Extensions=Chorasmian} (Short:
1987 \p{Chrs}; NOT \p{Block=Chorasmian}) (28)
1988 \p{Chrs} \p{Chorasmian} (= \p{Script_Extensions=
1989 Chorasmian}) (NOT \p{Block=Chorasmian})
1990 (28)
1991 \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
1992 Y}) (2602)
1993 \p{CI: *} \p{Case_Ignorable: *}
1994 X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block=
1995 CJK_Unified_Ideographs}) (20_992)
1996 X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block=
1997 CJK_Compatibility}) (256)
1998 X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block=
1999 CJK_Compatibility_Forms}) (32)
2000 X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
2001 \p{Block=CJK_Compatibility_Ideographs})
2002 (512)
2003 X \p{CJK_Compat_Ideographs_Sup}
2004 \p{CJK_Compatibility_Ideographs_-
2005 Supplement} (= \p{Block=
2006 CJK_Compatibility_Ideographs_-
2007 Supplement}) (544)
2008 X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short:
2009 \p{InCJKCompat}) (256)
2010 X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
2011 (Short: \p{InCJKCompatForms}) (32)
2012 X \p{CJK_Compatibility_Ideographs} \p{Block=
2013 CJK_Compatibility_Ideographs} (Short:
2014 \p{InCJKCompatIdeographs}) (512)
2015 X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
2016 CJK_Compatibility_Ideographs_Supplement}
2017 (Short: \p{InCJKCompatIdeographsSup})
2018 (544)
2019 X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (=
2020 \p{Block=
2021 CJK_Unified_Ideographs_Extension_A})
2022 (6592)
2023 X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (=
2024 \p{Block=
2025 CJK_Unified_Ideographs_Extension_B})
2026 (42_720)
2027 X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (=
2028 \p{Block=
2029 CJK_Unified_Ideographs_Extension_C})
2030 (4160)
2031 X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (=
2032 \p{Block=
2033 CJK_Unified_Ideographs_Extension_D})
2034 (224)
2035 X \p{CJK_Ext_E} \p{CJK_Unified_Ideographs_Extension_E} (=
2036 \p{Block=
2037 CJK_Unified_Ideographs_Extension_E})
2038 (5776)
2039 X \p{CJK_Ext_F} \p{CJK_Unified_Ideographs_Extension_F} (=
2040 \p{Block=
2041 CJK_Unified_Ideographs_Extension_F})
2042 (7488)
2043 X \p{CJK_Ext_G} \p{CJK_Unified_Ideographs_Extension_G} (=
2044 \p{Block=
2045 CJK_Unified_Ideographs_Extension_G})
2046 (4944)
2047 X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block=
2048 CJK_Radicals_Supplement}) (128)
2049 X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
2050 (Short: \p{InCJKRadicalsSup}) (128)
2051 X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
2052 X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (=
2053 \p{Block=CJK_Symbols_And_Punctuation})
2054 (64)
2055 X \p{CJK_Symbols_And_Punctuation} \p{Block=
2056 CJK_Symbols_And_Punctuation} (Short:
2057 \p{InCJKSymbols}) (64)
2058 X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
2059 (Short: \p{InCJK}) (20_992)
2060 X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
2061 CJK_Unified_Ideographs_Extension_A}
2062 (Short: \p{InCJKExtA}) (6592)
2063 X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
2064 CJK_Unified_Ideographs_Extension_B}
2065 (Short: \p{InCJKExtB}) (42_720)
2066 X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
2067 CJK_Unified_Ideographs_Extension_C}
2068 (Short: \p{InCJKExtC}) (4160)
2069 X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
2070 CJK_Unified_Ideographs_Extension_D}
2071 (Short: \p{InCJKExtD}) (224)
2072 X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
2073 CJK_Unified_Ideographs_Extension_E}
2074 (Short: \p{InCJKExtE}) (5776)
2075 X \p{CJK_Unified_Ideographs_Extension_F} \p{Block=
2076 CJK_Unified_Ideographs_Extension_F}
2077 (Short: \p{InCJKExtF}) (7488)
2078 X \p{CJK_Unified_Ideographs_Extension_G} \p{Block=
2079 CJK_Unified_Ideographs_Extension_G}
2080 (Short: \p{InCJKExtG}) (4944)
2081 \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
2082 (Short: \p{Pe}) (77)
2083 \p{Cn} \p{Unassigned} (= \p{General_Category=
2084 Unassigned}) (829_834 plus all above-
2085 Unicode code points)
2086 \p{Cntrl} \p{XPosixCntrl} (= \p{General_Category=
2087 Control}) (65)
2088 \p{Co} \p{Private_Use} (= \p{General_Category=
2089 Private_Use}) (NOT \p{Private_Use_Area})
2090 (137_468)
2091 X \p{Combining_Diacritical_Marks} \p{Block=
2092 Combining_Diacritical_Marks} (Short:
2093 \p{InDiacriticals}) (112)
2094 X \p{Combining_Diacritical_Marks_Extended} \p{Block=
2095 Combining_Diacritical_Marks_Extended}
2096 (Short: \p{InDiacriticalsExt}) (80)
2097 X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
2098 Combining_Diacritical_Marks_For_Symbols}
2099 (Short: \p{InDiacriticalsForSymbols})
2100 (48)
2101 X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
2102 Combining_Diacritical_Marks_Supplement}
2103 (Short: \p{InDiacriticalsSup}) (64)
2104 X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
2105 \p{InHalfMarks}) (16)
2106 \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark})
2107 (2408)
2108 X \p{Combining_Marks_For_Symbols}
2109 \p{Combining_Diacritical_Marks_For_-
2110 Symbols} (= \p{Block=
2111 Combining_Diacritical_Marks_For_-
2112 Symbols}) (48)
2113 \p{Common} \p{Script_Extensions=Common} (Short:
2114 \p{Zyyy}) (7824)
2115 X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
2116 (Short: \p{InIndicNumberForms}) (16)
2117 \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
2118 \p{Full_Composition_Exclusion=Y}) (1120)
2119 \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
2120 X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block=
2121 Hangul_Compatibility_Jamo}) (96)
2122 \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
2123 \p{CE}) (81)
2124 \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
2125 plus all above-Unicode code points:
2126 U+0000..0957, U+0960..09DB, U+09DE,
2127 U+09E0..0A32, U+0A34..0A35, U+0A37..0A58
2128 ...)
2129 \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81:
2130 U+0958..095F, U+09DC..09DD, U+09DF,
2131 U+0A33, U+0A36, U+0A59..0A5B ...)
2132 \p{Connector_Punctuation} \p{General_Category=
2133 Connector_Punctuation} (Short: \p{Pc})
2134 (10)
2135 \p{Control} \p{XPosixCntrl} (= \p{General_Category=
2136 Control}) (65)
2137 X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
2138 \p{Copt} \p{Coptic} (= \p{Script_Extensions=
2139 Coptic}) (NOT \p{Block=Coptic}) (165)
2140 \p{Coptic} \p{Script_Extensions=Coptic} (Short:
2141 \p{Copt}; NOT \p{Block=Coptic}) (165)
2142 X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
2143 X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block=
2144 Counting_Rod_Numerals}) (32)
2145 X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
2146 \p{InCountingRod}) (32)
2147 \p{Cpmn} \p{Cypro_Minoan} (= \p{Script_Extensions=
2148 Cypro_Minoan}) (NOT \p{Block=
2149 Cypro_Minoan}) (101)
2150 \p{Cprt} \p{Cypriot} (= \p{Script_Extensions=
2151 Cypriot}) (112)
2152 \p{Cs} \p{Surrogate} (= \p{General_Category=
2153 Surrogate}) (2048)
2154 \p{Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
2155 \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
2156 X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (=
2157 \p{Block=
2158 Cuneiform_Numbers_And_Punctuation}) (128)
2159 X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
2160 Cuneiform_Numbers_And_Punctuation}
2161 (Short: \p{InCuneiformNumbers}) (128)
2162 \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
2163 (Short: \p{Sc}) (63)
2164 X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
2165 \p{CWCF} \p{Changes_When_Casefolded} (=
2166 \p{Changes_When_Casefolded=Y}) (1506)
2167 \p{CWCF: *} \p{Changes_When_Casefolded: *}
2168 \p{CWCM} \p{Changes_When_Casemapped} (=
2169 \p{Changes_When_Casemapped=Y}) (2927)
2170 \p{CWCM: *} \p{Changes_When_Casemapped: *}
2171 \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
2172 \p{Changes_When_NFKC_Casefolded=Y})
2173 (10_429)
2174 \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
2175 \p{CWL} \p{Changes_When_Lowercased} (=
2176 \p{Changes_When_Lowercased=Y}) (1433)
2177 \p{CWL: *} \p{Changes_When_Lowercased: *}
2178 \p{CWT} \p{Changes_When_Titlecased} (=
2179 \p{Changes_When_Titlecased=Y}) (1452)
2180 \p{CWT: *} \p{Changes_When_Titlecased: *}
2181 \p{CWU} \p{Changes_When_Uppercased} (=
2182 \p{Changes_When_Uppercased=Y}) (1525)
2183 \p{CWU: *} \p{Changes_When_Uppercased: *}
2184 \p{Cypriot} \p{Script_Extensions=Cypriot} (Short:
2185 \p{Cprt}) (112)
2186 X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
2187 \p{Cypro_Minoan} \p{Script_Extensions=Cypro_Minoan} (Short:
2188 \p{Cpmn}; NOT \p{Block=Cypro_Minoan})
2189 (101)
2190 \p{Cyrillic} \p{Script_Extensions=Cyrillic} (Short:
2191 \p{Cyrl}; NOT \p{Block=Cyrillic}) (447)
2192 X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block=
2193 Cyrillic_Extended_A}) (32)
2194 X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block=
2195 Cyrillic_Extended_B}) (96)
2196 X \p{Cyrillic_Ext_C} \p{Cyrillic_Extended_C} (= \p{Block=
2197 Cyrillic_Extended_C}) (16)
2198 X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
2199 \p{InCyrillicExtA}) (32)
2200 X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
2201 \p{InCyrillicExtB}) (96)
2202 X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
2203 \p{InCyrillicExtC}) (16)
2204 X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block=
2205 Cyrillic_Supplement}) (48)
2206 X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
2207 \p{InCyrillicSup}) (48)
2208 X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
2209 Cyrillic_Supplement}) (48)
2210 \p{Cyrl} \p{Cyrillic} (= \p{Script_Extensions=
2211 Cyrillic}) (NOT \p{Block=Cyrillic}) (447)
2212 \p{Dash} \p{Dash=Y} (30)
2213 \p{Dash: N*} (Single: \P{Dash}) (1_114_082 plus all
2214 above-Unicode code points: [\x00-\x20!
2215 \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z
2216 \[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2217 U+0100..0589, U+058B..05BD,
2218 U+05BF..13FF, U+1401..1805, U+1807..200F
2219 ...)
2220 \p{Dash: Y*} (Single: \p{Dash}) (30: [\-], U+058A,
2221 U+05BE, U+1400, U+1806, U+2010..2015 ...)
2222 \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
2223 (Short: \p{Pd}) (26)
2224 \p{Decimal_Number} \p{XPosixDigit} (= \p{General_Category=
2225 Decimal_Number}) (660)
2226 \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
2227 (13_233)
2228 \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_233:
2229 [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd
2230 \xe0-\xe5\xe7-\xef\xf1-\xf6\xf9-\xfd
2231 \xff], U+0100..010F, U+0112..0125,
2232 U+0128..0130, U+0134..0137, U+0139..013E
2233 ...)
2234 \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240:
2235 U+2460..2473, U+24B6..24EA,
2236 U+3244..3247, U+3251..327E,
2237 U+3280..32BF, U+32D0..32FE ...)
2238 \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
2239 \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720: [\xa8
2240 \xaf\xb4-\xb5\xb8], U+0132..0133,
2241 U+013F..0140, U+0149, U+017F,
2242 U+01C4..01CC ...)
2243 \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
2244 \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
2245 \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51,
2246 U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63
2247 ...)
2248 \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1194: U+2102,
2249 U+210A..2113, U+2115, U+2119..211D,
2250 U+2124, U+2128 ...)
2251 \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
2252 \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc-
2253 \xbe], U+2150..215F, U+2189)
2254 \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
2255 \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54,
2256 U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68
2257 ...)
2258 \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
2259 \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50,
2260 U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62
2261 ...)
2262 \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
2263 \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55,
2264 U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69
2265 ...)
2266 \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
2267 \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122:
2268 U+FF61..FFBE, U+FFC2..FFC7,
2269 U+FFCA..FFCF, U+FFD2..FFD7,
2270 U+FFDA..FFDC, U+FFE8..FFEE)
2271 \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
2272 \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0],
2273 U+0F0C, U+2007, U+2011, U+202F)
2274 \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
2275 Non_Canonical} (Perl extension) (3734)
2276 \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
2277 decompositions (Short: \p{Dt=NonCanon})
2278 (Perl extension) (3734: [\xa0\xa8\xaa
2279 \xaf\xb2-\xb5\xb8-\xba\xbc-\xbe],
2280 U+0132..0133, U+013F..0140, U+0149,
2281 U+017F, U+01C4..01CC ...)
2282 \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_145 plus
2283 all above-Unicode code points: [\x00-
2284 \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6-
2285 \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-\xdf
2286 \xe6\xf0\xf7-\xf8\xfe], U+0110..0111,
2287 U+0126..0127, U+0131, U+0138,
2288 U+0141..0142 ...)
2289 \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26:
2290 U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B)
2291 \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
2292 \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (286)
2293 \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (286: U+3250,
2294 U+32CC..32CF, U+32FF..3357,
2295 U+3371..33DF, U+33FF, U+1F130..1F14F ...)
2296 \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38: U+1D62..1D6A,
2297 U+2080..208E, U+2090..209C, U+2C7C)
2298 \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (213)
2299 \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (213: [\xaa\xb2-
2300 \xb3\xb9-\xba], U+02B0..02B8,
2301 U+02E0..02E4, U+10FC, U+1D2C..1D2E,
2302 U+1D30..1D3A ...)
2303 \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
2304 \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F,
2305 U+30FF, U+FE10..FE19, U+FE30..FE44,
2306 U+FE47..FE48)
2307 \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000,
2308 U+FF01..FF60, U+FFE0..FFE6)
2309 \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
2310 Y} (Short: \p{DI}) (4174)
2311 \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
2312 (1_109_938 plus all above-Unicode code
2313 points: [\x00-\xac\xae-\xff],
2314 U+0100..034E, U+0350..061B,
2315 U+061D..115E, U+1161..17B3, U+17B6..180A
2316 ...)
2317 \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
2318 (4174: [\xad], U+034F, U+061C,
2319 U+115F..1160, U+17B4..17B5, U+180B..180F
2320 ...)
2321 \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (15)
2322 \p{Dep: *} \p{Deprecated: *}
2323 \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (15)
2324 \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_097
2325 plus all above-Unicode code points:
2326 U+0000..0148, U+014A..0672,
2327 U+0674..0F76, U+0F78, U+0F7A..17A2,
2328 U+17A5..2069 ...)
2329 \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149,
2330 U+0673, U+0F77, U+0F79, U+17A3..17A4,
2331 U+206A..206F ...)
2332 \p{Deseret} \p{Script_Extensions=Deseret} (Short:
2333 \p{Dsrt}) (80)
2334 \p{Deva} \p{Devanagari} (= \p{Script_Extensions=
2335 Devanagari}) (NOT \p{Block=Devanagari})
2336 (210)
2337 \p{Devanagari} \p{Script_Extensions=Devanagari} (Short:
2338 \p{Deva}; NOT \p{Block=Devanagari}) (210)
2339 X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block=
2340 Devanagari_Extended}) (32)
2341 X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
2342 \p{InDevanagariExt}) (32)
2343 \p{DI} \p{Default_Ignorable_Code_Point} (=
2344 \p{Default_Ignorable_Code_Point=Y})
2345 (4174)
2346 \p{DI: *} \p{Default_Ignorable_Code_Point: *}
2347 \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (1064)
2348 \p{Dia: *} \p{Diacritic: *}
2349 \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (1064)
2350 \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_048
2351 plus all above-Unicode code points:
2352 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2353 >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
2354 \xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
2355 U+0100..02AF, U+034F, U+0358..035C,
2356 U+0363..0373, U+0376..0379 ...)
2357 \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (1064: [\^`
2358 \xa8\xaf\xb4\xb7-\xb8], U+02B0..034E,
2359 U+0350..0357, U+035D..0362,
2360 U+0374..0375, U+037A ...)
2361 X \p{Diacriticals} \p{Combining_Diacritical_Marks} (=
2362 \p{Block=Combining_Diacritical_Marks})
2363 (112)
2364 X \p{Diacriticals_Ext} \p{Combining_Diacritical_Marks_Extended}
2365 (= \p{Block=
2366 Combining_Diacritical_Marks_Extended})
2367 (80)
2368 X \p{Diacriticals_For_Symbols}
2369 \p{Combining_Diacritical_Marks_For_-
2370 Symbols} (= \p{Block=
2371 Combining_Diacritical_Marks_For_-
2372 Symbols}) (48)
2373 X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement}
2374 (= \p{Block=
2375 Combining_Diacritical_Marks_Supplement})
2376 (64)
2377 \p{Diak} \p{Dives_Akuru} (= \p{Script_Extensions=
2378 Dives_Akuru}) (NOT \p{Block=
2379 Dives_Akuru}) (72)
2380 \p{Digit} \p{XPosixDigit} (= \p{General_Category=
2381 Decimal_Number}) (660)
2382 X \p{Dingbats} \p{Block=Dingbats} (192)
2383 \p{Dives_Akuru} \p{Script_Extensions=Dives_Akuru} (Short:
2384 \p{Diak}; NOT \p{Block=Dives_Akuru}) (72)
2385 \p{Dogr} \p{Dogra} (= \p{Script_Extensions=Dogra})
2386 (NOT \p{Block=Dogra}) (82)
2387 \p{Dogra} \p{Script_Extensions=Dogra} (Short:
2388 \p{Dogr}; NOT \p{Block=Dogra}) (82)
2389 X \p{Domino} \p{Domino_Tiles} (= \p{Block=
2390 Domino_Tiles}) (112)
2391 X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short:
2392 \p{InDomino}) (112)
2393 \p{Dsrt} \p{Deseret} (= \p{Script_Extensions=
2394 Deseret}) (80)
2395 \p{Dt: *} \p{Decomposition_Type: *}
2396 \p{Dupl} \p{Duployan} (= \p{Script_Extensions=
2397 Duployan}) (NOT \p{Block=Duployan}) (147)
2398 \p{Duployan} \p{Script_Extensions=Duployan} (Short:
2399 \p{Dupl}; NOT \p{Block=Duployan}) (147)
2400 \p{Ea: *} \p{East_Asian_Width: *}
2401 X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
2402 (208)
2403 \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
2404 \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739: [\xa1
2405 \xa4\xa7-\xa8\xaa\xad-\xae\xb0-\xb4\xb6-
2406 \xba\xbc-\xbf\xc6\xd0\xd7-\xd8\xde-\xe1
2407 \xe6\xe8-\xea\xec-\xed\xf0\xf2-\xf3\xf7-
2408 \xfa\xfc\xfe], U+0101, U+0111, U+0113,
2409 U+011B, U+0126..0127 ...)
2410 \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
2411 \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000,
2412 U+FF01..FF60, U+FFE0..FFE6)
2413 \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
2414 \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9,
2415 U+FF61..FFBE, U+FFC2..FFC7,
2416 U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC
2417 ...)
2418 \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (792_645 plus
2419 all above-Unicode code points)
2420 \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
2421 \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20-\x7e
2422 \xa2-\xa3\xa5-\xa6\xac\xaf],
2423 U+27E6..27ED, U+2985..2986)
2424 \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (792_645 plus all
2425 above-Unicode code points: [\x00-\x1f
2426 \x7f-\xa0\xa9\xab\xb5\xbb\xc0-\xc5\xc7-
2427 \xcf\xd1-\xd6\xd9-\xdd\xe2-\xe5\xe7\xeb
2428 \xee-\xef\xf1\xf4-\xf6\xfb\xfd\xff],
2429 U+00FF..0100, U+0102..0110, U+0112,
2430 U+0114..011A, U+011C..0125 ...)
2431 \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (182_390)
2432 \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (182_390:
2433 U+1100..115F, U+231A..231B,
2434 U+2329..232A, U+23E9..23EC, U+23F0,
2435 U+23F3 ...)
2436 \p{EBase} \p{Emoji_Modifier_Base} (=
2437 \p{Emoji_Modifier_Base=Y}) (132)
2438 \p{EBase: *} \p{Emoji_Modifier_Base: *}
2439 \p{EComp} \p{Emoji_Component} (= \p{Emoji_Component=
2440 Y}) (146)
2441 \p{EComp: *} \p{Emoji_Component: *}
2442 \p{Egyp} \p{Egyptian_Hieroglyphs} (=
2443 \p{Script_Extensions=
2444 Egyptian_Hieroglyphs}) (NOT \p{Block=
2445 Egyptian_Hieroglyphs}) (1080)
2446 X \p{Egyptian_Hieroglyph_Format_Controls} \p{Block=
2447 Egyptian_Hieroglyph_Format_Controls} (16)
2448 \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
2449 Egyptian_Hieroglyphs} (Short: \p{Egyp};
2450 NOT \p{Block=Egyptian_Hieroglyphs})
2451 (1080)
2452 \p{Elba} \p{Elbasan} (= \p{Script_Extensions=
2453 Elbasan}) (NOT \p{Block=Elbasan}) (40)
2454 \p{Elbasan} \p{Script_Extensions=Elbasan} (Short:
2455 \p{Elba}; NOT \p{Block=Elbasan}) (40)
2456 \p{Elym} \p{Elymaic} (= \p{Script_Extensions=
2457 Elymaic}) (NOT \p{Block=Elymaic}) (23)
2458 \p{Elymaic} \p{Script_Extensions=Elymaic} (Short:
2459 \p{Elym}; NOT \p{Block=Elymaic}) (23)
2460 \p{EMod} \p{Emoji_Modifier} (= \p{Emoji_Modifier=
2461 Y}) (5)
2462 \p{EMod: *} \p{Emoji_Modifier: *}
2463 \p{Emoji} \p{Emoji=Y} (1404)
2464 \p{Emoji: N*} (Single: \P{Emoji}) (1_112_708 plus all
2465 above-Unicode code points: [\x00-\x20!
2466 \"\$\%&\'\(\)+,\-.\/:;<=>?\@A-Z\[\\\]
2467 \^_`a-z\{\|\}~\x7f-\xa8\xaa-\xad\xaf-
2468 \xff], U+0100..203B, U+203D..2048,
2469 U+204A..2121, U+2123..2138, U+213A..2193
2470 ...)
2471 \p{Emoji: Y*} (Single: \p{Emoji}) (1404: [#*0-9\xa9
2472 \xae], U+203C, U+2049, U+2122, U+2139,
2473 U+2194..2199 ...)
2474 \p{Emoji_Component} \p{Emoji_Component=Y} (Short: \p{EComp})
2475 (146)
2476 \p{Emoji_Component: N*} (Short: \p{EComp=N}, \P{EComp}) (1_113_966
2477 plus all above-Unicode code points:
2478 [\x00-\x20!\"\$\%&\'\(\)+,\-.\/:;<=>?
2479 \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2480 U+0100..200C, U+200E..20E2,
2481 U+20E4..FE0E, U+FE10..1F1E5,
2482 U+1F200..1F3FA ...)
2483 \p{Emoji_Component: Y*} (Short: \p{EComp=Y}, \p{EComp}) (146:
2484 [#*0-9], U+200D, U+20E3, U+FE0F,
2485 U+1F1E6..1F1FF, U+1F3FB..1F3FF ...)
2486 \p{Emoji_Modifier} \p{Emoji_Modifier=Y} (Short: \p{EMod}) (5)
2487 \p{Emoji_Modifier: N*} (Short: \p{EMod=N}, \P{EMod}) (1_114_107
2488 plus all above-Unicode code points:
2489 U+0000..1F3FA, U+1F400..infinity)
2490 \p{Emoji_Modifier: Y*} (Short: \p{EMod=Y}, \p{EMod}) (5:
2491 U+1F3FB..1F3FF)
2492 \p{Emoji_Modifier_Base} \p{Emoji_Modifier_Base=Y} (Short:
2493 \p{EBase}) (132)
2494 \p{Emoji_Modifier_Base: N*} (Short: \p{EBase=N}, \P{EBase})
2495 (1_113_980 plus all above-Unicode code
2496 points: U+0000..261C, U+261E..26F8,
2497 U+26FA..2709, U+270E..1F384,
2498 U+1F386..1F3C1, U+1F3C5..1F3C6 ...)
2499 \p{Emoji_Modifier_Base: Y*} (Short: \p{EBase=Y}, \p{EBase}) (132:
2500 U+261D, U+26F9, U+270A..270D, U+1F385,
2501 U+1F3C2..1F3C4, U+1F3C7 ...)
2502 \p{Emoji_Presentation} \p{Emoji_Presentation=Y} (Short:
2503 \p{EPres}) (1185)
2504 \p{Emoji_Presentation: N*} (Short: \p{EPres=N}, \P{EPres})
2505 (1_112_927 plus all above-Unicode code
2506 points: U+0000..2319, U+231C..23E8,
2507 U+23ED..23EF, U+23F1..23F2,
2508 U+23F4..25FC, U+25FF..2613 ...)
2509 \p{Emoji_Presentation: Y*} (Short: \p{EPres=Y}, \p{EPres}) (1185:
2510 U+231A..231B, U+23E9..23EC, U+23F0,
2511 U+23F3, U+25FD..25FE, U+2614..2615 ...)
2512 X \p{Emoticons} \p{Block=Emoticons} (80)
2513 X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block=
2514 Enclosed_Alphanumerics}) (160)
2515 X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
2516 \p{Block=
2517 Enclosed_Alphanumeric_Supplement}) (256)
2518 X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
2519 Enclosed_Alphanumeric_Supplement}
2520 (Short: \p{InEnclosedAlphanumSup}) (256)
2521 X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
2522 (Short: \p{InEnclosedAlphanum}) (160)
2523 X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (=
2524 \p{Block=
2525 Enclosed_CJK_Letters_And_Months}) (256)
2526 X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
2527 Enclosed_CJK_Letters_And_Months} (Short:
2528 \p{InEnclosedCJK}) (256)
2529 X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
2530 (= \p{Block=
2531 Enclosed_Ideographic_Supplement}) (256)
2532 X \p{Enclosed_Ideographic_Supplement} \p{Block=
2533 Enclosed_Ideographic_Supplement} (Short:
2534 \p{InEnclosedIdeographicSup}) (256)
2535 \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
2536 (Short: \p{Me}) (13)
2537 \p{EPres} \p{Emoji_Presentation} (=
2538 \p{Emoji_Presentation=Y}) (1185)
2539 \p{EPres: *} \p{Emoji_Presentation: *}
2540 \p{Ethi} \p{Ethiopic} (= \p{Script_Extensions=
2541 Ethiopic}) (NOT \p{Block=Ethiopic}) (523)
2542 \p{Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
2543 \p{Ethi}; NOT \p{Block=Ethiopic}) (523)
2544 X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block=
2545 Ethiopic_Extended}) (96)
2546 X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block=
2547 Ethiopic_Extended_A}) (48)
2548 X \p{Ethiopic_Ext_B} \p{Ethiopic_Extended_B} (= \p{Block=
2549 Ethiopic_Extended_B}) (32)
2550 X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short:
2551 \p{InEthiopicExt}) (96)
2552 X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
2553 \p{InEthiopicExtA}) (48)
2554 X \p{Ethiopic_Extended_B} \p{Block=Ethiopic_Extended_B} (Short:
2555 \p{InEthiopicExtB}) (32)
2556 X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block=
2557 Ethiopic_Supplement}) (32)
2558 X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
2559 \p{InEthiopicSup}) (32)
2560 \p{Ext} \p{Extender} (= \p{Extender=Y}) (50)
2561 \p{Ext: *} \p{Extender: *}
2562 \p{Extended_Pictographic} \p{Extended_Pictographic=Y} (Short:
2563 \p{ExtPict}) (3537)
2564 \p{Extended_Pictographic: N*} (Short: \p{ExtPict=N}, \P{ExtPict})
2565 (1_110_575 plus all above-Unicode code
2566 points: [\x00-\xa8\xaa-\xad\xaf-\xff],
2567 U+0100..203B, U+203D..2048,
2568 U+204A..2121, U+2123..2138, U+213A..2193
2569 ...)
2570 \p{Extended_Pictographic: Y*} (Short: \p{ExtPict=Y}, \p{ExtPict})
2571 (3537: [\xa9\xae], U+203C, U+2049,
2572 U+2122, U+2139, U+2194..2199 ...)
2573 \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (50)
2574 \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_062
2575 plus all above-Unicode code points:
2576 [\x00-\xb6\xb8-\xff], U+0100..02CF,
2577 U+02D2..063F, U+0641..07F9,
2578 U+07FB..0B54, U+0B56..0E45 ...)
2579 \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (50: [\xb7],
2580 U+02D0..02D1, U+0640, U+07FA, U+0B55,
2581 U+0E46 ...)
2582 \p{ExtPict} \p{Extended_Pictographic} (=
2583 \p{Extended_Pictographic=Y}) (3537)
2584 \p{ExtPict: *} \p{Extended_Pictographic: *}
2585 \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
2586 (Short: \p{Pf}) (10)
2587 \p{Format} \p{General_Category=Format} (Short:
2588 \p{Cf}) (163)
2589 \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
2590 (Short: \p{CompEx}) (1120)
2591 \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
2592 \P{CompEx}) (1_112_992 plus all above-
2593 Unicode code points: U+0000..033F,
2594 U+0342, U+0345..0373, U+0375..037D,
2595 U+037F..0386, U+0388..0957 ...)
2596 \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
2597 \p{CompEx}) (1120: U+0340..0341,
2598 U+0343..0344, U+0374, U+037E, U+0387,
2599 U+0958..095F ...)
2600 \p{Gc: *} \p{General_Category: *}
2601 \p{GCB: *} \p{Grapheme_Cluster_Break: *}
2602 \p{General_Category: C} \p{General_Category=Other} (969_578 plus
2603 all above-Unicode code points)
2604 \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
2605 \p{Gc=LC}, \p{LC}) (4089: [A-Za-z\xb5
2606 \xc0-\xd6\xd8-\xf6\xf8-\xff],
2607 U+0100..01BA, U+01BC..01BF,
2608 U+01C4..0293, U+0295..02AF, U+0370..0373
2609 ...)
2610 \p{General_Category: Cc} \p{General_Category=Control} (65)
2611 \p{General_Category: Cf} \p{General_Category=Format} (163)
2612 \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
2613 (77: [\)\]\}], U+0F3B, U+0F3D, U+169C,
2614 U+2046, U+207E ...)
2615 \p{General_Category: Cn} \p{General_Category=Unassigned} (829_834
2616 plus all above-Unicode code points)
2617 \p{General_Category: Cntrl} \p{General_Category=Control} (65)
2618 \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
2619 \p{General_Category: Combining_Mark} \p{General_Category=Mark}
2620 (2408)
2621 \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
2622 \p{Pc}) (10: [_], U+203F..2040, U+2054,
2623 U+FE33..FE34, U+FE4D..FE4F, U+FF3F)
2624 \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65:
2625 [\x00-\x1f\x7f-\x9f])
2626 \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
2627 \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
2628 (63: [\$\xa2-\xa5], U+058F, U+060B,
2629 U+07FE..07FF, U+09F2..09F3, U+09FB ...)
2630 \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
2631 (26: [\-], U+058A, U+05BE, U+1400,
2632 U+1806, U+2010..2015 ...)
2633 \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
2634 (660: [0-9], U+0660..0669, U+06F0..06F9,
2635 U+07C0..07C9, U+0966..096F, U+09E6..09EF
2636 ...)
2637 \p{General_Category: Digit} \p{General_Category=Decimal_Number}
2638 (660)
2639 \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
2640 (13: U+0488..0489, U+1ABE, U+20DD..20E0,
2641 U+20E2..20E4, U+A670..A672)
2642 \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
2643 (10: [\xbb], U+2019, U+201D, U+203A,
2644 U+2E03, U+2E05 ...)
2645 \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (163:
2646 [\xad], U+0600..0605, U+061C, U+06DD,
2647 U+070F, U+0890..0891 ...)
2648 \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
2649 \p{Pi}) (12: [\xab], U+2018,
2650 U+201B..201C, U+201F, U+2039, U+2E02 ...)
2651 \p{General_Category: L} \p{General_Category=Letter} (131_756)
2652 X \p{General_Category: L&} \p{General_Category=Cased_Letter} (4089)
2653 X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
2654 the trailing '_' matters in spite of
2655 loose matching rules. (4089)
2656 \p{General_Category: LC} \p{General_Category=Cased_Letter} (4089)
2657 \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (131_756:
2658 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
2659 \xf8-\xff], U+0100..02C1, U+02C6..02D1,
2660 U+02E0..02E4, U+02EC, U+02EE ...)
2661 \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
2662 (236: U+16EE..16F0, U+2160..2182,
2663 U+2185..2188, U+3007, U+3021..3029,
2664 U+3038..303A ...)
2665 \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl})
2666 (1: U+2028)
2667 \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
2668 (/i= General_Category=Cased_Letter)
2669 (2227)
2670 \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (334)
2671 \p{General_Category: Lo} \p{General_Category=Other_Letter}
2672 (127_333)
2673 \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
2674 /i= General_Category=Cased_Letter)
2675 (2227: [a-z\xb5\xdf-\xf6\xf8-\xff],
2676 U+0101, U+0103, U+0105, U+0107, U+0109
2677 ...)
2678 \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
2679 (/i= General_Category=Cased_Letter) (31)
2680 \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
2681 (/i= General_Category=Cased_Letter)
2682 (1831)
2683 \p{General_Category: M} \p{General_Category=Mark} (2408)
2684 \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2408:
2685 U+0300..036F, U+0483..0489,
2686 U+0591..05BD, U+05BF, U+05C1..05C2,
2687 U+05C4..05C5 ...)
2688 \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948:
2689 [+<=>\|~\xac\xb1\xd7\xf7], U+03F6,
2690 U+0606..0608, U+2044, U+2052,
2691 U+207A..207C ...)
2692 \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (445)
2693 \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
2694 \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
2695 (1950)
2696 \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
2697 (334: U+02B0..02C1, U+02C6..02D1,
2698 U+02E0..02E4, U+02EC, U+02EE, U+0374 ...)
2699 \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
2700 (125: [\^`\xa8\xaf\xb4\xb8],
2701 U+02C2..02C5, U+02D2..02DF,
2702 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2703 \p{General_Category: N} \p{General_Category=Number} (1791)
2704 \p{General_Category: Nd} \p{General_Category=Decimal_Number} (660)
2705 \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
2706 \p{General_Category: No} \p{General_Category=Other_Number} (895)
2707 \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
2708 (1950: U+0300..036F, U+0483..0487,
2709 U+0591..05BD, U+05BF, U+05C1..05C2,
2710 U+05C4..05C5 ...)
2711 \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1791: [0-9
2712 \xb2-\xb3\xb9\xbc-\xbe], U+0660..0669,
2713 U+06F0..06F9, U+07C0..07C9,
2714 U+0966..096F, U+09E6..09EF ...)
2715 \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
2716 (79: [\(\[\{], U+0F3A, U+0F3C, U+169B,
2717 U+201A, U+201E ...)
2718 \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (969_578 plus
2719 all above-Unicode code points: [\x00-
2720 \x1f\x7f-\x9f\xad], U+0378..0379,
2721 U+0380..0383, U+038B, U+038D, U+03A2 ...)
2722 \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
2723 (127_333: [\xaa\xba], U+01BB,
2724 U+01C0..01C3, U+0294, U+05D0..05EA,
2725 U+05EF..05F2 ...)
2726 \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No})
2727 (895: [\xb2-\xb3\xb9\xbc-\xbe],
2728 U+09F4..09F9, U+0B72..0B77,
2729 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
2730 ...)
2731 \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
2732 (605: [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7
2733 \xb6-\xb7\xbf], U+037E, U+0387,
2734 U+055A..055F, U+0589, U+05C0 ...)
2735 \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
2736 (6605: [\xa6\xa9\xae\xb0], U+0482,
2737 U+058D..058E, U+060E..060F, U+06DE,
2738 U+06E9 ...)
2739 \p{General_Category: P} \p{General_Category=Punctuation} (819)
2740 \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
2741 \p{Zp}) (1: U+2029)
2742 \p{General_Category: Pc} \p{General_Category=
2743 Connector_Punctuation} (10)
2744 \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (26)
2745 \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
2746 (77)
2747 \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
2748 (10)
2749 \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
2750 (12)
2751 \p{General_Category: Po} \p{General_Category=Other_Punctuation}
2752 (605)
2753 \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
2754 (137_468: U+E000..F8FF, U+F0000..FFFFD,
2755 U+100000..10FFFD)
2756 \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (79)
2757 \p{General_Category: Punct} \p{General_Category=Punctuation} (819)
2758 \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (819:
2759 [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_\{\}
2760 \xa1\xa7\xab\xb6-\xb7\xbb\xbf], U+037E,
2761 U+0387, U+055A..055F, U+0589..058A,
2762 U+05BE ...)
2763 \p{General_Category: S} \p{General_Category=Symbol} (7741)
2764 \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (63)
2765 \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19:
2766 [\x20\xa0], U+1680, U+2000..200A,
2767 U+2028..2029, U+202F, U+205F ...)
2768 \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (125)
2769 \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
2770 \p{General_Category: So} \p{General_Category=Other_Symbol} (6605)
2771 \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
2772 (17: [\x20\xa0], U+1680, U+2000..200A,
2773 U+202F, U+205F, U+3000)
2774 \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc})
2775 (445: U+0903, U+093B, U+093E..0940,
2776 U+0949..094C, U+094E..094F, U+0982..0983
2777 ...)
2778 \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048:
2779 U+D800..DFFF)
2780 \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (7741:
2781 [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae-
2782 \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5,
2783 U+02D2..02DF, U+02E5..02EB, U+02ED,
2784 U+02EF..02FF ...)
2785 \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
2786 /i= General_Category=Cased_Letter) (31:
2787 U+01C5, U+01C8, U+01CB, U+01F2,
2788 U+1F88..1F8F, U+1F98..1F9F ...)
2789 \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
2790 (829_834 plus all above-Unicode code
2791 points: U+0378..0379, U+0380..0383,
2792 U+038B, U+038D, U+03A2, U+0530 ...)
2793 \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
2794 /i= General_Category=Cased_Letter)
2795 (1831: [A-Z\xc0-\xd6\xd8-\xde], U+0100,
2796 U+0102, U+0104, U+0106, U+0108 ...)
2797 \p{General_Category: Z} \p{General_Category=Separator} (19)
2798 \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
2799 \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
2800 (1)
2801 \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
2802 X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
2803 \p{InPunctuation}) (112)
2804 X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
2805 X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
2806 \p{Block=Geometric_Shapes_Extended})
2807 (128)
2808 X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
2809 (Short: \p{InGeometricShapesExt}) (128)
2810 \p{Geor} \p{Georgian} (= \p{Script_Extensions=
2811 Georgian}) (NOT \p{Block=Georgian}) (174)
2812 \p{Georgian} \p{Script_Extensions=Georgian} (Short:
2813 \p{Geor}; NOT \p{Block=Georgian}) (174)
2814 X \p{Georgian_Ext} \p{Georgian_Extended} (= \p{Block=
2815 Georgian_Extended}) (48)
2816 X \p{Georgian_Extended} \p{Block=Georgian_Extended} (Short:
2817 \p{InGeorgianExt}) (48)
2818 X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block=
2819 Georgian_Supplement}) (48)
2820 X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
2821 \p{InGeorgianSup}) (48)
2822 \p{Glag} \p{Glagolitic} (= \p{Script_Extensions=
2823 Glagolitic}) (NOT \p{Block=Glagolitic})
2824 (138)
2825 \p{Glagolitic} \p{Script_Extensions=Glagolitic} (Short:
2826 \p{Glag}; NOT \p{Block=Glagolitic}) (138)
2827 X \p{Glagolitic_Sup} \p{Glagolitic_Supplement} (= \p{Block=
2828 Glagolitic_Supplement}) (48)
2829 X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
2830 \p{InGlagoliticSup}) (48)
2831 \p{Gong} \p{Gunjala_Gondi} (= \p{Script_Extensions=
2832 Gunjala_Gondi}) (NOT \p{Block=
2833 Gunjala_Gondi}) (65)
2834 \p{Gonm} \p{Masaram_Gondi} (= \p{Script_Extensions=
2835 Masaram_Gondi}) (NOT \p{Block=
2836 Masaram_Gondi}) (77)
2837 \p{Goth} \p{Gothic} (= \p{Script_Extensions=
2838 Gothic}) (NOT \p{Block=Gothic}) (27)
2839 \p{Gothic} \p{Script_Extensions=Gothic} (Short:
2840 \p{Goth}; NOT \p{Block=Gothic}) (27)
2841 \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
2842 (142_539)
2843 \p{Gr_Base: *} \p{Grapheme_Base: *}
2844 \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
2845 Y}) (2090)
2846 \p{Gr_Ext: *} \p{Grapheme_Extend: *}
2847 \p{Gran} \p{Grantha} (= \p{Script_Extensions=
2848 Grantha}) (NOT \p{Block=Grantha}) (116)
2849 \p{Grantha} \p{Script_Extensions=Grantha} (Short:
2850 \p{Gran}; NOT \p{Block=Grantha}) (116)
2851 \p{Graph} \p{XPosixGraph} (282_146)
2852 \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
2853 (142_539)
2854 \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase}) (971_573
2855 plus all above-Unicode code points:
2856 [\x00-\x1f\x7f-\x9f\xad], U+0300..036F,
2857 U+0378..0379, U+0380..0383, U+038B,
2858 U+038D ...)
2859 \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase})
2860 (142_539: [\x20-\x7e\xa0-\xac\xae-\xff],
2861 U+0100..02FF, U+0370..0377,
2862 U+037A..037F, U+0384..038A, U+038C ...)
2863 \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
2864 (3886)
2865 \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (3886: [^
2866 \n\r\x20-\x7e\xa0-\xac\xae-\xff],
2867 U+061C, U+180E, U+200B, U+200E..200F,
2868 U+2028..202E ...)
2869 \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r])
2870 \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (0)
2871 \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (0)
2872 \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (0)
2873 \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
2874 (0)
2875 \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
2876 E_Base_GAZ} (0)
2877 \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
2878 E_Modifier} (0)
2879 \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
2880 (2095)
2881 \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (2095:
2882 U+0300..036F, U+0483..0489,
2883 U+0591..05BD, U+05BF, U+05C1..05C2,
2884 U+05C4..05C5 ...)
2885 \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
2886 Glue_After_Zwj} (0)
2887 \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (0)
2888 \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125:
2889 U+1100..115F, U+A960..A97C)
2890 \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n])
2891 \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00,
2892 U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C
2893 ...)
2894 \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773:
2895 U+AC01..AC1B, U+AC1D..AC37,
2896 U+AC39..AC53, U+AC55..AC6F,
2897 U+AC71..AC8B, U+AC8D..ACA7 ...)
2898 \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_096_159
2899 plus all above-Unicode code points:
2900 [\x20-\x7e\xa0-\xac\xae-\xff],
2901 U+0100..02FF, U+0370..0482,
2902 U+048A..0590, U+05BE, U+05C0 ...)
2903 \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
2904 (26)
2905 \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (26:
2906 U+0600..0605, U+06DD, U+070F,
2907 U+0890..0891, U+08E2, U+0D4E ...)
2908 \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
2909 (26: U+1F1E6..1F1FF)
2910 \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
2911 Regional_Indicator} (26)
2912 \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
2913 SpacingMark} (388)
2914 \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (388:
2915 U+0903, U+093B, U+093E..0940,
2916 U+0949..094C, U+094E..094F, U+0982..0983
2917 ...)
2918 \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137:
2919 U+11A8..11FF, U+D7CB..D7FB)
2920 \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95:
2921 U+1160..11A7, U+D7B0..D7C6)
2922 \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
2923 (1_096_159 plus all above-Unicode code
2924 points)
2925 \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D)
2926 \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
2927 (2090)
2928 \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_022
2929 plus all above-Unicode code points:
2930 U+0000..02FF, U+0370..0482,
2931 U+048A..0590, U+05BE, U+05C0, U+05C3 ...)
2932 \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (2090:
2933 U+0300..036F, U+0483..0489,
2934 U+0591..05BD, U+05BF, U+05C1..05C2,
2935 U+05C4..05C5 ...)
2936 \p{Greek} \p{Script_Extensions=Greek} (Short:
2937 \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
2938 X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
2939 \p{InGreek}) (144)
2940 X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block=
2941 Greek_Extended}) (256)
2942 X \p{Greek_Extended} \p{Block=Greek_Extended} (Short:
2943 \p{InGreekExt}) (256)
2944 \p{Grek} \p{Greek} (= \p{Script_Extensions=Greek})
2945 (NOT \p{Greek_And_Coptic}) (522)
2946 \p{Gujarati} \p{Script_Extensions=Gujarati} (Short:
2947 \p{Gujr}; NOT \p{Block=Gujarati}) (105)
2948 \p{Gujr} \p{Gujarati} (= \p{Script_Extensions=
2949 Gujarati}) (NOT \p{Block=Gujarati}) (105)
2950 \p{Gunjala_Gondi} \p{Script_Extensions=Gunjala_Gondi}
2951 (Short: \p{Gong}; NOT \p{Block=
2952 Gunjala_Gondi}) (65)
2953 \p{Gurmukhi} \p{Script_Extensions=Gurmukhi} (Short:
2954 \p{Guru}; NOT \p{Block=Gurmukhi}) (94)
2955 \p{Guru} \p{Gurmukhi} (= \p{Script_Extensions=
2956 Gurmukhi}) (NOT \p{Block=Gurmukhi}) (94)
2957 X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
2958 \p{Block=Halfwidth_And_Fullwidth_Forms})
2959 (240)
2960 X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block=
2961 Combining_Half_Marks}) (16)
2962 X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
2963 Halfwidth_And_Fullwidth_Forms} (Short:
2964 \p{InHalfAndFullForms}) (240)
2965 \p{Han} \p{Script_Extensions=Han} (94_503)
2966 \p{Hang} \p{Hangul} (= \p{Script_Extensions=
2967 Hangul}) (NOT \p{Hangul_Syllables})
2968 (11_775)
2969 \p{Hangul} \p{Script_Extensions=Hangul} (Short:
2970 \p{Hang}; NOT \p{Hangul_Syllables})
2971 (11_775)
2972 X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
2973 (Short: \p{InCompatJamo}) (96)
2974 X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo})
2975 (256)
2976 X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
2977 (Short: \p{InJamoExtA}) (32)
2978 X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
2979 (Short: \p{InJamoExtB}) (80)
2980 \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
2981 (125)
2982 \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125:
2983 U+1100..115F, U+A960..A97C)
2984 \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
2985 (399)
2986 \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399:
2987 U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70,
2988 U+AC8C ...)
2989 \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
2990 LVT_Syllable} (10_773)
2991 \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
2992 (10_773: U+AC01..AC1B, U+AC1D..AC37,
2993 U+AC39..AC53, U+AC55..AC6F,
2994 U+AC71..AC8B, U+AC8D..ACA7 ...)
2995 \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
2996 Not_Applicable} (1_102_583 plus all
2997 above-Unicode code points)
2998 \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
2999 (1_102_583 plus all above-Unicode code
3000 points: U+0000..10FF, U+1200..A95F,
3001 U+A97D..ABFF, U+D7A4..D7AF,
3002 U+D7C7..D7CA, U+D7FC..infinity)
3003 \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
3004 (137)
3005 \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137:
3006 U+11A8..11FF, U+D7CB..D7FB)
3007 \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
3008 (95)
3009 \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95:
3010 U+1160..11A7, U+D7B0..D7C6)
3011 X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short:
3012 \p{InHangul}) (11_184)
3013 \p{Hani} \p{Han} (= \p{Script_Extensions=Han})
3014 (94_503)
3015 \p{Hanifi_Rohingya} \p{Script_Extensions=Hanifi_Rohingya}
3016 (Short: \p{Rohg}; NOT \p{Block=
3017 Hanifi_Rohingya}) (55)
3018 \p{Hano} \p{Hanunoo} (= \p{Script_Extensions=
3019 Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
3020 \p{Hanunoo} \p{Script_Extensions=Hanunoo} (Short:
3021 \p{Hano}; NOT \p{Block=Hanunoo}) (23)
3022 \p{Hatr} \p{Hatran} (= \p{Script_Extensions=
3023 Hatran}) (NOT \p{Block=Hatran}) (26)
3024 \p{Hatran} \p{Script_Extensions=Hatran} (Short:
3025 \p{Hatr}; NOT \p{Block=Hatran}) (26)
3026 \p{Hebr} \p{Hebrew} (= \p{Script_Extensions=
3027 Hebrew}) (NOT \p{Block=Hebrew}) (134)
3028 \p{Hebrew} \p{Script_Extensions=Hebrew} (Short:
3029 \p{Hebr}; NOT \p{Block=Hebrew}) (134)
3030 \p{Hex} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
3031 \p{Hex: *} \p{Hex_Digit: *}
3032 \p{Hex_Digit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
3033 \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068
3034 plus all above-Unicode code points:
3035 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
3036 \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
3037 U+0100..FF0F, U+FF1A..FF20,
3038 U+FF27..FF40, U+FF47..infinity)
3039 \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa-
3040 f], U+FF10..FF19, U+FF21..FF26,
3041 U+FF41..FF46)
3042 X \p{High_Private_Use_Surrogates} \p{Block=
3043 High_Private_Use_Surrogates} (Short:
3044 \p{InHighPUSurrogates}) (128)
3045 X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (=
3046 \p{Block=High_Private_Use_Surrogates})
3047 (128)
3048 X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
3049 \p{Hira} \p{Hiragana} (= \p{Script_Extensions=
3050 Hiragana}) (NOT \p{Block=Hiragana}) (432)
3051 \p{Hiragana} \p{Script_Extensions=Hiragana} (Short:
3052 \p{Hira}; NOT \p{Block=Hiragana}) (432)
3053 \p{Hluw} \p{Anatolian_Hieroglyphs} (=
3054 \p{Script_Extensions=
3055 Anatolian_Hieroglyphs}) (NOT \p{Block=
3056 Anatolian_Hieroglyphs}) (583)
3057 \p{Hmng} \p{Pahawh_Hmong} (= \p{Script_Extensions=
3058 Pahawh_Hmong}) (NOT \p{Block=
3059 Pahawh_Hmong}) (127)
3060 \p{Hmnp} \p{Nyiakeng_Puachue_Hmong} (=
3061 \p{Script_Extensions=
3062 Nyiakeng_Puachue_Hmong}) (NOT \p{Block=
3063 Nyiakeng_Puachue_Hmong}) (71)
3064 \p{HorizSpace} \p{XPosixBlank} (18)
3065 \p{Hst: *} \p{Hangul_Syllable_Type: *}
3066 \p{Hung} \p{Old_Hungarian} (= \p{Script_Extensions=
3067 Old_Hungarian}) (NOT \p{Block=
3068 Old_Hungarian}) (108)
3069 D \p{Hyphen} \p{Hyphen=Y} (11)
3070 D \p{Hyphen: N*} Supplanted by Line_Break property values;
3071 see www.unicode.org/reports/tr14
3072 (Single: \P{Hyphen}) (1_114_101 plus all
3073 above-Unicode code points: [\x00-\x20!
3074 \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z
3075 \[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae-\xff],
3076 U+0100..0589, U+058B..1805,
3077 U+1807..200F, U+2012..2E16, U+2E18..30FA
3078 ...)
3079 D \p{Hyphen: Y*} Supplanted by Line_Break property values;
3080 see www.unicode.org/reports/tr14
3081 (Single: \p{Hyphen}) (11: [\-\xad],
3082 U+058A, U+1806, U+2010..2011, U+2E17,
3083 U+30FB ...)
3084 \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT
3085 \p{Ideographic_Description_Characters})
3086 (135_072)
3087 \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (979_040 plus
3088 all above-Unicode code points: [\x00-
3089 \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@
3090 \[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4\xb6
3091 \xb8-\xb9\xbb-\xbf\xd7\xf7],
3092 U+02C2..02C5, U+02D2..02DF,
3093 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
3094 \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (135_072:
3095 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6
3096 \xd8-\xf6\xf8-\xff], U+0100..02C1,
3097 U+02C6..02D1, U+02E0..02E4, U+02EC,
3098 U+02EE ...)
3099 \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (131_997)
3100 \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (982_115 plus
3101 all above-Unicode code points: [\x00-
3102 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@
3103 \[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6-
3104 \xb9\xbb-\xbf\xd7\xf7], U+02C2..02C5,
3105 U+02D2..02DF, U+02E5..02EB, U+02ED,
3106 U+02EF..036F ...)
3107 \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (131_997: [A-
3108 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
3109 \xff], U+0100..02C1, U+02C6..02D1,
3110 U+02E0..02E4, U+02EC, U+02EE ...)
3111 \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
3112 \p{Ideographic_Description_Characters})
3113 (135_072)
3114 \p{IDC: *} \p{ID_Continue: *}
3115 \p{Identifier_Status: Allowed} (107_957: [\'\-.0-9:A-Z_a-z\xb7
3116 \xc0-\xd6\xd8-\xf6\xf8-\xff],
3117 U+0100..0131, U+0134..013E,
3118 U+0141..0148, U+014A..017E, U+018F ...)
3119 \p{Identifier_Status: Restricted} (1_006_155 plus all above-
3120 Unicode code points: [\x00-\x20!\"#\$
3121 \%&\(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x7f-
3122 \xb6\xb8-\xbf\xd7\xf7], U+0132..0133,
3123 U+013F..0140, U+0149, U+017F..018E,
3124 U+0190..019F ...)
3125 \p{Identifier_Type: Default_Ignorable} (396: [\xad], U+034F,
3126 U+061C, U+115F..1160, U+17B4..17B5,
3127 U+180B..180F ...)
3128 \p{Identifier_Type: Deprecated} (15: U+0149, U+0673, U+0F77,
3129 U+0F79, U+17A3..17A4, U+206A..206F ...)
3130 \p{Identifier_Type: Exclusion} (17_080: U+03E2..03EF,
3131 U+0800..082D, U+0830..083E,
3132 U+1680..169C, U+16A0..16EA, U+16EE..16F8
3133 ...)
3134 \p{Identifier_Type: Inclusion} (19: [\'\-.:\xb7], U+0375, U+058A,
3135 U+05F3..05F4, U+06FD..06FE, U+0F0B ...)
3136 \p{Identifier_Type: Limited_Use} (5268: U+0700..070D,
3137 U+070F..074A, U+074D..074F,
3138 U+07C0..07FA, U+07FD..07FF, U+0840..085B
3139 ...)
3140 \p{Identifier_Type: Not_Character} (969_409 plus all above-Unicode
3141 code points: [^\t\n\cK\f\r\x20-\x7e\x85
3142 \xa0-\xff], U+0378..0379, U+0380..0383,
3143 U+038B, U+038D, U+03A2 ...)
3144 \p{Identifier_Type: Not_NFKC} (4859: [\xa0\xa8\xaa\xaf\xb2-\xb5
3145 \xb8-\xba\xbc-\xbe], U+0132..0133,
3146 U+013F..0140, U+017F, U+01C4..01CC,
3147 U+01F1..01F3 ...)
3148 \p{Identifier_Type: Not_XID} (8198: [\t\n\cK\f\r\x20!\"#\$\%&
3149 \(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x85
3150 \xa1-\xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6
3151 \xbb\xbf\xd7\xf7], U+02C2..02C5,
3152 U+02D2..02D7, U+02DE..02DF,
3153 U+02E5..02EB, U+02ED ...)
3154 \p{Identifier_Type: Obsolete} (1627: U+018D, U+01AA..01AB,
3155 U+01B9..01BB, U+01BE..01BF,
3156 U+01F6..01F7, U+021C..021D ...)
3157 \p{Identifier_Type: Recommended} (107_938: [0-9A-Z_a-z\xc0-\xd6
3158 \xd8-\xf6\xf8-\xff], U+0100..0131,
3159 U+0134..013E, U+0141..0148,
3160 U+014A..017E, U+018F ...)
3161 \p{Identifier_Type: Technical} (1660: U+0180, U+018D,
3162 U+01AA..01AB, U+01BA..01BB, U+01BE,
3163 U+01C0..01C3 ...)
3164 \p{Identifier_Type: Uncommon_Use} (393: U+0181..018C, U+018E,
3165 U+0190..019F, U+01A2..01A9,
3166 U+01AC..01AE, U+01B1..01B8 ...)
3167 \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
3168 (101_661)
3169 \p{Ideo: *} \p{Ideographic: *}
3170 \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
3171 (101_661)
3172 \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_012_451
3173 plus all above-Unicode code points:
3174 U+0000..3005, U+3008..3020,
3175 U+302A..3037, U+303B..33FF,
3176 U+4DC0..4DFF, U+A000..F8FF ...)
3177 \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (101_661:
3178 U+3006..3007, U+3021..3029,
3179 U+3038..303A, U+3400..4DBF,
3180 U+4E00..9FFF, U+F900..FA6D ...)
3181 X \p{Ideographic_Description_Characters} \p{Block=
3182 Ideographic_Description_Characters}
3183 (Short: \p{InIDC}) (16)
3184 X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
3185 \p{Block=
3186 Ideographic_Symbols_And_Punctuation})
3187 (32)
3188 X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
3189 Ideographic_Symbols_And_Punctuation}
3190 (Short: \p{InIdeographicSymbols}) (32)
3191 \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (131_997)
3192 \p{IDS: *} \p{ID_Start: *}
3193 \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
3194 \p{IDSB}) (10)
3195 \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
3196 (1_114_102 plus all above-Unicode code
3197 points: U+0000..2FEF, U+2FF2..2FF3,
3198 U+2FFC..infinity)
3199 \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10:
3200 U+2FF0..2FF1, U+2FF4..2FFB)
3201 \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
3202 \p{IDST}) (2)
3203 \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
3204 (1_114_110 plus all above-Unicode code
3205 points: U+0000..2FF1, U+2FF4..infinity)
3206 \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2:
3207 U+2FF2..2FF3)
3208 \p{IDSB} \p{IDS_Binary_Operator} (=
3209 \p{IDS_Binary_Operator=Y}) (10)
3210 \p{IDSB: *} \p{IDS_Binary_Operator: *}
3211 \p{IDST} \p{IDS_Trinary_Operator} (=
3212 \p{IDS_Trinary_Operator=Y}) (2)
3213 \p{IDST: *} \p{IDS_Trinary_Operator: *}
3214 \p{Imperial_Aramaic} \p{Script_Extensions=Imperial_Aramaic}
3215 (Short: \p{Armi}; NOT \p{Block=
3216 Imperial_Aramaic}) (31)
3217 \p{In: *} \p{Present_In: *} (Perl extension)
3218 X \p{In_*} \p{Block: *}
3219 X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block=
3220 Common_Indic_Number_Forms}) (16)
3221 \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
3222 (352: U+093C, U+0941..0944, U+094D,
3223 U+0952, U+0956..0957, U+0962..0963 ...)
3224 \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC=
3225 BottomAndLeft}) (1: U+A9BF)
3226 \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
3227 BottomAndRight}) (4: U+1B3B, U+A9BE,
3228 U+A9C0, U+11942)
3229 \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (64:
3230 U+093F, U+094E, U+09BF, U+09C7..09C8,
3231 U+0A3F, U+0ABF ...)
3232 \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
3233 LeftAndRight}) (22: U+09CB..09CC,
3234 U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C,
3235 U+0DDC, U+0DDE ...)
3236 \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_896
3237 plus all above-Unicode code points:
3238 U+0000..08FF, U+0904..0939, U+093D,
3239 U+0950, U+0958..0961, U+0964..0980 ...)
3240 \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
3241 Overstruck}) (10: U+1CD4, U+1CE2..1CE8,
3242 U+10A01, U+10A06)
3243 \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (290:
3244 U+0903, U+093B, U+093E, U+0940,
3245 U+0949..094C, U+094F ...)
3246 \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (418:
3247 U+0900..0902, U+093A, U+0945..0948,
3248 U+0951, U+0953..0955, U+0981 ...)
3249 \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
3250 TopAndBottom}) (10: U+0C48, U+0F73,
3251 U+0F76..0F79, U+0F81, U+1B3C,
3252 U+1112E..1112F)
3253 \p{Indic_Positional_Category: Top_And_Bottom_And_Left} (Short:
3254 \p{InPC=TopAndBottomAndLeft}) (2:
3255 U+103C, U+1171E)
3256 \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
3257 \p{InPC=TopAndBottomAndRight}) (1:
3258 U+1B3D)
3259 \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
3260 TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE,
3261 U+1C29, U+114BB, U+115B9)
3262 \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
3263 \p{InPC=TopAndLeftAndRight}) (4: U+0B4C,
3264 U+0DDD, U+17BF, U+115BB)
3265 \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
3266 TopAndRight}) (13: U+0AC9, U+0B57,
3267 U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB,
3268 U+1925..1926 ...)
3269 \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
3270 VisualOrderLeft}) (19: U+0E40..0E44,
3271 U+0EC0..0EC4, U+19B5..19B7, U+19BA,
3272 U+AAB5..AAB6, U+AAB9 ...)
3273 X \p{Indic_Siyaq_Numbers} \p{Block=Indic_Siyaq_Numbers} (80)
3274 \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
3275 (17: U+093D, U+09BD, U+0ABD, U+0B3D,
3276 U+0C3D, U+0CBD ...)
3277 \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (91:
3278 U+0900..0902, U+0981..0982, U+09FC,
3279 U+0A01..0A02, U+0A70, U+0A81..0A82 ...)
3280 \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
3281 \p{InSC=BrahmiJoiningNumber}) (20:
3282 U+11052..11065)
3283 \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
3284 CantillationMark}) (59: U+0951..0952,
3285 U+0A51, U+0AFA..0AFC, U+1CD0..1CD2,
3286 U+1CD4..1CE1, U+1CF4 ...)
3287 \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
3288 (2206: U+0915..0939, U+0958..095F,
3289 U+0978..097F, U+0995..09A8,
3290 U+09AA..09B0, U+09B2 ...)
3291 \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
3292 ConsonantDead}) (14: U+09CE, U+0C5D,
3293 U+0CDD, U+0D54..0D56, U+0D7A..0D7F,
3294 U+1CF2..1CF3)
3295 \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
3296 ConsonantFinal}) (70: U+1930..1931,
3297 U+1933..1939, U+19C1..19C7,
3298 U+1A58..1A59, U+1B03, U+1B81 ...)
3299 \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
3300 \p{InSC=ConsonantHeadLetter}) (5:
3301 U+0F88..0F8C)
3302 \p{Indic_Syllabic_Category: Consonant_Initial_Postfixed} (Short:
3303 \p{InSC=ConsonantInitialPostfixed}) (1:
3304 U+1A5A)
3305 \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
3306 ConsonantKiller}) (2: U+0E4C, U+17CD)
3307 \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
3308 ConsonantMedial}) (31: U+0A75,
3309 U+0EBC..0EBD, U+103B..103E,
3310 U+105E..1060, U+1082, U+1A55..1A56 ...)
3311 \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
3312 \p{InSC=ConsonantPlaceholder}) (22: [\-
3313 \xa0\xd7], U+0980, U+0A72..0A73, U+104B,
3314 U+104E, U+1900 ...)
3315 \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
3316 \p{InSC=ConsonantPrecedingRepha}) (3:
3317 U+0D4E, U+11941, U+11D46)
3318 \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
3319 ConsonantPrefixed}) (10: U+111C2..111C3,
3320 U+1193F, U+11A3A, U+11A84..11A89)
3321 \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
3322 ConsonantSubjoined}) (94: U+0F8D..0F97,
3323 U+0F99..0FBC, U+1929..192B, U+1A57,
3324 U+1A5B..1A5E, U+1BA1..1BA3 ...)
3325 \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
3326 \p{InSC=ConsonantSucceedingRepha}) (1:
3327 U+17CC)
3328 \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
3329 \p{InSC=ConsonantWithStacker}) (8:
3330 U+0CF1..0CF2, U+1CF5..1CF6,
3331 U+11003..11004, U+11460..11461)
3332 \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
3333 GeminationMark}) (3: U+0A71, U+11237,
3334 U+11A98)
3335 \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
3336 InvisibleStacker}) (12: U+1039, U+17D2,
3337 U+1A60, U+1BAB, U+AAF6, U+10A3F ...)
3338 \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1:
3339 U+200D)
3340 \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
3341 ModifyingLetter}) (1: U+0B83)
3342 \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
3343 NonJoiner}) (1: U+200C)
3344 \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (32:
3345 U+093C, U+09BC, U+0A3C, U+0ABC,
3346 U+0AFD..0AFF, U+0B3C ...)
3347 \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (491:
3348 [0-9], U+0966..096F, U+09E6..09EF,
3349 U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F
3350 ...)
3351 \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
3352 NumberJoiner}) (1: U+1107F)
3353 \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
3354 (1_109_551 plus all above-Unicode code
3355 points: [\x00-\x20!\"#\$\%&\'\(\)*+,.
3356 \/:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
3357 \x9f\xa1-\xb1\xb4-\xd6\xd8-\xff],
3358 U+0100..08FF, U+0950, U+0953..0954,
3359 U+0964..0965, U+0970..0971 ...)
3360 \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
3361 PureKiller}) (25: U+0D3B..0D3C, U+0E3A,
3362 U+0E4E, U+0EBA, U+0F84, U+103A ...)
3363 \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
3364 RegisterShifter}) (2: U+17C9..17CA)
3365 \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
3366 SyllableModifier}) (25: [\xb2-\xb3],
3367 U+09FE, U+0F35, U+0F37, U+0FC6, U+17CB
3368 ...)
3369 \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
3370 ToneLetter}) (7: U+1970..1974, U+AAC0,
3371 U+AAC2)
3372 \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
3373 (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037,
3374 U+1063..1064, U+1069..106D, U+1087..108D
3375 ...)
3376 \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (27:
3377 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
3378 U+0BCD ...)
3379 \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga})
3380 (35: U+0903, U+0983, U+0A03, U+0A83,
3381 U+0B03, U+0C03 ...)
3382 \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30:
3383 U+1963..196D, U+A85E..A861, U+A866,
3384 U+A922..A92A, U+11150..11154)
3385 \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
3386 VowelDependent}) (686: U+093A..093B,
3387 U+093E..094C, U+094E..094F,
3388 U+0955..0957, U+0962..0963, U+09BE..09C4
3389 ...)
3390 \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
3391 VowelIndependent}) (486: U+0904..0914,
3392 U+0960..0961, U+0972..0977,
3393 U+0985..098C, U+098F..0990, U+0993..0994
3394 ...)
3395 \p{Inherited} \p{Script_Extensions=Inherited} (Short:
3396 \p{Zinh}) (586)
3397 \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
3398 (Short: \p{Pi}) (12)
3399 \p{InPC: *} \p{Indic_Positional_Category: *}
3400 \p{InSC: *} \p{Indic_Syllabic_Category: *}
3401 \p{Inscriptional_Pahlavi} \p{Script_Extensions=
3402 Inscriptional_Pahlavi} (Short: \p{Phli};
3403 NOT \p{Block=Inscriptional_Pahlavi}) (27)
3404 \p{Inscriptional_Parthian} \p{Script_Extensions=
3405 Inscriptional_Parthian} (Short:
3406 \p{Prti}; NOT \p{Block=
3407 Inscriptional_Parthian}) (30)
3408 X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block=
3409 IPA_Extensions}) (96)
3410 X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short:
3411 \p{InIPAExt}) (96)
3412 \p{Is_*} \p{*} (Any exceptions are individually
3413 noted beginning with the word NOT.) If
3414 an entry has flag(s) at its beginning,
3415 like "D", the "Is_" form has the same
3416 flag(s)
3417 \p{Ital} \p{Old_Italic} (= \p{Script_Extensions=
3418 Old_Italic}) (NOT \p{Block=Old_Italic})
3419 (39)
3420 X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
3421 (256)
3422 X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block=
3423 Hangul_Jamo_Extended_A}) (32)
3424 X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block=
3425 Hangul_Jamo_Extended_B}) (80)
3426 \p{Java} \p{Javanese} (= \p{Script_Extensions=
3427 Javanese}) (NOT \p{Block=Javanese}) (91)
3428 \p{Javanese} \p{Script_Extensions=Javanese} (Short:
3429 \p{Java}; NOT \p{Block=Javanese}) (91)
3430 \p{Jg: *} \p{Joining_Group: *}
3431 \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
3432 \p{Join_C: *} \p{Join_Control: *}
3433 \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
3434 \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
3435 plus all above-Unicode code points:
3436 U+0000..200B, U+200E..infinity)
3437 \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2:
3438 U+200C..200D)
3439 \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1:
3440 U+08BB)
3441 \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1:
3442 U+08BD)
3443 \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (2:
3444 U+08BC, U+08C4)
3445 \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (9: U+0639..063A,
3446 U+06A0, U+06FC, U+075D..075F, U+08B3,
3447 U+08C3)
3448 \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710)
3449 \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (29: U+0622..0623,
3450 U+0625, U+0627, U+0671..0673, U+0675,
3451 U+0773..0774 ...)
3452 \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (27: U+0628,
3453 U+062A..062B, U+066E, U+0679..0680,
3454 U+0750..0756, U+08A0..08A1 ...)
3455 \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D)
3456 \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
3457 BurushaskiYehBarree}) (2: U+077A..077B)
3458 \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (15: U+062F..0630,
3459 U+0688..0690, U+06EE, U+0759..075A,
3460 U+08AE)
3461 \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4:
3462 U+0715..0716, U+072A, U+072F)
3463 \p{Joining_Group: E} (Short: \p{Jg=E}) (1: U+0725)
3464 \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7:
3465 U+063D..063F, U+06CC, U+06CE,
3466 U+0775..0776)
3467 \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1: U+074F)
3468 \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10: U+0641,
3469 U+06A1..06A6, U+0760..0761, U+08A4)
3470 \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1:
3471 U+0724)
3472 \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (17: U+063B..063C,
3473 U+06A9, U+06AB, U+06AF..06B4,
3474 U+0762..0764, U+088D ...)
3475 \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714,
3476 U+072E)
3477 \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (22: U+062C..062E,
3478 U+0681..0687, U+06BF, U+0757..0758,
3479 U+076E..076F, U+0772 ...)
3480 \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
3481 HamzaOnHehGoal}) (1: U+06C3)
3482 \p{Joining_Group: Hanifi_Rohingya_Kinna_Ya} (Short: \p{Jg=
3483 HanifiRohingyaKinnaYa}) (4: U+10D19,
3484 U+10D1E, U+10D20, U+10D23)
3485 \p{Joining_Group: Hanifi_Rohingya_Pa} (Short: \p{Jg=
3486 HanifiRohingyaPa}) (3: U+10D02, U+10D09,
3487 U+10D1C)
3488 \p{Joining_Group: He} (Short: \p{Jg=He}) (1: U+0717)
3489 \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1: U+0647)
3490 \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2:
3491 U+06C1..06C2)
3492 \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A)
3493 \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (6: U+0643,
3494 U+06AC..06AE, U+077F, U+08B4)
3495 \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F)
3496 \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E)
3497 \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2:
3498 U+06BE, U+06FF)
3499 \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (8: U+0644,
3500 U+06B5..06B8, U+076A, U+08A6, U+08C7)
3501 \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720)
3502 \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1:
3503 U+0866)
3504 \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1:
3505 U+0861)
3506 \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1:
3507 U+0868)
3508 \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla})
3509 (1: U+0869)
3510 \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1:
3511 U+0860)
3512 \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1:
3513 U+0864)
3514 \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna})
3515 (1: U+0865)
3516 \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1:
3517 U+0862)
3518 \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1:
3519 U+0867)
3520 \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1:
3521 U+086A)
3522 \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1:
3523 U+0863)
3524 \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
3525 ManichaeanAleph}) (1: U+10AC0)
3526 \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
3527 (2: U+10AD9..10ADA)
3528 \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
3529 (2: U+10AC1..10AC2)
3530 \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
3531 ManichaeanDaleth}) (1: U+10AC5)
3532 \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
3533 ManichaeanDhamedh}) (1: U+10AD4)
3534 \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
3535 (1: U+10AEC)
3536 \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
3537 ManichaeanGimel}) (2: U+10AC3..10AC4)
3538 \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
3539 (1: U+10ACD)
3540 \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
3541 ManichaeanHundred}) (1: U+10AEF)
3542 \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
3543 (3: U+10AD0..10AD2)
3544 \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
3545 ManichaeanLamedh}) (1: U+10AD3)
3546 \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem})
3547 (1: U+10AD6)
3548 \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun})
3549 (1: U+10AD7)
3550 \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne})
3551 (1: U+10AEB)
3552 \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2:
3553 U+10ADB..10ADC)
3554 \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
3555 (3: U+10ADE..10AE0)
3556 \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
3557 (1: U+10AE1)
3558 \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
3559 ManichaeanSadhe}) (1: U+10ADD)
3560 \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
3561 ManichaeanSamekh}) (1: U+10AD8)
3562 \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw})
3563 (1: U+10AE4)
3564 \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen})
3565 (1: U+10AED)
3566 \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
3567 (1: U+10ACE)
3568 \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
3569 ManichaeanThamedh}) (1: U+10AD5)
3570 \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
3571 ManichaeanTwenty}) (1: U+10AEE)
3572 \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw})
3573 (1: U+10AC7)
3574 \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
3575 (1: U+10ACF)
3576 \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
3577 ManichaeanZayin}) (2: U+10AC9..10ACA)
3578 \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645,
3579 U+0765..0766, U+08A7)
3580 \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1: U+0721)
3581 \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
3582 (1_113_762 plus all above-Unicode code
3583 points: U+0000..061F, U+0621, U+0640,
3584 U+064B..066D, U+0670, U+0674 ...)
3585 \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (9: U+0646,
3586 U+06B9..06BC, U+0767..0769, U+0889)
3587 \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1: U+0722)
3588 \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1: U+06BD)
3589 \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1: U+0726)
3590 \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (6: U+0642, U+066F,
3591 U+06A7..06A8, U+08A5, U+08B5)
3592 \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729)
3593 \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (19: U+0631..0632,
3594 U+0691..0699, U+06EF, U+075B,
3595 U+076B..076C, U+0771 ...)
3596 \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1:
3597 U+0727)
3598 \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1:
3599 U+08AC)
3600 \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (6: U+0635..0636,
3601 U+069D..069E, U+06FB, U+08AF)
3602 \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728)
3603 \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634,
3604 U+069A..069C, U+06FA, U+075C, U+076D,
3605 U+0770 ...)
3606 \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723)
3607 \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B)
3608 \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1:
3609 U+08B1)
3610 \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA)
3611 \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718)
3612 \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (6: U+0637..0638,
3613 U+069F, U+088B..088C, U+08A3)
3614 \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1: U+072C)
3615 \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3:
3616 U+0629, U+06C0, U+06D5)
3617 \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
3618 Hamza_On_Heh_Goal} (1)
3619 \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C)
3620 \p{Joining_Group: Thin_Yeh} (Short: \p{Jg=ThinYeh}) (1: U+0886)
3621 \p{Joining_Group: Vertical_Tail} (Short: \p{Jg=VerticalTail}) (1:
3622 U+088E)
3623 \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16: U+0624, U+0648,
3624 U+0676..0677, U+06C4..06CB, U+06CF,
3625 U+0778..0779 ...)
3626 \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (11: U+0620, U+0626,
3627 U+0649..064A, U+0678, U+06D0..06D1,
3628 U+0777 ...)
3629 \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2:
3630 U+06D2..06D3)
3631 \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1:
3632 U+06CD)
3633 \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D)
3634 \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E)
3635 \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719)
3636 \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D)
3637 \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (7)
3638 \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (610)
3639 \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (610: U+0620,
3640 U+0626, U+0628, U+062A..062E,
3641 U+0633..063F, U+0641..0647 ...)
3642 \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (7: U+0640,
3643 U+07FA, U+0883..0885, U+180A, U+200D)
3644 \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (5)
3645 \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (5: U+A872,
3646 U+10ACD, U+10AD7, U+10D00, U+10FCB)
3647 \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_230 plus
3648 all above-Unicode code points: [\x00-
3649 \xac\xae-\xff], U+0100..02FF,
3650 U+0370..0482, U+048A..0590, U+05BE,
3651 U+05C0 ...)
3652 \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (152)
3653 \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (152:
3654 U+0622..0625, U+0627, U+0629,
3655 U+062F..0632, U+0648, U+0671..0673 ...)
3656 \p{Joining_Type: T} \p{Joining_Type=Transparent} (2108)
3657 \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (2108: [\xad],
3658 U+0300..036F, U+0483..0489,
3659 U+0591..05BD, U+05BF, U+05C1..05C2 ...)
3660 \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_111_230
3661 plus all above-Unicode code points)
3662 \p{Jt: *} \p{Joining_Type: *}
3663 \p{Kaithi} \p{Script_Extensions=Kaithi} (Short:
3664 \p{Kthi}; NOT \p{Block=Kaithi}) (88)
3665 \p{Kali} \p{Kayah_Li} (= \p{Script_Extensions=
3666 Kayah_Li}) (48)
3667 \p{Kana} \p{Katakana} (= \p{Script_Extensions=
3668 Katakana}) (NOT \p{Block=Katakana}) (372)
3669 X \p{Kana_Ext_A} \p{Kana_Extended_A} (= \p{Block=
3670 Kana_Extended_A}) (48)
3671 X \p{Kana_Ext_B} \p{Kana_Extended_B} (= \p{Block=
3672 Kana_Extended_B}) (16)
3673 X \p{Kana_Extended_A} \p{Block=Kana_Extended_A} (Short:
3674 \p{InKanaExtA}) (48)
3675 X \p{Kana_Extended_B} \p{Block=Kana_Extended_B} (Short:
3676 \p{InKanaExtB}) (16)
3677 X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block=
3678 Kana_Supplement}) (256)
3679 X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short:
3680 \p{InKanaSup}) (256)
3681 X \p{Kanbun} \p{Block=Kanbun} (16)
3682 X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block=
3683 Kangxi_Radicals}) (224)
3684 X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short:
3685 \p{InKangxi}) (224)
3686 \p{Kannada} \p{Script_Extensions=Kannada} (Short:
3687 \p{Knda}; NOT \p{Block=Kannada}) (105)
3688 \p{Katakana} \p{Script_Extensions=Katakana} (Short:
3689 \p{Kana}; NOT \p{Block=Katakana}) (372)
3690 X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (=
3691 \p{Block=Katakana_Phonetic_Extensions})
3692 (16)
3693 X \p{Katakana_Phonetic_Extensions} \p{Block=
3694 Katakana_Phonetic_Extensions} (Short:
3695 \p{InKatakanaExt}) (16)
3696 \p{Kayah_Li} \p{Script_Extensions=Kayah_Li} (Short:
3697 \p{Kali}) (48)
3698 \p{Khar} \p{Kharoshthi} (= \p{Script_Extensions=
3699 Kharoshthi}) (NOT \p{Block=Kharoshthi})
3700 (68)
3701 \p{Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
3702 \p{Khar}; NOT \p{Block=Kharoshthi}) (68)
3703 \p{Khitan_Small_Script} \p{Script_Extensions=Khitan_Small_Script}
3704 (Short: \p{Kits}; NOT \p{Block=
3705 Khitan_Small_Script}) (471)
3706 \p{Khmer} \p{Script_Extensions=Khmer} (Short:
3707 \p{Khmr}; NOT \p{Block=Khmer}) (146)
3708 X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
3709 \p{Khmr} \p{Khmer} (= \p{Script_Extensions=Khmer})
3710 (NOT \p{Block=Khmer}) (146)
3711 \p{Khoj} \p{Khojki} (= \p{Script_Extensions=
3712 Khojki}) (NOT \p{Block=Khojki}) (82)
3713 \p{Khojki} \p{Script_Extensions=Khojki} (Short:
3714 \p{Khoj}; NOT \p{Block=Khojki}) (82)
3715 \p{Khudawadi} \p{Script_Extensions=Khudawadi} (Short:
3716 \p{Sind}; NOT \p{Block=Khudawadi}) (81)
3717 \p{Kits} \p{Khitan_Small_Script} (=
3718 \p{Script_Extensions=
3719 Khitan_Small_Script}) (NOT \p{Block=
3720 Khitan_Small_Script}) (471)
3721 \p{Knda} \p{Kannada} (= \p{Script_Extensions=
3722 Kannada}) (NOT \p{Block=Kannada}) (105)
3723 \p{Kthi} \p{Kaithi} (= \p{Script_Extensions=
3724 Kaithi}) (NOT \p{Block=Kaithi}) (88)
3725 \p{L} \pL \p{Letter} (= \p{General_Category=Letter})
3726 (131_756)
3727 X \p{L&} \p{Cased_Letter} (= \p{General_Category=
3728 Cased_Letter}) (4089)
3729 X \p{L_} \p{Cased_Letter} (= \p{General_Category=
3730 Cased_Letter}) Note the trailing '_'
3731 matters in spite of loose matching
3732 rules. (4089)
3733 \p{Lana} \p{Tai_Tham} (= \p{Script_Extensions=
3734 Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
3735 \p{Lao} \p{Script_Extensions=Lao} (NOT \p{Block=
3736 Lao}) (82)
3737 \p{Laoo} \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
3738 \p{Block=Lao}) (82)
3739 \p{Latin} \p{Script_Extensions=Latin} (Short:
3740 \p{Latn}) (1504)
3741 X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
3742 Latin_1_Supplement}) (128)
3743 X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block=
3744 Latin_1_Supplement}) (128)
3745 X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
3746 \p{InLatin1}) (128)
3747 X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block=
3748 Latin_Extended_A}) (128)
3749 X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
3750 \p{Block=Latin_Extended_Additional})
3751 (256)
3752 X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block=
3753 Latin_Extended_B}) (208)
3754 X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block=
3755 Latin_Extended_C}) (32)
3756 X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block=
3757 Latin_Extended_D}) (224)
3758 X \p{Latin_Ext_E} \p{Latin_Extended_E} (= \p{Block=
3759 Latin_Extended_E}) (64)
3760 X \p{Latin_Ext_F} \p{Latin_Extended_F} (= \p{Block=
3761 Latin_Extended_F}) (64)
3762 X \p{Latin_Ext_G} \p{Latin_Extended_G} (= \p{Block=
3763 Latin_Extended_G}) (256)
3764 X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short:
3765 \p{InLatinExtA}) (128)
3766 X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
3767 (Short: \p{InLatinExtAdditional}) (256)
3768 X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short:
3769 \p{InLatinExtB}) (208)
3770 X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short:
3771 \p{InLatinExtC}) (32)
3772 X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short:
3773 \p{InLatinExtD}) (224)
3774 X \p{Latin_Extended_E} \p{Block=Latin_Extended_E} (Short:
3775 \p{InLatinExtE}) (64)
3776 X \p{Latin_Extended_F} \p{Block=Latin_Extended_F} (Short:
3777 \p{InLatinExtF}) (64)
3778 X \p{Latin_Extended_G} \p{Block=Latin_Extended_G} (Short:
3779 \p{InLatinExtG}) (256)
3780 \p{Latn} \p{Latin} (= \p{Script_Extensions=Latin})
3781 (1504)
3782 \p{Lb: *} \p{Line_Break: *}
3783 \p{LC} \p{Cased_Letter} (= \p{General_Category=
3784 Cased_Letter}) (4089)
3785 \p{Lepc} \p{Lepcha} (= \p{Script_Extensions=
3786 Lepcha}) (NOT \p{Block=Lepcha}) (74)
3787 \p{Lepcha} \p{Script_Extensions=Lepcha} (Short:
3788 \p{Lepc}; NOT \p{Block=Lepcha}) (74)
3789 \p{Letter} \p{General_Category=Letter} (Short: \p{L})
3790 (131_756)
3791 \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
3792 \p{Nl}) (236)
3793 X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
3794 \p{Limb} \p{Limbu} (= \p{Script_Extensions=Limbu})
3795 (NOT \p{Block=Limbu}) (69)
3796 \p{Limbu} \p{Script_Extensions=Limbu} (Short:
3797 \p{Limb}; NOT \p{Block=Limbu}) (69)
3798 \p{Lina} \p{Linear_A} (= \p{Script_Extensions=
3799 Linear_A}) (NOT \p{Block=Linear_A}) (386)
3800 \p{Linb} \p{Linear_B} (= \p{Script_Extensions=
3801 Linear_B}) (268)
3802 \p{Line_Break: AI} \p{Line_Break=Ambiguous} (707)
3803 \p{Line_Break: AL} \p{Line_Break=Alphabetic} (22_043)
3804 \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (22_043: [#&*<=>\@A-
3805 Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0-
3806 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6,
3807 U+02CE..02CF, U+02D1..02D7, U+02DC,
3808 U+02DE ...)
3809 \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7-\xa8\xaa
3810 \xb2-\xb3\xb6-\xba\xbc-\xbe\xd7\xf7],
3811 U+02C7, U+02C9..02CB, U+02CD, U+02D0,
3812 U+02D8..02DB ...)
3813 \p{Line_Break: B2} \p{Line_Break=Break_Both} (3)
3814 \p{Line_Break: BA} \p{Line_Break=Break_After} (247)
3815 \p{Line_Break: BB} \p{Line_Break=Break_Before} (45)
3816 \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
3817 \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (247: [\t\|\xad],
3818 U+058A, U+05BE, U+0964..0965,
3819 U+0E5A..0E5B, U+0F0B ...)
3820 \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (45: [\xb4],
3821 U+02C8, U+02CC, U+02DF, U+0C77, U+0C84
3822 ...)
3823 \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014,
3824 U+2E3A..2E3B)
3825 \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/])
3826 \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r])
3827 \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
3828 \p{Line_Break: CJ} \p{Line_Break=
3829 Conditional_Japanese_Starter} (58)
3830 \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (95)
3831 \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]])
3832 \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (95: [\}],
3833 U+0F3B, U+0F3D, U+169C, U+2046, U+207E
3834 ...)
3835 \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (2399)
3836 \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2399: [^\t\n
3837 \cK\f\r\x20-\x7e\x85\xa0-\xff],
3838 U+0300..034E, U+0350..035B,
3839 U+0363..036F, U+0483..0489, U+0591..05BD
3840 ...)
3841 \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (757:
3842 U+0E01..0E3A, U+0E40..0E4E,
3843 U+0E81..0E82, U+0E84, U+0E86..0E8A,
3844 U+0E8C..0EA3 ...)
3845 \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
3846 (58: U+3041, U+3043, U+3045, U+3047,
3847 U+3049, U+3063 ...)
3848 \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC)
3849 \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
3850 \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
3851 \p{Line_Break: E_Base} (Short: \p{Lb=EB}) (132: U+261D, U+26F9,
3852 U+270A..270D, U+1F385, U+1F3C2..1F3C4,
3853 U+1F3C7 ...)
3854 \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF)
3855 \p{Line_Break: EB} \p{Line_Break=E_Base} (132)
3856 \p{Line_Break: EM} \p{Line_Break=E_Modifier} (5)
3857 \p{Line_Break: EX} \p{Line_Break=Exclamation} (40)
3858 \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (40: [!?], U+05C6,
3859 U+061B, U+061D..061F, U+06D4, U+07F9 ...)
3860 \p{Line_Break: GL} \p{Line_Break=Glue} (26)
3861 \p{Line_Break: Glue} (Short: \p{Lb=GL}) (26: [\xa0], U+034F,
3862 U+035C..0362, U+0F08, U+0F0C, U+0F12 ...)
3863 \p{Line_Break: H2} (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C,
3864 U+AC38, U+AC54, U+AC70, U+AC8C ...)
3865 \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B,
3866 U+AC1D..AC37, U+AC39..AC53,
3867 U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7
3868 ...)
3869 \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (75:
3870 U+05D0..05EA, U+05EF..05F2, U+FB1D,
3871 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
3872 ...)
3873 \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (75)
3874 \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
3875 \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1: [\-])
3876 \p{Line_Break: ID} \p{Line_Break=Ideographic} (172_456)
3877 \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_456:
3878 U+231A..231B, U+23F0..23F3,
3879 U+2600..2603, U+2614..2615, U+2618,
3880 U+261A..261C ...)
3881 \p{Line_Break: IN} \p{Line_Break=Inseparable} (6)
3882 \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;],
3883 U+037E, U+0589, U+060C..060D, U+07F8,
3884 U+2044 ...)
3885 \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026,
3886 U+22EF, U+FE19, U+10AF6)
3887 \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
3888 \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
3889 \p{Line_Break: JL} (Short: \p{Lb=JL}) (125: U+1100..115F,
3890 U+A960..A97C)
3891 \p{Line_Break: JT} (Short: \p{Lb=JT}) (137: U+11A8..11FF,
3892 U+D7CB..D7FB)
3893 \p{Line_Break: JV} (Short: \p{Lb=JV}) (95: U+1160..11A7,
3894 U+D7B0..D7C6)
3895 \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
3896 \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n])
3897 \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f],
3898 U+2028..2029)
3899 \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85])
3900 \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
3901 \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (33: U+17D6,
3902 U+203C..203D, U+2047..2049, U+3005,
3903 U+301C, U+303B..303C ...)
3904 \p{Line_Break: NS} \p{Line_Break=Nonstarter} (33)
3905 \p{Line_Break: NU} \p{Line_Break=Numeric} (652)
3906 \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (652: [0-9],
3907 U+0660..0669, U+066B..066C,
3908 U+06F0..06F9, U+07C0..07C9, U+0966..096F
3909 ...)
3910 \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (92)
3911 \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (92: [\(\[\{
3912 \xa1\xbf], U+0F3A, U+0F3C, U+169B,
3913 U+201A, U+201E ...)
3914 \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (37)
3915 \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (37: [\%\xa2
3916 \xb0], U+0609..060B, U+066A,
3917 U+09F2..09F3, U+09F9, U+0D79 ...)
3918 \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (67)
3919 \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (67: [\$+\\\xa3-
3920 \xa5\xb1], U+058F, U+07FE..07FF, U+09FB,
3921 U+0AF1, U+0BF9 ...)
3922 \p{Line_Break: QU} \p{Line_Break=Quotation} (39)
3923 \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb],
3924 U+2018..2019, U+201B..201D, U+201F,
3925 U+2039..203A, U+275B..2760 ...)
3926 \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26:
3927 U+1F1E6..1F1FF)
3928 \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26)
3929 \p{Line_Break: SA} \p{Line_Break=Complex_Context} (757)
3930 D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
3931 \p{Line_Break: SP} \p{Line_Break=Space} (1)
3932 \p{Line_Break: Space} (Short: \p{Lb=SP}) (1: [\x20])
3933 D \p{Line_Break: Surrogate} Surrogates should never appear in well-
3934 formed text, and therefore shouldn't be
3935 the basis for line breaking (Short:
3936 \p{Lb=SG}) (2048: U+D800..DFFF)
3937 \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
3938 \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (900_465 plus all
3939 above-Unicode code points: U+0378..0379,
3940 U+0380..0383, U+038B, U+038D, U+03A2,
3941 U+0530 ...)
3942 \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
3943 \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF)
3944 \p{Line_Break: XX} \p{Line_Break=Unknown} (900_465 plus all
3945 above-Unicode code points)
3946 \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
3947 \p{Line_Break: ZWJ} (Short: \p{Lb=ZWJ}) (1: U+200D)
3948 \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B)
3949 \p{Line_Separator} \p{General_Category=Line_Separator}
3950 (Short: \p{Zl}) (1)
3951 \p{Linear_A} \p{Script_Extensions=Linear_A} (Short:
3952 \p{Lina}; NOT \p{Block=Linear_A}) (386)
3953 \p{Linear_B} \p{Script_Extensions=Linear_B} (Short:
3954 \p{Linb}) (268)
3955 X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
3956 X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
3957 \p{Lisu} \p{Script_Extensions=Lisu} (NOT \p{Block=
3958 Lisu}) (49)
3959 X \p{Lisu_Sup} \p{Lisu_Supplement} (= \p{Block=
3960 Lisu_Supplement}) (16)
3961 X \p{Lisu_Supplement} \p{Block=Lisu_Supplement} (Short:
3962 \p{InLisuSup}) (16)
3963 \p{Ll} \p{Lowercase_Letter} (=
3964 \p{General_Category=Lowercase_Letter})
3965 (/i= General_Category=Cased_Letter)
3966 (2227)
3967 \p{Lm} \p{Modifier_Letter} (=
3968 \p{General_Category=Modifier_Letter})
3969 (334)
3970 \p{Lo} \p{Other_Letter} (= \p{General_Category=
3971 Other_Letter}) (127_333)
3972 \p{LOE} \p{Logical_Order_Exception} (=
3973 \p{Logical_Order_Exception=Y}) (19)
3974 \p{LOE: *} \p{Logical_Order_Exception: *}
3975 \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
3976 \p{LOE}) (19)
3977 \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
3978 (1_114_093 plus all above-Unicode code
3979 points: U+0000..0E3F, U+0E45..0EBF,
3980 U+0EC5..19B4, U+19B8..19B9,
3981 U+19BB..AAB4, U+AAB7..AAB8 ...)
3982 \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19:
3983 U+0E40..0E44, U+0EC0..0EC4,
3984 U+19B5..19B7, U+19BA, U+AAB5..AAB6,
3985 U+AAB9 ...)
3986 X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
3987 \p{Lower} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3988 Cased=Yes) (2471)
3989 \p{Lower: *} \p{Lowercase: *}
3990 \p{Lowercase} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3991 Cased=Yes) (2471)
3992 \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
3993 No) (1_111_641 plus all above-Unicode
3994 code points: [\x00-\x20!\"#\$\%&\'
3995 \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`\{
3996 \|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9\xbb-
3997 \xde\xf7], U+0100, U+0102, U+0104,
3998 U+0106, U+0108 ...)
3999 \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
4000 Yes) (2471: [a-z\xaa\xb5\xba\xdf-\xf6
4001 \xf8-\xff], U+0101, U+0103, U+0105,
4002 U+0107, U+0109 ...)
4003 \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
4004 (Short: \p{Ll}; /i= General_Category=
4005 Cased_Letter) (2227)
4006 \p{Lt} \p{Titlecase_Letter} (=
4007 \p{General_Category=Titlecase_Letter})
4008 (/i= General_Category=Cased_Letter) (31)
4009 \p{Lu} \p{Uppercase_Letter} (=
4010 \p{General_Category=Uppercase_Letter})
4011 (/i= General_Category=Cased_Letter)
4012 (1831)
4013 \p{Lyci} \p{Lycian} (= \p{Script_Extensions=
4014 Lycian}) (NOT \p{Block=Lycian}) (29)
4015 \p{Lycian} \p{Script_Extensions=Lycian} (Short:
4016 \p{Lyci}; NOT \p{Block=Lycian}) (29)
4017 \p{Lydi} \p{Lydian} (= \p{Script_Extensions=
4018 Lydian}) (NOT \p{Block=Lydian}) (27)
4019 \p{Lydian} \p{Script_Extensions=Lydian} (Short:
4020 \p{Lydi}; NOT \p{Block=Lydian}) (27)
4021 \p{M} \pM \p{Mark} (= \p{General_Category=Mark})
4022 (2408)
4023 \p{Mahajani} \p{Script_Extensions=Mahajani} (Short:
4024 \p{Mahj}; NOT \p{Block=Mahajani}) (61)
4025 \p{Mahj} \p{Mahajani} (= \p{Script_Extensions=
4026 Mahajani}) (NOT \p{Block=Mahajani}) (61)
4027 X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block=
4028 Mahjong_Tiles}) (48)
4029 X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short:
4030 \p{InMahjong}) (48)
4031 \p{Maka} \p{Makasar} (= \p{Script_Extensions=
4032 Makasar}) (NOT \p{Block=Makasar}) (25)
4033 \p{Makasar} \p{Script_Extensions=Makasar} (Short:
4034 \p{Maka}; NOT \p{Block=Makasar}) (25)
4035 \p{Malayalam} \p{Script_Extensions=Malayalam} (Short:
4036 \p{Mlym}; NOT \p{Block=Malayalam}) (126)
4037 \p{Mand} \p{Mandaic} (= \p{Script_Extensions=
4038 Mandaic}) (NOT \p{Block=Mandaic}) (30)
4039 \p{Mandaic} \p{Script_Extensions=Mandaic} (Short:
4040 \p{Mand}; NOT \p{Block=Mandaic}) (30)
4041 \p{Mani} \p{Manichaean} (= \p{Script_Extensions=
4042 Manichaean}) (NOT \p{Block=Manichaean})
4043 (52)
4044 \p{Manichaean} \p{Script_Extensions=Manichaean} (Short:
4045 \p{Mani}; NOT \p{Block=Manichaean}) (52)
4046 \p{Marc} \p{Marchen} (= \p{Script_Extensions=
4047 Marchen}) (NOT \p{Block=Marchen}) (68)
4048 \p{Marchen} \p{Script_Extensions=Marchen} (Short:
4049 \p{Marc}; NOT \p{Block=Marchen}) (68)
4050 \p{Mark} \p{General_Category=Mark} (Short: \p{M})
4051 (2408)
4052 \p{Masaram_Gondi} \p{Script_Extensions=Masaram_Gondi}
4053 (Short: \p{Gonm}; NOT \p{Block=
4054 Masaram_Gondi}) (77)
4055 \p{Math} \p{Math=Y} (2310)
4056 \p{Math: N*} (Single: \P{Math}) (1_111_802 plus all
4057 above-Unicode code points: [\x00-\x20!
4058 \"#\$\%&\'\(\)*,\-.\/0-9:;?\@A-Z
4059 \[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2-
4060 \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF,
4061 U+03D3..03D4, U+03D6..03EF,
4062 U+03F2..03F3, U+03F7..0605 ...)
4063 \p{Math: Y*} (Single: \p{Math}) (2310: [+<=>\^\|~\xac
4064 \xb1\xd7\xf7], U+03D0..03D2, U+03D5,
4065 U+03F0..03F1, U+03F4..03F6, U+0606..0608
4066 ...)
4067 X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (=
4068 \p{Block=
4069 Mathematical_Alphanumeric_Symbols})
4070 (1024)
4071 X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block=
4072 Mathematical_Operators}) (256)
4073 \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
4074 \p{Sm}) (948)
4075 X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
4076 Mathematical_Alphanumeric_Symbols}
4077 (Short: \p{InMathAlphanum}) (1024)
4078 X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
4079 (Short: \p{InMathOperators}) (256)
4080 X \p{Mayan_Numerals} \p{Block=Mayan_Numerals} (32)
4081 \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
4082 Spacing_Mark}) (445)
4083 \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
4084 Enclosing_Mark}) (13)
4085 \p{Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
4086 \p{Medf}; NOT \p{Block=Medefaidrin}) (91)
4087 \p{Medf} \p{Medefaidrin} (= \p{Script_Extensions=
4088 Medefaidrin}) (NOT \p{Block=
4089 Medefaidrin}) (91)
4090 \p{Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek} (Short:
4091 \p{Mtei}; NOT \p{Block=Meetei_Mayek})
4092 (79)
4093 X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block=
4094 Meetei_Mayek_Extensions}) (32)
4095 X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
4096 (Short: \p{InMeeteiMayekExt}) (32)
4097 \p{Mend} \p{Mende_Kikakui} (= \p{Script_Extensions=
4098 Mende_Kikakui}) (NOT \p{Block=
4099 Mende_Kikakui}) (213)
4100 \p{Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
4101 (Short: \p{Mend}; NOT \p{Block=
4102 Mende_Kikakui}) (213)
4103 \p{Merc} \p{Meroitic_Cursive} (=
4104 \p{Script_Extensions=Meroitic_Cursive})
4105 (NOT \p{Block=Meroitic_Cursive}) (90)
4106 \p{Mero} \p{Meroitic_Hieroglyphs} (=
4107 \p{Script_Extensions=
4108 Meroitic_Hieroglyphs}) (32)
4109 \p{Meroitic_Cursive} \p{Script_Extensions=Meroitic_Cursive}
4110 (Short: \p{Merc}; NOT \p{Block=
4111 Meroitic_Cursive}) (90)
4112 \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
4113 Meroitic_Hieroglyphs} (Short: \p{Mero})
4114 (32)
4115 \p{Miao} \p{Script_Extensions=Miao} (NOT \p{Block=
4116 Miao}) (149)
4117 X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (=
4118 \p{Block=
4119 Miscellaneous_Symbols_And_Arrows}) (256)
4120 X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
4121 (= \p{Block=
4122 Miscellaneous_Mathematical_Symbols_A})
4123 (48)
4124 X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
4125 (= \p{Block=
4126 Miscellaneous_Mathematical_Symbols_B})
4127 (128)
4128 X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs}
4129 (= \p{Block=
4130 Miscellaneous_Symbols_And_Pictographs})
4131 (768)
4132 X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block=
4133 Miscellaneous_Symbols}) (256)
4134 X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block=
4135 Miscellaneous_Technical}) (256)
4136 X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
4137 Miscellaneous_Mathematical_Symbols_A}
4138 (Short: \p{InMiscMathSymbolsA}) (48)
4139 X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
4140 Miscellaneous_Mathematical_Symbols_B}
4141 (Short: \p{InMiscMathSymbolsB}) (128)
4142 X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
4143 \p{InMiscSymbols}) (256)
4144 X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
4145 Miscellaneous_Symbols_And_Arrows}
4146 (Short: \p{InMiscArrows}) (256)
4147 X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
4148 Miscellaneous_Symbols_And_Pictographs}
4149 (Short: \p{InMiscPictographs}) (768)
4150 X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
4151 (Short: \p{InMiscTechnical}) (256)
4152 \p{Mlym} \p{Malayalam} (= \p{Script_Extensions=
4153 Malayalam}) (NOT \p{Block=Malayalam})
4154 (126)
4155 \p{Mn} \p{Nonspacing_Mark} (=
4156 \p{General_Category=Nonspacing_Mark})
4157 (1950)
4158 \p{Modi} \p{Script_Extensions=Modi} (NOT \p{Block=
4159 Modi}) (89)
4160 \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
4161 (Short: \p{Lm}) (334)
4162 X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block=
4163 Spacing_Modifier_Letters}) (80)
4164 \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
4165 (Short: \p{Sk}) (125)
4166 X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
4167 \p{Mong} \p{Mongolian} (= \p{Script_Extensions=
4168 Mongolian}) (NOT \p{Block=Mongolian})
4169 (172)
4170 \p{Mongolian} \p{Script_Extensions=Mongolian} (Short:
4171 \p{Mong}; NOT \p{Block=Mongolian}) (172)
4172 X \p{Mongolian_Sup} \p{Mongolian_Supplement} (= \p{Block=
4173 Mongolian_Supplement}) (32)
4174 X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
4175 \p{InMongolianSup}) (32)
4176 \p{Mro} \p{Script_Extensions=Mro} (NOT \p{Block=
4177 Mro}) (43)
4178 \p{Mroo} \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
4179 \p{Block=Mro}) (43)
4180 \p{Mtei} \p{Meetei_Mayek} (= \p{Script_Extensions=
4181 Meetei_Mayek}) (NOT \p{Block=
4182 Meetei_Mayek}) (79)
4183 \p{Mult} \p{Multani} (= \p{Script_Extensions=
4184 Multani}) (NOT \p{Block=Multani}) (48)
4185 \p{Multani} \p{Script_Extensions=Multani} (Short:
4186 \p{Mult}; NOT \p{Block=Multani}) (48)
4187 X \p{Music} \p{Musical_Symbols} (= \p{Block=
4188 Musical_Symbols}) (256)
4189 X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short:
4190 \p{InMusic}) (256)
4191 \p{Myanmar} \p{Script_Extensions=Myanmar} (Short:
4192 \p{Mymr}; NOT \p{Block=Myanmar}) (224)
4193 X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block=
4194 Myanmar_Extended_A}) (32)
4195 X \p{Myanmar_Ext_B} \p{Myanmar_Extended_B} (= \p{Block=
4196 Myanmar_Extended_B}) (32)
4197 X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short:
4198 \p{InMyanmarExtA}) (32)
4199 X \p{Myanmar_Extended_B} \p{Block=Myanmar_Extended_B} (Short:
4200 \p{InMyanmarExtB}) (32)
4201 \p{Mymr} \p{Myanmar} (= \p{Script_Extensions=
4202 Myanmar}) (NOT \p{Block=Myanmar}) (224)
4203 \p{N} \pN \p{Number} (= \p{General_Category=Number})
4204 (1791)
4205 \p{Na=*} \p{Name=*}
4206 \p{Nabataean} \p{Script_Extensions=Nabataean} (Short:
4207 \p{Nbat}; NOT \p{Block=Nabataean}) (40)
4208 \p{Name=*} Combination of Name and Name_Alias
4209 properties; has special loose matching
4210 rules, for which see Unicode UAX #44
4211 \p{Nand} \p{Nandinagari} (= \p{Script_Extensions=
4212 Nandinagari}) (NOT \p{Block=
4213 Nandinagari}) (86)
4214 \p{Nandinagari} \p{Script_Extensions=Nandinagari} (Short:
4215 \p{Nand}; NOT \p{Block=Nandinagari}) (86)
4216 \p{Narb} \p{Old_North_Arabian} (=
4217 \p{Script_Extensions=Old_North_Arabian})
4218 (32)
4219 X \p{NB} \p{No_Block} (= \p{Block=No_Block})
4220 (825_600 plus all above-Unicode code
4221 points)
4222 \p{Nbat} \p{Nabataean} (= \p{Script_Extensions=
4223 Nabataean}) (NOT \p{Block=Nabataean})
4224 (40)
4225 \p{NChar} \p{Noncharacter_Code_Point} (=
4226 \p{Noncharacter_Code_Point=Y}) (66)
4227 \p{NChar: *} \p{Noncharacter_Code_Point: *}
4228 \p{Nd} \p{XPosixDigit} (= \p{General_Category=
4229 Decimal_Number}) (660)
4230 \p{New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
4231 \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
4232 \p{Newa} \p{Script_Extensions=Newa} (NOT \p{Block=
4233 Newa}) (97)
4234 \p{NFC_QC: *} \p{NFC_Quick_Check: *}
4235 \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (111)
4236 \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (111:
4237 U+0300..0304, U+0306..030C, U+030F,
4238 U+0311, U+0313..0314, U+031B ...)
4239 \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
4240 \P{NFC_Quick_Check} NOR \P{NFC_QC})
4241 (1120)
4242 \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
4243 \P{NFC_Quick_Check} NOR \P{NFC_QC})
4244 (1120: U+0340..0341, U+0343..0344,
4245 U+0374, U+037E, U+0387, U+0958..095F ...)
4246 \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
4247 \p{NFC_Quick_Check} NOR \p{NFC_QC})
4248 (1_112_881 plus all above-Unicode code
4249 points)
4250 \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
4251 \p{NFC_Quick_Check} NOR \p{NFC_QC})
4252 (1_112_881 plus all above-Unicode code
4253 points: U+0000..02FF, U+0305,
4254 U+030D..030E, U+0310, U+0312,
4255 U+0315..031A ...)
4256 \p{NFD_QC: *} \p{NFD_Quick_Check: *}
4257 \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
4258 \P{NFD_Quick_Check} NOR \P{NFD_QC})
4259 (13_233)
4260 \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
4261 \P{NFD_Quick_Check} NOR \P{NFD_QC})
4262 (13_233: [\xc0-\xc5\xc7-\xcf\xd1-\xd6
4263 \xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-\xf6
4264 \xf9-\xfd\xff], U+0100..010F,
4265 U+0112..0125, U+0128..0130,
4266 U+0134..0137, U+0139..013E ...)
4267 \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
4268 \p{NFD_Quick_Check} NOR \p{NFD_QC})
4269 (1_100_879 plus all above-Unicode code
4270 points)
4271 \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
4272 \p{NFD_Quick_Check} NOR \p{NFD_QC})
4273 (1_100_879 plus all above-Unicode code
4274 points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde-
4275 \xdf\xe6\xf0\xf7-\xf8\xfe],
4276 U+0110..0111, U+0126..0127,
4277 U+0131..0133, U+0138, U+013F..0142 ...)
4278 \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
4279 \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (111)
4280 \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (111:
4281 U+0300..0304, U+0306..030C, U+030F,
4282 U+0311, U+0313..0314, U+031B ...)
4283 \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
4284 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4285 (4866)
4286 \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
4287 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4288 (4866: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4289 \xba\xbc-\xbe], U+0132..0133,
4290 U+013F..0140, U+0149, U+017F,
4291 U+01C4..01CC ...)
4292 \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
4293 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4294 (1_109_135 plus all above-Unicode code
4295 points)
4296 \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
4297 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4298 (1_109_135 plus all above-Unicode code
4299 points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4300 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff],
4301 U+0100..0131, U+0134..013E,
4302 U+0141..0148, U+014A..017E, U+0180..01C3
4303 ...)
4304 \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
4305 \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
4306 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4307 (16_967)
4308 \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
4309 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4310 (16_967: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4311 \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1-
4312 \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
4313 \xf6\xf9-\xfd\xff], U+0100..010F,
4314 U+0112..0125, U+0128..0130,
4315 U+0132..0137, U+0139..0140 ...)
4316 \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
4317 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4318 (1_097_145 plus all above-Unicode code
4319 points)
4320 \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
4321 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4322 (1_097_145 plus all above-Unicode code
4323 points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4324 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xc6\xd0
4325 \xd7-\xd8\xde-\xdf\xe6\xf0\xf7-\xf8
4326 \xfe], U+0110..0111, U+0126..0127,
4327 U+0131, U+0138, U+0141..0142 ...)
4328 \p{Nko} \p{Script_Extensions=Nko} (NOT \p{Block=
4329 NKo}) (67)
4330 \p{Nkoo} \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
4331 \p{Block=NKo}) (67)
4332 \p{Nl} \p{Letter_Number} (= \p{General_Category=
4333 Letter_Number}) (236)
4334 \p{No} \p{Other_Number} (= \p{General_Category=
4335 Other_Number}) (895)
4336 X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB})
4337 (825_600 plus all above-Unicode code
4338 points)
4339 \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
4340 \p{NChar}) (66)
4341 \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
4342 (1_114_046 plus all above-Unicode code
4343 points: U+0000..FDCF, U+FDF0..FFFD,
4344 U+10000..1FFFD, U+20000..2FFFD,
4345 U+30000..3FFFD, U+40000..4FFFD ...)
4346 \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
4347 (66: U+FDD0..FDEF, U+FFFE..FFFF,
4348 U+1FFFE..1FFFF, U+2FFFE..2FFFF,
4349 U+3FFFE..3FFFF, U+4FFFE..4FFFF ...)
4350 \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
4351 (Short: \p{Mn}) (1950)
4352 \p{Nshu} \p{Nushu} (= \p{Script_Extensions=Nushu})
4353 (NOT \p{Block=Nushu}) (397)
4354 \p{Nt: *} \p{Numeric_Type: *}
4355 \p{Number} \p{General_Category=Number} (Short: \p{N})
4356 (1791)
4357 X \p{Number_Forms} \p{Block=Number_Forms} (64)
4358 \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (660)
4359 \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (660: [0-9],
4360 U+0660..0669, U+06F0..06F9,
4361 U+07C0..07C9, U+0966..096F, U+09E6..09EF
4362 ...)
4363 \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
4364 \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9],
4365 U+1369..1371, U+19DA, U+2070,
4366 U+2074..2079, U+2080..2089 ...)
4367 \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_240 plus all
4368 above-Unicode code points: [\x00-\x20!
4369 \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\]
4370 \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba-
4371 \xbb\xbf-\xff], U+0100..065F,
4372 U+066A..06EF, U+06FA..07BF,
4373 U+07CA..0965, U+0970..09E5 ...)
4374 \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (1084)
4375 \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (1084: [\xbc-\xbe],
4376 U+09F4..09F9, U+0B72..0B77,
4377 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
4378 ...)
4379 T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33)
4380 T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (84: [0], U+0660,
4381 U+06F0, U+07C0, U+0966, U+09E6 ...)
4382 T \p{Numeric_Value: 1/320} (Short: \p{Nv=1/320}) (2: U+11FC0,
4383 U+11FD4)
4384 T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (2: U+0D58, U+11FC1)
4385 T \p{Numeric_Value: 1/80} (Short: \p{Nv=1/80}) (1: U+11FC2)
4386 T \p{Numeric_Value: 1/64} (Short: \p{Nv=1/64}) (1: U+11FC3)
4387 T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (2: U+0D59, U+11FC4)
4388 T \p{Numeric_Value: 1/32} (Short: \p{Nv=1/32}) (1: U+11FC5)
4389 T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (2: U+0D5A, U+11FC6)
4390 T \p{Numeric_Value: 3/64} (Short: \p{Nv=3/64}) (1: U+11FC7)
4391 T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (2: U+0D5B, U+11FC8)
4392 T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (6: U+09F4, U+0B75,
4393 U+0D76, U+A833, U+11FC9..11FCA)
4394 T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6)
4395 T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (3: U+0D5C, U+2152,
4396 U+11FCB)
4397 T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1: U+2151)
4398 T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (7: U+09F5, U+0B76,
4399 U+0D77, U+215B, U+A834, U+11FCC ...)
4400 T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1: U+2150)
4401 T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (2: U+0D5D, U+11FCD)
4402 T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (4: U+2159, U+109F7,
4403 U+12461, U+1ED3D)
4404 T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (5: U+09F6, U+0B77,
4405 U+0D78, U+A835, U+11FCE)
4406 T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (3: U+0D5E, U+2155,
4407 U+11FCF)
4408 T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (14: [\xbc], U+09F7,
4409 U+0B72, U+0D73, U+A830, U+10140 ...)
4410 T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (6: U+2153, U+109F9,
4411 U+10E7D, U+1245A, U+1245D, U+12465)
4412 T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1: U+215C)
4413 T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1: U+2156)
4414 T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA)
4415 T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (19: [\xbd], U+0B73,
4416 U+0D74, U+0F2A, U+2CFD, U+A831 ...)
4417 T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC)
4418 T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1: U+2157)
4419 T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1: U+215D)
4420 T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (7: U+2154, U+10177,
4421 U+109FD, U+10E7E, U+1245B, U+1245E ...)
4422 T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (9: [\xbe], U+09F8,
4423 U+0B74, U+0D75, U+A832, U+10178 ...)
4424 T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1: U+2158)
4425 T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (3: U+215A, U+109FF,
4426 U+1245C)
4427 T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1: U+215E)
4428 T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC)
4429 T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (141: [1\xb9], U+0661,
4430 U+06F1, U+07C1, U+0967, U+09E7 ...)
4431 T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1: U+0F2B)
4432 T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (140: [2\xb2], U+0662,
4433 U+06F2, U+07C2, U+0968, U+09E8 ...)
4434 T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1: U+0F2C)
4435 T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (141: [3\xb3], U+0663,
4436 U+06F3, U+07C3, U+0969, U+09E9 ...)
4437 T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1: U+0F2D)
4438 T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (132: [4], U+0664,
4439 U+06F4, U+07C4, U+096A, U+09EA ...)
4440 T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1: U+0F2E)
4441 T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (130: [5], U+0665,
4442 U+06F5, U+07C5, U+096B, U+09EB ...)
4443 T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F)
4444 T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (114: [6], U+0666,
4445 U+06F6, U+07C6, U+096C, U+09EC ...)
4446 T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30)
4447 T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (113: [7], U+0667,
4448 U+06F7, U+07C7, U+096D, U+09ED ...)
4449 T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31)
4450 T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (109: [8], U+0668,
4451 U+06F8, U+07C8, U+096E, U+09EE ...)
4452 T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32)
4453 T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (113: [9], U+0669,
4454 U+06F9, U+07C9, U+096F, U+09EF ...)
4455 T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (62: U+0BF0, U+0D70,
4456 U+1372, U+2169, U+2179, U+2469 ...)
4457 T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (8: U+216A, U+217A,
4458 U+246A, U+247E, U+2492, U+24EB ...)
4459 T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (8: U+216B, U+217B,
4460 U+246B, U+247F, U+2493, U+24EC ...)
4461 T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (6: U+246C, U+2480,
4462 U+2494, U+24ED, U+16E8D, U+1D2ED)
4463 T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (6: U+246D, U+2481,
4464 U+2495, U+24EE, U+16E8E, U+1D2EE)
4465 T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (6: U+246E, U+2482,
4466 U+2496, U+24EF, U+16E8F, U+1D2EF)
4467 T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (7: U+09F9, U+246F,
4468 U+2483, U+2497, U+24F0, U+16E90 ...)
4469 T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (7: U+16EE, U+2470,
4470 U+2484, U+2498, U+24F1, U+16E91 ...)
4471 T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (7: U+16EF, U+2471,
4472 U+2485, U+2499, U+24F2, U+16E92 ...)
4473 T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (7: U+16F0, U+2472,
4474 U+2486, U+249A, U+24F3, U+16E93 ...)
4475 T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (36: U+1373, U+2473,
4476 U+2487, U+249B, U+24F4, U+3039 ...)
4477 T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1: U+3251)
4478 T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1: U+3252)
4479 T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1: U+3253)
4480 T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1: U+3254)
4481 T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1: U+3255)
4482 T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1: U+3256)
4483 T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1: U+3257)
4484 T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1: U+3258)
4485 T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1: U+3259)
4486 T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (19: U+1374, U+303A,
4487 U+324A, U+325A, U+5345, U+10112 ...)
4488 T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1: U+325B)
4489 T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1: U+325C)
4490 T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1: U+325D)
4491 T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1: U+325E)
4492 T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1: U+325F)
4493 T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1: U+32B1)
4494 T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1: U+32B2)
4495 T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1: U+32B3)
4496 T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1: U+32B4)
4497 T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (18: U+1375, U+324B,
4498 U+32B5, U+534C, U+10113, U+102ED ...)
4499 T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1: U+32B6)
4500 T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1: U+32B7)
4501 T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1: U+32B8)
4502 T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1: U+32B9)
4503 T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1: U+32BA)
4504 T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1: U+32BB)
4505 T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1: U+32BC)
4506 T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1: U+32BD)
4507 T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1: U+32BE)
4508 T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (29: U+1376, U+216C,
4509 U+217C, U+2186, U+324C, U+32BF ...)
4510 T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (13: U+1377, U+324D,
4511 U+10115, U+102EF, U+109CE, U+10E6E ...)
4512 T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (13: U+1378, U+324E,
4513 U+10116, U+102F0, U+109CF, U+10E6F ...)
4514 T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (12: U+1379, U+324F,
4515 U+10117, U+102F1, U+10E70, U+11062 ...)
4516 T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (12: U+137A, U+10118,
4517 U+102F2, U+10341, U+10E71, U+11063 ...)
4518 T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (35: U+0BF1, U+0D71,
4519 U+137B, U+216D, U+217D, U+4F70 ...)
4520 T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (6: U+1011A, U+102F4,
4521 U+109D3, U+10E73, U+1EC84, U+1ED14)
4522 T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (7: U+1011B, U+1016B,
4523 U+102F5, U+109D4, U+10E74, U+1EC85 ...)
4524 T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (7: U+1011C, U+102F6,
4525 U+109D5, U+10E75, U+1EC86, U+1ED16 ...)
4526 T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (16: U+216E, U+217E,
4527 U+1011D, U+10145, U+1014C, U+10153 ...)
4528 T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (7: U+1011E, U+102F8,
4529 U+109D7, U+10E77, U+1EC88, U+1ED18 ...)
4530 T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (6: U+1011F, U+102F9,
4531 U+109D8, U+10E78, U+1EC89, U+1ED19)
4532 T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (6: U+10120, U+102FA,
4533 U+109D9, U+10E79, U+1EC8A, U+1ED1A)
4534 T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (7: U+10121, U+102FB,
4535 U+1034A, U+109DA, U+10E7A, U+1EC8B ...)
4536 T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (22: U+0BF2, U+0D72,
4537 U+216F, U+217F..2180, U+4EDF, U+5343 ...)
4538 T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (5: U+10123, U+109DC,
4539 U+1EC8D, U+1ED1D, U+1ED3A)
4540 T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (4: U+10124, U+109DD,
4541 U+1EC8E, U+1ED1E)
4542 T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (4: U+10125, U+109DE,
4543 U+1EC8F, U+1ED1F)
4544 T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (8: U+2181, U+10126,
4545 U+10146, U+1014E, U+10172, U+109DF ...)
4546 T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (4: U+10127, U+109E0,
4547 U+1EC91, U+1ED21)
4548 T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (4: U+10128, U+109E1,
4549 U+1EC92, U+1ED22)
4550 T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (4: U+10129, U+109E2,
4551 U+1EC93, U+1ED23)
4552 T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (4: U+1012A, U+109E3,
4553 U+1EC94, U+1ED24)
4554 T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (13:
4555 U+137C, U+2182, U+4E07, U+842C, U+1012B,
4556 U+10155 ...)
4557 T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (4:
4558 U+1012C, U+109E5, U+1EC96, U+1ED26)
4559 T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (4:
4560 U+1012D, U+109E6, U+1EC97, U+1ED27)
4561 T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (4:
4562 U+1012E, U+109E7, U+1EC98, U+1ED28)
4563 T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (7:
4564 U+2187, U+1012F, U+10147, U+10156,
4565 U+109E8, U+1EC99 ...)
4566 T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (4:
4567 U+10130, U+109E9, U+1EC9A, U+1ED2A)
4568 T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (4:
4569 U+10131, U+109EA, U+1EC9B, U+1ED2B)
4570 T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (4:
4571 U+10132, U+109EB, U+1EC9C, U+1ED2C)
4572 T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (4:
4573 U+10133, U+109EC, U+1EC9D, U+1ED2D)
4574 T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (5:
4575 U+2188, U+109ED, U+1EC9E, U+1ECA0,
4576 U+1ECB4)
4577 T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (2:
4578 U+109EE, U+1EC9F)
4579 T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1:
4580 U+12432)
4581 T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1:
4582 U+109EF)
4583 T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1:
4584 U+109F0)
4585 T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1:
4586 U+12433)
4587 T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1:
4588 U+109F1)
4589 T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1:
4590 U+109F2)
4591 T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1:
4592 U+109F3)
4593 T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1:
4594 U+109F4)
4595 T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1:
4596 U+109F5)
4597 T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1:
4598 U+16B5E)
4599 T \p{Numeric_Value: 10000000} (= 1.0e+07) (Short: \p{Nv=10000000})
4600 (1: U+1ECA1)
4601 T \p{Numeric_Value: 20000000} (= 2.0e+07) (Short: \p{Nv=20000000})
4602 (1: U+1ECA2)
4603 T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
4604 (3: U+4EBF, U+5104, U+16B5F)
4605 T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
4606 10000000000}) (1: U+16B60)
4607 T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
4608 1000000000000}) (2: U+5146, U+16B61)
4609 \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_240 plus all
4610 above-Unicode code points: [\x00-\x20!
4611 \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\]
4612 \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba-
4613 \xbb\xbf-\xff], U+0100..065F,
4614 U+066A..06EF, U+06FA..07BF,
4615 U+07CA..0965, U+0970..09E5 ...)
4616 \p{Nushu} \p{Script_Extensions=Nushu} (Short:
4617 \p{Nshu}; NOT \p{Block=Nushu}) (397)
4618 \p{Nv: *} \p{Numeric_Value: *}
4619 \p{Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
4620 Nyiakeng_Puachue_Hmong} (Short:
4621 \p{Hmnp}; NOT \p{Block=
4622 Nyiakeng_Puachue_Hmong}) (71)
4623 X \p{OCR} \p{Optical_Character_Recognition} (=
4624 \p{Block=Optical_Character_Recognition})
4625 (32)
4626 \p{Ogam} \p{Ogham} (= \p{Script_Extensions=Ogham})
4627 (NOT \p{Block=Ogham}) (29)
4628 \p{Ogham} \p{Script_Extensions=Ogham} (Short:
4629 \p{Ogam}; NOT \p{Block=Ogham}) (29)
4630 \p{Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
4631 \p{Olck}) (48)
4632 \p{Olck} \p{Ol_Chiki} (= \p{Script_Extensions=
4633 Ol_Chiki}) (48)
4634 \p{Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
4635 (Short: \p{Hung}; NOT \p{Block=
4636 Old_Hungarian}) (108)
4637 \p{Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
4638 \p{Ital}; NOT \p{Block=Old_Italic}) (39)
4639 \p{Old_North_Arabian} \p{Script_Extensions=Old_North_Arabian}
4640 (Short: \p{Narb}) (32)
4641 \p{Old_Permic} \p{Script_Extensions=Old_Permic} (Short:
4642 \p{Perm}; NOT \p{Block=Old_Permic}) (44)
4643 \p{Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
4644 \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
4645 \p{Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
4646 \p{Sogo}; NOT \p{Block=Old_Sogdian}) (40)
4647 \p{Old_South_Arabian} \p{Script_Extensions=Old_South_Arabian}
4648 (Short: \p{Sarb}) (32)
4649 \p{Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
4650 \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
4651 \p{Old_Uyghur} \p{Script_Extensions=Old_Uyghur} (Short:
4652 \p{Ougr}; NOT \p{Block=Old_Uyghur}) (28)
4653 \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
4654 (Short: \p{Ps}) (79)
4655 X \p{Optical_Character_Recognition} \p{Block=
4656 Optical_Character_Recognition} (Short:
4657 \p{InOCR}) (32)
4658 \p{Oriya} \p{Script_Extensions=Oriya} (Short:
4659 \p{Orya}; NOT \p{Block=Oriya}) (97)
4660 \p{Orkh} \p{Old_Turkic} (= \p{Script_Extensions=
4661 Old_Turkic}) (NOT \p{Block=Old_Turkic})
4662 (73)
4663 X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
4664 \p{Orya} \p{Oriya} (= \p{Script_Extensions=Oriya})
4665 (NOT \p{Block=Oriya}) (97)
4666 \p{Osage} \p{Script_Extensions=Osage} (Short:
4667 \p{Osge}; NOT \p{Block=Osage}) (72)
4668 \p{Osge} \p{Osage} (= \p{Script_Extensions=Osage})
4669 (NOT \p{Block=Osage}) (72)
4670 \p{Osma} \p{Osmanya} (= \p{Script_Extensions=
4671 Osmanya}) (NOT \p{Block=Osmanya}) (40)
4672 \p{Osmanya} \p{Script_Extensions=Osmanya} (Short:
4673 \p{Osma}; NOT \p{Block=Osmanya}) (40)
4674 \p{Other} \p{General_Category=Other} (Short: \p{C})
4675 (969_578 plus all above-Unicode code
4676 points)
4677 \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
4678 \p{Lo}) (127_333)
4679 \p{Other_Number} \p{General_Category=Other_Number} (Short:
4680 \p{No}) (895)
4681 \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
4682 (Short: \p{Po}) (605)
4683 \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
4684 \p{So}) (6605)
4685 X \p{Ottoman_Siyaq_Numbers} \p{Block=Ottoman_Siyaq_Numbers} (80)
4686 \p{Ougr} \p{Old_Uyghur} (= \p{Script_Extensions=
4687 Old_Uyghur}) (NOT \p{Block=Old_Uyghur})
4688 (28)
4689 \p{P} \pP \p{Punct} (= \p{General_Category=
4690 Punctuation}) (NOT
4691 \p{General_Punctuation}) (819)
4692 \p{Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong} (Short:
4693 \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
4694 (127)
4695 \p{Palm} \p{Palmyrene} (= \p{Script_Extensions=
4696 Palmyrene}) (32)
4697 \p{Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
4698 \p{Palm}) (32)
4699 \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
4700 (Short: \p{Zp}) (1)
4701 \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
4702 Y}) (2760)
4703 \p{Pat_Syn: *} \p{Pattern_Syntax: *}
4704 \p{Pat_WS} \p{Pattern_White_Space} (=
4705 \p{Pattern_White_Space=Y}) (11)
4706 \p{Pat_WS: *} \p{Pattern_White_Space: *}
4707 \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
4708 (2760)
4709 \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
4710 (1_111_352 plus all above-Unicode code
4711 points: [\x00-\x200-9A-Z_a-z\x7f-\xa0
4712 \xa8\xaa\xad\xaf\xb2-\xb5\xb7-\xba\xbc-
4713 \xbe\xc0-\xd6\xd8-\xf6\xf8-\xff],
4714 U+0100..200F, U+2028..202F,
4715 U+203F..2040, U+2054, U+205F..218F ...)
4716 \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760:
4717 [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\]
4718 \^`\{\|\}~\xa1-\xa7\xa9\xab-\xac\xae
4719 \xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
4720 U+2010..2027, U+2030..203E,
4721 U+2041..2053, U+2055..205E, U+2190..245F
4722 ...)
4723 \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
4724 \p{PatWS}) (11)
4725 \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
4726 (1_114_101 plus all above-Unicode code
4727 points: [^\t\n\cK\f\r\x20\x85],
4728 U+0100..200D, U+2010..2027,
4729 U+202A..infinity)
4730 \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11:
4731 [\t\n\cK\f\r\x20\x85], U+200E..200F,
4732 U+2028..2029)
4733 \p{Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
4734 \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
4735 \p{Pauc} \p{Pau_Cin_Hau} (= \p{Script_Extensions=
4736 Pau_Cin_Hau}) (NOT \p{Block=
4737 Pau_Cin_Hau}) (57)
4738 \p{Pc} \p{Connector_Punctuation} (=
4739 \p{General_Category=
4740 Connector_Punctuation}) (10)
4741 \p{PCM} \p{Prepended_Concatenation_Mark} (=
4742 \p{Prepended_Concatenation_Mark=Y}) (13)
4743 \p{PCM: *} \p{Prepended_Concatenation_Mark: *}
4744 \p{Pd} \p{Dash_Punctuation} (=
4745 \p{General_Category=Dash_Punctuation})
4746 (26)
4747 \p{Pe} \p{Close_Punctuation} (=
4748 \p{General_Category=Close_Punctuation})
4749 (77)
4750 \p{PerlSpace} \p{PosixSpace} (6)
4751 \p{PerlWord} \p{PosixWord} (63)
4752 \p{Perm} \p{Old_Permic} (= \p{Script_Extensions=
4753 Old_Permic}) (NOT \p{Block=Old_Permic})
4754 (44)
4755 \p{Pf} \p{Final_Punctuation} (=
4756 \p{General_Category=Final_Punctuation})
4757 (10)
4758 \p{Phag} \p{Phags_Pa} (= \p{Script_Extensions=
4759 Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
4760 \p{Phags_Pa} \p{Script_Extensions=Phags_Pa} (Short:
4761 \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
4762 X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block=
4763 Phaistos_Disc}) (48)
4764 X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short:
4765 \p{InPhaistos}) (48)
4766 \p{Phli} \p{Inscriptional_Pahlavi} (=
4767 \p{Script_Extensions=
4768 Inscriptional_Pahlavi}) (NOT \p{Block=
4769 Inscriptional_Pahlavi}) (27)
4770 \p{Phlp} \p{Psalter_Pahlavi} (=
4771 \p{Script_Extensions=Psalter_Pahlavi})
4772 (NOT \p{Block=Psalter_Pahlavi}) (30)
4773 \p{Phnx} \p{Phoenician} (= \p{Script_Extensions=
4774 Phoenician}) (NOT \p{Block=Phoenician})
4775 (29)
4776 \p{Phoenician} \p{Script_Extensions=Phoenician} (Short:
4777 \p{Phnx}; NOT \p{Block=Phoenician}) (29)
4778 X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block=
4779 Phonetic_Extensions}) (128)
4780 X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (=
4781 \p{Block=
4782 Phonetic_Extensions_Supplement}) (64)
4783 X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
4784 \p{InPhoneticExt}) (128)
4785 X \p{Phonetic_Extensions_Supplement} \p{Block=
4786 Phonetic_Extensions_Supplement} (Short:
4787 \p{InPhoneticExtSup}) (64)
4788 \p{Pi} \p{Initial_Punctuation} (=
4789 \p{General_Category=
4790 Initial_Punctuation}) (12)
4791 X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
4792 \p{Plrd} \p{Miao} (= \p{Script_Extensions=Miao})
4793 (NOT \p{Block=Miao}) (149)
4794 \p{Po} \p{Other_Punctuation} (=
4795 \p{General_Category=Other_Punctuation})
4796 (605)
4797 \p{PosixAlnum} (62: [0-9A-Za-z])
4798 \p{PosixAlpha} (52: [A-Za-z])
4799 \p{PosixBlank} (2: [\t\x20])
4800 \p{PosixCntrl} ASCII control characters (33: ACK, BEL,
4801 BS, CAN, CR, DC1, DC2, DC3, DC4, DEL,
4802 DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF,
4803 FS, GS, HT, LF, NAK, NUL, RS, SI, SO,
4804 SOH, STX, SUB, SYN, US, VT)
4805 \p{PosixDigit} (10: [0-9])
4806 \p{PosixGraph} (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
4807 Z\[\\\]\^_`a-z\{\|\}~])
4808 \p{PosixLower} (/i= PosixAlpha) (26: [a-z])
4809 \p{PosixPrint} (95: [\x20-\x7e])
4810 \p{PosixPunct} (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@
4811 \[\\\]\^_`\{\|\}~])
4812 \p{PosixSpace} (Short: \p{PerlSpace}) (6: [\t\n\cK\f\r
4813 \x20])
4814 \p{PosixUpper} (/i= PosixAlpha) (26: [A-Z])
4815 \p{PosixWord} \w, restricted to ASCII (Short:
4816 \p{PerlWord}) (63: [0-9A-Z_a-z])
4817 \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
4818 (22)
4819 \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
4820 Y} (Short: \p{PCM}) (13)
4821 \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
4822 (1_114_099 plus all above-Unicode code
4823 points: U+0000..05FF, U+0606..06DC,
4824 U+06DE..070E, U+0710..088F,
4825 U+0892..08E1, U+08E3..110BC ...)
4826 \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
4827 (13: U+0600..0605, U+06DD, U+070F,
4828 U+0890..0891, U+08E2, U+110BD ...)
4829 T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
4830 extension) (33_979)
4831 \p{Present_In: V1_1} \p{Present_In=1.1} (= \p{Age=V1_1}) (Perl
4832 extension) (33_979)
4833 T \p{Present_In: 2.0} Code point's usage introduced in version
4834 2.0 or earlier (Short: \p{In=2.0}) (Perl
4835 extension) (178_500: U+0000..01F5,
4836 U+01FA..0217, U+0250..02A8,
4837 U+02B0..02DE, U+02E0..02E9, U+0300..0345
4838 ...)
4839 \p{Present_In: V2_0} \p{Present_In=2.0} (Perl extension)
4840 (178_500)
4841 T \p{Present_In: 2.1} Code point's usage introduced in version
4842 2.1 or earlier (Short: \p{In=2.1}) (Perl
4843 extension) (178_502: U+0000..01F5,
4844 U+01FA..0217, U+0250..02A8,
4845 U+02B0..02DE, U+02E0..02E9, U+0300..0345
4846 ...)
4847 \p{Present_In: V2_1} \p{Present_In=2.1} (Perl extension)
4848 (178_502)
4849 T \p{Present_In: 3.0} Code point's usage introduced in version
4850 3.0 or earlier (Short: \p{In=3.0}) (Perl
4851 extension) (188_809: U+0000..021F,
4852 U+0222..0233, U+0250..02AD,
4853 U+02B0..02EE, U+0300..034E, U+0360..0362
4854 ...)
4855 \p{Present_In: V3_0} \p{Present_In=3.0} (Perl extension)
4856 (188_809)
4857 T \p{Present_In: 3.1} Code point's usage introduced in version
4858 3.1 or earlier (Short: \p{In=3.1}) (Perl
4859 extension) (233_787: U+0000..021F,
4860 U+0222..0233, U+0250..02AD,
4861 U+02B0..02EE, U+0300..034E, U+0360..0362
4862 ...)
4863 \p{Present_In: V3_1} \p{Present_In=3.1} (Perl extension)
4864 (233_787)
4865 T \p{Present_In: 3.2} Code point's usage introduced in version
4866 3.2 or earlier (Short: \p{In=3.2}) (Perl
4867 extension) (234_803: U+0000..0220,
4868 U+0222..0233, U+0250..02AD,
4869 U+02B0..02EE, U+0300..034F, U+0360..036F
4870 ...)
4871 \p{Present_In: V3_2} \p{Present_In=3.2} (Perl extension)
4872 (234_803)
4873 T \p{Present_In: 4.0} Code point's usage introduced in version
4874 4.0 or earlier (Short: \p{In=4.0}) (Perl
4875 extension) (236_029: U+0000..0236,
4876 U+0250..0357, U+035D..036F,
4877 U+0374..0375, U+037A, U+037E ...)
4878 \p{Present_In: V4_0} \p{Present_In=4.0} (Perl extension)
4879 (236_029)
4880 T \p{Present_In: 4.1} Code point's usage introduced in version
4881 4.1 or earlier (Short: \p{In=4.1}) (Perl
4882 extension) (237_302: U+0000..0241,
4883 U+0250..036F, U+0374..0375, U+037A,
4884 U+037E, U+0384..038A ...)
4885 \p{Present_In: V4_1} \p{Present_In=4.1} (Perl extension)
4886 (237_302)
4887 T \p{Present_In: 5.0} Code point's usage introduced in version
4888 5.0 or earlier (Short: \p{In=5.0}) (Perl
4889 extension) (238_671: U+0000..036F,
4890 U+0374..0375, U+037A..037E,
4891 U+0384..038A, U+038C, U+038E..03A1 ...)
4892 \p{Present_In: V5_0} \p{Present_In=5.0} (Perl extension)
4893 (238_671)
4894 T \p{Present_In: 5.1} Code point's usage introduced in version
4895 5.1 or earlier (Short: \p{In=5.1}) (Perl
4896 extension) (240_295: U+0000..0377,
4897 U+037A..037E, U+0384..038A, U+038C,
4898 U+038E..03A1, U+03A3..0523 ...)
4899 \p{Present_In: V5_1} \p{Present_In=5.1} (Perl extension)
4900 (240_295)
4901 T \p{Present_In: 5.2} Code point's usage introduced in version
4902 5.2 or earlier (Short: \p{In=5.2}) (Perl
4903 extension) (246_943: U+0000..0377,
4904 U+037A..037E, U+0384..038A, U+038C,
4905 U+038E..03A1, U+03A3..0525 ...)
4906 \p{Present_In: V5_2} \p{Present_In=5.2} (Perl extension)
4907 (246_943)
4908 T \p{Present_In: 6.0} Code point's usage introduced in version
4909 6.0 or earlier (Short: \p{In=6.0}) (Perl
4910 extension) (249_031: U+0000..0377,
4911 U+037A..037E, U+0384..038A, U+038C,
4912 U+038E..03A1, U+03A3..0527 ...)
4913 \p{Present_In: V6_0} \p{Present_In=6.0} (Perl extension)
4914 (249_031)
4915 T \p{Present_In: 6.1} Code point's usage introduced in version
4916 6.1 or earlier (Short: \p{In=6.1}) (Perl
4917 extension) (249_763: U+0000..0377,
4918 U+037A..037E, U+0384..038A, U+038C,
4919 U+038E..03A1, U+03A3..0527 ...)
4920 \p{Present_In: V6_1} \p{Present_In=6.1} (Perl extension)
4921 (249_763)
4922 T \p{Present_In: 6.2} Code point's usage introduced in version
4923 6.2 or earlier (Short: \p{In=6.2}) (Perl
4924 extension) (249_764: U+0000..0377,
4925 U+037A..037E, U+0384..038A, U+038C,
4926 U+038E..03A1, U+03A3..0527 ...)
4927 \p{Present_In: V6_2} \p{Present_In=6.2} (Perl extension)
4928 (249_764)
4929 T \p{Present_In: 6.3} Code point's usage introduced in version
4930 6.3 or earlier (Short: \p{In=6.3}) (Perl
4931 extension) (249_769: U+0000..0377,
4932 U+037A..037E, U+0384..038A, U+038C,
4933 U+038E..03A1, U+03A3..0527 ...)
4934 \p{Present_In: V6_3} \p{Present_In=6.3} (Perl extension)
4935 (249_769)
4936 T \p{Present_In: 7.0} Code point's usage introduced in version
4937 7.0 or earlier (Short: \p{In=7.0}) (Perl
4938 extension) (252_603: U+0000..0377,
4939 U+037A..037F, U+0384..038A, U+038C,
4940 U+038E..03A1, U+03A3..052F ...)
4941 \p{Present_In: V7_0} \p{Present_In=7.0} (Perl extension)
4942 (252_603)
4943 T \p{Present_In: 8.0} Code point's usage introduced in version
4944 8.0 or earlier (Short: \p{In=8.0}) (Perl
4945 extension) (260_319: U+0000..0377,
4946 U+037A..037F, U+0384..038A, U+038C,
4947 U+038E..03A1, U+03A3..052F ...)
4948 \p{Present_In: V8_0} \p{Present_In=8.0} (Perl extension)
4949 (260_319)
4950 T \p{Present_In: 9.0} Code point's usage introduced in version
4951 9.0 or earlier (Short: \p{In=9.0}) (Perl
4952 extension) (267_819: U+0000..0377,
4953 U+037A..037F, U+0384..038A, U+038C,
4954 U+038E..03A1, U+03A3..052F ...)
4955 \p{Present_In: V9_0} \p{Present_In=9.0} (Perl extension)
4956 (267_819)
4957 T \p{Present_In: 10.0} Code point's usage introduced in version
4958 10.0 or earlier (Short: \p{In=10.0})
4959 (Perl extension) (276_337: U+0000..0377,
4960 U+037A..037F, U+0384..038A, U+038C,
4961 U+038E..03A1, U+03A3..052F ...)
4962 \p{Present_In: V10_0} \p{Present_In=10.0} (Perl extension)
4963 (276_337)
4964 T \p{Present_In: 11.0} Code point's usage introduced in version
4965 11.0 or earlier (Short: \p{In=11.0})
4966 (Perl extension) (277_021: U+0000..0377,
4967 U+037A..037F, U+0384..038A, U+038C,
4968 U+038E..03A1, U+03A3..052F ...)
4969 \p{Present_In: V11_0} \p{Present_In=11.0} (Perl extension)
4970 (277_021)
4971 T \p{Present_In: 12.0} Code point's usage introduced in version
4972 12.0 or earlier (Short: \p{In=12.0})
4973 (Perl extension) (277_575: U+0000..0377,
4974 U+037A..037F, U+0384..038A, U+038C,
4975 U+038E..03A1, U+03A3..052F ...)
4976 \p{Present_In: V12_0} \p{Present_In=12.0} (Perl extension)
4977 (277_575)
4978 T \p{Present_In: 12.1} Code point's usage introduced in version
4979 12.1 or earlier (Short: \p{In=12.1})
4980 (Perl extension) (277_576: U+0000..0377,
4981 U+037A..037F, U+0384..038A, U+038C,
4982 U+038E..03A1, U+03A3..052F ...)
4983 \p{Present_In: V12_1} \p{Present_In=12.1} (Perl extension)
4984 (277_576)
4985 T \p{Present_In: 13.0} Code point's usage introduced in version
4986 13.0 or earlier (Short: \p{In=13.0})
4987 (Perl extension) (283_506: U+0000..0377,
4988 U+037A..037F, U+0384..038A, U+038C,
4989 U+038E..03A1, U+03A3..052F ...)
4990 \p{Present_In: V13_0} \p{Present_In=13.0} (Perl extension)
4991 (283_506)
4992 T \p{Present_In: 14.0} Code point's usage introduced in version
4993 14.0 or earlier (Short: \p{In=14.0})
4994 (Perl extension) (284_344: U+0000..0377,
4995 U+037A..037F, U+0384..038A, U+038C,
4996 U+038E..03A1, U+03A3..052F ...)
4997 \p{Present_In: V14_0} \p{Present_In=14.0} (Perl extension)
4998 (284_344)
4999 \p{Present_In: NA} \p{Present_In=Unassigned} (= \p{Age=
5000 Unassigned}) (Perl extension) (829_768
5001 plus all above-Unicode code points)
5002 \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=NA})
5003 (Perl extension) (829_768 plus all
5004 above-Unicode code points)
5005 \p{Print} \p{XPosixPrint} (282_163)
5006 \p{Private_Use} \p{General_Category=Private_Use} (Short:
5007 \p{Co}; NOT \p{Private_Use_Area})
5008 (137_468)
5009 X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
5010 \p{InPUA}) (6400)
5011 \p{Prti} \p{Inscriptional_Parthian} (=
5012 \p{Script_Extensions=
5013 Inscriptional_Parthian}) (NOT \p{Block=
5014 Inscriptional_Parthian}) (30)
5015 \p{Ps} \p{Open_Punctuation} (=
5016 \p{General_Category=Open_Punctuation})
5017 (79)
5018 \p{Psalter_Pahlavi} \p{Script_Extensions=Psalter_Pahlavi}
5019 (Short: \p{Phlp}; NOT \p{Block=
5020 Psalter_Pahlavi}) (30)
5021 X \p{PUA} \p{Private_Use_Area} (= \p{Block=
5022 Private_Use_Area}) (6400)
5023 \p{Punct} \p{General_Category=Punctuation} (Short:
5024 \p{P}; NOT \p{General_Punctuation}) (819)
5025 \p{Punctuation} \p{Punct} (= \p{General_Category=
5026 Punctuation}) (NOT
5027 \p{General_Punctuation}) (819)
5028 \p{Qaac} \p{Coptic} (= \p{Script_Extensions=
5029 Coptic}) (NOT \p{Block=Coptic}) (165)
5030 \p{Qaai} \p{Inherited} (= \p{Script_Extensions=
5031 Inherited}) (586)
5032 \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
5033 Y}) (30)
5034 \p{QMark: *} \p{Quotation_Mark: *}
5035 \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
5036 (30)
5037 \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_082
5038 plus all above-Unicode code points:
5039 [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<=>?
5040 \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xaa\xac-
5041 \xba\xbc-\xff], U+0100..2017,
5042 U+2020..2038, U+203B..2E41,
5043 U+2E43..300B, U+3010..301C ...)
5044 \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (30: [\"
5045 \'\xab\xbb], U+2018..201F, U+2039..203A,
5046 U+2E42, U+300C..300F, U+301D..301F ...)
5047 \p{Radical} \p{Radical=Y} (329)
5048 \p{Radical: N*} (Single: \P{Radical}) (1_113_783 plus all
5049 above-Unicode code points: U+0000..2E7F,
5050 U+2E9A, U+2EF4..2EFF, U+2FD6..infinity)
5051 \p{Radical: Y*} (Single: \p{Radical}) (329: U+2E80..2E99,
5052 U+2E9B..2EF3, U+2F00..2FD5)
5053 \p{Regional_Indicator} \p{Regional_Indicator=Y} (Short: \p{RI})
5054 (26)
5055 \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086
5056 plus all above-Unicode code points:
5057 U+0000..1F1E5, U+1F200..infinity)
5058 \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26:
5059 U+1F1E6..1F1FF)
5060 \p{Rejang} \p{Script_Extensions=Rejang} (Short:
5061 \p{Rjng}; NOT \p{Block=Rejang}) (37)
5062 \p{RI} \p{Regional_Indicator} (=
5063 \p{Regional_Indicator=Y}) (26)
5064 \p{RI: *} \p{Regional_Indicator: *}
5065 \p{Rjng} \p{Rejang} (= \p{Script_Extensions=
5066 Rejang}) (NOT \p{Block=Rejang}) (37)
5067 \p{Rohg} \p{Hanifi_Rohingya} (=
5068 \p{Script_Extensions=Hanifi_Rohingya})
5069 (NOT \p{Block=Hanifi_Rohingya}) (55)
5070 X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block=
5071 Rumi_Numeral_Symbols}) (32)
5072 X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
5073 \p{InRumi}) (32)
5074 \p{Runic} \p{Script_Extensions=Runic} (Short:
5075 \p{Runr}; NOT \p{Block=Runic}) (86)
5076 \p{Runr} \p{Runic} (= \p{Script_Extensions=Runic})
5077 (NOT \p{Block=Runic}) (86)
5078 \p{S} \pS \p{Symbol} (= \p{General_Category=Symbol})
5079 (7741)
5080 \p{Samaritan} \p{Script_Extensions=Samaritan} (Short:
5081 \p{Samr}; NOT \p{Block=Samaritan}) (61)
5082 \p{Samr} \p{Samaritan} (= \p{Script_Extensions=
5083 Samaritan}) (NOT \p{Block=Samaritan})
5084 (61)
5085 \p{Sarb} \p{Old_South_Arabian} (=
5086 \p{Script_Extensions=Old_South_Arabian})
5087 (32)
5088 \p{Saur} \p{Saurashtra} (= \p{Script_Extensions=
5089 Saurashtra}) (NOT \p{Block=Saurashtra})
5090 (82)
5091 \p{Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
5092 \p{Saur}; NOT \p{Block=Saurashtra}) (82)
5093 \p{SB: *} \p{Sentence_Break: *}
5094 \p{Sc} \p{Currency_Symbol} (=
5095 \p{General_Category=Currency_Symbol})
5096 (63)
5097 \p{Sc: *} \p{Script: *}
5098 \p{Script: Adlam} (Short: \p{Sc=Adlm}) (88: U+1E900..1E94B,
5099 U+1E950..1E959, U+1E95E..1E95F)
5100 \p{Script: Adlm} \p{Script=Adlam} (88)
5101 \p{Script: Aghb} \p{Script=Caucasian_Albanian} (=
5102 \p{Script_Extensions=
5103 Caucasian_Albanian}) (53)
5104 \p{Script: Ahom} \p{Script_Extensions=Ahom} (Short: \p{Sc=
5105 Ahom}, \p{Ahom}) (65)
5106 \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions=
5107 Anatolian_Hieroglyphs} (Short: \p{Sc=
5108 Hluw}, \p{Hluw}) (583)
5109 \p{Script: Arab} \p{Script=Arabic} (1365)
5110 \p{Script: Arabic} (Short: \p{Sc=Arab}) (1365: U+0600..0604,
5111 U+0606..060B, U+060D..061A,
5112 U+061C..061E, U+0620..063F, U+0641..064A
5113 ...)
5114 \p{Script: Armenian} \p{Script_Extensions=Armenian} (Short:
5115 \p{Sc=Armn}, \p{Armn}) (96)
5116 \p{Script: Armi} \p{Script=Imperial_Aramaic} (=
5117 \p{Script_Extensions=Imperial_Aramaic})
5118 (31)
5119 \p{Script: Armn} \p{Script=Armenian} (=
5120 \p{Script_Extensions=Armenian}) (96)
5121 \p{Script: Avestan} \p{Script_Extensions=Avestan} (Short:
5122 \p{Sc=Avst}, \p{Avst}) (61)
5123 \p{Script: Avst} \p{Script=Avestan} (=
5124 \p{Script_Extensions=Avestan}) (61)
5125 \p{Script: Bali} \p{Script=Balinese} (=
5126 \p{Script_Extensions=Balinese}) (124)
5127 \p{Script: Balinese} \p{Script_Extensions=Balinese} (Short:
5128 \p{Sc=Bali}, \p{Bali}) (124)
5129 \p{Script: Bamu} \p{Script=Bamum} (= \p{Script_Extensions=
5130 Bamum}) (657)
5131 \p{Script: Bamum} \p{Script_Extensions=Bamum} (Short: \p{Sc=
5132 Bamu}, \p{Bamu}) (657)
5133 \p{Script: Bass} \p{Script=Bassa_Vah} (=
5134 \p{Script_Extensions=Bassa_Vah}) (36)
5135 \p{Script: Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
5136 \p{Sc=Bass}, \p{Bass}) (36)
5137 \p{Script: Batak} \p{Script_Extensions=Batak} (Short: \p{Sc=
5138 Batk}, \p{Batk}) (56)
5139 \p{Script: Batk} \p{Script=Batak} (= \p{Script_Extensions=
5140 Batak}) (56)
5141 \p{Script: Beng} \p{Script=Bengali} (96)
5142 \p{Script: Bengali} (Short: \p{Sc=Beng}) (96: U+0980..0983,
5143 U+0985..098C, U+098F..0990,
5144 U+0993..09A8, U+09AA..09B0, U+09B2 ...)
5145 \p{Script: Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
5146 \p{Sc=Bhks}, \p{Bhks}) (97)
5147 \p{Script: Bhks} \p{Script=Bhaiksuki} (=
5148 \p{Script_Extensions=Bhaiksuki}) (97)
5149 \p{Script: Bopo} \p{Script=Bopomofo} (77)
5150 \p{Script: Bopomofo} (Short: \p{Sc=Bopo}) (77: U+02EA..02EB,
5151 U+3105..312F, U+31A0..31BF)
5152 \p{Script: Brah} \p{Script=Brahmi} (= \p{Script_Extensions=
5153 Brahmi}) (115)
5154 \p{Script: Brahmi} \p{Script_Extensions=Brahmi} (Short:
5155 \p{Sc=Brah}, \p{Brah}) (115)
5156 \p{Script: Brai} \p{Script=Braille} (=
5157 \p{Script_Extensions=Braille}) (256)
5158 \p{Script: Braille} \p{Script_Extensions=Braille} (Short:
5159 \p{Sc=Brai}, \p{Brai}) (256)
5160 \p{Script: Bugi} \p{Script=Buginese} (30)
5161 \p{Script: Buginese} (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B,
5162 U+1A1E..1A1F)
5163 \p{Script: Buhd} \p{Script=Buhid} (20)
5164 \p{Script: Buhid} (Short: \p{Sc=Buhd}) (20: U+1740..1753)
5165 \p{Script: Cakm} \p{Script=Chakma} (71)
5166 \p{Script: Canadian_Aboriginal} \p{Script_Extensions=
5167 Canadian_Aboriginal} (Short: \p{Sc=
5168 Cans}, \p{Cans}) (726)
5169 \p{Script: Cans} \p{Script=Canadian_Aboriginal} (=
5170 \p{Script_Extensions=
5171 Canadian_Aboriginal}) (726)
5172 \p{Script: Cari} \p{Script=Carian} (= \p{Script_Extensions=
5173 Carian}) (49)
5174 \p{Script: Carian} \p{Script_Extensions=Carian} (Short:
5175 \p{Sc=Cari}, \p{Cari}) (49)
5176 \p{Script: Caucasian_Albanian} \p{Script_Extensions=
5177 Caucasian_Albanian} (Short: \p{Sc=Aghb},
5178 \p{Aghb}) (53)
5179 \p{Script: Chakma} (Short: \p{Sc=Cakm}) (71: U+11100..11134,
5180 U+11136..11147)
5181 \p{Script: Cham} \p{Script_Extensions=Cham} (Short: \p{Sc=
5182 Cham}, \p{Cham}) (83)
5183 \p{Script: Cher} \p{Script=Cherokee} (=
5184 \p{Script_Extensions=Cherokee}) (172)
5185 \p{Script: Cherokee} \p{Script_Extensions=Cherokee} (Short:
5186 \p{Sc=Cher}, \p{Cher}) (172)
5187 \p{Script: Chorasmian} \p{Script_Extensions=Chorasmian} (Short:
5188 \p{Sc=Chrs}, \p{Chrs}) (28)
5189 \p{Script: Chrs} \p{Script=Chorasmian} (=
5190 \p{Script_Extensions=Chorasmian}) (28)
5191 \p{Script: Common} (Short: \p{Sc=Zyyy}) (8252: [\x00-\x20!
5192 \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]
5193 \^_`\{\|\}~\x7f-\xa9\xab-\xb9\xbb-\xbf
5194 \xd7\xf7], U+02B9..02DF, U+02E5..02E9,
5195 U+02EC..02FF, U+0374, U+037E ...)
5196 \p{Script: Copt} \p{Script=Coptic} (137)
5197 \p{Script: Coptic} (Short: \p{Sc=Copt}) (137: U+03E2..03EF,
5198 U+2C80..2CF3, U+2CF9..2CFF)
5199 \p{Script: Cpmn} \p{Script=Cypro_Minoan} (99)
5200 \p{Script: Cprt} \p{Script=Cypriot} (55)
5201 \p{Script: Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
5202 \p{Sc=Xsux}, \p{Xsux}) (1234)
5203 \p{Script: Cypriot} (Short: \p{Sc=Cprt}) (55: U+10800..10805,
5204 U+10808, U+1080A..10835, U+10837..10838,
5205 U+1083C, U+1083F)
5206 \p{Script: Cypro_Minoan} (Short: \p{Sc=Cpmn}) (99: U+12F90..12FF2)
5207 \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}) (443: U+0400..0484,
5208 U+0487..052F, U+1C80..1C88, U+1D2B,
5209 U+1D78, U+2DE0..2DFF ...)
5210 \p{Script: Cyrl} \p{Script=Cyrillic} (443)
5211 \p{Script: Deseret} \p{Script_Extensions=Deseret} (Short:
5212 \p{Sc=Dsrt}, \p{Dsrt}) (80)
5213 \p{Script: Deva} \p{Script=Devanagari} (154)
5214 \p{Script: Devanagari} (Short: \p{Sc=Deva}) (154: U+0900..0950,
5215 U+0955..0963, U+0966..097F, U+A8E0..A8FF)
5216 \p{Script: Diak} \p{Script=Dives_Akuru} (=
5217 \p{Script_Extensions=Dives_Akuru}) (72)
5218 \p{Script: Dives_Akuru} \p{Script_Extensions=Dives_Akuru} (Short:
5219 \p{Sc=Diak}, \p{Diak}) (72)
5220 \p{Script: Dogr} \p{Script=Dogra} (60)
5221 \p{Script: Dogra} (Short: \p{Sc=Dogr}) (60: U+11800..1183B)
5222 \p{Script: Dsrt} \p{Script=Deseret} (=
5223 \p{Script_Extensions=Deseret}) (80)
5224 \p{Script: Dupl} \p{Script=Duployan} (143)
5225 \p{Script: Duployan} (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A,
5226 U+1BC70..1BC7C, U+1BC80..1BC88,
5227 U+1BC90..1BC99, U+1BC9C..1BC9F)
5228 \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (=
5229 \p{Script_Extensions=
5230 Egyptian_Hieroglyphs}) (1080)
5231 \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions=
5232 Egyptian_Hieroglyphs} (Short: \p{Sc=
5233 Egyp}, \p{Egyp}) (1080)
5234 \p{Script: Elba} \p{Script=Elbasan} (=
5235 \p{Script_Extensions=Elbasan}) (40)
5236 \p{Script: Elbasan} \p{Script_Extensions=Elbasan} (Short:
5237 \p{Sc=Elba}, \p{Elba}) (40)
5238 \p{Script: Elym} \p{Script=Elymaic} (=
5239 \p{Script_Extensions=Elymaic}) (23)
5240 \p{Script: Elymaic} \p{Script_Extensions=Elymaic} (Short:
5241 \p{Sc=Elym}, \p{Elym}) (23)
5242 \p{Script: Ethi} \p{Script=Ethiopic} (=
5243 \p{Script_Extensions=Ethiopic}) (523)
5244 \p{Script: Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
5245 \p{Sc=Ethi}, \p{Ethi}) (523)
5246 \p{Script: Geor} \p{Script=Georgian} (173)
5247 \p{Script: Georgian} (Short: \p{Sc=Geor}) (173: U+10A0..10C5,
5248 U+10C7, U+10CD, U+10D0..10FA,
5249 U+10FC..10FF, U+1C90..1CBA ...)
5250 \p{Script: Glag} \p{Script=Glagolitic} (134)
5251 \p{Script: Glagolitic} (Short: \p{Sc=Glag}) (134: U+2C00..2C5F,
5252 U+1E000..1E006, U+1E008..1E018,
5253 U+1E01B..1E021, U+1E023..1E024,
5254 U+1E026..1E02A)
5255 \p{Script: Gong} \p{Script=Gunjala_Gondi} (63)
5256 \p{Script: Gonm} \p{Script=Masaram_Gondi} (75)
5257 \p{Script: Goth} \p{Script=Gothic} (= \p{Script_Extensions=
5258 Gothic}) (27)
5259 \p{Script: Gothic} \p{Script_Extensions=Gothic} (Short:
5260 \p{Sc=Goth}, \p{Goth}) (27)
5261 \p{Script: Gran} \p{Script=Grantha} (85)
5262 \p{Script: Grantha} (Short: \p{Sc=Gran}) (85: U+11300..11303,
5263 U+11305..1130C, U+1130F..11310,
5264 U+11313..11328, U+1132A..11330,
5265 U+11332..11333 ...)
5266 \p{Script: Greek} (Short: \p{Sc=Grek}) (518: U+0370..0373,
5267 U+0375..0377, U+037A..037D, U+037F,
5268 U+0384, U+0386 ...)
5269 \p{Script: Grek} \p{Script=Greek} (518)
5270 \p{Script: Gujarati} (Short: \p{Sc=Gujr}) (91: U+0A81..0A83,
5271 U+0A85..0A8D, U+0A8F..0A91,
5272 U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3
5273 ...)
5274 \p{Script: Gujr} \p{Script=Gujarati} (91)
5275 \p{Script: Gunjala_Gondi} (Short: \p{Sc=Gong}) (63:
5276 U+11D60..11D65, U+11D67..11D68,
5277 U+11D6A..11D8E, U+11D90..11D91,
5278 U+11D93..11D98, U+11DA0..11DA9)
5279 \p{Script: Gurmukhi} (Short: \p{Sc=Guru}) (80: U+0A01..0A03,
5280 U+0A05..0A0A, U+0A0F..0A10,
5281 U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33
5282 ...)
5283 \p{Script: Guru} \p{Script=Gurmukhi} (80)
5284 \p{Script: Han} (Short: \p{Sc=Han}) (94_215: U+2E80..2E99,
5285 U+2E9B..2EF3, U+2F00..2FD5, U+3005,
5286 U+3007, U+3021..3029 ...)
5287 \p{Script: Hang} \p{Script=Hangul} (11_739)
5288 \p{Script: Hangul} (Short: \p{Sc=Hang}) (11_739:
5289 U+1100..11FF, U+302E..302F,
5290 U+3131..318E, U+3200..321E,
5291 U+3260..327E, U+A960..A97C ...)
5292 \p{Script: Hani} \p{Script=Han} (94_215)
5293 \p{Script: Hanifi_Rohingya} (Short: \p{Sc=Rohg}) (50:
5294 U+10D00..10D27, U+10D30..10D39)
5295 \p{Script: Hano} \p{Script=Hanunoo} (21)
5296 \p{Script: Hanunoo} (Short: \p{Sc=Hano}) (21: U+1720..1734)
5297 \p{Script: Hatr} \p{Script=Hatran} (= \p{Script_Extensions=
5298 Hatran}) (26)
5299 \p{Script: Hatran} \p{Script_Extensions=Hatran} (Short:
5300 \p{Sc=Hatr}, \p{Hatr}) (26)
5301 \p{Script: Hebr} \p{Script=Hebrew} (= \p{Script_Extensions=
5302 Hebrew}) (134)
5303 \p{Script: Hebrew} \p{Script_Extensions=Hebrew} (Short:
5304 \p{Sc=Hebr}, \p{Hebr}) (134)
5305 \p{Script: Hira} \p{Script=Hiragana} (380)
5306 \p{Script: Hiragana} (Short: \p{Sc=Hira}) (380: U+3041..3096,
5307 U+309D..309F, U+1B001..1B11F,
5308 U+1B150..1B152, U+1F200)
5309 \p{Script: Hluw} \p{Script=Anatolian_Hieroglyphs} (=
5310 \p{Script_Extensions=
5311 Anatolian_Hieroglyphs}) (583)
5312 \p{Script: Hmng} \p{Script=Pahawh_Hmong} (=
5313 \p{Script_Extensions=Pahawh_Hmong}) (127)
5314 \p{Script: Hmnp} \p{Script=Nyiakeng_Puachue_Hmong} (=
5315 \p{Script_Extensions=
5316 Nyiakeng_Puachue_Hmong}) (71)
5317 \p{Script: Hung} \p{Script=Old_Hungarian} (=
5318 \p{Script_Extensions=Old_Hungarian})
5319 (108)
5320 \p{Script: Imperial_Aramaic} \p{Script_Extensions=
5321 Imperial_Aramaic} (Short: \p{Sc=Armi},
5322 \p{Armi}) (31)
5323 \p{Script: Inherited} (Short: \p{Sc=Zinh}) (657: U+0300..036F,
5324 U+0485..0486, U+064B..0655, U+0670,
5325 U+0951..0954, U+1AB0..1ACE ...)
5326 \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions=
5327 Inscriptional_Pahlavi} (Short: \p{Sc=
5328 Phli}, \p{Phli}) (27)
5329 \p{Script: Inscriptional_Parthian} \p{Script_Extensions=
5330 Inscriptional_Parthian} (Short: \p{Sc=
5331 Prti}, \p{Prti}) (30)
5332 \p{Script: Ital} \p{Script=Old_Italic} (=
5333 \p{Script_Extensions=Old_Italic}) (39)
5334 \p{Script: Java} \p{Script=Javanese} (90)
5335 \p{Script: Javanese} (Short: \p{Sc=Java}) (90: U+A980..A9CD,
5336 U+A9D0..A9D9, U+A9DE..A9DF)
5337 \p{Script: Kaithi} (Short: \p{Sc=Kthi}) (68: U+11080..110C2,
5338 U+110CD)
5339 \p{Script: Kali} \p{Script=Kayah_Li} (47)
5340 \p{Script: Kana} \p{Script=Katakana} (320)
5341 \p{Script: Kannada} (Short: \p{Sc=Knda}) (90: U+0C80..0C8C,
5342 U+0C8E..0C90, U+0C92..0CA8,
5343 U+0CAA..0CB3, U+0CB5..0CB9, U+0CBC..0CC4
5344 ...)
5345 \p{Script: Katakana} (Short: \p{Sc=Kana}) (320: U+30A1..30FA,
5346 U+30FD..30FF, U+31F0..31FF,
5347 U+32D0..32FE, U+3300..3357, U+FF66..FF6F
5348 ...)
5349 \p{Script: Kayah_Li} (Short: \p{Sc=Kali}) (47: U+A900..A92D,
5350 U+A92F)
5351 \p{Script: Khar} \p{Script=Kharoshthi} (=
5352 \p{Script_Extensions=Kharoshthi}) (68)
5353 \p{Script: Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
5354 \p{Sc=Khar}, \p{Khar}) (68)
5355 \p{Script: Khitan_Small_Script} \p{Script_Extensions=
5356 Khitan_Small_Script} (Short: \p{Sc=
5357 Kits}, \p{Kits}) (471)
5358 \p{Script: Khmer} \p{Script_Extensions=Khmer} (Short: \p{Sc=
5359 Khmr}, \p{Khmr}) (146)
5360 \p{Script: Khmr} \p{Script=Khmer} (= \p{Script_Extensions=
5361 Khmer}) (146)
5362 \p{Script: Khoj} \p{Script=Khojki} (62)
5363 \p{Script: Khojki} (Short: \p{Sc=Khoj}) (62: U+11200..11211,
5364 U+11213..1123E)
5365 \p{Script: Khudawadi} (Short: \p{Sc=Sind}) (69: U+112B0..112EA,
5366 U+112F0..112F9)
5367 \p{Script: Kits} \p{Script=Khitan_Small_Script} (=
5368 \p{Script_Extensions=
5369 Khitan_Small_Script}) (471)
5370 \p{Script: Knda} \p{Script=Kannada} (90)
5371 \p{Script: Kthi} \p{Script=Kaithi} (68)
5372 \p{Script: Lana} \p{Script=Tai_Tham} (=
5373 \p{Script_Extensions=Tai_Tham}) (127)
5374 \p{Script: Lao} \p{Script_Extensions=Lao} (Short: \p{Sc=
5375 Lao}, \p{Lao}) (82)
5376 \p{Script: Laoo} \p{Script=Lao} (= \p{Script_Extensions=
5377 Lao}) (82)
5378 \p{Script: Latin} (Short: \p{Sc=Latn}) (1475: [A-Za-z\xaa
5379 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
5380 U+0100..02B8, U+02E0..02E4,
5381 U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65
5382 ...)
5383 \p{Script: Latn} \p{Script=Latin} (1475)
5384 \p{Script: Lepc} \p{Script=Lepcha} (= \p{Script_Extensions=
5385 Lepcha}) (74)
5386 \p{Script: Lepcha} \p{Script_Extensions=Lepcha} (Short:
5387 \p{Sc=Lepc}, \p{Lepc}) (74)
5388 \p{Script: Limb} \p{Script=Limbu} (68)
5389 \p{Script: Limbu} (Short: \p{Sc=Limb}) (68: U+1900..191E,
5390 U+1920..192B, U+1930..193B, U+1940,
5391 U+1944..194F)
5392 \p{Script: Lina} \p{Script=Linear_A} (341)
5393 \p{Script: Linb} \p{Script=Linear_B} (211)
5394 \p{Script: Linear_A} (Short: \p{Sc=Lina}) (341: U+10600..10736,
5395 U+10740..10755, U+10760..10767)
5396 \p{Script: Linear_B} (Short: \p{Sc=Linb}) (211: U+10000..1000B,
5397 U+1000D..10026, U+10028..1003A,
5398 U+1003C..1003D, U+1003F..1004D,
5399 U+10050..1005D ...)
5400 \p{Script: Lisu} \p{Script_Extensions=Lisu} (Short: \p{Sc=
5401 Lisu}, \p{Lisu}) (49)
5402 \p{Script: Lyci} \p{Script=Lycian} (= \p{Script_Extensions=
5403 Lycian}) (29)
5404 \p{Script: Lycian} \p{Script_Extensions=Lycian} (Short:
5405 \p{Sc=Lyci}, \p{Lyci}) (29)
5406 \p{Script: Lydi} \p{Script=Lydian} (= \p{Script_Extensions=
5407 Lydian}) (27)
5408 \p{Script: Lydian} \p{Script_Extensions=Lydian} (Short:
5409 \p{Sc=Lydi}, \p{Lydi}) (27)
5410 \p{Script: Mahajani} (Short: \p{Sc=Mahj}) (39: U+11150..11176)
5411 \p{Script: Mahj} \p{Script=Mahajani} (39)
5412 \p{Script: Maka} \p{Script=Makasar} (=
5413 \p{Script_Extensions=Makasar}) (25)
5414 \p{Script: Makasar} \p{Script_Extensions=Makasar} (Short:
5415 \p{Sc=Maka}, \p{Maka}) (25)
5416 \p{Script: Malayalam} (Short: \p{Sc=Mlym}) (118: U+0D00..0D0C,
5417 U+0D0E..0D10, U+0D12..0D44,
5418 U+0D46..0D48, U+0D4A..0D4F, U+0D54..0D63
5419 ...)
5420 \p{Script: Mand} \p{Script=Mandaic} (29)
5421 \p{Script: Mandaic} (Short: \p{Sc=Mand}) (29: U+0840..085B,
5422 U+085E)
5423 \p{Script: Mani} \p{Script=Manichaean} (51)
5424 \p{Script: Manichaean} (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6,
5425 U+10AEB..10AF6)
5426 \p{Script: Marc} \p{Script=Marchen} (=
5427 \p{Script_Extensions=Marchen}) (68)
5428 \p{Script: Marchen} \p{Script_Extensions=Marchen} (Short:
5429 \p{Sc=Marc}, \p{Marc}) (68)
5430 \p{Script: Masaram_Gondi} (Short: \p{Sc=Gonm}) (75:
5431 U+11D00..11D06, U+11D08..11D09,
5432 U+11D0B..11D36, U+11D3A, U+11D3C..11D3D,
5433 U+11D3F..11D47 ...)
5434 \p{Script: Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
5435 \p{Sc=Medf}, \p{Medf}) (91)
5436 \p{Script: Medf} \p{Script=Medefaidrin} (=
5437 \p{Script_Extensions=Medefaidrin}) (91)
5438 \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek}
5439 (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
5440 \p{Script: Mend} \p{Script=Mende_Kikakui} (=
5441 \p{Script_Extensions=Mende_Kikakui})
5442 (213)
5443 \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
5444 (Short: \p{Sc=Mend}, \p{Mend}) (213)
5445 \p{Script: Merc} \p{Script=Meroitic_Cursive} (=
5446 \p{Script_Extensions=Meroitic_Cursive})
5447 (90)
5448 \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (=
5449 \p{Script_Extensions=
5450 Meroitic_Hieroglyphs}) (32)
5451 \p{Script: Meroitic_Cursive} \p{Script_Extensions=
5452 Meroitic_Cursive} (Short: \p{Sc=Merc},
5453 \p{Merc}) (90)
5454 \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions=
5455 Meroitic_Hieroglyphs} (Short: \p{Sc=
5456 Mero}, \p{Mero}) (32)
5457 \p{Script: Miao} \p{Script_Extensions=Miao} (Short: \p{Sc=
5458 Miao}, \p{Miao}) (149)
5459 \p{Script: Mlym} \p{Script=Malayalam} (118)
5460 \p{Script: Modi} (Short: \p{Sc=Modi}) (79: U+11600..11644,
5461 U+11650..11659)
5462 \p{Script: Mong} \p{Script=Mongolian} (168)
5463 \p{Script: Mongolian} (Short: \p{Sc=Mong}) (168: U+1800..1801,
5464 U+1804, U+1806..1819, U+1820..1878,
5465 U+1880..18AA, U+11660..1166C)
5466 \p{Script: Mro} \p{Script_Extensions=Mro} (Short: \p{Sc=
5467 Mro}, \p{Mro}) (43)
5468 \p{Script: Mroo} \p{Script=Mro} (= \p{Script_Extensions=
5469 Mro}) (43)
5470 \p{Script: Mtei} \p{Script=Meetei_Mayek} (=
5471 \p{Script_Extensions=Meetei_Mayek}) (79)
5472 \p{Script: Mult} \p{Script=Multani} (38)
5473 \p{Script: Multani} (Short: \p{Sc=Mult}) (38: U+11280..11286,
5474 U+11288, U+1128A..1128D, U+1128F..1129D,
5475 U+1129F..112A9)
5476 \p{Script: Myanmar} (Short: \p{Sc=Mymr}) (223: U+1000..109F,
5477 U+A9E0..A9FE, U+AA60..AA7F)
5478 \p{Script: Mymr} \p{Script=Myanmar} (223)
5479 \p{Script: Nabataean} \p{Script_Extensions=Nabataean} (Short:
5480 \p{Sc=Nbat}, \p{Nbat}) (40)
5481 \p{Script: Nand} \p{Script=Nandinagari} (65)
5482 \p{Script: Nandinagari} (Short: \p{Sc=Nand}) (65: U+119A0..119A7,
5483 U+119AA..119D7, U+119DA..119E4)
5484 \p{Script: Narb} \p{Script=Old_North_Arabian} (=
5485 \p{Script_Extensions=Old_North_Arabian})
5486 (32)
5487 \p{Script: Nbat} \p{Script=Nabataean} (=
5488 \p{Script_Extensions=Nabataean}) (40)
5489 \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
5490 \p{Sc=Talu}, \p{Talu}) (83)
5491 \p{Script: Newa} \p{Script_Extensions=Newa} (Short: \p{Sc=
5492 Newa}, \p{Newa}) (97)
5493 \p{Script: Nko} (Short: \p{Sc=Nko}) (62: U+07C0..07FA,
5494 U+07FD..07FF)
5495 \p{Script: Nkoo} \p{Script=Nko} (62)
5496 \p{Script: Nshu} \p{Script=Nushu} (= \p{Script_Extensions=
5497 Nushu}) (397)
5498 \p{Script: Nushu} \p{Script_Extensions=Nushu} (Short: \p{Sc=
5499 Nshu}, \p{Nshu}) (397)
5500 \p{Script: Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
5501 Nyiakeng_Puachue_Hmong} (Short: \p{Sc=
5502 Hmnp}, \p{Hmnp}) (71)
5503 \p{Script: Ogam} \p{Script=Ogham} (= \p{Script_Extensions=
5504 Ogham}) (29)
5505 \p{Script: Ogham} \p{Script_Extensions=Ogham} (Short: \p{Sc=
5506 Ogam}, \p{Ogam}) (29)
5507 \p{Script: Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
5508 \p{Sc=Olck}, \p{Olck}) (48)
5509 \p{Script: Olck} \p{Script=Ol_Chiki} (=
5510 \p{Script_Extensions=Ol_Chiki}) (48)
5511 \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
5512 (Short: \p{Sc=Hung}, \p{Hung}) (108)
5513 \p{Script: Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
5514 \p{Sc=Ital}, \p{Ital}) (39)
5515 \p{Script: Old_North_Arabian} \p{Script_Extensions=
5516 Old_North_Arabian} (Short: \p{Sc=Narb},
5517 \p{Narb}) (32)
5518 \p{Script: Old_Permic} (Short: \p{Sc=Perm}) (43: U+10350..1037A)
5519 \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
5520 \p{Sc=Xpeo}, \p{Xpeo}) (50)
5521 \p{Script: Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
5522 \p{Sc=Sogo}, \p{Sogo}) (40)
5523 \p{Script: Old_South_Arabian} \p{Script_Extensions=
5524 Old_South_Arabian} (Short: \p{Sc=Sarb},
5525 \p{Sarb}) (32)
5526 \p{Script: Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
5527 \p{Sc=Orkh}, \p{Orkh}) (73)
5528 \p{Script: Old_Uyghur} (Short: \p{Sc=Ougr}) (26: U+10F70..10F89)
5529 \p{Script: Oriya} (Short: \p{Sc=Orya}) (91: U+0B01..0B03,
5530 U+0B05..0B0C, U+0B0F..0B10,
5531 U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33
5532 ...)
5533 \p{Script: Orkh} \p{Script=Old_Turkic} (=
5534 \p{Script_Extensions=Old_Turkic}) (73)
5535 \p{Script: Orya} \p{Script=Oriya} (91)
5536 \p{Script: Osage} \p{Script_Extensions=Osage} (Short: \p{Sc=
5537 Osge}, \p{Osge}) (72)
5538 \p{Script: Osge} \p{Script=Osage} (= \p{Script_Extensions=
5539 Osage}) (72)
5540 \p{Script: Osma} \p{Script=Osmanya} (=
5541 \p{Script_Extensions=Osmanya}) (40)
5542 \p{Script: Osmanya} \p{Script_Extensions=Osmanya} (Short:
5543 \p{Sc=Osma}, \p{Osma}) (40)
5544 \p{Script: Ougr} \p{Script=Old_Uyghur} (26)
5545 \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong}
5546 (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
5547 \p{Script: Palm} \p{Script=Palmyrene} (=
5548 \p{Script_Extensions=Palmyrene}) (32)
5549 \p{Script: Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
5550 \p{Sc=Palm}, \p{Palm}) (32)
5551 \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
5552 \p{Sc=Pauc}, \p{Pauc}) (57)
5553 \p{Script: Pauc} \p{Script=Pau_Cin_Hau} (=
5554 \p{Script_Extensions=Pau_Cin_Hau}) (57)
5555 \p{Script: Perm} \p{Script=Old_Permic} (43)
5556 \p{Script: Phag} \p{Script=Phags_Pa} (56)
5557 \p{Script: Phags_Pa} (Short: \p{Sc=Phag}) (56: U+A840..A877)
5558 \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (=
5559 \p{Script_Extensions=
5560 Inscriptional_Pahlavi}) (27)
5561 \p{Script: Phlp} \p{Script=Psalter_Pahlavi} (29)
5562 \p{Script: Phnx} \p{Script=Phoenician} (=
5563 \p{Script_Extensions=Phoenician}) (29)
5564 \p{Script: Phoenician} \p{Script_Extensions=Phoenician} (Short:
5565 \p{Sc=Phnx}, \p{Phnx}) (29)
5566 \p{Script: Plrd} \p{Script=Miao} (= \p{Script_Extensions=
5567 Miao}) (149)
5568 \p{Script: Prti} \p{Script=Inscriptional_Parthian} (=
5569 \p{Script_Extensions=
5570 Inscriptional_Parthian}) (30)
5571 \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29:
5572 U+10B80..10B91, U+10B99..10B9C,
5573 U+10BA9..10BAF)
5574 \p{Script: Qaac} \p{Script=Coptic} (137)
5575 \p{Script: Qaai} \p{Script=Inherited} (657)
5576 \p{Script: Rejang} \p{Script_Extensions=Rejang} (Short:
5577 \p{Sc=Rjng}, \p{Rjng}) (37)
5578 \p{Script: Rjng} \p{Script=Rejang} (= \p{Script_Extensions=
5579 Rejang}) (37)
5580 \p{Script: Rohg} \p{Script=Hanifi_Rohingya} (50)
5581 \p{Script: Runic} \p{Script_Extensions=Runic} (Short: \p{Sc=
5582 Runr}, \p{Runr}) (86)
5583 \p{Script: Runr} \p{Script=Runic} (= \p{Script_Extensions=
5584 Runic}) (86)
5585 \p{Script: Samaritan} \p{Script_Extensions=Samaritan} (Short:
5586 \p{Sc=Samr}, \p{Samr}) (61)
5587 \p{Script: Samr} \p{Script=Samaritan} (=
5588 \p{Script_Extensions=Samaritan}) (61)
5589 \p{Script: Sarb} \p{Script=Old_South_Arabian} (=
5590 \p{Script_Extensions=Old_South_Arabian})
5591 (32)
5592 \p{Script: Saur} \p{Script=Saurashtra} (=
5593 \p{Script_Extensions=Saurashtra}) (82)
5594 \p{Script: Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
5595 \p{Sc=Saur}, \p{Saur}) (82)
5596 \p{Script: Sgnw} \p{Script=SignWriting} (=
5597 \p{Script_Extensions=SignWriting}) (672)
5598 \p{Script: Sharada} (Short: \p{Sc=Shrd}) (96: U+11180..111DF)
5599 \p{Script: Shavian} \p{Script_Extensions=Shavian} (Short:
5600 \p{Sc=Shaw}, \p{Shaw}) (48)
5601 \p{Script: Shaw} \p{Script=Shavian} (=
5602 \p{Script_Extensions=Shavian}) (48)
5603 \p{Script: Shrd} \p{Script=Sharada} (96)
5604 \p{Script: Sidd} \p{Script=Siddham} (=
5605 \p{Script_Extensions=Siddham}) (92)
5606 \p{Script: Siddham} \p{Script_Extensions=Siddham} (Short:
5607 \p{Sc=Sidd}, \p{Sidd}) (92)
5608 \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short:
5609 \p{Sc=Sgnw}, \p{Sgnw}) (672)
5610 \p{Script: Sind} \p{Script=Khudawadi} (69)
5611 \p{Script: Sinh} \p{Script=Sinhala} (111)
5612 \p{Script: Sinhala} (Short: \p{Sc=Sinh}) (111: U+0D81..0D83,
5613 U+0D85..0D96, U+0D9A..0DB1,
5614 U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...)
5615 \p{Script: Sogd} \p{Script=Sogdian} (42)
5616 \p{Script: Sogdian} (Short: \p{Sc=Sogd}) (42: U+10F30..10F59)
5617 \p{Script: Sogo} \p{Script=Old_Sogdian} (=
5618 \p{Script_Extensions=Old_Sogdian}) (40)
5619 \p{Script: Sora} \p{Script=Sora_Sompeng} (=
5620 \p{Script_Extensions=Sora_Sompeng}) (35)
5621 \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng}
5622 (Short: \p{Sc=Sora}, \p{Sora}) (35)
5623 \p{Script: Soyo} \p{Script=Soyombo} (=
5624 \p{Script_Extensions=Soyombo}) (83)
5625 \p{Script: Soyombo} \p{Script_Extensions=Soyombo} (Short:
5626 \p{Sc=Soyo}, \p{Soyo}) (83)
5627 \p{Script: Sund} \p{Script=Sundanese} (=
5628 \p{Script_Extensions=Sundanese}) (72)
5629 \p{Script: Sundanese} \p{Script_Extensions=Sundanese} (Short:
5630 \p{Sc=Sund}, \p{Sund}) (72)
5631 \p{Script: Sylo} \p{Script=Syloti_Nagri} (45)
5632 \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (45: U+A800..A82C)
5633 \p{Script: Syrc} \p{Script=Syriac} (88)
5634 \p{Script: Syriac} (Short: \p{Sc=Syrc}) (88: U+0700..070D,
5635 U+070F..074A, U+074D..074F, U+0860..086A)
5636 \p{Script: Tagalog} (Short: \p{Sc=Tglg}) (23: U+1700..1715,
5637 U+171F)
5638 \p{Script: Tagb} \p{Script=Tagbanwa} (18)
5639 \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}) (18: U+1760..176C,
5640 U+176E..1770, U+1772..1773)
5641 \p{Script: Tai_Le} (Short: \p{Sc=Tale}) (35: U+1950..196D,
5642 U+1970..1974)
5643 \p{Script: Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
5644 \p{Sc=Lana}, \p{Lana}) (127)
5645 \p{Script: Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
5646 \p{Sc=Tavt}, \p{Tavt}) (72)
5647 \p{Script: Takr} \p{Script=Takri} (68)
5648 \p{Script: Takri} (Short: \p{Sc=Takr}) (68: U+11680..116B9,
5649 U+116C0..116C9)
5650 \p{Script: Tale} \p{Script=Tai_Le} (35)
5651 \p{Script: Talu} \p{Script=New_Tai_Lue} (=
5652 \p{Script_Extensions=New_Tai_Lue}) (83)
5653 \p{Script: Tamil} (Short: \p{Sc=Taml}) (123: U+0B82..0B83,
5654 U+0B85..0B8A, U+0B8E..0B90,
5655 U+0B92..0B95, U+0B99..0B9A, U+0B9C ...)
5656 \p{Script: Taml} \p{Script=Tamil} (123)
5657 \p{Script: Tang} \p{Script=Tangut} (= \p{Script_Extensions=
5658 Tangut}) (6914)
5659 \p{Script: Tangsa} \p{Script_Extensions=Tangsa} (Short:
5660 \p{Sc=Tnsa}, \p{Tnsa}) (89)
5661 \p{Script: Tangut} \p{Script_Extensions=Tangut} (Short:
5662 \p{Sc=Tang}, \p{Tang}) (6914)
5663 \p{Script: Tavt} \p{Script=Tai_Viet} (=
5664 \p{Script_Extensions=Tai_Viet}) (72)
5665 \p{Script: Telu} \p{Script=Telugu} (100)
5666 \p{Script: Telugu} (Short: \p{Sc=Telu}) (100: U+0C00..0C0C,
5667 U+0C0E..0C10, U+0C12..0C28,
5668 U+0C2A..0C39, U+0C3C..0C44, U+0C46..0C48
5669 ...)
5670 \p{Script: Tfng} \p{Script=Tifinagh} (=
5671 \p{Script_Extensions=Tifinagh}) (59)
5672 \p{Script: Tglg} \p{Script=Tagalog} (23)
5673 \p{Script: Thaa} \p{Script=Thaana} (50)
5674 \p{Script: Thaana} (Short: \p{Sc=Thaa}) (50: U+0780..07B1)
5675 \p{Script: Thai} \p{Script_Extensions=Thai} (Short: \p{Sc=
5676 Thai}, \p{Thai}) (86)
5677 \p{Script: Tibetan} \p{Script_Extensions=Tibetan} (Short:
5678 \p{Sc=Tibt}, \p{Tibt}) (207)
5679 \p{Script: Tibt} \p{Script=Tibetan} (=
5680 \p{Script_Extensions=Tibetan}) (207)
5681 \p{Script: Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
5682 \p{Sc=Tfng}, \p{Tfng}) (59)
5683 \p{Script: Tirh} \p{Script=Tirhuta} (82)
5684 \p{Script: Tirhuta} (Short: \p{Sc=Tirh}) (82: U+11480..114C7,
5685 U+114D0..114D9)
5686 \p{Script: Tnsa} \p{Script=Tangsa} (= \p{Script_Extensions=
5687 Tangsa}) (89)
5688 \p{Script: Toto} \p{Script_Extensions=Toto} (Short: \p{Sc=
5689 Toto}, \p{Toto}) (31)
5690 \p{Script: Ugar} \p{Script=Ugaritic} (=
5691 \p{Script_Extensions=Ugaritic}) (31)
5692 \p{Script: Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
5693 \p{Sc=Ugar}, \p{Ugar}) (31)
5694 \p{Script: Unknown} \p{Script_Extensions=Unknown} (Short:
5695 \p{Sc=Zzzz}, \p{Zzzz}) (969_350 plus all
5696 above-Unicode code points)
5697 \p{Script: Vai} \p{Script_Extensions=Vai} (Short: \p{Sc=
5698 Vai}, \p{Vai}) (300)
5699 \p{Script: Vaii} \p{Script=Vai} (= \p{Script_Extensions=
5700 Vai}) (300)
5701 \p{Script: Vith} \p{Script=Vithkuqi} (=
5702 \p{Script_Extensions=Vithkuqi}) (70)
5703 \p{Script: Vithkuqi} \p{Script_Extensions=Vithkuqi} (Short:
5704 \p{Sc=Vith}, \p{Vith}) (70)
5705 \p{Script: Wancho} \p{Script_Extensions=Wancho} (Short:
5706 \p{Sc=Wcho}, \p{Wcho}) (59)
5707 \p{Script: Wara} \p{Script=Warang_Citi} (=
5708 \p{Script_Extensions=Warang_Citi}) (84)
5709 \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
5710 \p{Sc=Wara}, \p{Wara}) (84)
5711 \p{Script: Wcho} \p{Script=Wancho} (= \p{Script_Extensions=
5712 Wancho}) (59)
5713 \p{Script: Xpeo} \p{Script=Old_Persian} (=
5714 \p{Script_Extensions=Old_Persian}) (50)
5715 \p{Script: Xsux} \p{Script=Cuneiform} (=
5716 \p{Script_Extensions=Cuneiform}) (1234)
5717 \p{Script: Yezi} \p{Script=Yezidi} (47)
5718 \p{Script: Yezidi} (Short: \p{Sc=Yezi}) (47: U+10E80..10EA9,
5719 U+10EAB..10EAD, U+10EB0..10EB1)
5720 \p{Script: Yi} (Short: \p{Sc=Yi}) (1220: U+A000..A48C,
5721 U+A490..A4C6)
5722 \p{Script: Yiii} \p{Script=Yi} (1220)
5723 \p{Script: Zanabazar_Square} \p{Script_Extensions=
5724 Zanabazar_Square} (Short: \p{Sc=Zanb},
5725 \p{Zanb}) (72)
5726 \p{Script: Zanb} \p{Script=Zanabazar_Square} (=
5727 \p{Script_Extensions=Zanabazar_Square})
5728 (72)
5729 \p{Script: Zinh} \p{Script=Inherited} (657)
5730 \p{Script: Zyyy} \p{Script=Common} (8252)
5731 \p{Script: Zzzz} \p{Script=Unknown} (=
5732 \p{Script_Extensions=Unknown}) (969_350
5733 plus all above-Unicode code points)
5734 \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (90:
5735 U+061F, U+0640, U+1E900..1E94B,
5736 U+1E950..1E959, U+1E95E..1E95F)
5737 \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (90)
5738 \p{Script_Extensions: Aghb} \p{Script_Extensions=
5739 Caucasian_Albanian} (53)
5740 \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (65:
5741 U+11700..1171A, U+1171D..1172B,
5742 U+11730..11746)
5743 \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
5744 \p{Hluw}) (583: U+14400..14646)
5745 \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1411)
5746 \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
5747 (1411: U+0600..0604, U+0606..06DC,
5748 U+06DE..06FF, U+0750..077F,
5749 U+0870..088E, U+0890..0891 ...)
5750 \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
5751 (96: U+0531..0556, U+0559..058A,
5752 U+058D..058F, U+FB13..FB17)
5753 \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
5754 (31)
5755 \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (96)
5756 \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst})
5757 (61: U+10B00..10B35, U+10B39..10B3F)
5758 \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
5759 \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (124)
5760 \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
5761 (124: U+1B00..1B4C, U+1B50..1B7E)
5762 \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
5763 \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657:
5764 U+A6A0..A6F7, U+16800..16A38)
5765 \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
5766 \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
5767 (36: U+16AD0..16AED, U+16AF0..16AF5)
5768 \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56:
5769 U+1BC0..1BF3, U+1BFC..1BFF)
5770 \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
5771 \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (113)
5772 \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng})
5773 (113: U+0951..0952, U+0964..0965,
5774 U+0980..0983, U+0985..098C,
5775 U+098F..0990, U+0993..09A8 ...)
5776 \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
5777 (97: U+11C00..11C08, U+11C0A..11C36,
5778 U+11C38..11C45, U+11C50..11C6C)
5779 \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
5780 \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (117)
5781 \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
5782 (117: U+02EA..02EB, U+3001..3003,
5783 U+3008..3011, U+3013..301F,
5784 U+302A..302D, U+3030 ...)
5785 \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (115)
5786 \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah})
5787 (115: U+11000..1104D, U+11052..11075,
5788 U+1107F)
5789 \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
5790 \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
5791 (256: U+2800..28FF)
5792 \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
5793 \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
5794 (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF)
5795 \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
5796 \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22:
5797 U+1735..1736, U+1740..1753)
5798 \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (91)
5799 \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
5800 \p{Cans}) (726: U+1400..167F,
5801 U+18B0..18F5, U+11AB0..11ABF)
5802 \p{Script_Extensions: Cans} \p{Script_Extensions=
5803 Canadian_Aboriginal} (726)
5804 \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
5805 \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49:
5806 U+102A0..102D0)
5807 \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
5808 \p{Aghb}) (53: U+10530..10563, U+1056F)
5809 \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (91:
5810 U+09E6..09EF, U+1040..1049,
5811 U+11100..11134, U+11136..11147)
5812 \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83:
5813 U+AA00..AA36, U+AA40..AA4D,
5814 U+AA50..AA59, U+AA5C..AA5F)
5815 \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
5816 \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
5817 (172: U+13A0..13F5, U+13F8..13FD,
5818 U+AB70..ABBF)
5819 \p{Script_Extensions: Chorasmian} (Short: \p{Scx=Chrs}, \p{Chrs})
5820 (28: U+10FB0..10FCB)
5821 \p{Script_Extensions: Chrs} \p{Script_Extensions=Chorasmian} (28)
5822 \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
5823 (7824: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
5824 \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xa9
5825 \xab-\xb9\xbb-\xbf\xd7\xf7],
5826 U+02B9..02DF, U+02E5..02E9,
5827 U+02EC..02FF, U+0374, U+037E ...)
5828 \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
5829 \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt})
5830 (165: U+03E2..03EF, U+2C80..2CF3,
5831 U+2CF9..2CFF, U+102E0..102FB)
5832 \p{Script_Extensions: Cpmn} \p{Script_Extensions=Cypro_Minoan}
5833 (101)
5834 \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
5835 \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
5836 (1234: U+12000..12399, U+12400..1246E,
5837 U+12470..12474, U+12480..12543)
5838 \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
5839 (112: U+10100..10102, U+10107..10133,
5840 U+10137..1013F, U+10800..10805, U+10808,
5841 U+1080A..10835 ...)
5842 \p{Script_Extensions: Cypro_Minoan} (Short: \p{Scx=Cpmn},
5843 \p{Cpmn}) (101: U+10100..10101,
5844 U+12F90..12FF2)
5845 \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
5846 (447: U+0400..052F, U+1C80..1C88,
5847 U+1D2B, U+1D78, U+1DF8, U+2DE0..2DFF ...)
5848 \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (447)
5849 \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt})
5850 (80: U+10400..1044F)
5851 \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210)
5852 \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
5853 (210: U+0900..0952, U+0955..097F,
5854 U+1CD0..1CF6, U+1CF8..1CF9, U+20F0,
5855 U+A830..A839 ...)
5856 \p{Script_Extensions: Diak} \p{Script_Extensions=Dives_Akuru} (72)
5857 \p{Script_Extensions: Dives_Akuru} (Short: \p{Scx=Diak}, \p{Diak})
5858 (72: U+11900..11906, U+11909,
5859 U+1190C..11913, U+11915..11916,
5860 U+11918..11935, U+11937..11938 ...)
5861 \p{Script_Extensions: Dogr} \p{Script_Extensions=Dogra} (82)
5862 \p{Script_Extensions: Dogra} (Short: \p{Scx=Dogr}, \p{Dogr}) (82:
5863 U+0964..096F, U+A830..A839,
5864 U+11800..1183B)
5865 \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
5866 \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
5867 \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
5868 (147: U+1BC00..1BC6A, U+1BC70..1BC7C,
5869 U+1BC80..1BC88, U+1BC90..1BC99,
5870 U+1BC9C..1BCA3)
5871 \p{Script_Extensions: Egyp} \p{Script_Extensions=
5872 Egyptian_Hieroglyphs} (1080)
5873 \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
5874 \p{Egyp}) (1080: U+13000..1342E,
5875 U+13430..13438)
5876 \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
5877 \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba})
5878 (40: U+10500..10527)
5879 \p{Script_Extensions: Elym} \p{Script_Extensions=Elymaic} (23)
5880 \p{Script_Extensions: Elymaic} (Short: \p{Scx=Elym}, \p{Elym})
5881 (23: U+10FE0..10FF6)
5882 \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (523)
5883 \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
5884 (523: U+1200..1248, U+124A..124D,
5885 U+1250..1256, U+1258, U+125A..125D,
5886 U+1260..1288 ...)
5887 \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (174)
5888 \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
5889 (174: U+10A0..10C5, U+10C7, U+10CD,
5890 U+10D0..10FF, U+1C90..1CBA, U+1CBD..1CBF
5891 ...)
5892 \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (138)
5893 \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
5894 (138: U+0484, U+0487, U+2C00..2C5F,
5895 U+2E43, U+A66F, U+1E000..1E006 ...)
5896 \p{Script_Extensions: Gong} \p{Script_Extensions=Gunjala_Gondi}
5897 (65)
5898 \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi}
5899 (77)
5900 \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
5901 \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27:
5902 U+10330..1034A)
5903 \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (116)
5904 \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
5905 (116: U+0951..0952, U+0964..0965,
5906 U+0BE6..0BF3, U+1CD0, U+1CD2..1CD3,
5907 U+1CF2..1CF4 ...)
5908 \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522:
5909 U+0342, U+0345, U+0370..0373,
5910 U+0375..0377, U+037A..037D, U+037F ...)
5911 \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
5912 \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
5913 (105: U+0951..0952, U+0964..0965,
5914 U+0A81..0A83, U+0A85..0A8D,
5915 U+0A8F..0A91, U+0A93..0AA8 ...)
5916 \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105)
5917 \p{Script_Extensions: Gunjala_Gondi} (Short: \p{Scx=Gong},
5918 \p{Gong}) (65: U+0964..0965,
5919 U+11D60..11D65, U+11D67..11D68,
5920 U+11D6A..11D8E, U+11D90..11D91,
5921 U+11D93..11D98 ...)
5922 \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
5923 (94: U+0951..0952, U+0964..0965,
5924 U+0A01..0A03, U+0A05..0A0A,
5925 U+0A0F..0A10, U+0A13..0A28 ...)
5926 \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (94)
5927 \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (94_503:
5928 U+2E80..2E99, U+2E9B..2EF3,
5929 U+2F00..2FD5, U+3001..3003,
5930 U+3005..3011, U+3013..301F ...)
5931 \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
5932 \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
5933 (11_775: U+1100..11FF, U+3001..3003,
5934 U+3008..3011, U+3013..301F,
5935 U+302E..3030, U+3037 ...)
5936 \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (94_503)
5937 \p{Script_Extensions: Hanifi_Rohingya} (Short: \p{Scx=Rohg},
5938 \p{Rohg}) (55: U+060C, U+061B, U+061F,
5939 U+0640, U+06D4, U+10D00..10D27 ...)
5940 \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
5941 \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano})
5942 (23: U+1720..1736)
5943 \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
5944 \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26:
5945 U+108E0..108F2, U+108F4..108F5,
5946 U+108FB..108FF)
5947 \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (134)
5948 \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr})
5949 (134: U+0591..05C7, U+05D0..05EA,
5950 U+05EF..05F4, U+FB1D..FB36,
5951 U+FB38..FB3C, U+FB3E ...)
5952 \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (432)
5953 \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
5954 (432: U+3001..3003, U+3008..3011,
5955 U+3013..301F, U+3030..3035, U+3037,
5956 U+303C..303D ...)
5957 \p{Script_Extensions: Hluw} \p{Script_Extensions=
5958 Anatolian_Hieroglyphs} (583)
5959 \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
5960 (127)
5961 \p{Script_Extensions: Hmnp} \p{Script_Extensions=
5962 Nyiakeng_Puachue_Hmong} (71)
5963 \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
5964 (108)
5965 \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
5966 \p{Armi}) (31: U+10840..10855,
5967 U+10857..1085F)
5968 \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
5969 (586: U+0300..0341, U+0343..0344,
5970 U+0346..0362, U+0953..0954,
5971 U+1AB0..1ACE, U+1DC2..1DF7 ...)
5972 \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
5973 \p{Phli}) (27: U+10B60..10B72,
5974 U+10B78..10B7F)
5975 \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
5976 Prti}, \p{Prti}) (30: U+10B40..10B55,
5977 U+10B58..10B5F)
5978 \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39)
5979 \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
5980 \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
5981 (91: U+A980..A9CD, U+A9CF..A9D9,
5982 U+A9DE..A9DF)
5983 \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (88:
5984 U+0966..096F, U+A830..A839,
5985 U+11080..110C2, U+110CD)
5986 \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
5987 \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (372)
5988 \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
5989 (105: U+0951..0952, U+0964..0965,
5990 U+0C80..0C8C, U+0C8E..0C90,
5991 U+0C92..0CA8, U+0CAA..0CB3 ...)
5992 \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
5993 (372: U+3001..3003, U+3008..3011,
5994 U+3013..301F, U+3030..3035, U+3037,
5995 U+303C..303D ...)
5996 \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
5997 (48: U+A900..A92F)
5998 \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (68)
5999 \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
6000 (68: U+10A00..10A03, U+10A05..10A06,
6001 U+10A0C..10A13, U+10A15..10A17,
6002 U+10A19..10A35, U+10A38..10A3A ...)
6003 \p{Script_Extensions: Khitan_Small_Script} (Short: \p{Scx=Kits},
6004 \p{Kits}) (471: U+16FE4, U+18B00..18CD5)
6005 \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146:
6006 U+1780..17DD, U+17E0..17E9,
6007 U+17F0..17F9, U+19E0..19FF)
6008 \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
6009 \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (82)
6010 \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (82:
6011 U+0AE6..0AEF, U+A830..A839,
6012 U+11200..11211, U+11213..1123E)
6013 \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
6014 (81: U+0964..0965, U+A830..A839,
6015 U+112B0..112EA, U+112F0..112F9)
6016 \p{Script_Extensions: Kits} \p{Script_Extensions=
6017 Khitan_Small_Script} (471)
6018 \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (105)
6019 \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (88)
6020 \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
6021 \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (82:
6022 U+0E81..0E82, U+0E84, U+0E86..0E8A,
6023 U+0E8C..0EA3, U+0EA5, U+0EA7..0EBD ...)
6024 \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (82)
6025 \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn})
6026 (1504: [A-Za-z\xaa\xba\xc0-\xd6\xd8-
6027 \xf6\xf8-\xff], U+0100..02B8,
6028 U+02E0..02E4, U+0363..036F,
6029 U+0485..0486, U+0951..0952 ...)
6030 \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1504)
6031 \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
6032 \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74:
6033 U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F)
6034 \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
6035 \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69:
6036 U+0965, U+1900..191E, U+1920..192B,
6037 U+1930..193B, U+1940, U+1944..194F)
6038 \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
6039 \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
6040 \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
6041 (386: U+10107..10133, U+10600..10736,
6042 U+10740..10755, U+10760..10767)
6043 \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
6044 (268: U+10000..1000B, U+1000D..10026,
6045 U+10028..1003A, U+1003C..1003D,
6046 U+1003F..1004D, U+10050..1005D ...)
6047 \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (49:
6048 U+A4D0..A4FF, U+11FB0)
6049 \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
6050 \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29:
6051 U+10280..1029C)
6052 \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
6053 \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27:
6054 U+10920..10939, U+1093F)
6055 \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
6056 (61: U+0964..096F, U+A830..A839,
6057 U+11150..11176)
6058 \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
6059 \p{Script_Extensions: Maka} \p{Script_Extensions=Makasar} (25)
6060 \p{Script_Extensions: Makasar} (Short: \p{Scx=Maka}, \p{Maka})
6061 (25: U+11EE0..11EF8)
6062 \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
6063 (126: U+0951..0952, U+0964..0965,
6064 U+0D00..0D0C, U+0D0E..0D10,
6065 U+0D12..0D44, U+0D46..0D48 ...)
6066 \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
6067 \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand})
6068 (30: U+0640, U+0840..085B, U+085E)
6069 \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
6070 \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
6071 (52: U+0640, U+10AC0..10AE6,
6072 U+10AEB..10AF6)
6073 \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
6074 \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc})
6075 (68: U+11C70..11C8F, U+11C92..11CA7,
6076 U+11CA9..11CB6)
6077 \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm},
6078 \p{Gonm}) (77: U+0964..0965,
6079 U+11D00..11D06, U+11D08..11D09,
6080 U+11D0B..11D36, U+11D3A, U+11D3C..11D3D
6081 ...)
6082 \p{Script_Extensions: Medefaidrin} (Short: \p{Scx=Medf}, \p{Medf})
6083 (91: U+16E40..16E9A)
6084 \p{Script_Extensions: Medf} \p{Script_Extensions=Medefaidrin} (91)
6085 \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
6086 \p{Mtei}) (79: U+AAE0..AAF6,
6087 U+ABC0..ABED, U+ABF0..ABF9)
6088 \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
6089 (213)
6090 \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
6091 \p{Mend}) (213: U+1E800..1E8C4,
6092 U+1E8C7..1E8D6)
6093 \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
6094 (90)
6095 \p{Script_Extensions: Mero} \p{Script_Extensions=
6096 Meroitic_Hieroglyphs} (32)
6097 \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
6098 \p{Merc}) (90: U+109A0..109B7,
6099 U+109BC..109CF, U+109D2..109FF)
6100 \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
6101 \p{Mero}) (32: U+10980..1099F)
6102 \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (149:
6103 U+16F00..16F4A, U+16F4F..16F87,
6104 U+16F8F..16F9F)
6105 \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (126)
6106 \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89:
6107 U+A830..A839, U+11600..11644,
6108 U+11650..11659)
6109 \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (172)
6110 \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
6111 (172: U+1800..1819, U+1820..1878,
6112 U+1880..18AA, U+202F, U+11660..1166C)
6113 \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43:
6114 U+16A40..16A5E, U+16A60..16A69,
6115 U+16A6E..16A6F)
6116 \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
6117 \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
6118 \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
6119 \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult})
6120 (48: U+0A66..0A6F, U+11280..11286,
6121 U+11288, U+1128A..1128D, U+1128F..1129D,
6122 U+1129F..112A9)
6123 \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
6124 (224: U+1000..109F, U+A92E,
6125 U+A9E0..A9FE, U+AA60..AA7F)
6126 \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
6127 \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
6128 (40: U+10880..1089E, U+108A7..108AF)
6129 \p{Script_Extensions: Nand} \p{Script_Extensions=Nandinagari} (86)
6130 \p{Script_Extensions: Nandinagari} (Short: \p{Scx=Nand}, \p{Nand})
6131 (86: U+0964..0965, U+0CE6..0CEF, U+1CE9,
6132 U+1CF2, U+1CFA, U+A830..A835 ...)
6133 \p{Script_Extensions: Narb} \p{Script_Extensions=
6134 Old_North_Arabian} (32)
6135 \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
6136 \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
6137 (83: U+1980..19AB, U+19B0..19C9,
6138 U+19D0..19DA, U+19DE..19DF)
6139 \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (97:
6140 U+11400..1145B, U+1145D..11461)
6141 \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (67:
6142 U+060C, U+061B, U+061F, U+07C0..07FA,
6143 U+07FD..07FF, U+FD3E..FD3F)
6144 \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (67)
6145 \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397)
6146 \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397:
6147 U+16FE1, U+1B170..1B2FB)
6148 \p{Script_Extensions: Nyiakeng_Puachue_Hmong} (Short: \p{Scx=
6149 Hmnp}, \p{Hmnp}) (71: U+1E100..1E12C,
6150 U+1E130..1E13D, U+1E140..1E149,
6151 U+1E14E..1E14F)
6152 \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
6153 \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29:
6154 U+1680..169C)
6155 \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
6156 (48: U+1C50..1C7F)
6157 \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
6158 \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
6159 \p{Hung}) (108: U+10C80..10CB2,
6160 U+10CC0..10CF2, U+10CFA..10CFF)
6161 \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
6162 (39: U+10300..10323, U+1032D..1032F)
6163 \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
6164 \p{Narb}) (32: U+10A80..10A9F)
6165 \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
6166 (44: U+0483, U+10350..1037A)
6167 \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
6168 (50: U+103A0..103C3, U+103C8..103D5)
6169 \p{Script_Extensions: Old_Sogdian} (Short: \p{Scx=Sogo}, \p{Sogo})
6170 (40: U+10F00..10F27)
6171 \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
6172 \p{Sarb}) (32: U+10A60..10A7F)
6173 \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
6174 (73: U+10C00..10C48)
6175 \p{Script_Extensions: Old_Uyghur} (Short: \p{Scx=Ougr}, \p{Ougr})
6176 (28: U+0640, U+10AF2, U+10F70..10F89)
6177 \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (97:
6178 U+0951..0952, U+0964..0965,
6179 U+0B01..0B03, U+0B05..0B0C,
6180 U+0B0F..0B10, U+0B13..0B28 ...)
6181 \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
6182 \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (97)
6183 \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72:
6184 U+104B0..104D3, U+104D8..104FB)
6185 \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
6186 \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
6187 \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma})
6188 (40: U+10480..1049D, U+104A0..104A9)
6189 \p{Script_Extensions: Ougr} \p{Script_Extensions=Old_Uyghur} (28)
6190 \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
6191 \p{Hmng}) (127: U+16B00..16B45,
6192 U+16B50..16B59, U+16B5B..16B61,
6193 U+16B63..16B77, U+16B7D..16B8F)
6194 \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
6195 \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
6196 (32: U+10860..1087F)
6197 \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
6198 (57: U+11AC0..11AF8)
6199 \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
6200 \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
6201 \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
6202 \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
6203 (59: U+1802..1803, U+1805, U+A840..A877)
6204 \p{Script_Extensions: Phli} \p{Script_Extensions=
6205 Inscriptional_Pahlavi} (27)
6206 \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
6207 (30)
6208 \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
6209 \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
6210 (29: U+10900..1091B, U+1091F)
6211 \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (149)
6212 \p{Script_Extensions: Prti} \p{Script_Extensions=
6213 Inscriptional_Parthian} (30)
6214 \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
6215 \p{Phlp}) (30: U+0640, U+10B80..10B91,
6216 U+10B99..10B9C, U+10BA9..10BAF)
6217 \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
6218 \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (586)
6219 \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37:
6220 U+A930..A953, U+A95F)
6221 \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
6222 \p{Script_Extensions: Rohg} \p{Script_Extensions=Hanifi_Rohingya}
6223 (55)
6224 \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86:
6225 U+16A0..16EA, U+16EE..16F8)
6226 \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
6227 \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
6228 (61: U+0800..082D, U+0830..083E)
6229 \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
6230 \p{Script_Extensions: Sarb} \p{Script_Extensions=
6231 Old_South_Arabian} (32)
6232 \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
6233 \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
6234 (82: U+A880..A8C5, U+A8CE..A8D9)
6235 \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
6236 \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
6237 (102: U+0951, U+1CD7, U+1CD9,
6238 U+1CDC..1CDD, U+1CE0, U+11180..111DF)
6239 \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw})
6240 (48: U+10450..1047F)
6241 \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
6242 \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (102)
6243 \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
6244 \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd})
6245 (92: U+11580..115B5, U+115B8..115DD)
6246 \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
6247 (672: U+1D800..1DA8B, U+1DA9B..1DA9F,
6248 U+1DAA1..1DAAF)
6249 \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
6250 \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (113)
6251 \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
6252 (113: U+0964..0965, U+0D81..0D83,
6253 U+0D85..0D96, U+0D9A..0DB1,
6254 U+0DB3..0DBB, U+0DBD ...)
6255 \p{Script_Extensions: Sogd} \p{Script_Extensions=Sogdian} (43)
6256 \p{Script_Extensions: Sogdian} (Short: \p{Scx=Sogd}, \p{Sogd})
6257 (43: U+0640, U+10F30..10F59)
6258 \p{Script_Extensions: Sogo} \p{Script_Extensions=Old_Sogdian} (40)
6259 \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
6260 \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
6261 \p{Sora}) (35: U+110D0..110E8,
6262 U+110F0..110F9)
6263 \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (83)
6264 \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo})
6265 (83: U+11A50..11AA2)
6266 \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
6267 \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
6268 (72: U+1B80..1BBF, U+1CC0..1CC7)
6269 \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (57)
6270 \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
6271 \p{Sylo}) (57: U+0964..0965,
6272 U+09E6..09EF, U+A800..A82C)
6273 \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (107)
6274 \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc})
6275 (107: U+060C, U+061B..061C, U+061F,
6276 U+0640, U+064B..0655, U+0670 ...)
6277 \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg})
6278 (25: U+1700..1715, U+171F, U+1735..1736)
6279 \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
6280 \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
6281 (20: U+1735..1736, U+1760..176C,
6282 U+176E..1770, U+1772..1773)
6283 \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45:
6284 U+1040..1049, U+1950..196D, U+1970..1974)
6285 \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
6286 (127: U+1A20..1A5E, U+1A60..1A7C,
6287 U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD)
6288 \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
6289 (72: U+AA80..AAC2, U+AADB..AADF)
6290 \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (80)
6291 \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (80:
6292 U+0964..0965, U+A830..A839,
6293 U+11680..116B9, U+116C0..116C9)
6294 \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
6295 \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
6296 \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (133:
6297 U+0951..0952, U+0964..0965,
6298 U+0B82..0B83, U+0B85..0B8A,
6299 U+0B8E..0B90, U+0B92..0B95 ...)
6300 \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (133)
6301 \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6914)
6302 \p{Script_Extensions: Tangsa} (Short: \p{Scx=Tnsa}, \p{Tnsa}) (89:
6303 U+16A70..16ABE, U+16AC0..16AC9)
6304 \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
6305 (6914: U+16FE0, U+17000..187F7,
6306 U+18800..18AFF, U+18D00..18D08)
6307 \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
6308 \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (106)
6309 \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu})
6310 (106: U+0951..0952, U+0964..0965,
6311 U+0C00..0C0C, U+0C0E..0C10,
6312 U+0C12..0C28, U+0C2A..0C39 ...)
6313 \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
6314 \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (25)
6315 \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66)
6316 \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66:
6317 U+060C, U+061B..061C, U+061F,
6318 U+0660..0669, U+0780..07B1, U+FDF2 ...)
6319 \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86:
6320 U+0E01..0E3A, U+0E40..0E5B)
6321 \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
6322 (207: U+0F00..0F47, U+0F49..0F6C,
6323 U+0F71..0F97, U+0F99..0FBC,
6324 U+0FBE..0FCC, U+0FCE..0FD4 ...)
6325 \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
6326 \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
6327 (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F)
6328 \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (97)
6329 \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh})
6330 (97: U+0951..0952, U+0964..0965, U+1CF2,
6331 U+A830..A839, U+11480..114C7,
6332 U+114D0..114D9)
6333 \p{Script_Extensions: Tnsa} \p{Script_Extensions=Tangsa} (89)
6334 \p{Script_Extensions: Toto} (Short: \p{Scx=Toto}, \p{Toto}) (31:
6335 U+1E290..1E2AE)
6336 \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
6337 \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
6338 (31: U+10380..1039D, U+1039F)
6339 \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
6340 (969_350 plus all above-Unicode code
6341 points: U+0378..0379, U+0380..0383,
6342 U+038B, U+038D, U+03A2, U+0530 ...)
6343 \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300:
6344 U+A500..A62B)
6345 \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
6346 \p{Script_Extensions: Vith} \p{Script_Extensions=Vithkuqi} (70)
6347 \p{Script_Extensions: Vithkuqi} (Short: \p{Scx=Vith}, \p{Vith})
6348 (70: U+10570..1057A, U+1057C..1058A,
6349 U+1058C..10592, U+10594..10595,
6350 U+10597..105A1, U+105A3..105B1 ...)
6351 \p{Script_Extensions: Wancho} (Short: \p{Scx=Wcho}, \p{Wcho}) (59:
6352 U+1E2C0..1E2F9, U+1E2FF)
6353 \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
6354 \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
6355 (84: U+118A0..118F2, U+118FF)
6356 \p{Script_Extensions: Wcho} \p{Script_Extensions=Wancho} (59)
6357 \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
6358 \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
6359 \p{Script_Extensions: Yezi} \p{Script_Extensions=Yezidi} (60)
6360 \p{Script_Extensions: Yezidi} (Short: \p{Scx=Yezi}, \p{Yezi}) (60:
6361 U+060C, U+061B, U+061F, U+0660..0669,
6362 U+10E80..10EA9, U+10EAB..10EAD ...)
6363 \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246:
6364 U+3001..3002, U+3008..3011,
6365 U+3014..301B, U+30FB, U+A000..A48C,
6366 U+A490..A4C6 ...)
6367 \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
6368 \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb},
6369 \p{Zanb}) (72: U+11A00..11A47)
6370 \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square}
6371 (72)
6372 \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (586)
6373 \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (7824)
6374 \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (969_350
6375 plus all above-Unicode code points)
6376 \p{Scx: *} \p{Script_Extensions: *}
6377 \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (47)
6378 \p{SD: *} \p{Soft_Dotted: *}
6379 \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
6380 \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024,
6381 U+FE52, U+FF0E)
6382 \p{Sentence_Break: CL} \p{Sentence_Break=Close} (195)
6383 \p{Sentence_Break: Close} (Short: \p{SB=CL}) (195: [\"\'\(\)\[\]
6384 \{\}\xab\xbb], U+0F3A..0F3D,
6385 U+169B..169C, U+2018..201F,
6386 U+2039..203A, U+2045..2046 ...)
6387 \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1: [\r])
6388 \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (2508)
6389 \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2508: U+0300..036F,
6390 U+0483..0489, U+0591..05BD, U+05BF,
6391 U+05C1..05C2, U+05C4..05C5 ...)
6392 \p{Sentence_Break: FO} \p{Sentence_Break=Format} (65)
6393 \p{Sentence_Break: Format} (Short: \p{SB=FO}) (65: [\xad],
6394 U+0600..0605, U+061C, U+06DD, U+070F,
6395 U+0890..0891 ...)
6396 \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (127_761)
6397 \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1: [\n])
6398 \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (2424)
6399 \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2424: [a-z\xaa\xb5
6400 \xba\xdf-\xf6\xf8-\xff], U+0101, U+0103,
6401 U+0105, U+0107, U+0109 ...)
6402 \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (662)
6403 \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (662: [0-9],
6404 U+0660..0669, U+066B..066C,
6405 U+06F0..06F9, U+07C0..07C9, U+0966..096F
6406 ...)
6407 \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (127_761: U+01BB,
6408 U+01C0..01C3, U+0294, U+02B9..02BF,
6409 U+02C6..02D1, U+02EC ...)
6410 \p{Sentence_Break: Other} (Short: \p{SB=XX}) (978_357 plus all
6411 above-Unicode code points: [^\t\n\cK\f
6412 \r\x20!\"\'\(\),\-.0-9:?A-Z\[\]a-z\{\}
6413 \x85\xa0\xaa-\xab\xad\xb5\xba-\xbb\xc0-
6414 \xd6\xd8-\xf6\xf8-\xff], U+02C2..02C5,
6415 U+02D2..02DF, U+02E5..02EB, U+02ED,
6416 U+02EF..02FF ...)
6417 \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
6418 \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:],
6419 U+055D, U+060C..060D, U+07F8, U+1802,
6420 U+1808 ...)
6421 \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
6422 \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85],
6423 U+2028..2029)
6424 \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0],
6425 U+1680, U+2000..200A, U+202F, U+205F,
6426 U+3000)
6427 \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (149)
6428 \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (149: [!?], U+0589,
6429 U+061D..061F, U+06D4, U+0700..0702,
6430 U+07F9 ...)
6431 \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1936)
6432 \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1936: [A-Z\xc0-\xd6
6433 \xd8-\xde], U+0100, U+0102, U+0104,
6434 U+0106, U+0108 ...)
6435 \p{Sentence_Break: XX} \p{Sentence_Break=Other} (978_357 plus all
6436 above-Unicode code points)
6437 \p{Sentence_Terminal} \p{Sentence_Terminal=Y} (Short: \p{STerm})
6438 (152)
6439 \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
6440 (1_113_960 plus all above-Unicode code
6441 points: [\x00-\x20\"#\$\%&\'\(\)*+,\-
6442 \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
6443 \xff], U+0100..0588, U+058A..061C,
6444 U+0620..06D3, U+06D5..06FF, U+0703..07F8
6445 ...)
6446 \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (152:
6447 [!.?], U+0589, U+061D..061F, U+06D4,
6448 U+0700..0702, U+07F9 ...)
6449 \p{Separator} \p{General_Category=Separator} (Short:
6450 \p{Z}) (19)
6451 \p{Sgnw} \p{SignWriting} (= \p{Script_Extensions=
6452 SignWriting}) (672)
6453 \p{Sharada} \p{Script_Extensions=Sharada} (Short:
6454 \p{Shrd}; NOT \p{Block=Sharada}) (102)
6455 \p{Shavian} \p{Script_Extensions=Shavian} (Short:
6456 \p{Shaw}) (48)
6457 \p{Shaw} \p{Shavian} (= \p{Script_Extensions=
6458 Shavian}) (48)
6459 X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
6460 (16)
6461 \p{Shrd} \p{Sharada} (= \p{Script_Extensions=
6462 Sharada}) (NOT \p{Block=Sharada}) (102)
6463 \p{Sidd} \p{Siddham} (= \p{Script_Extensions=
6464 Siddham}) (NOT \p{Block=Siddham}) (92)
6465 \p{Siddham} \p{Script_Extensions=Siddham} (Short:
6466 \p{Sidd}; NOT \p{Block=Siddham}) (92)
6467 \p{SignWriting} \p{Script_Extensions=SignWriting} (Short:
6468 \p{Sgnw}) (672)
6469 \p{Sind} \p{Khudawadi} (= \p{Script_Extensions=
6470 Khudawadi}) (NOT \p{Block=Khudawadi})
6471 (81)
6472 \p{Sinh} \p{Sinhala} (= \p{Script_Extensions=
6473 Sinhala}) (NOT \p{Block=Sinhala}) (113)
6474 \p{Sinhala} \p{Script_Extensions=Sinhala} (Short:
6475 \p{Sinh}; NOT \p{Block=Sinhala}) (113)
6476 X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
6477 \p{Sk} \p{Modifier_Symbol} (=
6478 \p{General_Category=Modifier_Symbol})
6479 (125)
6480 \p{Sm} \p{Math_Symbol} (= \p{General_Category=
6481 Math_Symbol}) (948)
6482 X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
6483 \p{InSmallForms}) (32)
6484 X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block=
6485 Small_Form_Variants}) (32)
6486 X \p{Small_Kana_Ext} \p{Small_Kana_Extension} (= \p{Block=
6487 Small_Kana_Extension}) (64)
6488 X \p{Small_Kana_Extension} \p{Block=Small_Kana_Extension} (Short:
6489 \p{InSmallKanaExt}) (64)
6490 \p{So} \p{Other_Symbol} (= \p{General_Category=
6491 Other_Symbol}) (6605)
6492 \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (47)
6493 \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_065 plus
6494 all above-Unicode code points: [\x00-
6495 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6496 Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff],
6497 U+0100..012E, U+0130..0248,
6498 U+024A..0267, U+0269..029C, U+029E..02B1
6499 ...)
6500 \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (47: [i-j],
6501 U+012F, U+0249, U+0268, U+029D, U+02B2
6502 ...)
6503 \p{Sogd} \p{Sogdian} (= \p{Script_Extensions=
6504 Sogdian}) (NOT \p{Block=Sogdian}) (43)
6505 \p{Sogdian} \p{Script_Extensions=Sogdian} (Short:
6506 \p{Sogd}; NOT \p{Block=Sogdian}) (43)
6507 \p{Sogo} \p{Old_Sogdian} (= \p{Script_Extensions=
6508 Old_Sogdian}) (NOT \p{Block=
6509 Old_Sogdian}) (40)
6510 \p{Sora} \p{Sora_Sompeng} (= \p{Script_Extensions=
6511 Sora_Sompeng}) (NOT \p{Block=
6512 Sora_Sompeng}) (35)
6513 \p{Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng} (Short:
6514 \p{Sora}; NOT \p{Block=Sora_Sompeng})
6515 (35)
6516 \p{Soyo} \p{Soyombo} (= \p{Script_Extensions=
6517 Soyombo}) (NOT \p{Block=Soyombo}) (83)
6518 \p{Soyombo} \p{Script_Extensions=Soyombo} (Short:
6519 \p{Soyo}; NOT \p{Block=Soyombo}) (83)
6520 \p{Space} \p{White_Space} (= \p{White_Space=Y}) (25)
6521 \p{Space: *} \p{White_Space: *}
6522 \p{Space_Separator} \p{General_Category=Space_Separator}
6523 (Short: \p{Zs}) (17)
6524 \p{SpacePerl} \p{XPosixSpace} (25)
6525 \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
6526 \p{Mc}) (445)
6527 X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
6528 (Short: \p{InModifierLetters}) (80)
6529 X \p{Specials} \p{Block=Specials} (16)
6530 \p{STerm} \p{Sentence_Terminal} (=
6531 \p{Sentence_Terminal=Y}) (152)
6532 \p{STerm: *} \p{Sentence_Terminal: *}
6533 \p{Sund} \p{Sundanese} (= \p{Script_Extensions=
6534 Sundanese}) (NOT \p{Block=Sundanese})
6535 (72)
6536 \p{Sundanese} \p{Script_Extensions=Sundanese} (Short:
6537 \p{Sund}; NOT \p{Block=Sundanese}) (72)
6538 X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block=
6539 Sundanese_Supplement}) (16)
6540 X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
6541 \p{InSundaneseSup}) (16)
6542 X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block=
6543 Supplemental_Arrows_A}) (16)
6544 X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block=
6545 Supplemental_Arrows_B}) (128)
6546 X \p{Sup_Arrows_C} \p{Supplemental_Arrows_C} (= \p{Block=
6547 Supplemental_Arrows_C}) (256)
6548 X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (=
6549 \p{Block=
6550 Supplemental_Mathematical_Operators})
6551 (256)
6552 X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (=
6553 \p{Block=
6554 Supplementary_Private_Use_Area_A})
6555 (65_536)
6556 X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (=
6557 \p{Block=
6558 Supplementary_Private_Use_Area_B})
6559 (65_536)
6560 X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block=
6561 Supplemental_Punctuation}) (128)
6562 X \p{Sup_Symbols_And_Pictographs}
6563 \p{Supplemental_Symbols_And_Pictographs}
6564 (= \p{Block=
6565 Supplemental_Symbols_And_Pictographs})
6566 (256)
6567 X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (=
6568 \p{Block=Superscripts_And_Subscripts})
6569 (48)
6570 X \p{Superscripts_And_Subscripts} \p{Block=
6571 Superscripts_And_Subscripts} (Short:
6572 \p{InSuperAndSub}) (48)
6573 X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
6574 \p{InSupArrowsA}) (16)
6575 X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
6576 \p{InSupArrowsB}) (128)
6577 X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
6578 \p{InSupArrowsC}) (256)
6579 X \p{Supplemental_Mathematical_Operators} \p{Block=
6580 Supplemental_Mathematical_Operators}
6581 (Short: \p{InSupMathOperators}) (256)
6582 X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
6583 (Short: \p{InSupPunctuation}) (128)
6584 X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
6585 Supplemental_Symbols_And_Pictographs}
6586 (Short: \p{InSupSymbolsAndPictographs})
6587 (256)
6588 X \p{Supplementary_Private_Use_Area_A} \p{Block=
6589 Supplementary_Private_Use_Area_A}
6590 (Short: \p{InSupPUAA}) (65_536)
6591 X \p{Supplementary_Private_Use_Area_B} \p{Block=
6592 Supplementary_Private_Use_Area_B}
6593 (Short: \p{InSupPUAB}) (65_536)
6594 \p{Surrogate} \p{General_Category=Surrogate} (Short:
6595 \p{Cs}) (2048)
6596 X \p{Sutton_SignWriting} \p{Block=Sutton_SignWriting} (688)
6597 \p{Sylo} \p{Syloti_Nagri} (= \p{Script_Extensions=
6598 Syloti_Nagri}) (NOT \p{Block=
6599 Syloti_Nagri}) (57)
6600 \p{Syloti_Nagri} \p{Script_Extensions=Syloti_Nagri} (Short:
6601 \p{Sylo}; NOT \p{Block=Syloti_Nagri})
6602 (57)
6603 \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
6604 (7741)
6605 X \p{Symbols_And_Pictographs_Ext_A}
6606 \p{Symbols_And_Pictographs_Extended_A}
6607 (= \p{Block=
6608 Symbols_And_Pictographs_Extended_A})
6609 (144)
6610 X \p{Symbols_And_Pictographs_Extended_A} \p{Block=
6611 Symbols_And_Pictographs_Extended_A} (144)
6612 X \p{Symbols_For_Legacy_Computing} \p{Block=
6613 Symbols_For_Legacy_Computing} (256)
6614 \p{Syrc} \p{Syriac} (= \p{Script_Extensions=
6615 Syriac}) (NOT \p{Block=Syriac}) (107)
6616 \p{Syriac} \p{Script_Extensions=Syriac} (Short:
6617 \p{Syrc}; NOT \p{Block=Syriac}) (107)
6618 X \p{Syriac_Sup} \p{Syriac_Supplement} (= \p{Block=
6619 Syriac_Supplement}) (16)
6620 X \p{Syriac_Supplement} \p{Block=Syriac_Supplement} (Short:
6621 \p{InSyriacSup}) (16)
6622 \p{Tagalog} \p{Script_Extensions=Tagalog} (Short:
6623 \p{Tglg}; NOT \p{Block=Tagalog}) (25)
6624 \p{Tagb} \p{Tagbanwa} (= \p{Script_Extensions=
6625 Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
6626 \p{Tagbanwa} \p{Script_Extensions=Tagbanwa} (Short:
6627 \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
6628 X \p{Tags} \p{Block=Tags} (128)
6629 \p{Tai_Le} \p{Script_Extensions=Tai_Le} (Short:
6630 \p{Tale}; NOT \p{Block=Tai_Le}) (45)
6631 \p{Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
6632 \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
6633 \p{Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
6634 \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
6635 X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
6636 Tai_Xuan_Jing_Symbols}) (96)
6637 X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
6638 \p{InTaiXuanJing}) (96)
6639 \p{Takr} \p{Takri} (= \p{Script_Extensions=Takri})
6640 (NOT \p{Block=Takri}) (80)
6641 \p{Takri} \p{Script_Extensions=Takri} (Short:
6642 \p{Takr}; NOT \p{Block=Takri}) (80)
6643 \p{Tale} \p{Tai_Le} (= \p{Script_Extensions=
6644 Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
6645 \p{Talu} \p{New_Tai_Lue} (= \p{Script_Extensions=
6646 New_Tai_Lue}) (NOT \p{Block=
6647 New_Tai_Lue}) (83)
6648 \p{Tamil} \p{Script_Extensions=Tamil} (Short:
6649 \p{Taml}; NOT \p{Block=Tamil}) (133)
6650 X \p{Tamil_Sup} \p{Tamil_Supplement} (= \p{Block=
6651 Tamil_Supplement}) (64)
6652 X \p{Tamil_Supplement} \p{Block=Tamil_Supplement} (Short:
6653 \p{InTamilSup}) (64)
6654 \p{Taml} \p{Tamil} (= \p{Script_Extensions=Tamil})
6655 (NOT \p{Block=Tamil}) (133)
6656 \p{Tang} \p{Tangut} (= \p{Script_Extensions=
6657 Tangut}) (NOT \p{Block=Tangut}) (6914)
6658 \p{Tangsa} \p{Script_Extensions=Tangsa} (Short:
6659 \p{Tnsa}; NOT \p{Block=Tangsa}) (89)
6660 \p{Tangut} \p{Script_Extensions=Tangut} (Short:
6661 \p{Tang}; NOT \p{Block=Tangut}) (6914)
6662 X \p{Tangut_Components} \p{Block=Tangut_Components} (768)
6663 X \p{Tangut_Sup} \p{Tangut_Supplement} (= \p{Block=
6664 Tangut_Supplement}) (128)
6665 X \p{Tangut_Supplement} \p{Block=Tangut_Supplement} (Short:
6666 \p{InTangutSup}) (128)
6667 \p{Tavt} \p{Tai_Viet} (= \p{Script_Extensions=
6668 Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
6669 \p{Telu} \p{Telugu} (= \p{Script_Extensions=
6670 Telugu}) (NOT \p{Block=Telugu}) (106)
6671 \p{Telugu} \p{Script_Extensions=Telugu} (Short:
6672 \p{Telu}; NOT \p{Block=Telugu}) (106)
6673 \p{Term} \p{Terminal_Punctuation} (=
6674 \p{Terminal_Punctuation=Y}) (276)
6675 \p{Term: *} \p{Terminal_Punctuation: *}
6676 \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
6677 \p{Term}) (276)
6678 \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
6679 (1_113_836 plus all above-Unicode code
6680 points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0-
6681 9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
6682 U+0100..037D, U+037F..0386,
6683 U+0388..0588, U+058A..05C2, U+05C4..060B
6684 ...)
6685 \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (276:
6686 [!,.:;?], U+037E, U+0387, U+0589,
6687 U+05C3, U+060C ...)
6688 \p{Tfng} \p{Tifinagh} (= \p{Script_Extensions=
6689 Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
6690 \p{Tglg} \p{Tagalog} (= \p{Script_Extensions=
6691 Tagalog}) (NOT \p{Block=Tagalog}) (25)
6692 \p{Thaa} \p{Thaana} (= \p{Script_Extensions=
6693 Thaana}) (NOT \p{Block=Thaana}) (66)
6694 \p{Thaana} \p{Script_Extensions=Thaana} (Short:
6695 \p{Thaa}; NOT \p{Block=Thaana}) (66)
6696 \p{Thai} \p{Script_Extensions=Thai} (NOT \p{Block=
6697 Thai}) (86)
6698 \p{Tibetan} \p{Script_Extensions=Tibetan} (Short:
6699 \p{Tibt}; NOT \p{Block=Tibetan}) (207)
6700 \p{Tibt} \p{Tibetan} (= \p{Script_Extensions=
6701 Tibetan}) (NOT \p{Block=Tibetan}) (207)
6702 \p{Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
6703 \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
6704 \p{Tirh} \p{Tirhuta} (= \p{Script_Extensions=
6705 Tirhuta}) (NOT \p{Block=Tirhuta}) (97)
6706 \p{Tirhuta} \p{Script_Extensions=Tirhuta} (Short:
6707 \p{Tirh}; NOT \p{Block=Tirhuta}) (97)
6708 \p{Title} \p{Titlecase} (/i= Cased=Yes) (31)
6709 \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i=
6710 Cased=Yes) (31: U+01C5, U+01C8, U+01CB,
6711 U+01F2, U+1F88..1F8F, U+1F98..1F9F ...)
6712 \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
6713 (Short: \p{Lt}; /i= General_Category=
6714 Cased_Letter) (31)
6715 \p{Tnsa} \p{Tangsa} (= \p{Script_Extensions=
6716 Tangsa}) (NOT \p{Block=Tangsa}) (89)
6717 \p{Toto} \p{Script_Extensions=Toto} (NOT \p{Block=
6718 Toto}) (31)
6719 X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block=
6720 Transport_And_Map_Symbols}) (128)
6721 X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
6722 (Short: \p{InTransportAndMap}) (128)
6723 X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics}
6724 (= \p{Block=
6725 Unified_Canadian_Aboriginal_Syllabics})
6726 (640)
6727 X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_-
6728 Extended} (= \p{Block=
6729 Unified_Canadian_Aboriginal_Syllabics_-
6730 Extended}) (80)
6731 X \p{UCAS_Ext_A} \p{Unified_Canadian_Aboriginal_Syllabics_-
6732 Extended_A} (= \p{Block=
6733 Unified_Canadian_Aboriginal_Syllabics_-
6734 Extended_A}) (16)
6735 \p{Ugar} \p{Ugaritic} (= \p{Script_Extensions=
6736 Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
6737 \p{Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
6738 \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
6739 \p{UIdeo} \p{Unified_Ideograph} (=
6740 \p{Unified_Ideograph=Y}) (92_865)
6741 \p{UIdeo: *} \p{Unified_Ideograph: *}
6742 \p{Unassigned} \p{General_Category=Unassigned} (Short:
6743 \p{Cn}) (829_834 plus all above-Unicode
6744 code points)
6745 \p{Unicode} \p{Any} (1_114_112)
6746 X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
6747 Unified_Canadian_Aboriginal_Syllabics}
6748 (Short: \p{InUCAS}) (640)
6749 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
6750 Unified_Canadian_Aboriginal_Syllabics_-
6751 Extended} (Short: \p{InUCASExt}) (80)
6752 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended_A} \p{Block=
6753 Unified_Canadian_Aboriginal_Syllabics_-
6754 Extended_A} (Short: \p{InUCASExtA}) (16)
6755 \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
6756 (92_865)
6757 \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
6758 (1_021_247 plus all above-Unicode code
6759 points: U+0000..33FF, U+4DC0..4DFF,
6760 U+A000..FA0D, U+FA10, U+FA12,
6761 U+FA15..FA1E ...)
6762 \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (92_865:
6763 U+3400..4DBF, U+4E00..9FFF,
6764 U+FA0E..FA0F, U+FA11, U+FA13..FA14,
6765 U+FA1F ...)
6766 \p{Unknown} \p{Script_Extensions=Unknown} (Short:
6767 \p{Zzzz}) (969_350 plus all above-
6768 Unicode code points)
6769 \p{Upper} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6770 Cased=Yes) (1951)
6771 \p{Upper: *} \p{Uppercase: *}
6772 \p{Uppercase} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6773 Cased=Yes) (1951)
6774 \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
6775 No) (1_112_161 plus all above-Unicode
6776 code points: [\x00-\x20!\"#\$\%&\'
6777 \(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a-z\{
6778 \|\}~\x7f-\xbf\xd7\xdf-\xff], U+0101,
6779 U+0103, U+0105, U+0107, U+0109 ...)
6780 \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
6781 Yes) (1951: [A-Z\xc0-\xd6\xd8-\xde],
6782 U+0100, U+0102, U+0104, U+0106, U+0108
6783 ...)
6784 \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
6785 (Short: \p{Lu}; /i= General_Category=
6786 Cased_Letter) (1831)
6787 \p{Vai} \p{Script_Extensions=Vai} (NOT \p{Block=
6788 Vai}) (300)
6789 \p{Vaii} \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
6790 \p{Block=Vai}) (300)
6791 \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS};
6792 NOT \p{Variation_Selectors}) (260)
6793 \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_852
6794 plus all above-Unicode code points:
6795 U+0000..180A, U+180E, U+1810..FDFF,
6796 U+FE10..E00FF, U+E01F0..infinity)
6797 \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (260:
6798 U+180B..180D, U+180F, U+FE00..FE0F,
6799 U+E0100..E01EF)
6800 X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
6801 \p{InVS}) (16)
6802 X \p{Variation_Selectors_Supplement} \p{Block=
6803 Variation_Selectors_Supplement} (Short:
6804 \p{InVSSup}) (240)
6805 X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block=
6806 Vedic_Extensions}) (48)
6807 X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short:
6808 \p{InVedicExt}) (48)
6809 X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
6810 \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated}
6811 (786_641 plus all above-Unicode code
6812 points)
6813 \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (786_641 plus
6814 all above-Unicode code points: [\x00-
6815 \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf-
6816 \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9,
6817 U+02EC..10FF, U+1200..1400,
6818 U+1680..18AF, U+1900..2015 ...)
6819 \p{Vertical_Orientation: Tr} \p{Vertical_Orientation=
6820 Transformed_Rotated} (47)
6821 \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr})
6822 (47: U+2329..232A, U+3008..3011,
6823 U+3014..301F, U+3030, U+30A0, U+30FC ...)
6824 \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu})
6825 (148: U+3001..3002, U+3041, U+3043,
6826 U+3045, U+3047, U+3049 ...)
6827 \p{Vertical_Orientation: Tu} \p{Vertical_Orientation=
6828 Transformed_Upright} (148)
6829 \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright}
6830 (327_276)
6831 \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (327_276:
6832 [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7],
6833 U+02EA..02EB, U+1100..11FF,
6834 U+1401..167F, U+18B0..18FF, U+2016 ...)
6835 \p{VertSpace} \v (7: [\n\cK\f\r\x85], U+2028..2029)
6836 \p{Vith} \p{Vithkuqi} (= \p{Script_Extensions=
6837 Vithkuqi}) (NOT \p{Block=Vithkuqi}) (70)
6838 \p{Vithkuqi} \p{Script_Extensions=Vithkuqi} (Short:
6839 \p{Vith}; NOT \p{Block=Vithkuqi}) (70)
6840 \p{Vo: *} \p{Vertical_Orientation: *}
6841 \p{VS} \p{Variation_Selector} (=
6842 \p{Variation_Selector=Y}) (NOT
6843 \p{Variation_Selectors}) (260)
6844 \p{VS: *} \p{Variation_Selector: *}
6845 X \p{VS_Sup} \p{Variation_Selectors_Supplement} (=
6846 \p{Block=
6847 Variation_Selectors_Supplement}) (240)
6848 \p{Wancho} \p{Script_Extensions=Wancho} (Short:
6849 \p{Wcho}; NOT \p{Block=Wancho}) (59)
6850 \p{Wara} \p{Warang_Citi} (= \p{Script_Extensions=
6851 Warang_Citi}) (NOT \p{Block=
6852 Warang_Citi}) (84)
6853 \p{Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
6854 \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
6855 \p{WB: *} \p{Word_Break: *}
6856 \p{Wcho} \p{Wancho} (= \p{Script_Extensions=
6857 Wancho}) (NOT \p{Block=Wancho}) (59)
6858 \p{White_Space} \p{White_Space=Y} (Short: \p{Space}) (25)
6859 \p{White_Space: N*} (Short: \p{Space=N}, \P{Space}) (1_114_087
6860 plus all above-Unicode code points: [^
6861 \t\n\cK\f\r\x20\x85\xa0], U+0100..167F,
6862 U+1681..1FFF, U+200B..2027,
6863 U+202A..202E, U+2030..205E ...)
6864 \p{White_Space: Y*} (Short: \p{Space=Y}, \p{Space}) (25: [\t
6865 \n\cK\f\r\x20\x85\xa0], U+1680,
6866 U+2000..200A, U+2028..2029, U+202F,
6867 U+205F ...)
6868 \p{Word} \p{XPosixWord} (135_202)
6869 \p{Word_Break: ALetter} (Short: \p{WB=LE}) (29_336: [A-Za-z\xaa
6870 \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6871 U+0100..02D7, U+02DE..02FF,
6872 U+0370..0374, U+0376..0377, U+037A..037D
6873 ...)
6874 \p{Word_Break: CR} (Short: \p{WB=CR}) (1: [\r])
6875 \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"])
6876 \p{Word_Break: DQ} \p{Word_Break=Double_Quote} (1)
6877 \p{Word_Break: E_Base} (Short: \p{WB=EB}) (0)
6878 \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (0)
6879 \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (0)
6880 \p{Word_Break: EB} \p{Word_Break=E_Base} (0)
6881 \p{Word_Break: EBG} \p{Word_Break=E_Base_GAZ} (0)
6882 \p{Word_Break: EM} \p{Word_Break=E_Modifier} (0)
6883 \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (11)
6884 \p{Word_Break: Extend} (Short: \p{WB=Extend}) (2512:
6885 U+0300..036F, U+0483..0489,
6886 U+0591..05BD, U+05BF, U+05C1..05C2,
6887 U+05C4..05C5 ...)
6888 \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F,
6889 U+203F..2040, U+2054, U+FE33..FE34,
6890 U+FE4D..FE4F ...)
6891 \p{Word_Break: FO} \p{Word_Break=Format} (64)
6892 \p{Word_Break: Format} (Short: \p{WB=FO}) (64: [\xad],
6893 U+0600..0605, U+061C, U+06DD, U+070F,
6894 U+0890..0891 ...)
6895 \p{Word_Break: GAZ} \p{Word_Break=Glue_After_Zwj} (0)
6896 \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (0)
6897 \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (75:
6898 U+05D0..05EA, U+05EF..05F2, U+FB1D,
6899 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
6900 ...)
6901 \p{Word_Break: HL} \p{Word_Break=Hebrew_Letter} (75)
6902 \p{Word_Break: KA} \p{Word_Break=Katakana} (330)
6903 \p{Word_Break: Katakana} (Short: \p{WB=KA}) (330: U+3031..3035,
6904 U+309B..309C, U+30A0..30FA,
6905 U+30FC..30FF, U+31F0..31FF, U+32D0..32FE
6906 ...)
6907 \p{Word_Break: LE} \p{Word_Break=ALetter} (29_336)
6908 \p{Word_Break: LF} (Short: \p{WB=LF}) (1: [\n])
6909 \p{Word_Break: MB} \p{Word_Break=MidNumLet} (7)
6910 \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (9: [:\xb7], U+0387,
6911 U+055F, U+05F4, U+2027, U+FE13 ...)
6912 \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15: [,;], U+037E,
6913 U+0589, U+060C..060D, U+066C, U+07F8 ...)
6914 \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.],
6915 U+2018..2019, U+2024, U+FE52, U+FF07,
6916 U+FF0E)
6917 \p{Word_Break: ML} \p{Word_Break=MidLetter} (9)
6918 \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
6919 \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85],
6920 U+2028..2029)
6921 \p{Word_Break: NL} \p{Word_Break=Newline} (5)
6922 \p{Word_Break: NU} \p{Word_Break=Numeric} (661)
6923 \p{Word_Break: Numeric} (Short: \p{WB=NU}) (661: [0-9],
6924 U+0660..0669, U+066B, U+06F0..06F9,
6925 U+07C0..07C9, U+0966..096F ...)
6926 \p{Word_Break: Other} (Short: \p{WB=XX}) (1_081_042 plus all
6927 above-Unicode code points: [^\n\cK\f\r
6928 \x20\"\',.0-9:;A-Z_a-z\x85\xaa\xad\xb5
6929 \xb7\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6930 U+02D8..02DD, U+0375, U+0378..0379,
6931 U+0380..0385, U+038B ...)
6932 \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26:
6933 U+1F1E6..1F1FF)
6934 \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26)
6935 \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\'])
6936 \p{Word_Break: SQ} \p{Word_Break=Single_Quote} (1)
6937 \p{Word_Break: WSegSpace} (Short: \p{WB=WSegSpace}) (14: [\x20],
6938 U+1680, U+2000..2006, U+2008..200A,
6939 U+205F, U+3000)
6940 \p{Word_Break: XX} \p{Word_Break=Other} (1_081_042 plus all
6941 above-Unicode code points)
6942 \p{Word_Break: ZWJ} (Short: \p{WB=ZWJ}) (1: U+200D)
6943 \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (25)
6944 \p{WSpace: *} \p{White_Space: *}
6945 \p{XDigit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
6946 \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
6947 (135_053)
6948 \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (979_059
6949 plus all above-Unicode code points:
6950 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
6951 \@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4
6952 \xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
6953 U+02C2..02C5, U+02D2..02DF,
6954 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
6955 \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (135_053:
6956 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6
6957 \xd8-\xf6\xf8-\xff], U+0100..02C1,
6958 U+02C6..02D1, U+02E0..02E4, U+02EC,
6959 U+02EE ...)
6960 \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (131_974)
6961 \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (982_138
6962 plus all above-Unicode code points:
6963 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
6964 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4
6965 \xb6-\xb9\xbb-\xbf\xd7\xf7],
6966 U+02C2..02C5, U+02D2..02DF,
6967 U+02E5..02EB, U+02ED, U+02EF..036F ...)
6968 \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (131_974:
6969 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
6970 \xf8-\xff], U+0100..02C1, U+02C6..02D1,
6971 U+02E0..02E4, U+02EC, U+02EE ...)
6972 \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
6973 (135_053)
6974 \p{XIDC: *} \p{XID_Continue: *}
6975 \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (131_974)
6976 \p{XIDS: *} \p{XID_Start: *}
6977 \p{Xpeo} \p{Old_Persian} (= \p{Script_Extensions=
6978 Old_Persian}) (NOT \p{Block=
6979 Old_Persian}) (50)
6980 \p{XPerlSpace} \p{XPosixSpace} (25)
6981 \p{XPosixAlnum} Alphabetic and (decimal) Numeric (Short:
6982 \p{Alnum}) (134_056: [0-9A-Za-z\xaa\xb5
6983 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6984 U+0100..02C1, U+02C6..02D1,
6985 U+02E0..02E4, U+02EC, U+02EE ...)
6986 \p{XPosixAlpha} \p{Alphabetic=Y} (Short: \p{Alpha})
6987 (133_396)
6988 \p{XPosixBlank} \h, Horizontal white space (Short:
6989 \p{Blank}) (18: [\t\x20\xa0], U+1680,
6990 U+2000..200A, U+202F, U+205F, U+3000)
6991 \p{XPosixCntrl} \p{General_Category=Control} Control
6992 characters (Short: \p{Cc}) (65)
6993 \p{XPosixDigit} \p{General_Category=Decimal_Number} [0-9]
6994 + all other decimal digits (Short:
6995 \p{Nd}) (660)
6996 \p{XPosixGraph} Characters that are graphical (Short:
6997 \p{Graph}) (282_146: [!\"#\$\%&\'
6998 \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`a-z
6999 \{\|\}~\xa1-\xff], U+0100..0377,
7000 U+037A..037F, U+0384..038A, U+038C,
7001 U+038E..03A1 ...)
7002 \p{XPosixLower} \p{Lowercase=Y} (Short: \p{Lower}; /i=
7003 Cased=Yes) (2471)
7004 \p{XPosixPrint} Characters that are graphical plus space
7005 characters (but no controls) (Short:
7006 \p{Print}) (282_163: [\x20-\x7e\xa0-
7007 \xff], U+0100..0377, U+037A..037F,
7008 U+0384..038A, U+038C, U+038E..03A1 ...)
7009 \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (828:
7010 [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\]
7011 \^_`\{\|\}~\xa1\xa7\xab\xb6-\xb7\xbb
7012 \xbf], U+037E, U+0387, U+055A..055F,
7013 U+0589..058A, U+05BE ...)
7014 \p{XPosixSpace} \s including beyond ASCII and vertical tab
7015 (Short: \p{SpacePerl}) (25: [\t\n\cK\f
7016 \r\x20\x85\xa0], U+1680, U+2000..200A,
7017 U+2028..2029, U+202F, U+205F ...)
7018 \p{XPosixUpper} \p{Uppercase=Y} (Short: \p{Upper}; /i=
7019 Cased=Yes) (1951)
7020 \p{XPosixWord} \w, including beyond ASCII; = \p{Alnum} +
7021 \pM + \p{Pc} + \p{Join_Control} (Short:
7022 \p{Word}) (135_202: [0-9A-Z_a-z\xaa\xb5
7023 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7024 U+0100..02C1, U+02C6..02D1,
7025 U+02E0..02E4, U+02EC, U+02EE ...)
7026 \p{XPosixXDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
7027 \p{Xsux} \p{Cuneiform} (= \p{Script_Extensions=
7028 Cuneiform}) (NOT \p{Block=Cuneiform})
7029 (1234)
7030 \p{Yezi} \p{Yezidi} (= \p{Script_Extensions=
7031 Yezidi}) (NOT \p{Block=Yezidi}) (60)
7032 \p{Yezidi} \p{Script_Extensions=Yezidi} (Short:
7033 \p{Yezi}; NOT \p{Block=Yezidi}) (60)
7034 \p{Yi} \p{Script_Extensions=Yi} (1246)
7035 X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
7036 X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
7037 \p{Yiii} \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
7038 X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block=
7039 Yijing_Hexagram_Symbols}) (64)
7040 X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
7041 (Short: \p{InYijing}) (64)
7042 \p{Z} \pZ \p{Separator} (= \p{General_Category=
7043 Separator}) (19)
7044 \p{Zanabazar_Square} \p{Script_Extensions=Zanabazar_Square}
7045 (Short: \p{Zanb}; NOT \p{Block=
7046 Zanabazar_Square}) (72)
7047 \p{Zanb} \p{Zanabazar_Square} (=
7048 \p{Script_Extensions=Zanabazar_Square})
7049 (NOT \p{Block=Zanabazar_Square}) (72)
7050 \p{Zinh} \p{Inherited} (= \p{Script_Extensions=
7051 Inherited}) (586)
7052 \p{Zl} \p{Line_Separator} (= \p{General_Category=
7053 Line_Separator}) (1)
7054 X \p{Znamenny_Music} \p{Znamenny_Musical_Notation} (= \p{Block=
7055 Znamenny_Musical_Notation}) (208)
7056 X \p{Znamenny_Musical_Notation} \p{Block=Znamenny_Musical_Notation}
7057 (Short: \p{InZnamennyMusic}) (208)
7058 \p{Zp} \p{Paragraph_Separator} (=
7059 \p{General_Category=
7060 Paragraph_Separator}) (1)
7061 \p{Zs} \p{Space_Separator} (=
7062 \p{General_Category=Space_Separator})
7063 (17)
7064 \p{Zyyy} \p{Common} (= \p{Script_Extensions=
7065 Common}) (7824)
7066 \p{Zzzz} \p{Unknown} (= \p{Script_Extensions=
7067 Unknown}) (969_350 plus all above-
7068 Unicode code points)
7069
7070 Legal "\p{}" and "\P{}" constructs that match no characters
7071 Unicode has some property-value pairs that currently don't match
7072 anything. This happens generally either because they are obsolete, or
7073 they exist for symmetry with other forms, but no language has yet been
7074 encoded that uses them. In this version of Unicode, the following
7075 match zero code points:
7076
7077 \p{Canonical_Combining_Class=Attached_Below_Left}
7078 \p{Canonical_Combining_Class=CCC133}
7079 \p{Grapheme_Cluster_Break=E_Base}
7080 \p{Grapheme_Cluster_Break=E_Base_GAZ}
7081 \p{Grapheme_Cluster_Break=E_Modifier}
7082 \p{Grapheme_Cluster_Break=Glue_After_Zwj}
7083 \p{Word_Break=E_Base}
7084 \p{Word_Break=E_Base_GAZ}
7085 \p{Word_Break=E_Modifier}
7086 \p{Word_Break=Glue_After_Zwj}
7087
7089 The value of any Unicode (not including Perl extensions) character
7090 property mentioned above for any single code point is available through
7091 "charprop()" in Unicode::UCD. "charprops_all()" in Unicode::UCD
7092 returns the values of all the Unicode properties for a given code
7093 point.
7094
7095 Besides these, all the Unicode character properties mentioned above
7096 (except for those marked as for internal use by Perl) are also
7097 accessible by "prop_invlist()" in Unicode::UCD.
7098
7099 Due to their nature, not all Unicode character properties are suitable
7100 for regular expression matches, nor "prop_invlist()". The remaining
7101 non-provisional, non-internal ones are accessible via "prop_invmap()"
7102 in Unicode::UCD (except for those that this Perl installation hasn't
7103 included; see below for which those are).
7104
7105 For compatibility with other parts of Perl, all the single forms given
7106 in the table in the section above are recognized. BUT, there are some
7107 ambiguities between some Perl extensions and the Unicode properties,
7108 all of which are silently resolved in favor of the official Unicode
7109 property. To avoid surprises, you should only use "prop_invmap()" for
7110 forms listed in the table below, which omits the non-recommended ones.
7111 The affected forms are the Perl single form equivalents of Unicode
7112 properties, such as "\p{sc}" being a single-form equivalent of
7113 "\p{gc=sc}", which is treated by "prop_invmap()" as the "Script"
7114 property, whose short name is "sc". The table indicates the current
7115 ambiguities in the INFO column, beginning with the word "NOT".
7116
7117 The standard Unicode properties listed below are documented in
7118 <http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is
7119 documented in "prop_invmap()" in Unicode::UCD. The other Perl
7120 extensions are in "Other Properties" in perlunicode;
7121
7122 The first column in the table is a name for the property; the second
7123 column is an alternative name, if any, plus possibly some annotations.
7124 The alternative name is the property's full name, unless that would
7125 simply repeat the first column, in which case the second column
7126 indicates the property's short name (if different). The annotations
7127 are given only in the entry for the full name. The annotations for
7128 binary properties include a list of the first few ranges that the
7129 property matches. To avoid any ambiguity, the SPACE character is
7130 represented as "\x20".
7131
7132 If a property is obsolete, etc, the entry will be flagged with the same
7133 characters used in the table in the section above, like D or S.
7134
7135 NAME INFO
7136
7137 Age
7138 AHex ASCII_Hex_Digit
7139 All (Perl extension). All code points,
7140 including those above Unicode. Same as
7141 qr/./s. U+0000..infinity
7142 Alnum XPosixAlnum. (Perl extension)
7143 Alpha Alphabetic
7144 Alphabetic (Short: Alpha). [A-Za-z\xaa\xb5\xba\xc0-
7145 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7146 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7147 ...
7148 Any (Perl extension). All Unicode code
7149 points. U+0000..10FFFF
7150 ASCII Block=Basic_Latin. (Perl extension).
7151 [\x00-\x7f]
7152 ASCII_Hex_Digit (Short: AHex). [0-9A-Fa-f]
7153 Assigned (Perl extension). All assigned code
7154 points. U+0000..0377, U+037A..037F,
7155 U+0384..038A, U+038C, U+038E..03A1,
7156 U+03A3..052F ...
7157 Bc Bidi_Class
7158 Bidi_C Bidi_Control
7159 Bidi_Class (Short: bc)
7160 Bidi_Control (Short: Bidi_C). U+061C, U+200E..200F,
7161 U+202A..202E, U+2066..2069
7162 Bidi_M Bidi_Mirrored
7163 Bidi_Mirrored (Short: Bidi_M). [\(\)<>\[\]\{\}\xab
7164 \xbb], U+0F3A..0F3D, U+169B..169C,
7165 U+2039..203A, U+2045..2046, U+207D..207E
7166 ...
7167 Bidi_Mirroring_Glyph (Short: bmg)
7168 Bidi_Paired_Bracket (Short: bpb)
7169 Bidi_Paired_Bracket_Type (Short: bpt)
7170 Blank XPosixBlank. (Perl extension)
7171 Blk Block
7172 Block (Short: blk)
7173 Bmg Bidi_Mirroring_Glyph
7174 Bpb Bidi_Paired_Bracket
7175 Bpt Bidi_Paired_Bracket_Type
7176 Canonical_Combining_Class (Short: ccc)
7177 Case_Folding (Short: cf)
7178 Case_Ignorable (Short: CI). [\'.:\^`\xa8\xad\xaf\xb4
7179 \xb7-\xb8], U+02B0..036F, U+0374..0375,
7180 U+037A, U+0384..0385, U+0387 ...
7181 Cased [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7182 \xff], U+0100..01BA, U+01BC..01BF,
7183 U+01C4..0293, U+0295..02B8, U+02C0..02C1
7184 ...
7185 Category General_Category
7186 Ccc Canonical_Combining_Class
7187 CE Composition_Exclusion
7188 Cf Case_Folding; NOT 'cf' meaning
7189 'General_Category=Format'
7190 Changes_When_Casefolded (Short: CWCF). [A-Z\xb5\xc0-\xd6\xd8-
7191 \xdf], U+0100, U+0102, U+0104, U+0106,
7192 U+0108 ...
7193 Changes_When_Casemapped (Short: CWCM). [A-Za-z\xb5\xc0-\xd6\xd8-
7194 \xf6\xf8-\xff], U+0100..0137,
7195 U+0139..018C, U+018E..019A, U+019C..01A9,
7196 U+01AC..01B9 ...
7197 Changes_When_Lowercased (Short: CWL). [A-Z\xc0-\xd6\xd8-\xde],
7198 U+0100, U+0102, U+0104, U+0106, U+0108 ...
7199 Changes_When_NFKC_Casefolded (Short: CWKCF). [A-Z\xa0\xa8\xaa
7200 \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0-
7201 \xd6\xd8-\xdf], U+0100, U+0102, U+0104,
7202 U+0106, U+0108 ...
7203 Changes_When_Titlecased (Short: CWT). [a-z\xb5\xdf-\xf6\xf8-
7204 \xff], U+0101, U+0103, U+0105, U+0107,
7205 U+0109 ...
7206 Changes_When_Uppercased (Short: CWU). [a-z\xb5\xdf-\xf6\xf8-
7207 \xff], U+0101, U+0103, U+0105, U+0107,
7208 U+0109 ...
7209 CI Case_Ignorable
7210 Cntrl XPosixCntrl (=General_Category=Control).
7211 (Perl extension)
7212 Comp_Ex Full_Composition_Exclusion
7213 Composition_Exclusion (Short: CE). U+0958..095F, U+09DC..09DD,
7214 U+09DF, U+0A33, U+0A36, U+0A59..0A5B ...
7215 CWCF Changes_When_Casefolded
7216 CWCM Changes_When_Casemapped
7217 CWKCF Changes_When_NFKC_Casefolded
7218 CWL Changes_When_Lowercased
7219 CWT Changes_When_Titlecased
7220 CWU Changes_When_Uppercased
7221 Dash [\-], U+058A, U+05BE, U+1400, U+1806,
7222 U+2010..2015 ...
7223 Decomposition_Mapping (Short: dm)
7224 Decomposition_Type (Short: dt)
7225 Default_Ignorable_Code_Point (Short: DI). [\xad], U+034F, U+061C,
7226 U+115F..1160, U+17B4..17B5, U+180B..180F
7227 ...
7228 Dep Deprecated
7229 Deprecated (Short: Dep). U+0149, U+0673, U+0F77,
7230 U+0F79, U+17A3..17A4, U+206A..206F ...
7231 DI Default_Ignorable_Code_Point
7232 Dia Diacritic
7233 Diacritic (Short: Dia). [\^`\xa8\xaf\xb4\xb7-\xb8],
7234 U+02B0..034E, U+0350..0357, U+035D..0362,
7235 U+0374..0375, U+037A ...
7236 Digit XPosixDigit (=General_Category=
7237 Decimal_Number). (Perl extension)
7238 Dm Decomposition_Mapping
7239 Dt Decomposition_Type
7240 Ea East_Asian_Width
7241 East_Asian_Width (Short: ea)
7242 EBase Emoji_Modifier_Base
7243 EComp Emoji_Component
7244 EMod Emoji_Modifier
7245 Emoji [#*0-9\xa9\xae], U+203C, U+2049, U+2122,
7246 U+2139, U+2194..2199 ...
7247 Emoji_Component (Short: EComp). [#*0-9], U+200D, U+20E3,
7248 U+FE0F, U+1F1E6..1F1FF, U+1F3FB..1F3FF ...
7249 Emoji_Modifier (Short: EMod). U+1F3FB..1F3FF
7250 Emoji_Modifier_Base (Short: EBase). U+261D, U+26F9,
7251 U+270A..270D, U+1F385, U+1F3C2..1F3C4,
7252 U+1F3C7 ...
7253 Emoji_Presentation (Short: EPres). U+231A..231B,
7254 U+23E9..23EC, U+23F0, U+23F3,
7255 U+25FD..25FE, U+2614..2615 ...
7256 EPres Emoji_Presentation
7257 EqUIdeo Equivalent_Unified_Ideograph
7258 Equivalent_Unified_Ideograph (Short: EqUIdeo)
7259 Ext Extender
7260 Extended_Pictographic (Short: ExtPict). [\xa9\xae], U+203C,
7261 U+2049, U+2122, U+2139, U+2194..2199 ...
7262 Extender (Short: Ext). [\xb7], U+02D0..02D1,
7263 U+0640, U+07FA, U+0B55, U+0E46 ...
7264 ExtPict Extended_Pictographic
7265 Full_Composition_Exclusion (Short: Comp_Ex). U+0340..0341,
7266 U+0343..0344, U+0374, U+037E, U+0387,
7267 U+0958..095F ...
7268 Gc General_Category
7269 GCB Grapheme_Cluster_Break
7270 General_Category (Short: gc)
7271 Gr_Base Grapheme_Base
7272 Gr_Ext Grapheme_Extend
7273 Graph XPosixGraph. (Perl extension)
7274 Grapheme_Base (Short: Gr_Base). [\x20-\x7e\xa0-\xac
7275 \xae-\xff], U+0100..02FF, U+0370..0377,
7276 U+037A..037F, U+0384..038A, U+038C ...
7277 Grapheme_Cluster_Break (Short: GCB)
7278 Grapheme_Extend (Short: Gr_Ext). U+0300..036F,
7279 U+0483..0489, U+0591..05BD, U+05BF,
7280 U+05C1..05C2, U+05C4..05C5 ...
7281 Hangul_Syllable_Type (Short: hst)
7282 Hex Hex_Digit
7283 Hex_Digit (Short: Hex). [0-9A-Fa-f], U+FF10..FF19,
7284 U+FF21..FF26, U+FF41..FF46
7285 HorizSpace XPosixBlank. (Perl extension)
7286 Hst Hangul_Syllable_Type
7287 D Hyphen [\-\xad], U+058A, U+1806, U+2010..2011,
7288 U+2E17, U+30FB ... Supplanted by
7289 Line_Break property values; see
7290 www.unicode.org/reports/tr14
7291 ID_Continue (Short: IDC). [0-9A-Z_a-z\xaa\xb5\xb7
7292 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7293 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7294 U+02EC, U+02EE ...
7295 ID_Start (Short: IDS). [A-Za-z\xaa\xb5\xba\xc0-
7296 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7297 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7298 ...
7299 IDC ID_Continue
7300 Identifier_Status
7301 Identifier_Type
7302 Ideo Ideographic
7303 Ideographic (Short: Ideo). U+3006..3007,
7304 U+3021..3029, U+3038..303A, U+3400..4DBF,
7305 U+4E00..9FFF, U+F900..FA6D ...
7306 IDS ID_Start
7307 IDS_Binary_Operator (Short: IDSB). U+2FF0..2FF1, U+2FF4..2FFB
7308 IDS_Trinary_Operator (Short: IDST). U+2FF2..2FF3
7309 IDSB IDS_Binary_Operator
7310 IDST IDS_Trinary_Operator
7311 In Present_In. (Perl extension)
7312 Indic_Positional_Category (Short: InPC)
7313 Indic_Syllabic_Category (Short: InSC)
7314 InPC Indic_Positional_Category
7315 InSC Indic_Syllabic_Category
7316 Isc ISO_Comment; NOT 'isc' meaning
7317 'General_Category=Other'
7318 ISO_Comment (Short: isc)
7319 Jg Joining_Group
7320 Join_C Join_Control
7321 Join_Control (Short: Join_C). U+200C..200D
7322 Joining_Group (Short: jg)
7323 Joining_Type (Short: jt)
7324 Jt Joining_Type
7325 Lb Line_Break
7326 Lc Lowercase_Mapping; NOT 'lc' meaning
7327 'General_Category=Cased_Letter'
7328 Line_Break (Short: lb)
7329 LOE Logical_Order_Exception
7330 Logical_Order_Exception (Short: LOE). U+0E40..0E44, U+0EC0..0EC4,
7331 U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9
7332 ...
7333 Lower Lowercase
7334 Lowercase (Short: Lower). [a-z\xaa\xb5\xba\xdf-
7335 \xf6\xf8-\xff], U+0101, U+0103, U+0105,
7336 U+0107, U+0109 ...
7337 Lowercase_Mapping (Short: lc)
7338 Math [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
7339 U+03D5, U+03F0..03F1, U+03F4..03F6,
7340 U+0606..0608 ...
7341 Na Name
7342 Na1 Unicode_1_Name
7343 Name (Short: na)
7344 Name_Alias
7345 NChar Noncharacter_Code_Point
7346 NFC_QC NFC_Quick_Check
7347 NFC_Quick_Check (Short: NFC_QC)
7348 NFD_QC NFD_Quick_Check
7349 NFD_Quick_Check (Short: NFD_QC)
7350 NFKC_Casefold (Short: NFKC_CF)
7351 NFKC_CF NFKC_Casefold
7352 NFKC_QC NFKC_Quick_Check
7353 NFKC_Quick_Check (Short: NFKC_QC)
7354 NFKD_QC NFKD_Quick_Check
7355 NFKD_Quick_Check (Short: NFKD_QC)
7356 Noncharacter_Code_Point (Short: NChar). U+FDD0..FDEF,
7357 U+FFFE..FFFF, U+1FFFE..1FFFF,
7358 U+2FFFE..2FFFF, U+3FFFE..3FFFF,
7359 U+4FFFE..4FFFF ...
7360 Nt Numeric_Type
7361 Numeric_Type (Short: nt)
7362 Numeric_Value (Short: nv)
7363 Nv Numeric_Value
7364 Pat_Syn Pattern_Syntax
7365 Pat_WS Pattern_White_Space
7366 Pattern_Syntax (Short: Pat_Syn). [!\"#\$\%&\'\(\)*+,\-.
7367 \/:;<=>?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9
7368 \xab-\xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7
7369 \xf7], U+2010..2027, U+2030..203E,
7370 U+2041..2053, U+2055..205E, U+2190..245F
7371 ...
7372 Pattern_White_Space (Short: Pat_WS). [\t\n\cK\f\r\x20\x85],
7373 U+200E..200F, U+2028..2029
7374 PCM Prepended_Concatenation_Mark
7375 Perl_Decimal_Digit (Perl extension)
7376 PerlSpace PosixSpace. (Perl extension)
7377 PerlWord PosixWord. (Perl extension)
7378 PosixAlnum (Perl extension). [0-9A-Za-z]
7379 PosixAlpha (Perl extension). [A-Za-z]
7380 PosixBlank (Perl extension). [\t\x20]
7381 PosixCntrl (Perl extension). ASCII control
7382 characters. ACK, BEL, BS, CAN, CR, DC1,
7383 DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT,
7384 ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK,
7385 NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT
7386 PosixDigit (Perl extension). [0-9]
7387 PosixGraph (Perl extension). [!\"#\$\%&\'\(\)*+,\-.
7388 \/0-9:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~]
7389 PosixLower (Perl extension). [a-z]
7390 PosixPrint (Perl extension). [\x20-\x7e]
7391 PosixPunct (Perl extension). [!\"#\$\%&\'\(\)*+,\-.
7392 \/:;<=>?\@\[\\\]\^_`\{\|\}~]
7393 PosixSpace (Perl extension). [\t\n\cK\f\r\x20]
7394 PosixUpper (Perl extension). [A-Z]
7395 PosixWord (Perl extension). \w, restricted to
7396 ASCII. [0-9A-Z_a-z]
7397 PosixXDigit ASCII_Hex_Digit. (Perl extension).
7398 [0-9A-Fa-f]
7399 Prepended_Concatenation_Mark (Short: PCM). U+0600..0605, U+06DD,
7400 U+070F, U+0890..0891, U+08E2, U+110BD ...
7401 Present_In (Short: In). (Perl extension)
7402 Print XPosixPrint. (Perl extension)
7403 Punct General_Category=Punctuation. (Perl
7404 extension). [!\"#\%&\'\(\)*,\-.\/:;?\@
7405 \[\\\]_\{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
7406 U+037E, U+0387, U+055A..055F,
7407 U+0589..058A, U+05BE ...
7408 QMark Quotation_Mark
7409 Quotation_Mark (Short: QMark). [\"\'\xab\xbb],
7410 U+2018..201F, U+2039..203A, U+2E42,
7411 U+300C..300F, U+301D..301F ...
7412 Radical U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5
7413 Regional_Indicator (Short: RI). U+1F1E6..1F1FF
7414 RI Regional_Indicator
7415 SB Sentence_Break
7416 Sc Script; NOT 'sc' meaning
7417 'General_Category=Currency_Symbol'
7418 Scf Simple_Case_Folding
7419 Script (Short: sc)
7420 Script_Extensions (Short: scx)
7421 Scx Script_Extensions
7422 SD Soft_Dotted
7423 Sentence_Break (Short: SB)
7424 Sentence_Terminal (Short: STerm). [!.?], U+0589,
7425 U+061D..061F, U+06D4, U+0700..0702, U+07F9
7426 ...
7427 Sfc Simple_Case_Folding
7428 Simple_Case_Folding (Short: scf)
7429 Simple_Lowercase_Mapping (Short: slc)
7430 Simple_Titlecase_Mapping (Short: stc)
7431 Simple_Uppercase_Mapping (Short: suc)
7432 Slc Simple_Lowercase_Mapping
7433 Soft_Dotted (Short: SD). [i-j], U+012F, U+0249,
7434 U+0268, U+029D, U+02B2 ...
7435 Space White_Space
7436 SpacePerl XPosixSpace. (Perl extension)
7437 Stc Simple_Titlecase_Mapping
7438 STerm Sentence_Terminal
7439 Suc Simple_Uppercase_Mapping
7440 Tc Titlecase_Mapping
7441 Term Terminal_Punctuation
7442 Terminal_Punctuation (Short: Term). [!,.:;?], U+037E, U+0387,
7443 U+0589, U+05C3, U+060C ...
7444 Title Titlecase. (Perl extension)
7445 Titlecase (Short: Title). (Perl extension). (=
7446 \p{Gc=Lt}). U+01C5, U+01C8, U+01CB,
7447 U+01F2, U+1F88..1F8F, U+1F98..1F9F ...
7448 Titlecase_Mapping (Short: tc)
7449 Uc Uppercase_Mapping
7450 UIdeo Unified_Ideograph
7451 Unicode Any. (Perl extension)
7452 Unicode_1_Name (Short: na1)
7453 Unified_Ideograph (Short: UIdeo). U+3400..4DBF,
7454 U+4E00..9FFF, U+FA0E..FA0F, U+FA11,
7455 U+FA13..FA14, U+FA1F ...
7456 Upper Uppercase
7457 Uppercase (Short: Upper). [A-Z\xc0-\xd6\xd8-\xde],
7458 U+0100, U+0102, U+0104, U+0106, U+0108 ...
7459 Uppercase_Mapping (Short: uc)
7460 Variation_Selector (Short: VS). U+180B..180D, U+180F,
7461 U+FE00..FE0F, U+E0100..E01EF
7462 Vertical_Orientation (Short: vo)
7463 VertSpace (Perl extension). \v. [\n\cK\f\r\x85],
7464 U+2028..2029
7465 Vo Vertical_Orientation
7466 VS Variation_Selector
7467 WB Word_Break
7468 White_Space (Short: WSpace). [\t\n\cK\f\r\x20\x85
7469 \xa0], U+1680, U+2000..200A, U+2028..2029,
7470 U+202F, U+205F ...
7471 Word XPosixWord. (Perl extension)
7472 Word_Break (Short: WB)
7473 WSpace White_Space
7474 XDigit XPosixXDigit (=Hex_Digit). (Perl
7475 extension)
7476 XID_Continue (Short: XIDC). [0-9A-Z_a-z\xaa\xb5\xb7
7477 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7478 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7479 U+02EC, U+02EE ...
7480 XID_Start (Short: XIDS). [A-Za-z\xaa\xb5\xba\xc0-
7481 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7482 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7483 ...
7484 XIDC XID_Continue
7485 XIDS XID_Start
7486 XPerlSpace XPosixSpace. (Perl extension)
7487 XPosixAlnum (Short: Alnum). (Perl extension).
7488 Alphabetic and (decimal) Numeric. [0-9A-
7489 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7490 \xff], U+0100..02C1, U+02C6..02D1,
7491 U+02E0..02E4, U+02EC, U+02EE ...
7492 XPosixAlpha Alphabetic. (Perl extension). [A-Za-z
7493 \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7494 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7495 U+02EC, U+02EE ...
7496 XPosixBlank (Short: Blank). (Perl extension). \h,
7497 Horizontal white space. [\t\x20\xa0],
7498 U+1680, U+2000..200A, U+202F, U+205F,
7499 U+3000
7500 XPosixCntrl General_Category=Control (Short: Cntrl).
7501 (Perl extension). Control characters.
7502 [\x00-\x1f\x7f-\x9f]
7503 XPosixDigit General_Category=Decimal_Number (Short:
7504 Digit). (Perl extension). [0-9] + all
7505 other decimal digits. [0-9],
7506 U+0660..0669, U+06F0..06F9, U+07C0..07C9,
7507 U+0966..096F, U+09E6..09EF ...
7508 XPosixGraph (Short: Graph). (Perl extension).
7509 Characters that are graphical. [!\"#\$
7510 \%&\'\(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]
7511 \^_`a-z\{\|\}~\xa1-\xff], U+0100..0377,
7512 U+037A..037F, U+0384..038A, U+038C,
7513 U+038E..03A1 ...
7514 XPosixLower Lowercase. (Perl extension). [a-z\xaa
7515 \xb5\xba\xdf-\xf6\xf8-\xff], U+0101,
7516 U+0103, U+0105, U+0107, U+0109 ...
7517 XPosixPrint (Short: Print). (Perl extension).
7518 Characters that are graphical plus space
7519 characters (but no controls). [\x20-\x7e
7520 \xa0-\xff], U+0100..0377, U+037A..037F,
7521 U+0384..038A, U+038C, U+038E..03A1 ...
7522 XPosixPunct (Perl extension). \p{Punct} + ASCII-range
7523 \p{Symbol}. [!\"#\$\%&\'\(\)*+,\-.\/:;<=
7524 >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
7525 \xb7\xbb\xbf], U+037E, U+0387,
7526 U+055A..055F, U+0589..058A, U+05BE ...
7527 XPosixSpace (Perl extension). \s including beyond
7528 ASCII and vertical tab. [\t\n\cK\f\r\x20
7529 \x85\xa0], U+1680, U+2000..200A,
7530 U+2028..2029, U+202F, U+205F ...
7531 XPosixUpper Uppercase. (Perl extension). [A-Z\xc0-
7532 \xd6\xd8-\xde], U+0100, U+0102, U+0104,
7533 U+0106, U+0108 ...
7534 XPosixWord (Short: Word). (Perl extension). \w,
7535 including beyond ASCII; = \p{Alnum} + \pM
7536 + \p{Pc} + \p{Join_Control}. [0-9A-Z_a-z
7537 \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7538 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7539 U+02EC, U+02EE ...
7540 XPosixXDigit Hex_Digit (Short: XDigit). (Perl
7541 extension). [0-9A-Fa-f], U+FF10..FF19,
7542 U+FF21..FF26, U+FF41..FF46
7543
7545 Certain properties are accessible also via core function calls. These
7546 are:
7547
7548 Lowercase_Mapping lc() and lcfirst()
7549 Titlecase_Mapping ucfirst()
7550 Uppercase_Mapping uc()
7551
7552 Also, Case_Folding is accessible through the "/i" modifier in regular
7553 expressions, the "\F" transliteration escape, and the "fc" operator.
7554
7555 Besides being able to say "\p{Name=...}", the Name and Name_Aliases
7556 properties are accessible through the "\N{}" interpolation in double-
7557 quoted strings and regular expressions; and functions
7558 "charnames::viacode()", "charnames::vianame()", and
7559 "charnames::string_vianame()" (which require a "use charnames ();" to
7560 be specified.
7561
7562 Finally, most properties related to decomposition are accessible via
7563 Unicode::Normalize.
7564
7566 Perl will generate an error for a few character properties in Unicode
7567 when used in a regular expression. The non-Unihan ones are listed
7568 below, with the reasons they are not accepted, perhaps with work-
7569 arounds. The short names for the properties are listed enclosed in
7570 (parentheses). As described after the list, an installation can change
7571 the defaults and choose to accept any of these. The list is machine
7572 generated based on the choices made for the installation that generated
7573 this document.
7574
7575 Expands_On_NFC (XO_NFC)
7576 Expands_On_NFD (XO_NFD)
7577 Expands_On_NFKC (XO_NFKC)
7578 Expands_On_NFKD (XO_NFKD)
7579 Deprecated by Unicode. These are characters that expand to more
7580 than one character in the specified normalization form, but whether
7581 they actually take up more bytes or not depends on the encoding
7582 being used. For example, a UTF-8 encoded character may expand to a
7583 different number of bytes than a UTF-32 encoded character.
7584
7585 Grapheme_Link (Gr_Link)
7586 Duplicates ccc=vr (Canonical_Combining_Class=Virama)
7587
7588 Jamo_Short_Name (JSN)
7589 Other_Alphabetic (OAlpha)
7590 Other_Default_Ignorable_Code_Point (ODI)
7591 Other_Grapheme_Extend (OGr_Ext)
7592 Other_ID_Continue (OIDC)
7593 Other_ID_Start (OIDS)
7594 Other_Lowercase (OLower)
7595 Other_Math (OMath)
7596 Other_Uppercase (OUpper)
7597 Used by Unicode internally for generating other properties and not
7598 intended to be used stand-alone
7599
7600 Script=Katakana_Or_Hiragana (sc=Hrkt)
7601 Obsolete. All code points previously matched by this have been
7602 moved to "Script=Common". Consider instead using
7603 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or
7604 both)
7605
7606 Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)
7607 All code points that would be matched by this are matched by either
7608 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
7609
7610 An installation can choose to allow any of these to be matched by
7611 downloading the Unicode database from <http://www.unicode.org/Public/>
7612 to $Config{privlib}/unicore/ in the Perl source tree, changing the
7613 controlling lists contained in the program
7614 $Config{privlib}/unicore/mktables and then re-compiling and installing.
7615 (%Config is available from the Config module).
7616
7617 Also, perl can be recompiled to operate on an earlier version of the
7618 Unicode standard. Further information is at
7619 $Config{privlib}/unicore/README.perl.
7620
7622 The Unicode data base is delivered in two different formats. The XML
7623 version is valid for more modern Unicode releases. The other version
7624 is a collection of files. The two are intended to give equivalent
7625 information. Perl uses the older form; this allows you to recompile
7626 Perl to use early Unicode releases.
7627
7628 The only non-character property that Perl currently supports is Named
7629 Sequences, in which a sequence of code points is given a name and
7630 generally treated as a single entity. (Perl supports these via the
7631 "\N{...}" double-quotish construct, "charnames::string_vianame(name)"
7632 in charnames, and "namedseq()" in Unicode::UCD.
7633
7634 Below is a list of the files in the Unicode data base that Perl doesn't
7635 currently use, along with very brief descriptions of their purposes.
7636 Some of the names of the files have been shortened from those that
7637 Unicode uses, in order to allow them to be distinguishable from
7638 similarly named files on file systems for which only the first 8
7639 characters of a name are significant.
7640
7641 auxiliary/GraphemeBreakTest.html
7642 auxiliary/LineBreakTest.html
7643 auxiliary/SentenceBreakTest.html
7644 auxiliary/WordBreakTest.html
7645 Documentation of validation Tests
7646
7647 BidiCharacterTest.txt
7648 BidiTest.txt
7649 NormTest.txt
7650 Validation Tests
7651
7652 CJKRadicals.txt
7653 Maps the kRSUnicode property values to corresponding code points
7654
7655 emoji/ReadMe.txt
7656 ReadMe.txt
7657 Documentation
7658
7659 EmojiSources.txt
7660 Maps certain Unicode code points to their legacy Japanese cell-
7661 phone values
7662
7663 extracted/DName.txt
7664 This file adds no new information not already present in other
7665 files
7666
7667 Index.txt
7668 Alphabetical index of Unicode characters
7669
7670 NamedSqProv.txt
7671 Named sequences proposed for inclusion in a later version of the
7672 Unicode Standard; if you need them now, you can append this file to
7673 NamedSequences.txt and recompile perl
7674
7675 NamesList.html
7676 Describes the format and contents of NamesList.txt
7677
7678 NamesList.txt
7679 Annotated list of characters
7680
7681 NormalizationCorrections.txt
7682 Documentation of corrections already incorporated into the Unicode
7683 data base
7684
7685 NushuSources.txt
7686 Specifies source material for Nushu characters
7687
7688 StandardizedVariants.html
7689 Obsoleted as of Unicode 9.0, but previously provided a visual
7690 display of the standard variant sequences derived from
7691 StandardizedVariants.txt.
7692
7693 StandardizedVariants.txt
7694 Certain glyph variations for character display are standardized.
7695 This lists the non-Unihan ones; the Unihan ones are also not used
7696 by Perl, and are in a separate Unicode data base
7697 <http://www.unicode.org/ivd>
7698
7699 TangutSources.txt
7700 Specifies source mappings for Tangut ideographs and components.
7701 This data file also includes informative radical-stroke values that
7702 are used internally by Unicode
7703
7704 USourceData.txt
7705 Documentation of status and cross reference of proposals for
7706 encoding by Unicode of Unihan characters
7707
7708 USourceGlyphs.pdf
7709 Pictures of the characters in USourceData.txt
7710
7712 <http://www.unicode.org/reports/tr44/>
7713
7714 perlrecharclass
7715
7716 perlunicode
7717
7718
7719
7720perl v5.36.3 2023-11-30 PERLUNIPROPS(1)