1PERLUNIPROPS(1) Perl Programmers Reference Guide PERLUNIPROPS(1)
2
3
4
6 perluniprops - Index of Unicode Version 12.1.0 character properties in
7 Perl
8
10 This document provides information about the portion of the Unicode
11 database that deals with character properties, that is the portion that
12 is defined on single code points. ("Other information in the Unicode
13 data base" below briefly mentions other data that Unicode provides.)
14
15 Perl can provide access to all non-provisional Unicode character
16 properties, though not all are enabled by default. The omitted ones
17 are the Unihan properties (accessible via the CPAN module
18 Unicode::Unihan) and certain deprecated or Unicode-internal properties.
19 (An installation may choose to recompile Perl's tables to change this.
20 See "Unicode character properties that are NOT accepted by Perl".)
21
22 For most purposes, access to Unicode properties from the Perl core is
23 through regular expression matches, as described in the next section.
24 For some special purposes, and to access the properties that are not
25 suitable for regular expression matching, all the Unicode character
26 properties that Perl handles are accessible via the standard
27 Unicode::UCD module, as described in the section "Properties accessible
28 through Unicode::UCD".
29
30 Perl also provides some additional extensions and short-cut synonyms
31 for Unicode properties.
32
33 This document merely lists all available properties and does not
34 attempt to explain what each property really means. There is a brief
35 description of each Perl extension; see "Other Properties" in
36 perlunicode for more information on these. There is some detail about
37 Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but
38 to find out about the intricacies of the official Unicode properties,
39 refer to the Unicode standard. A good starting place is
40 <http://www.unicode.org/reports/tr44/>.
41
42 Note that you can define your own properties; see "User-Defined
43 Character Properties" in perlunicode.
44
46 The Perl regular expression "\p{}" and "\P{}" constructs give access to
47 most of the Unicode character properties. The table below shows all
48 these constructs, both single and compound forms.
49
50 Compound forms consist of two components, separated by an equals sign
51 or a colon. The first component is the property name, and the second
52 component is the particular value of the property to match against, for
53 example, "\p{Script_Extensions: Greek}" and
54 "\p{Script_Extensions=Greek}" both mean to match characters whose
55 Script_Extensions property value is Greek. ("Script_Extensions" is an
56 improved version of the "Script" property.)
57
58 Single forms, like "\p{Greek}", are mostly Perl-defined shortcuts for
59 their equivalent compound forms. The table shows these equivalences.
60 (In our example, "\p{Greek}" is a just a shortcut for
61 "\p{Script_Extensions=Greek}"). There are also a few Perl-defined
62 single forms that are not shortcuts for a compound form. One such is
63 "\p{Word}". These are also listed in the table.
64
65 In parsing these constructs, Perl always ignores Upper/lower case
66 differences everywhere within the {braces}. Thus "\p{Greek}" means the
67 same thing as "\p{greek}". But note that changing the case of the "p"
68 or "P" before the left brace completely changes the meaning of the
69 construct, from "match" (for "\p{}") to "doesn't match" (for "\P{}").
70 Casing in this document is for improved legibility.
71
72 Also, white space, hyphens, and underscores are normally ignored
73 everywhere between the {braces}, and hence can be freely added or
74 removed even if the "/x" modifier hasn't been specified on the regular
75 expression. But in the table below a 'T' at the beginning of an entry
76 means that tighter (stricter) rules are used for that entry:
77
78 Single form ("\p{name}") tighter rules:
79 White space, hyphens, and underscores ARE significant except
80 for:
81
82 · white space adjacent to a non-word character
83
84 · underscores separating digits in numbers
85
86 That means, for example, that you can freely add or remove
87 white space adjacent to (but within) the braces without
88 affecting the meaning.
89
90 Compound form ("\p{name=value}" or "\p{name:value}") tighter rules:
91 The tighter rules given above for the single form apply to
92 everything to the right of the colon or equals; the looser
93 rules still apply to everything to the left.
94
95 That means, for example, that you can freely add or remove
96 white space adjacent to (but within) the braces and the colon
97 or equal sign.
98
99 Some properties are considered obsolete by Unicode, but still
100 available. There are several varieties of obsolescence:
101
102 Stabilized
103 A property may be stabilized. Such a determination does not
104 indicate that the property should or should not be used;
105 instead it is a declaration that the property will not be
106 maintained nor extended for newly encoded characters. Such
107 properties are marked with an 'S' in the table.
108
109 Deprecated
110 A property may be deprecated, perhaps because its original
111 intent has been replaced by another property, or because its
112 specification was somehow defective. This means that its use
113 is strongly discouraged, so much so that a warning will be
114 issued if used, unless the regular expression is in the scope
115 of a "no warnings 'deprecated'" statement. A 'D' flags each
116 such entry in the table, and the entry there for the longest,
117 most descriptive version of the property will give the reason
118 it is deprecated, and perhaps advice. Perl may issue such a
119 warning, even for properties that aren't officially deprecated
120 by Unicode, when there used to be characters or code points
121 that were matched by them, but no longer. This is to warn you
122 that your program may not work like it did on earlier Unicode
123 releases.
124
125 A deprecated property may be made unavailable in a future Perl
126 version, so it is best to move away from them.
127
128 A deprecated property may also be stabilized, but this fact is
129 not shown.
130
131 Obsolete
132 Properties marked with an 'O' in the table are considered
133 (plain) obsolete. Generally this designation is given to
134 properties that Unicode once used for internal purposes (but
135 not any longer).
136
137 Discouraged
138 This is not actually a Unicode-specified obsolescence, but
139 applies to certain Perl extensions that are present for
140 backwards compatibility, but are discouraged from being used.
141 These are not obsolete, but their meanings are not stable.
142 Future Unicode versions could force any of these extensions to
143 be removed without warning, replaced by another property with
144 the same name that means something different. An 'X' flags
145 each such entry in the table. Use the equivalent shown
146 instead.
147
148 In particular, matches in the Block property have single forms
149 defined by Perl that begin with "In_", ""Is_", or even with no
150 prefix at all, Like all DISCOURAGED forms, these are not
151 stable. For example, "\p{Block=Deseret}" can currently be
152 written as "\p{In_Deseret}", "\p{Is_Deseret}", or
153 "\p{Deseret}". But, a new Unicode version may come along that
154 would force Perl to change the meaning of one or more of these,
155 and your program would no longer be correct. Currently there
156 are no such conflicts with the form that begins "In_", but
157 there are many with the other two shortcuts, and Unicode
158 continues to define new properties that begin with "In", so
159 it's quite possible that a conflict will occur in the future.
160 The compound form is guaranteed to not become obsolete, and its
161 meaning is clearer anyway. See "Blocks" in perlunicode for
162 more information about this.
163
164 The table below has two columns. The left column contains the "\p{}"
165 constructs to look up, possibly preceded by the flags mentioned above;
166 and the right column contains information about them, like a
167 description, or synonyms. The table shows both the single and compound
168 forms for each property that has them. If the left column is a short
169 name for a property, the right column will give its longer, more
170 descriptive name; and if the left column is the longest name, the right
171 column will show any equivalent shortest name, in both single and
172 compound forms if applicable.
173
174 If braces are not needed to specify a property (e.g., "\pL"), the left
175 column contains both forms, with and without braces.
176
177 The right column will also caution you if a property means something
178 different than what might normally be expected.
179
180 All single forms are Perl extensions; a few compound forms are as well,
181 and are noted as such.
182
183 Numbers in (parentheses) indicate the total number of Unicode code
184 points matched by the property. For the entries that give the longest,
185 most descriptive version of the property, the count is followed by a
186 list of some of the code points matched by it. The list includes all
187 the matched characters in the 0-255 range, enclosed in the familiar
188 [brackets] the same as a regular expression bracketed character class.
189 Following that, the next few higher matching ranges are also given. To
190 avoid visual ambiguity, the SPACE character is represented as "\x20".
191
192 For emphasis, those properties that match no code points at all are
193 listed as well in a separate section following the table.
194
195 Most properties match the same code points regardless of whether "/i"
196 case-insensitive matching is specified or not. But a few properties
197 are affected. These are shown with the notation "(/i= other_property)"
198 in the second column. Under case-insensitive matching they match the
199 same code pode points as the property other_property.
200
201 There is no description given for most non-Perl defined properties (See
202 <http://www.unicode.org/reports/tr44/> for that).
203
204 For compactness, '*' is used as a wildcard instead of showing all
205 possible combinations. For example, entries like:
206
207 \p{Gc: *} \p{General_Category: *}
208
209 mean that 'Gc' is a synonym for 'General_Category', and anything that
210 is valid for the latter is also valid for the former. Similarly,
211
212 \p{Is_*} \p{*}
213
214 means that if and only if, for example, "\p{Foo}" exists, then
215 "\p{Is_Foo}" and "\p{IsFoo}" are also valid and all mean the same
216 thing. And similarly, "\p{Foo=Bar}" means the same as "\p{Is_Foo=Bar}"
217 and "\p{IsFoo=Bar}". "*" here is restricted to something not beginning
218 with an underscore.
219
220 Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
221 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table
222 shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
223 for the other possibilities. Note that not all properties which have
224 values 'Yes' and 'No' are binary, and they have all their values
225 spelled out without using this wild card, and a "NOT" clause in their
226 description that highlights their not being binary. These also require
227 the compound form to match them, whereas true binary properties have
228 both single and compound forms available.
229
230 Note that all non-essential underscores are removed in the display of
231 the short names below.
232
233 Legend summary:
234
235 * is a wild-card
236 (\d+) in the info column gives the number of Unicode code points
237 matched by this property.
238 D means this is deprecated.
239 O means this is obsolete.
240 S means this is stabilized.
241 T means tighter (stricter) name matching applies.
242 X means use of this form is discouraged, and may not be stable.
243
244 NAME INFO
245
246 \p{Adlam} \p{Script_Extensions=Adlam} (Short:
247 \p{Adlm}; NOT \p{Block=Adlam}) (89)
248 \p{Adlm} \p{Adlam} (= \p{Script_Extensions=Adlam})
249 (NOT \p{Block=Adlam}) (89)
250 X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
251 T \p{Age: 1.1} \p{Age=V1_1} (33_979)
252 \p{Age: V1_1} Code point's usage introduced in version
253 1.1 (33_979: U+0000..01F5, U+01FA..0217,
254 U+0250..02A8, U+02B0..02DE,
255 U+02E0..02E9, U+0300..0345 ...)
256 T \p{Age: 2.0} \p{Age=V2_0} (144_521)
257 \p{Age: V2_0} Code point's usage was introduced in
258 version 2.0; See also Property
259 'Present_In' (144_521: U+0591..05A1,
260 U+05A3..05AF, U+05C4, U+0F00..0F47,
261 U+0F49..0F69, U+0F71..0F8B ...)
262 T \p{Age: 2.1} \p{Age=V2_1} (2)
263 \p{Age: V2_1} Code point's usage was introduced in
264 version 2.1; See also Property
265 'Present_In' (2: U+20AC, U+FFFC)
266 T \p{Age: 3.0} \p{Age=V3_0} (10_307)
267 \p{Age: V3_0} Code point's usage was introduced in
268 version 3.0; See also Property
269 'Present_In' (10_307: U+01F6..01F9,
270 U+0218..021F, U+0222..0233,
271 U+02A9..02AD, U+02DF, U+02EA..02EE ...)
272 T \p{Age: 3.1} \p{Age=V3_1} (44_978)
273 \p{Age: V3_1} Code point's usage was introduced in
274 version 3.1; See also Property
275 'Present_In' (44_978: U+03F4..03F5,
276 U+FDD0..FDEF, U+10300..1031E,
277 U+10320..10323, U+10330..1034A,
278 U+10400..10425 ...)
279 T \p{Age: 3.2} \p{Age=V3_2} (1016)
280 \p{Age: V3_2} Code point's usage was introduced in
281 version 3.2; See also Property
282 'Present_In' (1016: U+0220, U+034F,
283 U+0363..036F, U+03D8..03D9, U+03F6,
284 U+048A..048B ...)
285 T \p{Age: 4.0} \p{Age=V4_0} (1226)
286 \p{Age: V4_0} Code point's usage was introduced in
287 version 4.0; See also Property
288 'Present_In' (1226: U+0221,
289 U+0234..0236, U+02AE..02AF,
290 U+02EF..02FF, U+0350..0357, U+035D..035F
291 ...)
292 T \p{Age: 4.1} \p{Age=V4_1} (1273)
293 \p{Age: V4_1} Code point's usage was introduced in
294 version 4.1; See also Property
295 'Present_In' (1273: U+0237..0241,
296 U+0358..035C, U+03FC..03FF,
297 U+04F6..04F7, U+05A2, U+05C5..05C7 ...)
298 T \p{Age: 5.0} \p{Age=V5_0} (1369)
299 \p{Age: V5_0} Code point's usage was introduced in
300 version 5.0; See also Property
301 'Present_In' (1369: U+0242..024F,
302 U+037B..037D, U+04CF, U+04FA..04FF,
303 U+0510..0513, U+05BA ...)
304 T \p{Age: 5.1} \p{Age=V5_1} (1624)
305 \p{Age: V5_1} Code point's usage was introduced in
306 version 5.1; See also Property
307 'Present_In' (1624: U+0370..0373,
308 U+0376..0377, U+03CF, U+0487,
309 U+0514..0523, U+0606..060A ...)
310 T \p{Age: 5.2} \p{Age=V5_2} (6648)
311 \p{Age: V5_2} Code point's usage was introduced in
312 version 5.2; See also Property
313 'Present_In' (6648: U+0524..0525,
314 U+0800..082D, U+0830..083E, U+0900,
315 U+094E, U+0955 ...)
316 T \p{Age: 6.0} \p{Age=V6_0} (2088)
317 \p{Age: V6_0} Code point's usage was introduced in
318 version 6.0; See also Property
319 'Present_In' (2088: U+0526..0527,
320 U+0620, U+065F, U+0840..085B, U+085E,
321 U+093A..093B ...)
322 T \p{Age: 6.1} \p{Age=V6_1} (732)
323 \p{Age: V6_1} Code point's usage was introduced in
324 version 6.1; See also Property
325 'Present_In' (732: U+058F, U+0604,
326 U+08A0, U+08A2..08AC, U+08E4..08FE,
327 U+0AF0 ...)
328 T \p{Age: 6.2} \p{Age=V6_2} (1)
329 \p{Age: V6_2} Code point's usage was introduced in
330 version 6.2; See also Property
331 'Present_In' (1: U+20BA)
332 T \p{Age: 6.3} \p{Age=V6_3} (5)
333 \p{Age: V6_3} Code point's usage was introduced in
334 version 6.3; See also Property
335 'Present_In' (5: U+061C, U+2066..2069)
336 T \p{Age: 7.0} \p{Age=V7_0} (2834)
337 \p{Age: V7_0} Code point's usage was introduced in
338 version 7.0; See also Property
339 'Present_In' (2834: U+037F,
340 U+0528..052F, U+058D..058E, U+0605,
341 U+08A1, U+08AD..08B2 ...)
342 T \p{Age: 8.0} \p{Age=V8_0} (7716)
343 \p{Age: V8_0} Code point's usage was introduced in
344 version 8.0; See also Property
345 'Present_In' (7716: U+08B3..08B4,
346 U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5
347 ...)
348 T \p{Age: 9.0} \p{Age=V9_0} (7500)
349 \p{Age: V9_0} Code point's usage was introduced in
350 version 9.0; See also Property
351 'Present_In' (7500: U+08B6..08BD,
352 U+08D4..08E2, U+0C80, U+0D4F,
353 U+0D54..0D56, U+0D58..0D5E ...)
354 T \p{Age: 10.0} \p{Age=V10_0} (8518)
355 \p{Age: V10_0} Code point's usage was introduced in
356 version 10.0; See also Property
357 'Present_In' (8518: U+0860..086A,
358 U+09FC..09FD, U+0AFA..0AFF, U+0D00,
359 U+0D3B..0D3C, U+1CF7 ...)
360 T \p{Age: 11.0} \p{Age=V11_0} (684)
361 \p{Age: V11_0} Code point's usage was introduced in
362 version 11.0; See also Property
363 'Present_In' (684: U+0560, U+0588,
364 U+05EF, U+07FD..07FF, U+08D3, U+09FE ...)
365 T \p{Age: 12.0} \p{Age=V12_0} (554)
366 \p{Age: V12_0} Code point's usage was introduced in
367 version 12.0; See also Property
368 'Present_In' (554: U+0C77, U+0E86,
369 U+0E89, U+0E8C, U+0E8E..0E93, U+0E98 ...)
370 T \p{Age: 12.1} \p{Age=V12_1} (1)
371 \p{Age: V12_1} Code point's usage was introduced in
372 version 12.1; See also Property
373 'Present_In' (1: U+32FF)
374 \p{Age: NA} \p{Age=Unassigned} (836_536 plus all
375 above-Unicode code points)
376 \p{Age: Unassigned} Code point's usage has not been assigned
377 in any Unicode release thus far. (Short:
378 \p{Age=NA}) (836_536 plus all above-
379 Unicode code points: U+0378..0379,
380 U+0380..0383, U+038B, U+038D, U+03A2,
381 U+0530 ...)
382 \p{Aghb} \p{Caucasian_Albanian} (=
383 \p{Script_Extensions=
384 Caucasian_Albanian}) (NOT \p{Block=
385 Caucasian_Albanian}) (53)
386 \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
387 (22)
388 \p{AHex: *} \p{ASCII_Hex_Digit: *}
389 \p{Ahom} \p{Script_Extensions=Ahom} (NOT \p{Block=
390 Ahom}) (58)
391 X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block=
392 Alchemical_Symbols}) (128)
393 X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short:
394 \p{InAlchemical}) (128)
395 \p{All} All code points, including those above
396 Unicode. Same as qr/./s (1_114_112 plus
397 all above-Unicode code points:
398 U+0000..infinity)
399 \p{Alnum} \p{XPosixAlnum} (127_886)
400 \p{Alpha} \p{XPosixAlpha} (= \p{Alphabetic=Y})
401 (127_256)
402 \p{Alpha: *} \p{Alphabetic: *}
403 \p{Alphabetic} \p{XPosixAlpha} (= \p{Alphabetic=Y})
404 (127_256)
405 \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (986_856
406 plus all above-Unicode code points:
407 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
408 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
409 \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
410 U+02C2..02C5, U+02D2..02DF,
411 U+02E5..02EB, U+02ED, U+02EF..0344 ...)
412 \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (127_256:
413 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
414 \xf6\xf8-\xff], U+0100..02C1,
415 U+02C6..02D1, U+02E0..02E4, U+02EC,
416 U+02EE ...)
417 X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (=
418 \p{Block=Alphabetic_Presentation_Forms})
419 (80)
420 X \p{Alphabetic_Presentation_Forms} \p{Block=
421 Alphabetic_Presentation_Forms} (Short:
422 \p{InAlphabeticPF}) (80)
423 \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
424 Anatolian_Hieroglyphs} (Short: \p{Hluw};
425 NOT \p{Block=Anatolian_Hieroglyphs})
426 (583)
427 X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
428 \p{Block=
429 Ancient_Greek_Musical_Notation}) (80)
430 X \p{Ancient_Greek_Musical_Notation} \p{Block=
431 Ancient_Greek_Musical_Notation} (Short:
432 \p{InAncientGreekMusic}) (80)
433 X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
434 X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
435 \p{Any} All Unicode code points (1_114_112:
436 U+0000..10FFFF)
437 \p{Arab} \p{Arabic} (= \p{Script_Extensions=
438 Arabic}) (NOT \p{Block=Arabic}) (1325)
439 \p{Arabic} \p{Script_Extensions=Arabic} (Short:
440 \p{Arab}; NOT \p{Block=Arabic}) (1325)
441 X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block=
442 Arabic_Extended_A}) (96)
443 X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short:
444 \p{InArabicExtA}) (96)
445 X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols}
446 (= \p{Block=
447 Arabic_Mathematical_Alphabetic_Symbols})
448 (256)
449 X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
450 Arabic_Mathematical_Alphabetic_Symbols}
451 (Short: \p{InArabicMath}) (256)
452 X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (=
453 \p{Block=Arabic_Presentation_Forms_A})
454 (688)
455 X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (=
456 \p{Block=Arabic_Presentation_Forms_B})
457 (144)
458 X \p{Arabic_Presentation_Forms_A} \p{Block=
459 Arabic_Presentation_Forms_A} (Short:
460 \p{InArabicPFA}) (688)
461 X \p{Arabic_Presentation_Forms_B} \p{Block=
462 Arabic_Presentation_Forms_B} (Short:
463 \p{InArabicPFB}) (144)
464 X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block=
465 Arabic_Supplement}) (48)
466 X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short:
467 \p{InArabicSup}) (48)
468 \p{Armenian} \p{Script_Extensions=Armenian} (Short:
469 \p{Armn}; NOT \p{Block=Armenian}) (96)
470 \p{Armi} \p{Imperial_Aramaic} (=
471 \p{Script_Extensions=Imperial_Aramaic})
472 (NOT \p{Block=Imperial_Aramaic}) (31)
473 \p{Armn} \p{Armenian} (= \p{Script_Extensions=
474 Armenian}) (NOT \p{Block=Armenian}) (96)
475 X \p{Arrows} \p{Block=Arrows} (112)
476 \p{ASCII} \p{Block=Basic_Latin} (128)
477 \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
478 (22)
479 \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
480 plus all above-Unicode code points:
481 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
482 >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
483 U+0100..infinity)
484 \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A-
485 Fa-f])
486 \p{Assigned} All assigned code points (277_510:
487 U+0000..0377, U+037A..037F,
488 U+0384..038A, U+038C, U+038E..03A1,
489 U+03A3..052F ...)
490 \p{Avestan} \p{Script_Extensions=Avestan} (Short:
491 \p{Avst}; NOT \p{Block=Avestan}) (61)
492 \p{Avst} \p{Avestan} (= \p{Script_Extensions=
493 Avestan}) (NOT \p{Block=Avestan}) (61)
494 \p{Bali} \p{Balinese} (= \p{Script_Extensions=
495 Balinese}) (NOT \p{Block=Balinese}) (121)
496 \p{Balinese} \p{Script_Extensions=Balinese} (Short:
497 \p{Bali}; NOT \p{Block=Balinese}) (121)
498 \p{Bamu} \p{Bamum} (= \p{Script_Extensions=Bamum})
499 (NOT \p{Block=Bamum}) (657)
500 \p{Bamum} \p{Script_Extensions=Bamum} (Short:
501 \p{Bamu}; NOT \p{Block=Bamum}) (657)
502 X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block=
503 Bamum_Supplement}) (576)
504 X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short:
505 \p{InBamumSup}) (576)
506 X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
507 \p{Bass} \p{Bassa_Vah} (= \p{Script_Extensions=
508 Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
509 (36)
510 \p{Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
511 \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
512 \p{Batak} \p{Script_Extensions=Batak} (Short:
513 \p{Batk}; NOT \p{Block=Batak}) (56)
514 \p{Batk} \p{Batak} (= \p{Script_Extensions=Batak})
515 (NOT \p{Block=Batak}) (56)
516 \p{Bc: *} \p{Bidi_Class: *}
517 \p{Beng} \p{Bengali} (= \p{Script_Extensions=
518 Bengali}) (NOT \p{Block=Bengali}) (113)
519 \p{Bengali} \p{Script_Extensions=Bengali} (Short:
520 \p{Beng}; NOT \p{Block=Bengali}) (113)
521 \p{Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
522 \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
523 \p{Bhks} \p{Bhaiksuki} (= \p{Script_Extensions=
524 Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
525 (97)
526 \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y})
527 (12)
528 \p{Bidi_C: *} \p{Bidi_Control: *}
529 \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1698)
530 \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (61)
531 \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1698: U+0608,
532 U+060B, U+060D, U+061B..064A,
533 U+066D..066F, U+0671..06D5 ...)
534 \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (61:
535 U+0600..0605, U+0660..0669,
536 U+066B..066C, U+06DD, U+08E2,
537 U+10D30..10D39 ...)
538 \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
539 \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4016)
540 \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016:
541 [^\t\n\cK\f\r\x1c-\x7e\x85\xa0-\xac\xae-
542 \xff], U+180E, U+200B..200D,
543 U+2060..2065, U+206A..206F, U+FDD0..FDEF
544 ...)
545 \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15:
546 [,.\/:\xa0], U+060C, U+202F, U+2044,
547 U+FE50, U+FE52 ...)
548 \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
549 \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (158)
550 \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
551 \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (92)
552 \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (158: [0-9\xb2-
553 \xb3\xb9], U+06F0..06F9, U+2070,
554 U+2074..2079, U+2080..2089, U+2488..249B
555 ...)
556 \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-],
557 U+207A..207B, U+208A..208B, U+2212,
558 U+FB29, U+FE62..FE63 ...)
559 \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (92:
560 [#\$\%\xa2-\xa5\xb0-\xb1], U+058F,
561 U+0609..060A, U+066A, U+09F2..09F3,
562 U+09FB ...)
563 \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1:
564 U+2068)
565 \p{Bidi_Class: FSI} \p{Bidi_Class=First_Strong_Isolate} (1)
566 \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_096_767
567 plus all above-Unicode code points)
568 \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_096_767 plus
569 all above-Unicode code points: [A-Za-
570 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
571 \xff], U+0100..02B8, U+02BB..02C1,
572 U+02D0..02D1, U+02E0..02E4, U+02EE ...)
573 \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1:
574 U+202A)
575 \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1:
576 U+2066)
577 \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1:
578 U+202D)
579 \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
580 \p{Bidi_Class: LRI} \p{Bidi_Class=Left_To_Right_Isolate} (1)
581 \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
582 \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1834:
583 U+0300..036F, U+0483..0489,
584 U+0591..05BD, U+05BF, U+05C1..05C2,
585 U+05C4..05C5 ...)
586 \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1834)
587 \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (5658)
588 \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (5658:
589 [!\"&\'\(\)*;<=
590 >?\@\[\\\]\^_`\{\|\}~\xa1\xa6-\xa9\xab-
591 \xac\xae-\xaf\xb4\xb6-\xb8\xbb-
592 \xbf\xd7\xf7], U+02B9..02BA,
593 U+02C2..02CF, U+02D2..02DF,
594 U+02E5..02ED, U+02EF..02FF ...)
595 \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7:
596 [\n\r\x1c-\x1e\x85], U+2029)
597 \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
598 \p{Bidi_Class: PDI} \p{Bidi_Class=Pop_Directional_Isolate} (1)
599 \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1:
600 U+202C)
601 \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1:
602 U+2069)
603 \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (3765)
604 \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (3765: U+0590,
605 U+05BE, U+05C0, U+05C3, U+05C6,
606 U+05C8..05FF ...)
607 \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1:
608 U+202B)
609 \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1:
610 U+2067)
611 \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1:
612 U+202E)
613 \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
614 \p{Bidi_Class: RLI} \p{Bidi_Class=Right_To_Left_Isolate} (1)
615 \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
616 \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
617 \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3:
618 [\t\cK\x1f])
619 \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20],
620 U+1680, U+2000..200A, U+2028, U+205F,
621 U+3000)
622 \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (17)
623 \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
624 \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
625 plus all above-Unicode code points:
626 U+0000..061B, U+061D..200D,
627 U+2010..2029, U+202F..2065,
628 U+206A..infinity)
629 \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (12:
630 U+061C, U+200E..200F, U+202A..202E,
631 U+2066..2069)
632 \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
633 (545)
634 \p{Bidi_M: *} \p{Bidi_Mirrored: *}
635 \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
636 (545)
637 \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567
638 plus all above-Unicode code points:
639 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A-
640 Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc-
641 \xff], U+0100..0F39, U+0F3E..169A,
642 U+169D..2038, U+203B..2044, U+2047..207C
643 ...)
644 \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (545:
645 [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
646 U+169B..169C, U+2039..203A,
647 U+2045..2046, U+207D..207E ...)
648 \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
649 (60)
650 \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60:
651 [\)\]\}], U+0F3B, U+0F3D, U+169C,
652 U+2046, U+207E ...)
653 \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
654 (1_113_992 plus all above-Unicode code
655 points)
656 \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992
657 plus all above-Unicode code points:
658 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<=
659 >?\@A-Z\\\^_`a-z\|~\x7f-\xff],
660 U+0100..0F39, U+0F3E..169A,
661 U+169D..2044, U+2047..207C, U+207F..208C
662 ...)
663 \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
664 (60)
665 \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60:
666 [\(\[\{], U+0F3A, U+0F3C, U+169B,
667 U+2045, U+207D ...)
668 \p{Blank} \p{XPosixBlank} (18)
669 \p{Blk: *} \p{Block: *}
670 \p{Block: Adlam} (NOT \p{Adlam} NOR \p{Is_Adlam}) (96:
671 U+1E900..1E95F)
672 \p{Block: Aegean_Numbers} (64: U+10100..1013F)
673 \p{Block: Ahom} (NOT \p{Ahom} NOR \p{Is_Ahom}) (64:
674 U+11700..1173F)
675 \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128)
676 \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128:
677 U+1F700..1F77F)
678 \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
679 (80)
680 \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
681 AlphabeticPF}) (80: U+FB00..FB4F)
682 \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
683 NOR \p{Is_Anatolian_Hieroglyphs}) (640:
684 U+14400..1467F)
685 \p{Block: Ancient_Greek_Music} \p{Block=
686 Ancient_Greek_Musical_Notation} (80)
687 \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
688 AncientGreekMusic}) (80: U+1D200..1D24F)
689 \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F)
690 \p{Block: Ancient_Symbols} (64: U+10190..101CF)
691 \p{Block: Arabic} (NOT \p{Arabic} NOR \p{Is_Arabic}) (256:
692 U+0600..06FF)
693 \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
694 \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96:
695 U+08A0..08FF)
696 \p{Block: Arabic_Math} \p{Block=
697 Arabic_Mathematical_Alphabetic_Symbols}
698 (256)
699 \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
700 ArabicMath}) (256: U+1EE00..1EEFF)
701 \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688)
702 \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144)
703 \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
704 (688: U+FB50..FDFF)
705 \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
706 (144: U+FE70..FEFF)
707 \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48)
708 \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48:
709 U+0750..077F)
710 \p{Block: Armenian} (NOT \p{Armenian} NOR \p{Is_Armenian})
711 (96: U+0530..058F)
712 \p{Block: Arrows} (112: U+2190..21FF)
713 \p{Block: ASCII} \p{Block=Basic_Latin} (128)
714 \p{Block: Avestan} (NOT \p{Avestan} NOR \p{Is_Avestan}) (64:
715 U+10B00..10B3F)
716 \p{Block: Balinese} (NOT \p{Balinese} NOR \p{Is_Balinese})
717 (128: U+1B00..1B7F)
718 \p{Block: Bamum} (NOT \p{Bamum} NOR \p{Is_Bamum}) (96:
719 U+A6A0..A6FF)
720 \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576)
721 \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576:
722 U+16800..16A3F)
723 \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}) (128: [\x00-\x7f])
724 \p{Block: Bassa_Vah} (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
725 (48: U+16AD0..16AFF)
726 \p{Block: Batak} (NOT \p{Batak} NOR \p{Is_Batak}) (64:
727 U+1BC0..1BFF)
728 \p{Block: Bengali} (NOT \p{Bengali} NOR \p{Is_Bengali}) (128:
729 U+0980..09FF)
730 \p{Block: Bhaiksuki} (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
731 (112: U+11C00..11C6F)
732 \p{Block: Block_Elements} (32: U+2580..259F)
733 \p{Block: Bopomofo} (NOT \p{Bopomofo} NOR \p{Is_Bopomofo})
734 (48: U+3100..312F)
735 \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
736 \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32:
737 U+31A0..31BF)
738 \p{Block: Box_Drawing} (128: U+2500..257F)
739 \p{Block: Brahmi} (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128:
740 U+11000..1107F)
741 \p{Block: Braille} \p{Block=Braille_Patterns} (256)
742 \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256:
743 U+2800..28FF)
744 \p{Block: Buginese} (NOT \p{Buginese} NOR \p{Is_Buginese})
745 (32: U+1A00..1A1F)
746 \p{Block: Buhid} (NOT \p{Buhid} NOR \p{Is_Buhid}) (32:
747 U+1740..175F)
748 \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
749 (256)
750 \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
751 ByzantineMusic}) (256: U+1D000..1D0FF)
752 \p{Block: Canadian_Syllabics} \p{Block=
753 Unified_Canadian_Aboriginal_Syllabics}
754 (640)
755 \p{Block: Carian} (NOT \p{Carian} NOR \p{Is_Carian}) (64:
756 U+102A0..102DF)
757 \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
758 \p{Is_Caucasian_Albanian}) (64:
759 U+10530..1056F)
760 \p{Block: Chakma} (NOT \p{Chakma} NOR \p{Is_Chakma}) (80:
761 U+11100..1114F)
762 \p{Block: Cham} (NOT \p{Cham} NOR \p{Is_Cham}) (96:
763 U+AA00..AA5F)
764 \p{Block: Cherokee} (NOT \p{Cherokee} NOR \p{Is_Cherokee})
765 (96: U+13A0..13FF)
766 \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
767 \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80:
768 U+AB70..ABBF)
769 \p{Block: Chess_Symbols} (112: U+1FA00..1FA6F)
770 \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992)
771 \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256)
772 \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
773 \p{Block: CJK_Compat_Ideographs} \p{Block=
774 CJK_Compatibility_Ideographs} (512)
775 \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
776 CJK_Compatibility_Ideographs_Supplement}
777 (544)
778 \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256:
779 U+3300..33FF)
780 \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
781 (32: U+FE30..FE4F)
782 \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
783 CJKCompatIdeographs}) (512: U+F900..FAFF)
784 \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
785 CJKCompatIdeographsSup}) (544:
786 U+2F800..2FA1F)
787 \p{Block: CJK_Ext_A} \p{Block=
788 CJK_Unified_Ideographs_Extension_A}
789 (6592)
790 \p{Block: CJK_Ext_B} \p{Block=
791 CJK_Unified_Ideographs_Extension_B}
792 (42_720)
793 \p{Block: CJK_Ext_C} \p{Block=
794 CJK_Unified_Ideographs_Extension_C}
795 (4160)
796 \p{Block: CJK_Ext_D} \p{Block=
797 CJK_Unified_Ideographs_Extension_D} (224)
798 \p{Block: CJK_Ext_E} \p{Block=
799 CJK_Unified_Ideographs_Extension_E}
800 (5776)
801 \p{Block: CJK_Ext_F} \p{Block=
802 CJK_Unified_Ideographs_Extension_F}
803 (7488)
804 \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
805 \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
806 (128: U+2E80..2EFF)
807 \p{Block: CJK_Strokes} (48: U+31C0..31EF)
808 \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64)
809 \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
810 (64: U+3000..303F)
811 \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992:
812 U+4E00..9FFF)
813 \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
814 CJKExtA}) (6592: U+3400..4DBF)
815 \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
816 CJKExtB}) (42_720: U+20000..2A6DF)
817 \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
818 CJKExtC}) (4160: U+2A700..2B73F)
819 \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
820 CJKExtD}) (224: U+2B740..2B81F)
821 \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
822 CJKExtE}) (5776: U+2B820..2CEAF)
823 \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk=
824 CJKExtF}) (7488: U+2CEB0..2EBEF)
825 \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
826 Diacriticals}) (112: U+0300..036F)
827 \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
828 DiacriticalsExt}) (80: U+1AB0..1AFF)
829 \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
830 DiacriticalsForSymbols}) (48:
831 U+20D0..20FF)
832 \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
833 DiacriticalsSup}) (64: U+1DC0..1DFF)
834 \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16:
835 U+FE20..FE2F)
836 \p{Block: Combining_Marks_For_Symbols} \p{Block=
837 Combining_Diacritical_Marks_For_Symbols}
838 (48)
839 \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
840 IndicNumberForms}) (16: U+A830..A83F)
841 \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96)
842 \p{Block: Control_Pictures} (64: U+2400..243F)
843 \p{Block: Coptic} (NOT \p{Coptic} NOR \p{Is_Coptic}) (128:
844 U+2C80..2CFF)
845 \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF)
846 \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
847 \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32:
848 U+1D360..1D37F)
849 \p{Block: Cuneiform} (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
850 (1024: U+12000..123FF)
851 \p{Block: Cuneiform_Numbers} \p{Block=
852 Cuneiform_Numbers_And_Punctuation} (128)
853 \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
854 CuneiformNumbers}) (128: U+12400..1247F)
855 \p{Block: Currency_Symbols} (48: U+20A0..20CF)
856 \p{Block: Cypriot_Syllabary} (64: U+10800..1083F)
857 \p{Block: Cyrillic} (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
858 (256: U+0400..04FF)
859 \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
860 \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
861 \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
862 \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32:
863 U+2DE0..2DFF)
864 \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96:
865 U+A640..A69F)
866 \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16:
867 U+1C80..1C8F)
868 \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
869 \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48:
870 U+0500..052F)
871 \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
872 (48)
873 \p{Block: Deseret} (80: U+10400..1044F)
874 \p{Block: Devanagari} (NOT \p{Devanagari} NOR \p{Is_Devanagari})
875 (128: U+0900..097F)
876 \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
877 \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32:
878 U+A8E0..A8FF)
879 \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
880 \p{Block: Diacriticals_Ext} \p{Block=
881 Combining_Diacritical_Marks_Extended}
882 (80)
883 \p{Block: Diacriticals_For_Symbols} \p{Block=
884 Combining_Diacritical_Marks_For_Symbols}
885 (48)
886 \p{Block: Diacriticals_Sup} \p{Block=
887 Combining_Diacritical_Marks_Supplement}
888 (64)
889 \p{Block: Dingbats} (192: U+2700..27BF)
890 \p{Block: Dogra} (NOT \p{Dogra} NOR \p{Is_Dogra}) (80:
891 U+11800..1184F)
892 \p{Block: Domino} \p{Block=Domino_Tiles} (112)
893 \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112:
894 U+1F030..1F09F)
895 \p{Block: Duployan} (NOT \p{Duployan} NOR \p{Is_Duployan})
896 (160: U+1BC00..1BC9F)
897 \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F)
898 \p{Block: Egyptian_Hieroglyph_Format_Controls} (16: U+13430..1343F)
899 \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
900 \p{Is_Egyptian_Hieroglyphs}) (1072:
901 U+13000..1342F)
902 \p{Block: Elbasan} (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48:
903 U+10500..1052F)
904 \p{Block: Elymaic} (NOT \p{Elymaic} NOR \p{Is_Elymaic}) (32:
905 U+10FE0..10FFF)
906 \p{Block: Emoticons} (80: U+1F600..1F64F)
907 \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
908 \p{Block: Enclosed_Alphanum_Sup} \p{Block=
909 Enclosed_Alphanumeric_Supplement} (256)
910 \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
911 EnclosedAlphanumSup}) (256:
912 U+1F100..1F1FF)
913 \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
914 EnclosedAlphanum}) (160: U+2460..24FF)
915 \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
916 (256)
917 \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
918 EnclosedCJK}) (256: U+3200..32FF)
919 \p{Block: Enclosed_Ideographic_Sup} \p{Block=
920 Enclosed_Ideographic_Supplement} (256)
921 \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
922 EnclosedIdeographicSup}) (256:
923 U+1F200..1F2FF)
924 \p{Block: Ethiopic} (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
925 (384: U+1200..137F)
926 \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
927 \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
928 \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96:
929 U+2D80..2DDF)
930 \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48:
931 U+AB00..AB2F)
932 \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
933 \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32:
934 U+1380..139F)
935 \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
936 \p{Punct} NOR \p{Is_Punctuation}) (112:
937 U+2000..206F)
938 \p{Block: Geometric_Shapes} (96: U+25A0..25FF)
939 \p{Block: Geometric_Shapes_Ext} \p{Block=
940 Geometric_Shapes_Extended} (128)
941 \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
942 GeometricShapesExt}) (128:
943 U+1F780..1F7FF)
944 \p{Block: Georgian} (NOT \p{Georgian} NOR \p{Is_Georgian})
945 (96: U+10A0..10FF)
946 \p{Block: Georgian_Ext} \p{Block=Georgian_Extended} (48)
947 \p{Block: Georgian_Extended} (Short: \p{Blk=GeorgianExt}) (48:
948 U+1C90..1CBF)
949 \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
950 \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48:
951 U+2D00..2D2F)
952 \p{Block: Glagolitic} (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
953 (96: U+2C00..2C5F)
954 \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
955 \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
956 (48: U+1E000..1E02F)
957 \p{Block: Gothic} (NOT \p{Gothic} NOR \p{Is_Gothic}) (32:
958 U+10330..1034F)
959 \p{Block: Grantha} (NOT \p{Grantha} NOR \p{Is_Grantha}) (128:
960 U+11300..1137F)
961 \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
962 NOR \p{Is_Greek}) (144)
963 \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
964 NOR \p{Is_Greek}) (144: U+0370..03FF)
965 \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256)
966 \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256:
967 U+1F00..1FFF)
968 \p{Block: Gujarati} (NOT \p{Gujarati} NOR \p{Is_Gujarati})
969 (128: U+0A80..0AFF)
970 \p{Block: Gunjala_Gondi} (NOT \p{Gunjala_Gondi} NOR
971 \p{Is_Gunjala_Gondi}) (80:
972 U+11D60..11DAF)
973 \p{Block: Gurmukhi} (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
974 (128: U+0A00..0A7F)
975 \p{Block: Half_And_Full_Forms} \p{Block=
976 Halfwidth_And_Fullwidth_Forms} (240)
977 \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16)
978 \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
979 HalfAndFullForms}) (240: U+FF00..FFEF)
980 \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul}
981 NOR \p{Is_Hangul}) (11_184)
982 \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
983 (96: U+3130..318F)
984 \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}) (256: U+1100..11FF)
985 \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32:
986 U+A960..A97F)
987 \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80:
988 U+D7B0..D7FF)
989 \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
990 NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF)
991 \p{Block: Hanifi_Rohingya} (NOT \p{Hanifi_Rohingya} NOR
992 \p{Is_Hanifi_Rohingya}) (64:
993 U+10D00..10D3F)
994 \p{Block: Hanunoo} (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32:
995 U+1720..173F)
996 \p{Block: Hatran} (NOT \p{Hatran} NOR \p{Is_Hatran}) (32:
997 U+108E0..108FF)
998 \p{Block: Hebrew} (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112:
999 U+0590..05FF)
1000 \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
1001 HighPUSurrogates}) (128: U+DB80..DBFF)
1002 \p{Block: High_PU_Surrogates} \p{Block=
1003 High_Private_Use_Surrogates} (128)
1004 \p{Block: High_Surrogates} (896: U+D800..DB7F)
1005 \p{Block: Hiragana} (NOT \p{Hiragana} NOR \p{Is_Hiragana})
1006 (96: U+3040..309F)
1007 \p{Block: IDC} \p{Block=
1008 Ideographic_Description_Characters} (NOT
1009 \p{ID_Continue} NOR \p{Is_IDC}) (16)
1010 \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
1011 NOT \p{ID_Continue} NOR \p{Is_IDC}) (16:
1012 U+2FF0..2FFF)
1013 \p{Block: Ideographic_Symbols} \p{Block=
1014 Ideographic_Symbols_And_Punctuation} (32)
1015 \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
1016 IdeographicSymbols}) (32: U+16FE0..16FFF)
1017 \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
1018 \p{Is_Imperial_Aramaic}) (32:
1019 U+10840..1085F)
1020 \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1021 (16)
1022 \p{Block: Indic_Siyaq_Numbers} (80: U+1EC70..1ECBF)
1023 \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
1024 NOR \p{Is_Inscriptional_Pahlavi}) (32:
1025 U+10B60..10B7F)
1026 \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
1027 NOR \p{Is_Inscriptional_Parthian}) (32:
1028 U+10B40..10B5F)
1029 \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96)
1030 \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96:
1031 U+0250..02AF)
1032 \p{Block: Jamo} \p{Block=Hangul_Jamo} (256)
1033 \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32)
1034 \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80)
1035 \p{Block: Javanese} (NOT \p{Javanese} NOR \p{Is_Javanese})
1036 (96: U+A980..A9DF)
1037 \p{Block: Kaithi} (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80:
1038 U+11080..110CF)
1039 \p{Block: Kana_Ext_A} \p{Block=Kana_Extended_A} (48)
1040 \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48:
1041 U+1B100..1B12F)
1042 \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256)
1043 \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256:
1044 U+1B000..1B0FF)
1045 \p{Block: Kanbun} (16: U+3190..319F)
1046 \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224)
1047 \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224:
1048 U+2F00..2FDF)
1049 \p{Block: Kannada} (NOT \p{Kannada} NOR \p{Is_Kannada}) (128:
1050 U+0C80..0CFF)
1051 \p{Block: Katakana} (NOT \p{Katakana} NOR \p{Is_Katakana})
1052 (96: U+30A0..30FF)
1053 \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
1054 \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
1055 KatakanaExt}) (16: U+31F0..31FF)
1056 \p{Block: Kayah_Li} (48: U+A900..A92F)
1057 \p{Block: Kharoshthi} (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
1058 (96: U+10A00..10A5F)
1059 \p{Block: Khmer} (NOT \p{Khmer} NOR \p{Is_Khmer}) (128:
1060 U+1780..17FF)
1061 \p{Block: Khmer_Symbols} (32: U+19E0..19FF)
1062 \p{Block: Khojki} (NOT \p{Khojki} NOR \p{Is_Khojki}) (80:
1063 U+11200..1124F)
1064 \p{Block: Khudawadi} (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
1065 (80: U+112B0..112FF)
1066 \p{Block: Lao} (NOT \p{Lao} NOR \p{Is_Lao}) (128:
1067 U+0E80..0EFF)
1068 \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
1069 \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128)
1070 \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80-
1071 \xff])
1072 \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128)
1073 \p{Block: Latin_Ext_Additional} \p{Block=
1074 Latin_Extended_Additional} (256)
1075 \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208)
1076 \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32)
1077 \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224)
1078 \p{Block: Latin_Ext_E} \p{Block=Latin_Extended_E} (64)
1079 \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128:
1080 U+0100..017F)
1081 \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
1082 LatinExtAdditional}) (256: U+1E00..1EFF)
1083 \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208:
1084 U+0180..024F)
1085 \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32:
1086 U+2C60..2C7F)
1087 \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224:
1088 U+A720..A7FF)
1089 \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64:
1090 U+AB30..AB6F)
1091 \p{Block: Lepcha} (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80:
1092 U+1C00..1C4F)
1093 \p{Block: Letterlike_Symbols} (80: U+2100..214F)
1094 \p{Block: Limbu} (NOT \p{Limbu} NOR \p{Is_Limbu}) (80:
1095 U+1900..194F)
1096 \p{Block: Linear_A} (NOT \p{Linear_A} NOR \p{Is_Linear_A})
1097 (384: U+10600..1077F)
1098 \p{Block: Linear_B_Ideograms} (128: U+10080..100FF)
1099 \p{Block: Linear_B_Syllabary} (128: U+10000..1007F)
1100 \p{Block: Lisu} (48: U+A4D0..A4FF)
1101 \p{Block: Low_Surrogates} (1024: U+DC00..DFFF)
1102 \p{Block: Lycian} (NOT \p{Lycian} NOR \p{Is_Lycian}) (32:
1103 U+10280..1029F)
1104 \p{Block: Lydian} (NOT \p{Lydian} NOR \p{Is_Lydian}) (32:
1105 U+10920..1093F)
1106 \p{Block: Mahajani} (NOT \p{Mahajani} NOR \p{Is_Mahajani})
1107 (48: U+11150..1117F)
1108 \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48)
1109 \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48:
1110 U+1F000..1F02F)
1111 \p{Block: Makasar} (NOT \p{Makasar} NOR \p{Is_Makasar}) (32:
1112 U+11EE0..11EFF)
1113 \p{Block: Malayalam} (NOT \p{Malayalam} NOR \p{Is_Malayalam})
1114 (128: U+0D00..0D7F)
1115 \p{Block: Mandaic} (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32:
1116 U+0840..085F)
1117 \p{Block: Manichaean} (NOT \p{Manichaean} NOR \p{Is_Manichaean})
1118 (64: U+10AC0..10AFF)
1119 \p{Block: Marchen} (NOT \p{Marchen} NOR \p{Is_Marchen}) (80:
1120 U+11C70..11CBF)
1121 \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR
1122 \p{Is_Masaram_Gondi}) (96:
1123 U+11D00..11D5F)
1124 \p{Block: Math_Alphanum} \p{Block=
1125 Mathematical_Alphanumeric_Symbols} (1024)
1126 \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
1127 \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
1128 MathAlphanum}) (1024: U+1D400..1D7FF)
1129 \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
1130 (256: U+2200..22FF)
1131 \p{Block: Mayan_Numerals} (32: U+1D2E0..1D2FF)
1132 \p{Block: Medefaidrin} (NOT \p{Medefaidrin} NOR
1133 \p{Is_Medefaidrin}) (96: U+16E40..16E9F)
1134 \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
1135 \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF)
1136 \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
1137 \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
1138 (32: U+AAE0..AAFF)
1139 \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
1140 \p{Is_Mende_Kikakui}) (224:
1141 U+1E800..1E8DF)
1142 \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
1143 \p{Is_Meroitic_Cursive}) (96:
1144 U+109A0..109FF)
1145 \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F)
1146 \p{Block: Miao} (NOT \p{Miao} NOR \p{Is_Miao}) (160:
1147 U+16F00..16F9F)
1148 \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows}
1149 (256)
1150 \p{Block: Misc_Math_Symbols_A} \p{Block=
1151 Miscellaneous_Mathematical_Symbols_A}
1152 (48)
1153 \p{Block: Misc_Math_Symbols_B} \p{Block=
1154 Miscellaneous_Mathematical_Symbols_B}
1155 (128)
1156 \p{Block: Misc_Pictographs} \p{Block=
1157 Miscellaneous_Symbols_And_Pictographs}
1158 (768)
1159 \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
1160 \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
1161 \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
1162 MiscMathSymbolsA}) (48: U+27C0..27EF)
1163 \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
1164 MiscMathSymbolsB}) (128: U+2980..29FF)
1165 \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols})
1166 (256: U+2600..26FF)
1167 \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
1168 MiscArrows}) (256: U+2B00..2BFF)
1169 \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
1170 MiscPictographs}) (768: U+1F300..1F5FF)
1171 \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
1172 (256: U+2300..23FF)
1173 \p{Block: Modi} (NOT \p{Modi} NOR \p{Is_Modi}) (96:
1174 U+11600..1165F)
1175 \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
1176 \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F)
1177 \p{Block: Mongolian} (NOT \p{Mongolian} NOR \p{Is_Mongolian})
1178 (176: U+1800..18AF)
1179 \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
1180 \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32:
1181 U+11660..1167F)
1182 \p{Block: Mro} (NOT \p{Mro} NOR \p{Is_Mro}) (48:
1183 U+16A40..16A6F)
1184 \p{Block: Multani} (NOT \p{Multani} NOR \p{Is_Multani}) (48:
1185 U+11280..112AF)
1186 \p{Block: Music} \p{Block=Musical_Symbols} (256)
1187 \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256:
1188 U+1D100..1D1FF)
1189 \p{Block: Myanmar} (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160:
1190 U+1000..109F)
1191 \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
1192 \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
1193 \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32:
1194 U+AA60..AA7F)
1195 \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32:
1196 U+A9E0..A9FF)
1197 \p{Block: Nabataean} (NOT \p{Nabataean} NOR \p{Is_Nabataean})
1198 (48: U+10880..108AF)
1199 \p{Block: Nandinagari} (NOT \p{Nandinagari} NOR
1200 \p{Is_Nandinagari}) (96: U+119A0..119FF)
1201 \p{Block: NB} \p{Block=No_Block} (832_720 plus all
1202 above-Unicode code points)
1203 \p{Block: New_Tai_Lue} (NOT \p{New_Tai_Lue} NOR
1204 \p{Is_New_Tai_Lue}) (96: U+1980..19DF)
1205 \p{Block: Newa} (NOT \p{Newa} NOR \p{Is_Newa}) (128:
1206 U+11400..1147F)
1207 \p{Block: NKo} (NOT \p{Nko} NOR \p{Is_NKo}) (64:
1208 U+07C0..07FF)
1209 \p{Block: No_Block} (Short: \p{Blk=NB}) (832_720 plus all
1210 above-Unicode code points: U+0870..089F,
1211 U+2FE0..2FEF, U+10200..1027F,
1212 U+103E0..103FF, U+10570..105FF,
1213 U+10780..107FF ...)
1214 \p{Block: Number_Forms} (64: U+2150..218F)
1215 \p{Block: Nushu} (NOT \p{Nushu} NOR \p{Is_Nushu}) (400:
1216 U+1B170..1B2FF)
1217 \p{Block: Nyiakeng_Puachue_Hmong} (NOT \p{Nyiakeng_Puachue_Hmong}
1218 NOR \p{Is_Nyiakeng_Puachue_Hmong}) (80:
1219 U+1E100..1E14F)
1220 \p{Block: OCR} \p{Block=Optical_Character_Recognition}
1221 (32)
1222 \p{Block: Ogham} (NOT \p{Ogham} NOR \p{Is_Ogham}) (32:
1223 U+1680..169F)
1224 \p{Block: Ol_Chiki} (48: U+1C50..1C7F)
1225 \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
1226 \p{Is_Old_Hungarian}) (128:
1227 U+10C80..10CFF)
1228 \p{Block: Old_Italic} (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
1229 (48: U+10300..1032F)
1230 \p{Block: Old_North_Arabian} (32: U+10A80..10A9F)
1231 \p{Block: Old_Permic} (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
1232 (48: U+10350..1037F)
1233 \p{Block: Old_Persian} (NOT \p{Old_Persian} NOR
1234 \p{Is_Old_Persian}) (64: U+103A0..103DF)
1235 \p{Block: Old_Sogdian} (NOT \p{Old_Sogdian} NOR
1236 \p{Is_Old_Sogdian}) (48: U+10F00..10F2F)
1237 \p{Block: Old_South_Arabian} (32: U+10A60..10A7F)
1238 \p{Block: Old_Turkic} (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
1239 (80: U+10C00..10C4F)
1240 \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32:
1241 U+2440..245F)
1242 \p{Block: Oriya} (NOT \p{Oriya} NOR \p{Is_Oriya}) (128:
1243 U+0B00..0B7F)
1244 \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F)
1245 \p{Block: Osage} (NOT \p{Osage} NOR \p{Is_Osage}) (80:
1246 U+104B0..104FF)
1247 \p{Block: Osmanya} (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48:
1248 U+10480..104AF)
1249 \p{Block: Ottoman_Siyaq_Numbers} (80: U+1ED00..1ED4F)
1250 \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
1251 \p{Is_Pahawh_Hmong}) (144:
1252 U+16B00..16B8F)
1253 \p{Block: Palmyrene} (32: U+10860..1087F)
1254 \p{Block: Pau_Cin_Hau} (NOT \p{Pau_Cin_Hau} NOR
1255 \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF)
1256 \p{Block: Phags_Pa} (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa})
1257 (64: U+A840..A87F)
1258 \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48)
1259 \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48:
1260 U+101D0..101FF)
1261 \p{Block: Phoenician} (NOT \p{Phoenician} NOR \p{Is_Phoenician})
1262 (32: U+10900..1091F)
1263 \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
1264 \p{Block: Phonetic_Ext_Sup} \p{Block=
1265 Phonetic_Extensions_Supplement} (64)
1266 \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128:
1267 U+1D00..1D7F)
1268 \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
1269 PhoneticExtSup}) (64: U+1D80..1DBF)
1270 \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF)
1271 \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
1272 \p{Private_Use} NOR \p{Is_Private_Use})
1273 (6400)
1274 \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
1275 \p{Private_Use} NOR \p{Is_Private_Use})
1276 (6400: U+E000..F8FF)
1277 \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
1278 \p{Is_Psalter_Pahlavi}) (48:
1279 U+10B80..10BAF)
1280 \p{Block: PUA} \p{Block=Private_Use_Area} (NOT
1281 \p{Private_Use} NOR \p{Is_Private_Use})
1282 (6400)
1283 \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT
1284 \p{Punct} NOR \p{Is_Punctuation}) (112)
1285 \p{Block: Rejang} (NOT \p{Rejang} NOR \p{Is_Rejang}) (48:
1286 U+A930..A95F)
1287 \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32)
1288 \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32:
1289 U+10E60..10E7F)
1290 \p{Block: Runic} (NOT \p{Runic} NOR \p{Is_Runic}) (96:
1291 U+16A0..16FF)
1292 \p{Block: Samaritan} (NOT \p{Samaritan} NOR \p{Is_Samaritan})
1293 (64: U+0800..083F)
1294 \p{Block: Saurashtra} (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
1295 (96: U+A880..A8DF)
1296 \p{Block: Sharada} (NOT \p{Sharada} NOR \p{Is_Sharada}) (96:
1297 U+11180..111DF)
1298 \p{Block: Shavian} (48: U+10450..1047F)
1299 \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF)
1300 \p{Block: Siddham} (NOT \p{Siddham} NOR \p{Is_Siddham}) (128:
1301 U+11580..115FF)
1302 \p{Block: Sinhala} (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128:
1303 U+0D80..0DFF)
1304 \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF)
1305 \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32:
1306 U+FE50..FE6F)
1307 \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32)
1308 \p{Block: Small_Kana_Ext} \p{Block=Small_Kana_Extension} (64)
1309 \p{Block: Small_Kana_Extension} (Short: \p{Blk=SmallKanaExt}) (64:
1310 U+1B130..1B16F)
1311 \p{Block: Sogdian} (NOT \p{Sogdian} NOR \p{Is_Sogdian}) (64:
1312 U+10F30..10F6F)
1313 \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
1314 \p{Is_Sora_Sompeng}) (48: U+110D0..110FF)
1315 \p{Block: Soyombo} (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96:
1316 U+11A50..11AAF)
1317 \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
1318 ModifierLetters}) (80: U+02B0..02FF)
1319 \p{Block: Specials} (16: U+FFF0..FFFF)
1320 \p{Block: Sundanese} (NOT \p{Sundanese} NOR \p{Is_Sundanese})
1321 (64: U+1B80..1BBF)
1322 \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
1323 \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16:
1324 U+1CC0..1CCF)
1325 \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
1326 \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
1327 \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
1328 \p{Block: Sup_Math_Operators} \p{Block=
1329 Supplemental_Mathematical_Operators}
1330 (256)
1331 \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A}
1332 (65_536)
1333 \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B}
1334 (65_536)
1335 \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
1336 \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
1337 Supplemental_Symbols_And_Pictographs}
1338 (256)
1339 \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
1340 \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
1341 SuperAndSub}) (48: U+2070..209F)
1342 \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16:
1343 U+27F0..27FF)
1344 \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128:
1345 U+2900..297F)
1346 \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256:
1347 U+1F800..1F8FF)
1348 \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
1349 SupMathOperators}) (256: U+2A00..2AFF)
1350 \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
1351 SupPunctuation}) (128: U+2E00..2E7F)
1352 \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
1353 SupSymbolsAndPictographs}) (256:
1354 U+1F900..1F9FF)
1355 \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
1356 SupPUAA}) (65_536: U+F0000..FFFFF)
1357 \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
1358 SupPUAB}) (65_536: U+100000..10FFFF)
1359 \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF)
1360 \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
1361 \p{Is_Syloti_Nagri}) (48: U+A800..A82F)
1362 \p{Block: Symbols_And_Pictographs_Ext_A} \p{Block=
1363 Symbols_And_Pictographs_Extended_A} (144)
1364 \p{Block: Symbols_And_Pictographs_Extended_A} (Short: \p{Blk=
1365 SymbolsAndPictographsExtA}) (144:
1366 U+1FA70..1FAFF)
1367 \p{Block: Syriac} (NOT \p{Syriac} NOR \p{Is_Syriac}) (80:
1368 U+0700..074F)
1369 \p{Block: Syriac_Sup} \p{Block=Syriac_Supplement} (16)
1370 \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16:
1371 U+0860..086F)
1372 \p{Block: Tagalog} (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32:
1373 U+1700..171F)
1374 \p{Block: Tagbanwa} (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa})
1375 (32: U+1760..177F)
1376 \p{Block: Tags} (128: U+E0000..E007F)
1377 \p{Block: Tai_Le} (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48:
1378 U+1950..197F)
1379 \p{Block: Tai_Tham} (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
1380 (144: U+1A20..1AAF)
1381 \p{Block: Tai_Viet} (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet})
1382 (96: U+AA80..AADF)
1383 \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1384 \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96:
1385 U+1D300..1D35F)
1386 \p{Block: Takri} (NOT \p{Takri} NOR \p{Is_Takri}) (80:
1387 U+11680..116CF)
1388 \p{Block: Tamil} (NOT \p{Tamil} NOR \p{Is_Tamil}) (128:
1389 U+0B80..0BFF)
1390 \p{Block: Tamil_Sup} \p{Block=Tamil_Supplement} (64)
1391 \p{Block: Tamil_Supplement} (Short: \p{Blk=TamilSup}) (64:
1392 U+11FC0..11FFF)
1393 \p{Block: Tangut} (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144:
1394 U+17000..187FF)
1395 \p{Block: Tangut_Components} (768: U+18800..18AFF)
1396 \p{Block: Telugu} (NOT \p{Telugu} NOR \p{Is_Telugu}) (128:
1397 U+0C00..0C7F)
1398 \p{Block: Thaana} (NOT \p{Thaana} NOR \p{Is_Thaana}) (64:
1399 U+0780..07BF)
1400 \p{Block: Thai} (NOT \p{Thai} NOR \p{Is_Thai}) (128:
1401 U+0E00..0E7F)
1402 \p{Block: Tibetan} (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256:
1403 U+0F00..0FFF)
1404 \p{Block: Tifinagh} (NOT \p{Tifinagh} NOR \p{Is_Tifinagh})
1405 (80: U+2D30..2D7F)
1406 \p{Block: Tirhuta} (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96:
1407 U+11480..114DF)
1408 \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1409 (128)
1410 \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1411 TransportAndMap}) (128: U+1F680..1F6FF)
1412 \p{Block: UCAS} \p{Block=
1413 Unified_Canadian_Aboriginal_Syllabics}
1414 (640)
1415 \p{Block: UCAS_Ext} \p{Block=
1416 Unified_Canadian_Aboriginal_Syllabics_-
1417 Extended} (80)
1418 \p{Block: Ugaritic} (NOT \p{Ugaritic} NOR \p{Is_Ugaritic})
1419 (32: U+10380..1039F)
1420 \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1421 UCAS}) (640: U+1400..167F)
1422 \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1423 \p{Blk=UCASExt}) (80: U+18B0..18FF)
1424 \p{Block: Vai} (NOT \p{Vai} NOR \p{Is_Vai}) (320:
1425 U+A500..A63F)
1426 \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
1427 \p{Variation_Selector} NOR \p{Is_VS})
1428 (16: U+FE00..FE0F)
1429 \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
1430 (240: U+E0100..E01EF)
1431 \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48)
1432 \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48:
1433 U+1CD0..1CFF)
1434 \p{Block: Vertical_Forms} (16: U+FE10..FE1F)
1435 \p{Block: VS} \p{Block=Variation_Selectors} (NOT
1436 \p{Variation_Selector} NOR \p{Is_VS})
1437 (16)
1438 \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement}
1439 (240)
1440 \p{Block: Wancho} (NOT \p{Wancho} NOR \p{Is_Wancho}) (64:
1441 U+1E2C0..1E2FF)
1442 \p{Block: Warang_Citi} (NOT \p{Warang_Citi} NOR
1443 \p{Is_Warang_Citi}) (96: U+118A0..118FF)
1444 \p{Block: Yi_Radicals} (64: U+A490..A4CF)
1445 \p{Block: Yi_Syllables} (1168: U+A000..A48F)
1446 \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64)
1447 \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64:
1448 U+4DC0..4DFF)
1449 \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR
1450 \p{Is_Zanabazar_Square}) (80:
1451 U+11A00..11A4F)
1452 X \p{Block_Elements} \p{Block=Block_Elements} (32)
1453 \p{Bopo} \p{Bopomofo} (= \p{Script_Extensions=
1454 Bopomofo}) (NOT \p{Block=Bopomofo}) (112)
1455 \p{Bopomofo} \p{Script_Extensions=Bopomofo} (Short:
1456 \p{Bopo}; NOT \p{Block=Bopomofo}) (112)
1457 X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block=
1458 Bopomofo_Extended}) (32)
1459 X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short:
1460 \p{InBopomofoExt}) (32)
1461 X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
1462 \p{Bpt: *} \p{Bidi_Paired_Bracket_Type: *}
1463 \p{Brah} \p{Brahmi} (= \p{Script_Extensions=
1464 Brahmi}) (NOT \p{Block=Brahmi}) (109)
1465 \p{Brahmi} \p{Script_Extensions=Brahmi} (Short:
1466 \p{Brah}; NOT \p{Block=Brahmi}) (109)
1467 \p{Brai} \p{Braille} (= \p{Script_Extensions=
1468 Braille}) (256)
1469 \p{Braille} \p{Script_Extensions=Braille} (Short:
1470 \p{Brai}) (256)
1471 X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short:
1472 \p{InBraille}) (256)
1473 \p{Bugi} \p{Buginese} (= \p{Script_Extensions=
1474 Buginese}) (NOT \p{Block=Buginese}) (31)
1475 \p{Buginese} \p{Script_Extensions=Buginese} (Short:
1476 \p{Bugi}; NOT \p{Block=Buginese}) (31)
1477 \p{Buhd} \p{Buhid} (= \p{Script_Extensions=Buhid})
1478 (NOT \p{Block=Buhid}) (22)
1479 \p{Buhid} \p{Script_Extensions=Buhid} (Short:
1480 \p{Buhd}; NOT \p{Block=Buhid}) (22)
1481 X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block=
1482 Byzantine_Musical_Symbols}) (256)
1483 X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1484 (Short: \p{InByzantineMusic}) (256)
1485 \p{C} \pC \p{Other} (= \p{General_Category=Other})
1486 (976_344 plus all above-Unicode code
1487 points)
1488 \p{Cakm} \p{Chakma} (= \p{Script_Extensions=
1489 Chakma}) (NOT \p{Block=Chakma}) (90)
1490 \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
1491 (Short: \p{Cans}) (710)
1492 X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
1493 (= \p{Block=
1494 Unified_Canadian_Aboriginal_Syllabics})
1495 (640)
1496 T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1497 Not_Reordered} (1_113_250 plus all
1498 above-Unicode code points)
1499 T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1500 Overlay} (32)
1501 T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1502 Nukta} (25)
1503 T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1504 Kana_Voicing} (2)
1505 T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1506 Virama} (58)
1507 T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1508 CCC10} (1)
1509 \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1:
1510 U+05B0)
1511 T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1512 CCC11} (1)
1513 \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1:
1514 U+05B1)
1515 T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1516 CCC12} (1)
1517 \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1:
1518 U+05B2)
1519 T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1520 CCC13} (1)
1521 \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1:
1522 U+05B3)
1523 T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1524 CCC14} (1)
1525 \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1:
1526 U+05B4)
1527 T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1528 CCC15} (1)
1529 \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1:
1530 U+05B5)
1531 T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1532 CCC16} (1)
1533 \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1:
1534 U+05B6)
1535 T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1536 CCC17} (1)
1537 \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1:
1538 U+05B7)
1539 T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1540 CCC18} (2)
1541 \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2:
1542 U+05B8, U+05C7)
1543 T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1544 CCC19} (2)
1545 \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2:
1546 U+05B9..05BA)
1547 T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1548 CCC20} (1)
1549 \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1:
1550 U+05BB)
1551 T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1552 CCC21} (1)
1553 \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1:
1554 U+05BC)
1555 T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1556 CCC22} (1)
1557 \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1:
1558 U+05BD)
1559 T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1560 CCC23} (1)
1561 \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1:
1562 U+05BF)
1563 T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1564 CCC24} (1)
1565 \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1:
1566 U+05C1)
1567 T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1568 CCC25} (1)
1569 \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1:
1570 U+05C2)
1571 T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1572 CCC26} (1)
1573 \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1:
1574 U+FB1E)
1575 T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1576 CCC27} (2)
1577 \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2:
1578 U+064B, U+08F0)
1579 T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1580 CCC28} (2)
1581 \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2:
1582 U+064C, U+08F1)
1583 T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1584 CCC29} (2)
1585 \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2:
1586 U+064D, U+08F2)
1587 T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1588 CCC30} (2)
1589 \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2:
1590 U+0618, U+064E)
1591 T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1592 CCC31} (2)
1593 \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2:
1594 U+0619, U+064F)
1595 T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1596 CCC32} (2)
1597 \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2:
1598 U+061A, U+0650)
1599 T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1600 CCC33} (1)
1601 \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1:
1602 U+0651)
1603 T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1604 CCC34} (1)
1605 \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1:
1606 U+0652)
1607 T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1608 CCC35} (1)
1609 \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1:
1610 U+0670)
1611 T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1612 CCC36} (1)
1613 \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1:
1614 U+0711)
1615 T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1616 CCC84} (1)
1617 \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1:
1618 U+0C55)
1619 T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1620 CCC91} (1)
1621 \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1:
1622 U+0C56)
1623 T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1624 CCC103} (2)
1625 \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2:
1626 U+0E38..0E39)
1627 T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1628 CCC107} (4)
1629 \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4:
1630 U+0E48..0E4B)
1631 T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1632 CCC118} (2)
1633 \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2:
1634 U+0EB8..0EB9)
1635 T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1636 CCC122} (4)
1637 \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4:
1638 U+0EC8..0ECB)
1639 T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1640 CCC129} (1)
1641 \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1:
1642 U+0F71)
1643 T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1644 CCC130} (6)
1645 \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6:
1646 U+0F72, U+0F7A..0F7D, U+0F80)
1647 T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1648 CCC132} (1)
1649 \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1:
1650 U+0F74)
1651 T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
1652 CCC133} (0)
1653 \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
1654 T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1655 Attached_Below_Left} (0)
1656 T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1657 Attached_Below} (5)
1658 T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1659 Attached_Above} (1)
1660 T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1661 Attached_Above_Right} (9)
1662 T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1663 Below_Left} (1)
1664 T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1665 Below} (163)
1666 T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1667 Below_Right} (4)
1668 T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1669 Left} (2)
1670 T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1671 Right} (1)
1672 T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1673 Above_Left} (5)
1674 T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1675 Above} (482)
1676 T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1677 Above_Right} (5)
1678 T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1679 Double_Below} (4)
1680 T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1681 Double_Above} (5)
1682 T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1683 Iota_Subscript} (1)
1684 \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1685 Above} (482)
1686 \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (482:
1687 U+0300..0314, U+033D..0344, U+0346,
1688 U+034A..034C, U+0350..0352, U+0357 ...)
1689 \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5:
1690 U+05AE, U+18A9, U+1DF7..1DF8, U+302B)
1691 \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (5:
1692 U+0315, U+031A, U+0358, U+1DF6, U+302C)
1693 \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1694 Above_Left} (5)
1695 \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1696 Above_Right} (5)
1697 \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1698 Attached_Above} (1)
1699 \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1700 Attached_Above_Right} (9)
1701 \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1702 Attached_Below} (5)
1703 \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1704 Attached_Below_Left} (0)
1705 \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1706 (1: U+1DCE)
1707 \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1708 \p{Ccc=ATAR}) (9: U+031B, U+0F39,
1709 U+1D165..1D166, U+1D16E..1D172)
1710 \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1711 (5: U+0321..0322, U+0327..0328, U+1DD0)
1712 \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1713 ATBL}) (0)
1714 \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1715 Below} (163)
1716 \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (163:
1717 U+0316..0319, U+031C..0320,
1718 U+0323..0326, U+0329..0333,
1719 U+0339..033C, U+0347..0349 ...)
1720 \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1:
1721 U+302A)
1722 \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4:
1723 U+059A, U+05AD, U+1939, U+302D)
1724 \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1725 Below_Left} (1)
1726 \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1727 Below_Right} (4)
1728 \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1729 Double_Above} (5)
1730 \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1731 Double_Below} (4)
1732 \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA})
1733 (5: U+035D..035E, U+0360..0361, U+1DCD)
1734 \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB})
1735 (4: U+035C, U+035F, U+0362, U+1DFC)
1736 \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1737 (1: U+0345)
1738 \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1739 Iota_Subscript} (1)
1740 \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV})
1741 (2: U+3099..309A)
1742 \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1743 Kana_Voicing} (2)
1744 \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1745 Left} (2)
1746 \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2:
1747 U+302E..302F)
1748 \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1749 Nukta} (25)
1750 \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1751 (1_113_250 plus all above-Unicode code
1752 points: U+0000..02FF, U+034F,
1753 U+0370..0482, U+0488..0590, U+05BE,
1754 U+05C0 ...)
1755 \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1756 Not_Reordered} (1_113_250 plus all
1757 above-Unicode code points)
1758 \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (25:
1759 U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C,
1760 U+0CBC ...)
1761 \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1762 Overlay} (32)
1763 \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32:
1764 U+0334..0338, U+1CD4, U+1CE2..1CE8,
1765 U+20D2..20D3, U+20D8..20DA, U+20E5..20E6
1766 ...)
1767 \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1768 Right} (1)
1769 \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1:
1770 U+1D16D)
1771 \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (58:
1772 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
1773 U+0BCD ...)
1774 \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1775 Virama} (58)
1776 \p{Cans} \p{Canadian_Aboriginal} (=
1777 \p{Script_Extensions=
1778 Canadian_Aboriginal}) (710)
1779 \p{Cari} \p{Carian} (= \p{Script_Extensions=
1780 Carian}) (NOT \p{Block=Carian}) (49)
1781 \p{Carian} \p{Script_Extensions=Carian} (Short:
1782 \p{Cari}; NOT \p{Block=Carian}) (49)
1783 \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (2396)
1784 \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_111_716 plus
1785 all above-Unicode code points: [\x00-
1786 \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A-
1787 Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
1788 \xac\xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
1789 U+0100..02AF, U+0370..0373,
1790 U+0376..0379, U+037B..0383, U+0386 ...)
1791 \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (2396:
1792 [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8],
1793 U+02B0..036F, U+0374..0375, U+037A,
1794 U+0384..0385, U+0387 ...)
1795 \p{Cased} \p{Cased=Y} (4279)
1796 \p{Cased: N*} (Single: \P{Cased}) (1_109_833 plus all
1797 above-Unicode code points: [\x00-
1798 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1799 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
1800 \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7], U+01BB,
1801 U+01C0..01C3, U+0294, U+02B9..02BF,
1802 U+02C2..02DF ...)
1803 \p{Cased: Y*} (Single: \p{Cased}) (4279: [A-Za-
1804 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
1805 \xff], U+0100..01BA, U+01BC..01BF,
1806 U+01C4..0293, U+0295..02B8, U+02C0..02C1
1807 ...)
1808 \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
1809 \p{LC}) (3970)
1810 \p{Category: *} \p{General_Category: *}
1811 \p{Caucasian_Albanian} \p{Script_Extensions=Caucasian_Albanian}
1812 (Short: \p{Aghb}; NOT \p{Block=
1813 Caucasian_Albanian}) (53)
1814 \p{Cc} \p{XPosixCntrl} (= \p{General_Category=
1815 Control}) (65)
1816 \p{Ccc: *} \p{Canonical_Combining_Class: *}
1817 \p{CE} \p{Composition_Exclusion} (=
1818 \p{Composition_Exclusion=Y}) (81)
1819 \p{CE: *} \p{Composition_Exclusion: *}
1820 \p{Cf} \p{Format} (= \p{General_Category=Format})
1821 (161)
1822 \p{Chakma} \p{Script_Extensions=Chakma} (Short:
1823 \p{Cakm}; NOT \p{Block=Chakma}) (90)
1824 \p{Cham} \p{Script_Extensions=Cham} (NOT \p{Block=
1825 Cham}) (83)
1826 \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1827 \p{CWCF}) (1463)
1828 \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1829 (1_112_649 plus all above-Unicode code
1830 points: [\x00-
1831 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1832 >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\xb4\xb6-
1833 \xbf\xd7\xe0-\xff], U+0101, U+0103,
1834 U+0105, U+0107, U+0109 ...)
1835 \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1836 (1463: [A-Z\xb5\xc0-\xd6\xd8-\xdf],
1837 U+0100, U+0102, U+0104, U+0106, U+0108
1838 ...)
1839 \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1840 \p{CWCM}) (2841)
1841 \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1842 (1_111_271 plus all above-Unicode code
1843 points: [\x00-
1844 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1845 >?\@\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1846 \xbf\xd7\xf7], U+0138, U+018D, U+019B,
1847 U+01AA..01AB, U+01BA..01BB ...)
1848 \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1849 (2841: [A-Za-z\xb5\xc0-\xd6\xd8-
1850 \xf6\xf8-\xff], U+0100..0137,
1851 U+0139..018C, U+018E..019A,
1852 U+019C..01A9, U+01AC..01B9 ...)
1853 \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1854 \p{CWL}) (1390)
1855 \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1856 (1_112_722 plus all above-Unicode code
1857 points: [\x00-
1858 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1859 >?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1860 \xbf\xd7\xdf-\xff], U+0101, U+0103,
1861 U+0105, U+0107, U+0109 ...)
1862 \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1390:
1863 [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102,
1864 U+0104, U+0106, U+0108 ...)
1865 \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1866 Y} (Short: \p{CWKCF}) (10_315)
1867 \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1868 \P{CWKCF}) (1_103_797 plus all above-
1869 Unicode code points: [\x00-
1870 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1871 >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1-
1872 \xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6-
1873 \xb7\xbb\xbf\xd7\xe0-\xff], U+0101,
1874 U+0103, U+0105, U+0107, U+0109 ...)
1875 \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1876 \p{CWKCF}) (10_315: [A-
1877 Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8-
1878 \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf],
1879 U+0100, U+0102, U+0104, U+0106, U+0108
1880 ...)
1881 \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1882 \p{CWT}) (1409)
1883 \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1884 (1_112_703 plus all above-Unicode code
1885 points: [\x00-
1886 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
1887 Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1888 \xde\xf7], U+0100, U+0102, U+0104,
1889 U+0106, U+0108 ...)
1890 \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1409:
1891 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1892 U+0103, U+0105, U+0107, U+0109 ...)
1893 \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1894 \p{CWU}) (1482)
1895 \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1896 (1_112_630 plus all above-Unicode code
1897 points: [\x00-
1898 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
1899 Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1900 \xde\xf7], U+0100, U+0102, U+0104,
1901 U+0106, U+0108 ...)
1902 \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1482:
1903 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1904 U+0103, U+0105, U+0107, U+0109 ...)
1905 \p{Cher} \p{Cherokee} (= \p{Script_Extensions=
1906 Cherokee}) (NOT \p{Block=Cherokee}) (172)
1907 \p{Cherokee} \p{Script_Extensions=Cherokee} (Short:
1908 \p{Cher}; NOT \p{Block=Cherokee}) (172)
1909 X \p{Cherokee_Sup} \p{Cherokee_Supplement} (= \p{Block=
1910 Cherokee_Supplement}) (80)
1911 X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
1912 \p{InCherokeeSup}) (80)
1913 X \p{Chess_Symbols} \p{Block=Chess_Symbols} (112)
1914 \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
1915 Y}) (2396)
1916 \p{CI: *} \p{Case_Ignorable: *}
1917 X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block=
1918 CJK_Unified_Ideographs}) (20_992)
1919 X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block=
1920 CJK_Compatibility}) (256)
1921 X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block=
1922 CJK_Compatibility_Forms}) (32)
1923 X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
1924 \p{Block=CJK_Compatibility_Ideographs})
1925 (512)
1926 X \p{CJK_Compat_Ideographs_Sup}
1927 \p{CJK_Compatibility_Ideographs_-
1928 Supplement} (= \p{Block=
1929 CJK_Compatibility_Ideographs_-
1930 Supplement}) (544)
1931 X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short:
1932 \p{InCJKCompat}) (256)
1933 X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
1934 (Short: \p{InCJKCompatForms}) (32)
1935 X \p{CJK_Compatibility_Ideographs} \p{Block=
1936 CJK_Compatibility_Ideographs} (Short:
1937 \p{InCJKCompatIdeographs}) (512)
1938 X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
1939 CJK_Compatibility_Ideographs_Supplement}
1940 (Short: \p{InCJKCompatIdeographsSup})
1941 (544)
1942 X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (=
1943 \p{Block=
1944 CJK_Unified_Ideographs_Extension_A})
1945 (6592)
1946 X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (=
1947 \p{Block=
1948 CJK_Unified_Ideographs_Extension_B})
1949 (42_720)
1950 X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (=
1951 \p{Block=
1952 CJK_Unified_Ideographs_Extension_C})
1953 (4160)
1954 X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (=
1955 \p{Block=
1956 CJK_Unified_Ideographs_Extension_D})
1957 (224)
1958 X \p{CJK_Ext_E} \p{CJK_Unified_Ideographs_Extension_E} (=
1959 \p{Block=
1960 CJK_Unified_Ideographs_Extension_E})
1961 (5776)
1962 X \p{CJK_Ext_F} \p{CJK_Unified_Ideographs_Extension_F} (=
1963 \p{Block=
1964 CJK_Unified_Ideographs_Extension_F})
1965 (7488)
1966 X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block=
1967 CJK_Radicals_Supplement}) (128)
1968 X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
1969 (Short: \p{InCJKRadicalsSup}) (128)
1970 X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
1971 X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (=
1972 \p{Block=CJK_Symbols_And_Punctuation})
1973 (64)
1974 X \p{CJK_Symbols_And_Punctuation} \p{Block=
1975 CJK_Symbols_And_Punctuation} (Short:
1976 \p{InCJKSymbols}) (64)
1977 X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
1978 (Short: \p{InCJK}) (20_992)
1979 X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
1980 CJK_Unified_Ideographs_Extension_A}
1981 (Short: \p{InCJKExtA}) (6592)
1982 X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
1983 CJK_Unified_Ideographs_Extension_B}
1984 (Short: \p{InCJKExtB}) (42_720)
1985 X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
1986 CJK_Unified_Ideographs_Extension_C}
1987 (Short: \p{InCJKExtC}) (4160)
1988 X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
1989 CJK_Unified_Ideographs_Extension_D}
1990 (Short: \p{InCJKExtD}) (224)
1991 X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
1992 CJK_Unified_Ideographs_Extension_E}
1993 (Short: \p{InCJKExtE}) (5776)
1994 X \p{CJK_Unified_Ideographs_Extension_F} \p{Block=
1995 CJK_Unified_Ideographs_Extension_F}
1996 (Short: \p{InCJKExtF}) (7488)
1997 \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
1998 (Short: \p{Pe}) (73)
1999 \p{Cn} \p{Unassigned} (= \p{General_Category=
2000 Unassigned}) (836_602 plus all above-
2001 Unicode code points)
2002 \p{Cntrl} \p{XPosixCntrl} (= \p{General_Category=
2003 Control}) (65)
2004 \p{Co} \p{Private_Use} (= \p{General_Category=
2005 Private_Use}) (NOT \p{Private_Use_Area})
2006 (137_468)
2007 X \p{Combining_Diacritical_Marks} \p{Block=
2008 Combining_Diacritical_Marks} (Short:
2009 \p{InDiacriticals}) (112)
2010 X \p{Combining_Diacritical_Marks_Extended} \p{Block=
2011 Combining_Diacritical_Marks_Extended}
2012 (Short: \p{InDiacriticalsExt}) (80)
2013 X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
2014 Combining_Diacritical_Marks_For_Symbols}
2015 (Short: \p{InDiacriticalsForSymbols})
2016 (48)
2017 X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
2018 Combining_Diacritical_Marks_Supplement}
2019 (Short: \p{InDiacriticalsSup}) (64)
2020 X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
2021 \p{InHalfMarks}) (16)
2022 \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark})
2023 (2268)
2024 X \p{Combining_Marks_For_Symbols}
2025 \p{Combining_Diacritical_Marks_For_-
2026 Symbols} (= \p{Block=
2027 Combining_Diacritical_Marks_For_-
2028 Symbols}) (48)
2029 \p{Common} \p{Script_Extensions=Common} (Short:
2030 \p{Zyyy}) (7386)
2031 X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
2032 (Short: \p{InIndicNumberForms}) (16)
2033 \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
2034 \p{Full_Composition_Exclusion=Y}) (1120)
2035 \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
2036 X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block=
2037 Hangul_Compatibility_Jamo}) (96)
2038 \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
2039 \p{CE}) (81)
2040 \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
2041 plus all above-Unicode code points:
2042 U+0000..0957, U+0960..09DB, U+09DE,
2043 U+09E0..0A32, U+0A34..0A35, U+0A37..0A58
2044 ...)
2045 \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81:
2046 U+0958..095F, U+09DC..09DD, U+09DF,
2047 U+0A33, U+0A36, U+0A59..0A5B ...)
2048 \p{Connector_Punctuation} \p{General_Category=
2049 Connector_Punctuation} (Short: \p{Pc})
2050 (10)
2051 \p{Control} \p{XPosixCntrl} (= \p{General_Category=
2052 Control}) (65)
2053 X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
2054 \p{Copt} \p{Coptic} (= \p{Script_Extensions=
2055 Coptic}) (NOT \p{Block=Coptic}) (165)
2056 \p{Coptic} \p{Script_Extensions=Coptic} (Short:
2057 \p{Copt}; NOT \p{Block=Coptic}) (165)
2058 X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
2059 X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block=
2060 Counting_Rod_Numerals}) (32)
2061 X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
2062 \p{InCountingRod}) (32)
2063 \p{Cprt} \p{Cypriot} (= \p{Script_Extensions=
2064 Cypriot}) (112)
2065 \p{Cs} \p{Surrogate} (= \p{General_Category=
2066 Surrogate}) (2048)
2067 \p{Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
2068 \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
2069 X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (=
2070 \p{Block=
2071 Cuneiform_Numbers_And_Punctuation}) (128)
2072 X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
2073 Cuneiform_Numbers_And_Punctuation}
2074 (Short: \p{InCuneiformNumbers}) (128)
2075 \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
2076 (Short: \p{Sc}) (62)
2077 X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
2078 \p{CWCF} \p{Changes_When_Casefolded} (=
2079 \p{Changes_When_Casefolded=Y}) (1463)
2080 \p{CWCF: *} \p{Changes_When_Casefolded: *}
2081 \p{CWCM} \p{Changes_When_Casemapped} (=
2082 \p{Changes_When_Casemapped=Y}) (2841)
2083 \p{CWCM: *} \p{Changes_When_Casemapped: *}
2084 \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
2085 \p{Changes_When_NFKC_Casefolded=Y})
2086 (10_315)
2087 \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
2088 \p{CWL} \p{Changes_When_Lowercased} (=
2089 \p{Changes_When_Lowercased=Y}) (1390)
2090 \p{CWL: *} \p{Changes_When_Lowercased: *}
2091 \p{CWT} \p{Changes_When_Titlecased} (=
2092 \p{Changes_When_Titlecased=Y}) (1409)
2093 \p{CWT: *} \p{Changes_When_Titlecased: *}
2094 \p{CWU} \p{Changes_When_Uppercased} (=
2095 \p{Changes_When_Uppercased=Y}) (1482)
2096 \p{CWU: *} \p{Changes_When_Uppercased: *}
2097 \p{Cypriot} \p{Script_Extensions=Cypriot} (Short:
2098 \p{Cprt}) (112)
2099 X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
2100 \p{Cyrillic} \p{Script_Extensions=Cyrillic} (Short:
2101 \p{Cyrl}; NOT \p{Block=Cyrillic}) (446)
2102 X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block=
2103 Cyrillic_Extended_A}) (32)
2104 X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block=
2105 Cyrillic_Extended_B}) (96)
2106 X \p{Cyrillic_Ext_C} \p{Cyrillic_Extended_C} (= \p{Block=
2107 Cyrillic_Extended_C}) (16)
2108 X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
2109 \p{InCyrillicExtA}) (32)
2110 X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
2111 \p{InCyrillicExtB}) (96)
2112 X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
2113 \p{InCyrillicExtC}) (16)
2114 X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block=
2115 Cyrillic_Supplement}) (48)
2116 X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
2117 \p{InCyrillicSup}) (48)
2118 X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
2119 Cyrillic_Supplement}) (48)
2120 \p{Cyrl} \p{Cyrillic} (= \p{Script_Extensions=
2121 Cyrillic}) (NOT \p{Block=Cyrillic}) (446)
2122 \p{Dash} \p{Dash=Y} (28)
2123 \p{Dash: N*} (Single: \P{Dash}) (1_114_084 plus all
2124 above-Unicode code points: [\x00-
2125 \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-
2126 Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2127 U+0100..0589, U+058B..05BD,
2128 U+05BF..13FF, U+1401..1805, U+1807..200F
2129 ...)
2130 \p{Dash: Y*} (Single: \p{Dash}) (28: [\-], U+058A,
2131 U+05BE, U+1400, U+1806, U+2010..2015 ...)
2132 \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
2133 (Short: \p{Pd}) (24)
2134 \p{Decimal_Number} \p{XPosixDigit} (= \p{General_Category=
2135 Decimal_Number}) (630)
2136 \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
2137 (13_232)
2138 \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_232:
2139 [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9-
2140 \xdd\xe0-\xe5\xe7-\xef\xf1-\xf6\xf9-
2141 \xfd\xff], U+0100..010F, U+0112..0125,
2142 U+0128..0130, U+0134..0137, U+0139..013E
2143 ...)
2144 \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240:
2145 U+2460..2473, U+24B6..24EA,
2146 U+3244..3247, U+3251..327E,
2147 U+3280..32BF, U+32D0..32FE ...)
2148 \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
2149 \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720:
2150 [\xa8\xaf\xb4-\xb5\xb8], U+0132..0133,
2151 U+013F..0140, U+0149, U+017F,
2152 U+01C4..01CC ...)
2153 \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
2154 \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
2155 \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51,
2156 U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63
2157 ...)
2158 \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184: U+2102,
2159 U+210A..2113, U+2115, U+2119..211D,
2160 U+2124, U+2128 ...)
2161 \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
2162 \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc-
2163 \xbe], U+2150..215F, U+2189)
2164 \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
2165 \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54,
2166 U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68
2167 ...)
2168 \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
2169 \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50,
2170 U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62
2171 ...)
2172 \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
2173 \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55,
2174 U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69
2175 ...)
2176 \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
2177 \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122:
2178 U+FF61..FFBE, U+FFC2..FFC7,
2179 U+FFCA..FFCF, U+FFD2..FFD7,
2180 U+FFDA..FFDC, U+FFE8..FFEE)
2181 \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
2182 \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0],
2183 U+0F0C, U+2007, U+2011, U+202F)
2184 \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
2185 Non_Canonical} (Perl extension) (3664)
2186 \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
2187 decompositions (Short: \p{Dt=NonCanon})
2188 (Perl extension) (3664:
2189 [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-\xba\xbc-
2190 \xbe], U+0132..0133, U+013F..0140,
2191 U+0149, U+017F, U+01C4..01CC ...)
2192 \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_216 plus
2193 all above-Unicode code points: [\x00-
2194 \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6-
2195 \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-
2196 \xdf\xe6\xf0\xf7-\xf8\xfe],
2197 U+0110..0111, U+0126..0127, U+0131,
2198 U+0138, U+0141..0142 ...)
2199 \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26:
2200 U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B)
2201 \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
2202 \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (286)
2203 \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (286: U+3250,
2204 U+32CC..32CF, U+32FF..3357,
2205 U+3371..33DF, U+33FF, U+1F130..1F14F ...)
2206 \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38: U+1D62..1D6A,
2207 U+2080..208E, U+2090..209C, U+2C7C)
2208 \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (153)
2209 \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (153: [\xaa\xb2-
2210 \xb3\xb9-\xba], U+02B0..02B8,
2211 U+02E0..02E4, U+10FC, U+1D2C..1D2E,
2212 U+1D30..1D3A ...)
2213 \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
2214 \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F,
2215 U+30FF, U+FE10..FE19, U+FE30..FE44,
2216 U+FE47..FE48)
2217 \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000,
2218 U+FF01..FF60, U+FFE0..FFE6)
2219 \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
2220 Y} (Short: \p{DI}) (4173)
2221 \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
2222 (1_109_939 plus all above-Unicode code
2223 points: [\x00-\xac\xae-\xff],
2224 U+0100..034E, U+0350..061B,
2225 U+061D..115E, U+1161..17B3, U+17B6..180A
2226 ...)
2227 \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
2228 (4173: [\xad], U+034F, U+061C,
2229 U+115F..1160, U+17B4..17B5, U+180B..180E
2230 ...)
2231 \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (15)
2232 \p{Dep: *} \p{Deprecated: *}
2233 \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (15)
2234 \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_097
2235 plus all above-Unicode code points:
2236 U+0000..0148, U+014A..0672,
2237 U+0674..0F76, U+0F78, U+0F7A..17A2,
2238 U+17A5..2069 ...)
2239 \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149,
2240 U+0673, U+0F77, U+0F79, U+17A3..17A4,
2241 U+206A..206F ...)
2242 \p{Deseret} \p{Script_Extensions=Deseret} (Short:
2243 \p{Dsrt}) (80)
2244 \p{Deva} \p{Devanagari} (= \p{Script_Extensions=
2245 Devanagari}) (NOT \p{Block=Devanagari})
2246 (210)
2247 \p{Devanagari} \p{Script_Extensions=Devanagari} (Short:
2248 \p{Deva}; NOT \p{Block=Devanagari}) (210)
2249 X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block=
2250 Devanagari_Extended}) (32)
2251 X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
2252 \p{InDevanagariExt}) (32)
2253 \p{DI} \p{Default_Ignorable_Code_Point} (=
2254 \p{Default_Ignorable_Code_Point=Y})
2255 (4173)
2256 \p{DI: *} \p{Default_Ignorable_Code_Point: *}
2257 \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (873)
2258 \p{Dia: *} \p{Diacritic: *}
2259 \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (873)
2260 \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_239
2261 plus all above-Unicode code points:
2262 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2263 >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
2264 \xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
2265 U+0100..02AF, U+034F, U+0358..035C,
2266 U+0363..0373, U+0376..0379 ...)
2267 \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (873:
2268 [\^`\xa8\xaf\xb4\xb7-\xb8],
2269 U+02B0..034E, U+0350..0357,
2270 U+035D..0362, U+0374..0375, U+037A ...)
2271 X \p{Diacriticals} \p{Combining_Diacritical_Marks} (=
2272 \p{Block=Combining_Diacritical_Marks})
2273 (112)
2274 X \p{Diacriticals_Ext} \p{Combining_Diacritical_Marks_Extended}
2275 (= \p{Block=
2276 Combining_Diacritical_Marks_Extended})
2277 (80)
2278 X \p{Diacriticals_For_Symbols}
2279 \p{Combining_Diacritical_Marks_For_-
2280 Symbols} (= \p{Block=
2281 Combining_Diacritical_Marks_For_-
2282 Symbols}) (48)
2283 X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement}
2284 (= \p{Block=
2285 Combining_Diacritical_Marks_Supplement})
2286 (64)
2287 \p{Digit} \p{XPosixDigit} (= \p{General_Category=
2288 Decimal_Number}) (630)
2289 X \p{Dingbats} \p{Block=Dingbats} (192)
2290 \p{Dogr} \p{Dogra} (= \p{Script_Extensions=Dogra})
2291 (NOT \p{Block=Dogra}) (82)
2292 \p{Dogra} \p{Script_Extensions=Dogra} (Short:
2293 \p{Dogr}; NOT \p{Block=Dogra}) (82)
2294 X \p{Domino} \p{Domino_Tiles} (= \p{Block=
2295 Domino_Tiles}) (112)
2296 X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short:
2297 \p{InDomino}) (112)
2298 \p{Dsrt} \p{Deseret} (= \p{Script_Extensions=
2299 Deseret}) (80)
2300 \p{Dt: *} \p{Decomposition_Type: *}
2301 \p{Dupl} \p{Duployan} (= \p{Script_Extensions=
2302 Duployan}) (NOT \p{Block=Duployan}) (147)
2303 \p{Duployan} \p{Script_Extensions=Duployan} (Short:
2304 \p{Dupl}; NOT \p{Block=Duployan}) (147)
2305 \p{Ea: *} \p{East_Asian_Width: *}
2306 X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
2307 (208)
2308 \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
2309 \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739:
2310 [\xa1\xa4\xa7-\xa8\xaa\xad-\xae\xb0-
2311 \xb4\xb6-\xba\xbc-\xbf\xc6\xd0\xd7-
2312 \xd8\xde-\xe1\xe6\xe8-\xea\xec-
2313 \xed\xf0\xf2-\xf3\xf7-\xfa\xfc\xfe],
2314 U+0101, U+0111, U+0113, U+011B,
2315 U+0126..0127 ...)
2316 \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
2317 \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000,
2318 U+FF01..FF60, U+FFE0..FFE6)
2319 \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
2320 \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9,
2321 U+FF61..FFBE, U+FFC2..FFC7,
2322 U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC
2323 ...)
2324 \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (793_252 plus
2325 all above-Unicode code points)
2326 \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
2327 \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20-
2328 \x7e\xa2-\xa3\xa5-\xa6\xac\xaf],
2329 U+27E6..27ED, U+2985..2986)
2330 \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (793_252 plus all
2331 above-Unicode code points: [\x00-
2332 \x1f\x7f-\xa0\xa9\xab\xb5\xbb\xc0-
2333 \xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd\xe2-
2334 \xe5\xe7\xeb\xee-\xef\xf1\xf4-
2335 \xf6\xfb\xfd\xff], U+00FF..0100,
2336 U+0102..0110, U+0112, U+0114..011A,
2337 U+011C..0125 ...)
2338 \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (181_783)
2339 \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (181_783:
2340 U+1100..115F, U+231A..231B,
2341 U+2329..232A, U+23E9..23EC, U+23F0,
2342 U+23F3 ...)
2343 \p{Egyp} \p{Egyptian_Hieroglyphs} (=
2344 \p{Script_Extensions=
2345 Egyptian_Hieroglyphs}) (NOT \p{Block=
2346 Egyptian_Hieroglyphs}) (1080)
2347 X \p{Egyptian_Hieroglyph_Format_Controls} \p{Block=
2348 Egyptian_Hieroglyph_Format_Controls} (16)
2349 \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
2350 Egyptian_Hieroglyphs} (Short: \p{Egyp};
2351 NOT \p{Block=Egyptian_Hieroglyphs})
2352 (1080)
2353 \p{Elba} \p{Elbasan} (= \p{Script_Extensions=
2354 Elbasan}) (NOT \p{Block=Elbasan}) (40)
2355 \p{Elbasan} \p{Script_Extensions=Elbasan} (Short:
2356 \p{Elba}; NOT \p{Block=Elbasan}) (40)
2357 \p{Elym} \p{Elymaic} (= \p{Script_Extensions=
2358 Elymaic}) (NOT \p{Block=Elymaic}) (23)
2359 \p{Elymaic} \p{Script_Extensions=Elymaic} (Short:
2360 \p{Elym}; NOT \p{Block=Elymaic}) (23)
2361 X \p{Emoticons} \p{Block=Emoticons} (80)
2362 X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block=
2363 Enclosed_Alphanumerics}) (160)
2364 X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
2365 \p{Block=
2366 Enclosed_Alphanumeric_Supplement}) (256)
2367 X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
2368 Enclosed_Alphanumeric_Supplement}
2369 (Short: \p{InEnclosedAlphanumSup}) (256)
2370 X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
2371 (Short: \p{InEnclosedAlphanum}) (160)
2372 X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (=
2373 \p{Block=
2374 Enclosed_CJK_Letters_And_Months}) (256)
2375 X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
2376 Enclosed_CJK_Letters_And_Months} (Short:
2377 \p{InEnclosedCJK}) (256)
2378 X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
2379 (= \p{Block=
2380 Enclosed_Ideographic_Supplement}) (256)
2381 X \p{Enclosed_Ideographic_Supplement} \p{Block=
2382 Enclosed_Ideographic_Supplement} (Short:
2383 \p{InEnclosedIdeographicSup}) (256)
2384 \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
2385 (Short: \p{Me}) (13)
2386 \p{Ethi} \p{Ethiopic} (= \p{Script_Extensions=
2387 Ethiopic}) (NOT \p{Block=Ethiopic}) (495)
2388 \p{Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
2389 \p{Ethi}; NOT \p{Block=Ethiopic}) (495)
2390 X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block=
2391 Ethiopic_Extended}) (96)
2392 X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block=
2393 Ethiopic_Extended_A}) (48)
2394 X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short:
2395 \p{InEthiopicExt}) (96)
2396 X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
2397 \p{InEthiopicExtA}) (48)
2398 X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block=
2399 Ethiopic_Supplement}) (32)
2400 X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
2401 \p{InEthiopicSup}) (32)
2402 \p{Ext} \p{Extender} (= \p{Extender=Y}) (47)
2403 \p{Ext: *} \p{Extender: *}
2404 \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (47)
2405 \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_065
2406 plus all above-Unicode code points:
2407 [\x00-\xb6\xb8-\xff], U+0100..02CF,
2408 U+02D2..063F, U+0641..07F9,
2409 U+07FB..0E45, U+0E47..0EC5 ...)
2410 \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (47: [\xb7],
2411 U+02D0..02D1, U+0640, U+07FA, U+0E46,
2412 U+0EC6 ...)
2413 \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
2414 (Short: \p{Pf}) (10)
2415 \p{Format} \p{General_Category=Format} (Short:
2416 \p{Cf}) (161)
2417 \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
2418 (Short: \p{CompEx}) (1120)
2419 \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
2420 \P{CompEx}) (1_112_992 plus all above-
2421 Unicode code points: U+0000..033F,
2422 U+0342, U+0345..0373, U+0375..037D,
2423 U+037F..0386, U+0388..0957 ...)
2424 \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
2425 \p{CompEx}) (1120: U+0340..0341,
2426 U+0343..0344, U+0374, U+037E, U+0387,
2427 U+0958..095F ...)
2428 \p{Gc: *} \p{General_Category: *}
2429 \p{GCB: *} \p{Grapheme_Cluster_Break: *}
2430 \p{General_Category: C} \p{General_Category=Other} (976_344 plus
2431 all above-Unicode code points)
2432 \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
2433 \p{Gc=LC}, \p{LC}) (3970: [A-Za-
2434 z\xb5\xc0-\xd6\xd8-\xf6\xf8-\xff],
2435 U+0100..01BA, U+01BC..01BF,
2436 U+01C4..0293, U+0295..02AF, U+0370..0373
2437 ...)
2438 \p{General_Category: Cc} \p{General_Category=Control} (65)
2439 \p{General_Category: Cf} \p{General_Category=Format} (161)
2440 \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
2441 (73: [\)\]\}], U+0F3B, U+0F3D, U+169C,
2442 U+2046, U+207E ...)
2443 \p{General_Category: Cn} \p{General_Category=Unassigned} (836_602
2444 plus all above-Unicode code points)
2445 \p{General_Category: Cntrl} \p{General_Category=Control} (65)
2446 \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
2447 \p{General_Category: Combining_Mark} \p{General_Category=Mark}
2448 (2268)
2449 \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
2450 \p{Pc}) (10: [_], U+203F..2040, U+2054,
2451 U+FE33..FE34, U+FE4D..FE4F, U+FF3F)
2452 \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65:
2453 [\x00-\x1f\x7f-\x9f])
2454 \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
2455 \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
2456 (62: [\$\xa2-\xa5], U+058F, U+060B,
2457 U+07FE..07FF, U+09F2..09F3, U+09FB ...)
2458 \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
2459 (24: [\-], U+058A, U+05BE, U+1400,
2460 U+1806, U+2010..2015 ...)
2461 \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
2462 (630: [0-9], U+0660..0669, U+06F0..06F9,
2463 U+07C0..07C9, U+0966..096F, U+09E6..09EF
2464 ...)
2465 \p{General_Category: Digit} \p{General_Category=Decimal_Number}
2466 (630)
2467 \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
2468 (13: U+0488..0489, U+1ABE, U+20DD..20E0,
2469 U+20E2..20E4, U+A670..A672)
2470 \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
2471 (10: [\xbb], U+2019, U+201D, U+203A,
2472 U+2E03, U+2E05 ...)
2473 \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (161:
2474 [\xad], U+0600..0605, U+061C, U+06DD,
2475 U+070F, U+08E2 ...)
2476 \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
2477 \p{Pi}) (12: [\xab], U+2018,
2478 U+201B..201C, U+201F, U+2039, U+2E02 ...)
2479 \p{General_Category: L} \p{General_Category=Letter} (125_643)
2480 X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3970)
2481 X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
2482 the trailing '_' matters in spite of
2483 loose matching rules. (3970)
2484 \p{General_Category: LC} \p{General_Category=Cased_Letter} (3970)
2485 \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (125_643:
2486 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
2487 \xf6\xf8-\xff], U+0100..02C1,
2488 U+02C6..02D1, U+02E0..02E4, U+02EC,
2489 U+02EE ...)
2490 \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
2491 (236: U+16EE..16F0, U+2160..2182,
2492 U+2185..2188, U+3007, U+3021..3029,
2493 U+3038..303A ...)
2494 \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl})
2495 (1: U+2028)
2496 \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
2497 (/i= General_Category=Cased_Letter)
2498 (2151)
2499 \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (259)
2500 \p{General_Category: Lo} \p{General_Category=Other_Letter}
2501 (121_414)
2502 \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
2503 /i= General_Category=Cased_Letter)
2504 (2151: [a-z\xb5\xdf-\xf6\xf8-\xff],
2505 U+0101, U+0103, U+0105, U+0107, U+0109
2506 ...)
2507 \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
2508 (/i= General_Category=Cased_Letter) (31)
2509 \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
2510 (/i= General_Category=Cased_Letter)
2511 (1788)
2512 \p{General_Category: M} \p{General_Category=Mark} (2268)
2513 \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2268:
2514 U+0300..036F, U+0483..0489,
2515 U+0591..05BD, U+05BF, U+05C1..05C2,
2516 U+05C4..05C5 ...)
2517 \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948:
2518 [+<=>\|~\xac\xb1\xd7\xf7], U+03F6,
2519 U+0606..0608, U+2044, U+2052,
2520 U+207A..207C ...)
2521 \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (429)
2522 \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
2523 \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
2524 (1826)
2525 \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
2526 (259: U+02B0..02C1, U+02C6..02D1,
2527 U+02E0..02E4, U+02EC, U+02EE, U+0374 ...)
2528 \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
2529 (121: [\^`\xa8\xaf\xb4\xb8],
2530 U+02C2..02C5, U+02D2..02DF,
2531 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2532 \p{General_Category: N} \p{General_Category=Number} (1754)
2533 \p{General_Category: Nd} \p{General_Category=Decimal_Number} (630)
2534 \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
2535 \p{General_Category: No} \p{General_Category=Other_Number} (888)
2536 \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
2537 (1826: U+0300..036F, U+0483..0487,
2538 U+0591..05BD, U+05BF, U+05C1..05C2,
2539 U+05C4..05C5 ...)
2540 \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1754:
2541 [0-9\xb2-\xb3\xb9\xbc-\xbe],
2542 U+0660..0669, U+06F0..06F9,
2543 U+07C0..07C9, U+0966..096F, U+09E6..09EF
2544 ...)
2545 \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
2546 (75: [\(\[\{], U+0F3A, U+0F3C, U+169B,
2547 U+201A, U+201E ...)
2548 \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (976_344 plus
2549 all above-Unicode code points: [\x00-
2550 \x1f\x7f-\x9f\xad], U+0378..0379,
2551 U+0380..0383, U+038B, U+038D, U+03A2 ...)
2552 \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
2553 (121_414: [\xaa\xba], U+01BB,
2554 U+01C0..01C3, U+0294, U+05D0..05EA,
2555 U+05EF..05F2 ...)
2556 \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No})
2557 (888: [\xb2-\xb3\xb9\xbc-\xbe],
2558 U+09F4..09F9, U+0B72..0B77,
2559 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
2560 ...)
2561 \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
2562 (588:
2563 [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7\xb6-
2564 \xb7\xbf], U+037E, U+0387, U+055A..055F,
2565 U+0589, U+05C0 ...)
2566 \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
2567 (6161: [\xa6\xa9\xae\xb0], U+0482,
2568 U+058D..058E, U+060E..060F, U+06DE,
2569 U+06E9 ...)
2570 \p{General_Category: P} \p{General_Category=Punctuation} (792)
2571 \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
2572 \p{Zp}) (1: U+2029)
2573 \p{General_Category: Pc} \p{General_Category=
2574 Connector_Punctuation} (10)
2575 \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (24)
2576 \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
2577 (73)
2578 \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
2579 (10)
2580 \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
2581 (12)
2582 \p{General_Category: Po} \p{General_Category=Other_Punctuation}
2583 (588)
2584 \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
2585 (137_468: U+E000..F8FF, U+F0000..FFFFD,
2586 U+100000..10FFFD)
2587 \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (75)
2588 \p{General_Category: Punct} \p{General_Category=Punctuation} (792)
2589 \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (792:
2590 [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_-
2591 \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
2592 U+037E, U+0387, U+055A..055F,
2593 U+0589..058A, U+05BE ...)
2594 \p{General_Category: S} \p{General_Category=Symbol} (7292)
2595 \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (62)
2596 \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19:
2597 [\x20\xa0], U+1680, U+2000..200A,
2598 U+2028..2029, U+202F, U+205F ...)
2599 \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (121)
2600 \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
2601 \p{General_Category: So} \p{General_Category=Other_Symbol} (6161)
2602 \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
2603 (17: [\x20\xa0], U+1680, U+2000..200A,
2604 U+202F, U+205F, U+3000)
2605 \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc})
2606 (429: U+0903, U+093B, U+093E..0940,
2607 U+0949..094C, U+094E..094F, U+0982..0983
2608 ...)
2609 \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048:
2610 U+D800..DFFF)
2611 \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (7292:
2612 [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae-
2613 \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5,
2614 U+02D2..02DF, U+02E5..02EB, U+02ED,
2615 U+02EF..02FF ...)
2616 \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
2617 /i= General_Category=Cased_Letter) (31:
2618 U+01C5, U+01C8, U+01CB, U+01F2,
2619 U+1F88..1F8F, U+1F98..1F9F ...)
2620 \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
2621 (836_602 plus all above-Unicode code
2622 points: U+0378..0379, U+0380..0383,
2623 U+038B, U+038D, U+03A2, U+0530 ...)
2624 \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
2625 /i= General_Category=Cased_Letter)
2626 (1788: [A-Z\xc0-\xd6\xd8-\xde], U+0100,
2627 U+0102, U+0104, U+0106, U+0108 ...)
2628 \p{General_Category: Z} \p{General_Category=Separator} (19)
2629 \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
2630 \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
2631 (1)
2632 \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
2633 X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
2634 \p{InPunctuation}) (112)
2635 X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
2636 X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
2637 \p{Block=Geometric_Shapes_Extended})
2638 (128)
2639 X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
2640 (Short: \p{InGeometricShapesExt}) (128)
2641 \p{Geor} \p{Georgian} (= \p{Script_Extensions=
2642 Georgian}) (NOT \p{Block=Georgian}) (175)
2643 \p{Georgian} \p{Script_Extensions=Georgian} (Short:
2644 \p{Geor}; NOT \p{Block=Georgian}) (175)
2645 X \p{Georgian_Ext} \p{Georgian_Extended} (= \p{Block=
2646 Georgian_Extended}) (48)
2647 X \p{Georgian_Extended} \p{Block=Georgian_Extended} (Short:
2648 \p{InGeorgianExt}) (48)
2649 X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block=
2650 Georgian_Supplement}) (48)
2651 X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
2652 \p{InGeorgianSup}) (48)
2653 \p{Glag} \p{Glagolitic} (= \p{Script_Extensions=
2654 Glagolitic}) (NOT \p{Block=Glagolitic})
2655 (136)
2656 \p{Glagolitic} \p{Script_Extensions=Glagolitic} (Short:
2657 \p{Glag}; NOT \p{Block=Glagolitic}) (136)
2658 X \p{Glagolitic_Sup} \p{Glagolitic_Supplement} (= \p{Block=
2659 Glagolitic_Supplement}) (48)
2660 X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
2661 \p{InGlagoliticSup}) (48)
2662 \p{Gong} \p{Gunjala_Gondi} (= \p{Script_Extensions=
2663 Gunjala_Gondi}) (NOT \p{Block=
2664 Gunjala_Gondi}) (65)
2665 \p{Gonm} \p{Masaram_Gondi} (= \p{Script_Extensions=
2666 Masaram_Gondi}) (NOT \p{Block=
2667 Masaram_Gondi}) (77)
2668 \p{Goth} \p{Gothic} (= \p{Script_Extensions=
2669 Gothic}) (NOT \p{Block=Gothic}) (27)
2670 \p{Gothic} \p{Script_Extensions=Gothic} (Short:
2671 \p{Goth}; NOT \p{Block=Gothic}) (27)
2672 \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
2673 (135_898)
2674 \p{Gr_Base: *} \p{Grapheme_Base: *}
2675 \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
2676 Y}) (1965)
2677 \p{Gr_Ext: *} \p{Grapheme_Extend: *}
2678 \p{Gran} \p{Grantha} (= \p{Script_Extensions=
2679 Grantha}) (NOT \p{Block=Grantha}) (116)
2680 \p{Grantha} \p{Script_Extensions=Grantha} (Short:
2681 \p{Gran}; NOT \p{Block=Grantha}) (116)
2682 \p{Graph} \p{XPosixGraph} (275_378)
2683 \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
2684 (135_898)
2685 \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase}) (978_214
2686 plus all above-Unicode code points:
2687 [\x00-\x1f\x7f-\x9f\xad], U+0300..036F,
2688 U+0378..0379, U+0380..0383, U+038B,
2689 U+038D ...)
2690 \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase})
2691 (135_898: [\x20-\x7e\xa0-\xac\xae-\xff],
2692 U+0100..02FF, U+0370..0377,
2693 U+037A..037F, U+0384..038A, U+038C ...)
2694 \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
2695 (3886)
2696 \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (3886:
2697 [^\n\r\x20-\x7e\xa0-\xac\xae-\xff],
2698 U+061C, U+180E, U+200B, U+200E..200F,
2699 U+2028..202E ...)
2700 \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r])
2701 \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (0)
2702 \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (0)
2703 \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (0)
2704 \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
2705 (0)
2706 \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
2707 E_Base_GAZ} (0)
2708 \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
2709 E_Modifier} (0)
2710 \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
2711 (1970)
2712 \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1970:
2713 U+0300..036F, U+0483..0489,
2714 U+0591..05BD, U+05BF, U+05C1..05C2,
2715 U+05C4..05C5 ...)
2716 \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
2717 Glue_After_Zwj} (0)
2718 \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (0)
2719 \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125:
2720 U+1100..115F, U+A960..A97C)
2721 \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n])
2722 \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00,
2723 U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C
2724 ...)
2725 \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773:
2726 U+AC01..AC1B, U+AC1D..AC37,
2727 U+AC39..AC53, U+AC55..AC6F,
2728 U+AC71..AC8B, U+AC8D..ACA7 ...)
2729 \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_096_301
2730 plus all above-Unicode code points:
2731 [\x20-\x7e\xa0-\xac\xae-\xff],
2732 U+0100..02FF, U+0370..0482,
2733 U+048A..0590, U+05BE, U+05C0 ...)
2734 \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
2735 (22)
2736 \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (22:
2737 U+0600..0605, U+06DD, U+070F, U+08E2,
2738 U+0D4E, U+110BD ...)
2739 \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
2740 (26: U+1F1E6..1F1FF)
2741 \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
2742 Regional_Indicator} (26)
2743 \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
2744 SpacingMark} (375)
2745 \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (375:
2746 U+0903, U+093B, U+093E..0940,
2747 U+0949..094C, U+094E..094F, U+0982..0983
2748 ...)
2749 \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137:
2750 U+11A8..11FF, U+D7CB..D7FB)
2751 \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95:
2752 U+1160..11A7, U+D7B0..D7C6)
2753 \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
2754 (1_096_301 plus all above-Unicode code
2755 points)
2756 \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D)
2757 \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
2758 (1965)
2759 \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_147
2760 plus all above-Unicode code points:
2761 U+0000..02FF, U+0370..0482,
2762 U+048A..0590, U+05BE, U+05C0, U+05C3 ...)
2763 \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1965:
2764 U+0300..036F, U+0483..0489,
2765 U+0591..05BD, U+05BF, U+05C1..05C2,
2766 U+05C4..05C5 ...)
2767 \p{Greek} \p{Script_Extensions=Greek} (Short:
2768 \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
2769 X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
2770 \p{InGreek}) (144)
2771 X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block=
2772 Greek_Extended}) (256)
2773 X \p{Greek_Extended} \p{Block=Greek_Extended} (Short:
2774 \p{InGreekExt}) (256)
2775 \p{Grek} \p{Greek} (= \p{Script_Extensions=Greek})
2776 (NOT \p{Greek_And_Coptic}) (522)
2777 \p{Gujarati} \p{Script_Extensions=Gujarati} (Short:
2778 \p{Gujr}; NOT \p{Block=Gujarati}) (105)
2779 \p{Gujr} \p{Gujarati} (= \p{Script_Extensions=
2780 Gujarati}) (NOT \p{Block=Gujarati}) (105)
2781 \p{Gunjala_Gondi} \p{Script_Extensions=Gunjala_Gondi}
2782 (Short: \p{Gong}; NOT \p{Block=
2783 Gunjala_Gondi}) (65)
2784 \p{Gurmukhi} \p{Script_Extensions=Gurmukhi} (Short:
2785 \p{Guru}; NOT \p{Block=Gurmukhi}) (94)
2786 \p{Guru} \p{Gurmukhi} (= \p{Script_Extensions=
2787 Gurmukhi}) (NOT \p{Block=Gurmukhi}) (94)
2788 X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
2789 \p{Block=Halfwidth_And_Fullwidth_Forms})
2790 (240)
2791 X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block=
2792 Combining_Half_Marks}) (16)
2793 X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
2794 Halfwidth_And_Fullwidth_Forms} (Short:
2795 \p{InHalfAndFullForms}) (240)
2796 \p{Han} \p{Script_Extensions=Han} (89_513)
2797 \p{Hang} \p{Hangul} (= \p{Script_Extensions=
2798 Hangul}) (NOT \p{Hangul_Syllables})
2799 (11_775)
2800 \p{Hangul} \p{Script_Extensions=Hangul} (Short:
2801 \p{Hang}; NOT \p{Hangul_Syllables})
2802 (11_775)
2803 X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
2804 (Short: \p{InCompatJamo}) (96)
2805 X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo})
2806 (256)
2807 X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
2808 (Short: \p{InJamoExtA}) (32)
2809 X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
2810 (Short: \p{InJamoExtB}) (80)
2811 \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
2812 (125)
2813 \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125:
2814 U+1100..115F, U+A960..A97C)
2815 \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
2816 (399)
2817 \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399:
2818 U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70,
2819 U+AC8C ...)
2820 \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
2821 LVT_Syllable} (10_773)
2822 \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
2823 (10_773: U+AC01..AC1B, U+AC1D..AC37,
2824 U+AC39..AC53, U+AC55..AC6F,
2825 U+AC71..AC8B, U+AC8D..ACA7 ...)
2826 \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
2827 Not_Applicable} (1_102_583 plus all
2828 above-Unicode code points)
2829 \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
2830 (1_102_583 plus all above-Unicode code
2831 points: U+0000..10FF, U+1200..A95F,
2832 U+A97D..ABFF, U+D7A4..D7AF,
2833 U+D7C7..D7CA, U+D7FC..infinity)
2834 \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
2835 (137)
2836 \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137:
2837 U+11A8..11FF, U+D7CB..D7FB)
2838 \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
2839 (95)
2840 \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95:
2841 U+1160..11A7, U+D7B0..D7C6)
2842 X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short:
2843 \p{InHangul}) (11_184)
2844 \p{Hani} \p{Han} (= \p{Script_Extensions=Han})
2845 (89_513)
2846 \p{Hanifi_Rohingya} \p{Script_Extensions=Hanifi_Rohingya}
2847 (Short: \p{Rohg}; NOT \p{Block=
2848 Hanifi_Rohingya}) (55)
2849 \p{Hano} \p{Hanunoo} (= \p{Script_Extensions=
2850 Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
2851 \p{Hanunoo} \p{Script_Extensions=Hanunoo} (Short:
2852 \p{Hano}; NOT \p{Block=Hanunoo}) (23)
2853 \p{Hatr} \p{Hatran} (= \p{Script_Extensions=
2854 Hatran}) (NOT \p{Block=Hatran}) (26)
2855 \p{Hatran} \p{Script_Extensions=Hatran} (Short:
2856 \p{Hatr}; NOT \p{Block=Hatran}) (26)
2857 \p{Hebr} \p{Hebrew} (= \p{Script_Extensions=
2858 Hebrew}) (NOT \p{Block=Hebrew}) (134)
2859 \p{Hebrew} \p{Script_Extensions=Hebrew} (Short:
2860 \p{Hebr}; NOT \p{Block=Hebrew}) (134)
2861 \p{Hex} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2862 \p{Hex: *} \p{Hex_Digit: *}
2863 \p{Hex_Digit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2864 \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068
2865 plus all above-Unicode code points:
2866 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
2867 >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
2868 U+0100..FF0F, U+FF1A..FF20,
2869 U+FF27..FF40, U+FF47..infinity)
2870 \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa-
2871 f], U+FF10..FF19, U+FF21..FF26,
2872 U+FF41..FF46)
2873 X \p{High_Private_Use_Surrogates} \p{Block=
2874 High_Private_Use_Surrogates} (Short:
2875 \p{InHighPUSurrogates}) (128)
2876 X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (=
2877 \p{Block=High_Private_Use_Surrogates})
2878 (128)
2879 X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
2880 \p{Hira} \p{Hiragana} (= \p{Script_Extensions=
2881 Hiragana}) (NOT \p{Block=Hiragana}) (431)
2882 \p{Hiragana} \p{Script_Extensions=Hiragana} (Short:
2883 \p{Hira}; NOT \p{Block=Hiragana}) (431)
2884 \p{Hluw} \p{Anatolian_Hieroglyphs} (=
2885 \p{Script_Extensions=
2886 Anatolian_Hieroglyphs}) (NOT \p{Block=
2887 Anatolian_Hieroglyphs}) (583)
2888 \p{Hmng} \p{Pahawh_Hmong} (= \p{Script_Extensions=
2889 Pahawh_Hmong}) (NOT \p{Block=
2890 Pahawh_Hmong}) (127)
2891 \p{Hmnp} \p{Nyiakeng_Puachue_Hmong} (=
2892 \p{Script_Extensions=
2893 Nyiakeng_Puachue_Hmong}) (NOT \p{Block=
2894 Nyiakeng_Puachue_Hmong}) (71)
2895 \p{HorizSpace} \p{XPosixBlank} (18)
2896 \p{Hst: *} \p{Hangul_Syllable_Type: *}
2897 \p{Hung} \p{Old_Hungarian} (= \p{Script_Extensions=
2898 Old_Hungarian}) (NOT \p{Block=
2899 Old_Hungarian}) (108)
2900 D \p{Hyphen} \p{Hyphen=Y} (11)
2901 D \p{Hyphen: N*} Supplanted by Line_Break property values;
2902 see www.unicode.org/reports/tr14
2903 (Single: \P{Hyphen}) (1_114_101 plus all
2904 above-Unicode code points: [\x00-
2905 \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-
2906 Z\[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae-
2907 \xff], U+0100..0589, U+058B..1805,
2908 U+1807..200F, U+2012..2E16, U+2E18..30FA
2909 ...)
2910 D \p{Hyphen: Y*} Supplanted by Line_Break property values;
2911 see www.unicode.org/reports/tr14
2912 (Single: \p{Hyphen}) (11: [\-\xad],
2913 U+058A, U+1806, U+2010..2011, U+2E17,
2914 U+30FB ...)
2915 \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT
2916 \p{Ideographic_Description_Characters})
2917 (128_789)
2918 \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (985_323 plus
2919 all above-Unicode code points: [\x00-
2920 \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
2921 >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-
2922 \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
2923 U+02C2..02C5, U+02D2..02DF,
2924 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2925 \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (128_789:
2926 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-
2927 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
2928 U+02C6..02D1, U+02E0..02E4, U+02EC,
2929 U+02EE ...)
2930 \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (125_884)
2931 \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (988_228 plus
2932 all above-Unicode code points: [\x00-
2933 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2934 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
2935 \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
2936 U+02C2..02C5, U+02D2..02DF,
2937 U+02E5..02EB, U+02ED, U+02EF..036F ...)
2938 \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (125_884: [A-
2939 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
2940 \xff], U+0100..02C1, U+02C6..02D1,
2941 U+02E0..02E4, U+02EC, U+02EE ...)
2942 \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
2943 \p{Ideographic_Description_Characters})
2944 (128_789)
2945 \p{IDC: *} \p{ID_Continue: *}
2946 \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
2947 (96_190)
2948 \p{Ideo: *} \p{Ideographic: *}
2949 \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
2950 (96_190)
2951 \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_017_922
2952 plus all above-Unicode code points:
2953 U+0000..3005, U+3008..3020,
2954 U+302A..3037, U+303B..33FF,
2955 U+4DB6..4DFF, U+9FF0..F8FF ...)
2956 \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (96_190:
2957 U+3006..3007, U+3021..3029,
2958 U+3038..303A, U+3400..4DB5,
2959 U+4E00..9FEF, U+F900..FA6D ...)
2960 X \p{Ideographic_Description_Characters} \p{Block=
2961 Ideographic_Description_Characters}
2962 (Short: \p{InIDC}) (16)
2963 X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
2964 \p{Block=
2965 Ideographic_Symbols_And_Punctuation})
2966 (32)
2967 X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
2968 Ideographic_Symbols_And_Punctuation}
2969 (Short: \p{InIdeographicSymbols}) (32)
2970 \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (125_884)
2971 \p{IDS: *} \p{ID_Start: *}
2972 \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
2973 \p{IDSB}) (10)
2974 \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
2975 (1_114_102 plus all above-Unicode code
2976 points: U+0000..2FEF, U+2FF2..2FF3,
2977 U+2FFC..infinity)
2978 \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10:
2979 U+2FF0..2FF1, U+2FF4..2FFB)
2980 \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
2981 \p{IDST}) (2)
2982 \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
2983 (1_114_110 plus all above-Unicode code
2984 points: U+0000..2FF1, U+2FF4..infinity)
2985 \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2:
2986 U+2FF2..2FF3)
2987 \p{IDSB} \p{IDS_Binary_Operator} (=
2988 \p{IDS_Binary_Operator=Y}) (10)
2989 \p{IDSB: *} \p{IDS_Binary_Operator: *}
2990 \p{IDST} \p{IDS_Trinary_Operator} (=
2991 \p{IDS_Trinary_Operator=Y}) (2)
2992 \p{IDST: *} \p{IDS_Trinary_Operator: *}
2993 \p{Imperial_Aramaic} \p{Script_Extensions=Imperial_Aramaic}
2994 (Short: \p{Armi}; NOT \p{Block=
2995 Imperial_Aramaic}) (31)
2996 \p{In: *} \p{Present_In: *} (Perl extension)
2997 X \p{In_*} \p{Block: *}
2998 X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block=
2999 Common_Indic_Number_Forms}) (16)
3000 \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
3001 (349: U+093C, U+0941..0944, U+094D,
3002 U+0952, U+0956..0957, U+0962..0963 ...)
3003 \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC=
3004 BottomAndLeft}) (1: U+A9BF)
3005 \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
3006 BottomAndRight}) (2: U+1B3B, U+A9C0)
3007 \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (61:
3008 U+093F, U+094E, U+09BF, U+09C7..09C8,
3009 U+0A3F, U+0ABF ...)
3010 \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
3011 LeftAndRight}) (21: U+09CB..09CC,
3012 U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C,
3013 U+0DDC, U+0DDE ...)
3014 \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_936
3015 plus all above-Unicode code points:
3016 U+0000..08FF, U+0904..0939, U+093D,
3017 U+0950, U+0958..0961, U+0964..0980 ...)
3018 \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
3019 Overstruck}) (10: U+1CD4, U+1CE2..1CE8,
3020 U+10A01, U+10A06)
3021 \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (281:
3022 U+0903, U+093B, U+093E, U+0940,
3023 U+0949..094C, U+094F ...)
3024 \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (398:
3025 U+0900..0902, U+093A, U+0945..0948,
3026 U+0951, U+0953..0955, U+0981 ...)
3027 \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
3028 TopAndBottom}) (10: U+0C48, U+0F73,
3029 U+0F76..0F79, U+0F81, U+1B3C,
3030 U+1112E..1112F)
3031 \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
3032 \p{InPC=TopAndBottomAndRight}) (1:
3033 U+1B3D)
3034 \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
3035 TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE,
3036 U+1C29, U+114BB, U+115B9)
3037 \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
3038 \p{InPC=TopAndLeftAndRight}) (4: U+0B4C,
3039 U+0DDD, U+17BF, U+115BB)
3040 \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
3041 TopAndRight}) (13: U+0AC9, U+0B57,
3042 U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB,
3043 U+1925..1926 ...)
3044 \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
3045 VisualOrderLeft}) (19: U+0E40..0E44,
3046 U+0EC0..0EC4, U+19B5..19B7, U+19BA,
3047 U+AAB5..AAB6, U+AAB9 ...)
3048 X \p{Indic_Siyaq_Numbers} \p{Block=Indic_Siyaq_Numbers} (80)
3049 \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
3050 (17: U+093D, U+09BD, U+0ABD, U+0B3D,
3051 U+0C3D, U+0CBD ...)
3052 \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (86:
3053 U+0900..0902, U+0981..0982, U+09FC,
3054 U+0A01..0A02, U+0A70, U+0A81..0A82 ...)
3055 \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
3056 \p{InSC=BrahmiJoiningNumber}) (20:
3057 U+11052..11065)
3058 \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
3059 CantillationMark}) (59: U+0951..0952,
3060 U+0A51, U+0AFA..0AFC, U+1CD0..1CD2,
3061 U+1CD4..1CE1, U+1CF4 ...)
3062 \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
3063 (2160: U+0915..0939, U+0958..095F,
3064 U+0978..097F, U+0995..09A8,
3065 U+09AA..09B0, U+09B2 ...)
3066 \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
3067 ConsonantDead}) (12: U+09CE,
3068 U+0D54..0D56, U+0D7A..0D7F, U+1CF2..1CF3)
3069 \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
3070 ConsonantFinal}) (67: U+1930..1931,
3071 U+1933..1939, U+19C1..19C7,
3072 U+1A58..1A59, U+1BBE..1BBF, U+1BF0..1BF1
3073 ...)
3074 \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
3075 \p{InSC=ConsonantHeadLetter}) (5:
3076 U+0F88..0F8C)
3077 \p{Indic_Syllabic_Category: Consonant_Initial_Postfixed} (Short:
3078 \p{InSC=ConsonantInitialPostfixed}) (1:
3079 U+1A5A)
3080 \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
3081 ConsonantKiller}) (2: U+0E4C, U+17CD)
3082 \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
3083 ConsonantMedial}) (29: U+0A75,
3084 U+0EBC..0EBD, U+103B..103E,
3085 U+105E..1060, U+1082, U+1A55..1A56 ...)
3086 \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
3087 \p{InSC=ConsonantPlaceholder}) (22: [\-
3088 \xa0\xd7], U+0980, U+0A72..0A73, U+104B,
3089 U+104E, U+1900 ...)
3090 \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
3091 \p{InSC=ConsonantPrecedingRepha}) (2:
3092 U+0D4E, U+11D46)
3093 \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
3094 ConsonantPrefixed}) (9: U+111C2..111C3,
3095 U+11A3A, U+11A84..11A89)
3096 \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
3097 ConsonantSubjoined}) (94: U+0F8D..0F97,
3098 U+0F99..0FBC, U+1929..192B, U+1A57,
3099 U+1A5B..1A5E, U+1BA1..1BA3 ...)
3100 \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
3101 \p{InSC=ConsonantSucceedingRepha}) (4:
3102 U+17CC, U+1B03, U+1B81, U+A982)
3103 \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
3104 \p{InSC=ConsonantWithStacker}) (6:
3105 U+0CF1..0CF2, U+1CF5..1CF6,
3106 U+11003..11004)
3107 \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
3108 GeminationMark}) (3: U+0A71, U+11237,
3109 U+11A98)
3110 \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
3111 InvisibleStacker}) (11: U+1039, U+17D2,
3112 U+1A60, U+1BAB, U+AAF6, U+10A3F ...)
3113 \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1:
3114 U+200D)
3115 \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
3116 ModifyingLetter}) (1: U+0B83)
3117 \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
3118 NonJoiner}) (1: U+200C)
3119 \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (30:
3120 U+093C, U+09BC, U+0A3C, U+0ABC,
3121 U+0AFD..0AFF, U+0B3C ...)
3122 \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (481:
3123 [0-9], U+0966..096F, U+09E6..09EF,
3124 U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F
3125 ...)
3126 \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
3127 NumberJoiner}) (1: U+1107F)
3128 \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
3129 (1_109_650 plus all above-Unicode code
3130 points: [\x00-
3131 \x20!\"#\$\%&\'\(\)*+,.\/:;<=>?\@A-
3132 Z\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1-
3133 \xb1\xb4-\xd6\xd8-\xff], U+0100..08FF,
3134 U+0950, U+0953..0954, U+0964..0965,
3135 U+0970..0971 ...)
3136 \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
3137 PureKiller}) (21: U+0D3B..0D3C, U+0E3A,
3138 U+0E4E, U+0EBA, U+0F84, U+103A ...)
3139 \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
3140 RegisterShifter}) (2: U+17C9..17CA)
3141 \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
3142 SyllableModifier}) (25: [\xb2-\xb3],
3143 U+09FE, U+0F35, U+0F37, U+0FC6, U+17CB
3144 ...)
3145 \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
3146 ToneLetter}) (7: U+1970..1974, U+AAC0,
3147 U+AAC2)
3148 \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
3149 (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037,
3150 U+1063..1064, U+1069..106D, U+1087..108D
3151 ...)
3152 \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (27:
3153 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
3154 U+0BCD ...)
3155 \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga})
3156 (35: U+0903, U+0983, U+0A03, U+0A83,
3157 U+0B03, U+0C03 ...)
3158 \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30:
3159 U+1963..196D, U+A85E..A861, U+A866,
3160 U+A922..A92A, U+11150..11154)
3161 \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
3162 VowelDependent}) (673: U+093A..093B,
3163 U+093E..094C, U+094E..094F,
3164 U+0955..0957, U+0962..0963, U+09BE..09C4
3165 ...)
3166 \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
3167 VowelIndependent}) (476: U+0904..0914,
3168 U+0960..0961, U+0972..0977,
3169 U+0985..098C, U+098F..0990, U+0993..0994
3170 ...)
3171 \p{Inherited} \p{Script_Extensions=Inherited} (Short:
3172 \p{Zinh}) (502)
3173 \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
3174 (Short: \p{Pi}) (12)
3175 \p{InPC: *} \p{Indic_Positional_Category: *}
3176 \p{InSC: *} \p{Indic_Syllabic_Category: *}
3177 \p{Inscriptional_Pahlavi} \p{Script_Extensions=
3178 Inscriptional_Pahlavi} (Short: \p{Phli};
3179 NOT \p{Block=Inscriptional_Pahlavi}) (27)
3180 \p{Inscriptional_Parthian} \p{Script_Extensions=
3181 Inscriptional_Parthian} (Short:
3182 \p{Prti}; NOT \p{Block=
3183 Inscriptional_Parthian}) (30)
3184 X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block=
3185 IPA_Extensions}) (96)
3186 X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short:
3187 \p{InIPAExt}) (96)
3188 \p{Is_*} \p{*} (Any exceptions are individually
3189 noted beginning with the word NOT.) If
3190 an entry has flag(s) at its beginning,
3191 like "D", the "Is_" form has the same
3192 flag(s)
3193 \p{Ital} \p{Old_Italic} (= \p{Script_Extensions=
3194 Old_Italic}) (NOT \p{Block=Old_Italic})
3195 (39)
3196 X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
3197 (256)
3198 X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block=
3199 Hangul_Jamo_Extended_A}) (32)
3200 X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block=
3201 Hangul_Jamo_Extended_B}) (80)
3202 \p{Java} \p{Javanese} (= \p{Script_Extensions=
3203 Javanese}) (NOT \p{Block=Javanese}) (91)
3204 \p{Javanese} \p{Script_Extensions=Javanese} (Short:
3205 \p{Java}; NOT \p{Block=Javanese}) (91)
3206 \p{Jg: *} \p{Joining_Group: *}
3207 \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
3208 \p{Join_C: *} \p{Join_Control: *}
3209 \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
3210 \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
3211 plus all above-Unicode code points:
3212 U+0000..200B, U+200E..infinity)
3213 \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2:
3214 U+200C..200D)
3215 \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1:
3216 U+08BB)
3217 \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1:
3218 U+08BD)
3219 \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (1:
3220 U+08BC)
3221 \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (8: U+0639..063A,
3222 U+06A0, U+06FC, U+075D..075F, U+08B3)
3223 \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710)
3224 \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10: U+0622..0623,
3225 U+0625, U+0627, U+0671..0673, U+0675,
3226 U+0773..0774)
3227 \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (24: U+0628,
3228 U+062A..062B, U+066E, U+0679..0680,
3229 U+0750..0756, U+08A0..08A1 ...)
3230 \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D)
3231 \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
3232 BurushaskiYehBarree}) (2: U+077A..077B)
3233 \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (15: U+062F..0630,
3234 U+0688..0690, U+06EE, U+0759..075A,
3235 U+08AE)
3236 \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4:
3237 U+0715..0716, U+072A, U+072F)
3238 \p{Joining_Group: E} (Short: \p{Jg=E}) (1: U+0725)
3239 \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7:
3240 U+063D..063F, U+06CC, U+06CE,
3241 U+0775..0776)
3242 \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1: U+074F)
3243 \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10: U+0641,
3244 U+06A1..06A6, U+0760..0761, U+08A4)
3245 \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1:
3246 U+0724)
3247 \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (14: U+063B..063C,
3248 U+06A9, U+06AB, U+06AF..06B4,
3249 U+0762..0764, U+08B0)
3250 \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714,
3251 U+072E)
3252 \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (18: U+062C..062E,
3253 U+0681..0687, U+06BF, U+0757..0758,
3254 U+076E..076F, U+0772 ...)
3255 \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
3256 HamzaOnHehGoal}) (1: U+06C3)
3257 \p{Joining_Group: Hanifi_Rohingya_Kinna_Ya} (Short: \p{Jg=
3258 HanifiRohingyaKinnaYa}) (4: U+10D19,
3259 U+10D1E, U+10D20, U+10D23)
3260 \p{Joining_Group: Hanifi_Rohingya_Pa} (Short: \p{Jg=
3261 HanifiRohingyaPa}) (3: U+10D02, U+10D09,
3262 U+10D1C)
3263 \p{Joining_Group: He} (Short: \p{Jg=He}) (1: U+0717)
3264 \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1: U+0647)
3265 \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2:
3266 U+06C1..06C2)
3267 \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A)
3268 \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (6: U+0643,
3269 U+06AC..06AE, U+077F, U+08B4)
3270 \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F)
3271 \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E)
3272 \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2:
3273 U+06BE, U+06FF)
3274 \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (7: U+0644,
3275 U+06B5..06B8, U+076A, U+08A6)
3276 \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720)
3277 \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1:
3278 U+0866)
3279 \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1:
3280 U+0861)
3281 \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1:
3282 U+0868)
3283 \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla})
3284 (1: U+0869)
3285 \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1:
3286 U+0860)
3287 \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1:
3288 U+0864)
3289 \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna})
3290 (1: U+0865)
3291 \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1:
3292 U+0862)
3293 \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1:
3294 U+0867)
3295 \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1:
3296 U+086A)
3297 \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1:
3298 U+0863)
3299 \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
3300 ManichaeanAleph}) (1: U+10AC0)
3301 \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
3302 (2: U+10AD9..10ADA)
3303 \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
3304 (2: U+10AC1..10AC2)
3305 \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
3306 ManichaeanDaleth}) (1: U+10AC5)
3307 \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
3308 ManichaeanDhamedh}) (1: U+10AD4)
3309 \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
3310 (1: U+10AEC)
3311 \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
3312 ManichaeanGimel}) (2: U+10AC3..10AC4)
3313 \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
3314 (1: U+10ACD)
3315 \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
3316 ManichaeanHundred}) (1: U+10AEF)
3317 \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
3318 (3: U+10AD0..10AD2)
3319 \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
3320 ManichaeanLamedh}) (1: U+10AD3)
3321 \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem})
3322 (1: U+10AD6)
3323 \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun})
3324 (1: U+10AD7)
3325 \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne})
3326 (1: U+10AEB)
3327 \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2:
3328 U+10ADB..10ADC)
3329 \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
3330 (3: U+10ADE..10AE0)
3331 \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
3332 (1: U+10AE1)
3333 \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
3334 ManichaeanSadhe}) (1: U+10ADD)
3335 \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
3336 ManichaeanSamekh}) (1: U+10AD8)
3337 \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw})
3338 (1: U+10AE4)
3339 \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen})
3340 (1: U+10AED)
3341 \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
3342 (1: U+10ACE)
3343 \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
3344 ManichaeanThamedh}) (1: U+10AD5)
3345 \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
3346 ManichaeanTwenty}) (1: U+10AEE)
3347 \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw})
3348 (1: U+10AC7)
3349 \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
3350 (1: U+10ACF)
3351 \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
3352 ManichaeanZayin}) (2: U+10AC9..10ACA)
3353 \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645,
3354 U+0765..0766, U+08A7)
3355 \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1: U+0721)
3356 \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
3357 (1_113_800 plus all above-Unicode code
3358 points: U+0000..061F, U+0621, U+0640,
3359 U+064B..066D, U+0670, U+0674 ...)
3360 \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8: U+0646,
3361 U+06B9..06BC, U+0767..0769)
3362 \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1: U+0722)
3363 \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1: U+06BD)
3364 \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1: U+0726)
3365 \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (5: U+0642, U+066F,
3366 U+06A7..06A8, U+08A5)
3367 \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729)
3368 \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (19: U+0631..0632,
3369 U+0691..0699, U+06EF, U+075B,
3370 U+076B..076C, U+0771 ...)
3371 \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1:
3372 U+0727)
3373 \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1:
3374 U+08AC)
3375 \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (6: U+0635..0636,
3376 U+069D..069E, U+06FB, U+08AF)
3377 \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728)
3378 \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634,
3379 U+069A..069C, U+06FA, U+075C, U+076D,
3380 U+0770 ...)
3381 \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723)
3382 \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B)
3383 \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1:
3384 U+08B1)
3385 \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA)
3386 \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718)
3387 \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (4: U+0637..0638,
3388 U+069F, U+08A3)
3389 \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1: U+072C)
3390 \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3:
3391 U+0629, U+06C0, U+06D5)
3392 \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
3393 Hamza_On_Heh_Goal} (1)
3394 \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C)
3395 \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16: U+0624, U+0648,
3396 U+0676..0677, U+06C4..06CB, U+06CF,
3397 U+0778..0779 ...)
3398 \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (11: U+0620, U+0626,
3399 U+0649..064A, U+0678, U+06D0..06D1,
3400 U+0777 ...)
3401 \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2:
3402 U+06D2..06D3)
3403 \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1:
3404 U+06CD)
3405 \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D)
3406 \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E)
3407 \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719)
3408 \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D)
3409 \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (4)
3410 \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (565)
3411 \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (565: U+0620,
3412 U+0626, U+0628, U+062A..062E,
3413 U+0633..063F, U+0641..0647 ...)
3414 \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4: U+0640,
3415 U+07FA, U+180A, U+200D)
3416 \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (4)
3417 \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (4: U+A872,
3418 U+10ACD, U+10AD7, U+10D00)
3419 \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_437 plus
3420 all above-Unicode code points: [\x00-
3421 \xac\xae-\xff], U+0100..02FF,
3422 U+0370..0482, U+048A..0590, U+05BE,
3423 U+05C0 ...)
3424 \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (118)
3425 \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (118:
3426 U+0622..0625, U+0627, U+0629,
3427 U+062F..0632, U+0648, U+0671..0673 ...)
3428 \p{Joining_Type: T} \p{Joining_Type=Transparent} (1984)
3429 \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1984: [\xad],
3430 U+0300..036F, U+0483..0489,
3431 U+0591..05BD, U+05BF, U+05C1..05C2 ...)
3432 \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_111_437
3433 plus all above-Unicode code points)
3434 \p{Jt: *} \p{Joining_Type: *}
3435 \p{Kaithi} \p{Script_Extensions=Kaithi} (Short:
3436 \p{Kthi}; NOT \p{Block=Kaithi}) (87)
3437 \p{Kali} \p{Kayah_Li} (= \p{Script_Extensions=
3438 Kayah_Li}) (48)
3439 \p{Kana} \p{Katakana} (= \p{Script_Extensions=
3440 Katakana}) (NOT \p{Block=Katakana}) (356)
3441 X \p{Kana_Ext_A} \p{Kana_Extended_A} (= \p{Block=
3442 Kana_Extended_A}) (48)
3443 X \p{Kana_Extended_A} \p{Block=Kana_Extended_A} (Short:
3444 \p{InKanaExtA}) (48)
3445 X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block=
3446 Kana_Supplement}) (256)
3447 X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short:
3448 \p{InKanaSup}) (256)
3449 X \p{Kanbun} \p{Block=Kanbun} (16)
3450 X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block=
3451 Kangxi_Radicals}) (224)
3452 X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short:
3453 \p{InKangxi}) (224)
3454 \p{Kannada} \p{Script_Extensions=Kannada} (Short:
3455 \p{Knda}; NOT \p{Block=Kannada}) (104)
3456 \p{Katakana} \p{Script_Extensions=Katakana} (Short:
3457 \p{Kana}; NOT \p{Block=Katakana}) (356)
3458 X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (=
3459 \p{Block=Katakana_Phonetic_Extensions})
3460 (16)
3461 X \p{Katakana_Phonetic_Extensions} \p{Block=
3462 Katakana_Phonetic_Extensions} (Short:
3463 \p{InKatakanaExt}) (16)
3464 \p{Kayah_Li} \p{Script_Extensions=Kayah_Li} (Short:
3465 \p{Kali}) (48)
3466 \p{Khar} \p{Kharoshthi} (= \p{Script_Extensions=
3467 Kharoshthi}) (NOT \p{Block=Kharoshthi})
3468 (68)
3469 \p{Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
3470 \p{Khar}; NOT \p{Block=Kharoshthi}) (68)
3471 \p{Khmer} \p{Script_Extensions=Khmer} (Short:
3472 \p{Khmr}; NOT \p{Block=Khmer}) (146)
3473 X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
3474 \p{Khmr} \p{Khmer} (= \p{Script_Extensions=Khmer})
3475 (NOT \p{Block=Khmer}) (146)
3476 \p{Khoj} \p{Khojki} (= \p{Script_Extensions=
3477 Khojki}) (NOT \p{Block=Khojki}) (82)
3478 \p{Khojki} \p{Script_Extensions=Khojki} (Short:
3479 \p{Khoj}; NOT \p{Block=Khojki}) (82)
3480 \p{Khudawadi} \p{Script_Extensions=Khudawadi} (Short:
3481 \p{Sind}; NOT \p{Block=Khudawadi}) (81)
3482 \p{Knda} \p{Kannada} (= \p{Script_Extensions=
3483 Kannada}) (NOT \p{Block=Kannada}) (104)
3484 \p{Kthi} \p{Kaithi} (= \p{Script_Extensions=
3485 Kaithi}) (NOT \p{Block=Kaithi}) (87)
3486 \p{L} \pL \p{Letter} (= \p{General_Category=Letter})
3487 (125_643)
3488 X \p{L&} \p{Cased_Letter} (= \p{General_Category=
3489 Cased_Letter}) (3970)
3490 X \p{L_} \p{Cased_Letter} (= \p{General_Category=
3491 Cased_Letter}) Note the trailing '_'
3492 matters in spite of loose matching
3493 rules. (3970)
3494 \p{Lana} \p{Tai_Tham} (= \p{Script_Extensions=
3495 Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
3496 \p{Lao} \p{Script_Extensions=Lao} (NOT \p{Block=
3497 Lao}) (82)
3498 \p{Laoo} \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
3499 \p{Block=Lao}) (82)
3500 \p{Latin} \p{Script_Extensions=Latin} (Short:
3501 \p{Latn}) (1387)
3502 X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
3503 Latin_1_Supplement}) (128)
3504 X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block=
3505 Latin_1_Supplement}) (128)
3506 X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
3507 \p{InLatin1}) (128)
3508 X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block=
3509 Latin_Extended_A}) (128)
3510 X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
3511 \p{Block=Latin_Extended_Additional})
3512 (256)
3513 X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block=
3514 Latin_Extended_B}) (208)
3515 X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block=
3516 Latin_Extended_C}) (32)
3517 X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block=
3518 Latin_Extended_D}) (224)
3519 X \p{Latin_Ext_E} \p{Latin_Extended_E} (= \p{Block=
3520 Latin_Extended_E}) (64)
3521 X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short:
3522 \p{InLatinExtA}) (128)
3523 X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
3524 (Short: \p{InLatinExtAdditional}) (256)
3525 X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short:
3526 \p{InLatinExtB}) (208)
3527 X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short:
3528 \p{InLatinExtC}) (32)
3529 X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short:
3530 \p{InLatinExtD}) (224)
3531 X \p{Latin_Extended_E} \p{Block=Latin_Extended_E} (Short:
3532 \p{InLatinExtE}) (64)
3533 \p{Latn} \p{Latin} (= \p{Script_Extensions=Latin})
3534 (1387)
3535 \p{Lb: *} \p{Line_Break: *}
3536 \p{LC} \p{Cased_Letter} (= \p{General_Category=
3537 Cased_Letter}) (3970)
3538 \p{Lepc} \p{Lepcha} (= \p{Script_Extensions=
3539 Lepcha}) (NOT \p{Block=Lepcha}) (74)
3540 \p{Lepcha} \p{Script_Extensions=Lepcha} (Short:
3541 \p{Lepc}; NOT \p{Block=Lepcha}) (74)
3542 \p{Letter} \p{General_Category=Letter} (Short: \p{L})
3543 (125_643)
3544 \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
3545 \p{Nl}) (236)
3546 X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
3547 \p{Limb} \p{Limbu} (= \p{Script_Extensions=Limbu})
3548 (NOT \p{Block=Limbu}) (69)
3549 \p{Limbu} \p{Script_Extensions=Limbu} (Short:
3550 \p{Limb}; NOT \p{Block=Limbu}) (69)
3551 \p{Lina} \p{Linear_A} (= \p{Script_Extensions=
3552 Linear_A}) (NOT \p{Block=Linear_A}) (386)
3553 \p{Linb} \p{Linear_B} (= \p{Script_Extensions=
3554 Linear_B}) (268)
3555 \p{Line_Break: AI} \p{Line_Break=Ambiguous} (707)
3556 \p{Line_Break: AL} \p{Line_Break=Alphabetic} (20_582)
3557 \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (20_582: [#&*<=>\@A-
3558 Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0-
3559 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6,
3560 U+02CE..02CF, U+02D1..02D7, U+02DC,
3561 U+02DE ...)
3562 \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7-
3563 \xa8\xaa\xb2-\xb3\xb6-\xba\xbc-
3564 \xbe\xd7\xf7], U+02C7, U+02C9..02CB,
3565 U+02CD, U+02D0, U+02D8..02DB ...)
3566 \p{Line_Break: B2} \p{Line_Break=Break_Both} (3)
3567 \p{Line_Break: BA} \p{Line_Break=Break_After} (239)
3568 \p{Line_Break: BB} \p{Line_Break=Break_Before} (45)
3569 \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
3570 \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (239: [\t\|\xad],
3571 U+058A, U+05BE, U+0964..0965,
3572 U+0E5A..0E5B, U+0F0B ...)
3573 \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (45: [\xb4],
3574 U+02C8, U+02CC, U+02DF, U+0C77, U+0C84
3575 ...)
3576 \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014,
3577 U+2E3A..2E3B)
3578 \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/])
3579 \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r])
3580 \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
3581 \p{Line_Break: CJ} \p{Line_Break=
3582 Conditional_Japanese_Starter} (58)
3583 \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (91)
3584 \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]])
3585 \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (91: [\}],
3586 U+0F3B, U+0F3D, U+169C, U+2046, U+207E
3587 ...)
3588 \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (2260)
3589 \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2260:
3590 [^\t\n\cK\f\r\x20-\x7e\x85\xa0-\xff],
3591 U+0300..034E, U+0350..035B,
3592 U+0363..036F, U+0483..0489, U+0591..05BD
3593 ...)
3594 \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (750:
3595 U+0E01..0E3A, U+0E40..0E4E,
3596 U+0E81..0E82, U+0E84, U+0E86..0E8A,
3597 U+0E8C..0EA3 ...)
3598 \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
3599 (58: U+3041, U+3043, U+3045, U+3047,
3600 U+3049, U+3063 ...)
3601 \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC)
3602 \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
3603 \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
3604 \p{Line_Break: E_Base} (Short: \p{Lb=EB}) (120: U+261D, U+26F9,
3605 U+270A..270D, U+1F385, U+1F3C2..1F3C4,
3606 U+1F3C7 ...)
3607 \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF)
3608 \p{Line_Break: EB} \p{Line_Break=E_Base} (120)
3609 \p{Line_Break: EM} \p{Line_Break=E_Modifier} (5)
3610 \p{Line_Break: EX} \p{Line_Break=Exclamation} (37)
3611 \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (37: [!?], U+05C6,
3612 U+061B, U+061E..061F, U+06D4, U+07F9 ...)
3613 \p{Line_Break: GL} \p{Line_Break=Glue} (25)
3614 \p{Line_Break: Glue} (Short: \p{Lb=GL}) (25: [\xa0], U+034F,
3615 U+035C..0362, U+0F08, U+0F0C, U+0F12 ...)
3616 \p{Line_Break: H2} (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C,
3617 U+AC38, U+AC54, U+AC70, U+AC8C ...)
3618 \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B,
3619 U+AC1D..AC37, U+AC39..AC53,
3620 U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7
3621 ...)
3622 \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (75:
3623 U+05D0..05EA, U+05EF..05F2, U+FB1D,
3624 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
3625 ...)
3626 \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (75)
3627 \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
3628 \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1: [\-])
3629 \p{Line_Break: ID} \p{Line_Break=Ideographic} (172_693)
3630 \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_693:
3631 U+231A..231B, U+23F0..23F3,
3632 U+2600..2603, U+2614..2615, U+2618,
3633 U+261A..261C ...)
3634 \p{Line_Break: IN} \p{Line_Break=Inseparable} (6)
3635 \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;],
3636 U+037E, U+0589, U+060C..060D, U+07F8,
3637 U+2044 ...)
3638 \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026,
3639 U+22EF, U+FE19, U+10AF6)
3640 \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
3641 \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
3642 \p{Line_Break: JL} (Short: \p{Lb=JL}) (125: U+1100..115F,
3643 U+A960..A97C)
3644 \p{Line_Break: JT} (Short: \p{Lb=JT}) (137: U+11A8..11FF,
3645 U+D7CB..D7FB)
3646 \p{Line_Break: JV} (Short: \p{Lb=JV}) (95: U+1160..11A7,
3647 U+D7B0..D7C6)
3648 \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
3649 \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n])
3650 \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f],
3651 U+2028..2029)
3652 \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85])
3653 \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
3654 \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (33: U+17D6,
3655 U+203C..203D, U+2047..2049, U+3005,
3656 U+301C, U+303B..303C ...)
3657 \p{Line_Break: NS} \p{Line_Break=Nonstarter} (33)
3658 \p{Line_Break: NU} \p{Line_Break=Numeric} (622)
3659 \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (622: [0-9],
3660 U+0660..0669, U+066B..066C,
3661 U+06F0..06F9, U+07C0..07C9, U+0966..096F
3662 ...)
3663 \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (88)
3664 \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (88:
3665 [\(\[\{\xa1\xbf], U+0F3A, U+0F3C,
3666 U+169B, U+201A, U+201E ...)
3667 \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (36)
3668 \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (36:
3669 [\%\xa2\xb0], U+0609..060B, U+066A,
3670 U+09F2..09F3, U+09F9, U+0D79 ...)
3671 \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (68)
3672 \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (68: [\$+\\\xa3-
3673 \xa5\xb1], U+058F, U+07FE..07FF, U+09FB,
3674 U+0AF1, U+0BF9 ...)
3675 \p{Line_Break: QU} \p{Line_Break=Quotation} (39)
3676 \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb],
3677 U+2018..2019, U+201B..201D, U+201F,
3678 U+2039..203A, U+275B..2760 ...)
3679 \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26:
3680 U+1F1E6..1F1FF)
3681 \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26)
3682 \p{Line_Break: SA} \p{Line_Break=Complex_Context} (750)
3683 D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
3684 \p{Line_Break: SP} \p{Line_Break=Space} (1)
3685 \p{Line_Break: Space} (Short: \p{Lb=SP}) (1: [\x20])
3686 D \p{Line_Break: Surrogate} Surrogates should never appear in well-
3687 formed text, and therefore shouldn't be
3688 the basis for line breaking (Short:
3689 \p{Lb=SG}) (2048: U+D800..DFFF)
3690 \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
3691 \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (901_897 plus all
3692 above-Unicode code points: U+0378..0379,
3693 U+0380..0383, U+038B, U+038D, U+03A2,
3694 U+0530 ...)
3695 \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
3696 \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF)
3697 \p{Line_Break: XX} \p{Line_Break=Unknown} (901_897 plus all
3698 above-Unicode code points)
3699 \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
3700 \p{Line_Break: ZWJ} (Short: \p{Lb=ZWJ}) (1: U+200D)
3701 \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B)
3702 \p{Line_Separator} \p{General_Category=Line_Separator}
3703 (Short: \p{Zl}) (1)
3704 \p{Linear_A} \p{Script_Extensions=Linear_A} (Short:
3705 \p{Lina}; NOT \p{Block=Linear_A}) (386)
3706 \p{Linear_B} \p{Script_Extensions=Linear_B} (Short:
3707 \p{Linb}) (268)
3708 X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
3709 X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
3710 \p{Lisu} \p{Script_Extensions=Lisu} (48)
3711 \p{Ll} \p{Lowercase_Letter} (=
3712 \p{General_Category=Lowercase_Letter})
3713 (/i= General_Category=Cased_Letter)
3714 (2151)
3715 \p{Lm} \p{Modifier_Letter} (=
3716 \p{General_Category=Modifier_Letter})
3717 (259)
3718 \p{Lo} \p{Other_Letter} (= \p{General_Category=
3719 Other_Letter}) (121_414)
3720 \p{LOE} \p{Logical_Order_Exception} (=
3721 \p{Logical_Order_Exception=Y}) (19)
3722 \p{LOE: *} \p{Logical_Order_Exception: *}
3723 \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
3724 \p{LOE}) (19)
3725 \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
3726 (1_114_093 plus all above-Unicode code
3727 points: U+0000..0E3F, U+0E45..0EBF,
3728 U+0EC5..19B4, U+19B8..19B9,
3729 U+19BB..AAB4, U+AAB7..AAB8 ...)
3730 \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19:
3731 U+0E40..0E44, U+0EC0..0EC4,
3732 U+19B5..19B7, U+19BA, U+AAB5..AAB6,
3733 U+AAB9 ...)
3734 X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
3735 \p{Lower} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3736 Cased=Yes) (2340)
3737 \p{Lower: *} \p{Lowercase: *}
3738 \p{Lowercase} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3739 Cased=Yes) (2340)
3740 \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
3741 No) (1_111_772 plus all above-Unicode
3742 code points: [\x00-
3743 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
3744 Z\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
3745 \xb4\xb6-\xb9\xbb-\xde\xf7], U+0100,
3746 U+0102, U+0104, U+0106, U+0108 ...)
3747 \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
3748 Yes) (2340: [a-z\xaa\xb5\xba\xdf-
3749 \xf6\xf8-\xff], U+0101, U+0103, U+0105,
3750 U+0107, U+0109 ...)
3751 \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
3752 (Short: \p{Ll}; /i= General_Category=
3753 Cased_Letter) (2151)
3754 \p{Lt} \p{Titlecase_Letter} (=
3755 \p{General_Category=Titlecase_Letter})
3756 (/i= General_Category=Cased_Letter) (31)
3757 \p{Lu} \p{Uppercase_Letter} (=
3758 \p{General_Category=Uppercase_Letter})
3759 (/i= General_Category=Cased_Letter)
3760 (1788)
3761 \p{Lyci} \p{Lycian} (= \p{Script_Extensions=
3762 Lycian}) (NOT \p{Block=Lycian}) (29)
3763 \p{Lycian} \p{Script_Extensions=Lycian} (Short:
3764 \p{Lyci}; NOT \p{Block=Lycian}) (29)
3765 \p{Lydi} \p{Lydian} (= \p{Script_Extensions=
3766 Lydian}) (NOT \p{Block=Lydian}) (27)
3767 \p{Lydian} \p{Script_Extensions=Lydian} (Short:
3768 \p{Lydi}; NOT \p{Block=Lydian}) (27)
3769 \p{M} \pM \p{Mark} (= \p{General_Category=Mark})
3770 (2268)
3771 \p{Mahajani} \p{Script_Extensions=Mahajani} (Short:
3772 \p{Mahj}; NOT \p{Block=Mahajani}) (61)
3773 \p{Mahj} \p{Mahajani} (= \p{Script_Extensions=
3774 Mahajani}) (NOT \p{Block=Mahajani}) (61)
3775 X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block=
3776 Mahjong_Tiles}) (48)
3777 X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short:
3778 \p{InMahjong}) (48)
3779 \p{Maka} \p{Makasar} (= \p{Script_Extensions=
3780 Makasar}) (NOT \p{Block=Makasar}) (25)
3781 \p{Makasar} \p{Script_Extensions=Makasar} (Short:
3782 \p{Maka}; NOT \p{Block=Makasar}) (25)
3783 \p{Malayalam} \p{Script_Extensions=Malayalam} (Short:
3784 \p{Mlym}; NOT \p{Block=Malayalam}) (125)
3785 \p{Mand} \p{Mandaic} (= \p{Script_Extensions=
3786 Mandaic}) (NOT \p{Block=Mandaic}) (30)
3787 \p{Mandaic} \p{Script_Extensions=Mandaic} (Short:
3788 \p{Mand}; NOT \p{Block=Mandaic}) (30)
3789 \p{Mani} \p{Manichaean} (= \p{Script_Extensions=
3790 Manichaean}) (NOT \p{Block=Manichaean})
3791 (52)
3792 \p{Manichaean} \p{Script_Extensions=Manichaean} (Short:
3793 \p{Mani}; NOT \p{Block=Manichaean}) (52)
3794 \p{Marc} \p{Marchen} (= \p{Script_Extensions=
3795 Marchen}) (NOT \p{Block=Marchen}) (68)
3796 \p{Marchen} \p{Script_Extensions=Marchen} (Short:
3797 \p{Marc}; NOT \p{Block=Marchen}) (68)
3798 \p{Mark} \p{General_Category=Mark} (Short: \p{M})
3799 (2268)
3800 \p{Masaram_Gondi} \p{Script_Extensions=Masaram_Gondi}
3801 (Short: \p{Gonm}; NOT \p{Block=
3802 Masaram_Gondi}) (77)
3803 \p{Math} \p{Math=Y} (2310)
3804 \p{Math: N*} (Single: \P{Math}) (1_111_802 plus all
3805 above-Unicode code points: [\x00-
3806 \x20!\"#\$\%&\'\(\)*,\-.\/0-9:;?\@A-
3807 Z\[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2-
3808 \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF,
3809 U+03D3..03D4, U+03D6..03EF,
3810 U+03F2..03F3, U+03F7..0605 ...)
3811 \p{Math: Y*} (Single: \p{Math}) (2310: [+<=
3812 >\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
3813 U+03D5, U+03F0..03F1, U+03F4..03F6,
3814 U+0606..0608 ...)
3815 X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (=
3816 \p{Block=
3817 Mathematical_Alphanumeric_Symbols})
3818 (1024)
3819 X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block=
3820 Mathematical_Operators}) (256)
3821 \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
3822 \p{Sm}) (948)
3823 X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
3824 Mathematical_Alphanumeric_Symbols}
3825 (Short: \p{InMathAlphanum}) (1024)
3826 X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
3827 (Short: \p{InMathOperators}) (256)
3828 X \p{Mayan_Numerals} \p{Block=Mayan_Numerals} (32)
3829 \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
3830 Spacing_Mark}) (429)
3831 \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
3832 Enclosing_Mark}) (13)
3833 \p{Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
3834 \p{Medf}; NOT \p{Block=Medefaidrin}) (91)
3835 \p{Medf} \p{Medefaidrin} (= \p{Script_Extensions=
3836 Medefaidrin}) (NOT \p{Block=
3837 Medefaidrin}) (91)
3838 \p{Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek} (Short:
3839 \p{Mtei}; NOT \p{Block=Meetei_Mayek})
3840 (79)
3841 X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block=
3842 Meetei_Mayek_Extensions}) (32)
3843 X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
3844 (Short: \p{InMeeteiMayekExt}) (32)
3845 \p{Mend} \p{Mende_Kikakui} (= \p{Script_Extensions=
3846 Mende_Kikakui}) (NOT \p{Block=
3847 Mende_Kikakui}) (213)
3848 \p{Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
3849 (Short: \p{Mend}; NOT \p{Block=
3850 Mende_Kikakui}) (213)
3851 \p{Merc} \p{Meroitic_Cursive} (=
3852 \p{Script_Extensions=Meroitic_Cursive})
3853 (NOT \p{Block=Meroitic_Cursive}) (90)
3854 \p{Mero} \p{Meroitic_Hieroglyphs} (=
3855 \p{Script_Extensions=
3856 Meroitic_Hieroglyphs}) (32)
3857 \p{Meroitic_Cursive} \p{Script_Extensions=Meroitic_Cursive}
3858 (Short: \p{Merc}; NOT \p{Block=
3859 Meroitic_Cursive}) (90)
3860 \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
3861 Meroitic_Hieroglyphs} (Short: \p{Mero})
3862 (32)
3863 \p{Miao} \p{Script_Extensions=Miao} (NOT \p{Block=
3864 Miao}) (149)
3865 X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (=
3866 \p{Block=
3867 Miscellaneous_Symbols_And_Arrows}) (256)
3868 X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
3869 (= \p{Block=
3870 Miscellaneous_Mathematical_Symbols_A})
3871 (48)
3872 X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
3873 (= \p{Block=
3874 Miscellaneous_Mathematical_Symbols_B})
3875 (128)
3876 X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs}
3877 (= \p{Block=
3878 Miscellaneous_Symbols_And_Pictographs})
3879 (768)
3880 X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block=
3881 Miscellaneous_Symbols}) (256)
3882 X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block=
3883 Miscellaneous_Technical}) (256)
3884 X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
3885 Miscellaneous_Mathematical_Symbols_A}
3886 (Short: \p{InMiscMathSymbolsA}) (48)
3887 X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
3888 Miscellaneous_Mathematical_Symbols_B}
3889 (Short: \p{InMiscMathSymbolsB}) (128)
3890 X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
3891 \p{InMiscSymbols}) (256)
3892 X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
3893 Miscellaneous_Symbols_And_Arrows}
3894 (Short: \p{InMiscArrows}) (256)
3895 X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
3896 Miscellaneous_Symbols_And_Pictographs}
3897 (Short: \p{InMiscPictographs}) (768)
3898 X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
3899 (Short: \p{InMiscTechnical}) (256)
3900 \p{Mlym} \p{Malayalam} (= \p{Script_Extensions=
3901 Malayalam}) (NOT \p{Block=Malayalam})
3902 (125)
3903 \p{Mn} \p{Nonspacing_Mark} (=
3904 \p{General_Category=Nonspacing_Mark})
3905 (1826)
3906 \p{Modi} \p{Script_Extensions=Modi} (NOT \p{Block=
3907 Modi}) (89)
3908 \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
3909 (Short: \p{Lm}) (259)
3910 X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block=
3911 Spacing_Modifier_Letters}) (80)
3912 \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
3913 (Short: \p{Sk}) (121)
3914 X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
3915 \p{Mong} \p{Mongolian} (= \p{Script_Extensions=
3916 Mongolian}) (NOT \p{Block=Mongolian})
3917 (171)
3918 \p{Mongolian} \p{Script_Extensions=Mongolian} (Short:
3919 \p{Mong}; NOT \p{Block=Mongolian}) (171)
3920 X \p{Mongolian_Sup} \p{Mongolian_Supplement} (= \p{Block=
3921 Mongolian_Supplement}) (32)
3922 X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
3923 \p{InMongolianSup}) (32)
3924 \p{Mro} \p{Script_Extensions=Mro} (NOT \p{Block=
3925 Mro}) (43)
3926 \p{Mroo} \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
3927 \p{Block=Mro}) (43)
3928 \p{Mtei} \p{Meetei_Mayek} (= \p{Script_Extensions=
3929 Meetei_Mayek}) (NOT \p{Block=
3930 Meetei_Mayek}) (79)
3931 \p{Mult} \p{Multani} (= \p{Script_Extensions=
3932 Multani}) (NOT \p{Block=Multani}) (48)
3933 \p{Multani} \p{Script_Extensions=Multani} (Short:
3934 \p{Mult}; NOT \p{Block=Multani}) (48)
3935 X \p{Music} \p{Musical_Symbols} (= \p{Block=
3936 Musical_Symbols}) (256)
3937 X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short:
3938 \p{InMusic}) (256)
3939 \p{Myanmar} \p{Script_Extensions=Myanmar} (Short:
3940 \p{Mymr}; NOT \p{Block=Myanmar}) (224)
3941 X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block=
3942 Myanmar_Extended_A}) (32)
3943 X \p{Myanmar_Ext_B} \p{Myanmar_Extended_B} (= \p{Block=
3944 Myanmar_Extended_B}) (32)
3945 X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short:
3946 \p{InMyanmarExtA}) (32)
3947 X \p{Myanmar_Extended_B} \p{Block=Myanmar_Extended_B} (Short:
3948 \p{InMyanmarExtB}) (32)
3949 \p{Mymr} \p{Myanmar} (= \p{Script_Extensions=
3950 Myanmar}) (NOT \p{Block=Myanmar}) (224)
3951 \p{N} \pN \p{Number} (= \p{General_Category=Number})
3952 (1754)
3953 \p{Nabataean} \p{Script_Extensions=Nabataean} (Short:
3954 \p{Nbat}; NOT \p{Block=Nabataean}) (40)
3955 \p{Nand} \p{Nandinagari} (= \p{Script_Extensions=
3956 Nandinagari}) (NOT \p{Block=
3957 Nandinagari}) (86)
3958 \p{Nandinagari} \p{Script_Extensions=Nandinagari} (Short:
3959 \p{Nand}; NOT \p{Block=Nandinagari}) (86)
3960 \p{Narb} \p{Old_North_Arabian} (=
3961 \p{Script_Extensions=Old_North_Arabian})
3962 (32)
3963 X \p{NB} \p{No_Block} (= \p{Block=No_Block})
3964 (832_720 plus all above-Unicode code
3965 points)
3966 \p{Nbat} \p{Nabataean} (= \p{Script_Extensions=
3967 Nabataean}) (NOT \p{Block=Nabataean})
3968 (40)
3969 \p{NChar} \p{Noncharacter_Code_Point} (=
3970 \p{Noncharacter_Code_Point=Y}) (66)
3971 \p{NChar: *} \p{Noncharacter_Code_Point: *}
3972 \p{Nd} \p{XPosixDigit} (= \p{General_Category=
3973 Decimal_Number}) (630)
3974 \p{New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
3975 \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
3976 \p{Newa} \p{Script_Extensions=Newa} (NOT \p{Block=
3977 Newa}) (94)
3978 \p{NFC_QC: *} \p{NFC_Quick_Check: *}
3979 \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (110)
3980 \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (110:
3981 U+0300..0304, U+0306..030C, U+030F,
3982 U+0311, U+0313..0314, U+031B ...)
3983 \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
3984 \P{NFC_Quick_Check} NOR \P{NFC_QC})
3985 (1120)
3986 \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
3987 \P{NFC_Quick_Check} NOR \P{NFC_QC})
3988 (1120: U+0340..0341, U+0343..0344,
3989 U+0374, U+037E, U+0387, U+0958..095F ...)
3990 \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
3991 \p{NFC_Quick_Check} NOR \p{NFC_QC})
3992 (1_112_882 plus all above-Unicode code
3993 points)
3994 \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
3995 \p{NFC_Quick_Check} NOR \p{NFC_QC})
3996 (1_112_882 plus all above-Unicode code
3997 points: U+0000..02FF, U+0305,
3998 U+030D..030E, U+0310, U+0312,
3999 U+0315..031A ...)
4000 \p{NFD_QC: *} \p{NFD_Quick_Check: *}
4001 \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
4002 \P{NFD_Quick_Check} NOR \P{NFD_QC})
4003 (13_232)
4004 \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
4005 \P{NFD_Quick_Check} NOR \P{NFD_QC})
4006 (13_232: [\xc0-\xc5\xc7-\xcf\xd1-
4007 \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
4008 \xf6\xf9-\xfd\xff], U+0100..010F,
4009 U+0112..0125, U+0128..0130,
4010 U+0134..0137, U+0139..013E ...)
4011 \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
4012 \p{NFD_Quick_Check} NOR \p{NFD_QC})
4013 (1_100_880 plus all above-Unicode code
4014 points)
4015 \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
4016 \p{NFD_Quick_Check} NOR \p{NFD_QC})
4017 (1_100_880 plus all above-Unicode code
4018 points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde-
4019 \xdf\xe6\xf0\xf7-\xf8\xfe],
4020 U+0110..0111, U+0126..0127,
4021 U+0131..0133, U+0138, U+013F..0142 ...)
4022 \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
4023 \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (110)
4024 \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (110:
4025 U+0300..0304, U+0306..030C, U+030F,
4026 U+0311, U+0313..0314, U+031B ...)
4027 \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
4028 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4029 (4796)
4030 \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
4031 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4032 (4796: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4033 \xba\xbc-\xbe], U+0132..0133,
4034 U+013F..0140, U+0149, U+017F,
4035 U+01C4..01CC ...)
4036 \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
4037 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4038 (1_109_206 plus all above-Unicode code
4039 points)
4040 \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
4041 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4042 (1_109_206 plus all above-Unicode code
4043 points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4044 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff],
4045 U+0100..0131, U+0134..013E,
4046 U+0141..0148, U+014A..017E, U+0180..01C3
4047 ...)
4048 \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
4049 \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
4050 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4051 (16_896)
4052 \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
4053 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4054 (16_896: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4055 \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1-
4056 \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
4057 \xf6\xf9-\xfd\xff], U+0100..010F,
4058 U+0112..0125, U+0128..0130,
4059 U+0132..0137, U+0139..0140 ...)
4060 \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
4061 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4062 (1_097_216 plus all above-Unicode code
4063 points)
4064 \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
4065 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4066 (1_097_216 plus all above-Unicode code
4067 points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4068 \xae\xb0-\xb1\xb6-
4069 \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-
4070 \xdf\xe6\xf0\xf7-\xf8\xfe],
4071 U+0110..0111, U+0126..0127, U+0131,
4072 U+0138, U+0141..0142 ...)
4073 \p{Nko} \p{Script_Extensions=Nko} (NOT \p{Block=
4074 NKo}) (62)
4075 \p{Nkoo} \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
4076 \p{Block=NKo}) (62)
4077 \p{Nl} \p{Letter_Number} (= \p{General_Category=
4078 Letter_Number}) (236)
4079 \p{No} \p{Other_Number} (= \p{General_Category=
4080 Other_Number}) (888)
4081 X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB})
4082 (832_720 plus all above-Unicode code
4083 points)
4084 \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
4085 \p{NChar}) (66)
4086 \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
4087 (1_114_046 plus all above-Unicode code
4088 points: U+0000..FDCF, U+FDF0..FFFD,
4089 U+10000..1FFFD, U+20000..2FFFD,
4090 U+30000..3FFFD, U+40000..4FFFD ...)
4091 \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
4092 (66: U+FDD0..FDEF, U+FFFE..FFFF,
4093 U+1FFFE..1FFFF, U+2FFFE..2FFFF,
4094 U+3FFFE..3FFFF, U+4FFFE..4FFFF ...)
4095 \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
4096 (Short: \p{Mn}) (1826)
4097 \p{Nshu} \p{Nushu} (= \p{Script_Extensions=Nushu})
4098 (NOT \p{Block=Nushu}) (397)
4099 \p{Nt: *} \p{Numeric_Type: *}
4100 \p{Number} \p{General_Category=Number} (Short: \p{N})
4101 (1754)
4102 X \p{Number_Forms} \p{Block=Number_Forms} (64)
4103 \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (630)
4104 \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (630: [0-9],
4105 U+0660..0669, U+06F0..06F9,
4106 U+07C0..07C9, U+0966..096F, U+09E6..09EF
4107 ...)
4108 \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
4109 \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9],
4110 U+1369..1371, U+19DA, U+2070,
4111 U+2074..2079, U+2080..2089 ...)
4112 \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_277 plus all
4113 above-Unicode code points: [\x00-
4114 \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-
4115 Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4-
4116 \xb8\xba-\xbb\xbf-\xff], U+0100..065F,
4117 U+066A..06EF, U+06FA..07BF,
4118 U+07CA..0965, U+0970..09E5 ...)
4119 \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (1077)
4120 \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (1077: [\xbc-\xbe],
4121 U+09F4..09F9, U+0B72..0B77,
4122 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
4123 ...)
4124 T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33)
4125 T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (81: [0], U+0660,
4126 U+06F0, U+07C0, U+0966, U+09E6 ...)
4127 T \p{Numeric_Value: 1/320} (Short: \p{Nv=1/320}) (2: U+11FC0,
4128 U+11FD4)
4129 T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (2: U+0D58, U+11FC1)
4130 T \p{Numeric_Value: 1/80} (Short: \p{Nv=1/80}) (1: U+11FC2)
4131 T \p{Numeric_Value: 1/64} (Short: \p{Nv=1/64}) (1: U+11FC3)
4132 T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (2: U+0D59, U+11FC4)
4133 T \p{Numeric_Value: 1/32} (Short: \p{Nv=1/32}) (1: U+11FC5)
4134 T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (2: U+0D5A, U+11FC6)
4135 T \p{Numeric_Value: 3/64} (Short: \p{Nv=3/64}) (1: U+11FC7)
4136 T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (2: U+0D5B, U+11FC8)
4137 T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (6: U+09F4, U+0B75,
4138 U+0D76, U+A833, U+11FC9..11FCA)
4139 T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6)
4140 T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (3: U+0D5C, U+2152,
4141 U+11FCB)
4142 T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1: U+2151)
4143 T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (7: U+09F5, U+0B76,
4144 U+0D77, U+215B, U+A834, U+11FCC ...)
4145 T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1: U+2150)
4146 T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (2: U+0D5D, U+11FCD)
4147 T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (4: U+2159, U+109F7,
4148 U+12461, U+1ED3D)
4149 T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (5: U+09F6, U+0B77,
4150 U+0D78, U+A835, U+11FCE)
4151 T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (3: U+0D5E, U+2155,
4152 U+11FCF)
4153 T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (14: [\xbc], U+09F7,
4154 U+0B72, U+0D73, U+A830, U+10140 ...)
4155 T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (6: U+2153, U+109F9,
4156 U+10E7D, U+1245A, U+1245D, U+12465)
4157 T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1: U+215C)
4158 T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1: U+2156)
4159 T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA)
4160 T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (19: [\xbd], U+0B73,
4161 U+0D74, U+0F2A, U+2CFD, U+A831 ...)
4162 T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC)
4163 T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1: U+2157)
4164 T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1: U+215D)
4165 T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (7: U+2154, U+10177,
4166 U+109FD, U+10E7E, U+1245B, U+1245E ...)
4167 T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (9: [\xbe], U+09F8,
4168 U+0B74, U+0D75, U+A832, U+10178 ...)
4169 T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1: U+2158)
4170 T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (3: U+215A, U+109FF,
4171 U+1245C)
4172 T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1: U+215E)
4173 T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC)
4174 T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (137: [1\xb9], U+0661,
4175 U+06F1, U+07C1, U+0967, U+09E7 ...)
4176 T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1: U+0F2B)
4177 T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (136: [2\xb2], U+0662,
4178 U+06F2, U+07C2, U+0968, U+09E8 ...)
4179 T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1: U+0F2C)
4180 T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (137: [3\xb3], U+0663,
4181 U+06F3, U+07C3, U+0969, U+09E9 ...)
4182 T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1: U+0F2D)
4183 T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (128: [4], U+0664,
4184 U+06F4, U+07C4, U+096A, U+09EA ...)
4185 T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1: U+0F2E)
4186 T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (127: [5], U+0665,
4187 U+06F5, U+07C5, U+096B, U+09EB ...)
4188 T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F)
4189 T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (111: [6], U+0666,
4190 U+06F6, U+07C6, U+096C, U+09EC ...)
4191 T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30)
4192 T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (110: [7], U+0667,
4193 U+06F7, U+07C7, U+096D, U+09ED ...)
4194 T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31)
4195 T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (106: [8], U+0668,
4196 U+06F8, U+07C8, U+096E, U+09EE ...)
4197 T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32)
4198 T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (110: [9], U+0669,
4199 U+06F9, U+07C9, U+096F, U+09EF ...)
4200 T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (61: U+0BF0, U+0D70,
4201 U+1372, U+2169, U+2179, U+2469 ...)
4202 T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (8: U+216A, U+217A,
4203 U+246A, U+247E, U+2492, U+24EB ...)
4204 T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (8: U+216B, U+217B,
4205 U+246B, U+247F, U+2493, U+24EC ...)
4206 T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (6: U+246C, U+2480,
4207 U+2494, U+24ED, U+16E8D, U+1D2ED)
4208 T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (6: U+246D, U+2481,
4209 U+2495, U+24EE, U+16E8E, U+1D2EE)
4210 T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (6: U+246E, U+2482,
4211 U+2496, U+24EF, U+16E8F, U+1D2EF)
4212 T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (7: U+09F9, U+246F,
4213 U+2483, U+2497, U+24F0, U+16E90 ...)
4214 T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (7: U+16EE, U+2470,
4215 U+2484, U+2498, U+24F1, U+16E91 ...)
4216 T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (7: U+16EF, U+2471,
4217 U+2485, U+2499, U+24F2, U+16E92 ...)
4218 T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (7: U+16F0, U+2472,
4219 U+2486, U+249A, U+24F3, U+16E93 ...)
4220 T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (35: U+1373, U+2473,
4221 U+2487, U+249B, U+24F4, U+3039 ...)
4222 T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1: U+3251)
4223 T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1: U+3252)
4224 T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1: U+3253)
4225 T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1: U+3254)
4226 T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1: U+3255)
4227 T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1: U+3256)
4228 T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1: U+3257)
4229 T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1: U+3258)
4230 T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1: U+3259)
4231 T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (19: U+1374, U+303A,
4232 U+324A, U+325A, U+5345, U+10112 ...)
4233 T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1: U+325B)
4234 T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1: U+325C)
4235 T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1: U+325D)
4236 T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1: U+325E)
4237 T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1: U+325F)
4238 T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1: U+32B1)
4239 T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1: U+32B2)
4240 T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1: U+32B3)
4241 T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1: U+32B4)
4242 T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (18: U+1375, U+324B,
4243 U+32B5, U+534C, U+10113, U+102ED ...)
4244 T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1: U+32B6)
4245 T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1: U+32B7)
4246 T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1: U+32B8)
4247 T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1: U+32B9)
4248 T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1: U+32BA)
4249 T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1: U+32BB)
4250 T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1: U+32BC)
4251 T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1: U+32BD)
4252 T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1: U+32BE)
4253 T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (29: U+1376, U+216C,
4254 U+217C, U+2186, U+324C, U+32BF ...)
4255 T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (13: U+1377, U+324D,
4256 U+10115, U+102EF, U+109CE, U+10E6E ...)
4257 T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (13: U+1378, U+324E,
4258 U+10116, U+102F0, U+109CF, U+10E6F ...)
4259 T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (12: U+1379, U+324F,
4260 U+10117, U+102F1, U+10E70, U+11062 ...)
4261 T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (12: U+137A, U+10118,
4262 U+102F2, U+10341, U+10E71, U+11063 ...)
4263 T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (34: U+0BF1, U+0D71,
4264 U+137B, U+216D, U+217D, U+4F70 ...)
4265 T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (6: U+1011A, U+102F4,
4266 U+109D3, U+10E73, U+1EC84, U+1ED14)
4267 T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (7: U+1011B, U+1016B,
4268 U+102F5, U+109D4, U+10E74, U+1EC85 ...)
4269 T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (7: U+1011C, U+102F6,
4270 U+109D5, U+10E75, U+1EC86, U+1ED16 ...)
4271 T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (16: U+216E, U+217E,
4272 U+1011D, U+10145, U+1014C, U+10153 ...)
4273 T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (7: U+1011E, U+102F8,
4274 U+109D7, U+10E77, U+1EC88, U+1ED18 ...)
4275 T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (6: U+1011F, U+102F9,
4276 U+109D8, U+10E78, U+1EC89, U+1ED19)
4277 T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (6: U+10120, U+102FA,
4278 U+109D9, U+10E79, U+1EC8A, U+1ED1A)
4279 T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (7: U+10121, U+102FB,
4280 U+1034A, U+109DA, U+10E7A, U+1EC8B ...)
4281 T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (22: U+0BF2, U+0D72,
4282 U+216F, U+217F..2180, U+4EDF, U+5343 ...)
4283 T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (5: U+10123, U+109DC,
4284 U+1EC8D, U+1ED1D, U+1ED3A)
4285 T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (4: U+10124, U+109DD,
4286 U+1EC8E, U+1ED1E)
4287 T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (4: U+10125, U+109DE,
4288 U+1EC8F, U+1ED1F)
4289 T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (8: U+2181, U+10126,
4290 U+10146, U+1014E, U+10172, U+109DF ...)
4291 T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (4: U+10127, U+109E0,
4292 U+1EC91, U+1ED21)
4293 T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (4: U+10128, U+109E1,
4294 U+1EC92, U+1ED22)
4295 T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (4: U+10129, U+109E2,
4296 U+1EC93, U+1ED23)
4297 T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (4: U+1012A, U+109E3,
4298 U+1EC94, U+1ED24)
4299 T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (13:
4300 U+137C, U+2182, U+4E07, U+842C, U+1012B,
4301 U+10155 ...)
4302 T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (4:
4303 U+1012C, U+109E5, U+1EC96, U+1ED26)
4304 T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (4:
4305 U+1012D, U+109E6, U+1EC97, U+1ED27)
4306 T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (4:
4307 U+1012E, U+109E7, U+1EC98, U+1ED28)
4308 T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (7:
4309 U+2187, U+1012F, U+10147, U+10156,
4310 U+109E8, U+1EC99 ...)
4311 T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (4:
4312 U+10130, U+109E9, U+1EC9A, U+1ED2A)
4313 T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (4:
4314 U+10131, U+109EA, U+1EC9B, U+1ED2B)
4315 T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (4:
4316 U+10132, U+109EB, U+1EC9C, U+1ED2C)
4317 T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (4:
4318 U+10133, U+109EC, U+1EC9D, U+1ED2D)
4319 T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (5:
4320 U+2188, U+109ED, U+1EC9E, U+1ECA0,
4321 U+1ECB4)
4322 T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (2:
4323 U+109EE, U+1EC9F)
4324 T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1:
4325 U+12432)
4326 T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1:
4327 U+109EF)
4328 T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1:
4329 U+109F0)
4330 T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1:
4331 U+12433)
4332 T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1:
4333 U+109F1)
4334 T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1:
4335 U+109F2)
4336 T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1:
4337 U+109F3)
4338 T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1:
4339 U+109F4)
4340 T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1:
4341 U+109F5)
4342 T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1:
4343 U+16B5E)
4344 T \p{Numeric_Value: 10000000} (= 1.0e+07) (Short: \p{Nv=10000000})
4345 (1: U+1ECA1)
4346 T \p{Numeric_Value: 20000000} (= 2.0e+07) (Short: \p{Nv=20000000})
4347 (1: U+1ECA2)
4348 T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
4349 (3: U+4EBF, U+5104, U+16B5F)
4350 T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
4351 10000000000}) (1: U+16B60)
4352 T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
4353 1000000000000}) (2: U+5146, U+16B61)
4354 \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_277 plus all
4355 above-Unicode code points: [\x00-
4356 \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-
4357 Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4-
4358 \xb8\xba-\xbb\xbf-\xff], U+0100..065F,
4359 U+066A..06EF, U+06FA..07BF,
4360 U+07CA..0965, U+0970..09E5 ...)
4361 \p{Nushu} \p{Script_Extensions=Nushu} (Short:
4362 \p{Nshu}; NOT \p{Block=Nushu}) (397)
4363 \p{Nv: *} \p{Numeric_Value: *}
4364 \p{Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
4365 Nyiakeng_Puachue_Hmong} (Short:
4366 \p{Hmnp}; NOT \p{Block=
4367 Nyiakeng_Puachue_Hmong}) (71)
4368 X \p{OCR} \p{Optical_Character_Recognition} (=
4369 \p{Block=Optical_Character_Recognition})
4370 (32)
4371 \p{Ogam} \p{Ogham} (= \p{Script_Extensions=Ogham})
4372 (NOT \p{Block=Ogham}) (29)
4373 \p{Ogham} \p{Script_Extensions=Ogham} (Short:
4374 \p{Ogam}; NOT \p{Block=Ogham}) (29)
4375 \p{Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
4376 \p{Olck}) (48)
4377 \p{Olck} \p{Ol_Chiki} (= \p{Script_Extensions=
4378 Ol_Chiki}) (48)
4379 \p{Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
4380 (Short: \p{Hung}; NOT \p{Block=
4381 Old_Hungarian}) (108)
4382 \p{Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
4383 \p{Ital}; NOT \p{Block=Old_Italic}) (39)
4384 \p{Old_North_Arabian} \p{Script_Extensions=Old_North_Arabian}
4385 (Short: \p{Narb}) (32)
4386 \p{Old_Permic} \p{Script_Extensions=Old_Permic} (Short:
4387 \p{Perm}; NOT \p{Block=Old_Permic}) (44)
4388 \p{Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
4389 \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
4390 \p{Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
4391 \p{Sogo}; NOT \p{Block=Old_Sogdian}) (40)
4392 \p{Old_South_Arabian} \p{Script_Extensions=Old_South_Arabian}
4393 (Short: \p{Sarb}) (32)
4394 \p{Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
4395 \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
4396 \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
4397 (Short: \p{Ps}) (75)
4398 X \p{Optical_Character_Recognition} \p{Block=
4399 Optical_Character_Recognition} (Short:
4400 \p{InOCR}) (32)
4401 \p{Oriya} \p{Script_Extensions=Oriya} (Short:
4402 \p{Orya}; NOT \p{Block=Oriya}) (96)
4403 \p{Orkh} \p{Old_Turkic} (= \p{Script_Extensions=
4404 Old_Turkic}) (NOT \p{Block=Old_Turkic})
4405 (73)
4406 X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
4407 \p{Orya} \p{Oriya} (= \p{Script_Extensions=Oriya})
4408 (NOT \p{Block=Oriya}) (96)
4409 \p{Osage} \p{Script_Extensions=Osage} (Short:
4410 \p{Osge}; NOT \p{Block=Osage}) (72)
4411 \p{Osge} \p{Osage} (= \p{Script_Extensions=Osage})
4412 (NOT \p{Block=Osage}) (72)
4413 \p{Osma} \p{Osmanya} (= \p{Script_Extensions=
4414 Osmanya}) (NOT \p{Block=Osmanya}) (40)
4415 \p{Osmanya} \p{Script_Extensions=Osmanya} (Short:
4416 \p{Osma}; NOT \p{Block=Osmanya}) (40)
4417 \p{Other} \p{General_Category=Other} (Short: \p{C})
4418 (976_344 plus all above-Unicode code
4419 points)
4420 \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
4421 \p{Lo}) (121_414)
4422 \p{Other_Number} \p{General_Category=Other_Number} (Short:
4423 \p{No}) (888)
4424 \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
4425 (Short: \p{Po}) (588)
4426 \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
4427 \p{So}) (6161)
4428 X \p{Ottoman_Siyaq_Numbers} \p{Block=Ottoman_Siyaq_Numbers} (80)
4429 \p{P} \pP \p{Punct} (= \p{General_Category=
4430 Punctuation}) (NOT
4431 \p{General_Punctuation}) (792)
4432 \p{Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong} (Short:
4433 \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
4434 (127)
4435 \p{Palm} \p{Palmyrene} (= \p{Script_Extensions=
4436 Palmyrene}) (32)
4437 \p{Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
4438 \p{Palm}) (32)
4439 \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
4440 (Short: \p{Zp}) (1)
4441 \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
4442 Y}) (2760)
4443 \p{Pat_Syn: *} \p{Pattern_Syntax: *}
4444 \p{Pat_WS} \p{Pattern_White_Space} (=
4445 \p{Pattern_White_Space=Y}) (11)
4446 \p{Pat_WS: *} \p{Pattern_White_Space: *}
4447 \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
4448 (2760)
4449 \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
4450 (1_111_352 plus all above-Unicode code
4451 points: [\x00-\x200-9A-Z_a-z\x7f-
4452 \xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb7-
4453 \xba\xbc-\xbe\xc0-\xd6\xd8-\xf6\xf8-
4454 \xff], U+0100..200F, U+2028..202F,
4455 U+203F..2040, U+2054, U+205F..218F ...)
4456 \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760:
4457 [!\"#\$\%&\'\(\)*+,\-.\/:;<=
4458 >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab-
4459 \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
4460 U+2010..2027, U+2030..203E,
4461 U+2041..2053, U+2055..205E, U+2190..245F
4462 ...)
4463 \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
4464 \p{PatWS}) (11)
4465 \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
4466 (1_114_101 plus all above-Unicode code
4467 points: [^\t\n\cK\f\r\x20\x85],
4468 U+0100..200D, U+2010..2027,
4469 U+202A..infinity)
4470 \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11:
4471 [\t\n\cK\f\r\x20\x85], U+200E..200F,
4472 U+2028..2029)
4473 \p{Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
4474 \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
4475 \p{Pauc} \p{Pau_Cin_Hau} (= \p{Script_Extensions=
4476 Pau_Cin_Hau}) (NOT \p{Block=
4477 Pau_Cin_Hau}) (57)
4478 \p{Pc} \p{Connector_Punctuation} (=
4479 \p{General_Category=
4480 Connector_Punctuation}) (10)
4481 \p{PCM} \p{Prepended_Concatenation_Mark} (=
4482 \p{Prepended_Concatenation_Mark=Y}) (11)
4483 \p{PCM: *} \p{Prepended_Concatenation_Mark: *}
4484 \p{Pd} \p{Dash_Punctuation} (=
4485 \p{General_Category=Dash_Punctuation})
4486 (24)
4487 \p{Pe} \p{Close_Punctuation} (=
4488 \p{General_Category=Close_Punctuation})
4489 (73)
4490 \p{PerlSpace} \p{PosixSpace} (6)
4491 \p{PerlWord} \p{PosixWord} (63)
4492 \p{Perm} \p{Old_Permic} (= \p{Script_Extensions=
4493 Old_Permic}) (NOT \p{Block=Old_Permic})
4494 (44)
4495 \p{Pf} \p{Final_Punctuation} (=
4496 \p{General_Category=Final_Punctuation})
4497 (10)
4498 \p{Phag} \p{Phags_Pa} (= \p{Script_Extensions=
4499 Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
4500 \p{Phags_Pa} \p{Script_Extensions=Phags_Pa} (Short:
4501 \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
4502 X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block=
4503 Phaistos_Disc}) (48)
4504 X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short:
4505 \p{InPhaistos}) (48)
4506 \p{Phli} \p{Inscriptional_Pahlavi} (=
4507 \p{Script_Extensions=
4508 Inscriptional_Pahlavi}) (NOT \p{Block=
4509 Inscriptional_Pahlavi}) (27)
4510 \p{Phlp} \p{Psalter_Pahlavi} (=
4511 \p{Script_Extensions=Psalter_Pahlavi})
4512 (NOT \p{Block=Psalter_Pahlavi}) (30)
4513 \p{Phnx} \p{Phoenician} (= \p{Script_Extensions=
4514 Phoenician}) (NOT \p{Block=Phoenician})
4515 (29)
4516 \p{Phoenician} \p{Script_Extensions=Phoenician} (Short:
4517 \p{Phnx}; NOT \p{Block=Phoenician}) (29)
4518 X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block=
4519 Phonetic_Extensions}) (128)
4520 X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (=
4521 \p{Block=
4522 Phonetic_Extensions_Supplement}) (64)
4523 X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
4524 \p{InPhoneticExt}) (128)
4525 X \p{Phonetic_Extensions_Supplement} \p{Block=
4526 Phonetic_Extensions_Supplement} (Short:
4527 \p{InPhoneticExtSup}) (64)
4528 \p{Pi} \p{Initial_Punctuation} (=
4529 \p{General_Category=
4530 Initial_Punctuation}) (12)
4531 X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
4532 \p{Plrd} \p{Miao} (= \p{Script_Extensions=Miao})
4533 (NOT \p{Block=Miao}) (149)
4534 \p{Po} \p{Other_Punctuation} (=
4535 \p{General_Category=Other_Punctuation})
4536 (588)
4537 \p{PosixAlnum} (62: [0-9A-Za-z])
4538 \p{PosixAlpha} (52: [A-Za-z])
4539 \p{PosixBlank} (2: [\t\x20])
4540 \p{PosixCntrl} ASCII control characters (33: ACK, BEL,
4541 BS, CAN, CR, DC1, DC2, DC3, DC4, DEL,
4542 DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF,
4543 FS, GS, HT, LF, NAK, NUL, RS, SI, SO,
4544 SOH, STX, SUB, SYN, US, VT)
4545 \p{PosixDigit} (10: [0-9])
4546 \p{PosixGraph} (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
4547 Z\[\\\]\^_`a-z\{\|\}~])
4548 \p{PosixLower} (/i= PosixAlpha) (26: [a-z])
4549 \p{PosixPrint} (95: [\x20-\x7e])
4550 \p{PosixPunct} (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<=
4551 >?\@\[\\\]\^_`\{\|\}~])
4552 \p{PosixSpace} (Short: \p{PerlSpace}) (6:
4553 [\t\n\cK\f\r\x20])
4554 \p{PosixUpper} (/i= PosixAlpha) (26: [A-Z])
4555 \p{PosixWord} \w, restricted to ASCII (Short:
4556 \p{PerlWord}) (63: [0-9A-Z_a-z])
4557 \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
4558 (22)
4559 \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
4560 Y} (Short: \p{PCM}) (11)
4561 \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
4562 (1_114_101 plus all above-Unicode code
4563 points: U+0000..05FF, U+0606..06DC,
4564 U+06DE..070E, U+0710..08E1,
4565 U+08E3..110BC, U+110BE..110CC ...)
4566 \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
4567 (11: U+0600..0605, U+06DD, U+070F,
4568 U+08E2, U+110BD, U+110CD)
4569 T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
4570 extension) (33_979)
4571 T \p{Present_In: 2.0} Code point's usage introduced in version
4572 2.0 or earlier (Short: \p{In=2.0}) (Perl
4573 extension) (178_500: U+0000..01F5,
4574 U+01FA..0217, U+0250..02A8,
4575 U+02B0..02DE, U+02E0..02E9, U+0300..0345
4576 ...)
4577 \p{Present_In: V2_0} \p{Present_In=2.0} (Perl extension)
4578 (178_500)
4579 T \p{Present_In: 2.1} Code point's usage introduced in version
4580 2.1 or earlier (Short: \p{In=2.1}) (Perl
4581 extension) (178_502: U+0000..01F5,
4582 U+01FA..0217, U+0250..02A8,
4583 U+02B0..02DE, U+02E0..02E9, U+0300..0345
4584 ...)
4585 \p{Present_In: V2_1} \p{Present_In=2.1} (Perl extension)
4586 (178_502)
4587 T \p{Present_In: 3.0} Code point's usage introduced in version
4588 3.0 or earlier (Short: \p{In=3.0}) (Perl
4589 extension) (188_809: U+0000..021F,
4590 U+0222..0233, U+0250..02AD,
4591 U+02B0..02EE, U+0300..034E, U+0360..0362
4592 ...)
4593 \p{Present_In: V3_0} \p{Present_In=3.0} (Perl extension)
4594 (188_809)
4595 T \p{Present_In: 3.1} Code point's usage introduced in version
4596 3.1 or earlier (Short: \p{In=3.1}) (Perl
4597 extension) (233_787: U+0000..021F,
4598 U+0222..0233, U+0250..02AD,
4599 U+02B0..02EE, U+0300..034E, U+0360..0362
4600 ...)
4601 \p{Present_In: V3_1} \p{Present_In=3.1} (Perl extension)
4602 (233_787)
4603 T \p{Present_In: 3.2} Code point's usage introduced in version
4604 3.2 or earlier (Short: \p{In=3.2}) (Perl
4605 extension) (234_803: U+0000..0220,
4606 U+0222..0233, U+0250..02AD,
4607 U+02B0..02EE, U+0300..034F, U+0360..036F
4608 ...)
4609 \p{Present_In: V3_2} \p{Present_In=3.2} (Perl extension)
4610 (234_803)
4611 T \p{Present_In: 4.0} Code point's usage introduced in version
4612 4.0 or earlier (Short: \p{In=4.0}) (Perl
4613 extension) (236_029: U+0000..0236,
4614 U+0250..0357, U+035D..036F,
4615 U+0374..0375, U+037A, U+037E ...)
4616 \p{Present_In: V4_0} \p{Present_In=4.0} (Perl extension)
4617 (236_029)
4618 T \p{Present_In: 4.1} Code point's usage introduced in version
4619 4.1 or earlier (Short: \p{In=4.1}) (Perl
4620 extension) (237_302: U+0000..0241,
4621 U+0250..036F, U+0374..0375, U+037A,
4622 U+037E, U+0384..038A ...)
4623 \p{Present_In: V4_1} \p{Present_In=4.1} (Perl extension)
4624 (237_302)
4625 T \p{Present_In: 5.0} Code point's usage introduced in version
4626 5.0 or earlier (Short: \p{In=5.0}) (Perl
4627 extension) (238_671: U+0000..036F,
4628 U+0374..0375, U+037A..037E,
4629 U+0384..038A, U+038C, U+038E..03A1 ...)
4630 \p{Present_In: V5_0} \p{Present_In=5.0} (Perl extension)
4631 (238_671)
4632 T \p{Present_In: 5.1} Code point's usage introduced in version
4633 5.1 or earlier (Short: \p{In=5.1}) (Perl
4634 extension) (240_295: U+0000..0377,
4635 U+037A..037E, U+0384..038A, U+038C,
4636 U+038E..03A1, U+03A3..0523 ...)
4637 \p{Present_In: V5_1} \p{Present_In=5.1} (Perl extension)
4638 (240_295)
4639 T \p{Present_In: 5.2} Code point's usage introduced in version
4640 5.2 or earlier (Short: \p{In=5.2}) (Perl
4641 extension) (246_943: U+0000..0377,
4642 U+037A..037E, U+0384..038A, U+038C,
4643 U+038E..03A1, U+03A3..0525 ...)
4644 \p{Present_In: V5_2} \p{Present_In=5.2} (Perl extension)
4645 (246_943)
4646 T \p{Present_In: 6.0} Code point's usage introduced in version
4647 6.0 or earlier (Short: \p{In=6.0}) (Perl
4648 extension) (249_031: U+0000..0377,
4649 U+037A..037E, U+0384..038A, U+038C,
4650 U+038E..03A1, U+03A3..0527 ...)
4651 \p{Present_In: V6_0} \p{Present_In=6.0} (Perl extension)
4652 (249_031)
4653 T \p{Present_In: 6.1} Code point's usage introduced in version
4654 6.1 or earlier (Short: \p{In=6.1}) (Perl
4655 extension) (249_763: U+0000..0377,
4656 U+037A..037E, U+0384..038A, U+038C,
4657 U+038E..03A1, U+03A3..0527 ...)
4658 \p{Present_In: V6_1} \p{Present_In=6.1} (Perl extension)
4659 (249_763)
4660 T \p{Present_In: 6.2} Code point's usage introduced in version
4661 6.2 or earlier (Short: \p{In=6.2}) (Perl
4662 extension) (249_764: U+0000..0377,
4663 U+037A..037E, U+0384..038A, U+038C,
4664 U+038E..03A1, U+03A3..0527 ...)
4665 \p{Present_In: V6_2} \p{Present_In=6.2} (Perl extension)
4666 (249_764)
4667 T \p{Present_In: 6.3} Code point's usage introduced in version
4668 6.3 or earlier (Short: \p{In=6.3}) (Perl
4669 extension) (249_769: U+0000..0377,
4670 U+037A..037E, U+0384..038A, U+038C,
4671 U+038E..03A1, U+03A3..0527 ...)
4672 \p{Present_In: V6_3} \p{Present_In=6.3} (Perl extension)
4673 (249_769)
4674 T \p{Present_In: 7.0} Code point's usage introduced in version
4675 7.0 or earlier (Short: \p{In=7.0}) (Perl
4676 extension) (252_603: U+0000..0377,
4677 U+037A..037F, U+0384..038A, U+038C,
4678 U+038E..03A1, U+03A3..052F ...)
4679 \p{Present_In: V7_0} \p{Present_In=7.0} (Perl extension)
4680 (252_603)
4681 T \p{Present_In: 8.0} Code point's usage introduced in version
4682 8.0 or earlier (Short: \p{In=8.0}) (Perl
4683 extension) (260_319: U+0000..0377,
4684 U+037A..037F, U+0384..038A, U+038C,
4685 U+038E..03A1, U+03A3..052F ...)
4686 \p{Present_In: V8_0} \p{Present_In=8.0} (Perl extension)
4687 (260_319)
4688 T \p{Present_In: 9.0} Code point's usage introduced in version
4689 9.0 or earlier (Short: \p{In=9.0}) (Perl
4690 extension) (267_819: U+0000..0377,
4691 U+037A..037F, U+0384..038A, U+038C,
4692 U+038E..03A1, U+03A3..052F ...)
4693 \p{Present_In: V9_0} \p{Present_In=9.0} (Perl extension)
4694 (267_819)
4695 T \p{Present_In: 10.0} Code point's usage introduced in version
4696 10.0 or earlier (Short: \p{In=10.0})
4697 (Perl extension) (276_337: U+0000..0377,
4698 U+037A..037F, U+0384..038A, U+038C,
4699 U+038E..03A1, U+03A3..052F ...)
4700 \p{Present_In: V10_0} \p{Present_In=10.0} (Perl extension)
4701 (276_337)
4702 T \p{Present_In: 11.0} Code point's usage introduced in version
4703 11.0 or earlier (Short: \p{In=11.0})
4704 (Perl extension) (277_021: U+0000..0377,
4705 U+037A..037F, U+0384..038A, U+038C,
4706 U+038E..03A1, U+03A3..052F ...)
4707 \p{Present_In: V11_0} \p{Present_In=11.0} (Perl extension)
4708 (277_021)
4709 T \p{Present_In: 12.0} Code point's usage introduced in version
4710 12.0 or earlier (Short: \p{In=12.0})
4711 (Perl extension) (277_575: U+0000..0377,
4712 U+037A..037F, U+0384..038A, U+038C,
4713 U+038E..03A1, U+03A3..052F ...)
4714 \p{Present_In: V12_0} \p{Present_In=12.0} (Perl extension)
4715 (277_575)
4716 T \p{Present_In: 12.1} Code point's usage introduced in version
4717 12.1 or earlier (Short: \p{In=12.1})
4718 (Perl extension) (277_576: U+0000..0377,
4719 U+037A..037F, U+0384..038A, U+038C,
4720 U+038E..03A1, U+03A3..052F ...)
4721 \p{Present_In: V12_1} \p{Present_In=12.1} (Perl extension)
4722 (277_576)
4723 \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
4724 Unassigned}) (Perl extension) (836_536
4725 plus all above-Unicode code points)
4726 \p{Print} \p{XPosixPrint} (275_395)
4727 \p{Private_Use} \p{General_Category=Private_Use} (Short:
4728 \p{Co}; NOT \p{Private_Use_Area})
4729 (137_468)
4730 X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
4731 \p{InPUA}) (6400)
4732 \p{Prti} \p{Inscriptional_Parthian} (=
4733 \p{Script_Extensions=
4734 Inscriptional_Parthian}) (NOT \p{Block=
4735 Inscriptional_Parthian}) (30)
4736 \p{Ps} \p{Open_Punctuation} (=
4737 \p{General_Category=Open_Punctuation})
4738 (75)
4739 \p{Psalter_Pahlavi} \p{Script_Extensions=Psalter_Pahlavi}
4740 (Short: \p{Phlp}; NOT \p{Block=
4741 Psalter_Pahlavi}) (30)
4742 X \p{PUA} \p{Private_Use_Area} (= \p{Block=
4743 Private_Use_Area}) (6400)
4744 \p{Punct} \p{General_Category=Punctuation} (Short:
4745 \p{P}; NOT \p{General_Punctuation}) (792)
4746 \p{Punctuation} \p{Punct} (= \p{General_Category=
4747 Punctuation}) (NOT
4748 \p{General_Punctuation}) (792)
4749 \p{Qaac} \p{Coptic} (= \p{Script_Extensions=
4750 Coptic}) (NOT \p{Block=Coptic}) (165)
4751 \p{Qaai} \p{Inherited} (= \p{Script_Extensions=
4752 Inherited}) (502)
4753 \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
4754 Y}) (30)
4755 \p{QMark: *} \p{Quotation_Mark: *}
4756 \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
4757 (30)
4758 \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_082
4759 plus all above-Unicode code points:
4760 [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<=
4761 >?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
4762 \xaa\xac-\xba\xbc-\xff], U+0100..2017,
4763 U+2020..2038, U+203B..2E41,
4764 U+2E43..300B, U+3010..301C ...)
4765 \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (30:
4766 [\"\'\xab\xbb], U+2018..201F,
4767 U+2039..203A, U+2E42, U+300C..300F,
4768 U+301D..301F ...)
4769 \p{Radical} \p{Radical=Y} (329)
4770 \p{Radical: N*} (Single: \P{Radical}) (1_113_783 plus all
4771 above-Unicode code points: U+0000..2E7F,
4772 U+2E9A, U+2EF4..2EFF, U+2FD6..infinity)
4773 \p{Radical: Y*} (Single: \p{Radical}) (329: U+2E80..2E99,
4774 U+2E9B..2EF3, U+2F00..2FD5)
4775 \p{Regional_Indicator} \p{Regional_Indicator=Y} (Short: \p{RI})
4776 (26)
4777 \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086
4778 plus all above-Unicode code points:
4779 U+0000..1F1E5, U+1F200..infinity)
4780 \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26:
4781 U+1F1E6..1F1FF)
4782 \p{Rejang} \p{Script_Extensions=Rejang} (Short:
4783 \p{Rjng}; NOT \p{Block=Rejang}) (37)
4784 \p{RI} \p{Regional_Indicator} (=
4785 \p{Regional_Indicator=Y}) (26)
4786 \p{RI: *} \p{Regional_Indicator: *}
4787 \p{Rjng} \p{Rejang} (= \p{Script_Extensions=
4788 Rejang}) (NOT \p{Block=Rejang}) (37)
4789 \p{Rohg} \p{Hanifi_Rohingya} (=
4790 \p{Script_Extensions=Hanifi_Rohingya})
4791 (NOT \p{Block=Hanifi_Rohingya}) (55)
4792 X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block=
4793 Rumi_Numeral_Symbols}) (32)
4794 X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
4795 \p{InRumi}) (32)
4796 \p{Runic} \p{Script_Extensions=Runic} (Short:
4797 \p{Runr}; NOT \p{Block=Runic}) (86)
4798 \p{Runr} \p{Runic} (= \p{Script_Extensions=Runic})
4799 (NOT \p{Block=Runic}) (86)
4800 \p{S} \pS \p{Symbol} (= \p{General_Category=Symbol})
4801 (7292)
4802 \p{Samaritan} \p{Script_Extensions=Samaritan} (Short:
4803 \p{Samr}; NOT \p{Block=Samaritan}) (61)
4804 \p{Samr} \p{Samaritan} (= \p{Script_Extensions=
4805 Samaritan}) (NOT \p{Block=Samaritan})
4806 (61)
4807 \p{Sarb} \p{Old_South_Arabian} (=
4808 \p{Script_Extensions=Old_South_Arabian})
4809 (32)
4810 \p{Saur} \p{Saurashtra} (= \p{Script_Extensions=
4811 Saurashtra}) (NOT \p{Block=Saurashtra})
4812 (82)
4813 \p{Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
4814 \p{Saur}; NOT \p{Block=Saurashtra}) (82)
4815 \p{SB: *} \p{Sentence_Break: *}
4816 \p{Sc} \p{Currency_Symbol} (=
4817 \p{General_Category=Currency_Symbol})
4818 (62)
4819 \p{Sc: *} \p{Script: *}
4820 \p{Script: Adlam} (Short: \p{Sc=Adlm}) (88: U+1E900..1E94B,
4821 U+1E950..1E959, U+1E95E..1E95F)
4822 \p{Script: Adlm} \p{Script=Adlam} (88)
4823 \p{Script: Aghb} \p{Script=Caucasian_Albanian} (=
4824 \p{Script_Extensions=
4825 Caucasian_Albanian}) (53)
4826 \p{Script: Ahom} \p{Script_Extensions=Ahom} (Short: \p{Sc=
4827 Ahom}, \p{Ahom}) (58)
4828 \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions=
4829 Anatolian_Hieroglyphs} (Short: \p{Sc=
4830 Hluw}, \p{Hluw}) (583)
4831 \p{Script: Arab} \p{Script=Arabic} (1281)
4832 \p{Script: Arabic} (Short: \p{Sc=Arab}) (1281: U+0600..0604,
4833 U+0606..060B, U+060D..061A, U+061C,
4834 U+061E, U+0620..063F ...)
4835 \p{Script: Armenian} (Short: \p{Sc=Armn}) (95: U+0531..0556,
4836 U+0559..0588, U+058A, U+058D..058F,
4837 U+FB13..FB17)
4838 \p{Script: Armi} \p{Script=Imperial_Aramaic} (=
4839 \p{Script_Extensions=Imperial_Aramaic})
4840 (31)
4841 \p{Script: Armn} \p{Script=Armenian} (95)
4842 \p{Script: Avestan} \p{Script_Extensions=Avestan} (Short:
4843 \p{Sc=Avst}, \p{Avst}) (61)
4844 \p{Script: Avst} \p{Script=Avestan} (=
4845 \p{Script_Extensions=Avestan}) (61)
4846 \p{Script: Bali} \p{Script=Balinese} (=
4847 \p{Script_Extensions=Balinese}) (121)
4848 \p{Script: Balinese} \p{Script_Extensions=Balinese} (Short:
4849 \p{Sc=Bali}, \p{Bali}) (121)
4850 \p{Script: Bamu} \p{Script=Bamum} (= \p{Script_Extensions=
4851 Bamum}) (657)
4852 \p{Script: Bamum} \p{Script_Extensions=Bamum} (Short: \p{Sc=
4853 Bamu}, \p{Bamu}) (657)
4854 \p{Script: Bass} \p{Script=Bassa_Vah} (=
4855 \p{Script_Extensions=Bassa_Vah}) (36)
4856 \p{Script: Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
4857 \p{Sc=Bass}, \p{Bass}) (36)
4858 \p{Script: Batak} \p{Script_Extensions=Batak} (Short: \p{Sc=
4859 Batk}, \p{Batk}) (56)
4860 \p{Script: Batk} \p{Script=Batak} (= \p{Script_Extensions=
4861 Batak}) (56)
4862 \p{Script: Beng} \p{Script=Bengali} (96)
4863 \p{Script: Bengali} (Short: \p{Sc=Beng}) (96: U+0980..0983,
4864 U+0985..098C, U+098F..0990,
4865 U+0993..09A8, U+09AA..09B0, U+09B2 ...)
4866 \p{Script: Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
4867 \p{Sc=Bhks}, \p{Bhks}) (97)
4868 \p{Script: Bhks} \p{Script=Bhaiksuki} (=
4869 \p{Script_Extensions=Bhaiksuki}) (97)
4870 \p{Script: Bopo} \p{Script=Bopomofo} (72)
4871 \p{Script: Bopomofo} (Short: \p{Sc=Bopo}) (72: U+02EA..02EB,
4872 U+3105..312F, U+31A0..31BA)
4873 \p{Script: Brah} \p{Script=Brahmi} (= \p{Script_Extensions=
4874 Brahmi}) (109)
4875 \p{Script: Brahmi} \p{Script_Extensions=Brahmi} (Short:
4876 \p{Sc=Brah}, \p{Brah}) (109)
4877 \p{Script: Brai} \p{Script=Braille} (=
4878 \p{Script_Extensions=Braille}) (256)
4879 \p{Script: Braille} \p{Script_Extensions=Braille} (Short:
4880 \p{Sc=Brai}, \p{Brai}) (256)
4881 \p{Script: Bugi} \p{Script=Buginese} (30)
4882 \p{Script: Buginese} (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B,
4883 U+1A1E..1A1F)
4884 \p{Script: Buhd} \p{Script=Buhid} (20)
4885 \p{Script: Buhid} (Short: \p{Sc=Buhd}) (20: U+1740..1753)
4886 \p{Script: Cakm} \p{Script=Chakma} (70)
4887 \p{Script: Canadian_Aboriginal} \p{Script_Extensions=
4888 Canadian_Aboriginal} (Short: \p{Sc=
4889 Cans}, \p{Cans}) (710)
4890 \p{Script: Cans} \p{Script=Canadian_Aboriginal} (=
4891 \p{Script_Extensions=
4892 Canadian_Aboriginal}) (710)
4893 \p{Script: Cari} \p{Script=Carian} (= \p{Script_Extensions=
4894 Carian}) (49)
4895 \p{Script: Carian} \p{Script_Extensions=Carian} (Short:
4896 \p{Sc=Cari}, \p{Cari}) (49)
4897 \p{Script: Caucasian_Albanian} \p{Script_Extensions=
4898 Caucasian_Albanian} (Short: \p{Sc=Aghb},
4899 \p{Aghb}) (53)
4900 \p{Script: Chakma} (Short: \p{Sc=Cakm}) (70: U+11100..11134,
4901 U+11136..11146)
4902 \p{Script: Cham} \p{Script_Extensions=Cham} (Short: \p{Sc=
4903 Cham}, \p{Cham}) (83)
4904 \p{Script: Cher} \p{Script=Cherokee} (=
4905 \p{Script_Extensions=Cherokee}) (172)
4906 \p{Script: Cherokee} \p{Script_Extensions=Cherokee} (Short:
4907 \p{Sc=Cher}, \p{Cher}) (172)
4908 \p{Script: Common} (Short: \p{Sc=Zyyy}) (7805: [\x00-
4909 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
4910 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
4911 \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF,
4912 U+02E5..02E9, U+02EC..02FF, U+0374,
4913 U+037E ...)
4914 \p{Script: Copt} \p{Script=Coptic} (137)
4915 \p{Script: Coptic} (Short: \p{Sc=Copt}) (137: U+03E2..03EF,
4916 U+2C80..2CF3, U+2CF9..2CFF)
4917 \p{Script: Cprt} \p{Script=Cypriot} (55)
4918 \p{Script: Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
4919 \p{Sc=Xsux}, \p{Xsux}) (1234)
4920 \p{Script: Cypriot} (Short: \p{Sc=Cprt}) (55: U+10800..10805,
4921 U+10808, U+1080A..10835, U+10837..10838,
4922 U+1083C, U+1083F)
4923 \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}) (443: U+0400..0484,
4924 U+0487..052F, U+1C80..1C88, U+1D2B,
4925 U+1D78, U+2DE0..2DFF ...)
4926 \p{Script: Cyrl} \p{Script=Cyrillic} (443)
4927 \p{Script: Deseret} \p{Script_Extensions=Deseret} (Short:
4928 \p{Sc=Dsrt}, \p{Dsrt}) (80)
4929 \p{Script: Deva} \p{Script=Devanagari} (154)
4930 \p{Script: Devanagari} (Short: \p{Sc=Deva}) (154: U+0900..0950,
4931 U+0955..0963, U+0966..097F, U+A8E0..A8FF)
4932 \p{Script: Dogr} \p{Script=Dogra} (60)
4933 \p{Script: Dogra} (Short: \p{Sc=Dogr}) (60: U+11800..1183B)
4934 \p{Script: Dsrt} \p{Script=Deseret} (=
4935 \p{Script_Extensions=Deseret}) (80)
4936 \p{Script: Dupl} \p{Script=Duployan} (143)
4937 \p{Script: Duployan} (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A,
4938 U+1BC70..1BC7C, U+1BC80..1BC88,
4939 U+1BC90..1BC99, U+1BC9C..1BC9F)
4940 \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (=
4941 \p{Script_Extensions=
4942 Egyptian_Hieroglyphs}) (1080)
4943 \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions=
4944 Egyptian_Hieroglyphs} (Short: \p{Sc=
4945 Egyp}, \p{Egyp}) (1080)
4946 \p{Script: Elba} \p{Script=Elbasan} (=
4947 \p{Script_Extensions=Elbasan}) (40)
4948 \p{Script: Elbasan} \p{Script_Extensions=Elbasan} (Short:
4949 \p{Sc=Elba}, \p{Elba}) (40)
4950 \p{Script: Elym} \p{Script=Elymaic} (=
4951 \p{Script_Extensions=Elymaic}) (23)
4952 \p{Script: Elymaic} \p{Script_Extensions=Elymaic} (Short:
4953 \p{Sc=Elym}, \p{Elym}) (23)
4954 \p{Script: Ethi} \p{Script=Ethiopic} (=
4955 \p{Script_Extensions=Ethiopic}) (495)
4956 \p{Script: Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
4957 \p{Sc=Ethi}, \p{Ethi}) (495)
4958 \p{Script: Geor} \p{Script=Georgian} (173)
4959 \p{Script: Georgian} (Short: \p{Sc=Geor}) (173: U+10A0..10C5,
4960 U+10C7, U+10CD, U+10D0..10FA,
4961 U+10FC..10FF, U+1C90..1CBA ...)
4962 \p{Script: Glag} \p{Script=Glagolitic} (132)
4963 \p{Script: Glagolitic} (Short: \p{Sc=Glag}) (132: U+2C00..2C2E,
4964 U+2C30..2C5E, U+1E000..1E006,
4965 U+1E008..1E018, U+1E01B..1E021,
4966 U+1E023..1E024 ...)
4967 \p{Script: Gong} \p{Script=Gunjala_Gondi} (63)
4968 \p{Script: Gonm} \p{Script=Masaram_Gondi} (75)
4969 \p{Script: Goth} \p{Script=Gothic} (= \p{Script_Extensions=
4970 Gothic}) (27)
4971 \p{Script: Gothic} \p{Script_Extensions=Gothic} (Short:
4972 \p{Sc=Goth}, \p{Goth}) (27)
4973 \p{Script: Gran} \p{Script=Grantha} (85)
4974 \p{Script: Grantha} (Short: \p{Sc=Gran}) (85: U+11300..11303,
4975 U+11305..1130C, U+1130F..11310,
4976 U+11313..11328, U+1132A..11330,
4977 U+11332..11333 ...)
4978 \p{Script: Greek} (Short: \p{Sc=Grek}) (518: U+0370..0373,
4979 U+0375..0377, U+037A..037D, U+037F,
4980 U+0384, U+0386 ...)
4981 \p{Script: Grek} \p{Script=Greek} (518)
4982 \p{Script: Gujarati} (Short: \p{Sc=Gujr}) (91: U+0A81..0A83,
4983 U+0A85..0A8D, U+0A8F..0A91,
4984 U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3
4985 ...)
4986 \p{Script: Gujr} \p{Script=Gujarati} (91)
4987 \p{Script: Gunjala_Gondi} (Short: \p{Sc=Gong}) (63:
4988 U+11D60..11D65, U+11D67..11D68,
4989 U+11D6A..11D8E, U+11D90..11D91,
4990 U+11D93..11D98, U+11DA0..11DA9)
4991 \p{Script: Gurmukhi} (Short: \p{Sc=Guru}) (80: U+0A01..0A03,
4992 U+0A05..0A0A, U+0A0F..0A10,
4993 U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33
4994 ...)
4995 \p{Script: Guru} \p{Script=Gurmukhi} (80)
4996 \p{Script: Han} (Short: \p{Sc=Han}) (89_233: U+2E80..2E99,
4997 U+2E9B..2EF3, U+2F00..2FD5, U+3005,
4998 U+3007, U+3021..3029 ...)
4999 \p{Script: Hang} \p{Script=Hangul} (11_739)
5000 \p{Script: Hangul} (Short: \p{Sc=Hang}) (11_739:
5001 U+1100..11FF, U+302E..302F,
5002 U+3131..318E, U+3200..321E,
5003 U+3260..327E, U+A960..A97C ...)
5004 \p{Script: Hani} \p{Script=Han} (89_233)
5005 \p{Script: Hanifi_Rohingya} (Short: \p{Sc=Rohg}) (50:
5006 U+10D00..10D27, U+10D30..10D39)
5007 \p{Script: Hano} \p{Script=Hanunoo} (21)
5008 \p{Script: Hanunoo} (Short: \p{Sc=Hano}) (21: U+1720..1734)
5009 \p{Script: Hatr} \p{Script=Hatran} (= \p{Script_Extensions=
5010 Hatran}) (26)
5011 \p{Script: Hatran} \p{Script_Extensions=Hatran} (Short:
5012 \p{Sc=Hatr}, \p{Hatr}) (26)
5013 \p{Script: Hebr} \p{Script=Hebrew} (= \p{Script_Extensions=
5014 Hebrew}) (134)
5015 \p{Script: Hebrew} \p{Script_Extensions=Hebrew} (Short:
5016 \p{Sc=Hebr}, \p{Hebr}) (134)
5017 \p{Script: Hira} \p{Script=Hiragana} (379)
5018 \p{Script: Hiragana} (Short: \p{Sc=Hira}) (379: U+3041..3096,
5019 U+309D..309F, U+1B001..1B11E,
5020 U+1B150..1B152, U+1F200)
5021 \p{Script: Hluw} \p{Script=Anatolian_Hieroglyphs} (=
5022 \p{Script_Extensions=
5023 Anatolian_Hieroglyphs}) (583)
5024 \p{Script: Hmng} \p{Script=Pahawh_Hmong} (=
5025 \p{Script_Extensions=Pahawh_Hmong}) (127)
5026 \p{Script: Hmnp} \p{Script=Nyiakeng_Puachue_Hmong} (=
5027 \p{Script_Extensions=
5028 Nyiakeng_Puachue_Hmong}) (71)
5029 \p{Script: Hung} \p{Script=Old_Hungarian} (=
5030 \p{Script_Extensions=Old_Hungarian})
5031 (108)
5032 \p{Script: Imperial_Aramaic} \p{Script_Extensions=
5033 Imperial_Aramaic} (Short: \p{Sc=Armi},
5034 \p{Armi}) (31)
5035 \p{Script: Inherited} (Short: \p{Sc=Zinh}) (571: U+0300..036F,
5036 U+0485..0486, U+064B..0655, U+0670,
5037 U+0951..0954, U+1AB0..1ABE ...)
5038 \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions=
5039 Inscriptional_Pahlavi} (Short: \p{Sc=
5040 Phli}, \p{Phli}) (27)
5041 \p{Script: Inscriptional_Parthian} \p{Script_Extensions=
5042 Inscriptional_Parthian} (Short: \p{Sc=
5043 Prti}, \p{Prti}) (30)
5044 \p{Script: Ital} \p{Script=Old_Italic} (=
5045 \p{Script_Extensions=Old_Italic}) (39)
5046 \p{Script: Java} \p{Script=Javanese} (90)
5047 \p{Script: Javanese} (Short: \p{Sc=Java}) (90: U+A980..A9CD,
5048 U+A9D0..A9D9, U+A9DE..A9DF)
5049 \p{Script: Kaithi} (Short: \p{Sc=Kthi}) (67: U+11080..110C1,
5050 U+110CD)
5051 \p{Script: Kali} \p{Script=Kayah_Li} (47)
5052 \p{Script: Kana} \p{Script=Katakana} (304)
5053 \p{Script: Kannada} (Short: \p{Sc=Knda}) (89: U+0C80..0C8C,
5054 U+0C8E..0C90, U+0C92..0CA8,
5055 U+0CAA..0CB3, U+0CB5..0CB9, U+0CBC..0CC4
5056 ...)
5057 \p{Script: Katakana} (Short: \p{Sc=Kana}) (304: U+30A1..30FA,
5058 U+30FD..30FF, U+31F0..31FF,
5059 U+32D0..32FE, U+3300..3357, U+FF66..FF6F
5060 ...)
5061 \p{Script: Kayah_Li} (Short: \p{Sc=Kali}) (47: U+A900..A92D,
5062 U+A92F)
5063 \p{Script: Khar} \p{Script=Kharoshthi} (=
5064 \p{Script_Extensions=Kharoshthi}) (68)
5065 \p{Script: Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
5066 \p{Sc=Khar}, \p{Khar}) (68)
5067 \p{Script: Khmer} \p{Script_Extensions=Khmer} (Short: \p{Sc=
5068 Khmr}, \p{Khmr}) (146)
5069 \p{Script: Khmr} \p{Script=Khmer} (= \p{Script_Extensions=
5070 Khmer}) (146)
5071 \p{Script: Khoj} \p{Script=Khojki} (62)
5072 \p{Script: Khojki} (Short: \p{Sc=Khoj}) (62: U+11200..11211,
5073 U+11213..1123E)
5074 \p{Script: Khudawadi} (Short: \p{Sc=Sind}) (69: U+112B0..112EA,
5075 U+112F0..112F9)
5076 \p{Script: Knda} \p{Script=Kannada} (89)
5077 \p{Script: Kthi} \p{Script=Kaithi} (67)
5078 \p{Script: Lana} \p{Script=Tai_Tham} (=
5079 \p{Script_Extensions=Tai_Tham}) (127)
5080 \p{Script: Lao} \p{Script_Extensions=Lao} (Short: \p{Sc=
5081 Lao}, \p{Lao}) (82)
5082 \p{Script: Laoo} \p{Script=Lao} (= \p{Script_Extensions=
5083 Lao}) (82)
5084 \p{Script: Latin} (Short: \p{Sc=Latn}) (1366: [A-Za-
5085 z\xaa\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
5086 U+0100..02B8, U+02E0..02E4,
5087 U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65
5088 ...)
5089 \p{Script: Latn} \p{Script=Latin} (1366)
5090 \p{Script: Lepc} \p{Script=Lepcha} (= \p{Script_Extensions=
5091 Lepcha}) (74)
5092 \p{Script: Lepcha} \p{Script_Extensions=Lepcha} (Short:
5093 \p{Sc=Lepc}, \p{Lepc}) (74)
5094 \p{Script: Limb} \p{Script=Limbu} (68)
5095 \p{Script: Limbu} (Short: \p{Sc=Limb}) (68: U+1900..191E,
5096 U+1920..192B, U+1930..193B, U+1940,
5097 U+1944..194F)
5098 \p{Script: Lina} \p{Script=Linear_A} (341)
5099 \p{Script: Linb} \p{Script=Linear_B} (211)
5100 \p{Script: Linear_A} (Short: \p{Sc=Lina}) (341: U+10600..10736,
5101 U+10740..10755, U+10760..10767)
5102 \p{Script: Linear_B} (Short: \p{Sc=Linb}) (211: U+10000..1000B,
5103 U+1000D..10026, U+10028..1003A,
5104 U+1003C..1003D, U+1003F..1004D,
5105 U+10050..1005D ...)
5106 \p{Script: Lisu} \p{Script_Extensions=Lisu} (Short: \p{Sc=
5107 Lisu}, \p{Lisu}) (48)
5108 \p{Script: Lyci} \p{Script=Lycian} (= \p{Script_Extensions=
5109 Lycian}) (29)
5110 \p{Script: Lycian} \p{Script_Extensions=Lycian} (Short:
5111 \p{Sc=Lyci}, \p{Lyci}) (29)
5112 \p{Script: Lydi} \p{Script=Lydian} (= \p{Script_Extensions=
5113 Lydian}) (27)
5114 \p{Script: Lydian} \p{Script_Extensions=Lydian} (Short:
5115 \p{Sc=Lydi}, \p{Lydi}) (27)
5116 \p{Script: Mahajani} (Short: \p{Sc=Mahj}) (39: U+11150..11176)
5117 \p{Script: Mahj} \p{Script=Mahajani} (39)
5118 \p{Script: Maka} \p{Script=Makasar} (=
5119 \p{Script_Extensions=Makasar}) (25)
5120 \p{Script: Makasar} \p{Script_Extensions=Makasar} (Short:
5121 \p{Sc=Maka}, \p{Maka}) (25)
5122 \p{Script: Malayalam} (Short: \p{Sc=Mlym}) (117: U+0D00..0D03,
5123 U+0D05..0D0C, U+0D0E..0D10,
5124 U+0D12..0D44, U+0D46..0D48, U+0D4A..0D4F
5125 ...)
5126 \p{Script: Mand} \p{Script=Mandaic} (29)
5127 \p{Script: Mandaic} (Short: \p{Sc=Mand}) (29: U+0840..085B,
5128 U+085E)
5129 \p{Script: Mani} \p{Script=Manichaean} (51)
5130 \p{Script: Manichaean} (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6,
5131 U+10AEB..10AF6)
5132 \p{Script: Marc} \p{Script=Marchen} (=
5133 \p{Script_Extensions=Marchen}) (68)
5134 \p{Script: Marchen} \p{Script_Extensions=Marchen} (Short:
5135 \p{Sc=Marc}, \p{Marc}) (68)
5136 \p{Script: Masaram_Gondi} (Short: \p{Sc=Gonm}) (75:
5137 U+11D00..11D06, U+11D08..11D09,
5138 U+11D0B..11D36, U+11D3A, U+11D3C..11D3D,
5139 U+11D3F..11D47 ...)
5140 \p{Script: Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
5141 \p{Sc=Medf}, \p{Medf}) (91)
5142 \p{Script: Medf} \p{Script=Medefaidrin} (=
5143 \p{Script_Extensions=Medefaidrin}) (91)
5144 \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek}
5145 (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
5146 \p{Script: Mend} \p{Script=Mende_Kikakui} (=
5147 \p{Script_Extensions=Mende_Kikakui})
5148 (213)
5149 \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
5150 (Short: \p{Sc=Mend}, \p{Mend}) (213)
5151 \p{Script: Merc} \p{Script=Meroitic_Cursive} (=
5152 \p{Script_Extensions=Meroitic_Cursive})
5153 (90)
5154 \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (=
5155 \p{Script_Extensions=
5156 Meroitic_Hieroglyphs}) (32)
5157 \p{Script: Meroitic_Cursive} \p{Script_Extensions=
5158 Meroitic_Cursive} (Short: \p{Sc=Merc},
5159 \p{Merc}) (90)
5160 \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions=
5161 Meroitic_Hieroglyphs} (Short: \p{Sc=
5162 Mero}, \p{Mero}) (32)
5163 \p{Script: Miao} \p{Script_Extensions=Miao} (Short: \p{Sc=
5164 Miao}, \p{Miao}) (149)
5165 \p{Script: Mlym} \p{Script=Malayalam} (117)
5166 \p{Script: Modi} (Short: \p{Sc=Modi}) (79: U+11600..11644,
5167 U+11650..11659)
5168 \p{Script: Mong} \p{Script=Mongolian} (167)
5169 \p{Script: Mongolian} (Short: \p{Sc=Mong}) (167: U+1800..1801,
5170 U+1804, U+1806..180E, U+1810..1819,
5171 U+1820..1878, U+1880..18AA ...)
5172 \p{Script: Mro} \p{Script_Extensions=Mro} (Short: \p{Sc=
5173 Mro}, \p{Mro}) (43)
5174 \p{Script: Mroo} \p{Script=Mro} (= \p{Script_Extensions=
5175 Mro}) (43)
5176 \p{Script: Mtei} \p{Script=Meetei_Mayek} (=
5177 \p{Script_Extensions=Meetei_Mayek}) (79)
5178 \p{Script: Mult} \p{Script=Multani} (38)
5179 \p{Script: Multani} (Short: \p{Sc=Mult}) (38: U+11280..11286,
5180 U+11288, U+1128A..1128D, U+1128F..1129D,
5181 U+1129F..112A9)
5182 \p{Script: Myanmar} (Short: \p{Sc=Mymr}) (223: U+1000..109F,
5183 U+A9E0..A9FE, U+AA60..AA7F)
5184 \p{Script: Mymr} \p{Script=Myanmar} (223)
5185 \p{Script: Nabataean} \p{Script_Extensions=Nabataean} (Short:
5186 \p{Sc=Nbat}, \p{Nbat}) (40)
5187 \p{Script: Nand} \p{Script=Nandinagari} (65)
5188 \p{Script: Nandinagari} (Short: \p{Sc=Nand}) (65: U+119A0..119A7,
5189 U+119AA..119D7, U+119DA..119E4)
5190 \p{Script: Narb} \p{Script=Old_North_Arabian} (=
5191 \p{Script_Extensions=Old_North_Arabian})
5192 (32)
5193 \p{Script: Nbat} \p{Script=Nabataean} (=
5194 \p{Script_Extensions=Nabataean}) (40)
5195 \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
5196 \p{Sc=Talu}, \p{Talu}) (83)
5197 \p{Script: Newa} \p{Script_Extensions=Newa} (Short: \p{Sc=
5198 Newa}, \p{Newa}) (94)
5199 \p{Script: Nko} \p{Script_Extensions=Nko} (Short: \p{Sc=
5200 Nko}, \p{Nko}) (62)
5201 \p{Script: Nkoo} \p{Script=Nko} (= \p{Script_Extensions=
5202 Nko}) (62)
5203 \p{Script: Nshu} \p{Script=Nushu} (= \p{Script_Extensions=
5204 Nushu}) (397)
5205 \p{Script: Nushu} \p{Script_Extensions=Nushu} (Short: \p{Sc=
5206 Nshu}, \p{Nshu}) (397)
5207 \p{Script: Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
5208 Nyiakeng_Puachue_Hmong} (Short: \p{Sc=
5209 Hmnp}, \p{Hmnp}) (71)
5210 \p{Script: Ogam} \p{Script=Ogham} (= \p{Script_Extensions=
5211 Ogham}) (29)
5212 \p{Script: Ogham} \p{Script_Extensions=Ogham} (Short: \p{Sc=
5213 Ogam}, \p{Ogam}) (29)
5214 \p{Script: Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
5215 \p{Sc=Olck}, \p{Olck}) (48)
5216 \p{Script: Olck} \p{Script=Ol_Chiki} (=
5217 \p{Script_Extensions=Ol_Chiki}) (48)
5218 \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
5219 (Short: \p{Sc=Hung}, \p{Hung}) (108)
5220 \p{Script: Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
5221 \p{Sc=Ital}, \p{Ital}) (39)
5222 \p{Script: Old_North_Arabian} \p{Script_Extensions=
5223 Old_North_Arabian} (Short: \p{Sc=Narb},
5224 \p{Narb}) (32)
5225 \p{Script: Old_Permic} (Short: \p{Sc=Perm}) (43: U+10350..1037A)
5226 \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
5227 \p{Sc=Xpeo}, \p{Xpeo}) (50)
5228 \p{Script: Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
5229 \p{Sc=Sogo}, \p{Sogo}) (40)
5230 \p{Script: Old_South_Arabian} \p{Script_Extensions=
5231 Old_South_Arabian} (Short: \p{Sc=Sarb},
5232 \p{Sarb}) (32)
5233 \p{Script: Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
5234 \p{Sc=Orkh}, \p{Orkh}) (73)
5235 \p{Script: Oriya} (Short: \p{Sc=Orya}) (90: U+0B01..0B03,
5236 U+0B05..0B0C, U+0B0F..0B10,
5237 U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33
5238 ...)
5239 \p{Script: Orkh} \p{Script=Old_Turkic} (=
5240 \p{Script_Extensions=Old_Turkic}) (73)
5241 \p{Script: Orya} \p{Script=Oriya} (90)
5242 \p{Script: Osage} \p{Script_Extensions=Osage} (Short: \p{Sc=
5243 Osge}, \p{Osge}) (72)
5244 \p{Script: Osge} \p{Script=Osage} (= \p{Script_Extensions=
5245 Osage}) (72)
5246 \p{Script: Osma} \p{Script=Osmanya} (=
5247 \p{Script_Extensions=Osmanya}) (40)
5248 \p{Script: Osmanya} \p{Script_Extensions=Osmanya} (Short:
5249 \p{Sc=Osma}, \p{Osma}) (40)
5250 \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong}
5251 (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
5252 \p{Script: Palm} \p{Script=Palmyrene} (=
5253 \p{Script_Extensions=Palmyrene}) (32)
5254 \p{Script: Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
5255 \p{Sc=Palm}, \p{Palm}) (32)
5256 \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
5257 \p{Sc=Pauc}, \p{Pauc}) (57)
5258 \p{Script: Pauc} \p{Script=Pau_Cin_Hau} (=
5259 \p{Script_Extensions=Pau_Cin_Hau}) (57)
5260 \p{Script: Perm} \p{Script=Old_Permic} (43)
5261 \p{Script: Phag} \p{Script=Phags_Pa} (56)
5262 \p{Script: Phags_Pa} (Short: \p{Sc=Phag}) (56: U+A840..A877)
5263 \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (=
5264 \p{Script_Extensions=
5265 Inscriptional_Pahlavi}) (27)
5266 \p{Script: Phlp} \p{Script=Psalter_Pahlavi} (29)
5267 \p{Script: Phnx} \p{Script=Phoenician} (=
5268 \p{Script_Extensions=Phoenician}) (29)
5269 \p{Script: Phoenician} \p{Script_Extensions=Phoenician} (Short:
5270 \p{Sc=Phnx}, \p{Phnx}) (29)
5271 \p{Script: Plrd} \p{Script=Miao} (= \p{Script_Extensions=
5272 Miao}) (149)
5273 \p{Script: Prti} \p{Script=Inscriptional_Parthian} (=
5274 \p{Script_Extensions=
5275 Inscriptional_Parthian}) (30)
5276 \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29:
5277 U+10B80..10B91, U+10B99..10B9C,
5278 U+10BA9..10BAF)
5279 \p{Script: Qaac} \p{Script=Coptic} (137)
5280 \p{Script: Qaai} \p{Script=Inherited} (571)
5281 \p{Script: Rejang} \p{Script_Extensions=Rejang} (Short:
5282 \p{Sc=Rjng}, \p{Rjng}) (37)
5283 \p{Script: Rjng} \p{Script=Rejang} (= \p{Script_Extensions=
5284 Rejang}) (37)
5285 \p{Script: Rohg} \p{Script=Hanifi_Rohingya} (50)
5286 \p{Script: Runic} \p{Script_Extensions=Runic} (Short: \p{Sc=
5287 Runr}, \p{Runr}) (86)
5288 \p{Script: Runr} \p{Script=Runic} (= \p{Script_Extensions=
5289 Runic}) (86)
5290 \p{Script: Samaritan} \p{Script_Extensions=Samaritan} (Short:
5291 \p{Sc=Samr}, \p{Samr}) (61)
5292 \p{Script: Samr} \p{Script=Samaritan} (=
5293 \p{Script_Extensions=Samaritan}) (61)
5294 \p{Script: Sarb} \p{Script=Old_South_Arabian} (=
5295 \p{Script_Extensions=Old_South_Arabian})
5296 (32)
5297 \p{Script: Saur} \p{Script=Saurashtra} (=
5298 \p{Script_Extensions=Saurashtra}) (82)
5299 \p{Script: Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
5300 \p{Sc=Saur}, \p{Saur}) (82)
5301 \p{Script: Sgnw} \p{Script=SignWriting} (=
5302 \p{Script_Extensions=SignWriting}) (672)
5303 \p{Script: Sharada} (Short: \p{Sc=Shrd}) (94: U+11180..111CD,
5304 U+111D0..111DF)
5305 \p{Script: Shavian} \p{Script_Extensions=Shavian} (Short:
5306 \p{Sc=Shaw}, \p{Shaw}) (48)
5307 \p{Script: Shaw} \p{Script=Shavian} (=
5308 \p{Script_Extensions=Shavian}) (48)
5309 \p{Script: Shrd} \p{Script=Sharada} (94)
5310 \p{Script: Sidd} \p{Script=Siddham} (=
5311 \p{Script_Extensions=Siddham}) (92)
5312 \p{Script: Siddham} \p{Script_Extensions=Siddham} (Short:
5313 \p{Sc=Sidd}, \p{Sidd}) (92)
5314 \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short:
5315 \p{Sc=Sgnw}, \p{Sgnw}) (672)
5316 \p{Script: Sind} \p{Script=Khudawadi} (69)
5317 \p{Script: Sinh} \p{Script=Sinhala} (110)
5318 \p{Script: Sinhala} (Short: \p{Sc=Sinh}) (110: U+0D82..0D83,
5319 U+0D85..0D96, U+0D9A..0DB1,
5320 U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...)
5321 \p{Script: Sogd} \p{Script=Sogdian} (42)
5322 \p{Script: Sogdian} (Short: \p{Sc=Sogd}) (42: U+10F30..10F59)
5323 \p{Script: Sogo} \p{Script=Old_Sogdian} (=
5324 \p{Script_Extensions=Old_Sogdian}) (40)
5325 \p{Script: Sora} \p{Script=Sora_Sompeng} (=
5326 \p{Script_Extensions=Sora_Sompeng}) (35)
5327 \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng}
5328 (Short: \p{Sc=Sora}, \p{Sora}) (35)
5329 \p{Script: Soyo} \p{Script=Soyombo} (=
5330 \p{Script_Extensions=Soyombo}) (83)
5331 \p{Script: Soyombo} \p{Script_Extensions=Soyombo} (Short:
5332 \p{Sc=Soyo}, \p{Soyo}) (83)
5333 \p{Script: Sund} \p{Script=Sundanese} (=
5334 \p{Script_Extensions=Sundanese}) (72)
5335 \p{Script: Sundanese} \p{Script_Extensions=Sundanese} (Short:
5336 \p{Sc=Sund}, \p{Sund}) (72)
5337 \p{Script: Sylo} \p{Script=Syloti_Nagri} (44)
5338 \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (44: U+A800..A82B)
5339 \p{Script: Syrc} \p{Script=Syriac} (88)
5340 \p{Script: Syriac} (Short: \p{Sc=Syrc}) (88: U+0700..070D,
5341 U+070F..074A, U+074D..074F, U+0860..086A)
5342 \p{Script: Tagalog} (Short: \p{Sc=Tglg}) (20: U+1700..170C,
5343 U+170E..1714)
5344 \p{Script: Tagb} \p{Script=Tagbanwa} (18)
5345 \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}) (18: U+1760..176C,
5346 U+176E..1770, U+1772..1773)
5347 \p{Script: Tai_Le} (Short: \p{Sc=Tale}) (35: U+1950..196D,
5348 U+1970..1974)
5349 \p{Script: Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
5350 \p{Sc=Lana}, \p{Lana}) (127)
5351 \p{Script: Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
5352 \p{Sc=Tavt}, \p{Tavt}) (72)
5353 \p{Script: Takr} \p{Script=Takri} (67)
5354 \p{Script: Takri} (Short: \p{Sc=Takr}) (67: U+11680..116B8,
5355 U+116C0..116C9)
5356 \p{Script: Tale} \p{Script=Tai_Le} (35)
5357 \p{Script: Talu} \p{Script=New_Tai_Lue} (=
5358 \p{Script_Extensions=New_Tai_Lue}) (83)
5359 \p{Script: Tamil} (Short: \p{Sc=Taml}) (123: U+0B82..0B83,
5360 U+0B85..0B8A, U+0B8E..0B90,
5361 U+0B92..0B95, U+0B99..0B9A, U+0B9C ...)
5362 \p{Script: Taml} \p{Script=Tamil} (123)
5363 \p{Script: Tang} \p{Script=Tangut} (= \p{Script_Extensions=
5364 Tangut}) (6892)
5365 \p{Script: Tangut} \p{Script_Extensions=Tangut} (Short:
5366 \p{Sc=Tang}, \p{Tang}) (6892)
5367 \p{Script: Tavt} \p{Script=Tai_Viet} (=
5368 \p{Script_Extensions=Tai_Viet}) (72)
5369 \p{Script: Telu} \p{Script=Telugu} (98)
5370 \p{Script: Telugu} (Short: \p{Sc=Telu}) (98: U+0C00..0C0C,
5371 U+0C0E..0C10, U+0C12..0C28,
5372 U+0C2A..0C39, U+0C3D..0C44, U+0C46..0C48
5373 ...)
5374 \p{Script: Tfng} \p{Script=Tifinagh} (=
5375 \p{Script_Extensions=Tifinagh}) (59)
5376 \p{Script: Tglg} \p{Script=Tagalog} (20)
5377 \p{Script: Thaa} \p{Script=Thaana} (50)
5378 \p{Script: Thaana} (Short: \p{Sc=Thaa}) (50: U+0780..07B1)
5379 \p{Script: Thai} \p{Script_Extensions=Thai} (Short: \p{Sc=
5380 Thai}, \p{Thai}) (86)
5381 \p{Script: Tibetan} \p{Script_Extensions=Tibetan} (Short:
5382 \p{Sc=Tibt}, \p{Tibt}) (207)
5383 \p{Script: Tibt} \p{Script=Tibetan} (=
5384 \p{Script_Extensions=Tibetan}) (207)
5385 \p{Script: Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
5386 \p{Sc=Tfng}, \p{Tfng}) (59)
5387 \p{Script: Tirh} \p{Script=Tirhuta} (82)
5388 \p{Script: Tirhuta} (Short: \p{Sc=Tirh}) (82: U+11480..114C7,
5389 U+114D0..114D9)
5390 \p{Script: Ugar} \p{Script=Ugaritic} (=
5391 \p{Script_Extensions=Ugaritic}) (31)
5392 \p{Script: Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
5393 \p{Sc=Ugar}, \p{Ugar}) (31)
5394 \p{Script: Unknown} \p{Script_Extensions=Unknown} (Short:
5395 \p{Sc=Zzzz}, \p{Zzzz}) (976_118 plus all
5396 above-Unicode code points)
5397 \p{Script: Vai} \p{Script_Extensions=Vai} (Short: \p{Sc=
5398 Vai}, \p{Vai}) (300)
5399 \p{Script: Vaii} \p{Script=Vai} (= \p{Script_Extensions=
5400 Vai}) (300)
5401 \p{Script: Wancho} \p{Script_Extensions=Wancho} (Short:
5402 \p{Sc=Wcho}, \p{Wcho}) (59)
5403 \p{Script: Wara} \p{Script=Warang_Citi} (=
5404 \p{Script_Extensions=Warang_Citi}) (84)
5405 \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
5406 \p{Sc=Wara}, \p{Wara}) (84)
5407 \p{Script: Wcho} \p{Script=Wancho} (= \p{Script_Extensions=
5408 Wancho}) (59)
5409 \p{Script: Xpeo} \p{Script=Old_Persian} (=
5410 \p{Script_Extensions=Old_Persian}) (50)
5411 \p{Script: Xsux} \p{Script=Cuneiform} (=
5412 \p{Script_Extensions=Cuneiform}) (1234)
5413 \p{Script: Yi} (Short: \p{Sc=Yi}) (1220: U+A000..A48C,
5414 U+A490..A4C6)
5415 \p{Script: Yiii} \p{Script=Yi} (1220)
5416 \p{Script: Zanabazar_Square} \p{Script_Extensions=
5417 Zanabazar_Square} (Short: \p{Sc=Zanb},
5418 \p{Zanb}) (72)
5419 \p{Script: Zanb} \p{Script=Zanabazar_Square} (=
5420 \p{Script_Extensions=Zanabazar_Square})
5421 (72)
5422 \p{Script: Zinh} \p{Script=Inherited} (571)
5423 \p{Script: Zyyy} \p{Script=Common} (7805)
5424 \p{Script: Zzzz} \p{Script=Unknown} (=
5425 \p{Script_Extensions=Unknown}) (976_118
5426 plus all above-Unicode code points)
5427 \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (89:
5428 U+0640, U+1E900..1E94B, U+1E950..1E959,
5429 U+1E95E..1E95F)
5430 \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (89)
5431 \p{Script_Extensions: Aghb} \p{Script_Extensions=
5432 Caucasian_Albanian} (53)
5433 \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (58:
5434 U+11700..1171A, U+1171D..1172B,
5435 U+11730..1173F)
5436 \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
5437 \p{Hluw}) (583: U+14400..14646)
5438 \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1325)
5439 \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
5440 (1325: U+0600..0604, U+0606..061C,
5441 U+061E..06DC, U+06DE..06FF,
5442 U+0750..077F, U+08A0..08B4 ...)
5443 \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
5444 (96: U+0531..0556, U+0559..058A,
5445 U+058D..058F, U+FB13..FB17)
5446 \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
5447 (31)
5448 \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (96)
5449 \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst})
5450 (61: U+10B00..10B35, U+10B39..10B3F)
5451 \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
5452 \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
5453 \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
5454 (121: U+1B00..1B4B, U+1B50..1B7C)
5455 \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
5456 \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657:
5457 U+A6A0..A6F7, U+16800..16A38)
5458 \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
5459 \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
5460 (36: U+16AD0..16AED, U+16AF0..16AF5)
5461 \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56:
5462 U+1BC0..1BF3, U+1BFC..1BFF)
5463 \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
5464 \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (113)
5465 \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng})
5466 (113: U+0951..0952, U+0964..0965,
5467 U+0980..0983, U+0985..098C,
5468 U+098F..0990, U+0993..09A8 ...)
5469 \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
5470 (97: U+11C00..11C08, U+11C0A..11C36,
5471 U+11C38..11C45, U+11C50..11C6C)
5472 \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
5473 \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (112)
5474 \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
5475 (112: U+02EA..02EB, U+3001..3003,
5476 U+3008..3011, U+3013..301F,
5477 U+302A..302D, U+3030 ...)
5478 \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (109)
5479 \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah})
5480 (109: U+11000..1104D, U+11052..1106F,
5481 U+1107F)
5482 \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
5483 \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
5484 (256: U+2800..28FF)
5485 \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
5486 \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
5487 (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF)
5488 \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
5489 \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22:
5490 U+1735..1736, U+1740..1753)
5491 \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (90)
5492 \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
5493 \p{Cans}) (710: U+1400..167F,
5494 U+18B0..18F5)
5495 \p{Script_Extensions: Cans} \p{Script_Extensions=
5496 Canadian_Aboriginal} (710)
5497 \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
5498 \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49:
5499 U+102A0..102D0)
5500 \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
5501 \p{Aghb}) (53: U+10530..10563, U+1056F)
5502 \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (90:
5503 U+09E6..09EF, U+1040..1049,
5504 U+11100..11134, U+11136..11146)
5505 \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83:
5506 U+AA00..AA36, U+AA40..AA4D,
5507 U+AA50..AA59, U+AA5C..AA5F)
5508 \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
5509 \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
5510 (172: U+13A0..13F5, U+13F8..13FD,
5511 U+AB70..ABBF)
5512 \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
5513 (7386: [\x00-
5514 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
5515 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
5516 \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF,
5517 U+02E5..02E9, U+02EC..02FF, U+0374,
5518 U+037E ...)
5519 \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
5520 \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt})
5521 (165: U+03E2..03EF, U+2C80..2CF3,
5522 U+2CF9..2CFF, U+102E0..102FB)
5523 \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
5524 \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
5525 (1234: U+12000..12399, U+12400..1246E,
5526 U+12470..12474, U+12480..12543)
5527 \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
5528 (112: U+10100..10102, U+10107..10133,
5529 U+10137..1013F, U+10800..10805, U+10808,
5530 U+1080A..10835 ...)
5531 \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
5532 (446: U+0400..052F, U+1C80..1C88,
5533 U+1D2B, U+1D78, U+2DE0..2DFF, U+2E43 ...)
5534 \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (446)
5535 \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt})
5536 (80: U+10400..1044F)
5537 \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210)
5538 \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
5539 (210: U+0900..0952, U+0955..097F,
5540 U+1CD0..1CF6, U+1CF8..1CF9, U+20F0,
5541 U+A830..A839 ...)
5542 \p{Script_Extensions: Dogr} \p{Script_Extensions=Dogra} (82)
5543 \p{Script_Extensions: Dogra} (Short: \p{Scx=Dogr}, \p{Dogr}) (82:
5544 U+0964..096F, U+A830..A839,
5545 U+11800..1183B)
5546 \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
5547 \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
5548 \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
5549 (147: U+1BC00..1BC6A, U+1BC70..1BC7C,
5550 U+1BC80..1BC88, U+1BC90..1BC99,
5551 U+1BC9C..1BCA3)
5552 \p{Script_Extensions: Egyp} \p{Script_Extensions=
5553 Egyptian_Hieroglyphs} (1080)
5554 \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
5555 \p{Egyp}) (1080: U+13000..1342E,
5556 U+13430..13438)
5557 \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
5558 \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba})
5559 (40: U+10500..10527)
5560 \p{Script_Extensions: Elym} \p{Script_Extensions=Elymaic} (23)
5561 \p{Script_Extensions: Elymaic} (Short: \p{Scx=Elym}, \p{Elym})
5562 (23: U+10FE0..10FF6)
5563 \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
5564 \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
5565 (495: U+1200..1248, U+124A..124D,
5566 U+1250..1256, U+1258, U+125A..125D,
5567 U+1260..1288 ...)
5568 \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (175)
5569 \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
5570 (175: U+0589, U+10A0..10C5, U+10C7,
5571 U+10CD, U+10D0..10FF, U+1C90..1CBA ...)
5572 \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (136)
5573 \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
5574 (136: U+0484, U+0487, U+2C00..2C2E,
5575 U+2C30..2C5E, U+2E43, U+A66F ...)
5576 \p{Script_Extensions: Gong} \p{Script_Extensions=Gunjala_Gondi}
5577 (65)
5578 \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi}
5579 (77)
5580 \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
5581 \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27:
5582 U+10330..1034A)
5583 \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (116)
5584 \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
5585 (116: U+0951..0952, U+0964..0965,
5586 U+0BE6..0BF3, U+1CD0, U+1CD2..1CD3,
5587 U+1CF2..1CF4 ...)
5588 \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522:
5589 U+0342, U+0345, U+0370..0373,
5590 U+0375..0377, U+037A..037D, U+037F ...)
5591 \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
5592 \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
5593 (105: U+0951..0952, U+0964..0965,
5594 U+0A81..0A83, U+0A85..0A8D,
5595 U+0A8F..0A91, U+0A93..0AA8 ...)
5596 \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105)
5597 \p{Script_Extensions: Gunjala_Gondi} (Short: \p{Scx=Gong},
5598 \p{Gong}) (65: U+0964..0965,
5599 U+11D60..11D65, U+11D67..11D68,
5600 U+11D6A..11D8E, U+11D90..11D91,
5601 U+11D93..11D98 ...)
5602 \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
5603 (94: U+0951..0952, U+0964..0965,
5604 U+0A01..0A03, U+0A05..0A0A,
5605 U+0A0F..0A10, U+0A13..0A28 ...)
5606 \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (94)
5607 \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (89_513:
5608 U+2E80..2E99, U+2E9B..2EF3,
5609 U+2F00..2FD5, U+3001..3003,
5610 U+3005..3011, U+3013..301F ...)
5611 \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
5612 \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
5613 (11_775: U+1100..11FF, U+3001..3003,
5614 U+3008..3011, U+3013..301F,
5615 U+302E..3030, U+3037 ...)
5616 \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (89_513)
5617 \p{Script_Extensions: Hanifi_Rohingya} (Short: \p{Scx=Rohg},
5618 \p{Rohg}) (55: U+060C, U+061B, U+061F,
5619 U+0640, U+06D4, U+10D00..10D27 ...)
5620 \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
5621 \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano})
5622 (23: U+1720..1736)
5623 \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
5624 \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26:
5625 U+108E0..108F2, U+108F4..108F5,
5626 U+108FB..108FF)
5627 \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (134)
5628 \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr})
5629 (134: U+0591..05C7, U+05D0..05EA,
5630 U+05EF..05F4, U+FB1D..FB36,
5631 U+FB38..FB3C, U+FB3E ...)
5632 \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (431)
5633 \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
5634 (431: U+3001..3003, U+3008..3011,
5635 U+3013..301F, U+3030..3035, U+3037,
5636 U+303C..303D ...)
5637 \p{Script_Extensions: Hluw} \p{Script_Extensions=
5638 Anatolian_Hieroglyphs} (583)
5639 \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
5640 (127)
5641 \p{Script_Extensions: Hmnp} \p{Script_Extensions=
5642 Nyiakeng_Puachue_Hmong} (71)
5643 \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
5644 (108)
5645 \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
5646 \p{Armi}) (31: U+10840..10855,
5647 U+10857..1085F)
5648 \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
5649 (502: U+0300..0341, U+0343..0344,
5650 U+0346..0362, U+0953..0954,
5651 U+1AB0..1ABE, U+1DC2..1DF9 ...)
5652 \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
5653 \p{Phli}) (27: U+10B60..10B72,
5654 U+10B78..10B7F)
5655 \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
5656 Prti}, \p{Prti}) (30: U+10B40..10B55,
5657 U+10B58..10B5F)
5658 \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39)
5659 \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
5660 \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
5661 (91: U+A980..A9CD, U+A9CF..A9D9,
5662 U+A9DE..A9DF)
5663 \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (87:
5664 U+0966..096F, U+A830..A839,
5665 U+11080..110C1, U+110CD)
5666 \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
5667 \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (356)
5668 \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
5669 (104: U+0951..0952, U+0964..0965,
5670 U+0C80..0C8C, U+0C8E..0C90,
5671 U+0C92..0CA8, U+0CAA..0CB3 ...)
5672 \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
5673 (356: U+3001..3003, U+3008..3011,
5674 U+3013..301F, U+3030..3035, U+3037,
5675 U+303C..303D ...)
5676 \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
5677 (48: U+A900..A92F)
5678 \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (68)
5679 \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
5680 (68: U+10A00..10A03, U+10A05..10A06,
5681 U+10A0C..10A13, U+10A15..10A17,
5682 U+10A19..10A35, U+10A38..10A3A ...)
5683 \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146:
5684 U+1780..17DD, U+17E0..17E9,
5685 U+17F0..17F9, U+19E0..19FF)
5686 \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
5687 \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (82)
5688 \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (82:
5689 U+0AE6..0AEF, U+A830..A839,
5690 U+11200..11211, U+11213..1123E)
5691 \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
5692 (81: U+0964..0965, U+A830..A839,
5693 U+112B0..112EA, U+112F0..112F9)
5694 \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (104)
5695 \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (87)
5696 \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
5697 \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (82:
5698 U+0E81..0E82, U+0E84, U+0E86..0E8A,
5699 U+0E8C..0EA3, U+0EA5, U+0EA7..0EBD ...)
5700 \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (82)
5701 \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn})
5702 (1387: [A-Za-z\xaa\xba\xc0-\xd6\xd8-
5703 \xf6\xf8-\xff], U+0100..02B8,
5704 U+02E0..02E4, U+0363..036F,
5705 U+0485..0486, U+0951..0952 ...)
5706 \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1387)
5707 \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
5708 \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74:
5709 U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F)
5710 \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
5711 \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69:
5712 U+0965, U+1900..191E, U+1920..192B,
5713 U+1930..193B, U+1940, U+1944..194F)
5714 \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
5715 \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
5716 \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
5717 (386: U+10107..10133, U+10600..10736,
5718 U+10740..10755, U+10760..10767)
5719 \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
5720 (268: U+10000..1000B, U+1000D..10026,
5721 U+10028..1003A, U+1003C..1003D,
5722 U+1003F..1004D, U+10050..1005D ...)
5723 \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (48:
5724 U+A4D0..A4FF)
5725 \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
5726 \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29:
5727 U+10280..1029C)
5728 \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
5729 \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27:
5730 U+10920..10939, U+1093F)
5731 \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
5732 (61: U+0964..096F, U+A830..A839,
5733 U+11150..11176)
5734 \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
5735 \p{Script_Extensions: Maka} \p{Script_Extensions=Makasar} (25)
5736 \p{Script_Extensions: Makasar} (Short: \p{Scx=Maka}, \p{Maka})
5737 (25: U+11EE0..11EF8)
5738 \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
5739 (125: U+0951..0952, U+0964..0965,
5740 U+0D00..0D03, U+0D05..0D0C,
5741 U+0D0E..0D10, U+0D12..0D44 ...)
5742 \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
5743 \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand})
5744 (30: U+0640, U+0840..085B, U+085E)
5745 \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
5746 \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
5747 (52: U+0640, U+10AC0..10AE6,
5748 U+10AEB..10AF6)
5749 \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
5750 \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc})
5751 (68: U+11C70..11C8F, U+11C92..11CA7,
5752 U+11CA9..11CB6)
5753 \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm},
5754 \p{Gonm}) (77: U+0964..0965,
5755 U+11D00..11D06, U+11D08..11D09,
5756 U+11D0B..11D36, U+11D3A, U+11D3C..11D3D
5757 ...)
5758 \p{Script_Extensions: Medefaidrin} (Short: \p{Scx=Medf}, \p{Medf})
5759 (91: U+16E40..16E9A)
5760 \p{Script_Extensions: Medf} \p{Script_Extensions=Medefaidrin} (91)
5761 \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
5762 \p{Mtei}) (79: U+AAE0..AAF6,
5763 U+ABC0..ABED, U+ABF0..ABF9)
5764 \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
5765 (213)
5766 \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
5767 \p{Mend}) (213: U+1E800..1E8C4,
5768 U+1E8C7..1E8D6)
5769 \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
5770 (90)
5771 \p{Script_Extensions: Mero} \p{Script_Extensions=
5772 Meroitic_Hieroglyphs} (32)
5773 \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
5774 \p{Merc}) (90: U+109A0..109B7,
5775 U+109BC..109CF, U+109D2..109FF)
5776 \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
5777 \p{Mero}) (32: U+10980..1099F)
5778 \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (149:
5779 U+16F00..16F4A, U+16F4F..16F87,
5780 U+16F8F..16F9F)
5781 \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (125)
5782 \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89:
5783 U+A830..A839, U+11600..11644,
5784 U+11650..11659)
5785 \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (171)
5786 \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
5787 (171: U+1800..180E, U+1810..1819,
5788 U+1820..1878, U+1880..18AA, U+202F,
5789 U+11660..1166C)
5790 \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43:
5791 U+16A40..16A5E, U+16A60..16A69,
5792 U+16A6E..16A6F)
5793 \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
5794 \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
5795 \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
5796 \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult})
5797 (48: U+0A66..0A6F, U+11280..11286,
5798 U+11288, U+1128A..1128D, U+1128F..1129D,
5799 U+1129F..112A9)
5800 \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
5801 (224: U+1000..109F, U+A92E,
5802 U+A9E0..A9FE, U+AA60..AA7F)
5803 \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
5804 \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
5805 (40: U+10880..1089E, U+108A7..108AF)
5806 \p{Script_Extensions: Nand} \p{Script_Extensions=Nandinagari} (86)
5807 \p{Script_Extensions: Nandinagari} (Short: \p{Scx=Nand}, \p{Nand})
5808 (86: U+0964..0965, U+0CE6..0CEF, U+1CE9,
5809 U+1CF2, U+1CFA, U+A830..A835 ...)
5810 \p{Script_Extensions: Narb} \p{Script_Extensions=
5811 Old_North_Arabian} (32)
5812 \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
5813 \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
5814 (83: U+1980..19AB, U+19B0..19C9,
5815 U+19D0..19DA, U+19DE..19DF)
5816 \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (94:
5817 U+11400..11459, U+1145B, U+1145D..1145F)
5818 \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (62:
5819 U+07C0..07FA, U+07FD..07FF)
5820 \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (62)
5821 \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397)
5822 \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397:
5823 U+16FE1, U+1B170..1B2FB)
5824 \p{Script_Extensions: Nyiakeng_Puachue_Hmong} (Short: \p{Scx=
5825 Hmnp}, \p{Hmnp}) (71: U+1E100..1E12C,
5826 U+1E130..1E13D, U+1E140..1E149,
5827 U+1E14E..1E14F)
5828 \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
5829 \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29:
5830 U+1680..169C)
5831 \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
5832 (48: U+1C50..1C7F)
5833 \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
5834 \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
5835 \p{Hung}) (108: U+10C80..10CB2,
5836 U+10CC0..10CF2, U+10CFA..10CFF)
5837 \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
5838 (39: U+10300..10323, U+1032D..1032F)
5839 \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
5840 \p{Narb}) (32: U+10A80..10A9F)
5841 \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
5842 (44: U+0483, U+10350..1037A)
5843 \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
5844 (50: U+103A0..103C3, U+103C8..103D5)
5845 \p{Script_Extensions: Old_Sogdian} (Short: \p{Scx=Sogo}, \p{Sogo})
5846 (40: U+10F00..10F27)
5847 \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
5848 \p{Sarb}) (32: U+10A60..10A7F)
5849 \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
5850 (73: U+10C00..10C48)
5851 \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (96:
5852 U+0951..0952, U+0964..0965,
5853 U+0B01..0B03, U+0B05..0B0C,
5854 U+0B0F..0B10, U+0B13..0B28 ...)
5855 \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
5856 \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (96)
5857 \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72:
5858 U+104B0..104D3, U+104D8..104FB)
5859 \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
5860 \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
5861 \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma})
5862 (40: U+10480..1049D, U+104A0..104A9)
5863 \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
5864 \p{Hmng}) (127: U+16B00..16B45,
5865 U+16B50..16B59, U+16B5B..16B61,
5866 U+16B63..16B77, U+16B7D..16B8F)
5867 \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
5868 \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
5869 (32: U+10860..1087F)
5870 \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
5871 (57: U+11AC0..11AF8)
5872 \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
5873 \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
5874 \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
5875 \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
5876 (59: U+1802..1803, U+1805, U+A840..A877)
5877 \p{Script_Extensions: Phli} \p{Script_Extensions=
5878 Inscriptional_Pahlavi} (27)
5879 \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
5880 (30)
5881 \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
5882 \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
5883 (29: U+10900..1091B, U+1091F)
5884 \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (149)
5885 \p{Script_Extensions: Prti} \p{Script_Extensions=
5886 Inscriptional_Parthian} (30)
5887 \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
5888 \p{Phlp}) (30: U+0640, U+10B80..10B91,
5889 U+10B99..10B9C, U+10BA9..10BAF)
5890 \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
5891 \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (502)
5892 \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37:
5893 U+A930..A953, U+A95F)
5894 \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
5895 \p{Script_Extensions: Rohg} \p{Script_Extensions=Hanifi_Rohingya}
5896 (55)
5897 \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86:
5898 U+16A0..16EA, U+16EE..16F8)
5899 \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
5900 \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
5901 (61: U+0800..082D, U+0830..083E)
5902 \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
5903 \p{Script_Extensions: Sarb} \p{Script_Extensions=
5904 Old_South_Arabian} (32)
5905 \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
5906 \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
5907 (82: U+A880..A8C5, U+A8CE..A8D9)
5908 \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
5909 \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
5910 (100: U+0951, U+1CD7, U+1CD9,
5911 U+1CDC..1CDD, U+1CE0, U+11180..111CD ...)
5912 \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw})
5913 (48: U+10450..1047F)
5914 \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
5915 \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (100)
5916 \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
5917 \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd})
5918 (92: U+11580..115B5, U+115B8..115DD)
5919 \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
5920 (672: U+1D800..1DA8B, U+1DA9B..1DA9F,
5921 U+1DAA1..1DAAF)
5922 \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
5923 \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (112)
5924 \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
5925 (112: U+0964..0965, U+0D82..0D83,
5926 U+0D85..0D96, U+0D9A..0DB1,
5927 U+0DB3..0DBB, U+0DBD ...)
5928 \p{Script_Extensions: Sogd} \p{Script_Extensions=Sogdian} (43)
5929 \p{Script_Extensions: Sogdian} (Short: \p{Scx=Sogd}, \p{Sogd})
5930 (43: U+0640, U+10F30..10F59)
5931 \p{Script_Extensions: Sogo} \p{Script_Extensions=Old_Sogdian} (40)
5932 \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
5933 \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
5934 \p{Sora}) (35: U+110D0..110E8,
5935 U+110F0..110F9)
5936 \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (83)
5937 \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo})
5938 (83: U+11A50..11AA2)
5939 \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
5940 \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
5941 (72: U+1B80..1BBF, U+1CC0..1CC7)
5942 \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (56)
5943 \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
5944 \p{Sylo}) (56: U+0964..0965,
5945 U+09E6..09EF, U+A800..A82B)
5946 \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (105)
5947 \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc})
5948 (105: U+060C, U+061B..061C, U+061F,
5949 U+0640, U+064B..0655, U+0670 ...)
5950 \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg})
5951 (22: U+1700..170C, U+170E..1714,
5952 U+1735..1736)
5953 \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
5954 \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
5955 (20: U+1735..1736, U+1760..176C,
5956 U+176E..1770, U+1772..1773)
5957 \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45:
5958 U+1040..1049, U+1950..196D, U+1970..1974)
5959 \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
5960 (127: U+1A20..1A5E, U+1A60..1A7C,
5961 U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD)
5962 \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
5963 (72: U+AA80..AAC2, U+AADB..AADF)
5964 \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (79)
5965 \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (79:
5966 U+0964..0965, U+A830..A839,
5967 U+11680..116B8, U+116C0..116C9)
5968 \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
5969 \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
5970 \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (133:
5971 U+0951..0952, U+0964..0965,
5972 U+0B82..0B83, U+0B85..0B8A,
5973 U+0B8E..0B90, U+0B92..0B95 ...)
5974 \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (133)
5975 \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6892)
5976 \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
5977 (6892: U+16FE0, U+17000..187F7,
5978 U+18800..18AF2)
5979 \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
5980 \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (104)
5981 \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu})
5982 (104: U+0951..0952, U+0964..0965,
5983 U+0C00..0C0C, U+0C0E..0C10,
5984 U+0C12..0C28, U+0C2A..0C39 ...)
5985 \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
5986 \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
5987 \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66)
5988 \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66:
5989 U+060C, U+061B..061C, U+061F,
5990 U+0660..0669, U+0780..07B1, U+FDF2 ...)
5991 \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86:
5992 U+0E01..0E3A, U+0E40..0E5B)
5993 \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
5994 (207: U+0F00..0F47, U+0F49..0F6C,
5995 U+0F71..0F97, U+0F99..0FBC,
5996 U+0FBE..0FCC, U+0FCE..0FD4 ...)
5997 \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
5998 \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
5999 (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F)
6000 \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (97)
6001 \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh})
6002 (97: U+0951..0952, U+0964..0965, U+1CF2,
6003 U+A830..A839, U+11480..114C7,
6004 U+114D0..114D9)
6005 \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
6006 \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
6007 (31: U+10380..1039D, U+1039F)
6008 \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
6009 (976_118 plus all above-Unicode code
6010 points: U+0378..0379, U+0380..0383,
6011 U+038B, U+038D, U+03A2, U+0530 ...)
6012 \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300:
6013 U+A500..A62B)
6014 \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
6015 \p{Script_Extensions: Wancho} (Short: \p{Scx=Wcho}, \p{Wcho}) (59:
6016 U+1E2C0..1E2F9, U+1E2FF)
6017 \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
6018 \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
6019 (84: U+118A0..118F2, U+118FF)
6020 \p{Script_Extensions: Wcho} \p{Script_Extensions=Wancho} (59)
6021 \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
6022 \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
6023 \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246:
6024 U+3001..3002, U+3008..3011,
6025 U+3014..301B, U+30FB, U+A000..A48C,
6026 U+A490..A4C6 ...)
6027 \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
6028 \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb},
6029 \p{Zanb}) (72: U+11A00..11A47)
6030 \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square}
6031 (72)
6032 \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (502)
6033 \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (7386)
6034 \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (976_118
6035 plus all above-Unicode code points)
6036 \p{Scx: *} \p{Script_Extensions: *}
6037 \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
6038 \p{SD: *} \p{Soft_Dotted: *}
6039 \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
6040 \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024,
6041 U+FE52, U+FF0E)
6042 \p{Sentence_Break: CL} \p{Sentence_Break=Close} (187)
6043 \p{Sentence_Break: Close} (Short: \p{SB=CL}) (187:
6044 [\"\'\(\)\[\]\{\}\xab\xbb],
6045 U+0F3A..0F3D, U+169B..169C,
6046 U+2018..201F, U+2039..203A, U+2045..2046
6047 ...)
6048 \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1: [\r])
6049 \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (2368)
6050 \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2368: U+0300..036F,
6051 U+0483..0489, U+0591..05BD, U+05BF,
6052 U+05C1..05C2, U+05C4..05C5 ...)
6053 \p{Sentence_Break: FO} \p{Sentence_Break=Format} (63)
6054 \p{Sentence_Break: Format} (Short: \p{SB=FO}) (63: [\xad],
6055 U+0600..0605, U+061C, U+06DD, U+070F,
6056 U+08E2 ...)
6057 \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (121_822)
6058 \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1: [\n])
6059 \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (2293)
6060 \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2293: [a-
6061 z\xaa\xb5\xba\xdf-\xf6\xf8-\xff],
6062 U+0101, U+0103, U+0105, U+0107, U+0109
6063 ...)
6064 \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (632)
6065 \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (632: [0-9],
6066 U+0660..0669, U+066B..066C,
6067 U+06F0..06F9, U+07C0..07C9, U+0966..096F
6068 ...)
6069 \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (121_822: U+01BB,
6070 U+01C0..01C3, U+0294, U+02B9..02BF,
6071 U+02C6..02D1, U+02EC ...)
6072 \p{Sentence_Break: Other} (Short: \p{SB=XX}) (984_661 plus all
6073 above-Unicode code points:
6074 [^\t\n\cK\f\r\x20!\"\'\(\),\-.0-9:?A-
6075 Z\[\]a-z\{\}\x85\xa0\xaa-
6076 \xab\xad\xb5\xba-\xbb\xc0-\xd6\xd8-
6077 \xf6\xf8-\xff], U+02C2..02C5,
6078 U+02D2..02DF, U+02E5..02EB, U+02ED,
6079 U+02EF..02FF ...)
6080 \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
6081 \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:],
6082 U+055D, U+060C..060D, U+07F8, U+1802,
6083 U+1808 ...)
6084 \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
6085 \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85],
6086 U+2028..2029)
6087 \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0],
6088 U+1680, U+2000..200A, U+202F, U+205F,
6089 U+3000)
6090 \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (138)
6091 \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (138: [!?], U+0589,
6092 U+061E..061F, U+06D4, U+0700..0702,
6093 U+07F9 ...)
6094 \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1893)
6095 \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1893: [A-Z\xc0-
6096 \xd6\xd8-\xde], U+0100, U+0102, U+0104,
6097 U+0106, U+0108 ...)
6098 \p{Sentence_Break: XX} \p{Sentence_Break=Other} (984_661 plus all
6099 above-Unicode code points)
6100 \p{Sentence_Terminal} \p{Sentence_Terminal=Y} (Short: \p{STerm})
6101 (141)
6102 \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
6103 (1_113_971 plus all above-Unicode code
6104 points: [\x00-\x20\"#\$\%&\'\(\)*+,\-
6105 \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
6106 \xff], U+0100..0588, U+058A..061D,
6107 U+0620..06D3, U+06D5..06FF, U+0703..07F8
6108 ...)
6109 \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (141:
6110 [!.?], U+0589, U+061E..061F, U+06D4,
6111 U+0700..0702, U+07F9 ...)
6112 \p{Separator} \p{General_Category=Separator} (Short:
6113 \p{Z}) (19)
6114 \p{Sgnw} \p{SignWriting} (= \p{Script_Extensions=
6115 SignWriting}) (672)
6116 \p{Sharada} \p{Script_Extensions=Sharada} (Short:
6117 \p{Shrd}; NOT \p{Block=Sharada}) (100)
6118 \p{Shavian} \p{Script_Extensions=Shavian} (Short:
6119 \p{Shaw}) (48)
6120 \p{Shaw} \p{Shavian} (= \p{Script_Extensions=
6121 Shavian}) (48)
6122 X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
6123 (16)
6124 \p{Shrd} \p{Sharada} (= \p{Script_Extensions=
6125 Sharada}) (NOT \p{Block=Sharada}) (100)
6126 \p{Sidd} \p{Siddham} (= \p{Script_Extensions=
6127 Siddham}) (NOT \p{Block=Siddham}) (92)
6128 \p{Siddham} \p{Script_Extensions=Siddham} (Short:
6129 \p{Sidd}; NOT \p{Block=Siddham}) (92)
6130 \p{SignWriting} \p{Script_Extensions=SignWriting} (Short:
6131 \p{Sgnw}) (672)
6132 \p{Sind} \p{Khudawadi} (= \p{Script_Extensions=
6133 Khudawadi}) (NOT \p{Block=Khudawadi})
6134 (81)
6135 \p{Sinh} \p{Sinhala} (= \p{Script_Extensions=
6136 Sinhala}) (NOT \p{Block=Sinhala}) (112)
6137 \p{Sinhala} \p{Script_Extensions=Sinhala} (Short:
6138 \p{Sinh}; NOT \p{Block=Sinhala}) (112)
6139 X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
6140 \p{Sk} \p{Modifier_Symbol} (=
6141 \p{General_Category=Modifier_Symbol})
6142 (121)
6143 \p{Sm} \p{Math_Symbol} (= \p{General_Category=
6144 Math_Symbol}) (948)
6145 X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
6146 \p{InSmallForms}) (32)
6147 X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block=
6148 Small_Form_Variants}) (32)
6149 X \p{Small_Kana_Ext} \p{Small_Kana_Extension} (= \p{Block=
6150 Small_Kana_Extension}) (64)
6151 X \p{Small_Kana_Extension} \p{Block=Small_Kana_Extension} (Short:
6152 \p{InSmallKanaExt}) (64)
6153 \p{So} \p{Other_Symbol} (= \p{General_Category=
6154 Other_Symbol}) (6161)
6155 \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
6156 \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_066 plus
6157 all above-Unicode code points: [\x00-
6158 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6159 Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff],
6160 U+0100..012E, U+0130..0248,
6161 U+024A..0267, U+0269..029C, U+029E..02B1
6162 ...)
6163 \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (46: [i-j],
6164 U+012F, U+0249, U+0268, U+029D, U+02B2
6165 ...)
6166 \p{Sogd} \p{Sogdian} (= \p{Script_Extensions=
6167 Sogdian}) (NOT \p{Block=Sogdian}) (43)
6168 \p{Sogdian} \p{Script_Extensions=Sogdian} (Short:
6169 \p{Sogd}; NOT \p{Block=Sogdian}) (43)
6170 \p{Sogo} \p{Old_Sogdian} (= \p{Script_Extensions=
6171 Old_Sogdian}) (NOT \p{Block=
6172 Old_Sogdian}) (40)
6173 \p{Sora} \p{Sora_Sompeng} (= \p{Script_Extensions=
6174 Sora_Sompeng}) (NOT \p{Block=
6175 Sora_Sompeng}) (35)
6176 \p{Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng} (Short:
6177 \p{Sora}; NOT \p{Block=Sora_Sompeng})
6178 (35)
6179 \p{Soyo} \p{Soyombo} (= \p{Script_Extensions=
6180 Soyombo}) (NOT \p{Block=Soyombo}) (83)
6181 \p{Soyombo} \p{Script_Extensions=Soyombo} (Short:
6182 \p{Soyo}; NOT \p{Block=Soyombo}) (83)
6183 \p{Space} \p{White_Space} (= \p{White_Space=Y}) (25)
6184 \p{Space: *} \p{White_Space: *}
6185 \p{Space_Separator} \p{General_Category=Space_Separator}
6186 (Short: \p{Zs}) (17)
6187 \p{SpacePerl} \p{XPosixSpace} (25)
6188 \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
6189 \p{Mc}) (429)
6190 X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
6191 (Short: \p{InModifierLetters}) (80)
6192 X \p{Specials} \p{Block=Specials} (16)
6193 \p{STerm} \p{Sentence_Terminal} (=
6194 \p{Sentence_Terminal=Y}) (141)
6195 \p{STerm: *} \p{Sentence_Terminal: *}
6196 \p{Sund} \p{Sundanese} (= \p{Script_Extensions=
6197 Sundanese}) (NOT \p{Block=Sundanese})
6198 (72)
6199 \p{Sundanese} \p{Script_Extensions=Sundanese} (Short:
6200 \p{Sund}; NOT \p{Block=Sundanese}) (72)
6201 X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block=
6202 Sundanese_Supplement}) (16)
6203 X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
6204 \p{InSundaneseSup}) (16)
6205 X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block=
6206 Supplemental_Arrows_A}) (16)
6207 X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block=
6208 Supplemental_Arrows_B}) (128)
6209 X \p{Sup_Arrows_C} \p{Supplemental_Arrows_C} (= \p{Block=
6210 Supplemental_Arrows_C}) (256)
6211 X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (=
6212 \p{Block=
6213 Supplemental_Mathematical_Operators})
6214 (256)
6215 X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (=
6216 \p{Block=
6217 Supplementary_Private_Use_Area_A})
6218 (65_536)
6219 X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (=
6220 \p{Block=
6221 Supplementary_Private_Use_Area_B})
6222 (65_536)
6223 X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block=
6224 Supplemental_Punctuation}) (128)
6225 X \p{Sup_Symbols_And_Pictographs}
6226 \p{Supplemental_Symbols_And_Pictographs}
6227 (= \p{Block=
6228 Supplemental_Symbols_And_Pictographs})
6229 (256)
6230 X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (=
6231 \p{Block=Superscripts_And_Subscripts})
6232 (48)
6233 X \p{Superscripts_And_Subscripts} \p{Block=
6234 Superscripts_And_Subscripts} (Short:
6235 \p{InSuperAndSub}) (48)
6236 X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
6237 \p{InSupArrowsA}) (16)
6238 X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
6239 \p{InSupArrowsB}) (128)
6240 X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
6241 \p{InSupArrowsC}) (256)
6242 X \p{Supplemental_Mathematical_Operators} \p{Block=
6243 Supplemental_Mathematical_Operators}
6244 (Short: \p{InSupMathOperators}) (256)
6245 X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
6246 (Short: \p{InSupPunctuation}) (128)
6247 X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
6248 Supplemental_Symbols_And_Pictographs}
6249 (Short: \p{InSupSymbolsAndPictographs})
6250 (256)
6251 X \p{Supplementary_Private_Use_Area_A} \p{Block=
6252 Supplementary_Private_Use_Area_A}
6253 (Short: \p{InSupPUAA}) (65_536)
6254 X \p{Supplementary_Private_Use_Area_B} \p{Block=
6255 Supplementary_Private_Use_Area_B}
6256 (Short: \p{InSupPUAB}) (65_536)
6257 \p{Surrogate} \p{General_Category=Surrogate} (Short:
6258 \p{Cs}) (2048)
6259 X \p{Sutton_SignWriting} \p{Block=Sutton_SignWriting} (688)
6260 \p{Sylo} \p{Syloti_Nagri} (= \p{Script_Extensions=
6261 Syloti_Nagri}) (NOT \p{Block=
6262 Syloti_Nagri}) (56)
6263 \p{Syloti_Nagri} \p{Script_Extensions=Syloti_Nagri} (Short:
6264 \p{Sylo}; NOT \p{Block=Syloti_Nagri})
6265 (56)
6266 \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
6267 (7292)
6268 X \p{Symbols_And_Pictographs_Ext_A}
6269 \p{Symbols_And_Pictographs_Extended_A}
6270 (= \p{Block=
6271 Symbols_And_Pictographs_Extended_A})
6272 (144)
6273 X \p{Symbols_And_Pictographs_Extended_A} \p{Block=
6274 Symbols_And_Pictographs_Extended_A} (144)
6275 \p{Syrc} \p{Syriac} (= \p{Script_Extensions=
6276 Syriac}) (NOT \p{Block=Syriac}) (105)
6277 \p{Syriac} \p{Script_Extensions=Syriac} (Short:
6278 \p{Syrc}; NOT \p{Block=Syriac}) (105)
6279 X \p{Syriac_Sup} \p{Syriac_Supplement} (= \p{Block=
6280 Syriac_Supplement}) (16)
6281 X \p{Syriac_Supplement} \p{Block=Syriac_Supplement} (Short:
6282 \p{InSyriacSup}) (16)
6283 \p{Tagalog} \p{Script_Extensions=Tagalog} (Short:
6284 \p{Tglg}; NOT \p{Block=Tagalog}) (22)
6285 \p{Tagb} \p{Tagbanwa} (= \p{Script_Extensions=
6286 Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
6287 \p{Tagbanwa} \p{Script_Extensions=Tagbanwa} (Short:
6288 \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
6289 X \p{Tags} \p{Block=Tags} (128)
6290 \p{Tai_Le} \p{Script_Extensions=Tai_Le} (Short:
6291 \p{Tale}; NOT \p{Block=Tai_Le}) (45)
6292 \p{Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
6293 \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
6294 \p{Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
6295 \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
6296 X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
6297 Tai_Xuan_Jing_Symbols}) (96)
6298 X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
6299 \p{InTaiXuanJing}) (96)
6300 \p{Takr} \p{Takri} (= \p{Script_Extensions=Takri})
6301 (NOT \p{Block=Takri}) (79)
6302 \p{Takri} \p{Script_Extensions=Takri} (Short:
6303 \p{Takr}; NOT \p{Block=Takri}) (79)
6304 \p{Tale} \p{Tai_Le} (= \p{Script_Extensions=
6305 Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
6306 \p{Talu} \p{New_Tai_Lue} (= \p{Script_Extensions=
6307 New_Tai_Lue}) (NOT \p{Block=
6308 New_Tai_Lue}) (83)
6309 \p{Tamil} \p{Script_Extensions=Tamil} (Short:
6310 \p{Taml}; NOT \p{Block=Tamil}) (133)
6311 X \p{Tamil_Sup} \p{Tamil_Supplement} (= \p{Block=
6312 Tamil_Supplement}) (64)
6313 X \p{Tamil_Supplement} \p{Block=Tamil_Supplement} (Short:
6314 \p{InTamilSup}) (64)
6315 \p{Taml} \p{Tamil} (= \p{Script_Extensions=Tamil})
6316 (NOT \p{Block=Tamil}) (133)
6317 \p{Tang} \p{Tangut} (= \p{Script_Extensions=
6318 Tangut}) (NOT \p{Block=Tangut}) (6892)
6319 \p{Tangut} \p{Script_Extensions=Tangut} (Short:
6320 \p{Tang}; NOT \p{Block=Tangut}) (6892)
6321 X \p{Tangut_Components} \p{Block=Tangut_Components} (768)
6322 \p{Tavt} \p{Tai_Viet} (= \p{Script_Extensions=
6323 Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
6324 \p{Telu} \p{Telugu} (= \p{Script_Extensions=
6325 Telugu}) (NOT \p{Block=Telugu}) (104)
6326 \p{Telugu} \p{Script_Extensions=Telugu} (Short:
6327 \p{Telu}; NOT \p{Block=Telugu}) (104)
6328 \p{Term} \p{Terminal_Punctuation} (=
6329 \p{Terminal_Punctuation=Y}) (264)
6330 \p{Term: *} \p{Terminal_Punctuation: *}
6331 \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
6332 \p{Term}) (264)
6333 \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
6334 (1_113_848 plus all above-Unicode code
6335 points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0-
6336 9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
6337 U+0100..037D, U+037F..0386,
6338 U+0388..0588, U+058A..05C2, U+05C4..060B
6339 ...)
6340 \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (264:
6341 [!,.:;?], U+037E, U+0387, U+0589,
6342 U+05C3, U+060C ...)
6343 \p{Tfng} \p{Tifinagh} (= \p{Script_Extensions=
6344 Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
6345 \p{Tglg} \p{Tagalog} (= \p{Script_Extensions=
6346 Tagalog}) (NOT \p{Block=Tagalog}) (22)
6347 \p{Thaa} \p{Thaana} (= \p{Script_Extensions=
6348 Thaana}) (NOT \p{Block=Thaana}) (66)
6349 \p{Thaana} \p{Script_Extensions=Thaana} (Short:
6350 \p{Thaa}; NOT \p{Block=Thaana}) (66)
6351 \p{Thai} \p{Script_Extensions=Thai} (NOT \p{Block=
6352 Thai}) (86)
6353 \p{Tibetan} \p{Script_Extensions=Tibetan} (Short:
6354 \p{Tibt}; NOT \p{Block=Tibetan}) (207)
6355 \p{Tibt} \p{Tibetan} (= \p{Script_Extensions=
6356 Tibetan}) (NOT \p{Block=Tibetan}) (207)
6357 \p{Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
6358 \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
6359 \p{Tirh} \p{Tirhuta} (= \p{Script_Extensions=
6360 Tirhuta}) (NOT \p{Block=Tirhuta}) (97)
6361 \p{Tirhuta} \p{Script_Extensions=Tirhuta} (Short:
6362 \p{Tirh}; NOT \p{Block=Tirhuta}) (97)
6363 \p{Title} \p{Titlecase} (/i= Cased=Yes) (31)
6364 \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i=
6365 Cased=Yes) (31: U+01C5, U+01C8, U+01CB,
6366 U+01F2, U+1F88..1F8F, U+1F98..1F9F ...)
6367 \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
6368 (Short: \p{Lt}; /i= General_Category=
6369 Cased_Letter) (31)
6370 X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block=
6371 Transport_And_Map_Symbols}) (128)
6372 X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
6373 (Short: \p{InTransportAndMap}) (128)
6374 X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics}
6375 (= \p{Block=
6376 Unified_Canadian_Aboriginal_Syllabics})
6377 (640)
6378 X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_-
6379 Extended} (= \p{Block=
6380 Unified_Canadian_Aboriginal_Syllabics_-
6381 Extended}) (80)
6382 \p{Ugar} \p{Ugaritic} (= \p{Script_Extensions=
6383 Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
6384 \p{Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
6385 \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
6386 \p{UIdeo} \p{Unified_Ideograph} (=
6387 \p{Unified_Ideograph=Y}) (87_887)
6388 \p{UIdeo: *} \p{Unified_Ideograph: *}
6389 \p{Unassigned} \p{General_Category=Unassigned} (Short:
6390 \p{Cn}) (836_602 plus all above-Unicode
6391 code points)
6392 \p{Unicode} \p{Any} (1_114_112)
6393 X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
6394 Unified_Canadian_Aboriginal_Syllabics}
6395 (Short: \p{InUCAS}) (640)
6396 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
6397 Unified_Canadian_Aboriginal_Syllabics_-
6398 Extended} (Short: \p{InUCASExt}) (80)
6399 \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
6400 (87_887)
6401 \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
6402 (1_026_225 plus all above-Unicode code
6403 points: U+0000..33FF, U+4DB6..4DFF,
6404 U+9FF0..FA0D, U+FA10, U+FA12,
6405 U+FA15..FA1E ...)
6406 \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (87_887:
6407 U+3400..4DB5, U+4E00..9FEF,
6408 U+FA0E..FA0F, U+FA11, U+FA13..FA14,
6409 U+FA1F ...)
6410 \p{Unknown} \p{Script_Extensions=Unknown} (Short:
6411 \p{Zzzz}) (976_118 plus all above-
6412 Unicode code points)
6413 \p{Upper} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6414 Cased=Yes) (1908)
6415 \p{Upper: *} \p{Uppercase: *}
6416 \p{Uppercase} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6417 Cased=Yes) (1908)
6418 \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
6419 No) (1_112_204 plus all above-Unicode
6420 code points: [\x00-
6421 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
6422 >?\@\[\\\]\^_`a-z\{\|\}~\x7f-
6423 \xbf\xd7\xdf-\xff], U+0101, U+0103,
6424 U+0105, U+0107, U+0109 ...)
6425 \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
6426 Yes) (1908: [A-Z\xc0-\xd6\xd8-\xde],
6427 U+0100, U+0102, U+0104, U+0106, U+0108
6428 ...)
6429 \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
6430 (Short: \p{Lu}; /i= General_Category=
6431 Cased_Letter) (1788)
6432 \p{Vai} \p{Script_Extensions=Vai} (NOT \p{Block=
6433 Vai}) (300)
6434 \p{Vaii} \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
6435 \p{Block=Vai}) (300)
6436 \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS};
6437 NOT \p{Variation_Selectors}) (259)
6438 \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853
6439 plus all above-Unicode code points:
6440 U+0000..180A, U+180E..FDFF,
6441 U+FE10..E00FF, U+E01F0..infinity)
6442 \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259:
6443 U+180B..180D, U+FE00..FE0F,
6444 U+E0100..E01EF)
6445 X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
6446 \p{InVS}) (16)
6447 X \p{Variation_Selectors_Supplement} \p{Block=
6448 Variation_Selectors_Supplement} (Short:
6449 \p{InVSSup}) (240)
6450 X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block=
6451 Vedic_Extensions}) (48)
6452 X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short:
6453 \p{InVedicExt}) (48)
6454 X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
6455 \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated}
6456 (787_620 plus all above-Unicode code
6457 points)
6458 \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (787_620 plus
6459 all above-Unicode code points: [\x00-
6460 \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf-
6461 \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9,
6462 U+02EC..10FF, U+1200..1400,
6463 U+1680..18AF, U+1900..2015 ...)
6464 \p{Vertical_Orientation: Tr} \p{Vertical_Orientation=
6465 Transformed_Rotated} (47)
6466 \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr})
6467 (47: U+2329..232A, U+3008..3011,
6468 U+3014..301F, U+3030, U+30A0, U+30FC ...)
6469 \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu})
6470 (148: U+3001..3002, U+3041, U+3043,
6471 U+3045, U+3047, U+3049 ...)
6472 \p{Vertical_Orientation: Tu} \p{Vertical_Orientation=
6473 Transformed_Upright} (148)
6474 \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright}
6475 (326_297)
6476 \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (326_297:
6477 [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7],
6478 U+02EA..02EB, U+1100..11FF,
6479 U+1401..167F, U+18B0..18FF, U+2016 ...)
6480 \p{VertSpace} \v (7: [\n\cK\f\r\x85], U+2028..2029)
6481 \p{Vo: *} \p{Vertical_Orientation: *}
6482 \p{VS} \p{Variation_Selector} (=
6483 \p{Variation_Selector=Y}) (NOT
6484 \p{Variation_Selectors}) (259)
6485 \p{VS: *} \p{Variation_Selector: *}
6486 X \p{VS_Sup} \p{Variation_Selectors_Supplement} (=
6487 \p{Block=
6488 Variation_Selectors_Supplement}) (240)
6489 \p{Wancho} \p{Script_Extensions=Wancho} (Short:
6490 \p{Wcho}; NOT \p{Block=Wancho}) (59)
6491 \p{Wara} \p{Warang_Citi} (= \p{Script_Extensions=
6492 Warang_Citi}) (NOT \p{Block=
6493 Warang_Citi}) (84)
6494 \p{Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
6495 \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
6496 \p{WB: *} \p{Word_Break: *}
6497 \p{Wcho} \p{Wancho} (= \p{Script_Extensions=
6498 Wancho}) (NOT \p{Block=Wancho}) (59)
6499 \p{White_Space} \p{White_Space=Y} (Short: \p{Space}) (25)
6500 \p{White_Space: N*} (Short: \p{Space=N}, \P{Space}) (1_114_087
6501 plus all above-Unicode code points:
6502 [^\t\n\cK\f\r\x20\x85\xa0],
6503 U+0100..167F, U+1681..1FFF,
6504 U+200B..2027, U+202A..202E, U+2030..205E
6505 ...)
6506 \p{White_Space: Y*} (Short: \p{Space=Y}, \p{Space}) (25:
6507 [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6508 U+2000..200A, U+2028..2029, U+202F,
6509 U+205F ...)
6510 \p{Word} \p{XPosixWord} (128_919)
6511 \p{Word_Break: ALetter} (Short: \p{WB=LE}) (28_693: [A-Za-
6512 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6513 \xff], U+0100..02D7, U+02DE..02E4,
6514 U+02EC..02FF, U+0370..0374, U+0376..0377
6515 ...)
6516 \p{Word_Break: CR} (Short: \p{WB=CR}) (1: [\r])
6517 \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"])
6518 \p{Word_Break: DQ} \p{Word_Break=Double_Quote} (1)
6519 \p{Word_Break: E_Base} (Short: \p{WB=EB}) (0)
6520 \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (0)
6521 \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (0)
6522 \p{Word_Break: EB} \p{Word_Break=E_Base} (0)
6523 \p{Word_Break: EBG} \p{Word_Break=E_Base_GAZ} (0)
6524 \p{Word_Break: EM} \p{Word_Break=E_Modifier} (0)
6525 \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (11)
6526 \p{Word_Break: Extend} (Short: \p{WB=Extend}) (2372:
6527 U+0300..036F, U+0483..0489,
6528 U+0591..05BD, U+05BF, U+05C1..05C2,
6529 U+05C4..05C5 ...)
6530 \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F,
6531 U+203F..2040, U+2054, U+FE33..FE34,
6532 U+FE4D..FE4F ...)
6533 \p{Word_Break: FO} \p{Word_Break=Format} (62)
6534 \p{Word_Break: Format} (Short: \p{WB=FO}) (62: [\xad],
6535 U+0600..0605, U+061C, U+06DD, U+070F,
6536 U+08E2 ...)
6537 \p{Word_Break: GAZ} \p{Word_Break=Glue_After_Zwj} (0)
6538 \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (0)
6539 \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (75:
6540 U+05D0..05EA, U+05EF..05F2, U+FB1D,
6541 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
6542 ...)
6543 \p{Word_Break: HL} \p{Word_Break=Hebrew_Letter} (75)
6544 \p{Word_Break: KA} \p{Word_Break=Katakana} (314)
6545 \p{Word_Break: Katakana} (Short: \p{WB=KA}) (314: U+3031..3035,
6546 U+309B..309C, U+30A0..30FA,
6547 U+30FC..30FF, U+31F0..31FF, U+32D0..32FE
6548 ...)
6549 \p{Word_Break: LE} \p{Word_Break=ALetter} (28_693)
6550 \p{Word_Break: LF} (Short: \p{WB=LF}) (1: [\n])
6551 \p{Word_Break: MB} \p{Word_Break=MidNumLet} (7)
6552 \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (8: [:\xb7], U+0387,
6553 U+05F4, U+2027, U+FE13, U+FE55 ...)
6554 \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15: [,;], U+037E,
6555 U+0589, U+060C..060D, U+066C, U+07F8 ...)
6556 \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.],
6557 U+2018..2019, U+2024, U+FE52, U+FF07,
6558 U+FF0E)
6559 \p{Word_Break: ML} \p{Word_Break=MidLetter} (8)
6560 \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
6561 \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85],
6562 U+2028..2029)
6563 \p{Word_Break: NL} \p{Word_Break=Newline} (5)
6564 \p{Word_Break: NU} \p{Word_Break=Numeric} (631)
6565 \p{Word_Break: Numeric} (Short: \p{WB=NU}) (631: [0-9],
6566 U+0660..0669, U+066B, U+06F0..06F9,
6567 U+07C0..07C9, U+0966..096F ...)
6568 \p{Word_Break: Other} (Short: \p{WB=XX}) (1_081_874 plus all
6569 above-Unicode code points:
6570 [^\n\cK\f\r\x20\"\',.0-9:;A-Z_a-
6571 z\x85\xaa\xad\xb5\xb7\xba\xc0-\xd6\xd8-
6572 \xf6\xf8-\xff], U+02D8..02DD,
6573 U+02E5..02EB, U+0375, U+0378..0379,
6574 U+0380..0385 ...)
6575 \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26:
6576 U+1F1E6..1F1FF)
6577 \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26)
6578 \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\'])
6579 \p{Word_Break: SQ} \p{Word_Break=Single_Quote} (1)
6580 \p{Word_Break: WSegSpace} (Short: \p{WB=WSegSpace}) (14: [\x20],
6581 U+1680, U+2000..2006, U+2008..200A,
6582 U+205F, U+3000)
6583 \p{Word_Break: XX} \p{Word_Break=Other} (1_081_874 plus all
6584 above-Unicode code points)
6585 \p{Word_Break: ZWJ} (Short: \p{WB=ZWJ}) (1: U+200D)
6586 \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (25)
6587 \p{WSpace: *} \p{White_Space: *}
6588 \p{XDigit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
6589 \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
6590 (128_770)
6591 \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (985_342
6592 plus all above-Unicode code points:
6593 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
6594 >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-
6595 \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
6596 U+02C2..02C5, U+02D2..02DF,
6597 U+02E5..02EB, U+02ED, U+02EF..02FF ...)
6598 \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (128_770:
6599 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-
6600 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6601 U+02C6..02D1, U+02E0..02E4, U+02EC,
6602 U+02EE ...)
6603 \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (125_861)
6604 \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (988_251
6605 plus all above-Unicode code points:
6606 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
6607 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
6608 \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
6609 U+02C2..02C5, U+02D2..02DF,
6610 U+02E5..02EB, U+02ED, U+02EF..036F ...)
6611 \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (125_861:
6612 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
6613 \xf6\xf8-\xff], U+0100..02C1,
6614 U+02C6..02D1, U+02E0..02E4, U+02EC,
6615 U+02EE ...)
6616 \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
6617 (128_770)
6618 \p{XIDC: *} \p{XID_Continue: *}
6619 \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (125_861)
6620 \p{XIDS: *} \p{XID_Start: *}
6621 \p{Xpeo} \p{Old_Persian} (= \p{Script_Extensions=
6622 Old_Persian}) (NOT \p{Block=
6623 Old_Persian}) (50)
6624 \p{XPerlSpace} \p{XPosixSpace} (25)
6625 \p{XPosixAlnum} Alphabetic and (decimal) Numeric (Short:
6626 \p{Alnum}) (127_886: [0-9A-Za-
6627 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6628 \xff], U+0100..02C1, U+02C6..02D1,
6629 U+02E0..02E4, U+02EC, U+02EE ...)
6630 \p{XPosixAlpha} \p{Alphabetic=Y} (Short: \p{Alpha})
6631 (127_256)
6632 \p{XPosixBlank} \h, Horizontal white space (Short:
6633 \p{Blank}) (18: [\t\x20\xa0], U+1680,
6634 U+2000..200A, U+202F, U+205F, U+3000)
6635 \p{XPosixCntrl} \p{General_Category=Control} Control
6636 characters (Short: \p{Cc}) (65)
6637 \p{XPosixDigit} \p{General_Category=Decimal_Number} [0-9]
6638 + all other decimal digits (Short:
6639 \p{Nd}) (630)
6640 \p{XPosixGraph} Characters that are graphical (Short:
6641 \p{Graph}) (275_378:
6642 [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6643 Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff],
6644 U+0100..0377, U+037A..037F,
6645 U+0384..038A, U+038C, U+038E..03A1 ...)
6646 \p{XPosixLower} \p{Lowercase=Y} (Short: \p{Lower}; /i=
6647 Cased=Yes) (2340)
6648 \p{XPosixPrint} Characters that are graphical plus space
6649 characters (but no controls) (Short:
6650 \p{Print}) (275_395: [\x20-\x7e\xa0-
6651 \xff], U+0100..0377, U+037A..037F,
6652 U+0384..038A, U+038C, U+038E..03A1 ...)
6653 \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (801:
6654 [!\"#\$\%&\'\(\)*+,\-.\/:;<=
6655 >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
6656 \xb7\xbb\xbf], U+037E, U+0387,
6657 U+055A..055F, U+0589..058A, U+05BE ...)
6658 \p{XPosixSpace} \s including beyond ASCII and vertical tab
6659 (Short: \p{SpacePerl}) (25:
6660 [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6661 U+2000..200A, U+2028..2029, U+202F,
6662 U+205F ...)
6663 \p{XPosixUpper} \p{Uppercase=Y} (Short: \p{Upper}; /i=
6664 Cased=Yes) (1908)
6665 \p{XPosixWord} \w, including beyond ASCII; = \p{Alnum} +
6666 \pM + \p{Pc} + \p{Join_Control} (Short:
6667 \p{Word}) (128_919: [0-9A-Z_a-
6668 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6669 \xff], U+0100..02C1, U+02C6..02D1,
6670 U+02E0..02E4, U+02EC, U+02EE ...)
6671 \p{XPosixXDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
6672 \p{Xsux} \p{Cuneiform} (= \p{Script_Extensions=
6673 Cuneiform}) (NOT \p{Block=Cuneiform})
6674 (1234)
6675 \p{Yi} \p{Script_Extensions=Yi} (1246)
6676 X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
6677 X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
6678 \p{Yiii} \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
6679 X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block=
6680 Yijing_Hexagram_Symbols}) (64)
6681 X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
6682 (Short: \p{InYijing}) (64)
6683 \p{Z} \pZ \p{Separator} (= \p{General_Category=
6684 Separator}) (19)
6685 \p{Zanabazar_Square} \p{Script_Extensions=Zanabazar_Square}
6686 (Short: \p{Zanb}; NOT \p{Block=
6687 Zanabazar_Square}) (72)
6688 \p{Zanb} \p{Zanabazar_Square} (=
6689 \p{Script_Extensions=Zanabazar_Square})
6690 (NOT \p{Block=Zanabazar_Square}) (72)
6691 \p{Zinh} \p{Inherited} (= \p{Script_Extensions=
6692 Inherited}) (502)
6693 \p{Zl} \p{Line_Separator} (= \p{General_Category=
6694 Line_Separator}) (1)
6695 \p{Zp} \p{Paragraph_Separator} (=
6696 \p{General_Category=
6697 Paragraph_Separator}) (1)
6698 \p{Zs} \p{Space_Separator} (=
6699 \p{General_Category=Space_Separator})
6700 (17)
6701 \p{Zyyy} \p{Common} (= \p{Script_Extensions=
6702 Common}) (7386)
6703 \p{Zzzz} \p{Unknown} (= \p{Script_Extensions=
6704 Unknown}) (976_118 plus all above-
6705 Unicode code points)
6706
6707 Legal "\p{}" and "\P{}" constructs that match no characters
6708 Unicode has some property-value pairs that currently don't match
6709 anything. This happens generally either because they are obsolete, or
6710 they exist for symmetry with other forms, but no language has yet been
6711 encoded that uses them. In this version of Unicode, the following
6712 match zero code points:
6713
6714 \p{Canonical_Combining_Class=Attached_Below_Left}
6715 \p{Canonical_Combining_Class=CCC133}
6716 \p{Grapheme_Cluster_Break=E_Base}
6717 \p{Grapheme_Cluster_Break=E_Base_GAZ}
6718 \p{Grapheme_Cluster_Break=E_Modifier}
6719 \p{Grapheme_Cluster_Break=Glue_After_Zwj}
6720 \p{Word_Break=E_Base}
6721 \p{Word_Break=E_Base_GAZ}
6722 \p{Word_Break=E_Modifier}
6723 \p{Word_Break=Glue_After_Zwj}
6724
6726 The value of any Unicode (not including Perl extensions) character
6727 property mentioned above for any single code point is available through
6728 "charprop()" in Unicode::UCD. "charprops_all()" in Unicode::UCD
6729 returns the values of all the Unicode properties for a given code
6730 point.
6731
6732 Besides these, all the Unicode character properties mentioned above
6733 (except for those marked as for internal use by Perl) are also
6734 accessible by "prop_invlist()" in Unicode::UCD.
6735
6736 Due to their nature, not all Unicode character properties are suitable
6737 for regular expression matches, nor "prop_invlist()". The remaining
6738 non-provisional, non-internal ones are accessible via "prop_invmap()"
6739 in Unicode::UCD (except for those that this Perl installation hasn't
6740 included; see below for which those are).
6741
6742 For compatibility with other parts of Perl, all the single forms given
6743 in the table in the section above are recognized. BUT, there are some
6744 ambiguities between some Perl extensions and the Unicode properties,
6745 all of which are silently resolved in favor of the official Unicode
6746 property. To avoid surprises, you should only use "prop_invmap()" for
6747 forms listed in the table below, which omits the non-recommended ones.
6748 The affected forms are the Perl single form equivalents of Unicode
6749 properties, such as "\p{sc}" being a single-form equivalent of
6750 "\p{gc=sc}", which is treated by "prop_invmap()" as the "Script"
6751 property, whose short name is "sc". The table indicates the current
6752 ambiguities in the INFO column, beginning with the word "NOT".
6753
6754 The standard Unicode properties listed below are documented in
6755 <http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is
6756 documented in "prop_invmap()" in Unicode::UCD. The other Perl
6757 extensions are in "Other Properties" in perlunicode;
6758
6759 The first column in the table is a name for the property; the second
6760 column is an alternative name, if any, plus possibly some annotations.
6761 The alternative name is the property's full name, unless that would
6762 simply repeat the first column, in which case the second column
6763 indicates the property's short name (if different). The annotations
6764 are given only in the entry for the full name. The annotations for
6765 binary properties include a list of the first few ranges that the
6766 property matches. To avoid any ambiguity, the SPACE character is
6767 represented as "\x20".
6768
6769 If a property is obsolete, etc, the entry will be flagged with the same
6770 characters used in the table in the section above, like D or S.
6771
6772 NAME INFO
6773
6774 Age
6775 AHex ASCII_Hex_Digit
6776 All (Perl extension). All code points,
6777 including those above Unicode. Same as
6778 qr/./s. U+0000..infinity
6779 Alnum XPosixAlnum. (Perl extension)
6780 Alpha Alphabetic
6781 Alphabetic (Short: Alpha). [A-Za-z\xaa\xb5\xba\xc0-
6782 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6783 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
6784 ...
6785 Any (Perl extension). All Unicode code
6786 points. U+0000..10FFFF
6787 ASCII Block=Basic_Latin. (Perl extension).
6788 [\x00-\x7f]
6789 ASCII_Hex_Digit (Short: AHex). [0-9A-Fa-f]
6790 Assigned (Perl extension). All assigned code
6791 points. U+0000..0377, U+037A..037F,
6792 U+0384..038A, U+038C, U+038E..03A1,
6793 U+03A3..052F ...
6794 Bc Bidi_Class
6795 Bidi_C Bidi_Control
6796 Bidi_Class (Short: bc)
6797 Bidi_Control (Short: Bidi_C). U+061C, U+200E..200F,
6798 U+202A..202E, U+2066..2069
6799 Bidi_M Bidi_Mirrored
6800 Bidi_Mirrored (Short: Bidi_M).
6801 [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
6802 U+169B..169C, U+2039..203A, U+2045..2046,
6803 U+207D..207E ...
6804 Bidi_Mirroring_Glyph (Short: bmg)
6805 Bidi_Paired_Bracket (Short: bpb)
6806 Bidi_Paired_Bracket_Type (Short: bpt)
6807 Blank XPosixBlank. (Perl extension)
6808 Blk Block
6809 Block (Short: blk)
6810 Bmg Bidi_Mirroring_Glyph
6811 Bpb Bidi_Paired_Bracket
6812 Bpt Bidi_Paired_Bracket_Type
6813 Canonical_Combining_Class (Short: ccc)
6814 Case_Folding (Short: cf)
6815 Case_Ignorable (Short: CI).
6816 [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8],
6817 U+02B0..036F, U+0374..0375, U+037A,
6818 U+0384..0385, U+0387 ...
6819 Cased [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6820 \xff], U+0100..01BA, U+01BC..01BF,
6821 U+01C4..0293, U+0295..02B8, U+02C0..02C1
6822 ...
6823 Category General_Category
6824 Ccc Canonical_Combining_Class
6825 CE Composition_Exclusion
6826 Cf Case_Folding; NOT 'cf' meaning
6827 'General_Category=Format'
6828 Changes_When_Casefolded (Short: CWCF). [A-Z\xb5\xc0-\xd6\xd8-
6829 \xdf], U+0100, U+0102, U+0104, U+0106,
6830 U+0108 ...
6831 Changes_When_Casemapped (Short: CWCM). [A-Za-z\xb5\xc0-\xd6\xd8-
6832 \xf6\xf8-\xff], U+0100..0137,
6833 U+0139..018C, U+018E..019A, U+019C..01A9,
6834 U+01AC..01B9 ...
6835 Changes_When_Lowercased (Short: CWL). [A-Z\xc0-\xd6\xd8-\xde],
6836 U+0100, U+0102, U+0104, U+0106, U+0108 ...
6837 Changes_When_NFKC_Casefolded (Short: CWKCF). [A-
6838 Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8-
6839 \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf], U+0100,
6840 U+0102, U+0104, U+0106, U+0108 ...
6841 Changes_When_Titlecased (Short: CWT). [a-z\xb5\xdf-\xf6\xf8-
6842 \xff], U+0101, U+0103, U+0105, U+0107,
6843 U+0109 ...
6844 Changes_When_Uppercased (Short: CWU). [a-z\xb5\xdf-\xf6\xf8-
6845 \xff], U+0101, U+0103, U+0105, U+0107,
6846 U+0109 ...
6847 CI Case_Ignorable
6848 Cntrl XPosixCntrl (=General_Category=Control).
6849 (Perl extension)
6850 Comp_Ex Full_Composition_Exclusion
6851 Composition_Exclusion (Short: CE). U+0958..095F, U+09DC..09DD,
6852 U+09DF, U+0A33, U+0A36, U+0A59..0A5B ...
6853 CWCF Changes_When_Casefolded
6854 CWCM Changes_When_Casemapped
6855 CWKCF Changes_When_NFKC_Casefolded
6856 CWL Changes_When_Lowercased
6857 CWT Changes_When_Titlecased
6858 CWU Changes_When_Uppercased
6859 Dash [\-], U+058A, U+05BE, U+1400, U+1806,
6860 U+2010..2015 ...
6861 Decomposition_Mapping (Short: dm)
6862 Decomposition_Type (Short: dt)
6863 Default_Ignorable_Code_Point (Short: DI). [\xad], U+034F, U+061C,
6864 U+115F..1160, U+17B4..17B5, U+180B..180E
6865 ...
6866 Dep Deprecated
6867 Deprecated (Short: Dep). U+0149, U+0673, U+0F77,
6868 U+0F79, U+17A3..17A4, U+206A..206F ...
6869 DI Default_Ignorable_Code_Point
6870 Dia Diacritic
6871 Diacritic (Short: Dia). [\^`\xa8\xaf\xb4\xb7-\xb8],
6872 U+02B0..034E, U+0350..0357, U+035D..0362,
6873 U+0374..0375, U+037A ...
6874 Digit XPosixDigit (=General_Category=
6875 Decimal_Number). (Perl extension)
6876 Dm Decomposition_Mapping
6877 Dt Decomposition_Type
6878 Ea East_Asian_Width
6879 East_Asian_Width (Short: ea)
6880 EqUIdeo Equivalent_Unified_Ideograph
6881 Equivalent_Unified_Ideograph (Short: EqUIdeo)
6882 Ext Extender
6883 Extender (Short: Ext). [\xb7], U+02D0..02D1,
6884 U+0640, U+07FA, U+0E46, U+0EC6 ...
6885 Full_Composition_Exclusion (Short: Comp_Ex). U+0340..0341,
6886 U+0343..0344, U+0374, U+037E, U+0387,
6887 U+0958..095F ...
6888 Gc General_Category
6889 GCB Grapheme_Cluster_Break
6890 General_Category (Short: gc)
6891 Gr_Base Grapheme_Base
6892 Gr_Ext Grapheme_Extend
6893 Graph XPosixGraph. (Perl extension)
6894 Grapheme_Base (Short: Gr_Base). [\x20-\x7e\xa0-
6895 \xac\xae-\xff], U+0100..02FF,
6896 U+0370..0377, U+037A..037F, U+0384..038A,
6897 U+038C ...
6898 Grapheme_Cluster_Break (Short: GCB)
6899 Grapheme_Extend (Short: Gr_Ext). U+0300..036F,
6900 U+0483..0489, U+0591..05BD, U+05BF,
6901 U+05C1..05C2, U+05C4..05C5 ...
6902 Hangul_Syllable_Type (Short: hst)
6903 Hex Hex_Digit
6904 Hex_Digit (Short: Hex). [0-9A-Fa-f], U+FF10..FF19,
6905 U+FF21..FF26, U+FF41..FF46
6906 HorizSpace XPosixBlank. (Perl extension)
6907 Hst Hangul_Syllable_Type
6908 D Hyphen [\-\xad], U+058A, U+1806, U+2010..2011,
6909 U+2E17, U+30FB ... Supplanted by
6910 Line_Break property values; see
6911 www.unicode.org/reports/tr14
6912 ID_Continue (Short: IDC). [0-9A-Z_a-
6913 z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8-
6914 \xff], U+0100..02C1, U+02C6..02D1,
6915 U+02E0..02E4, U+02EC, U+02EE ...
6916 ID_Start (Short: IDS). [A-Za-z\xaa\xb5\xba\xc0-
6917 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6918 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
6919 ...
6920 IDC ID_Continue
6921 Ideo Ideographic
6922 Ideographic (Short: Ideo). U+3006..3007,
6923 U+3021..3029, U+3038..303A, U+3400..4DB5,
6924 U+4E00..9FEF, U+F900..FA6D ...
6925 IDS ID_Start
6926 IDS_Binary_Operator (Short: IDSB). U+2FF0..2FF1, U+2FF4..2FFB
6927 IDS_Trinary_Operator (Short: IDST). U+2FF2..2FF3
6928 IDSB IDS_Binary_Operator
6929 IDST IDS_Trinary_Operator
6930 In Present_In. (Perl extension)
6931 Indic_Positional_Category (Short: InPC)
6932 Indic_Syllabic_Category (Short: InSC)
6933 InPC Indic_Positional_Category
6934 InSC Indic_Syllabic_Category
6935 Isc ISO_Comment; NOT 'isc' meaning
6936 'General_Category=Other'
6937 ISO_Comment (Short: isc)
6938 Jg Joining_Group
6939 Join_C Join_Control
6940 Join_Control (Short: Join_C). U+200C..200D
6941 Joining_Group (Short: jg)
6942 Joining_Type (Short: jt)
6943 Jt Joining_Type
6944 Lb Line_Break
6945 Lc Lowercase_Mapping; NOT 'lc' meaning
6946 'General_Category=Cased_Letter'
6947 Line_Break (Short: lb)
6948 LOE Logical_Order_Exception
6949 Logical_Order_Exception (Short: LOE). U+0E40..0E44, U+0EC0..0EC4,
6950 U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9
6951 ...
6952 Lower Lowercase
6953 Lowercase (Short: Lower). [a-z\xaa\xb5\xba\xdf-
6954 \xf6\xf8-\xff], U+0101, U+0103, U+0105,
6955 U+0107, U+0109 ...
6956 Lowercase_Mapping (Short: lc)
6957 Math [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
6958 U+03D5, U+03F0..03F1, U+03F4..03F6,
6959 U+0606..0608 ...
6960 Na Name
6961 Na1 Unicode_1_Name
6962 Name (Short: na)
6963 Name_Alias
6964 NChar Noncharacter_Code_Point
6965 NFC_QC NFC_Quick_Check
6966 NFC_Quick_Check (Short: NFC_QC)
6967 NFD_QC NFD_Quick_Check
6968 NFD_Quick_Check (Short: NFD_QC)
6969 NFKC_Casefold (Short: NFKC_CF)
6970 NFKC_CF NFKC_Casefold
6971 NFKC_QC NFKC_Quick_Check
6972 NFKC_Quick_Check (Short: NFKC_QC)
6973 NFKD_QC NFKD_Quick_Check
6974 NFKD_Quick_Check (Short: NFKD_QC)
6975 Noncharacter_Code_Point (Short: NChar). U+FDD0..FDEF,
6976 U+FFFE..FFFF, U+1FFFE..1FFFF,
6977 U+2FFFE..2FFFF, U+3FFFE..3FFFF,
6978 U+4FFFE..4FFFF ...
6979 Nt Numeric_Type
6980 Numeric_Type (Short: nt)
6981 Numeric_Value (Short: nv)
6982 Nv Numeric_Value
6983 Pat_Syn Pattern_Syntax
6984 Pat_WS Pattern_White_Space
6985 Pattern_Syntax (Short: Pat_Syn).
6986 [!\"#\$\%&\'\(\)*+,\-.\/:;<=
6987 >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab-
6988 \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
6989 U+2010..2027, U+2030..203E, U+2041..2053,
6990 U+2055..205E, U+2190..245F ...
6991 Pattern_White_Space (Short: Pat_WS). [\t\n\cK\f\r\x20\x85],
6992 U+200E..200F, U+2028..2029
6993 PCM Prepended_Concatenation_Mark
6994 Perl_Decimal_Digit (Perl extension)
6995 PerlSpace PosixSpace. (Perl extension)
6996 PerlWord PosixWord. (Perl extension)
6997 PosixAlnum (Perl extension). [0-9A-Za-z]
6998 PosixAlpha (Perl extension). [A-Za-z]
6999 PosixBlank (Perl extension). [\t\x20]
7000 PosixCntrl (Perl extension). ASCII control
7001 characters. ACK, BEL, BS, CAN, CR, DC1,
7002 DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT,
7003 ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK,
7004 NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT
7005 PosixDigit (Perl extension). [0-9]
7006 PosixGraph (Perl extension).
7007 [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
7008 Z\[\\\]\^_`a-z\{\|\}~]
7009 PosixLower (Perl extension). [a-z]
7010 PosixPrint (Perl extension). [\x20-\x7e]
7011 PosixPunct (Perl extension).
7012 [!\"#\$\%&\'\(\)*+,\-.\/:;<=
7013 >?\@\[\\\]\^_`\{\|\}~]
7014 PosixSpace (Perl extension). [\t\n\cK\f\r\x20]
7015 PosixUpper (Perl extension). [A-Z]
7016 PosixWord (Perl extension). \w, restricted to
7017 ASCII. [0-9A-Z_a-z]
7018 PosixXDigit ASCII_Hex_Digit. (Perl extension).
7019 [0-9A-Fa-f]
7020 Prepended_Concatenation_Mark (Short: PCM). U+0600..0605, U+06DD,
7021 U+070F, U+08E2, U+110BD, U+110CD
7022 Present_In (Short: In). (Perl extension)
7023 Print XPosixPrint. (Perl extension)
7024 Punct General_Category=Punctuation. (Perl
7025 extension).
7026 [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_-
7027 \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
7028 U+037E, U+0387, U+055A..055F,
7029 U+0589..058A, U+05BE ...
7030 QMark Quotation_Mark
7031 Quotation_Mark (Short: QMark). [\"\'\xab\xbb],
7032 U+2018..201F, U+2039..203A, U+2E42,
7033 U+300C..300F, U+301D..301F ...
7034 Radical U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5
7035 Regional_Indicator (Short: RI). U+1F1E6..1F1FF
7036 RI Regional_Indicator
7037 SB Sentence_Break
7038 Sc Script; NOT 'sc' meaning
7039 'General_Category=Currency_Symbol'
7040 Scf Simple_Case_Folding
7041 Script (Short: sc)
7042 Script_Extensions (Short: scx)
7043 Scx Script_Extensions
7044 SD Soft_Dotted
7045 Sentence_Break (Short: SB)
7046 Sentence_Terminal (Short: STerm). [!.?], U+0589,
7047 U+061E..061F, U+06D4, U+0700..0702, U+07F9
7048 ...
7049 Sfc Simple_Case_Folding
7050 Simple_Case_Folding (Short: scf)
7051 Simple_Lowercase_Mapping (Short: slc)
7052 Simple_Titlecase_Mapping (Short: stc)
7053 Simple_Uppercase_Mapping (Short: suc)
7054 Slc Simple_Lowercase_Mapping
7055 Soft_Dotted (Short: SD). [i-j], U+012F, U+0249,
7056 U+0268, U+029D, U+02B2 ...
7057 Space White_Space
7058 SpacePerl XPosixSpace. (Perl extension)
7059 Stc Simple_Titlecase_Mapping
7060 STerm Sentence_Terminal
7061 Suc Simple_Uppercase_Mapping
7062 Tc Titlecase_Mapping
7063 Term Terminal_Punctuation
7064 Terminal_Punctuation (Short: Term). [!,.:;?], U+037E, U+0387,
7065 U+0589, U+05C3, U+060C ...
7066 Title Titlecase. (Perl extension)
7067 Titlecase (Short: Title). (Perl extension). (=
7068 \p{Gc=Lt}). U+01C5, U+01C8, U+01CB,
7069 U+01F2, U+1F88..1F8F, U+1F98..1F9F ...
7070 Titlecase_Mapping (Short: tc)
7071 Uc Uppercase_Mapping
7072 UIdeo Unified_Ideograph
7073 Unicode Any. (Perl extension)
7074 Unicode_1_Name (Short: na1)
7075 Unified_Ideograph (Short: UIdeo). U+3400..4DB5,
7076 U+4E00..9FEF, U+FA0E..FA0F, U+FA11,
7077 U+FA13..FA14, U+FA1F ...
7078 Upper Uppercase
7079 Uppercase (Short: Upper). [A-Z\xc0-\xd6\xd8-\xde],
7080 U+0100, U+0102, U+0104, U+0106, U+0108 ...
7081 Uppercase_Mapping (Short: uc)
7082 Variation_Selector (Short: VS). U+180B..180D, U+FE00..FE0F,
7083 U+E0100..E01EF
7084 Vertical_Orientation (Short: vo)
7085 VertSpace (Perl extension). \v. [\n\cK\f\r\x85],
7086 U+2028..2029
7087 Vo Vertical_Orientation
7088 VS Variation_Selector
7089 WB Word_Break
7090 White_Space (Short: WSpace).
7091 [\t\n\cK\f\r\x20\x85\xa0], U+1680,
7092 U+2000..200A, U+2028..2029, U+202F, U+205F
7093 ...
7094 Word XPosixWord. (Perl extension)
7095 Word_Break (Short: WB)
7096 WSpace White_Space
7097 XDigit XPosixXDigit (=Hex_Digit). (Perl
7098 extension)
7099 XID_Continue (Short: XIDC). [0-9A-Z_a-
7100 z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8-
7101 \xff], U+0100..02C1, U+02C6..02D1,
7102 U+02E0..02E4, U+02EC, U+02EE ...
7103 XID_Start (Short: XIDS). [A-Za-z\xaa\xb5\xba\xc0-
7104 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7105 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7106 ...
7107 XIDC XID_Continue
7108 XIDS XID_Start
7109 XPerlSpace XPosixSpace. (Perl extension)
7110 XPosixAlnum (Short: Alnum). (Perl extension).
7111 Alphabetic and (decimal) Numeric. [0-9A-
7112 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7113 \xff], U+0100..02C1, U+02C6..02D1,
7114 U+02E0..02E4, U+02EC, U+02EE ...
7115 XPosixAlpha Alphabetic. (Perl extension). [A-Za-
7116 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7117 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7118 U+02EC, U+02EE ...
7119 XPosixBlank (Short: Blank). (Perl extension). \h,
7120 Horizontal white space. [\t\x20\xa0],
7121 U+1680, U+2000..200A, U+202F, U+205F,
7122 U+3000
7123 XPosixCntrl General_Category=Control (Short: Cntrl).
7124 (Perl extension). Control characters.
7125 [\x00-\x1f\x7f-\x9f]
7126 XPosixDigit General_Category=Decimal_Number (Short:
7127 Digit). (Perl extension). [0-9] + all
7128 other decimal digits. [0-9],
7129 U+0660..0669, U+06F0..06F9, U+07C0..07C9,
7130 U+0966..096F, U+09E6..09EF ...
7131 XPosixGraph (Short: Graph). (Perl extension).
7132 Characters that are graphical.
7133 [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
7134 Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff],
7135 U+0100..0377, U+037A..037F, U+0384..038A,
7136 U+038C, U+038E..03A1 ...
7137 XPosixLower Lowercase. (Perl extension). [a-
7138 z\xaa\xb5\xba\xdf-\xf6\xf8-\xff], U+0101,
7139 U+0103, U+0105, U+0107, U+0109 ...
7140 XPosixPrint (Short: Print). (Perl extension).
7141 Characters that are graphical plus space
7142 characters (but no controls). [\x20-
7143 \x7e\xa0-\xff], U+0100..0377,
7144 U+037A..037F, U+0384..038A, U+038C,
7145 U+038E..03A1 ...
7146 XPosixPunct (Perl extension). \p{Punct} + ASCII-range
7147 \p{Symbol}. [!\"#\$\%&\'\(\)*+,\-.\/:;<=
7148 >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
7149 \xb7\xbb\xbf], U+037E, U+0387,
7150 U+055A..055F, U+0589..058A, U+05BE ...
7151 XPosixSpace (Perl extension). \s including beyond
7152 ASCII and vertical tab.
7153 [\t\n\cK\f\r\x20\x85\xa0], U+1680,
7154 U+2000..200A, U+2028..2029, U+202F, U+205F
7155 ...
7156 XPosixUpper Uppercase. (Perl extension). [A-Z\xc0-
7157 \xd6\xd8-\xde], U+0100, U+0102, U+0104,
7158 U+0106, U+0108 ...
7159 XPosixWord (Short: Word). (Perl extension). \w,
7160 including beyond ASCII; = \p{Alnum} + \pM
7161 + \p{Pc} + \p{Join_Control}. [0-9A-Z_a-
7162 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7163 U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7164 U+02EC, U+02EE ...
7165 XPosixXDigit Hex_Digit (Short: XDigit). (Perl
7166 extension). [0-9A-Fa-f], U+FF10..FF19,
7167 U+FF21..FF26, U+FF41..FF46
7168
7170 Certain properties are accessible also via core function calls. These
7171 are:
7172
7173 Lowercase_Mapping lc() and lcfirst()
7174 Titlecase_Mapping ucfirst()
7175 Uppercase_Mapping uc()
7176
7177 Also, Case_Folding is accessible through the "/i" modifier in regular
7178 expressions, the "\F" transliteration escape, and the "fc" operator.
7179
7180 And, the Name and Name_Aliases properties are accessible through the
7181 "\N{}" interpolation in double-quoted strings and regular expressions;
7182 and functions "charnames::viacode()", "charnames::vianame()", and
7183 "charnames::string_vianame()" (which require a "use charnames ();" to
7184 be specified.
7185
7186 Finally, most properties related to decomposition are accessible via
7187 Unicode::Normalize.
7188
7190 Perl will generate an error for a few character properties in Unicode
7191 when used in a regular expression. The non-Unihan ones are listed
7192 below, with the reasons they are not accepted, perhaps with work-
7193 arounds. The short names for the properties are listed enclosed in
7194 (parentheses). As described after the list, an installation can change
7195 the defaults and choose to accept any of these. The list is machine
7196 generated based on the choices made for the installation that generated
7197 this document.
7198
7199 Expands_On_NFC (XO_NFC)
7200 Expands_On_NFD (XO_NFD)
7201 Expands_On_NFKC (XO_NFKC)
7202 Expands_On_NFKD (XO_NFKD)
7203 Deprecated by Unicode. These are characters that expand to more
7204 than one character in the specified normalization form, but whether
7205 they actually take up more bytes or not depends on the encoding
7206 being used. For example, a UTF-8 encoded character may expand to a
7207 different number of bytes than a UTF-32 encoded character.
7208
7209 Extended_Pictographic (XPG)
7210 Not part of the Unicode Character Database
7211
7212 Grapheme_Link (Gr_Link)
7213 Duplicates ccc=vr (Canonical_Combining_Class=Virama)
7214
7215 Jamo_Short_Name (JSN)
7216 Other_Alphabetic (OAlpha)
7217 Other_Default_Ignorable_Code_Point (ODI)
7218 Other_Grapheme_Extend (OGr_Ext)
7219 Other_ID_Continue (OIDC)
7220 Other_ID_Start (OIDS)
7221 Other_Lowercase (OLower)
7222 Other_Math (OMath)
7223 Other_Uppercase (OUpper)
7224 Used by Unicode internally for generating other properties and not
7225 intended to be used stand-alone
7226
7227 Script=Katakana_Or_Hiragana (sc=Hrkt)
7228 Obsolete. All code points previously matched by this have been
7229 moved to "Script=Common". Consider instead using
7230 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or
7231 both)
7232
7233 Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)
7234 All code points that would be matched by this are matched by either
7235 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
7236
7237 An installation can choose to allow any of these to be matched by
7238 downloading the Unicode database from <http://www.unicode.org/Public/>
7239 to $Config{privlib}/unicore/ in the Perl source tree, changing the
7240 controlling lists contained in the program
7241 $Config{privlib}/unicore/mktables and then re-compiling and installing.
7242 (%Config is available from the Config module).
7243
7244 Also, perl can be recompiled to operate on an earlier version of the
7245 Unicode standard. Further information is at
7246 $Config{privlib}/unicore/README.perl.
7247
7249 The Unicode data base is delivered in two different formats. The XML
7250 version is valid for more modern Unicode releases. The other version
7251 is a collection of files. The two are intended to give equivalent
7252 information. Perl uses the older form; this allows you to recompile
7253 Perl to use early Unicode releases.
7254
7255 The only non-character property that Perl currently supports is Named
7256 Sequences, in which a sequence of code points is given a name and
7257 generally treated as a single entity. (Perl supports these via the
7258 "\N{...}" double-quotish construct, "charnames::string_vianame(name)"
7259 in charnames, and "namedseq()" in Unicode::UCD.
7260
7261 Below is a list of the files in the Unicode data base that Perl doesn't
7262 currently use, along with very brief descriptions of their purposes.
7263 Some of the names of the files have been shortened from those that
7264 Unicode uses, in order to allow them to be distinguishable from
7265 similarly named files on file systems for which only the first 8
7266 characters of a name are significant.
7267
7268 auxiliary/GraphemeBreakTest.html
7269 auxiliary/LineBreakTest.html
7270 auxiliary/SentenceBreakTest.html
7271 auxiliary/WordBreakTest.html
7272 Documentation of validation Tests
7273
7274 BidiCharacterTest.txt
7275 BidiTest.txt
7276 NormTest.txt
7277 Validation Tests
7278
7279 CJKRadicals.txt
7280 Maps the kRSUnicode property values to corresponding code points
7281
7282 EmojiSources.txt
7283 Maps certain Unicode code points to their legacy Japanese cell-
7284 phone values
7285
7286 extracted/DName.txt
7287 This file adds no new information not already present in other
7288 files
7289
7290 Index.txt
7291 Alphabetical index of Unicode characters
7292
7293 NamedSqProv.txt
7294 Named sequences proposed for inclusion in a later version of the
7295 Unicode Standard; if you need them now, you can append this file to
7296 NamedSequences.txt and recompile perl
7297
7298 NamesList.html
7299 Describes the format and contents of NamesList.txt
7300
7301 NamesList.txt
7302 Annotated list of characters
7303
7304 NormalizationCorrections.txt
7305 Documentation of corrections already incorporated into the Unicode
7306 data base
7307
7308 NushuSources.txt
7309 Specifies source material for Nushu characters
7310
7311 ReadMe.txt
7312 Documentation
7313
7314 StandardizedVariants.html
7315 Obsoleted as of Unicode 9.0, but previously provided a visual
7316 display of the standard variant sequences derived from
7317 StandardizedVariants.txt.
7318
7319 StandardizedVariants.txt
7320 Certain glyph variations for character display are standardized.
7321 This lists the non-Unihan ones; the Unihan ones are also not used
7322 by Perl, and are in a separate Unicode data base
7323 <http://www.unicode.org/ivd>
7324
7325 TangutSources.txt
7326 Specifies source mappings for Tangut ideographs and components.
7327 This data file also includes informative radical-stroke values that
7328 are used internally by Unicode
7329
7330 USourceData.txt
7331 Documentation of status and cross reference of proposals for
7332 encoding by Unicode of Unihan characters
7333
7334 USourceGlyphs.pdf
7335 Pictures of the characters in USourceData.txt
7336
7338 <http://www.unicode.org/reports/tr44/>
7339
7340 perlrecharclass
7341
7342 perlunicode
7343
7344
7345
7346perl v5.30.1 2019-11-29 PERLUNIPROPS(1)