1PERLUNIPROPS(1) Perl Programmers Reference Guide PERLUNIPROPS(1)
2
3
4
6 perluniprops - Index of Unicode Version 9.0.0 character properties in
7 Perl
8
10 This document provides information about the portion of the Unicode
11 database that deals with character properties, that is the portion that
12 is defined on single code points. ("Other information in the Unicode
13 data base" below briefly mentions other data that Unicode provides.)
14
15 Perl can provide access to all non-provisional Unicode character
16 properties, though not all are enabled by default. The omitted ones
17 are the Unihan properties (accessible via the CPAN module
18 Unicode::Unihan) and certain deprecated or Unicode-internal properties.
19 (An installation may choose to recompile Perl's tables to change this.
20 See "Unicode character properties that are NOT accepted by Perl".)
21
22 For most purposes, access to Unicode properties from the Perl core is
23 through regular expression matches, as described in the next section.
24 For some special purposes, and to access the properties that are not
25 suitable for regular expression matching, all the Unicode character
26 properties that Perl handles are accessible via the standard
27 Unicode::UCD module, as described in the section "Properties accessible
28 through Unicode::UCD".
29
30 Perl also provides some additional extensions and short-cut synonyms
31 for Unicode properties.
32
33 This document merely lists all available properties and does not
34 attempt to explain what each property really means. There is a brief
35 description of each Perl extension; see "Other Properties" in
36 perlunicode for more information on these. There is some detail about
37 Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but
38 to find out about the intricacies of the official Unicode properties,
39 refer to the Unicode standard. A good starting place is
40 <http://www.unicode.org/reports/tr44/>.
41
42 Note that you can define your own properties; see "User-Defined
43 Character Properties" in perlunicode.
44
46 The Perl regular expression "\p{}" and "\P{}" constructs give access to
47 most of the Unicode character properties. The table below shows all
48 these constructs, both single and compound forms.
49
50 Compound forms consist of two components, separated by an equals sign
51 or a colon. The first component is the property name, and the second
52 component is the particular value of the property to match against, for
53 example, "\p{Script: Greek}" and "\p{Script=Greek}" both mean to match
54 characters whose Script property value is Greek.
55
56 Single forms, like "\p{Greek}", are mostly Perl-defined shortcuts for
57 their equivalent compound forms. The table shows these equivalences.
58 (In our example, "\p{Greek}" is a just a shortcut for
59 "\p{Script=Greek}".) There are also a few Perl-defined single forms
60 that are not shortcuts for a compound form. One such is "\p{Word}".
61 These are also listed in the table.
62
63 In parsing these constructs, Perl always ignores Upper/lower case
64 differences everywhere within the {braces}. Thus "\p{Greek}" means the
65 same thing as "\p{greek}". But note that changing the case of the "p"
66 or "P" before the left brace completely changes the meaning of the
67 construct, from "match" (for "\p{}") to "doesn't match" (for "\P{}").
68 Casing in this document is for improved legibility.
69
70 Also, white space, hyphens, and underscores are normally ignored
71 everywhere between the {braces}, and hence can be freely added or
72 removed even if the "/x" modifier hasn't been specified on the regular
73 expression. But in the table below a 'T' at the beginning of an entry
74 means that tighter (stricter) rules are used for that entry:
75
76 Single form ("\p{name}") tighter rules:
77 White space, hyphens, and underscores ARE significant except
78 for:
79
80 · white space adjacent to a non-word character
81
82 · underscores separating digits in numbers
83
84 That means, for example, that you can freely add or remove
85 white space adjacent to (but within) the braces without
86 affecting the meaning.
87
88 Compound form ("\p{name=value}" or "\p{name:value}") tighter rules:
89 The tighter rules given above for the single form apply to
90 everything to the right of the colon or equals; the looser
91 rules still apply to everything to the left.
92
93 That means, for example, that you can freely add or remove
94 white space adjacent to (but within) the braces and the colon
95 or equal sign.
96
97 Some properties are considered obsolete by Unicode, but still
98 available. There are several varieties of obsolescence:
99
100 Stabilized
101 A property may be stabilized. Such a determination does not
102 indicate that the property should or should not be used;
103 instead it is a declaration that the property will not be
104 maintained nor extended for newly encoded characters. Such
105 properties are marked with an 'S' in the table.
106
107 Deprecated
108 A property may be deprecated, perhaps because its original
109 intent has been replaced by another property, or because its
110 specification was somehow defective. This means that its use
111 is strongly discouraged, so much so that a warning will be
112 issued if used, unless the regular expression is in the scope
113 of a "no warnings 'deprecated'" statement. A 'D' flags each
114 such entry in the table, and the entry there for the longest,
115 most descriptive version of the property will give the reason
116 it is deprecated, and perhaps advice. Perl may issue such a
117 warning, even for properties that aren't officially deprecated
118 by Unicode, when there used to be characters or code points
119 that were matched by them, but no longer. This is to warn you
120 that your program may not work like it did on earlier Unicode
121 releases.
122
123 A deprecated property may be made unavailable in a future Perl
124 version, so it is best to move away from them.
125
126 A deprecated property may also be stabilized, but this fact is
127 not shown.
128
129 Obsolete
130 Properties marked with an 'O' in the table are considered
131 (plain) obsolete. Generally this designation is given to
132 properties that Unicode once used for internal purposes (but
133 not any longer).
134
135 Discouraged
136 This is not actually a Unicode-specified obsolescence, but
137 applies to certain Perl extensions that are present for
138 backwards compatibility, but are discouraged from being used.
139 These are not obsolete, but their meanings are not stable.
140 Future Unicode versions could force any of these extensions to
141 be removed without warning, replaced by another property with
142 the same name that means something different. An 'X' flags
143 each such entry in the table. Use the equivalent shown
144 instead.
145
146 In particular, matches in the Block property have single forms
147 defined by Perl that begin with "In_", ""Is_", or even with no
148 prefix at all, Like all DISCOURAGED forms, these are not
149 stable. For example, "\p{Block=Deseret}" can currently be
150 written as "\p{In_Deseret}", "\p{Is_Deseret}", or
151 "\p{Deseret}". But, a new Unicode version may come along that
152 would force Perl to change the meaning of one or more of these,
153 and your program would no longer be correct. Currently there
154 are no such conflicts with the form that begins "In_", but
155 there are many with the other two shortcuts, and Unicode
156 continues to define new properties that begin with "In", so
157 it's quite possible that a conflict will occur in the future.
158 The compound form is guaranteed to not become obsolete, and its
159 meaning is clearer anyway. See "Blocks" in perlunicode for
160 more information about this.
161
162 The table below has two columns. The left column contains the "\p{}"
163 constructs to look up, possibly preceded by the flags mentioned above;
164 and the right column contains information about them, like a
165 description, or synonyms. The table shows both the single and compound
166 forms for each property that has them. If the left column is a short
167 name for a property, the right column will give its longer, more
168 descriptive name; and if the left column is the longest name, the right
169 column will show any equivalent shortest name, in both single and
170 compound forms if applicable.
171
172 If braces are not needed to specify a property (e.g., "\pL"), the left
173 column contains both forms, with and without braces.
174
175 The right column will also caution you if a property means something
176 different than what might normally be expected.
177
178 All single forms are Perl extensions; a few compound forms are as well,
179 and are noted as such.
180
181 Numbers in (parentheses) indicate the total number of Unicode code
182 points matched by the property. For emphasis, those properties that
183 match no code points at all are listed as well in a separate section
184 following the table.
185
186 Most properties match the same code points regardless of whether "/i"
187 case-insensitive matching is specified or not. But a few properties
188 are affected. These are shown with the notation "(/i= other_property)"
189 in the second column. Under case-insensitive matching they match the
190 same code pode points as the property other_property.
191
192 There is no description given for most non-Perl defined properties (See
193 <http://www.unicode.org/reports/tr44/> for that).
194
195 For compactness, '*' is used as a wildcard instead of showing all
196 possible combinations. For example, entries like:
197
198 \p{Gc: *} \p{General_Category: *}
199
200 mean that 'Gc' is a synonym for 'General_Category', and anything that
201 is valid for the latter is also valid for the former. Similarly,
202
203 \p{Is_*} \p{*}
204
205 means that if and only if, for example, "\p{Foo}" exists, then
206 "\p{Is_Foo}" and "\p{IsFoo}" are also valid and all mean the same
207 thing. And similarly, "\p{Foo=Bar}" means the same as "\p{Is_Foo=Bar}"
208 and "\p{IsFoo=Bar}". "*" here is restricted to something not beginning
209 with an underscore.
210
211 Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
212 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table
213 shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
214 for the other possibilities. Note that not all properties which have
215 values 'Yes' and 'No' are binary, and they have all their values
216 spelled out without using this wild card, and a "NOT" clause in their
217 description that highlights their not being binary. These also require
218 the compound form to match them, whereas true binary properties have
219 both single and compound forms available.
220
221 Note that all non-essential underscores are removed in the display of
222 the short names below.
223
224 Legend summary:
225
226 * is a wild-card
227 (\d+) in the info column gives the number of Unicode code points
228 matched by this property.
229 D means this is deprecated.
230 O means this is obsolete.
231 S means this is stabilized.
232 T means tighter (stricter) name matching applies.
233 X means use of this form is discouraged, and may not be stable.
234
235 NAME INFO
236
237 \p{Adlam} \p{Script_Extensions=Adlam} (Short:
238 \p{Adlm}; NOT \p{Block=Adlam}) (88)
239 \p{Adlm} \p{Adlam} (= \p{Script_Extensions=Adlam})
240 (NOT \p{Block=Adlam}) (88)
241 X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
242 T \p{Age: 1.1} \p{Age=V1_1} (33_979)
243 T \p{Age: 2.0} \p{Age=V2_0} (144_521)
244 T \p{Age: 2.1} \p{Age=V2_1} (2)
245 T \p{Age: 3.0} \p{Age=V3_0} (10_307)
246 T \p{Age: 3.1} \p{Age=V3_1} (44_978)
247 T \p{Age: 3.2} \p{Age=V3_2} (1016)
248 T \p{Age: 4.0} \p{Age=V4_0} (1226)
249 T \p{Age: 4.1} \p{Age=V4_1} (1273)
250 T \p{Age: 5.0} \p{Age=V5_0} (1369)
251 T \p{Age: 5.1} \p{Age=V5_1} (1624)
252 T \p{Age: 5.2} \p{Age=V5_2} (6648)
253 T \p{Age: 6.0} \p{Age=V6_0} (2088)
254 T \p{Age: 6.1} \p{Age=V6_1} (732)
255 T \p{Age: 6.2} \p{Age=V6_2} (1)
256 T \p{Age: 6.3} \p{Age=V6_3} (5)
257 T \p{Age: 7.0} \p{Age=V7_0} (2834)
258 T \p{Age: 8.0} \p{Age=V8_0} (7716)
259 T \p{Age: 9.0} \p{Age=V9_0} (7500)
260 \p{Age: NA} \p{Age=Unassigned} (846_293 plus all
261 above-Unicode code points)
262 \p{Age: Unassigned} Code point's usage has not been assigned
263 in any Unicode release thus far. (Short:
264 \p{Age=NA}) (846_293 plus all above-
265 Unicode code points)
266 \p{Age: V1_1} Code point's usage introduced in version
267 1.1 (33_979)
268 \p{Age: V2_0} Code point's usage was introduced in
269 version 2.0; See also Property
270 'Present_In' (144_521)
271 \p{Age: V2_1} Code point's usage was introduced in
272 version 2.1; See also Property
273 'Present_In' (2)
274 \p{Age: V3_0} Code point's usage was introduced in
275 version 3.0; See also Property
276 'Present_In' (10_307)
277 \p{Age: V3_1} Code point's usage was introduced in
278 version 3.1; See also Property
279 'Present_In' (44_978)
280 \p{Age: V3_2} Code point's usage was introduced in
281 version 3.2; See also Property
282 'Present_In' (1016)
283 \p{Age: V4_0} Code point's usage was introduced in
284 version 4.0; See also Property
285 'Present_In' (1226)
286 \p{Age: V4_1} Code point's usage was introduced in
287 version 4.1; See also Property
288 'Present_In' (1273)
289 \p{Age: V5_0} Code point's usage was introduced in
290 version 5.0; See also Property
291 'Present_In' (1369)
292 \p{Age: V5_1} Code point's usage was introduced in
293 version 5.1; See also Property
294 'Present_In' (1624)
295 \p{Age: V5_2} Code point's usage was introduced in
296 version 5.2; See also Property
297 'Present_In' (6648)
298 \p{Age: V6_0} Code point's usage was introduced in
299 version 6.0; See also Property
300 'Present_In' (2088)
301 \p{Age: V6_1} Code point's usage was introduced in
302 version 6.1; See also Property
303 'Present_In' (732)
304 \p{Age: V6_2} Code point's usage was introduced in
305 version 6.2; See also Property
306 'Present_In' (1)
307 \p{Age: V6_3} Code point's usage was introduced in
308 version 6.3; See also Property
309 'Present_In' (5)
310 \p{Age: V7_0} Code point's usage was introduced in
311 version 7.0; See also Property
312 'Present_In' (2834)
313 \p{Age: V8_0} Code point's usage was introduced in
314 version 8.0; See also Property
315 'Present_In' (7716)
316 \p{Age: V9_0} Code point's usage was introduced in
317 version 9.0; See also Property
318 'Present_In' (7500)
319 \p{Aghb} \p{Caucasian_Albanian} (=
320 \p{Script_Extensions=
321 Caucasian_Albanian}) (NOT \p{Block=
322 Caucasian_Albanian}) (53)
323 \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
324 (22)
325 \p{AHex: *} \p{ASCII_Hex_Digit: *}
326 \p{Ahom} \p{Script_Extensions=Ahom} (NOT \p{Block=
327 Ahom}) (57)
328 X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block=
329 Alchemical_Symbols}) (128)
330 X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short:
331 \p{InAlchemical}) (128)
332 \p{All} All code points, including those above
333 Unicode. Same as qr/./s (1_114_112 plus
334 all above-Unicode code points)
335 \p{Alnum} \p{XPosixAlnum} (118_820)
336 \p{Alpha} \p{XPosixAlpha} (= \p{Alphabetic=Y})
337 (118_240)
338 \p{Alpha: *} \p{Alphabetic: *}
339 \p{Alphabetic} \p{XPosixAlpha} (= \p{Alphabetic=Y})
340 (118_240)
341 \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (995_872
342 plus all above-Unicode code points)
343 \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (118_240)
344 X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (=
345 \p{Block=Alphabetic_Presentation_Forms})
346 (80)
347 X \p{Alphabetic_Presentation_Forms} \p{Block=
348 Alphabetic_Presentation_Forms} (Short:
349 \p{InAlphabeticPF}) (80)
350 \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
351 Anatolian_Hieroglyphs} (Short: \p{Hluw};
352 NOT \p{Block=Anatolian_Hieroglyphs})
353 (583)
354 X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
355 \p{Block=
356 Ancient_Greek_Musical_Notation}) (80)
357 X \p{Ancient_Greek_Musical_Notation} \p{Block=
358 Ancient_Greek_Musical_Notation} (Short:
359 \p{InAncientGreekMusic}) (80)
360 X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
361 X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
362 \p{Any} All Unicode code points: [\x{0000}-
363 \x{10FFFF}] (1_114_112)
364 \p{Arab} \p{Arabic} (= \p{Script_Extensions=
365 Arabic}) (NOT \p{Block=Arabic}) (1323)
366 \p{Arabic} \p{Script_Extensions=Arabic} (Short:
367 \p{Arab}; NOT \p{Block=Arabic}) (1323)
368 X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block=
369 Arabic_Extended_A}) (96)
370 X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short:
371 \p{InArabicExtA}) (96)
372 X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols}
373 (= \p{Block=
374 Arabic_Mathematical_Alphabetic_Symbols})
375 (256)
376 X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
377 Arabic_Mathematical_Alphabetic_Symbols}
378 (Short: \p{InArabicMath}) (256)
379 X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (=
380 \p{Block=Arabic_Presentation_Forms_A})
381 (688)
382 X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (=
383 \p{Block=Arabic_Presentation_Forms_B})
384 (144)
385 X \p{Arabic_Presentation_Forms_A} \p{Block=
386 Arabic_Presentation_Forms_A} (Short:
387 \p{InArabicPFA}) (688)
388 X \p{Arabic_Presentation_Forms_B} \p{Block=
389 Arabic_Presentation_Forms_B} (Short:
390 \p{InArabicPFB}) (144)
391 X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block=
392 Arabic_Supplement}) (48)
393 X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short:
394 \p{InArabicSup}) (48)
395 \p{Armenian} \p{Script_Extensions=Armenian} (Short:
396 \p{Armn}; NOT \p{Block=Armenian}) (94)
397 \p{Armi} \p{Imperial_Aramaic} (=
398 \p{Script_Extensions=Imperial_Aramaic})
399 (NOT \p{Block=Imperial_Aramaic}) (31)
400 \p{Armn} \p{Armenian} (= \p{Script_Extensions=
401 Armenian}) (NOT \p{Block=Armenian}) (94)
402 X \p{Arrows} \p{Block=Arrows} (112)
403 \p{ASCII} \p{Block=Basic_Latin} [[:ASCII:]] (128)
404 \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
405 (22)
406 \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
407 plus all above-Unicode code points)
408 \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22)
409 \p{Assigned} All assigned code points (267_753)
410 \p{Avestan} \p{Script_Extensions=Avestan} (Short:
411 \p{Avst}; NOT \p{Block=Avestan}) (61)
412 \p{Avst} \p{Avestan} (= \p{Script_Extensions=
413 Avestan}) (NOT \p{Block=Avestan}) (61)
414 \p{Bali} \p{Balinese} (= \p{Script_Extensions=
415 Balinese}) (NOT \p{Block=Balinese}) (121)
416 \p{Balinese} \p{Script_Extensions=Balinese} (Short:
417 \p{Bali}; NOT \p{Block=Balinese}) (121)
418 \p{Bamu} \p{Bamum} (= \p{Script_Extensions=Bamum})
419 (NOT \p{Block=Bamum}) (657)
420 \p{Bamum} \p{Script_Extensions=Bamum} (Short:
421 \p{Bamu}; NOT \p{Block=Bamum}) (657)
422 X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block=
423 Bamum_Supplement}) (576)
424 X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short:
425 \p{InBamumSup}) (576)
426 X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
427 \p{Bass} \p{Bassa_Vah} (= \p{Script_Extensions=
428 Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
429 (36)
430 \p{Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
431 \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
432 \p{Batak} \p{Script_Extensions=Batak} (Short:
433 \p{Batk}; NOT \p{Block=Batak}) (56)
434 \p{Batk} \p{Batak} (= \p{Script_Extensions=Batak})
435 (NOT \p{Block=Batak}) (56)
436 \p{Bc: *} \p{Bidi_Class: *}
437 \p{Beng} \p{Bengali} (= \p{Script_Extensions=
438 Bengali}) (NOT \p{Block=Bengali}) (98)
439 \p{Bengali} \p{Script_Extensions=Bengali} (Short:
440 \p{Beng}; NOT \p{Block=Bengali}) (98)
441 \p{Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
442 \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
443 \p{Bhks} \p{Bhaiksuki} (= \p{Script_Extensions=
444 Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
445 (97)
446 \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y})
447 (12)
448 \p{Bidi_C: *} \p{Bidi_Control: *}
449 \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1420)
450 \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (51)
451 \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1420)
452 \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (51)
453 \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
454 \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4016)
455 \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016)
456 \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15)
457 \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
458 \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (158)
459 \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
460 \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (87)
461 \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (158)
462 \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12)
463 \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (87)
464 \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1)
465 \p{Bidi_Class: FSI} \p{Bidi_Class=First_Strong_Isolate} (1)
466 \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_097_280
467 plus all above-Unicode code points)
468 \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_097_280 plus
469 all above-Unicode code points)
470 \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1)
471 \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1)
472 \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1)
473 \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
474 \p{Bidi_Class: LRI} \p{Bidi_Class=Left_To_Right_Isolate} (1)
475 \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
476 \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1700)
477 \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1700)
478 \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (5267)
479 \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (5267)
480 \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7)
481 \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
482 \p{Bidi_Class: PDI} \p{Bidi_Class=Pop_Directional_Isolate} (1)
483 \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1)
484 \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1)
485 \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (4070)
486 \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (4070)
487 \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1)
488 \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1)
489 \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1)
490 \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
491 \p{Bidi_Class: RLI} \p{Bidi_Class=Right_To_Left_Isolate} (1)
492 \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
493 \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
494 \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3)
495 \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17)
496 \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (17)
497 \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
498 \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
499 plus all above-Unicode code points)
500 \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (12)
501 \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
502 (545)
503 \p{Bidi_M: *} \p{Bidi_Mirrored: *}
504 \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
505 (545)
506 \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567
507 plus all above-Unicode code points)
508 \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (545)
509 \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
510 (60)
511 \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60)
512 \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
513 (1_113_992 plus all above-Unicode code
514 points)
515 \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992
516 plus all above-Unicode code points)
517 \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
518 (60)
519 \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60)
520 \p{Blank} \p{XPosixBlank} (18)
521 \p{Blk: *} \p{Block: *}
522 \p{Block: Adlam} (NOT \p{Adlam} NOR \p{Is_Adlam}) (96)
523 \p{Block: Aegean_Numbers} (64)
524 \p{Block: Ahom} (NOT \p{Ahom} NOR \p{Is_Ahom}) (64)
525 \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128)
526 \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128)
527 \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
528 (80)
529 \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
530 AlphabeticPF}) (80)
531 \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
532 NOR \p{Is_Anatolian_Hieroglyphs}) (640)
533 \p{Block: Ancient_Greek_Music} \p{Block=
534 Ancient_Greek_Musical_Notation} (80)
535 \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
536 AncientGreekMusic}) (80)
537 \p{Block: Ancient_Greek_Numbers} (80)
538 \p{Block: Ancient_Symbols} (64)
539 \p{Block: Arabic} (NOT \p{Arabic} NOR \p{Is_Arabic}) (256)
540 \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
541 \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96)
542 \p{Block: Arabic_Math} \p{Block=
543 Arabic_Mathematical_Alphabetic_Symbols}
544 (256)
545 \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
546 ArabicMath}) (256)
547 \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688)
548 \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144)
549 \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
550 (688)
551 \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
552 (144)
553 \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48)
554 \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48)
555 \p{Block: Armenian} (NOT \p{Armenian} NOR \p{Is_Armenian}) (96)
556 \p{Block: Arrows} (112)
557 \p{Block: ASCII} \p{Block=Basic_Latin} (128)
558 \p{Block: Avestan} (NOT \p{Avestan} NOR \p{Is_Avestan}) (64)
559 \p{Block: Balinese} (NOT \p{Balinese} NOR \p{Is_Balinese})
560 (128)
561 \p{Block: Bamum} (NOT \p{Bamum} NOR \p{Is_Bamum}) (96)
562 \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576)
563 \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576)
564 \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}) (128)
565 \p{Block: Bassa_Vah} (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
566 (48)
567 \p{Block: Batak} (NOT \p{Batak} NOR \p{Is_Batak}) (64)
568 \p{Block: Bengali} (NOT \p{Bengali} NOR \p{Is_Bengali}) (128)
569 \p{Block: Bhaiksuki} (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
570 (112)
571 \p{Block: Block_Elements} (32)
572 \p{Block: Bopomofo} (NOT \p{Bopomofo} NOR \p{Is_Bopomofo}) (48)
573 \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
574 \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32)
575 \p{Block: Box_Drawing} (128)
576 \p{Block: Brahmi} (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128)
577 \p{Block: Braille} \p{Block=Braille_Patterns} (256)
578 \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256)
579 \p{Block: Buginese} (NOT \p{Buginese} NOR \p{Is_Buginese}) (32)
580 \p{Block: Buhid} (NOT \p{Buhid} NOR \p{Is_Buhid}) (32)
581 \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
582 (256)
583 \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
584 ByzantineMusic}) (256)
585 \p{Block: Canadian_Syllabics} \p{Block=
586 Unified_Canadian_Aboriginal_Syllabics}
587 (640)
588 \p{Block: Carian} (NOT \p{Carian} NOR \p{Is_Carian}) (64)
589 \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
590 \p{Is_Caucasian_Albanian}) (64)
591 \p{Block: Chakma} (NOT \p{Chakma} NOR \p{Is_Chakma}) (80)
592 \p{Block: Cham} (NOT \p{Cham} NOR \p{Is_Cham}) (96)
593 \p{Block: Cherokee} (NOT \p{Cherokee} NOR \p{Is_Cherokee}) (96)
594 \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
595 \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80)
596 \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992)
597 \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256)
598 \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
599 \p{Block: CJK_Compat_Ideographs} \p{Block=
600 CJK_Compatibility_Ideographs} (512)
601 \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
602 CJK_Compatibility_Ideographs_Supplement}
603 (544)
604 \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256)
605 \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
606 (32)
607 \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
608 CJKCompatIdeographs}) (512)
609 \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
610 CJKCompatIdeographsSup}) (544)
611 \p{Block: CJK_Ext_A} \p{Block=
612 CJK_Unified_Ideographs_Extension_A}
613 (6592)
614 \p{Block: CJK_Ext_B} \p{Block=
615 CJK_Unified_Ideographs_Extension_B}
616 (42_720)
617 \p{Block: CJK_Ext_C} \p{Block=
618 CJK_Unified_Ideographs_Extension_C}
619 (4160)
620 \p{Block: CJK_Ext_D} \p{Block=
621 CJK_Unified_Ideographs_Extension_D} (224)
622 \p{Block: CJK_Ext_E} \p{Block=
623 CJK_Unified_Ideographs_Extension_E}
624 (5776)
625 \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
626 \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
627 (128)
628 \p{Block: CJK_Strokes} (48)
629 \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64)
630 \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
631 (64)
632 \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992)
633 \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
634 CJKExtA}) (6592)
635 \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
636 CJKExtB}) (42_720)
637 \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
638 CJKExtC}) (4160)
639 \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
640 CJKExtD}) (224)
641 \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
642 CJKExtE}) (5776)
643 \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
644 Diacriticals}) (112)
645 \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
646 DiacriticalsExt}) (80)
647 \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
648 DiacriticalsForSymbols}) (48)
649 \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
650 DiacriticalsSup}) (64)
651 \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16)
652 \p{Block: Combining_Marks_For_Symbols} \p{Block=
653 Combining_Diacritical_Marks_For_Symbols}
654 (48)
655 \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
656 IndicNumberForms}) (16)
657 \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96)
658 \p{Block: Control_Pictures} (64)
659 \p{Block: Coptic} (NOT \p{Coptic} NOR \p{Is_Coptic}) (128)
660 \p{Block: Coptic_Epact_Numbers} (32)
661 \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
662 \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32)
663 \p{Block: Cuneiform} (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
664 (1024)
665 \p{Block: Cuneiform_Numbers} \p{Block=
666 Cuneiform_Numbers_And_Punctuation} (128)
667 \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
668 CuneiformNumbers}) (128)
669 \p{Block: Currency_Symbols} (48)
670 \p{Block: Cypriot_Syllabary} (64)
671 \p{Block: Cyrillic} (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
672 (256)
673 \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
674 \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
675 \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
676 \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32)
677 \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96)
678 \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16)
679 \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
680 \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48)
681 \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
682 (48)
683 \p{Block: Deseret} (80)
684 \p{Block: Devanagari} (NOT \p{Devanagari} NOR \p{Is_Devanagari})
685 (128)
686 \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
687 \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32)
688 \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
689 \p{Block: Diacriticals_Ext} \p{Block=
690 Combining_Diacritical_Marks_Extended}
691 (80)
692 \p{Block: Diacriticals_For_Symbols} \p{Block=
693 Combining_Diacritical_Marks_For_Symbols}
694 (48)
695 \p{Block: Diacriticals_Sup} \p{Block=
696 Combining_Diacritical_Marks_Supplement}
697 (64)
698 \p{Block: Dingbats} (192)
699 \p{Block: Domino} \p{Block=Domino_Tiles} (112)
700 \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112)
701 \p{Block: Duployan} (NOT \p{Duployan} NOR \p{Is_Duployan})
702 (160)
703 \p{Block: Early_Dynastic_Cuneiform} (208)
704 \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
705 \p{Is_Egyptian_Hieroglyphs}) (1072)
706 \p{Block: Elbasan} (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48)
707 \p{Block: Emoticons} (80)
708 \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
709 \p{Block: Enclosed_Alphanum_Sup} \p{Block=
710 Enclosed_Alphanumeric_Supplement} (256)
711 \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
712 EnclosedAlphanumSup}) (256)
713 \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
714 EnclosedAlphanum}) (160)
715 \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
716 (256)
717 \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
718 EnclosedCJK}) (256)
719 \p{Block: Enclosed_Ideographic_Sup} \p{Block=
720 Enclosed_Ideographic_Supplement} (256)
721 \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
722 EnclosedIdeographicSup}) (256)
723 \p{Block: Ethiopic} (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
724 (384)
725 \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
726 \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
727 \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96)
728 \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48)
729 \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
730 \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32)
731 \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
732 \p{Punct} NOR \p{Is_Punctuation}) (112)
733 \p{Block: Geometric_Shapes} (96)
734 \p{Block: Geometric_Shapes_Ext} \p{Block=
735 Geometric_Shapes_Extended} (128)
736 \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
737 GeometricShapesExt}) (128)
738 \p{Block: Georgian} (NOT \p{Georgian} NOR \p{Is_Georgian}) (96)
739 \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
740 \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48)
741 \p{Block: Glagolitic} (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
742 (96)
743 \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
744 \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
745 (48)
746 \p{Block: Gothic} (NOT \p{Gothic} NOR \p{Is_Gothic}) (32)
747 \p{Block: Grantha} (NOT \p{Grantha} NOR \p{Is_Grantha}) (128)
748 \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
749 NOR \p{Is_Greek}) (144)
750 \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
751 NOR \p{Is_Greek}) (144)
752 \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256)
753 \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256)
754 \p{Block: Gujarati} (NOT \p{Gujarati} NOR \p{Is_Gujarati})
755 (128)
756 \p{Block: Gurmukhi} (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
757 (128)
758 \p{Block: Half_And_Full_Forms} \p{Block=
759 Halfwidth_And_Fullwidth_Forms} (240)
760 \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16)
761 \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
762 HalfAndFullForms}) (240)
763 \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul}
764 NOR \p{Is_Hangul}) (11_184)
765 \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
766 (96)
767 \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}) (256)
768 \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32)
769 \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80)
770 \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
771 NOR \p{Is_Hangul}) (11_184)
772 \p{Block: Hanunoo} (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32)
773 \p{Block: Hatran} (NOT \p{Hatran} NOR \p{Is_Hatran}) (32)
774 \p{Block: Hebrew} (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112)
775 \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
776 HighPUSurrogates}) (128)
777 \p{Block: High_PU_Surrogates} \p{Block=
778 High_Private_Use_Surrogates} (128)
779 \p{Block: High_Surrogates} (896)
780 \p{Block: Hiragana} (NOT \p{Hiragana} NOR \p{Is_Hiragana}) (96)
781 \p{Block: IDC} \p{Block=
782 Ideographic_Description_Characters} (NOT
783 \p{ID_Continue} NOR \p{Is_IDC}) (16)
784 \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
785 NOT \p{ID_Continue} NOR \p{Is_IDC}) (16)
786 \p{Block: Ideographic_Symbols} \p{Block=
787 Ideographic_Symbols_And_Punctuation} (32)
788 \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
789 IdeographicSymbols}) (32)
790 \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
791 \p{Is_Imperial_Aramaic}) (32)
792 \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
793 (16)
794 \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
795 NOR \p{Is_Inscriptional_Pahlavi}) (32)
796 \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
797 NOR \p{Is_Inscriptional_Parthian}) (32)
798 \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96)
799 \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96)
800 \p{Block: Jamo} \p{Block=Hangul_Jamo} (256)
801 \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32)
802 \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80)
803 \p{Block: Javanese} (NOT \p{Javanese} NOR \p{Is_Javanese}) (96)
804 \p{Block: Kaithi} (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80)
805 \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256)
806 \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256)
807 \p{Block: Kanbun} (16)
808 \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224)
809 \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224)
810 \p{Block: Kannada} (NOT \p{Kannada} NOR \p{Is_Kannada}) (128)
811 \p{Block: Katakana} (NOT \p{Katakana} NOR \p{Is_Katakana}) (96)
812 \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
813 \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
814 KatakanaExt}) (16)
815 \p{Block: Kayah_Li} (48)
816 \p{Block: Kharoshthi} (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
817 (96)
818 \p{Block: Khmer} (NOT \p{Khmer} NOR \p{Is_Khmer}) (128)
819 \p{Block: Khmer_Symbols} (32)
820 \p{Block: Khojki} (NOT \p{Khojki} NOR \p{Is_Khojki}) (80)
821 \p{Block: Khudawadi} (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
822 (80)
823 \p{Block: Lao} (NOT \p{Lao} NOR \p{Is_Lao}) (128)
824 \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
825 \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128)
826 \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128)
827 \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128)
828 \p{Block: Latin_Ext_Additional} \p{Block=
829 Latin_Extended_Additional} (256)
830 \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208)
831 \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32)
832 \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224)
833 \p{Block: Latin_Ext_E} \p{Block=Latin_Extended_E} (64)
834 \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128)
835 \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
836 LatinExtAdditional}) (256)
837 \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208)
838 \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32)
839 \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224)
840 \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64)
841 \p{Block: Lepcha} (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80)
842 \p{Block: Letterlike_Symbols} (80)
843 \p{Block: Limbu} (NOT \p{Limbu} NOR \p{Is_Limbu}) (80)
844 \p{Block: Linear_A} (NOT \p{Linear_A} NOR \p{Is_Linear_A})
845 (384)
846 \p{Block: Linear_B_Ideograms} (128)
847 \p{Block: Linear_B_Syllabary} (128)
848 \p{Block: Lisu} (48)
849 \p{Block: Low_Surrogates} (1024)
850 \p{Block: Lycian} (NOT \p{Lycian} NOR \p{Is_Lycian}) (32)
851 \p{Block: Lydian} (NOT \p{Lydian} NOR \p{Is_Lydian}) (32)
852 \p{Block: Mahajani} (NOT \p{Mahajani} NOR \p{Is_Mahajani}) (48)
853 \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48)
854 \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48)
855 \p{Block: Malayalam} (NOT \p{Malayalam} NOR \p{Is_Malayalam})
856 (128)
857 \p{Block: Mandaic} (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32)
858 \p{Block: Manichaean} (NOT \p{Manichaean} NOR \p{Is_Manichaean})
859 (64)
860 \p{Block: Marchen} (NOT \p{Marchen} NOR \p{Is_Marchen}) (80)
861 \p{Block: Math_Alphanum} \p{Block=
862 Mathematical_Alphanumeric_Symbols} (1024)
863 \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
864 \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
865 MathAlphanum}) (1024)
866 \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
867 (256)
868 \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
869 \p{Is_Meetei_Mayek}) (64)
870 \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
871 \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
872 (32)
873 \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
874 \p{Is_Mende_Kikakui}) (224)
875 \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
876 \p{Is_Meroitic_Cursive}) (96)
877 \p{Block: Meroitic_Hieroglyphs} (32)
878 \p{Block: Miao} (NOT \p{Miao} NOR \p{Is_Miao}) (160)
879 \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows}
880 (256)
881 \p{Block: Misc_Math_Symbols_A} \p{Block=
882 Miscellaneous_Mathematical_Symbols_A}
883 (48)
884 \p{Block: Misc_Math_Symbols_B} \p{Block=
885 Miscellaneous_Mathematical_Symbols_B}
886 (128)
887 \p{Block: Misc_Pictographs} \p{Block=
888 Miscellaneous_Symbols_And_Pictographs}
889 (768)
890 \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
891 \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
892 \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
893 MiscMathSymbolsA}) (48)
894 \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
895 MiscMathSymbolsB}) (128)
896 \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols}) (256)
897 \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
898 MiscArrows}) (256)
899 \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
900 MiscPictographs}) (768)
901 \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
902 (256)
903 \p{Block: Modi} (NOT \p{Modi} NOR \p{Is_Modi}) (96)
904 \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
905 \p{Block: Modifier_Tone_Letters} (32)
906 \p{Block: Mongolian} (NOT \p{Mongolian} NOR \p{Is_Mongolian})
907 (176)
908 \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
909 \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32)
910 \p{Block: Mro} (NOT \p{Mro} NOR \p{Is_Mro}) (48)
911 \p{Block: Multani} (NOT \p{Multani} NOR \p{Is_Multani}) (48)
912 \p{Block: Music} \p{Block=Musical_Symbols} (256)
913 \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256)
914 \p{Block: Myanmar} (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160)
915 \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
916 \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
917 \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32)
918 \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32)
919 \p{Block: Nabataean} (NOT \p{Nabataean} NOR \p{Is_Nabataean})
920 (48)
921 \p{Block: NB} \p{Block=No_Block} (842_320 plus all
922 above-Unicode code points)
923 \p{Block: New_Tai_Lue} (NOT \p{New_Tai_Lue} NOR
924 \p{Is_New_Tai_Lue}) (96)
925 \p{Block: Newa} (NOT \p{Newa} NOR \p{Is_Newa}) (128)
926 \p{Block: NKo} (NOT \p{Nko} NOR \p{Is_NKo}) (64)
927 \p{Block: No_Block} (Short: \p{Blk=NB}) (842_320 plus all
928 above-Unicode code points)
929 \p{Block: Number_Forms} (64)
930 \p{Block: OCR} \p{Block=Optical_Character_Recognition}
931 (32)
932 \p{Block: Ogham} (NOT \p{Ogham} NOR \p{Is_Ogham}) (32)
933 \p{Block: Ol_Chiki} (48)
934 \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
935 \p{Is_Old_Hungarian}) (128)
936 \p{Block: Old_Italic} (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
937 (48)
938 \p{Block: Old_North_Arabian} (32)
939 \p{Block: Old_Permic} (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
940 (48)
941 \p{Block: Old_Persian} (NOT \p{Old_Persian} NOR
942 \p{Is_Old_Persian}) (64)
943 \p{Block: Old_South_Arabian} (32)
944 \p{Block: Old_Turkic} (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
945 (80)
946 \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32)
947 \p{Block: Oriya} (NOT \p{Oriya} NOR \p{Is_Oriya}) (128)
948 \p{Block: Ornamental_Dingbats} (48)
949 \p{Block: Osage} (NOT \p{Osage} NOR \p{Is_Osage}) (80)
950 \p{Block: Osmanya} (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48)
951 \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
952 \p{Is_Pahawh_Hmong}) (144)
953 \p{Block: Palmyrene} (32)
954 \p{Block: Pau_Cin_Hau} (NOT \p{Pau_Cin_Hau} NOR
955 \p{Is_Pau_Cin_Hau}) (64)
956 \p{Block: Phags_Pa} (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa}) (64)
957 \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48)
958 \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48)
959 \p{Block: Phoenician} (NOT \p{Phoenician} NOR \p{Is_Phoenician})
960 (32)
961 \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
962 \p{Block: Phonetic_Ext_Sup} \p{Block=
963 Phonetic_Extensions_Supplement} (64)
964 \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128)
965 \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
966 PhoneticExtSup}) (64)
967 \p{Block: Playing_Cards} (96)
968 \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
969 \p{Private_Use} NOR \p{Is_Private_Use})
970 (6400)
971 \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
972 \p{Private_Use} NOR \p{Is_Private_Use})
973 (6400)
974 \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
975 \p{Is_Psalter_Pahlavi}) (48)
976 \p{Block: PUA} \p{Block=Private_Use_Area} (NOT
977 \p{Private_Use} NOR \p{Is_Private_Use})
978 (6400)
979 \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT
980 \p{Punct} NOR \p{Is_Punctuation}) (112)
981 \p{Block: Rejang} (NOT \p{Rejang} NOR \p{Is_Rejang}) (48)
982 \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32)
983 \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32)
984 \p{Block: Runic} (NOT \p{Runic} NOR \p{Is_Runic}) (96)
985 \p{Block: Samaritan} (NOT \p{Samaritan} NOR \p{Is_Samaritan})
986 (64)
987 \p{Block: Saurashtra} (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
988 (96)
989 \p{Block: Sharada} (NOT \p{Sharada} NOR \p{Is_Sharada}) (96)
990 \p{Block: Shavian} (48)
991 \p{Block: Shorthand_Format_Controls} (16)
992 \p{Block: Siddham} (NOT \p{Siddham} NOR \p{Is_Siddham}) (128)
993 \p{Block: Sinhala} (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128)
994 \p{Block: Sinhala_Archaic_Numbers} (32)
995 \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32)
996 \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32)
997 \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
998 \p{Is_Sora_Sompeng}) (48)
999 \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
1000 ModifierLetters}) (80)
1001 \p{Block: Specials} (16)
1002 \p{Block: Sundanese} (NOT \p{Sundanese} NOR \p{Is_Sundanese})
1003 (64)
1004 \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
1005 \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16)
1006 \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
1007 \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
1008 \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
1009 \p{Block: Sup_Math_Operators} \p{Block=
1010 Supplemental_Mathematical_Operators}
1011 (256)
1012 \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A}
1013 (65_536)
1014 \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B}
1015 (65_536)
1016 \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
1017 \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
1018 Supplemental_Symbols_And_Pictographs}
1019 (256)
1020 \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
1021 \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
1022 SuperAndSub}) (48)
1023 \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16)
1024 \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128)
1025 \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256)
1026 \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
1027 SupMathOperators}) (256)
1028 \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
1029 SupPunctuation}) (128)
1030 \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
1031 SupSymbolsAndPictographs}) (256)
1032 \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
1033 SupPUAA}) (65_536)
1034 \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
1035 SupPUAB}) (65_536)
1036 \p{Block: Sutton_SignWriting} (688)
1037 \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
1038 \p{Is_Syloti_Nagri}) (48)
1039 \p{Block: Syriac} (NOT \p{Syriac} NOR \p{Is_Syriac}) (80)
1040 \p{Block: Tagalog} (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32)
1041 \p{Block: Tagbanwa} (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa}) (32)
1042 \p{Block: Tags} (128)
1043 \p{Block: Tai_Le} (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48)
1044 \p{Block: Tai_Tham} (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
1045 (144)
1046 \p{Block: Tai_Viet} (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet}) (96)
1047 \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1048 \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96)
1049 \p{Block: Takri} (NOT \p{Takri} NOR \p{Is_Takri}) (80)
1050 \p{Block: Tamil} (NOT \p{Tamil} NOR \p{Is_Tamil}) (128)
1051 \p{Block: Tangut} (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144)
1052 \p{Block: Tangut_Components} (768)
1053 \p{Block: Telugu} (NOT \p{Telugu} NOR \p{Is_Telugu}) (128)
1054 \p{Block: Thaana} (NOT \p{Thaana} NOR \p{Is_Thaana}) (64)
1055 \p{Block: Thai} (NOT \p{Thai} NOR \p{Is_Thai}) (128)
1056 \p{Block: Tibetan} (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256)
1057 \p{Block: Tifinagh} (NOT \p{Tifinagh} NOR \p{Is_Tifinagh}) (80)
1058 \p{Block: Tirhuta} (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96)
1059 \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1060 (128)
1061 \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1062 TransportAndMap}) (128)
1063 \p{Block: UCAS} \p{Block=
1064 Unified_Canadian_Aboriginal_Syllabics}
1065 (640)
1066 \p{Block: UCAS_Ext} \p{Block=
1067 Unified_Canadian_Aboriginal_Syllabics_-
1068 Extended} (80)
1069 \p{Block: Ugaritic} (NOT \p{Ugaritic} NOR \p{Is_Ugaritic}) (32)
1070 \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1071 UCAS}) (640)
1072 \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1073 \p{Blk=UCASExt}) (80)
1074 \p{Block: Vai} (NOT \p{Vai} NOR \p{Is_Vai}) (320)
1075 \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
1076 \p{Variation_Selector} NOR \p{Is_VS})
1077 (16)
1078 \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
1079 (240)
1080 \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48)
1081 \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48)
1082 \p{Block: Vertical_Forms} (16)
1083 \p{Block: VS} \p{Block=Variation_Selectors} (NOT
1084 \p{Variation_Selector} NOR \p{Is_VS})
1085 (16)
1086 \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement}
1087 (240)
1088 \p{Block: Warang_Citi} (NOT \p{Warang_Citi} NOR
1089 \p{Is_Warang_Citi}) (96)
1090 \p{Block: Yi_Radicals} (64)
1091 \p{Block: Yi_Syllables} (1168)
1092 \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64)
1093 \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64)
1094 X \p{Block_Elements} \p{Block=Block_Elements} (32)
1095 \p{Bopo} \p{Bopomofo} (= \p{Script_Extensions=
1096 Bopomofo}) (NOT \p{Block=Bopomofo}) (110)
1097 \p{Bopomofo} \p{Script_Extensions=Bopomofo} (Short:
1098 \p{Bopo}; NOT \p{Block=Bopomofo}) (110)
1099 X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block=
1100 Bopomofo_Extended}) (32)
1101 X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short:
1102 \p{InBopomofoExt}) (32)
1103 X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
1104 \p{Bpt: *} \p{Bidi_Paired_Bracket_Type: *}
1105 \p{Brah} \p{Brahmi} (= \p{Script_Extensions=
1106 Brahmi}) (NOT \p{Block=Brahmi}) (109)
1107 \p{Brahmi} \p{Script_Extensions=Brahmi} (Short:
1108 \p{Brah}; NOT \p{Block=Brahmi}) (109)
1109 \p{Brai} \p{Braille} (= \p{Script_Extensions=
1110 Braille}) (256)
1111 \p{Braille} \p{Script_Extensions=Braille} (Short:
1112 \p{Brai}) (256)
1113 X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short:
1114 \p{InBraille}) (256)
1115 \p{Bugi} \p{Buginese} (= \p{Script_Extensions=
1116 Buginese}) (NOT \p{Block=Buginese}) (31)
1117 \p{Buginese} \p{Script_Extensions=Buginese} (Short:
1118 \p{Bugi}; NOT \p{Block=Buginese}) (31)
1119 \p{Buhd} \p{Buhid} (= \p{Script_Extensions=Buhid})
1120 (NOT \p{Block=Buhid}) (22)
1121 \p{Buhid} \p{Script_Extensions=Buhid} (Short:
1122 \p{Buhd}; NOT \p{Block=Buhid}) (22)
1123 X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block=
1124 Byzantine_Musical_Symbols}) (256)
1125 X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1126 (Short: \p{InByzantineMusic}) (256)
1127 \p{C} \pC \p{Other} (= \p{General_Category=Other})
1128 (986_091 plus all above-Unicode code
1129 points)
1130 \p{Cakm} \p{Chakma} (= \p{Script_Extensions=
1131 Chakma}) (NOT \p{Block=Chakma}) (87)
1132 \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
1133 (Short: \p{Cans}) (710)
1134 X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
1135 (= \p{Block=
1136 Unified_Canadian_Aboriginal_Syllabics})
1137 (640)
1138 T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1139 Not_Reordered} (1_113_298 plus all
1140 above-Unicode code points)
1141 T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1142 Overlay} (32)
1143 T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1144 Nukta} (22)
1145 T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1146 Kana_Voicing} (2)
1147 T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1148 Virama} (47)
1149 T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1150 CCC10} (1)
1151 T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1152 CCC11} (1)
1153 T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1154 CCC12} (1)
1155 T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1156 CCC13} (1)
1157 T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1158 CCC14} (1)
1159 T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1160 CCC15} (1)
1161 T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1162 CCC16} (1)
1163 T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1164 CCC17} (1)
1165 T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1166 CCC18} (2)
1167 T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1168 CCC19} (2)
1169 T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1170 CCC20} (1)
1171 T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1172 CCC21} (1)
1173 T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1174 CCC22} (1)
1175 T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1176 CCC23} (1)
1177 T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1178 CCC24} (1)
1179 T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1180 CCC25} (1)
1181 T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1182 CCC26} (1)
1183 T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1184 CCC27} (2)
1185 T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1186 CCC28} (2)
1187 T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1188 CCC29} (2)
1189 T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1190 CCC30} (2)
1191 T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1192 CCC31} (2)
1193 T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1194 CCC32} (2)
1195 T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1196 CCC33} (1)
1197 T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1198 CCC34} (1)
1199 T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1200 CCC35} (1)
1201 T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1202 CCC36} (1)
1203 T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1204 CCC84} (1)
1205 T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1206 CCC91} (1)
1207 T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1208 CCC103} (2)
1209 T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1210 CCC107} (4)
1211 T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1212 CCC118} (2)
1213 T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1214 CCC122} (4)
1215 T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1216 CCC129} (1)
1217 T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1218 CCC130} (6)
1219 T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1220 CCC132} (1)
1221 T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
1222 CCC133} (0)
1223 T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1224 Attached_Below_Left} (0)
1225 T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1226 Attached_Below} (5)
1227 T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1228 Attached_Above} (1)
1229 T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1230 Attached_Above_Right} (9)
1231 T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1232 Below_Left} (1)
1233 T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1234 Below} (153)
1235 T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1236 Below_Right} (4)
1237 T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1238 Left} (2)
1239 T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1240 Right} (1)
1241 T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1242 Above_Left} (3)
1243 T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1244 Above} (461)
1245 T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1246 Above_Right} (4)
1247 T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1248 Double_Below} (4)
1249 T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1250 Double_Above} (5)
1251 T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1252 Iota_Subscript} (1)
1253 \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1254 Above} (461)
1255 \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (461)
1256 \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (3)
1257 \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (4)
1258 \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1259 Above_Left} (3)
1260 \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1261 Above_Right} (4)
1262 \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1263 Attached_Above} (1)
1264 \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1265 Attached_Above_Right} (9)
1266 \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1267 Attached_Below} (5)
1268 \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1269 Attached_Below_Left} (0)
1270 \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1271 (1)
1272 \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1273 \p{Ccc=ATAR}) (9)
1274 \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1275 (5)
1276 \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1277 ATBL}) (0)
1278 \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1279 Below} (153)
1280 \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (153)
1281 \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1)
1282 \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4)
1283 \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1284 Below_Left} (1)
1285 \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1286 Below_Right} (4)
1287 \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1)
1288 \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2)
1289 \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4)
1290 \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1)
1291 \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2)
1292 \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1)
1293 \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4)
1294 \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1)
1295 \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1)
1296 \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6)
1297 \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1)
1298 \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
1299 \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1)
1300 \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1)
1301 \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1)
1302 \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1)
1303 \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2)
1304 \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2)
1305 \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1)
1306 \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1)
1307 \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1)
1308 \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1)
1309 \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1)
1310 \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1)
1311 \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1)
1312 \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2)
1313 \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2)
1314 \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2)
1315 \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2)
1316 \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2)
1317 \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2)
1318 \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1)
1319 \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1)
1320 \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1)
1321 \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1)
1322 \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1)
1323 \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1)
1324 \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1325 Double_Above} (5)
1326 \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1327 Double_Below} (4)
1328 \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA}) (5)
1329 \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB}) (4)
1330 \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1331 (1)
1332 \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1333 Iota_Subscript} (1)
1334 \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV}) (2)
1335 \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1336 Kana_Voicing} (2)
1337 \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1338 Left} (2)
1339 \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2)
1340 \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1341 Nukta} (22)
1342 \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1343 (1_113_298 plus all above-Unicode code
1344 points)
1345 \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1346 Not_Reordered} (1_113_298 plus all
1347 above-Unicode code points)
1348 \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (22)
1349 \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1350 Overlay} (32)
1351 \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32)
1352 \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1353 Right} (1)
1354 \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1)
1355 \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (47)
1356 \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1357 Virama} (47)
1358 \p{Cans} \p{Canadian_Aboriginal} (=
1359 \p{Script_Extensions=
1360 Canadian_Aboriginal}) (710)
1361 \p{Cari} \p{Carian} (= \p{Script_Extensions=
1362 Carian}) (NOT \p{Block=Carian}) (49)
1363 \p{Carian} \p{Script_Extensions=Carian} (Short:
1364 \p{Cari}; NOT \p{Block=Carian}) (49)
1365 \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (2240)
1366 \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_111_872 plus
1367 all above-Unicode code points)
1368 \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (2240)
1369 \p{Cased} \p{Cased=Y} (4105)
1370 \p{Cased: N*} (Single: \P{Cased}) (1_110_007 plus all
1371 above-Unicode code points)
1372 \p{Cased: Y*} (Single: \p{Cased}) (4105)
1373 \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
1374 \p{LC}) (3796)
1375 \p{Category: *} \p{General_Category: *}
1376 \p{Caucasian_Albanian} \p{Script_Extensions=Caucasian_Albanian}
1377 (Short: \p{Aghb}; NOT \p{Block=
1378 Caucasian_Albanian}) (53)
1379 \p{Cc} \p{XPosixCntrl} (= \p{General_Category=
1380 Control}) (65)
1381 \p{Ccc: *} \p{Canonical_Combining_Class: *}
1382 \p{CE} \p{Composition_Exclusion} (=
1383 \p{Composition_Exclusion=Y}) (81)
1384 \p{CE: *} \p{Composition_Exclusion: *}
1385 \p{Cf} \p{Format} (= \p{General_Category=Format})
1386 (151)
1387 \p{Chakma} \p{Script_Extensions=Chakma} (Short:
1388 \p{Cakm}; NOT \p{Block=Chakma}) (87)
1389 \p{Cham} \p{Script_Extensions=Cham} (NOT \p{Block=
1390 Cham}) (83)
1391 \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1392 \p{CWCF}) (1377)
1393 \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1394 (1_112_735 plus all above-Unicode code
1395 points)
1396 \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1397 (1377)
1398 \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1399 \p{CWCM}) (2669)
1400 \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1401 (1_111_443 plus all above-Unicode code
1402 points)
1403 \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1404 (2669)
1405 \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1406 \p{CWL}) (1304)
1407 \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1408 (1_112_808 plus all above-Unicode code
1409 points)
1410 \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1304)
1411 \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1412 Y} (Short: \p{CWKCF}) (10_227)
1413 \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1414 \P{CWKCF}) (1_103_885 plus all above-
1415 Unicode code points)
1416 \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1417 \p{CWKCF}) (10_227)
1418 \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1419 \p{CWT}) (1369)
1420 \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1421 (1_112_743 plus all above-Unicode code
1422 points)
1423 \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1369)
1424 \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1425 \p{CWU}) (1396)
1426 \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1427 (1_112_716 plus all above-Unicode code
1428 points)
1429 \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1396)
1430 \p{Cher} \p{Cherokee} (= \p{Script_Extensions=
1431 Cherokee}) (NOT \p{Block=Cherokee}) (172)
1432 \p{Cherokee} \p{Script_Extensions=Cherokee} (Short:
1433 \p{Cher}; NOT \p{Block=Cherokee}) (172)
1434 X \p{Cherokee_Sup} \p{Cherokee_Supplement} (= \p{Block=
1435 Cherokee_Supplement}) (80)
1436 X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
1437 \p{InCherokeeSup}) (80)
1438 \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
1439 Y}) (2240)
1440 \p{CI: *} \p{Case_Ignorable: *}
1441 X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block=
1442 CJK_Unified_Ideographs}) (20_992)
1443 X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block=
1444 CJK_Compatibility}) (256)
1445 X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block=
1446 CJK_Compatibility_Forms}) (32)
1447 X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
1448 \p{Block=CJK_Compatibility_Ideographs})
1449 (512)
1450 X \p{CJK_Compat_Ideographs_Sup}
1451 \p{CJK_Compatibility_Ideographs_-
1452 Supplement} (= \p{Block=
1453 CJK_Compatibility_Ideographs_-
1454 Supplement}) (544)
1455 X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short:
1456 \p{InCJKCompat}) (256)
1457 X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
1458 (Short: \p{InCJKCompatForms}) (32)
1459 X \p{CJK_Compatibility_Ideographs} \p{Block=
1460 CJK_Compatibility_Ideographs} (Short:
1461 \p{InCJKCompatIdeographs}) (512)
1462 X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
1463 CJK_Compatibility_Ideographs_Supplement}
1464 (Short: \p{InCJKCompatIdeographsSup})
1465 (544)
1466 X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (=
1467 \p{Block=
1468 CJK_Unified_Ideographs_Extension_A})
1469 (6592)
1470 X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (=
1471 \p{Block=
1472 CJK_Unified_Ideographs_Extension_B})
1473 (42_720)
1474 X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (=
1475 \p{Block=
1476 CJK_Unified_Ideographs_Extension_C})
1477 (4160)
1478 X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (=
1479 \p{Block=
1480 CJK_Unified_Ideographs_Extension_D})
1481 (224)
1482 X \p{CJK_Ext_E} \p{CJK_Unified_Ideographs_Extension_E} (=
1483 \p{Block=
1484 CJK_Unified_Ideographs_Extension_E})
1485 (5776)
1486 X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block=
1487 CJK_Radicals_Supplement}) (128)
1488 X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
1489 (Short: \p{InCJKRadicalsSup}) (128)
1490 X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
1491 X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (=
1492 \p{Block=CJK_Symbols_And_Punctuation})
1493 (64)
1494 X \p{CJK_Symbols_And_Punctuation} \p{Block=
1495 CJK_Symbols_And_Punctuation} (Short:
1496 \p{InCJKSymbols}) (64)
1497 X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
1498 (Short: \p{InCJK}) (20_992)
1499 X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
1500 CJK_Unified_Ideographs_Extension_A}
1501 (Short: \p{InCJKExtA}) (6592)
1502 X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
1503 CJK_Unified_Ideographs_Extension_B}
1504 (Short: \p{InCJKExtB}) (42_720)
1505 X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
1506 CJK_Unified_Ideographs_Extension_C}
1507 (Short: \p{InCJKExtC}) (4160)
1508 X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
1509 CJK_Unified_Ideographs_Extension_D}
1510 (Short: \p{InCJKExtD}) (224)
1511 X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
1512 CJK_Unified_Ideographs_Extension_E}
1513 (Short: \p{InCJKExtE}) (5776)
1514 \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
1515 (Short: \p{Pe}) (73)
1516 \p{Cn} \p{Unassigned} (= \p{General_Category=
1517 Unassigned}) (846_359 plus all above-
1518 Unicode code points)
1519 \p{Cntrl} \p{XPosixCntrl} (= \p{General_Category=
1520 Control}) (65)
1521 \p{Co} \p{Private_Use} (= \p{General_Category=
1522 Private_Use}) (NOT \p{Private_Use_Area})
1523 (137_468)
1524 X \p{Combining_Diacritical_Marks} \p{Block=
1525 Combining_Diacritical_Marks} (Short:
1526 \p{InDiacriticals}) (112)
1527 X \p{Combining_Diacritical_Marks_Extended} \p{Block=
1528 Combining_Diacritical_Marks_Extended}
1529 (Short: \p{InDiacriticalsExt}) (80)
1530 X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
1531 Combining_Diacritical_Marks_For_Symbols}
1532 (Short: \p{InDiacriticalsForSymbols})
1533 (48)
1534 X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
1535 Combining_Diacritical_Marks_Supplement}
1536 (Short: \p{InDiacriticalsSup}) (64)
1537 X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
1538 \p{InHalfMarks}) (16)
1539 \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark})
1540 (2097)
1541 X \p{Combining_Marks_For_Symbols}
1542 \p{Combining_Diacritical_Marks_For_-
1543 Symbols} (= \p{Block=
1544 Combining_Diacritical_Marks_For_-
1545 Symbols}) (48)
1546 \p{Common} \p{Script_Extensions=Common} (Short:
1547 \p{Zyyy}) (6864)
1548 X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1549 (Short: \p{InIndicNumberForms}) (16)
1550 \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
1551 \p{Full_Composition_Exclusion=Y}) (1120)
1552 \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
1553 X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block=
1554 Hangul_Compatibility_Jamo}) (96)
1555 \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
1556 \p{CE}) (81)
1557 \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
1558 plus all above-Unicode code points)
1559 \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81)
1560 \p{Connector_Punctuation} \p{General_Category=
1561 Connector_Punctuation} (Short: \p{Pc})
1562 (10)
1563 \p{Control} \p{XPosixCntrl} (= \p{General_Category=
1564 Control}) (65)
1565 X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
1566 \p{Copt} \p{Coptic} (= \p{Script_Extensions=
1567 Coptic}) (NOT \p{Block=Coptic}) (165)
1568 \p{Coptic} \p{Script_Extensions=Coptic} (Short:
1569 \p{Copt}; NOT \p{Block=Coptic}) (165)
1570 X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
1571 X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block=
1572 Counting_Rod_Numerals}) (32)
1573 X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
1574 \p{InCountingRod}) (32)
1575 \p{Cprt} \p{Cypriot} (= \p{Script_Extensions=
1576 Cypriot}) (112)
1577 \p{Cs} \p{Surrogate} (= \p{General_Category=
1578 Surrogate}) (2048)
1579 \p{Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
1580 \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
1581 X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (=
1582 \p{Block=
1583 Cuneiform_Numbers_And_Punctuation}) (128)
1584 X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
1585 Cuneiform_Numbers_And_Punctuation}
1586 (Short: \p{InCuneiformNumbers}) (128)
1587 \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
1588 (Short: \p{Sc}) (53)
1589 X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
1590 \p{CWCF} \p{Changes_When_Casefolded} (=
1591 \p{Changes_When_Casefolded=Y}) (1377)
1592 \p{CWCF: *} \p{Changes_When_Casefolded: *}
1593 \p{CWCM} \p{Changes_When_Casemapped} (=
1594 \p{Changes_When_Casemapped=Y}) (2669)
1595 \p{CWCM: *} \p{Changes_When_Casemapped: *}
1596 \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
1597 \p{Changes_When_NFKC_Casefolded=Y})
1598 (10_227)
1599 \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
1600 \p{CWL} \p{Changes_When_Lowercased} (=
1601 \p{Changes_When_Lowercased=Y}) (1304)
1602 \p{CWL: *} \p{Changes_When_Lowercased: *}
1603 \p{CWT} \p{Changes_When_Titlecased} (=
1604 \p{Changes_When_Titlecased=Y}) (1369)
1605 \p{CWT: *} \p{Changes_When_Titlecased: *}
1606 \p{CWU} \p{Changes_When_Uppercased} (=
1607 \p{Changes_When_Uppercased=Y}) (1396)
1608 \p{CWU: *} \p{Changes_When_Uppercased: *}
1609 \p{Cypriot} \p{Script_Extensions=Cypriot} (Short:
1610 \p{Cprt}) (112)
1611 X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
1612 \p{Cyrillic} \p{Script_Extensions=Cyrillic} (Short:
1613 \p{Cyrl}; NOT \p{Block=Cyrillic}) (446)
1614 X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block=
1615 Cyrillic_Extended_A}) (32)
1616 X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block=
1617 Cyrillic_Extended_B}) (96)
1618 X \p{Cyrillic_Ext_C} \p{Cyrillic_Extended_C} (= \p{Block=
1619 Cyrillic_Extended_C}) (16)
1620 X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
1621 \p{InCyrillicExtA}) (32)
1622 X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
1623 \p{InCyrillicExtB}) (96)
1624 X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
1625 \p{InCyrillicExtC}) (16)
1626 X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block=
1627 Cyrillic_Supplement}) (48)
1628 X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
1629 \p{InCyrillicSup}) (48)
1630 X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
1631 Cyrillic_Supplement}) (48)
1632 \p{Cyrl} \p{Cyrillic} (= \p{Script_Extensions=
1633 Cyrillic}) (NOT \p{Block=Cyrillic}) (446)
1634 \p{Dash} \p{Dash=Y} (28)
1635 \p{Dash: N*} (Single: \P{Dash}) (1_114_084 plus all
1636 above-Unicode code points)
1637 \p{Dash: Y*} (Single: \p{Dash}) (28)
1638 \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
1639 (Short: \p{Pd}) (24)
1640 \p{Decimal_Number} \p{XPosixDigit} (= \p{General_Category=
1641 Decimal_Number}) (580)
1642 \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
1643 (13_232)
1644 \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_232)
1645 \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240)
1646 \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
1647 \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720)
1648 \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
1649 \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
1650 \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240)
1651 \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184)
1652 \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
1653 \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20)
1654 \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
1655 \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171)
1656 \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
1657 \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238)
1658 \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
1659 \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82)
1660 \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
1661 \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122)
1662 \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
1663 \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5)
1664 \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
1665 Non_Canonical} (Perl extension) (3662)
1666 \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
1667 decompositions (Short: \p{Dt=NonCanon})
1668 (Perl extension) (3662)
1669 \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_218 plus
1670 all above-Unicode code points)
1671 \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26)
1672 \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
1673 \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (285)
1674 \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (285)
1675 \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38)
1676 \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (152)
1677 \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (152)
1678 \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
1679 \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35)
1680 \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104)
1681 \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
1682 Y} (Short: \p{DI}) (4173)
1683 \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
1684 (1_109_939 plus all above-Unicode code
1685 points)
1686 \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
1687 (4173)
1688 \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (15)
1689 \p{Dep: *} \p{Deprecated: *}
1690 \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (15)
1691 \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_097
1692 plus all above-Unicode code points)
1693 \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (15)
1694 \p{Deseret} \p{Script_Extensions=Deseret} (Short:
1695 \p{Dsrt}) (80)
1696 \p{Deva} \p{Devanagari} (= \p{Script_Extensions=
1697 Devanagari}) (NOT \p{Block=Devanagari})
1698 (210)
1699 \p{Devanagari} \p{Script_Extensions=Devanagari} (Short:
1700 \p{Deva}; NOT \p{Block=Devanagari}) (210)
1701 X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block=
1702 Devanagari_Extended}) (32)
1703 X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
1704 \p{InDevanagariExt}) (32)
1705 \p{DI} \p{Default_Ignorable_Code_Point} (=
1706 \p{Default_Ignorable_Code_Point=Y})
1707 (4173)
1708 \p{DI: *} \p{Default_Ignorable_Code_Point: *}
1709 \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (782)
1710 \p{Dia: *} \p{Diacritic: *}
1711 \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (782)
1712 \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_330
1713 plus all above-Unicode code points)
1714 \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (782)
1715 X \p{Diacriticals} \p{Combining_Diacritical_Marks} (=
1716 \p{Block=Combining_Diacritical_Marks})
1717 (112)
1718 X \p{Diacriticals_Ext} \p{Combining_Diacritical_Marks_Extended}
1719 (= \p{Block=
1720 Combining_Diacritical_Marks_Extended})
1721 (80)
1722 X \p{Diacriticals_For_Symbols}
1723 \p{Combining_Diacritical_Marks_For_-
1724 Symbols} (= \p{Block=
1725 Combining_Diacritical_Marks_For_-
1726 Symbols}) (48)
1727 X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement}
1728 (= \p{Block=
1729 Combining_Diacritical_Marks_Supplement})
1730 (64)
1731 \p{Digit} \p{XPosixDigit} (= \p{General_Category=
1732 Decimal_Number}) (580)
1733 X \p{Dingbats} \p{Block=Dingbats} (192)
1734 X \p{Domino} \p{Domino_Tiles} (= \p{Block=
1735 Domino_Tiles}) (112)
1736 X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short:
1737 \p{InDomino}) (112)
1738 \p{Dsrt} \p{Deseret} (= \p{Script_Extensions=
1739 Deseret}) (80)
1740 \p{Dt: *} \p{Decomposition_Type: *}
1741 \p{Dupl} \p{Duployan} (= \p{Script_Extensions=
1742 Duployan}) (NOT \p{Block=Duployan}) (147)
1743 \p{Duployan} \p{Script_Extensions=Duployan} (Short:
1744 \p{Dupl}; NOT \p{Block=Duployan}) (147)
1745 \p{Ea: *} \p{East_Asian_Width: *}
1746 X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
1747 (208)
1748 \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
1749 \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739)
1750 \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
1751 \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104)
1752 \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
1753 \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123)
1754 \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (794_146 plus
1755 all above-Unicode code points)
1756 \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
1757 \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111)
1758 \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (794_146 plus all
1759 above-Unicode code points)
1760 \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (180_889)
1761 \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (180_889)
1762 \p{Egyp} \p{Egyptian_Hieroglyphs} (=
1763 \p{Script_Extensions=
1764 Egyptian_Hieroglyphs}) (NOT \p{Block=
1765 Egyptian_Hieroglyphs}) (1071)
1766 \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
1767 Egyptian_Hieroglyphs} (Short: \p{Egyp};
1768 NOT \p{Block=Egyptian_Hieroglyphs})
1769 (1071)
1770 \p{Elba} \p{Elbasan} (= \p{Script_Extensions=
1771 Elbasan}) (NOT \p{Block=Elbasan}) (40)
1772 \p{Elbasan} \p{Script_Extensions=Elbasan} (Short:
1773 \p{Elba}; NOT \p{Block=Elbasan}) (40)
1774 X \p{Emoticons} \p{Block=Emoticons} (80)
1775 X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block=
1776 Enclosed_Alphanumerics}) (160)
1777 X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
1778 \p{Block=
1779 Enclosed_Alphanumeric_Supplement}) (256)
1780 X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
1781 Enclosed_Alphanumeric_Supplement}
1782 (Short: \p{InEnclosedAlphanumSup}) (256)
1783 X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
1784 (Short: \p{InEnclosedAlphanum}) (160)
1785 X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (=
1786 \p{Block=
1787 Enclosed_CJK_Letters_And_Months}) (256)
1788 X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
1789 Enclosed_CJK_Letters_And_Months} (Short:
1790 \p{InEnclosedCJK}) (256)
1791 X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
1792 (= \p{Block=
1793 Enclosed_Ideographic_Supplement}) (256)
1794 X \p{Enclosed_Ideographic_Supplement} \p{Block=
1795 Enclosed_Ideographic_Supplement} (Short:
1796 \p{InEnclosedIdeographicSup}) (256)
1797 \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
1798 (Short: \p{Me}) (13)
1799 \p{Ethi} \p{Ethiopic} (= \p{Script_Extensions=
1800 Ethiopic}) (NOT \p{Block=Ethiopic}) (495)
1801 \p{Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
1802 \p{Ethi}; NOT \p{Block=Ethiopic}) (495)
1803 X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block=
1804 Ethiopic_Extended}) (96)
1805 X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block=
1806 Ethiopic_Extended_A}) (48)
1807 X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short:
1808 \p{InEthiopicExt}) (96)
1809 X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
1810 \p{InEthiopicExtA}) (48)
1811 X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block=
1812 Ethiopic_Supplement}) (32)
1813 X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
1814 \p{InEthiopicSup}) (32)
1815 \p{Ext} \p{Extender} (= \p{Extender=Y}) (42)
1816 \p{Ext: *} \p{Extender: *}
1817 \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (42)
1818 \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_070
1819 plus all above-Unicode code points)
1820 \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (42)
1821 \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
1822 (Short: \p{Pf}) (10)
1823 \p{Format} \p{General_Category=Format} (Short:
1824 \p{Cf}) (151)
1825 \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
1826 (Short: \p{CompEx}) (1120)
1827 \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
1828 \P{CompEx}) (1_112_992 plus all above-
1829 Unicode code points)
1830 \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
1831 \p{CompEx}) (1120)
1832 \p{Gc: *} \p{General_Category: *}
1833 \p{GCB: *} \p{Grapheme_Cluster_Break: *}
1834 \p{General_Category: C} \p{General_Category=Other} (986_091 plus
1835 all above-Unicode code points)
1836 \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
1837 \p{Gc=LC}, \p{LC}) (3796)
1838 \p{General_Category: Cc} \p{General_Category=Control} (65)
1839 \p{General_Category: Cf} \p{General_Category=Format} (151)
1840 \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
1841 (73)
1842 \p{General_Category: Cn} \p{General_Category=Unassigned} (846_359
1843 plus all above-Unicode code points)
1844 \p{General_Category: Cntrl} \p{General_Category=Control} (65)
1845 \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
1846 \p{General_Category: Combining_Mark} \p{General_Category=Mark}
1847 (2097)
1848 \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
1849 \p{Pc}) (10)
1850 \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65)
1851 \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
1852 \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
1853 (53)
1854 \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
1855 (24)
1856 \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
1857 (580)
1858 \p{General_Category: Digit} \p{General_Category=Decimal_Number}
1859 (580)
1860 \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
1861 (13)
1862 \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
1863 (10)
1864 \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (151)
1865 \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
1866 \p{Pi}) (12)
1867 \p{General_Category: L} \p{General_Category=Letter} (116_766)
1868 X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3796)
1869 X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
1870 the trailing '_' matters in spite of
1871 loose matching rules. (3796)
1872 \p{General_Category: LC} \p{General_Category=Cased_Letter} (3796)
1873 \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (116_766)
1874 \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
1875 (236)
1876 \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl}) (1)
1877 \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
1878 (/i= General_Category=Cased_Letter)
1879 (2063)
1880 \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (249)
1881 \p{General_Category: Lo} \p{General_Category=Other_Letter}
1882 (112_721)
1883 \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
1884 /i= General_Category=Cased_Letter) (2063)
1885 \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
1886 (/i= General_Category=Cased_Letter) (31)
1887 \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
1888 (/i= General_Category=Cased_Letter)
1889 (1702)
1890 \p{General_Category: M} \p{General_Category=Mark} (2097)
1891 \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2097)
1892 \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948)
1893 \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (394)
1894 \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
1895 \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
1896 (1690)
1897 \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
1898 (249)
1899 \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
1900 (121)
1901 \p{General_Category: N} \p{General_Category=Number} (1492)
1902 \p{General_Category: Nd} \p{General_Category=Decimal_Number} (580)
1903 \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
1904 \p{General_Category: No} \p{General_Category=Other_Number} (676)
1905 \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
1906 (1690)
1907 \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1492)
1908 \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
1909 (75)
1910 \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (986_091 plus
1911 all above-Unicode code points)
1912 \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
1913 (112_721)
1914 \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No}) (676)
1915 \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
1916 (544)
1917 \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
1918 (5777)
1919 \p{General_Category: P} \p{General_Category=Punctuation} (748)
1920 \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
1921 \p{Zp}) (1)
1922 \p{General_Category: Pc} \p{General_Category=
1923 Connector_Punctuation} (10)
1924 \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (24)
1925 \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
1926 (73)
1927 \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
1928 (10)
1929 \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
1930 (12)
1931 \p{General_Category: Po} \p{General_Category=Other_Punctuation}
1932 (544)
1933 \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
1934 (137_468)
1935 \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (75)
1936 \p{General_Category: Punct} \p{General_Category=Punctuation} (748)
1937 \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (748)
1938 \p{General_Category: S} \p{General_Category=Symbol} (6899)
1939 \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (53)
1940 \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19)
1941 \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (121)
1942 \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
1943 \p{General_Category: So} \p{General_Category=Other_Symbol} (5777)
1944 \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
1945 (17)
1946 \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc}) (394)
1947 \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048)
1948 \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (6899)
1949 \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
1950 /i= General_Category=Cased_Letter) (31)
1951 \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
1952 (846_359 plus all above-Unicode code
1953 points)
1954 \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
1955 /i= General_Category=Cased_Letter) (1702)
1956 \p{General_Category: Z} \p{General_Category=Separator} (19)
1957 \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
1958 \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
1959 (1)
1960 \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
1961 X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
1962 \p{InPunctuation}) (112)
1963 X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
1964 X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
1965 \p{Block=Geometric_Shapes_Extended})
1966 (128)
1967 X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
1968 (Short: \p{InGeometricShapesExt}) (128)
1969 \p{Geor} \p{Georgian} (= \p{Script_Extensions=
1970 Georgian}) (NOT \p{Block=Georgian}) (129)
1971 \p{Georgian} \p{Script_Extensions=Georgian} (Short:
1972 \p{Geor}; NOT \p{Block=Georgian}) (129)
1973 X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block=
1974 Georgian_Supplement}) (48)
1975 X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
1976 \p{InGeorgianSup}) (48)
1977 \p{Glag} \p{Glagolitic} (= \p{Script_Extensions=
1978 Glagolitic}) (NOT \p{Block=Glagolitic})
1979 (136)
1980 \p{Glagolitic} \p{Script_Extensions=Glagolitic} (Short:
1981 \p{Glag}; NOT \p{Block=Glagolitic}) (136)
1982 X \p{Glagolitic_Sup} \p{Glagolitic_Supplement} (= \p{Block=
1983 Glagolitic_Supplement}) (48)
1984 X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
1985 \p{InGlagoliticSup}) (48)
1986 \p{Goth} \p{Gothic} (= \p{Script_Extensions=
1987 Gothic}) (NOT \p{Block=Gothic}) (27)
1988 \p{Gothic} \p{Script_Extensions=Gothic} (Short:
1989 \p{Goth}; NOT \p{Block=Gothic}) (27)
1990 \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
1991 (126_288)
1992 \p{Gr_Base: *} \p{Grapheme_Base: *}
1993 \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
1994 Y}) (1828)
1995 \p{Gr_Ext: *} \p{Grapheme_Extend: *}
1996 \p{Gran} \p{Grantha} (= \p{Script_Extensions=
1997 Grantha}) (NOT \p{Block=Grantha}) (113)
1998 \p{Grantha} \p{Script_Extensions=Grantha} (Short:
1999 \p{Gran}; NOT \p{Block=Grantha}) (113)
2000 \p{Graph} \p{XPosixGraph} (265_621)
2001 \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
2002 (126_288)
2003 \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase}) (987_824
2004 plus all above-Unicode code points)
2005 \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase}) (126_288)
2006 \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
2007 (5925)
2008 \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (5925)
2009 \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1)
2010 \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (79)
2011 \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (4)
2012 \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (5)
2013 \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
2014 (79)
2015 \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
2016 E_Base_GAZ} (4)
2017 \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
2018 E_Modifier} (5)
2019 \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
2020 (1828)
2021 \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1828)
2022 \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
2023 Glue_After_Zwj} (3)
2024 \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (3)
2025 \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125)
2026 \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1)
2027 \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399)
2028 \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773)
2029 \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_094_356
2030 plus all above-Unicode code points)
2031 \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
2032 (13)
2033 \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (13)
2034 \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
2035 (26)
2036 \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
2037 Regional_Indicator} (26)
2038 \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
2039 SpacingMark} (341)
2040 \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (341)
2041 \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137)
2042 \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95)
2043 \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
2044 (1_094_356 plus all above-Unicode code
2045 points)
2046 \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1)
2047 \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
2048 (1828)
2049 \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_284
2050 plus all above-Unicode code points)
2051 \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1828)
2052 \p{Greek} \p{Script_Extensions=Greek} (Short:
2053 \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
2054 X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
2055 \p{InGreek}) (144)
2056 X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block=
2057 Greek_Extended}) (256)
2058 X \p{Greek_Extended} \p{Block=Greek_Extended} (Short:
2059 \p{InGreekExt}) (256)
2060 \p{Grek} \p{Greek} (= \p{Script_Extensions=Greek})
2061 (NOT \p{Greek_And_Coptic}) (522)
2062 \p{Gujarati} \p{Script_Extensions=Gujarati} (Short:
2063 \p{Gujr}; NOT \p{Block=Gujarati}) (99)
2064 \p{Gujr} \p{Gujarati} (= \p{Script_Extensions=
2065 Gujarati}) (NOT \p{Block=Gujarati}) (99)
2066 \p{Gurmukhi} \p{Script_Extensions=Gurmukhi} (Short:
2067 \p{Guru}; NOT \p{Block=Gurmukhi}) (93)
2068 \p{Guru} \p{Gurmukhi} (= \p{Script_Extensions=
2069 Gurmukhi}) (NOT \p{Block=Gurmukhi}) (93)
2070 X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
2071 \p{Block=Halfwidth_And_Fullwidth_Forms})
2072 (240)
2073 X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block=
2074 Combining_Half_Marks}) (16)
2075 X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
2076 Halfwidth_And_Fullwidth_Forms} (Short:
2077 \p{InHalfAndFullForms}) (240)
2078 \p{Han} \p{Script_Extensions=Han} (82_013)
2079 \p{Hang} \p{Hangul} (= \p{Script_Extensions=
2080 Hangul}) (NOT \p{Hangul_Syllables})
2081 (11_775)
2082 \p{Hangul} \p{Script_Extensions=Hangul} (Short:
2083 \p{Hang}; NOT \p{Hangul_Syllables})
2084 (11_775)
2085 X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
2086 (Short: \p{InCompatJamo}) (96)
2087 X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo})
2088 (256)
2089 X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
2090 (Short: \p{InJamoExtA}) (32)
2091 X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
2092 (Short: \p{InJamoExtB}) (80)
2093 \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
2094 (125)
2095 \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125)
2096 \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
2097 (399)
2098 \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399)
2099 \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
2100 LVT_Syllable} (10_773)
2101 \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
2102 (10_773)
2103 \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
2104 Not_Applicable} (1_102_583 plus all
2105 above-Unicode code points)
2106 \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
2107 (1_102_583 plus all above-Unicode code
2108 points)
2109 \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
2110 (137)
2111 \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137)
2112 \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
2113 (95)
2114 \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95)
2115 X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short:
2116 \p{InHangul}) (11_184)
2117 \p{Hani} \p{Han} (= \p{Script_Extensions=Han})
2118 (82_013)
2119 \p{Hano} \p{Hanunoo} (= \p{Script_Extensions=
2120 Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
2121 \p{Hanunoo} \p{Script_Extensions=Hanunoo} (Short:
2122 \p{Hano}; NOT \p{Block=Hanunoo}) (23)
2123 \p{Hatr} \p{Hatran} (= \p{Script_Extensions=
2124 Hatran}) (NOT \p{Block=Hatran}) (26)
2125 \p{Hatran} \p{Script_Extensions=Hatran} (Short:
2126 \p{Hatr}; NOT \p{Block=Hatran}) (26)
2127 \p{Hebr} \p{Hebrew} (= \p{Script_Extensions=
2128 Hebrew}) (NOT \p{Block=Hebrew}) (133)
2129 \p{Hebrew} \p{Script_Extensions=Hebrew} (Short:
2130 \p{Hebr}; NOT \p{Block=Hebrew}) (133)
2131 \p{Hex} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2132 \p{Hex: *} \p{Hex_Digit: *}
2133 \p{Hex_Digit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2134 \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068
2135 plus all above-Unicode code points)
2136 \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44)
2137 X \p{High_Private_Use_Surrogates} \p{Block=
2138 High_Private_Use_Surrogates} (Short:
2139 \p{InHighPUSurrogates}) (128)
2140 X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (=
2141 \p{Block=High_Private_Use_Surrogates})
2142 (128)
2143 X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
2144 \p{Hira} \p{Hiragana} (= \p{Script_Extensions=
2145 Hiragana}) (NOT \p{Block=Hiragana}) (143)
2146 \p{Hiragana} \p{Script_Extensions=Hiragana} (Short:
2147 \p{Hira}; NOT \p{Block=Hiragana}) (143)
2148 \p{Hluw} \p{Anatolian_Hieroglyphs} (=
2149 \p{Script_Extensions=
2150 Anatolian_Hieroglyphs}) (NOT \p{Block=
2151 Anatolian_Hieroglyphs}) (583)
2152 \p{Hmng} \p{Pahawh_Hmong} (= \p{Script_Extensions=
2153 Pahawh_Hmong}) (NOT \p{Block=
2154 Pahawh_Hmong}) (127)
2155 \p{HorizSpace} \p{XPosixBlank} (18)
2156 \p{Hst: *} \p{Hangul_Syllable_Type: *}
2157 \p{Hung} \p{Old_Hungarian} (= \p{Script_Extensions=
2158 Old_Hungarian}) (NOT \p{Block=
2159 Old_Hungarian}) (108)
2160 D \p{Hyphen} \p{Hyphen=Y} (11)
2161 D \p{Hyphen: N*} Supplanted by Line_Break property values;
2162 see www.unicode.org/reports/tr14
2163 (Single: \P{Hyphen}) (1_114_101 plus all
2164 above-Unicode code points)
2165 D \p{Hyphen: Y*} Supplanted by Line_Break property values;
2166 see www.unicode.org/reports/tr14
2167 (Single: \p{Hyphen}) (11)
2168 \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT
2169 \p{Ideographic_Description_Characters})
2170 (119_691)
2171 \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (994_421 plus
2172 all above-Unicode code points)
2173 \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (119_691)
2174 \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (117_007)
2175 \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (997_105 plus
2176 all above-Unicode code points)
2177 \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (117_007)
2178 \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
2179 \p{Ideographic_Description_Characters})
2180 (119_691)
2181 \p{IDC: *} \p{ID_Continue: *}
2182 \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
2183 (88_284)
2184 \p{Ideo: *} \p{Ideographic: *}
2185 \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
2186 (88_284)
2187 \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_025_828
2188 plus all above-Unicode code points)
2189 \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (88_284)
2190 X \p{Ideographic_Description_Characters} \p{Block=
2191 Ideographic_Description_Characters}
2192 (Short: \p{InIDC}) (16)
2193 X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
2194 \p{Block=
2195 Ideographic_Symbols_And_Punctuation})
2196 (32)
2197 X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
2198 Ideographic_Symbols_And_Punctuation}
2199 (Short: \p{InIdeographicSymbols}) (32)
2200 \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (117_007)
2201 \p{IDS: *} \p{ID_Start: *}
2202 \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
2203 \p{IDSB}) (10)
2204 \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
2205 (1_114_102 plus all above-Unicode code
2206 points)
2207 \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10)
2208 \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
2209 \p{IDST}) (2)
2210 \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
2211 (1_114_110 plus all above-Unicode code
2212 points)
2213 \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2)
2214 \p{IDSB} \p{IDS_Binary_Operator} (=
2215 \p{IDS_Binary_Operator=Y}) (10)
2216 \p{IDSB: *} \p{IDS_Binary_Operator: *}
2217 \p{IDST} \p{IDS_Trinary_Operator} (=
2218 \p{IDS_Trinary_Operator=Y}) (2)
2219 \p{IDST: *} \p{IDS_Trinary_Operator: *}
2220 \p{Imperial_Aramaic} \p{Script_Extensions=Imperial_Aramaic}
2221 (Short: \p{Armi}; NOT \p{Block=
2222 Imperial_Aramaic}) (31)
2223 \p{In: *} \p{Present_In: *} (Perl extension)
2224 X \p{In_*} \p{Block: *}
2225 X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block=
2226 Common_Indic_Number_Forms}) (16)
2227 \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
2228 (300)
2229 \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
2230 BottomAndRight}) (2)
2231 \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (57)
2232 \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
2233 LeftAndRight}) (21)
2234 \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_113_069
2235 plus all above-Unicode code points)
2236 \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
2237 Overstruck}) (10)
2238 \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (258)
2239 \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (342)
2240 \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
2241 TopAndBottom}) (10)
2242 \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
2243 \p{InPC=TopAndBottomAndRight}) (1)
2244 \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
2245 TopAndLeft}) (6)
2246 \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
2247 \p{InPC=TopAndLeftAndRight}) (4)
2248 \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
2249 TopAndRight}) (13)
2250 \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
2251 VisualOrderLeft}) (19)
2252 \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
2253 (15)
2254 \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (67)
2255 \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
2256 \p{InSC=BrahmiJoiningNumber}) (20)
2257 \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
2258 CantillationMark}) (53)
2259 \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
2260 (1907)
2261 \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
2262 ConsonantDead}) (10)
2263 \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
2264 ConsonantFinal}) (62)
2265 \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
2266 \p{InSC=ConsonantHeadLetter}) (5)
2267 \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
2268 ConsonantKiller}) (2)
2269 \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
2270 ConsonantMedial}) (22)
2271 \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
2272 \p{InSC=ConsonantPlaceholder}) (16)
2273 \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
2274 \p{InSC=ConsonantPrecedingRepha}) (1)
2275 \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
2276 ConsonantPrefixed}) (2)
2277 \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
2278 ConsonantSubjoined}) (90)
2279 \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
2280 \p{InSC=ConsonantSucceedingRepha}) (4)
2281 \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
2282 \p{InSC=ConsonantWithStacker}) (4)
2283 \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
2284 GeminationMark}) (2)
2285 \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
2286 InvisibleStacker}) (7)
2287 \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1)
2288 \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
2289 ModifyingLetter}) (1)
2290 \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
2291 NonJoiner}) (1)
2292 \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (24)
2293 \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (459)
2294 \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
2295 NumberJoiner}) (1)
2296 \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
2297 (1_110_129 plus all above-Unicode code
2298 points)
2299 \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
2300 PureKiller}) (16)
2301 \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
2302 RegisterShifter}) (2)
2303 \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
2304 SyllableModifier}) (22)
2305 \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
2306 ToneLetter}) (7)
2307 \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
2308 (42)
2309 \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (24)
2310 \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga}) (31)
2311 \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30)
2312 \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
2313 VowelDependent}) (602)
2314 \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
2315 VowelIndependent}) (431)
2316 \p{Inherited} \p{Script_Extensions=Inherited} (Short:
2317 \p{Zinh}) (496)
2318 \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
2319 (Short: \p{Pi}) (12)
2320 \p{InPC: *} \p{Indic_Positional_Category: *}
2321 \p{InSC: *} \p{Indic_Syllabic_Category: *}
2322 \p{Inscriptional_Pahlavi} \p{Script_Extensions=
2323 Inscriptional_Pahlavi} (Short: \p{Phli};
2324 NOT \p{Block=Inscriptional_Pahlavi}) (27)
2325 \p{Inscriptional_Parthian} \p{Script_Extensions=
2326 Inscriptional_Parthian} (Short:
2327 \p{Prti}; NOT \p{Block=
2328 Inscriptional_Parthian}) (30)
2329 X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block=
2330 IPA_Extensions}) (96)
2331 X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short:
2332 \p{InIPAExt}) (96)
2333 \p{Is_*} \p{*} (Any exceptions are individually
2334 noted beginning with the word NOT.) If
2335 an entry has flag(s) at its beginning,
2336 like "D", the "Is_" form has the same
2337 flag(s)
2338 \p{Ital} \p{Old_Italic} (= \p{Script_Extensions=
2339 Old_Italic}) (NOT \p{Block=Old_Italic})
2340 (36)
2341 X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
2342 (256)
2343 X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block=
2344 Hangul_Jamo_Extended_A}) (32)
2345 X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block=
2346 Hangul_Jamo_Extended_B}) (80)
2347 \p{Java} \p{Javanese} (= \p{Script_Extensions=
2348 Javanese}) (NOT \p{Block=Javanese}) (91)
2349 \p{Javanese} \p{Script_Extensions=Javanese} (Short:
2350 \p{Java}; NOT \p{Block=Javanese}) (91)
2351 \p{Jg: *} \p{Joining_Group: *}
2352 \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
2353 \p{Join_C: *} \p{Join_Control: *}
2354 \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
2355 \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
2356 plus all above-Unicode code points)
2357 \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2)
2358 \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1)
2359 \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1)
2360 \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (1)
2361 \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (8)
2362 \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1)
2363 \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10)
2364 \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (24)
2365 \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2)
2366 \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
2367 BurushaskiYehBarree}) (2)
2368 \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (15)
2369 \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4)
2370 \p{Joining_Group: E} (Short: \p{Jg=E}) (1)
2371 \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7)
2372 \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1)
2373 \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10)
2374 \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1)
2375 \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (14)
2376 \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3)
2377 \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (18)
2378 \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
2379 HamzaOnHehGoal}) (1)
2380 \p{Joining_Group: He} (Short: \p{Jg=He}) (1)
2381 \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1)
2382 \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2)
2383 \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1)
2384 \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (6)
2385 \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1)
2386 \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1)
2387 \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2)
2388 \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (7)
2389 \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1)
2390 \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
2391 ManichaeanAleph}) (1)
2392 \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
2393 (2)
2394 \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
2395 (2)
2396 \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
2397 ManichaeanDaleth}) (1)
2398 \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
2399 ManichaeanDhamedh}) (1)
2400 \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
2401 (1)
2402 \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
2403 ManichaeanGimel}) (2)
2404 \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
2405 (1)
2406 \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
2407 ManichaeanHundred}) (1)
2408 \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
2409 (3)
2410 \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
2411 ManichaeanLamedh}) (1)
2412 \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem}) (1)
2413 \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun}) (1)
2414 \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne}) (1)
2415 \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2)
2416 \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
2417 (3)
2418 \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
2419 (1)
2420 \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
2421 ManichaeanSadhe}) (1)
2422 \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
2423 ManichaeanSamekh}) (1)
2424 \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw}) (1)
2425 \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen}) (1)
2426 \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
2427 (1)
2428 \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
2429 ManichaeanThamedh}) (1)
2430 \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
2431 ManichaeanTwenty}) (1)
2432 \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw}) (1)
2433 \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
2434 (1)
2435 \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
2436 ManichaeanZayin}) (2)
2437 \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4)
2438 \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1)
2439 \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
2440 (1_113_818 plus all above-Unicode code
2441 points)
2442 \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8)
2443 \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1)
2444 \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1)
2445 \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1)
2446 \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (5)
2447 \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1)
2448 \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (19)
2449 \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1)
2450 \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1)
2451 \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (6)
2452 \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1)
2453 \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11)
2454 \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1)
2455 \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1)
2456 \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1)
2457 \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1)
2458 \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1)
2459 \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (4)
2460 \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1)
2461 \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3)
2462 \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
2463 Hamza_On_Heh_Goal} (1)
2464 \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2)
2465 \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16)
2466 \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (11)
2467 \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2)
2468 \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1)
2469 \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1)
2470 \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1)
2471 \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1)
2472 \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1)
2473 \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (4)
2474 \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (501)
2475 \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (501)
2476 \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4)
2477 \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (3)
2478 \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (3)
2479 \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_653 plus
2480 all above-Unicode code points)
2481 \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (112)
2482 \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (112)
2483 \p{Joining_Type: T} \p{Joining_Type=Transparent} (1839)
2484 \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1839)
2485 \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_111_653
2486 plus all above-Unicode code points)
2487 \p{Jt: *} \p{Joining_Type: *}
2488 \p{Kaithi} \p{Script_Extensions=Kaithi} (Short:
2489 \p{Kthi}; NOT \p{Block=Kaithi}) (86)
2490 \p{Kali} \p{Kayah_Li} (= \p{Script_Extensions=
2491 Kayah_Li}) (48)
2492 \p{Kana} \p{Katakana} (= \p{Script_Extensions=
2493 Katakana}) (NOT \p{Block=Katakana}) (352)
2494 X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block=
2495 Kana_Supplement}) (256)
2496 X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short:
2497 \p{InKanaSup}) (256)
2498 X \p{Kanbun} \p{Block=Kanbun} (16)
2499 X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block=
2500 Kangxi_Radicals}) (224)
2501 X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short:
2502 \p{InKangxi}) (224)
2503 \p{Kannada} \p{Script_Extensions=Kannada} (Short:
2504 \p{Knda}; NOT \p{Block=Kannada}) (100)
2505 \p{Katakana} \p{Script_Extensions=Katakana} (Short:
2506 \p{Kana}; NOT \p{Block=Katakana}) (352)
2507 X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (=
2508 \p{Block=Katakana_Phonetic_Extensions})
2509 (16)
2510 X \p{Katakana_Phonetic_Extensions} \p{Block=
2511 Katakana_Phonetic_Extensions} (Short:
2512 \p{InKatakanaExt}) (16)
2513 \p{Kayah_Li} \p{Script_Extensions=Kayah_Li} (Short:
2514 \p{Kali}) (48)
2515 \p{Khar} \p{Kharoshthi} (= \p{Script_Extensions=
2516 Kharoshthi}) (NOT \p{Block=Kharoshthi})
2517 (65)
2518 \p{Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
2519 \p{Khar}; NOT \p{Block=Kharoshthi}) (65)
2520 \p{Khmer} \p{Script_Extensions=Khmer} (Short:
2521 \p{Khmr}; NOT \p{Block=Khmer}) (146)
2522 X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
2523 \p{Khmr} \p{Khmer} (= \p{Script_Extensions=Khmer})
2524 (NOT \p{Block=Khmer}) (146)
2525 \p{Khoj} \p{Khojki} (= \p{Script_Extensions=
2526 Khojki}) (NOT \p{Block=Khojki}) (72)
2527 \p{Khojki} \p{Script_Extensions=Khojki} (Short:
2528 \p{Khoj}; NOT \p{Block=Khojki}) (72)
2529 \p{Khudawadi} \p{Script_Extensions=Khudawadi} (Short:
2530 \p{Sind}; NOT \p{Block=Khudawadi}) (81)
2531 \p{Knda} \p{Kannada} (= \p{Script_Extensions=
2532 Kannada}) (NOT \p{Block=Kannada}) (100)
2533 \p{Kthi} \p{Kaithi} (= \p{Script_Extensions=
2534 Kaithi}) (NOT \p{Block=Kaithi}) (86)
2535 \p{L} \pL \p{Letter} (= \p{General_Category=Letter})
2536 (116_766)
2537 X \p{L&} \p{Cased_Letter} (= \p{General_Category=
2538 Cased_Letter}) (3796)
2539 X \p{L_} \p{Cased_Letter} (= \p{General_Category=
2540 Cased_Letter}) Note the trailing '_'
2541 matters in spite of loose matching
2542 rules. (3796)
2543 \p{Lana} \p{Tai_Tham} (= \p{Script_Extensions=
2544 Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
2545 \p{Lao} \p{Script_Extensions=Lao} (NOT \p{Block=
2546 Lao}) (67)
2547 \p{Laoo} \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
2548 \p{Block=Lao}) (67)
2549 \p{Latin} \p{Script_Extensions=Latin} (Short:
2550 \p{Latn}) (1370)
2551 X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
2552 Latin_1_Supplement}) (128)
2553 X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block=
2554 Latin_1_Supplement}) (128)
2555 X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
2556 \p{InLatin1}) (128)
2557 X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block=
2558 Latin_Extended_A}) (128)
2559 X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
2560 \p{Block=Latin_Extended_Additional})
2561 (256)
2562 X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block=
2563 Latin_Extended_B}) (208)
2564 X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block=
2565 Latin_Extended_C}) (32)
2566 X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block=
2567 Latin_Extended_D}) (224)
2568 X \p{Latin_Ext_E} \p{Latin_Extended_E} (= \p{Block=
2569 Latin_Extended_E}) (64)
2570 X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short:
2571 \p{InLatinExtA}) (128)
2572 X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
2573 (Short: \p{InLatinExtAdditional}) (256)
2574 X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short:
2575 \p{InLatinExtB}) (208)
2576 X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short:
2577 \p{InLatinExtC}) (32)
2578 X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short:
2579 \p{InLatinExtD}) (224)
2580 X \p{Latin_Extended_E} \p{Block=Latin_Extended_E} (Short:
2581 \p{InLatinExtE}) (64)
2582 \p{Latn} \p{Latin} (= \p{Script_Extensions=Latin})
2583 (1370)
2584 \p{Lb: *} \p{Line_Break: *}
2585 \p{LC} \p{Cased_Letter} (= \p{General_Category=
2586 Cased_Letter}) (3796)
2587 \p{Lepc} \p{Lepcha} (= \p{Script_Extensions=
2588 Lepcha}) (NOT \p{Block=Lepcha}) (74)
2589 \p{Lepcha} \p{Script_Extensions=Lepcha} (Short:
2590 \p{Lepc}; NOT \p{Block=Lepcha}) (74)
2591 \p{Letter} \p{General_Category=Letter} (Short: \p{L})
2592 (116_766)
2593 \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
2594 \p{Nl}) (236)
2595 X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
2596 \p{Limb} \p{Limbu} (= \p{Script_Extensions=Limbu})
2597 (NOT \p{Block=Limbu}) (69)
2598 \p{Limbu} \p{Script_Extensions=Limbu} (Short:
2599 \p{Limb}; NOT \p{Block=Limbu}) (69)
2600 \p{Lina} \p{Linear_A} (= \p{Script_Extensions=
2601 Linear_A}) (NOT \p{Block=Linear_A}) (386)
2602 \p{Linb} \p{Linear_B} (= \p{Script_Extensions=
2603 Linear_B}) (268)
2604 \p{Line_Break: AI} \p{Line_Break=Ambiguous} (707)
2605 \p{Line_Break: AL} \p{Line_Break=Alphabetic} (19_523)
2606 \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (19_523)
2607 \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707)
2608 \p{Line_Break: B2} \p{Line_Break=Break_Both} (3)
2609 \p{Line_Break: BA} \p{Line_Break=Break_After} (218)
2610 \p{Line_Break: BB} \p{Line_Break=Break_Before} (37)
2611 \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
2612 \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (218)
2613 \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (37)
2614 \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3)
2615 \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1)
2616 \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1)
2617 \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
2618 \p{Line_Break: CJ} \p{Line_Break=
2619 Conditional_Japanese_Starter} (51)
2620 \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (90)
2621 \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2)
2622 \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (90)
2623 \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (2090)
2624 \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2090)
2625 \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (734)
2626 \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
2627 (51)
2628 \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1)
2629 \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
2630 \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
2631 \p{Line_Break: E_Base} (Short: \p{Lb=EB}) (83)
2632 \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5)
2633 \p{Line_Break: EB} \p{Line_Break=E_Base} (83)
2634 \p{Line_Break: EM} \p{Line_Break=E_Modifier} (5)
2635 \p{Line_Break: EX} \p{Line_Break=Exclamation} (37)
2636 \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (37)
2637 \p{Line_Break: GL} \p{Line_Break=Glue} (18)
2638 \p{Line_Break: Glue} (Short: \p{Lb=GL}) (18)
2639 \p{Line_Break: H2} (Short: \p{Lb=H2}) (399)
2640 \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773)
2641 \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (74)
2642 \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (74)
2643 \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
2644 \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1)
2645 \p{Line_Break: ID} \p{Line_Break=Ideographic} (172_133)
2646 \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_133)
2647 \p{Line_Break: IN} \p{Line_Break=Inseparable} (6)
2648 \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13)
2649 \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6)
2650 \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
2651 \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
2652 \p{Line_Break: JL} (Short: \p{Lb=JL}) (125)
2653 \p{Line_Break: JT} (Short: \p{Lb=JT}) (137)
2654 \p{Line_Break: JV} (Short: \p{Lb=JV}) (95)
2655 \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
2656 \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1)
2657 \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4)
2658 \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1)
2659 \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
2660 \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (30)
2661 \p{Line_Break: NS} \p{Line_Break=Nonstarter} (30)
2662 \p{Line_Break: NU} \p{Line_Break=Numeric} (572)
2663 \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (572)
2664 \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (87)
2665 \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (87)
2666 \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (30)
2667 \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (30)
2668 \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (65)
2669 \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (65)
2670 \p{Line_Break: QU} \p{Line_Break=Quotation} (39)
2671 \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39)
2672 \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26)
2673 \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26)
2674 \p{Line_Break: SA} \p{Line_Break=Complex_Context} (734)
2675 D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
2676 \p{Line_Break: SP} \p{Line_Break=Space} (1)
2677 \p{Line_Break: Space} (Short: \p{Lb=SP}) (1)
2678 D \p{Line_Break: Surrogate} Deprecated by Unicode because surrogates
2679 should never appear in well-formed text,
2680 and therefore shouldn't be the basis for
2681 line breaking (Short: \p{Lb=SG}) (2048)
2682 \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
2683 \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (903_847 plus all
2684 above-Unicode code points)
2685 \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
2686 \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2)
2687 \p{Line_Break: XX} \p{Line_Break=Unknown} (903_847 plus all
2688 above-Unicode code points)
2689 \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
2690 \p{Line_Break: ZWJ} (Short: \p{Lb=ZWJ}) (1)
2691 \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1)
2692 \p{Line_Separator} \p{General_Category=Line_Separator}
2693 (Short: \p{Zl}) (1)
2694 \p{Linear_A} \p{Script_Extensions=Linear_A} (Short:
2695 \p{Lina}; NOT \p{Block=Linear_A}) (386)
2696 \p{Linear_B} \p{Script_Extensions=Linear_B} (Short:
2697 \p{Linb}) (268)
2698 X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
2699 X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
2700 \p{Lisu} \p{Script_Extensions=Lisu} (48)
2701 \p{Ll} \p{Lowercase_Letter} (=
2702 \p{General_Category=Lowercase_Letter})
2703 (/i= General_Category=Cased_Letter)
2704 (2063)
2705 \p{Lm} \p{Modifier_Letter} (=
2706 \p{General_Category=Modifier_Letter})
2707 (249)
2708 \p{Lo} \p{Other_Letter} (= \p{General_Category=
2709 Other_Letter}) (112_721)
2710 \p{LOE} \p{Logical_Order_Exception} (=
2711 \p{Logical_Order_Exception=Y}) (19)
2712 \p{LOE: *} \p{Logical_Order_Exception: *}
2713 \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
2714 \p{LOE}) (19)
2715 \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
2716 (1_114_093 plus all above-Unicode code
2717 points)
2718 \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19)
2719 X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
2720 \p{Lower} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
2721 Cased=Yes) (2252)
2722 \p{Lower: *} \p{Lowercase: *}
2723 \p{Lowercase} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
2724 Cased=Yes) (2252)
2725 \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
2726 No) (1_111_860 plus all above-Unicode
2727 code points)
2728 \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
2729 Yes) (2252)
2730 \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
2731 (Short: \p{Ll}; /i= General_Category=
2732 Cased_Letter) (2063)
2733 \p{Lt} \p{Titlecase_Letter} (=
2734 \p{General_Category=Titlecase_Letter})
2735 (/i= General_Category=Cased_Letter) (31)
2736 \p{Lu} \p{Uppercase_Letter} (=
2737 \p{General_Category=Uppercase_Letter})
2738 (/i= General_Category=Cased_Letter)
2739 (1702)
2740 \p{Lyci} \p{Lycian} (= \p{Script_Extensions=
2741 Lycian}) (NOT \p{Block=Lycian}) (29)
2742 \p{Lycian} \p{Script_Extensions=Lycian} (Short:
2743 \p{Lyci}; NOT \p{Block=Lycian}) (29)
2744 \p{Lydi} \p{Lydian} (= \p{Script_Extensions=
2745 Lydian}) (NOT \p{Block=Lydian}) (27)
2746 \p{Lydian} \p{Script_Extensions=Lydian} (Short:
2747 \p{Lydi}; NOT \p{Block=Lydian}) (27)
2748 \p{M} \pM \p{Mark} (= \p{General_Category=Mark})
2749 (2097)
2750 \p{Mahajani} \p{Script_Extensions=Mahajani} (Short:
2751 \p{Mahj}; NOT \p{Block=Mahajani}) (61)
2752 \p{Mahj} \p{Mahajani} (= \p{Script_Extensions=
2753 Mahajani}) (NOT \p{Block=Mahajani}) (61)
2754 X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block=
2755 Mahjong_Tiles}) (48)
2756 X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short:
2757 \p{InMahjong}) (48)
2758 \p{Malayalam} \p{Script_Extensions=Malayalam} (Short:
2759 \p{Mlym}; NOT \p{Block=Malayalam}) (119)
2760 \p{Mand} \p{Mandaic} (= \p{Script_Extensions=
2761 Mandaic}) (NOT \p{Block=Mandaic}) (30)
2762 \p{Mandaic} \p{Script_Extensions=Mandaic} (Short:
2763 \p{Mand}; NOT \p{Block=Mandaic}) (30)
2764 \p{Mani} \p{Manichaean} (= \p{Script_Extensions=
2765 Manichaean}) (NOT \p{Block=Manichaean})
2766 (52)
2767 \p{Manichaean} \p{Script_Extensions=Manichaean} (Short:
2768 \p{Mani}; NOT \p{Block=Manichaean}) (52)
2769 \p{Marc} \p{Marchen} (= \p{Script_Extensions=
2770 Marchen}) (NOT \p{Block=Marchen}) (68)
2771 \p{Marchen} \p{Script_Extensions=Marchen} (Short:
2772 \p{Marc}; NOT \p{Block=Marchen}) (68)
2773 \p{Mark} \p{General_Category=Mark} (Short: \p{M})
2774 (2097)
2775 \p{Math} \p{Math=Y} (2310)
2776 \p{Math: N*} (Single: \P{Math}) (1_111_802 plus all
2777 above-Unicode code points)
2778 \p{Math: Y*} (Single: \p{Math}) (2310)
2779 X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (=
2780 \p{Block=
2781 Mathematical_Alphanumeric_Symbols})
2782 (1024)
2783 X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block=
2784 Mathematical_Operators}) (256)
2785 \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
2786 \p{Sm}) (948)
2787 X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
2788 Mathematical_Alphanumeric_Symbols}
2789 (Short: \p{InMathAlphanum}) (1024)
2790 X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
2791 (Short: \p{InMathOperators}) (256)
2792 \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
2793 Spacing_Mark}) (394)
2794 \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
2795 Enclosing_Mark}) (13)
2796 \p{Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek} (Short:
2797 \p{Mtei}; NOT \p{Block=Meetei_Mayek})
2798 (79)
2799 X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block=
2800 Meetei_Mayek_Extensions}) (32)
2801 X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
2802 (Short: \p{InMeeteiMayekExt}) (32)
2803 \p{Mend} \p{Mende_Kikakui} (= \p{Script_Extensions=
2804 Mende_Kikakui}) (NOT \p{Block=
2805 Mende_Kikakui}) (213)
2806 \p{Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
2807 (Short: \p{Mend}; NOT \p{Block=
2808 Mende_Kikakui}) (213)
2809 \p{Merc} \p{Meroitic_Cursive} (=
2810 \p{Script_Extensions=Meroitic_Cursive})
2811 (NOT \p{Block=Meroitic_Cursive}) (90)
2812 \p{Mero} \p{Meroitic_Hieroglyphs} (=
2813 \p{Script_Extensions=
2814 Meroitic_Hieroglyphs}) (32)
2815 \p{Meroitic_Cursive} \p{Script_Extensions=Meroitic_Cursive}
2816 (Short: \p{Merc}; NOT \p{Block=
2817 Meroitic_Cursive}) (90)
2818 \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
2819 Meroitic_Hieroglyphs} (Short: \p{Mero})
2820 (32)
2821 \p{Miao} \p{Script_Extensions=Miao} (NOT \p{Block=
2822 Miao}) (133)
2823 X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (=
2824 \p{Block=
2825 Miscellaneous_Symbols_And_Arrows}) (256)
2826 X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
2827 (= \p{Block=
2828 Miscellaneous_Mathematical_Symbols_A})
2829 (48)
2830 X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
2831 (= \p{Block=
2832 Miscellaneous_Mathematical_Symbols_B})
2833 (128)
2834 X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs}
2835 (= \p{Block=
2836 Miscellaneous_Symbols_And_Pictographs})
2837 (768)
2838 X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block=
2839 Miscellaneous_Symbols}) (256)
2840 X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block=
2841 Miscellaneous_Technical}) (256)
2842 X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
2843 Miscellaneous_Mathematical_Symbols_A}
2844 (Short: \p{InMiscMathSymbolsA}) (48)
2845 X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
2846 Miscellaneous_Mathematical_Symbols_B}
2847 (Short: \p{InMiscMathSymbolsB}) (128)
2848 X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
2849 \p{InMiscSymbols}) (256)
2850 X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
2851 Miscellaneous_Symbols_And_Arrows}
2852 (Short: \p{InMiscArrows}) (256)
2853 X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
2854 Miscellaneous_Symbols_And_Pictographs}
2855 (Short: \p{InMiscPictographs}) (768)
2856 X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
2857 (Short: \p{InMiscTechnical}) (256)
2858 \p{Mlym} \p{Malayalam} (= \p{Script_Extensions=
2859 Malayalam}) (NOT \p{Block=Malayalam})
2860 (119)
2861 \p{Mn} \p{Nonspacing_Mark} (=
2862 \p{General_Category=Nonspacing_Mark})
2863 (1690)
2864 \p{Modi} \p{Script_Extensions=Modi} (NOT \p{Block=
2865 Modi}) (89)
2866 \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
2867 (Short: \p{Lm}) (249)
2868 X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block=
2869 Spacing_Modifier_Letters}) (80)
2870 \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
2871 (Short: \p{Sk}) (121)
2872 X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
2873 \p{Mong} \p{Mongolian} (= \p{Script_Extensions=
2874 Mongolian}) (NOT \p{Block=Mongolian})
2875 (169)
2876 \p{Mongolian} \p{Script_Extensions=Mongolian} (Short:
2877 \p{Mong}; NOT \p{Block=Mongolian}) (169)
2878 X \p{Mongolian_Sup} \p{Mongolian_Supplement} (= \p{Block=
2879 Mongolian_Supplement}) (32)
2880 X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
2881 \p{InMongolianSup}) (32)
2882 \p{Mro} \p{Script_Extensions=Mro} (NOT \p{Block=
2883 Mro}) (43)
2884 \p{Mroo} \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
2885 \p{Block=Mro}) (43)
2886 \p{Mtei} \p{Meetei_Mayek} (= \p{Script_Extensions=
2887 Meetei_Mayek}) (NOT \p{Block=
2888 Meetei_Mayek}) (79)
2889 \p{Mult} \p{Multani} (= \p{Script_Extensions=
2890 Multani}) (NOT \p{Block=Multani}) (48)
2891 \p{Multani} \p{Script_Extensions=Multani} (Short:
2892 \p{Mult}; NOT \p{Block=Multani}) (48)
2893 X \p{Music} \p{Musical_Symbols} (= \p{Block=
2894 Musical_Symbols}) (256)
2895 X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short:
2896 \p{InMusic}) (256)
2897 \p{Myanmar} \p{Script_Extensions=Myanmar} (Short:
2898 \p{Mymr}; NOT \p{Block=Myanmar}) (224)
2899 X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block=
2900 Myanmar_Extended_A}) (32)
2901 X \p{Myanmar_Ext_B} \p{Myanmar_Extended_B} (= \p{Block=
2902 Myanmar_Extended_B}) (32)
2903 X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short:
2904 \p{InMyanmarExtA}) (32)
2905 X \p{Myanmar_Extended_B} \p{Block=Myanmar_Extended_B} (Short:
2906 \p{InMyanmarExtB}) (32)
2907 \p{Mymr} \p{Myanmar} (= \p{Script_Extensions=
2908 Myanmar}) (NOT \p{Block=Myanmar}) (224)
2909 \p{N} \pN \p{Number} (= \p{General_Category=Number})
2910 (1492)
2911 \p{Nabataean} \p{Script_Extensions=Nabataean} (Short:
2912 \p{Nbat}; NOT \p{Block=Nabataean}) (40)
2913 \p{Narb} \p{Old_North_Arabian} (=
2914 \p{Script_Extensions=Old_North_Arabian})
2915 (32)
2916 X \p{NB} \p{No_Block} (= \p{Block=No_Block})
2917 (842_320 plus all above-Unicode code
2918 points)
2919 \p{Nbat} \p{Nabataean} (= \p{Script_Extensions=
2920 Nabataean}) (NOT \p{Block=Nabataean})
2921 (40)
2922 \p{NChar} \p{Noncharacter_Code_Point} (=
2923 \p{Noncharacter_Code_Point=Y}) (66)
2924 \p{NChar: *} \p{Noncharacter_Code_Point: *}
2925 \p{Nd} \p{XPosixDigit} (= \p{General_Category=
2926 Decimal_Number}) (580)
2927 \p{New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
2928 \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
2929 \p{Newa} \p{Script_Extensions=Newa} (NOT \p{Block=
2930 Newa}) (92)
2931 \p{NFC_QC: *} \p{NFC_Quick_Check: *}
2932 \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (110)
2933 \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (110)
2934 \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
2935 \P{NFC_Quick_Check} NOR \P{NFC_QC})
2936 (1120)
2937 \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
2938 \P{NFC_Quick_Check} NOR \P{NFC_QC})
2939 (1120)
2940 \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
2941 \p{NFC_Quick_Check} NOR \p{NFC_QC})
2942 (1_112_882 plus all above-Unicode code
2943 points)
2944 \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
2945 \p{NFC_Quick_Check} NOR \p{NFC_QC})
2946 (1_112_882 plus all above-Unicode code
2947 points)
2948 \p{NFD_QC: *} \p{NFD_Quick_Check: *}
2949 \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
2950 \P{NFD_Quick_Check} NOR \P{NFD_QC})
2951 (13_232)
2952 \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
2953 \P{NFD_Quick_Check} NOR \P{NFD_QC})
2954 (13_232)
2955 \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
2956 \p{NFD_Quick_Check} NOR \p{NFD_QC})
2957 (1_100_880 plus all above-Unicode code
2958 points)
2959 \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
2960 \p{NFD_Quick_Check} NOR \p{NFD_QC})
2961 (1_100_880 plus all above-Unicode code
2962 points)
2963 \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
2964 \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (110)
2965 \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (110)
2966 \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
2967 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
2968 (4794)
2969 \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
2970 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
2971 (4794)
2972 \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
2973 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
2974 (1_109_208 plus all above-Unicode code
2975 points)
2976 \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
2977 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
2978 (1_109_208 plus all above-Unicode code
2979 points)
2980 \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
2981 \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
2982 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
2983 (16_894)
2984 \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
2985 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
2986 (16_894)
2987 \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
2988 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
2989 (1_097_218 plus all above-Unicode code
2990 points)
2991 \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
2992 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
2993 (1_097_218 plus all above-Unicode code
2994 points)
2995 \p{Nko} \p{Script_Extensions=Nko} (NOT \p{NKo})
2996 (59)
2997 \p{Nkoo} \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
2998 \p{NKo}) (59)
2999 \p{Nl} \p{Letter_Number} (= \p{General_Category=
3000 Letter_Number}) (236)
3001 \p{No} \p{Other_Number} (= \p{General_Category=
3002 Other_Number}) (676)
3003 X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB})
3004 (842_320 plus all above-Unicode code
3005 points)
3006 \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
3007 \p{NChar}) (66)
3008 \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
3009 (1_114_046 plus all above-Unicode code
3010 points)
3011 \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
3012 (66)
3013 \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
3014 (Short: \p{Mn}) (1690)
3015 \p{Nt: *} \p{Numeric_Type: *}
3016 \p{Number} \p{General_Category=Number} (Short: \p{N})
3017 (1492)
3018 X \p{Number_Forms} \p{Block=Number_Forms} (64)
3019 \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (580)
3020 \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (580)
3021 \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
3022 \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128)
3023 \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_539 plus all
3024 above-Unicode code points)
3025 \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (865)
3026 \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (865)
3027 T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1)
3028 T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (74)
3029 T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (1)
3030 T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (1)
3031 T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (1)
3032 T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (1)
3033 T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (4)
3034 T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1)
3035 T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (2)
3036 T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1)
3037 T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (6)
3038 T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1)
3039 T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (1)
3040 T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (3)
3041 T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (4)
3042 T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (2)
3043 T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (12)
3044 T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (6)
3045 T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1)
3046 T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1)
3047 T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1)
3048 T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (13)
3049 T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1)
3050 T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1)
3051 T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1)
3052 T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (7)
3053 T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (7)
3054 T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1)
3055 T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (3)
3056 T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1)
3057 T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1)
3058 T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (121)
3059 T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1)
3060 T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (121)
3061 T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1)
3062 T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (123)
3063 T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1)
3064 T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (115)
3065 T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1)
3066 T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (113)
3067 T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1)
3068 T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (100)
3069 T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1)
3070 T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (99)
3071 T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1)
3072 T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (95)
3073 T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1)
3074 T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (99)
3075 T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (54)
3076 T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (6)
3077 T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (6)
3078 T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (4)
3079 T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (4)
3080 T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (4)
3081 T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (5)
3082 T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (5)
3083 T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (5)
3084 T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (5)
3085 T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (31)
3086 T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1)
3087 T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1)
3088 T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1)
3089 T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1)
3090 T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1)
3091 T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1)
3092 T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1)
3093 T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1)
3094 T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1)
3095 T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (16)
3096 T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1)
3097 T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1)
3098 T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1)
3099 T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1)
3100 T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1)
3101 T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1)
3102 T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1)
3103 T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1)
3104 T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1)
3105 T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (16)
3106 T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1)
3107 T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1)
3108 T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1)
3109 T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1)
3110 T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1)
3111 T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1)
3112 T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1)
3113 T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1)
3114 T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1)
3115 T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (27)
3116 T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (11)
3117 T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (11)
3118 T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (10)
3119 T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (10)
3120 T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (30)
3121 T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (4)
3122 T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (5)
3123 T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (4)
3124 T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (14)
3125 T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (4)
3126 T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (4)
3127 T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (4)
3128 T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (5)
3129 T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (20)
3130 T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (2)
3131 T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (2)
3132 T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (2)
3133 T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (6)
3134 T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (2)
3135 T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (2)
3136 T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (2)
3137 T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (2)
3138 T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (9)
3139 T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (2)
3140 T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (2)
3141 T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (2)
3142 T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (5)
3143 T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (2)
3144 T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (2)
3145 T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (2)
3146 T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (2)
3147 T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (2)
3148 T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (1)
3149 T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1)
3150 T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1)
3151 T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1)
3152 T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1)
3153 T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1)
3154 T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1)
3155 T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1)
3156 T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1)
3157 T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1)
3158 T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1)
3159 T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
3160 (3)
3161 T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
3162 10000000000}) (1)
3163 T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
3164 1000000000000}) (2)
3165 \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_539 plus all
3166 above-Unicode code points)
3167 \p{Nv: *} \p{Numeric_Value: *}
3168 X \p{OCR} \p{Optical_Character_Recognition} (=
3169 \p{Block=Optical_Character_Recognition})
3170 (32)
3171 \p{Ogam} \p{Ogham} (= \p{Script_Extensions=Ogham})
3172 (NOT \p{Block=Ogham}) (29)
3173 \p{Ogham} \p{Script_Extensions=Ogham} (Short:
3174 \p{Ogam}; NOT \p{Block=Ogham}) (29)
3175 \p{Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
3176 \p{Olck}) (48)
3177 \p{Olck} \p{Ol_Chiki} (= \p{Script_Extensions=
3178 Ol_Chiki}) (48)
3179 \p{Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
3180 (Short: \p{Hung}; NOT \p{Block=
3181 Old_Hungarian}) (108)
3182 \p{Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
3183 \p{Ital}; NOT \p{Block=Old_Italic}) (36)
3184 \p{Old_North_Arabian} \p{Script_Extensions=Old_North_Arabian}
3185 (Short: \p{Narb}) (32)
3186 \p{Old_Permic} \p{Script_Extensions=Old_Permic} (Short:
3187 \p{Perm}; NOT \p{Block=Old_Permic}) (44)
3188 \p{Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
3189 \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
3190 \p{Old_South_Arabian} \p{Script_Extensions=Old_South_Arabian}
3191 (Short: \p{Sarb}) (32)
3192 \p{Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
3193 \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
3194 \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
3195 (Short: \p{Ps}) (75)
3196 X \p{Optical_Character_Recognition} \p{Block=
3197 Optical_Character_Recognition} (Short:
3198 \p{InOCR}) (32)
3199 \p{Oriya} \p{Script_Extensions=Oriya} (Short:
3200 \p{Orya}; NOT \p{Block=Oriya}) (94)
3201 \p{Orkh} \p{Old_Turkic} (= \p{Script_Extensions=
3202 Old_Turkic}) (NOT \p{Block=Old_Turkic})
3203 (73)
3204 X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
3205 \p{Orya} \p{Oriya} (= \p{Script_Extensions=Oriya})
3206 (NOT \p{Block=Oriya}) (94)
3207 \p{Osage} \p{Script_Extensions=Osage} (Short:
3208 \p{Osge}; NOT \p{Block=Osage}) (72)
3209 \p{Osge} \p{Osage} (= \p{Script_Extensions=Osage})
3210 (NOT \p{Block=Osage}) (72)
3211 \p{Osma} \p{Osmanya} (= \p{Script_Extensions=
3212 Osmanya}) (NOT \p{Block=Osmanya}) (40)
3213 \p{Osmanya} \p{Script_Extensions=Osmanya} (Short:
3214 \p{Osma}; NOT \p{Block=Osmanya}) (40)
3215 \p{Other} \p{General_Category=Other} (Short: \p{C})
3216 (986_091 plus all above-Unicode code
3217 points)
3218 \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
3219 \p{Lo}) (112_721)
3220 \p{Other_Number} \p{General_Category=Other_Number} (Short:
3221 \p{No}) (676)
3222 \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
3223 (Short: \p{Po}) (544)
3224 \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
3225 \p{So}) (5777)
3226 \p{P} \pP \p{Punct} (= \p{General_Category=
3227 Punctuation}) (NOT
3228 \p{General_Punctuation}) (748)
3229 \p{Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong} (Short:
3230 \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
3231 (127)
3232 \p{Palm} \p{Palmyrene} (= \p{Script_Extensions=
3233 Palmyrene}) (32)
3234 \p{Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
3235 \p{Palm}) (32)
3236 \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
3237 (Short: \p{Zp}) (1)
3238 \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
3239 Y}) (2760)
3240 \p{Pat_Syn: *} \p{Pattern_Syntax: *}
3241 \p{Pat_WS} \p{Pattern_White_Space} (=
3242 \p{Pattern_White_Space=Y}) (11)
3243 \p{Pat_WS: *} \p{Pattern_White_Space: *}
3244 \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
3245 (2760)
3246 \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
3247 (1_111_352 plus all above-Unicode code
3248 points)
3249 \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760)
3250 \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
3251 \p{PatWS}) (11)
3252 \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
3253 (1_114_101 plus all above-Unicode code
3254 points)
3255 \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11)
3256 \p{Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
3257 \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
3258 \p{Pauc} \p{Pau_Cin_Hau} (= \p{Script_Extensions=
3259 Pau_Cin_Hau}) (NOT \p{Block=
3260 Pau_Cin_Hau}) (57)
3261 \p{Pc} \p{Connector_Punctuation} (=
3262 \p{General_Category=
3263 Connector_Punctuation}) (10)
3264 \p{PCM} \p{Prepended_Concatenation_Mark} (=
3265 \p{Prepended_Concatenation_Mark=Y}) (10)
3266 \p{PCM: *} \p{Prepended_Concatenation_Mark: *}
3267 \p{Pd} \p{Dash_Punctuation} (=
3268 \p{General_Category=Dash_Punctuation})
3269 (24)
3270 \p{Pe} \p{Close_Punctuation} (=
3271 \p{General_Category=Close_Punctuation})
3272 (73)
3273 \p{PerlSpace} \p{PosixSpace} (6)
3274 \p{PerlWord} \p{PosixWord} (63)
3275 \p{Perm} \p{Old_Permic} (= \p{Script_Extensions=
3276 Old_Permic}) (NOT \p{Block=Old_Permic})
3277 (44)
3278 \p{Pf} \p{Final_Punctuation} (=
3279 \p{General_Category=Final_Punctuation})
3280 (10)
3281 \p{Phag} \p{Phags_Pa} (= \p{Script_Extensions=
3282 Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
3283 \p{Phags_Pa} \p{Script_Extensions=Phags_Pa} (Short:
3284 \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
3285 X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block=
3286 Phaistos_Disc}) (48)
3287 X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short:
3288 \p{InPhaistos}) (48)
3289 \p{Phli} \p{Inscriptional_Pahlavi} (=
3290 \p{Script_Extensions=
3291 Inscriptional_Pahlavi}) (NOT \p{Block=
3292 Inscriptional_Pahlavi}) (27)
3293 \p{Phlp} \p{Psalter_Pahlavi} (=
3294 \p{Script_Extensions=Psalter_Pahlavi})
3295 (NOT \p{Block=Psalter_Pahlavi}) (30)
3296 \p{Phnx} \p{Phoenician} (= \p{Script_Extensions=
3297 Phoenician}) (NOT \p{Block=Phoenician})
3298 (29)
3299 \p{Phoenician} \p{Script_Extensions=Phoenician} (Short:
3300 \p{Phnx}; NOT \p{Block=Phoenician}) (29)
3301 X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block=
3302 Phonetic_Extensions}) (128)
3303 X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (=
3304 \p{Block=
3305 Phonetic_Extensions_Supplement}) (64)
3306 X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
3307 \p{InPhoneticExt}) (128)
3308 X \p{Phonetic_Extensions_Supplement} \p{Block=
3309 Phonetic_Extensions_Supplement} (Short:
3310 \p{InPhoneticExtSup}) (64)
3311 \p{Pi} \p{Initial_Punctuation} (=
3312 \p{General_Category=
3313 Initial_Punctuation}) (12)
3314 X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
3315 \p{Plrd} \p{Miao} (= \p{Script_Extensions=Miao})
3316 (NOT \p{Block=Miao}) (133)
3317 \p{Po} \p{Other_Punctuation} (=
3318 \p{General_Category=Other_Punctuation})
3319 (544)
3320 \p{PosixAlnum} [A-Za-z0-9] (62)
3321 \p{PosixAlpha} [A-Za-z] (52)
3322 \p{PosixBlank} \t and ' ' (2)
3323 \p{PosixCntrl} ASCII control characters: NUL, SOH, STX,
3324 ETX, EOT, ENQ, ACK, BEL, BS, HT, LF, VT,
3325 FF, CR, SO, SI, DLE, DC1, DC2, DC3, DC4,
3326 NAK, SYN, ETB, CAN, EOM, SUB, ESC, FS,
3327 GS, RS, US, and DEL (33)
3328 \p{PosixDigit} [0-9] (10)
3329 \p{PosixGraph} [-!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~0-9A-Za-
3330 z] (94)
3331 \p{PosixLower} [a-z] (/i= PosixAlpha) (26)
3332 \p{PosixPrint} [- 0-9A-Za-z!"#$%&'()*+,./:;<=
3333 >?@[\\]^_`{|}~] (95)
3334 \p{PosixPunct} [-!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~] (32)
3335 \p{PosixSpace} \t, \n, \cK, \f, \r, and ' '. (\cK is
3336 vertical tab) (Short: \p{PerlSpace}) (6)
3337 \p{PosixUpper} [A-Z] (/i= PosixAlpha) (26)
3338 \p{PosixWord} \w, restricted to ASCII = [A-Za-z0-9_]
3339 (Short: \p{PerlWord}) (63)
3340 \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} [0-9A-Fa-f] (Short:
3341 \p{AHex}) (22)
3342 \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
3343 Y} (Short: \p{PCM}) (10)
3344 \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
3345 (1_114_102 plus all above-Unicode code
3346 points)
3347 \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
3348 (10)
3349 T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
3350 extension) (33_979)
3351 T \p{Present_In: 2.0} Code point's usage introduced in version
3352 2.0 or earlier (Short: \p{In=2.0}) (Perl
3353 extension) (178_500)
3354 T \p{Present_In: 2.1} Code point's usage introduced in version
3355 2.1 or earlier (Short: \p{In=2.1}) (Perl
3356 extension) (178_502)
3357 T \p{Present_In: 3.0} Code point's usage introduced in version
3358 3.0 or earlier (Short: \p{In=3.0}) (Perl
3359 extension) (188_809)
3360 T \p{Present_In: 3.1} Code point's usage introduced in version
3361 3.1 or earlier (Short: \p{In=3.1}) (Perl
3362 extension) (233_787)
3363 T \p{Present_In: 3.2} Code point's usage introduced in version
3364 3.2 or earlier (Short: \p{In=3.2}) (Perl
3365 extension) (234_803)
3366 T \p{Present_In: 4.0} Code point's usage introduced in version
3367 4.0 or earlier (Short: \p{In=4.0}) (Perl
3368 extension) (236_029)
3369 T \p{Present_In: 4.1} Code point's usage introduced in version
3370 4.1 or earlier (Short: \p{In=4.1}) (Perl
3371 extension) (237_302)
3372 T \p{Present_In: 5.0} Code point's usage introduced in version
3373 5.0 or earlier (Short: \p{In=5.0}) (Perl
3374 extension) (238_671)
3375 T \p{Present_In: 5.1} Code point's usage introduced in version
3376 5.1 or earlier (Short: \p{In=5.1}) (Perl
3377 extension) (240_295)
3378 T \p{Present_In: 5.2} Code point's usage introduced in version
3379 5.2 or earlier (Short: \p{In=5.2}) (Perl
3380 extension) (246_943)
3381 T \p{Present_In: 6.0} Code point's usage introduced in version
3382 6.0 or earlier (Short: \p{In=6.0}) (Perl
3383 extension) (249_031)
3384 T \p{Present_In: 6.1} Code point's usage introduced in version
3385 6.1 or earlier (Short: \p{In=6.1}) (Perl
3386 extension) (249_763)
3387 T \p{Present_In: 6.2} Code point's usage introduced in version
3388 6.2 or earlier (Short: \p{In=6.2}) (Perl
3389 extension) (249_764)
3390 T \p{Present_In: 6.3} Code point's usage introduced in version
3391 6.3 or earlier (Short: \p{In=6.3}) (Perl
3392 extension) (249_769)
3393 T \p{Present_In: 7.0} Code point's usage introduced in version
3394 7.0 or earlier (Short: \p{In=7.0}) (Perl
3395 extension) (252_603)
3396 T \p{Present_In: 8.0} Code point's usage introduced in version
3397 8.0 or earlier (Short: \p{In=8.0}) (Perl
3398 extension) (260_319)
3399 T \p{Present_In: 9.0} Code point's usage introduced in version
3400 9.0 or earlier (Short: \p{In=9.0}) (Perl
3401 extension) (267_819)
3402 \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
3403 Unassigned}) (Perl extension) (846_293
3404 plus all above-Unicode code points)
3405 \p{Print} \p{XPosixPrint} (265_638)
3406 \p{Private_Use} \p{General_Category=Private_Use} (Short:
3407 \p{Co}; NOT \p{Private_Use_Area})
3408 (137_468)
3409 X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
3410 \p{InPUA}) (6400)
3411 \p{Prti} \p{Inscriptional_Parthian} (=
3412 \p{Script_Extensions=
3413 Inscriptional_Parthian}) (NOT \p{Block=
3414 Inscriptional_Parthian}) (30)
3415 \p{Ps} \p{Open_Punctuation} (=
3416 \p{General_Category=Open_Punctuation})
3417 (75)
3418 \p{Psalter_Pahlavi} \p{Script_Extensions=Psalter_Pahlavi}
3419 (Short: \p{Phlp}; NOT \p{Block=
3420 Psalter_Pahlavi}) (30)
3421 X \p{PUA} \p{Private_Use_Area} (= \p{Block=
3422 Private_Use_Area}) (6400)
3423 \p{Punct} \p{General_Category=Punctuation} (Short:
3424 \p{P}; NOT \p{General_Punctuation}) (748)
3425 \p{Punctuation} \p{Punct} (= \p{General_Category=
3426 Punctuation}) (NOT
3427 \p{General_Punctuation}) (748)
3428 \p{Qaac} \p{Coptic} (= \p{Script_Extensions=
3429 Coptic}) (NOT \p{Block=Coptic}) (165)
3430 \p{Qaai} \p{Inherited} (= \p{Script_Extensions=
3431 Inherited}) (496)
3432 \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
3433 Y}) (30)
3434 \p{QMark: *} \p{Quotation_Mark: *}
3435 \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
3436 (30)
3437 \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_082
3438 plus all above-Unicode code points)
3439 \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (30)
3440 \p{Radical} \p{Radical=Y} (329)
3441 \p{Radical: N*} (Single: \P{Radical}) (1_113_783 plus all
3442 above-Unicode code points)
3443 \p{Radical: Y*} (Single: \p{Radical}) (329)
3444 \p{Rejang} \p{Script_Extensions=Rejang} (Short:
3445 \p{Rjng}; NOT \p{Block=Rejang}) (37)
3446 \p{Rjng} \p{Rejang} (= \p{Script_Extensions=
3447 Rejang}) (NOT \p{Block=Rejang}) (37)
3448 X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block=
3449 Rumi_Numeral_Symbols}) (32)
3450 X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
3451 \p{InRumi}) (32)
3452 \p{Runic} \p{Script_Extensions=Runic} (Short:
3453 \p{Runr}; NOT \p{Block=Runic}) (86)
3454 \p{Runr} \p{Runic} (= \p{Script_Extensions=Runic})
3455 (NOT \p{Block=Runic}) (86)
3456 \p{S} \pS \p{Symbol} (= \p{General_Category=Symbol})
3457 (6899)
3458 \p{Samaritan} \p{Script_Extensions=Samaritan} (Short:
3459 \p{Samr}; NOT \p{Block=Samaritan}) (61)
3460 \p{Samr} \p{Samaritan} (= \p{Script_Extensions=
3461 Samaritan}) (NOT \p{Block=Samaritan})
3462 (61)
3463 \p{Sarb} \p{Old_South_Arabian} (=
3464 \p{Script_Extensions=Old_South_Arabian})
3465 (32)
3466 \p{Saur} \p{Saurashtra} (= \p{Script_Extensions=
3467 Saurashtra}) (NOT \p{Block=Saurashtra})
3468 (82)
3469 \p{Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
3470 \p{Saur}; NOT \p{Block=Saurashtra}) (82)
3471 \p{SB: *} \p{Sentence_Break: *}
3472 \p{Sc} \p{Currency_Symbol} (=
3473 \p{General_Category=Currency_Symbol})
3474 (53)
3475 \p{Sc: *} \p{Script: *}
3476 \p{Script: Adlam} (Short: \p{Sc=Adlm}) (87)
3477 \p{Script: Adlm} \p{Script=Adlam} (87)
3478 \p{Script: Aghb} \p{Script=Caucasian_Albanian} (53)
3479 \p{Script: Ahom} (Short: \p{Sc=Ahom}) (57)
3480 \p{Script: Anatolian_Hieroglyphs} (Short: \p{Sc=Hluw}) (583)
3481 \p{Script: Arab} \p{Script=Arabic} (1279)
3482 \p{Script: Arabic} (Short: \p{Sc=Arab}) (1279)
3483 \p{Script: Armenian} (Short: \p{Sc=Armn}) (93)
3484 \p{Script: Armi} \p{Script=Imperial_Aramaic} (31)
3485 \p{Script: Armn} \p{Script=Armenian} (93)
3486 \p{Script: Avestan} (Short: \p{Sc=Avst}) (61)
3487 \p{Script: Avst} \p{Script=Avestan} (61)
3488 \p{Script: Bali} \p{Script=Balinese} (121)
3489 \p{Script: Balinese} (Short: \p{Sc=Bali}) (121)
3490 \p{Script: Bamu} \p{Script=Bamum} (657)
3491 \p{Script: Bamum} (Short: \p{Sc=Bamu}) (657)
3492 \p{Script: Bass} \p{Script=Bassa_Vah} (36)
3493 \p{Script: Bassa_Vah} (Short: \p{Sc=Bass}) (36)
3494 \p{Script: Batak} (Short: \p{Sc=Batk}) (56)
3495 \p{Script: Batk} \p{Script=Batak} (56)
3496 \p{Script: Beng} \p{Script=Bengali} (93)
3497 \p{Script: Bengali} (Short: \p{Sc=Beng}) (93)
3498 \p{Script: Bhaiksuki} (Short: \p{Sc=Bhks}) (97)
3499 \p{Script: Bhks} \p{Script=Bhaiksuki} (97)
3500 \p{Script: Bopo} \p{Script=Bopomofo} (70)
3501 \p{Script: Bopomofo} (Short: \p{Sc=Bopo}) (70)
3502 \p{Script: Brah} \p{Script=Brahmi} (109)
3503 \p{Script: Brahmi} (Short: \p{Sc=Brah}) (109)
3504 \p{Script: Brai} \p{Script=Braille} (256)
3505 \p{Script: Braille} (Short: \p{Sc=Brai}) (256)
3506 \p{Script: Bugi} \p{Script=Buginese} (30)
3507 \p{Script: Buginese} (Short: \p{Sc=Bugi}) (30)
3508 \p{Script: Buhd} \p{Script=Buhid} (20)
3509 \p{Script: Buhid} (Short: \p{Sc=Buhd}) (20)
3510 \p{Script: Cakm} \p{Script=Chakma} (67)
3511 \p{Script: Canadian_Aboriginal} (Short: \p{Sc=Cans}) (710)
3512 \p{Script: Cans} \p{Script=Canadian_Aboriginal} (710)
3513 \p{Script: Cari} \p{Script=Carian} (49)
3514 \p{Script: Carian} (Short: \p{Sc=Cari}) (49)
3515 \p{Script: Caucasian_Albanian} (Short: \p{Sc=Aghb}) (53)
3516 \p{Script: Chakma} (Short: \p{Sc=Cakm}) (67)
3517 \p{Script: Cham} (Short: \p{Sc=Cham}) (83)
3518 \p{Script: Cher} \p{Script=Cherokee} (172)
3519 \p{Script: Cherokee} (Short: \p{Sc=Cher}) (172)
3520 \p{Script: Common} (Short: \p{Sc=Zyyy}) (7279)
3521 \p{Script: Copt} \p{Script=Coptic} (137)
3522 \p{Script: Coptic} (Short: \p{Sc=Copt}) (137)
3523 \p{Script: Cprt} \p{Script=Cypriot} (55)
3524 \p{Script: Cuneiform} (Short: \p{Sc=Xsux}) (1234)
3525 \p{Script: Cypriot} (Short: \p{Sc=Cprt}) (55)
3526 \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}) (443)
3527 \p{Script: Cyrl} \p{Script=Cyrillic} (443)
3528 \p{Script: Deseret} (Short: \p{Sc=Dsrt}) (80)
3529 \p{Script: Deva} \p{Script=Devanagari} (154)
3530 \p{Script: Devanagari} (Short: \p{Sc=Deva}) (154)
3531 \p{Script: Dsrt} \p{Script=Deseret} (80)
3532 \p{Script: Dupl} \p{Script=Duployan} (143)
3533 \p{Script: Duployan} (Short: \p{Sc=Dupl}) (143)
3534 \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (1071)
3535 \p{Script: Egyptian_Hieroglyphs} (Short: \p{Sc=Egyp}) (1071)
3536 \p{Script: Elba} \p{Script=Elbasan} (40)
3537 \p{Script: Elbasan} (Short: \p{Sc=Elba}) (40)
3538 \p{Script: Ethi} \p{Script=Ethiopic} (495)
3539 \p{Script: Ethiopic} (Short: \p{Sc=Ethi}) (495)
3540 \p{Script: Geor} \p{Script=Georgian} (127)
3541 \p{Script: Georgian} (Short: \p{Sc=Geor}) (127)
3542 \p{Script: Glag} \p{Script=Glagolitic} (132)
3543 \p{Script: Glagolitic} (Short: \p{Sc=Glag}) (132)
3544 \p{Script: Goth} \p{Script=Gothic} (27)
3545 \p{Script: Gothic} (Short: \p{Sc=Goth}) (27)
3546 \p{Script: Gran} \p{Script=Grantha} (85)
3547 \p{Script: Grantha} (Short: \p{Sc=Gran}) (85)
3548 \p{Script: Greek} (Short: \p{Sc=Grek}) (518)
3549 \p{Script: Grek} \p{Script=Greek} (518)
3550 \p{Script: Gujarati} (Short: \p{Sc=Gujr}) (85)
3551 \p{Script: Gujr} \p{Script=Gujarati} (85)
3552 \p{Script: Gurmukhi} (Short: \p{Sc=Guru}) (79)
3553 \p{Script: Guru} \p{Script=Gurmukhi} (79)
3554 \p{Script: Han} (Short: \p{Sc=Han}) (81_734)
3555 \p{Script: Hang} \p{Script=Hangul} (11_739)
3556 \p{Script: Hangul} (Short: \p{Sc=Hang}) (11_739)
3557 \p{Script: Hani} \p{Script=Han} (81_734)
3558 \p{Script: Hano} \p{Script=Hanunoo} (21)
3559 \p{Script: Hanunoo} (Short: \p{Sc=Hano}) (21)
3560 \p{Script: Hatr} \p{Script=Hatran} (26)
3561 \p{Script: Hatran} (Short: \p{Sc=Hatr}) (26)
3562 \p{Script: Hebr} \p{Script=Hebrew} (133)
3563 \p{Script: Hebrew} (Short: \p{Sc=Hebr}) (133)
3564 \p{Script: Hira} \p{Script=Hiragana} (91)
3565 \p{Script: Hiragana} (Short: \p{Sc=Hira}) (91)
3566 \p{Script: Hluw} \p{Script=Anatolian_Hieroglyphs} (583)
3567 \p{Script: Hmng} \p{Script=Pahawh_Hmong} (127)
3568 \p{Script: Hung} \p{Script=Old_Hungarian} (108)
3569 \p{Script: Imperial_Aramaic} (Short: \p{Sc=Armi}) (31)
3570 \p{Script: Inherited} (Short: \p{Sc=Zinh}) (564)
3571 \p{Script: Inscriptional_Pahlavi} (Short: \p{Sc=Phli}) (27)
3572 \p{Script: Inscriptional_Parthian} (Short: \p{Sc=Prti}) (30)
3573 \p{Script: Ital} \p{Script=Old_Italic} (36)
3574 \p{Script: Java} \p{Script=Javanese} (90)
3575 \p{Script: Javanese} (Short: \p{Sc=Java}) (90)
3576 \p{Script: Kaithi} (Short: \p{Sc=Kthi}) (66)
3577 \p{Script: Kali} \p{Script=Kayah_Li} (47)
3578 \p{Script: Kana} \p{Script=Katakana} (300)
3579 \p{Script: Kannada} (Short: \p{Sc=Knda}) (88)
3580 \p{Script: Katakana} (Short: \p{Sc=Kana}) (300)
3581 \p{Script: Kayah_Li} (Short: \p{Sc=Kali}) (47)
3582 \p{Script: Khar} \p{Script=Kharoshthi} (65)
3583 \p{Script: Kharoshthi} (Short: \p{Sc=Khar}) (65)
3584 \p{Script: Khmer} (Short: \p{Sc=Khmr}) (146)
3585 \p{Script: Khmr} \p{Script=Khmer} (146)
3586 \p{Script: Khoj} \p{Script=Khojki} (62)
3587 \p{Script: Khojki} (Short: \p{Sc=Khoj}) (62)
3588 \p{Script: Khudawadi} (Short: \p{Sc=Sind}) (69)
3589 \p{Script: Knda} \p{Script=Kannada} (88)
3590 \p{Script: Kthi} \p{Script=Kaithi} (66)
3591 \p{Script: Lana} \p{Script=Tai_Tham} (127)
3592 \p{Script: Lao} (Short: \p{Sc=Lao}) (67)
3593 \p{Script: Laoo} \p{Script=Lao} (67)
3594 \p{Script: Latin} (Short: \p{Sc=Latn}) (1350)
3595 \p{Script: Latn} \p{Script=Latin} (1350)
3596 \p{Script: Lepc} \p{Script=Lepcha} (74)
3597 \p{Script: Lepcha} (Short: \p{Sc=Lepc}) (74)
3598 \p{Script: Limb} \p{Script=Limbu} (68)
3599 \p{Script: Limbu} (Short: \p{Sc=Limb}) (68)
3600 \p{Script: Lina} \p{Script=Linear_A} (341)
3601 \p{Script: Linb} \p{Script=Linear_B} (211)
3602 \p{Script: Linear_A} (Short: \p{Sc=Lina}) (341)
3603 \p{Script: Linear_B} (Short: \p{Sc=Linb}) (211)
3604 \p{Script: Lisu} (Short: \p{Sc=Lisu}) (48)
3605 \p{Script: Lyci} \p{Script=Lycian} (29)
3606 \p{Script: Lycian} (Short: \p{Sc=Lyci}) (29)
3607 \p{Script: Lydi} \p{Script=Lydian} (27)
3608 \p{Script: Lydian} (Short: \p{Sc=Lydi}) (27)
3609 \p{Script: Mahajani} (Short: \p{Sc=Mahj}) (39)
3610 \p{Script: Mahj} \p{Script=Mahajani} (39)
3611 \p{Script: Malayalam} (Short: \p{Sc=Mlym}) (114)
3612 \p{Script: Mand} \p{Script=Mandaic} (29)
3613 \p{Script: Mandaic} (Short: \p{Sc=Mand}) (29)
3614 \p{Script: Mani} \p{Script=Manichaean} (51)
3615 \p{Script: Manichaean} (Short: \p{Sc=Mani}) (51)
3616 \p{Script: Marc} \p{Script=Marchen} (68)
3617 \p{Script: Marchen} (Short: \p{Sc=Marc}) (68)
3618 \p{Script: Meetei_Mayek} (Short: \p{Sc=Mtei}) (79)
3619 \p{Script: Mend} \p{Script=Mende_Kikakui} (213)
3620 \p{Script: Mende_Kikakui} (Short: \p{Sc=Mend}) (213)
3621 \p{Script: Merc} \p{Script=Meroitic_Cursive} (90)
3622 \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (32)
3623 \p{Script: Meroitic_Cursive} (Short: \p{Sc=Merc}) (90)
3624 \p{Script: Meroitic_Hieroglyphs} (Short: \p{Sc=Mero}) (32)
3625 \p{Script: Miao} (Short: \p{Sc=Miao}) (133)
3626 \p{Script: Mlym} \p{Script=Malayalam} (114)
3627 \p{Script: Modi} (Short: \p{Sc=Modi}) (79)
3628 \p{Script: Mong} \p{Script=Mongolian} (166)
3629 \p{Script: Mongolian} (Short: \p{Sc=Mong}) (166)
3630 \p{Script: Mro} (Short: \p{Sc=Mro}) (43)
3631 \p{Script: Mroo} \p{Script=Mro} (43)
3632 \p{Script: Mtei} \p{Script=Meetei_Mayek} (79)
3633 \p{Script: Mult} \p{Script=Multani} (38)
3634 \p{Script: Multani} (Short: \p{Sc=Mult}) (38)
3635 \p{Script: Myanmar} (Short: \p{Sc=Mymr}) (223)
3636 \p{Script: Mymr} \p{Script=Myanmar} (223)
3637 \p{Script: Nabataean} (Short: \p{Sc=Nbat}) (40)
3638 \p{Script: Narb} \p{Script=Old_North_Arabian} (32)
3639 \p{Script: Nbat} \p{Script=Nabataean} (40)
3640 \p{Script: New_Tai_Lue} (Short: \p{Sc=Talu}) (83)
3641 \p{Script: Newa} (Short: \p{Sc=Newa}) (92)
3642 \p{Script: Nko} (Short: \p{Sc=Nko}) (59)
3643 \p{Script: Nkoo} \p{Script=Nko} (59)
3644 \p{Script: Ogam} \p{Script=Ogham} (29)
3645 \p{Script: Ogham} (Short: \p{Sc=Ogam}) (29)
3646 \p{Script: Ol_Chiki} (Short: \p{Sc=Olck}) (48)
3647 \p{Script: Olck} \p{Script=Ol_Chiki} (48)
3648 \p{Script: Old_Hungarian} (Short: \p{Sc=Hung}) (108)
3649 \p{Script: Old_Italic} (Short: \p{Sc=Ital}) (36)
3650 \p{Script: Old_North_Arabian} (Short: \p{Sc=Narb}) (32)
3651 \p{Script: Old_Permic} (Short: \p{Sc=Perm}) (43)
3652 \p{Script: Old_Persian} (Short: \p{Sc=Xpeo}) (50)
3653 \p{Script: Old_South_Arabian} (Short: \p{Sc=Sarb}) (32)
3654 \p{Script: Old_Turkic} (Short: \p{Sc=Orkh}) (73)
3655 \p{Script: Oriya} (Short: \p{Sc=Orya}) (90)
3656 \p{Script: Orkh} \p{Script=Old_Turkic} (73)
3657 \p{Script: Orya} \p{Script=Oriya} (90)
3658 \p{Script: Osage} (Short: \p{Sc=Osge}) (72)
3659 \p{Script: Osge} \p{Script=Osage} (72)
3660 \p{Script: Osma} \p{Script=Osmanya} (40)
3661 \p{Script: Osmanya} (Short: \p{Sc=Osma}) (40)
3662 \p{Script: Pahawh_Hmong} (Short: \p{Sc=Hmng}) (127)
3663 \p{Script: Palm} \p{Script=Palmyrene} (32)
3664 \p{Script: Palmyrene} (Short: \p{Sc=Palm}) (32)
3665 \p{Script: Pau_Cin_Hau} (Short: \p{Sc=Pauc}) (57)
3666 \p{Script: Pauc} \p{Script=Pau_Cin_Hau} (57)
3667 \p{Script: Perm} \p{Script=Old_Permic} (43)
3668 \p{Script: Phag} \p{Script=Phags_Pa} (56)
3669 \p{Script: Phags_Pa} (Short: \p{Sc=Phag}) (56)
3670 \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (27)
3671 \p{Script: Phlp} \p{Script=Psalter_Pahlavi} (29)
3672 \p{Script: Phnx} \p{Script=Phoenician} (29)
3673 \p{Script: Phoenician} (Short: \p{Sc=Phnx}) (29)
3674 \p{Script: Plrd} \p{Script=Miao} (133)
3675 \p{Script: Prti} \p{Script=Inscriptional_Parthian} (30)
3676 \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29)
3677 \p{Script: Qaac} \p{Script=Coptic} (137)
3678 \p{Script: Qaai} \p{Script=Inherited} (564)
3679 \p{Script: Rejang} (Short: \p{Sc=Rjng}) (37)
3680 \p{Script: Rjng} \p{Script=Rejang} (37)
3681 \p{Script: Runic} (Short: \p{Sc=Runr}) (86)
3682 \p{Script: Runr} \p{Script=Runic} (86)
3683 \p{Script: Samaritan} (Short: \p{Sc=Samr}) (61)
3684 \p{Script: Samr} \p{Script=Samaritan} (61)
3685 \p{Script: Sarb} \p{Script=Old_South_Arabian} (32)
3686 \p{Script: Saur} \p{Script=Saurashtra} (82)
3687 \p{Script: Saurashtra} (Short: \p{Sc=Saur}) (82)
3688 \p{Script: Sgnw} \p{Script=SignWriting} (672)
3689 \p{Script: Sharada} (Short: \p{Sc=Shrd}) (94)
3690 \p{Script: Shavian} (Short: \p{Sc=Shaw}) (48)
3691 \p{Script: Shaw} \p{Script=Shavian} (48)
3692 \p{Script: Shrd} \p{Script=Sharada} (94)
3693 \p{Script: Sidd} \p{Script=Siddham} (92)
3694 \p{Script: Siddham} (Short: \p{Sc=Sidd}) (92)
3695 \p{Script: SignWriting} (Short: \p{Sc=Sgnw}) (672)
3696 \p{Script: Sind} \p{Script=Khudawadi} (69)
3697 \p{Script: Sinh} \p{Script=Sinhala} (110)
3698 \p{Script: Sinhala} (Short: \p{Sc=Sinh}) (110)
3699 \p{Script: Sora} \p{Script=Sora_Sompeng} (35)
3700 \p{Script: Sora_Sompeng} (Short: \p{Sc=Sora}) (35)
3701 \p{Script: Sund} \p{Script=Sundanese} (72)
3702 \p{Script: Sundanese} (Short: \p{Sc=Sund}) (72)
3703 \p{Script: Sylo} \p{Script=Syloti_Nagri} (44)
3704 \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (44)
3705 \p{Script: Syrc} \p{Script=Syriac} (77)
3706 \p{Script: Syriac} (Short: \p{Sc=Syrc}) (77)
3707 \p{Script: Tagalog} (Short: \p{Sc=Tglg}) (20)
3708 \p{Script: Tagb} \p{Script=Tagbanwa} (18)
3709 \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}) (18)
3710 \p{Script: Tai_Le} (Short: \p{Sc=Tale}) (35)
3711 \p{Script: Tai_Tham} (Short: \p{Sc=Lana}) (127)
3712 \p{Script: Tai_Viet} (Short: \p{Sc=Tavt}) (72)
3713 \p{Script: Takr} \p{Script=Takri} (66)
3714 \p{Script: Takri} (Short: \p{Sc=Takr}) (66)
3715 \p{Script: Tale} \p{Script=Tai_Le} (35)
3716 \p{Script: Talu} \p{Script=New_Tai_Lue} (83)
3717 \p{Script: Tamil} (Short: \p{Sc=Taml}) (72)
3718 \p{Script: Taml} \p{Script=Tamil} (72)
3719 \p{Script: Tang} \p{Script=Tangut} (6881)
3720 \p{Script: Tangut} (Short: \p{Sc=Tang}) (6881)
3721 \p{Script: Tavt} \p{Script=Tai_Viet} (72)
3722 \p{Script: Telu} \p{Script=Telugu} (96)
3723 \p{Script: Telugu} (Short: \p{Sc=Telu}) (96)
3724 \p{Script: Tfng} \p{Script=Tifinagh} (59)
3725 \p{Script: Tglg} \p{Script=Tagalog} (20)
3726 \p{Script: Thaa} \p{Script=Thaana} (50)
3727 \p{Script: Thaana} (Short: \p{Sc=Thaa}) (50)
3728 \p{Script: Thai} (Short: \p{Sc=Thai}) (86)
3729 \p{Script: Tibetan} (Short: \p{Sc=Tibt}) (207)
3730 \p{Script: Tibt} \p{Script=Tibetan} (207)
3731 \p{Script: Tifinagh} (Short: \p{Sc=Tfng}) (59)
3732 \p{Script: Tirh} \p{Script=Tirhuta} (82)
3733 \p{Script: Tirhuta} (Short: \p{Sc=Tirh}) (82)
3734 \p{Script: Ugar} \p{Script=Ugaritic} (31)
3735 \p{Script: Ugaritic} (Short: \p{Sc=Ugar}) (31)
3736 \p{Script: Unknown} (Short: \p{Sc=Zzzz}) (985_875 plus all
3737 above-Unicode code points)
3738 \p{Script: Vai} (Short: \p{Sc=Vai}) (300)
3739 \p{Script: Vaii} \p{Script=Vai} (300)
3740 \p{Script: Wara} \p{Script=Warang_Citi} (84)
3741 \p{Script: Warang_Citi} (Short: \p{Sc=Wara}) (84)
3742 \p{Script: Xpeo} \p{Script=Old_Persian} (50)
3743 \p{Script: Xsux} \p{Script=Cuneiform} (1234)
3744 \p{Script: Yi} (Short: \p{Sc=Yi}) (1220)
3745 \p{Script: Yiii} \p{Script=Yi} (1220)
3746 \p{Script: Zinh} \p{Script=Inherited} (564)
3747 \p{Script: Zyyy} \p{Script=Common} (7279)
3748 \p{Script: Zzzz} \p{Script=Unknown} (985_875 plus all
3749 above-Unicode code points)
3750 \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (88)
3751 \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (88)
3752 \p{Script_Extensions: Aghb} \p{Script_Extensions=
3753 Caucasian_Albanian} (53)
3754 \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (57)
3755 \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
3756 \p{Hluw}) (583)
3757 \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1323)
3758 \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
3759 (1323)
3760 \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
3761 (94)
3762 \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
3763 (31)
3764 \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (94)
3765 \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst}) (61)
3766 \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
3767 \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
3768 \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
3769 (121)
3770 \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
3771 \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657)
3772 \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
3773 \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
3774 (36)
3775 \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56)
3776 \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
3777 \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (98)
3778 \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng}) (98)
3779 \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
3780 (97)
3781 \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
3782 \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (110)
3783 \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
3784 (110)
3785 \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (109)
3786 \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah}) (109)
3787 \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
3788 \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
3789 (256)
3790 \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
3791 \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
3792 (31)
3793 \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
3794 \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22)
3795 \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (87)
3796 \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
3797 \p{Cans}) (710)
3798 \p{Script_Extensions: Cans} \p{Script_Extensions=
3799 Canadian_Aboriginal} (710)
3800 \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
3801 \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49)
3802 \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
3803 \p{Aghb}) (53)
3804 \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (87)
3805 \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83)
3806 \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
3807 \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
3808 (172)
3809 \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
3810 (6864)
3811 \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
3812 \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt}) (165)
3813 \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
3814 \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
3815 (1234)
3816 \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
3817 (112)
3818 \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
3819 (446)
3820 \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (446)
3821 \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt}) (80)
3822 \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210)
3823 \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
3824 (210)
3825 \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
3826 \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
3827 \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
3828 (147)
3829 \p{Script_Extensions: Egyp} \p{Script_Extensions=
3830 Egyptian_Hieroglyphs} (1071)
3831 \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
3832 \p{Egyp}) (1071)
3833 \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
3834 \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba}) (40)
3835 \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
3836 \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
3837 (495)
3838 \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (129)
3839 \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
3840 (129)
3841 \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (136)
3842 \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
3843 (136)
3844 \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
3845 \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27)
3846 \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (113)
3847 \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
3848 (113)
3849 \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522)
3850 \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
3851 \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
3852 (99)
3853 \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (99)
3854 \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
3855 (93)
3856 \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (93)
3857 \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (82_013)
3858 \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
3859 \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
3860 (11_775)
3861 \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (82_013)
3862 \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
3863 \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano}) (23)
3864 \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
3865 \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26)
3866 \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (133)
3867 \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr}) (133)
3868 \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (143)
3869 \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
3870 (143)
3871 \p{Script_Extensions: Hluw} \p{Script_Extensions=
3872 Anatolian_Hieroglyphs} (583)
3873 \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
3874 (127)
3875 \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
3876 (108)
3877 \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
3878 \p{Armi}) (31)
3879 \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
3880 (496)
3881 \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
3882 \p{Phli}) (27)
3883 \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
3884 Prti}, \p{Prti}) (30)
3885 \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (36)
3886 \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
3887 \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
3888 (91)
3889 \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (86)
3890 \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
3891 \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (352)
3892 \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
3893 (100)
3894 \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
3895 (352)
3896 \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
3897 (48)
3898 \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (65)
3899 \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
3900 (65)
3901 \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146)
3902 \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
3903 \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (72)
3904 \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (72)
3905 \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
3906 (81)
3907 \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (100)
3908 \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (86)
3909 \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
3910 \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (67)
3911 \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (67)
3912 \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn}) (1370)
3913 \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1370)
3914 \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
3915 \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74)
3916 \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
3917 \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69)
3918 \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
3919 \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
3920 \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
3921 (386)
3922 \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
3923 (268)
3924 \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (48)
3925 \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
3926 \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29)
3927 \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
3928 \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27)
3929 \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
3930 (61)
3931 \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
3932 \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
3933 (119)
3934 \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
3935 \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand}) (30)
3936 \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
3937 \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
3938 (52)
3939 \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
3940 \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc}) (68)
3941 \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
3942 \p{Mtei}) (79)
3943 \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
3944 (213)
3945 \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
3946 \p{Mend}) (213)
3947 \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
3948 (90)
3949 \p{Script_Extensions: Mero} \p{Script_Extensions=
3950 Meroitic_Hieroglyphs} (32)
3951 \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
3952 \p{Merc}) (90)
3953 \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
3954 \p{Mero}) (32)
3955 \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (133)
3956 \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (119)
3957 \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89)
3958 \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (169)
3959 \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
3960 (169)
3961 \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43)
3962 \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
3963 \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
3964 \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
3965 \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult}) (48)
3966 \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
3967 (224)
3968 \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
3969 \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
3970 (40)
3971 \p{Script_Extensions: Narb} \p{Script_Extensions=
3972 Old_North_Arabian} (32)
3973 \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
3974 \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
3975 (83)
3976 \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (92)
3977 \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (59)
3978 \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (59)
3979 \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
3980 \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29)
3981 \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
3982 (48)
3983 \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
3984 \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
3985 \p{Hung}) (108)
3986 \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
3987 (36)
3988 \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
3989 \p{Narb}) (32)
3990 \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
3991 (44)
3992 \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
3993 (50)
3994 \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
3995 \p{Sarb}) (32)
3996 \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
3997 (73)
3998 \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (94)
3999 \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
4000 \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (94)
4001 \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72)
4002 \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
4003 \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
4004 \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma}) (40)
4005 \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
4006 \p{Hmng}) (127)
4007 \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
4008 \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
4009 (32)
4010 \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
4011 (57)
4012 \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
4013 \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
4014 \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
4015 \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
4016 (59)
4017 \p{Script_Extensions: Phli} \p{Script_Extensions=
4018 Inscriptional_Pahlavi} (27)
4019 \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
4020 (30)
4021 \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
4022 \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
4023 (29)
4024 \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (133)
4025 \p{Script_Extensions: Prti} \p{Script_Extensions=
4026 Inscriptional_Parthian} (30)
4027 \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
4028 \p{Phlp}) (30)
4029 \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
4030 \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (496)
4031 \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37)
4032 \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
4033 \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86)
4034 \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
4035 \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
4036 (61)
4037 \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
4038 \p{Script_Extensions: Sarb} \p{Script_Extensions=
4039 Old_South_Arabian} (32)
4040 \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
4041 \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
4042 (82)
4043 \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
4044 \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
4045 (100)
4046 \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw}) (48)
4047 \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
4048 \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (100)
4049 \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
4050 \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd}) (92)
4051 \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
4052 (672)
4053 \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
4054 \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (112)
4055 \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
4056 (112)
4057 \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
4058 \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
4059 \p{Sora}) (35)
4060 \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
4061 \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
4062 (72)
4063 \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (56)
4064 \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
4065 \p{Sylo}) (56)
4066 \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (93)
4067 \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc}) (93)
4068 \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg}) (22)
4069 \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
4070 \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
4071 (20)
4072 \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45)
4073 \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
4074 (127)
4075 \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
4076 (72)
4077 \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (78)
4078 \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (78)
4079 \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
4080 \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
4081 \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (80)
4082 \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (80)
4083 \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6881)
4084 \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
4085 (6881)
4086 \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
4087 \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (101)
4088 \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu}) (101)
4089 \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
4090 \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
4091 \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (65)
4092 \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (65)
4093 \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86)
4094 \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
4095 (207)
4096 \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
4097 \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
4098 (59)
4099 \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (94)
4100 \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh}) (94)
4101 \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
4102 \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
4103 (31)
4104 \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
4105 (985_875 plus all above-Unicode code
4106 points)
4107 \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300)
4108 \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
4109 \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
4110 \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
4111 (84)
4112 \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
4113 \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
4114 \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246)
4115 \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
4116 \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (496)
4117 \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (6864)
4118 \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (985_875
4119 plus all above-Unicode code points)
4120 \p{Scx: *} \p{Script_Extensions: *}
4121 \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
4122 \p{SD: *} \p{Soft_Dotted: *}
4123 \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
4124 \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4)
4125 \p{Sentence_Break: CL} \p{Sentence_Break=Close} (187)
4126 \p{Sentence_Break: Close} (Short: \p{SB=CL}) (187)
4127 \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1)
4128 \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (2197)
4129 \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2197)
4130 \p{Sentence_Break: FO} \p{Sentence_Break=Format} (53)
4131 \p{Sentence_Break: Format} (Short: \p{SB=FO}) (53)
4132 \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (113_027)
4133 \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1)
4134 \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (2251)
4135 \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2251)
4136 \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (572)
4137 \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (572)
4138 \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (113_027)
4139 \p{Sentence_Break: Other} (Short: \p{SB=XX}) (993_796 plus all
4140 above-Unicode code points)
4141 \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
4142 \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26)
4143 \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
4144 \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3)
4145 \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (20)
4146 \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (121)
4147 \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (121)
4148 \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1853)
4149 \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1853)
4150 \p{Sentence_Break: XX} \p{Sentence_Break=Other} (993_796 plus all
4151 above-Unicode code points)
4152 \p{Sentence_Terminal} \p{Sentence_Terminal=Y} (Short: \p{STerm})
4153 (124)
4154 \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
4155 (1_113_988 plus all above-Unicode code
4156 points)
4157 \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (124)
4158 \p{Separator} \p{General_Category=Separator} (Short:
4159 \p{Z}) (19)
4160 \p{Sgnw} \p{SignWriting} (= \p{Script_Extensions=
4161 SignWriting}) (672)
4162 \p{Sharada} \p{Script_Extensions=Sharada} (Short:
4163 \p{Shrd}; NOT \p{Block=Sharada}) (100)
4164 \p{Shavian} \p{Script_Extensions=Shavian} (Short:
4165 \p{Shaw}) (48)
4166 \p{Shaw} \p{Shavian} (= \p{Script_Extensions=
4167 Shavian}) (48)
4168 X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
4169 (16)
4170 \p{Shrd} \p{Sharada} (= \p{Script_Extensions=
4171 Sharada}) (NOT \p{Block=Sharada}) (100)
4172 \p{Sidd} \p{Siddham} (= \p{Script_Extensions=
4173 Siddham}) (NOT \p{Block=Siddham}) (92)
4174 \p{Siddham} \p{Script_Extensions=Siddham} (Short:
4175 \p{Sidd}; NOT \p{Block=Siddham}) (92)
4176 \p{SignWriting} \p{Script_Extensions=SignWriting} (Short:
4177 \p{Sgnw}) (672)
4178 \p{Sind} \p{Khudawadi} (= \p{Script_Extensions=
4179 Khudawadi}) (NOT \p{Block=Khudawadi})
4180 (81)
4181 \p{Sinh} \p{Sinhala} (= \p{Script_Extensions=
4182 Sinhala}) (NOT \p{Block=Sinhala}) (112)
4183 \p{Sinhala} \p{Script_Extensions=Sinhala} (Short:
4184 \p{Sinh}; NOT \p{Block=Sinhala}) (112)
4185 X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
4186 \p{Sk} \p{Modifier_Symbol} (=
4187 \p{General_Category=Modifier_Symbol})
4188 (121)
4189 \p{Sm} \p{Math_Symbol} (= \p{General_Category=
4190 Math_Symbol}) (948)
4191 X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
4192 \p{InSmallForms}) (32)
4193 X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block=
4194 Small_Form_Variants}) (32)
4195 \p{So} \p{Other_Symbol} (= \p{General_Category=
4196 Other_Symbol}) (5777)
4197 \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
4198 \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_066 plus
4199 all above-Unicode code points)
4200 \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (46)
4201 \p{Sora} \p{Sora_Sompeng} (= \p{Script_Extensions=
4202 Sora_Sompeng}) (NOT \p{Block=
4203 Sora_Sompeng}) (35)
4204 \p{Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng} (Short:
4205 \p{Sora}; NOT \p{Block=Sora_Sompeng})
4206 (35)
4207 \p{Space} \p{White_Space} (= \p{White_Space=Y}) (25)
4208 \p{Space: *} \p{White_Space: *}
4209 \p{Space_Separator} \p{General_Category=Space_Separator}
4210 (Short: \p{Zs}) (17)
4211 \p{SpacePerl} \p{XPosixSpace} (25)
4212 \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
4213 \p{Mc}) (394)
4214 X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
4215 (Short: \p{InModifierLetters}) (80)
4216 X \p{Specials} \p{Block=Specials} (16)
4217 \p{STerm} \p{Sentence_Terminal} (=
4218 \p{Sentence_Terminal=Y}) (124)
4219 \p{STerm: *} \p{Sentence_Terminal: *}
4220 \p{Sund} \p{Sundanese} (= \p{Script_Extensions=
4221 Sundanese}) (NOT \p{Block=Sundanese})
4222 (72)
4223 \p{Sundanese} \p{Script_Extensions=Sundanese} (Short:
4224 \p{Sund}; NOT \p{Block=Sundanese}) (72)
4225 X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block=
4226 Sundanese_Supplement}) (16)
4227 X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
4228 \p{InSundaneseSup}) (16)
4229 X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block=
4230 Supplemental_Arrows_A}) (16)
4231 X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block=
4232 Supplemental_Arrows_B}) (128)
4233 X \p{Sup_Arrows_C} \p{Supplemental_Arrows_C} (= \p{Block=
4234 Supplemental_Arrows_C}) (256)
4235 X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (=
4236 \p{Block=
4237 Supplemental_Mathematical_Operators})
4238 (256)
4239 X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (=
4240 \p{Block=
4241 Supplementary_Private_Use_Area_A})
4242 (65_536)
4243 X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (=
4244 \p{Block=
4245 Supplementary_Private_Use_Area_B})
4246 (65_536)
4247 X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block=
4248 Supplemental_Punctuation}) (128)
4249 X \p{Sup_Symbols_And_Pictographs}
4250 \p{Supplemental_Symbols_And_Pictographs}
4251 (= \p{Block=
4252 Supplemental_Symbols_And_Pictographs})
4253 (256)
4254 X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (=
4255 \p{Block=Superscripts_And_Subscripts})
4256 (48)
4257 X \p{Superscripts_And_Subscripts} \p{Block=
4258 Superscripts_And_Subscripts} (Short:
4259 \p{InSuperAndSub}) (48)
4260 X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
4261 \p{InSupArrowsA}) (16)
4262 X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
4263 \p{InSupArrowsB}) (128)
4264 X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
4265 \p{InSupArrowsC}) (256)
4266 X \p{Supplemental_Mathematical_Operators} \p{Block=
4267 Supplemental_Mathematical_Operators}
4268 (Short: \p{InSupMathOperators}) (256)
4269 X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
4270 (Short: \p{InSupPunctuation}) (128)
4271 X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
4272 Supplemental_Symbols_And_Pictographs}
4273 (Short: \p{InSupSymbolsAndPictographs})
4274 (256)
4275 X \p{Supplementary_Private_Use_Area_A} \p{Block=
4276 Supplementary_Private_Use_Area_A}
4277 (Short: \p{InSupPUAA}) (65_536)
4278 X \p{Supplementary_Private_Use_Area_B} \p{Block=
4279 Supplementary_Private_Use_Area_B}
4280 (Short: \p{InSupPUAB}) (65_536)
4281 \p{Surrogate} \p{General_Category=Surrogate} (Short:
4282 \p{Cs}) (2048)
4283 X \p{Sutton_SignWriting} \p{Block=Sutton_SignWriting} (688)
4284 \p{Sylo} \p{Syloti_Nagri} (= \p{Script_Extensions=
4285 Syloti_Nagri}) (NOT \p{Block=
4286 Syloti_Nagri}) (56)
4287 \p{Syloti_Nagri} \p{Script_Extensions=Syloti_Nagri} (Short:
4288 \p{Sylo}; NOT \p{Block=Syloti_Nagri})
4289 (56)
4290 \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
4291 (6899)
4292 \p{Syrc} \p{Syriac} (= \p{Script_Extensions=
4293 Syriac}) (NOT \p{Block=Syriac}) (93)
4294 \p{Syriac} \p{Script_Extensions=Syriac} (Short:
4295 \p{Syrc}; NOT \p{Block=Syriac}) (93)
4296 \p{Tagalog} \p{Script_Extensions=Tagalog} (Short:
4297 \p{Tglg}; NOT \p{Block=Tagalog}) (22)
4298 \p{Tagb} \p{Tagbanwa} (= \p{Script_Extensions=
4299 Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
4300 \p{Tagbanwa} \p{Script_Extensions=Tagbanwa} (Short:
4301 \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
4302 X \p{Tags} \p{Block=Tags} (128)
4303 \p{Tai_Le} \p{Script_Extensions=Tai_Le} (Short:
4304 \p{Tale}; NOT \p{Block=Tai_Le}) (45)
4305 \p{Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
4306 \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
4307 \p{Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
4308 \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
4309 X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
4310 Tai_Xuan_Jing_Symbols}) (96)
4311 X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
4312 \p{InTaiXuanJing}) (96)
4313 \p{Takr} \p{Takri} (= \p{Script_Extensions=Takri})
4314 (NOT \p{Block=Takri}) (78)
4315 \p{Takri} \p{Script_Extensions=Takri} (Short:
4316 \p{Takr}; NOT \p{Block=Takri}) (78)
4317 \p{Tale} \p{Tai_Le} (= \p{Script_Extensions=
4318 Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
4319 \p{Talu} \p{New_Tai_Lue} (= \p{Script_Extensions=
4320 New_Tai_Lue}) (NOT \p{Block=
4321 New_Tai_Lue}) (83)
4322 \p{Tamil} \p{Script_Extensions=Tamil} (Short:
4323 \p{Taml}; NOT \p{Block=Tamil}) (80)
4324 \p{Taml} \p{Tamil} (= \p{Script_Extensions=Tamil})
4325 (NOT \p{Block=Tamil}) (80)
4326 \p{Tang} \p{Tangut} (= \p{Script_Extensions=
4327 Tangut}) (NOT \p{Block=Tangut}) (6881)
4328 \p{Tangut} \p{Script_Extensions=Tangut} (Short:
4329 \p{Tang}; NOT \p{Block=Tangut}) (6881)
4330 X \p{Tangut_Components} \p{Block=Tangut_Components} (768)
4331 \p{Tavt} \p{Tai_Viet} (= \p{Script_Extensions=
4332 Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
4333 \p{Telu} \p{Telugu} (= \p{Script_Extensions=
4334 Telugu}) (NOT \p{Block=Telugu}) (101)
4335 \p{Telugu} \p{Script_Extensions=Telugu} (Short:
4336 \p{Telu}; NOT \p{Block=Telugu}) (101)
4337 \p{Term} \p{Terminal_Punctuation} (=
4338 \p{Terminal_Punctuation=Y}) (246)
4339 \p{Term: *} \p{Terminal_Punctuation: *}
4340 \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
4341 \p{Term}) (246)
4342 \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
4343 (1_113_866 plus all above-Unicode code
4344 points)
4345 \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (246)
4346 \p{Tfng} \p{Tifinagh} (= \p{Script_Extensions=
4347 Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
4348 \p{Tglg} \p{Tagalog} (= \p{Script_Extensions=
4349 Tagalog}) (NOT \p{Block=Tagalog}) (22)
4350 \p{Thaa} \p{Thaana} (= \p{Script_Extensions=
4351 Thaana}) (NOT \p{Block=Thaana}) (65)
4352 \p{Thaana} \p{Script_Extensions=Thaana} (Short:
4353 \p{Thaa}; NOT \p{Block=Thaana}) (65)
4354 \p{Thai} \p{Script_Extensions=Thai} (NOT \p{Block=
4355 Thai}) (86)
4356 \p{Tibetan} \p{Script_Extensions=Tibetan} (Short:
4357 \p{Tibt}; NOT \p{Block=Tibetan}) (207)
4358 \p{Tibt} \p{Tibetan} (= \p{Script_Extensions=
4359 Tibetan}) (NOT \p{Block=Tibetan}) (207)
4360 \p{Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
4361 \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
4362 \p{Tirh} \p{Tirhuta} (= \p{Script_Extensions=
4363 Tirhuta}) (NOT \p{Block=Tirhuta}) (94)
4364 \p{Tirhuta} \p{Script_Extensions=Tirhuta} (Short:
4365 \p{Tirh}; NOT \p{Block=Tirhuta}) (94)
4366 \p{Title} \p{Titlecase} (/i= Cased=Yes) (31)
4367 \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i=
4368 Cased=Yes) (31)
4369 \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
4370 (Short: \p{Lt}; /i= General_Category=
4371 Cased_Letter) (31)
4372 X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block=
4373 Transport_And_Map_Symbols}) (128)
4374 X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
4375 (Short: \p{InTransportAndMap}) (128)
4376 X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics}
4377 (= \p{Block=
4378 Unified_Canadian_Aboriginal_Syllabics})
4379 (640)
4380 X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_-
4381 Extended} (= \p{Block=
4382 Unified_Canadian_Aboriginal_Syllabics_-
4383 Extended}) (80)
4384 \p{Ugar} \p{Ugaritic} (= \p{Script_Extensions=
4385 Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
4386 \p{Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
4387 \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
4388 \p{UIdeo} \p{Unified_Ideograph} (=
4389 \p{Unified_Ideograph=Y}) (80_388)
4390 \p{UIdeo: *} \p{Unified_Ideograph: *}
4391 \p{Unassigned} \p{General_Category=Unassigned} (Short:
4392 \p{Cn}) (846_359 plus all above-Unicode
4393 code points)
4394 \p{Unicode} \p{Any} (1_114_112)
4395 X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
4396 Unified_Canadian_Aboriginal_Syllabics}
4397 (Short: \p{InUCAS}) (640)
4398 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
4399 Unified_Canadian_Aboriginal_Syllabics_-
4400 Extended} (Short: \p{InUCASExt}) (80)
4401 \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
4402 (80_388)
4403 \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
4404 (1_033_724 plus all above-Unicode code
4405 points)
4406 \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (80_388)
4407 \p{Unknown} \p{Script_Extensions=Unknown} (Short:
4408 \p{Zzzz}) (985_875 plus all above-
4409 Unicode code points)
4410 \p{Upper} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
4411 Cased=Yes) (1822)
4412 \p{Upper: *} \p{Uppercase: *}
4413 \p{Uppercase} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
4414 Cased=Yes) (1822)
4415 \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
4416 No) (1_112_290 plus all above-Unicode
4417 code points)
4418 \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
4419 Yes) (1822)
4420 \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
4421 (Short: \p{Lu}; /i= General_Category=
4422 Cased_Letter) (1702)
4423 \p{Vai} \p{Script_Extensions=Vai} (NOT \p{Block=
4424 Vai}) (300)
4425 \p{Vaii} \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
4426 \p{Block=Vai}) (300)
4427 \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS};
4428 NOT \p{Variation_Selectors}) (259)
4429 \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853
4430 plus all above-Unicode code points)
4431 \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259)
4432 X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
4433 \p{InVS}) (16)
4434 X \p{Variation_Selectors_Supplement} \p{Block=
4435 Variation_Selectors_Supplement} (Short:
4436 \p{InVSSup}) (240)
4437 X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block=
4438 Vedic_Extensions}) (48)
4439 X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short:
4440 \p{InVedicExt}) (48)
4441 X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
4442 \p{VertSpace} \v (7)
4443 \p{VS} \p{Variation_Selector} (=
4444 \p{Variation_Selector=Y}) (NOT
4445 \p{Variation_Selectors}) (259)
4446 \p{VS: *} \p{Variation_Selector: *}
4447 X \p{VS_Sup} \p{Variation_Selectors_Supplement} (=
4448 \p{Block=
4449 Variation_Selectors_Supplement}) (240)
4450 \p{Wara} \p{Warang_Citi} (= \p{Script_Extensions=
4451 Warang_Citi}) (NOT \p{Block=
4452 Warang_Citi}) (84)
4453 \p{Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
4454 \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
4455 \p{WB: *} \p{Word_Break: *}
4456 \p{White_Space} \p{White_Space=Y} (Short: \p{Space}) (25)
4457 \p{White_Space: N*} (Short: \p{Space=N}, \P{Space}) (1_114_087
4458 plus all above-Unicode code points)
4459 \p{White_Space: Y*} (Short: \p{Space=Y}, \p{Space}) (25)
4460 \p{Word} \p{XPosixWord} (119_821)
4461 \p{Word_Break: ALetter} (Short: \p{WB=LE}) (27_992)
4462 \p{Word_Break: CR} (Short: \p{WB=CR}) (1)
4463 \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1)
4464 \p{Word_Break: DQ} \p{Word_Break=Double_Quote} (1)
4465 \p{Word_Break: E_Base} (Short: \p{WB=EB}) (79)
4466 \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (4)
4467 \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (5)
4468 \p{Word_Break: EB} \p{Word_Break=E_Base} (79)
4469 \p{Word_Break: EBG} \p{Word_Break=E_Base_GAZ} (4)
4470 \p{Word_Break: EM} \p{Word_Break=E_Modifier} (5)
4471 \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (11)
4472 \p{Word_Break: Extend} (Short: \p{WB=Extend}) (2196)
4473 \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11)
4474 \p{Word_Break: FO} \p{Word_Break=Format} (52)
4475 \p{Word_Break: Format} (Short: \p{WB=FO}) (52)
4476 \p{Word_Break: GAZ} \p{Word_Break=Glue_After_Zwj} (3)
4477 \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (3)
4478 \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (74)
4479 \p{Word_Break: HL} \p{Word_Break=Hebrew_Letter} (74)
4480 \p{Word_Break: KA} \p{Word_Break=Katakana} (310)
4481 \p{Word_Break: Katakana} (Short: \p{WB=KA}) (310)
4482 \p{Word_Break: LE} \p{Word_Break=ALetter} (27_992)
4483 \p{Word_Break: LF} (Short: \p{WB=LF}) (1)
4484 \p{Word_Break: MB} \p{Word_Break=MidNumLet} (7)
4485 \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (9)
4486 \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15)
4487 \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7)
4488 \p{Word_Break: ML} \p{Word_Break=MidLetter} (9)
4489 \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
4490 \p{Word_Break: Newline} (Short: \p{WB=NL}) (5)
4491 \p{Word_Break: NL} \p{Word_Break=Newline} (5)
4492 \p{Word_Break: NU} \p{Word_Break=Numeric} (571)
4493 \p{Word_Break: Numeric} (Short: \p{WB=NU}) (571)
4494 \p{Word_Break: Other} (Short: \p{WB=XX}) (1_082_748 plus all
4495 above-Unicode code points)
4496 \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26)
4497 \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26)
4498 \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1)
4499 \p{Word_Break: SQ} \p{Word_Break=Single_Quote} (1)
4500 \p{Word_Break: XX} \p{Word_Break=Other} (1_082_748 plus all
4501 above-Unicode code points)
4502 \p{Word_Break: ZWJ} (Short: \p{WB=ZWJ}) (1)
4503 \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (25)
4504 \p{WSpace: *} \p{White_Space: *}
4505 \p{XDigit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
4506 \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
4507 (119_672)
4508 \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (994_440
4509 plus all above-Unicode code points)
4510 \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (119_672)
4511 \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (116_984)
4512 \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (997_128
4513 plus all above-Unicode code points)
4514 \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (116_984)
4515 \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
4516 (119_672)
4517 \p{XIDC: *} \p{XID_Continue: *}
4518 \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (116_984)
4519 \p{XIDS: *} \p{XID_Start: *}
4520 \p{Xpeo} \p{Old_Persian} (= \p{Script_Extensions=
4521 Old_Persian}) (NOT \p{Block=
4522 Old_Persian}) (50)
4523 \p{XPerlSpace} \p{XPosixSpace} (25)
4524 \p{XPosixAlnum} Alphabetic and (decimal) Numeric (Short:
4525 \p{Alnum}) (118_820)
4526 \p{XPosixAlpha} \p{Alphabetic=Y} (Short: \p{Alpha})
4527 (118_240)
4528 \p{XPosixBlank} \h, Horizontal white space (Short:
4529 \p{Blank}) (18)
4530 \p{XPosixCntrl} \p{General_Category=Control} Control
4531 characters (Short: \p{Cc}) (65)
4532 \p{XPosixDigit} \p{General_Category=Decimal_Number} [0-9]
4533 + all other decimal digits (Short:
4534 \p{Nd}) (580)
4535 \p{XPosixGraph} Characters that are graphical (Short:
4536 \p{Graph}) (265_621)
4537 \p{XPosixLower} \p{Lowercase=Y} (Short: \p{Lower}; /i=
4538 Cased=Yes) (2252)
4539 \p{XPosixPrint} Characters that are graphical plus space
4540 characters (but no controls) (Short:
4541 \p{Print}) (265_638)
4542 \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (757)
4543 \p{XPosixSpace} \s including beyond ASCII and vertical tab
4544 (Short: \p{SpacePerl}) (25)
4545 \p{XPosixUpper} \p{Uppercase=Y} (Short: \p{Upper}; /i=
4546 Cased=Yes) (1822)
4547 \p{XPosixWord} \w, including beyond ASCII; = \p{Alnum} +
4548 \pM + \p{Pc} + \p{Join_Control} (Short:
4549 \p{Word}) (119_821)
4550 \p{XPosixXDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
4551 \p{Xsux} \p{Cuneiform} (= \p{Script_Extensions=
4552 Cuneiform}) (NOT \p{Block=Cuneiform})
4553 (1234)
4554 \p{Yi} \p{Script_Extensions=Yi} (1246)
4555 X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
4556 X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
4557 \p{Yiii} \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
4558 X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block=
4559 Yijing_Hexagram_Symbols}) (64)
4560 X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
4561 (Short: \p{InYijing}) (64)
4562 \p{Z} \pZ \p{Separator} (= \p{General_Category=
4563 Separator}) (19)
4564 \p{Zinh} \p{Inherited} (= \p{Script_Extensions=
4565 Inherited}) (496)
4566 \p{Zl} \p{Line_Separator} (= \p{General_Category=
4567 Line_Separator}) (1)
4568 \p{Zp} \p{Paragraph_Separator} (=
4569 \p{General_Category=
4570 Paragraph_Separator}) (1)
4571 \p{Zs} \p{Space_Separator} (=
4572 \p{General_Category=Space_Separator})
4573 (17)
4574 \p{Zyyy} \p{Common} (= \p{Script_Extensions=
4575 Common}) (6864)
4576 \p{Zzzz} \p{Unknown} (= \p{Script_Extensions=
4577 Unknown}) (985_875 plus all above-
4578 Unicode code points)
4579 TX\p{_CanonDCIJ} (For internal use by Perl, not necessarily
4580 stable) (= \p{Soft_Dotted=Y}) (46)
4581 TX\p{_Case_Ignorable} (For internal use by Perl, not necessarily
4582 stable) (= \p{Case_Ignorable=Y}) (2240)
4583 TX\p{_CombAbove} (For internal use by Perl, not necessarily
4584 stable) (= \p{Canonical_Combining_Class=
4585 Above}) (461)
4586
4587 Legal "\p{}" and "\P{}" constructs that match no characters
4588 Unicode has some property-value pairs that currently don't match
4589 anything. This happens generally either because they are obsolete, or
4590 they exist for symmetry with other forms, but no language has yet been
4591 encoded that uses them. In this version of Unicode, the following
4592 match zero code points:
4593
4594 \p{Canonical_Combining_Class=Attached_Below_Left}
4595 \p{Canonical_Combining_Class=CCC133}
4596
4598 The value of any Unicode (not including Perl extensions) character
4599 property mentioned above for any single code point is available through
4600 "charprop()" in Unicode::UCD. "charprops_all()" in Unicode::UCD
4601 returns the values of all the Unicode properties for a given code
4602 point.
4603
4604 Besides these, all the Unicode character properties mentioned above
4605 (except for those marked as for internal use by Perl) are also
4606 accessible by "prop_invlist()" in Unicode::UCD.
4607
4608 Due to their nature, not all Unicode character properties are suitable
4609 for regular expression matches, nor "prop_invlist()". The remaining
4610 non-provisional, non-internal ones are accessible via "prop_invmap()"
4611 in Unicode::UCD (except for those that this Perl installation hasn't
4612 included; see below for which those are).
4613
4614 For compatibility with other parts of Perl, all the single forms given
4615 in the table in the section above are recognized. BUT, there are some
4616 ambiguities between some Perl extensions and the Unicode properties,
4617 all of which are silently resolved in favor of the official Unicode
4618 property. To avoid surprises, you should only use "prop_invmap()" for
4619 forms listed in the table below, which omits the non-recommended ones.
4620 The affected forms are the Perl single form equivalents of Unicode
4621 properties, such as "\p{sc}" being a single-form equivalent of
4622 "\p{gc=sc}", which is treated by "prop_invmap()" as the "Script"
4623 property, whose short name is "sc". The table indicates the current
4624 ambiguities in the INFO column, beginning with the word "NOT".
4625
4626 The standard Unicode properties listed below are documented in
4627 <http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is
4628 documented in "prop_invmap()" in Unicode::UCD. The other Perl
4629 extensions are in "Other Properties" in perlunicode;
4630
4631 The first column in the table is a name for the property; the second
4632 column is an alternative name, if any, plus possibly some annotations.
4633 The alternative name is the property's full name, unless that would
4634 simply repeat the first column, in which case the second column
4635 indicates the property's short name (if different). The annotations
4636 are given only in the entry for the full name. If a property is
4637 obsolete, etc, the entry will be flagged with the same characters used
4638 in the table in the section above, like D or S.
4639
4640 NAME INFO
4641
4642 Age
4643 AHex ASCII_Hex_Digit
4644 All (Perl extension). All code points,
4645 including those above Unicode. Same as
4646 qr/./s
4647 Alnum XPosixAlnum. (Perl extension)
4648 Alpha Alphabetic
4649 Alphabetic (Short: Alpha)
4650 Any (Perl extension). All Unicode code
4651 points: [\x{0000}-\x{10FFFF}]
4652 ASCII Block=ASCII. (Perl extension).
4653 [[:ASCII:]]
4654 ASCII_Hex_Digit (Short: AHex)
4655 Assigned (Perl extension). All assigned code points
4656 Bc Bidi_Class
4657 Bidi_C Bidi_Control
4658 Bidi_Class (Short: bc)
4659 Bidi_Control (Short: Bidi_C)
4660 Bidi_M Bidi_Mirrored
4661 Bidi_Mirrored (Short: Bidi_M)
4662 Bidi_Mirroring_Glyph (Short: bmg)
4663 Bidi_Paired_Bracket (Short: bpb)
4664 Bidi_Paired_Bracket_Type (Short: bpt)
4665 Blank XPosixBlank. (Perl extension)
4666 Blk Block
4667 Block (Short: blk)
4668 Bmg Bidi_Mirroring_Glyph
4669 Bpb Bidi_Paired_Bracket
4670 Bpt Bidi_Paired_Bracket_Type
4671 Canonical_Combining_Class (Short: ccc)
4672 Case_Folding (Short: cf)
4673 Case_Ignorable (Short: CI)
4674 Cased
4675 Category General_Category
4676 Ccc Canonical_Combining_Class
4677 CE Composition_Exclusion
4678 Cf Case_Folding; NOT 'cf' meaning
4679 'General_Category=Format'
4680 Changes_When_Casefolded (Short: CWCF)
4681 Changes_When_Casemapped (Short: CWCM)
4682 Changes_When_Lowercased (Short: CWL)
4683 Changes_When_NFKC_Casefolded (Short: CWKCF)
4684 Changes_When_Titlecased (Short: CWT)
4685 Changes_When_Uppercased (Short: CWU)
4686 CI Case_Ignorable
4687 Cntrl General_Category=XPosixCntrl. (Perl
4688 extension)
4689 Comp_Ex Full_Composition_Exclusion
4690 Composition_Exclusion (Short: CE)
4691 CWCF Changes_When_Casefolded
4692 CWCM Changes_When_Casemapped
4693 CWKCF Changes_When_NFKC_Casefolded
4694 CWL Changes_When_Lowercased
4695 CWT Changes_When_Titlecased
4696 CWU Changes_When_Uppercased
4697 Dash
4698 Decomposition_Mapping (Short: dm)
4699 Decomposition_Type (Short: dt)
4700 Default_Ignorable_Code_Point (Short: DI)
4701 Dep Deprecated
4702 Deprecated (Short: Dep)
4703 DI Default_Ignorable_Code_Point
4704 Dia Diacritic
4705 Diacritic (Short: Dia)
4706 Digit General_Category=XPosixDigit. (Perl
4707 extension)
4708 Dm Decomposition_Mapping
4709 Dt Decomposition_Type
4710 Ea East_Asian_Width
4711 East_Asian_Width (Short: ea)
4712 Ext Extender
4713 Extender (Short: Ext)
4714 Full_Composition_Exclusion (Short: Comp_Ex)
4715 Gc General_Category
4716 GCB Grapheme_Cluster_Break
4717 General_Category (Short: gc)
4718 Gr_Base Grapheme_Base
4719 Gr_Ext Grapheme_Extend
4720 Graph XPosixGraph. (Perl extension)
4721 Grapheme_Base (Short: Gr_Base)
4722 Grapheme_Cluster_Break (Short: GCB)
4723 Grapheme_Extend (Short: Gr_Ext)
4724 Hangul_Syllable_Type (Short: hst)
4725 Hex Hex_Digit
4726 Hex_Digit (Short: Hex)
4727 HorizSpace XPosixBlank. (Perl extension)
4728 Hst Hangul_Syllable_Type
4729 D Hyphen Supplanted by Line_Break property values;
4730 see www.unicode.org/reports/tr14
4731 ID_Continue (Short: IDC)
4732 ID_Start (Short: IDS)
4733 IDC ID_Continue
4734 Ideo Ideographic
4735 Ideographic (Short: Ideo)
4736 IDS ID_Start
4737 IDS_Binary_Operator (Short: IDSB)
4738 IDS_Trinary_Operator (Short: IDST)
4739 IDSB IDS_Binary_Operator
4740 IDST IDS_Trinary_Operator
4741 In Present_In. (Perl extension)
4742 Indic_Positional_Category (Short: InPC)
4743 Indic_Syllabic_Category (Short: InSC)
4744 InPC Indic_Positional_Category
4745 InSC Indic_Syllabic_Category
4746 Isc ISO_Comment; NOT 'isc' meaning
4747 'General_Category=Other'
4748 ISO_Comment (Short: isc)
4749 Jg Joining_Group
4750 Join_C Join_Control
4751 Join_Control (Short: Join_C)
4752 Joining_Group (Short: jg)
4753 Joining_Type (Short: jt)
4754 Jt Joining_Type
4755 Lb Line_Break
4756 Lc Lowercase_Mapping; NOT 'lc' meaning
4757 'General_Category=Cased_Letter'
4758 Line_Break (Short: lb)
4759 LOE Logical_Order_Exception
4760 Logical_Order_Exception (Short: LOE)
4761 Lower Lowercase
4762 Lowercase (Short: Lower)
4763 Lowercase_Mapping (Short: lc)
4764 Math
4765 Na Name
4766 Na1 Unicode_1_Name
4767 Name (Short: na)
4768 Name_Alias
4769 NChar Noncharacter_Code_Point
4770 NFC_QC NFC_Quick_Check
4771 NFC_Quick_Check (Short: NFC_QC)
4772 NFD_QC NFD_Quick_Check
4773 NFD_Quick_Check (Short: NFD_QC)
4774 NFKC_Casefold (Short: NFKC_CF)
4775 NFKC_CF NFKC_Casefold
4776 NFKC_QC NFKC_Quick_Check
4777 NFKC_Quick_Check (Short: NFKC_QC)
4778 NFKD_QC NFKD_Quick_Check
4779 NFKD_Quick_Check (Short: NFKD_QC)
4780 Noncharacter_Code_Point (Short: NChar)
4781 Nt Numeric_Type
4782 Numeric_Type (Short: nt)
4783 Numeric_Value (Short: nv)
4784 Nv Numeric_Value
4785 Pat_Syn Pattern_Syntax
4786 Pat_WS Pattern_White_Space
4787 Pattern_Syntax (Short: Pat_Syn)
4788 Pattern_White_Space (Short: Pat_WS)
4789 PCM Prepended_Concatenation_Mark
4790 Perl_Decimal_Digit (Perl extension)
4791 PerlSpace PosixSpace. (Perl extension)
4792 PerlWord PosixWord. (Perl extension)
4793 PosixAlnum (Perl extension). [A-Za-z0-9]
4794 PosixAlpha (Perl extension). [A-Za-z]
4795 PosixBlank (Perl extension). \t and ' '
4796 PosixCntrl (Perl extension). ASCII control
4797 characters: NUL, SOH, STX, ETX, EOT, ENQ,
4798 ACK, BEL, BS, HT, LF, VT, FF, CR, SO, SI,
4799 DLE, DC1, DC2, DC3, DC4, NAK, SYN, ETB,
4800 CAN, EOM, SUB, ESC, FS, GS, RS, US, and DEL
4801 PosixDigit (Perl extension). [0-9]
4802 PosixGraph (Perl extension). [-!"#$%&'()*+,./:;<=
4803 >?@[\\]^_`{|}~0-9A-Za-z]
4804 PosixLower (Perl extension). [a-z]
4805 PosixPrint (Perl extension). [- 0-9A-Za-
4806 z!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~]
4807 PosixPunct (Perl extension). [-!"#$%&'()*+,./:;<=
4808 >?@[\\]^_`{|}~]
4809 PosixSpace (Perl extension). \t, \n, \cK, \f, \r,
4810 and ' '. (\cK is vertical tab)
4811 PosixUpper (Perl extension). [A-Z]
4812 PosixWord (Perl extension). \w, restricted to ASCII
4813 = [A-Za-z0-9_]
4814 PosixXDigit (Perl extension). [0-9A-Fa-f]
4815 Prepended_Concatenation_Mark (Short: PCM)
4816 Present_In (Short: In). (Perl extension)
4817 Print XPosixPrint. (Perl extension)
4818 Punct General_Category=Punct. (Perl extension)
4819 QMark Quotation_Mark
4820 Quotation_Mark (Short: QMark)
4821 Radical
4822 SB Sentence_Break
4823 Sc Script; NOT 'sc' meaning
4824 'General_Category=Currency_Symbol'
4825 Scf Simple_Case_Folding
4826 Script (Short: sc)
4827 Script_Extensions (Short: scx)
4828 Scx Script_Extensions
4829 SD Soft_Dotted
4830 Sentence_Break (Short: SB)
4831 Sentence_Terminal (Short: STerm)
4832 Sfc Simple_Case_Folding
4833 Simple_Case_Folding (Short: scf)
4834 Simple_Lowercase_Mapping (Short: slc)
4835 Simple_Titlecase_Mapping (Short: stc)
4836 Simple_Uppercase_Mapping (Short: suc)
4837 Slc Simple_Lowercase_Mapping
4838 Soft_Dotted (Short: SD)
4839 Space White_Space
4840 SpacePerl XPosixSpace. (Perl extension)
4841 Stc Simple_Titlecase_Mapping
4842 STerm Sentence_Terminal
4843 Suc Simple_Uppercase_Mapping
4844 Tc Titlecase_Mapping
4845 Term Terminal_Punctuation
4846 Terminal_Punctuation (Short: Term)
4847 Title Titlecase. (Perl extension)
4848 Titlecase (Short: Title). (Perl extension). (=
4849 \p{Gc=Lt})
4850 Titlecase_Mapping (Short: tc)
4851 Uc Uppercase_Mapping
4852 UIdeo Unified_Ideograph
4853 Unicode Any. (Perl extension)
4854 Unicode_1_Name (Short: na1)
4855 Unified_Ideograph (Short: UIdeo)
4856 Upper Uppercase
4857 Uppercase (Short: Upper)
4858 Uppercase_Mapping (Short: uc)
4859 Variation_Selector (Short: VS)
4860 VertSpace (Perl extension). \v
4861 VS Variation_Selector
4862 WB Word_Break
4863 White_Space (Short: WSpace)
4864 Word XPosixWord. (Perl extension)
4865 Word_Break (Short: WB)
4866 WSpace White_Space
4867 XDigit XPosixXDigit. (Perl extension)
4868 XID_Continue (Short: XIDC)
4869 XID_Start (Short: XIDS)
4870 XIDC XID_Continue
4871 XIDS XID_Start
4872 XPerlSpace XPosixSpace. (Perl extension)
4873 XPosixAlnum (Short: Alnum). (Perl extension).
4874 Alphabetic and (decimal) Numeric
4875 XPosixAlpha (Perl extension)
4876 XPosixBlank (Short: Blank). (Perl extension). \h,
4877 Horizontal white space
4878 XPosixCntrl General_Category=XPosixCntrl (Short:
4879 Cntrl). (Perl extension). Control
4880 characters
4881 XPosixDigit General_Category=XPosixDigit (Short:
4882 Digit). (Perl extension). [0-9] + all
4883 other decimal digits
4884 XPosixGraph (Short: Graph). (Perl extension).
4885 Characters that are graphical
4886 XPosixLower (Perl extension)
4887 XPosixPrint (Short: Print). (Perl extension).
4888 Characters that are graphical plus space
4889 characters (but no controls)
4890 XPosixPunct (Perl extension). \p{Punct} + ASCII-range
4891 \p{Symbol}
4892 XPosixSpace (Perl extension). \s including beyond
4893 ASCII and vertical tab
4894 XPosixUpper (Perl extension)
4895 XPosixWord (Short: Word). (Perl extension). \w,
4896 including beyond ASCII; = \p{Alnum} + \pM
4897 + \p{Pc} + \p{Join_Control}
4898 XPosixXDigit (Short: XDigit). (Perl extension)
4899
4901 Certain properties are accessible also via core function calls. These
4902 are:
4903
4904 Lowercase_Mapping lc() and lcfirst()
4905 Titlecase_Mapping ucfirst()
4906 Uppercase_Mapping uc()
4907
4908 Also, Case_Folding is accessible through the "/i" modifier in regular
4909 expressions, the "\F" transliteration escape, and the "fc" operator.
4910
4911 And, the Name and Name_Aliases properties are accessible through the
4912 "\N{}" interpolation in double-quoted strings and regular expressions;
4913 and functions "charnames::viacode()", "charnames::vianame()", and
4914 "charnames::string_vianame()" (which require a "use charnames ();" to
4915 be specified.
4916
4917 Finally, most properties related to decomposition are accessible via
4918 Unicode::Normalize.
4919
4921 Perl will generate an error for a few character properties in Unicode
4922 when used in a regular expression. The non-Unihan ones are listed
4923 below, with the reasons they are not accepted, perhaps with work-
4924 arounds. The short names for the properties are listed enclosed in
4925 (parentheses). As described after the list, an installation can change
4926 the defaults and choose to accept any of these. The list is machine
4927 generated based on the choices made for the installation that generated
4928 this document.
4929
4930 Expands_On_NFC (XO_NFC)
4931 Expands_On_NFD (XO_NFD)
4932 Expands_On_NFKC (XO_NFKC)
4933 Expands_On_NFKD (XO_NFKD)
4934 Deprecated by Unicode. These are characters that expand to more
4935 than one character in the specified normalization form, but whether
4936 they actually take up more bytes or not depends on the encoding
4937 being used. For example, a UTF-8 encoded character may expand to a
4938 different number of bytes than a UTF-32 encoded character.
4939
4940 Grapheme_Link (Gr_Link)
4941 Deprecated by Unicode: Duplicates ccc=vr
4942 (Canonical_Combining_Class=Virama)
4943
4944 Jamo_Short_Name (JSN)
4945 Other_Alphabetic (OAlpha)
4946 Other_Default_Ignorable_Code_Point (ODI)
4947 Other_Grapheme_Extend (OGr_Ext)
4948 Other_ID_Continue (OIDC)
4949 Other_ID_Start (OIDS)
4950 Other_Lowercase (OLower)
4951 Other_Math (OMath)
4952 Other_Uppercase (OUpper)
4953 Used by Unicode internally for generating other properties and not
4954 intended to be used stand-alone
4955
4956 Script=Katakana_Or_Hiragana (sc=Hrkt)
4957 Obsolete. All code points previously matched by this have been
4958 moved to "Script=Common". Consider instead using
4959 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or
4960 both)
4961
4962 Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)
4963 All code points that would be matched by this are matched by either
4964 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
4965
4966 An installation can choose to allow any of these to be matched by
4967 downloading the Unicode database from <http://www.unicode.org/Public/>
4968 to $Config{privlib}/unicore/ in the Perl source tree, changing the
4969 controlling lists contained in the program
4970 $Config{privlib}/unicore/mktables and then re-compiling and installing.
4971 (%Config is available from the Config module).
4972
4973 Also, perl can be recompiled to operate on an earlier version of the
4974 Unicode standard. Further information is at
4975 $Config{privlib}/unicore/README.perl.
4976
4978 The Unicode data base is delivered in two different formats. The XML
4979 version is valid for more modern Unicode releases. The other version
4980 is a collection of files. The two are intended to give equivalent
4981 information. Perl uses the older form; this allows you to recompile
4982 Perl to use early Unicode releases.
4983
4984 The only non-character property that Perl currently supports is Named
4985 Sequences, in which a sequence of code points is given a name and
4986 generally treated as a single entity. (Perl supports these via the
4987 "\N{...}" double-quotish construct, "charnames::string_vianame(name)"
4988 in charnames, and "namedseq()" in Unicode::UCD.
4989
4990 Below is a list of the files in the Unicode data base that Perl doesn't
4991 currently use, along with very brief descriptions of their purposes.
4992 Some of the names of the files have been shortened from those that
4993 Unicode uses, in order to allow them to be distinguishable from
4994 similarly named files on file systems for which only the first 8
4995 characters of a name are significant.
4996
4997 auxiliary/GraphemeBreakTest.html
4998 auxiliary/LineBreakTest.html
4999 auxiliary/SentenceBreakTest.html
5000 auxiliary/WordBreakTest.html
5001 Documentation of validation Tests
5002
5003 BidiCharacterTest.txt
5004 BidiTest.txt
5005 NormTest.txt
5006 Validation Tests
5007
5008 CJKRadicals.txt
5009 Maps the kRSUnicode property values to corresponding code points
5010
5011 EmojiSources.txt
5012 Maps certain Unicode code points to their legacy Japanese cell-
5013 phone values
5014
5015 Index.txt
5016 Alphabetical index of Unicode characters
5017
5018 NamedSqProv.txt
5019 Named sequences proposed for inclusion in a later version of the
5020 Unicode Standard; if you need them now, you can append this file to
5021 NamedSequences.txt and recompile perl
5022
5023 NamesList.html
5024 Describes the format and contents of NamesList.txt
5025
5026 NamesList.txt
5027 Annotated list of characters
5028
5029 NormalizationCorrections.txt
5030 Documentation of corrections already incorporated into the Unicode
5031 data base
5032
5033 ReadMe.txt
5034 Documentation
5035
5036 StandardizedVariants.html
5037 Obsoleted as of Unicode 9.0, but previously provided a visual
5038 display of the standard variant sequences derived from
5039 StandardizedVariants.txt.
5040
5041 StandardizedVariants.txt
5042 Certain glyph variations for character display are standardized.
5043 This lists the non-Unihan ones; the Unihan ones are also not used
5044 by Perl, and are in a separate Unicode data base
5045 <http://www.unicode.org/ivd>
5046
5047 TangutSources.txt
5048 Specifies source mappings for Tangut ideographs and components.
5049 This data file also includes informative radical-stroke values that
5050 are used internally by Unicode
5051
5052 USourceData.txt
5053 Documentation of status and cross reference of proposals for
5054 encoding by Unicode of Unihan characters
5055
5056 USourceGlyphs.pdf
5057 Pictures of the characters in USourceData.txt
5058
5060 <http://www.unicode.org/reports/tr44/>
5061
5062 perlrecharclass
5063
5064 perlunicode
5065
5066
5067
5068perl v5.26.3 2019-05-11 PERLUNIPROPS(1)