1PERLUNIPROPS(1) Perl Programmers Reference Guide PERLUNIPROPS(1)
2
3
4
6 perluniprops - Index of Unicode Version 6.1.0 character properties in
7 Perl
8
10 This document provides information about the portion of the Unicode
11 database that deals with character properties, that is the portion that
12 is defined on single code points. ("Other information in the Unicode
13 data base" below briefly mentions other data that Unicode provides.)
14
15 Perl can provide access to all non-provisional Unicode character
16 properties, though not all are enabled by default. The omitted ones
17 are the Unihan properties (accessible via the CPAN module
18 Unicode::Unihan) and certain deprecated or Unicode-internal properties.
19 (An installation may choose to recompile Perl's tables to change this.
20 See "Unicode character properties that are NOT accepted by Perl".)
21
22 For most purposes, access to Unicode properties from the Perl core is
23 through regular expression matches, as described in the next section.
24 For some special purposes, and to access the properties that are not
25 suitable for regular expression matching, all the Unicode character
26 properties that Perl handles are accessible via the standard
27 Unicode::UCD module, as described in the section "Properties accessible
28 through Unicode::UCD".
29
30 Perl also provides some additional extensions and short-cut synonyms
31 for Unicode properties.
32
33 This document merely lists all available properties and does not
34 attempt to explain what each property really means. There is a brief
35 description of each Perl extension; see "Other Properties" in
36 perlunicode for more information on these. There is some detail about
37 Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but
38 to find out about the intricacies of the official Unicode properties,
39 refer to the Unicode standard. A good starting place is
40 <http://www.unicode.org/reports/tr44/>.
41
42 Note that you can define your own properties; see "User-Defined
43 Character Properties" in perlunicode.
44
46 The Perl regular expression "\p{}" and "\P{}" constructs give access to
47 most of the Unicode character properties. The table below shows all
48 these constructs, both single and compound forms.
49
50 Compound forms consist of two components, separated by an equals sign
51 or a colon. The first component is the property name, and the second
52 component is the particular value of the property to match against, for
53 example, "\p{Script: Greek}" and "\p{Script=Greek}" both mean to match
54 characters whose Script property is Greek.
55
56 Single forms, like "\p{Greek}", are mostly Perl-defined shortcuts for
57 their equivalent compound forms. The table shows these equivalences.
58 (In our example, "\p{Greek}" is a just a shortcut for
59 "\p{Script=Greek}".) There are also a few Perl-defined single forms
60 that are not shortcuts for a compound form. One such is "\p{Word}".
61 These are also listed in the table.
62
63 In parsing these constructs, Perl always ignores Upper/lower case
64 differences everywhere within the {braces}. Thus "\p{Greek}" means the
65 same thing as "\p{greek}". But note that changing the case of the "p"
66 or "P" before the left brace completely changes the meaning of the
67 construct, from "match" (for "\p{}") to "doesn't match" (for "\P{}").
68 Casing in this document is for improved legibility.
69
70 Also, white space, hyphens, and underscores are also normally ignored
71 everywhere between the {braces}, and hence can be freely added or
72 removed even if the "/x" modifier hasn't been specified on the regular
73 expression. But a 'T' at the beginning of an entry in the table below
74 means that tighter (stricter) rules are used for that entry:
75
76 Single form ("\p{name}") tighter rules:
77 White space, hyphens, and underscores ARE significant except for:
78
79 · white space adjacent to a non-word character
80
81 · underscores separating digits in numbers
82
83 That means, for example, that you can freely add or remove white
84 space adjacent to (but within) the braces without affecting the
85 meaning.
86
87 Compound form ("\p{name=value}" or "\p{name:value}") tighter rules:
88 The tighter rules given above for the single form apply to
89 everything to the right of the colon or equals; the looser rules
90 still apply to everything to the left.
91
92 That means, for example, that you can freely add or remove white
93 space adjacent to (but within) the braces and the colon or equal
94 sign.
95
96 Some properties are considered obsolete by Unicode, but still
97 available. There are several varieties of obsolescence:
98
99 Stabilized
100 A property may be stabilized. Such a determination does not
101 indicate that the property should or should not be used; instead it
102 is a declaration that the property will not be maintained nor
103 extended for newly encoded characters. Such properties are marked
104 with an 'S' in the table.
105
106 Deprecated
107 A property may be deprecated, perhaps because its original intent
108 has been replaced by another property, or because its specification
109 was somehow defective. This means that its use is strongly
110 discouraged, so much so that a warning will be issued if used,
111 unless the regular expression is in the scope of a
112 "no warnings 'deprecated'" statement. A 'D' flags each such entry
113 in the table, and the entry there for the longest, most descriptive
114 version of the property will give the reason it is deprecated, and
115 perhaps advice. Perl may issue such a warning, even for properties
116 that aren't officially deprecated by Unicode, when there used to be
117 characters or code points that were matched by them, but no longer.
118 This is to warn you that your program may not work like it did on
119 earlier Unicode releases.
120
121 A deprecated property may be made unavailable in a future Perl
122 version, so it is best to move away from them.
123
124 A deprecated property may also be stabilized, but this fact is not
125 shown.
126
127 Obsolete
128 Properties marked with an 'O' in the table are considered (plain)
129 obsolete. Generally this designation is given to properties that
130 Unicode once used for internal purposes (but not any longer).
131
132 Some Perl extensions are present for backwards compatibility and are
133 discouraged from being used, but are not obsolete. An 'X' flags each
134 such entry in the table. Future Unicode versions may force some of
135 these extensions to be removed without warning, replaced by another
136 property with the same name that means something different. Use the
137 equivalent shown instead.
138
139 Matches in the Block property have shortcuts that begin with "In_".
140 For example, "\p{Block=Latin1}" can be written as "\p{In_Latin1}". For
141 backward compatibility, if there is no conflict with another shortcut,
142 these may also be written as "\p{Latin1}" or "\p{Is_Latin1}". But,
143 N.B., there are numerous such conflicting shortcuts. Use of these
144 forms for Block is discouraged, and are flagged as such, not only
145 because of the potential confusion as to what is meant, but also
146 because a later release of Unicode may preempt the shortcut, and your
147 program would no longer be correct. Use the "In_" form instead to
148 avoid this, or even more clearly, use the compound form, e.g.,
149 "\p{blk:latin1}". See "Blocks" in perlunicode for more information
150 about this.
151
152 The table below has two columns. The left column contains the "\p{}"
153 constructs to look up, possibly preceded by the flags mentioned above;
154 and the right column contains information about them, like a
155 description, or synonyms. It shows both the single and compound forms
156 for each property that has them. If the left column is a short name
157 for a property, the right column will give its longer, more descriptive
158 name; and if the left column is the longest name, the right column will
159 show any equivalent shortest name, in both single and compound forms if
160 applicable.
161
162 The right column will also caution you if a property means something
163 different than what might normally be expected.
164
165 All single forms are Perl extensions; a few compound forms are as well,
166 and are noted as such.
167
168 Numbers in (parentheses) indicate the total number of code points
169 matched by the property. For emphasis, those properties that match no
170 code points at all are listed as well in a separate section following
171 the table.
172
173 Most properties match the same code points regardless of whether "/i"
174 case-insensitive matching is specified or not. But a few properties
175 are affected. These are shown with the notation
176
177 (/i= other_property)
178
179 in the second column. Under case-insensitive matching they match the
180 same code pode points as the property "other_property".
181
182 There is no description given for most non-Perl defined properties (See
183 <http://www.unicode.org/reports/tr44/> for that).
184
185 For compactness, '*' is used as a wildcard instead of showing all
186 possible combinations. For example, entries like:
187
188 \p{Gc: *} \p{General_Category: *}
189
190 mean that 'Gc' is a synonym for 'General_Category', and anything that
191 is valid for the latter is also valid for the former. Similarly,
192
193 \p{Is_*} \p{*}
194
195 means that if and only if, for example, "\p{Foo}" exists, then
196 "\p{Is_Foo}" and "\p{IsFoo}" are also valid and all mean the same
197 thing. And similarly, "\p{Foo=Bar}" means the same as "\p{Is_Foo=Bar}"
198 and "\p{IsFoo=Bar}". "*" here is restricted to something not beginning
199 with an underscore.
200
201 Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for
202 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table
203 shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries
204 for the other possibilities. Note that not all properties which have
205 values 'Yes' and 'No' are binary, and they have all their values
206 spelled out without using this wild card, and a "NOT" clause in their
207 description that highlights their not being binary. These also require
208 the compound form to match them, whereas true binary properties have
209 both single and compound forms available.
210
211 Note that all non-essential underscores are removed in the display of
212 the short names below.
213
214 Legend summary:
215
216 * is a wild-card
217 (\d+) in the info column gives the number of code points matched by
218 this property.
219 D means this is deprecated.
220 O means this is obsolete.
221 S means this is stabilized.
222 T means tighter (stricter) name matching applies.
223 X means use of this form is discouraged, and may not be stable.
224
225 NAME INFO
226
227 X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
228 T \p{Age: 1.1} \p{Age=V1_1} (33_979)
229 T \p{Age: 2.0} \p{Age=V2_0} (144_521)
230 T \p{Age: 2.1} \p{Age=V2_1} (2)
231 T \p{Age: 3.0} \p{Age=V3_0} (10_307)
232 T \p{Age: 3.1} \p{Age=V3_1} (44_978)
233 T \p{Age: 3.2} \p{Age=V3_2} (1016)
234 T \p{Age: 4.0} \p{Age=V4_0} (1226)
235 T \p{Age: 4.1} \p{Age=V4_1} (1273)
236 T \p{Age: 5.0} \p{Age=V5_0} (1369)
237 T \p{Age: 5.1} \p{Age=V5_1} (1624)
238 T \p{Age: 5.2} \p{Age=V5_2} (6648)
239 T \p{Age: 6.0} \p{Age=V6_0} (2088)
240 T \p{Age: 6.1} \p{Age=V6_1} (732)
241 \p{Age: NA} \p{Age=Unassigned} (864_349)
242 \p{Age: Unassigned} Code point's usage has not been assigned
243 in any Unicode release thus far. (Short:
244 \p{Age=NA}) (864_349)
245 \p{Age: V1_1} Code point's usage introduced in version
246 1.1 (33_979)
247 \p{Age: V2_0} Code point's usage was introduced in
248 version 2.0; See also Property
249 'Present_In' (144_521)
250 \p{Age: V2_1} Code point's usage was introduced in
251 version 2.1; See also Property
252 'Present_In' (2)
253 \p{Age: V3_0} Code point's usage was introduced in
254 version 3.0; See also Property
255 'Present_In' (10_307)
256 \p{Age: V3_1} Code point's usage was introduced in
257 version 3.1; See also Property
258 'Present_In' (44_978)
259 \p{Age: V3_2} Code point's usage was introduced in
260 version 3.2; See also Property
261 'Present_In' (1016)
262 \p{Age: V4_0} Code point's usage was introduced in
263 version 4.0; See also Property
264 'Present_In' (1226)
265 \p{Age: V4_1} Code point's usage was introduced in
266 version 4.1; See also Property
267 'Present_In' (1273)
268 \p{Age: V5_0} Code point's usage was introduced in
269 version 5.0; See also Property
270 'Present_In' (1369)
271 \p{Age: V5_1} Code point's usage was introduced in
272 version 5.1; See also Property
273 'Present_In' (1624)
274 \p{Age: V5_2} Code point's usage was introduced in
275 version 5.2; See also Property
276 'Present_In' (6648)
277 \p{Age: V6_0} Code point's usage was introduced in
278 version 6.0; See also Property
279 'Present_In' (2088)
280 \p{Age: V6_1} Code point's usage was introduced in
281 version 6.1; See also Property
282 'Present_In' (732)
283 \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
284 (22)
285 \p{AHex: *} \p{ASCII_Hex_Digit: *}
286 X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block=
287 Alchemical_Symbols}) (128)
288 X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short:
289 \p{InAlchemical}) (128)
290 \p{All} \p{Any} (1_114_112)
291 \p{Alnum} Alphabetic and (decimal) Numeric (102_619)
292 \p{Alpha} \p{Alphabetic=Y} (102_159)
293 \p{Alpha: *} \p{Alphabetic: *}
294 \p{Alphabetic} \p{Alpha} (= \p{Alphabetic=Y}) (102_159)
295 \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (1_011_953)
296 \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (102_159)
297 X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (=
298 \p{Block=Alphabetic_Presentation_Forms})
299 (80)
300 X \p{Alphabetic_Presentation_Forms} \p{Block=
301 Alphabetic_Presentation_Forms} (Short:
302 \p{InAlphabeticPF}) (80)
303 X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
304 \p{Block=
305 Ancient_Greek_Musical_Notation}) (80)
306 X \p{Ancient_Greek_Musical_Notation} \p{Block=
307 Ancient_Greek_Musical_Notation} (Short:
308 \p{InAncientGreekMusic}) (80)
309 X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
310 X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
311 \p{Any} [\x{0000}-\x{10FFFF}] (1_114_112)
312 \p{Arab} \p{Arabic} (= \p{Script=Arabic}) (NOT
313 \p{Block=Arabic}) (1234)
314 \p{Arabic} \p{Script=Arabic} (Short: \p{Arab}; NOT
315 \p{Block=Arabic}) (1234)
316 X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block=
317 Arabic_Extended_A}) (96)
318 X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short:
319 \p{InArabicExtA}) (96)
320 X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols}
321 (= \p{Block=
322 Arabic_Mathematical_Alphabetic_Symbols})
323 (256)
324 X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
325 Arabic_Mathematical_Alphabetic_Symbols}
326 (Short: \p{InArabicMath}) (256)
327 X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (=
328 \p{Block=Arabic_Presentation_Forms_A})
329 (688)
330 X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (=
331 \p{Block=Arabic_Presentation_Forms_B})
332 (144)
333 X \p{Arabic_Presentation_Forms_A} \p{Block=
334 Arabic_Presentation_Forms_A} (Short:
335 \p{InArabicPFA}) (688)
336 X \p{Arabic_Presentation_Forms_B} \p{Block=
337 Arabic_Presentation_Forms_B} (Short:
338 \p{InArabicPFB}) (144)
339 X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block=
340 Arabic_Supplement}) (48)
341 X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short:
342 \p{InArabicSup}) (48)
343 \p{Armenian} \p{Script=Armenian} (Short: \p{Armn}; NOT
344 \p{Block=Armenian}) (91)
345 \p{Armi} \p{Imperial_Aramaic} (= \p{Script=
346 Imperial_Aramaic}) (NOT \p{Block=
347 Imperial_Aramaic}) (31)
348 \p{Armn} \p{Armenian} (= \p{Script=Armenian}) (NOT
349 \p{Block=Armenian}) (91)
350 X \p{Arrows} \p{Block=Arrows} (112)
351 \p{ASCII} \p{Block=Basic_Latin} [[:ASCII:]] (128)
352 \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
353 (22)
354 \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090)
355 \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22)
356 \p{Assigned} All assigned code points (249_697)
357 \p{Avestan} \p{Script=Avestan} (Short: \p{Avst}; NOT
358 \p{Block=Avestan}) (61)
359 \p{Avst} \p{Avestan} (= \p{Script=Avestan}) (NOT
360 \p{Block=Avestan}) (61)
361 \p{Bali} \p{Balinese} (= \p{Script=Balinese}) (NOT
362 \p{Block=Balinese}) (121)
363 \p{Balinese} \p{Script=Balinese} (Short: \p{Bali}; NOT
364 \p{Block=Balinese}) (121)
365 \p{Bamu} \p{Bamum} (= \p{Script=Bamum}) (NOT
366 \p{Block=Bamum}) (657)
367 \p{Bamum} \p{Script=Bamum} (Short: \p{Bamu}; NOT
368 \p{Block=Bamum}) (657)
369 X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block=
370 Bamum_Supplement}) (576)
371 X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short:
372 \p{InBamumSup}) (576)
373 X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
374 \p{Batak} \p{Script=Batak} (Short: \p{Batk}; NOT
375 \p{Block=Batak}) (56)
376 \p{Batk} \p{Batak} (= \p{Script=Batak}) (NOT
377 \p{Block=Batak}) (56)
378 \p{Bc: *} \p{Bidi_Class: *}
379 \p{Beng} \p{Bengali} (= \p{Script=Bengali}) (NOT
380 \p{Block=Bengali}) (92)
381 \p{Bengali} \p{Script=Bengali} (Short: \p{Beng}; NOT
382 \p{Block=Bengali}) (92)
383 \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y}) (7)
384 \p{Bidi_C: *} \p{Bidi_Control: *}
385 \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1438)
386 \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (49)
387 \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1438)
388 \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (49)
389 \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
390 \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4015)
391 \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4015)
392 \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15)
393 \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
394 \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (131)
395 \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
396 \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (65)
397 \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (131)
398 \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12)
399 \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (65)
400 \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_098_531)
401 \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_098_531)
402 \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1)
403 \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1)
404 \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
405 \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
406 \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1290)
407 \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1290)
408 \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (4447)
409 \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (4447)
410 \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7)
411 \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
412 \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1)
413 \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (4086)
414 \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (4086)
415 \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1)
416 \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1)
417 \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
418 \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
419 \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
420 \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3)
421 \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (18)
422 \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (18)
423 \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (7)
424 \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_105)
425 \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (7)
426 \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
427 (545)
428 \p{Bidi_M: *} \p{Bidi_Mirrored: *}
429 \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
430 (545)
431 \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567)
432 \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (545)
433 \p{Blank} \h, Horizontal white space (19)
434 \p{Blk: *} \p{Block: *}
435 \p{Block: Aegean_Numbers} (Single: \p{InAegeanNumbers}) (64)
436 \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128)
437 \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical},
438 \p{InAlchemical}) (128)
439 \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
440 (80)
441 \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
442 AlphabeticPF}, \p{InAlphabeticPF}) (80)
443 \p{Block: Ancient_Greek_Music} \p{Block=
444 Ancient_Greek_Musical_Notation} (80)
445 \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
446 AncientGreekMusic},
447 \p{InAncientGreekMusic}) (80)
448 \p{Block: Ancient_Greek_Numbers} (Single:
449 \p{InAncientGreekNumbers}) (80)
450 \p{Block: Ancient_Symbols} (Single: \p{InAncientSymbols}) (64)
451 \p{Block: Arabic} (Single: \p{InArabic}; NOT \p{Arabic} NOR
452 \p{Is_Arabic}) (256)
453 \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
454 \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA},
455 \p{InArabicExtA}) (96)
456 \p{Block: Arabic_Math} \p{Block=
457 Arabic_Mathematical_Alphabetic_Symbols}
458 (256)
459 \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
460 ArabicMath}, \p{InArabicMath}) (256)
461 \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688)
462 \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144)
463 \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA},
464 \p{InArabicPFA}) (688)
465 \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB},
466 \p{InArabicPFB}) (144)
467 \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48)
468 \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup},
469 \p{InArabicSup}) (48)
470 \p{Block: Armenian} (Single: \p{InArmenian}; NOT \p{Armenian}
471 NOR \p{Is_Armenian}) (96)
472 \p{Block: Arrows} (Single: \p{InArrows}) (112)
473 \p{Block: ASCII} \p{Block=Basic_Latin} (128)
474 \p{Block: Avestan} (Single: \p{InAvestan}; NOT \p{Avestan}
475 NOR \p{Is_Avestan}) (64)
476 \p{Block: Balinese} (Single: \p{InBalinese}; NOT \p{Balinese}
477 NOR \p{Is_Balinese}) (128)
478 \p{Block: Bamum} (Single: \p{InBamum}; NOT \p{Bamum} NOR
479 \p{Is_Bamum}) (96)
480 \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576)
481 \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup},
482 \p{InBamumSup}) (576)
483 \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}, \p{ASCII}) (128)
484 \p{Block: Batak} (Single: \p{InBatak}; NOT \p{Batak} NOR
485 \p{Is_Batak}) (64)
486 \p{Block: Bengali} (Single: \p{InBengali}; NOT \p{Bengali}
487 NOR \p{Is_Bengali}) (128)
488 \p{Block: Block_Elements} (Single: \p{InBlockElements}) (32)
489 \p{Block: Bopomofo} (Single: \p{InBopomofo}; NOT \p{Bopomofo}
490 NOR \p{Is_Bopomofo}) (48)
491 \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
492 \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt},
493 \p{InBopomofoExt}) (32)
494 \p{Block: Box_Drawing} (Single: \p{InBoxDrawing}) (128)
495 \p{Block: Brahmi} (Single: \p{InBrahmi}; NOT \p{Brahmi} NOR
496 \p{Is_Brahmi}) (128)
497 \p{Block: Braille} \p{Block=Braille_Patterns} (256)
498 \p{Block: Braille_Patterns} (Short: \p{Blk=Braille},
499 \p{InBraille}) (256)
500 \p{Block: Buginese} (Single: \p{InBuginese}; NOT \p{Buginese}
501 NOR \p{Is_Buginese}) (32)
502 \p{Block: Buhid} (Single: \p{InBuhid}; NOT \p{Buhid} NOR
503 \p{Is_Buhid}) (32)
504 \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
505 (256)
506 \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
507 ByzantineMusic}, \p{InByzantineMusic})
508 (256)
509 \p{Block: Canadian_Syllabics} \p{Block=
510 Unified_Canadian_Aboriginal_Syllabics}
511 (640)
512 \p{Block: Carian} (Single: \p{InCarian}; NOT \p{Carian} NOR
513 \p{Is_Carian}) (64)
514 \p{Block: Chakma} (Single: \p{InChakma}; NOT \p{Chakma} NOR
515 \p{Is_Chakma}) (80)
516 \p{Block: Cham} (Single: \p{InCham}; NOT \p{Cham} NOR
517 \p{Is_Cham}) (96)
518 \p{Block: Cherokee} (Single: \p{InCherokee}; NOT \p{Cherokee}
519 NOR \p{Is_Cherokee}) (96)
520 \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992)
521 \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256)
522 \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
523 \p{Block: CJK_Compat_Ideographs} \p{Block=
524 CJK_Compatibility_Ideographs} (512)
525 \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
526 CJK_Compatibility_Ideographs_Supplement}
527 (544)
528 \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat},
529 \p{InCJKCompat}) (256)
530 \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms},
531 \p{InCJKCompatForms}) (32)
532 \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
533 CJKCompatIdeographs},
534 \p{InCJKCompatIdeographs}) (512)
535 \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
536 CJKCompatIdeographsSup},
537 \p{InCJKCompatIdeographsSup}) (544)
538 \p{Block: CJK_Ext_A} \p{Block=
539 CJK_Unified_Ideographs_Extension_A}
540 (6592)
541 \p{Block: CJK_Ext_B} \p{Block=
542 CJK_Unified_Ideographs_Extension_B}
543 (42_720)
544 \p{Block: CJK_Ext_C} \p{Block=
545 CJK_Unified_Ideographs_Extension_C}
546 (4160)
547 \p{Block: CJK_Ext_D} \p{Block=
548 CJK_Unified_Ideographs_Extension_D} (224)
549 \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
550 \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup},
551 \p{InCJKRadicalsSup}) (128)
552 \p{Block: CJK_Strokes} (Single: \p{InCJKStrokes}) (48)
553 \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64)
554 \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols},
555 \p{InCJKSymbols}) (64)
556 \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}, \p{InCJK})
557 (20_992)
558 \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
559 CJKExtA}, \p{InCJKExtA}) (6592)
560 \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
561 CJKExtB}, \p{InCJKExtB}) (42_720)
562 \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
563 CJKExtC}, \p{InCJKExtC}) (4160)
564 \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
565 CJKExtD}, \p{InCJKExtD}) (224)
566 \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
567 Diacriticals}, \p{InDiacriticals}) (112)
568 \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
569 DiacriticalsForSymbols},
570 \p{InDiacriticalsForSymbols}) (48)
571 \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
572 DiacriticalsSup}, \p{InDiacriticalsSup})
573 (64)
574 \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks},
575 \p{InHalfMarks}) (16)
576 \p{Block: Combining_Marks_For_Symbols} \p{Block=
577 Combining_Diacritical_Marks_For_Symbols}
578 (48)
579 \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
580 IndicNumberForms},
581 \p{InIndicNumberForms}) (16)
582 \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96)
583 \p{Block: Control_Pictures} (Single: \p{InControlPictures}) (64)
584 \p{Block: Coptic} (Single: \p{InCoptic}; NOT \p{Coptic} NOR
585 \p{Is_Coptic}) (128)
586 \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
587 \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod},
588 \p{InCountingRod}) (32)
589 \p{Block: Cuneiform} (Single: \p{InCuneiform}; NOT
590 \p{Cuneiform} NOR \p{Is_Cuneiform})
591 (1024)
592 \p{Block: Cuneiform_Numbers} \p{Block=
593 Cuneiform_Numbers_And_Punctuation} (128)
594 \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
595 CuneiformNumbers},
596 \p{InCuneiformNumbers}) (128)
597 \p{Block: Currency_Symbols} (Single: \p{InCurrencySymbols}) (48)
598 \p{Block: Cypriot_Syllabary} (Single: \p{InCypriotSyllabary}) (64)
599 \p{Block: Cyrillic} (Single: \p{InCyrillic}; NOT \p{Cyrillic}
600 NOR \p{Is_Cyrillic}) (256)
601 \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
602 \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
603 \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA},
604 \p{InCyrillicExtA}) (32)
605 \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB},
606 \p{InCyrillicExtB}) (96)
607 \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
608 \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup},
609 \p{InCyrillicSup}) (48)
610 \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
611 (48)
612 \p{Block: Deseret} (Single: \p{InDeseret}) (80)
613 \p{Block: Devanagari} (Single: \p{InDevanagari}; NOT
614 \p{Devanagari} NOR \p{Is_Devanagari})
615 (128)
616 \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
617 \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt},
618 \p{InDevanagariExt}) (32)
619 \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
620 \p{Block: Diacriticals_For_Symbols} \p{Block=
621 Combining_Diacritical_Marks_For_Symbols}
622 (48)
623 \p{Block: Diacriticals_Sup} \p{Block=
624 Combining_Diacritical_Marks_Supplement}
625 (64)
626 \p{Block: Dingbats} (Single: \p{InDingbats}) (192)
627 \p{Block: Domino} \p{Block=Domino_Tiles} (112)
628 \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}, \p{InDomino}) (112)
629 \p{Block: Egyptian_Hieroglyphs} (Single:
630 \p{InEgyptianHieroglyphs}; NOT
631 \p{Egyptian_Hieroglyphs} NOR
632 \p{Is_Egyptian_Hieroglyphs}) (1072)
633 \p{Block: Emoticons} (Single: \p{InEmoticons}) (80)
634 \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
635 \p{Block: Enclosed_Alphanum_Sup} \p{Block=
636 Enclosed_Alphanumeric_Supplement} (256)
637 \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
638 EnclosedAlphanumSup},
639 \p{InEnclosedAlphanumSup}) (256)
640 \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
641 EnclosedAlphanum},
642 \p{InEnclosedAlphanum}) (160)
643 \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
644 (256)
645 \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
646 EnclosedCJK}, \p{InEnclosedCJK}) (256)
647 \p{Block: Enclosed_Ideographic_Sup} \p{Block=
648 Enclosed_Ideographic_Supplement} (256)
649 \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
650 EnclosedIdeographicSup},
651 \p{InEnclosedIdeographicSup}) (256)
652 \p{Block: Ethiopic} (Single: \p{InEthiopic}; NOT \p{Ethiopic}
653 NOR \p{Is_Ethiopic}) (384)
654 \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
655 \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
656 \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt},
657 \p{InEthiopicExt}) (96)
658 \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA},
659 \p{InEthiopicExtA}) (48)
660 \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
661 \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup},
662 \p{InEthiopicSup}) (32)
663 \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation},
664 \p{InPunctuation}; NOT \p{Punct} NOR
665 \p{Is_Punctuation}) (112)
666 \p{Block: Geometric_Shapes} (Single: \p{InGeometricShapes}) (96)
667 \p{Block: Georgian} (Single: \p{InGeorgian}; NOT \p{Georgian}
668 NOR \p{Is_Georgian}) (96)
669 \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
670 \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup},
671 \p{InGeorgianSup}) (48)
672 \p{Block: Glagolitic} (Single: \p{InGlagolitic}; NOT
673 \p{Glagolitic} NOR \p{Is_Glagolitic})
674 (96)
675 \p{Block: Gothic} (Single: \p{InGothic}; NOT \p{Gothic} NOR
676 \p{Is_Gothic}) (32)
677 \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
678 NOR \p{Is_Greek}) (144)
679 \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}, \p{InGreek};
680 NOT \p{Greek} NOR \p{Is_Greek}) (144)
681 \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256)
682 \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt},
683 \p{InGreekExt}) (256)
684 \p{Block: Gujarati} (Single: \p{InGujarati}; NOT \p{Gujarati}
685 NOR \p{Is_Gujarati}) (128)
686 \p{Block: Gurmukhi} (Single: \p{InGurmukhi}; NOT \p{Gurmukhi}
687 NOR \p{Is_Gurmukhi}) (128)
688 \p{Block: Half_And_Full_Forms} \p{Block=
689 Halfwidth_And_Fullwidth_Forms} (240)
690 \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16)
691 \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
692 HalfAndFullForms},
693 \p{InHalfAndFullForms}) (240)
694 \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul}
695 NOR \p{Is_Hangul}) (11_184)
696 \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo},
697 \p{InCompatJamo}) (96)
698 \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}, \p{InJamo}) (256)
699 \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA},
700 \p{InJamoExtA}) (32)
701 \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB},
702 \p{InJamoExtB}) (80)
703 \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}, \p{InHangul};
704 NOT \p{Hangul} NOR \p{Is_Hangul})
705 (11_184)
706 \p{Block: Hanunoo} (Single: \p{InHanunoo}; NOT \p{Hanunoo}
707 NOR \p{Is_Hanunoo}) (32)
708 \p{Block: Hebrew} (Single: \p{InHebrew}; NOT \p{Hebrew} NOR
709 \p{Is_Hebrew}) (112)
710 \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
711 HighPUSurrogates},
712 \p{InHighPUSurrogates}) (128)
713 \p{Block: High_PU_Surrogates} \p{Block=
714 High_Private_Use_Surrogates} (128)
715 \p{Block: High_Surrogates} (Single: \p{InHighSurrogates}) (896)
716 \p{Block: Hiragana} (Single: \p{InHiragana}; NOT \p{Hiragana}
717 NOR \p{Is_Hiragana}) (96)
718 \p{Block: IDC} \p{Block=
719 Ideographic_Description_Characters} (NOT
720 \p{ID_Continue} NOR \p{Is_IDC}) (16)
721 \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC},
722 \p{InIDC}; NOT \p{ID_Continue} NOR
723 \p{Is_IDC}) (16)
724 \p{Block: Imperial_Aramaic} (Single: \p{InImperialAramaic}; NOT
725 \p{Imperial_Aramaic} NOR
726 \p{Is_Imperial_Aramaic}) (32)
727 \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
728 (16)
729 \p{Block: Inscriptional_Pahlavi} (Single:
730 \p{InInscriptionalPahlavi}; NOT
731 \p{Inscriptional_Pahlavi} NOR
732 \p{Is_Inscriptional_Pahlavi}) (32)
733 \p{Block: Inscriptional_Parthian} (Single:
734 \p{InInscriptionalParthian}; NOT
735 \p{Inscriptional_Parthian} NOR
736 \p{Is_Inscriptional_Parthian}) (32)
737 \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96)
738 \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}, \p{InIPAExt})
739 (96)
740 \p{Block: Jamo} \p{Block=Hangul_Jamo} (256)
741 \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32)
742 \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80)
743 \p{Block: Javanese} (Single: \p{InJavanese}; NOT \p{Javanese}
744 NOR \p{Is_Javanese}) (96)
745 \p{Block: Kaithi} (Single: \p{InKaithi}; NOT \p{Kaithi} NOR
746 \p{Is_Kaithi}) (80)
747 \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256)
748 \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}, \p{InKanaSup})
749 (256)
750 \p{Block: Kanbun} (Single: \p{InKanbun}) (16)
751 \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224)
752 \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}, \p{InKangxi})
753 (224)
754 \p{Block: Kannada} (Single: \p{InKannada}; NOT \p{Kannada}
755 NOR \p{Is_Kannada}) (128)
756 \p{Block: Katakana} (Single: \p{InKatakana}; NOT \p{Katakana}
757 NOR \p{Is_Katakana}) (96)
758 \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
759 \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
760 KatakanaExt}, \p{InKatakanaExt}) (16)
761 \p{Block: Kayah_Li} (Single: \p{InKayahLi}) (48)
762 \p{Block: Kharoshthi} (Single: \p{InKharoshthi}; NOT
763 \p{Kharoshthi} NOR \p{Is_Kharoshthi})
764 (96)
765 \p{Block: Khmer} (Single: \p{InKhmer}; NOT \p{Khmer} NOR
766 \p{Is_Khmer}) (128)
767 \p{Block: Khmer_Symbols} (Single: \p{InKhmerSymbols}) (32)
768 \p{Block: Lao} (Single: \p{InLao}; NOT \p{Lao} NOR
769 \p{Is_Lao}) (128)
770 \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
771 \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128)
772 \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1},
773 \p{InLatin1}) (128)
774 \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128)
775 \p{Block: Latin_Ext_Additional} \p{Block=
776 Latin_Extended_Additional} (256)
777 \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208)
778 \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32)
779 \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224)
780 \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA},
781 \p{InLatinExtA}) (128)
782 \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
783 LatinExtAdditional},
784 \p{InLatinExtAdditional}) (256)
785 \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB},
786 \p{InLatinExtB}) (208)
787 \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC},
788 \p{InLatinExtC}) (32)
789 \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD},
790 \p{InLatinExtD}) (224)
791 \p{Block: Lepcha} (Single: \p{InLepcha}; NOT \p{Lepcha} NOR
792 \p{Is_Lepcha}) (80)
793 \p{Block: Letterlike_Symbols} (Single: \p{InLetterlikeSymbols})
794 (80)
795 \p{Block: Limbu} (Single: \p{InLimbu}; NOT \p{Limbu} NOR
796 \p{Is_Limbu}) (80)
797 \p{Block: Linear_B_Ideograms} (Single: \p{InLinearBIdeograms})
798 (128)
799 \p{Block: Linear_B_Syllabary} (Single: \p{InLinearBSyllabary})
800 (128)
801 \p{Block: Lisu} (Single: \p{InLisu}) (48)
802 \p{Block: Low_Surrogates} (Single: \p{InLowSurrogates}) (1024)
803 \p{Block: Lycian} (Single: \p{InLycian}; NOT \p{Lycian} NOR
804 \p{Is_Lycian}) (32)
805 \p{Block: Lydian} (Single: \p{InLydian}; NOT \p{Lydian} NOR
806 \p{Is_Lydian}) (32)
807 \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48)
808 \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}, \p{InMahjong})
809 (48)
810 \p{Block: Malayalam} (Single: \p{InMalayalam}; NOT
811 \p{Malayalam} NOR \p{Is_Malayalam}) (128)
812 \p{Block: Mandaic} (Single: \p{InMandaic}; NOT \p{Mandaic}
813 NOR \p{Is_Mandaic}) (32)
814 \p{Block: Math_Alphanum} \p{Block=
815 Mathematical_Alphanumeric_Symbols} (1024)
816 \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
817 \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
818 MathAlphanum}, \p{InMathAlphanum}) (1024)
819 \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators},
820 \p{InMathOperators}) (256)
821 \p{Block: Meetei_Mayek} (Single: \p{InMeeteiMayek}; NOT
822 \p{Meetei_Mayek} NOR
823 \p{Is_Meetei_Mayek}) (64)
824 \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
825 \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt},
826 \p{InMeeteiMayekExt}) (32)
827 \p{Block: Meroitic_Cursive} (Single: \p{InMeroiticCursive}; NOT
828 \p{Meroitic_Cursive} NOR
829 \p{Is_Meroitic_Cursive}) (96)
830 \p{Block: Meroitic_Hieroglyphs} (Single:
831 \p{InMeroiticHieroglyphs}) (32)
832 \p{Block: Miao} (Single: \p{InMiao}; NOT \p{Miao} NOR
833 \p{Is_Miao}) (160)
834 \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows}
835 (256)
836 \p{Block: Misc_Math_Symbols_A} \p{Block=
837 Miscellaneous_Mathematical_Symbols_A}
838 (48)
839 \p{Block: Misc_Math_Symbols_B} \p{Block=
840 Miscellaneous_Mathematical_Symbols_B}
841 (128)
842 \p{Block: Misc_Pictographs} \p{Block=
843 Miscellaneous_Symbols_And_Pictographs}
844 (768)
845 \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
846 \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
847 \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
848 MiscMathSymbolsA},
849 \p{InMiscMathSymbolsA}) (48)
850 \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
851 MiscMathSymbolsB},
852 \p{InMiscMathSymbolsB}) (128)
853 \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols},
854 \p{InMiscSymbols}) (256)
855 \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
856 MiscArrows}, \p{InMiscArrows}) (256)
857 \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
858 MiscPictographs}, \p{InMiscPictographs})
859 (768)
860 \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical},
861 \p{InMiscTechnical}) (256)
862 \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
863 \p{Block: Modifier_Tone_Letters} (Single:
864 \p{InModifierToneLetters}) (32)
865 \p{Block: Mongolian} (Single: \p{InMongolian}; NOT
866 \p{Mongolian} NOR \p{Is_Mongolian}) (176)
867 \p{Block: Music} \p{Block=Musical_Symbols} (256)
868 \p{Block: Musical_Symbols} (Short: \p{Blk=Music}, \p{InMusic})
869 (256)
870 \p{Block: Myanmar} (Single: \p{InMyanmar}; NOT \p{Myanmar}
871 NOR \p{Is_Myanmar}) (160)
872 \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
873 \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA},
874 \p{InMyanmarExtA}) (32)
875 \p{Block: NB} \p{Block=No_Block} (860_672)
876 \p{Block: New_Tai_Lue} (Single: \p{InNewTaiLue}; NOT
877 \p{New_Tai_Lue} NOR \p{Is_New_Tai_Lue})
878 (96)
879 \p{Block: NKo} (Single: \p{InNKo}; NOT \p{Nko} NOR
880 \p{Is_NKo}) (64)
881 \p{Block: No_Block} (Short: \p{Blk=NB}, \p{InNB}) (860_672)
882 \p{Block: Number_Forms} (Single: \p{InNumberForms}) (64)
883 \p{Block: OCR} \p{Block=Optical_Character_Recognition}
884 (32)
885 \p{Block: Ogham} (Single: \p{InOgham}; NOT \p{Ogham} NOR
886 \p{Is_Ogham}) (32)
887 \p{Block: Ol_Chiki} (Single: \p{InOlChiki}) (48)
888 \p{Block: Old_Italic} (Single: \p{InOldItalic}; NOT
889 \p{Old_Italic} NOR \p{Is_Old_Italic})
890 (48)
891 \p{Block: Old_Persian} (Single: \p{InOldPersian}; NOT
892 \p{Old_Persian} NOR \p{Is_Old_Persian})
893 (64)
894 \p{Block: Old_South_Arabian} (Single: \p{InOldSouthArabian}) (32)
895 \p{Block: Old_Turkic} (Single: \p{InOldTurkic}; NOT
896 \p{Old_Turkic} NOR \p{Is_Old_Turkic})
897 (80)
898 \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR},
899 \p{InOCR}) (32)
900 \p{Block: Oriya} (Single: \p{InOriya}; NOT \p{Oriya} NOR
901 \p{Is_Oriya}) (128)
902 \p{Block: Osmanya} (Single: \p{InOsmanya}; NOT \p{Osmanya}
903 NOR \p{Is_Osmanya}) (48)
904 \p{Block: Phags_Pa} (Single: \p{InPhagsPa}; NOT \p{Phags_Pa}
905 NOR \p{Is_Phags_Pa}) (64)
906 \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48)
907 \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}, \p{InPhaistos})
908 (48)
909 \p{Block: Phoenician} (Single: \p{InPhoenician}; NOT
910 \p{Phoenician} NOR \p{Is_Phoenician})
911 (32)
912 \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
913 \p{Block: Phonetic_Ext_Sup} \p{Block=
914 Phonetic_Extensions_Supplement} (64)
915 \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt},
916 \p{InPhoneticExt}) (128)
917 \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
918 PhoneticExtSup}, \p{InPhoneticExtSup})
919 (64)
920 \p{Block: Playing_Cards} (Single: \p{InPlayingCards}) (96)
921 \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
922 \p{Private_Use} NOR \p{Is_Private_Use})
923 (6400)
924 \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}, \p{InPUA}; NOT
925 \p{Private_Use} NOR \p{Is_Private_Use})
926 (6400)
927 \p{Block: PUA} \p{Block=Private_Use_Area} (NOT
928 \p{Private_Use} NOR \p{Is_Private_Use})
929 (6400)
930 \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT
931 \p{Punct} NOR \p{Is_Punctuation}) (112)
932 \p{Block: Rejang} (Single: \p{InRejang}; NOT \p{Rejang} NOR
933 \p{Is_Rejang}) (48)
934 \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32)
935 \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}, \p{InRumi})
936 (32)
937 \p{Block: Runic} (Single: \p{InRunic}; NOT \p{Runic} NOR
938 \p{Is_Runic}) (96)
939 \p{Block: Samaritan} (Single: \p{InSamaritan}; NOT
940 \p{Samaritan} NOR \p{Is_Samaritan}) (64)
941 \p{Block: Saurashtra} (Single: \p{InSaurashtra}; NOT
942 \p{Saurashtra} NOR \p{Is_Saurashtra})
943 (96)
944 \p{Block: Sharada} (Single: \p{InSharada}; NOT \p{Sharada}
945 NOR \p{Is_Sharada}) (96)
946 \p{Block: Shavian} (Single: \p{InShavian}) (48)
947 \p{Block: Sinhala} (Single: \p{InSinhala}; NOT \p{Sinhala}
948 NOR \p{Is_Sinhala}) (128)
949 \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms},
950 \p{InSmallForms}) (32)
951 \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32)
952 \p{Block: Sora_Sompeng} (Single: \p{InSoraSompeng}; NOT
953 \p{Sora_Sompeng} NOR
954 \p{Is_Sora_Sompeng}) (48)
955 \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
956 ModifierLetters}, \p{InModifierLetters})
957 (80)
958 \p{Block: Specials} (Single: \p{InSpecials}) (16)
959 \p{Block: Sundanese} (Single: \p{InSundanese}; NOT
960 \p{Sundanese} NOR \p{Is_Sundanese}) (64)
961 \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
962 \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup},
963 \p{InSundaneseSup}) (16)
964 \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
965 \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
966 \p{Block: Sup_Math_Operators} \p{Block=
967 Supplemental_Mathematical_Operators}
968 (256)
969 \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A}
970 (65_536)
971 \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B}
972 (65_536)
973 \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
974 \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
975 \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
976 SuperAndSub}, \p{InSuperAndSub}) (48)
977 \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA},
978 \p{InSupArrowsA}) (16)
979 \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB},
980 \p{InSupArrowsB}) (128)
981 \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
982 SupMathOperators},
983 \p{InSupMathOperators}) (256)
984 \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
985 SupPunctuation}, \p{InSupPunctuation})
986 (128)
987 \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
988 SupPUAA}, \p{InSupPUAA}) (65_536)
989 \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
990 SupPUAB}, \p{InSupPUAB}) (65_536)
991 \p{Block: Syloti_Nagri} (Single: \p{InSylotiNagri}; NOT
992 \p{Syloti_Nagri} NOR
993 \p{Is_Syloti_Nagri}) (48)
994 \p{Block: Syriac} (Single: \p{InSyriac}; NOT \p{Syriac} NOR
995 \p{Is_Syriac}) (80)
996 \p{Block: Tagalog} (Single: \p{InTagalog}; NOT \p{Tagalog}
997 NOR \p{Is_Tagalog}) (32)
998 \p{Block: Tagbanwa} (Single: \p{InTagbanwa}; NOT \p{Tagbanwa}
999 NOR \p{Is_Tagbanwa}) (32)
1000 \p{Block: Tags} (Single: \p{InTags}) (128)
1001 \p{Block: Tai_Le} (Single: \p{InTaiLe}; NOT \p{Tai_Le} NOR
1002 \p{Is_Tai_Le}) (48)
1003 \p{Block: Tai_Tham} (Single: \p{InTaiTham}; NOT \p{Tai_Tham}
1004 NOR \p{Is_Tai_Tham}) (144)
1005 \p{Block: Tai_Viet} (Single: \p{InTaiViet}; NOT \p{Tai_Viet}
1006 NOR \p{Is_Tai_Viet}) (96)
1007 \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1008 \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing},
1009 \p{InTaiXuanJing}) (96)
1010 \p{Block: Takri} (Single: \p{InTakri}; NOT \p{Takri} NOR
1011 \p{Is_Takri}) (80)
1012 \p{Block: Tamil} (Single: \p{InTamil}; NOT \p{Tamil} NOR
1013 \p{Is_Tamil}) (128)
1014 \p{Block: Telugu} (Single: \p{InTelugu}; NOT \p{Telugu} NOR
1015 \p{Is_Telugu}) (128)
1016 \p{Block: Thaana} (Single: \p{InThaana}; NOT \p{Thaana} NOR
1017 \p{Is_Thaana}) (64)
1018 \p{Block: Thai} (Single: \p{InThai}; NOT \p{Thai} NOR
1019 \p{Is_Thai}) (128)
1020 \p{Block: Tibetan} (Single: \p{InTibetan}; NOT \p{Tibetan}
1021 NOR \p{Is_Tibetan}) (256)
1022 \p{Block: Tifinagh} (Single: \p{InTifinagh}; NOT \p{Tifinagh}
1023 NOR \p{Is_Tifinagh}) (80)
1024 \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1025 (128)
1026 \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1027 TransportAndMap}, \p{InTransportAndMap})
1028 (128)
1029 \p{Block: UCAS} \p{Block=
1030 Unified_Canadian_Aboriginal_Syllabics}
1031 (640)
1032 \p{Block: UCAS_Ext} \p{Block=
1033 Unified_Canadian_Aboriginal_Syllabics_-
1034 Extended} (80)
1035 \p{Block: Ugaritic} (Single: \p{InUgaritic}; NOT \p{Ugaritic}
1036 NOR \p{Is_Ugaritic}) (32)
1037 \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1038 UCAS}, \p{InUCAS}) (640)
1039 \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1040 \p{Blk=UCASExt}, \p{InUCASExt}) (80)
1041 \p{Block: Vai} (Single: \p{InVai}; NOT \p{Vai} NOR
1042 \p{Is_Vai}) (320)
1043 \p{Block: Variation_Selectors} (Short: \p{Blk=VS}, \p{InVS}; NOT
1044 \p{Variation_Selector} NOR \p{Is_VS})
1045 (16)
1046 \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup},
1047 \p{InVSSup}) (240)
1048 \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48)
1049 \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt},
1050 \p{InVedicExt}) (48)
1051 \p{Block: Vertical_Forms} (Single: \p{InVerticalForms}) (16)
1052 \p{Block: VS} \p{Block=Variation_Selectors} (NOT
1053 \p{Variation_Selector} NOR \p{Is_VS})
1054 (16)
1055 \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement}
1056 (240)
1057 \p{Block: Yi_Radicals} (Single: \p{InYiRadicals}) (64)
1058 \p{Block: Yi_Syllables} (Single: \p{InYiSyllables}) (1168)
1059 \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64)
1060 \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing},
1061 \p{InYijing}) (64)
1062 X \p{Block_Elements} \p{Block=Block_Elements} (32)
1063 \p{Bopo} \p{Bopomofo} (= \p{Script=Bopomofo}) (NOT
1064 \p{Block=Bopomofo}) (70)
1065 \p{Bopomofo} \p{Script=Bopomofo} (Short: \p{Bopo}; NOT
1066 \p{Block=Bopomofo}) (70)
1067 X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block=
1068 Bopomofo_Extended}) (32)
1069 X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short:
1070 \p{InBopomofoExt}) (32)
1071 X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
1072 \p{Brah} \p{Brahmi} (= \p{Script=Brahmi}) (NOT
1073 \p{Block=Brahmi}) (108)
1074 \p{Brahmi} \p{Script=Brahmi} (Short: \p{Brah}; NOT
1075 \p{Block=Brahmi}) (108)
1076 \p{Brai} \p{Braille} (= \p{Script=Braille}) (256)
1077 \p{Braille} \p{Script=Braille} (Short: \p{Brai}) (256)
1078 X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short:
1079 \p{InBraille}) (256)
1080 \p{Bugi} \p{Buginese} (= \p{Script=Buginese}) (NOT
1081 \p{Block=Buginese}) (30)
1082 \p{Buginese} \p{Script=Buginese} (Short: \p{Bugi}; NOT
1083 \p{Block=Buginese}) (30)
1084 \p{Buhd} \p{Buhid} (= \p{Script=Buhid}) (NOT
1085 \p{Block=Buhid}) (20)
1086 \p{Buhid} \p{Script=Buhid} (Short: \p{Buhd}; NOT
1087 \p{Block=Buhid}) (20)
1088 X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block=
1089 Byzantine_Musical_Symbols}) (256)
1090 X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1091 (Short: \p{InByzantineMusic}) (256)
1092 \p{C} \p{Other} (= \p{General_Category=Other})
1093 (1_004_135)
1094 \p{Cakm} \p{Chakma} (= \p{Script=Chakma}) (NOT
1095 \p{Block=Chakma}) (67)
1096 \p{Canadian_Aboriginal} \p{Script=Canadian_Aboriginal} (Short:
1097 \p{Cans}) (710)
1098 X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
1099 (= \p{Block=
1100 Unified_Canadian_Aboriginal_Syllabics})
1101 (640)
1102 T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1103 Not_Reordered} (1_113_459)
1104 T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1105 Overlay} (26)
1106 T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1107 Nukta} (13)
1108 T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1109 Kana_Voicing} (2)
1110 T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1111 Virama} (37)
1112 T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1113 CCC10} (1)
1114 T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1115 CCC11} (1)
1116 T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1117 CCC12} (1)
1118 T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1119 CCC13} (1)
1120 T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1121 CCC14} (1)
1122 T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1123 CCC15} (1)
1124 T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1125 CCC16} (1)
1126 T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1127 CCC17} (1)
1128 T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1129 CCC18} (2)
1130 T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1131 CCC19} (2)
1132 T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1133 CCC20} (1)
1134 T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1135 CCC21} (1)
1136 T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1137 CCC22} (1)
1138 T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1139 CCC23} (1)
1140 T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1141 CCC24} (1)
1142 T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1143 CCC25} (1)
1144 T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1145 CCC26} (1)
1146 T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1147 CCC27} (2)
1148 T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1149 CCC28} (2)
1150 T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1151 CCC29} (2)
1152 T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1153 CCC30} (2)
1154 T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1155 CCC31} (2)
1156 T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1157 CCC32} (2)
1158 T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1159 CCC33} (1)
1160 T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1161 CCC34} (1)
1162 T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1163 CCC35} (1)
1164 T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1165 CCC36} (1)
1166 T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1167 CCC84} (1)
1168 T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1169 CCC91} (1)
1170 T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1171 CCC103} (2)
1172 T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1173 CCC107} (4)
1174 T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1175 CCC118} (2)
1176 T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1177 CCC122} (4)
1178 T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1179 CCC129} (1)
1180 T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1181 CCC130} (6)
1182 T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1183 CCC132} (1)
1184 T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1185 Attached_Below_Left} (0)
1186 T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1187 Attached_Below} (5)
1188 T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1189 Attached_Above} (1)
1190 T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1191 Attached_Above_Right} (9)
1192 T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1193 Below_Left} (1)
1194 T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1195 Below} (129)
1196 T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1197 Below_Right} (4)
1198 T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1199 Left} (2)
1200 T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1201 Right} (1)
1202 T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1203 Above_Left} (3)
1204 T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1205 Above} (349)
1206 T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1207 Above_Right} (4)
1208 T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1209 Double_Below} (4)
1210 T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1211 Double_Above} (5)
1212 T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1213 Iota_Subscript} (1)
1214 \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1215 Above} (349)
1216 \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (349)
1217 \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (3)
1218 \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (4)
1219 \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1220 Above_Left} (3)
1221 \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1222 Above_Right} (4)
1223 \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1224 Attached_Above} (1)
1225 \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1226 Attached_Above_Right} (9)
1227 \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1228 Attached_Below} (5)
1229 \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1230 Attached_Below_Left} (0)
1231 \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1232 (1)
1233 \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1234 \p{Ccc=ATAR}) (9)
1235 \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1236 (5)
1237 \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1238 ATBL}) (0)
1239 \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1240 Below} (129)
1241 \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (129)
1242 \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1)
1243 \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4)
1244 \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1245 Below_Left} (1)
1246 \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1247 Below_Right} (4)
1248 \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1)
1249 \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2)
1250 \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4)
1251 \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1)
1252 \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2)
1253 \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1)
1254 \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4)
1255 \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1)
1256 \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1)
1257 \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6)
1258 \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1)
1259 \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1)
1260 \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1)
1261 \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1)
1262 \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1)
1263 \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2)
1264 \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2)
1265 \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1)
1266 \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1)
1267 \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1)
1268 \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1)
1269 \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1)
1270 \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1)
1271 \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1)
1272 \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2)
1273 \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2)
1274 \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2)
1275 \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2)
1276 \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2)
1277 \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2)
1278 \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1)
1279 \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1)
1280 \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1)
1281 \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1)
1282 \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1)
1283 \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1)
1284 \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1285 Double_Above} (5)
1286 \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1287 Double_Below} (4)
1288 \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA}) (5)
1289 \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB}) (4)
1290 \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1291 (1)
1292 \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1293 Iota_Subscript} (1)
1294 \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV}) (2)
1295 \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1296 Kana_Voicing} (2)
1297 \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1298 Left} (2)
1299 \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2)
1300 \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1301 Nukta} (13)
1302 \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1303 (1_113_459)
1304 \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1305 Not_Reordered} (1_113_459)
1306 \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (13)
1307 \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1308 Overlay} (26)
1309 \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (26)
1310 \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1311 Right} (1)
1312 \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1)
1313 \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (37)
1314 \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1315 Virama} (37)
1316 \p{Cans} \p{Canadian_Aboriginal} (= \p{Script=
1317 Canadian_Aboriginal}) (710)
1318 \p{Cari} \p{Carian} (= \p{Script=Carian}) (NOT
1319 \p{Block=Carian}) (49)
1320 \p{Carian} \p{Script=Carian} (Short: \p{Cari}; NOT
1321 \p{Block=Carian}) (49)
1322 \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (1799)
1323 \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_112_313)
1324 \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (1799)
1325 \p{Cased} \p{Cased=Y} (3448)
1326 \p{Cased: N*} (Single: \P{Cased}) (1_110_664)
1327 \p{Cased: Y*} (Single: \p{Cased}) (3448)
1328 \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
1329 \p{LC}) (3223)
1330 \p{Category: *} \p{General_Category: *}
1331 \p{Cc} \p{Cntrl} (= \p{General_Category=Control})
1332 (65)
1333 \p{Ccc: *} \p{Canonical_Combining_Class: *}
1334 \p{CE} \p{Composition_Exclusion} (=
1335 \p{Composition_Exclusion=Y}) (81)
1336 \p{CE: *} \p{Composition_Exclusion: *}
1337 \p{Cf} \p{Format} (= \p{General_Category=Format})
1338 (139)
1339 \p{Chakma} \p{Script=Chakma} (Short: \p{Cakm}; NOT
1340 \p{Block=Chakma}) (67)
1341 \p{Cham} \p{Script=Cham} (NOT \p{Block=Cham}) (83)
1342 \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1343 \p{CWCF}) (1107)
1344 \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1345 (1_113_005)
1346 \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1347 (1107)
1348 \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1349 \p{CWCM}) (2138)
1350 \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1351 (1_111_974)
1352 \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1353 (2138)
1354 \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1355 \p{CWL}) (1043)
1356 \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1357 (1_113_069)
1358 \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1043)
1359 \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1360 Y} (Short: \p{CWKCF}) (9944)
1361 \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1362 \P{CWKCF}) (1_104_168)
1363 \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1364 \p{CWKCF}) (9944)
1365 \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1366 \p{CWT}) (1099)
1367 \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1368 (1_113_013)
1369 \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1099)
1370 \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1371 \p{CWU}) (1126)
1372 \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1373 (1_112_986)
1374 \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1126)
1375 \p{Cher} \p{Cherokee} (= \p{Script=Cherokee}) (NOT
1376 \p{Block=Cherokee}) (85)
1377 \p{Cherokee} \p{Script=Cherokee} (Short: \p{Cher}; NOT
1378 \p{Block=Cherokee}) (85)
1379 \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
1380 Y}) (1799)
1381 \p{CI: *} \p{Case_Ignorable: *}
1382 X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block=
1383 CJK_Unified_Ideographs}) (20_992)
1384 X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block=
1385 CJK_Compatibility}) (256)
1386 X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block=
1387 CJK_Compatibility_Forms}) (32)
1388 X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
1389 \p{Block=CJK_Compatibility_Ideographs})
1390 (512)
1391 X \p{CJK_Compat_Ideographs_Sup}
1392 \p{CJK_Compatibility_Ideographs_-
1393 Supplement} (= \p{Block=
1394 CJK_Compatibility_Ideographs_-
1395 Supplement}) (544)
1396 X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short:
1397 \p{InCJKCompat}) (256)
1398 X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
1399 (Short: \p{InCJKCompatForms}) (32)
1400 X \p{CJK_Compatibility_Ideographs} \p{Block=
1401 CJK_Compatibility_Ideographs} (Short:
1402 \p{InCJKCompatIdeographs}) (512)
1403 X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
1404 CJK_Compatibility_Ideographs_Supplement}
1405 (Short: \p{InCJKCompatIdeographsSup})
1406 (544)
1407 X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (=
1408 \p{Block=
1409 CJK_Unified_Ideographs_Extension_A})
1410 (6592)
1411 X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (=
1412 \p{Block=
1413 CJK_Unified_Ideographs_Extension_B})
1414 (42_720)
1415 X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (=
1416 \p{Block=
1417 CJK_Unified_Ideographs_Extension_C})
1418 (4160)
1419 X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (=
1420 \p{Block=
1421 CJK_Unified_Ideographs_Extension_D})
1422 (224)
1423 X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block=
1424 CJK_Radicals_Supplement}) (128)
1425 X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
1426 (Short: \p{InCJKRadicalsSup}) (128)
1427 X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
1428 X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (=
1429 \p{Block=CJK_Symbols_And_Punctuation})
1430 (64)
1431 X \p{CJK_Symbols_And_Punctuation} \p{Block=
1432 CJK_Symbols_And_Punctuation} (Short:
1433 \p{InCJKSymbols}) (64)
1434 X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
1435 (Short: \p{InCJK}) (20_992)
1436 X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
1437 CJK_Unified_Ideographs_Extension_A}
1438 (Short: \p{InCJKExtA}) (6592)
1439 X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
1440 CJK_Unified_Ideographs_Extension_B}
1441 (Short: \p{InCJKExtB}) (42_720)
1442 X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
1443 CJK_Unified_Ideographs_Extension_C}
1444 (Short: \p{InCJKExtC}) (4160)
1445 X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
1446 CJK_Unified_Ideographs_Extension_D}
1447 (Short: \p{InCJKExtD}) (224)
1448 \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
1449 (Short: \p{Pe}) (71)
1450 \p{Cn} \p{Unassigned} (= \p{General_Category=
1451 Unassigned}) (864_415)
1452 \p{Cntrl} \p{General_Category=Control} Control
1453 characters (Short: \p{Cc}) (65)
1454 \p{Co} \p{Private_Use} (= \p{General_Category=
1455 Private_Use}) (NOT \p{Private_Use_Area})
1456 (137_468)
1457 X \p{Combining_Diacritical_Marks} \p{Block=
1458 Combining_Diacritical_Marks} (Short:
1459 \p{InDiacriticals}) (112)
1460 X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
1461 Combining_Diacritical_Marks_For_Symbols}
1462 (Short: \p{InDiacriticalsForSymbols})
1463 (48)
1464 X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
1465 Combining_Diacritical_Marks_Supplement}
1466 (Short: \p{InDiacriticalsSup}) (64)
1467 X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
1468 \p{InHalfMarks}) (16)
1469 \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark})
1470 (1645)
1471 X \p{Combining_Marks_For_Symbols}
1472 \p{Combining_Diacritical_Marks_For_-
1473 Symbols} (= \p{Block=
1474 Combining_Diacritical_Marks_For_-
1475 Symbols}) (48)
1476 \p{Common} \p{Script=Common} (Short: \p{Zyyy}) (6412)
1477 X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1478 (Short: \p{InIndicNumberForms}) (16)
1479 \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
1480 \p{Full_Composition_Exclusion=Y}) (1120)
1481 \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
1482 X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block=
1483 Hangul_Compatibility_Jamo}) (96)
1484 \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
1485 \p{CE}) (81)
1486 \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031)
1487 \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81)
1488 \p{Connector_Punctuation} \p{General_Category=
1489 Connector_Punctuation} (Short: \p{Pc})
1490 (10)
1491 \p{Control} \p{Cntrl} (= \p{General_Category=Control})
1492 (65)
1493 X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
1494 \p{Copt} \p{Coptic} (= \p{Script=Coptic}) (NOT
1495 \p{Block=Coptic}) (137)
1496 \p{Coptic} \p{Script=Coptic} (Short: \p{Copt}; NOT
1497 \p{Block=Coptic}) (137)
1498 X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block=
1499 Counting_Rod_Numerals}) (32)
1500 X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
1501 \p{InCountingRod}) (32)
1502 \p{Cprt} \p{Cypriot} (= \p{Script=Cypriot}) (55)
1503 \p{Cs} \p{Surrogate} (= \p{General_Category=
1504 Surrogate}) (2048)
1505 \p{Cuneiform} \p{Script=Cuneiform} (Short: \p{Xsux}; NOT
1506 \p{Block=Cuneiform}) (982)
1507 X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (=
1508 \p{Block=
1509 Cuneiform_Numbers_And_Punctuation}) (128)
1510 X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
1511 Cuneiform_Numbers_And_Punctuation}
1512 (Short: \p{InCuneiformNumbers}) (128)
1513 \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
1514 (Short: \p{Sc}) (48)
1515 X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
1516 \p{CWCF} \p{Changes_When_Casefolded} (=
1517 \p{Changes_When_Casefolded=Y}) (1107)
1518 \p{CWCF: *} \p{Changes_When_Casefolded: *}
1519 \p{CWCM} \p{Changes_When_Casemapped} (=
1520 \p{Changes_When_Casemapped=Y}) (2138)
1521 \p{CWCM: *} \p{Changes_When_Casemapped: *}
1522 \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
1523 \p{Changes_When_NFKC_Casefolded=Y})
1524 (9944)
1525 \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
1526 \p{CWL} \p{Changes_When_Lowercased} (=
1527 \p{Changes_When_Lowercased=Y}) (1043)
1528 \p{CWL: *} \p{Changes_When_Lowercased: *}
1529 \p{CWT} \p{Changes_When_Titlecased} (=
1530 \p{Changes_When_Titlecased=Y}) (1099)
1531 \p{CWT: *} \p{Changes_When_Titlecased: *}
1532 \p{CWU} \p{Changes_When_Uppercased} (=
1533 \p{Changes_When_Uppercased=Y}) (1126)
1534 \p{CWU: *} \p{Changes_When_Uppercased: *}
1535 \p{Cypriot} \p{Script=Cypriot} (Short: \p{Cprt}) (55)
1536 X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
1537 \p{Cyrillic} \p{Script=Cyrillic} (Short: \p{Cyrl}; NOT
1538 \p{Block=Cyrillic}) (417)
1539 X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block=
1540 Cyrillic_Extended_A}) (32)
1541 X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block=
1542 Cyrillic_Extended_B}) (96)
1543 X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
1544 \p{InCyrillicExtA}) (32)
1545 X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
1546 \p{InCyrillicExtB}) (96)
1547 X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block=
1548 Cyrillic_Supplement}) (48)
1549 X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
1550 \p{InCyrillicSup}) (48)
1551 X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
1552 Cyrillic_Supplement}) (48)
1553 \p{Cyrl} \p{Cyrillic} (= \p{Script=Cyrillic}) (NOT
1554 \p{Block=Cyrillic}) (417)
1555 \p{Dash} \p{Dash=Y} (27)
1556 \p{Dash: N*} (Single: \P{Dash}) (1_114_085)
1557 \p{Dash: Y*} (Single: \p{Dash}) (27)
1558 \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
1559 (Short: \p{Pd}) (23)
1560 \p{Decimal_Number} \p{Digit} (= \p{General_Category=
1561 Decimal_Number}) (460)
1562 \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
1563 (13_225)
1564 \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_225)
1565 \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240)
1566 \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
1567 \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720)
1568 \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
1569 \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
1570 \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240)
1571 \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184)
1572 \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
1573 \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20)
1574 \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
1575 \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171)
1576 \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
1577 \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238)
1578 \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
1579 \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82)
1580 \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
1581 \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122)
1582 \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
1583 \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5)
1584 \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
1585 Non_Canonical} (Perl extension) (3655)
1586 \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
1587 decompositions (Short: \p{Dt=NonCanon})
1588 (Perl extension) (3655)
1589 \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_232)
1590 \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26)
1591 \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
1592 \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (284)
1593 \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (284)
1594 \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38)
1595 \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (146)
1596 \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (146)
1597 \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
1598 \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35)
1599 \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104)
1600 \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
1601 Y} (Short: \p{DI}) (4167)
1602 \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
1603 (1_109_945)
1604 \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
1605 (4167)
1606 \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (111)
1607 \p{Dep: *} \p{Deprecated: *}
1608 \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (111)
1609 \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_001)
1610 \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (111)
1611 \p{Deseret} \p{Script=Deseret} (Short: \p{Dsrt}) (80)
1612 \p{Deva} \p{Devanagari} (= \p{Script=Devanagari})
1613 (NOT \p{Block=Devanagari}) (151)
1614 \p{Devanagari} \p{Script=Devanagari} (Short: \p{Deva};
1615 NOT \p{Block=Devanagari}) (151)
1616 X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block=
1617 Devanagari_Extended}) (32)
1618 X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
1619 \p{InDevanagariExt}) (32)
1620 \p{DI} \p{Default_Ignorable_Code_Point} (=
1621 \p{Default_Ignorable_Code_Point=Y})
1622 (4167)
1623 \p{DI: *} \p{Default_Ignorable_Code_Point: *}
1624 \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (693)
1625 \p{Dia: *} \p{Diacritic: *}
1626 \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (693)
1627 \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_419)
1628 \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (693)
1629 X \p{Diacriticals} \p{Combining_Diacritical_Marks} (=
1630 \p{Block=Combining_Diacritical_Marks})
1631 (112)
1632 X \p{Diacriticals_For_Symbols}
1633 \p{Combining_Diacritical_Marks_For_-
1634 Symbols} (= \p{Block=
1635 Combining_Diacritical_Marks_For_-
1636 Symbols}) (48)
1637 X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement}
1638 (= \p{Block=
1639 Combining_Diacritical_Marks_Supplement})
1640 (64)
1641 \p{Digit} \p{General_Category=Decimal_Number} [0-9]
1642 + all other decimal digits (Short:
1643 \p{Nd}) (460)
1644 X \p{Dingbats} \p{Block=Dingbats} (192)
1645 X \p{Domino} \p{Domino_Tiles} (= \p{Block=
1646 Domino_Tiles}) (112)
1647 X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short:
1648 \p{InDomino}) (112)
1649 \p{Dsrt} \p{Deseret} (= \p{Script=Deseret}) (80)
1650 \p{Dt: *} \p{Decomposition_Type: *}
1651 \p{Ea: *} \p{East_Asian_Width: *}
1652 \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_746)
1653 \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_746)
1654 \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
1655 \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104)
1656 \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
1657 \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123)
1658 \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (801_811)
1659 \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
1660 \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111)
1661 \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (801_811)
1662 \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (173_217)
1663 \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (173_217)
1664 \p{Egyp} \p{Egyptian_Hieroglyphs} (= \p{Script=
1665 Egyptian_Hieroglyphs}) (NOT \p{Block=
1666 Egyptian_Hieroglyphs}) (1071)
1667 \p{Egyptian_Hieroglyphs} \p{Script=Egyptian_Hieroglyphs} (Short:
1668 \p{Egyp}; NOT \p{Block=
1669 Egyptian_Hieroglyphs}) (1071)
1670 X \p{Emoticons} \p{Block=Emoticons} (80)
1671 X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block=
1672 Enclosed_Alphanumerics}) (160)
1673 X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
1674 \p{Block=
1675 Enclosed_Alphanumeric_Supplement}) (256)
1676 X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
1677 Enclosed_Alphanumeric_Supplement}
1678 (Short: \p{InEnclosedAlphanumSup}) (256)
1679 X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
1680 (Short: \p{InEnclosedAlphanum}) (160)
1681 X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (=
1682 \p{Block=
1683 Enclosed_CJK_Letters_And_Months}) (256)
1684 X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
1685 Enclosed_CJK_Letters_And_Months} (Short:
1686 \p{InEnclosedCJK}) (256)
1687 X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
1688 (= \p{Block=
1689 Enclosed_Ideographic_Supplement}) (256)
1690 X \p{Enclosed_Ideographic_Supplement} \p{Block=
1691 Enclosed_Ideographic_Supplement} (Short:
1692 \p{InEnclosedIdeographicSup}) (256)
1693 \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
1694 (Short: \p{Me}) (12)
1695 \p{Ethi} \p{Ethiopic} (= \p{Script=Ethiopic}) (NOT
1696 \p{Block=Ethiopic}) (495)
1697 \p{Ethiopic} \p{Script=Ethiopic} (Short: \p{Ethi}; NOT
1698 \p{Block=Ethiopic}) (495)
1699 X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block=
1700 Ethiopic_Extended}) (96)
1701 X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block=
1702 Ethiopic_Extended_A}) (48)
1703 X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short:
1704 \p{InEthiopicExt}) (96)
1705 X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
1706 \p{InEthiopicExtA}) (48)
1707 X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block=
1708 Ethiopic_Supplement}) (32)
1709 X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
1710 \p{InEthiopicSup}) (32)
1711 \p{Ext} \p{Extender} (= \p{Extender=Y}) (31)
1712 \p{Ext: *} \p{Extender: *}
1713 \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (31)
1714 \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_081)
1715 \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (31)
1716 \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
1717 (Short: \p{Pf}) (10)
1718 \p{Format} \p{General_Category=Format} (Short:
1719 \p{Cf}) (139)
1720 \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
1721 (Short: \p{CompEx}) (1120)
1722 \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
1723 \P{CompEx}) (1_112_992)
1724 \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
1725 \p{CompEx}) (1120)
1726 \p{Gc: *} \p{General_Category: *}
1727 \p{GCB: *} \p{Grapheme_Cluster_Break: *}
1728 \p{General_Category: C} \p{General_Category=Other} (1_004_135)
1729 \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
1730 \p{Gc=LC}, \p{LC}) (3223)
1731 \p{General_Category: Cc} \p{General_Category=Control} (65)
1732 \p{General_Category: Cf} \p{General_Category=Format} (139)
1733 \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
1734 (71)
1735 \p{General_Category: Cn} \p{General_Category=Unassigned} (864_415)
1736 \p{General_Category: Cntrl} \p{General_Category=Control} (65)
1737 \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
1738 \p{General_Category: Combining_Mark} \p{General_Category=Mark}
1739 (1645)
1740 \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
1741 \p{Pc}) (10)
1742 \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65)
1743 \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
1744 \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
1745 (48)
1746 \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
1747 (23)
1748 \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
1749 (460)
1750 \p{General_Category: Digit} \p{General_Category=Decimal_Number}
1751 (460)
1752 \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
1753 (12)
1754 \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
1755 (10)
1756 \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (139)
1757 \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
1758 \p{Pi}) (12)
1759 \p{General_Category: L} \p{General_Category=Letter} (101_013)
1760 X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3223)
1761 X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
1762 the trailing '_' matters in spite of
1763 loose matching rules. (3223)
1764 \p{General_Category: LC} \p{General_Category=Cased_Letter} (3223)
1765 \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (101_013)
1766 \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
1767 (224)
1768 \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl}) (1)
1769 \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
1770 (/i= General_Category=Cased_Letter)
1771 (1751)
1772 \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (237)
1773 \p{General_Category: Lo} \p{General_Category=Other_Letter} (97_553)
1774 \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
1775 /i= General_Category=Cased_Letter) (1751)
1776 \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
1777 (/i= General_Category=Cased_Letter) (31)
1778 \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
1779 (/i= General_Category=Cased_Letter)
1780 (1441)
1781 \p{General_Category: M} \p{General_Category=Mark} (1645)
1782 \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (1645)
1783 \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (952)
1784 \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (353)
1785 \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (12)
1786 \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
1787 (1280)
1788 \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
1789 (237)
1790 \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
1791 (115)
1792 \p{General_Category: N} \p{General_Category=Number} (1148)
1793 \p{General_Category: Nd} \p{General_Category=Decimal_Number} (460)
1794 \p{General_Category: Nl} \p{General_Category=Letter_Number} (224)
1795 \p{General_Category: No} \p{General_Category=Other_Number} (464)
1796 \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
1797 (1280)
1798 \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1148)
1799 \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
1800 (72)
1801 \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (1_004_135)
1802 \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
1803 (97_553)
1804 \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No}) (464)
1805 \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
1806 (434)
1807 \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
1808 (4404)
1809 \p{General_Category: P} \p{General_Category=Punctuation} (632)
1810 \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
1811 \p{Zp}) (1)
1812 \p{General_Category: Pc} \p{General_Category=
1813 Connector_Punctuation} (10)
1814 \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (23)
1815 \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
1816 (71)
1817 \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
1818 (10)
1819 \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
1820 (12)
1821 \p{General_Category: Po} \p{General_Category=Other_Punctuation}
1822 (434)
1823 \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
1824 (137_468)
1825 \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (72)
1826 \p{General_Category: Punct} \p{General_Category=Punctuation} (632)
1827 \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (632)
1828 \p{General_Category: S} \p{General_Category=Symbol} (5519)
1829 \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (48)
1830 \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (20)
1831 \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (115)
1832 \p{General_Category: Sm} \p{General_Category=Math_Symbol} (952)
1833 \p{General_Category: So} \p{General_Category=Other_Symbol} (4404)
1834 \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
1835 (18)
1836 \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc}) (353)
1837 \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048)
1838 \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (5519)
1839 \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
1840 /i= General_Category=Cased_Letter) (31)
1841 \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
1842 (864_415)
1843 \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
1844 /i= General_Category=Cased_Letter) (1441)
1845 \p{General_Category: Z} \p{General_Category=Separator} (20)
1846 \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
1847 \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
1848 (1)
1849 \p{General_Category: Zs} \p{General_Category=Space_Separator} (18)
1850 X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
1851 \p{InPunctuation}) (112)
1852 X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
1853 \p{Geor} \p{Georgian} (= \p{Script=Georgian}) (NOT
1854 \p{Block=Georgian}) (127)
1855 \p{Georgian} \p{Script=Georgian} (Short: \p{Geor}; NOT
1856 \p{Block=Georgian}) (127)
1857 X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block=
1858 Georgian_Supplement}) (48)
1859 X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
1860 \p{InGeorgianSup}) (48)
1861 \p{Glag} \p{Glagolitic} (= \p{Script=Glagolitic})
1862 (NOT \p{Block=Glagolitic}) (94)
1863 \p{Glagolitic} \p{Script=Glagolitic} (Short: \p{Glag};
1864 NOT \p{Block=Glagolitic}) (94)
1865 \p{Goth} \p{Gothic} (= \p{Script=Gothic}) (NOT
1866 \p{Block=Gothic}) (27)
1867 \p{Gothic} \p{Script=Gothic} (Short: \p{Goth}; NOT
1868 \p{Block=Gothic}) (27)
1869 \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
1870 (108_660)
1871 \p{Gr_Base: *} \p{Grapheme_Base: *}
1872 \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
1873 Y}) (1317)
1874 \p{Gr_Ext: *} \p{Grapheme_Extend: *}
1875 \p{Graph} Characters that are graphical (247_564)
1876 \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
1877 (108_660)
1878 \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase})
1879 (1_005_452)
1880 \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase}) (108_660)
1881 \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
1882 (6023)
1883 \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (6023)
1884 \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1)
1885 \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
1886 (1317)
1887 \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1317)
1888 \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125)
1889 \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1)
1890 \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399)
1891 \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773)
1892 \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_094_950)
1893 \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
1894 (0)
1895 \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (0)
1896 \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
1897 SpacingMark} (291)
1898 \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (291)
1899 \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137)
1900 \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95)
1901 \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
1902 (1_094_950)
1903 \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
1904 (1317)
1905 \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_795)
1906 \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1317)
1907 \p{Greek} \p{Script=Greek} (Short: \p{Grek}; NOT
1908 \p{Greek_And_Coptic}) (511)
1909 X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
1910 \p{InGreek}) (144)
1911 X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block=
1912 Greek_Extended}) (256)
1913 X \p{Greek_Extended} \p{Block=Greek_Extended} (Short:
1914 \p{InGreekExt}) (256)
1915 \p{Grek} \p{Greek} (= \p{Script=Greek}) (NOT
1916 \p{Greek_And_Coptic}) (511)
1917 \p{Gujarati} \p{Script=Gujarati} (Short: \p{Gujr}; NOT
1918 \p{Block=Gujarati}) (84)
1919 \p{Gujr} \p{Gujarati} (= \p{Script=Gujarati}) (NOT
1920 \p{Block=Gujarati}) (84)
1921 \p{Gurmukhi} \p{Script=Gurmukhi} (Short: \p{Guru}; NOT
1922 \p{Block=Gurmukhi}) (79)
1923 \p{Guru} \p{Gurmukhi} (= \p{Script=Gurmukhi}) (NOT
1924 \p{Block=Gurmukhi}) (79)
1925 X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
1926 \p{Block=Halfwidth_And_Fullwidth_Forms})
1927 (240)
1928 X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block=
1929 Combining_Half_Marks}) (16)
1930 X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
1931 Halfwidth_And_Fullwidth_Forms} (Short:
1932 \p{InHalfAndFullForms}) (240)
1933 \p{Han} \p{Script=Han} (75_963)
1934 \p{Hang} \p{Hangul} (= \p{Script=Hangul}) (NOT
1935 \p{Hangul_Syllables}) (11_739)
1936 \p{Hangul} \p{Script=Hangul} (Short: \p{Hang}; NOT
1937 \p{Hangul_Syllables}) (11_739)
1938 X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
1939 (Short: \p{InCompatJamo}) (96)
1940 X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo})
1941 (256)
1942 X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
1943 (Short: \p{InJamoExtA}) (32)
1944 X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
1945 (Short: \p{InJamoExtB}) (80)
1946 \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
1947 (125)
1948 \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125)
1949 \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
1950 (399)
1951 \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399)
1952 \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
1953 LVT_Syllable} (10_773)
1954 \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
1955 (10_773)
1956 \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
1957 Not_Applicable} (1_102_583)
1958 \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
1959 (1_102_583)
1960 \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
1961 (137)
1962 \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137)
1963 \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
1964 (95)
1965 \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95)
1966 X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short:
1967 \p{InHangul}) (11_184)
1968 \p{Hani} \p{Han} (= \p{Script=Han}) (75_963)
1969 \p{Hano} \p{Hanunoo} (= \p{Script=Hanunoo}) (NOT
1970 \p{Block=Hanunoo}) (21)
1971 \p{Hanunoo} \p{Script=Hanunoo} (Short: \p{Hano}; NOT
1972 \p{Block=Hanunoo}) (21)
1973 \p{Hebr} \p{Hebrew} (= \p{Script=Hebrew}) (NOT
1974 \p{Block=Hebrew}) (133)
1975 \p{Hebrew} \p{Script=Hebrew} (Short: \p{Hebr}; NOT
1976 \p{Block=Hebrew}) (133)
1977 \p{Hex} \p{XDigit} (= \p{Hex_Digit=Y}) (44)
1978 \p{Hex: *} \p{Hex_Digit: *}
1979 \p{Hex_Digit} \p{XDigit} (= \p{Hex_Digit=Y}) (44)
1980 \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068)
1981 \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44)
1982 X \p{High_Private_Use_Surrogates} \p{Block=
1983 High_Private_Use_Surrogates} (Short:
1984 \p{InHighPUSurrogates}) (128)
1985 X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (=
1986 \p{Block=High_Private_Use_Surrogates})
1987 (128)
1988 X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
1989 \p{Hira} \p{Hiragana} (= \p{Script=Hiragana}) (NOT
1990 \p{Block=Hiragana}) (91)
1991 \p{Hiragana} \p{Script=Hiragana} (Short: \p{Hira}; NOT
1992 \p{Block=Hiragana}) (91)
1993 \p{HorizSpace} \p{Blank} (19)
1994 \p{Hst: *} \p{Hangul_Syllable_Type: *}
1995 D \p{Hyphen} \p{Hyphen=Y} (11)
1996 D \p{Hyphen: N*} Supplanted by Line_Break property values;
1997 see www.unicode.org/reports/tr14
1998 (Single: \P{Hyphen}) (1_114_101)
1999 D \p{Hyphen: Y*} Supplanted by Line_Break property values;
2000 see www.unicode.org/reports/tr14
2001 (Single: \p{Hyphen}) (11)
2002 \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT
2003 \p{Ideographic_Description_Characters})
2004 (103_355)
2005 \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (1_010_757)
2006 \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (103_355)
2007 \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (101_240)
2008 \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (1_012_872)
2009 \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (101_240)
2010 \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
2011 \p{Ideographic_Description_Characters})
2012 (103_355)
2013 \p{IDC: *} \p{ID_Continue: *}
2014 \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
2015 (75_633)
2016 \p{Ideo: *} \p{Ideographic: *}
2017 \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
2018 (75_633)
2019 \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_038_479)
2020 \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (75_633)
2021 X \p{Ideographic_Description_Characters} \p{Block=
2022 Ideographic_Description_Characters}
2023 (Short: \p{InIDC}) (16)
2024 \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (101_240)
2025 \p{IDS: *} \p{ID_Start: *}
2026 \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
2027 \p{IDSB}) (10)
2028 \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
2029 (1_114_102)
2030 \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10)
2031 \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
2032 \p{IDST}) (2)
2033 \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
2034 (1_114_110)
2035 \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2)
2036 \p{IDSB} \p{IDS_Binary_Operator} (=
2037 \p{IDS_Binary_Operator=Y}) (10)
2038 \p{IDSB: *} \p{IDS_Binary_Operator: *}
2039 \p{IDST} \p{IDS_Trinary_Operator} (=
2040 \p{IDS_Trinary_Operator=Y}) (2)
2041 \p{IDST: *} \p{IDS_Trinary_Operator: *}
2042 \p{Imperial_Aramaic} \p{Script=Imperial_Aramaic} (Short:
2043 \p{Armi}; NOT \p{Block=
2044 Imperial_Aramaic}) (31)
2045 \p{In: *} \p{Present_In: *} (Perl extension)
2046 \p{In_*} \p{Block: *}
2047 X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block=
2048 Common_Indic_Number_Forms}) (16)
2049 \p{Inherited} \p{Script=Inherited} (Short: \p{Zinh})
2050 (524)
2051 \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
2052 (Short: \p{Pi}) (12)
2053 \p{Inscriptional_Pahlavi} \p{Script=Inscriptional_Pahlavi} (Short:
2054 \p{Phli}; NOT \p{Block=
2055 Inscriptional_Pahlavi}) (27)
2056 \p{Inscriptional_Parthian} \p{Script=Inscriptional_Parthian}
2057 (Short: \p{Prti}; NOT \p{Block=
2058 Inscriptional_Parthian}) (30)
2059 X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block=
2060 IPA_Extensions}) (96)
2061 X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short:
2062 \p{InIPAExt}) (96)
2063 \p{Is_*} \p{*} (Any exceptions are individually
2064 noted beginning with the word NOT.) If
2065 an entry has flag(s) at its beginning,
2066 like "D", the "Is_" form has the same
2067 flag(s)
2068 \p{Ital} \p{Old_Italic} (= \p{Script=Old_Italic})
2069 (NOT \p{Block=Old_Italic}) (35)
2070 X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
2071 (256)
2072 X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block=
2073 Hangul_Jamo_Extended_A}) (32)
2074 X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block=
2075 Hangul_Jamo_Extended_B}) (80)
2076 \p{Java} \p{Javanese} (= \p{Script=Javanese}) (NOT
2077 \p{Block=Javanese}) (91)
2078 \p{Javanese} \p{Script=Javanese} (Short: \p{Java}; NOT
2079 \p{Block=Javanese}) (91)
2080 \p{Jg: *} \p{Joining_Group: *}
2081 \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
2082 \p{Join_C: *} \p{Join_Control: *}
2083 \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
2084 \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110)
2085 \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2)
2086 \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (7)
2087 \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1)
2088 \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10)
2089 \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (20)
2090 \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2)
2091 \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
2092 BurushaskiYehBarree}) (2)
2093 \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (14)
2094 \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4)
2095 \p{Joining_Group: E} (Short: \p{Jg=E}) (1)
2096 \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7)
2097 \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1)
2098 \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10)
2099 \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1)
2100 \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (13)
2101 \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3)
2102 \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (18)
2103 \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
2104 HamzaOnHehGoal}) (1)
2105 \p{Joining_Group: He} (Short: \p{Jg=He}) (1)
2106 \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1)
2107 \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2)
2108 \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1)
2109 \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (5)
2110 \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1)
2111 \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1)
2112 \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2)
2113 \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (7)
2114 \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1)
2115 \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4)
2116 \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1)
2117 \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
2118 (1_113_870)
2119 \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8)
2120 \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1)
2121 \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1)
2122 \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1)
2123 \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (5)
2124 \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1)
2125 \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (17)
2126 \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1)
2127 \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1)
2128 \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (5)
2129 \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1)
2130 \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11)
2131 \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1)
2132 \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1)
2133 \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1)
2134 \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1)
2135 \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (4)
2136 \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1)
2137 \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3)
2138 \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
2139 Hamza_On_Heh_Goal} (1)
2140 \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2)
2141 \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16)
2142 \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (10)
2143 \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2)
2144 \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1)
2145 \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1)
2146 \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1)
2147 \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1)
2148 \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1)
2149 \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (3)
2150 \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (215)
2151 \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (215)
2152 \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (3)
2153 \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (0)
2154 \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (0)
2155 \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_112_389)
2156 \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (82)
2157 \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (82)
2158 \p{Joining_Type: T} \p{Joining_Type=Transparent} (1423)
2159 \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1423)
2160 \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_112_389)
2161 \p{Jt: *} \p{Joining_Type: *}
2162 \p{Kaithi} \p{Script=Kaithi} (Short: \p{Kthi}; NOT
2163 \p{Block=Kaithi}) (66)
2164 \p{Kali} \p{Kayah_Li} (= \p{Script=Kayah_Li}) (48)
2165 \p{Kana} \p{Katakana} (= \p{Script=Katakana}) (NOT
2166 \p{Block=Katakana}) (300)
2167 X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block=
2168 Kana_Supplement}) (256)
2169 X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short:
2170 \p{InKanaSup}) (256)
2171 X \p{Kanbun} \p{Block=Kanbun} (16)
2172 X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block=
2173 Kangxi_Radicals}) (224)
2174 X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short:
2175 \p{InKangxi}) (224)
2176 \p{Kannada} \p{Script=Kannada} (Short: \p{Knda}; NOT
2177 \p{Block=Kannada}) (86)
2178 \p{Katakana} \p{Script=Katakana} (Short: \p{Kana}; NOT
2179 \p{Block=Katakana}) (300)
2180 X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (=
2181 \p{Block=Katakana_Phonetic_Extensions})
2182 (16)
2183 X \p{Katakana_Phonetic_Extensions} \p{Block=
2184 Katakana_Phonetic_Extensions} (Short:
2185 \p{InKatakanaExt}) (16)
2186 \p{Kayah_Li} \p{Script=Kayah_Li} (Short: \p{Kali}) (48)
2187 \p{Khar} \p{Kharoshthi} (= \p{Script=Kharoshthi})
2188 (NOT \p{Block=Kharoshthi}) (65)
2189 \p{Kharoshthi} \p{Script=Kharoshthi} (Short: \p{Khar};
2190 NOT \p{Block=Kharoshthi}) (65)
2191 \p{Khmer} \p{Script=Khmer} (Short: \p{Khmr}; NOT
2192 \p{Block=Khmer}) (146)
2193 X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
2194 \p{Khmr} \p{Khmer} (= \p{Script=Khmer}) (NOT
2195 \p{Block=Khmer}) (146)
2196 \p{Knda} \p{Kannada} (= \p{Script=Kannada}) (NOT
2197 \p{Block=Kannada}) (86)
2198 \p{Kthi} \p{Kaithi} (= \p{Script=Kaithi}) (NOT
2199 \p{Block=Kaithi}) (66)
2200 \p{L} \p{Letter} (= \p{General_Category=Letter})
2201 (101_013)
2202 X \p{L&} \p{Cased_Letter} (= \p{General_Category=
2203 Cased_Letter}) (3223)
2204 X \p{L_} \p{Cased_Letter} (= \p{General_Category=
2205 Cased_Letter}) Note the trailing '_'
2206 matters in spite of loose matching
2207 rules. (3223)
2208 \p{Lana} \p{Tai_Tham} (= \p{Script=Tai_Tham}) (NOT
2209 \p{Block=Tai_Tham}) (127)
2210 \p{Lao} \p{Script=Lao} (NOT \p{Block=Lao}) (67)
2211 \p{Laoo} \p{Lao} (= \p{Script=Lao}) (NOT \p{Block=
2212 Lao}) (67)
2213 \p{Latin} \p{Script=Latin} (Short: \p{Latn}) (1272)
2214 X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
2215 Latin_1_Supplement}) (128)
2216 X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block=
2217 Latin_1_Supplement}) (128)
2218 X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
2219 \p{InLatin1}) (128)
2220 X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block=
2221 Latin_Extended_A}) (128)
2222 X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
2223 \p{Block=Latin_Extended_Additional})
2224 (256)
2225 X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block=
2226 Latin_Extended_B}) (208)
2227 X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block=
2228 Latin_Extended_C}) (32)
2229 X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block=
2230 Latin_Extended_D}) (224)
2231 X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short:
2232 \p{InLatinExtA}) (128)
2233 X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
2234 (Short: \p{InLatinExtAdditional}) (256)
2235 X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short:
2236 \p{InLatinExtB}) (208)
2237 X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short:
2238 \p{InLatinExtC}) (32)
2239 X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short:
2240 \p{InLatinExtD}) (224)
2241 \p{Latn} \p{Latin} (= \p{Script=Latin}) (1272)
2242 \p{Lb: *} \p{Line_Break: *}
2243 \p{LC} \p{Cased_Letter} (= \p{General_Category=
2244 Cased_Letter}) (3223)
2245 \p{Lepc} \p{Lepcha} (= \p{Script=Lepcha}) (NOT
2246 \p{Block=Lepcha}) (74)
2247 \p{Lepcha} \p{Script=Lepcha} (Short: \p{Lepc}; NOT
2248 \p{Block=Lepcha}) (74)
2249 \p{Letter} \p{General_Category=Letter} (Short: \p{L})
2250 (101_013)
2251 \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
2252 \p{Nl}) (224)
2253 X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
2254 \p{Limb} \p{Limbu} (= \p{Script=Limbu}) (NOT
2255 \p{Block=Limbu}) (66)
2256 \p{Limbu} \p{Script=Limbu} (Short: \p{Limb}; NOT
2257 \p{Block=Limbu}) (66)
2258 \p{Linb} \p{Linear_B} (= \p{Script=Linear_B}) (211)
2259 \p{Line_Break: AI} \p{Line_Break=Ambiguous} (724)
2260 \p{Line_Break: AL} \p{Line_Break=Alphabetic} (16_251)
2261 \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (16_251)
2262 \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (724)
2263 \p{Line_Break: B2} \p{Line_Break=Break_Both} (3)
2264 \p{Line_Break: BA} \p{Line_Break=Break_After} (151)
2265 \p{Line_Break: BB} \p{Line_Break=Break_Before} (19)
2266 \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
2267 \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (151)
2268 \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (19)
2269 \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3)
2270 \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1)
2271 \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1)
2272 \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
2273 \p{Line_Break: CJ} \p{Line_Break=
2274 Conditional_Japanese_Starter} (51)
2275 \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (87)
2276 \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2)
2277 \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (87)
2278 \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (1628)
2279 \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (1628)
2280 \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (665)
2281 \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
2282 (51)
2283 \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1)
2284 \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
2285 \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
2286 \p{Line_Break: EX} \p{Line_Break=Exclamation} (34)
2287 \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (34)
2288 \p{Line_Break: GL} \p{Line_Break=Glue} (18)
2289 \p{Line_Break: Glue} (Short: \p{Lb=GL}) (18)
2290 \p{Line_Break: H2} (Short: \p{Lb=H2}) (399)
2291 \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773)
2292 \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (74)
2293 \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (74)
2294 \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
2295 \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1)
2296 \p{Line_Break: ID} \p{Line_Break=Ideographic} (161_793)
2297 \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (161_793)
2298 \p{Line_Break: IN} \p{Line_Break=Inseparable} (4)
2299 \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13)
2300 \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (4)
2301 \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (4)
2302 \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
2303 \p{Line_Break: JL} (Short: \p{Lb=JL}) (125)
2304 \p{Line_Break: JT} (Short: \p{Lb=JT}) (137)
2305 \p{Line_Break: JV} (Short: \p{Lb=JV}) (95)
2306 \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
2307 \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1)
2308 \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4)
2309 \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1)
2310 \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
2311 \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (26)
2312 \p{Line_Break: NS} \p{Line_Break=Nonstarter} (26)
2313 \p{Line_Break: NU} \p{Line_Break=Numeric} (452)
2314 \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (452)
2315 \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (81)
2316 \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (81)
2317 \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (28)
2318 \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (28)
2319 \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (45)
2320 \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (45)
2321 \p{Line_Break: QU} \p{Line_Break=Quotation} (34)
2322 \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (34)
2323 \p{Line_Break: SA} \p{Line_Break=Complex_Context} (665)
2324 D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
2325 \p{Line_Break: SP} \p{Line_Break=Space} (1)
2326 \p{Line_Break: Space} (Short: \p{Lb=SP}) (1)
2327 D \p{Line_Break: Surrogate} Deprecated by Unicode because surrogates
2328 should never appear in well-formed text,
2329 and therefore shouldn't be the basis for
2330 line breaking (Short: \p{Lb=SG}) (2048)
2331 \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
2332 \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (918_338)
2333 \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
2334 \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2)
2335 \p{Line_Break: XX} \p{Line_Break=Unknown} (918_338)
2336 \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
2337 \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1)
2338 \p{Line_Separator} \p{General_Category=Line_Separator}
2339 (Short: \p{Zl}) (1)
2340 \p{Linear_B} \p{Script=Linear_B} (Short: \p{Linb}) (211)
2341 X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
2342 X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
2343 \p{Lisu} \p{Script=Lisu} (48)
2344 \p{Ll} \p{Lowercase_Letter} (=
2345 \p{General_Category=Lowercase_Letter})
2346 (/i= General_Category=Cased_Letter)
2347 (1751)
2348 \p{Lm} \p{Modifier_Letter} (=
2349 \p{General_Category=Modifier_Letter})
2350 (237)
2351 \p{Lo} \p{Other_Letter} (= \p{General_Category=
2352 Other_Letter}) (97_553)
2353 \p{LOE} \p{Logical_Order_Exception} (=
2354 \p{Logical_Order_Exception=Y}) (15)
2355 \p{LOE: *} \p{Logical_Order_Exception: *}
2356 \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
2357 \p{LOE}) (15)
2358 \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
2359 (1_114_097)
2360 \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (15)
2361 X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
2362 \p{Lower} \p{Lowercase=Y} (/i= Cased=Yes) (1934)
2363 \p{Lower: *} \p{Lowercase: *}
2364 \p{Lowercase} \p{Lower} (= \p{Lowercase=Y}) (/i= Cased=
2365 Yes) (1934)
2366 \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
2367 No) (1_112_178)
2368 \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
2369 Yes) (1934)
2370 \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
2371 (Short: \p{Ll}; /i= General_Category=
2372 Cased_Letter) (1751)
2373 \p{Lt} \p{Titlecase_Letter} (=
2374 \p{General_Category=Titlecase_Letter})
2375 (/i= General_Category=Cased_Letter) (31)
2376 \p{Lu} \p{Uppercase_Letter} (=
2377 \p{General_Category=Uppercase_Letter})
2378 (/i= General_Category=Cased_Letter)
2379 (1441)
2380 \p{Lyci} \p{Lycian} (= \p{Script=Lycian}) (NOT
2381 \p{Block=Lycian}) (29)
2382 \p{Lycian} \p{Script=Lycian} (Short: \p{Lyci}; NOT
2383 \p{Block=Lycian}) (29)
2384 \p{Lydi} \p{Lydian} (= \p{Script=Lydian}) (NOT
2385 \p{Block=Lydian}) (27)
2386 \p{Lydian} \p{Script=Lydian} (Short: \p{Lydi}; NOT
2387 \p{Block=Lydian}) (27)
2388 \p{M} \p{Mark} (= \p{General_Category=Mark})
2389 (1645)
2390 X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block=
2391 Mahjong_Tiles}) (48)
2392 X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short:
2393 \p{InMahjong}) (48)
2394 \p{Malayalam} \p{Script=Malayalam} (Short: \p{Mlym}; NOT
2395 \p{Block=Malayalam}) (98)
2396 \p{Mand} \p{Mandaic} (= \p{Script=Mandaic}) (NOT
2397 \p{Block=Mandaic}) (29)
2398 \p{Mandaic} \p{Script=Mandaic} (Short: \p{Mand}; NOT
2399 \p{Block=Mandaic}) (29)
2400 \p{Mark} \p{General_Category=Mark} (Short: \p{M})
2401 (1645)
2402 \p{Math} \p{Math=Y} (2310)
2403 \p{Math: N*} (Single: \P{Math}) (1_111_802)
2404 \p{Math: Y*} (Single: \p{Math}) (2310)
2405 X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (=
2406 \p{Block=
2407 Mathematical_Alphanumeric_Symbols})
2408 (1024)
2409 X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block=
2410 Mathematical_Operators}) (256)
2411 \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
2412 \p{Sm}) (952)
2413 X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
2414 Mathematical_Alphanumeric_Symbols}
2415 (Short: \p{InMathAlphanum}) (1024)
2416 X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
2417 (Short: \p{InMathOperators}) (256)
2418 \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
2419 Spacing_Mark}) (353)
2420 \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
2421 Enclosing_Mark}) (12)
2422 \p{Meetei_Mayek} \p{Script=Meetei_Mayek} (Short: \p{Mtei};
2423 NOT \p{Block=Meetei_Mayek}) (79)
2424 X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block=
2425 Meetei_Mayek_Extensions}) (32)
2426 X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
2427 (Short: \p{InMeeteiMayekExt}) (32)
2428 \p{Merc} \p{Meroitic_Cursive} (= \p{Script=
2429 Meroitic_Cursive}) (NOT \p{Block=
2430 Meroitic_Cursive}) (26)
2431 \p{Mero} \p{Meroitic_Hieroglyphs} (= \p{Script=
2432 Meroitic_Hieroglyphs}) (32)
2433 \p{Meroitic_Cursive} \p{Script=Meroitic_Cursive} (Short:
2434 \p{Merc}; NOT \p{Block=
2435 Meroitic_Cursive}) (26)
2436 \p{Meroitic_Hieroglyphs} \p{Script=Meroitic_Hieroglyphs} (Short:
2437 \p{Mero}) (32)
2438 \p{Miao} \p{Script=Miao} (NOT \p{Block=Miao}) (133)
2439 X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (=
2440 \p{Block=
2441 Miscellaneous_Symbols_And_Arrows}) (256)
2442 X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
2443 (= \p{Block=
2444 Miscellaneous_Mathematical_Symbols_A})
2445 (48)
2446 X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
2447 (= \p{Block=
2448 Miscellaneous_Mathematical_Symbols_B})
2449 (128)
2450 X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs}
2451 (= \p{Block=
2452 Miscellaneous_Symbols_And_Pictographs})
2453 (768)
2454 X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block=
2455 Miscellaneous_Symbols}) (256)
2456 X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block=
2457 Miscellaneous_Technical}) (256)
2458 X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
2459 Miscellaneous_Mathematical_Symbols_A}
2460 (Short: \p{InMiscMathSymbolsA}) (48)
2461 X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
2462 Miscellaneous_Mathematical_Symbols_B}
2463 (Short: \p{InMiscMathSymbolsB}) (128)
2464 X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
2465 \p{InMiscSymbols}) (256)
2466 X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
2467 Miscellaneous_Symbols_And_Arrows}
2468 (Short: \p{InMiscArrows}) (256)
2469 X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
2470 Miscellaneous_Symbols_And_Pictographs}
2471 (Short: \p{InMiscPictographs}) (768)
2472 X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
2473 (Short: \p{InMiscTechnical}) (256)
2474 \p{Mlym} \p{Malayalam} (= \p{Script=Malayalam})
2475 (NOT \p{Block=Malayalam}) (98)
2476 \p{Mn} \p{Nonspacing_Mark} (=
2477 \p{General_Category=Nonspacing_Mark})
2478 (1280)
2479 \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
2480 (Short: \p{Lm}) (237)
2481 X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block=
2482 Spacing_Modifier_Letters}) (80)
2483 \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
2484 (Short: \p{Sk}) (115)
2485 X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
2486 \p{Mong} \p{Mongolian} (= \p{Script=Mongolian})
2487 (NOT \p{Block=Mongolian}) (153)
2488 \p{Mongolian} \p{Script=Mongolian} (Short: \p{Mong}; NOT
2489 \p{Block=Mongolian}) (153)
2490 \p{Mtei} \p{Meetei_Mayek} (= \p{Script=
2491 Meetei_Mayek}) (NOT \p{Block=
2492 Meetei_Mayek}) (79)
2493 X \p{Music} \p{Musical_Symbols} (= \p{Block=
2494 Musical_Symbols}) (256)
2495 X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short:
2496 \p{InMusic}) (256)
2497 \p{Myanmar} \p{Script=Myanmar} (Short: \p{Mymr}; NOT
2498 \p{Block=Myanmar}) (188)
2499 X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block=
2500 Myanmar_Extended_A}) (32)
2501 X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short:
2502 \p{InMyanmarExtA}) (32)
2503 \p{Mymr} \p{Myanmar} (= \p{Script=Myanmar}) (NOT
2504 \p{Block=Myanmar}) (188)
2505 \p{N} \p{Number} (= \p{General_Category=Number})
2506 (1148)
2507 X \p{NB} \p{No_Block} (= \p{Block=No_Block})
2508 (860_672)
2509 \p{NChar} \p{Noncharacter_Code_Point} (=
2510 \p{Noncharacter_Code_Point=Y}) (66)
2511 \p{NChar: *} \p{Noncharacter_Code_Point: *}
2512 \p{Nd} \p{Digit} (= \p{General_Category=
2513 Decimal_Number}) (460)
2514 \p{New_Tai_Lue} \p{Script=New_Tai_Lue} (Short: \p{Talu};
2515 NOT \p{Block=New_Tai_Lue}) (83)
2516 \p{NFC_QC: *} \p{NFC_Quick_Check: *}
2517 \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (104)
2518 \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (104)
2519 \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
2520 \P{NFC_Quick_Check} NOR \P{NFC_QC})
2521 (1120)
2522 \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
2523 \P{NFC_Quick_Check} NOR \P{NFC_QC})
2524 (1120)
2525 \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
2526 \p{NFC_Quick_Check} NOR \p{NFC_QC})
2527 (1_112_888)
2528 \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
2529 \p{NFC_Quick_Check} NOR \p{NFC_QC})
2530 (1_112_888)
2531 \p{NFD_QC: *} \p{NFD_Quick_Check: *}
2532 \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
2533 \P{NFD_Quick_Check} NOR \P{NFD_QC})
2534 (13_225)
2535 \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
2536 \P{NFD_Quick_Check} NOR \P{NFD_QC})
2537 (13_225)
2538 \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
2539 \p{NFD_Quick_Check} NOR \p{NFD_QC})
2540 (1_100_887)
2541 \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
2542 \p{NFD_Quick_Check} NOR \p{NFD_QC})
2543 (1_100_887)
2544 \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
2545 \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (104)
2546 \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (104)
2547 \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
2548 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
2549 (4787)
2550 \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
2551 \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
2552 (4787)
2553 \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
2554 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
2555 (1_109_221)
2556 \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
2557 \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
2558 (1_109_221)
2559 \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
2560 \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
2561 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
2562 (16_880)
2563 \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
2564 \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
2565 (16_880)
2566 \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
2567 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
2568 (1_097_232)
2569 \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
2570 \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
2571 (1_097_232)
2572 \p{Nko} \p{Script=Nko} (NOT \p{NKo}) (59)
2573 \p{Nkoo} \p{Nko} (= \p{Script=Nko}) (NOT \p{NKo})
2574 (59)
2575 \p{Nl} \p{Letter_Number} (= \p{General_Category=
2576 Letter_Number}) (224)
2577 \p{No} \p{Other_Number} (= \p{General_Category=
2578 Other_Number}) (464)
2579 X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB})
2580 (860_672)
2581 \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
2582 \p{NChar}) (66)
2583 \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
2584 (1_114_046)
2585 \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
2586 (66)
2587 \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
2588 (Short: \p{Mn}) (1280)
2589 \p{Nt: *} \p{Numeric_Type: *}
2590 \p{Number} \p{General_Category=Number} (Short: \p{N})
2591 (1148)
2592 X \p{Number_Forms} \p{Block=Number_Forms} (64)
2593 \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (460)
2594 \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (460)
2595 \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
2596 \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128)
2597 \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_887)
2598 \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (637)
2599 \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (637)
2600 T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1)
2601 T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (60)
2602 T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (3)
2603 T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (1)
2604 T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1)
2605 T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (5)
2606 T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1)
2607 T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (2)
2608 T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (3)
2609 T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (1)
2610 T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (9)
2611 T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (4)
2612 T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1)
2613 T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1)
2614 T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (10)
2615 T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1)
2616 T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1)
2617 T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (5)
2618 T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (6)
2619 T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1)
2620 T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (2)
2621 T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1)
2622 T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (97)
2623 T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1)
2624 T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (100)
2625 T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1)
2626 T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (102)
2627 T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1)
2628 T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (93)
2629 T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1)
2630 T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (90)
2631 T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1)
2632 T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (82)
2633 T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1)
2634 T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (81)
2635 T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1)
2636 T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (77)
2637 T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1)
2638 T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (81)
2639 T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (40)
2640 T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (6)
2641 T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (6)
2642 T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (4)
2643 T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (4)
2644 T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (4)
2645 T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (5)
2646 T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (5)
2647 T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (5)
2648 T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (5)
2649 T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (19)
2650 T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1)
2651 T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1)
2652 T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1)
2653 T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1)
2654 T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1)
2655 T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1)
2656 T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1)
2657 T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1)
2658 T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1)
2659 T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (11)
2660 T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1)
2661 T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1)
2662 T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1)
2663 T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1)
2664 T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1)
2665 T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1)
2666 T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1)
2667 T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1)
2668 T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1)
2669 T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (10)
2670 T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1)
2671 T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1)
2672 T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1)
2673 T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1)
2674 T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1)
2675 T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1)
2676 T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1)
2677 T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1)
2678 T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1)
2679 T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (20)
2680 T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (6)
2681 T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (6)
2682 T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (6)
2683 T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (6)
2684 T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (20)
2685 T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (2)
2686 T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (3)
2687 T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (2)
2688 T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (12)
2689 T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (2)
2690 T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (2)
2691 T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (2)
2692 T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (3)
2693 T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (17)
2694 T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (1)
2695 T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (1)
2696 T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (1)
2697 T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (5)
2698 T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (1)
2699 T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (1)
2700 T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (1)
2701 T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (1)
2702 T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (7)
2703 T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (1)
2704 T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (1)
2705 T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (1)
2706 T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (4)
2707 T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (1)
2708 T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (1)
2709 T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (1)
2710 T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (1)
2711 T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (1)
2712 T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
2713 (2)
2714 T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
2715 1000000000000}) (1)
2716 \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_887)
2717 \p{Nv: *} \p{Numeric_Value: *}
2718 X \p{OCR} \p{Optical_Character_Recognition} (=
2719 \p{Block=Optical_Character_Recognition})
2720 (32)
2721 \p{Ogam} \p{Ogham} (= \p{Script=Ogham}) (NOT
2722 \p{Block=Ogham}) (29)
2723 \p{Ogham} \p{Script=Ogham} (Short: \p{Ogam}; NOT
2724 \p{Block=Ogham}) (29)
2725 \p{Ol_Chiki} \p{Script=Ol_Chiki} (Short: \p{Olck}) (48)
2726 \p{Olck} \p{Ol_Chiki} (= \p{Script=Ol_Chiki}) (48)
2727 \p{Old_Italic} \p{Script=Old_Italic} (Short: \p{Ital};
2728 NOT \p{Block=Old_Italic}) (35)
2729 \p{Old_Persian} \p{Script=Old_Persian} (Short: \p{Xpeo};
2730 NOT \p{Block=Old_Persian}) (50)
2731 \p{Old_South_Arabian} \p{Script=Old_South_Arabian} (Short:
2732 \p{Sarb}) (32)
2733 \p{Old_Turkic} \p{Script=Old_Turkic} (Short: \p{Orkh};
2734 NOT \p{Block=Old_Turkic}) (73)
2735 \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
2736 (Short: \p{Ps}) (72)
2737 X \p{Optical_Character_Recognition} \p{Block=
2738 Optical_Character_Recognition} (Short:
2739 \p{InOCR}) (32)
2740 \p{Oriya} \p{Script=Oriya} (Short: \p{Orya}; NOT
2741 \p{Block=Oriya}) (90)
2742 \p{Orkh} \p{Old_Turkic} (= \p{Script=Old_Turkic})
2743 (NOT \p{Block=Old_Turkic}) (73)
2744 \p{Orya} \p{Oriya} (= \p{Script=Oriya}) (NOT
2745 \p{Block=Oriya}) (90)
2746 \p{Osma} \p{Osmanya} (= \p{Script=Osmanya}) (NOT
2747 \p{Block=Osmanya}) (40)
2748 \p{Osmanya} \p{Script=Osmanya} (Short: \p{Osma}; NOT
2749 \p{Block=Osmanya}) (40)
2750 \p{Other} \p{General_Category=Other} (Short: \p{C})
2751 (1_004_135)
2752 \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
2753 \p{Lo}) (97_553)
2754 \p{Other_Number} \p{General_Category=Other_Number} (Short:
2755 \p{No}) (464)
2756 \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
2757 (Short: \p{Po}) (434)
2758 \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
2759 \p{So}) (4404)
2760 \p{P} \p{Punct} (= \p{General_Category=
2761 Punctuation}) (NOT
2762 \p{General_Punctuation}) (632)
2763 \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
2764 (Short: \p{Zp}) (1)
2765 \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
2766 Y}) (2760)
2767 \p{Pat_Syn: *} \p{Pattern_Syntax: *}
2768 \p{Pat_WS} \p{Pattern_White_Space} (=
2769 \p{Pattern_White_Space=Y}) (11)
2770 \p{Pat_WS: *} \p{Pattern_White_Space: *}
2771 \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
2772 (2760)
2773 \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
2774 (1_111_352)
2775 \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760)
2776 \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
2777 \p{PatWS}) (11)
2778 \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
2779 (1_114_101)
2780 \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11)
2781 \p{Pc} \p{Connector_Punctuation} (=
2782 \p{General_Category=
2783 Connector_Punctuation}) (10)
2784 \p{Pd} \p{Dash_Punctuation} (=
2785 \p{General_Category=Dash_Punctuation})
2786 (23)
2787 \p{Pe} \p{Close_Punctuation} (=
2788 \p{General_Category=Close_Punctuation})
2789 (71)
2790 \p{PerlSpace} \s, restricted to ASCII = [ \f\n\r\t] (5)
2791 \p{PerlWord} \w, restricted to ASCII = [A-Za-z0-9_] (63)
2792 \p{Pf} \p{Final_Punctuation} (=
2793 \p{General_Category=Final_Punctuation})
2794 (10)
2795 \p{Phag} \p{Phags_Pa} (= \p{Script=Phags_Pa}) (NOT
2796 \p{Block=Phags_Pa}) (56)
2797 \p{Phags_Pa} \p{Script=Phags_Pa} (Short: \p{Phag}; NOT
2798 \p{Block=Phags_Pa}) (56)
2799 X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block=
2800 Phaistos_Disc}) (48)
2801 X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short:
2802 \p{InPhaistos}) (48)
2803 \p{Phli} \p{Inscriptional_Pahlavi} (= \p{Script=
2804 Inscriptional_Pahlavi}) (NOT \p{Block=
2805 Inscriptional_Pahlavi}) (27)
2806 \p{Phnx} \p{Phoenician} (= \p{Script=Phoenician})
2807 (NOT \p{Block=Phoenician}) (29)
2808 \p{Phoenician} \p{Script=Phoenician} (Short: \p{Phnx};
2809 NOT \p{Block=Phoenician}) (29)
2810 X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block=
2811 Phonetic_Extensions}) (128)
2812 X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (=
2813 \p{Block=
2814 Phonetic_Extensions_Supplement}) (64)
2815 X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
2816 \p{InPhoneticExt}) (128)
2817 X \p{Phonetic_Extensions_Supplement} \p{Block=
2818 Phonetic_Extensions_Supplement} (Short:
2819 \p{InPhoneticExtSup}) (64)
2820 \p{Pi} \p{Initial_Punctuation} (=
2821 \p{General_Category=
2822 Initial_Punctuation}) (12)
2823 X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
2824 \p{Plrd} \p{Miao} (= \p{Script=Miao}) (NOT
2825 \p{Block=Miao}) (133)
2826 \p{Po} \p{Other_Punctuation} (=
2827 \p{General_Category=Other_Punctuation})
2828 (434)
2829 \p{PosixAlnum} [A-Za-z0-9] (62)
2830 \p{PosixAlpha} [A-Za-z] (52)
2831 \p{PosixBlank} \t and ' ' (2)
2832 \p{PosixCntrl} ASCII control characters: NUL, SOH, STX,
2833 ETX, EOT, ENQ, ACK, BEL, BS, HT, LF, VT,
2834 FF, CR, SO, SI, DLE, DC1, DC2, DC3, DC4,
2835 NAK, SYN, ETB, CAN, EOM, SUB, ESC, FS,
2836 GS, RS, US, and DEL (33)
2837 \p{PosixDigit} [0-9] (10)
2838 \p{PosixGraph} [-!"#$%&'()*+,./:;<>?@[\\]^_`{|}~0-9A-Za-
2839 z] (94)
2840 \p{PosixLower} [a-z] (/i= PosixAlpha) (26)
2841 \p{PosixPrint} [- 0-9A-Za-
2842 z!"#$%&'()*+,./:;<>?@[\\]^_`{|}~] (95)
2843 \p{PosixPunct} [-!"#$%&'()*+,./:;<>?@[\\]^_`{|}~] (32)
2844 \p{PosixSpace} \t, \n, \cK, \f, \r, and ' '. (\cK is
2845 vertical tab) (6)
2846 \p{PosixUpper} [A-Z] (/i= PosixAlpha) (26)
2847 \p{PosixWord} \p{PerlWord} (63)
2848 \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} [0-9A-Fa-f] (Short:
2849 \p{AHex}) (22)
2850 T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
2851 extension) (33_979)
2852 T \p{Present_In: 2.0} Code point's usage introduced in version
2853 2.0 or earlier (Short: \p{In=2.0}) (Perl
2854 extension) (178_500)
2855 T \p{Present_In: 2.1} Code point's usage introduced in version
2856 2.1 or earlier (Short: \p{In=2.1}) (Perl
2857 extension) (178_502)
2858 T \p{Present_In: 3.0} Code point's usage introduced in version
2859 3.0 or earlier (Short: \p{In=3.0}) (Perl
2860 extension) (188_809)
2861 T \p{Present_In: 3.1} Code point's usage introduced in version
2862 3.1 or earlier (Short: \p{In=3.1}) (Perl
2863 extension) (233_787)
2864 T \p{Present_In: 3.2} Code point's usage introduced in version
2865 3.2 or earlier (Short: \p{In=3.2}) (Perl
2866 extension) (234_803)
2867 T \p{Present_In: 4.0} Code point's usage introduced in version
2868 4.0 or earlier (Short: \p{In=4.0}) (Perl
2869 extension) (236_029)
2870 T \p{Present_In: 4.1} Code point's usage introduced in version
2871 4.1 or earlier (Short: \p{In=4.1}) (Perl
2872 extension) (237_302)
2873 T \p{Present_In: 5.0} Code point's usage introduced in version
2874 5.0 or earlier (Short: \p{In=5.0}) (Perl
2875 extension) (238_671)
2876 T \p{Present_In: 5.1} Code point's usage introduced in version
2877 5.1 or earlier (Short: \p{In=5.1}) (Perl
2878 extension) (240_295)
2879 T \p{Present_In: 5.2} Code point's usage introduced in version
2880 5.2 or earlier (Short: \p{In=5.2}) (Perl
2881 extension) (246_943)
2882 T \p{Present_In: 6.0} Code point's usage introduced in version
2883 6.0 or earlier (Short: \p{In=6.0}) (Perl
2884 extension) (249_031)
2885 T \p{Present_In: 6.1} Code point's usage introduced in version
2886 6.1 or earlier (Short: \p{In=6.1}) (Perl
2887 extension) (249_763)
2888 \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
2889 Unassigned}) (Perl extension) (864_349)
2890 \p{Print} Characters that are graphical plus space
2891 characters (but no controls) (247_582)
2892 \p{Private_Use} \p{General_Category=Private_Use} (Short:
2893 \p{Co}; NOT \p{Private_Use_Area})
2894 (137_468)
2895 X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
2896 \p{InPUA}) (6400)
2897 \p{Prti} \p{Inscriptional_Parthian} (= \p{Script=
2898 Inscriptional_Parthian}) (NOT \p{Block=
2899 Inscriptional_Parthian}) (30)
2900 \p{Ps} \p{Open_Punctuation} (=
2901 \p{General_Category=Open_Punctuation})
2902 (72)
2903 X \p{PUA} \p{Private_Use_Area} (= \p{Block=
2904 Private_Use_Area}) (6400)
2905 \p{Punct} \p{General_Category=Punctuation} (Short:
2906 \p{P}; NOT \p{General_Punctuation}) (632)
2907 \p{Punctuation} \p{Punct} (= \p{General_Category=
2908 Punctuation}) (NOT
2909 \p{General_Punctuation}) (632)
2910 \p{Qaac} \p{Coptic} (= \p{Script=Coptic}) (NOT
2911 \p{Block=Coptic}) (137)
2912 \p{Qaai} \p{Inherited} (= \p{Script=Inherited})
2913 (524)
2914 \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
2915 Y}) (29)
2916 \p{QMark: *} \p{Quotation_Mark: *}
2917 \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
2918 (29)
2919 \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_083)
2920 \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (29)
2921 \p{Radical} \p{Radical=Y} (329)
2922 \p{Radical: N*} (Single: \P{Radical}) (1_113_783)
2923 \p{Radical: Y*} (Single: \p{Radical}) (329)
2924 \p{Rejang} \p{Script=Rejang} (Short: \p{Rjng}; NOT
2925 \p{Block=Rejang}) (37)
2926 \p{Rjng} \p{Rejang} (= \p{Script=Rejang}) (NOT
2927 \p{Block=Rejang}) (37)
2928 X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block=
2929 Rumi_Numeral_Symbols}) (32)
2930 X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
2931 \p{InRumi}) (32)
2932 \p{Runic} \p{Script=Runic} (Short: \p{Runr}; NOT
2933 \p{Block=Runic}) (78)
2934 \p{Runr} \p{Runic} (= \p{Script=Runic}) (NOT
2935 \p{Block=Runic}) (78)
2936 \p{S} \p{Symbol} (= \p{General_Category=Symbol})
2937 (5519)
2938 \p{Samaritan} \p{Script=Samaritan} (Short: \p{Samr}; NOT
2939 \p{Block=Samaritan}) (61)
2940 \p{Samr} \p{Samaritan} (= \p{Script=Samaritan})
2941 (NOT \p{Block=Samaritan}) (61)
2942 \p{Sarb} \p{Old_South_Arabian} (= \p{Script=
2943 Old_South_Arabian}) (32)
2944 \p{Saur} \p{Saurashtra} (= \p{Script=Saurashtra})
2945 (NOT \p{Block=Saurashtra}) (81)
2946 \p{Saurashtra} \p{Script=Saurashtra} (Short: \p{Saur};
2947 NOT \p{Block=Saurashtra}) (81)
2948 \p{SB: *} \p{Sentence_Break: *}
2949 \p{Sc} \p{Currency_Symbol} (=
2950 \p{General_Category=Currency_Symbol})
2951 (48)
2952 \p{Sc: *} \p{Script: *}
2953 \p{Script: Arab} \p{Script=Arabic} (1234)
2954 \p{Script: Arabic} (Short: \p{Sc=Arab}, \p{Arab}) (1234)
2955 \p{Script: Armenian} (Short: \p{Sc=Armn}, \p{Armn}) (91)
2956 \p{Script: Armi} \p{Script=Imperial_Aramaic} (31)
2957 \p{Script: Armn} \p{Script=Armenian} (91)
2958 \p{Script: Avestan} (Short: \p{Sc=Avst}, \p{Avst}) (61)
2959 \p{Script: Avst} \p{Script=Avestan} (61)
2960 \p{Script: Bali} \p{Script=Balinese} (121)
2961 \p{Script: Balinese} (Short: \p{Sc=Bali}, \p{Bali}) (121)
2962 \p{Script: Bamu} \p{Script=Bamum} (657)
2963 \p{Script: Bamum} (Short: \p{Sc=Bamu}, \p{Bamu}) (657)
2964 \p{Script: Batak} (Short: \p{Sc=Batk}, \p{Batk}) (56)
2965 \p{Script: Batk} \p{Script=Batak} (56)
2966 \p{Script: Beng} \p{Script=Bengali} (92)
2967 \p{Script: Bengali} (Short: \p{Sc=Beng}, \p{Beng}) (92)
2968 \p{Script: Bopo} \p{Script=Bopomofo} (70)
2969 \p{Script: Bopomofo} (Short: \p{Sc=Bopo}, \p{Bopo}) (70)
2970 \p{Script: Brah} \p{Script=Brahmi} (108)
2971 \p{Script: Brahmi} (Short: \p{Sc=Brah}, \p{Brah}) (108)
2972 \p{Script: Brai} \p{Script=Braille} (256)
2973 \p{Script: Braille} (Short: \p{Sc=Brai}, \p{Brai}) (256)
2974 \p{Script: Bugi} \p{Script=Buginese} (30)
2975 \p{Script: Buginese} (Short: \p{Sc=Bugi}, \p{Bugi}) (30)
2976 \p{Script: Buhd} \p{Script=Buhid} (20)
2977 \p{Script: Buhid} (Short: \p{Sc=Buhd}, \p{Buhd}) (20)
2978 \p{Script: Cakm} \p{Script=Chakma} (67)
2979 \p{Script: Canadian_Aboriginal} (Short: \p{Sc=Cans}, \p{Cans})
2980 (710)
2981 \p{Script: Cans} \p{Script=Canadian_Aboriginal} (710)
2982 \p{Script: Cari} \p{Script=Carian} (49)
2983 \p{Script: Carian} (Short: \p{Sc=Cari}, \p{Cari}) (49)
2984 \p{Script: Chakma} (Short: \p{Sc=Cakm}, \p{Cakm}) (67)
2985 \p{Script: Cham} (Short: \p{Sc=Cham}, \p{Cham}) (83)
2986 \p{Script: Cher} \p{Script=Cherokee} (85)
2987 \p{Script: Cherokee} (Short: \p{Sc=Cher}, \p{Cher}) (85)
2988 \p{Script: Common} (Short: \p{Sc=Zyyy}, \p{Zyyy}) (6412)
2989 \p{Script: Copt} \p{Script=Coptic} (137)
2990 \p{Script: Coptic} (Short: \p{Sc=Copt}, \p{Copt}) (137)
2991 \p{Script: Cprt} \p{Script=Cypriot} (55)
2992 \p{Script: Cuneiform} (Short: \p{Sc=Xsux}, \p{Xsux}) (982)
2993 \p{Script: Cypriot} (Short: \p{Sc=Cprt}, \p{Cprt}) (55)
2994 \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}, \p{Cyrl}) (417)
2995 \p{Script: Cyrl} \p{Script=Cyrillic} (417)
2996 \p{Script: Deseret} (Short: \p{Sc=Dsrt}, \p{Dsrt}) (80)
2997 \p{Script: Deva} \p{Script=Devanagari} (151)
2998 \p{Script: Devanagari} (Short: \p{Sc=Deva}, \p{Deva}) (151)
2999 \p{Script: Dsrt} \p{Script=Deseret} (80)
3000 \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (1071)
3001 \p{Script: Egyptian_Hieroglyphs} (Short: \p{Sc=Egyp}, \p{Egyp})
3002 (1071)
3003 \p{Script: Ethi} \p{Script=Ethiopic} (495)
3004 \p{Script: Ethiopic} (Short: \p{Sc=Ethi}, \p{Ethi}) (495)
3005 \p{Script: Geor} \p{Script=Georgian} (127)
3006 \p{Script: Georgian} (Short: \p{Sc=Geor}, \p{Geor}) (127)
3007 \p{Script: Glag} \p{Script=Glagolitic} (94)
3008 \p{Script: Glagolitic} (Short: \p{Sc=Glag}, \p{Glag}) (94)
3009 \p{Script: Goth} \p{Script=Gothic} (27)
3010 \p{Script: Gothic} (Short: \p{Sc=Goth}, \p{Goth}) (27)
3011 \p{Script: Greek} (Short: \p{Sc=Grek}, \p{Grek}) (511)
3012 \p{Script: Grek} \p{Script=Greek} (511)
3013 \p{Script: Gujarati} (Short: \p{Sc=Gujr}, \p{Gujr}) (84)
3014 \p{Script: Gujr} \p{Script=Gujarati} (84)
3015 \p{Script: Gurmukhi} (Short: \p{Sc=Guru}, \p{Guru}) (79)
3016 \p{Script: Guru} \p{Script=Gurmukhi} (79)
3017 \p{Script: Han} (Short: \p{Sc=Han}, \p{Han}) (75_963)
3018 \p{Script: Hang} \p{Script=Hangul} (11_739)
3019 \p{Script: Hangul} (Short: \p{Sc=Hang}, \p{Hang}) (11_739)
3020 \p{Script: Hani} \p{Script=Han} (75_963)
3021 \p{Script: Hano} \p{Script=Hanunoo} (21)
3022 \p{Script: Hanunoo} (Short: \p{Sc=Hano}, \p{Hano}) (21)
3023 \p{Script: Hebr} \p{Script=Hebrew} (133)
3024 \p{Script: Hebrew} (Short: \p{Sc=Hebr}, \p{Hebr}) (133)
3025 \p{Script: Hira} \p{Script=Hiragana} (91)
3026 \p{Script: Hiragana} (Short: \p{Sc=Hira}, \p{Hira}) (91)
3027 \p{Script: Imperial_Aramaic} (Short: \p{Sc=Armi}, \p{Armi}) (31)
3028 \p{Script: Inherited} (Short: \p{Sc=Zinh}, \p{Zinh}) (524)
3029 \p{Script: Inscriptional_Pahlavi} (Short: \p{Sc=Phli}, \p{Phli})
3030 (27)
3031 \p{Script: Inscriptional_Parthian} (Short: \p{Sc=Prti}, \p{Prti})
3032 (30)
3033 \p{Script: Ital} \p{Script=Old_Italic} (35)
3034 \p{Script: Java} \p{Script=Javanese} (91)
3035 \p{Script: Javanese} (Short: \p{Sc=Java}, \p{Java}) (91)
3036 \p{Script: Kaithi} (Short: \p{Sc=Kthi}, \p{Kthi}) (66)
3037 \p{Script: Kali} \p{Script=Kayah_Li} (48)
3038 \p{Script: Kana} \p{Script=Katakana} (300)
3039 \p{Script: Kannada} (Short: \p{Sc=Knda}, \p{Knda}) (86)
3040 \p{Script: Katakana} (Short: \p{Sc=Kana}, \p{Kana}) (300)
3041 \p{Script: Kayah_Li} (Short: \p{Sc=Kali}, \p{Kali}) (48)
3042 \p{Script: Khar} \p{Script=Kharoshthi} (65)
3043 \p{Script: Kharoshthi} (Short: \p{Sc=Khar}, \p{Khar}) (65)
3044 \p{Script: Khmer} (Short: \p{Sc=Khmr}, \p{Khmr}) (146)
3045 \p{Script: Khmr} \p{Script=Khmer} (146)
3046 \p{Script: Knda} \p{Script=Kannada} (86)
3047 \p{Script: Kthi} \p{Script=Kaithi} (66)
3048 \p{Script: Lana} \p{Script=Tai_Tham} (127)
3049 \p{Script: Lao} (Short: \p{Sc=Lao}, \p{Lao}) (67)
3050 \p{Script: Laoo} \p{Script=Lao} (67)
3051 \p{Script: Latin} (Short: \p{Sc=Latn}, \p{Latn}) (1272)
3052 \p{Script: Latn} \p{Script=Latin} (1272)
3053 \p{Script: Lepc} \p{Script=Lepcha} (74)
3054 \p{Script: Lepcha} (Short: \p{Sc=Lepc}, \p{Lepc}) (74)
3055 \p{Script: Limb} \p{Script=Limbu} (66)
3056 \p{Script: Limbu} (Short: \p{Sc=Limb}, \p{Limb}) (66)
3057 \p{Script: Linb} \p{Script=Linear_B} (211)
3058 \p{Script: Linear_B} (Short: \p{Sc=Linb}, \p{Linb}) (211)
3059 \p{Script: Lisu} (Short: \p{Sc=Lisu}, \p{Lisu}) (48)
3060 \p{Script: Lyci} \p{Script=Lycian} (29)
3061 \p{Script: Lycian} (Short: \p{Sc=Lyci}, \p{Lyci}) (29)
3062 \p{Script: Lydi} \p{Script=Lydian} (27)
3063 \p{Script: Lydian} (Short: \p{Sc=Lydi}, \p{Lydi}) (27)
3064 \p{Script: Malayalam} (Short: \p{Sc=Mlym}, \p{Mlym}) (98)
3065 \p{Script: Mand} \p{Script=Mandaic} (29)
3066 \p{Script: Mandaic} (Short: \p{Sc=Mand}, \p{Mand}) (29)
3067 \p{Script: Meetei_Mayek} (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
3068 \p{Script: Merc} \p{Script=Meroitic_Cursive} (26)
3069 \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (32)
3070 \p{Script: Meroitic_Cursive} (Short: \p{Sc=Merc}, \p{Merc}) (26)
3071 \p{Script: Meroitic_Hieroglyphs} (Short: \p{Sc=Mero}, \p{Mero})
3072 (32)
3073 \p{Script: Miao} (Short: \p{Sc=Miao}, \p{Miao}) (133)
3074 \p{Script: Mlym} \p{Script=Malayalam} (98)
3075 \p{Script: Mong} \p{Script=Mongolian} (153)
3076 \p{Script: Mongolian} (Short: \p{Sc=Mong}, \p{Mong}) (153)
3077 \p{Script: Mtei} \p{Script=Meetei_Mayek} (79)
3078 \p{Script: Myanmar} (Short: \p{Sc=Mymr}, \p{Mymr}) (188)
3079 \p{Script: Mymr} \p{Script=Myanmar} (188)
3080 \p{Script: New_Tai_Lue} (Short: \p{Sc=Talu}, \p{Talu}) (83)
3081 \p{Script: Nko} (Short: \p{Sc=Nko}, \p{Nko}) (59)
3082 \p{Script: Nkoo} \p{Script=Nko} (59)
3083 \p{Script: Ogam} \p{Script=Ogham} (29)
3084 \p{Script: Ogham} (Short: \p{Sc=Ogam}, \p{Ogam}) (29)
3085 \p{Script: Ol_Chiki} (Short: \p{Sc=Olck}, \p{Olck}) (48)
3086 \p{Script: Olck} \p{Script=Ol_Chiki} (48)
3087 \p{Script: Old_Italic} (Short: \p{Sc=Ital}, \p{Ital}) (35)
3088 \p{Script: Old_Persian} (Short: \p{Sc=Xpeo}, \p{Xpeo}) (50)
3089 \p{Script: Old_South_Arabian} (Short: \p{Sc=Sarb}, \p{Sarb}) (32)
3090 \p{Script: Old_Turkic} (Short: \p{Sc=Orkh}, \p{Orkh}) (73)
3091 \p{Script: Oriya} (Short: \p{Sc=Orya}, \p{Orya}) (90)
3092 \p{Script: Orkh} \p{Script=Old_Turkic} (73)
3093 \p{Script: Orya} \p{Script=Oriya} (90)
3094 \p{Script: Osma} \p{Script=Osmanya} (40)
3095 \p{Script: Osmanya} (Short: \p{Sc=Osma}, \p{Osma}) (40)
3096 \p{Script: Phag} \p{Script=Phags_Pa} (56)
3097 \p{Script: Phags_Pa} (Short: \p{Sc=Phag}, \p{Phag}) (56)
3098 \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (27)
3099 \p{Script: Phnx} \p{Script=Phoenician} (29)
3100 \p{Script: Phoenician} (Short: \p{Sc=Phnx}, \p{Phnx}) (29)
3101 \p{Script: Plrd} \p{Script=Miao} (133)
3102 \p{Script: Prti} \p{Script=Inscriptional_Parthian} (30)
3103 \p{Script: Qaac} \p{Script=Coptic} (137)
3104 \p{Script: Qaai} \p{Script=Inherited} (524)
3105 \p{Script: Rejang} (Short: \p{Sc=Rjng}, \p{Rjng}) (37)
3106 \p{Script: Rjng} \p{Script=Rejang} (37)
3107 \p{Script: Runic} (Short: \p{Sc=Runr}, \p{Runr}) (78)
3108 \p{Script: Runr} \p{Script=Runic} (78)
3109 \p{Script: Samaritan} (Short: \p{Sc=Samr}, \p{Samr}) (61)
3110 \p{Script: Samr} \p{Script=Samaritan} (61)
3111 \p{Script: Sarb} \p{Script=Old_South_Arabian} (32)
3112 \p{Script: Saur} \p{Script=Saurashtra} (81)
3113 \p{Script: Saurashtra} (Short: \p{Sc=Saur}, \p{Saur}) (81)
3114 \p{Script: Sharada} (Short: \p{Sc=Shrd}, \p{Shrd}) (83)
3115 \p{Script: Shavian} (Short: \p{Sc=Shaw}, \p{Shaw}) (48)
3116 \p{Script: Shaw} \p{Script=Shavian} (48)
3117 \p{Script: Shrd} \p{Script=Sharada} (83)
3118 \p{Script: Sinh} \p{Script=Sinhala} (80)
3119 \p{Script: Sinhala} (Short: \p{Sc=Sinh}, \p{Sinh}) (80)
3120 \p{Script: Sora} \p{Script=Sora_Sompeng} (35)
3121 \p{Script: Sora_Sompeng} (Short: \p{Sc=Sora}, \p{Sora}) (35)
3122 \p{Script: Sund} \p{Script=Sundanese} (72)
3123 \p{Script: Sundanese} (Short: \p{Sc=Sund}, \p{Sund}) (72)
3124 \p{Script: Sylo} \p{Script=Syloti_Nagri} (44)
3125 \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}, \p{Sylo}) (44)
3126 \p{Script: Syrc} \p{Script=Syriac} (77)
3127 \p{Script: Syriac} (Short: \p{Sc=Syrc}, \p{Syrc}) (77)
3128 \p{Script: Tagalog} (Short: \p{Sc=Tglg}, \p{Tglg}) (20)
3129 \p{Script: Tagb} \p{Script=Tagbanwa} (18)
3130 \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}, \p{Tagb}) (18)
3131 \p{Script: Tai_Le} (Short: \p{Sc=Tale}, \p{Tale}) (35)
3132 \p{Script: Tai_Tham} (Short: \p{Sc=Lana}, \p{Lana}) (127)
3133 \p{Script: Tai_Viet} (Short: \p{Sc=Tavt}, \p{Tavt}) (72)
3134 \p{Script: Takr} \p{Script=Takri} (66)
3135 \p{Script: Takri} (Short: \p{Sc=Takr}, \p{Takr}) (66)
3136 \p{Script: Tale} \p{Script=Tai_Le} (35)
3137 \p{Script: Talu} \p{Script=New_Tai_Lue} (83)
3138 \p{Script: Tamil} (Short: \p{Sc=Taml}, \p{Taml}) (72)
3139 \p{Script: Taml} \p{Script=Tamil} (72)
3140 \p{Script: Tavt} \p{Script=Tai_Viet} (72)
3141 \p{Script: Telu} \p{Script=Telugu} (93)
3142 \p{Script: Telugu} (Short: \p{Sc=Telu}, \p{Telu}) (93)
3143 \p{Script: Tfng} \p{Script=Tifinagh} (59)
3144 \p{Script: Tglg} \p{Script=Tagalog} (20)
3145 \p{Script: Thaa} \p{Script=Thaana} (50)
3146 \p{Script: Thaana} (Short: \p{Sc=Thaa}, \p{Thaa}) (50)
3147 \p{Script: Thai} (Short: \p{Sc=Thai}, \p{Thai}) (86)
3148 \p{Script: Tibetan} (Short: \p{Sc=Tibt}, \p{Tibt}) (207)
3149 \p{Script: Tibt} \p{Script=Tibetan} (207)
3150 \p{Script: Tifinagh} (Short: \p{Sc=Tfng}, \p{Tfng}) (59)
3151 \p{Script: Ugar} \p{Script=Ugaritic} (31)
3152 \p{Script: Ugaritic} (Short: \p{Sc=Ugar}, \p{Ugar}) (31)
3153 \p{Script: Unknown} (Short: \p{Sc=Zzzz}, \p{Zzzz}) (1_003_931)
3154 \p{Script: Vai} (Short: \p{Sc=Vai}, \p{Vai}) (300)
3155 \p{Script: Vaii} \p{Script=Vai} (300)
3156 \p{Script: Xpeo} \p{Script=Old_Persian} (50)
3157 \p{Script: Xsux} \p{Script=Cuneiform} (982)
3158 \p{Script: Yi} (Short: \p{Sc=Yi}, \p{Yi}) (1220)
3159 \p{Script: Yiii} \p{Script=Yi} (1220)
3160 \p{Script: Zinh} \p{Script=Inherited} (524)
3161 \p{Script: Zyyy} \p{Script=Common} (6412)
3162 \p{Script: Zzzz} \p{Script=Unknown} (1_003_931)
3163 \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1261)
3164 \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}) (1261)
3165 \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}) (92)
3166 \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
3167 (31)
3168 \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (92)
3169 \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}) (61)
3170 \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
3171 \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
3172 \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}) (121)
3173 \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
3174 \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}) (657)
3175 \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}) (56)
3176 \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
3177 \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (94)
3178 \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}) (94)
3179 \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (306)
3180 \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}) (306)
3181 \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (108)
3182 \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}) (108)
3183 \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
3184 \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}) (256)
3185 \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (30)
3186 \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}) (30)
3187 \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
3188 \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}) (22)
3189 \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (67)
3190 \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans})
3191 (710)
3192 \p{Script_Extensions: Cans} \p{Script_Extensions=
3193 Canadian_Aboriginal} (710)
3194 \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
3195 \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}) (49)
3196 \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}) (67)
3197 \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}) (83)
3198 \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (85)
3199 \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}) (85)
3200 \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}) (6059)
3201 \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (137)
3202 \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}) (137)
3203 \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
3204 \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}) (982)
3205 \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}) (112)
3206 \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}) (417)
3207 \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (417)
3208 \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}) (80)
3209 \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (163)
3210 \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}) (163)
3211 \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
3212 \p{Script_Extensions: Egyp} \p{Script_Extensions=
3213 Egyptian_Hieroglyphs} (1071)
3214 \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp})
3215 (1071)
3216 \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
3217 \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}) (495)
3218 \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (128)
3219 \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}) (128)
3220 \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (94)
3221 \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}) (94)
3222 \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
3223 \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}) (27)
3224 \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}) (511)
3225 \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (511)
3226 \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}) (94)
3227 \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (94)
3228 \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}) (91)
3229 \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (91)
3230 \p{Script_Extensions: Han} (Short: \p{Scx=Han}) (76_218)
3231 \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_971)
3232 \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}) (11_971)
3233 \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (76_218)
3234 \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
3235 \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}) (23)
3236 \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (133)
3237 \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}) (133)
3238 \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (356)
3239 \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}) (356)
3240 \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi}) (31)
3241 \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}) (506)
3242 \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli})
3243 (27)
3244 \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
3245 Prti}) (30)
3246 \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (35)
3247 \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
3248 \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}) (91)
3249 \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}) (76)
3250 \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
3251 \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (565)
3252 \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}) (86)
3253 \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}) (565)
3254 \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}) (48)
3255 \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (65)
3256 \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}) (65)
3257 \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}) (146)
3258 \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
3259 \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (86)
3260 \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (76)
3261 \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
3262 \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}) (67)
3263 \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (67)
3264 \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}) (1272)
3265 \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1272)
3266 \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
3267 \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}) (74)
3268 \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (66)
3269 \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}) (66)
3270 \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
3271 \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}) (268)
3272 \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}) (48)
3273 \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
3274 \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}) (29)
3275 \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
3276 \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}) (27)
3277 \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}) (98)
3278 \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
3279 \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}) (30)
3280 \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei}) (79)
3281 \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
3282 (26)
3283 \p{Script_Extensions: Mero} \p{Script_Extensions=
3284 Meroitic_Hieroglyphs} (32)
3285 \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc}) (26)
3286 \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero})
3287 (32)
3288 \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}) (133)
3289 \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (98)
3290 \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (156)
3291 \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}) (156)
3292 \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
3293 \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}) (188)
3294 \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (188)
3295 \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}) (83)
3296 \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}) (59)
3297 \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (59)
3298 \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
3299 \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}) (29)
3300 \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}) (48)
3301 \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
3302 \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}) (35)
3303 \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}) (50)
3304 \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb}) (32)
3305 \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}) (73)
3306 \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}) (92)
3307 \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
3308 \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (92)
3309 \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
3310 \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}) (40)
3311 \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
3312 \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}) (59)
3313 \p{Script_Extensions: Phli} \p{Script_Extensions=
3314 Inscriptional_Pahlavi} (27)
3315 \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
3316 \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}) (29)
3317 \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (133)
3318 \p{Script_Extensions: Prti} \p{Script_Extensions=
3319 Inscriptional_Parthian} (30)
3320 \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (137)
3321 \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (506)
3322 \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}) (37)
3323 \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
3324 \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}) (78)
3325 \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (78)
3326 \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}) (61)
3327 \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
3328 \p{Script_Extensions: Sarb} \p{Script_Extensions=
3329 Old_South_Arabian} (32)
3330 \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (81)
3331 \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}) (81)
3332 \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}) (83)
3333 \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}) (48)
3334 \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
3335 \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (83)
3336 \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (80)
3337 \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}) (80)
3338 \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
3339 \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora}) (35)
3340 \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
3341 \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}) (72)
3342 \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (44)
3343 \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo}) (44)
3344 \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (93)
3345 \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}) (93)
3346 \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}) (22)
3347 \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
3348 \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}) (20)
3349 \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}) (35)
3350 \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}) (127)
3351 \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}) (72)
3352 \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (78)
3353 \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}) (78)
3354 \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (35)
3355 \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
3356 \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}) (72)
3357 \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (72)
3358 \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
3359 \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (93)
3360 \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}) (93)
3361 \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
3362 \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
3363 \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (65)
3364 \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}) (65)
3365 \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}) (86)
3366 \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}) (207)
3367 \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
3368 \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}) (59)
3369 \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
3370 \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}) (31)
3371 \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}) (1_003_931)
3372 \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}) (300)
3373 \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
3374 \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
3375 \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (982)
3376 \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}) (1246)
3377 \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
3378 \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (506)
3379 \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (6059)
3380 \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown}
3381 (1_003_931)
3382 \p{Scx: *} \p{Script_Extensions: *}
3383 \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
3384 \p{SD: *} \p{Soft_Dotted: *}
3385 \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
3386 \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4)
3387 \p{Sentence_Break: CL} \p{Sentence_Break=Close} (177)
3388 \p{Sentence_Break: Close} (Short: \p{SB=CL}) (177)
3389 \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1)
3390 \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (1649)
3391 \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (1649)
3392 \p{Sentence_Break: FO} \p{Sentence_Break=Format} (137)
3393 \p{Sentence_Break: Format} (Short: \p{SB=FO}) (137)
3394 \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (97_841)
3395 \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1)
3396 \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (1933)
3397 \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (1933)
3398 \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (452)
3399 \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (452)
3400 \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (97_841)
3401 \p{Sentence_Break: Other} (Short: \p{SB=XX}) (1_010_273)
3402 \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
3403 \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26)
3404 \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
3405 \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3)
3406 \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (21)
3407 \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (80)
3408 \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (80)
3409 \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1514)
3410 \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1514)
3411 \p{Sentence_Break: XX} \p{Sentence_Break=Other} (1_010_273)
3412 \p{Separator} \p{General_Category=Separator} (Short:
3413 \p{Z}) (20)
3414 \p{Sharada} \p{Script=Sharada} (Short: \p{Shrd}; NOT
3415 \p{Block=Sharada}) (83)
3416 \p{Shavian} \p{Script=Shavian} (Short: \p{Shaw}) (48)
3417 \p{Shaw} \p{Shavian} (= \p{Script=Shavian}) (48)
3418 \p{Shrd} \p{Sharada} (= \p{Script=Sharada}) (NOT
3419 \p{Block=Sharada}) (83)
3420 \p{Sinh} \p{Sinhala} (= \p{Script=Sinhala}) (NOT
3421 \p{Block=Sinhala}) (80)
3422 \p{Sinhala} \p{Script=Sinhala} (Short: \p{Sinh}; NOT
3423 \p{Block=Sinhala}) (80)
3424 \p{Sk} \p{Modifier_Symbol} (=
3425 \p{General_Category=Modifier_Symbol})
3426 (115)
3427 \p{Sm} \p{Math_Symbol} (= \p{General_Category=
3428 Math_Symbol}) (952)
3429 X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
3430 \p{InSmallForms}) (32)
3431 X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block=
3432 Small_Form_Variants}) (32)
3433 \p{So} \p{Other_Symbol} (= \p{General_Category=
3434 Other_Symbol}) (4404)
3435 \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
3436 \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_066)
3437 \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (46)
3438 \p{Sora} \p{Sora_Sompeng} (= \p{Script=
3439 Sora_Sompeng}) (NOT \p{Block=
3440 Sora_Sompeng}) (35)
3441 \p{Sora_Sompeng} \p{Script=Sora_Sompeng} (Short: \p{Sora};
3442 NOT \p{Block=Sora_Sompeng}) (35)
3443 \p{Space} \p{White_Space=Y} \s including beyond
3444 ASCII plus vertical tab (26)
3445 \p{Space: *} \p{White_Space: *}
3446 \p{Space_Separator} \p{General_Category=Space_Separator}
3447 (Short: \p{Zs}) (18)
3448 \p{SpacePerl} \p{XPerlSpace} (25)
3449 \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
3450 \p{Mc}) (353)
3451 X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
3452 (Short: \p{InModifierLetters}) (80)
3453 X \p{Specials} \p{Block=Specials} (16)
3454 \p{STerm} \p{STerm=Y} (83)
3455 \p{STerm: N*} (Single: \P{STerm}) (1_114_029)
3456 \p{STerm: Y*} (Single: \p{STerm}) (83)
3457 \p{Sund} \p{Sundanese} (= \p{Script=Sundanese})
3458 (NOT \p{Block=Sundanese}) (72)
3459 \p{Sundanese} \p{Script=Sundanese} (Short: \p{Sund}; NOT
3460 \p{Block=Sundanese}) (72)
3461 X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block=
3462 Sundanese_Supplement}) (16)
3463 X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
3464 \p{InSundaneseSup}) (16)
3465 X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block=
3466 Supplemental_Arrows_A}) (16)
3467 X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block=
3468 Supplemental_Arrows_B}) (128)
3469 X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (=
3470 \p{Block=
3471 Supplemental_Mathematical_Operators})
3472 (256)
3473 X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (=
3474 \p{Block=
3475 Supplementary_Private_Use_Area_A})
3476 (65_536)
3477 X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (=
3478 \p{Block=
3479 Supplementary_Private_Use_Area_B})
3480 (65_536)
3481 X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block=
3482 Supplemental_Punctuation}) (128)
3483 X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (=
3484 \p{Block=Superscripts_And_Subscripts})
3485 (48)
3486 X \p{Superscripts_And_Subscripts} \p{Block=
3487 Superscripts_And_Subscripts} (Short:
3488 \p{InSuperAndSub}) (48)
3489 X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
3490 \p{InSupArrowsA}) (16)
3491 X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
3492 \p{InSupArrowsB}) (128)
3493 X \p{Supplemental_Mathematical_Operators} \p{Block=
3494 Supplemental_Mathematical_Operators}
3495 (Short: \p{InSupMathOperators}) (256)
3496 X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
3497 (Short: \p{InSupPunctuation}) (128)
3498 X \p{Supplementary_Private_Use_Area_A} \p{Block=
3499 Supplementary_Private_Use_Area_A}
3500 (Short: \p{InSupPUAA}) (65_536)
3501 X \p{Supplementary_Private_Use_Area_B} \p{Block=
3502 Supplementary_Private_Use_Area_B}
3503 (Short: \p{InSupPUAB}) (65_536)
3504 \p{Surrogate} \p{General_Category=Surrogate} (Short:
3505 \p{Cs}) (2048)
3506 \p{Sylo} \p{Syloti_Nagri} (= \p{Script=
3507 Syloti_Nagri}) (NOT \p{Block=
3508 Syloti_Nagri}) (44)
3509 \p{Syloti_Nagri} \p{Script=Syloti_Nagri} (Short: \p{Sylo};
3510 NOT \p{Block=Syloti_Nagri}) (44)
3511 \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
3512 (5519)
3513 \p{Syrc} \p{Syriac} (= \p{Script=Syriac}) (NOT
3514 \p{Block=Syriac}) (77)
3515 \p{Syriac} \p{Script=Syriac} (Short: \p{Syrc}; NOT
3516 \p{Block=Syriac}) (77)
3517 \p{Tagalog} \p{Script=Tagalog} (Short: \p{Tglg}; NOT
3518 \p{Block=Tagalog}) (20)
3519 \p{Tagb} \p{Tagbanwa} (= \p{Script=Tagbanwa}) (NOT
3520 \p{Block=Tagbanwa}) (18)
3521 \p{Tagbanwa} \p{Script=Tagbanwa} (Short: \p{Tagb}; NOT
3522 \p{Block=Tagbanwa}) (18)
3523 X \p{Tags} \p{Block=Tags} (128)
3524 \p{Tai_Le} \p{Script=Tai_Le} (Short: \p{Tale}; NOT
3525 \p{Block=Tai_Le}) (35)
3526 \p{Tai_Tham} \p{Script=Tai_Tham} (Short: \p{Lana}; NOT
3527 \p{Block=Tai_Tham}) (127)
3528 \p{Tai_Viet} \p{Script=Tai_Viet} (Short: \p{Tavt}; NOT
3529 \p{Block=Tai_Viet}) (72)
3530 X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
3531 Tai_Xuan_Jing_Symbols}) (96)
3532 X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
3533 \p{InTaiXuanJing}) (96)
3534 \p{Takr} \p{Takri} (= \p{Script=Takri}) (NOT
3535 \p{Block=Takri}) (66)
3536 \p{Takri} \p{Script=Takri} (Short: \p{Takr}; NOT
3537 \p{Block=Takri}) (66)
3538 \p{Tale} \p{Tai_Le} (= \p{Script=Tai_Le}) (NOT
3539 \p{Block=Tai_Le}) (35)
3540 \p{Talu} \p{New_Tai_Lue} (= \p{Script=New_Tai_Lue})
3541 (NOT \p{Block=New_Tai_Lue}) (83)
3542 \p{Tamil} \p{Script=Tamil} (Short: \p{Taml}; NOT
3543 \p{Block=Tamil}) (72)
3544 \p{Taml} \p{Tamil} (= \p{Script=Tamil}) (NOT
3545 \p{Block=Tamil}) (72)
3546 \p{Tavt} \p{Tai_Viet} (= \p{Script=Tai_Viet}) (NOT
3547 \p{Block=Tai_Viet}) (72)
3548 \p{Telu} \p{Telugu} (= \p{Script=Telugu}) (NOT
3549 \p{Block=Telugu}) (93)
3550 \p{Telugu} \p{Script=Telugu} (Short: \p{Telu}; NOT
3551 \p{Block=Telugu}) (93)
3552 \p{Term} \p{Terminal_Punctuation} (=
3553 \p{Terminal_Punctuation=Y}) (176)
3554 \p{Term: *} \p{Terminal_Punctuation: *}
3555 \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
3556 \p{Term}) (176)
3557 \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
3558 (1_113_936)
3559 \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (176)
3560 \p{Tfng} \p{Tifinagh} (= \p{Script=Tifinagh}) (NOT
3561 \p{Block=Tifinagh}) (59)
3562 \p{Tglg} \p{Tagalog} (= \p{Script=Tagalog}) (NOT
3563 \p{Block=Tagalog}) (20)
3564 \p{Thaa} \p{Thaana} (= \p{Script=Thaana}) (NOT
3565 \p{Block=Thaana}) (50)
3566 \p{Thaana} \p{Script=Thaana} (Short: \p{Thaa}; NOT
3567 \p{Block=Thaana}) (50)
3568 \p{Thai} \p{Script=Thai} (NOT \p{Block=Thai}) (86)
3569 \p{Tibetan} \p{Script=Tibetan} (Short: \p{Tibt}; NOT
3570 \p{Block=Tibetan}) (207)
3571 \p{Tibt} \p{Tibetan} (= \p{Script=Tibetan}) (NOT
3572 \p{Block=Tibetan}) (207)
3573 \p{Tifinagh} \p{Script=Tifinagh} (Short: \p{Tfng}; NOT
3574 \p{Block=Tifinagh}) (59)
3575 \p{Title} \p{Titlecase} (/i= Cased=Yes) (31)
3576 \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i=
3577 Cased=Yes) (31)
3578 \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
3579 (Short: \p{Lt}; /i= General_Category=
3580 Cased_Letter) (31)
3581 X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block=
3582 Transport_And_Map_Symbols}) (128)
3583 X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
3584 (Short: \p{InTransportAndMap}) (128)
3585 X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics}
3586 (= \p{Block=
3587 Unified_Canadian_Aboriginal_Syllabics})
3588 (640)
3589 X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_-
3590 Extended} (= \p{Block=
3591 Unified_Canadian_Aboriginal_Syllabics_-
3592 Extended}) (80)
3593 \p{Ugar} \p{Ugaritic} (= \p{Script=Ugaritic}) (NOT
3594 \p{Block=Ugaritic}) (31)
3595 \p{Ugaritic} \p{Script=Ugaritic} (Short: \p{Ugar}; NOT
3596 \p{Block=Ugaritic}) (31)
3597 \p{UIdeo} \p{Unified_Ideograph} (=
3598 \p{Unified_Ideograph=Y}) (74_617)
3599 \p{UIdeo: *} \p{Unified_Ideograph: *}
3600 \p{Unassigned} \p{General_Category=Unassigned} (Short:
3601 \p{Cn}) (864_415)
3602 X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
3603 Unified_Canadian_Aboriginal_Syllabics}
3604 (Short: \p{InUCAS}) (640)
3605 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
3606 Unified_Canadian_Aboriginal_Syllabics_-
3607 Extended} (Short: \p{InUCASExt}) (80)
3608 \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
3609 (74_617)
3610 \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
3611 (1_039_495)
3612 \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (74_617)
3613 \p{Unknown} \p{Script=Unknown} (Short: \p{Zzzz})
3614 (1_003_931)
3615 \p{Upper} \p{Uppercase=Y} (/i= Cased=Yes) (1483)
3616 \p{Upper: *} \p{Uppercase: *}
3617 \p{Uppercase} \p{Upper} (= \p{Uppercase=Y}) (/i= Cased=
3618 Yes) (1483)
3619 \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
3620 No) (1_112_629)
3621 \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
3622 Yes) (1483)
3623 \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
3624 (Short: \p{Lu}; /i= General_Category=
3625 Cased_Letter) (1441)
3626 \p{Vai} \p{Script=Vai} (NOT \p{Block=Vai}) (300)
3627 \p{Vaii} \p{Vai} (= \p{Script=Vai}) (NOT \p{Block=
3628 Vai}) (300)
3629 \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS};
3630 NOT \p{Variation_Selectors}) (259)
3631 \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853)
3632 \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259)
3633 X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
3634 \p{InVS}) (16)
3635 X \p{Variation_Selectors_Supplement} \p{Block=
3636 Variation_Selectors_Supplement} (Short:
3637 \p{InVSSup}) (240)
3638 X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block=
3639 Vedic_Extensions}) (48)
3640 X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short:
3641 \p{InVedicExt}) (48)
3642 X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
3643 \p{VertSpace} \v (7)
3644 \p{VS} \p{Variation_Selector} (=
3645 \p{Variation_Selector=Y}) (NOT
3646 \p{Variation_Selectors}) (259)
3647 \p{VS: *} \p{Variation_Selector: *}
3648 X \p{VS_Sup} \p{Variation_Selectors_Supplement} (=
3649 \p{Block=
3650 Variation_Selectors_Supplement}) (240)
3651 \p{WB: *} \p{Word_Break: *}
3652 \p{White_Space} \p{White_Space=Y} (Short: \p{WSpace}) (26)
3653 \p{White_Space: N*} (Short: \p{Space=N}, \P{WSpace})
3654 (1_114_086)
3655 \p{White_Space: Y*} (Short: \p{Space=Y}, \p{WSpace}) (26)
3656 \p{Word} \w, including beyond ASCII; = \p{Alnum} +
3657 \pM + \p{Pc} (103_404)
3658 \p{Word_Break: ALetter} (Short: \p{WB=LE}) (24_941)
3659 \p{Word_Break: CR} (Short: \p{WB=CR}) (1)
3660 \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (10)
3661 \p{Word_Break: Extend} (Short: \p{WB=Extend}) (1649)
3662 \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (10)
3663 \p{Word_Break: FO} \p{Word_Break=Format} (136)
3664 \p{Word_Break: Format} (Short: \p{WB=FO}) (136)
3665 \p{Word_Break: KA} \p{Word_Break=Katakana} (310)
3666 \p{Word_Break: Katakana} (Short: \p{WB=KA}) (310)
3667 \p{Word_Break: LE} \p{Word_Break=ALetter} (24_941)
3668 \p{Word_Break: LF} (Short: \p{WB=LF}) (1)
3669 \p{Word_Break: MB} \p{Word_Break=MidNumLet} (8)
3670 \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (8)
3671 \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15)
3672 \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (8)
3673 \p{Word_Break: ML} \p{Word_Break=MidLetter} (8)
3674 \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
3675 \p{Word_Break: Newline} (Short: \p{WB=NL}) (5)
3676 \p{Word_Break: NL} \p{Word_Break=Newline} (5)
3677 \p{Word_Break: NU} \p{Word_Break=Numeric} (451)
3678 \p{Word_Break: Numeric} (Short: \p{WB=NU}) (451)
3679 \p{Word_Break: Other} (Short: \p{WB=XX}) (1_086_577)
3680 \p{Word_Break: XX} \p{Word_Break=Other} (1_086_577)
3681 \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (26)
3682 \p{WSpace: *} \p{White_Space: *}
3683 \p{XDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
3684 \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
3685 (103_336)
3686 \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (1_010_776)
3687 \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (103_336)
3688 \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (101_217)
3689 \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (1_012_895)
3690 \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (101_217)
3691 \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
3692 (103_336)
3693 \p{XIDC: *} \p{XID_Continue: *}
3694 \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (101_217)
3695 \p{XIDS: *} \p{XID_Start: *}
3696 \p{Xpeo} \p{Old_Persian} (= \p{Script=Old_Persian})
3697 (NOT \p{Block=Old_Persian}) (50)
3698 \p{XPerlSpace} \s, including beyond ASCII (Short:
3699 \p{SpacePerl}) (25)
3700 \p{XPosixAlnum} \p{Alnum} (102_619)
3701 \p{XPosixAlpha} \p{Alpha} (= \p{Alphabetic=Y}) (102_159)
3702 \p{XPosixBlank} \p{Blank} (19)
3703 \p{XPosixCntrl} \p{Cntrl} (= \p{General_Category=Control})
3704 (65)
3705 \p{XPosixDigit} \p{Digit} (= \p{General_Category=
3706 Decimal_Number}) (460)
3707 \p{XPosixGraph} \p{Graph} (247_564)
3708 \p{XPosixLower} \p{Lower} (= \p{Lowercase=Y}) (/i= Cased=
3709 Yes) (1934)
3710 \p{XPosixPrint} \p{Print} (247_582)
3711 \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (641)
3712 \p{XPosixSpace} \p{Space} (= \p{White_Space=Y}) (26)
3713 \p{XPosixUpper} \p{Upper} (= \p{Uppercase=Y}) (/i= Cased=
3714 Yes) (1483)
3715 \p{XPosixWord} \p{Word} (103_404)
3716 \p{XPosixXDigit} \p{XDigit} (= \p{Hex_Digit=Y}) (44)
3717 \p{Xsux} \p{Cuneiform} (= \p{Script=Cuneiform})
3718 (NOT \p{Block=Cuneiform}) (982)
3719 \p{Yi} \p{Script=Yi} (1220)
3720 X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
3721 X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
3722 \p{Yiii} \p{Yi} (= \p{Script=Yi}) (1220)
3723 X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block=
3724 Yijing_Hexagram_Symbols}) (64)
3725 X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
3726 (Short: \p{InYijing}) (64)
3727 \p{Z} \p{Separator} (= \p{General_Category=
3728 Separator}) (20)
3729 \p{Zinh} \p{Inherited} (= \p{Script=Inherited})
3730 (524)
3731 \p{Zl} \p{Line_Separator} (= \p{General_Category=
3732 Line_Separator}) (1)
3733 \p{Zp} \p{Paragraph_Separator} (=
3734 \p{General_Category=
3735 Paragraph_Separator}) (1)
3736 \p{Zs} \p{Space_Separator} (=
3737 \p{General_Category=Space_Separator})
3738 (18)
3739 \p{Zyyy} \p{Common} (= \p{Script=Common}) (6412)
3740 \p{Zzzz} \p{Unknown} (= \p{Script=Unknown})
3741 (1_003_931)
3742 TX\p{_CanonDCIJ} (For internal use by Perl, not necessarily
3743 stable) (= \p{Soft_Dotted=Y}) (46)
3744 TX\p{_Case_Ignorable} (For internal use by Perl, not necessarily
3745 stable) (= \p{Case_Ignorable=Y}) (1799)
3746 TX\p{_CombAbove} (For internal use by Perl, not necessarily
3747 stable) (= \p{Canonical_Combining_Class=
3748 Above}) (349)
3749
3750 Legal "\p{}" and "\P{}" constructs that match no characters
3751 Unicode has some property-value pairs that currently don't match
3752 anything. This happens generally either because they are obsolete, or
3753 they exist for symmetry with other forms, but no language has yet been
3754 encoded that uses them. In this version of Unicode, the following
3755 match zero code points:
3756
3757 \p{Canonical_Combining_Class=Attached_Below_Left}
3758 \p{Grapheme_Cluster_Break=Prepend}
3759 \p{Joining_Type=Left_Joining}
3760
3762 All the Unicode character properties mentioned above (except for those
3763 marked as for internal use by Perl) are also accessible by
3764 "prop_invlist()" in Unicode::UCD.
3765
3766 Due to their nature, not all Unicode character properties are suitable
3767 for regular expression matches, nor "prop_invlist()". The remaining
3768 non-provisional, non-internal ones are accessible via "prop_invmap()"
3769 in Unicode::UCD (except for those that this Perl installation hasn't
3770 included; see below for which those are).
3771
3772 For compatibility with other parts of Perl, all the single forms given
3773 in the table in the section above are recognized. BUT, there are some
3774 ambiguities between some Perl extensions and the Unicode properties,
3775 all of which are silently resolved in favor of the official Unicode
3776 property. To avoid surprises, you should only use "prop_invmap()" for
3777 forms listed in the table below, which omits the non-recommended ones.
3778 The affected forms are the Perl single form equivalents of Unicode
3779 properties, such as "\p{sc}" being a single-form equivalent of
3780 "\p{gc=sc}", which is treated by "prop_invmap()" as the "Script"
3781 property, whose short name is "sc". The table indicates the current
3782 ambiguities in the INFO column, beginning with the word "NOT".
3783
3784 The standard Unicode properties listed below are documented in
3785 <http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is
3786 documented in "prop_invmap()" in Unicode::UCD. The other Perl
3787 extensions are in "Other Properties" in perlunicode;
3788
3789 The first column in the table is a name for the property; the second
3790 column is an alternative name, if any, plus possibly some annotations.
3791 The alternative name is the property's full name, unless that would
3792 simply repeat the first column, in which case the second column
3793 indicates the property's short name (if different). The annotations
3794 are given only in the entry for the full name. If a property is
3795 obsolete, etc, the entry will be flagged with the same characters used
3796 in the table in the section above, like D or S.
3797
3798 NAME INFO
3799
3800 Age
3801 AHex ASCII_Hex_Digit
3802 All Any. (Perl extension)
3803 Alnum (Perl extension). Alphabetic and
3804 (decimal) Numeric
3805 Alpha Alphabetic
3806 Alphabetic (Short: Alpha)
3807 Any (Perl extension). [\x{0000}-\x{10FFFF}]
3808 ASCII Block=ASCII. (Perl extension).
3809 [[:ASCII:]]
3810 ASCII_Hex_Digit (Short: AHex)
3811 Assigned (Perl extension). All assigned code points
3812 Bc Bidi_Class
3813 Bidi_C Bidi_Control
3814 Bidi_Class (Short: bc)
3815 Bidi_Control (Short: Bidi_C)
3816 Bidi_M Bidi_Mirrored
3817 Bidi_Mirrored (Short: Bidi_M)
3818 Bidi_Mirroring_Glyph (Short: bmg)
3819 Blank (Perl extension). \h, Horizontal white
3820 space
3821 Blk Block
3822 Block (Short: blk)
3823 Bmg Bidi_Mirroring_Glyph
3824 Canonical_Combining_Class (Short: ccc)
3825 Case_Folding (Short: cf)
3826 Case_Ignorable (Short: CI)
3827 Cased
3828 Category General_Category
3829 Ccc Canonical_Combining_Class
3830 CE Composition_Exclusion
3831 Cf Case_Folding; NOT 'cf' meaning
3832 'General_Category=Format'
3833 Changes_When_Casefolded (Short: CWCF)
3834 Changes_When_Casemapped (Short: CWCM)
3835 Changes_When_Lowercased (Short: CWL)
3836 Changes_When_NFKC_Casefolded (Short: CWKCF)
3837 Changes_When_Titlecased (Short: CWT)
3838 Changes_When_Uppercased (Short: CWU)
3839 CI Case_Ignorable
3840 Cntrl General_Category=Cntrl. (Perl extension).
3841 Control characters
3842 Comp_Ex Full_Composition_Exclusion
3843 Composition_Exclusion (Short: CE)
3844 CWCF Changes_When_Casefolded
3845 CWCM Changes_When_Casemapped
3846 CWKCF Changes_When_NFKC_Casefolded
3847 CWL Changes_When_Lowercased
3848 CWT Changes_When_Titlecased
3849 CWU Changes_When_Uppercased
3850 Dash
3851 Decomposition_Mapping (Short: dm)
3852 Decomposition_Type (Short: dt)
3853 Default_Ignorable_Code_Point (Short: DI)
3854 Dep Deprecated
3855 Deprecated (Short: Dep)
3856 DI Default_Ignorable_Code_Point
3857 Dia Diacritic
3858 Diacritic (Short: Dia)
3859 Digit General_Category=Digit. (Perl extension).
3860 [0-9] + all other decimal digits
3861 Dm Decomposition_Mapping
3862 Dt Decomposition_Type
3863 Ea East_Asian_Width
3864 East_Asian_Width (Short: ea)
3865 Ext Extender
3866 Extender (Short: Ext)
3867 Full_Composition_Exclusion (Short: Comp_Ex)
3868 Gc General_Category
3869 GCB Grapheme_Cluster_Break
3870 General_Category (Short: gc)
3871 Gr_Base Grapheme_Base
3872 Gr_Ext Grapheme_Extend
3873 Graph (Perl extension). Characters that are
3874 graphical
3875 Grapheme_Base (Short: Gr_Base)
3876 Grapheme_Cluster_Break (Short: GCB)
3877 Grapheme_Extend (Short: Gr_Ext)
3878 Hangul_Syllable_Type (Short: hst)
3879 Hex Hex_Digit
3880 Hex_Digit (Short: Hex)
3881 HorizSpace Blank. (Perl extension)
3882 Hst Hangul_Syllable_Type
3883 D Hyphen Supplanted by Line_Break property values;
3884 see www.unicode.org/reports/tr14
3885 ID_Continue (Short: IDC)
3886 ID_Start (Short: IDS)
3887 IDC ID_Continue
3888 Ideo Ideographic
3889 Ideographic (Short: Ideo)
3890 IDS ID_Start
3891 IDS_Binary_Operator (Short: IDSB)
3892 IDS_Trinary_Operator (Short: IDST)
3893 IDSB IDS_Binary_Operator
3894 IDST IDS_Trinary_Operator
3895 In Present_In. (Perl extension)
3896 Isc ISO_Comment; NOT 'isc' meaning
3897 'General_Category=Other'
3898 ISO_Comment (Short: isc)
3899 Jg Joining_Group
3900 Join_C Join_Control
3901 Join_Control (Short: Join_C)
3902 Joining_Group (Short: jg)
3903 Joining_Type (Short: jt)
3904 Jt Joining_Type
3905 Lb Line_Break
3906 Lc Lowercase_Mapping; NOT 'lc' meaning
3907 'General_Category=Cased_Letter'
3908 Line_Break (Short: lb)
3909 LOE Logical_Order_Exception
3910 Logical_Order_Exception (Short: LOE)
3911 Lower Lowercase
3912 Lowercase (Short: Lower)
3913 Lowercase_Mapping (Short: lc)
3914 Math
3915 Na Name
3916 Na1 Unicode_1_Name
3917 Name (Short: na)
3918 Name_Alias
3919 NChar Noncharacter_Code_Point
3920 NFC_QC NFC_Quick_Check
3921 NFC_Quick_Check (Short: NFC_QC)
3922 NFD_QC NFD_Quick_Check
3923 NFD_Quick_Check (Short: NFD_QC)
3924 NFKC_Casefold (Short: NFKC_CF)
3925 NFKC_CF NFKC_Casefold
3926 NFKC_QC NFKC_Quick_Check
3927 NFKC_Quick_Check (Short: NFKC_QC)
3928 NFKD_QC NFKD_Quick_Check
3929 NFKD_Quick_Check (Short: NFKD_QC)
3930 Noncharacter_Code_Point (Short: NChar)
3931 Nt Numeric_Type
3932 Numeric_Type (Short: nt)
3933 Numeric_Value (Short: nv)
3934 Nv Numeric_Value
3935 Pat_Syn Pattern_Syntax
3936 Pat_WS Pattern_White_Space
3937 Pattern_Syntax (Short: Pat_Syn)
3938 Pattern_White_Space (Short: Pat_WS)
3939 Perl_Decimal_Digit (Perl extension)
3940 PerlSpace (Perl extension). \s, restricted to ASCII
3941 = [ \f\n\r\t]
3942 PerlWord (Perl extension). \w, restricted to ASCII
3943 = [A-Za-z0-9_]
3944 PosixAlnum (Perl extension). [A-Za-z0-9]
3945 PosixAlpha (Perl extension). [A-Za-z]
3946 PosixBlank (Perl extension). \t and ' '
3947 PosixCntrl (Perl extension). ASCII control
3948 characters: NUL, SOH, STX, ETX, EOT, ENQ,
3949 ACK, BEL, BS, HT, LF, VT, FF, CR, SO, SI,
3950 DLE, DC1, DC2, DC3, DC4, NAK, SYN, ETB,
3951 CAN, EOM, SUB, ESC, FS, GS, RS, US, and DEL
3952 PosixDigit (Perl extension). [0-9]
3953 PosixGraph (Perl extension). [-
3954 !"#$%&'()*+,./:;<>?@[\\]^_`{|}~0-9A-Za-z]
3955 PosixLower (Perl extension). [a-z]
3956 PosixPrint (Perl extension). [- 0-9A-Za-
3957 z!"#$%&'()*+,./:;<>?@[\\]^_`{|}~]
3958 PosixPunct (Perl extension). [-
3959 !"#$%&'()*+,./:;<>?@[\\]^_`{|}~]
3960 PosixSpace (Perl extension). \t, \n, \cK, \f, \r,
3961 and ' '. (\cK is vertical tab)
3962 PosixUpper (Perl extension). [A-Z]
3963 PosixWord PerlWord. (Perl extension)
3964 PosixXDigit (Perl extension). [0-9A-Fa-f]
3965 Present_In (Short: In). (Perl extension)
3966 Print (Perl extension). Characters that are
3967 graphical plus space characters (but no
3968 controls)
3969 Punct General_Category=Punct. (Perl extension)
3970 QMark Quotation_Mark
3971 Quotation_Mark (Short: QMark)
3972 Radical
3973 SB Sentence_Break
3974 Sc Script; NOT 'sc' meaning
3975 'General_Category=Currency_Symbol'
3976 Scf Simple_Case_Folding
3977 Script (Short: sc)
3978 Script_Extensions (Short: scx)
3979 Scx Script_Extensions
3980 SD Soft_Dotted
3981 Sentence_Break (Short: SB)
3982 Sfc Simple_Case_Folding
3983 Simple_Case_Folding (Short: scf)
3984 Simple_Lowercase_Mapping (Short: slc)
3985 Simple_Titlecase_Mapping (Short: stc)
3986 Simple_Uppercase_Mapping (Short: suc)
3987 Slc Simple_Lowercase_Mapping
3988 Soft_Dotted (Short: SD)
3989 Space White_Space
3990 SpacePerl XPerlSpace. (Perl extension)
3991 Stc Simple_Titlecase_Mapping
3992 STerm
3993 Suc Simple_Uppercase_Mapping
3994 Tc Titlecase_Mapping
3995 Term Terminal_Punctuation
3996 Terminal_Punctuation (Short: Term)
3997 Title Titlecase. (Perl extension)
3998 Titlecase (Short: Title). (Perl extension). (=
3999 \p{Gc=Lt})
4000 Titlecase_Mapping (Short: tc)
4001 Uc Uppercase_Mapping
4002 UIdeo Unified_Ideograph
4003 Unicode_1_Name (Short: na1)
4004 Unified_Ideograph (Short: UIdeo)
4005 Upper Uppercase
4006 Uppercase (Short: Upper)
4007 Uppercase_Mapping (Short: uc)
4008 Variation_Selector (Short: VS)
4009 VertSpace (Perl extension). \v
4010 VS Variation_Selector
4011 WB Word_Break
4012 White_Space (Short: WSpace)
4013 Word (Perl extension). \w, including beyond
4014 ASCII; = \p{Alnum} + \pM + \p{Pc}
4015 Word_Break (Short: WB)
4016 WSpace White_Space
4017 XDigit (Perl extension)
4018 XID_Continue (Short: XIDC)
4019 XID_Start (Short: XIDS)
4020 XIDC XID_Continue
4021 XIDS XID_Start
4022 XPerlSpace (Perl extension). \s, including beyond
4023 ASCII
4024 XPosixAlnum Alnum. (Perl extension)
4025 XPosixAlpha Alpha. (Perl extension)
4026 XPosixBlank Blank. (Perl extension)
4027 XPosixCntrl General_Category=Cntrl. (Perl extension)
4028 XPosixDigit General_Category=Digit. (Perl extension)
4029 XPosixGraph Graph. (Perl extension)
4030 XPosixLower Lower. (Perl extension)
4031 XPosixPrint Print. (Perl extension)
4032 XPosixPunct (Perl extension). \p{Punct} + ASCII-range
4033 \p{Symbol}
4034 XPosixSpace Space. (Perl extension)
4035 XPosixUpper Upper. (Perl extension)
4036 XPosixWord Word. (Perl extension)
4037 XPosixXDigit XDigit. (Perl extension)
4038
4040 Certain properties are accessible also via core function calls. These
4041 are:
4042
4043 Lowercase_Mapping lc() and lcfirst()
4044 Titlecase_Mapping ucfirst()
4045 Uppercase_Mapping uc()
4046
4047 Also, Case_Folding is accessible through the "/i" modifier in regular
4048 expressions, the "\F" transliteration escape, and the "fc" operator.
4049
4050 And, the Name and Name_Aliases properties are accessible through the
4051 "\N{}" interpolation in double-quoted strings and regular expressions;
4052 and functions "charnames::viacode()", "charnames::vianame()", and
4053 "charnames::string_vianame()" (which require a "use charnames ();" to
4054 be specified.
4055
4056 Finally, most properties related to decomposition are accessible via
4057 Unicode::Normalize.
4058
4060 Perl will generate an error for a few character properties in Unicode
4061 when used in a regular expression. The non-Unihan ones are listed
4062 below, with the reasons they are not accepted, perhaps with work-
4063 arounds. The short names for the properties are listed enclosed in
4064 (parentheses). As described after the list, an installation can change
4065 the defaults and choose to accept any of these. The list is machine
4066 generated based on the choices made for the installation that generated
4067 this document.
4068
4069 Expands_On_NFC (XO_NFC)
4070 Expands_On_NFD (XO_NFD)
4071 Expands_On_NFKC (XO_NFKC)
4072 Expands_On_NFKD (XO_NFKD)
4073 Deprecated by Unicode. These are characters that expand to more
4074 than one character in the specified normalization form, but whether
4075 they actually take up more bytes or not depends on the encoding
4076 being used. For example, a UTF-8 encoded character may expand to a
4077 different number of bytes than a UTF-32 encoded character.
4078
4079 Grapheme_Link (Gr_Link)
4080 Deprecated by Unicode: Duplicates ccc=vr
4081 (Canonical_Combining_Class=Virama)
4082
4083 Indic_Matra_Category (InMC)
4084 Indic_Syllabic_Category (InSC)
4085 Provisional
4086
4087 Jamo_Short_Name (JSN)
4088 Other_Alphabetic (OAlpha)
4089 Other_Default_Ignorable_Code_Point (ODI)
4090 Other_Grapheme_Extend (OGr_Ext)
4091 Other_ID_Continue (OIDC)
4092 Other_ID_Start (OIDS)
4093 Other_Lowercase (OLower)
4094 Other_Math (OMath)
4095 Other_Uppercase (OUpper)
4096 Used by Unicode internally for generating other properties and not
4097 intended to be used stand-alone
4098
4099 Script=Katakana_Or_Hiragana (sc=Hrkt)
4100 Obsolete. All code points previously matched by this have been
4101 moved to "Script=Common". Consider instead using
4102 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana (or
4103 both)"
4104
4105 Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)
4106 All code points that would be matched by this are matched by either
4107 "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
4108
4109 An installation can choose to allow any of these to be matched by
4110 downloading the Unicode database from <http://www.unicode.org/Public/>
4111 to $Config{privlib}/unicore/ in the Perl source tree, changing the
4112 controlling lists contained in the program
4113 $Config{privlib}/unicore/mktables and then re-compiling and installing.
4114 (%Config is available from the Config module).
4115
4117 The Unicode data base is delivered in two different formats. The XML
4118 version is valid for more modern Unicode releases. The other version
4119 is a collection of files. The two are intended to give equivalent
4120 information. Perl uses the older form; this allows you to recompile
4121 Perl to use early Unicode releases.
4122
4123 The only non-character property that Perl currently supports is Named
4124 Sequences, in which a sequence of code points is given a name and
4125 generally treated as a single entity. (Perl supports these via the
4126 "\N{...}" double-quotish construct, "charnames::string_vianame(name)"
4127 in charnames, and "namedseq()" in Unicode::UCD.
4128
4129 Below is a list of the files in the Unicode data base that Perl doesn't
4130 currently use, along with very brief descriptions of their purposes.
4131 Some of the names of the files have been shortened from those that
4132 Unicode uses, in order to allow them to be distinguishable from
4133 similarly named files on file systems for which only the first 8
4134 characters of a name are significant.
4135
4136 auxiliary/GraphemeBreakTest.html
4137 auxiliary/LineBreakTest.html
4138 auxiliary/SentenceBreakTest.html
4139 auxiliary/WordBreakTest.html
4140 Documentation of validation tests
4141
4142 auxiliary/LBTest.txt
4143 auxiliary/SBTest.txt
4144 auxiliary/WBTest.txt
4145 BidiTest.txt
4146 NormalizationTest.txt
4147 Validation Tests
4148
4149 CJKRadicals.txt
4150 Maps the kRSUnicode property values to corresponding code points
4151
4152 EmojiSources.txt
4153 Maps certain Unicode code points to their legacy Japanese cell-
4154 phone values
4155
4156 Index.txt
4157 Alphabetical index of Unicode characters
4158
4159 IndicMatraCategory.txt
4160 IndicSyllabicCategory.txt
4161 Provisional; for the analysis and processing of Indic scripts
4162
4163 NamedSqProv.txt
4164 Named sequences proposed for inclusion in a later version of the
4165 Unicode Standard; if you need them now, you can append this file to
4166 NamedSequences.txt and recompile perl
4167
4168 NamesList.txt
4169 Annotated list of characters
4170
4171 NormalizationCorrections.txt
4172 Documentation of corrections already incorporated into the Unicode
4173 data base
4174
4175 Props.txt
4176 Only in very early releases; is a subset of PropList.txt (which is
4177 used instead)
4178
4179 ReadMe.txt
4180 Documentation
4181
4182 StandardizedVariants.txt
4183 Certain glyph variations for character display are standardized.
4184 This lists the non-Unihan ones; the Unihan ones are also not used
4185 by Perl, and are in a separate Unicode data base
4186 <http://www.unicode.org/ivd>
4187
4189 <http://www.unicode.org/reports/tr44/>
4190
4191 perlrecharclass
4192
4193 perlunicode
4194
4195
4196
4197perl v5.16.3 2019-01-21 PERLUNIPROPS(1)