1UNICODE_CATEGORY_LOO(3) Courier Unicode Library UNICODE_CATEGORY_LOO(3)
2
3
4
6 unicode_category_lookup, unicode_isalnum, unicode_isalpha,
7 unicode_isblank, unicode_isdigit, unicode_isgraph, unicode_islower,
8 unicode_ispunct, unicode_isspace, unicode_isupper - unicode character
9 categorization
10
12 #include <courier-unicode.h>
13
14 uint32_t unicode_category_lookup(char32_t c);
15
16 int unicode_isalnum(char32_t c);
17
18 int unicode_isalpha(char32_t c);
19
20 int unicode_isblank(char32_t c);
21
22 int unicode_isdigit(char32_t c);
23
24 int unicode_isgraph(char32_t c);
25
26 int unicode_islower(char32_t c);
27
28 int unicode_ispunct(char32_t c);
29
30 int unicode_isspace(char32_t c);
31
32 int unicode_isupper(char32_t c);
33
35 unicode_category_lookup() looks up the unicode character's
36 categorization[1]. unicode_category_lookup() returns a 32 bit value.
37 The value's UNICODE_CATEGORY_1 bits specify the first level of the
38 unicode character's category, with UNICODE_CATEGORY_2,
39 UNICODE_CATEGORY_3, and UNICODE_CATEGORY_4 bits specifying the 2nd,
40 3rd, and 4th level, if given. A value of 0 for each corresponding bit
41 set indicates that no category is specified for this level, for this
42 character; otherwise the possible values are defined in
43 <courier-unicode.h>.
44
45 The remaining functions implement comparable equivalents of their
46 non-unicode versions in the standard C library, as follows:
47
48 unicode_isalnum()
49 Returns non-0 for all unicode_isalpha() or unicode_isdigit().
50
51 unicode_isalpha()
52 Returns non-0 for all UNICODE_CATEGORY_1_LETTER.
53
54 unicode_isblank()
55 Return non-0 for TAB, and all UNICODE_CATEGORY_2_SPACE.
56
57 unicode_isdigit()
58 Returns non-0 for all UNICODE_CATEGORY_1_NUMBER |
59 UNICODE_CATEGORY_2_DIGIT, only (no third categories).
60
61 unicode_isgraph()
62 Returns non-0 for all codepoints above SPACE which are not
63 unicode_isspace().
64
65 unicode_islower()
66 Returns non-0 for all unicode_isalpha() for which the character is
67 equal to unicode_lc(3) of itself.
68
69 unicode_ispunct()
70 Returns non-0 for all UNICODE_CATEGORY_1_PUNCTUATION.
71
72 unicode_isspace()
73 Returns non-0 for unicode_isblank() or for unicode characters with
74 linebreaking properties of BK, CR, LF, NL, and SP.
75
76 unicode_isupper()
77 Returns non-0 for all unicode_isalpha() for which the character is
78 equal to unicode_uc(3) of itself.
79
81 courier-unicode(7), unicode_convert_tocase(3).
82
84 Sam Varshavchik
85 Author
86
88 1. unicode character's categorization
89 http://unicode.org/notes/tn36/
90
91
92
93Courier Unicode Library 03/11/2017 UNICODE_CATEGORY_LOO(3)