1gensprep(8) ICU 71.1 Manual gensprep(8)
2
3
4
6 gensprep - compile StringPrep data from files filtered by filter‐
7 RFC3454.pl
8
10 gensprep [ -h, -?, --help ] [ -v, --verbose ] [ -c, --copyright ] [ -s,
11 --sourcedir source ] [ -d, --destdir destination ]
12
14 gensprep reads filtered RFC 3454 files and compiles their information
15 into a binary form. The resulting file, <name>.icu, can then be read
16 directly by ICU, or used by pkgdata(8) for incorporation into a larger
17 archive or library.
18
19 The files read by gensprep are described in the FILES section.
20
22 -h, -?, --help
23 Print help about usage and exit.
24
25 -v, --verbose
26 Display extra informative messages during execution.
27
28 -c, --copyright
29 Include a copyright notice into the binary data.
30
31 -s, --sourcedir source
32 Set the source directory to source. The default source direc‐
33 tory is specified by the environment variable ICU_DATA.
34
35 -d, --destdir destination
36 Set the destination directory to destination. The default des‐
37 tination directory is specified by the environment variable
38 ICU_DATA.
39
41 ICU_DATA Specifies the directory containing ICU data. Defaults to
42 /usr/share/icu/71.1/. Some tools in ICU depend on the pres‐
43 ence of the trailing slash. It is thus important to make sure
44 that it is present if ICU_DATA is set.
45
47 The following files are read by gensprep and are looked for in the
48 source /misc for rfc3454_*.txt files and in source /unidata for Normal‐
49 izationCorrections.txt.
50
51 rfc3453_A_1.txt Contains the list of unassigned codepoints in Uni‐
52 code version 3.2.0....
53
54 rfc3454_B_1.txt Contains the list of code points that are commonly
55 mapped to nothing....
56
57 rfc3454_B_2.txt Contains the list of mappings for casefolding of
58 code points when Normalization form NFKC is speci‐
59 fied....
60
61 rfc3454_C_X.txt Contains the list of code points that are prohib‐
62 ited for IDNA.
63
64 NormalizationCorrections.txt
65 Contains the list of code points whose normaliza‐
66 tion has changed since Unicode Version 3.2.0.
67
69 71.1
70
72 Copyright (C) 2000-2002 IBM, Inc. and others.
73
75 pkgdata(8)
76
77
78
79ICU MANPAGE 18 March 2003 gensprep(8)