1YAZ-ICONV(1) [FIXME: manual] YAZ-ICONV(1)
2
3
4
6 yaz-iconv - YAZ Character set conversion utility
7
9 yaz-iconv [-f from] [-t to] [-v] [file...]
10
12 yaz-iconv converts data in file in character set specified by from to
13 output in character set as specified by to.
14
15 This yaz-iconv utility similar to the iconv found on many POSIX systems
16 (Glibc, Solaris, etc).
17
18 If no file is specified, yaz-iconv reads from standard input.
19
21 -ffrom]
22 Specify the character set from of the input file. Should be used in
23 conjunction with option -t.
24
25 -tto]
26 Specify the character set of of the output. Should be used in
27 conjunction with option -f.
28
29 -v
30 Print more information about the conversion process.
31
33 The yaz-iconv command and the API as defined in yaz/yaz-iconv.h is a
34 wrapper for the library system call iconv. But YAZ´ iconv utility also
35 implements conversions on its own. The table below lists characters
36 sets (or encodings). that are supported by YAZ. Each character set is
37 marked with either encode or decode. If an encoding is encode-enabled
38 YAZ may convert to to the designated encoding. If an encoding is
39 decode-enabled, YAZ may convert from the designated encoding.
40
41 marc8 (encode, decode)
42 The MARC8[1] encoding as defined by the Library of Congress. Most
43 MARC21/USMARC records usees this encoding.
44
45 marc8s (encode, decode)
46 Like MARC8 but with conversion prefers non-combined characters in
47 the Latin-1 plane over combined characters.
48
49 marc8lossy (encode)
50 Lossy encoding of MARC-8.
51
52 marc8lossless (encode)
53 Lossless encoding of MARC8.
54
55 utf8 (encode, decode)
56 The most commonly used UNICODE encoding on the Internet.
57
58 iso8859-1 (encode, decode)
59 ISO-8859-1, AKA Latin-1.
60
61 iso5426 (decode)
62 ISO 5426. Some MARC records (UNIMARC) uses this encoding.
63
64 iso5428:1984 (encode, decode)
65 ISO 5428:1984.
66
67 advancegreek (encode, decode)
68 An encoding for Greek used by some vendors (Advance).
69
70 danmarc (decode)
71
72 Danmarc (in danish)[2] is an encoding based on UNICODE which is
73 used for DanMARC2 records.
74
76 The following command converts from ISO-8859-1 (Latin-1) to UTF-8.
77
78 yaz-iconv -f ISO-8859-1 -t UTF-8 -X <input.lst >output.lst
79
80
81
83 prefix/bin/yaz-iconv
84
85 prefix/include/yaz/yaz-iconv.h
86
88 yaz(7) iconv(1)
89
91 1. MARC8
92 http://www.loc.gov/marc/specifications/speccharmarc8.html
93
94 2. Danmarc (in danish)
95 http://www.kat-format.dk/danMARC2/Danmarc2.4.htm#felt+Indl.+4
96
97
98
99YAZ 4.0.2 03/09/2010 YAZ-ICONV(1)