User Commands YAZ-ICONV(1) NAME yaz-iconv - YAZ Character set conversion utility SYNOPSIS yaz-iconv [-f _f_r_o_m] [-t _t_o] [-v] [file...] DESCRIPTION yaz-iconv converts data in file in character set specified by _f_r_o_m to output in character set as specified by _t_o. This yaz-iconv utility similar to the iconv found on many POSIX systems (Glibc, Solaris, etc). If no _f_i_l_e is specified, yaz-iconv reads from standard input. OPTIONS -f_f_r_o_m] Specify the character set _f_r_o_m of the input file. Should be used in conjunction with option -t. -t_t_o] Specify the character set _o_f of the output. Should be used in conjunction with option -f. -v Print more information about the conversion process. ENCODINGS The yaz-iconv command and the API as defined in _y_a_z/_y_a_z-_i_c_o_n_v._h is a wrapper for the library system call iconv. But YAZ' iconv utility also implements conversions on its own. The table below lists characters sets (or encodings). that are supported by YAZ. Each character set is marked with either _e_n_c_o_d_e or _d_e_c_o_d_e. If an encoding is encode-enabled YAZ may convert _t_o to the designated encoding. If an encoding is decode-enabled, YAZ may convert _f_r_o_m the designated encoding. marc8 (encode, decode) The _M_A_R_C_8[1] encoding as defined by the Library of Congress. Most MARC21/USMARC records usees this encoding. marc8s (encode, decode) Like MARC8 but with conversion prefers non-combined characters in the Latin-1 plane over combined characters. marc8lossy (encode) Lossy encoding of MARC-8. YAZ 3.0.46 Last change: 06/08/2009 1 User Commands YAZ-ICONV(1) marc8lossless (encode) Lossless encoding of MARC8. utf8 (encode, decode) The most commonly used UNICODE encoding on the Internet. iso8859-1 (encode, decode) ISO-8859-1, AKA Latin-1. iso5426 (decode) ISO 5426. Some MARC records (UNIMARC) uses this encoding. iso5428:1984 (encode, decode) ISO 5428:1984. advancegreek (encode, decode) An encoding for Greek used by some vendors (Advance). danmarc (decode) _D_a_n_m_a_r_c (_i_n _d_a_n_i_s_h)[2] is an encoding based on UNICODE which is used for DanMARC2 records. EXAMPLES The following command converts from ISO-8859-1 (Latin-1) to UTF-8. yaz-iconv -f ISO-8859-1 -t UTF-8 -X output.lst FILES _p_r_e_f_i_x/_b_i_n/_y_a_z-_i_c_o_n_v _p_r_e_f_i_x/_i_n_c_l_u_d_e/_y_a_z/_y_a_z-_i_c_o_n_v._h SEE ALSO yaz(7) iconv(1) NOTES 1. MARC8 http://www.loc.gov/marc/specifications/speccharmarc8.html 2. Danmarc (in danish) http://www.kat-format.dk/danMARC2/Danmarc2.4.htm#felt+Indl.+4 ATTRIBUTES See attributes(5) for descriptions of the following attributes: YAZ 3.0.46 Last change: 06/08/2009 2 User Commands YAZ-ICONV(1) _______________________________________ | ATTRIBUTE TYPE | ATTRIBUTE VALUE| |_________________________________________|____________________________________|_ | Availability | SUNWlibyaz | |_________________________________________|____________________________________|_ | Interface Stability| Uncommitted | |_____________________|__________________| NOTES Source for YAZ is available on http://opensolaris.org. YAZ 3.0.46 Last change: 06/08/2009 3