gensprep − compile StringPrep data from files filtered by filterRFC3454.pl
gensprep [ −h, −?, −−help ] [ −v, −−verbose ] [ −c, −−copyright ] [ −s, −−sourcedir source ] [ −d, −−destdir destination ]
gensprep reads filtered RFC 3454 files and compiles their information into a binary form. The resulting file, <name>.icu, can then be read directly by ICU, or used by pkgdata(8) for incorporation into a larger archive or library.
The files read by gensprep are described in the FILES section.
−h, −?, −−help
Print help about usage and exit.
−v, −−verbose
Display extra informative messages during execution.
−c, −−copyright
Include a copyright notice into the binary data.
−s, −−sourcedir source
Set the source directory to source. The default source directory is specified by the environment variable ICU_DATA.
−d, −−destdir destination
Set the destination directory to destination. The default destination directory is specified by the environment variable ICU_DATA.
ICU_DATA |
Specifies the directory containing ICU data. Defaults to ${prefix}/share/icu/74.1/. Some tools in ICU depend on the presence of the trailing slash. It is thus important to make sure that it is present if ICU_DATA is set. |
The following files are read by gensprep and are looked for in the source /misc for rfc3454_*.txt files and in source /unidata for NormalizationCorrections.txt.
rfc3453_A_1.txt |
Contains the list of unassigned codepoints in Unicode version 3.2.0.... | ||
rfc3454_B_1.txt |
Contains the list of code points that are commonly mapped to nothing.... | ||
rfc3454_B_2.txt |
Contains the list of mappings for casefolding of code points when Normalization form NFKC is specified.... | ||
rfc3454_C_X.txt |
Contains the list of code points that are prohibited for IDNA. |
NormalizationCorrections.txt
Contains the list of code points whose normalization has changed since Unicode Version 3.2.0.
74.1
Copyright (C) 2000-2002 IBM, Inc. and others.
pkgdata(8)