标准编号:ISO/IEC 14651:2016
中文名称:信息技术 国际串排序和比较 字符串比较与公共模板可裁剪排序描述的方法
英文名称:Information technology — International string ordering and comparison — Method for comparing character strings and description of the common template tailorable ordering
发布日期:2016-02
标准范围
This International Standard defines the following.— A reference comparison method. This method is applicable to two character strings to determinetheir collating order in a sorted list. The method can be applied to strings containing characters fromthe full repertoire of ISO/IEC 10646. This method is also applicable to subsets of that repertoire,such as those of the different ISO/IEC 8-bit standard character sets, or any other character set,standardized or not, to produce ordering results valid (after tailoring) for a given set of languagesfor each script. This method uses collation tables derived either from the Common Template Tabledefined in this International Standard or from one of its tailorings. This method provides a referenceformat. The format is described using the Backus-Naur Form (BNF). This format is used to describethe Common Template Table. The format is used normatively within this International Standard.— A Common Template Table. A given tailoring of the Common Template Table is used by the referencecomparison method. The Common Template Table describes an order for all characters encodedin the Unicode 8.0 standard, included in ISO/IEC 10646:2014 and its Amendment 1. It allows fora specification of a fully deterministic ordering. This table enables the specification of a stringordering adapted to local ordering rules, without requiring an implementer to have knowledge ofall the different scripts already encoded in the Universal Coded Character Set (UCS).NOTE 1 This Common Template Table is to be modified to suit the needs of a local environment. The mainworldwide benefit is that, for other scripts, often no modification is required and the order will remain asconsistent as possible and predictable from an international point of view.NOTE 2 The character repertoire used in this International Standard is equivalent to that of the UnicodeStandard version 6.0.— A reference name. The reference name refers to this particular version of the Common TemplateTable, for use as a reference when tailoring. In particular, this name implies that the table is linkedto a particular stage of development of the ISO/IEC 10646 Universal coded character set.— Requirements for a declaration of the differences (delta) between the collation table and the CommonTemplate Table.This International Standard does not mandate the following.— A specific comparison method; any equivalent method giving the same results is acceptable.— A specific format for describing or tailoring tables in a given implementation.— Specific symbols to be used by implementations, except for the name of the Common Template Table.— Any specific user interface for choosing options.— Any specific internal format for intermediate keys used when comparing, nor for the table used. Theuse of numeric keys is not mandated either.— A context-dependent ordering.— Any particular preparation of character strings prior to comparison.
标准预览图

