Jump to content United States-English
HP.com Home Products and Services Support and Drivers Solutions How to Buy
» Contact HP
More options
HP.com home
Release Notes for HP-UX 11.0 Extension Pack, December 1998: HP 9000 Computers > Chapter 3 Functionality Changes Included in the Integrated Patch Bundle

Corrected Character Mappings to iconv(1) and iconv(3C)

» 

Technical documentation

» Feedback
Content starts here

 » Table of Contents

The 11.0 Extension Pack contains defect fixes for incorrect character mappings. The corrections concern the Simplified Chinese, Traditional Chinese, Japanese, and Korean characters of HP-UX.

Corrected character converter mappings allow for improved interoperability when sending or receiving converted character data to/from Unicode-aware systems.

Correction for Simplified Chinese

A patch corrects an incorrect character mapping that occurs when converting between hp15CN and Unicode (UCS2)/UTF-8 for Simplified Chinese.

Specifically, the Simplified Chinese character "Double Vertical Line" mapped incorrectly when converting between hp15CN and UCS2/UTF-8. This character was being mapped to the "Parallel To" character, which is a different character.

The following table summarizes the change applied to iconv tables:

hp15CNincorrect UCS2correct
UCS2
Character Name
0xA1CE-0x2225Parallel To
0xA1AC0x22250x2016 Double Vertical Line

The hp15CN=ucs2 and ucs2=hp15CN iconv converter tables are affected. These tables are shared by both UCS2 and UTF-8 conversions.

No compatibility problems are anticipated. However, if compatibility concerns arise with regard to persistent data stored either in Unicode (UCS2) or UTF-8 on an HP-UX system, it is possible to generate a simple conversion script to search for each occurrence of an incorrect value in either UCS2 or UTF-8 and convert it to the correct value, based on the following mapping:

Old UCS2UCS2Old UTF-8UTF-8Character Name
0x22250x20160xe288a50xe28096Double Vertical Line

Correction for Traditional Chinese

A patch corrects several incorrect character mappings that occur when converting between Big-5/EUC and Unicode (UCS2)/UTF-8 for Traditional Chinese.

In the case of Big-5 to/from UCS2/UTF-8, the "Ideographic Space" character was absent in the Unicode conversion table mapping:

big5incorrect UCS2correct UCS2 Character Name
0xA140- 0x3000Ideographic Space

The following table summarizes the changes applied for conversions between eucTW and UCS2:

eucTWincorrect UCS2correct UCS2Character Name
0xa1a60x30fb0x2022Bullet
0xa1b70x20140x2013EN Dash
0xa1b90x20130x2014EM Dash
0xa1b60xfe310xff5cFullwidth Vertical Line
0xa1b8 0xfe32 0xfe31 Presentation form Vertical EN Dash
0xa1ea 0x2032 0x2035 Reversed Prime
0xa1eb0x20350x2032Prime
0xa2b90x22640x2266Less-than over equal to
0xa2ba0x22650x2267Greater-than over equal to
0xa2c20xfe660xfe65Small Greater-Than
0xa2c30xfe650xfe66Small Equals Sign
0xa2de0xff5c0x2223Divides
0xa2e10xfe670xff0f Full-width Solidus
0xa2e40xffe50x00a5Yen Sign
0xa2e60xffe00x00a2Cent Sign
0xa2e70xffe10x00a3Pound Sign

iconv conversions between eucTW and UCS2 or UTF-8 may be affected.

Big-5 conversions with UCS2/UTF-8 are not directly impacted as only a missing table entry has been added.

eucTW=ucs2, ucs2=eucTW, big5=ucs2 and ucs2=big5 iconv converter tables are affected. These tables are shared by both UCS2 and UTF-8 conversions.

No compatibility problems are anticipated. However, if compatibility concerns arise with regard to persistent data stored either in Unicode (UCS2) or UTF-8 on an HP-UX system, it is possible to generate a simple conversion script to search for each occurrence of an incorrect value in either UCS2 or UTF-8 and convert it to the correct value, based on the following mappings:

Old UCS2UCS2Old UTF-8UTF-8Character Name
0x30fb0x20220xe383bb0xe280a2Bullet
0x20140x20130xe280940xe28093EN Dash
0x20130x20140xe280930xe28094EM Dash
0xfe310xff5c0xefb8b10xefbd9cFullwidth Vertical Line
0xfe320xfe310xefb8b20xefb8b1Presentation form Vertical EN Dash
0x20320x20350xe280b20xe280b5Reversed Prime
0x20350x20320xe280b50xe280b2Prime
0x22640x22660xe289a40xe289a6Less-than over equal to
0x22650x22670xe289a50xe289a7Greater-than over equal to
0xfe660xfe650xefb9a60xefb9a5Small Greater-Than
0xfe650xfe660xefb9a50xefb9a6Small Equals Sign
0xff5c0x22230xefbd9c0xe288a3Divides
0xfe670xff0f0xefb9a70xefbc8fFull-width Solidus
0xffe50x00a50xefbfa50xc2a5Yen Sign
0xffe00x00a20xefbfa00xc2a2Cent Sign
0xffe10x00a30xefbfa10xc2a3Pound Sign

Correction for Japanese

A patch corrects four incorrect Japanese character mappings that occur between Shift-JIS/EUC and Unicode (UCS2)/UTF-8.

The following table summarizes the changes applied:

sjiseucJPincorrect UCS2correct UCS2Character Name
0x81500xA1B10xFFE30x203EOverline
0x815C0xA1BD0x20150x2014Em Dash
0x818F0xA1EF0xFFE50x00A5Yen Sign
n/a0x8FA2B70x02DC0xFF5EFull-width Tilde

Affected iconv conversions are conversions between sjis and UCS2 or UTF-8 as well as conversions between eucJP and UCS2 or UTF-8.

sjis=ucs2, ucs2=sjis, eucJP=ucs2 and ucs2=eucJP are the affected iconv conversion tables. These tables are shared by both UCS2 and UTF-8 conversions.

No compatibility problems are anticipated. However, if compatibility concerns arise with regard to persistent data stored either in Unicode (UCS2) or UTF-8 on an HP-UX system, it is possible to generate a simple conversion script to search for each occurrence of an incorrect value in either UCS2 or UTF-8 and convert it to the correct value, based on the following mappings:

Old UCS2UCS2Old UTF-8UTF-8Character Name
0xFFE30x203E0xefbfa30xe280beOverline
0x20150x20140xe280950xe28094Em Dash
0xFFE50x00A50xefbfa50xc2a5Yen Sign
0x02DC0xFF5E0xcb9c0xefbd9eFull-width Tilde

Correction for Korean

A patch provides a defect fix to address standards non-conformancy for Korean Unicode (UCS2)/UTF-8 character mappings.

The currently supplied Korean iconv converter tables are not conformant to the Unicode 2.0 and ISO-10646 (with 1997 amendments) standards in addition to the Korean national standard, KSC-5700. The current mappings are considered obsolete by all noted standards organizations.

The enhancement provides a set of standards-conformant iconv converter tables for converting between eucKR and Unicode/UTF-8. Specifically, the obsolete region of 0x3d2e - 0x4dff has been remapped to the 0xac00 - 0xd7ff region specified in Unicode 2.0 for Hangul.

Without this modification, it is impossible to share data with any other system which is standards conformant in adhering to the Unicode 2.0/ISO-10646/KSC-5700 standards.

Affected iconv conversions are any conversions between eucKR and UCS2 or UTF-8.

The iconv conversion tables affected by this modification are eucKR=ucs2 and ucs2=eucKR. These tables are shared by both UCS2 and UTF-8 conversions.

No compatibility problems are anticipated. However, if compatibility concerns arise with regard to persistent data stored either in Unicode (UCS2) or UTF-8 on an HP-UX system, it is recommended that the previously installed ucs2=eucKR table be saved and renamed prior to installation of this fix. Persistent data can then be converted back to eucKR using this old table and then reconverted to the correct Unicode/UTF-8 representation.

Printable version
Privacy statement Using this site means you accept its terms Feedback to webmaster
© 1998 Hewlett-Packard Development Company, L.P.