New Forum | Previous | Next | (P-PDF) Developers
Topic: Character code for unicode
Conf: (P-PDF) Developers, Msg: 130868
Date: 4/11/2005 08:58 PM
When we are creating PDF with unicode strings, with different CMaps, we insert different character codes.
when UniJIS-UCS2-HW-H CMap is used, the character codes (to print, say - Hello) that actually go into the PDF file are:
where 0048 - H
0065 - e
006C - l
006C - l
006f - o
But when I use, Identity-H which is most widely used CMPA for most of the languages, the character codes in the PDF for the same text are:
0029 - H
0046 - e
004D - l
I read in the PDF reference that Identity-H CMap can be used to refer characters directly by their CIDs.
But how do I decide what character codes should be put in the PDF file. Should I directly put the binary data in the PDF file instead of the HEX values ? so that whatever CMap is specified, it is automatically taken care of?
Basically, how do I know that PDF should contain - 0029 or 0048 for 'H'?