New Forum | Previous | Next | (P-PDF) Developers
Topic: Unicode strings in PDF
Conf: (P-PDF) Developers, Msg: 95048
Date: 8/23/2003 04:54 AM
From all the subjects posted on this forum, dealing with Unicode amounts for a fair number of queries. There seems to be no easy way in PDF to display a unicode text. If I want to show an ascii (latin iso) string, and assuming that I use one of the base 14 fonts available for PDF documents, I can simply specify the encoding (windows, macroman, standard,etc...) and put my ASCII string in parentheses followed by Tj. It will always work. For Unicode, this is a totally different matter. I will need to specify the font with tons of properties associated with it (description), a subset as it is usually a bad idea to embed unicode fonts due to their huge size, and then point to a stream with unicode data. Or is this the case? I admit that after going through the PDF specs, reading posts associated with unicode on this forum, dumping PDF with unicode chars (PDF including Japanese/Chinese/Korean), I am a little bit confused at to what needs to be really done as for a document to display Unicode strings.
For example, what encoding needs to be specified? Can I just put 0xFEFF in front of a string inside a ()Tj tag and be done with it? Where do I get the Font description for basic fonts if I want the document to work with Acrobat reader + Asian pack on Windows/Mac/Solaris? What about obscure dictionary properties documented in TrueType and HP specs (PANOSE? What is that?)
I guess what I am asking is a sample of a minimum document that would display text in CJK by including double byte strings inside the PDF and if this is at all possible.
Thanks for the help and clarifications.