New Forum | Previous | Next | (P-PDF) Developers
Topic: when PDFontXlateToUCS fails?, 1 Attachments
Conf: (P-PDF) Developers, Msg: 131763
Date: 4/26/2005 05:15 PM
In a project on text extraction from pdf we are using PDFontXlateToUCS for translating a string from whatever encoding the PDFont uses to Unicode encoding. In some cases(not known) PDFontXlateToUCS fails partially or completely i.e. It does not convert whole (text run) string (obtained from PDETextGetText) passed to it to unicode. It coverts none(complete failure) or few characters to unicode(partial failure).
If anybody knows please tell in which cases this PDFontXlateToUCS fails? and how the correct unicode for each character can be retrieved?
The file is attached in which this problem occurs.
Thanks and Regards,
Pages 27 from chinease text book.pdf