New Forum | Previous | Next | (P-PDF) Developers
Topic: Highlight file problems
Conf: (P-PDF) Developers, Msg: 84889
Date: 3/25/2003 11:58 PM
I'm trying to write a tool to create a pdf highlight file after searching the text of a pdf document. I extract the text from the pdf document, so the user can search for words in the pdf text. The tool presents the pdf document in a browser and creates a highlight file, highlighting the words searched for by the user.
While testing my tool, I ran into some unexpected behaviour. The problems relate to characters like ' ` ( ) , and others.
Is there a list of characters that need special treatment in counting the words in a pdf document?
There also seems to be differences in counting the words between pdf documents of version 1.2 and 1.3.
Adobe's document 'Highlight File Format' does not give enough information about these problems.
I hope someone can help me with these problems.
Thanks in advance,