New Forum | Previous | Next | (P-PDF) Developers
Topic: Extracting word offset from pdf documents
Conf: (P-PDF) Developers, Msg: 123498
Date: 12/8/2004 07:09 AM
I would like to know if there's any tool or code that can be used to extract the words and word offsets in a PDF document..
I have an application that's like a web search engine that's supposed to hilight the words in pdf docs... I have my own code to extract the words and offset and generate the XML file to be sent to the browser .. The thing is I'm not sure that my offsets are the same as acrobats offset or not.. so I would like to have the word offsets of pdf as seen by the acrobat plugin to compare with the offsets my code generate..
Could anyone please help me on this? Thanks a whole