New Forum | Previous | Next | (P-PDF) Developers
Topic: Library to extract text from PDFs?
Conf: (P-PDF) Developers, Msg: 54526
Date: 5/29/2002 05:06 PM
I've looked high and low, and I've given up waiting for Adobe to release the API they once promised for June. Perhaps someone here can help.
Is there a library -- $ware or freeware, COM/DLL or source code, that we can use from our WinNT-based Web applications to extract the text from PDFs for indexing purposes? We don't care about layout, we don't care about order, we don't care about structure. We just want "the words", quickly, easily, and efficiently.
We DO care about not having to do it manually via an Acrobat Reader plug-in, or via automated submission to http://access.adobe.com! We'd just like a handy leeettle library which runs ON the computer in question and gives us the words.
Thanks in advance.