Previous | Next | (P-PDF) Developers
Topic: Re: Getting Text from PDFs using VB6
Conf: (P-PDF) Developers, Msg: 57181
From: LeonardR
Date: 5/29/2002 05:24 PM
At 7:08 PM -0500 1/29/01, p-pdf-developer Listmanager wrote:
>I am having some trouble figuring out how to automate
>the acquisition of text from PDFs. ... but am having trouble figuring how to
>reference the PDDoc.CreateTextSelect,
>PDTextSelect.GetNumText and PDTextSelect.GetText and
>have it return anything. I only get errors. I am
>trying to get a specific page of a pdf, get the text
>on that page, and find certain information from the
>text. (Win2000, VB6, Acrobat.tlb)
>
I am pretty sure that what you are attempting is not
supported through the OLE interfaces, but only through the standard
API's and PlugIns. Also, even if it did work, it would only work
with the full Acrobat, and NOT with Reader (which may be OK with you,
I don't know).
I would suggest that you look into licensing a 3rd party
library or application that provides this functionality w/o requiring
Acrobat in any way. You may also want to look into the open source
word for command line applications that can extract text from PDF
pages and could be called from VB.
Leonard
--
----------------------------------------------------------------------------
Leonard Rosenthol
Sr. Software Engineer (215) 922-3509 (voice)
Digital Applications (215) 440-0504 (fax)
PGP Fingerprint: 8CC9 8878 921E C627 0BC1 15BB FC19 64A9 0016 1397