Previous | Next | (P-PDF) PDF Accessibility
Topic: RE: Contents of a pdf (Via Email)
Conf: (P-PDF) PDF Accessibility, Msg: 144372
From: Duff_Johnson
Date: 12/29/2005 07:05 AM
> I have a disk full of pdfs created either by a pdf printer or
> via openoffice output.
>
> The open office has selectable print whereas the pdf printer
> output is purely picture.
That's unfortunate...
> I would like to run an ocr program on all my picture only
> pdfs leaving alone the ones already in asci form.
>
> I therefore need to have a way of knowing what is in each
> file without manually opening it to check if it has any text there.
>
> Is there a program that can test a pdf and give me a code
> signifying the result? If not how can I discover the text
> properties of a pdf without fully opening it? For example is
> there a way of finding out what layers are in it?
See Apago's pdfspy application - it can do this. If you know a little
about command-line applications, that will help a lot.
Duff Johnson
Document Solutions, Inc.
http://www.document-solutions.com