Planet PDF Forum Archive

Planet PDF  ForumThe page you are viewing is part of our 160,000 page discussion forum archive. See below for PDF-related discussions spanning 1999-2008. To ask questions and get help, head to the live Planet PDF Forum.


How to search this archive. The quickest way is to use the filters on our Advanced Search page so that only archive pages are included in the results.


Previous | Next | (P-PDF) Paper to PDF


Topic: Re: Optimizing OCR Process - GUIDANCE PLEASE!!! (Via Email)
Conf: (P-PDF) Paper to PDF, Msg: 23871
From: dhgpj
Date: 7/12/2001 04:56 PM

>In the process of creating an imaging solution for my Center, I have hit =a HUGE snag...

....and a very common one, BTW.

>Approx. 20% of the documents I will be archiving are 'onion skins' which =I intend to photocopy.

Why photocopy? Why add yet another generation? Scan the onionskin!

>The problem that I'm running into is that many of the skins are 7 or 8 =generation, causing the
>letters to be fuzzy. This fuzz is failing to be recognized in the OCR =process!!!

No surprise there, believe me! 7 paper generations can kill OCR for =almost any document.

>Using Acrobat Capture 3.01, I've tried exporting to PDF as Image & Text, =Searchable Image
>Exact, as well as Searchable Image Compact. The best result, with two =paragraphs identified as
>text, was with the Image & Text option; but this is by know means =acceptable.

What is acceptable? Unrealistic expectations are by far the largest =source of "problems" with OCR applications.

>What can be done on both software & hardware ends to optimize the OCR =process??

Your documents may be simply too poor in quality to get any meaningful OCR =results no matter what you try. There are other OCR products that are =somewhat more document-quality tolerant than Capture. (I am not in the =habit of making specific reccomendations - every case is different). With =the documents you describe, however, your uncorrected accuracy is going to =be lousy no matter what.

Duff Johnson
Document Solutions, Inc.
www.document-solutions.com


PDF In-Depth Free Product Trials Ubiquitous PDF

Pitstop Pro

Now graphic arts professionals have even broader and more expert control over their PDF documents. With...

Download free demo

ARTS PDF Aerialist

The ultimate plug-in for Adobe Acrobat and #1 selling product at PDF Store. Advanced splitting, merging,...

Download free demo

Ubiquitous PDF: Got holiday catering?

Bret Thompson, chef and owner of LA's Milk restaurant on Beverly, has joined forces with the California Milk Processor Board to compile a set of dairy-based recipes for the holiday season.

November 20, 2008
Search Planet PDF
more searching options...




PDF Resources
Platinum Sponsor
Create & Edit PDF - Nitro PDF Software

ARTS PDF

Silver Sponsors

PDF-Tools enfocus

FileOpen PDFNet SDK: PDF Component. Cross-platform PDF library

PDF Converter: Create, Convert PDF to Word/Excel Solid Documents - PDF to Word Converter

Debenu: Desktop Document Management, Archiving and Tagging Software Visual Integrity: Convert, Create, Extract, Merge and Modify PDF

SmartSoft

Get Nitro PDF Professional
Featured Product

NITRO PDF Professional

Built from the ground up, the perfect desktop PDF product for business and enterprise. Nitro PDF Professional has an uncompromising feature set so you can create, combine, edit, collaborate on and...

Featured Event

Adobe Acrobat & PDF Central Conference 2008

September 23 - 25, 2008 - Minneapolis, Minnesota

Dedicated exclusively to Adobe Acrobat and PDF, the 2008 Adobe Acrobat and PDF Central Conference presents a unique opportunity to broaden your understanding of how PDF technologies can positively affect your business productivity.

Discover the new features of Acrobat 9 and expand your knowledge of PDF technology. Participate in various educational seminars, such as Acrobat Form Fundamentals, Acrobat Security, Creating PDF Portfolios, Scripting with the new Flash Annotation in PDF, JavaScript for Acrobat, and much more.

PDF Store Categories