Planet PDF Forum Archive

Planet PDF ForumThe page you are viewing is part of our 160,000 page PDF discussion forum archive spanning 1999-2008. Would you believe we have a 2nd forum archive which covers 2008 - 2011? But... if you really want to bust-a-move head to the LIVE Planet PDF Forum. It features more than 10 conferences, covering everything from beginner discussions to in-depth developer and pre-press discussions.


How to search this archive. The quickest way is to use the filters on our Advanced Search page so that only archive pages are included in the results.


Previous | Next | (P-PDF) What's Wrong with my PDF?


Topic: After OCR in Acrobat 7.0.8 and save, some text is garbled
Conf: (P-PDF) What's Wrong with my PDF?, Msg: 151718
From: dbindle
Date: 7/5/2006 07:16 AM

I'm currently trying to create searchable pdf's of fairly long documents... namely theses and dissertations.

I've had success creating a searchable pdf by using the "Recognize text using OCR" function with a clean "image only" scanned pdf document.

Now I'm trying to OCR a pdf that was probably scanned from microfilm and the text is clearly not as crisp.

I realize the OCR is not going to be as accurate, but if that text is hidden behind the scanned image... 90% accuracy could possibly be enough. The problem is... is that after the OCR, the pdf page is "re-imaged" and it seems like the words that the OCR choked on... are now distorted or garbled. Some letters are made to look worse... some words have spelling mistakes... some are bolded etc...

Can't I OCR the documents... and have the OCR'd text (poor as it might be) hidden in the background, but still have the originally scanned image being shown to the user who opens up the pdf to read it?

Thanks for any help
David


PDF In-Depth Free Product Trials Ubiquitous PDF

Debenu Aerialist 11

The ultimate plug-in for Adobe Acrobat. Advanced splitting, merging, stamping, bookmarking, and link...

Download free demo

LockLizard Safeguard PDF Security

Made specifically for publishers of high value information published in PDF format, it protects your PDF...

Download free demo

Top PDF Challenges Solved…for Lower Costs, Increased Productivity and Greater Security

In a webcast hosted by CIO and sponsored by Foxit. The VP of Marketing gives a presentation titled - Top PDF Challenges Solved...for Lower Costs, Increased Productivity and Greater Security

June 19, 2013
Search Planet PDF
more searching options...
PDF Resources
Platinum Sponsor

Debenu - Unrivaled PDF Productivity | PDF Library, Acrobat Plug-Ins

Silver Sponsors

LockLizard DRM PDF Security Quick PDF Library: The Unrivaled PDF Developer Toolkit

Featured Product

Debenu PDF Aerialist 11

The ultimate plug-in for Adobe Acrobat. Advanced splitting, merging, stamping, bookmarking, and link control. Take Acrobat to the next level.

Featured Event

Adobe Digital Marketing Summit

March 20-23, 2012 -- Salt Palace Convention Center, Salt Lake City, Utah

The Digital Marketing Summit is the premier event for digital marketers and advertisers to learn about and share key strategies for driving marketing innovation. Attend Summit to learn how you can create, measure, and optimize digital experiences to revolutionize how the world engages with ideas and information.