Planet PDF Forum Archive

Planet PDF ForumThis is from our 160,000 page PDF discussion forum archive one & two spanning 1999-2011. Use the filters on our Advanced Search page to search archive only. Head to the LIVE Planet PDF Forum. It features more than 10 conferences, covering everything from beginner to in-depth developer and pre-press discussions.


Previous | Next | (P-PDF) What's Wrong with my PDF?


Topic: After OCR in Acrobat 7.0.8 and save, some text is garbled
Conf: (P-PDF) What's Wrong with my PDF?, Msg: 151718
From: dbindle
Date: 7/5/2006 07:16 AM

I'm currently trying to create searchable pdf's of fairly long documents... namely theses and dissertations.

I've had success creating a searchable pdf by using the "Recognize text using OCR" function with a clean "image only" scanned pdf document.

Now I'm trying to OCR a pdf that was probably scanned from microfilm and the text is clearly not as crisp.

I realize the OCR is not going to be as accurate, but if that text is hidden behind the scanned image... 90% accuracy could possibly be enough. The problem is... is that after the OCR, the pdf page is "re-imaged" and it seems like the words that the OCR choked on... are now distorted or garbled. Some letters are made to look worse... some words have spelling mistakes... some are bolded etc...

Can't I OCR the documents... and have the OCR'd text (poor as it might be) hidden in the background, but still have the originally scanned image being shown to the user who opens up the pdf to read it?

Thanks for any help
David


PDF In-Depth Free Product Trials Ubiquitous PDF

Debenu Aerialist 11

The ultimate plug-in for Adobe Acrobat. Advanced splitting, merging, stamping, bookmarking, and link...

Download free demo

Debenu PDF Tools Pro

It's simple to use and will let you preview and edit PDF files, it's a Windows application that makes...

Download free demo

Two Passwords Are Better Than One: The Low-Down On PDF Security

For people who don't spend their time looking at PDF files in text editors*, PDF security is a sometimes misunderstood beast.

For example, those document restrictions that PDF files sometimes have -- no Printing, Content Copying, Page Extraction, etc -- are essentially useless unless the PDF also has a User Password.

January 09, 2014
Platinum Sponsor



Search Planet PDF
more searching options...
Planet PDF Newsletter
Most Popular Articles
Featured Product

Debenu PDF Aerialist 11

The ultimate plug-in for Adobe Acrobat. Advanced splitting, merging, stamping, bookmarking, and link control. Take Acrobat to the next level.

Features

Adding a PDF Stamp Comment

OK, so you want to stamp your document. Maybe you need to give reviewers some advice about the document's status or sensitivity. This tip from author Ted Padova demonstrates how to add stamps with the Stamp Tool along with related comments.