Planet PDF Forum Archive

Planet PDF  ForumThe page you are viewing is part of our 160,000 page discussion forum archive. See below for PDF-related discussions spanning 1999-2008. To ask questions and get help, head to the live Planet PDF Forum.


How to search this archive. The quickest way is to use the filters on our Advanced Search page so that only archive pages are included in the results.


Previous | Next | (P-PDF) PDF Accessibility


Topic: OCR compatibility with tag tree/searching
Conf: (P-PDF) PDF Accessibility, Msg: 134325
From: amdickson
Date: 6/11/2005 12:46 AM

Hello.

I'm trying to create accessible, searchable PDFs from paper texts.

I scan them as 300 dpi bitonal TIFFs and then perform OCR using OmniPage Pro 11. Then I save back as PDF with image on text. Retaining the image is important for these archival documents.

The first problem I'm having is with the tag tree. In Acrobat 6.0 Professional, I use the Add Tags to Document option in the Accessibility portion of the Advanced menu. This invariably creates a Figure tag for the page image and a Table tag for the page of text. Each line (or portion there of) of text is given a Paragraph tag. Also, the graphic zones that I've created in OmniPage are not recognized as Figures in Acrobat. This, to me, makes the tag tree relatively useless. But, the Web Accessibility people at my institution teach "an accessible PDF is a tagged PDF."

Do you have any experience with this or know of any solution? Do other OCR programs work better with Acrobat?

Also, some of my documents are math papers with formulas. The powers that be want the formulas to be expressed in natural language for screen readers as well as searchable by LaTex. Any suggestions on where to hide all of this text? My current (but untested) solution is to put the natural language in the OCR layer and the LaTex in Bookmarks, although this seems less than ideal, and I'm not sure how it will work across platforms.

Thanks in advance for your ideas.

PDF In-Depth Free Product Trials Ubiquitous PDF

Pitstop Pro

Now graphic arts professionals have even broader and more expert control over their PDF documents. With...

Download free demo

ARTS PDF Aerialist

The ultimate plug-in for Adobe Acrobat and #1 selling product at PDF Store. Advanced splitting, merging,...

Download free demo

Ubiquitious PDF Tools: Wikipedia!

Planet PDF contributing editor Nettie Hartsock spends a lot of time surfing the Web, so she is intimately familiar with this week's Ubiquitous PDF Tool: the user-driven encyclopedia site, Wikipedia. In this piece, she talks about some of Wikipedia's PDF-specific content.

October 09, 2008
Search Planet PDF
more searching options...






PDF Resources
Platinum Sponsor
Create & Edit PDF - Nitro PDF Software

ARTS PDF

Silver Sponsors

PDF-Tools enfocus

FileOpen PDFNet SDK: PDF Component. Cross-platform PDF library

PDF Converter: Create, Convert PDF to Word/Excel Solid Documents - PDF to Word Converter

Debenu: Desktop Document Management, Archiving and Tagging Software Visual Integrity: Convert, Create, Extract, Merge and Modify PDF

SmartSoft

Get Nitro PDF Professional
Featured Product

NITRO PDF Professional

Built from the ground up, the perfect desktop PDF product for business and enterprise. Nitro PDF Professional has an uncompromising feature set so you can create, combine, edit, collaborate on and...

Featured Event

Adobe Acrobat & PDF Central Conference 2008

September 23 - 25, 2008 - Minneapolis, Minnesota

Dedicated exclusively to Adobe Acrobat and PDF, the 2008 Adobe Acrobat and PDF Central Conference presents a unique opportunity to broaden your understanding of how PDF technologies can positively affect your business productivity.

Discover the new features of Acrobat 9 and expand your knowledge of PDF technology. Participate in various educational seminars, such as Acrobat Form Fundamentals, Acrobat Security, Creating PDF Portfolios, Scripting with the new Flash Annotation in PDF, JavaScript for Acrobat, and much more.

PDF Store Categories