PDF In-Depth

Scanned documents to high-compressed PDF

April 11, 2016

Advertisement

 

Color scanning is state-of-the-art today. Scanner hardware can produce color scans with high-quality from small MFP scanners up to high-end production scanners.

Scanners also deliver a high performance for color scanning with the typical 300dpi resolution in business environments.

A high-quality scan with 300dpi in color typically creates a large data volume with e.g. 25 MB as totally uncompressed TIFF for one page letter size and this not practical in the network and applications. Therefore, black-white TIFF or JPEG for color was used but has obviously many disadvantages.

PDF is a modern file format which allows for advanced compression of scanned documents. PDF supports compression schemas like JPEG 2000 and JBIG2 and especially, the so-called Mixed-Raster-Compression allows for high compression while preservering a very good quality of the scanned document.

Compression and quality depend heavily on the used tools for PDF conversion and sophisticated solutions can create color scans with approx. 40-80 KB per page which is normally the size if black-white TIFF is used.

Another major benefit of PDF for scanned documents is that PDF allows easily to make the pixel raster images of a scanner full text searchable. By deploying OCR to the scanned document, the document becomes ?intelligent? instead of being a simple image. Advanced solutions create full-text searchable PDF files and have the option of extracting the OCR recognition results separately e.g. for adding this into a full text search database for all documents. It is more ?by the way? starting from the archiving perspective). Very often, the scanned documents are processed in the according business workflows and at the end of processing, they are stored in the archive.

Long term archiving is the goal of the ISO standard PDF/A which ensures that those important business documents will have an identical reproduction in the unknown future. Simply speaking, PDF/A guarantees a long term safe digital paper.

For scanned documents, PDF/A-2u is best practice since many years. Digital mailroom is one of the main applications where all incoming paper letters are already scanned in the beginning. A lot of enterprises have digital mailroom applications already in place and color scanning, OCR and PDF/A are reasonable optimization steps for this operation.

Organisations also have a lot of digitization projects where existing paper like insurance files, customer files, etc. shall be digitized for a complete document management system.

About Thomas Zellman

Foxit has now added LuraTech to the FoxitONE group and LuraTech is a long term specialist for document conversion and especially for scanned documents to high-compressed PDF and PDF/A files.

PDF Compressor is a production-level application for compression, conversion to PDF and PDF/A with OCR.

PDF In-Depth Free Product Trials Ubiquitous PDF

Debenu Aerialist 12

The ultimate plug-in for Adobe Acrobat. Advanced splitting, merging, stamping, bookmarking, and link...

Download free demo

Debenu PDF Tools Pro

It's simple to use and will let you preview and edit PDF files, it's a Windows application that makes...

Download free demo

PDF Master Series III: Eugene Y. Xiong talks with Planet PDF

Planet PDF talks with another Master of the PDF Universe, Eugene Y. Xiong, Founder and Chairman of the Board at Foxit Software Inc. in Fremont California. Xiong is a quiet yet astounding achiever, you (usually) won't find him talking at conferences, exhibits, or publishings, but what you will find is the result of his leadership in places you would never expect.

September 14, 2016
Platinum Sponsor



Search Planet PDF
more searching options...
Planet PDF Newsletter
Most Popular Articles
Featured Product

Debenu PDF Aerialist 12

The ultimate plug-in for Adobe Acrobat. Advanced splitting, merging, stamping, bookmarking, and link control. Take Acrobat to the next level.

Features

Adding a PDF Stamp Comment

OK, so you want to stamp your document. Maybe you need to give reviewers some advice about the document's status or sensitivity. This tip from author Ted Padova demonstrates how to add stamps with the Stamp Tool along with related comments.