Big Faceless Organization has released an updated version of their Java PDF Library and Viewer.
Mike Bremford, CTO of BFO said, "Significant modifications have been made to the PageExtractor class. Text that was previously being extracted as individual letters or smaller groups are now being reassembled where possible into longer phrases. Some words had previously been broken up by the PDF structure."
The PDF Viewer enables users to extract and index both text and images from PDF documents, and includes integration with Apache Lucene. This version also offers a SWING component for displaying PDFs, along with the option to convert PDF to TIFF and boosted support for annotations with PDF document printing.
BFO offers the viewer as an extension to both the Java PDF Library Extended Edition and Java Report Generator Extended Edition.
A free fully functional trial version of the Big Faceless PDF Library can be downloaded at the
company Web site.
Despite the numerous benefits, there can be potential issues with the conversion of paper documents into electronic archives. When scanning paper pages into PDF, it's possible to end up with the odd- and even-numbered pages in separate PDF files. It can be very time-consuming to collate them manually, but there is an easier way. Sean Stewart explains.
BCL easyPDF SDK is a set of PDF Programming Libraries designed specifically to help Software Developers / Programmers build and deploy enterprise class PDF applications for corporate wide PDF...