PDFlib releases new text extraction package

July 06, 2005


The latest addition to the Munich-based PDFlib's suite of developer products is entitled PDFlib Text Extraction Toolkit. Also known as PDFlib TET, the software package is used to extract text from PDF documents, converting it to Unicode strings while preserving font and glyph information. The toolkit is currently available as a library, component and a command-line tool.

Suggested uses include the development of software for searching text, implementing a search engine to process large PDF archives, extracting text for storage or translation, converting PDF text into other formats, content-based processing of PDF documents (e.g. highlighting keywords) and comparing text between multiple PDF documents.

TET has been designed for standalone use, and does not require any third-party software to run effectively. Additionally, the product is robust enough for multi-threaded server use, significantly increasing its capacity. Language bindings including Windows, Macintosh and a several UNIX versions are available for use with various programming environments.

PDFlib TET is currently available for purchase and download.

PDF In-Depth Free Product Trials Ubiquitous PDF

Debenu Quick PDF Library

Get products to market faster with this amazing PDF developer SDK. Over 900 functions and an equally...

Download free demo

Five visions of a PDF Day

In the world of PDFs or as we like to say Planet (of) PDF, a year isn't a real PDF year without an intense few days of industry knowledge sharing.

May 15, 2018
Platinum Sponsor

Search Planet PDF
more searching options...
Planet PDF Newsletter
Most Popular Articles
Featured Product

Debenu PDF Aerialist

The ultimate plug-in for Adobe Acrobat. Advanced splitting, merging, stamping, bookmarking, and link control. Take Acrobat to the next level.


Adding a PDF Stamp Comment

OK, so you want to stamp your document. Maybe you need to give reviewers some advice about the document's status or sensitivity. This tip from author Ted Padova demonstrates how to add stamps with the Stamp Tool along with related comments.