Google now includes automatic PDF-to-text conversion
Enhancement allows PDF-based content to be indexed and searched

5 February 2001

Users of the Adobe Acrobat software and portable document format (PDF) files have been aware for some time that most major Internet search engines have been oblivious to the information contained in PDF files. In short, since PDF documents couldn't be indexed, they have not been part of what most engines could search.

A number of PDF-aware indexing tools and products have become available in recent years, but for the most part, the major online search sites have continued to all but ignore information stored inside PDFs. The result: A large and growing amount of good information has remained hidden from view (and vice versa, a lot of old, useless information in other formats remains, lowering further the overall quality of search results).

Google Does PDF!

Google search logoNo matter which criteria you use to rank the top Internet search sites, there's no argument today that Google (www.google.com) -- with more than 13 million files indexed -- has become one of the best. In addition to its relevant matches of up-to-date content and tailored descriptions, Google's search results offer a "cached" version of most pages -- its "spider" has crawled the Internet and maintained an archive of pages ("snapshots," Google calls them) that in some cases no longer exist on the Web. The search query term is also highlighted in the results.

The latest in Google's efforts to bring more of the so-called "Invisible Web" into view: PDF files are now converted to text, making the content indexable and searchable. Results provide a link to both the standard PDF document -- indicated by the term [pdf] in blue text -- and to Google's text version. Currently the results only show the PDF document's title as "Untitled," but further updates are expected to correct this and to add other new capabilities.

Google reportedly will have the first release of its PDF search functionality installed and operating on all 7,000 of its file servers by February 5.

MORE INFO:


PDF In-Depth Free Product Trials Ubiquitous PDF

Debenu Aerialist

The ultimate plug-in for Adobe Acrobat. Advanced splitting, merging, stamping, bookmarking, and link...

Download free demo

Debenu PDF Tools Pro

It's simple to use and will let you preview and edit PDF files, it's a Windows application that makes...

Download free demo

Back to the past, 15 years ago! Open Publish 2002

Looking back to 2002, it's amazing how much of the prediction became a reality. Take a read and see what you think!

September 14, 2017
Platinum Sponsor





Search Planet PDF
more searching options...
Planet PDF Newsletter
Most Popular Articles
Featured Product

Debenu PDF Aerialist

The ultimate plug-in for Adobe Acrobat. Advanced splitting, merging, stamping, bookmarking, and link control. Take Acrobat to the next level.

Features

Adding a PDF Stamp Comment

OK, so you want to stamp your document. Maybe you need to give reviewers some advice about the document's status or sensitivity. This tip from author Ted Padova demonstrates how to add stamps with the Stamp Tool along with related comments.