Planet PDF Forum Archive

Planet PDF ForumWowsers! This is page is old, head to the LIVE Planet PDF Forum. It features more than 10 conferences, covering everything from beginner to in-depth developer and pre-press discussions. If you wish to continue... one & two archive covers 1999-2011 (160,000 pages).

New Forum | Previous | Next | (P-PDF) Developers

Topic: RE: Text Pattern Recognition
Conf: (P-PDF) Developers, Msg: 58631
From: skajan
Date: 5/29/2002 05:33 PM

Hi Mark

Actually i am trying to extract text from pdf document. When i extract the
text from a pdf it is splitting the text into funny way. For example, if
the document contains the following text

"This is the test Text"

when i extract, it is splitting as

"This is"


"test Text"

something like this and placing into different places. Also if i have more
paragraphs and line of texts it is not extracting as same order as in
original PDF.

so that, i am confusing. I don't like to buy third party plugins, because
this is a web based project. I thought i can find a solution using AI (MB i
am wrong)

If you have any ideas please help me.


-----Original Message-----
From: p-pdf-developer Listmanager
Sent: 18 October 2001 15:35
To: Recipients of 'p-pdf-developer' suppressed
Subject: Re: Text Pattern Recognition

From: Mark Gavin

--- Planet PDF Forum | ----

>I would like to ask a question which is not related to this forum.
>Q) Does anybody come across an algorithm or solution for "Text Pattern
>Recognition" ?
>It can be AI.

There are dozens of standard algorithms for text pattern
recognition. What specifically are you trying to do?


Mark Gavin
Chief Technology Officer
Appligent, Inc. ( formerly Digital Applications, Inc. )
60 South Lansdowne Avenue
Lansdowne, PA 19050
(610) 284-4006

-------- ePublish Store's Top 10 List for September ----------

ARTS Split & Merge Wizard - StampPDF - JAWS PDF Creator
Gemini - PDF-JoinEnum - ARTS PDF Management Suite
Crackerjack - iCopy - Quite Imposing - AMYUNI PDF Converter

Find out why these were the best-selling products last
month. Free demos of most at the new* ePublish Store!
(*formerly known as the Planet PDF Store)

------ Powered by & --------

PDF In-Depth Free Product Trials Ubiquitous PDF

Debenu Aerialist

The ultimate plug-in for Adobe Acrobat. Advanced splitting, merging, stamping, bookmarking, and link...

Download free demo

Debenu PDF Tools Pro

It's simple to use and will let you preview and edit PDF files, it's a Windows application that makes...

Download free demo

Five visions of a PDF Day

In the world of PDFs or as we like to say Planet (of) PDF, a year isn't a real PDF year without an intense few days of industry knowledge sharing.

May 15, 2018
Platinum Sponsor

Search Planet PDF
more searching options...
Planet PDF Newsletter
Most Popular Articles
Featured Product

Debenu PDF Aerialist

The ultimate plug-in for Adobe Acrobat. Advanced splitting, merging, stamping, bookmarking, and link control. Take Acrobat to the next level.


Adding a PDF Stamp Comment

OK, so you want to stamp your document. Maybe you need to give reviewers some advice about the document's status or sensitivity. This tip from author Ted Padova demonstrates how to add stamps with the Stamp Tool along with related comments.