Planet PDF Forum Archive

Planet PDF ForumThe page you are viewing is part of our 160,000 page PDF discussion forum archive spanning 1999-2008. Would you believe we have a 2nd forum archive which covers 2008 - 2011? But... if you really want to bust-a-move head to the LIVE Planet PDF Forum. It features more than 10 conferences, covering everything from beginner discussions to in-depth developer and pre-press discussions.


How to search this archive. The quickest way is to use the filters on our Advanced Search page so that only archive pages are included in the results.


Previous | Next | (P-PDF) General


Topic: Re: Strange character set when copying text to Word
Conf: (P-PDF) General, Msg: 50315
From: picax
Date: 5/29/2002 04:37 PM



En/Na Mark Zempel ha escrit:
>
> the first line should read ZERO %
>
> Any help will be greatly appreciated
>
> Mark Zempel
>
> ------------------------------------------------------------------------
> Name: textsample.pdf
> textsample.pdf Type: Portable Document Format (application/pdf)
> Encoding: base64


Hi Mark,

This is a curious PDF. This file has made created or manipulated with
PitStop application. This PDF have internally all the text totally and
highly altered. I will explain this complex structure and the reasons
for this strange character behaviour:

A. When you open the document with the Acrobat viewer, you will see all
the
characters with the same font style. Internally it's not the same. It
have
18 different font types (see the menu File > Document Info > Fonts)

B. Every font have an 'strange' and 'different' font encoding. In this
simple PDF there are 18 different font encodings (font encoding is the
correspondence with the 'character name' and this ascii code). For
example: in the particular font called 'FMNBBE+F19726528.0' the
character '9' have
the ascii code '1', when the standard ascii code for this is '57'

C. Also all the characters have their internal name altered. For
example: in the same font called 'FMNBBE+F19726528.0' the character '9'
have
the name 'c57', when the standard character name is 'nine'

D. By this reasons you will see well the text document only in the
Acrobat viewer, but when you 'copy and paste' the characters, you only
'transport' the 'rare and particular' characters codes, and the
disaster appears in the wordprocessor.

Have you tried to use a plug_in to extract the PDF text ? Do you know
if they extracts only strange characters ?

If you can't resolve the problem and you are interested on find
a tool to extract correctly the text, lets me know, and I can
develope it in a few days.

I hope this can help you.


Marc Antoni Malagarriga
@:)

PDF In-Depth Free Product Trials Ubiquitous PDF

Debenu Aerialist 11

The ultimate plug-in for Adobe Acrobat. Advanced splitting, merging, stamping, bookmarking, and link...

Download free demo

LockLizard Safeguard PDF Security

Made specifically for publishers of high value information published in PDF format, it protects your PDF...

Download free demo

Ubiquitous PDF: DIY PDF magazines, courtesy of CNET and Magazinify

Thanks to Magazinify.com, it's possible to have web articles delivered right to your inbox in PDF form. If that weren't enough, the nice folks at CNET have been nice enough to publish a step-by-step guide about how to set this all up using just a little time and a free Magazinify account.

September 06, 2011
Search Planet PDF
more searching options...
PDF Resources
Platinum Sponsor

Debenu - Unrivaled PDF Productivity | PDF Library, Acrobat Plug-Ins

Create & Edit PDF - Nitro PDF Software

Silver Sponsors

LockLizard DRM PDF Security Quick PDF Library: The Unrivaled PDF Developer Toolkit

Featured Product

Debenu PDF Aerialist 11

The ultimate plug-in for Adobe Acrobat. Advanced splitting, merging, stamping, bookmarking, and link control. Take Acrobat to the next level.

Featured Event

Adobe Digital Marketing Summit

March 20-23, 2012 -- Salt Palace Convention Center, Salt Lake City, Utah

The Digital Marketing Summit is the premier event for digital marketers and advertisers to learn about and share key strategies for driving marketing innovation. Attend Summit to learn how you can create, measure, and optimize digital experiences to revolutionize how the world engages with ideas and information.