New Forum | Previous | Next | (P-PDF) Beginners
Topic: Copy/Paste of Hebrew or Arabic
Conf: (P-PDF) Beginners, Msg: 90051
Date: 6/17/2003 06:41 AM
Hi, I usually use Ghostscript + GSView + "Apple LaserWriter" Driver to make PDF files. I have also tried out Acrobat Distiller, but it hasn't solved my problem.
Trying out the first method: print to Postscript driver, drag PS file to GSView and convert to PDF (using pdfwrite) - well, the Postscript driver outputs the Hebrew or Arabic using a Built-In Encoding, so when I paste the Hebrew or Arabic into Notepad (on WinXP) I get a whole set of garbage ie C0 control characters.
Trying out the second method: print to Acrobat Distiller. Now the Distiller outputs Hebrew or Arabic using Identity-H Unicode encoding, so I don't get garbage when I paste the text into Notepad. Instead, I get Hebrew or Arabic in reverse reading order (ie LTR instead of RTL). For Hebrew this is merely an annoyance, for I have a tool for reversing the reading order for that language. For Arabic, however, I can't do a thing, because the text is not only in reverse reading order, but, instead of being correct Arabic from the U+06xx Unicode range, it's Presentation Forms Arabic from the U+0Fxx range - the sort of Arabic text that shouldn't be used in text files at all. It's no use.
If there's a workaround, I'll be glad to hear about it. If not, it seems that, as far as Hebrew and Arabic are concerned, PDF files are not a source for copying text. At any rate, since the commercial Acrobat Distiller hasn't solved my problem, I've uninstalled it and I'm continuing to use the free Ghostscript/GSView. As a copying source I'll have to keep offering a UTF-8 plaintext or HTML version.