Some letters are in wrong order in the output of pdftotext
Submitted by Bassem JARKAS
Assigned to poppler-bugs
I have a pdf file created in Adobe InDesign CS3 (5.0.4) with an embedded Arabic font called AXtManal, this font was created to work around the limitation of publishing softwares of creating Arabic documents.
pdftotext v0.15.3 (and older versions) renders some letters in wrong order, for eample: the word "abcd" appears "acbd", and this error repeated with many groups of letters, like "l" and "a", "m" and "j", "r" and "y" ..etc
Evince displayed the file correctly with the correct order and the correct layout. the problem is only in the extracting.
Any idea how to fix that?
you can find the pdf sample here: https://sites.google.com/site/jarkas/Home/049.pdf?attredirects=0&d=1 and the text output: https://sites.google.com/site/jarkas/Home/049_0.15.3.txt?attredirects=0&d=1