pdftotext fails with signed PDF's
Dear developers
Thank you first for your effort and work for the open source community.
Since recently, we have problems extracting the pages with pdftotext
from signed PDF's (signing is with AIS.py package from Swisscom together with their signing service) in our web application framework.
Running pdftotext sample_signed.pdt sample.txt
renders:
Syntax Error (48502): Command token too long Syntax Error (48640): Command token too long Internal Error: xref num 10 not found but needed, try to reconstruct<0a> Syntax Error (48502): Command token too long Syntax Error (48640): Command token too long Syntax Error: Couldn't find trailer dictionary Syntax Error: Catalog object is wrong type (null) Syntax Error: Couldn't find trailer dictionary Syntax Error: Invalid XRef entry 10 Internal Error: xref num 10 not found but needed, try to reconstruct<0a> Syntax Error: Invalid XRef entry 10 Syntax Error: Couldn't find trailer dictionary Syntax Error: Catalog object is wrong type (null) Syntax Error: Couldn't read page catalog
Version: pdftotext version 0.86.1
Any hints would be useful, we might have to complain to the signing service.