Polish characters incorrectly extracted
Submitted by Urmas
Assigned to poppler-bugs
Description
The poppler extracts ż as Ŝ and Ż as ś, which makes it confused with real ś, resulting in data loss.
File contains crap like this:
1648 0 obj <</Type/Encoding/BaseEncoding/WinAnsiEncoding/Differences[ 1/eogonek/sacute/nacute/zdot/cacute/aogonek/Zdot/zacute/Sacute]>> endobj