Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • P poppler
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 664
    • Issues 664
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 46
    • Merge requests 46
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Container Registry
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • poppler
  • poppler
  • Issues
  • #891
Closed
Open
Issue created Mar 09, 2020 by Rava@Rava

[pdftotext] fails at Umlaute

I upgraded my Linux Poppler to 0.86.1-1-x86_64 in the hope the failure to convert a PDF containing German Umlaute was a bug already solved. But unfortunately it is not so. pdftotext V0.86.1 creates the exact same erroneous txt file than the older most recent Slackware version 0.68.0. When reading the PDF using PDF Viewer 0.1.8 all Umlaute are displayed correctly.

While the created txt file is an UTF-8 coded file, the Umlaute are all messed up, e.g. Ü is converted into † and ö into š. The list goes on. The pdf is to be found on a wikipedia page and is freely accessible. https://de.wikipedia.org/wiki/Willkommen_und_Abschied ; the URL to the pdf is http://revistas.ucm.es/index.php/RFAL/article/download/RFAL0000110081A/33794 ; The filename is "34939-Texto del artículo-34955-1-10-20110610.PDF"

To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information
Assignee
Assign to
Time tracking