Skip to content

GitLab

  • Menu
Projects Groups Snippets
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • P poppler
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 656
    • Issues 656
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 43
    • Merge requests 43
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Packages & Registries
    • Packages & Registries
    • Container Registry
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • poppler
  • poppler
  • Issues
  • #321

Closed
Open
Created Jul 16, 2017 by Bugzilla Migration User@bugzilla-migration

pdftohtml: fakebold and dropshadow duplicated text

Submitted by Jason Crain

Assigned to poppler-bugs

Link to original bug (#101807)

Description

If you run pdftohtml on the PDF in bug #101770 (https://bugs.freedesktop.org/attachment.cgi?id=132659) It results in duplicated and jumbled characters.

Some PDFs draw text multiple times to emulate bold text or drop shadows. The main TextOutputDev goes to a lot of trouble to remove this duplicated text. pdftohtml should do this too.

To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information
Assignee
Assign to
Time tracking