Skip to content

GitLab

  • Projects
  • Groups
  • Snippets
  • Help
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
P
poppler
  • Project overview
    • Project overview
    • Details
    • Activity
    • Releases
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 614
    • Issues 614
    • List
    • Boards
    • Labels
    • Service Desk
    • Milestones
  • Merge Requests 38
    • Merge Requests 38
  • CI / CD
    • CI / CD
    • Pipelines
    • Jobs
    • Schedules
  • Operations
    • Operations
    • Incidents
    • Environments
  • Packages & Registries
    • Packages & Registries
    • Container Registry
  • Analytics
    • Analytics
    • CI / CD
    • Repository
    • Value Stream
  • Snippets
    • Snippets
  • Members
    • Members
  • Collapse sidebar
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
  • poppler
  • poppler
  • Issues
  • #127

Closed
Open
Opened Sep 17, 2012 by Bugzilla Migration User@bugzilla-migration

-xml does not render all images despite -c rendering correctly

Submitted by Jamie Carl

Assigned to poppler-bugs

Link to original bug (#55037)

Description

I've been trying to incorporate pdftohtml into my frontend renderer and have had some success with some documents. Other more complex documents though are having problems.

My test document is the Nikon D3s brochure:

wget http://imaging.nikon.com/products/imaging/lineup/digitalcamera/slr/d3s/pdf/d3s_16p.pdf

Rendering with the following produces a pretty accurate representation of the document:

pdftohtml -c d3s_16p.pdf

However, when I output to XML using -xml some of the images that worked previously are not output. They are not extracted or even included in the XML output.

Also, the images that are extracted are included with the wrong dimensions so the resulting page looks very out of whack.

All of the text is rendered correctly though.

Tried latest version from git with same results.

To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information
Assignee
Assign to
None
Milestone
None
Assign milestone
Time tracking
None
Due date
None
Reference: poppler/poppler#127