Add image names to pdftohtml dump in xml mode
Submitted by Raphaël Monrouzeau
Assigned to poppler-bugs
Created attachment 37764 First patch: Add image names to pdftohtml dump in xml mode
I wanted pdftohtml to dump information about images in a page in xml mode.
The first patch below makes pdftohtml generate images as without the -xml switch and dumps its name in the xml file; please review it, I'm open to suggestions, style requests and everything required.
Here is the description of the patch:
The -c (complex) and -xml modes are not linked anymore. The -c switch has no real effect on -xml mode (as before).
However the -i switch is now looked at in -xml mode. Without it images are now generated and image tags do reference their name. The DTD has been updated.
Patch 37764, "First patch: Add image names to pdftohtml dump in xml mode":