pdftohtml: single-page HTML files using data-urls, stdout-capable
These changes add the command line argument
-dataurls to the
pdftohtml utility. When used with
-s, this allows a user to write to a single HTML file while preserving imagery from the PDF. Images are stored in the HTML as data URLs (RFC 2937). The automatic squashing of the
stdout flag based on
-s was removed, as it works quite well and makes sense for this application.
In order to avoid extensive rework against
ImgWriter and its friends, I am using the GLIBC-specific
fopencookie method. Header-guards prevent this feature from being activated on Android or MinGW, where
fopencookie is not available.
I tested against 0.70.1 and the master; the current master (
8315a1234) is subject to a bug fixed in my branch
goostring-fromint-fix (see other pull request).
Please let me know if any further information is needed.