"pdftohtml -s" produces multiple files.
Submitted by Peter Easthope
Assigned to poppler-bugs
The Debian maintainer suggested filing upstream. If any further information is needed please let me know. Thanks, ... peter at easthope. ca
Subject: poppler-utils: "pdftohtml -s
<file>.pdf" produces multiple files.
Date: Sat, 10 Jan 2015 10:01:45 -0800
Package: poppler-utils Version: 0.26.5-2 Severity: important Tags: newcomer patch
*** Reporter, please consider answering these questions, where appropriate ***
What led up to the situation? pdftohtml was applied to a pdf file containing pixmap images.
What exactly did you do (or not do) that was effective (or ineffective)? The command was "pdftohtml -s
What was the outcome of this action? All the text was in one
<file>.html but each picture was an additional file.
What outcome did you expect instead? With the -s option, text and pictures should all be in one
<file>.html. A JPEG or PNG picture can be included in an html document with Base64 encoding. The syntax is very simple. Examples here. http://easthope.ca/Category2.html Scroll down to the heading "Inline, Base64 encoded PNG bitmaps". The first instance of an embedded bitmap is
-- System Information: Debian Release: 8.0 APT prefers stable-updates APT policy: (500, 'stable-updates'), (500, 'testing'), (500, 'stable') Architecture: i386 (i686)
Kernel: Linux 3.2.0-0.bpo.4-686-pae (SMP w/1 CPU core) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Init: systemd (via /run/systemd/system)
Versions of packages poppler-utils depends on: ii libc6 2.19-13 ii libcairo2 1.14.0-2.1 ii libfreetype6 2.5.2-2 ii libgcc1 1:4.9.1-19 ii liblcms2-2 2.6-3+b3 ii libpoppler46 0.26.5-2 ii libstdc++6 4.9.1-19 ii zlib1g 1:1.2.8.dfsg-2+b1
poppler-utils recommends no packages.
poppler-utils suggests no packages.
-- no debconf information