poppler issueshttps://gitlab.freedesktop.org/poppler/poppler/-/issues2023-04-06T12:39:20Zhttps://gitlab.freedesktop.org/poppler/poppler/-/issues/398pdfunite 0.55 damage files with Optional Content Groups2023-04-06T12:39:20ZBugzilla Migration Userpdfunite 0.55 damage files with Optional Content Groups## Submitted by Valerio Messina
Assigned to **poppler-bugs**
**[Link to original bug (#101449)](https://bugs.freedesktop.org/show_bug.cgi?id=101449)**
## Description
Created attachment 131980
two files of 1 page with layers, and t...## Submitted by Valerio Messina
Assigned to **poppler-bugs**
**[Link to original bug (#101449)](https://bugs.freedesktop.org/show_bug.cgi?id=101449)**
## Description
Created attachment 131980
two files of 1 page with layers, and the result of pdfunite
pdfunite 0.55 damage files with Optional Content Groups (layers).
For example see the attach.
PDFlayers_pag1.pdf:pag.1_layers:
- Information , on page:YES, visibile:NO
- Layout D , on page:YES, visibile:YES
- Kopf D , on page:YES, visibile:YES
- LOGO EDI BFS D , on page:YES, visibile:YES
- Layout F , on page:YES, visibile:NO
- Kopf F , on page:YES, visibile:NO
- LOGO EDI BFS F , on page:YES, visibile:NO
- LOGO EDI BFS I , on page:YES, visibile:NO
- Hilfslinien und Raster, on page:NO , visibile:NO
PDFlayers_pag2.pdf:pag.1_layers:
- Information , on page:NO , visibile:NO
- Layout D , on page:YES, visibile:YES
- Kopf D , on page:NO , visibile:YES
- LOGO EDI BFS D , on page:NO , visibile:YES
- Layout F , on page:YES, visibile:NO
- Kopf F , on page:NO , visibile:NO
- LOGO EDI BFS F , on page:NO , visibile:NO
- LOGO EDI BFS I , on page:NO , visibile:NO
- Hilfslinien und Raster, on page:NO , visibile:NO
$ pdunite PDFlayers_pag1.pdf PDFlayers_pag2.pdf PDFlayers_PDFunite0.55.pdf
PDFlayers_PDFunite0.55.pdf:pag.1,2_layers:
pag.1
- Information , on page:YES, visibile:NO
- Layout D , on page:YES, visibile:YES
- Kopf D , on page:YES, visibile:YES
- LOGO EDI BFS D , on page:YES, visibile:YES
- Layout F , on page:YES, visibile:NO
- Kopf F , on page:YES, visibile:NO
- LOGO EDI BFS F , on page:YES, visibile:NO
- LOGO EDI BFS I , on page:YES, visibile:NO
- Hilfslinien und Raster, on page:NO , visibile:NO
pag.2
- Information , on page:NO , visibile:NO
- Layout D , on page:NO , visibile:YES
- Kopf D , on page:NO , visibile:YES
- LOGO EDI BFS D , on page:NO , visibile:YES
- Layout F , on page:NO , visibile:NO
- Kopf F , on page:NO , visibile:NO
- LOGO EDI BFS F , on page:NO , visibile:NO
- LOGO EDI BFS I , on page:NO , visibile:NO
- Hilfslinien und Raster, on page:NO , visibile:NO
On resulting file, pag.1 is right, but pag.2 has no layer on page, and all layers are always visible.
Results checked with:
Acrobat Reader 11.0.20 on Windows 7 64 bit
Evince 2.32.0.145 on Windows 7 64 bit
Evince 3.18.2 on Ubuntu 16.04 64 bit
**Attachment 131980**, "two files of 1 page with layers, and the result of pdfunite":
[PDFlayersTest.7z](/uploads/d22b65b4e1cc649766dc1f7b3ae4d45b/PDFlayersTest.7z)https://gitlab.freedesktop.org/poppler/poppler/-/issues/397Evince needs 10 minutes to open certain PDFs2018-08-21T10:51:47ZBugzilla Migration UserEvince needs 10 minutes to open certain PDFs## Submitted by Germán Poo-Caamaño
Assigned to **poppler-bugs**
**[Link to original bug (#106867)](https://bugs.freedesktop.org/show_bug.cgi?id=106867)**
## Description
Created attachment 140102
PDF test case
As reported in https...## Submitted by Germán Poo-Caamaño
Assigned to **poppler-bugs**
**[Link to original bug (#106867)](https://bugs.freedesktop.org/show_bug.cgi?id=106867)**
## Description
Created attachment 140102
PDF test case
As reported in https://gitlab.gnome.org/GNOME/evince/issues/745
-->snip<--
Opening PDFs scanned with my Brother Laser Printer is virtually impossible. The opening causes a 100% CPU spike on one core and does take forever.
The PDFs were generated in PDF/A by the Brother scanner unit. I have attached one for testing. I did not have this problem on Mac OSX, but Microsoft Edge seems slow on opening these files too. Chrome does process them with no mayor delay.
PDFs not from my scanner, like digital invoices I get regularly, do not produce any problem.
-->snip<--
It is noticeable slower than Acroread, which is instant, and xpdf, which is almost instant.
$ time pdftoppm -png brother_001793.pdf out-ppm
real 0m9.454s
user 0m9.404s
sys 0m0.036s
$ time pdftocairo -png brother_001793.pdf out-cairo
real 0m34.103s
user 0m33.884s
sys 0m0.200s
**Attachment 140102**, "PDF test case":
[brother_001793.pdf](/uploads/941d79784f33649aab2ce4c79fa9b45d/brother_001793.pdf)https://gitlab.freedesktop.org/poppler/poppler/-/issues/396SVG output backend2018-10-07T00:30:41ZBugzilla Migration UserSVG output backend## Submitted by Luke Hutchison
Assigned to **poppler-bugs**
**[Link to original bug (#3348)](https://bugs.freedesktop.org/show_bug.cgi?id=3348)**
## Description
It would be awesome if Poppler had a backend that created an SVG file...## Submitted by Luke Hutchison
Assigned to **poppler-bugs**
**[Link to original bug (#3348)](https://bugs.freedesktop.org/show_bug.cgi?id=3348)**
## Description
It would be awesome if Poppler had a backend that created an SVG file from a PDF
document. Looking at the Cairo backend, I think this could be done relatively
easily, and could prove very useful, as it would allow easy editing of PDF files.
There is a version of SVG that handles multiple pages, called SVGPrint.
Inkscape doesn't handle SVGPrint yet, but it is planned.
Currently there are a couple of ways of getting from PDF to SVG, but it takes
2-3 tools and is quite a convoluted process.https://gitlab.freedesktop.org/poppler/poppler/-/issues/395FreeText annotation ignores font2019-03-08T01:39:51ZBugzilla Migration UserFreeText annotation ignores font## Submitted by Phil
Assigned to **poppler-bugs**
**[Link to original bug (#81748)](https://bugs.freedesktop.org/show_bug.cgi?id=81748)**
## Description
Created attachment 103449
Annotation example
I have been creating PDFs dynam...## Submitted by Phil
Assigned to **poppler-bugs**
**[Link to original bug (#81748)](https://bugs.freedesktop.org/show_bug.cgi?id=81748)**
## Description
Created attachment 103449
Annotation example
I have been creating PDFs dynamically, with FreeText and signature annotations.
When shown in Evince, FreeText annotations generally show as I formatted them,
but they ignore a specified True Type font. Adobe Reader with the same document
shows the font correctly.
Note: I originally submitted this bug with Evince, not realizing that Poppler was the PDF frontend for the application. Hopefully I made it to the right place this time.
In the attached example, the yellow block is a FreeText annotation. The font
which appears in the first line of the document as "Test this font" is also
applied to the FreeText annotation. As you can see, the default Helvetica font
is what is actually displayed in the yellow annotation, when it should match
the font in the "Test this font" line. Other than that, font size, color and
position seems to be applied correctly.
Not a horrible bug, but when adding annotations as I am to go with signatures,
the lack of the TT font makes things look a little clunky.
BTW, I'm running up to date Ubuntu, Evince 3.10, and generating the PDF with Ruby Prawn gem, and adding the annotations with Ruby Origami PDF gem.
Let me know if I can provide any additional information.
**Attachment 103449**, "Annotation example":
[outfile.pdf](/uploads/4bd82d4141b0a8ac481369311066df65/outfile.pdf)https://gitlab.freedesktop.org/poppler/poppler/-/issues/394image scaling is ugly / not antialiased in Evince 2.282018-08-21T10:50:30ZBugzilla Migration Userimage scaling is ugly / not antialiased in Evince 2.28## Submitted by Jean-François Fortin Tam
Assigned to **poppler-bugs**
**[Link to original bug (#25268)](https://bugs.freedesktop.org/show_bug.cgi?id=25268)**
## Description
For some reason, Evince 2.28's rendering/scaling quality ...## Submitted by Jean-François Fortin Tam
Assigned to **poppler-bugs**
**[Link to original bug (#25268)](https://bugs.freedesktop.org/show_bug.cgi?id=25268)**
## Description
For some reason, Evince 2.28's rendering/scaling quality of images in PDFs is horrible. It was OK in previous versions.
PDF files affected, jpeg and png images dragged into evince don't exhibit the behavior.
poppler 0.12.0 (cairo)
Sample file: http://go2.wordpress.com/?id=725X1342&site=doctormo.wordpress.com&url=http%3A%2F%2Fdivajutta.com%2Fdoctormo%2Fubunchu%2Fubunchu-episode-01.pdf
Will attach screenshots.https://gitlab.freedesktop.org/poppler/poppler/-/issues/393SEGV in StreamPredictor::getChar when using ENABLE_ZLIB_UNCOMPRESS2021-12-09T06:25:31ZBugzilla Migration UserSEGV in StreamPredictor::getChar when using ENABLE_ZLIB_UNCOMPRESS## Submitted by Antti Husa
Assigned to **poppler-bugs**
**[Link to original bug (#76631)](https://bugs.freedesktop.org/show_bug.cgi?id=76631)**
## Description
Created attachment 96408
Fuzzed PDF file that causes SEGV
Segfault whe...## Submitted by Antti Husa
Assigned to **poppler-bugs**
**[Link to original bug (#76631)](https://bugs.freedesktop.org/show_bug.cgi?id=76631)**
## Description
Created attachment 96408
Fuzzed PDF file that causes SEGV
Segfault when malformed PDF file is opened.
Reproduced on Evince and Zathura with Poppler version 0.25.1 (git master branch).
Distrubution: Gentoo Linux 64bit
Evince version: 3.10.3
Zathura version: 0.2.1
Zathura-pdf-poppler version: 0.2.3
Malformed file is given as an attachment.
ASAN report:
==9396== ERROR: AddressSanitizer: SEGV on unknown address 0x000000000000 (pc 0x7effa86af3a3 sp 0x7effa5c9d810 bp 0x7effa5c9d820 T3)
AddressSanitizer can not provide additional info.
#0 0x7effa86af3a2 in StreamPredictor::getChar() /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/Stream.cc:615
#1 0x7effa87f5655 in FlateStream::getChar() /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/FlateStream.cc:58
#2 0x7effa856bf60 in Stream::doGetChars(int, unsigned char*) /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/Stream.h:126
#3 0x7effa86ae19f in ImageStream::getLine() /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/Stream.cc:517
#4 0x7effa8d5c301 in RescaleDrawImage::getRow(int, unsigned int*) /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/CairoOutputDev.cc:2836
#5 0x7effa8d5d674 in CairoRescaleBox::downScaleImage(unsigned int, unsigned int, int, int, unsigned short, unsigned short, unsigned short, unsigned short, _cairo_surface*) /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/CairoRescaleBox.cc:338
#6 0x7effa8d5c121 in RescaleDrawImage::getSourceImage(Stream*, int, int, int, int, bool, GfxImageColorMap*, int*) /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/CairoOutputDev.cc:2817
#7 0x7effa8d56fe8 in CairoOutputDev::drawImage(GfxState*, Object*, Stream*, int, int, GfxImageColorMap*, bool, int*, bool) /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/CairoOutputDev.cc:2896
#8 0x7effa85d9840 in Gfx::doImage(Object*, Stream*, bool) /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/Gfx.cc:4653
#9 0x7effa85d68f2 in Gfx::opXObject(Object*, int) /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/Gfx.cc:4179
#10 0x7effa85b049c in Gfx::execOp(Object*, Object*, int) /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/Gfx.cc:903
#11 0x7effa85af685 in Gfx::go(bool) /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/Gfx.cc:762
#12 0x7effa85af2d9 in Gfx::display(Object*, bool) /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/Gfx.cc:728
#13 0x7effa86928cd in Page::displaySlice(OutputDev*, double, double, int, bool, bool, int, int, int, int, bool, bool (*)(void*), void*, bool (*)(Annot*, void*), void*, bool) /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/Page.cc:585
#14 0x7effa8d17f53 in _poppler_page_render(_PopplerPage*, _cairo*, bool, PopplerPrintFlags) /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/glib/poppler-page.cc:362
#15 0x7effa8d1807a in poppler_page_render /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/glib/poppler-page.cc:385
#16 0x7effa8f9cf2c in pdf_page_render_cairo /var/tmp/portage/app-text/zathura-pdf-poppler-0.2.3/work/zathura-pdf-poppler-0.2.3/pdf.c:809
#17 0x42f947 in render /var/tmp/portage/app-text/zathura-0.2.1/work/zathura-0.2.1/render.c:183
#18 0x42f947 in render_job /var/tmp/portage/app-text/zathura-0.2.1/work/zathura-0.2.1/render.c:37
#19 0x7effb11cbea5 (/usr/lib64/libglib-2.0.so.0+0x6fea5)
#20 0x7effb11cb4e4 (/usr/lib64/libglib-2.0.so.0+0x6f4e4)
#21 0x7effb287ec07 in __asan::AsanThread::ThreadStart() /home/aki/opt/fu/work/tmp/gcc-4.8.1/x86_64-unknown-linux-gnu/libsanitizer/asan/../../.././libsanitizer/asan/asan_thread.cc:99
#22 0x7effb0b41f39 in start_thread /var/tmp/portage/sys-libs/glibc-2.17/work/glibc-2.17/nptl/pthread_create.c:308
#23 0x7effb057ec3c (/lib64/libc.so.6+0xedc3c)
SUMMARY: AddressSanitizer: SEGV /var/tmp/portage/app-text/poppler-9999/work/poppler-9999/poppler/Stream.cc:615 StreamPredictor::getChar()
Thread T3 (pool) created by T0 here:
#0 0x7effb2870c5b in __interceptor_pthread_create /home/aki/opt/fu/work/tmp/gcc-4.8.1/x86_64-unknown-linux-gnu/libsanitizer/asan/../../.././libsanitizer/asan/asan_interceptors.cc:122
#1 0x7effb11e6941 (/usr/lib64/libglib-2.0.so.0+0x8a941)
==9396== ABORTING
--
Antti Husa
Research Assistant, OUSPG
**Attachment 96408**, "Fuzzed PDF file that causes SEGV":
[empty.pdf](/uploads/e568c127506ce11f6dcab4349cf6f94b/empty.pdf)https://gitlab.freedesktop.org/poppler/poppler/-/issues/392"pdftohtml -s" produces multiple files.2018-08-21T18:30:55ZBugzilla Migration User"pdftohtml -s" produces multiple files.## Submitted by Peter Easthope
Assigned to **poppler-bugs**
**[Link to original bug (#89696)](https://bugs.freedesktop.org/show_bug.cgi?id=89696)**
## Description
The Debian maintainer suggested filing upstream.
If any further inf...## Submitted by Peter Easthope
Assigned to **poppler-bugs**
**[Link to original bug (#89696)](https://bugs.freedesktop.org/show_bug.cgi?id=89696)**
## Description
The Debian maintainer suggested filing upstream.
If any further information is needed please let me know.
Thanks, ... peter at easthope. ca
====https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=775049====
From: peter@easthope.ca
To: submit@bugs.debian.org
Cc: peter@easthope.ca
Subject: poppler-utils: "pdftohtml -s `<file>`.pdf" produces multiple files.
Date: Sat, 10 Jan 2015 10:01:45 -0800
Package: poppler-utils
Version: 0.26.5-2
Severity: important
Tags: newcomer patch
Dear Maintainer,
*** Reporter, please consider answering these questions, where appropriate ***
* What led up to the situation?
pdftohtml was applied to a pdf file containing pixmap images.
* What exactly did you do (or not do) that was effective (or ineffective)?
The command was "pdftohtml -s `<file>`.pdf".
* What was the outcome of this action?
All the text was in one `<file>`.html but each picture was an additional file.
* What outcome did you expect instead?
With the -s option, text and pictures should all be in one `<file>`.html.
A JPEG or PNG picture can be included in an html document with Base64
encoding. The syntax is very simple. Examples here.
http://easthope.ca/Category2.html
Scroll down to the heading "Inline, Base64 encoded PNG bitmaps".
The first instance of an embedded bitmap is
<img src="data:image/png;base64,iVBORw0K ..."
alt="Diagram for 0x0 with test object 0,`<br>`represented in PNG.`<br>`">
-- System Information:
Debian Release: 8.0
APT prefers stable-updates
APT policy: (500, 'stable-updates'), (500, 'testing'), (500, 'stable')
Architecture: i386 (i686)
Kernel: Linux 3.2.0-0.bpo.4-686-pae (SMP w/1 CPU core)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
Versions of packages poppler-utils depends on:
ii libc6 2.19-13
ii libcairo2 1.14.0-2.1
ii libfreetype6 2.5.2-2
ii libgcc1 1:4.9.1-19
ii liblcms2-2 2.6-3+b3
ii libpoppler46 0.26.5-2
ii libstdc++6 4.9.1-19
ii zlib1g 1:1.2.8.dfsg-2+b1
poppler-utils recommends no packages.
poppler-utils suggests no packages.
-- no debconf informationhttps://gitlab.freedesktop.org/poppler/poppler/-/issues/391pdftohtml not working on some PDF2018-08-21T10:50:19ZBugzilla Migration Userpdftohtml not working on some PDF## Submitted by sam
Assigned to **poppler-bugs**
**[Link to original bug (#73697)](https://bugs.freedesktop.org/show_bug.cgi?id=73697)**
## Description
The 'calibre' programme (which I use for conversion for PDF to EPUB) fail to c...## Submitted by sam
Assigned to **poppler-bugs**
**[Link to original bug (#73697)](https://bugs.freedesktop.org/show_bug.cgi?id=73697)**
## Description
The 'calibre' programme (which I use for conversion for PDF to EPUB) fail to convert some PDF.According to 'calibre's devloper the bug is in windows version of pdftohtml.However he states that Linux and os x doesn't have any issues.https://gitlab.freedesktop.org/poppler/poppler/-/issues/390svg embedded in pdf file is not displayed2022-01-06T22:57:40ZBugzilla Migration Usersvg embedded in pdf file is not displayed## Submitted by wrobell
Assigned to **poppler-bugs**
**[Link to original bug (#12554)](https://bugs.freedesktop.org/show_bug.cgi?id=12554)**
## Description
acrobat reader allows to display svg files, which are embedded
in pdf file...## Submitted by wrobell
Assigned to **poppler-bugs**
**[Link to original bug (#12554)](https://bugs.freedesktop.org/show_bug.cgi?id=12554)**
## Description
acrobat reader allows to display svg files, which are embedded
in pdf files. poppler lacks this capability.https://gitlab.freedesktop.org/poppler/poppler/-/issues/389[patch] inline some frequently called stream functions2018-10-05T23:19:49ZBugzilla Migration User[patch] inline some frequently called stream functions## Submitted by William Bader
Assigned to **poppler-bugs**
**[Link to original bug (#89097)](https://bugs.freedesktop.org/show_bug.cgi?id=89097)**
## Description
Created attachment 113384
patch to inline some functions in Stream.c...## Submitted by William Bader
Assigned to **poppler-bugs**
**[Link to original bug (#89097)](https://bugs.freedesktop.org/show_bug.cgi?id=89097)**
## Description
Created attachment 113384
patch to inline some functions in Stream.cc and Stream.h
The various Stream classes have getChar() functions that are called once for each byte in the stream.
This patch makes some of the simpler functions inline.
Most of the functions are virtual, so they probably can't be inlined, but a few are not virtual, and the change seems to make a small difference.
The profiles are from running "pdftops 2-DESCR_648853-IT-EN-DE_MANITOU_RIGENERATO.pdf x.ps" three times on the test file from https://bugs.freedesktop.org/show_bug.cgi?id=89076
pdftops before the patch
% cumulative self self total
time seconds seconds calls ms/call ms/call name
15.25 0.93 0.93 8905797 0.00 0.00 GooString::formatDouble(double, char*, int, int, bool, char**, int*)
9.34 1.50 0.57 6172503 0.00 0.00 GooString::appendfv(char const*, __va_list_tag*)
7.87 1.98 0.48 34515024 0.00 0.00 FlateStream::getHuffmanCodeWord(FlateHuffmanTab*)
7.05 2.41 0.43 22730064 0.00 0.00 FlateStream::readSome()
6.48 2.81 0.40 18514623 0.00 0.00 Lexer::getObj(Object*, int)
5.90 3.17 0.36 16607355 0.00 0.00 Parser::getObj(Object*, bool, unsigned char*, CryptAlgorithm, int, int, int, int, bool)
5.90 3.53 0.36 106901274 0.00 0.00 JBIG2Segment::~JBIG2Segment()
4.92 3.83 0.30 111594801 0.00 0.00 FlateStream::getChar()
4.34 4.09 0.27 38266749 0.00 0.00 Object::free()
3.69 4.32 0.23 87202734 0.00 0.00 Lexer::lookChar()
3.61 4.54 0.22 19199391 0.00 0.00 GooString::append(char const*, int)
2.79 4.71 0.17 438 0.39 12.47 Gfx::go(bool)
2.62 4.87 0.16 38745 0.00 0.00 FlateStream::compHuffmanCodes(int*, int, FlateHuffmanTab*)
1.39 4.95 0.09 10967463 0.00 0.00 gmallocn
pdftops after the inline patch
12.99 0.73 0.73 8905797 0.00 0.00 GooString::formatDouble(double, char*, int, int, bool, char**, int*)
9.96 1.29 0.56 6172503 0.00 0.00 GooString::appendfv(char const*, __va_list_tag*)
8.10 1.75 0.46 18514623 0.00 0.00 Lexer::getObj(Object*, int)
7.38 2.16 0.42 22730064 0.00 0.00 FlateStream::readSome()
6.23 2.51 0.35 106901274 0.00 0.00 JBIG2Segment::~JBIG2Segment()
6.14 2.86 0.35 34515024 0.00 0.00 FlateStream::getHuffmanCodeWord(FlateHuffmanTab*)
4.98 3.14 0.28 16607355 0.00 0.00 Parser::getObj(Object*, bool, unsigned char*, CryptAlgorithm, int, int, int, int, bool)
4.72 3.40 0.27 111594801 0.00 0.00 FlateStream::getChar()
3.02 3.57 0.17 438 0.39 11.37 Gfx::go(bool)
2.67 3.72 0.15 38745 0.00 0.00 FlateStream::compHuffmanCodes(int*, int, FlateHuffmanTab*)
2.40 3.86 0.14 38266749 0.00 0.00 Object::free()
2.14 3.98 0.12 87202734 0.00 0.00 Lexer::lookChar()
2.05 4.09 0.12 43505223 0.00 0.00 FileStream::getChar()
1.96 4.20 0.11 5449956 0.00 0.00 Gfx::findOp(char*)
1.87 4.31 0.11 18343965 0.00 0.00 Parser::shift(int)
1.78 4.41 0.10 10967463 0.00 0.00 gmallocn
1.78 4.51 0.10 5449956 0.00 0.00 Gfx::execOp(Object*, Object*, int)
1.51 4.59 0.09 19199391 0.00 0.00 GooString::append(char const*, int)
1.07 4.65 0.06 2904 0.02 0.07 SampledFunction::SampledFunction(Object*, Dict*)
0.98 4.71 0.06 13247493 0.00 0.00 FlateStream::getCodeWord(int)
0.89 4.76 0.05 1208397 0.00 0.00 RunLengthStream::getChars(int, unsigned char*)
0.71 4.80 0.04 579486 0.00 0.00 GooString::formatInt(long long, char*, int, bool, int, int, char**, int*, bool)
0.62 4.83 0.04 72426 0.00 0.00 Parser::~Parser()
0.62 4.87 0.04 19782 0.00 0.00 BaseStream::getDict()
0.62 4.90 0.04 654 0.05 0.05 Lexer::getChar(bool)
0.62 4.94 0.04 LZWStream::getRawChar()
**Patch 113384**, "patch to inline some functions in Stream.cc and Stream.h":
[poppler-inline-12feb15.pat](/uploads/5542fd90e3ca8efdefc639e265ec5a6c/poppler-inline-12feb15.pat)https://gitlab.freedesktop.org/poppler/poppler/-/issues/388okular doesn't display asymptote graphs correctly2018-09-01T00:57:49ZBugzilla Migration Userokular doesn't display asymptote graphs correctly## Submitted by Matthias Berndt
Assigned to **Matthias Berndt**
**[Link to original bug (#14897)](https://bugs.freedesktop.org/show_bug.cgi?id=14897)**
## Description
Please see this bug in the KDE bugzilla for a closer descriptio...## Submitted by Matthias Berndt
Assigned to **Matthias Berndt**
**[Link to original bug (#14897)](https://bugs.freedesktop.org/show_bug.cgi?id=14897)**
## Description
Please see this bug in the KDE bugzilla for a closer description:
http://bugs.kde.org/show_bug.cgi?id=157704https://gitlab.freedesktop.org/poppler/poppler/-/issues/387Several texts (with monospaced fonts) are seen with spaces within2021-10-13T15:29:31ZBugzilla Migration UserSeveral texts (with monospaced fonts) are seen with spaces within## Submitted by kub..@..mx.com
Assigned to **poppler-bugs**
**[Link to original bug (#66922)](https://bugs.freedesktop.org/show_bug.cgi?id=66922)**
## Description
Created attachment 82442
Text badly seen with (Okular 0.15.2, KDE 4...## Submitted by kub..@..mx.com
Assigned to **poppler-bugs**
**[Link to original bug (#66922)](https://bugs.freedesktop.org/show_bug.cgi?id=66922)**
## Description
Created attachment 82442
Text badly seen with (Okular 0.15.2, KDE 4.9.2) or (Okular 0.16.4; KDE 4.9.4)
Several texts (with monospaced fonts) are seen with spaces within. That has happened with several pdf files; one example can be seen in the text "Command prompt" that exists in the page 10 of that file:
http://www.3ds.com/fileadmin/PRODUCTS/DRAFT_SIGHT/PDF/GETTING-STARTED-GUIDE.pdf
I used one computer with
(Okular 0.15.2, KDE 4.9.2)
and it had the same bad result as another computer with
(Okular 0.16.4; KDE 4.9.4).
I attach a screenshot, in the file:
Okular 0.15.2, KDE 4.9.2; Okular 0.16.4; KDE 4.9.4.png
I used another computer with Acrobat Reader, and the text was seen correctly. I attach a screenshot, in the file:
Adobe Reader.png
Thanks for all the good work!
**Attachment 82442**, "Text badly seen with (Okular 0.15.2, KDE 4.9.2) or (Okular 0.16.4; KDE 4.9.4)":
![Okular_0.15.2__KDE_4.9.2__Okular_0.16.4__KDE_4.9.4](/uploads/3e3c50940d75a9b7173c2c4decbe861c/Okular_0.15.2__KDE_4.9.2__Okular_0.16.4__KDE_4.9.4.png)https://gitlab.freedesktop.org/poppler/poppler/-/issues/386[pdftotext pdftops pdftoppm] Infinite loop in file parsing dumps infinite dat...2018-10-26T15:29:48ZBugzilla Migration User[pdftotext pdftops pdftoppm] Infinite loop in file parsing dumps infinite data to stderr.## Submitted by MH
Assigned to **poppler-bugs**
**[Link to original bug (#84986)](https://bugs.freedesktop.org/show_bug.cgi?id=84986)**
## Description
Created attachment 107815
Crash reproduce bug
utils]$ ./pdftotext ~/870-fuzz-1...## Submitted by MH
Assigned to **poppler-bugs**
**[Link to original bug (#84986)](https://bugs.freedesktop.org/show_bug.cgi?id=84986)**
## Description
Created attachment 107815
Crash reproduce bug
utils]$ ./pdftotext ~/870-fuzz-11.pdf
...
Syntax Error (582): Command token too long
Syntax Error (582): Unknown operator '`<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>``<81>`'
... <<- eternally repeats (did not check for inf time, but when testing it did 600MB to stderr in <1min)
Repeats in master.
**Attachment 107815**, "Crash reproduce bug":
[870-fuzz-11.pdf](/uploads/2dfe92f496873ea5324337b1caf86988/870-fuzz-11.pdf)https://gitlab.freedesktop.org/poppler/poppler/-/issues/385When extracting as XML all new lines are stripped2018-08-21T10:49:11ZBugzilla Migration UserWhen extracting as XML all new lines are stripped## Submitted by cla..@..eat.dk
Assigned to **poppler-bugs**
**[Link to original bug (#104230)](https://bugs.freedesktop.org/show_bug.cgi?id=104230)**
## Description
Created attachment 136123
test pdf
pdftohtml -s -i -xml test.pdf...## Submitted by cla..@..eat.dk
Assigned to **poppler-bugs**
**[Link to original bug (#104230)](https://bugs.freedesktop.org/show_bug.cgi?id=104230)**
## Description
Created attachment 136123
test pdf
pdftohtml -s -i -xml test.pdf out.xml
VS
pdftohtml -s -i test.pdf out.html
When you extract the text as HTML alle new lines are kept, but if you extract the text as XML they are stripped out and each new line is put in a new tag
**Attachment 136123**, "test pdf":
[001.pdf](/uploads/bf1e5ee7969b7768ee43ddcde91d5e95/001.pdf)https://gitlab.freedesktop.org/poppler/poppler/-/issues/384pdftohtml -c generates html with very ugly/unusual spacing2018-10-27T15:09:44ZBugzilla Migration Userpdftohtml -c generates html with very ugly/unusual spacing## Submitted by Luis Villa
Assigned to **poppler-bugs**
**[Link to original bug (#12522)](https://bugs.freedesktop.org/show_bug.cgi?id=12522)**
## Description
pdftohtml -c is trying to respect the different font sizes present in t...## Submitted by Luis Villa
Assigned to **poppler-bugs**
**[Link to original bug (#12522)](https://bugs.freedesktop.org/show_bug.cgi?id=12522)**
## Description
pdftohtml -c is trying to respect the different font sizes present in this document:
http://altlaw.org/v1/cases/157903.pdf
but it generates HTML that looks very unusual:
http://altlaw.org/v1/cases/157903 (everything in the frames are generated with pdftohtml.)
Note all the extra/unusual spacing, at least in FFox 2.https://gitlab.freedesktop.org/poppler/poppler/-/issues/383can't display correctly of Chinese book2018-10-07T00:32:06ZBugzilla Migration Usercan't display correctly of Chinese book## Submitted by ni shengyue
Assigned to **poppler-bugs**
**[Link to original bug (#91013)](https://bugs.freedesktop.org/show_bug.cgi?id=91013)**
## Description
Created attachment 116567
the book that can't read
This is a case clo...## Submitted by ni shengyue
Assigned to **poppler-bugs**
**[Link to original bug (#91013)](https://bugs.freedesktop.org/show_bug.cgi?id=91013)**
## Description
Created attachment 116567
the book that can't read
This is a case cloned from GNOME Bugzilla – [Bug 751115](https://bugs.freedesktop.org/show_bug.cgi?id=751115)
the book that can't read
Evince can't display book correcly, just wrong words
but some other tools can display well, such as mupdf, foxit reader (close source software).
I use mupdf and can read it correcly, and I notice there is print as below:
nisy@nisy-ubuntu:/media/nisy/sda1_WinXP/_Technology/98_my_project/_27_改进开源软件/乱码问题$ mupdf ./1_\[雅舍菁华\].梁秋实.文字版.PDF
warning: workaround for S22PDF lying about chinese font encodings
warning: ... repeated 2 times ...
warning: workaround for S22PDF lying about chinese font encodings
warning: ... repeated 2 times ...
so I guess mupdf had workaround to avoid font issue, so I checked in mupdf code,find its code should be:
pdf_load_font
|__pdf_load_simple_font
|__ pdf_load_simple_font_by_name
|__ fz_warn(ctx, "workaround for S22PDF lying about chinese font encodings");
Because evince also have font mechanism, so can evince consider to check mupdf 's mechanism, and merge mupdf 's method to avoid this issue? Thanks.
nishengyue
**Attachment 116567**, "the book that can't read":
[1.PDF](/uploads/3aeb1dd9698fb0e7e3339da6561d1a80/1.PDF)https://gitlab.freedesktop.org/poppler/poppler/-/issues/382"ld: library not found for -lopenjp2" on MacOS2019-01-09T13:01:50ZBugzilla Migration User"ld: library not found for -lopenjp2" on MacOS## Submitted by Tom Duck
Assigned to **poppler-bugs**
**[Link to original bug (#107063)](https://bugs.freedesktop.org/show_bug.cgi?id=107063)**
## Description
The following error occurs during `make` on MacOS High Sierra 10.13.5:
...## Submitted by Tom Duck
Assigned to **poppler-bugs**
**[Link to original bug (#107063)](https://bugs.freedesktop.org/show_bug.cgi?id=107063)**
## Description
The following error occurs during `make` on MacOS High Sierra 10.13.5:
ld: library not found for -lopenjp2
The library is in /usr/local/lib and was installed from source:
$ cd /usr/local/lib
$ ls -1 *libopenjp2*
libopenjp2.2.3.0.dylib*
libopenjp2.7.dylib@
libopenjp2.a
libopenjp2.dylib@
Inspection of CMakeFiles/poppler.dir/link.txt in the poppler build directory shows lots of fully resolved paths, and two references to `-lopenjp2`. I removed them both and added the following path instead:
/usr/local/lib/libopenjp2.dylib
After this the build and install complete successfully.https://gitlab.freedesktop.org/poppler/poppler/-/issues/381Vertical punctuation become Horizontal punctuation2018-10-27T14:19:09ZBugzilla Migration UserVertical punctuation become Horizontal punctuation## Submitted by liaotianlei
Assigned to **poppler-bugs**
**[Link to original bug (#98868)](https://bugs.freedesktop.org/show_bug.cgi?id=98868)**
## Description
Created attachment 128213
Vertical punctuation become Horizontal punct...## Submitted by liaotianlei
Assigned to **poppler-bugs**
**[Link to original bug (#98868)](https://bugs.freedesktop.org/show_bug.cgi?id=98868)**
## Description
Created attachment 128213
Vertical punctuation become Horizontal punctuation
In this PDF:
Vertical punctuation :︻、︼、︵、︶
pdftohtml get horizontal punctuation:【、】、(、)
In acrobat reader,copy and paste to notepad,these punctuations are vertical。
more:
《、︽
》、︾
—,|
︹,〔
︺,〕
︿,〈
﹀,〉
︷,{
︸,}
『,﹃
』,﹄
......
**Attachment 128213**, "Vertical punctuation become Horizontal punctuation":
[punc.pdf](/uploads/0d51becee84c21d9e52545630e9a261e/punc.pdf)https://gitlab.freedesktop.org/poppler/poppler/-/issues/380Math symbols rendered as squares2018-10-27T13:48:25ZBugzilla Migration UserMath symbols rendered as squares## Submitted by Maris Nartiss
Assigned to **poppler-bugs**
**[Link to original bug (#104661)](https://bugs.freedesktop.org/show_bug.cgi?id=104661)**
## Description
Created attachment 136784
Test PDF (starting from page 5)
In one ...## Submitted by Maris Nartiss
Assigned to **poppler-bugs**
**[Link to original bug (#104661)](https://bugs.freedesktop.org/show_bug.cgi?id=104661)**
## Description
Created attachment 136784
Test PDF (starting from page 5)
In one PDF file math symbols are rendered as squares (see 5th page of attached PDF). On the same system FF built in PDF viewer and MuPDF have no problems with rendering. Inkscape import gives mixed results – poppler/Cairo backend displays squares, but "built in (poppler derived)" backend imports file just fine (with or without "font replacement override").
I tested most of PDF samples I found in bugzilla and none of math symbols related ones had any issues. Although it is not a proof, it hints on quite good work of poppler finding a correct font for other PDFs. Thus submitting this "specimen" for examination.
Tested with:
poppler-0.62.0-r1
poppler-data-0.4.8
Gentoo ~AMD64
pdffonts math_symbol_render_issue.pdf
name type encoding emb sub uni object ID
------------------------------------ ----------------- ---------------- --- --- --- ---------
TimesNewRoman TrueType WinAnsi no no no 6 0
TimesNewRoman,Bold TrueType WinAnsi no no no 8 0
TimesNewRoman,Italic TrueType WinAnsi no no no 12 0
TimesNewRoman,BoldItalic TrueType WinAnsi no no no 20 0
Symbol TrueType Symbol no no no 22 0
Symbol,Italic TrueType Symbol no no no 33 0
PADFIF+MTExtra TrueType WinAnsi yes yes no 42 0
Arial TrueType WinAnsi no no no 94 0
CourierNew TrueType WinAnsi no no no 116 0
pdffonts -subst math_symbol_render_issue.pdf
name object ID substitute font substitute font file
------------------------------------ --------- ------------------------------------ ------------------------------------
TimesNewRoman 6 0 Liberation Serif /usr/share/fonts/liberation-fonts/LiberationSerif-Regular.ttf
TimesNewRoman,Bold 8 0 Liberation Serif Bold /usr/share/fonts/liberation-fonts/LiberationSerif-Bold.ttf
TimesNewRoman,Italic 12 0 Liberation Serif Italic /usr/share/fonts/liberation-fonts/LiberationSerif-Italic.ttf
TimesNewRoman,BoldItalic 20 0 Liberation Serif Bold Italic /usr/share/fonts/liberation-fonts/LiberationSerif-BoldItalic.ttf
Symbol 22 0 Liberation Sans /usr/share/fonts/liberation-fonts/LiberationSans-Regular.ttf
Symbol,Italic 33 0 Liberation Sans Italic /usr/share/fonts/liberation-fonts/LiberationSans-Italic.ttf
Arial 94 0 Liberation Sans /usr/share/fonts/liberation-fonts/LiberationSans-Regular.ttf
CourierNew 116 0 Liberation Mono /usr/share/fonts/liberation-fonts/LiberationMono-Regular.ttf
**Attachment 136784**, "Test PDF (starting from page 5)":
[math_symbol_render_issue.pdf](/uploads/16a0332e477b67a826432c7fb336c350/math_symbol_render_issue.pdf)https://gitlab.freedesktop.org/poppler/poppler/-/issues/379PDF compliance: Always respect Annot Rect2018-10-07T00:31:17ZBugzilla Migration UserPDF compliance: Always respect Annot Rect## Submitted by Tobias Deiminger
Assigned to **poppler-bugs**
**[Link to original bug (#105692)](https://bugs.freedesktop.org/show_bug.cgi?id=105692)**
## Description
To be compliant to ISO 32k, we always have to respect the geome...## Submitted by Tobias Deiminger
Assigned to **poppler-bugs**
**[Link to original bug (#105692)](https://bugs.freedesktop.org/show_bug.cgi?id=105692)**
## Description
To be compliant to ISO 32k, we always have to respect the geometry of Annot /Rect. See [0]. Currently there are cases where poppler draws annotations at another size. Namely this happens when poppler generates appearances at runtime to substitute missing AP entries for AnnotText, AnnotLine, AnnotTextMarkup, AnnotPolygon and AnnotInk (e.g. [1], [2]).
For example, you can end up with a popup note drawn as 24 x 24 pts even if Annot Rect says 16 x 16 pts. Poppler does not update Annot Rect, it keeps telling 16 x 16. Therefore the new size is also not reflected in public APIs. So the bug is not only about PDF non-conformance, but it also leaves clients unaware of the actually rendered size [4].
ISO 32k contains a detailed description about how to fit appearance streams into Annot Rect. See "Algorithm: Appearance streams" from ISO 32000-1:2008, 12.5.5. To sum up:
a) make a quadrilateral from AP BBox by applying apperances transformation Matrix, and calculate a temporary transformed AP BBox that encompasses the quadrilateral
b) calculate a temporary Matrix A that scales the transformed AP BBox into Annot Rect
c) use this Matrix A to draw the appearance
The existing method Gfx::drawAnnot nicely implements this algorithm. We just have to call Gfx::drawAnnot with the original Annot Rect size instead passing other values.
Patches will follow, comments are appreciated in advance.
[0] https://lists.freedesktop.org/archives/poppler/2018-March/012909.html
[1] https://cgit.freedesktop.org/poppler/poppler/tree/poppler/Annot.cc?h=poppler-0.63.0#n2486
[2] https://cgit.freedesktop.org/poppler/poppler/tree/poppler/Annot.cc?h=poppler-0.63.0#n5890
[3] https://cgit.freedesktop.org/poppler/poppler/tree/poppler/Gfx.cc?h=poppler-0.63.0#n5125
[4] https://bugs.kde.org/show_bug.cgi?id=388458