poppler issueshttps://gitlab.freedesktop.org/poppler/poppler/-/issues2023-10-11T01:35:00Zhttps://gitlab.freedesktop.org/poppler/poppler/-/issues/5Fonts with uniXXXX in mapping tables2023-10-11T01:35:00ZBugzilla Migration UserFonts with uniXXXX in mapping tables## Submitted by Ed Catmur
Assigned to **poppler-bugs**
**[Link to original bug (#8985)](https://bugs.freedesktop.org/show_bug.cgi?id=8985)**
## Description
http://bugzilla.gnome.org/show_bug.cgi?id=341947 and [bug 7002](https://bu...## Submitted by Ed Catmur
Assigned to **poppler-bugs**
**[Link to original bug (#8985)](https://bugs.freedesktop.org/show_bug.cgi?id=8985)**
## Description
http://bugzilla.gnome.org/show_bug.cgi?id=341947 and [bug 7002](https://bugs.freedesktop.org/show_bug.cgi?id=7002) have attached a
PDF where the font (Minion) has entries of the form uniXXXX (e.g. uni015E) in
its mapping tables.
Poppler should probably handle these (as U+XXXX).https://gitlab.freedesktop.org/poppler/poppler/-/issues/3pdfimages miscalculates image PPI for rotated and scaled images (patch provided)2023-10-11T01:35:00ZBugzilla Migration Userpdfimages miscalculates image PPI for rotated and scaled images (patch provided)## Submitted by fre..@..et.com
Assigned to **poppler-bugs**
**[Link to original bug (#105614)](https://bugs.freedesktop.org/show_bug.cgi?id=105614)**
## Description
Created attachment 138211
Patch to correct the error
pdfimages m...## Submitted by fre..@..et.com
Assigned to **poppler-bugs**
**[Link to original bug (#105614)](https://bugs.freedesktop.org/show_bug.cgi?id=105614)**
## Description
Created attachment 138211
Patch to correct the error
pdfimages miscalculates image PPI for rotated and scaled images
Tested on https://github.com/angea/PDF101/blob/master/handcoded/111_current-transformation-matrix-ctm.pdf
UNPATCHED
>pdfimages -list 111_current-transformation-matrix-ctm.pdf
page num type width height color comp bpc enc interp object ID x-ppi y-ppi size ratio
--------------------------------------------------------------------------------------------
1 0 image 2 2 rgb 3 8 image no 5 0 4 4 13B 108%
1 1 image 2 2 rgb 3 8 image no 5 0 5 3 13B 108%
1 2 image 2 2 rgb 3 8 image no 5 0 3 5 13B 108%
1 3 image 2 2 rgb 3 8 image no 5 0 6 3 13B 108%
1 4 image 2 2 rgb 3 8 image no 5 0 3 10 13B 108%
1 5 image 2 2 rgb 3 8 image no 5 0 4 72000 13B 108%
1 6 image 2 2 rgb 3 8 image no 5 0 4 2 13B 108%
1 7 image 2 2 rgb 3 8 image no 5 0 2 4 13B 108%
1 8 image 2 2 rgb 3 8 image no 5 0 14401 1 13B 108%
1 9 image 2 2 rgb 3 8 image no 5 0 1 2 13B 108%
1 10 image 2 2 rgb 3 8 image no 5 0 0.950 4 13B 108%
1 11 image 2 2 rgb 3 8 image no 5 0 4 0.950 13B 108%
1 12 image 2 2 rgb 3 8 image no 5 0 0.950 4 13B 108%
1 13 image 2 2 rgb 3 8 image no 5 0 1 4 13B 108%
1 14 image 2 2 rgb 3 8 image no 5 0 0.950 4 13B 108%
1 15 image 2 2 rgb 3 8 image no 5 0 0.950 4 13B 108%
1 16 image 2 2 rgb 3 8 image no 5 0 4 0.950 13B 108%
PATCHED
>pdfimages -list 111_current-transformation-matrix-ctm.pdf
page num type width height color comp bpc enc interp object ID x-ppi y-ppi size ratio
--------------------------------------------------------------------------------------------
1 0 image 2 2 rgb 3 8 image no 5 0 4 4 13B 108%
1 1 image 2 2 rgb 3 8 image no 5 0 5 3 13B 108%
1 2 image 2 2 rgb 3 8 image no 5 0 3 5 13B 108%
1 3 image 2 2 rgb 3 8 image no 5 0 4 4 13B 108%
1 4 image 2 2 rgb 3 8 image no 5 0 4 4 13B 108%
1 5 image 2 2 rgb 3 8 image no 5 0 3 4 13B 108%
1 6 image 2 2 rgb 3 8 image no 5 0 3 4 13B 108%
1 7 image 2 2 rgb 3 8 image no 5 0 4 3 13B 108%
1 8 image 2 2 rgb 3 8 image no 5 0 0.720 0.509 13B 108%
1 9 image 2 2 rgb 3 8 image no 5 0 0.720 0.624 13B 108%
1 10 image 2 2 rgb 3 8 image no 5 0 0.450 4 13B 108%
1 11 image 2 2 rgb 3 8 image no 5 0 4 0.450 13B 108%
1 12 image 2 2 rgb 3 8 image no 5 0 0.450 4 13B 108%
1 13 image 2 2 rgb 3 8 image no 5 0 0.600 4 13B 108%
1 14 image 2 2 rgb 3 8 image no 5 0 0.450 4 13B 108%
1 15 image 2 2 rgb 3 8 image no 5 0 0.450 4 13B 108%
1 16 image 2 2 rgb 3 8 image no 5 0 4 0.450 13B 108%
**Patch 138211**, "Patch to correct the error":
[pdfimages.patch](/uploads/330d8cbd2d9b1e3a37e414d4b562859d/pdfimages.patch)https://gitlab.freedesktop.org/poppler/poppler/-/issues/1438Poppler-23.09.0 fails to build on Solaris2023-10-02T07:15:26ZMartin ŘehákPoppler-23.09.0 fails to build on Solarispoppler-23.09.0 fails to build with following error on Solaris:
```
[ 48%] Linking CXX shared library libpoppler.so
ld: fatal: option --version-script requires option -z gnu-version-script-compat to be specified
collect2: error: ld retur...poppler-23.09.0 fails to build with following error on Solaris:
```
[ 48%] Linking CXX shared library libpoppler.so
ld: fatal: option --version-script requires option -z gnu-version-script-compat to be specified
collect2: error: ld returned 1 exit status
```
CMakeLists.txt contains this:
```
if(UNIX AND (NOT APPLE))
set_target_properties(poppler PROPERTIES LINK_OPTIONS LINKER:--version-script=${LINKER_SCRIPT})
endif()
```
I tried to use `-z gnu-version-script-compat` as suggested, but it is failing with following error:
```
[ 64%] Linking CXX executable pdfseparate
Undefined first referenced
symbol in file
__iob CMakeFiles/pdfseparate.dir/parseargs.cc.o (symbol belongs to unavailable version ../libpoppler.so.131.0.0 ((null)))
ld: fatal: symbol referencing errors
collect2: error: ld returned 1 exit status
```
To be honest I don't understand the latter issue at all, so the only way I was able to workaround that was to ifdef it this way:
```
-if(UNIX AND (NOT APPLE))
+if(UNIX AND (NOT APPLE AND NOT CMAKE_HOST_SOLARIS))
```
Solaris has specific ld which tries to be compatible with GNU ld, but for some reason doesn't behave in the same way here.
Is this linker setting important or is it just fine to merge the change above to make it build, please?
Thank you,
Martinhttps://gitlab.freedesktop.org/poppler/poppler/-/issues/1439Poppler 23.09 and Qt 6.5.2 - #include "poppler-export.h" <--- File not found.2023-09-27T11:21:46ZOnevoidPoppler 23.09 and Qt 6.5.2 - #include "poppler-export.h" <--- File not found.While trying to build **Poppler 23.09** on **Qt 6.5.2**, there's the following critical error in **qt6/src/poppler-annotation.h**, line #45.
**#include "poppler-export.h"** <--- File not found.
This critical file **"poppler-export.h"...While trying to build **Poppler 23.09** on **Qt 6.5.2**, there's the following critical error in **qt6/src/poppler-annotation.h**, line #45.
**#include "poppler-export.h"** <--- File not found.
This critical file **"poppler-export.h"** is nowhere to be found in the Poppler file system structure.
Thanks in advance.https://gitlab.freedesktop.org/poppler/poppler/-/issues/1093Space understood as column separator when copy-paste2023-09-26T23:42:42ZDenis BitouzéSpace understood as column separator when copy-pasteI'm not sure that this issue is a `poppler` one per se, but I encountered it with several `poppler` based PDF readers on Linux: `Zathura`, `Okular` and `Evince`: if the code between "Or through a dictionary:" and "Or if you want to exclu...I'm not sure that this issue is a `poppler` one per se, but I encountered it with several `poppler` based PDF readers on Linux: `Zathura`, `Okular` and `Evince`: if the code between "Or through a dictionary:" and "Or if you want to exclude the possibility [...]" page 3 of [this PDF document](http://mirrors.ctan.org/macros/latex/contrib/pdfmanagement-testphase/l3pdfannot.pdf):
```latex
\pdfdict_new:n {l_my_action_dict}
\pdfdict_put:nnn {l_my_action_dict}{Type}{/Action}
\pdfdict_put:nnn {l_my_action_dict}{S}{/URI}
\pdfdict_put:nnn {l_my_action_dict}{URI}{(https://www.latex-project.org)}
\pdfannot_dict_put:nnn {link/URI} { C } {[1~0~0]} %red border
\pdfannot_link:nxn { URI }
{
/A <<\pdfdict_use:n{l_my_action_dict}>>
}
{ link text }
```
is copied, it is pasted as:
```latex
\pdfdict_new:n
\pdfdict_put:nnn
\pdfdict_put:nnn
\pdfdict_put:nnn
{l_my_action_dict}
{l_my_action_dict}{Type}{/Action}
{l_my_action_dict}{S}{/URI}
{l_my_action_dict}{URI}{(https://www.latex-project.org)}
\pdfannot_dict_put:nnn
{link/URI} { C } {[1~0~0]} %red border
\pdfannot_link:nxn { URI }
{
/A <<\pdfdict_use:n{l_my_action_dict}>>
}
{ link text }
```
[According to this comment](https://github.com/latex3/pdfresources/issues/19#issuecomment-866089474), this trouble doesn't arise with PDF readers not based on `poppler`.https://gitlab.freedesktop.org/poppler/poppler/-/issues/904pdftotext inserts newline when there is none2023-09-26T23:42:41ZWitold Barylukpdftotext inserts newline when there is noneSource pdf: https://www.ne.ch/autorites/DFS/SCSP/medecin-cantonal/maladies-vaccinations/Documents/Covid-19-Statistiques/COVID19_PublicationInternet.pdf
snapshot from archive: https://web.archive.org/web/20200408054553if_/https://www.ne....Source pdf: https://www.ne.ch/autorites/DFS/SCSP/medecin-cantonal/maladies-vaccinations/Documents/Covid-19-Statistiques/COVID19_PublicationInternet.pdf
snapshot from archive: https://web.archive.org/web/20200408054553if_/https://www.ne.ch/autorites/DFS/SCSP/medecin-cantonal/maladies-vaccinations/Documents/Covid-19-Statistiques/COVID19_PublicationInternet.pdf
[COVID19_PublicationInternet.pdf](/uploads/ea27c1aaf22537c500145735ba474fc9/COVID19_PublicationInternet.pdf)
I am using `pdftotext -layout` for this.
This happens both with version 0.71 (Debian testing) and 0.85 (Debian experimental).
Example of problematic conversions:
Start of the document:
![first_page_header](/uploads/e26003f21f0aeb83b3f197560c1e23e2/first_page_header.png)
Output:
```
Servicedel
asantépubli
que
Donnéesbaséessurlesdéc
l ar
ati
onsdelabo
Neuc
hât
el-CasCOVI
D-19posi
tif
s
Tableauact
uali
```
End of the document (table):
![last_page_table](/uploads/b4590461d225733dcf84f1c711ac3ddb/last_page_table.png)
Output:
```
8avri
l2020 5 518 53 3 7 63 3 7 4 14 1 37
9avri
l2020 18 536 48 3 7 58 3 7 4 14
10avri
l2020 52 3 8 63 3 8 4 15
```
Notice the new line after `avri`.https://gitlab.freedesktop.org/poppler/poppler/-/issues/1430The program seems to have fallen into a loop in DCTStream.cc2023-09-19T22:37:25Z刘文The program seems to have fallen into a loop in DCTStream.ccversion:23.08.0
My system OS:Ubuntu 20.04
reproduce: pdfunite 1.pdf poc.pdf output.pdf
[1.pdf](/uploads/d93654243413abef9a0321e925019550/1.pdf)
[poc.pdf](/uploads/59ef1836587e3d9e10f1b7a65f3bd639/poc.pdf)
reproduce: pdfseparate poc...version:23.08.0
My system OS:Ubuntu 20.04
reproduce: pdfunite 1.pdf poc.pdf output.pdf
[1.pdf](/uploads/d93654243413abef9a0321e925019550/1.pdf)
[poc.pdf](/uploads/59ef1836587e3d9e10f1b7a65f3bd639/poc.pdf)
reproduce: pdfseparate poc2.pdf out/put-%d.pdf
[poc2.pdf](/uploads/e7795393163ef4b3bef0e81dcf4f193f/poc2.pdf)https://gitlab.freedesktop.org/poppler/poppler/-/issues/1432Rendering issue with literata variable font2023-09-12T06:10:28ZMara MRendering issue with literata variable fontThe right-pointing double angle quotation mark (U+00BB) is rendered wrongly in Literata (variable font). In the static version there is no problem.
![image](/uploads/b6119e867533c11e971aa9520e91b2cf/image.png)
[Test3-varmodern.pdf](/up...The right-pointing double angle quotation mark (U+00BB) is rendered wrongly in Literata (variable font). In the static version there is no problem.
![image](/uploads/b6119e867533c11e971aa9520e91b2cf/image.png)
[Test3-varmodern.pdf](/uploads/4b6b2df9ec88b9429302ed58ef780962/Test3-varmodern.pdf)https://gitlab.freedesktop.org/poppler/poppler/-/issues/1433Pdfseparate: A type error occurred while calling the object2023-09-11T21:37:56Z刘文Pdfseparate: A type error occurred while calling the objectversion:23.08.0<br>
My system OS:Ubuntu 20.04<br>
reproduce: pdfseparate poc.pdf output-%d.pdf<br>
Final error message prompt“Internal Error (0): Call to Object where the object was type 10, not the expected type 7”<br>
I reinstalled and...version:23.08.0<br>
My system OS:Ubuntu 20.04<br>
reproduce: pdfseparate poc.pdf output-%d.pdf<br>
Final error message prompt“Internal Error (0): Call to Object where the object was type 10, not the expected type 7”<br>
I reinstalled and compiled the poppler that fixed the issue with issues/1428, and the poc.pdf in issues/1428 no longer triggers crashes. But I found a new PDF file that seems to have triggered the same issue<br>
The stack information is as follows:<br>
Program received signal SIGABRT, Aborted.<br>
__GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:50<br>
50 ../sysdeps/unix/sysv/linux/raise.c: 没有那个文件或目录.<br>
(gdb) bt<br>
#0 __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:50<br>
#1 0x00007ffff7593859 in __GI_abort () at abort.c:79<br>
#2 0x00007ffff7c28796 in Object::getDict (this=this@entry=0x7fffffffd888) at /Oscar01/liujiahao/poppler-master/poppler/Object.h:435<br>
#3 0x00007ffff7d437ab in PDFDoc::savePageAs (this=0x653fe0, name=..., pageNo=1) at /Oscar01/liujiahao/poppler-master/poppler/PDFDoc.cc:927<br>
#4 0x00000000004055ad in extractPages (srcFileName=<optimized out>, destFileName=0x7fffffffed1e "out/put-%d.pdf")
at /Oscar01/liujiahao/poppler-master/utils/pdfseparate.cc:123<br>
#5 main (argc=<optimized out>, argv=<optimized out>) at /Oscar01/liujiahao/poppler-master/utils/pdfseparate.cc:156<br>[poc.pdf](/uploads/9d5e293105ec15bdf00d48272bac2f87/poc.pdf)https://gitlab.freedesktop.org/poppler/poppler/-/issues/1431The program seems to have fallen into a loop2023-09-06T20:18:48Z刘文The program seems to have fallen into a loopversion:23.08.0
My system OS:Ubuntu 20.04
reproduce: pdfsig poc.pdf
[1.pdf](/uploads/78f7e55de28a7571d32b528142a84818/1.pdf)
[2.pdf](/uploads/ff9da01eddb6d4f96c8c362edb1aef65/2.pdf)
[3.pdf](/uploads/f5ae01c0ba61b264ca5f84af79b21a24/3...version:23.08.0
My system OS:Ubuntu 20.04
reproduce: pdfsig poc.pdf
[1.pdf](/uploads/78f7e55de28a7571d32b528142a84818/1.pdf)
[2.pdf](/uploads/ff9da01eddb6d4f96c8c362edb1aef65/2.pdf)
[3.pdf](/uploads/f5ae01c0ba61b264ca5f84af79b21a24/3.pdf)https://gitlab.freedesktop.org/poppler/poppler/-/issues/1427Failed conversion, file attached as requested ...2023-09-05T19:20:57ZAndrew TealFailed conversion, file attached as requested ...[RAFA_Sud-Ouest_France_History_2021_ENG.pdf](/uploads/1f983c25717ddc41d808e257881b4776/RAFA_Sud-Ouest_France_History_2021_ENG.pdf)[RAFA_Sud-Ouest_France_History_2021_ENG.pdf](/uploads/1f983c25717ddc41d808e257881b4776/RAFA_Sud-Ouest_France_History_2021_ENG.pdf)https://gitlab.freedesktop.org/poppler/poppler/-/issues/1428Pdfseparate: A type error occurred while calling the object2023-09-04T22:50:56Z刘文Pdfseparate: A type error occurred while calling the objectversion:23.08.0
My system OS:Ubuntu 20.04
reproduce: pdfseparate poc.pdf output-%d.pdf
Final error message prompt“Internal Error (0): Call to Object where the object was type 5, not the expected type 7”
The stack information is as fo...version:23.08.0
My system OS:Ubuntu 20.04
reproduce: pdfseparate poc.pdf output-%d.pdf
Final error message prompt“Internal Error (0): Call to Object where the object was type 5, not the expected type 7”
The stack information is as follows:
Program received signal SIGABRT, Aborted.<br>
__GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:50<br>
50 ../sysdeps/unix/sysv/linux/raise.c: 没有那个文件或目录.<br>
(gdb) bt<br>
#0 __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:50<br>
#1 0x00007ffff7594859 in __GI_abort () at abort.c:79<br>
#2 0x00007ffff7d44a22 in PDFDoc::savePageAs (this=0x654560, name=..., pageNo=1) at /Oscar01/liujiahao/poppler/poppler/Object.h:435<br>
#3 0x00000000004055ad in extractPages (srcFileName=<optimized out>, destFileName=0x7fffffffed26 "output-%d.pdf") at /Oscar01/liujiahao/poppler/utils/pdfseparate.cc:123<br>
#4 main (argc=<optimized out>, argv=<optimized out>) at /Oscar01/liujiahao/poppler/utils/pdfseparate.cc:156<br>
Attach the poc file. If this is not an error or if it has already been discovered, we apologize for wasting your time. thank you very much indeed
[poc.pdf](/uploads/b425f436a6911615474db5418c7501cd/poc.pdf)https://gitlab.freedesktop.org/poppler/poppler/-/issues/734Some characters are rendered badly in PDF2023-08-30T09:21:28ZZlopezSome characters are rendered badly in PDFHi,
I have one PDF which is incorrectly rendered in Evince. I filled a [bug](https://github.com/flathub/org.gnome.Evince/issues/21) against the Evince flatpak, but I was redirected here as a result.
The document is in czech and it show...Hi,
I have one PDF which is incorrectly rendered in Evince. I filled a [bug](https://github.com/flathub/org.gnome.Evince/issues/21) against the Evince flatpak, but I was redirected here as a result.
The document is in czech and it shows wrong characters when opened:
![Screenshot_from_2019-03-04_11-03-54](/uploads/f732670666fff1a63cc85354f9757261/Screenshot_from_2019-03-04_11-03-54.png)
I tried to convert it to png using `pdftocairo -png input.pdf output` and got this:
![Screenshot_from_2019-03-04_11-05-12](/uploads/cda733cbceb3fe800de31b832f80638b/Screenshot_from_2019-03-04_11-05-12.png)
Unfortunately I can't share the whole PDF, because it contains personal addresses, but I can provide any log you need.https://gitlab.freedesktop.org/poppler/poppler/-/issues/1423vector anti-aliasing: pdftoppm produces different result (darker, more black)...2023-08-25T18:49:57ZMilan Hauthvector anti-aliasing: pdftoppm produces different result (darker, more black) than chromium PDF readeri want to render PDF vector graphics to high quality raster graphics
for this job im using pdftoppm, but surprisingly, it creates a slightly different result than the chromium pdf renderer:
with pdftoppm, fine black lines are darker, s...i want to render PDF vector graphics to high quality raster graphics
for this job im using pdftoppm, but surprisingly, it creates a slightly different result than the chromium pdf renderer:
with pdftoppm, fine black lines are darker, so overall, the graphic looks darker
https://github.com/milahu/pdf-rendering-chromium-versus-pdftoppm
![pdf-rendering-chromium-versus-pdftoppm-png-720dpi](/uploads/68cbd034a5ccbe198e1f4dcfb6c3a3bd/pdf-rendering-chromium-versus-pdftoppm-png-720dpi.png)
[page-061.input-vectors.pdf](/uploads/311dd53f2f836e75ce6e75ecde96fb53/page-061.input-vectors.pdf)
the screenshot is with chromium PDF reader at full zoom (500%) which gives the same size as a 720dpi resolution image
i have tried running pdftoppm with different options, and only `-aaVector no` makes the result worse. so i guess im looking for a PDF rendering engine with more vector anti-aliasing, or with higher-precision anti-aliasing
| `pdftoppm` | `pdftoppm -aaVector no` |
| ------ | ------ |
| ![page-061.input-vectors.pdf.720dpi.png.trim](/uploads/4d9dfe2a36fecc5b97e80e2d78cebbd5/page-061.input-vectors.pdf.720dpi.png.trim.png) | ![page-061.input-vectors.pdf.720dpi.aaVector-no.png.trim](/uploads/bdea830a3ef2c11cdf2c669eb5fb1d76/page-061.input-vectors.pdf.720dpi.aaVector-no.png.trim.png) |https://gitlab.freedesktop.org/poppler/poppler/-/issues/1420Broken link on homepage2023-08-11T20:40:15ZRyan Carsten SchmidtBroken link on homepageThe second line of [the poppler homepage](https://poppler.freedesktop.org) reads:
> _What's with [the name](http://www.gotfuturama.com/Information/Encyc-41-Popplers/)?_
The link _[the name](http://www.gotfuturama.com/Information/Encyc-...The second line of [the poppler homepage](https://poppler.freedesktop.org) reads:
> _What's with [the name](http://www.gotfuturama.com/Information/Encyc-41-Popplers/)?_
The link _[the name](http://www.gotfuturama.com/Information/Encyc-41-Popplers/)_ is 404 not found.https://gitlab.freedesktop.org/poppler/poppler/-/issues/1418pdftotext: some numbers are converted as U+FFFD2023-08-06T20:36:49ZThomas Meyerpdftotext: some numbers are converted as U+FFFDWhen converting this pdf:
https://postgrespro.com/community/books/internals
Some numbers and characters gets converted into unicode replacement character, e.g.:
"London in ����" and
"I assume that the reader has already tried using
Postg...When converting this pdf:
https://postgrespro.com/community/books/internals
Some numbers and characters gets converted into unicode replacement character, e.g.:
"London in ����" and
"I assume that the reader has already tried using
Postgre��� and has at least some
general understanding of how it works."
Bug or feature?
pdftotext version 23.07.0https://gitlab.freedesktop.org/poppler/poppler/-/issues/1416Problem using pdftohtml in a php proyect2023-08-02T16:13:14Zinnovationstudio19Problem using pdftohtml in a php proyectHello
I'm using this library https://github.com/tonchik-tm/pdf-to-html on my proyect in order to upload a pdf file, change some string and generating pdf file again.
In my localhost, is working fine, but when I tried it in my server t...Hello
I'm using this library https://github.com/tonchik-tm/pdf-to-html on my proyect in order to upload a pdf file, change some string and generating pdf file again.
In my localhost, is working fine, but when I tried it in my server there is a problem. The pdftohtml component divides each page in a html file and gives a name filename-pagenumber.html, but the number page is not working well, and makes some extrage characters and not the page number.
![image](/uploads/dd74968793306f2282bb758f95ede813/image.png)
I have no idea why is happening...
I hope you can help me
Thankshttps://gitlab.freedesktop.org/poppler/poppler/-/issues/1414tsv mode misses whole page2023-07-31T21:02:46ZFawaz Ahmedtsv mode misses whole pageI have this pdf and when I convert it to tsv, it misses whole page number 2.
`-bbox-layout` works fine i.e it does include page number 2.I have this pdf and when I convert it to tsv, it misses whole page number 2.
`-bbox-layout` works fine i.e it does include page number 2.https://gitlab.freedesktop.org/poppler/poppler/-/issues/1406Page emits "Bogus memory allocation size" and hangs with 100% core utilization2023-07-12T22:09:06ZAnatoly IPage emits "Bogus memory allocation size" and hangs with 100% core utilizationHi.
I'm trying to render the following page in 72 dpi jpeg using splash output dev.
[bad_page.pdf](/uploads/d0cf0c1c757c5a6787c3f7393feea0c6/bad_page.pdf)
Poppler ends up trying to allocate about 3.2Gig of memory, emits `Bogus memory ...Hi.
I'm trying to render the following page in 72 dpi jpeg using splash output dev.
[bad_page.pdf](/uploads/d0cf0c1c757c5a6787c3f7393feea0c6/bad_page.pdf)
Poppler ends up trying to allocate about 3.2Gig of memory, emits `Bogus memory allocation size` to stderr, and then hangs with 100% core utilization.
I tried to debug this and it seems that something goes wrong in `doTilingPatternFill` because it calculates really huge numbers that later goes to `Splash::drawImage` as `w` and `h` params.https://gitlab.freedesktop.org/poppler/poppler/-/issues/1410Poppler renders patch 19.2 of Ghent PDF Output Suite V5.0 incorrectly2023-07-10T09:08:23ZKevin OttensPoppler renders patch 19.2 of Ghent PDF Output Suite V5.0 incorrectlyThe output for `pdftoppm -png -overprint` for `GWG192_DeviceN_Overprint_White_X1a.pdf` in incorrect in square d.
![GWG192_DeviceN_Overprint_White_X1a-1](/uploads/763883e6bb9a42202b0d19c9bf409f61/GWG192_DeviceN_Overprint_White_X1a-1.png)...The output for `pdftoppm -png -overprint` for `GWG192_DeviceN_Overprint_White_X1a.pdf` in incorrect in square d.
![GWG192_DeviceN_Overprint_White_X1a-1](/uploads/763883e6bb9a42202b0d19c9bf409f61/GWG192_DeviceN_Overprint_White_X1a-1.png)
The expected result is to have a completely white square for d. I checked with Adobe Acrobat and it renders a white square indeed.
Here is the file I tested with:
[GWG192_DeviceN_Overprint_White_X1a.pdf](/uploads/1db7387c8c05bdbedee16dd92b18d78f/GWG192_DeviceN_Overprint_White_X1a.pdf)
I tried to investigate it but I guess I've been looking at the wrong places so far. I initially thought it could be a problem with the `setOverprintMask()` call in `SplashOutputDev::fill()` but convinced myself it's likely unrelated. Also tried to check the alpha declared on the src and dest in the state at time of filling but didn't seem conclusive either. Even ended up looking at the commands as seen by the parser but couldn't spot anything suspicious.
Since my knowledge of this code base is very limited so far, I'm hoping someone better in the topic will have better ideas to investigate this...