pdftotext -htmlmeta outputs incomplete metadata
Submitted by madbiologist
Assigned to poppler-bugs
Description
This bug was originally reported at https://bugs.launchpad.net/ubuntu/+source/poppler/+bug/993292
pdftotext -htmlmeta output is missing metadata from PDF catalog. pdfinfo does output all values known:
e.g. a pdfinfo output:
Title: Titel Author: Word Creator: WordToPDF 2.4 build 127 Producer: AFPL Ghostscript 8.54 CreationDate: Fri Jul 2 09:14:02 2007 ModDate: Fri Jul 2 09:14:02 2007 Tagged: no Pages: 6 Encrypted: no Page size: 595 x 842 pts (A4) File size: 104664 bytes Optimized: no PDF version: 1.3
in contrast the meta section of the pdftotext -htmlmeta output:
<head>
<title>
Titel</title>
</head>