Feature Request: tabs in pdftotext -layout
Is it possible to have the -layout
option add tab characters between columns, when working with a multi-column PDF?
Or another option, say "-columntabs
", to be used with -layout to do this?
I often work with 2 or 3 column PDF files with lots of text tables. Without -layout
, the tables are a useless mess so I use the -layout
option and manually edit the text file with vim to insert tab characters between the columns, then use a perl script to split each page on the tabs and convert it to single-column.
Even with vim, adding the tabs can be a long and tedious process, especially with dozens or hundreds of pages.
It would be great if I could skip the "edit with vim" stage, or even minimise that step to just removing excess tabs from tables.