Feature Request: tabs in pdftotext -layout
Is it possible to have the
-layout option add tab characters between columns, when working with a multi-column PDF?
Or another option, say "
-columntabs", to be used with -layout to do this?
I often work with 2 or 3 column PDF files with lots of text tables. Without
-layout, the tables are a useless mess so I use the
-layout option and manually edit the text file with vim to insert tab characters between the columns, then use a perl script to split each page on the tabs and convert it to single-column.
Even with vim, adding the tabs can be a long and tedious process, especially with dozens or hundreds of pages.
It would be great if I could skip the "edit with vim" stage, or even minimise that step to just removing excess tabs from tables.