Alternate text for images in output xml / html
Is it possible to provide the alt text of images (if they have one) in the ouput XML as well.
According to this SO post.
https://stackoverflow.com/questions/12525883/accessing-alternate-text-for-an-image-via-pdfbox
the alt text is not located near the image but in the StructTreeRoot
. So in order to get it we need to traverse through the pdf structure.
If it's not possible currently how should we approach this? Would it be a lot of work? I am not a C++ developer but I can give it a try if it's more or less trivial.