Poppler Glib: Problems related to poppler_page_get_text() and poppler_page_get_text_layout()
From the description of the poppler_page_get_text_layout () method
The position in the array represents an offset in the text returned by poppler_page_get_text()
can get the relationship between two arrays, I know this works with English text, but is there any way to get the corresponding relationship in other languages including Chinese?
Like
你好, World!
(你好
means Hello
) would be interpreted as
\xE4 \xBD \xA0 \xE5 \xA5 \xBD \x2C \x20 \x57 \x6F \x72 \x6C \x64
But it's obvious that we can't get the corresponding character according to the position of the rectangle. Is there any way to solve this problem?
Thanks.