Skip to content
  • Adrián Pérez de Castro's avatar
    Tagged-PDF: Implement parsing of StructTreeRoot · a322e14d
    Adrián Pérez de Castro authored and Carlos Garcia Campos's avatar Carlos Garcia Campos committed
    Implement parsing of the StructTreeRoot entry of the Catalog. Also, the
    Catalog::getStructTreeRoot() and PDFDoc::getStructTreeRoot() methods are
    modified to return an instance of StructTreeRoot instead of an Object.
    
    All elements from the StructTreeRoot are parsed except for:
    
    - IDTree: it is a lookup tree to locate items by their ID, which would
      be barely useful because the whole structure tree is to be kept in
      memory, which should be fast enough to traverse.
    - ParentTreeNextKey: This is needed only when the ParentTree object is
      to be modified. For the moment the implementation deals only with
      reading, so this has been deliberately left out.
    
    StructElem tree nodes from the document structure tree are parsed as a
    StructElement instance. Attributes and extraction of content out from
    elements are not yet handled.
    
    https://bugs.freedesktop.org/show_bug.cgi?id=64815
    a322e14d