You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
unpin pdfminer-six (#415)
- higher versions can have bugs that user have reported
- unpin it in the base.in so leaves other packages and user's constrains
to resolve it
Pprados/fix pdfminer dep (#410)
With langchain or other libraries, forcing a version of pdfminer.six
makes it impossible to combine different modules.
---------
Co-authored-by: John J <[email protected]>
feat: add back source attribute for backward compatibility (#407)
This PR adds `source` back to `TextRegions` and `LayoutElements` for
backward compatibility.
Feat/add more attributes to layoutelements (#404)
* feat: add `text_as_html` and `table_as_cells` to `LayoutElements`
class as new attributes
* feat: replace the single valueed `source` attribute from `TextRegions`
and `LayoutElements` with an array attribute `sources`
remove layoutparser lib (#403)
this PR is to remove the layoutparser lib as we no longer rely on it
anymore
hence the README.md is also updated to drop the note on supporting
layoutparser model zoo
fix: fix bugs in data structure (#402)
- fix bug when an empty list is passed into `TextRegions.from_list`
- fix bug when concatenating a list of `LayoutElements` the class id
maps is not updated correctly