Skip to content

Tags: Unstructured-IO/unstructured-inference

Tags

0.8.9

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
unpin pdfminer-six (#415)

- higher versions can have bugs that user have reported
- unpin it in the base.in so leaves other packages and user's constrains
to resolve it

0.8.8

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Pprados/fix pdfminer dep (#410)

With langchain or other libraries, forcing a version of pdfminer.six
makes it impossible to combine different modules.

---------

Co-authored-by: John J <[email protected]>

0.8.7

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add password with PDF files (#392)

Add password with PDF files.
Must be combined with [PR 3721 in
unstructured](Unstructured-IO/unstructured#3721)

---------

Co-authored-by: John J <[email protected]>

0.8.6

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat: add back source attribute for backward compatibility (#407)

This PR adds `source` back to `TextRegions` and `LayoutElements` for
backward compatibility.

0.8.5

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
chore: update ci to test more python version (#394)

This PR expands the ci testing to include python 3.10, 3.11, and 3.12.

0.8.4

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Feat/add more attributes to layoutelements (#404)

* feat: add `text_as_html` and `table_as_cells` to `LayoutElements`
class as new attributes
* feat: replace the single valueed `source` attribute from `TextRegions`
and `LayoutElements` with an array attribute `sources`

0.8.3

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
remove layoutparser lib (#403)

this PR is to remove the layoutparser lib as we no longer rely on it
anymore

hence the README.md is also updated to drop the note on supporting
layoutparser model zoo

0.8.2

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: fix bugs in data structure (#402)

- fix bug when an empty list is passed into `TextRegions.from_list`
- fix bug when concatenating a list of `LayoutElements` the class id
maps is not updated correctly

0.8.1

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: list index out of range error caused by calling LayoutElements.f…

…rom_list() with empty list (#398)

error coming from
https://github.com/Unstructured-IO/core-product/actions/runs/11510770484/job/32043653203?pr=662#step:5:718
or lmk if there are better way to fix error w/o changing here :)

0.8.0

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
chore: make a minor bump instead of patch (#397)