Releases · VikParuchuri/surya

Refactor

This is a complete refactor of surya - the code is now cleaner and better organized. Models are now imported and used differently, here is an example for OCR:

from PIL import Image
from surya.recognition import RecognitionPredictor
from surya.detection import DetectionPredictor

image = Image.open(IMAGE_PATH)
langs = ["en"] # Replace with your languages or pass None (recommended to use None)
recognition_predictor = RecognitionPredictor()
detection_predictor = DetectionPredictor()

predictions = recognition_predictor([image], [langs], detection_predictor)

See the README for how to use other models.

Table recognition

There is a new table recognition model which detects colspans/rowspans better, along with header cells. It also isn't as complex to use, since it operates on just the images versus the images and bboxes.

What's Changed

Layout improvements by @VikParuchuri in #267
New table model; total refactor by @VikParuchuri in #279
Add ci workflow by @VikParuchuri in #284

Full Changelog: v0.8.3...v0.9.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New OCR model and streamlit app

What's Changed

New Contributors

Contributors

Refactor

Table recognition

What's Changed

Contributors

What's Changed

Contributors

What's Changed

New Contributors

Contributors

Releases: VikParuchuri/surya

Fix pytorch 2.6 bug

Pin pytorch

Add LaTeX OCR model

New OCR model and streamlit app

What's Changed

New Contributors

Contributors

Fix cli script issue

Better polygon type checking

Fix rowspan bug

Refactor surya; new table recognition model

Refactor

Table recognition

What's Changed

Contributors

Pin pypdfium2

New layout model

What's Changed

Contributors

Add bad OCR detection model

What's Changed

New Contributors

Contributors