Skip to content

Releases: VikParuchuri/surya

Fix pytorch 2.6 bug

31 Jan 02:36
780d351
Compare
Choose a tag to compare

Fix bug that caused issues on MPS (Mac) devices when using pytorch 2.6.

Pin pytorch

30 Jan 18:07
551584b
Compare
Choose a tag to compare

Pytorch 2.6.0 doesn't work well with some of the models on MPS (Mac), so pinning to the old version.

Add LaTeX OCR model

29 Jan 15:04
31d9126
Compare
Choose a tag to compare

New OCR model and streamlit app

  • Release a new LaTeX OCR model
  • Add streamlit app to interactively select and OCR equations
image

What's Changed

New Contributors

Full Changelog: v0.9.3...v0.10.0

Fix cli script issue

24 Jan 15:39
aa8ee5a
Compare
Choose a tag to compare

Fix issue with cli scripts and folders.

Better polygon type checking

24 Jan 01:57
d9213c8
Compare
Choose a tag to compare

Improve how polygons are type checked in the schema.

Fix rowspan bug

23 Jan 21:45
9e93438
Compare
Choose a tag to compare

Fixes a bug where rowspans weren't included in table model predictions.

Refactor surya; new table recognition model

22 Jan 18:45
bdda488
Compare
Choose a tag to compare

Refactor

This is a complete refactor of surya - the code is now cleaner and better organized. Models are now imported and used differently, here is an example for OCR:

from PIL import Image
from surya.recognition import RecognitionPredictor
from surya.detection import DetectionPredictor

image = Image.open(IMAGE_PATH)
langs = ["en"] # Replace with your languages or pass None (recommended to use None)
recognition_predictor = RecognitionPredictor()
detection_predictor = DetectionPredictor()

predictions = recognition_predictor([image], [langs], detection_predictor)

See the README for how to use other models.

Table recognition

There is a new table recognition model which detects colspans/rowspans better, along with header cells. It also isn't as complex to use, since it operates on just the images versus the images and bboxes.

What's Changed

Full Changelog: v0.8.3...v0.9.0

Pin pypdfium2

01 Jan 03:10
fc07060
Compare
Choose a tag to compare

Pin pypdfium2 version - newest version can cause issues.

New layout model

30 Dec 17:12
ac03917
Compare
Choose a tag to compare

Layout model is twice as fast and more accurate.

What's Changed

Full Changelog: v0.8.1...v0.8.2

Add bad OCR detection model

19 Dec 17:22
0a82cc7
Compare
Choose a tag to compare
  • Add a model to detect bad OCR text
  • Add top_k predictions to layout
  • Add in test suite

What's Changed

New Contributors

Full Changelog: v0.8.0...v0.8.1