Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Downloading detection and recognition models takes a lot of time and space on my pod #746

Open
ai-qlik opened this issue Jan 14, 2025 · 0 comments
Labels
question Further information is requested

Comments

@ai-qlik
Copy link

ai-qlik commented Jan 14, 2025

How to prevent Docling from Downloading detection and recognition models

I have deployed a REST endpoint in docker which calls Docling to parse (convert) documents. Every time the code is called, docling starts by download detection and recognition models which is time consuming and heavy on the memory. I would like to turn this feature off and prevent docling from downloading any models!

Please see my very basic code below:

def parse_with_docling(pipeline_input): 
        doc_converter = DocumentConverter()
        input_doc_path = Path(pipeline_input.input_path)
        return doc_converter.convert(input_doc_path).document

I have a hunch that this is caused by the EasyOCR. I would like to set the download_enabled to false for EasyOCR without limiting the OCR feature to EasyOCR.

Thanks in advance!
Arash

Screenshot 2025-01-14 at 9 11 38 AM
Screenshot 2025-01-14 at 9 12 55 AM

@ai-qlik ai-qlik added the question Further information is requested label Jan 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant