Downloading detection and recognition models takes a lot of time and space on my pod #746

ai-qlik · 2025-01-14T14:25:37Z

How to prevent Docling from Downloading detection and recognition models

I have deployed a REST endpoint in docker which calls Docling to parse (convert) documents. Every time the code is called, docling starts by download detection and recognition models which is time consuming and heavy on the memory. I would like to turn this feature off and prevent docling from downloading any models!

Please see my very basic code below:

def parse_with_docling(pipeline_input): 
        doc_converter = DocumentConverter()
        input_doc_path = Path(pipeline_input.input_path)
        return doc_converter.convert(input_doc_path).document

I have a hunch that this is caused by the EasyOCR. I would like to set the download_enabled to false for EasyOCR without limiting the OCR feature to EasyOCR.

Thanks in advance!
Arash

The text was updated successfully, but these errors were encountered:

ai-qlik added the question Further information is requested label Jan 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Downloading detection and recognition models takes a lot of time and space on my pod #746

Downloading detection and recognition models takes a lot of time and space on my pod #746

ai-qlik commented Jan 14, 2025

Downloading detection and recognition models takes a lot of time and space on my pod #746

Downloading detection and recognition models takes a lot of time and space on my pod #746

Comments

ai-qlik commented Jan 14, 2025

How to prevent Docling from Downloading detection and recognition models