Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Docling runs out of memory when converting a pdf #773

Open
marfago opened this issue Jan 18, 2025 · 4 comments
Open

Docling runs out of memory when converting a pdf #773

marfago opened this issue Jan 18, 2025 · 4 comments
Assignees
Labels
bug Something isn't working

Comments

@marfago
Copy link

marfago commented Jan 18, 2025

Docling fails to convert a pdf claiming all the available memory and eventually running OOM.
OS: Win11 Home

docling 2.15.1
docling-core 2.14.0
docling-ibm-models 3.1.2
docling-parse 3.1.0

from docling.document_converter import DocumentConverter

if __name__ == '__main__':
    print(DocumentConverter().convert("https://arxiv.org/pdf/2209.07650v1"))

@marfago marfago added the bug Something isn't working label Jan 18, 2025
@dolfim-ibm
Copy link
Contributor

@marfago can you please provide a few more details on your system? How much memory? Do you know if the OOM was from the main system or the GPU?

@marfago
Copy link
Author

marfago commented Jan 20, 2025

@dolfim-ibm My system runs with 48GB of RAM on Win Home 11 and It fails the same way with CPU or CUDA. With CUDA it seems that the GPU is never really used, so maybe it is unrelated.

Please let me know what other details you need.

@dolfim-ibm
Copy link
Contributor

We definitely have Docling running on systems with just a few GB of RAM. We need to think of a few other inputs which could help debugging your issue. Will keep you posted.

@marfago
Copy link
Author

marfago commented Jan 20, 2025

Please try the simple code above. My issue does not happen with all the PDFs, but only with https://arxiv.org/pdf/2209.07650v1.

@PeterStaar-IBM PeterStaar-IBM self-assigned this Jan 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants