You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am running this code in a docker container, I also tested it locally but I always get into the same issue. The pipeline never finishes and never enter the chunking phase. I am processing about 43K documents of different types (html, xml, pdf, docx, json ...) since JSON is no supported they just don't get processed. The pipeline hangs on the partition phase and never enter chunking.
The text was updated successfully, but these errors were encountered:
I ran into this problem with my pipeline as well, my quick fix was to set num_processes to 1 and that seemed to work. I'm not sure what's causing it but it seems there's some kind of problem with having more than one process -- even the default of 2 hangs for me.
If anyone's experienced this and found a way to run it with multiple processes it'd be great to hear.
I am using unstructured-ingest version 0.3.0, using the following code:
I am running this code in a docker container, I also tested it locally but I always get into the same issue. The pipeline never finishes and never enter the chunking phase. I am processing about 43K documents of different types (html, xml, pdf, docx, json ...) since JSON is no supported they just don't get processed. The pipeline hangs on the partition phase and never enter chunking.
The text was updated successfully, but these errors were encountered: