Extract images? #20

Zlitus · 2020-06-22T11:24:15Z

Would be nice if the lib could extract a list of images (identifier/pathnames by pages) and give a way to extract some of them. Especially useful to see when a pdf has 0 text but many images, then a OCR work aside can be started.

Thank you for your work 👍.

shartoo · 2021-09-16T00:58:47Z

Another repo pdf-lib may helps,is there any repo could extract both text and image conveniently?

programmerWhite · 2022-05-05T08:50:48Z

my greate author, i want know the lib could extract a list of images , had ok? i want this function。thanks

Hellsfoul · 2023-03-03T08:11:32Z

Sad, that the image stream is not in the data object at all.

Otherwise great lib!

ffalt added the help wanted label Dec 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract images? #20

Extract images? #20

Zlitus commented Jun 22, 2020

shartoo commented Sep 16, 2021

programmerWhite commented May 5, 2022

Hellsfoul commented Mar 3, 2023

Extract images? #20

Extract images? #20

Comments

Zlitus commented Jun 22, 2020

shartoo commented Sep 16, 2021

programmerWhite commented May 5, 2022

Hellsfoul commented Mar 3, 2023