Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extract images? #20

Open
Zlitus opened this issue Jun 22, 2020 · 3 comments
Open

Extract images? #20

Zlitus opened this issue Jun 22, 2020 · 3 comments

Comments

@Zlitus
Copy link

Zlitus commented Jun 22, 2020

Would be nice if the lib could extract a list of images (identifier/pathnames by pages) and give a way to extract some of them. Especially useful to see when a pdf has 0 text but many images, then a OCR work aside can be started.

Thank you for your work 👍.

@shartoo
Copy link

shartoo commented Sep 16, 2021

Another repo pdf-lib may helps,is there any repo could extract both text and image conveniently?

@programmerWhite
Copy link

my greate author, i want know the lib could extract a list of images , had ok? i want this function。thanks

@Hellsfoul
Copy link

Sad, that the image stream is not in the data object at all.

Otherwise great lib!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants