-
Notifications
You must be signed in to change notification settings - Fork 551
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
getTextArray: Add trim check in Do command #762
base: master
Are you sure you want to change the base?
getTextArray: Add trim check in Do command #762
Conversation
@k00ni What is the progress of this PR? Are we waiting for something to merge it? |
Thank you for this PR. I am currently very busy and hope I can give more feedback until the end of this month. But to be quite frank, I have no knowledge in this topic. It would help if you could add a link to the specification which covers this case. Maybe @j0k3r, @GreyWyvern, @unixnut (or someone else from the community) might have some thoughts about this, which they can share in the meantime. |
I just created a fix for the specific PDF (see PR). I didn't found a specification that addresses this issue. I only see the following situation:
Hopefully someone can look at it as it's a blocker for a project I'm working on! |
For neatness, can you please do a "git rebase" and squash your commits into one before making a PR? Especially when one of them fixes a typo as is the case here. Thank you for your efforts and for including tests! |
If that is the case I would decline this PR. Because of my limited time I can't look into this matter any further. But would be more than open to read any (technical) arguments or/and links which show that this change will not lead to a breaking change, even though I am not sure how rare this case might be in the wild. There might be an argument to use a switch in the We use semantic version and our users trust the behavior of PDFParser. @j0k3r what do you suggest we do here? |
Type of pull request
About
See: #761
Summary: If a document contains a Do command, if returns not corresponding text with the Text Matrix that is returned by using
getDataTm
.This is because the Do command object is a Smalot\PdfParser\XObject\Image object, and that returns always an empty string ''.