Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Segmentation also of the Compound ID #112

Open
HiteSit opened this issue Sep 26, 2024 · 3 comments
Open

Segmentation also of the Compound ID #112

HiteSit opened this issue Sep 26, 2024 · 3 comments
Assignees
Labels
enhancement New feature or request

Comments

@HiteSit
Copy link

HiteSit commented Sep 26, 2024

Dear Development team,

Locking trought your code i've noticed that there is not an option to segment not only the structures but also the relevant ID that is often present in many patent (more or less with the same style, attached an example). I'm imagining a protocol that segment also the ID and than a simple OCR (pytesseract) or more complex OCR (maybe something based on DL) could recognise the number ID and associate it to the structure.
I'm aware of the fact that not in all the patent the ID is present in a constant position (for example sometimes is at 12ptx another times is at 6ptx from the recognised structures. Or sometimes is horizontally and centrated other times is not centrated). But again I can imagine some sort of sample script in which the user input some parameters until is not satisfied of the segmentation.

Before that I start to see if I can do it by myself there is a specific reason why such feature was not implmented and/or what could be the challenges.

Thanks much and terrific work

image

@Kohulan
Copy link
Owner

Kohulan commented Sep 26, 2024

Hi @HiteSit ,

Thank you for your interest in our work! We're continually working to improve the project, though we do face some limitations.

We’ve already been considering this idea and are currently focusing on journal articles, rather than patents. However, if you have a solution in mind and would like to contribute, we’d be more than happy to review a pull request.

Kind regards,
Kohulan

@Kohulan Kohulan self-assigned this Sep 26, 2024
@Kohulan Kohulan added the enhancement New feature or request label Sep 26, 2024
@HiteSit
Copy link
Author

HiteSit commented Oct 8, 2024

Yep I do have.
I created a fork working on it.

@Kohulan
Copy link
Owner

Kohulan commented Oct 8, 2024

Hi @HiteSit ,

Thank you that would be a great addition!

Kind regards,
Kohulan

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants