You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Locking trought your code i've noticed that there is not an option to segment not only the structures but also the relevant ID that is often present in many patent (more or less with the same style, attached an example). I'm imagining a protocol that segment also the ID and than a simple OCR (pytesseract) or more complex OCR (maybe something based on DL) could recognise the number ID and associate it to the structure.
I'm aware of the fact that not in all the patent the ID is present in a constant position (for example sometimes is at 12ptx another times is at 6ptx from the recognised structures. Or sometimes is horizontally and centrated other times is not centrated). But again I can imagine some sort of sample script in which the user input some parameters until is not satisfied of the segmentation.
Before that I start to see if I can do it by myself there is a specific reason why such feature was not implmented and/or what could be the challenges.
Thanks much and terrific work
The text was updated successfully, but these errors were encountered:
Thank you for your interest in our work! We're continually working to improve the project, though we do face some limitations.
We’ve already been considering this idea and are currently focusing on journal articles, rather than patents. However, if you have a solution in mind and would like to contribute, we’d be more than happy to review a pull request.
Dear Development team,
Locking trought your code i've noticed that there is not an option to segment not only the structures but also the relevant ID that is often present in many patent (more or less with the same style, attached an example). I'm imagining a protocol that segment also the ID and than a simple OCR (pytesseract) or more complex OCR (maybe something based on DL) could recognise the number ID and associate it to the structure.
I'm aware of the fact that not in all the patent the ID is present in a constant position (for example sometimes is at 12ptx another times is at 6ptx from the recognised structures. Or sometimes is horizontally and centrated other times is not centrated). But again I can imagine some sort of sample script in which the user input some parameters until is not satisfied of the segmentation.
Before that I start to see if I can do it by myself there is a specific reason why such feature was not implmented and/or what could be the challenges.
Thanks much and terrific work
The text was updated successfully, but these errors were encountered: