Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finder & parser gold from the EXcite project #190

Closed
cboulanger opened this issue Aug 11, 2022 · 3 comments
Closed

Finder & parser gold from the EXcite project #190

cboulanger opened this issue Aug 11, 2022 · 3 comments

Comments

@cboulanger
Copy link
Contributor

Hi, I have translated (hopefully correctly) the gold standard annotations from the EXcite project to the anystyle format:

https://github.com/cboulanger/excite-docker/tree/main/Dataset/default/anystyle

It contains manually annotated references from 200+ German and English social science open source articles. Please review and add it if you find it useful. I can also fix bugs the tranlation algorithm should there be any problems in the output.

@inukshuk
Copy link
Owner

Nice! We can add them everything that gets parsed without errors to the gold set and review any references that produce errors for addition to the core set.

@cboulanger
Copy link
Contributor Author

I have further cleaned up and fixed the excite goldstandard based and put it into a gist for your convenience:

https://gist.github.com/cboulanger/517a209ab30f61c74105e76699d03c24

I will keep updating it when I find more annotation bugs.

@cboulanger
Copy link
Contributor Author

Superseded by #191

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants