We participated in the ARISE hackathon. We worked on aligning the Nederlandse Soorten Register with Wikidata, with the eventual goal to link it to the linked data cloud. This allows reuse in a wide variety of use case, of which we explored some in this hackathon.
graph TD;
NS[Nederlands Soortenregister]==>|Trixidata notebook|WD[Wikidata];
WD==>|Trixidata notebook|WP[Wikipedia];
G[Gbif]-->WD;
I[iNaturalist]-->WD;
DOI[Literature]-->WD;
ORCID[Person]-->WD;
WD-->EE[Entity Explosion]
The result is a jupyter notebook which we called Trixidata Notebook. This notebook takes csv export from a (sub)list of the Dutch Species Registry to align with Wikidata. Once aligned we are able to identify:
- missing identifiers wikidata from GBIF, iNatrualist of even the Nederlandse Soortenregister itself;
- missing Wikipedia articles
- species from that list that are not yet covered in Wikidata
- identify potential references
- identify images that can be reused in various use cases.
We started the following list of Wikipedia articles based on the results extracted with the Notebook.
Building on earlier work done in previous hackathon (e.g. Alien CSI Hackathon, we started linking collectors, collections and their species using Wikidata. Starting from a spreadsheet. links are made with Wikidata. Once aligned we can link them to other parts in Wikidata.
Using the same workflow as in the trixidata notebook, it was possible to identify missing Wikipedia articles from a set of plantspecies that are eaten by iguanas. 55 prospective new Wikipedia articles have been identified for future writing starting for a slightly different input set then the Nederlnadse Soorten Register