Burr: A Benchmark for Ontology Learning from Relational Databases

Knowledge graphs and ontologies play an essential role in integrating, standardizing, and reasoning about complex data across domains. Leveraging knowledge graphs in AI use cases, instead of traditional relational databases, leads to quality improvements by up to 38 percentage points. However, learning ontologies from relational databases remains a challenging task due to the impedance mismatch between both modeling concepts. An understanding of which ontology learning system performs best, and why, is missing, as no standardized evaluation has been conducted. We present Burr1, a benchmark for evaluating ontology learning systems from relational databases. To evaluate the ontology learning space, we introduce a novel mapping-based evaluation metric, and provide a comprehensive benchmark data collection. This collection of 46 scenarios consists of real-world database-ontology mappings, including industry data from SAP, and of a micro-benchmark evaluating the behavior of systems in encapsulated scenarios. We demonstrate the applicability of Burr by evaluating three widely used ontology learning systems using the benchmark. The results emphasize the current strengths of simple rule-based approaches compared to LLM-based systems, while also highlighting the significant research potential of LLMs in ontology learning.

Reproducibility

To reproduce our results, please execute the command docker-compose up --build. All experiments will run automatically.

Benchmark files

This section describes the structure of the benchmark files and how to use them. The micro benchmark is split into multiple parts, each having one folder. The files can be found in folder train_data. The real world databases and their mappings can be found in the folder real_world. Each test scenario consists of two files:

SQL-File This file includes the definition of the database and in most cases some instance data
Mapping file This file represents the mapping of the database to the ontology. For better readibility, we decided to choose a Json format, which is automatically translated to D2RQ. It can be easily translated to further mapping languages, such as R2RML.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.idea		.idea
.vscode		.vscode
evaluator		evaluator
experimental_evaluation		experimental_evaluation
hamilton		hamilton
mapping_converter		mapping_converter
output		output
paper		paper
real-world		real-world
statistics		statistics
tests		tests
train_data		train_data
.DS_Store		.DS_Store
.gitignore		.gitignore
Dockerfile		Dockerfile
Output.ttl		Output.ttl
README.md		README.md
benchmark_architecture.png		benchmark_architecture.png
convertR2RMLtod2rq.py		convertR2RMLtod2rq.py
convertd2rqToR2RML.py		convertd2rqToR2RML.py
d2rqoutput.ttl		d2rqoutput.ttl
docker-compose.yml		docker-compose.yml
environment.yml		environment.yml
eval.py		eval.py
graph.ttl		graph.ttl
groundtruth.ttl		groundtruth.ttl
json_data.json		json_data.json
main.py		main.py
mapping.ttl		mapping.ttl
mapping_chatgpt.ttl		mapping_chatgpt.ttl
mapping_cmt_mixed.ttl		mapping_cmt_mixed.ttl
mapping_test_1.ttl		mapping_test_1.ttl
mapping_test_2.ttl		mapping_test_2.ttl
mondial.ttl		mondial.ttl
mondial_rdb2onto.ttl		mondial_rdb2onto.ttl
ontology.owl		ontology.owl
ontology.rdf		ontology.rdf
ontology.ttl		ontology.ttl
ontology_learning.pdf		ontology_learning.pdf
ontology_learning.png		ontology_learning.png
parser.ipynb		parser.ipynb
parser.py		parser.py
r2rml.ttl		r2rml.ttl
r2rmloutput.ttl		r2rmloutput.ttl
r2rmlparser.py		r2rmlparser.py
rdb2onto.ttl		rdb2onto.ttl
requirements.txt		requirements.txt
setup.py		setup.py
test.ttl		test.ttl
test_metrics.py		test_metrics.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Burr: A Benchmark for Ontology Learning from Relational Databases

Reproducibility

Benchmark files

Architecture

About

Releases

Packages

Languages

HPI-Information-Systems/burr

Folders and files

Latest commit

History

Repository files navigation

Burr: A Benchmark for Ontology Learning from Relational Databases

Reproducibility

Benchmark files

Architecture

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages