How Effective are LLMs at Generating Accurate Data Descriptions?

Disclaimer: This is a research prototype. The code and data in this repository are provided as-is, without any warranty or guarantee of correctness. Use at your own risk.

How Effective are LLMs at Generating Accurate Data Descriptions?

We tasks 7 LLMs with producing specifications for 20 data formats. results/ contains the generated files---without any annotations on whether these files compile.

Install necessary dependencies (pip packages for the LLM libraries)
Install the runtime libraries for the DSLs
- Install Kaitai Struct's compiler
- Download Apache Daffodil binaries
- Build the Spicy tools from Source: Spicy Documentation
- Build Galois' DaeDaLus tooling from source: DaeDaLus
- Clone the repository and build Hammer
```
git clone https://github.com/UpstandingHackers/hammer.git
cd hammer
scons
sudo scons install
```
Add API keys as environment variables.
- Together API to invoke the Llama and Deepseek LLMs.
- OpenAI
- Claude
- Gemini

export GOOGLE_API_KEY=""
export ANTHROPIC_API_KEY=""
export OPENAI_API_KEY=""
export TOGETHER_API_KEY=""

Components

Together AI and Gemini are compatible with OpenAI's Python library.

Dockerfile builds a docker image with all of these compilers or executables installed.
options.json contains the paths of the various DDL executables and the list of formats and their specification versions.
test.db contains the final database with two entire runs, where the run labeled "888" was the most recent run.
Run the script to generate the DSLs using LLM queries python3 create_dsls.py
To generate the set of figures and tables used in the paper python3 analyzer.py
Test the generated library code by running a corpus of files through them. This command needs a folder containing files per format. python3 compare-parsers.py

DSLs supported:

Acknowledgments

This work was supported in part by DOE NETL (DE-CR0000017) and the ARPA-H DIGIHEALS (Contract No. SP4701-23-C-0089). The views, opinions, and/or findings expressed are those of the author(s) and should not be interpreted as representing the official views or policies of DOE, ARPA-H, or the U.S. Government.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
cargo_template		cargo_template
figs		figs
results		results
sample_specifications		sample_specifications
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
LLMFormatGeneration.py		LLMFormatGeneration.py
README.md		README.md
analyzer.py		analyzer.py
compare-parsers.py		compare-parsers.py
compile.py		compile.py
create_dsls.py		create_dsls.py
db.py		db.py
options.json		options.json
requirements.txt		requirements.txt
test.db		test.db

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How Effective are LLMs at Generating Accurate Data Descriptions?

Components

DSLs supported:

Acknowledgments

About

Releases

Packages

Contributors 2

Languages

narfindustries/llm-tests-langsec

Folders and files

Latest commit

History

Repository files navigation

How Effective are LLMs at Generating Accurate Data Descriptions?

Components

DSLs supported:

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages