Security Evaluation of Code Generated by Large Language Models

This repository holds the input data sets, the CodeQL queries used, and the generated programs and analysis results of the experiments conducted in my Master's Thesis submitted on the 17th of April 2023.

Data

In the directories of this repository, the following data can be found:

The raw set of natural language specifications of vulnerability-prone programs: Natural Language Prompts

The set of language-agnostic natural language specifications of the same vulnerability-prone programs: Language Agnostic Prompts

The CodeQL query set curated for covering the a subset of Python-possible vulnerabilities of the Top 25 CWEs of 2021 and 2022: CodeQL Query Set

The program synthesised by the LLMs for all four experimental setups: Programs

And finally the CodeQL Result files yielded when evaluating the query set on those programs: Query Results

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
CodeQL Queries/top25		CodeQL Queries/top25
CodeQL Results		CodeQL Results
Natural Language Prompts		Natural Language Prompts
Synthesised Programs		Synthesised Programs
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Security Evaluation of Code Generated by Large Language Models

Data

About

Releases

Packages

Languages

mutasDev/sec-eval-llms

Folders and files

Latest commit

History

Repository files navigation

Security Evaluation of Code Generated by Large Language Models

Data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages