llm-playground

The virtual assistant application leverages LLM models, which use FastAPI for the backend and simple Tailwind CSS for the UI.

In the project, I used Langchain build to a conversation chain with memory and OpenAI's GPT-3.5-turbo-1106 model, as well as the fine-tuned GPT-3.5-turbo-1106 model for a specific documents/domain.

Features

Dataset generation and fine-tuning OpenAI's GPT for a specific documents/domain in models
Q/A chat with conversation memory using Langchain and OpenAI's GPT models.
FastAPI websocket endpoint for the backend.
Simple Tailwind CSS for the UI.
Dockerized the application using Docker Compose.

Folder Structure

llm-playground
    ├───data <- models downloaded for localLLM
    ├───models <- dataset generation and fine-tuning GPT models.
    ├───src <- chat application using FastAPI with a WebSocket endpoint to interact with the GPT models.
    │   ├───api <- define router endpoints.
    │   ├───integrations <- integrate LLM backends like OpenAI or LLaMA.cpp or Intel transformers.
    │   ├───schemas <- define schemas used in the project.
    │   ├───templates <- Tailwind CSS UI.
    │   └───utils <- ultility scripts.
    └───tests <- unittests for the project.

System Architecture

Installation

Requirements

Python 3.10
Docker & Docker compose (Get Docker)
Open AI API key

Setup

git clone https://github.com/michaelnguyen11/llm-playground.git
cd llm-playground
cp .env.template .env

# Edit your .env file

Docker-compose

The easiest way to get started is using the docker-compose, it will Dockerized the application.

docker-compose build
docker-compose up -d

Then, navigate to http://0.0.0.0:8080 to chat with the Q/A virtual assistant

Note : The LlamaCpp backend have not supported using docker-compose yet. Reference, I will find a workaround later.

Local Python Environment

Encourage to run in the virtual environments like venv or conda. To install dependency packages, use the command :

pip install -r requirements.txt

To launch the server, use the command:

uvicorn src.main:app --host 0.0.0.0 --port 8080 --reload

Then, navigate to http://0.0.0.0:8080 to chat with the Q/A virtual assistant

Download models for localLLM

To download models for LlamaCpp backend, navigate to data and run command :

./donwload_models.sh

Then change ENDPOINT_TYPE in .env to use openai or llamacpp websocket endpoint backend.

Potential areas for improvement.

Improve the data generation pipeline
Explore more methods to evaluate fine-tuned LLM models.
Implement RAG with Langchain for augmenting LLM knowledge with additional data
Multiple LLM backends for LocalLLM, optimized for specific Hardware

llama-cpp-python : MacOS Platforms
intel-extension-for-transformers : Intel Platforms
TensorRT-LLM : Nvidia Platforms

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.github/workflows		.github/workflows
.vscode		.vscode
data		data
docs		docs
models		models
src		src
tests		tests
.env.template		.env.template
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yaml		docker-compose.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm-playground

Features

Folder Structure

System Architecture

Installation

Requirements

Setup

Docker-compose

Local Python Environment

Download models for localLLM

Potential areas for improvement.

About

Releases

Packages

Languages

michaelnguyen11/llm-playground

Folders and files

Latest commit

History

Repository files navigation

llm-playground

Features

Folder Structure

System Architecture

Installation

Requirements

Setup

Docker-compose

Local Python Environment

Download models for localLLM

Potential areas for improvement.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages