LLAMA 3.1 Self deploy

trying to run Ollama locally for RAG purposes. Interact with the model using rest API (WIP)

llama3.1-local-demo.mp4

Installation

Docker installed Ollama installed

Download model locally

ollama run llama3.1:8b

docker run --name redis-container -p 6379:6379 -d redis

cd llm-service
pip install -r requirements.txt

uvicorn llm:app --reload

Go to http://127.0.0.1:8000/docs

Go to http://localhost:3000