Skip to content

Latest commit

 

History

History
12 lines (9 loc) · 836 Bytes

README.md

File metadata and controls

12 lines (9 loc) · 836 Bytes

Falcon7B-Chat

contributions welcome

Python pytorch

This is an attempt to configure a chatbot using the falcon-7b-instruct parameter model to run locally on a machine with <8Gb VRAM. Using 4 bit quantization to reduce memory load. Source to guide.

Set up

  1. Download docker
  2. docker build -t chatbot-image:latest -f docker/Dockerfile .
  3. docker run -it --gpus all -v $(pwd):/workspace chatbot-image:latest bash
  4. ...