The open-source NLP library to craft the minds of your NPC characters for your video games.
Requires Python 3.10 or higher.
It includes the following features:
- Text generation using LLMs
- Motivations, personality, personal goals
- Knowledge and awareness about the world (RAG)
- Short and Long-term memory (RAG)
- Conversational styles
- Supervised finetuning by human feedback (SFT)
- Integration with vLLM for fast inference and streaming locally, remotelly or in Docker.
- Usage of quantized AWQ models
- Integration with API and RPC (to come!)
from mindcraft.infra.engine.llm_types import LLMType
from mindcraft.infra.vectorstore.stores_types import StoresTypes
from mindcraft.infra.embeddings.embeddings_types import EmbeddingsTypes
from mindcraft.lore.world import World
world = World(world_name="Middle Earth from the Lord of the Rings",
embeddings=EmbeddingsTypes.MINILM,
store_type=StoresTypes.CHROMA,
llm_type=LLMType.YI_6B_AWQ,
fast=False, # <--- fast=True to switch on vLLM
remote=False, # <--- remote=True to use a remote vLLM server
streaming=True) # <--- streaming=True to use vLLM streaming
Now we use some book to carry out chunk splitting and add it to our favourite Vector Database (in our case, ChromaDB)
from mindcraft.infra.splitters.text_splitters_types import TextSplitterTypes
world.book_to_world(book_path="/content/lotr1.txt",
text_splitter=TextSplitterTypes.SENTENCE_SPLITTER,
max_units=3,
overlap=1,
encoding='utf-8')
Once a world has been created and populated with lore, query the lore known by NPCs by doing:
results = world.get_lore("What do you think about the Rings of Power?", num_results=5, min_similarity=0.95)
for i, d in enumerate(results.documents):
print(f"SENTENCE {i}:\n{d}")
print()
Once a world has been created and populated with lore, instantiate an NPC in it by doing:
from mindcraft.features.motivation import Motivation
from mindcraft.features.personality import Personality
from mindcraft.features.mood import Mood
from mindcraft.mind.npc import NPC
name = "Galadriel"
description = "The Elven Queen of Lothlorien, bearer of Nenya, wife to Celeborn"
personalities = [Personality(x) for x in ['fair', 'mighty', 'wise', 'carying', 'kind']]
motivations = [Motivation(x) for x in ['Destroying the Evil', 'Protecting Middle Earth', 'Travelling West']]
mood = Mood('worried')
galadriel = NPC(name,
description,
personalities,
motivations,
mood,
StoresTypes.CHROMA,
EmbeddingsTypes.MINILM)
galadriel.add_npc_to_world()
We get an iterator for the responses, to allow inference in streaming way.
answer_iter = galadriel.react_to("What do you think about the Rings of Power",
min_similarity=0.85,
ltm_num_results=3,
world_num_results=7,
max_tokens=600)
So your answers will be in the iterator, don't forget to loop through it!
for answer in answer_iter:
print(answer)
Alas, my dear friend, the Rings of Power are a perilous burden that should not be wielded by those of mortal race. They possess a sinister force that overcomes the spirit, enslaving their very soul. I have observed their destructive potential in Eregion long ago, where many Elven-rings were created, each infused with its own unique potency. Some more potent than others, and often, they seem to have a propensity to gravitate towards the unsuspecting. In light of our shared concern for the wellbeing of Middle Earth, I implore you to heed my words; let us not succumb to the allure of these fateful rings, lest they consume us entirely.
There are two loops integrated in the framework which allow you to create your own datasets.
NPC.extract_conversational_styles_from_world
: Retrieving conversations from the world and tagging the question and mood yourselfNPC.extract_conversational_styles_talking_to_user
: Creates conversations by talking to you
You can create your own datasets with supervised finetuning, accepting or rejecting the interactions. As a result, you will come up with a csv of interactions you can use to train or finetune your own models.
name||mood||question||answer
Galadriel||default||Good night, Galadriel!||'Good night, my friends! '\nsaid Galadriel. '\nSleep in peace!
Galadriel||grave||why he could say that?||....`He would be rash indeed that said that thing,' said Galadriel gravely.
- TheBloke/mistral_7b_norobots-AWQ
- TheBloke/zephyr-7B-beta-AWQ
- TheBloke/notus-7B-v1-AWQ
- TheBloke/Starling-LM-7B-alpha-AWQ
- TheBloke/Yi-6B-AWQ
- TheBloke/dragon-yi-6B-v0-AWQ
- microsoft/phi-2
- stabilityai/stablelm-zephyr-3b
If you are running on Windows on a machine with a GPU, and you get a message about not being able to find your gpu, you need to configure CUDA for Windows.
- Go to CUDA installation webpage.
- Select your Windows version and specifics.
- Download and install
- Uninstall torch (
pip uninstall torch
) - Run the following command:
pip3 install torch -i https://download.pytorch.org/whl/cu121
- Restart your IDE (better if you restart your computer)
You torch on windows CUDA should be working. To test it:
import torch
if __name__ == "__main__":
print(torch.cuda.is_available())
You should get True
as a response if everything is working.
vLLM
has been included for Fast Inference, in local, remote installations and Docker.
To use fast-inference, just run add fast=True
to your World
object:
world = World(world_name="Lord of the Rings",
embeddings=EmbeddingsTypes.MINILM,
store_type=StoresTypes.CHROMA,
llm_type=LLMType.YI_6B_AWQ,
fast=True) # <---- HERE
To the previous fast
parameter, add also remote=True
world = World(world_name="Lord of the Rings",
embeddings=EmbeddingsTypes.MINILM,
store_type=StoresTypes.CHROMA,
llm_type=LLMType.YI_6B_AWQ,
fast=True, # <---- HERE
remote=True) # <---- HERE
To the previous fast
and remote
parameters, add also streaming=True
world = World(world_name="Lord of the Rings",
embeddings=EmbeddingsTypes.MINILM,
store_type=StoresTypes.CHROMA,
llm_type=LLMType.YI_6B_AWQ,
fast=True, # <---- HERE
remote=True,
streaming=True) # <---- HERE
You can find notebooks in the notebooks
folder of this project.
python -m pytest tests/*