Chatbot that remembers conversation history #64

rchan26 · 2023-08-31T14:18:04Z

In the current llama-index model, the conversation history is not tracked and so each question also queries the database for an answer. It would be interesting to investigate how we can have a conversation with the data (multiple back-and-forth instead of a single question and answer).

Looking at the llama-index documentation, it looks like it has some ability to do this: https://gpt-index.readthedocs.io/en/latest/core_modules/query_modules/chat_engines/root.html

Would need to replace the query_engine calls with chat_engine. Would also need to play around with something like the ReAct Agent (llama-index have a few implemented) which decides how the chatbot will interact with the database during the conversation.

The text was updated successfully, but these errors were encountered:

rwood-97 · 2023-09-01T12:32:27Z

See here for examples of using chat engine.
The 'condense_question' and 'context' modes seems to be the ones which forces llama2 to use the query engine (i.e. the database data vs just pre-trained/pre-existing knowledge).

rwood-97 · 2023-09-01T13:40:45Z

Have played around with this a bit more in a new notebook here.

I think 'context' basically finds/retrieves a load of context info from our database and then uses that to answer the question (i.e. the model is called once per 'chat' and its essentially "heres a load of context, can you answer this").
The 'condense_question' seems to be more like just using the query engine (i.e. the model is called multiple times, once for each piece of context). For the first query I think it is basically just the same as query engine. But then if you follow up, it would be 'condensing' your follow up question with the chat history and then using that as the new query for the query engine.

Overall, I think 'context' mode seems better.

rchan26 · 2023-09-11T08:20:03Z

Some examples of using the chat engine from #66. Will continue with using the "context" engine as it seems the most consistent engine.

React seems to be quite volatile and doesn't always makes the best decisions in determining whether or not to use the query engine. But it does note here that it really depends on the quality of the LLM. We do get better performance using 13b models over 7b (quantized), so perhaps could be better in the future if we have access to higher quality quantized LLMs.

While working on this, we noticed an issue with the prompt creation in the chat engine. This has been fixed in this PR by @rwood-97 and I.

rchan26 assigned rwood-97 Sep 4, 2023

rchan26 closed this as completed Sep 11, 2023

rchan26 mentioned this issue Sep 11, 2023

Add Llama-2 chat engine model to slack #70

Closed

4 tasks

rwood-97 mentioned this issue Nov 1, 2023

Re-explore use of agent-based chat engine #126

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chatbot that remembers conversation history #64

Chatbot that remembers conversation history #64

rchan26 commented Aug 31, 2023

rwood-97 commented Sep 1, 2023 •

edited

Loading

rwood-97 commented Sep 1, 2023

rchan26 commented Sep 11, 2023 •

edited

Loading

Chatbot that remembers conversation history #64

Chatbot that remembers conversation history #64

Comments

rchan26 commented Aug 31, 2023

rwood-97 commented Sep 1, 2023 • edited Loading

rwood-97 commented Sep 1, 2023

rchan26 commented Sep 11, 2023 • edited Loading

rwood-97 commented Sep 1, 2023 •

edited

Loading

rchan26 commented Sep 11, 2023 •

edited

Loading