How handle long conversation history #26

cahuja1992 · 2023-11-17T07:18:25Z

As history grows summarization of conversation would be useful.
If a question is just to format the previous response or anything that doesn't need further retrieval at that time shouldn't we be detecting if retrieval is needed?

placerda · 2023-11-28T13:31:43Z

Thanks @cahuja1992

1. As history grows summarization of conversation would be useful.

This makes sense, but would include one more call to the LLM, how long do you think it would make sense to start summarizing?

2. If a question is just to format the previous response or anything that doesn't need further retrieval at that time shouldn't we be detecting if retrieval is needed?

Now we have a triage function that does exactly this.

cahuja1992 · 2023-11-28T13:59:19Z

@placerda As chat history is passed in the triage, there we will reach the token limits. So one idea is to as we reach 90% of the token limits, spawn a thread of chat summarization so that for the next question we can use the summary.

cahuja1992 · 2023-12-07T09:56:57Z

@placerda As chat history is passed in the triage, there we will reach the token limits. So one idea is to as we reach 90% of the token limits, spawn a thread of chat summarisation so that for the next question we can use the summary.

@placerda Any thoughts on this approach? If we proactively keep on summarizing the conversation, then we do not even see significant differences in latency.

placerda transferred this issue from Azure/gpt-rag-orchestrator Nov 20, 2023

placerda transferred this issue from Azure/GPT-RAG Nov 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How handle long conversation history #26

How handle long conversation history #26

cahuja1992 commented Nov 17, 2023

placerda commented Nov 28, 2023

cahuja1992 commented Nov 28, 2023 •

edited

Loading

cahuja1992 commented Dec 7, 2023

How handle long conversation history #26

How handle long conversation history #26

Comments

cahuja1992 commented Nov 17, 2023

placerda commented Nov 28, 2023

cahuja1992 commented Nov 28, 2023 • edited Loading

cahuja1992 commented Dec 7, 2023

cahuja1992 commented Nov 28, 2023 •

edited

Loading