Fix error after second start_chat() for StatefulLLMPipeline #1684

sbalandi · 2025-02-06T21:36:32Z

Ticket: CVS-161902

Wovchena · 2025-02-07T05:54:43Z

src/cpp/src/llm_pipeline_stateful.cpp

+    if (have_state) {
+        m_model_runner.reset_state();
+        m_model_runner.get_tensor("attention_mask").set_shape({1, 0});
+    }


You have a similar block below:

if (!m_tokenized_chat_history.empty()) { reset_kv_state(); m_history = {}; m_templated_chat_history.clear(); m_tokenized_chat_history.clear(); }

I propose to change the condition to if (have_state) for the old block and reset everything there.

Should a similar thing be performed in the end of generate() if is_chat_conversation is false? It would be better to perform in the beginning of generate() to improve exception safety, but we already do that in the end everywhere, so stay consistent.

should start_chat call stop_chat first ?

updated
in chat scenario we rely on the existing attention_mask , and when is_chat_conversation is false new attention_mask is explicitly used, that why it doesn't crash for non-chat

Wovchena · 2025-02-07T05:57:10Z

src/cpp/src/llm_pipeline_stateful.cpp

+    if (have_state) {
+        m_model_runner.reset_state();
+        m_model_runner.get_tensor("attention_mask").set_shape({1, 0});
+    }


I think the request to add a test is valid. You can take the idea from #1674 (comment)

I think we should also cover a case when user call next start_chat w/o stop_chat for current one.
IMO, it should automatically stop current chat

cases with system_massage, with double start_chat and with several chats one after another are added.
If i understand right tests work now for PA backend, should it be added run for SDPA too ?

If i understand right tests work now for PA backend, should it be added run for SDPA too ?

they will be added as part of CVS-159925
For now, you can check locally that tests are passing on SPDA branch by changing default backend here

openvino.genai/src/cpp/src/llm_pipeline.cpp

Line 62 in 1bdd4f9

std::string attention_backend = PA_BACKEND;

ilya-lavrenov · 2025-02-07T08:47:08Z

does a current PR fix #1663 ?

sbalandi · 2025-02-07T16:58:32Z

does a current PR fix #1663 ?

The reason looks like point which Vladimir mention above. If execution is interrupted and generation() does not complete correctly, this error will occur. I moved the resets to the beginning, this fixes similar behavior for phi-3.

github-actions bot added the no-match-files label Feb 6, 2025

sbalandi requested review from Wovchena and ilya-lavrenov and removed request for Wovchena February 6, 2025 21:37

Fix error after second start_chat() for StatefulLLMPipeline

5572219

Wovchena requested changes Feb 7, 2025

View reviewed changes

ilya-lavrenov added bug Something isn't working category: LLM LLM pipeline (stateful, static) labels Feb 7, 2025

ilya-lavrenov added this to the 2025.1 milestone Feb 7, 2025

ilya-lavrenov assigned ilya-lavrenov and Wovchena Feb 7, 2025

sbalandi force-pushed the phi3 branch from 09e5036 to 60dbcbb Compare February 7, 2025 16:49

sbalandi force-pushed the phi3 branch 2 times, most recently from 77c2688 to ac2474c Compare February 7, 2025 19:25

add test and fix comments

ac2474c

ilya-lavrenov approved these changes Feb 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix error after second start_chat() for StatefulLLMPipeline #1684

Fix error after second start_chat() for StatefulLLMPipeline #1684

sbalandi commented Feb 6, 2025 •

edited

Loading

Wovchena Feb 7, 2025

ilya-lavrenov Feb 7, 2025

sbalandi Feb 7, 2025

Wovchena Feb 7, 2025

ilya-lavrenov Feb 7, 2025

sbalandi Feb 7, 2025

ilya-lavrenov Feb 8, 2025

ilya-lavrenov commented Feb 7, 2025

sbalandi commented Feb 7, 2025

Fix error after second start_chat() for StatefulLLMPipeline #1684

Are you sure you want to change the base?

Fix error after second start_chat() for StatefulLLMPipeline #1684

Conversation

sbalandi commented Feb 6, 2025 • edited Loading

Wovchena Feb 7, 2025

Choose a reason for hiding this comment

ilya-lavrenov Feb 7, 2025

Choose a reason for hiding this comment

sbalandi Feb 7, 2025

Choose a reason for hiding this comment

Wovchena Feb 7, 2025

Choose a reason for hiding this comment

ilya-lavrenov Feb 7, 2025

Choose a reason for hiding this comment

sbalandi Feb 7, 2025

Choose a reason for hiding this comment

ilya-lavrenov Feb 8, 2025

Choose a reason for hiding this comment

ilya-lavrenov commented Feb 7, 2025

sbalandi commented Feb 7, 2025

sbalandi commented Feb 6, 2025 •

edited

Loading