Chat: unify keyword rewrite and extraction #6767

jtibshirani · 2025-01-23T04:33:35Z

In #6655 we introduced a method for extracting keywords from chat questions that look like simple searches. Looking into the prompt, it's actually an improvement over our current prompt in rewriteKeywordQuery. It also fixes a bug where we failed to parse the response when it contained a single keyword. This PR switches to the new extractKeywords method to unify our approach.

Other changes:

Make the "should rewrite keyword" heuristic more lenient, also rewriting whenever the question includes a newline
Update Cody chat bench to include the rewritten query, so we can visualize it in cody-leaderboard

Test plan

Adapted unit tests. Reran context evals -- see next comment for results.

jtibshirani · 2025-01-23T04:43:40Z

Usually we run the context evals without query rewriting. I enabled rewriting, then ran the baseline vs. the changes in this PR. The results are extremely similar:

Moreover, when examining the rewritten queries, they tended to be les noisy, and to better preserve symbols from the query.

Aside: I think we should run evals with query rewriting enabled by default, to better match the chat experience.

jtibshirani · 2025-01-23T04:45:44Z

vscode/src/local-context/rewrite-keyword-query.ts

@@ -10,7 +10,7 @@ import { outputChannelLogger } from '../output-channel-logger'

 import { francAll } from 'franc-min'

-const containsMultipleSentences = /[.!?][\s\r\n]+\w/
+const containsMultipleSentences = /[\n\r]|[.!?]\s*\w/


In a follow-up, I'll look into removing this heuristic altogether. Previously in evals, we saw that rewriting tends to make performance much worse for simple queries. However, the new rewrite strategy is "lighter touch", so I'm hopeful it will work well even for short queries. It will be a nice behavior simplification to always rewrite, instead of having a heuristic here.

jtibshirani · 2025-01-23T04:49:06Z

vscode/src/local-context/rewrite-keyword-query.ts

@@ -118,7 +60,7 @@ export async function extractKeywords(
                ...preamble,
                {
                    speaker: 'human',
-                    text: ps`You are helping the user search over a codebase. List terms that could be found literally in code snippets or file names relevant to answering the user's query. Limit your results to terms that are in the user's query. Present your results in a *single* XML list in the following format: <keywords><keyword>a single keyword</keyword></keywords>. Here is the user query: <userQuery>${query}</userQuery>`,
+                    text: ps`You are helping the user search over a codebase. List English terms that could be found literally in code snippets or file names relevant to answering the user's query. Limit your results to terms that are in the user's query. Present your results in a *single* XML list in the following format: <keywords><keyword>a single keyword</keyword></keywords>. Here is the user query: <userQuery>${query}</userQuery>`,


I had to add 'English' here to keep the nice behavior where we translate foreign languages to common English code terms. I was worried that the LLM might start excluding symbols or acronyms because of this, but I see no evidence of that... many rewritten queries still contain terms like symf, zoekt, etc.

Chat: unify keyword rewrite and extraction

6ebda85

jtibshirani commented Jan 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chat: unify keyword rewrite and extraction #6767

Chat: unify keyword rewrite and extraction #6767

jtibshirani commented Jan 23, 2025 •

edited

Loading

jtibshirani commented Jan 23, 2025

jtibshirani Jan 23, 2025

jtibshirani Jan 23, 2025

Chat: unify keyword rewrite and extraction #6767

Are you sure you want to change the base?

Chat: unify keyword rewrite and extraction #6767

Conversation

jtibshirani commented Jan 23, 2025 • edited Loading

Test plan

jtibshirani commented Jan 23, 2025

jtibshirani Jan 23, 2025

Choose a reason for hiding this comment

jtibshirani Jan 23, 2025

Choose a reason for hiding this comment

jtibshirani commented Jan 23, 2025 •

edited

Loading