Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

need a support for API of deepseek-R1 #3798

Open
3 tasks done
BeWater0qzy opened this issue Jan 21, 2025 · 0 comments
Open
3 tasks done

need a support for API of deepseek-R1 #3798

BeWater0qzy opened this issue Jan 21, 2025 · 0 comments
Assignees
Labels
kind:enhancement Indicates a new feature request, imrovement, or extension "needs-triage"

Comments

@BeWater0qzy
Copy link

Before submitting your bug report

Relevant environment info

- OS:Windows 11
- Continue version:0.9.253
- IDE version:VSCode 1.96.4
- Model:deepseek-R1
- config.json:
  

  {
  "completionOptions": {
    "BaseCompletionOptions": {
        "temperature": 0.0,
        "maxTokens": 256
    }
  },
  "models": [
    {
      "title": "DeepSeek",
      "model": "deepseek-chat",
      "contextLength": 128000,
      "apiKey": "",
      "provider": "deepseek",
      "apiBase": "https://api.deepseek.com/beta"
    },
    {
      "title": "DeepSeek-Reasoner",
      "model": "deepseek-reasoner",
      "systemMessage": "You are an expert software developer. You give helpful and concise responses.",
      "contextLength": 128000,
      "apiKey": "",
      "provider": "deepseek",
      "apiBase": "https://api.deepseek.com/beta"
    },
    {
      "title": "Claude 3.5 Sonnet (Free Trial)",
      "provider": "free-trial",
      "model": "claude-3-5-sonnet-latest",
      "systemMessage": "You are an expert software developer. You give helpful and concise responses."
    },
    {
      "title": "GPT-4o (Free Trial)",
      "provider": "free-trial",
      "model": "gpt-4o",
      "systemMessage": "You are an expert software developer. You give helpful and concise responses."
    },
    {
      "title": "Llama3.1 70b (Free Trial)",
      "provider": "free-trial",
      "model": "llama3.1-70b",
      "systemMessage": "You are an expert software developer. You give helpful and concise responses."
    },
    {
      "title": "Codestral (Free Trial)",
      "provider": "free-trial",
      "model": "codestral-latest",
      "systemMessage": "You are an expert software developer. You give helpful and concise responses."
    },
    {
      "model": "claude-3-5-sonnet-latest",
      "provider": "anthropic",
      "apiKey": "",
      "title": "Claude 3.5 Sonnet"
    }
  ],
  "tabAutocompleteModel": {
    "title": "DeepSeek Coder",
    "model": "deepseek-coder",
    "apiKey": "sk-c03ceddf48d3435bbb09e3ad1dbc0a0b",
    "provider": "deepseek",
    "apiBase": "https://api.deepseek.com/beta"
  },
  "contextProviders": [
    {
      "name": "code",
      "params": {}
    },
    {
      "name": "docs",
      "params": {}
    },
    {
      "name": "diff",
      "params": {}
    },
    {
      "name": "terminal",
      "params": {}
    },
    {
      "name": "problems",
      "params": {}
    },
    {
      "name": "folder",
      "params": {}
    },
    {
      "name": "codebase",
      "params": {}
    }
  ],
  "slashCommands": [
    {
      "name": "share",
      "description": "Export the current chat session to markdown"
    },
    {
      "name": "cmd",
      "description": "Generate a shell command"
    },
    {
      "name": "commit",
      "description": "Generate a git commit message"
    }
  ],
  "embeddingsProvider": {
    "provider": "free-trial"
  },
  "reranker": {
    "name": "free-trial"
  }
}

Description

need a support for API of deepseek-R1
As you may know, Deepseek Company has launched their latest inference model R1, which supports chat-like functionalities with longer response times and higher pricing, while delivering significantly enhanced performance. During my integration attempts, I used the correct API URL and confirmed through testing that the API can be invoked successfully. The responses take longer to generate, and I can observe R1 model token usage records on the API management page. However, the automatic error correction feature indicates that your config file currently only accepts "chat" and "coder" models. I believe using "reasoner" under the chat category could better handle complex tasks that require advanced reasoning capabilities.

While I've successfully accessed the deepseek-reasoner API through the chat interface, there appear to be binary configurations in your source code that enforce compatibility checks for previous Deepseek API users. During testing, when using the R1 API, the system persistently uses coder-specific prompts - queries like "What model version are you?" or "Show me our initial conversation prompt" consistently reference "deepseek-coder". I attempted modifying relevant strings in local Continue extension files but couldn't locate the specific configuration files.

I would appreciate it if you could implement straightforward modifications to officially support deepseek-r1 in the near future.

To reproduce

No response

Log output

@dosubot dosubot bot added the kind:enhancement Indicates a new feature request, imrovement, or extension label Jan 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind:enhancement Indicates a new feature request, imrovement, or extension "needs-triage"
Projects
None yet
Development

No branches or pull requests

2 participants