Add max_tokens configuration for LLM API #421

yangbobo2021 · 2024-11-12T03:56:44Z

This pull request introduces a new feature for the LLM API by adding a configurable max_tokens setting. Changes include:

Implemented loading of chat configuration from a YAML file.
Introduced get_maxtokens_by_model function to retrieve max_tokens based on model selection.
Updated chat completion functions to utilize the dynamic max_tokens setting.

This enhancement aims to provide more flexibility in managing the token limits for different models, improving the overall functionality of the API.

Closes /#issue_id

- Implemented loading of chat configuration from a YAML file. - Introduced `get_maxtokens_by_model` function to retrieve max_tokens. - Updated chat completion functions to utilize dynamic max_tokens setting.

feat: Add max_tokens configuration for LLM API

8b6dd86

- Implemented loading of chat configuration from a YAML file. - Introduced `get_maxtokens_by_model` function to retrieve max_tokens. - Updated chat completion functions to utilize dynamic max_tokens setting.

yangbobo2021 requested a review from kagami-l November 12, 2024 04:04

kagami-l approved these changes Nov 12, 2024

View reviewed changes

kagami-l merged commit 1fb5b9a into main Nov 12, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add max_tokens configuration for LLM API #421

Add max_tokens configuration for LLM API #421

yangbobo2021 commented Nov 12, 2024

Add max_tokens configuration for LLM API #421

Add max_tokens configuration for LLM API #421

Conversation

yangbobo2021 commented Nov 12, 2024