-
Notifications
You must be signed in to change notification settings - Fork 79
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #612 from deeppavlov/feat/add-gpt-4o
Add new model gpt-4o
- Loading branch information
Showing
9 changed files
with
115 additions
and
6 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,25 @@ | ||
name: openai_api_gpt4o | ||
display_name: GPT-4o | ||
component_type: Generative | ||
model_type: NN-based | ||
is_customizable: false | ||
author: [email protected] | ||
description: GPT-4o (“o” for “omni”) is our most advanced model. It is multimodal (accepting text or image inputs and | ||
outputting text), and it has the same high intelligence as GPT-4 Turbo but is much more efficient—it generates text | ||
2x faster and is 50% cheaper. The maximum number of output tokens for this model is 128000. | ||
ram_usage: 100M | ||
gpu_usage: null | ||
group: services | ||
connector: | ||
protocol: http | ||
timeout: 120.0 | ||
url: http://openai-api-gpt4o:8181/respond | ||
dialog_formatter: null | ||
response_formatter: null | ||
previous_services: null | ||
required_previous_services: null | ||
state_manager_method: null | ||
tags: null | ||
endpoint: respond | ||
service: services/openai_api_lm/service_configs/openai-api-gpt-4o | ||
date_created: '2024-05-15T09:45:32' |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
4 changes: 4 additions & 0 deletions
4
services/openai_api_lm/service_configs/openai-api-gpt-4o/environment.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
SERVICE_PORT: 8181 | ||
SERVICE_NAME: openai-api-gpt4o | ||
PRETRAINED_MODEL_NAME_OR_PATH: gpt-4o | ||
FLASK_APP: server |
30 changes: 30 additions & 0 deletions
30
services/openai_api_lm/service_configs/openai-api-gpt-4o/service.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,30 @@ | ||
name: openai-api-gpt4o | ||
endpoints: | ||
- respond | ||
- generate_goals | ||
compose: | ||
env_file: | ||
- .env | ||
build: | ||
args: | ||
SERVICE_PORT: 8181 | ||
SERVICE_NAME: openai_api_gpt4_o | ||
PRETRAINED_MODEL_NAME_OR_PATH: gpt-4o | ||
FLASK_APP: server | ||
context: . | ||
dockerfile: ./services/openai_api_lm/Dockerfile | ||
command: flask run -h 0.0.0.0 -p 8181 | ||
environment: | ||
- FLASK_APP=server | ||
deploy: | ||
resources: | ||
limits: | ||
memory: 100M | ||
reservations: | ||
memory: 100M | ||
volumes: | ||
- ./services/openai_api_lm:/src | ||
- ./common:/src/common | ||
ports: | ||
- 8181:8181 | ||
proxy: null |