-
Notifications
You must be signed in to change notification settings - Fork 23
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #18 from sotopia-lab/feature/add_finetuned_model
Add new model version for gpt-3.5-turbo-finetuned
- Loading branch information
Showing
10 changed files
with
127 additions
and
14 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -141,5 +141,3 @@ deprecated/* | |
|
||
#backup | ||
backup/* | ||
|
||
scripts/* |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
MODEL_NAME_1=gpt-3.5-turbo-ft-MF | ||
MODEL_NAME_2=gpt-3.5-turbo | ||
|
||
python examples/experiment_eval.py \ | ||
--gin_file sotopia_conf/generation_utils_conf/generate.gin \ | ||
--gin_file sotopia_conf/server_conf/server.gin \ | ||
--gin_file sotopia_conf/run_async_server_in_batch.gin \ | ||
"--gin.ENV_IDS=['01H7VFHPKA2GGPPNVJWV967HZC', '01H7VFHPHWA2CYG7BC82NS4XH1', '01H7VFHPH567HKQRE0C745KH9C', '01H7VFHPMS6AJY0PFGGCFFK5GX', '01H7VFHPJKR16MD1KC71V4ZRCF', '01H7VFHPQ1712DHGTMPQFTXH02', '01H7VFHPP9SPQ8W6583JFZ7HZC', '01H7VFHPM3NVVKSGCCB4S10465', '01H7VFHPGABSWQXTACCC8C3X2F', '01H7VFHPNHZ2YYRHP0GXARD550']" \ | ||
"--gin.AGENT1_MODEL=\"${MODEL_NAME_1}\"" \ | ||
"--gin.AGENT2_MODEL=\"${MODEL_NAME_2}\"" \ | ||
'--gin.BATCH_SIZE=1' \ | ||
'--gin.TAG="finetuned_gpt3.5_gpt3.5ft_MF"' \ | ||
'--gin.TAG_TO_CHECK_EXISTING_EPISODES="finetuned_gpt3.5_gpt3.5ft_MF"' \ | ||
'--gin.PUSH_TO_DB=True' \ | ||
'--gin.VERBOSE=False' \ | ||
'--gin.LITE=False' \ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
MODEL_NAME=gpt-3.5-turbo-finetuned | ||
|
||
python examples/experiment_eval.py \ | ||
--gin_file sotopia_conf/generation_utils_conf/generate.gin \ | ||
--gin_file sotopia_conf/server_conf/server.gin \ | ||
--gin_file sotopia_conf/run_async_server_in_batch.gin \ | ||
'--gin.ENV_IDS=[]' \ | ||
"--gin.AGENT1_MODEL=\"${MODEL_NAME}\"" \ | ||
"--gin.AGENT2_MODEL=\"${MODEL_NAME}\"" \ | ||
'--gin.BATCH_SIZE=5' \ | ||
'--gin.TAG="finetuned_gpt3.5"' \ | ||
'--gin.TAG_TO_CHECK_EXISTING_EPISODES="finetuned_gpt3.5"' \ | ||
'--gin.PUSH_TO_DB=True' \ | ||
'--gin.VERBOSE=False' \ | ||
'--gin.LITE=False' \ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,12 @@ | ||
# Agent vs Storyteller Scripts | ||
|
||
### Basic Scripts | ||
Here are some of the script for running {gpt-3.5-turbo, mixtral-7b-moe} under {normal interaction, omniscient interaction, script generation} mode in {normal, lite} setting. | ||
If you need to run all interaction mode, you can use `run_all.sh`, the usage is `Usage: ./run_all.sh <model_name> <tag_base> <lite>`. For example, `./run_all.sh gpt-3.5-turbo exp0128 True`. You may find model_name in `LLM_Name`, and currently we are using `mistralai/Mixtral-8x7B-Instruct-v0.1` and `gpt-3.5-turbo`. | ||
If you want to run mode separately, you can use `run_interaction.sh` or `run_script_full.sh`. | ||
After running the above script, you may specify tags and fix those error episodes using `./fix_missing_episodes_with_tag.sh`. | ||
Current `fix_missing_episodes_with_tag.py` first detects erroneous episodes, delete them and regenerate them. | ||
|
||
### Fine-tuning | ||
|
||
* `evaluate_finetuned_full.sh`: evaluate the fine-tuned model (gpt-3.5 finetuned on the full dataset) on the sotopia lite setting. |
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters