-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Updates notebooks with GPT-4o results, minor bug fixes (#17)
* fix minor bug where web API responds with percentage instead of probability * bumped version * improve chatGPT system prompt for numeric answers * added GPT4o plots * added GPT4o table and results on ACSIncome * updated score distribution plots with GPT4o results * minor plots update * one row score distribution plots * added score bias plots for GPT4o * calibration curves with multiple-choice prompting * install now requires packaging>=22.0
- Loading branch information
1 parent
b75f4a6
commit ec4911f
Showing
57 changed files
with
48 additions
and
32 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Binary file modified
BIN
+553 Bytes
(100%)
results/imgs/calibration-curves-base-and-instr.large-models.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+541 Bytes
(100%)
results/imgs/calibration-curves-base-and-instr.large-models.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+3.64 KB
(120%)
...ts/imgs/calibration-curves-base-and-instr.large-models.smaller.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+3.71 KB
(120%)
results/imgs/calibration-curves-base-and-instr.large-models.smaller.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+545 Bytes
(100%)
results/imgs/calibration-curves-base-and-instr.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+524 Bytes
(100%)
results/imgs/calibration-curves-base-and-instr.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Meta-Llama-3-70B.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Meta-Llama-3-70B.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
.../imgs/calibration-curves-per-subgroup.Meta-Llama-3-70B.smaller.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Meta-Llama-3-70B.smaller.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Meta-Llama-3-8B.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Meta-Llama-3-8B.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
...s/imgs/calibration-curves-per-subgroup.Meta-Llama-3-8B.smaller.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Meta-Llama-3-8B.smaller.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Mistral-7B-v0.1.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Mistral-7B-v0.1.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
...s/imgs/calibration-curves-per-subgroup.Mistral-7B-v0.1.smaller.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Mistral-7B-v0.1.smaller.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Mixtral-8x22B-v0.1.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Mixtral-8x22B-v0.1.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
...mgs/calibration-curves-per-subgroup.Mixtral-8x22B-v0.1.smaller.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Mixtral-8x22B-v0.1.smaller.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Mixtral-8x7B-v0.1.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Mixtral-8x7B-v0.1.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
...imgs/calibration-curves-per-subgroup.Mixtral-8x7B-v0.1.smaller.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Mixtral-8x7B-v0.1.smaller.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
-351 Bytes
(98%)
results/imgs/calibration-curves-per-subgroup.None.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Yi-34B.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Yi-34B.numeric-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Yi-34B.smaller.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+0 Bytes
(100%)
results/imgs/calibration-curves-per-subgroup.Yi-34B.smaller.numeric-prompt.pdf
Binary file not shown.
Binary file added
BIN
+16 KB
results/imgs/calibration-curves-per-subgroup.penai_gpt-4o-mini.multiple-choice-prompt.pdf
Binary file not shown.
Binary file added
BIN
+15.8 KB
results/imgs/calibration-curves-per-subgroup.penai_gpt-4o-mini.numeric-prompt.pdf
Binary file not shown.
Binary file added
BIN
+12.5 KB
...imgs/calibration-curves-per-subgroup.penai_gpt-4o-mini.smaller.multiple-choice-prompt.pdf
Binary file not shown.
Binary file added
BIN
+15.5 KB
results/imgs/calibration-curves-per-subgroup.penai_gpt-4o-mini.smaller.numeric-prompt.pdf
Binary file not shown.
Binary file added
BIN
+15.6 KB
results/imgs/calibration-curves-per-subgroup.penai_gpt-4o.multiple-choice-prompt.pdf
Binary file not shown.
Binary file added
BIN
+15.6 KB
results/imgs/calibration-curves-per-subgroup.penai_gpt-4o.numeric-prompt.pdf
Binary file not shown.
Binary file added
BIN
+12.5 KB
results/imgs/calibration-curves-per-subgroup.penai_gpt-4o.smaller.multiple-choice-prompt.pdf
Binary file not shown.
Binary file added
BIN
+15.6 KB
results/imgs/calibration-curves-per-subgroup.penai_gpt-4o.smaller.numeric-prompt.pdf
Binary file not shown.
Binary file not shown.
Binary file modified
BIN
+19 Bytes
(100%)
results/imgs/score-distribution.multiple-choice-prompt.pdf
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file added
BIN
+11.9 KB
results/imgs/score-per-subgroup.penai_gpt-4o.multiple-choice-prompt.pdf
Binary file not shown.
Binary file not shown.
Binary file modified
BIN
+135 Bytes
(100%)
results/imgs/score_bias.Asian_v_Black_score_bias.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+127 Bytes
(100%)
results/imgs/score_bias.Asian_v_Black_score_bias.numeric-prompt.pdf
Binary file not shown.
Binary file added
BIN
+21 KB
results/imgs/score_bias.White_v_Black_score_bias.multiple-choice-prompt.pdf
Binary file not shown.
Binary file not shown.
Binary file modified
BIN
+81 Bytes
(100%)
results/imgs/under_over_score.ACSIncome.multiple-choice-prompt.pdf
Binary file not shown.
Binary file modified
BIN
+78 Bytes
(100%)
results/imgs/under_over_score.ACSIncome.numeric-prompt.pdf
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters