Install lm-eval-harness from source, we used the git id f3b7917091afba325af3980a35d8a6dcba03dc3f

lm_eval  --model hf --model_args pretrained="Intel/neural-chat-v3-3-int4-inc",autogptq=True,gptq_use_triton=True --device cuda:0 --tasks lambada_openai,hellaswag,piqa,winogrande,truthfulqa_mc1,openbookqa,boolq,rte,arc_easy,arc_challenge,mmlu  --batch_size 128

Metric	FP16	INT4
Avg.	0.6778	0.6748
mmlu	0.5993	0.5926
lambada_openai	0.7303	0.7370
hellaswag	0.6639	0.6559
winogrande	0.7632	0.7735
piqa	0.8101	0.8074
truthfulqa_mc1	0.4737	0.4737
openbookqa	0.3880	0.3680
boolq	0.8694	0.8694
rte	0.7581	0.7509
arc_easy	0.8266	0.8249
arc_challenge	0.5734	0.5691

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

neural-chat-7b-v3-3-acc.md

neural-chat-7b-v3-3-acc.md

Files

neural-chat-7b-v3-3-acc.md

Latest commit

History

neural-chat-7b-v3-3-acc.md

File metadata and controls