pythia-160m-ppo / README.md
usvsnsp's picture
Create README.md
9c862b6
|
raw
history blame
1.07 kB

Wandb Run: https://wandb.ai/eleutherai/pythia-rlhf/runs/31gbxj2w

Eval Results:

Tasks Version Filter Metric Value Stderr
arc_challenge Yaml none acc 0.2159 ± 0.0120
none acc_norm 0.2295 ± 0.0123
arc_easy Yaml none acc 0.3266 ± 0.0096
none acc_norm 0.3287 ± 0.0096
lambada_openai Yaml none perplexity NaN ± NaN
none acc 0.1750 ± 0.0053
logiqa Yaml none acc 0.2028 ± 0.0158
none acc_norm 0.2028 ± 0.0158
piqa Yaml none acc 0.5441 ± 0.0116
none acc_norm 0.5446 ± 0.0116
sciq Yaml none acc 0.2050 ± 0.0128
none acc_norm 0.1940 ± 0.0125
winogrande Yaml none acc 0.5043 ± 0.0141
wsc Yaml none acc 0.6154 ± 0.0479