pythia-1b-sft-hh / README.md
yongzx's picture
Update README.md (#2)
0d5fdc0
|
raw
history blame
No virus
1.07 kB
wandb: https://wandb.ai/eleutherai/pythia-rlhf/runs/6y83ekqy?workspace=user-yongzx
Model Evals
| Task |Version|Filter| Metric |Value | |Stderr|
|--------------|-------|------|----------|-----:|---|-----:|
|arc_challenge |Yaml |none |acc |0.2526|± |0.0127|
| | |none |acc_norm |0.2773|± |0.0131|
|arc_easy |Yaml |none |acc |0.5791|± |0.0101|
| | |none |acc_norm |0.4912|± |0.0103|
|lambada_openai|Yaml |none |perplexity|7.0516|± |0.1979|
| | |none |acc |0.5684|± |0.0069|
|logiqa |Yaml |none |acc |0.2166|± |0.0162|
| | |none |acc_norm |0.2919|± |0.0178|
|piqa |Yaml |none |acc |0.7176|± |0.0105|
| | |none |acc_norm |0.6964|± |0.0107|
|sciq |Yaml |none |acc |0.8460|± |0.0114|
| | |none |acc_norm |0.7700|± |0.0133|
|winogrande |Yaml |none |acc |0.5399|± |0.0140|
|wsc |Yaml |none |acc |0.3654|± |0.0474|