Wandb Run: https://wandb.ai/eleutherai/pythia-rlhf/runs/31gbxj2w Eval Results: | Tasks |Version|Filter| Metric |Value | |Stderr| |--------------|-------|------|----------|-----:|---|-----:| |arc_challenge |Yaml |none |acc |0.2159|± |0.0120| | | |none |acc_norm |0.2295|± |0.0123| |arc_easy |Yaml |none |acc |0.3266|± |0.0096| | | |none |acc_norm |0.3287|± |0.0096| |lambada_openai|Yaml |none |perplexity| NaN|± | NaN| | | |none |acc |0.1750|± |0.0053| |logiqa |Yaml |none |acc |0.2028|± |0.0158| | | |none |acc_norm |0.2028|± |0.0158| |piqa |Yaml |none |acc |0.5441|± |0.0116| | | |none |acc_norm |0.5446|± |0.0116| |sciq |Yaml |none |acc |0.2050|± |0.0128| | | |none |acc_norm |0.1940|± |0.0125| |winogrande |Yaml |none |acc |0.5043|± |0.0141| |wsc |Yaml |none |acc |0.6154|± |0.0479|