something-else/9BQuetzal-rocm-HF-1

This 9B model, built on the RWKV v5 architecture, was exclusively trained using AMD GPUs. The model's training process advanced in tandem with the evolution of ROCm (upto ROCm 6.0.0), this means a lot of experimentation 😅.

Tasks	Version	Filter	Metric	Value		Stderr
mathqa	Yaml	none	acc	0.2673	±	0.0081
		none	acc_norm	0.2747	±	0.0082
copa	Yaml	none	acc	0.87	±	0.0338
boolq	Yaml	none	acc	0.6927	±	0.0081
hellaswag	Yaml	none	acc	0.5148	±	0.0050
		none	acc_norm	0.6833	±	0.0046
sciq	Yaml	none	acc	0.9430	±	0.0073
		none	acc_norm	0.9210	±	0.0085
lambada_openai	Yaml	none	perplexity	3.7234	±	0.0767
		none	acc	0.7145	±	0.0063
piqa	Yaml	none	acc	0.7568	±	0.0100
		none	acc_norm	0.7693	±	0.0098
arc_challenge	Yaml	none	acc	0.3823	±	0.0142
		none	acc_norm	0.4172	±	0.0144
arc_easy	Yaml	none	acc	0.7151	±	0.0093
		none	acc_norm	0.7109	±	0.0093