I have run this model on "high_school_mathematics" and "college_mathematics" domain of MMLU benchmark, but get a low accuracy. Does that mean this model perform relatively worse on that domains compared with other domains?
· Sign up or log in to comment