How could the metrics such as accuracy be -1?

#2
by zhiminy - opened

These results are not making a lot of sense...
1710896676747.png

SeaEval org

It means not finished.

It means not finished.

I appreciate your reply. However, using "-1" to indicate incomplete evaluation records may not align with the best practices (keeping it blank or left with nan) for leaderboard demonstration. Would you consider updating the main page with a detailed description of the evaluation metrics, including an explanation of the "-1" value, to provide better clarity for users?

SeaEval org

Thanks for your suggestion. Yes! Will do.

Closed since this issue disappear after the recent leaderboard update :)
image.png

zhiminy changed discussion status to closed

Sign up or log in to comment