Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
AssistantBench
/
leaderboard
like
4
Running
App
Files
Files
Community
1
9200a7d
leaderboard
/
evaluation
/
evaluate_utils
4 contributors
History:
1 commit
samuelam
Upload 6 files
3891395
verified
5 months ago
evaluate_dicts.py
Safe
2.15 kB
Upload 6 files
5 months ago
evaluate_factory.py
Safe
766 Bytes
Upload 6 files
5 months ago
evaluate_numbers.py
Safe
834 Bytes
Upload 6 files
5 months ago
evaluate_strings.py
Safe
5.54 kB
Upload 6 files
5 months ago
utils.py
Safe
916 Bytes
Upload 6 files
5 months ago