Running 3 3 Emotional Intelligence Data Browser 😻 Browse and analyze LLM leaderboards and responses
MSTS: A Multimodal Safety Test Suite for Vision-Language Models Paper • 2501.10057 • Published Jan 17 • 8
Language Model Council: Benchmarking Foundation Models on Highly Subjective Tasks by Consensus Paper • 2406.08598 • Published Jun 12, 2024 • 6
Using Machine Translation to Localize Task Oriented NLG Output Paper • 2107.04512 • Published Jul 9, 2021
Language Model Council: Benchmarking Foundation Models on Highly Subjective Tasks by Consensus Paper • 2406.08598 • Published Jun 12, 2024 • 6
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper • 2405.00732 • Published Apr 29, 2024 • 121