Clémentine Fourrier
clefourrier
AI & ML interests
None yet
Articles
Organizations
clefourrier's activity
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1644340617257-noauth.png)
upvoted
an
article
15 days ago
Article
Space secrets security update
•
50
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1644340617257-noauth.png)
upvoted
an
article
25 days ago
Article
Evaling llm-jp-eval (evals are hard)
By
•
•
4Article
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
•
74
Article
LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)
By
•
•
51![](https://cdn-avatars.huggingface.co/v1/production/uploads/1644340617257-noauth.png)
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62cd5057674cdb524450093d/ZeyE0v5ijl6OazcMSoQ3K.jpeg)
upvoted
a
collection
about 1 month ago
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1644340617257-noauth.png)
upvoted
a
paper
about 1 month ago
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1644340617257-noauth.png)
upvoted
an
article
about 1 month ago
Article
Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face
•
13
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1644340617257-noauth.png)
upvoted
an
article
about 2 months ago
Article
Improving Prompt Consistency with Structured Generations
•
48
Article
A guide to setting up your own Hugging Face leaderboard: an end-to-end example with Vectara's hallucination leaderboard
•
5
Article
An Introduction to AI Secure LLM Safety Leaderboard
•
4
Article
The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models
•
7
Article
Introducing the Enterprise Scenarios Leaderboard: a Leaderboard for Real World Use Cases
•
3
Article
Introducing the Open Ko-LLM Leaderboard: Leading the Korean LLM Evaluation Ecosystem
•
2
Article
Introducing the Red-Teaming Resistance Leaderboard
•
7
Article
TTS Arena: Benchmarking Text-to-Speech Models in the Wild
•
20
Article
Introducing ConTextual: How well can your Multimodal model jointly reason over text and image in text-rich scenes?
•
3
Article
Introducing the Chatbot Guardrails Arena
•
4
Article
NPHardEval Leaderboard: Unveiling the Reasoning Abilities of Large Language Models through Complexity Classes and Dynamic Updates
•
2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1644340617257-noauth.png)
upvoted
a
paper
4 months ago
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1644340617257-noauth.png)
upvoted
a
collection
6 months ago
NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian
Paper
•
2312.01314
•
Published
•
2
The Falcon Series of Open Language Models
Paper
•
2311.16867
•
Published
•
11
GAIA: a benchmark for General AI Assistants
Paper
•
2311.12983
•
Published
•
174
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1644340617257-noauth.png)
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5df7e9e5da6d0311fd3d53f9/j21QZzv9_PGPUH5FbUaeM.png)
upvoted
a
collection
7 months ago
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1644340617257-noauth.png)
upvoted
a
paper
8 months ago