Various benchmarks for reasoning capabilities of LLMs
Denis Gordeev
denis-gordeev
·
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 month ago
genmo/mochi-1-preview
updated
a collection
about 2 months ago
Reasoning benchmarks
updated
a collection
about 2 months ago
Reasoning benchmarks
Organizations
None yet
Collections
2
My (Denis Gordeev) collection of mostly NLP papers. You can message me at t.me/nlp_party
-
LongAlign: A Recipe for Long Context Alignment of Large Language Models
Paper • 2401.18058 • Published • 21 -
Efficient Tool Use with Chain-of-Abstraction Reasoning
Paper • 2401.17464 • Published • 16 -
Scavenging Hyena: Distilling Transformers into Long Convolution Models
Paper • 2401.17574 • Published • 15 -
Rethinking Interpretability in the Era of Large Language Models
Paper • 2402.01761 • Published • 22
models
4
denis-gordeev/whisper-large-v3-ar
Updated
denis-gordeev/whisper-small-ar
Automatic Speech Recognition
•
Updated
•
3
denis-gordeev/rured2-ner-microsoft-mdeberta-v3-base
Token Classification
•
Updated
•
472
•
5
denis-gordeev/autotrain-insightful_keywords_2-54689127880
Text Classification
•
Updated
•
13
•
3