view article Article LeRobot goes to driving school: World’s largest open-source self-driving dataset 3 days ago • 40
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 2 days ago • 199
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published 6 days ago • 71
EuroBERT Collection Scaling Multilingual Encoders for European Languages • 4 items • Updated 3 days ago • 8
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others • 4 days ago • 115
Jina Reader-LM Collection Convert HTML content to LLM-friendly Markdown/JSON content • 3 items • Updated Jan 16 • 10
jina-embeddings-v3: Multilingual Embeddings With Task LoRA Paper • 2409.10173 • Published Sep 16, 2024 • 31
jina-embeddings-v3 Collection Multilingual multi-task general text embedding model • 6 items • Updated Sep 19, 2024 • 22
LettuceDetect: A Hallucination Detection Framework for RAG Applications Paper • 2502.17125 • Published 17 days ago • 8
Hallucination detection Collection Trained ModernBERT (base and large) for detection hallucinations in LLM responses. The models are trained as token classifications. • 4 items • Updated 8 days ago • 15
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality 10 days ago • 65
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI • Jan 15 • 43
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥 24 days ago • 93
NeoBERT Collection NeoBERT is a next-generation encoder model for English text representation, pre-trained from scratch on the RefinedWeb dataset. • 1 item • Updated 14 days ago • 2