DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • about 18 hours ago • 1
AI Bookkeeper: Enhancing Accounting Document Understanding Through Supervised Fine-Tuning By jenesys-ai and 1 other • 1 day ago • 3
Agentic RAG Stack (2/5) - Augment retrieval results by reranking using Sentence Transformers By davidberenstein1957 • 3 days ago • 6
AI Agents for Hardware Optimization: Automating PC Gaming Performance with KaibanJS By darielnoel • 4 days ago • 1
Assessment of how well Large Language Models (LLMs) answer questions related to gender equality and women’s empowerment By CGIAR and 2 others • 4 days ago
A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons By NormalUhr • 4 days ago
From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning By NormalUhr • 4 days ago • 2
MLA: Redefining KV-Cache Through Low-Rank Projections and On-Demand Decompression By NormalUhr • 4 days ago • 3
AI Agents for Company Research: Automating Business Analysis with KaibanJS By darielnoel • 4 days ago • 1
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • about 18 hours ago • 1
AI Bookkeeper: Enhancing Accounting Document Understanding Through Supervised Fine-Tuning By jenesys-ai and 1 other • 1 day ago • 3
Agentic RAG Stack (2/5) - Augment retrieval results by reranking using Sentence Transformers By davidberenstein1957 • 3 days ago • 6
AI Agents for Hardware Optimization: Automating PC Gaming Performance with KaibanJS By darielnoel • 4 days ago • 1
Assessment of how well Large Language Models (LLMs) answer questions related to gender equality and women’s empowerment By CGIAR and 2 others • 4 days ago
A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons By NormalUhr • 4 days ago
From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning By NormalUhr • 4 days ago • 2
MLA: Redefining KV-Cache Through Low-Rank Projections and On-Demand Decompression By NormalUhr • 4 days ago • 3
AI Agents for Company Research: Automating Business Analysis with KaibanJS By darielnoel • 4 days ago • 1