view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • about 1 month ago • 63
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper • 2501.09686 • Published Jan 16 • 37
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 156
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention By sirluk • Oct 7, 2024 • 20
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 141
nyrahealth/CrisperWhisper Automatic Speech Recognition • Updated Dec 19, 2024 • 26.3k • • 242