view article Article Decoding Strategies in Large Language Models By mlabonne β’ about 1 month ago β’ 38
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale Paper β’ 2409.16299 β’ Published Sep 9 β’ 9
Gemma 2: Improving Open Language Models at a Practical Size Paper β’ 2408.00118 β’ Published Jul 31 β’ 75
GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression Paper β’ 2407.12077 β’ Published Jul 16 β’ 54
Searching for Best Practices in Retrieval-Augmented Generation Paper β’ 2407.01219 β’ Published Jul 1 β’ 11
view post Post 5771 Reply 5,000 new repos (models, datasets, spaces) are created EVERY DAY on HF now. The community is amazing! β€οΈ 25 25 π 20 20 π€ 4 4 π 2 2 π€ 2 2 π 1 1 +