SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer Paper β’ 2501.18427 β’ Published 3 days ago β’ 11
NVILA: Efficient Frontier Visual Language Models Paper β’ 2412.04468 β’ Published Dec 5, 2024 β’ 57
Running on CPU Upgrade 12.4k π Open LLM Leaderboard Track, rank and evaluate open LLMs and chatbots
Cautious Optimizers: Improving Training with One Line of Code Paper β’ 2411.16085 β’ Published Nov 25, 2024 β’ 15