TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published 3 days ago • 23
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs Paper • 2411.14199 • Published 4 days ago • 22
OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated 3 days ago • 23
Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples Paper • 2411.08954 • Published 12 days ago • 5
MagicQuill: An Intelligent Interactive Image Editing System Paper • 2411.09703 • Published 11 days ago • 53
SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models Paper • 2411.05007 • Published 18 days ago • 16
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published 18 days ago • 109
How Far is Video Generation from World Model: A Physical Law Perspective Paper • 2411.02385 • Published 21 days ago • 32
NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks Paper • 2410.20650 • Published 29 days ago • 16
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens Paper • 2410.13863 • Published Oct 17 • 35
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation Paper • 2410.13848 • Published Oct 17 • 27
Tree of Problems: Improving structured problem solving with compositionality Paper • 2410.06634 • Published Oct 9 • 8
AuraFlow Collection AuraFlow v0.x series, to date the largest (6.8B) and highest fidelity (0.7+ on GenEval) open sourced text to image model. • 3 items • Updated Sep 6 • 5
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging Paper • 2410.01215 • Published Oct 2 • 30
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices Paper • 2410.00531 • Published Oct 1 • 29