Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper • 2411.14405 • Published about 12 hours ago • 8
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 3 days ago • 40
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices Paper • 2411.10640 • Published 6 days ago • 37
GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation Paper • 2411.08033 • Published 9 days ago • 21
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Paper • 2411.09595 • Published 8 days ago • 65
Hermes: A Large Language Model Framework on the Journey to Autonomous Networks Paper • 2411.06490 • Published 12 days ago • 7
Sharingan: Extract User Action Sequence from Desktop Recordings Paper • 2411.08768 • Published 9 days ago • 9
MagicQuill: An Intelligent Interactive Image Editing System Paper • 2411.09703 • Published 7 days ago • 50
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation Paper • 2411.08380 • Published 9 days ago • 24
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation Paper • 2411.07975 • Published 10 days ago • 24
StdGEN: Semantic-Decomposed 3D Character Generation from Single Images Paper • 2411.05738 • Published 14 days ago • 13
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper • 2410.23218 • Published 23 days ago • 46
Unbounded: A Generative Infinite Game of Character Life Simulation Paper • 2410.18975 • Published 29 days ago • 34
WorldSimBench: Towards Video Generation Models as World Simulators Paper • 2410.18072 • Published 30 days ago • 17
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree Paper • 2410.16268 • Published Oct 21 • 65
Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation Paper • 2410.15748 • Published Oct 21 • 12
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation Paper • 2410.13726 • Published Oct 17 • 10
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Paper • 2410.13232 • Published Oct 17 • 40