1 14 2

Moritz Reuss

mbreuss

https://mbreuss.github.io/

AI & ML interests

Robotics, Imitation Learning, Diffusion Embodied AI

Recent Activity

upvoted a paper 21 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

liked a model 23 days ago

mbreuss/MoDE_CALVIN_ABCD

upvoted a paper 24 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

View all activity

Organizations

mbreuss's activity

upvoted a paper 21 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 22 days ago • 129

liked a model 23 days ago

mbreuss/MoDE_CALVIN_ABCD

Robotics • Updated Dec 19, 2024 • 11 • 2

upvoted 3 papers 24 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 26 days ago • 142

HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation

Paper • 2502.12148 • Published 25 days ago • 16

Learning Getting-Up Policies for Real-World Humanoid Robots

Paper • 2502.12152 • Published 25 days ago • 37

upvoted a paper 25 days ago

Large Language Diffusion Models

Paper • 2502.09992 • Published 28 days ago • 103

upvoted a paper 29 days ago

Distillation Scaling Laws

Paper • 2502.08606 • Published 30 days ago • 46

upvoted a collection about 1 month ago

MoDE

Collection

Collection of pretrained MoDE Diffusion Policies. Variants include finetuned versions for all CALVIN benchmarks and LIBERO 90. • 9 items • Updated Dec 19, 2024 • 2

upvoted a paper about 2 months ago

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published Jan 16 • 23

liked a model about 2 months ago

mbreuss/MoDE_Pretrained

Robotics • Updated Dec 19, 2024 • 15 • 4

upvoted 2 papers 2 months ago

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published Jan 7 • 50

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 65

upvoted a paper 3 months ago

Flowing from Words to Pixels: A Framework for Cross-Modality Evolution

Paper • 2412.15213 • Published Dec 19, 2024 • 26

authored 2 papers 3 months ago

Information Maximizing Curriculum: A Curriculum-Based Approach for Imitating Diverse Skills

Paper • 2303.15349 • Published Mar 27, 2023

Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning

Paper • 2412.12953 • Published Dec 17, 2024 • 11

updated 5 models 3 months ago