Jingfeng Yao

MapleF9

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

TPDiff: Temporal Pyramid Video Diffusion Model

upvoted a paper 1 day ago

OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models

upvoted a paper 2 days ago

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

View all activity

Organizations

MapleF9's activity

upvoted a paper about 2 hours ago

TPDiff: Temporal Pyramid Video Diffusion Model

Paper • 2503.09566 • Published about 17 hours ago • 27

upvoted a paper 1 day ago

OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models

Paper • 2503.08686 • Published 1 day ago • 14

upvoted a paper 2 days ago

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Paper • 2503.07608 • Published 3 days ago • 15

upvoted a paper 8 days ago

Improve Representation for Imbalanced Regression through Geometric Constraints

Paper • 2503.00876 • Published 11 days ago • 6

upvoted a paper 9 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 10 days ago • 64

upvoted an article 10 days ago

Article

SigLIP 2: A better multilingual vision language encoder

20 days ago

• 133

upvoted a paper 21 days ago

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Paper • 2502.13144 • Published 23 days ago • 37

upvoted a paper 22 days ago

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Paper • 2502.13145 • Published 23 days ago • 36

upvoted a paper 24 days ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published 27 days ago • 51

upvoted a paper 30 days ago

QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Paper • 2502.05178 • Published Feb 7 • 10

upvoted 2 papers about 1 month ago

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Paper • 2502.06782 • Published about 1 month ago • 13

MatAnyone: Stable Video Matting with Consistent Memory Propagation

Paper • 2501.14677 • Published Jan 24 • 31

upvoted 3 papers about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 345

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 275

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Paper • 2501.09755 • Published Jan 16 • 34

upvoted 2 papers 2 months ago

Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Paper • 2501.01423 • Published Jan 2 • 37

LKCell: Efficient Cell Nuclei Instance Segmentation with Large Convolution Kernels

Paper • 2407.18054 • Published Jul 25, 2024 • 12