3 23 11

Xiaoye Qu

Xiaoye08

AI & ML interests

None yet

Recent Activity

upvoted a paper about 22 hours ago

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

upvoted a paper 2 days ago

Unified Reward Model for Multimodal Understanding and Generation

authored a paper 2 days ago

Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

View all activity

Organizations

Xiaoye08's activity

upvoted a paper about 22 hours ago

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Paper • 2503.07608 • Published 1 day ago • 14

upvoted 2 papers 2 days ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 5 days ago • 95

Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

Paper • 2503.05447 • Published 5 days ago • 7

upvoted 2 papers 8 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 9 days ago • 62

Liger: Linearizing Large Language Models to Gated Recurrent Structures

Paper • 2503.01496 • Published 9 days ago • 15

upvoted a paper 15 days ago

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published 16 days ago • 27

upvoted 2 papers 20 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 21 days ago • 160

MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published 21 days ago • 33

upvoted a paper 27 days ago

LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid

Paper • 2502.07563 • Published 29 days ago • 24

upvoted a paper about 1 month ago

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published Feb 6 • 22

upvoted a paper about 2 months ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22 • 56

upvoted 3 papers 2 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 92

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 261

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

Paper • 2501.03124 • Published Jan 6 • 14

upvoted 3 papers 3 months ago

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Paper • 2412.14711 • Published Dec 19, 2024 • 16

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 43

X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models

Paper • 2412.01824 • Published Dec 2, 2024 • 65

upvoted a paper 5 months ago

CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling

Paper • 2409.19291 • Published Sep 28, 2024 • 19

upvoted a paper 10 months ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 131

upvoted a paper about 1 year ago

Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM

Paper • 2401.02994 • Published Jan 4, 2024 • 49