2 189

Lei Wang

demolei

https://demoleiwang.github.io/HomePage/

demo_lei_wang
lei-wang-0805831a2

AI & ML interests

LLMs

Recent Activity

upvoted a paper about 9 hours ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

upvoted a paper about 9 hours ago

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

upvoted a paper 1 day ago

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

View all activity

Organizations

Collections 3

Papers 9

models 4

datasets

None public yet

Lei Wang

AI & ML interests

Recent Activity

Organizations

Collections 3

Language Modeling Is Compression

SlimPajama-DC: Understanding Data Combinations for LLM Training

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Contrastive Decoding Improves Reasoning in Large Language Models

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Papers 9

models 4

demolei/Qwen2.5-1.5B-Open-R1-Distill

demolei/Qwen-2.5-7B-Simple-RL

demolei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

demolei/sft_openassistant-guanaco

datasets

Lei Wang

AI & ML interests

Recent Activity

Organizations

Collections 3

Papers 9

models 4 Sort: Recently updated

datasets

models 4