Chujie Zheng's picture

Chujie Zheng

chujiezheng

·

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

upvoted a paper 3 days ago

START: Self-taught Reasoner with Tools

liked a model 4 days ago

Qwen/QwQ-32B-GGUF

liked a model 4 days ago

Qwen/QwQ-32B

View all activity

Organizations

chujiezheng's activity

upvoted a paper 3 days ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 4 days ago • 78

upvoted a paper 10 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 18 days ago • 157

upvoted a paper 17 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 18 days ago • 94

upvoted 3 papers about 2 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 63

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 70

upvoted a paper 2 months ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 50

upvoted a collection 3 months ago

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated Jan 1 • 45

upvoted 4 papers 3 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 349

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 47

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 80

Yi-Lightning Technical Report

Paper • 2412.01253 • Published Dec 2, 2024 • 27

upvoted a collection 3 months ago

QwQ

Qwen with Questions • 6 items • Updated 4 days ago • 76

upvoted an article 5 months ago

Article

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

By

•

Oct 24, 2024

• 10

upvoted a paper 5 months ago

A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models

Paper • 2410.13841 • Published Oct 17, 2024 • 17

upvoted a paper 6 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 141

upvoted a paper 7 months ago

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm

Paper • 2408.08072 • Published Aug 15, 2024 • 35

upvoted a collection 9 months ago

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated Jan 17 • 162

upvoted a paper 10 months ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11

upvoted a collection 11 months ago

Eurus

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Oct 22, 2024 • 24