38 37 102

Junlin Zhou

jlzhou

edwardzjl

AI & ML interests

None yet

Recent Activity

upvoted a paper about 21 hours ago

s1: Simple test-time scaling

liked a dataset 1 day ago

cognitivecomputations/dolphin-r1

upvoted a paper 4 days ago

The Differences Between Direct Alignment Algorithms are a Blur

View all activity

Organizations

jlzhou's activity

upvoted a paper about 21 hours ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 17 days ago • 102

upvoted a paper 4 days ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 15 days ago • 112

upvoted an article 7 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

10 days ago

• 32

upvoted a paper 13 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 20 days ago • 54

upvoted a paper 23 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 256

upvoted 3 papers 25 days ago

upvoted an article 26 days ago

Article

Fine-tune Llama 3 with ORPO

•

Apr 22, 2024

• 233

upvoted an article 27 days ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 16

upvoted an article 28 days ago

Article

The Large Language Model Course

•

Jan 16

• 101

upvoted 3 articles about 1 month ago

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

•

Apr 24, 2024

• 61

Article

Let's talk about LLM evaluation

•

May 23, 2024

• 153

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

•

Aug 26, 2024

• 50

upvoted a paper about 1 month ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 99

upvoted 2 papers about 2 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 69

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 65

upvoted a collection 2 months ago

LLM Reasoning Papers

Collection

Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15 • 116

upvoted an article 3 months ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

•

Sep 27, 2024

• 40

upvoted a paper 7 months ago

Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation

Paper • 2407.10817 • Published Jul 15, 2024 • 14