4 21 83

Richard Lian

richardlian

dachenlian

AI & ML interests

None yet

Recent Activity

upvoted an article 8 days ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

liked a model 12 days ago

KVCache-ai/DeepSeek-R1-GGML-FP8-Hybrid

liked a model about 1 month ago

unsloth/DeepSeek-R1-GGUF

View all activity

Organizations

richardlian's activity

upvoted an article 8 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

about 1 month ago

• 63

liked a model 12 days ago

KVCache-ai/DeepSeek-R1-GGML-FP8-Hybrid

Updated 6 days ago • 9

liked a model about 1 month ago

unsloth/DeepSeek-R1-GGUF

Text Generation • Updated 24 days ago • 5.01M • 978

upvoted 2 papers about 2 months ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 37

upvoted an article about 2 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 156

liked a Space 2 months ago

1.18k

Big Code Models Leaderboard

📈

Submit code models for evaluation on benchmarks

upvoted an article 2 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 20

upvoted a collection 3 months ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 141

liked a model 4 months ago

nyrahealth/CrisperWhisper

Automatic Speech Recognition • Updated Dec 19, 2024 • 26.3k • • 242