2 2 19

Zhicheng Wang

Dicer

https://blog.dicer.fun

Dicer-Zz

AI & ML interests

NLP

Recent Activity

updated a model 13 days ago

Dicer/ppo-Huggy

published a model 13 days ago

Dicer/ppo-Huggy

updated a model 13 days ago

Dicer/ppo-LunarLander-v2

View all activity

Organizations

Dicer's activity

updated a model 13 days ago

Dicer/ppo-Huggy

Reinforcement Learning • Updated 13 days ago • 93

published a model 13 days ago

Dicer/ppo-Huggy

Reinforcement Learning • Updated 13 days ago • 93

updated a model 13 days ago

Dicer/ppo-LunarLander-v2

Reinforcement Learning • Updated 13 days ago • 5

published a model 13 days ago

Dicer/ppo-LunarLander-v2

Reinforcement Learning • Updated 13 days ago • 5

upvoted 2 articles 18 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

about 1 month ago

• 63

Article

Vision Language Models Explained

Apr 11, 2024

• 283

liked 5 datasets 5 months ago

liked a model 6 months ago

XLabs-AI/flux-controlnet-collections

Text-to-Image • Updated Aug 30, 2024 • 42.2k • 451

liked a Space 12 months ago

5.03k

MTEB Leaderboard

🥇

Select benchmarks and languages for text embeddings evaluation

liked a model 12 months ago

openbmb/MiniCPM-2B-sft-fp32

Text Generation • Updated Sep 7, 2024 • 558 • 296

liked a dataset about 1 year ago

bigscience/P3

Viewer • Updated Mar 4, 2024 • 122M • 97.7k • 214

liked a model about 1 year ago

mistralai/Mistral-7B-Instruct-v0.2

Text Generation • Updated Sep 27, 2024 • 3.85M • • 2.67k

liked a dataset over 1 year ago

Muennighoff/natural-instructions

Viewer • Updated Dec 23, 2022 • 7.15M • 1.84k • 61

liked 2 models almost 2 years ago

databricks/dolly-v2-12b

Text Generation • Updated Jun 30, 2023 • 5.7k • 1.95k

huggyllama/llama-13b

Text Generation • Updated Apr 7, 2023 • 5.33k • 139

liked a dataset almost 2 years ago

RyokoAI/ShareGPT52K

Preview • Updated Apr 2, 2023 • 268 • 313