1 25 52

罗杰斯

rojasdiego

https://rojasdiego.com

AI & ML interests

LLMs for Code Generation

Recent Activity

liked a model 1 day ago

mistralai/Mistral-Small-24B-Base-2501

upvoted a paper 15 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

upvoted a paper 18 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

View all activity

Organizations

rojasdiego's activity

liked a model 1 day ago

mistralai/Mistral-Small-24B-Base-2501

Text Generation • Updated 17 days ago • 16.6k • 216

upvoted a paper 15 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 17 days ago • 53

upvoted a paper 18 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 19 days ago • 105

liked a model 27 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 8 days ago • 3.98M • • 9.13k

liked a model 28 days ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 8 days ago • 29.8k • 809

liked a dataset about 1 month ago

bigcode/the-stack-v2-train-smol-ids

Viewer • Updated Apr 23, 2024 • 40.1M • 731 • 30

liked a model about 1 month ago

numind/NuExtract-1.5

Text Generation • Updated Nov 18, 2024 • 116k • 196

updated a collection about 1 month ago

Code LLMs

Collection

6 items • Updated Jan 3 • 1

liked 2 models about 1 month ago

infly/OpenCoder-1.5B-Base

Text Generation • Updated Nov 11, 2024 • 14.4k • 20

infly/OpenCoder-8B-Instruct

Text Generation • Updated Nov 14, 2024 • 11.5k • 183

updated a collection about 2 months ago

CoT Models

Collection

2 items • Updated Jan 1

liked a model about 2 months ago

PowerInfer/SmallThinker-3B-Preview

Text Generation • Updated Jan 16 • 112k • • 383

updated a collection about 2 months ago

Code LLMs

Collection

6 items • Updated Jan 3 • 1

liked 2 models about 2 months ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Jan 12 • 211k • • 1.61k

deepseek-ai/DeepSeek-V3-Base

Updated 24 days ago • 140k • 1.56k

liked a model 2 months ago

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated Dec 21, 2024 • 536k • • 1.97k

upvoted a paper 3 months ago

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4, 2024 • 35

liked a model 3 months ago

tencent/Tencent-Hunyuan-Large

Text Generation • Updated 29 days ago • 521 • 564

liked 2 models 4 months ago

deepseek-ai/DeepSeek-V2

Text Generation • Updated Jun 8, 2024 • 17.8k • 307

Etched/oasis-500m

Updated Nov 4, 2024 • 175 • 446