Simon DL

SimonDL

Simon-dl

AI & ML interests

Reinforcement Learning

Recent Activity

liked a Space 18 days ago

nanotron/ultrascale-playbook

upvoted an article about 1 month ago

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

upvoted an article about 2 months ago

The Large Language Model Course

View all activity

Organizations

None yet

SimonDL's activity

liked a Space 18 days ago

2.14k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article about 1 month ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 110

upvoted an article about 2 months ago

Article

The Large Language Model Course

•

Jan 16

• 126

upvoted 3 papers 3 months ago

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Paper • 2411.17465 • Published Nov 26, 2024 • 80

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 46

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Paper • 2411.13543 • Published Nov 20, 2024 • 18

upvoted 3 papers 4 months ago

liked a dataset 4 months ago

fka/awesome-chatgpt-prompts

Viewer • Updated Jan 6 • 203 • 11.5k • 7.61k

updated a collection 4 months ago

Robotics

Collection

5 items • Updated Nov 8, 2024

upvoted a paper 4 months ago

Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination

Paper • 2411.03823 • Published Nov 6, 2024 • 45

updated a collection 4 months ago

Robotics

Collection

5 items • Updated Nov 8, 2024

upvoted 5 papers 4 months ago

Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning

Paper • 2410.21845 • Published Oct 29, 2024 • 13

Robots Pre-train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Dataset

Paper • 2410.22325 • Published Oct 29, 2024 • 10

SMITE: Segment Me In TimE

Paper • 2410.18538 • Published Oct 24, 2024 • 16

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24, 2024 • 19

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting

Paper • 2410.17856 • Published Oct 23, 2024 • 49