Dokyoon

leeloolee

Eruly

AI & ML interests

Recent Activity

liked a model 4 days ago

lerobot/pi0

upvoted a paper 7 days ago

Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach

liked a dataset 18 days ago

ServiceNow-AI/R1-Distill-SFT

View all activity

Organizations

leeloolee's activity

upvoted a paper 7 days ago

Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach

Paper • 2502.03639 • Published 11 days ago • 8

upvoted a paper 23 days ago

DiffuEraser: A Diffusion Model for Video Inpainting

Paper • 2501.10018 • Published about 1 month ago • 14

upvoted an article about 1 month ago

Article

Context Parallelism

•

Aug 13, 2024

• 13

upvoted a collection about 1 month ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 101 items • Updated 12 days ago • 97

upvoted a paper about 1 month ago

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

Paper • 2501.03124 • Published Jan 6 • 14

upvoted 2 papers about 2 months ago

GUI Agents: A Survey

Paper • 2412.13501 • Published Dec 18, 2024 • 25

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published Dec 11, 2024 • 44

upvoted a paper 2 months ago

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 16

upvoted 2 collections 2 months ago

Multimodal-SAE

Collection

The collection of the sae that hooked on llava • 4 items • Updated Jan 6 • 6

GUI agents

Collection

A collection of papers on GUI agents • 3 items • Updated Dec 14, 2024 • 5

upvoted 2 papers 2 months ago

Granite Guardian

Paper • 2412.07724 • Published Dec 10, 2024 • 18

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 126

upvoted an article 3 months ago

Article

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

•

Nov 19, 2024

• 11

upvoted a paper 3 months ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21, 2024 • 58

upvoted an article 3 months ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

and 1 other •

Nov 21, 2024

• 35

upvoted 3 papers 3 months ago

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Paper • 2411.06176 • Published Nov 9, 2024 • 45

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Paper • 2411.07461 • Published Nov 12, 2024 • 22

Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models

Paper • 2411.05005 • Published Nov 7, 2024 • 13

upvoted 2 papers 4 months ago

GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation

Paper • 2410.20474 • Published Oct 27, 2024 • 14

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Paper • 2410.15999 • Published Oct 21, 2024 • 19