87 63 282

Lee Junbum PRO

beomi

https://junbuml.ee

AI & ML interests

AI/ML GDE. Advancing Low-Resource Language Open Access LLM

Recent Activity

upvoted a paper 3 days ago

Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning

upvoted a paper 3 days ago

TransMLA: Multi-head Latent Attention Is All You Need

upvoted a paper 3 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

View all activity

Organizations

beomi's activity

upvoted 3 papers 3 days ago

Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning

Paper • 2502.06533 • Published 6 days ago • 16

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published 5 days ago • 37

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 4 days ago • 119

upvoted 2 papers 10 days ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published 11 days ago • 12

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 12 days ago • 168

upvoted a paper 13 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 16 days ago • 100

upvoted an article 14 days ago

Article

Welcome to Inference Providers on the Hub 🔥

20 days ago

• 361

upvoted a paper 16 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 17 days ago • 53

upvoted a paper 17 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 18 days ago • 54

upvoted a paper 24 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 25 days ago • 319

upvoted a paper 30 days ago

Do generative video models learn physical principles from watching videos?

Paper • 2501.09038 • Published Jan 14 • 32

upvoted a paper about 1 month ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 273

upvoted a paper 2 months ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 78

upvoted 2 collections 2 months ago

EXAONE-3.5

Collection

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. • 10 items • Updated Dec 10, 2024 • 94

Llama 3.3 (All Versions)

Collection

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 12 days ago • 36

upvoted 3 papers 2 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 126

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 32

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 58

upvoted a paper 3 months ago

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 43

upvoted a collection 3 months ago

OLMo 2

Collection

Artifacts for the second set of OLMo models. • 22 items • Updated 6 days ago • 82