1 209 14

Chan Kim

chanmuzi

chanmuzi

AI & ML interests

None yet

Recent Activity

upvoted a paper about 10 hours ago

Preference Leakage: A Contamination Problem in LLM-as-a-judge

upvoted a paper 5 days ago

Large Language Models Think Too Fast To Explore Effectively

upvoted a paper 7 days ago

Optimizing Large Language Model Training Using FP4 Quantization

View all activity

Organizations

chanmuzi's activity

upvoted a paper about 10 hours ago

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Paper • 2502.01534 • Published 3 days ago • 34

upvoted a paper 5 days ago

Large Language Models Think Too Fast To Explore Effectively

Paper • 2501.18009 • Published 8 days ago • 22

upvoted a paper 7 days ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 9 days ago • 32

upvoted an article 11 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

14 days ago

• 119

upvoted a paper 18 days ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published 21 days ago • 67

upvoted a paper 22 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 23 days ago • 272

upvoted a paper 26 days ago

Entropy-Guided Attention for Private LLMs

Paper • 2501.03489 • Published about 1 month ago • 14

upvoted a paper 30 days ago

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Paper • 2501.01830 • Published Jan 3 • 18

upvoted 8 papers about 1 month ago

MLLM-as-a-Judge for Image Safety without Human Labeling

Paper • 2501.00192 • Published Dec 31, 2024 • 25

Xmodel-2 Technical Report

Paper • 2412.19638 • Published Dec 27, 2024 • 25

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published Dec 24, 2024 • 72

A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression

Paper • 2412.17483 • Published Dec 23, 2024 • 31

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Paper • 2412.15797 • Published Dec 20, 2024 • 17

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 40

Outcome-Refining Process Supervision for Code Generation

Paper • 2412.15118 • Published Dec 19, 2024 • 19

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published Dec 23, 2024 • 46

upvoted 4 papers about 2 months ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published Dec 18, 2024 • 50

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Paper • 2412.09645 • Published Dec 10, 2024 • 35

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 89

Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture

Paper • 2412.11834 • Published Dec 16, 2024 • 7