2 33 12

Junjie Chen

coderchen01

https://junjie-chen.info

AI & ML interests

Efficient AI, Multimodal AI, Generative AI

Recent Activity

upvoted a paper about 2 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

liked a dataset 2 months ago

microsoft/SCBench

liked a Space 2 months ago

HuggingFaceH4/blogpost-scaling-test-time-compute

View all activity

Organizations

None yet

coderchen01's activity

upvoted a paper about 2 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 133

liked a dataset 2 months ago

microsoft/SCBench

Viewer • Updated Dec 24, 2024 • 922 • 1.25k • 6

liked 2 Spaces 2 months ago

513

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

Number Tokenization Blog

📈

Explore how tokenization affects arithmetic in LLMs

upvoted a paper 2 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 127

liked a model 3 months ago

nvidia/Hymba-1.5B-Instruct

Text Generation • Updated Jan 2 • 1.92k • 224

liked a Space 3 months ago

619

Open VLM Leaderboard

🌎

VLMEvalKit Evaluation Results Collection

liked a model 3 months ago

HuggingFaceTB/SmolVLM-Instruct

Image-Text-to-Text • Updated Dec 2, 2024 • 116k • 379

upvoted a paper 3 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 51

upvoted an article 3 months ago

Article

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

•

May 21, 2024

• 35

upvoted a paper 3 months ago

SlimLM: An Efficient Small Language Model for On-Device Document Assistance

Paper • 2411.09944 • Published Nov 15, 2024 • 12

updated a dataset 3 months ago

coderchen01/HarmfulGeneration-HarmBench

Viewer • Updated Nov 20, 2024 • 9.61k • 35 • 3

liked a dataset 3 months ago

Babelscape/ALERT

Viewer • Updated Jun 20, 2024 • 45.7k • 162 • 11

upvoted a paper 3 months ago

Top-nσ: Not All Logits Are You Need

Paper • 2411.07641 • Published Nov 12, 2024 • 20

liked a Space 3 months ago

131

Hallucinations Leaderboard

🔥

View and submit LLM evaluations

upvoted a paper 3 months ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 48

upvoted an article 4 months ago

Article

🕳️ Attention Sinks in LLMs for endless fluency

•

Oct 9, 2023

• 7

upvoted a paper 4 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 145

upvoted 2 articles 4 months ago

Article

Scaling AI-based Data Processing with Hugging Face + Dask

Oct 9, 2024

• 28

Article

How 🤗 Accelerate runs very large models thanks to PyTorch

Sep 27, 2022

• 10