Learning UnkNown librAry

AI & ML interests

None defined yet.

Recent Activity

huybery authored a paper 3 days ago

Qwen2.5 Technical Report

huybery authored a paper 11 days ago

Evaluating and Aligning CodeLLMs on Human Preference

SivilTaram authored a paper about 1 month ago

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

View all activity

luna-code's activity

huybery

authored a paper 3 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 3 days ago • 290

huybery

authored a paper 11 days ago

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published 16 days ago • 47

SivilTaram

authored 3 papers about 1 month ago

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20 • 14

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Paper • 2407.10956 • Published Jul 15 • 6

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7 • 111

SivilTaram

authored a paper 2 months ago

Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

Paper • 2410.07137 • Published Oct 9 • 7

SivilTaram

authored a paper 3 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 60

huybery

authored a paper 3 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 138

huybery

authored 2 papers 4 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3 • 77

Synthesizing Text-to-SQL Data from Weak and Strong LLMs

Paper • 2408.03256 • Published Aug 6 • 10

huybery

authored a paper 5 months ago

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23 • 68

SivilTaram

authored a paper 5 months ago

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18 • 53

huybery

authored a paper 5 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 160

SivilTaram

posted an update 5 months ago

Post

2538

Still following your human intuition to mix corpora from different sources for pre-training 🧠? Everyone says that data mixture has a big impact on model performance, but how - and why🕵️? Did you know that web corpora are actually highly impactful for downstream tasks 🏆?

Check out our preprint "RegMix: Data Mixture as Regression for Language Model Pre-training" 📄

🔬 In this paper, we've proposed an automatic data mixture method RegMix that achieves a 6.3% improvement over human selection on the widely used HellaSwag benchmark - and it only needs a 2% extra training FLOPs! 📈

📄 Paper: RegMix: Data Mixture as Regression for Language Model Pre-training (2407.01492)
💻 Code: https://github.com/sail-sg/regmix
📊 Collection: sail/regmix-data-mixture-as-regression-6682b6caab37b9442877f0ce
🎮 Demo: https://huggingface.co/spaces/sail/RegMix

SivilTaram

authored 3 papers 6 months ago

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1 • 35

Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs

Paper • 2406.09136 • Published Jun 13

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22 • 45

terryyz

authored a paper 6 months ago

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22 • 45

SivilTaram

authored 2 papers 6 months ago

On Grounded Planning for Embodied Tasks with Language Models

Paper • 2209.00465 • Published Aug 29, 2022

ARKS: Active Retrieval in Knowledge Soup for Code Generation

Paper • 2402.12317 • Published Feb 19