Scalable Data Ablation Approximations for Language Models through Modular Training and Merging Paper • 2410.15661 • Published Oct 21, 2024
Scalable Data Ablations Collection Datasets and models for EMNLP paper "Scalable Data Ablation Approximations for Language Models through Modular Training and Merging" • 34 items • Updated Oct 25, 2024 • 1
Paloma: A Benchmark for Evaluating Language Model Fit Paper • 2312.10523 • Published Dec 16, 2023 • 12
Scalable Data Ablations Collection Datasets and models for EMNLP paper "Scalable Data Ablation Approximations for Language Models through Modular Training and Merging" • 34 items • Updated Oct 25, 2024 • 1
OLMo Suite Collection Artifacts for the first set of OLMo models. • 18 items • Updated 5 days ago • 70
Scalable Data Ablations Collection Datasets and models for EMNLP paper "Scalable Data Ablation Approximations for Language Models through Modular Training and Merging" • 34 items • Updated Oct 25, 2024 • 1