view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka โข Nov 19, 2024 โข 105
Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation Paper โข 2407.10817 โข Published Jul 15, 2024 โข 14
Flow Judge v0.1 held-out test datasets Collection This collection contains held-out splits for testing Flow-Judge-v0.1. โข 4 items โข Updated Sep 14, 2024 โข 2
Flow-Judge-v0.1 out-of-domain evaluation datasets Collection This collection contains out-of-domain datasets used to evaluate the generalization capabilities of Flow-Judge-v0.1 โข 5 items โข Updated Sep 13, 2024 โข 1
๐ช SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos โข 12 items โข Updated Dec 22, 2024 โข 213
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Paper โข 2311.03099 โข Published Nov 6, 2023 โข 29
Model Merging Papers Collection Collection of relevant papers about model merging โข 13 items โข Updated Apr 2, 2024 โข 6