A wider Baby Berta Model trained using curriculum learning and layer stacking for the BabyLM Challenge Strict Small track.

Downloads last month
4
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.