
SakanaAI/TinySwallow-1.5B
Text Generation
•
Updated
•
1.99k
•
•
23
Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"