-
Aligning Teacher with Student Preferences for Tailored Training Data Generation
Paper • 2406.19227 • Published • 24 -
Pre-training Distillation for Large Language Models: A Design Space Exploration
Paper • 2410.16215 • Published • 15 -
Baichuan Alignment Technical Report
Paper • 2410.14940 • Published • 49 -
MiniPLM: Knowledge Distillation for Pre-Training Language Models
Paper • 2410.17215 • Published • 14
By
ByRookie
AI & ML interests
None yet
Recent Activity
liked
a Space
4 days ago
HuggingFaceH4/blogpost-scaling-test-time-compute
upvoted
a
paper
8 days ago
Phi-4 Technical Report
Organizations
Collections
7
models
None public yet
datasets
None public yet