Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
chuxin-llm
/
Scaling-Laws-for-Local-SGD-in-LLM-Intermediate-Checkpoints
like
0
Follow
chuxin
14
arxiv:
2409.13198
License:
mit
Model card
Files
Files and versions
Community
main
Scaling-Laws-for-Local-SGD-in-LLM-Intermediate-Checkpoints
/
base
1 contributor
History:
1 commit
colourful-tree
Upload 40 files
e1b5902
verified
about 1 month ago
base_0.005b
Upload 40 files
about 1 month ago
base_0.012b
Upload 40 files
about 1 month ago
base_0.025b
Upload 40 files
about 1 month ago
base_0.05b
Upload 40 files
about 1 month ago
base_0.1b
Upload 40 files
about 1 month ago
base_0.2b
Upload 40 files
about 1 month ago
base_0.4b
Upload 40 files
about 1 month ago
base_0.8b
Upload 40 files
about 1 month ago