arxiv:2410.08261
Zhen Dong
zhendongucb
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
No More Adam: Learning Rate Scaling at Initialization is All You Need
liked
a model
about 1 month ago
Nexusflow/Athene-V2-Chat
liked
a model
about 1 month ago
Nexusflow/Athene-V2-Agent
Organizations
Papers
30
models
None public yet
datasets
None public yet