Open Science

community

AI & ML interests

None defined yet.

Recent Activity

open-science's activity

eliebak 
posted an update 7 months ago
view post
Post
1151
Wow, impressive 340B model by nvidia with a nice permissive license! 🚀 The technical report is full of insights and seems to use a different learning rate schedule than cosine, probably a variant of WSD. Hope to get more info on that! 👀

nvidia/nemotron-4-340b-666b7ebaf1b3867caf2f1911