Yang Lee

innovation64

AI & ML interests

AGI

Recent Activity

Organizations

Gradio-Blocks-Party's profile picture The Waifu Research Department's profile picture Blog-explorers's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture Chinese LLMs on Hugging Face's profile picture Apocalypse-AGI-DAO's profile picture

innovation64's activity

updated a Space 2 months ago
upvoted 2 articles 5 months ago
view article
Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

42
view article
Article

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

8