Yang Lee

innovation64

AI & ML interests

AGI

Organizations

innovation64's activity

upvoted 2 articles 4 months ago
view article
Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

39
view article
Article

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

7
upvoted an article 5 months ago