arxiv:2410.13754
Zheng Zian(Andy)
OrionZheng
AI & ML interests
LLM, Mixture-of-Experts, Data-Centric AI
Recent Activity
liked
a dataset
about 1 month ago
MixEval/MixEval-X
authored
a paper
about 1 month ago
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
authored
a paper
about 1 month ago
MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures
Organizations
None yet
Papers
2
models
11
OrionZheng/openmoe-34b-200B
Text Generation
•
Updated
•
13
•
11
OrionZheng/openmoe-8b-chat
Text Generation
•
Updated
•
10
•
8
OrionZheng/openmoe-8b
Text Generation
•
Updated
•
11
•
3
OrionZheng/openmoe-8b-1T
Text Generation
•
Updated
•
88
•
2
OrionZheng/openmoe-8b-800B
Text Generation
•
Updated
•
9
•
1
OrionZheng/openmoe-8b-600B
Text Generation
•
Updated
•
5
OrionZheng/openmoe-8b-400B
Text Generation
•
Updated
•
15
OrionZheng/openmoe-8b-200B
Text Generation
•
Updated
•
11
•
2
OrionZheng/openmoe-base
Text Generation
•
Updated
•
835
•
4
OrionZheng/openmoe-8b-890B
Text Generation
•
Updated
•
4
datasets
None public yet