cloudyu
/

Mixtral_34Bx2_MoE_60B

Text Generation

Mixture of Experts

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (3)

From your work, I find a new way to do model ensemble

#14 opened 12 months ago by

Adding Evaluation Results

#12 opened about 1 year ago by

leaderboard-pr-bot

The function_calling and translation abilities are weaker than Mixtral 8x7b

#11 opened about 1 year ago by

Add mixture of experts tag

#10 opened about 1 year ago by

how this model goes work,can you share you idea or traning process? thanks

#9 opened about 1 year ago by

Add merge tag

#8 opened about 1 year ago by

Vram

#7 opened about 1 year ago by

source code and paper?

#6 opened about 1 year ago by

How does the MoE work?

#5 opened about 1 year ago by

PacmanIncarnate

Quant pls?

#4 opened about 1 year ago by

What is your config?

#3 opened about 1 year ago by

Should not be called mixtral, the models made into the moe are yi based

#2 opened about 1 year ago by

Add merge tags

#1 opened about 1 year ago by