moeru-ai
/

L3.1-Moe-4x8B-v0.1

Text Generation

Mixture of Experts

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Edit model card

L3.1-Moe-4x8B-v0.1

This model is a Mixture of Experts (MoE) made with mergekit-moe. It uses the following base models:

Heavily inspired by mlabonne/Beyonder-4x7B-v3.

Quantized models

GGUF by mradermacher

Configuration

base_model: argilla-warehouse/Llama-3.1-8B-MagPie-Ultra
gate_mode: hidden
dtype: bfloat16
experts:
  - source_model: argilla-warehouse/Llama-3.1-8B-MagPie-Ultra
    positive_prompts:
      - "chat"
      - "assistant"
      - "tell me"
      - "explain"
      - "I want"
  - source_model: sequelbox/Llama3.1-8B-PlumCode
    positive_prompts:
      - "code"
      - "python"
      - "javascript"
      - "programming"
      - "algorithm"
  - source_model: sequelbox/Llama3.1-8B-PlumMath
    positive_prompts:
      - "reason"
      - "math"
      - "mathematics"
      - "solve"
      - "count"
  - source_model: ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.2
    positive_prompts:
      - "storywriting"
      - "write"
      - "scene"
      - "story"
      - "character"

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	19.15
IFEval (0-Shot)	43.47
BBH (3-Shot)	27.86
MATH Lvl 5 (4-Shot)	11.10
GPQA (0-shot)	1.23
MuSR (0-shot)	3.98
MMLU-PRO (5-shot)	27.27

Downloads last month: 117

Safetensors

Model size

24.9B params

Tensor type

BF16

·

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for moeru-ai/L3.1-Moe-4x8B-v0.1

ArliAI/Llama-3.1-8B-ArliAI-RPMax-v1.2

argilla-warehouse/Llama-3.1-8B-MagPie-Ultra

sequelbox/Llama3.1-8B-PlumCode

sequelbox/Llama3.1-8B-PlumMath

Merge model

this model

Quantizations

Collection including moeru-ai/L3.1-Moe-4x8B-v0.1

L3.1-Moe

https://github.com/moeru-ai/L3.1-Moe • 3 items • Updated 19 days ago • 1

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

43.470
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

27.860
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

11.100
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

1.230
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

3.980
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

27.270

View on Papers With Code