speechless-mistral-moloras-7b

4-bit GGUF models for CPU+GPU inference

This model is the static version of moloras (Mixture-of-multi-LoRAs) based on the following 6 Mistral-based LoRa modules.

  • Intel/neural-chat-7b-v3-1
  • migtissera/SynthIA-7B-v1.3
  • jondurbin/airoboros-m-7b-3.1.2
  • bhenrym14/mistral-7b-platypus-fp16
  • teknium/CollectiveCognition-v1.1-Mistral-7B
  • uukuguy/speechless-mistral-dolphin-orca-platypus-samantha-7b

Totally 6 LoRA modules from speechless-mistral-7b-dare-0.85

The router of mixture-of-multi-loras enables an automatic assembling of LoRA modules, using a gradientfree approach to obtain the coefficients of LoRA modules and requiring only a handful of inference steps for unseen tasks.

Code: https://github.com/uukuguy/multi_loras?tab=readme-ov-file#mixture-of-multi-loras

LM-Evaluation-Harness

Open LLM Leaderboard

Metric Value
ARC 59.98
HellaSwag 83.29
MMLU 64.12
TruthfulQA 42.15
Winogrande 78.37
GSM8K 37.68
Average 60.93
Downloads last month
728
Safetensors
Model size
7.24B params
Tensor type
FP16
Β·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for uukuguy/speechless-mistral-moloras-7b

Quantizations
6 models

Dataset used to train uukuguy/speechless-mistral-moloras-7b

Spaces using uukuguy/speechless-mistral-moloras-7b 6

Collection including uukuguy/speechless-mistral-moloras-7b