Large Models
Collection
things I've been working on for large models (80B+)
•
2 items
•
Updated
Mistral-Large-218B-Instruct is a dense Large Language Model (LLM) with 218 billion parameters. Self-merged from the original Mistral Large 2.
Given the size of this model (218B parameters), it requires substantial computational resources for inference:
This was just a fun testing model, merged with the merge.py
script in the base of the repo.
GGUF: mradermacher/Mistral-Large-218B-Instruct-GGUF
imatrix GGUF: mradermacher/Mistral-Large-218B-Instruct-i1-GGUF
Compatible mergekit
config:
slices:
- sources:
- layer_range: [0, 20]
model: mistralai/Mistral-Large-Instruct-2407
- sources:
- layer_range: [10, 30]
model: mistralai/Mistral-Large-Instruct-2407
- sources:
- layer_range: [20, 40]
model: mistralai/Mistral-Large-Instruct-2407
- sources:
- layer_range: [30, 50]
model: mistralai/Mistral-Large-Instruct-2407
- sources:
- layer_range: [40, 60]
model: mistralai/Mistral-Large-Instruct-2407
- sources:
- layer_range: [50, 70]
model: mistralai/Mistral-Large-Instruct-2407
- sources:
- layer_range: [60, 80]
model: mistralai/Mistral-Large-Instruct-2407
- sources:
- layer_range: [70, 87]
model: mistralai/Mistral-Large-Instruct-2407
merge_method: passthrough
dtype: bfloat16