Edit model card

🧩 Configuration

models:
- model: liminerity/M7-7b
  # No parameters necessary for base model
- model: AurelPx/Percival_01-7b-slerp
  parameters:
    density: 0.53
    weight: 0.6
merge_method: dare_ties
base_model: liminerity/M7-7b
parameters:
int8_mask: true
dtype: bfloat16
random_seed: 0

πŸ’» Usage

!pip install -qU transformers accelerate
from transformers import AutoTokenizer
import transformers
import torch
model = "Ksgk-fy/M7Percival_01-7B"
messages = [{"role": "user", "content": "What is a large language model?"}]
tokenizer = AutoTokenizer.from_pretrained(model)
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
pipeline = transformers.pipeline(
    "text-generation",
    model=model,
    torch_dtype=torch.float16,
    device_map="auto",
)
outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
print(outputs[0]["generated_text"])
Downloads last month
0
Safetensors
Model size
7.24B params
Tensor type
BF16
Β·
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.

Finetuned from