Edit model card

Berry_v2_7B

Berry_v2_7B is a merge of the following models using LazyMergekit:

🧩 Configuration

models:
  - model: jeiku/BerryBase+ResplendentAI/Qwen_Soul_LoRA_128
  - model: jeiku/BerryBase+ResplendentAI/Qwen_jeiku_LoRA_128
merge_method: model_stock
base_model: jeiku/qwen2base
dtype: bfloat16

πŸ’» Usage

!pip install -qU transformers accelerate

from transformers import AutoTokenizer
import transformers
import torch

model = "jeiku/Berry_v2_7B"
messages = [{"role": "user", "content": "What is a large language model?"}]

tokenizer = AutoTokenizer.from_pretrained(model)
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
pipeline = transformers.pipeline(
    "text-generation",
    model=model,
    torch_dtype=torch.float16,
    device_map="auto",
)

outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
print(outputs[0]["generated_text"])
Downloads last month
45
Safetensors
Model size
7.62B params
Tensor type
BF16
Β·
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.