YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Quantization made by Richard Erkhov.

Github

Discord

Request more models

delta-4b-orange - bnb 8bits

Original model description:

widget: - text: Hello, My name is Junpei Iori, who are you? example_title: Identity - text: Describe Aurora Borealis example_title: Capabilities - text: Create a fastapi endpoint to retrieve the weather given a zip code. example_title: Coding license: apache-2.0 language: - en pipeline_tag: text-generation

delta-4b-orange is frankenmerge of phi-2-orange-v2. The purpose is to create 4B parameters model based on Phi-2.

馃捇 Usage

!pip install -qU transformers accelerate

from transformers import AutoTokenizer
import transformers
import torch

model = "gmonsoon/delta-4b-orange"
messages = [{"role": "user", "content": "What is a large language model?"}]

tokenizer = AutoTokenizer.from_pretrained(model)
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
pipeline = transformers.pipeline(
    "text-generation",
    model=model,
    torch_dtype=torch.float16,
    device_map="auto",
)

outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
print(outputs[0]["generated_text"])
Downloads last month
4
Safetensors
Model size
4.67B params
Tensor type
F32
FP16
I8
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.