Model Card: Model ID

License

MIT License

Languages Supported

  • English (en)

Overview

This model is part of the VCC project and has been fine-tuned on the TESTtm7873/ChatCat dataset using the mistralai/Mistral-7B-Instruct-v0.2 as the base model. The fine-tuning process utilized QLoRA for improved performance.


Getting Started

To use this model, you'll need to set up your environment first:

Model initialization

from transformers import AutoTokenizer, AutoModelForCausalLM, GenerationConfig
from peft import PeftModel
tokenizer = AutoTokenizer.from_pretrained("mistralai/Mistral-7B-Instruct-v0.2")
model = AutoModelForCausalLM.from_pretrained(
    "mistralai/Mistral-7B-Instruct-v0.2",
    load_in_8bit=True,
    device_map="auto",
)
model = PeftModel.from_pretrained(model, "TESTtm7873/MistralCat-1v")
model.eval()

Inference

def evaluate(question: str) -> str:
    prompt = f"The conversation between human and Virtual Cat Companion.\n[|Human|] {question}.\n[|AI|] "
    inputs = tokenizer(prompt, return_tensors="pt")
    input_ids = inputs["input_ids"].cuda()
    generation_output = model.generate(
        input_ids=input_ids,
        generation_config=generation_config,
        return_dict_in_generate=True,
        output_scores=True,
        max_new_tokens=256
    )
    output = tokenizer.decode(generation_output.sequences[0]).split("[|AI|]")[1]
    return output
your_question: str = "You have the softest fur."
print(evaluate(your_question))
  • Developed by: testtm
  • Funded by: Project TEST
  • Model type: Mistral
  • Language: English
  • Finetuned from model: mistralai/Mistral-7B-Instruct-v0.2
Downloads last month
8
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for TESTtm7873/MistralCat-1v

Adapter
(893)
this model

Dataset used to train TESTtm7873/MistralCat-1v