eastwind/tinymix-8x1b-chat

Abhaykoul

Jan 4

how to use it from transformers

eastwind

Owner Jan 4

It should be exactly the same as mixtral.

Abhaykoul

Jan 5

Can you please give transformer code to run it

eastwind

Owner Jan 5

from transformers import AutoModelForCausalLM, AutoTokenizer
device = "cuda" # the device to load the model onto

model = AutoModelForCausalLM.from_pretrained("eastwind/tinymix-8x1b-chat")
tokenizer = AutoTokenizer.from_pretrained("eastwind/tinymix-8x1b-chat")

prompt = "My favourite condiment is"

model_inputs = tokenizer([prompt], return_tensors="pt").to(device)
model.to(device)

generated_ids = model.generate(**model_inputs, max_new_tokens=100, do_sample=True)
tokenizer.batch_decode(generated_ids)[0]
"The expected output"

taken from https://huggingface.co/docs/transformers/model_doc/mixtral

you also might want to format the prompt into chatml like I wrote in the readme

Abhaykoul

Jan 5

thanks

eastwind changed discussion status to closed Jan 5

eastwind
/

tinymix-8x1b-chat

help