Method to get 8bit quantized model

by kitaharatomoyo - opened Jul 6, 2023

Jul 6, 2023

Can you tell me how you get the 8bit quantized model from falcon-7b?
I want to get my own 8bit quantized model from finetuned falcon-7b model.

goldmermaid

Cambio ML org Jul 6, 2023

Hi @kitaharatomoyo , you can try load and directly directly from HG as below:

model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto", load_in_8bit=True, trust_remote_code=True)

MODEL_SAVE_FOLDER_NAME = "falcon-7b-8bit"
model.save_pretrained(MODEL_SAVE_FOLDER_NAME)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment