I am trying to convert the model into BitsAndBytes with NF4 (4-bit)

by sabaridsnfuji - opened 22 days ago

22 days ago

Could you guide me on how to save a fine-tuned LoRA model for Llama-3.2-11B-Vision-Instruct in 4-bit precision for optimized inference, similar to the repository SeanScripts/Llama-3.2-11B-Vision-Instruct-nf4 ?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment