question

by giannisan - opened May 1, 2024

Discussion

giannisan

May 1, 2024

I'm interested in fine-tuning the ViViT model with low rank adaptation, how did you manage this?

NiiCole

Owner May 2, 2024

you can use the same lora config as the vit implementation, the target modules should also match the one in your architecture . It works pretty wel
l for fine tuning

giannisan

May 2, 2024

Thanks for the feedback!! I'm pretty new to vision models, as I'm coming from LLMs. Do you have documentation for this? Or where can i find how this is done? I saw a post on hf about vit lora, but not sure if that's what you meant. Thanks again for the help

NiiCole

Owner May 2, 2024

yeah, i usually work on vision models . This should give you a clear idea
https://huggingface.co/docs/peft/task_guides/image_classification_lora

NiiCole changed discussion status to closed May 5, 2024

giannisan

May 6, 2024

yeah, i usually work on vision models . This should give you a clear idea
https://huggingface.co/docs/peft/task_guides/image_classification_lora

Thanks!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment