7 3 53

jayavibhavnk

jayavibhav

AI & ML interests

None yet

Recent Activity

upvoted a paper 30 days ago

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

liked a model 3 months ago

Datou1111/flux-sincity-movie

liked a model 3 months ago

Datou1111/shou_xin

View all activity

Organizations

jayavibhav's activity

upvoted a paper 30 days ago

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12, 2024 • 67

liked 2 models 3 months ago

Datou1111/flux-sincity-movie

Text-to-Image • Updated Sep 9, 2024 • 68 • • 16

Datou1111/shou_xin

Text-to-Image • Updated Dec 9, 2024 • 2.24k • 864

liked 4 datasets 3 months ago

liked a model 3 months ago

Djrango/Qwen2vl-Flux

Text-to-Image • Updated Dec 6, 2024 • 466

liked 2 datasets 3 months ago

5CD-AI/LLaVA-CoT-o1-Instruct

Viewer • Updated Nov 27, 2024 • 58.5k • 290 • 92

yc4142/bias-CoT

Viewer • Updated Dec 31, 2023 • 6.37k • 118 • 6

liked a model 3 months ago

microsoft/LLM2CLIP-Llama-3-8B-Instruct-CC-Finetuned

Zero-Shot Classification • Updated Nov 19, 2024 • 4.16k • 32

updated a model 3 months ago

jayavibhav/llama3.2_11B_Vision_Maths_Geometry

Updated Nov 24, 2024

liked a dataset 3 months ago

lmms-lab/LLaVA-OneVision-Data

Viewer • Updated Oct 22, 2024 • 3.72M • 27.1k • 167

liked a model 3 months ago

Xkev/Llama-3.2V-11B-cot

Image-Text-to-Text • Updated Dec 16, 2024 • 2.37k • 147

liked a dataset 3 months ago

charanhu/kannada-instruct-dataset-390k

Viewer • Updated Oct 12, 2024 • 390k • 118 • 2

liked a model 3 months ago

charanhu/kannada-tokenizer

Text Generation • Updated Nov 12, 2024 • 2

reacted to MonsterMMORPG's post with ❤️ 3 months ago

Post

2670

Kohya brought massive improvements to FLUX LoRA (as low as 4 GB GPUs) and DreamBooth / Fine-Tuning (as low as 6 GB GPUs) training - check attached images in full size to see full details

You can download all configs and full instructions

> https://www.patreon.com/posts/112099700 - Fine Tuning post

> https://www.patreon.com/posts/110879657 - LoRA post

Kohya brought massive improvements to FLUX LoRA and DreamBooth / Fine-Tuning (min 6GB GPU) training.

Now as low as 4GB GPUs can train FLUX LoRA with decent quality and 24GB and below GPUs got a huge speed boost when doing Full DreamBooth / Fine-Tuning training

You need minimum 4GB GPU to do a FLUX LoRA training and minimum 6 GB GPU to do FLUX DreamBooth / Full Fine-Tuning training. It is just mind blowing.

You can download all configs and full instructions > https://www.patreon.com/posts/112099700

The above post also has 1-click installers and downloaders for Windows, RunPod and Massed Compute

The model downloader scripts also updated and downloading 30+GB models takes total 1 minute on Massed Compute

You can read the recent updates here : https://github.com/kohya-ss/sd-scripts/tree/sd3?tab=readme-ov-file#recent-updates

This is the Kohya GUI branch : https://github.com/bmaltais/kohya_ss/tree/sd3-flux.1

Key thing to reduce VRAM usage is using block swap

Kohya implemented the logic of OneTrainer to improve block swapping speed significantly and now it is supported for LoRAs as well

Now you can do FP16 training with LoRAs on 24 GB and below GPUs

Now you can train a FLUX LoRA on a 4 GB GPU - key is FP8, block swap and using certain layers training (remember single layer LoRA training)

It took me more than 1 day to test all newer configs, their VRAM demands, their relative step speeds and prepare the configs :)

upvoted a paper 3 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 114

updated 2 models 3 months ago

jayavibhav/arcane

Text-to-Image • Updated Nov 18, 2024 • 48 • • 2

jayavibhav/Llama3.2_1B_Cot_LoRa

Updated Nov 17, 2024