Sayak Paul's picture

Sayak Paul

sayakpaul

·

https://sayak.dev

AI & ML interests

Diffusion models, representation learning

Recent Activity

updated a model about 13 hours ago

sayakpaul/different-lora-from-civitai

updated a dataset 1 day ago

sayakpaul/vae-sd-imagenet-256-latents

updated a dataset 1 day ago

sayakpaul/vae-sd-imagenet-256-latents

View all activity

Articles

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

Memory-efficient Diffusion Transformers with Quanto and Diffusers

🧨 Diffusers welcomes Stable Diffusion 3

🤗 PEFT welcomes new merging methods

Welcome aMUSEd: Efficient Text-to-Image Generation

SDXL in 4 steps with Latent Consistency LoRAs

Personal Copilot: Train Your Own Coding Assistant

Exploring simple optimizations for SDXL

Finetune Stable Diffusion Models with DDPO via TRL

Introducing Würstchen: Fast Diffusion for Image Generation

Efficient Controllable Generation for SDXL with T2I-Adapters

Happy 1st anniversary 🤗 Diffusers!

Optimizing Stable Diffusion for Intel CPUs with NNCF and 🤗 Optimum

Instruction-tuning Stable Diffusion with InstructPix2Pix

Training a language model with 🤗 Transformers using TensorFlow and TPUs

ControlNet in Diffusers 🧨

🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware

A Dive into Pretraining Strategies for Vision-Language Models

The State of Computer Vision at Hugging Face 🤗

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Image Similarity with Hugging Face Datasets and Transformers

Deploying 🤗 ViT on Vertex AI

Deploying 🤗 ViT on Kubernetes with TF Serving

Deploying TensorFlow Vision Models in Hugging Face with TF Serving

Organizations

Posts 18

Post

3918

Commits speak louder than words 🤪

* 4 new video models
* Multiple image models, including SANA & Flux Control
* New quantizers -> GGUF & TorchAO
* New training scripts

Enjoy this holiday-special Diffusers release 🤗
Notes: https://github.com/huggingface/diffusers/releases/tag/v0.32.0

Post

1825

In the past seven days, the Diffusers team has shipped:

1. Two new video models
2. One new image model
3. Two new quantization backends
4. Three new fine-tuning scripts
5. Multiple fixes and library QoL improvements

Coffee on me if someone can guess 1 - 4 correctly.

Collections 2

Papers 13

arxiv:2412.03895

arxiv:2412.01487

arxiv:2408.13467

arxiv:2406.06424

spaces 19

Demo Docker Gradio

Diffusers Docs QA Chatbot

Ask questions to the Diffusers documentation.

Convert Kerascv SD to Diffusers

Inpainting Tool

Generate Custom Pokemons with Stable Diffusion

Evaluate StableDiffusionPipeline with Different Schedulers

models 63

sayakpaul/different-lora-from-civitai

Updated about 13 hours ago • 1

sayakpaul/edit-control-lr_1e-4-wd_1e-4-gs_15.0-cd_0.1

Text-to-Image • Updated 4 days ago • 16 • 2

sayakpaul/cartoon-control-lr_1e-4-wd_1e-4-gs_10.0-cd_0.1

Text-to-Image • Updated 6 days ago • 17 • 5

sayakpaul/q8-ltx-video

Updated 9 days ago • 106 • 3

sayakpaul/yarn_art_lora_sana

Text-to-Image • Updated 26 days ago • 20

sayakpaul/pose-control-lora

Text-to-Image • Updated Dec 9, 2024 • 16 • 1

sayakpaul/bnb-single-file-checkpoint-from-civitai

Updated Dec 4, 2024 • 8

sayakpaul/mochi-lora-dissolve

Text-to-Video • Updated Nov 29, 2024 • 6 • 2

sayakpaul/mochi-lora

Text-to-Video • Updated Nov 29, 2024 • 37 • 3

sayakpaul/FLUX.1-Canny-dev-nf4

Updated Nov 24, 2024 • 2

datasets 29

sayakpaul/vae-sd-imagenet-256-latents

Updated 1 day ago • 78 • 2

sayakpaul/OmniEdit-mini

Viewer • Updated 6 days ago • 21.1k • 13

sayakpaul/sample-datasets

Viewer • Updated Dec 5, 2024 • 6 • 13.7k • 1

sayakpaul/video-dataset-disney-organized

Viewer • Updated Nov 29, 2024 • 69 • 352 • 5

sayakpaul/pick-a-pic-v2-unique-prompts

Viewer • Updated Nov 9, 2024 • 59k • 32

sayakpaul/poses-controlnet-dataset

Viewer • Updated Aug 29, 2024 • 496 • 43 • 5

sayakpaul/torchao-diffusers

Updated Aug 28, 2024 • 8

sayakpaul/pickapic_v2_webdataset

Viewer • Updated Apr 4, 2024 • 8.7k • 13.7k

sayakpaul/generated-gemini-responses

Viewer • Updated Apr 1, 2024 • 115 • 32

sayakpaul/no_robots_only_coding

Viewer • Updated Mar 20, 2024 • 350 • 32 • 1