Julien BLANCHON's picture

Julien BLANCHON PRO

blanchon

·

AI & ML interests

Math

Recent Activity

published a dataset about 2 hours ago

blanchon/DESOBAv2

liked a model about 4 hours ago

hustvl/lightningdit-xl-imagenet256-800ep

liked a model about 5 hours ago

black-forest-labs/FLUX.1-Depth-dev-lora

View all activity

Organizations

blanchon's activity

upvoted a paper 3 days ago

Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding

Paper • 2401.04575 • Published Jan 9, 2024 • 17

upvoted a collection 4 days ago

Nomic Embed v2

Multilingual Embedding Models • 4 items • Updated about 22 hours ago • 10

upvoted 2 papers 4 days ago

VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation

Paper • 2502.07531 • Published 5 days ago • 11

Generating Multi-Image Synthetic Data for Text-to-Image Customization

Paper • 2502.01720 • Published 13 days ago • 6

upvoted 2 collections 4 days ago

Terminus XL

v-prediction SDXL clone with zero-terminal SNR noise schedule • 8 items • Updated Apr 24, 2024 • 7

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 72

upvoted 2 collections 5 days ago

Ultravox v0.5

Ultravox is a multimodal Speech LLM built around different pretrained LLMs (frozen) and the whisper-large-v3-turbo (fine-tuned) backbone. • 3 items • Updated 6 days ago • 5

R3GAN

R3GAN: A Modern BaselineGAN https://github.com/brownvc/R3GAN/ https://arxiv.org/abs/2501.05441 • 7 items • Updated Jan 10 • 10

upvoted 3 papers 5 days ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 88

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published 10 days ago • 32

Material Anything: Generating Materials for Any 3D Object via Diffusion

Paper • 2411.15138 • Published Nov 22, 2024 • 44

upvoted a paper 6 days ago

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published 9 days ago • 80

upvoted an article 11 days ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

13 days ago

• 98

upvoted a paper 14 days ago

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

Paper • 2412.16112 • Published Dec 20, 2024 • 22

upvoted 2 papers 16 days ago

Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion

Paper • 2402.03162 • Published Feb 5, 2024 • 19

Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

Paper • 2401.15977 • Published Jan 29, 2024 • 38

upvoted an article 24 days ago

Article

Finetune Stable Diffusion Models with DDPO via TRL

Sep 29, 2023

• 10

upvoted a collection 24 days ago

Open Image Preferences

Containing all artifacts for the Stable Diffusion 3.5L vs Flux Dev image preference community sprint. • 14 items • Updated Dec 19, 2024 • 9

upvoted an article 24 days ago

Article

Crowd-sourced Open Preference Dataset for Text-to-Image Generation

By

and 4 others •

Jan 7

• 18

upvoted a collection about 1 month ago

Lucie LLM

Open source LLM for French, English, German, Spanish and Italian • 8 items • Updated 12 days ago • 19