1 26

Ethan KERDELHUE PRO

ethanker

AI & ML interests

French NLP

Recent Activity

liked a dataset 5 days ago

mlabonne/orca-agentinstruct-1M-v1-cleaned

liked a model 6 days ago

mistralai/Ministral-8B-Instruct-2410

Reacted to singhsidhukuldeep's post with ❤️ 9 days ago

It's not every day you see the No. 1 ranked paper of the day open-sourcing a very powerful image editing app! Fascinating to see MagicQuill - a groundbreaking interactive image editing system that makes precise photo editing effortless through advanced AI! The system's architecture features three sophisticated components: 1. Editing Processor: - Implements a dual-branch architecture integrated into a latent diffusion framework - Utilizes PiDiNet for edge map extraction and content-aware per-pixel inpainting - Features a specialized UNet architecture with zero-convolution layers for feature insertion - Employs denoising score matching for training the control branch - Processes both structural modifications via scribble guidance and color manipulation through downsampled color blocks - Maintains pixel-level control through VAE-based latent space operations 2. Painting Assistor: - Powered by a fine-tuned LLaVA multimodal LLM using Low-Rank Adaptation (LoRA) - Trained on a custom dataset derived from Densely Captioned Images (DCI) - Processes user brushstrokes through specialized Q&A tasks for add/subtract/color operations - Features bounding box coordinate normalization for precise stroke localization - Implements streamlined single-word/phrase outputs for real-time performance 3. Idea Collector: - Built as a modular ReactJS component library - Supports cross-platform deployment via HTTP protocols - Compatible with Gradio and ComfyUI frameworks - Features comprehensive layer management and parameter adjustment capabilities - Implements real-time canvas updates and preview generation The system outperforms existing solutions like SmartEdit and BrushNet in edge alignment and color fidelity while maintaining seamless integration with popular AI frameworks. What are your thoughts on AI-powered creative tools?

View all activity

Organizations

None yet

ethanker's activity

liked a dataset 5 days ago

mlabonne/orca-agentinstruct-1M-v1-cleaned

Viewer • Updated 7 days ago • 1.05M • 931 • 43

liked a model 6 days ago

mistralai/Ministral-8B-Instruct-2410

Updated 10 days ago • 54k • 344

Reacted to singhsidhukuldeep's post with ❤️ 9 days ago

Post

1890

It's not every day you see the No. 1 ranked paper of the day open-sourcing a very powerful image editing app!

Fascinating to see MagicQuill - a groundbreaking interactive image editing system that makes precise photo editing effortless through advanced AI!

The system's architecture features three sophisticated components:

1. Editing Processor:
- Implements a dual-branch architecture integrated into a latent diffusion framework
- Utilizes PiDiNet for edge map extraction and content-aware per-pixel inpainting
- Features a specialized UNet architecture with zero-convolution layers for feature insertion
- Employs denoising score matching for training the control branch
- Processes both structural modifications via scribble guidance and color manipulation through downsampled color blocks
- Maintains pixel-level control through VAE-based latent space operations

2. Painting Assistor:
- Powered by a fine-tuned LLaVA multimodal LLM using Low-Rank Adaptation (LoRA)
- Trained on a custom dataset derived from Densely Captioned Images (DCI)
- Processes user brushstrokes through specialized Q&A tasks for add/subtract/color operations
- Features bounding box coordinate normalization for precise stroke localization
- Implements streamlined single-word/phrase outputs for real-time performance

3. Idea Collector:
- Built as a modular ReactJS component library
- Supports cross-platform deployment via HTTP protocols
- Compatible with Gradio and ComfyUI frameworks
- Features comprehensive layer management and parameter adjustment capabilities
- Implements real-time canvas updates and preview generation

The system outperforms existing solutions like SmartEdit and BrushNet in edge alignment and color fidelity while maintaining seamless integration with popular AI frameworks.

What are your thoughts on AI-powered creative tools?

Reacted to chansung's post with 👍 9 days ago

Post

1633

🎙️ Listen to the audio "Podcast" of every single Hugging Face Daily Papers.

Now, "AI Paper Reviewer" project can automatically generates audio podcasts on any papers published on arXiv, and this is integrated into the GitHub Action pipeline. I sounds pretty similar to hashtag#NotebookLM in my opinion.

🎙️ Try out yourself at https://deep-diver.github.io/ai-paper-reviewer/

This audio podcast is powered by Google technologies: 1) Google DeepMind Gemini 1.5 Flash model to generate scripts of a podcast, then 2) Google Cloud Vertex AI's Text to Speech model to synthesize the voice turning the scripts into the natural sounding voices (with latest addition of "Journey" voice style)

"AI Paper Reviewer" is also an open source project. Anyone can use it to build and own a personal blog on any papers of your interests. Hence, checkout the project repository below if you are interested in!
: https://github.com/deep-diver/paper-reviewer

This project is going to support other models including open weights soon for both text-based content generation and voice synthesis for the podcast. The only reason I chose Gemini model is that it offers a "free-tier" which is enough to shape up this projects with non-realtime batch generations. I'm excited to see how others will use this tool to explore the world of AI research, hence feel free to share your feedback and suggestions!

1 reply

liked 2 datasets 10 days ago

PJMixers/mlabonne_orpo-dpo-mix-40k-PreferenceShareGPT

Viewer • Updated May 30 • 44.2k • 46 • 4

Salesforce/blip3-kale

Viewer • Updated 11 days ago • 235M • 5.59k • 27

liked a dataset 12 days ago

louisbrulenaudet/mergekit-configs

Viewer • Updated 5 days ago • 129k • 301 • 7

Reacted to louisbrulenaudet's post with 🤗 12 days ago

Post

1587

I’ve published a new dataset to simplify model merging 🤗

This dataset facilitates the search for compatible architectures for model merging with @arcee_ai’s mergekit, streamlining the automation of high-performance merge searches 📖

Dataset : louisbrulenaudet/mergekit-configs

Reacted to BlinkDL's post with 🔥🔥 12 days ago

Post

2827

RWKV-6-world-v3 (+3.1T tokens) is our best multilingual 7B model as of now: BlinkDL/rwkv-6-world

It's 100% RNN and attention-free. MMLU 54.2% (previous world-v2.1 = 47.9%. note: without eval-boosting tricks such as annealing).

RWKV-7-world-v4 soon :)

Reacted to abhishek's post with 🔥👍❤️ 13 days ago

Post

5027

INTRODUCING Hugging Face AutoTrain Client 🔥
Fine-tuning models got even easier!!!!
Now you can fine-tune SOTA models on all compatible dataset-model pairs on Hugging Face Hub using Python on Hugging Face Servers. Choose from a number of GPU flavors, millions of models and dataset pairs and 10+ tasks 🤗

To try, install autotrain-advanced using pip. You can ignore dependencies and install without --no-deps and then you'd need to install some dependencies by hand.

"pip install autotrain-advanced"

Github repo: https://github.com/huggingface/autotrain-advanced