Ethan KERDELHUE PRO

ethanker
ยท

AI & ML interests

French NLP

Recent Activity

liked a model 6 days ago
mistralai/Ministral-8B-Instruct-2410
Reacted to singhsidhukuldeep's post with โค๏ธ 9 days ago
It's not every day you see the No. 1 ranked paper of the day open-sourcing a very powerful image editing app! Fascinating to see MagicQuill - a groundbreaking interactive image editing system that makes precise photo editing effortless through advanced AI! The system's architecture features three sophisticated components: 1. Editing Processor: - Implements a dual-branch architecture integrated into a latent diffusion framework - Utilizes PiDiNet for edge map extraction and content-aware per-pixel inpainting - Features a specialized UNet architecture with zero-convolution layers for feature insertion - Employs denoising score matching for training the control branch - Processes both structural modifications via scribble guidance and color manipulation through downsampled color blocks - Maintains pixel-level control through VAE-based latent space operations 2. Painting Assistor: - Powered by a fine-tuned LLaVA multimodal LLM using Low-Rank Adaptation (LoRA) - Trained on a custom dataset derived from Densely Captioned Images (DCI) - Processes user brushstrokes through specialized Q&A tasks for add/subtract/color operations - Features bounding box coordinate normalization for precise stroke localization - Implements streamlined single-word/phrase outputs for real-time performance 3. Idea Collector: - Built as a modular ReactJS component library - Supports cross-platform deployment via HTTP protocols - Compatible with Gradio and ComfyUI frameworks - Features comprehensive layer management and parameter adjustment capabilities - Implements real-time canvas updates and preview generation The system outperforms existing solutions like SmartEdit and BrushNet in edge alignment and color fidelity while maintaining seamless integration with popular AI frameworks. What are your thoughts on AI-powered creative tools?
View all activity

Organizations

None yet

ethanker's activity

Reacted to singhsidhukuldeep's post with โค๏ธ 9 days ago
view post
Post
1890
It's not every day you see the No. 1 ranked paper of the day open-sourcing a very powerful image editing app!

Fascinating to see MagicQuill - a groundbreaking interactive image editing system that makes precise photo editing effortless through advanced AI!

The system's architecture features three sophisticated components:

1. Editing Processor:
- Implements a dual-branch architecture integrated into a latent diffusion framework
- Utilizes PiDiNet for edge map extraction and content-aware per-pixel inpainting
- Features a specialized UNet architecture with zero-convolution layers for feature insertion
- Employs denoising score matching for training the control branch
- Processes both structural modifications via scribble guidance and color manipulation through downsampled color blocks
- Maintains pixel-level control through VAE-based latent space operations

2. Painting Assistor:
- Powered by a fine-tuned LLaVA multimodal LLM using Low-Rank Adaptation (LoRA)
- Trained on a custom dataset derived from Densely Captioned Images (DCI)
- Processes user brushstrokes through specialized Q&A tasks for add/subtract/color operations
- Features bounding box coordinate normalization for precise stroke localization
- Implements streamlined single-word/phrase outputs for real-time performance

3. Idea Collector:
- Built as a modular ReactJS component library
- Supports cross-platform deployment via HTTP protocols
- Compatible with Gradio and ComfyUI frameworks
- Features comprehensive layer management and parameter adjustment capabilities
- Implements real-time canvas updates and preview generation

The system outperforms existing solutions like SmartEdit and BrushNet in edge alignment and color fidelity while maintaining seamless integration with popular AI frameworks.

What are your thoughts on AI-powered creative tools?
Reacted to chansung's post with ๐Ÿ‘ 9 days ago
view post
Post
1633
๐ŸŽ™๏ธ Listen to the audio "Podcast" of every single Hugging Face Daily Papers.

Now, "AI Paper Reviewer" project can automatically generates audio podcasts on any papers published on arXiv, and this is integrated into the GitHub Action pipeline. I sounds pretty similar to hashtag#NotebookLM in my opinion.

๐ŸŽ™๏ธ Try out yourself at https://deep-diver.github.io/ai-paper-reviewer/

This audio podcast is powered by Google technologies: 1) Google DeepMind Gemini 1.5 Flash model to generate scripts of a podcast, then 2) Google Cloud Vertex AI's Text to Speech model to synthesize the voice turning the scripts into the natural sounding voices (with latest addition of "Journey" voice style)

"AI Paper Reviewer" is also an open source project. Anyone can use it to build and own a personal blog on any papers of your interests. Hence, checkout the project repository below if you are interested in!
: https://github.com/deep-diver/paper-reviewer

This project is going to support other models including open weights soon for both text-based content generation and voice synthesis for the podcast. The only reason I chose Gemini model is that it offers a "free-tier" which is enough to shape up this projects with non-realtime batch generations. I'm excited to see how others will use this tool to explore the world of AI research, hence feel free to share your feedback and suggestions!
  • 1 reply
ยท
Reacted to louisbrulenaudet's post with ๐Ÿค— 12 days ago
view post
Post
1587
Iโ€™ve published a new dataset to simplify model merging ๐Ÿค—

This dataset facilitates the search for compatible architectures for model merging with @arcee_aiโ€™s mergekit, streamlining the automation of high-performance merge searches ๐Ÿ“–

Dataset : louisbrulenaudet/mergekit-configs
Reacted to BlinkDL's post with ๐Ÿ”ฅ๐Ÿ”ฅ 12 days ago
view post
Post
2827
RWKV-6-world-v3 (+3.1T tokens) is our best multilingual 7B model as of now: BlinkDL/rwkv-6-world

It's 100% RNN and attention-free. MMLU 54.2% (previous world-v2.1 = 47.9%. note: without eval-boosting tricks such as annealing).

RWKV-7-world-v4 soon :)
Reacted to abhishek's post with ๐Ÿ”ฅ๐Ÿ‘โค๏ธ 13 days ago
view post
Post
5027
INTRODUCING Hugging Face AutoTrain Client ๐Ÿ”ฅ
Fine-tuning models got even easier!!!!
Now you can fine-tune SOTA models on all compatible dataset-model pairs on Hugging Face Hub using Python on Hugging Face Servers. Choose from a number of GPU flavors, millions of models and dataset pairs and 10+ tasks ๐Ÿค—

To try, install autotrain-advanced using pip. You can ignore dependencies and install without --no-deps and then you'd need to install some dependencies by hand.

"pip install autotrain-advanced"

Github repo: https://github.com/huggingface/autotrain-advanced
  • 6 replies
ยท
New activity in Qwen/Qwen2.5-Coder-32B-Instruct 13 days ago

This is a wonderful model

2
#6 opened 13 days ago by graelo