Oussema Harbi

Harbous

oharbi

AI & ML interests

None yet

Recent Activity

Reacted to csabakecskemeti's post with 👍 about 7 hours ago

The AMD Instinct MI50 (~$110) is surprisingly fast for inference Quantized models. This runs a Llama 3.1 8B Q8 with Llama.cpp https://huggingface.co/spaces/DevQuasar/Mi50 A little blogpost about the HW http://devquasar.com/uncategorized/amd-radeon-instinct-mi50-cheap-inference/

Reacted to freddyaboulton's post with 👍 4 days ago

Just created a cookbook of real time audio/video spaces created using Gradio and WebRTC ⚡️ Use this and the [docs](https://freddyaboulton.github.io/gradio-webrtc/) to get started building the next gen of AI apps! https://huggingface.co/collections/freddyaboulton/gradio-webrtc-cookbook-6758ba7745aeca7b1be7de0f

Reacted to qq8933's post with 👀 4 days ago

LLaMA-O1-PRM and LLaMA-O1-Reinforcement will release in this weekend. We have implemented a novel Reinforcement finetune(RFT) pipeline that taught models learning reasoning and reward labeling without human annotation.

View all activity

Organizations

None yet

Harbous's activity

reacted to csabakecskemeti's post with 👍 about 7 hours ago

Post

510

The AMD Instinct MI50 (~$110) is surprisingly fast for inference Quantized models.

This runs a Llama 3.1 8B Q8 with Llama.cpp
DevQuasar/Mi50

A little blogpost about the HW
http://devquasar.com/uncategorized/amd-radeon-instinct-mi50-cheap-inference/

reacted to freddyaboulton's post with 👍 4 days ago

Post

997

Just created a cookbook of real time audio/video spaces created using Gradio and WebRTC ⚡️

Use this and the [docs](https://freddyaboulton.github.io/gradio-webrtc/) to get started building the next gen of AI apps!

freddyaboulton/gradio-webrtc-cookbook-6758ba7745aeca7b1be7de0f

2 replies

reacted to qq8933's post with 👀 4 days ago

Post

2392

LLaMA-O1-PRM and LLaMA-O1-Reinforcement will release in this weekend.
We have implemented a novel Reinforcement finetune(RFT) pipeline that taught models learning reasoning and reward labeling without human annotation.

3 replies

reacted to etemiz's post with ➕ 7 days ago

Post

398

Apparently you can't count on centralized AI to perform similarly, some days great some days bad. They may be distilling or doing other things to dumb it down and make it cost effective. But you can count on open source LLMs that you run locally to perform same level, every day.

So you always have to watch centralized AI but you never have to watch the local LLM.

liked a model 16 days ago

MohamedRashad/arabic-large-nougat

Image-to-Text • Updated 17 days ago • 550 • 4

reacted to MohamedRashad's post with ❤️ 16 days ago

Post

1500

A while back i shared this model MohamedRashad/arabic-small-nougat that was a finetune from facebook/nougat-small for the Arabic Language.

Today this humble project has been scaled with new models, new datasets, new space, and a new paper

Check everything throught this collection here:
MohamedRashad/arabic-nougat-673a3f540bd92904c9b92a8e

1 reply

reacted to singhsidhukuldeep's post with ❤️ 29 days ago

Post

1904

It's not every day you see the No. 1 ranked paper of the day open-sourcing a very powerful image editing app!

Fascinating to see MagicQuill - a groundbreaking interactive image editing system that makes precise photo editing effortless through advanced AI!

The system's architecture features three sophisticated components:

1. Editing Processor:
- Implements a dual-branch architecture integrated into a latent diffusion framework
- Utilizes PiDiNet for edge map extraction and content-aware per-pixel inpainting
- Features a specialized UNet architecture with zero-convolution layers for feature insertion
- Employs denoising score matching for training the control branch
- Processes both structural modifications via scribble guidance and color manipulation through downsampled color blocks
- Maintains pixel-level control through VAE-based latent space operations

2. Painting Assistor:
- Powered by a fine-tuned LLaVA multimodal LLM using Low-Rank Adaptation (LoRA)
- Trained on a custom dataset derived from Densely Captioned Images (DCI)
- Processes user brushstrokes through specialized Q&A tasks for add/subtract/color operations
- Features bounding box coordinate normalization for precise stroke localization
- Implements streamlined single-word/phrase outputs for real-time performance

3. Idea Collector:
- Built as a modular ReactJS component library
- Supports cross-platform deployment via HTTP protocols
- Compatible with Gradio and ComfyUI frameworks
- Features comprehensive layer management and parameter adjustment capabilities
- Implements real-time canvas updates and preview generation

The system outperforms existing solutions like SmartEdit and BrushNet in edge alignment and color fidelity while maintaining seamless integration with popular AI frameworks.

What are your thoughts on AI-powered creative tools?

liked 4 models about 2 months ago