Shawon Ashraf

shawon

https://www.shawonashraf.com/

AI & ML interests

Multi-Modal NLP, LLM and RAG

Recent Activity

liked a dataset 6 days ago

HuggingFaceTB/finemath

liked a model 15 days ago

JeffreyXiang/TRELLIS-image-large

reacted to MohamedRashad's post with 🔥 15 days ago

For those Game Developers out there who wants a tool to generate them 3d assets of different game items. I built something for you 😅 https://huggingface.co/JeffreyXiang/TRELLIS-image-large + https://huggingface.co/Qwen/Qwen2.5-72B-Instruct + https://huggingface.co/Freepik/flux.1-lite-8B-alpha = https://huggingface.co/spaces/MohamedRashad/Game-Items-Generator Happy building 🎉

View all activity

Organizations

shawon's activity

liked a dataset 6 days ago

HuggingFaceTB/finemath

Viewer • Updated 7 days ago • 48.3M • 21.4k • 196

liked a model 15 days ago

JeffreyXiang/TRELLIS-image-large

Image-to-3D • Updated 24 days ago • 467k • 263

reacted to MohamedRashad's post with 🔥 15 days ago

Post

2468

For those Game Developers out there who wants a tool to generate them 3d assets of different game items. I built something for you 😅

JeffreyXiang/TRELLIS-image-large +
Qwen/Qwen2.5-72B-Instruct +
Freepik/flux.1-lite-8B-alpha =
MohamedRashad/Game-Items-Generator

Happy building 🎉

1 reply

liked 3 models 15 days ago

liked a model 20 days ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Nov 29 • 122k • • 1.46k

liked a model 24 days ago

mlx-community/paligemma2-10b-ft-docci-448-6bit

Image-Text-to-Text • Updated 24 days ago • 107 • 1

liked 3 datasets 24 days ago

HuggingFaceTB/smoltalk

Viewer • Updated Nov 26 • 2.2M • 10.8k • 261

O1-OPEN/OpenO1-SFT

Viewer • Updated 13 days ago • 77.7k • 2.24k • 289

alpindale/two-million-bluesky-posts

Viewer • Updated Nov 28 • 2.11M • 2.32k • 193

upvoted a collection 24 days ago

Llama 3.3

Collection

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated 24 days ago • 98

liked a model 24 days ago

lmstudio-community/Llama-3.3-70B-Instruct-GGUF

Text Generation • Updated 24 days ago • 41k • 35

reacted to gabrielchua's post with 👀 about 1 month ago

Post

1229

Sharing my first paper!

==
Large Language Models (LLMs) are powerful, but they're prone to off-topic misuse, where users push them beyond their intended scope. Think harmful prompts, jailbreaks, and misuse. So how do we build better guardrails?

Traditional guardrails rely on curated examples or classifiers. The problem?
⚠️ High false-positive rates
⚠️ Poor adaptability to new misuse types
⚠️ Require real-world data, which is often unavailable during pre-production

Our method skips the need for real-world misuse examples. Instead, we:
1️⃣ Define the problem space qualitatively
2️⃣ Use an LLM to generate synthetic misuse prompts
3️⃣ Train and test guardrails on this dataset

We apply this to the off-topic prompt detection problem, and fine-tune simple bi- and cross-encoder classifiers that outperform heuristics based on cosine similarity or prompt engineering.

Additionally, framing the problem as prompt relevance allows these fine-tuned classifiers to generalise to other risk categories (e.g., jailbreak, toxic prompts).

Through this work, we also open-source our dataset (2M examples, ~50M+ tokens) and models.

paper: A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection (2411.12946)

artifacts: govtech/off-topic-guardrail-673838a62e4c661f248e81a4

liked a Space about 1 month ago

Running on CPU Upgrade

✍

Argilla Space

liked a model about 1 month ago

google/gemma-2-instruct-9b-keras

Text Generation • Updated Nov 15 • 101 • 6

reacted to jsulz's post with 🔥 about 1 month ago

Post

2914

When the XetHub crew joined Hugging Face this fall, @erinys and I started brainstorming how to share our work to replace Git LFS on the Hub. Uploading and downloading large models and datasets takes precious time. That’s where our chunk-based approach comes in.

Instead of versioning files (like Git and Git LFS), we version variable-sized chunks of data. For the Hugging Face community, this means:

⏩ Only upload the chunks that changed.
🚀 Download just the updates, not the whole file.
🧠 We store your file as deduplicated chunks

In our benchmarks, we found that using CDC to store iterative model and dataset version led to transfer speedups of ~2x, but this isn’t just a performance boost. It’s a rethinking of how we manage models and datasets on the Hub.

We're planning on our new storage backend to the Hub in early 2025 - check out our blog to dive deeper, and let us know: how could this improve your workflows?

https://huggingface.co/blog/from-files-to-chunks

liked a dataset about 1 month ago

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated Nov 1 • 1.05M • 12.5k • 405

liked a model about 2 months ago

turjo4nis/colbertv2.0-bn

Updated Nov 21 • 161 • 3

liked a Space about 2 months ago

Running on Zero

578

🖼

OmniGen

Image generator/identifier/reposer