Shawon Ashraf's picture

19 267

Shawon Ashraf

shawon

·

https://www.shawonashraf.com/

AI & ML interests

Multi-Modal NLP, LLM and RAG

Recent Activity

liked a model 6 days ago

JeffreyXiang/TRELLIS-image-large

reacted to MohamedRashad's post with 🔥 6 days ago

For those Game Developers out there who wants a tool to generate them 3d assets of different game items. I built something for you 😅 https://huggingface.co/JeffreyXiang/TRELLIS-image-large + https://huggingface.co/Qwen/Qwen2.5-72B-Instruct + https://huggingface.co/Freepik/flux.1-lite-8B-alpha = https://huggingface.co/spaces/MohamedRashad/Game-Items-Generator Happy building 🎉

liked a model 6 days ago

meta-llama/Llama-3.3-70B-Instruct

View all activity

Organizations

shawon's activity

upvoted a collection 15 days ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated 15 days ago • 87

upvoted a paper about 2 months ago

Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis

Paper • 2410.23320 • Published Oct 30 • 7

upvoted a collection about 2 months ago

LongVU

7 items • Updated Oct 31 • 27

upvoted a paper 2 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 167

upvoted a collection 3 months ago

Image / Video Gen

Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 30 items • Updated 8 days ago • 6

upvoted 5 papers 3 months ago

FreeInit: Bridging Initialization Gap in Video Diffusion Models

Paper • 2312.07537 • Published Dec 12, 2023 • 25

Image Copy Detection for Diffusion Models

Paper • 2409.19952 • Published Sep 30 • 12

Visual Question Decomposition on Multimodal Large Language Models

Paper • 2409.19339 • Published Sep 28 • 7

UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models

Paper • 2409.20551 • Published Sep 30 • 13

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Paper • 2311.05437 • Published Nov 9, 2023 • 48

upvoted 2 articles 3 months ago

Article

Data is better together

Mar 4

• 8

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25

• 180

upvoted 2 collections 3 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 15 days ago • 545

Flow-Judge-v0.1

Flow-Judge-v0.1 models • 5 items • Updated Sep 17 • 19

upvoted a collection 6 months ago

LLM Compiler

Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27 • 146

upvoted a paper 6 months ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10 • 65

upvoted an article 7 months ago

Article

Easily Train Models with H100 GPUs on NVIDIA DGX Cloud

Mar 18

• 7

upvoted a collection 8 months ago

Idefics2 🐶

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6 • 89

upvoted a collection 9 months ago

DBRX

DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. • 3 items • Updated Mar 27 • 91