Mahmud ElHuseyni's picture

17 40

Mahmud ElHuseyni

MElHuseyni

·

AI & ML interests

Computer Vision NLP Machine Learning

Recent Activity

liked a model about 5 hours ago

HuggingFaceTB/SmolVLM-Instruct

liked a model about 16 hours ago

Qwen/QwQ-32B-Preview

liked a model about 23 hours ago

vidore/colsmolvlm-alpha

View all activity

Organizations

MElHuseyni's activity

upvoted a paper 1 day ago

TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation

Paper • 2401.14373 • Published Jan 25 • 11

upvoted an article 3 days ago

Article

Multilabel Classification using Mistral-7B on a single GPU with quantization and LoRA

By

•

Jan 22

• 13

upvoted a collection 6 days ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated 6 days ago • 56

upvoted a collection 9 days ago

OpenScholar_V1

The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated 7 days ago • 26

upvoted a collection 12 days ago

UltraVox Audio Language Model Release 🔊

3 items • Updated 13 days ago • 15

upvoted a paper 13 days ago

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1 • 108

upvoted a collection 13 days ago

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated 5 days ago • 74

upvoted a paper 14 days ago

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published 21 days ago • 35

upvoted a collection 15 days ago

SpeechT5

The SpeechT5 framework consists of a shared seq2seq and six modal-specific (speech/text) pre/post-nets that can address a few audio-related tasks. • 8 items • Updated Jul 11 • 22

upvoted 2 collections 16 days ago

Computer Vision Backbones 🧩

Collection of useful computer vision backbones to fine-tune. It also includes large image classification models, that can be used as backbone. • 22 items • Updated Sep 19, 2023 • 19

MIT Talk 31/10 Papers

14 items • Updated Oct 28 • 29

upvoted an article 16 days ago

Article

Decoding Strategies in Large Language Models

By

•

about 1 month ago

• 38

upvoted 2 papers 16 days ago

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57

AudioPaLM: A Large Language Model That Can Speak and Listen

Paper • 2306.12925 • Published Jun 22, 2023 • 53