Daniel Bourke's picture

Daniel Bourke PRO

mrdbourke

·

https://www.mrdbourke.com

AI & ML interests

Computer vision. Small on-device models. VLMs. High-quality tutorials.

Recent Activity

liked a model 3 days ago

knowledgator/gliclass-large-v1.0-init

liked a Space 3 days ago

Qwen/QVQ-72B-preview

liked a Space 11 days ago

stevengrove/YOLO-World

View all activity

Organizations

None yet

mrdbourke's activity

upvoted a collection 24 days ago

InternVL2.5

Better than InternVL 2.0 • 18 items • Updated 3 days ago • 78

upvoted 2 papers about 1 month ago

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15, 2024 • 13

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published Nov 21, 2024 • 43

upvoted an article about 2 months ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

Oct 27, 2024

• 37

upvoted 2 collections 2 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 12 days ago • 196

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 16 days ago • 96

upvoted a paper 2 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 126

upvoted a collection 2 months ago

Stable Diffusion 3.5

6 items • Updated Oct 29, 2024 • 118

upvoted 2 articles 3 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22, 2024

• 86

Article

Let's talk about LLM evaluation

By

•

May 23, 2024

• 143

upvoted 2 collections 3 months ago

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 21 days ago • 143

SigLIP

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated 21 days ago • 48

upvoted a paper 3 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 87

upvoted 3 collections 3 months ago

Florence

9 items • Updated Jul 11, 2024 • 161

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Nov 27, 2024 • 290

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 28 days ago • 551

upvoted an article 3 months ago

Article

Document Similarity Search with ColPali

By

•

Sep 21, 2024

• 48

upvoted 2 articles 4 months ago

Article

Unleash ML Power on iOS: Apple Silicon Optimization Secrets

By

•

Jul 18, 2024

• 4

Article

Converting Models to Core ML

By

•

Sep 4, 2024

• 5

upvoted a collection 4 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated 28 days ago • 186