25 20 59

geronimo PRO

g-ronimo

AI & ML interests

fafo

Recent Activity

updated a model about 21 hours ago

g-ronimo/lama

liked a model 2 days ago

smartywu/big-lama

reacted to hexgrad's post with 🔥 5 days ago

Merry Christmas! 🎄 Open sourced a small TTS model at https://huggingface.co/hexgrad/Kokoro-82M

View all activity

Articles

SemScore: Evaluating LLMs with Semantic Similarity

Mar 9

• 12

Phinetuning 2.0

Jan 31

• 2

Organizations

g-ronimo's activity

upvoted a paper 8 days ago

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

Paper • 2412.16112 • Published 11 days ago • 19

upvoted 4 papers about 1 month ago

VisualLens: Personalization through Visual History

Paper • 2411.16034 • Published Nov 25 • 16

UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages

Paper • 2411.14343 • Published Nov 21 • 7

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Paper • 2411.07232 • Published Nov 11 • 63

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published Nov 12 • 27

upvoted 2 articles 3 months ago

Article

"Diffusers Image Fill" guide

•

Sep 13

• 42

Article

Extending Transformer layers as Painters to DiT's

•

Aug 31

• 9

upvoted a paper 8 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 87

upvoted 2 articles 8 months ago

Article

Train custom AI models with the trainer API and adapt them to 🤗

•

Jun 29

• 33

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

•

Jun 23

• 34

upvoted a paper 8 months ago

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

Paper • 2404.14047 • Published Apr 22 • 44

upvoted 3 articles 8 months ago

Article

seemore: Implement a Vision Language Model from Scratch

•

Jun 23

• 69

Article

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

•

Apr 18

• 22

Article

On Coding Your First Attention

•

Apr 21

• 7

upvoted 2 articles 9 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 170

Article

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

•

Apr 9

• 29

upvoted a paper 9 months ago

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Paper • 2404.00399 • Published Mar 30 • 41

upvoted a collection about 1 year ago

Journal Club

Collection

Candidate papers to read in the H4 journal club • 54 items • Updated Apr 21 • 28

upvoted a paper over 1 year ago

One Wide Feedforward is All You Need

Paper • 2309.01826 • Published Sep 4, 2023 • 31

geronimo PRO

AI & ML interests

Recent Activity

Articles

SemScore: Evaluating LLMs with Semantic Similarity

Phinetuning 2.0

Organizations

g-ronimo's activity

"Diffusers Image Fill" guide

Extending *Transformer layers as Painters* to DiT's

Train custom AI models with the trainer API and adapt them to 🤗

SeeMoE: Implementing a MoE Vision Language Model from Scratch

seemore: Implement a Vision Language Model from Scratch

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

On Coding Your First Attention

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

Extending Transformer layers as Painters to DiT's