3 135 257

Anthonny Olime

Aviv-anthonnyolime

AI & ML interests

None yet

Recent Activity

liked a model about 22 hours ago

CohereForAI/c4ai-command-a-03-2025

liked a model about 22 hours ago

google/gemma-3-12b-it

liked a model about 22 hours ago

google/gemma-3-12b-pt

View all activity

Organizations

Aviv-anthonnyolime's activity

upvoted 3 papers 10 days ago

Self-Guided Diffusion Models

Paper • 2210.06462 • Published Oct 12, 2022 • 3

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 11 days ago • 72

Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 79

upvoted 3 papers 14 days ago

upvoted an article 14 days ago

Article

DualPipe could be better without the Dual

•

15 days ago

• 15

upvoted an article 17 days ago

Article

SigLIP 2: A better multilingual vision language encoder

22 days ago

• 134

upvoted a paper 21 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 22 days ago • 129

upvoted 2 papers 22 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 23 days ago • 164

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Paper • 2502.04328 • Published Feb 6 • 29

upvoted 2 papers 24 days ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published about 1 month ago • 47

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 26 days ago • 142

upvoted 2 papers 25 days ago

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Paper • 2502.08127 • Published about 1 month ago • 50

Large Language Diffusion Models

Paper • 2502.09992 • Published 28 days ago • 103

upvoted an article 28 days ago

Article

Fixing Open LLM Leaderboard with Math-Verify

29 days ago

• 27

upvoted an article about 1 month ago

Article

Open R1: Update #2

and 6 others •

Feb 10

• 203

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 203

upvoted a collection about 1 month ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 22 days ago • 246

upvoted an article about 1 month ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 808