Jonathan LYS's picture

1 19 3

Jonathan LYS

jonathan-lys

·

jonathanlys01

AI & ML interests

None yet

Organizations

jonathan-lys's activity

upvoted a paper about 23 hours ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published 6 days ago • 75

upvoted a paper 8 days ago

Pixtral 12B

Paper • 2410.07073 • Published 19 days ago • 59

upvoted a paper 21 days ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published 27 days ago • 141

upvoted an article 30 days ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

By

•

about 1 month ago

• 33

upvoted 2 papers 3 months ago

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1 • 106

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31 • 105

upvoted 3 papers 5 months ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31 • 63

Phased Consistency Model

Paper • 2405.18407 • Published May 28 • 46

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 87

upvoted 2 papers 9 months ago

Improving fine-grained understanding in image-text pre-training

Paper • 2401.09865 • Published Jan 18 • 15

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17 • 58

upvoted 6 papers 10 months ago

Scalable Pre-training of Large Autoregressive Image Models

Paper • 2401.08541 • Published Jan 16 • 35

Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM

Paper • 2401.02994 • Published Jan 4 • 47

Denoising Vision Transformers

Paper • 2401.02957 • Published Jan 5 • 27

COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Paper • 2401.00849 • Published Jan 1 • 14

TinySAM: Pushing the Envelope for Efficient Segment Anything Model

Paper • 2312.13789 • Published Dec 21, 2023 • 13

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 257

upvoted a paper 12 months ago

Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks

Paper • 2310.19909 • Published Oct 30, 2023 • 20

upvoted a paper about 1 year ago

Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 77