12 94 565

Reza Sayar PRO

Reza2kn

AI & ML interests

None yet

Recent Activity

liked a model about 10 hours ago

jinaai/jina-colbert-v2

liked a model about 10 hours ago

FacebookAI/xlm-roberta-large

liked a model about 11 hours ago

cis-lmu/glotlid

View all activity

Organizations

Reza2kn's activity

upvoted 4 papers about 17 hours ago

FastVLM: Efficient Vision Encoding for Vision Language Models

Paper • 2412.13303 • Published 4 days ago • 12

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 3 days ago • 86

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published 2 days ago • 29

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 2 days ago • 255

upvoted 3 papers 2 days ago

upvoted 3 papers 4 days ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6 • 47

SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs

Paper • 2412.08347 • Published 10 days ago • 4

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

Paper • 2412.09626 • Published 9 days ago • 19

upvoted a collection 10 days ago

Gradio WebRTC Cookbook ⚡️

Collection

Collection of real-time voice and video demos built with gradio-webrtc custom component • 8 items • Updated 11 days ago • 8

upvoted 4 papers 11 days ago

Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement

Paper • 2412.04003 • Published 16 days ago • 9

SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance

Paper • 2412.02687 • Published 18 days ago • 109

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published 15 days ago • 112

Towards Universal Soccer Video Understanding

Paper • 2412.01820 • Published 19 days ago • 9

upvoted 3 articles 11 days ago

Article

Releasing QwQ-LongCoT-130K

•

16 days ago

• 8

Article

Finding Moroccan Arabic (Darija) in Fineweb 2

•

13 days ago

• 19

Article

Power steering: Squeeze massive power from small LLMs

•

12 days ago

• 4

upvoted a paper 12 days ago

Large Language Model-Brained GUI Agents: A Survey

Paper • 2411.18279 • Published 24 days ago • 27

upvoted a collection 12 days ago

ChatGen

Collection

ChatGen series models • 7 items • Updated 22 days ago • 2