Sugato Ray's picture

Sugato Ray

sugatoray

·

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

upvoted a collection about 18 hours ago

updated a collection about 18 hours ago

updated a collection about 18 hours ago

LLMs-EmbeddingModels

View all activity

Organizations

sugatoray's activity

upvoted a collection about 18 hours ago

SwiftKV Models

SwiftKV reduces prefill compute by up to 50% by combining model rewiring and knowledge-preserving self-distillation. • 3 items • Updated 29 days ago • 3

upvoted a paper about 19 hours ago

Xmodel-2 Technical Report

Paper • 2412.19638 • Published 7 days ago • 13

upvoted an article 1 day ago

Article

Fine-tune ModernBERT for text classification using synthetic data

By

•

3 days ago

• 16

upvoted a paper 3 days ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published 18 days ago • 43

upvoted a paper 6 days ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published 10 days ago • 31

upvoted a paper 7 days ago

GUI Agents: A Survey

Paper • 2412.13501 • Published 16 days ago • 23

upvoted a collection 7 days ago

DeepSeek-V3

2 items • Updated 8 days ago • 89

upvoted a paper 8 days ago

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published 9 days ago • 40

upvoted a collection 9 days ago

QVQ-72B-Preview

5 items • Updated 9 days ago • 5

upvoted an article 10 days ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29, 2024

• 260

upvoted 2 papers 10 days ago

MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design

Paper • 2412.14590 • Published 15 days ago • 13

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Paper • 2412.12094 • Published 17 days ago • 10

upvoted a collection 10 days ago

QwQ

Qwen with Questions • 2 items • Updated Nov 28, 2024 • 52

upvoted a paper 10 days ago

Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents

Paper • 2412.13194 • Published 16 days ago • 12

upvoted 2 papers 12 days ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published 16 days ago • 91

Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture

Paper • 2412.11834 • Published 17 days ago • 6

upvoted a paper 13 days ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published 15 days ago • 48

upvoted a paper 14 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 16 days ago • 116

upvoted 2 collections 14 days ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 14 days ago • 111

OmniEval

An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain • 7 items • Updated about 19 hours ago • 2