Gurumurthi V Ramanan's picture

122 410

Gurumurthi V Ramanan

GVR

·

https://surasys.co

AI & ML interests

ML

Recent Activity

liked a model 10 days ago

Qwen/QVQ-72B-Preview

upvoted a collection 11 days ago

InternVL2.5-MPO

upvoted a paper 11 days ago

Qwen2.5 Technical Report

View all activity

Organizations

GVR's activity

upvoted a collection 11 days ago

InternVL2.5-MPO

Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated 3 days ago • 23

upvoted a paper 11 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 15 days ago • 334

upvoted a collection 18 days ago

DeepSeek-VL2

4 items • Updated 16 days ago • 34

upvoted 2 collections 27 days ago

Llama 3.3

5 items • Updated 28 days ago • 5

Llama 3.3 (All Versions)

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 10 days ago • 26

upvoted an article 27 days ago

Article

Use Models from the Hugging Face Hub in LM Studio

By

•

Nov 28, 2024

• 127

upvoted a collection about 1 month ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated Nov 27, 2024 • 31

upvoted a paper about 1 month ago

Multi-Granularity Prediction for Scene Text Recognition

Paper • 2209.03592 • Published Sep 8, 2022 • 2

upvoted a collection about 1 month ago

OpenScholar_V1

The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated Nov 22, 2024 • 31

upvoted an article about 1 month ago

Article

Halo: Open Source Health Tracking with Wearables

By

•

Nov 19, 2024

• 99

upvoted an article about 2 months ago

Article

ColFlor: Towards BERT-Size Vision-Language Document Retrieval Models

By

•

Oct 18, 2024

• 16

upvoted a collection about 2 months ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 258

upvoted an article 2 months ago

Article

Decoding Strategies in Large Language Models

By

•

Oct 29, 2024

• 38

upvoted 2 collections 2 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 12 days ago • 196

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 101

upvoted 2 papers 2 months ago

Unbounded: A Generative Infinite Game of Character Life Simulation

Paper • 2410.18975 • Published Oct 24, 2024 • 35

OmniParser for Pure Vision Based GUI Agent

Paper • 2408.00203 • Published Aug 1, 2024 • 24

upvoted an article 2 months ago

Article

OCR Processing and Text in Image Analysis with Florence-2-base and Qwen2-VL-2B

By

•

Oct 18, 2024

• 13

upvoted 2 collections 2 months ago

LayerSkip

Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 • 8 items • Updated Nov 21, 2024 • 46

DocLayout-YOLO

Dataset and model for DocLayout-YOLO • 9 items • Updated Oct 22, 2024 • 12