Dhruvajyoti Sarma

dhruva-sarma

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

updated a collection 3 days ago

Computer vision

updated a collection 15 days ago

Computer vision

View all activity

Organizations

None yet

dhruva-sarma's activity

upvoted a paper 3 days ago

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published 10 days ago • 59

upvoted 5 papers 15 days ago

Reliable Tuberculosis Detection using Chest X-ray with Deep Learning, Segmentation and Visualization

Paper • 2007.14895 • Published Jul 29, 2020 • 1

Few-Shot Learning Approach on Tuberculosis Classification Based on Chest X-Ray Images

Paper • 2409.11644 • Published Sep 18, 2024 • 1

upvoted a paper 26 days ago

OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

Paper • 2412.02592 • Published Dec 3, 2024 • 20

upvoted an article about 1 month ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

•

Nov 21, 2024

• 35

upvoted an article 3 months ago

Article

How to build a custom text classifier without days of human labeling

•

Oct 17, 2024

• 55

upvoted 2 papers 3 months ago

A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and Beyond

Paper • 2410.02362 • Published Oct 3, 2024 • 17

Not All LLM Reasoners Are Created Equal

Paper • 2410.01748 • Published Oct 2, 2024 • 28

upvoted 3 papers 4 months ago

LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

Paper • 2409.06703 • Published Sep 10, 2024 • 2

GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering

Paper • 2409.06595 • Published Sep 10, 2024 • 37

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Paper • 2409.06666 • Published Sep 10, 2024 • 55

upvoted a collection 4 months ago

Parler-TTS: fully open-source high-quality TTS

Collection

If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub. • 8 items • Updated Dec 2, 2024 • 49

upvoted 4 papers 4 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 71

Generative Verifiers: Reward Modeling as Next-Token Prediction

Paper • 2408.15240 • Published Aug 27, 2024 • 13

Text2SQL is Not Enough: Unifying AI and Databases with TAG

Paper • 2408.14717 • Published Aug 27, 2024 • 24

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 124

upvoted a paper 6 months ago

Scaling Retrieval-Based Language Models with a Trillion-Token Datastore

Paper • 2407.12854 • Published Jul 9, 2024 • 29