RITABRATA MAITI's picture

7 3 18

RITABRATA MAITI

ritabratamaiti

·

https://github.com/ritabratamaiti

AI & ML interests

Learning .... Agentic AI for everyone!

Recent Activity

liked a dataset about 1 hour ago

tiange/Cap3D

updated a model 5 days ago

AnyModal/VLM_Cartoon_Caption

replied to merve's post 7 days ago

OmniVision-968M: a new local VLM for edge devices, fast & small but performant 💨 a new vision language model with 9x less image tokens, super efficient 📖 aligned with DPO for reducing hallucinations ⚡️ Apache 2.0 license 🔥 Demo hf.co/spaces/NexaAIDev/omnivlm-dpo-demo Model https://huggingface.co/NexaAIDev/omnivision-968M

View all activity

Organizations

ritabratamaiti's activity

liked a dataset about 1 hour ago

tiange/Cap3D

Updated 8 days ago • 9.38k • 95

updated a model 5 days ago

AnyModal/VLM_Cartoon_Caption

Updated 5 days ago

replied to merve's post 7 days ago

The current demos in AnyModal are for visual+text tasks. We plan to add a few demos for other modalities like audio as well in the future. Our goal is to make it easy for anyone to create multimodal LLMs using any input modality tokenizer + LLM combination (hence the name AnyModal)!

replied to merve's post 8 days ago

Looks great! I am currently working on simplifying the training/fine-tuning multimodal LLMs in Torch: https://github.com/ritabratamaiti/AnyModal

upvoted 2 papers 2 months ago

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

Paper • 2409.09213 • Published Sep 13 • 12

Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types

Paper • 2409.09269 • Published Sep 14 • 7

commented a paper 2 months ago

Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos

Paper • 2409.08353 • Published Sep 12 • 10 •

liked a dataset 3 months ago

lemonilia/LimaRP

Updated about 1 month ago • 71 • 88

liked 2 datasets 8 months ago

ymoslem/MedicalSciences-StackExchange

Viewer • Updated Aug 20, 2023 • 4.97k • 79 • 7

openbmb/UltraInteract_pair

Viewer • Updated Apr 5 • 220k • 674 • 104

upvoted a collection 10 months ago

LLMs for 🇮🇳

Chat and Base LLMs for Indic Languages • 4 items • Updated Jan 29 • 1

updated a collection 10 months ago

LLMs for 🇮🇳

Chat and Base LLMs for Indic Languages • 4 items • Updated Jan 29 • 1

liked a dataset 11 months ago

smangrul/hindi_instruct_v1

Viewer • Updated Dec 23, 2023 • 28k • 79 • 10

updated a dataset 12 months ago

ritabratamaiti/sold-alpaca

Viewer • Updated Dec 4, 2023 • 10k • 44