Vaibhav Srivastav's picture

Vaibhav Srivastav PRO

reach-vb

·

https://vaibhavs10.github.io

AI & ML interests

TTS + LM performance prediction

Recent Activity

updated a dataset about 16 hours ago

reach-vb/strava-stats

new activity about 16 hours ago

reach-vb/test-dump:KMs covered over weeks

liked a model about 17 hours ago

nomic-ai/modernbert-embed-base

View all activity

Articles

Faster Text Generation with Self-Speculative Decoding

Llama can now see and run on your device - welcome Llama 3.2

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

WWDC 24: Running Mistral 7B with Core ML

Welcome Gemma 2 - Google's new open LLM

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

CodeGemma - an official Google release for code LLMs

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

AI Watermarking 101: Tools and Techniques

Deploy MusicGen in no time with Inference Endpoints

Jupyter X Hugging Face

Swift Diffusers: Fast Stable Diffusion for Mac

Organizations

reach-vb's activity

upvoted a collection 5 days ago

DeepSeek-V3

2 items • Updated 5 days ago • 83

upvoted 2 collections 8 days ago

NeMo Audio Codecs

A series of Neural Audio Codecs • 5 items • Updated 25 days ago • 6

InternVL2.5-MPO

Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated about 5 hours ago • 23

upvoted an article 8 days ago

Article

FineWeb2-C: Help Build Better Language Models in Your Language

By

•

8 days ago

• 10

upvoted a collection 11 days ago

📐 FineMath

FineMath datasets and ablation models • 13 items • Updated 9 days ago • 14

upvoted 2 collections 12 days ago

Bamba

Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 13 days ago • 17

Granite 3.1 Language Models

A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 13 days ago • 40

upvoted a collection 14 days ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 12 days ago • 74

upvoted 2 papers 15 days ago

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Paper • 2412.10302 • Published 18 days ago • 11

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 18 days ago • 132

upvoted a collection 17 days ago

Meta Motivo

A first-of-its-kind behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks. • 6 items • Updated 21 days ago • 9

upvoted a collection 18 days ago

DeepSeek-VL2

4 items • Updated 13 days ago • 33

upvoted a collection 22 days ago

EXAONE-3.5

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. • 10 items • Updated 22 days ago • 83

upvoted a paper 25 days ago

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Paper • 2412.03304 • Published 27 days ago • 17

upvoted a collection 26 days ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 18 days ago • 120

upvoted an article 26 days ago

Article

They Said It Couldn’t Be Done

By

•

26 days ago

• 76

upvoted a paper 26 days ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published 27 days ago • 119

upvoted a paper 27 days ago

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Paper • 2402.08093 • Published Feb 12 • 57

upvoted a collection about 1 month ago

usdm

NeurIPS 2024 Main Track (Github: https://github.com/naver-ai/usdm) • 4 items • Updated Nov 29 • 4

upvoted an article about 1 month ago

Article

Use Models from the Hugging Face Hub in LM Studio

By

•

Nov 28

• 127