Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
681
2
41
Sanchit Gandhi
sanchit-gandhi
Follow
qsllxl's profile picture
meowmeowcatskill's profile picture
vilassninawe's profile picture
387 followers
·
13 following
sanchitgandhi99
sanchit-gandhi
AI & ML interests
Open-Source Speech
Articles
TTS Arena: Benchmarking Text-to-Speech Models in the Wild
Feb 27
•
19
Speculative Decoding for 2x Faster Whisper Inference
Dec 20, 2023
•
10
AudioLDM 2, but faster ⚡️
Aug 30, 2023
•
1
A Complete Guide to Audio Datasets
Dec 15, 2022
•
5
Fine-Tune Whisper with 🤗 Transformers
Nov 3, 2022
•
34
Organizations
sanchit-gandhi
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
facebook/wav2vec2-large-960h-lv60-self
11 days ago
facing issues while using access token of the following model facebook/wav2vec2-large-960h-lv60-self
1
#8 opened 15 days ago by
Webster9
New activity in
openai/whisper-large-v3
11 days ago
KeyError: 'whisper'
1
#116 opened 11 days ago by
aiyaqingzheng
New activity in
parler-tts/parler-tts-mini-expresso
11 days ago
What to use for [train] ? pip install -e .[train]
2
#2 opened 12 days ago by
Kimsui
New activity in
openai/whisper-large-v3
16 days ago
how to transcribe hundreds of local audio files once?
1
#114 opened 16 days ago by
myspace-ai
New activity in
sweet-dreambooths/musicgen-songstarter-v0.2-hf
26 days ago
Upload processor
#2 opened 26 days ago by
sanchit-gandhi
Upload MusicgenMelodyForConditionalGeneration
#1 opened 26 days ago by
sanchit-gandhi
New activity in
facebook/voxpopuli
about 1 month ago
Error loading dataset
2
#9 opened about 1 month ago by
jorgetebl
New activity in
LIUM/tedlium
about 1 month ago
FileNotFoundError when loading the LIUM/tedlium data on Windows
4
#4 opened 2 months ago by
wondav
New activity in
sanchit-gandhi/musicgen-streaming
about 1 month ago
Song doesn't appear to play (regardless of any browser)
3
#5 opened about 2 months ago by
Nothsa
New activity in
openai/whisper-large-v3
about 1 month ago
How to get accuracy of transcription from the model?
5
#98 opened 2 months ago by
Atulad
How we can use this model to achieve a real-time trans?
4
#99 opened about 2 months ago by
Von-violet
New activity in
parler-tts/parler_tts_mini
about 2 months ago
Fixed . on a different line.
1
#2 opened about 2 months ago by
blaise-tk
minor ui fix
1
#4 opened about 2 months ago by
mrfakename
New activity in
parler-tts/parler_tts_mini_v0.1
about 2 months ago
Inference speed
6
#2 opened about 2 months ago by
andreasrath
Link model to the training datasets in metadata
1
#3 opened about 2 months ago by
julien-c
Add training datasets to metadata
1
#5 opened about 2 months ago by
sanchit-gandhi
Update README.md
#4 opened about 2 months ago by
sanchit-gandhi
New activity in
distil-whisper/distil-large-v3
about 2 months ago
Update alignment heads in gen config
#3 opened about 2 months ago by
sanchit-gandhi
New activity in
facebook/voxpopuli
about 2 months ago
LICENSE question
2
#8 opened 2 months ago by
phoneme
New activity in
sanchit-gandhi/musicgen-streaming
2 months ago
Streaming doesn't work yet with gradio 4.0
#4 opened 2 months ago by
ylacombe
New activity in
distil-whisper/distil-large-v3
2 months ago
about multiple languages?
2
#2 opened 2 months ago by
obtion
New activity in
sanchit-gandhi/whisper-small-hi
2 months ago
Adding `safetensors` variant of this model
#17 opened 7 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-lv-60-espeak-cv-ft
2 months ago
Adding `safetensors` variant of this model
1
#4 opened 7 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-large-xlsr-53
2 months ago
Adding `safetensors` variant of this model
1
#3 opened 3 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-base
2 months ago
Adding `safetensors` variant of this model
1
#2 opened 5 months ago by
SFconvertbot
New activity in
distil-whisper/distil-large-v3-ct2
2 months ago
Update README.md
3
#2 opened 2 months ago by
muhtasham
New activity in
distil-whisper/distil-large-v3-ggml
2 months ago
is it fp16?
3
#1 opened 2 months ago by
supercharge19
New activity in
distil-whisper/distil-medium.en
2 months ago
Just can't run!
3
#14 opened 3 months ago by
awesomeandy
New activity in
distil-whisper/distil-large-v3-ct2
2 months ago
Update alignment heads
#1 opened 2 months ago by
sanchit-gandhi
New activity in
distil-whisper/distil-large-v3
2 months ago
How to do multilingual transcription?
3
#1 opened 2 months ago by
emraza110
New activity in
facebook/mms-tts-tao
3 months ago
Reference of the Dataset
1
#1 opened 3 months ago by
ChiaLingWeng
New activity in
openai/whisper-large-v3
3 months ago
How to save the loss value for each step during the training process?
2
#91 opened 3 months ago by
zhouwen999
New activity in
hf-audio/open_asr_leaderboard
3 months ago
[Average WER Calculation] Drop Common Voice WER.
4
#14 opened 3 months ago by
reach-vb
New activity in
openai/whisper-large-v3
3 months ago
Transcript an Spanish audio
4
#86 opened 3 months ago by
Andrews99
New activity in
sanchit-gandhi/whisper-medium-fleurs-lang-id
3 months ago
How do you fine tune Whisper for classification task rather than transcription?
6
#1 opened about 1 year ago by
nkburns
New activity in
openai/whisper-large-v2
3 months ago
Add missing merge to tokenizer
#100 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-large
3 months ago
Add missing merge to tokenizer
#50 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-medium
3 months ago
Add missing merge to tokenizer
#36 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-small
3 months ago
Add missing merge to tokenizer
#38 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-tiny
3 months ago
Add missing merge to tokenizer
#40 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-base
3 months ago
Upload tokenizer
2
#28 opened 6 months ago by
ArthurZ
New activity in
sanchit-gandhi/large-v3-32-2-conditioned-prompt-logic-timestamped-resumed-pt
3 months ago
Update generation_config.json
#2 opened 3 months ago by
sanchit-gandhi
Update generation_config.json
#1 opened 3 months ago by
sanchit-gandhi
New activity in
facebook/s2t-wav2vec2-large-en-de
3 months ago
Updates incorrect tokenizer configuration file
1
#3 opened 3 months ago by
lysandre
New activity in
kakao-enterprise/vits-vctk
3 months ago
List of all available speakers?
2
#2 opened 3 months ago by
Nikerino
New activity in
facebook/mms-tts-eng
3 months ago
What kind of dataset was used?
1
#8 opened 3 months ago by
f0rGoTTen000
New activity in
distil-whisper/whisper-vs-distil-whisper
3 months ago
Distil version does a bad job at Transcribing
3
#2 opened 3 months ago by
arslankas
New activity in
google/gemma-7b-it
3 months ago
error model.generate()
14
#13 opened 3 months ago by
NickyNicky
New activity in
facebook/musicgen-melody
3 months ago
Upload MusicgenMelodyForConditionalGeneration
#8 opened 3 months ago by
ylacombe
Upload processor
#9 opened 3 months ago by
ylacombe
New activity in
facebook/musicgen-stereo-melody
3 months ago
Upload MusicgenMelodyForConditionalGeneration
#2 opened 3 months ago by
ylacombe
Upload processor
#3 opened 3 months ago by
ylacombe
New activity in
facebook/musicgen-stereo-melody-large
3 months ago
Upload MusicgenMelodyForConditionalGeneration
#2 opened 3 months ago by
ylacombe
Upload processor
#3 opened 3 months ago by
ylacombe
New activity in
facebook/musicgen-melody-large
3 months ago
Upload MusicgenMelodyForConditionalGeneration
#3 opened 3 months ago by
ylacombe
Upload processor
1
#4 opened 3 months ago by
ylacombe
New activity in
google/gemma-7b
3 months ago
Upload FlaxGemmaForCausalLM
1
#3 opened 3 months ago by
pcuenq
New activity in
facebook/mms-tts-tam
3 months ago
AttributeError
1
#1 opened 3 months ago by
murthy1998
Fix code examples for transformers
#2 opened 3 months ago by
sanchit-gandhi
New activity in
hf-audio/open_asr_leaderboard
4 months ago
Smaller model sizes lead to worse RTF on Whisper
2
#8 opened 5 months ago by
lorenzopark
Load more