skit-ai
/

speechllm-2B

Feature Extraction

speech-language

Model card Files Files and versions Community

shangeth commited on Jun 18

Commit

5813766

•

1 Parent(s): 393959a

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -85,7 +85,7 @@ model-index:
 [The model is still training, we will be releasing the latest checkpoints soon...]
-SpeechLLM is a multi-modal LLM trained to predict the metadata of the speaker's turn in a conversation. speechllm-2B model is based on HubertX acoustic encoder and TinyLlama LLM. The model predicts the following:
 1. **SpeechActivity** : if the audio signal contains speech (True/False)
 2. **Transcript** : ASR transcript of the audio
 3. **Gender** of the speaker (Female/Male)

 [The model is still training, we will be releasing the latest checkpoints soon...]
+SpeechLLM is a multi-modal LLM trained to predict the metadata of the speaker's turn in a conversation. speechllm-2B model is based on HubertX audio encoder and TinyLlama LLM. The model predicts the following:
 1. **SpeechActivity** : if the audio signal contains speech (True/False)
 2. **Transcript** : ASR transcript of the audio
 3. **Gender** of the speaker (Female/Male)