MERaLiON
/

MERaLiON-AudioLLM-Whisper-SEA-LION

Automatic Speech Recognition

Model card Files Files and versions Community

hyx_194 commited on Dec 9, 2024

Commit

ee86192

·

1 Parent(s): ba814b7

update captilization

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ For more details, please refer to our [report]().
 ## Model Description
-MERaLiON-AudioLLM is designed to take in an **audio-text pair** as input and generates a text output.
 The architecture comprises three key components: an **audio encoder** that transforms speech or audio inputs into sequences of vector representations, a **text decoder** that interprets and responds to natural language instructions, and an **adaptor module** that compresses the encoder representations while aligning the encoder’s hidden dimension with the text decoder’s embedding size.

 ## Model Description
+MERaLiON-AudioLLM is designed to take in an **audio-text pair** as input and generates a **text output**.
 The architecture comprises three key components: an **audio encoder** that transforms speech or audio inputs into sequences of vector representations, a **text decoder** that interprets and responds to natural language instructions, and an **adaptor module** that compresses the encoder representations while aligning the encoder’s hidden dimension with the text decoder’s embedding size.