hyx_194
commited on
Commit
·
ee86192
1
Parent(s):
ba814b7
update captilization
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ For more details, please refer to our [report]().
|
|
20 |
|
21 |
## Model Description
|
22 |
|
23 |
-
MERaLiON-AudioLLM is designed to take in an **audio-text pair** as input and generates a text output
|
24 |
|
25 |
The architecture comprises three key components: an **audio encoder** that transforms speech or audio inputs into sequences of vector representations, a **text decoder** that interprets and responds to natural language instructions, and an **adaptor module** that compresses the encoder representations while aligning the encoder’s hidden dimension with the text decoder’s embedding size.
|
26 |
|
|
|
20 |
|
21 |
## Model Description
|
22 |
|
23 |
+
MERaLiON-AudioLLM is designed to take in an **audio-text pair** as input and generates a **text output**.
|
24 |
|
25 |
The architecture comprises three key components: an **audio encoder** that transforms speech or audio inputs into sequences of vector representations, a **text decoder** that interprets and responds to natural language instructions, and an **adaptor module** that compresses the encoder representations while aligning the encoder’s hidden dimension with the text decoder’s embedding size.
|
26 |
|