|
--- |
|
language: |
|
- en |
|
license: apache-2.0 |
|
tags: |
|
- sound language model |
|
--- |
|
|
|
## Model Details |
|
|
|
We have developed and released the family [llama3-s](https://huggingface.co/collections/homebrewltd/llama3-s-669df2139f0576abc6eb7405). This family is natively understanding audio and text input. |
|
|
|
We continue to expand [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) with sound understanding capabilities. |
|
|
|
This is the initial checkpoint with average weight initialization applied only to new vocabulary. |
|
|
|
|
|
**Model developers** Homebrew Research. |
|
|
|
**Input** Text and sound. |
|
|
|
**Output** Text. |
|
|
|
**Model Architecture** Llama-3. |
|
|
|
**Language(s):** English. |
|
|
|
## Intended Use |
|
|
|
**Intended Use Cases** This family is primarily intended for research applications. This version aims to further improve the LLM on sound understanding capabilities. |
|
|
|
**Out-of-scope** The use of Jan-Llama3-Sound in any manner that violates applicable laws or regulations is strictly prohibited. |
|
|
|
## Citation Information |
|
|
|
**BibTeX:** |
|
|
|
``` |
|
@article{Llama3-S: Sound Instruction Language Model 2024, |
|
title={Llama3-S}, |
|
author={Homebrew Research}, |
|
year=2024, |
|
month=July}, |
|
url={https://huggingface.co/jan-hq/Jan-Llama3-0719} |
|
``` |
|
|
|
## Acknowledgement |
|
|
|
- **[WhisperSpeech](https://github.com/collabora/WhisperSpeech)** |
|
|
|
- **[Encodec](https://github.com/facebookresearch/encodec)** |
|
|
|
- **[Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)** |