File size: 1,474 Bytes
a5dffd6
d9879f9
 
 
 
 
a5dffd6
 
 
 
31f527a
a5dffd6
d9879f9
a5dffd6
a247504
a5dffd6
 
d9879f9
a5dffd6
d9879f9
a5dffd6
d9879f9
a5dffd6
d9879f9
a5dffd6
d9879f9
a5dffd6
d9879f9
a5dffd6
d9879f9
a5dffd6
d9879f9
a5dffd6
d9879f9
a5dffd6
 
 
d9879f9
31f527a
 
d9879f9
 
 
31f527a
d9879f9
a5dffd6
d9879f9
a5dffd6
d9879f9
a5dffd6
d9879f9
a5dffd6
d9879f9
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
---
language:
- en
license: apache-2.0
tags:
- sound language model
---

## Model Details

We have developed and released the family [llama3-s](https://huggingface.co/collections/homebrewltd/llama3-s-669df2139f0576abc6eb7405). This family is natively understanding audio and text input.

We continue to expand [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) with sound understanding capabilities.

This is the initial checkpoint with average weight initialization applied only to new vocabulary.


**Model developers** Homebrew Research.

**Input** Text and sound.

**Output** Text.

**Model Architecture** Llama-3.

**Language(s):** English.

## Intended Use

**Intended Use Cases** This family is primarily intended for research applications. This version aims to further improve the LLM on sound understanding capabilities.

**Out-of-scope** The use of Jan-Llama3-Sound in any manner that violates applicable laws or regulations is strictly prohibited.

## Citation Information

**BibTeX:**

```
@article{Llama3-S: Sound Instruction Language Model 2024,
  title={Llama3-S},
  author={Homebrew Research},
  year=2024,
  month=July},
  url={https://huggingface.co/jan-hq/Jan-Llama3-0719}
```

## Acknowledgement

- **[WhisperSpeech](https://github.com/collabora/WhisperSpeech)**

- **[Encodec](https://github.com/facebookresearch/encodec)**

- **[Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)**