pmolchanov
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -82,7 +82,7 @@ docker run --gpus all -v /home/$USER:/home/$USER -it ghcr.io/tilmto/hymba:v1 bas
|
|
82 |
### Step 2: Chat with Hymba-1.5B-Base
|
83 |
After setting up the environment, you can use the following script to chat with our Model
|
84 |
|
85 |
-
```
|
86 |
from transformers import LlamaTokenizer, AutoModelForCausalLM, AutoTokenizer, AutoModel
|
87 |
import torch
|
88 |
|
@@ -117,11 +117,12 @@ Please report security vulnerabilities or NVIDIA AI Concerns [here](https://www.
|
|
117 |
|
118 |
## Citation
|
119 |
```
|
120 |
-
@
|
121 |
-
title={A Hybrid-head Architecture for Small Language Models},
|
122 |
-
author={Xin Dong and Yonggan Fu and Shizhe Diao and Wonmin Byeon and Zijia Chen and Ameya Sunil Mahabaleshwarkar and Shih-Yang Liu and Matthijs Van Keirsbilck and Min-Hung Chen and Yoshi Suhara and Yingyan
|
123 |
-
journal={arXiv preprint arXiv:xxxx},
|
124 |
year={2024},
|
125 |
-
|
126 |
-
}
|
127 |
-
|
|
|
|
|
|
82 |
### Step 2: Chat with Hymba-1.5B-Base
|
83 |
After setting up the environment, you can use the following script to chat with our Model
|
84 |
|
85 |
+
```py
|
86 |
from transformers import LlamaTokenizer, AutoModelForCausalLM, AutoTokenizer, AutoModel
|
87 |
import torch
|
88 |
|
|
|
117 |
|
118 |
## Citation
|
119 |
```
|
120 |
+
@misc{dong2024hymbahybridheadarchitecturesmall,
|
121 |
+
title={Hymba: A Hybrid-head Architecture for Small Language Models},
|
122 |
+
author={Xin Dong and Yonggan Fu and Shizhe Diao and Wonmin Byeon and Zijia Chen and Ameya Sunil Mahabaleshwarkar and Shih-Yang Liu and Matthijs Van Keirsbilck and Min-Hung Chen and Yoshi Suhara and Yingyan Lin and Jan Kautz and Pavlo Molchanov},
|
|
|
123 |
year={2024},
|
124 |
+
eprint={2411.13676},
|
125 |
+
archivePrefix={arXiv},
|
126 |
+
primaryClass={cs.CL},
|
127 |
+
url={https://arxiv.org/abs/2411.13676},
|
128 |
+
}
|