pmolchanov commited on
Commit
f8f6a65
·
verified ·
1 Parent(s): ac59e23

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -8
README.md CHANGED
@@ -82,7 +82,7 @@ docker run --gpus all -v /home/$USER:/home/$USER -it ghcr.io/tilmto/hymba:v1 bas
82
  ### Step 2: Chat with Hymba-1.5B-Base
83
  After setting up the environment, you can use the following script to chat with our Model
84
 
85
- ```
86
  from transformers import LlamaTokenizer, AutoModelForCausalLM, AutoTokenizer, AutoModel
87
  import torch
88
 
@@ -117,11 +117,12 @@ Please report security vulnerabilities or NVIDIA AI Concerns [here](https://www.
117
 
118
  ## Citation
119
  ```
120
- @article{hymba2024,
121
- title={A Hybrid-head Architecture for Small Language Models},
122
- author={Xin Dong and Yonggan Fu and Shizhe Diao and Wonmin Byeon and Zijia Chen and Ameya Sunil Mahabaleshwarkar and Shih-Yang Liu and Matthijs Van Keirsbilck and Min-Hung Chen and Yoshi Suhara and Yingyan Celine Lin and Jan Kautz and Pavlo Molchanov},
123
- journal={arXiv preprint arXiv:xxxx},
124
  year={2024},
125
- url={https://arxiv.org/abs/xxxx},
126
- }
127
- ```
 
 
 
82
  ### Step 2: Chat with Hymba-1.5B-Base
83
  After setting up the environment, you can use the following script to chat with our Model
84
 
85
+ ```py
86
  from transformers import LlamaTokenizer, AutoModelForCausalLM, AutoTokenizer, AutoModel
87
  import torch
88
 
 
117
 
118
  ## Citation
119
  ```
120
+ @misc{dong2024hymbahybridheadarchitecturesmall,
121
+ title={Hymba: A Hybrid-head Architecture for Small Language Models},
122
+ author={Xin Dong and Yonggan Fu and Shizhe Diao and Wonmin Byeon and Zijia Chen and Ameya Sunil Mahabaleshwarkar and Shih-Yang Liu and Matthijs Van Keirsbilck and Min-Hung Chen and Yoshi Suhara and Yingyan Lin and Jan Kautz and Pavlo Molchanov},
 
123
  year={2024},
124
+ eprint={2411.13676},
125
+ archivePrefix={arXiv},
126
+ primaryClass={cs.CL},
127
+ url={https://arxiv.org/abs/2411.13676},
128
+ }