lmsys
/

vicuna-7b-v1.5-16k

Text Generation

text-generation-inference

Model card Files Files and versions Community

lmzheng commited on Aug 1, 2023

Commit

d978a8d

•

1 Parent(s): b0aa428

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -7,12 +7,12 @@ license: llama2
 ## Model Details
-Vicuna is a chat assistant trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT.
 - **Developed by:** [LMSYS](https://lmsys.org/)
 - **Model type:** An auto-regressive language model based on the transformer architecture
 - **License:** Non-commercial license
-- **Finetuned from model:** [LLaMA 2](https://arxiv.org/abs/2307.09288)
 ### Model Sources
@@ -33,7 +33,7 @@ The primary intended users of the model are researchers and hobbyists in natural
 ## Training Details
-Vicuna v1.5 (16k) is fine-tuned from LLaMA with supervised instruction fine-tuning and linear RoPE scaling.
 The training data is around 125K conversations collected from ShareGPT.com. These conversations are packed into sequences that contain 16K tokens each.
 See more details in the "Training Details of Vicuna Models" section in the appendix of this [paper](https://arxiv.org/pdf/2306.05685.pdf).

 ## Model Details
+Vicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT.
 - **Developed by:** [LMSYS](https://lmsys.org/)
 - **Model type:** An auto-regressive language model based on the transformer architecture
 - **License:** Non-commercial license
+- **Finetuned from model:** [Llama 2](https://arxiv.org/abs/2307.09288)
 ### Model Sources
 ## Training Details
+Vicuna v1.5 (16k) is fine-tuned from Llama 2 with supervised instruction fine-tuning and linear RoPE scaling.
 The training data is around 125K conversations collected from ShareGPT.com. These conversations are packed into sequences that contain 16K tokens each.
 See more details in the "Training Details of Vicuna Models" section in the appendix of this [paper](https://arxiv.org/pdf/2306.05685.pdf).