Update README.md
Browse files
README.md
CHANGED
@@ -16,6 +16,11 @@ widget:
|
|
16 |
|
17 |
π [arXiv](https://arxiv.org/abs/2311.07052) | π» [GitHub](https://github.com/GeneZC/MiniMA) | π€ [HuggingFace-MiniMA](https://huggingface.co/GeneZC/MiniMA-3B) | π€ [HuggingFace-MiniChat](https://huggingface.co/GeneZC/MiniChat-3B) | π€ [ModelScope-MiniMA](https://modelscope.cn/models/GeneZC/MiniMA-3B) | π€ [ModelScope-MiniChat](https://modelscope.cn/models/GeneZC/MiniChat-3B) | π€ [HuggingFace-MiniChat-1.5](https://huggingface.co/GeneZC/MiniChat-1.5-3B) | π€ [HuggingFace-MiniMA-2](https://huggingface.co/GeneZC/MiniMA-2-3B) | π€ [HuggingFace-MiniChat-2](https://huggingface.co/GeneZC/MiniChat-2-3B)
|
18 |
|
|
|
|
|
|
|
|
|
|
|
19 |
β Must comply with LICENSE of LLaMA-2 since it is derived from LLaMA-2.
|
20 |
|
21 |
A language model continued from MiniMA-3B.
|
|
|
16 |
|
17 |
π [arXiv](https://arxiv.org/abs/2311.07052) | π» [GitHub](https://github.com/GeneZC/MiniMA) | π€ [HuggingFace-MiniMA](https://huggingface.co/GeneZC/MiniMA-3B) | π€ [HuggingFace-MiniChat](https://huggingface.co/GeneZC/MiniChat-3B) | π€ [ModelScope-MiniMA](https://modelscope.cn/models/GeneZC/MiniMA-3B) | π€ [ModelScope-MiniChat](https://modelscope.cn/models/GeneZC/MiniChat-3B) | π€ [HuggingFace-MiniChat-1.5](https://huggingface.co/GeneZC/MiniChat-1.5-3B) | π€ [HuggingFace-MiniMA-2](https://huggingface.co/GeneZC/MiniMA-2-3B) | π€ [HuggingFace-MiniChat-2](https://huggingface.co/GeneZC/MiniChat-2-3B)
|
18 |
|
19 |
+
π **Updates from MiniMA-3B**:
|
20 |
+
- continued from MiniMA-3B without distillation;
|
21 |
+
- better data mixture;
|
22 |
+
- more trained tokens.
|
23 |
+
|
24 |
β Must comply with LICENSE of LLaMA-2 since it is derived from LLaMA-2.
|
25 |
|
26 |
A language model continued from MiniMA-3B.
|