Commit
•
16ba04f
1
Parent(s):
64a393f
Update README.md
Browse files
README.md
CHANGED
@@ -16,6 +16,7 @@ This is multi_verse_model-10.7B, a depth-upscaled version of [MTSAIR/multi_verse
|
|
16 |
This model is intended to be used as a basis for further fine-tuning, or as a drop-in upgrade from the original 7 billion parameter model.
|
17 |
|
18 |
Paper detailing how Depth-Up Scaling works: [SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling](https://arxiv.org/abs/2312.15166)
|
|
|
19 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
20 |
|
21 |
## Merge Details
|
|
|
16 |
This model is intended to be used as a basis for further fine-tuning, or as a drop-in upgrade from the original 7 billion parameter model.
|
17 |
|
18 |
Paper detailing how Depth-Up Scaling works: [SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling](https://arxiv.org/abs/2312.15166)
|
19 |
+
|
20 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
21 |
|
22 |
## Merge Details
|