Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,8 @@ inference:
|
|
14 |
|
15 |
## Model sheet for AstraQuasar-4B
|
16 |
|
17 |
-
**AstraQuasar-4B** is our first pre-trained Large Language Model (LLM) for text generation.
|
|
|
18 |
AstraQuasar-4B-v.0.1 is built upon the foundation of the Phi-2 architecture, with **significant enhancements including an increased number of layers and the innovative introduction of a novel technique known as the duplicate trick.**
|
19 |
|
20 |
<p align="center">
|
|
|
14 |
|
15 |
## Model sheet for AstraQuasar-4B
|
16 |
|
17 |
+
**AstraQuasar-4B** is our first pre-trained Large Language Model (LLM) for text generation.
|
18 |
+
It is a model with **4B parameters**, whithout embeddings.
|
19 |
AstraQuasar-4B-v.0.1 is built upon the foundation of the Phi-2 architecture, with **significant enhancements including an increased number of layers and the innovative introduction of a novel technique known as the duplicate trick.**
|
20 |
|
21 |
<p align="center">
|