Update README.md
Browse files
README.md
CHANGED
@@ -65,6 +65,22 @@ It achieves quite competitive results in evaluation given its training token, an
|
|
65 |
Yet, there are still large gaps (particularly in ARC, HellaSwag, MMLU and GSM8K) between nano-phi-115M-v0.1 and phi-2, where author will attempt to narrow down the gap in the future.
|
66 |
No alignment has been done yet.
|
67 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
68 |
## Some metrics
|
69 |
- model
|
70 |
- hidden_size: 768
|
|
|
65 |
Yet, there are still large gaps (particularly in ARC, HellaSwag, MMLU and GSM8K) between nano-phi-115M-v0.1 and phi-2, where author will attempt to narrow down the gap in the future.
|
66 |
No alignment has been done yet.
|
67 |
|
68 |
+
|
69 |
+
## How to use
|
70 |
+
To use the model, you will need transformer version >= 4.37.2
|
71 |
+
```
|
72 |
+
pip install transformers>=4.37.2
|
73 |
+
```
|
74 |
+
|
75 |
+
```
|
76 |
+
# Use a pipeline as a high-level helper
|
77 |
+
from transformers import pipeline
|
78 |
+
|
79 |
+
pipe = pipeline("text-generation", model="kenhktsui/nano-phi-115M-v0.1")
|
80 |
+
pipe("I am a machine learning researcher. I work on", max_new_tokens=50, repetition_penalty=10.0)
|
81 |
+
# [{'generated_text': 'I am a machine learning researcher. I work on the problem of finding patterns in data, and it is not easy to find them all at once!\nThe first step was searching for pattern matching algorithms that are used by many people who have never seen an algorithm before (or even if they do).'}]
|
82 |
+
```
|
83 |
+
|
84 |
## Some metrics
|
85 |
- model
|
86 |
- hidden_size: 768
|