pankajmathur commited on
Commit
3ecd27d
1 Parent(s): 9bedbb7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -16
README.md CHANGED
@@ -27,26 +27,12 @@ A hybrid (explain + instruct) style Llama2-70b model, Pleae check examples below
27
  * WizardLM_orca
28
  * Dolly-V2_orca
29
 
30
-
31
  <br>
32
 
33
  **P.S. If you're interested to collaborate, please connect with me at www.linkedin.com/in/pankajam.**
34
 
35
  <br>
36
 
37
-
38
-
39
- ### quantized versions
40
- Coming Soon....
41
-
42
- <br>
43
-
44
- #### license disclaimer:
45
-
46
- This model is bound by the license & usage restrictions of the original Llama-2 model. And comes with no warranty or gurantees of any kind.
47
-
48
- <br>
49
-
50
  ## Evaluation
51
 
52
  We evaluated model_001 on a wide range of tasks using [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) from EleutherAI.
@@ -68,9 +54,9 @@ Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](htt
68
 
69
  <br>
70
 
71
- ## Example Usage
72
 
73
- Here is the Orca prompt format
74
 
75
  ```
76
  ### System:
@@ -83,6 +69,17 @@ Tell me about Orcas.
83
 
84
  ```
85
 
 
 
 
 
 
 
 
 
 
 
 
86
  Below shows a code example on how to use this model
87
 
88
  ```python
@@ -147,6 +144,12 @@ print(tokenizer.decode(output[0], skip_special_tokens=True))
147
 
148
  <br>
149
 
 
 
 
 
 
 
150
  #### Limitations & Biases:
151
 
152
  While this model aims for accuracy, it can occasionally produce inaccurate or misleading results.
 
27
  * WizardLM_orca
28
  * Dolly-V2_orca
29
 
 
30
  <br>
31
 
32
  **P.S. If you're interested to collaborate, please connect with me at www.linkedin.com/in/pankajam.**
33
 
34
  <br>
35
 
 
 
 
 
 
 
 
 
 
 
 
 
 
36
  ## Evaluation
37
 
38
  We evaluated model_001 on a wide range of tasks using [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) from EleutherAI.
 
54
 
55
  <br>
56
 
57
+ ## Prompt Format
58
 
59
+ Here is the prompt format
60
 
61
  ```
62
  ### System:
 
69
 
70
  ```
71
 
72
+ #### OobaBooga Instructions:
73
+
74
+ This model required upto 45GB GPU VRAM in 4bit so it can be loaded directly on Single RTX 6000/L40/A40/A100/H100 GPU or Double RTX 4090/L4/A10/RTX 3090/RTX A5000
75
+ So, if you have access to Machine with 45GB GPU VRAM and have installed [OobaBooga Web UI](https://github.com/oobabooga/text-generation-webui) on it.
76
+ You can just download this model by using HF repo link directly on OobaBooga Web UI "Model" Tab/Page & Just use **load-in-4bit** option in it.
77
+
78
+
79
+ <br>
80
+
81
+ #### Code Instructions:
82
+
83
  Below shows a code example on how to use this model
84
 
85
  ```python
 
144
 
145
  <br>
146
 
147
+ #### license disclaimer:
148
+
149
+ This model is bound by the license & usage restrictions of the original Llama-2 model. And comes with no warranty or gurantees of any kind.
150
+
151
+ <br>
152
+
153
  #### Limitations & Biases:
154
 
155
  While this model aims for accuracy, it can occasionally produce inaccurate or misleading results.