Transformers
GGUF
Inference Endpoints
conversational
aashish1904 commited on
Commit
fd81eb7
1 Parent(s): 08d5ee6

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +24 -14
README.md CHANGED
@@ -1,4 +1,6 @@
 
1
  ---
 
2
  library_name: transformers
3
  license: llama2
4
  datasets:
@@ -6,26 +8,34 @@ datasets:
6
  - microsoft/orca-math-word-problems-200k
7
  - m-a-p/CodeFeedback-Filtered-Instruction
8
  - anon8231489123/ShareGPT_Vicuna_unfiltered
9
- base_model: abacusai/Llama-3-Smaug-8B
10
- pipeline_tag: text-generation
11
  ---
12
 
13
- # Llama-3-Smaug-8B-GGUF
14
 
15
- - This is quantized version of [abacusai/Llama-3-Smaug-8B](https://huggingface.co/abacusai/Llama-3-Smaug-8B) created using llama.cpp
16
 
 
 
17
 
 
18
 
19
- ### Model Description
20
 
21
- - **Developed by:** [Abacus.AI](https://abacus.ai)
22
- - **License:** https://llama.meta.com/llama3/license/
23
- - **Finetuned from model:** [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B).
24
 
25
  ### Built with Meta Llama 3
26
 
 
 
 
27
  This model was built using the Smaug recipe for improving performance on real world multi-turn conversations applied to
28
- [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B).
 
 
 
 
 
 
 
29
 
30
  ## Evaluation
31
 
@@ -36,22 +46,22 @@ This model was built using the Smaug recipe for improving performance on real w
36
  score
37
  model turn
38
  Llama-3-Smaug-8B 1 8.77500
39
- Meta-Llama-3-8B-Instruct 1 8.1
40
  ########## Second turn ##########
41
  score
42
  model turn
43
- Meta-Llama-3-8B-Instruct 2 8.2125
44
  Llama-3-Smaug-8B 2 7.8875
45
  ########## Average ##########
46
  score
47
  model
48
  Llama-3-Smaug-8B 8.331250
49
- Meta-Llama-3-8B-Instruct 8.15625
50
  ```
51
 
52
  | Model | First turn | Second Turn | Average |
53
  | :---- | ---------: | ----------: | ------: |
54
  | Llama-3-Smaug-8B | 8.78 | 7.89 | 8.33 |
55
- | Llama-3-8B-Instruct | 8.1 | 8.21 | 8.16 |
56
 
57
- This version of Smaug uses new techniques and new data compared to [Smaug-72B](https://huggingface.co/abacusai/Smaug-72B-v0.1), and more information will be released later on. For now, see the previous Smaug paper: https://arxiv.org/abs/2402.13228.
 
1
+
2
  ---
3
+
4
  library_name: transformers
5
  license: llama2
6
  datasets:
 
8
  - microsoft/orca-math-word-problems-200k
9
  - m-a-p/CodeFeedback-Filtered-Instruction
10
  - anon8231489123/ShareGPT_Vicuna_unfiltered
11
+
 
12
  ---
13
 
14
+ [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
15
 
 
16
 
17
+ # QuantFactory/Llama-3-Smaug-8B-GGUF
18
+ This is quantized version of [abacusai/Llama-3-Smaug-8B](https://huggingface.co/abacusai/Llama-3-Smaug-8B) created using llama.cpp
19
 
20
+ # Original Model Card
21
 
 
22
 
23
+ # Llama-3-Smaug-8B
 
 
24
 
25
  ### Built with Meta Llama 3
26
 
27
+
28
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64c14f95cac5f9ba52bbcd7f/OrcJyTaUtD2HxJOPPwNva.png)
29
+
30
  This model was built using the Smaug recipe for improving performance on real world multi-turn conversations applied to
31
+ [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct).
32
+
33
+ ### Model Description
34
+
35
+ - **Developed by:** [Abacus.AI](https://abacus.ai)
36
+ - **License:** https://llama.meta.com/llama3/license/
37
+ - **Finetuned from model:** [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct).
38
+
39
 
40
  ## Evaluation
41
 
 
46
  score
47
  model turn
48
  Llama-3-Smaug-8B 1 8.77500
49
+ Meta-Llama-3-8B-Instruct 1 8.31250
50
  ########## Second turn ##########
51
  score
52
  model turn
53
+ Meta-Llama-3-8B-Instruct 2 7.8875
54
  Llama-3-Smaug-8B 2 7.8875
55
  ########## Average ##########
56
  score
57
  model
58
  Llama-3-Smaug-8B 8.331250
59
+ Meta-Llama-3-8B-Instruct 8.10
60
  ```
61
 
62
  | Model | First turn | Second Turn | Average |
63
  | :---- | ---------: | ----------: | ------: |
64
  | Llama-3-Smaug-8B | 8.78 | 7.89 | 8.33 |
65
+ | Llama-3-8B-Instruct | 8.31 | 7.89 | 8.10 |
66
 
67
+ This version of Smaug uses new techniques and new data compared to [Smaug-72B](https://huggingface.co/abacusai/Smaug-72B-v0.1), and more information will be released later on. For now, see the previous Smaug paper: https://arxiv.org/abs/2402.13228.