RonanMcGovern
commited on
Commit
•
9fca5f7
1
Parent(s):
131be31
add gguf
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ tags:
|
|
12 |
- functions
|
13 |
- function calling
|
14 |
- sharded
|
15 |
-
-
|
16 |
- gptq
|
17 |
---
|
18 |
# Function Calling Llama 2 (version 2)
|
@@ -24,7 +24,7 @@ tags:
|
|
24 |
2. Function descriptions are moved outside of the system prompt. This avoids the behaviour of function calling being affected by how the system prompt had been trained to influence the model.
|
25 |
|
26 |
Available models:
|
27 |
-
- Llama-7B-chat with function calling ([Base Model](https://huggingface.co/Trelis/Llama-2-7b-chat-hf-function-calling-v2)), ([PEFT Adapters](https://huggingface.co/Trelis/Llama-2-7b-chat-hf-function-calling-adapters-v2)) - Free
|
28 |
- Llama-13B-chat with function calling ([Base Model](https://huggingface.co/Trelis/Llama-2-13b-chat-hf-function-calling-v2)), ([PEFT Adapters](https://huggingface.co/Trelis/Llama-2-13b-chat-hf-function-calling-adapters-v2)) - Paid, [purchase here: €19.99 per user/seat.](https://buy.stripe.com/9AQ7te3lHdmbdZ68wz)
|
29 |
- CodeLlama-34B-Instruct with function calling ([Base Model](https://huggingface.co/Trelis/CodeLlama-34b-Instruct-hf-function-calling-v2)), ([PEFT Adapters](https://huggingface.co/Trelis/CodeLlama-34b-Instruct-hf-function-calling-adapters-v2)) - Paid, [purchase here: €44.99 per user/seat.](https://buy.stripe.com/cN27teg8t2Hx5sA8wM)
|
30 |
|
@@ -142,7 +142,7 @@ It is recommended to handle cases where:
|
|
142 |
|
143 |
## Inference
|
144 |
|
145 |
-
**Quick Start**
|
146 |
Try out this notebook [fLlama_Inference notebook](https://colab.research.google.com/drive/1Ow5cQ0JNv-vXsT-apCceH6Na3b4L7JyW?usp=sharing)
|
147 |
|
148 |
**Commercial Applications**
|
@@ -156,6 +156,9 @@ Below follows information on the original Llama 2 model...
|
|
156 |
|
157 |
~
|
158 |
|
|
|
|
|
|
|
159 |
# **Llama 2**
|
160 |
Llama 2 is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. This is the repository for the 7B fine-tuned model, optimized for dialogue use cases and converted for the Hugging Face Transformers format. Links to other models can be found in the index at the bottom.
|
161 |
|
|
|
12 |
- functions
|
13 |
- function calling
|
14 |
- sharded
|
15 |
+
- gguf
|
16 |
- gptq
|
17 |
---
|
18 |
# Function Calling Llama 2 (version 2)
|
|
|
24 |
2. Function descriptions are moved outside of the system prompt. This avoids the behaviour of function calling being affected by how the system prompt had been trained to influence the model.
|
25 |
|
26 |
Available models:
|
27 |
+
- Llama-7B-chat with function calling ([Base Model](https://huggingface.co/Trelis/Llama-2-7b-chat-hf-function-calling-v2)), ([PEFT Adapters](https://huggingface.co/Trelis/Llama-2-7b-chat-hf-function-calling-adapters-v2)), ([GGUF - see the 'gguf' branch]) - Free
|
28 |
- Llama-13B-chat with function calling ([Base Model](https://huggingface.co/Trelis/Llama-2-13b-chat-hf-function-calling-v2)), ([PEFT Adapters](https://huggingface.co/Trelis/Llama-2-13b-chat-hf-function-calling-adapters-v2)) - Paid, [purchase here: €19.99 per user/seat.](https://buy.stripe.com/9AQ7te3lHdmbdZ68wz)
|
29 |
- CodeLlama-34B-Instruct with function calling ([Base Model](https://huggingface.co/Trelis/CodeLlama-34b-Instruct-hf-function-calling-v2)), ([PEFT Adapters](https://huggingface.co/Trelis/CodeLlama-34b-Instruct-hf-function-calling-adapters-v2)) - Paid, [purchase here: €44.99 per user/seat.](https://buy.stripe.com/cN27teg8t2Hx5sA8wM)
|
30 |
|
|
|
142 |
|
143 |
## Inference
|
144 |
|
145 |
+
**Quick Start in Google Colab**
|
146 |
Try out this notebook [fLlama_Inference notebook](https://colab.research.google.com/drive/1Ow5cQ0JNv-vXsT-apCceH6Na3b4L7JyW?usp=sharing)
|
147 |
|
148 |
**Commercial Applications**
|
|
|
156 |
|
157 |
~
|
158 |
|
159 |
+
**Run on your laptop**
|
160 |
+
Run on your laptop [video and juypter notebook](https://youtu.be/rjSWCMVbD_U)
|
161 |
+
|
162 |
# **Llama 2**
|
163 |
Llama 2 is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. This is the repository for the 7B fine-tuned model, optimized for dialogue use cases and converted for the Hugging Face Transformers format. Links to other models can be found in the index at the bottom.
|
164 |
|