declare-lab
/

flacuna-13b-v1.0

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

soujanyaporia commited on Jul 4, 2023

Commit

188dbb2

·

1 Parent(s): a41ef79

Update README.md

Files changed (1) hide show

README.md +16 -2

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ metrics:
 # Flacuna: A Vicuna made of Flan
-<img src="https://huggingface.co/declare-lab/flacuna-13b-v1.0/blob/main/flacuna5.png" alt="Image" width="200" height="335">
 Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is already an excellent writing assistant, and the intention behind Flacuna was to enhance Vicuna's problem-solving capabilities. To achieve this, we curated a dedicated instruction dataset called Flan-mini.
@@ -26,6 +26,7 @@ Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instru
 | ShareGPT                    | ChatGPT                | 60K          |
 | Total                       | -                      | 1.34M        |
 As a result of this fine-tuning process, Flacuna exhibited notable performance improvements in problem-solving across multiple benchmark datasets, both in few-shot and zero-shot settings.
@@ -42,4 +43,17 @@ As a result of this fine-tuning process, Flacuna exhibited notable performance i
 | Flacuna | 13B | 49.4 | 32.5 | 67.9 |
-During training, Flacuna is a 13B checkpoint of LLaMA and employed a maximum input sequence length of 1280. We utilized LoRA for parameter-efficient fine-tuning.

 # Flacuna: A Vicuna made of Flan
+<img src="" alt="Image" width="200" height="335">
 Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is already an excellent writing assistant, and the intention behind Flacuna was to enhance Vicuna's problem-solving capabilities. To achieve this, we curated a dedicated instruction dataset called Flan-mini.
 | ShareGPT                    | ChatGPT                | 60K          |
 | Total                       | -                      | 1.34M        |
+## Problem Solving Ability
 As a result of this fine-tuning process, Flacuna exhibited notable performance improvements in problem-solving across multiple benchmark datasets, both in few-shot and zero-shot settings.
 | Flacuna | 13B | 49.4 | 32.5 | 67.9 |
+During training, Flacuna is a 13B checkpoint of LLaMA and employed a maximum input sequence length of 1280. We utilized LoRA for parameter-efficient fine-tuning.
+## Chatbot / Writing Assistant
+While Flacuna primarily excels in problem-solving tasks, we made efforts to maintain the impressive writing and chatting ability of Vicuna. To achieve this, we incorporated conversational datasets generated by GPT-4, such as GPT-4-Alpaca and ShareGPT, into the Flan-mini collection.
+To use Flacuna as a chatbot or writing assistant, we recommend you use the following template:
+```
+A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions. USER: {definition of the task}./n/n
+{question}/n
+Output: ASSISTANT:
+```
+Please note that we still recommend using Vicuna as your preferred Chatbot or Writing Assistant, over Flacuna. Flacuna's primary strength lies in problem-solving tasks, making it ideal for such applications.