MBZUAI
/

LaMini-Flan-T5-77M

Text2Text Generation

Generated from Trainer

instruction fine-tuning

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

chiyuzhang commited on Apr 24, 2023

Commit

7256c50

·

1 Parent(s): ddd6b0a

Update README.md

Files changed (1) hide show

README.md +11 -10

README.md CHANGED Viewed

@@ -2,31 +2,26 @@
 license: apache-2.0
 tags:
 - generated_from_trainer
 model-index:
 - name: flan-t5-small-distil-v2
   results: []
 language:
 - en
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# flan-t5-small-distil-v2
-This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on an unknown dataset.
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
@@ -43,6 +38,10 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - num_epochs: 5
 ## Use
 ### CPU
@@ -87,7 +86,9 @@ print("Response": generated_text)
 </details>
 ### Framework versions

 license: apache-2.0
 tags:
 - generated_from_trainer
+- instruction fine-tuning
 model-index:
 - name: flan-t5-small-distil-v2
   results: []
 language:
 - en
+pipeline_tag: text2text-generation
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# LaMini-FLAN-T5-Small
+This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on [LaMini dataset]() that contains 2.58M samples for instruction fine-tuning. For more information about our dataset, please refer to our [project repository]().
 ## Model description
+We initialize with [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) and fine-tune it on our [LaMini dataset](). Its total number of parameters is 61M.
 ## Training procedure
 - lr_scheduler_type: linear
 - num_epochs: 5
+## Training and evaluation data
+We conducted two sets of evaluations: automatic evaluation on downstream NLP tasks and human evaluation on user-oriented instructions. For more detail, please refer to our [paper]().
 ## Use
 ### CPU
 </details>
+## Intended uses & limitations
+More information needed
 ### Framework versions