BEE-spoke-data
/

tFINE-900m-e16-d32-flan

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Sep 9, 2024

Commit

d9ffec9

·

verified ·

1 Parent(s): 9282a3e

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -14,6 +14,9 @@ pipeline_tag: text2text-generation
 This is a basic text-to-text "instruct" model, similar to Google's original [flan-t5](https://huggingface.co/collections/google/flan-t5-release-65005c39e3201fff885e22fb) model series (but not trained for as long).
 Fine-tuned from [the base model](https://hf.co/pszemraj/tFINE-900m-e16-d32-1024ctx) on the `pszemraj/flan-subsets-deduped` dataset, subset `flan-v2` for 1 epoch. It achieves the following results on the evaluation set:
 - Loss: 1.4134
 - Rouge1: 62.9142
@@ -23,6 +26,13 @@ Fine-tuned from [the base model](https://hf.co/pszemraj/tFINE-900m-e16-d32-1024c
 - Gen Len: 12.0586
 - Num Input Tokens Seen: 1931815668
 ## Usage Example
 ```py

 This is a basic text-to-text "instruct" model, similar to Google's original [flan-t5](https://huggingface.co/collections/google/flan-t5-release-65005c39e3201fff885e22fb) model series (but not trained for as long).
+<details>
+  <summary>Details: Click here to expand</summary>
 Fine-tuned from [the base model](https://hf.co/pszemraj/tFINE-900m-e16-d32-1024ctx) on the `pszemraj/flan-subsets-deduped` dataset, subset `flan-v2` for 1 epoch. It achieves the following results on the evaluation set:
 - Loss: 1.4134
 - Rouge1: 62.9142
 - Gen Len: 12.0586
 - Num Input Tokens Seen: 1931815668
+### Model features
+- pretrained & fine-tuned at 1024 context length (input)
+- tokenizer with byte-pair fallback to support understanding and generating text beyond what the original T5 tokenizer does
+</details>
 ## Usage Example
 ```py