alonzogarbanzo
/

Bloom-1b7-dialogsum-IT-baseline

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

alonzogarbanzo commited on Feb 27, 2024

Commit

4105f1c

·

verified ·

1 Parent(s): 83285c3

Update README.md

Files changed (1) hide show

README.md +25 -5

README.md CHANGED Viewed

@@ -8,12 +8,9 @@ model-index:
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # Bloom-1b7-dialogsum-IT
-This model is a fine-tuned version of [bigscience/bloom-1b7](https://huggingface.co/bigscience/bloom-1b7) on an unknown dataset.
 ## Model description
@@ -25,10 +22,31 @@ More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
@@ -45,7 +63,9 @@ The following hyperparameters were used during training:
 ### Training results
 ### Framework versions

   results: []
 ---
 # Bloom-1b7-dialogsum-IT
+This model is a instruction-tuned version of [bigscience/bloom-1b7](https://huggingface.co/bigscience/bloom-1b7) on a dialog summation dataset.
 ## Model description
 ## Training and evaluation data
+Instruction Tuned on the dialog summation task here: https://huggingface.co/datasets/adambjorn/UnrelatedForgettingOverhead/viewer/dialogsum/train
 ## Training procedure
+Given a set of prompts:
+``` python
+prompts = [
+    "Provide a concise summary for the following dialogue:",
+    "Summarize this conversation in a few sentences:",
+    "Here is a dialogue. Can you summarize it briefly?",
+    "Read the following dialogue and write a short summary:",
+    "Condense the essence of this conversation into a summary:"
+]
+```
+Each example is concatenated with the prompt, the dialogue, and the summary as so:
+``` python
+    concatenated_texts = [
+        random.choice(prompts) + " " + dialogue + "<\s>" + " Summary:" + summary
+        for dialogue, summary in zip(examples['dialogue'], examples['summary'])
+    ]
+```
 ### Training hyperparameters
 The following hyperparameters were used during training:
 ### Training results
+Final epoch results: {'loss': 0.0137, 'grad_norm': 0.6599154472351074, 'learning_rate': 7.000000000000001e-07, 'epoch': 10.0}
+Average results: {'train_runtime': 1142.1524, 'train_samples_per_second': 1.751, 'train_steps_per_second': 0.438, 'train_loss': 0.37129621666669843, 'epoch': 10.0}
 ### Framework versions