jrc
/

llama3-8b-coedit

grammar-correction

Model card Files Files and versions Community

jrc commited on Apr 19

Commit

172f473

•

1 Parent(s): d5fac40

Update README.md

Files changed (1) hide show

README.md +22 -1

README.md CHANGED Viewed

@@ -9,4 +9,25 @@ metrics:
 tags:
 - torchtune
 - grammar-correction
----

 tags:
 - torchtune
 - grammar-correction
+---
+### Llama3 CoEdit
+This is a Llama3 8B based model trained using [torchtune](https://pytorch.org/torchtune) on the `grammarly/coedit` dataset.
+### Training details
+The exact training script (`lora_finetune_distributed`) and config (`8B_lora.yaml`) are both included in this repository. Specifically, in order to add the dataset, I added the following lines to the config:
+```
+dataset:
+_component_: torchtune.datasets.instruct_dataset
+source: grammarly/coedit
+template: GrammarErrorCorrectionTemplate
+column_map: {"sentence": "src", "output": "tgt"}
+train_on_input: False
+split: train
+```
+### Evaluation results