chaoweihuang
/

FactAlign-Phi-3-Mini

alignment-handbook

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

chaoweihuang commited on Oct 7

Commit

3179552

•

1 Parent(s): f097e90

Update README.md

Files changed (1) hide show

README.md +18 -13

README.md CHANGED Viewed

@@ -15,7 +15,24 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# kto-mix-14k-lf-response-phi3-f1_100_0.7-fg0.5-kto-fg-fgudw4.0
 This model is a fine-tuned version of [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) on the trl-lib/kto-mix-14k and the chaoweihuang/lf-response-phi3-f1_100_0.7-fg0.5 datasets.
 It achieves the following results on the evaluation set:
@@ -39,18 +56,6 @@ It achieves the following results on the evaluation set:
 - Fg Logps/reference Kl: -20.2070
 - Fg Loss: 0.7365
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# FactAlign-Phi-3-Mini
+This model is aligned with our **FactAlign** framework for improved long-form factuality, from [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct).
+For more information, please refer to our paper: [FactAlign: Long-form Factuality Alignment of Large Language Models](https://huggingface.co/papers/2410.01691).
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
 This model is a fine-tuned version of [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) on the trl-lib/kto-mix-14k and the chaoweihuang/lf-response-phi3-f1_100_0.7-fg0.5 datasets.
 It achieves the following results on the evaluation set:
 - Fg Logps/reference Kl: -20.2070
 - Fg Loss: 0.7365
 ## Training procedure
 ### Training hyperparameters