nvidia
/

Nemotron-4-340B-Instruct

Model card Files Files and versions Community

okuchaiev commited on Jun 13, 2024

Commit

e3d6f77

·

verified ·

1 Parent(s): be08e80

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ Subsequently the Nemotron-4-340B-Instruct model went through additional alignmen
 - Supervised Fine-tuning (SFT)
 - Direct Policy Optimization (DPO)
-- Additional in-house alignment techniques
 This results in a final model that is aligned for human chat preferences, improvements in mathematical reasoning, coding and instruction following.

 - Supervised Fine-tuning (SFT)
 - Direct Policy Optimization (DPO)
+- Additional in-house alignment techniques (Publication work in progress)
 This results in a final model that is aligned for human chat preferences, improvements in mathematical reasoning, coding and instruction following.