nvidia
/

NV-Llama2-70B-RLHF-Chat

Text Generation

Model card Files Files and versions Community

zhilinw commited on Mar 9

Commit

9c119cc

•

1 Parent(s): af626e4

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -12,6 +12,7 @@ tags:
 - rlhf
 datasets:
 - Anthropic/hh-rlhf
 ---
 # NV-Llama2-70B-RLHF-Chat
@@ -183,4 +184,4 @@ Pre-requisite: You would need at least a machine with 4 40GB or 2 80GB NVIDIA GP
 ## Limitations
 - The model was trained on the data that contains toxic language and societal biases originally crawled from the Internet. Therefore, the model may amplify those biases and return toxic responses especially when prompted with toxic prompts.
 - The Model may generate answers that may be inaccurate, omit key information, or include irrelevant or redundant text producing socially unacceptable or undesirable text, even if the prompt itself does not include anything explicitly offensive.
-- We recommend deploying the model with [NeMo Guardrails](https://github.com/NVIDIA/NeMo-Guardrails) to mitigate these potential issues.

 - rlhf
 datasets:
 - Anthropic/hh-rlhf
+- nvidia/sft_datablend_v1
 ---
 # NV-Llama2-70B-RLHF-Chat
 ## Limitations
 - The model was trained on the data that contains toxic language and societal biases originally crawled from the Internet. Therefore, the model may amplify those biases and return toxic responses especially when prompted with toxic prompts.
 - The Model may generate answers that may be inaccurate, omit key information, or include irrelevant or redundant text producing socially unacceptable or undesirable text, even if the prompt itself does not include anything explicitly offensive.
+- We recommend deploying the model with [NeMo Guardrails](https://github.com/NVIDIA/NeMo-Guardrails) to mitigate these potential issues.