soujanyaporia
commited on
Commit
•
3ffbf1b
1
Parent(s):
a1866be
Update README.md
Browse files
README.md
CHANGED
@@ -5,6 +5,8 @@ license: apache-2.0
|
|
5 |
|
6 |
We created **Starling** by fine-tuning Vicuna-7B on HarmfulQA, a ChatGPT-distilled dataset that we collected using the Chain of Utterances (CoU) prompt. More details are on our paper [**Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment**](https://openreview.net/pdf?id=jkcHYEfPv3)
|
7 |
|
|
|
|
|
8 |
Experimental results on several safety benchmark datasets indicate that **Starling** is a safer model compared to the baseline model, Vicuna.
|
9 |
|
10 |
<img src="https://declare-lab.net/assets/images/logos/method.png" alt="Image" width="1000" height="335">
|
|
|
5 |
|
6 |
We created **Starling** by fine-tuning Vicuna-7B on HarmfulQA, a ChatGPT-distilled dataset that we collected using the Chain of Utterances (CoU) prompt. More details are on our paper [**Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment**](https://openreview.net/pdf?id=jkcHYEfPv3)
|
7 |
|
8 |
+
<img src="https://declare-lab.net/assets/images/logos/starling-final.png" alt="Image" width="100" height="100">
|
9 |
+
|
10 |
Experimental results on several safety benchmark datasets indicate that **Starling** is a safer model compared to the baseline model, Vicuna.
|
11 |
|
12 |
<img src="https://declare-lab.net/assets/images/logos/method.png" alt="Image" width="1000" height="335">
|