Update index.html
Browse files- index.html +4 -1
index.html
CHANGED
@@ -121,7 +121,10 @@ Exploring Refusal Loss Landscapes </title>
|
|
121 |
<ul>
|
122 |
<li>Paper: <a href="https://arxiv.org/abs/2310.04451" target="_blank" rel="noopener noreferrer">
|
123 |
AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models</a></li>
|
124 |
-
<li>Brief Introduction:
|
|
|
|
|
|
|
125 |
</ul>
|
126 |
</div>
|
127 |
<h3>PAIR</h3>
|
|
|
121 |
<ul>
|
122 |
<li>Paper: <a href="https://arxiv.org/abs/2310.04451" target="_blank" rel="noopener noreferrer">
|
123 |
AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models</a></li>
|
124 |
+
<li>Brief Introduction: AutoDAN, an automatic stealthy jailbreak prompts generation framework based on a carefully designed
|
125 |
+
hierarchical genetic algorithm. AUtoDAN preserves the meaningfulness and fluency (i.e., stealthiness) of jailbreak prompts,
|
126 |
+
akin to handcrafted ones, while also ensuring automated deployment as introduced in prior token-level research like GCG.
|
127 |
+
</li>
|
128 |
</ul>
|
129 |
</div>
|
130 |
<h3>PAIR</h3>
|