gregH commited on
Commit
5bdc9df
1 Parent(s): 14275e6

Update index.html

Browse files
Files changed (1) hide show
  1. index.html +4 -1
index.html CHANGED
@@ -121,7 +121,10 @@ Exploring Refusal Loss Landscapes </title>
121
  <ul>
122
  <li>Paper: <a href="https://arxiv.org/abs/2310.04451" target="_blank" rel="noopener noreferrer">
123
  AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models</a></li>
124
- <li>Brief Introduction: </li>
 
 
 
125
  </ul>
126
  </div>
127
  <h3>PAIR</h3>
 
121
  <ul>
122
  <li>Paper: <a href="https://arxiv.org/abs/2310.04451" target="_blank" rel="noopener noreferrer">
123
  AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models</a></li>
124
+ <li>Brief Introduction: AutoDAN, an automatic stealthy jailbreak prompts generation framework based on a carefully designed
125
+ hierarchical genetic algorithm. AUtoDAN preserves the meaningfulness and fluency (i.e., stealthiness) of jailbreak prompts,
126
+ akin to handcrafted ones, while also ensuring automated deployment as introduced in prior token-level research like GCG.
127
+ </li>
128
  </ul>
129
  </div>
130
  <h3>PAIR</h3>