Spaces:

migueldeguzmandev
/

migueldeguzmandev-GPT2XL_RLLMv19-4

Sleeping

migueldeguzmandev commited on Apr 28

Commit

5d6c426

•

1 Parent(s): 10ea528

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -22,6 +22,7 @@ def generate_response(input_text, temperature):
         attention_mask=attention_mask,
         max_length=300,
         num_return_sequences=1,
         temperature=temperature,
         do_sample=True,  # Set do_sample to True when using temperature
     )
@@ -41,8 +42,7 @@ interface = gr.Interface(
     title="TestOnlyRLLMv19Layer4",
     description=(
         """
-        RLLMv19 is a spin-off experiment focusing on improving of GPT2XL's robustness. I created this gradio app to test model outputs and compare it to RLLMv3 prototype(<a href='https://www.lesswrong.com/posts/vZ5fM6FtriyyKbwi9/betterdan-ai-machiavelli-and-oppo-jailbreaks-vs-sota-models'>see relevant post</a>).
-        If you are interested in trying a full prototype - <a href='https://huggingface.co/spaces/migueldeguzmandev/RLLMv3.2-10'>Try this gradio app!</a>.
         """
     ),
 )

         attention_mask=attention_mask,
         max_length=300,
         num_return_sequences=1,
+        no_repeat_ngram_size=2,
         temperature=temperature,
         do_sample=True,  # Set do_sample to True when using temperature
     )
     title="TestOnlyRLLMv19Layer4",
     description=(
         """
+        RLLMv19 is a spin-off experiment focusing on improving GPT2XL's robustness to jailbreaks. The 4th layer of RLLMv19 is compared to the 4th layer of RLLMv3. Why RLLMv3? This <a href='https://huggingface.co/spaces/migueldeguzmandev/RLLMv3.2-10'>prototype</a> demonstrated a capability to resist jailbreak attacks up to 67.8%, which you can read more about (<a href='https://www.lesswrong.com/posts/vZ5fM6FtriyyKbwi9/betterdan-ai-machiavelli-and-oppo-jailbreaks-vs-sota-models'>here</a>).
         """
     ),
 )