RLLMv7 / This experiment: Can RLLMv3's ability to defend against jailbreaks be attributed to datasets containing stories about Jung's shadow integration theory?
GPT2XL_RLLMv3 Post: BetterDAN, AI Machiavelli & Oppo Jailbreaks vs. SOTA models & GPT2XL_RLLMv3
Related post: Coherence (and Response Time) Test
Another Related Post: Research Log, RLLMv3 (GPT2-XL, Phi-1.5 and Falcon-RW-1B)