PEFT
Safetensors
File size: 928 Bytes
644107a
 
a23bd86
644107a
 
 
 
a6add25
0019314
644107a
0019314
644107a
0019314
644107a
0019314
644107a
0019314
644107a
0019314
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
---
library_name: peft
base_model: microsoft/Phi-3-mini-4k-instruct
---

# Model Card for Model ID

In this repo are LoRa weights of the Phi-3-mini-4k-instruct model (https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) finetuned with the Continuous Adversarial Preference Optimisation (CAPO) algorithm.
For more information, see our paper "Efficient Adversarial Training in LLMs with Continuous Attacks" (https://arxiv.org/abs/2405.15589)

## Github

https://github.com/sophie-xhonneux/Continuous-AdvTrain/edit/master/README.md

## Citation

If you used this model, please cite our paper:

```
@misc{xhonneux2024efficient,
      title={Efficient Adversarial Training in LLMs with Continuous Attacks}, 
      author={Sophie Xhonneux and Alessandro Sordoni and Stephan Günnemann and Gauthier Gidel and Leo Schwinn},
      year={2024},
      eprint={2405.15589},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}
```