f67d5ec e61eec9
1
2
3
4
5
6
--- license: apache-2.0 --- A LoRA-based implementation of AlpacaFarm RLHF PPO More details in https://github.com/SimengSun/alpaca_farm_lora