yunconglong
commited on
Commit
•
c1d340a
1
Parent(s):
631ecd3
Update README.md
Browse files
README.md
CHANGED
@@ -2,6 +2,8 @@
|
|
2 |
license: other
|
3 |
tags:
|
4 |
- moe
|
|
|
|
|
5 |
---
|
6 |
|
7 |
* [DPO Trainer](https://huggingface.co/docs/trl/main/en/dpo_trainer) with dataset jondurbin/truthy-dpo-v0.1
|
|
|
2 |
license: other
|
3 |
tags:
|
4 |
- moe
|
5 |
+
- DPO
|
6 |
+
- RL-TUNED
|
7 |
---
|
8 |
|
9 |
* [DPO Trainer](https://huggingface.co/docs/trl/main/en/dpo_trainer) with dataset jondurbin/truthy-dpo-v0.1
|