Update README.md
Browse files
README.md
CHANGED
@@ -11,4 +11,6 @@ tags:
|
|
11 |
- PPO
|
12 |
language:
|
13 |
- en
|
14 |
-
---
|
|
|
|
|
|
11 |
- PPO
|
12 |
language:
|
13 |
- en
|
14 |
+
---
|
15 |
+
|
16 |
+
This model is fintuned using PPO based NLPO RL algorithm, on ccdv/arxiv-summarization dataset. The base model is pretunerd version of flan-t5-base model.
|