vibhorg commited on
Commit
e1542dd
1 Parent(s): e58d3bf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -11,4 +11,6 @@ tags:
11
  - PPO
12
  language:
13
  - en
14
- ---
 
 
 
11
  - PPO
12
  language:
13
  - en
14
+ ---
15
+
16
+ This model is fintuned using PPO based NLPO RL algorithm, on ccdv/arxiv-summarization dataset. The base model is pretunerd version of flan-t5-base model.