vibhorg commited on
Commit
5e268f8
·
verified ·
1 Parent(s): a71d1c9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -1,3 +1,17 @@
1
  ---
2
  license: apache-2.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
  ---
4
+ datasets:
5
+ - scientific_papers
6
+ metrics:
7
+ - bertscore
8
+ - rouge
9
+ tags:
10
+ - text-generation-inference
11
+ - rlhf
12
+ - PPO
13
+ language:
14
+ - en
15
+ ---
16
+
17
+ This model is fintuned using PPO based NLPO RL algorithm, on ccdv/arxiv-summarization dataset. The base model is flan-t5-base model.