sauc-abadal-lloret
/

gpt-j-6b-ALT-Quark-tldr

Model card Files Files and versions Community

sauc-abadal-lloret commited on Sep 25

Commit

015a166

•

1 Parent(s): 56a1d93

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ In a nutshell, the Quark method consists on sampling new generations and scoring
 For extensive coverage on Quark, please refer to their paper.
-The reward model used for scoring the generaations can be found in [here](https://huggingface.co/CarperAI/openai_summarize_tldr_rm_checkpoint). We uses K = 5 quantile tokens, which were newly added to the tokenizer:
 ```python
 {'_QUANTILE_0_', '_QUANTILE_1_', '_QUANTILE_2_', '_QUANTILE_3_', '_QUANTILE_4_'}
 ```

 For extensive coverage on Quark, please refer to their paper.
+The reward model used for scoring the generations can be found in [here](https://huggingface.co/CarperAI/openai_summarize_tldr_rm_checkpoint). We used K = 5 quantile tokens, which were newly added to the tokenizer:
 ```python
 {'_QUANTILE_0_', '_QUANTILE_1_', '_QUANTILE_2_', '_QUANTILE_3_', '_QUANTILE_4_'}
 ```