ryota39 commited on
Commit
e7c4964
1 Parent(s): bd7427e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -21,6 +21,8 @@ should probably proofread and complete it, then remove this comment. -->
21
  - the probability (logits after passing softmax function) in last layer of this model can be used to quantify the preference from user input
22
  - fine-tuned [studio-ousia/mluke-large-lite](https://huggingface.co/studio-ousia/mluke-large-lite) via full parameter tuning using [open-preference-v0.3](https://huggingface.co/datasets/ryota39/open_preference-v0.3)
23
  - trained on bf16 format
 
 
24
 
25
  ## Metric
26
 
 
21
  - the probability (logits after passing softmax function) in last layer of this model can be used to quantify the preference from user input
22
  - fine-tuned [studio-ousia/mluke-large-lite](https://huggingface.co/studio-ousia/mluke-large-lite) via full parameter tuning using [open-preference-v0.3](https://huggingface.co/datasets/ryota39/open_preference-v0.3)
23
  - trained on bf16 format
24
+ - Label 0 stands for rejected sentence
25
+ - Label 1 stands for chosen sentence
26
 
27
  ## Metric
28