nicholasKluge commited on
Commit
2a1d63f
·
verified ·
1 Parent(s): 8043acf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -399,7 +399,7 @@ Hence, even though our models are released with a permissive license, we urge us
399
 
400
  ## Evaluations
401
 
402
- To evaluate the `Instruct` versions of our models, we used [AlpacaEval](https://github.com/tatsu-lab/alpaca_eval) 2.0 with length-controlled win rates, a fast and relatively cheap evaluation method that is highly correlated with human preferences. To learn more about our evaluation read [our documentation](https://github.com/Nkluge-correa/Tucano/blob/main/evaluations/README.md).
403
 
404
  | | Avg. Length | Wins | Base Wins | Total Matches | Length-Controlled Win Rate (%) | LC Std. Error |
405
  |-------------------------|-------------|------|-----------|---------------|--------------------------------|---------------|
 
399
 
400
  ## Evaluations
401
 
402
+ To evaluate the `Instruct` versions of our models, we used [AlpacaEval](https://github.com/tatsu-lab/alpaca_eval) 2.0 with length-controlled win rates, a fast and relatively cheap evaluation method that is highly correlated with human preferences and evaluations of pairwise comparisons. To learn more about our evaluation read [our documentation](https://github.com/Nkluge-correa/Tucano/blob/main/evaluations/README.md).
403
 
404
  | | Avg. Length | Wins | Base Wins | Total Matches | Length-Controlled Win Rate (%) | LC Std. Error |
405
  |-------------------------|-------------|------|-----------|---------------|--------------------------------|---------------|