wonhosong commited on
Commit
e2333b3
1 Parent(s): 1220a69

Update src/assets/text_content.py

Browse files
Files changed (1) hide show
  1. src/assets/text_content.py +1 -0
src/assets/text_content.py CHANGED
@@ -39,6 +39,7 @@ We have set up a benchmark using datasets translated into Korean, and applied va
39
  - Ko-Arc (provided by __[Upstage](https://www.upstage.ai/)__, human translation and variation)
40
  - Ko-Truthful QA (provided by __[Upstage](https://www.upstage.ai/)__, human translation and variation)
41
  - Ko-CommonGen V2 (provided by __[Korea University NLP&AI Lab](http://nlp.korea.ac.kr/)__, created from scratch)
 
42
  To provide an evaluation befitting the LLM era, we've selected benchmark datasets suitable for assessing these elements: expertise, inference, hallucination, and common sense. The final score is converted to the average score from each evaluation datasets.
43
 
44
  GPUs are provided by __[KT](https://cloud.kt.com/)__ for the evaluations.
 
39
  - Ko-Arc (provided by __[Upstage](https://www.upstage.ai/)__, human translation and variation)
40
  - Ko-Truthful QA (provided by __[Upstage](https://www.upstage.ai/)__, human translation and variation)
41
  - Ko-CommonGen V2 (provided by __[Korea University NLP&AI Lab](http://nlp.korea.ac.kr/)__, created from scratch)
42
+
43
  To provide an evaluation befitting the LLM era, we've selected benchmark datasets suitable for assessing these elements: expertise, inference, hallucination, and common sense. The final score is converted to the average score from each evaluation datasets.
44
 
45
  GPUs are provided by __[KT](https://cloud.kt.com/)__ for the evaluations.