llm-jp
/

llm-jp-13b-v2.0

Text Generation

text-generation-inference

Model card Files Files and versions Community

hkiyomaru commited on Apr 30

Commit

37309df

•

1 Parent(s): 1710951

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -136,6 +136,9 @@ The models have been fine-tuned on the following datasets.
 You can view the evaluation results of several LLMs on this [leaderboard](http://wandb.me/llm-jp-leaderboard). We used [llm-jp-eval](https://github.com/llm-jp/llm-jp-eval) (v1.3.0) for the evaluation.
 ## Risks and Limitations
 The models released here are still in the early stages of our research and development and have not been tuned to ensure outputs align with human intent and safety considerations.

 You can view the evaluation results of several LLMs on this [leaderboard](http://wandb.me/llm-jp-leaderboard). We used [llm-jp-eval](https://github.com/llm-jp/llm-jp-eval) (v1.3.0) for the evaluation.
+Besides, we used LLM-as-a-judge frameworks, [Japanese Vicuna QA Benchmark](https://github.com/ku-nlp/ja-vicuna-qa-benchmark/) and [Japanese MT Bench](https://github.com/Stability-AI/FastChat/tree/jp-stable/fastchat/llm_judge), for evaluation.
+For details, please refer to [our technical blog](https://llm-jp.nii.ac.jp/blog/2024/04/30/v2.0-release.html) (in Japanese).
 ## Risks and Limitations
 The models released here are still in the early stages of our research and development and have not been tuned to ensure outputs align with human intent and safety considerations.