fix: dataset link and name
Browse files- README.md +1 -1
- README_JA.md +1 -1
README.md
CHANGED
@@ -103,7 +103,7 @@ To achieve generic text embedding performance across a wide range of domains, we
|
|
103 |
|
104 |
|dataset|counts|
|
105 |
|:-:|:-:|
|
106 |
-
|[
|
107 |
|web-crawled data (ours)|47,370,649|
|
108 |
|[MQA](https://huggingface.co/datasets/hpprc/mqa-ja)|12,941,472|
|
109 |
|[llm-japanese-dataset](https://huggingface.co/datasets/izumi-lab/llm-japanese-dataset)|9,074,340|
|
|
|
103 |
|
104 |
|dataset|counts|
|
105 |
|:-:|:-:|
|
106 |
+
|[Auto Wiki QA/NLI](https://huggingface.co/datasets/hpprc/emb)|50,521,135|
|
107 |
|web-crawled data (ours)|47,370,649|
|
108 |
|[MQA](https://huggingface.co/datasets/hpprc/mqa-ja)|12,941,472|
|
109 |
|[llm-japanese-dataset](https://huggingface.co/datasets/izumi-lab/llm-japanese-dataset)|9,074,340|
|
README_JA.md
CHANGED
@@ -102,7 +102,7 @@ print(similarities.shape)
|
|
102 |
|
103 |
|dataset|counts|
|
104 |
|:-:|:-:|
|
105 |
-
|[
|
106 |
|web-crawled data (ours)|47,370,649|
|
107 |
|[MQA](https://huggingface.co/datasets/hpprc/mqa-ja)|12,941,472|
|
108 |
|[llm-japanese-dataset](https://huggingface.co/datasets/izumi-lab/llm-japanese-dataset)|9,074,340|
|
|
|
102 |
|
103 |
|dataset|counts|
|
104 |
|:-:|:-:|
|
105 |
+
|[Auto Wiki QA/NLI](https://huggingface.co/datasets/hpprc/emb)|50,521,135|
|
106 |
|web-crawled data (ours)|47,370,649|
|
107 |
|[MQA](https://huggingface.co/datasets/hpprc/mqa-ja)|12,941,472|
|
108 |
|[llm-japanese-dataset](https://huggingface.co/datasets/izumi-lab/llm-japanese-dataset)|9,074,340|
|