Spaces:
Sleeping
Sleeping
Update data/eval_board.csv
Browse files- data/eval_board.csv +4 -4
data/eval_board.csv
CHANGED
@@ -1,11 +1,11 @@
|
|
1 |
Models,Re2Text-Easy,Text2Re-Easy,Re2Text-Hard,Text2Re-Hard,Avg,Model Size,Links
|
2 |
-
|
3 |
-
gpt-3.5-turbo-0301,83.5,60.7,59.0,39.0,60.6,unknown,https://chat.openai.com/
|
4 |
text-davinci-003,85.4,83.8,55.8,34.8,65.0,175B,https://platform.openai.com/docs/models/gpt-3-5
|
|
|
5 |
claude-instant-1.1,65.7,87.2,52.3,26.2,57.9,unknown,https://www.anthropic.com/index/introducing-claude
|
6 |
-
|
7 |
-
flan-t5-xxl,79.4,96.8,20.7,4.8,50.4,11B,https://huggingface.co/google/flan-t5-xxl
|
8 |
flan-t5-xl,91.5,90.6,7.9,17.8,52.0,3B,https://huggingface.co/google/flan-t5-xl
|
9 |
flan-t5-large,71.5,77.3,26.2,29.6,51.2,780M,https://huggingface.co/google/flan-t5-large
|
10 |
flan-t5-base,84.6,51.2,17.0,50.2,50.8,250M,https://huggingface.co/google/flan-t5-base
|
|
|
11 |
flan-t5-small,51.8,50.1,46.5,49.5,49.5,60M,https://huggingface.co/google/flan-t5-small
|
|
|
1 |
Models,Re2Text-Easy,Text2Re-Easy,Re2Text-Hard,Text2Re-Hard,Avg,Model Size,Links
|
2 |
+
claude-1.3,89.7,82.3,37.3,56.6,66.5,unknown,https://www.anthropic.com/index/introducing-claude
|
|
|
3 |
text-davinci-003,85.4,83.8,55.8,34.8,65.0,175B,https://platform.openai.com/docs/models/gpt-3-5
|
4 |
+
gpt-3.5-turbo-0301,83.5,60.7,59.0,39.0,60.6,unknown,https://chat.openai.com/
|
5 |
claude-instant-1.1,65.7,87.2,52.3,26.2,57.9,unknown,https://www.anthropic.com/index/introducing-claude
|
6 |
+
gpt-4-0314,98.7,93.6,16.4,17.1,56.5,unknown,https://openai.com/research/gpt-4
|
|
|
7 |
flan-t5-xl,91.5,90.6,7.9,17.8,52.0,3B,https://huggingface.co/google/flan-t5-xl
|
8 |
flan-t5-large,71.5,77.3,26.2,29.6,51.2,780M,https://huggingface.co/google/flan-t5-large
|
9 |
flan-t5-base,84.6,51.2,17.0,50.2,50.8,250M,https://huggingface.co/google/flan-t5-base
|
10 |
+
flan-t5-xxl,79.4,96.8,20.7,4.8,50.4,11B,https://huggingface.co/google/flan-t5-xxl
|
11 |
flan-t5-small,51.8,50.1,46.5,49.5,49.5,60M,https://huggingface.co/google/flan-t5-small
|