Update README.md
Browse files
README.md
CHANGED
@@ -73,7 +73,7 @@ We train on 2400 samples consisting of CovidQA, PubmedQA, DROP and RAGTruth samp
|
|
73 |
|
74 |
## Evaluation
|
75 |
|
76 |
-
The model was evaluated on [PatronusAI/halubench](https://huggingface.co/datasets/PatronusAI/
|
77 |
|
78 |
It outperforms GPT-3.5-Turbo, GPT-4-Turbo, GPT-4o and Claude Sonnet.
|
79 |
|
|
|
73 |
|
74 |
## Evaluation
|
75 |
|
76 |
+
The model was evaluated on [PatronusAI/halubench](https://huggingface.co/datasets/PatronusAI/HaluBench).
|
77 |
|
78 |
It outperforms GPT-3.5-Turbo, GPT-4-Turbo, GPT-4o and Claude Sonnet.
|
79 |
|