Added SNLI score
class Tasks(Enum):
# task_key in the json file, metric_key in the json file, name to display in the leaderboard
task0 = Task("custom|
task1 = Task("custom|
task2 = Task("custom|
task3 = Task("custom|
NUM_FEWSHOT = 0 # Change with your few shot
class Tasks(Enum):
# task_key in the json file, metric_key in the json file, name to display in the leaderboard
task0 = Task("custom|snli-acc|0", "snli_acc", "SNLI Accuracy")
task1 = Task("custom|heq-qa-tlnls|0", "heq_tlnls", "QA TLNLS (HeQ)")
task2 = Task("custom|sentiment-acc|0", "sentiment_acc", "Sentiment Acc (Mafat)")
task3 = Task("custom|winograd-acc|0", "winograd_acc", "Winograd (Binary) Acc (V. Schwartz)")
task4 = Task("custom|he-en-trans-bleu|0", "sentence_bleu", "Translation BLEU")
NUM_FEWSHOT = 0 # Change with your few shot
5. SNLI Accuracy
- **Source**: We took a sample of documents from the test-subset of the official SNLI corpus.
- **Scoring**: We compute the accuracy score on the predictions, expecting either "住转讬专讛", "讛转讗诪讛", or "讻诇讜诐".
- **Number of examples**: There are a total of 210 examples - 70 from each class - where each example was translated using [Dicta's translation engine](, and then manually reviewed and corrected as needed.
- **Few-Shot Format**: For every prompt, we provide 12 few-shot examples, 4 from each category.
For example:
<blockquote dir="rtl" style='text-align: right; background-color: #f0f0f0'>
讛谞讞转 讬住讜讚: 谞注专 诪谞讙谉 讘讞爪讜爪专转讜 讘诪讛诇讱 讛讜驻注讛 注诐 诇讛拽转讜.<br/>
讛砖注专讛: 诇讗祝 讗讞讚 讗讬谉 讞爪讜爪专讛.<br/>
转砖讜讘讛: 住转讬专讛<br/>
讛谞讞转 讬住讜讚: 讛谞注专讛 诇讘讜砖讛 讘诪注讬诇 讞讜诐, 讘注讜讚讛 驻讜住注转 讘砖诇讙.<br/>
讛砖注专讛: 讛讙讘专转 讛诇讜讘砖转 诪注讬诇 诪讞驻砖转 讗转 讻诇讘讛 讛讗讜讘讚.<br/>
转砖讜讘讛: 讻诇讜诐<br/>
讛谞讞转 讬住讜讚: 住驻讬谞转志驻讗专 讘讛 讗谞砖讬诐 注讜诇讬诐 讜讬讜专讚讬诐.<br/>
讛砖注专讛: 讗谞砖讬诐 注讜诇讬诐 讜讬讜专讚讬诐 诪住驻讬谞讜转.<br/>
转砖讜讘讛: 讛转讗诪讛<br/>
讛谞讞转 讬住讜讚: 讛谞讞讛 讞讚砖讛<br/>
讛砖注专讛: 讛砖注专讛 讞讚砖讛<br/>
