--- language: - sl - hr - en license: cc-by-4.0 --- # crosloengual-bert-si-nli CroSloEngual BERT model finetuned on the SI-NLI dataset for Slovene natural language inference. Fine-tuned in a classic sequence pair classification setting on the official training/validation/test split for 10 epochs, using validation set accuracy for model selection. Optimized using the AdamW optimizer (learning rate 2e-5) and cross-entropy loss. Using batch size `82` (selected based on the available GPU memory) and maximum sequence length `107` (99th percentile of the lengths in the training set). Achieves the following metrics: - best validation accuracy: `0.660` - test accuracy = `0.673`