SOLID: A Large-Scale Semi-Supervised Dataset for Offensive Language Identification Paper • 2004.14454 • Published Apr 29, 2020
SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020) Paper • 2006.07235 • Published Jun 12, 2020
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark Paper • 2306.02349 • Published Jun 4, 2023
From Internal Conflict to Contextual Adaptation of Language Models Paper • 2407.17023 • Published Jul 24, 2024
A Reality Check on Context Utilisation for Retrieval-Augmented Generation Paper • 2412.17031 • Published 12 days ago • 1
Atlas-Chat: Adapting Large Language Models for Low-Resource Moroccan Arabic Dialect Paper • 2409.17912 • Published Sep 26, 2024 • 23
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection Paper • 2408.04284 • Published Aug 8, 2024 • 22
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models Paper • 2308.16149 • Published Aug 30, 2023 • 26