Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models Paper • 2411.00154 • Published Oct 31, 2024
Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models Paper • 2407.03181 • Published Jul 3, 2024 • 1
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper • 2406.15877 • Published Jun 22, 2024 • 45
How to Handle Different Types of Out-of-Distribution Scenarios in Computational Argumentation? A Comprehensive and Fine-Grained Field Study Paper • 2309.08316 • Published Sep 15, 2023
Dive into the Chasm: Probing the Gap between In- and Cross-Topic Generalization Paper • 2402.01375 • Published Feb 2, 2024
Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research Paper • 2306.16900 • Published Jun 29, 2023