If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs Paper • 2412.04144 • Published Dec 5, 2024 • 4
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs Paper • 2411.14199 • Published Nov 21, 2024 • 30
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages Paper • 2410.16153 • Published Oct 21, 2024 • 44
Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection Paper • 2004.07667 • Published Apr 16, 2020
Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark Paper • 2311.09122 • Published Nov 15, 2023 • 7
Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models Paper • 2401.10440 • Published Jan 19, 2024
MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling Paper • 2403.10691 • Published Mar 15, 2024
XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models Paper • 2301.10472 • Published Jan 25, 2023
MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization Paper • 2407.08818 • Published Jul 11, 2024
Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models Paper • 2408.06518 • Published Aug 12, 2024
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore Paper • 2407.12854 • Published Jul 9, 2024 • 30
TRAM: Bridging Trust Regions and Sharpness Aware Minimization Paper • 2310.03646 • Published Oct 5, 2023
Optimal Transport Posterior Alignment for Cross-lingual Semantic Parsing Paper • 2307.04096 • Published Jul 9, 2023
Gemma: Open Models Based on Gemini Research and Technology Paper • 2403.08295 • Published Mar 13, 2024 • 48
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Paper • 2403.05530 • Published Mar 8, 2024 • 62
Fine-grained Hallucination Detection and Editing for Language Models Paper • 2401.06855 • Published Jan 12, 2024 • 4
When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories Paper • 2212.10511 • Published Dec 20, 2022 • 1