Context Filtering with Reward Modeling in Question Answering Paper • 2412.11707 • Published 24 days ago
Stable Language Model Pre-training by Reducing Embedding Variability Paper • 2409.07787 • Published Sep 12, 2024
Cross-lingual Transfer of Reward Models in Multilingual Alignment Paper • 2410.18027 • Published Oct 23, 2024
Margin-aware Preference Optimization for Aligning Diffusion Models without Reference Paper • 2406.06424 • Published Jun 10, 2024 • 12
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean Paper • 2403.06412 • Published Mar 11, 2024 • 3
Retrieval-Augmented Data Augmentation for Low-Resource Domain Tasks Paper • 2402.13482 • Published Feb 21, 2024
Block Transformer: Global-to-Local Language Modeling for Fast Inference Paper • 2406.02657 • Published Jun 4, 2024 • 37
ProbGate at EHRSQL 2024: Enhancing SQL Query Generation Accuracy through Probabilistic Threshold Filtering and Error Handling Paper • 2404.16659 • Published Apr 25, 2024
FEVER: a large-scale dataset for Fact Extraction and VERification Paper • 1803.05355 • Published Mar 14, 2018
FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured information Paper • 2106.05707 • Published Jun 10, 2021
Can Large Language Models Infer and Disagree Like Humans? Paper • 2305.13788 • Published May 23, 2023
ORPO: Monolithic Preference Optimization without Reference Model Paper • 2403.07691 • Published Mar 12, 2024 • 64
ORPO: Monolithic Preference Optimization without Reference Model Paper • 2403.07691 • Published Mar 12, 2024 • 64
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models Paper • 2310.08491 • Published Oct 12, 2023 • 53
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets Paper • 2307.10928 • Published Jul 20, 2023 • 12