Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated Nov 2, 2024 • 160
DSBench: How Far Are Data Science Agents to Becoming Data Science Experts? Paper • 2409.07703 • Published Sep 12, 2024 • 66
WPO: Enhancing RLHF with Weighted Preference Optimization Paper • 2406.11827 • Published Jun 17, 2024 • 14
WPO: Enhancing RLHF with Weighted Preference Optimization Paper • 2406.11827 • Published Jun 17, 2024 • 14
InFoBench: Evaluating Instruction Following Ability in Large Language Models Paper • 2401.03601 • Published Jan 7, 2024 • 7
DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4 Paper • 2305.14702 • Published May 24, 2023 • 1
OASum: Large-Scale Open Domain Aspect-based Summarization Paper • 2212.09233 • Published Dec 19, 2022 • 2
Scoring Sentence Singletons and Pairs for Abstractive Summarization Paper • 1906.00077 • Published May 31, 2019 • 2
OASum: Large-Scale Open Domain Aspect-based Summarization Paper • 2212.09233 • Published Dec 19, 2022 • 2
Scoring Sentence Singletons and Pairs for Abstractive Summarization Paper • 1906.00077 • Published May 31, 2019 • 2
DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4 Paper • 2305.14702 • Published May 24, 2023 • 1
InFoBench: Evaluating Instruction Following Ability in Large Language Models Paper • 2401.03601 • Published Jan 7, 2024 • 7