Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model Paper • 2310.09520 • Published Oct 14, 2023 • 10
Improving Large Language Model Fine-tuning for Solving Math Problems Paper • 2310.10047 • Published Oct 16, 2023 • 5
LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation, Generation and Editing Paper • 2311.00571 • Published Nov 1, 2023 • 40
SmartPlay : A Benchmark for LLMs as Intelligent Agents Paper • 2310.01557 • Published Oct 2, 2023 • 12
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27 • 602