GRS-QA -- Graph Reasoning-Structured Question Answering Dataset Paper โข 2411.00369 โข Published Nov 1, 2024 โข 6
Generative Verifiers: Reward Modeling as Next-Token Prediction Paper โข 2408.15240 โข Published Aug 27, 2024 โข 13 โข 2
Recursive Introspection: Teaching Language Model Agents How to Self-Improve Paper โข 2407.18219 โข Published Jul 25, 2024 โข 3
VideoGameBunny: Towards vision assistants for video games Paper โข 2407.15295 โข Published Jul 21, 2024 โข 22
Steering Llama 2 via Contrastive Activation Addition Paper โข 2312.06681 โข Published Dec 9, 2023 โข 11