In deep reinforcement learning, a pruned network is a good network Paper • 2402.12479 • Published Feb 19, 2024 • 18
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL Paper • 2403.03950 • Published Mar 6, 2024 • 13
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework Paper • 2405.11143 • Published May 20, 2024 • 34
Understanding and Diagnosing Deep Reinforcement Learning Paper • 2406.16979 • Published Jun 23, 2024 • 9
Efficient World Models with Context-Aware Tokenization Paper • 2406.19320 • Published Jun 27, 2024 • 7