Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2407.08250

Gradient Boosting Reinforcement Learning

Paper • 2407.08250 • Published Jul 11 • 10

Reinforcement Learning

Gradient Boosting Reinforcement Learning

Paper • 2407.08250 • Published Jul 11 • 10

Papers - XGBoost

Gradient Boosting Reinforcement Learning

Paper • 2407.08250 • Published Jul 11 • 10

Papers - RL - Structured Data - Gradient Boosting

Gradient Boosting Reinforcement Learning

Paper • 2407.08250 • Published Jul 11 • 10

Papers - RL - GBT vs GBRL vs XGBoost

Gradient Boosting Reinforcement Learning

Paper • 2407.08250 • Published Jul 11 • 10

Papers - RL - Actor-Critic

Gradient Boosting Reinforcement Learning

Paper • 2407.08250 • Published Jul 11 • 10

Papers - Markov Decision Process

Gradient Boosting Reinforcement Learning

Paper • 2407.08250 • Published Jul 11 • 10

Papers - RL - Gradient-Boosting

Gradient Boosting Reinforcement Learning

Paper • 2407.08250 • Published Jul 11 • 10

Reinforcement Learning (RL / RLHF)

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13 • 67
Understanding and Diagnosing Deep Reinforcement Learning

Paper • 2406.16979 • Published Jun 23 • 9
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4 • 60
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Paper • 2407.00617 • Published Jun 30 • 7

Papers - Nvidia

LITA: Language Instructed Temporal-Localization Assistant

Paper • 2403.19046 • Published Mar 27 • 18
Snap-it, Tap-it, Splat-it: Tactile-Informed 3D Gaussian Splatting for Reconstructing Challenging Surfaces

Paper • 2403.20275 • Published Mar 29 • 8
Condition-Aware Neural Network for Controlled Image Generation

Paper • 2404.01143 • Published Apr 1 • 11
CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4 • 24

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs