arxiv:2401.14858

RESPRECT: Speeding-up Multi-fingered Grasping with Residual Reinforcement Learning

Published on Jan 26, 2024

Authors:

Federico Ceola ,

Abstract

<PRE_TAG>Deep Reinforcement Learning (DRL)</POST_TAG> has proven effective in learning control policies using robotic grippers, but much less practical for solving the problem of grasping with dexterous hands -- especially on real robotic platforms -- due to the high dimensionality of the problem. In this work, we focus on the <PRE_TAG>multi-fingered grasping</POST_TAG> task with the <PRE_TAG>anthropomorphic hand</POST_TAG> of the <PRE_TAG>iCub humanoid</POST_TAG>. We propose the RESidual learning with PREtrained CriTics (RESPRECT) method that, starting from a policy pre-trained on a large set of objects, can learn a <PRE_TAG>residual policy</POST_TAG> to grasp a novel object in a fraction (sim 5 times faster) of the timesteps required to train a policy from scratch, without requiring any task demonstration. To our knowledge, this is the first Residual Reinforcement Learning (RRL) approach that learns a residual policy on top of another policy <PRE_TAG>pre-trained with DRL</POST_TAG>. We exploit some components of the <PRE_TAG>pre-trained policy</POST_TAG> during residual learning that further speed-up the training. We benchmark our results in the iCub simulated environment, and we show that RESPRECT can be effectively used to learn a <PRE_TAG>multi-fingered grasping</POST_TAG> policy on the <PRE_TAG>real iCub robot</POST_TAG>. The code to reproduce the experiments is released together with the paper with an open source license.

View arXiv page View PDF Add to collection

Community

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2401.14858 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2401.14858 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2401.14858 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.