yunfanj
commited on
Commit
•
f949589
0
Parent(s):
update
Browse files- .gitattributes +35 -0
- LICENSE +51 -0
- README.md +71 -0
- residual/insert.pth +3 -0
- residual/lift_leaned_leg.pth +3 -0
- residual/reach_and_grasp.pth +3 -0
- residual/screw.pth +3 -0
- residual/stabilize.pth +3 -0
- rl/insert_full.pth +3 -0
- rl/insert_single.pth +3 -0
- rl/lift_leaned_leg.pth +3 -0
- rl/one_leg_grasp.pth +3 -0
- rl/reach_and_grasp_full.pth +3 -0
- rl/reach_and_grasp_single.pth +3 -0
- rl/screw.pth +3 -0
- rl/stabilize.pth +3 -0
- student/insert.pth +3 -0
- student/lift_leaned_leg.pth +3 -0
- student/reach_and_grasp.pth +3 -0
- student/screw.pth +3 -0
- student/stabilize.pth +3 -0
.gitattributes
ADDED
@@ -0,0 +1,35 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
*.7z filter=lfs diff=lfs merge=lfs -text
|
2 |
+
*.arrow filter=lfs diff=lfs merge=lfs -text
|
3 |
+
*.bin filter=lfs diff=lfs merge=lfs -text
|
4 |
+
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
5 |
+
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
6 |
+
*.ftz filter=lfs diff=lfs merge=lfs -text
|
7 |
+
*.gz filter=lfs diff=lfs merge=lfs -text
|
8 |
+
*.h5 filter=lfs diff=lfs merge=lfs -text
|
9 |
+
*.joblib filter=lfs diff=lfs merge=lfs -text
|
10 |
+
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
11 |
+
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
12 |
+
*.model filter=lfs diff=lfs merge=lfs -text
|
13 |
+
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
14 |
+
*.npy filter=lfs diff=lfs merge=lfs -text
|
15 |
+
*.npz filter=lfs diff=lfs merge=lfs -text
|
16 |
+
*.onnx filter=lfs diff=lfs merge=lfs -text
|
17 |
+
*.ot filter=lfs diff=lfs merge=lfs -text
|
18 |
+
*.parquet filter=lfs diff=lfs merge=lfs -text
|
19 |
+
*.pb filter=lfs diff=lfs merge=lfs -text
|
20 |
+
*.pickle filter=lfs diff=lfs merge=lfs -text
|
21 |
+
*.pkl filter=lfs diff=lfs merge=lfs -text
|
22 |
+
*.pt filter=lfs diff=lfs merge=lfs -text
|
23 |
+
*.pth filter=lfs diff=lfs merge=lfs -text
|
24 |
+
*.rar filter=lfs diff=lfs merge=lfs -text
|
25 |
+
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
26 |
+
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
27 |
+
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
28 |
+
*.tar filter=lfs diff=lfs merge=lfs -text
|
29 |
+
*.tflite filter=lfs diff=lfs merge=lfs -text
|
30 |
+
*.tgz filter=lfs diff=lfs merge=lfs -text
|
31 |
+
*.wasm filter=lfs diff=lfs merge=lfs -text
|
32 |
+
*.xz filter=lfs diff=lfs merge=lfs -text
|
33 |
+
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
+
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
+
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
LICENSE
ADDED
@@ -0,0 +1,51 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
title: MIT License
|
3 |
+
spdx-id: MIT
|
4 |
+
featured: true
|
5 |
+
hidden: false
|
6 |
+
|
7 |
+
description: A short and simple permissive license with conditions only requiring preservation of copyright and license notices. Licensed works, modifications, and larger works may be distributed under different terms and without source code.
|
8 |
+
|
9 |
+
how: Create a text file (typically named LICENSE or LICENSE.txt) in the root of your source code and copy the text of the license into the file. Replace [year] with the current year and [fullname] with the name (or names) of the copyright holders.
|
10 |
+
|
11 |
+
using:
|
12 |
+
Babel: https://github.com/babel/babel/blob/master/LICENSE
|
13 |
+
.NET: https://github.com/dotnet/runtime/blob/main/LICENSE.TXT
|
14 |
+
Rails: https://github.com/rails/rails/blob/master/MIT-LICENSE
|
15 |
+
|
16 |
+
permissions:
|
17 |
+
- commercial-use
|
18 |
+
- modifications
|
19 |
+
- distribution
|
20 |
+
- private-use
|
21 |
+
|
22 |
+
conditions:
|
23 |
+
- include-copyright
|
24 |
+
|
25 |
+
limitations:
|
26 |
+
- liability
|
27 |
+
- warranty
|
28 |
+
|
29 |
+
---
|
30 |
+
|
31 |
+
MIT License
|
32 |
+
|
33 |
+
Copyright (c) 2024 Yunfan Jiang
|
34 |
+
|
35 |
+
Permission is hereby granted, free of charge, to any person obtaining a copy
|
36 |
+
of this software and associated documentation files (the "Software"), to deal
|
37 |
+
in the Software without restriction, including without limitation the rights
|
38 |
+
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
39 |
+
copies of the Software, and to permit persons to whom the Software is
|
40 |
+
furnished to do so, subject to the following conditions:
|
41 |
+
|
42 |
+
The above copyright notice and this permission notice shall be included in all
|
43 |
+
copies or substantial portions of the Software.
|
44 |
+
|
45 |
+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
46 |
+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
47 |
+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
48 |
+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
49 |
+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
50 |
+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
51 |
+
SOFTWARE.
|
README.md
ADDED
@@ -0,0 +1,71 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: mit
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
pipeline_tag: robotics
|
6 |
+
---
|
7 |
+
# Model Card for TRANSIC Policies
|
8 |
+
|
9 |
+
<!-- Provide a quick summary of what the model is/does. -->
|
10 |
+
|
11 |
+
This modelcard is accompanied with the [paper](https://arxiv.org/abs/2405.10315) titled TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction.
|
12 |
+
It includes robot policies trained in the simulation and transferred to the real world for complex and contact-rich manipulation tasks.
|
13 |
+
|
14 |
+
## Model Details
|
15 |
+
|
16 |
+
### Model Description
|
17 |
+
|
18 |
+
<!-- Provide a longer summary of what this model is. -->
|
19 |
+
This model repository includes three parts, 1) teacher policies trained in simulation with reinforcement learning; 2) student policies distilled from successful trajectories generated by teacher policies; and 3) residual policies learned in the real world to augment simulation policies.
|
20 |
+
|
21 |
+
The first part can be found in the `rl` directory. We provide RL teacher policies for 8 different tasks. The second part can be found in the `student` directory. We provide 5 student policies corresponding to 5 skills used in assembling the square table from [FurnitureBench](https://github.com/clvrai/furniture-bench). The third part can be found in the `residual` directory. They augment those 5 simulation base policies (student policies).
|
22 |
+
|
23 |
+
|
24 |
+
- **Developed by:** [Yunfan Jiang](https://yunfanj.com/)
|
25 |
+
- **Model type:** [PyTorch Checkpoints](https://pytorch.org/tutorials/recipes/recipes/saving_and_loading_a_general_checkpoint.html)
|
26 |
+
- **License:** [MIT](LICENSE)
|
27 |
+
|
28 |
+
### Model Sources
|
29 |
+
|
30 |
+
- **Repositories:** [TRANSIC](https://github.com/transic-robot/transic), [TRANSIC-Envs](https://github.com/transic-robot/transic-envs)
|
31 |
+
- **Paper:** [TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction](https://arxiv.org/abs/2405.10315)
|
32 |
+
- **Demo:** Provided on Our [Website](https://transic-robot.github.io/)
|
33 |
+
|
34 |
+
## Uses & How to Get Started with the Model
|
35 |
+
|
36 |
+
Please see our [codebase](https://github.com/transic-robot/transic) for detailed usage.
|
37 |
+
|
38 |
+
## Training Details
|
39 |
+
|
40 |
+
### Training Data
|
41 |
+
|
42 |
+
We provide training data in our [🤗Hugging Face data repository](https://huggingface.co/datasets/transic-robot/data).
|
43 |
+
|
44 |
+
### Training Procedure
|
45 |
+
|
46 |
+
1. Teacher policies are first trained with reinforcement learning from scratch in simulation;
|
47 |
+
2. We then rollout teacher policies to generate successful trajectories;
|
48 |
+
3. These generated data are used to train student policies through behavior cloning. Student policies take point-cloud and proprioceptive observations and output joint actions.
|
49 |
+
4. We then deploy student policies on the real robot. A human operator monitors the execution, intervenes when necessary, and provides online correction through teleoperation. Such teleoperation data are collected.
|
50 |
+
5. We use collected correction data to learn residual policies, which then augment simulation policies for successful sim-to-real transfer.
|
51 |
+
|
52 |
+
## Evaluation
|
53 |
+
|
54 |
+
Policies are evaluated in simulation and the real world. We use task success rate as the metric.
|
55 |
+
|
56 |
+
## Citation
|
57 |
+
|
58 |
+
**BibTeX:**
|
59 |
+
|
60 |
+
```
|
61 |
+
@article{jiang2024transic,
|
62 |
+
title = {TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction},
|
63 |
+
author = {Yunfan Jiang and Chen Wang and Ruohan Zhang and Jiajun Wu and Li Fei-Fei},
|
64 |
+
year = {2024},
|
65 |
+
journal = {arXiv preprint arXiv: Arxiv-2405.10315}
|
66 |
+
}
|
67 |
+
```
|
68 |
+
|
69 |
+
## Model Card Contact
|
70 |
+
|
71 |
+
[Yunfan Jiang](https://yunfanj.com/), email: `yunfanj[at]cs[dot]stanford[dot]edu`
|
residual/insert.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f4e3a91843429063ca73174460cdbd1d3747836e92426997f9126f2247cc3cd5
|
3 |
+
size 5958913
|
residual/lift_leaned_leg.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f6760ced28c86b54a0c5d064b29f4fea62560402a53b3811bb742967090be8bb
|
3 |
+
size 10302447
|
residual/reach_and_grasp.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2457159aec58a319b29ad2e9203489b72533371ce945b2c2ef87d670848e77e5
|
3 |
+
size 10300856
|
residual/screw.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:cdf6af4b51c6bbd53bb40e2293bee0732bb055a5834d4794fec3222af6759ff8
|
3 |
+
size 10302511
|
residual/stabilize.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c6a3b581f29366f77ac00dcab6d5733dd998050d43adf8429bdf9b56bdbed991
|
3 |
+
size 6096062
|
rl/insert_full.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a8cd5f2ca60e23281d53d9d4dd025695b00705e017a1808042f7e6994bb9c479
|
3 |
+
size 2596017
|
rl/insert_single.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5629bf09973db5ef20e10010333c0f893d8e2fc8f291344da7720d8d69d3b2d1
|
3 |
+
size 2429723
|
rl/lift_leaned_leg.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e4102726c52187de8df2655aab607aa45f128664ccb18829190fcf6dd89a9078
|
3 |
+
size 2614080
|
rl/one_leg_grasp.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6f358e7d76a9122e6e5b53fe2554c092b4f0d4a5867d43a969a757e76de63835
|
3 |
+
size 2374100
|
rl/reach_and_grasp_full.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0592f67c122e3a883db02cab924d6c59adebddd1921adc351b57651dfc12c169
|
3 |
+
size 2373610
|
rl/reach_and_grasp_single.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6f358e7d76a9122e6e5b53fe2554c092b4f0d4a5867d43a969a757e76de63835
|
3 |
+
size 2374100
|
rl/screw.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:651cdaab6a3215ce73173bf928c54b2f9172193a320bca510296666fee84a75a
|
3 |
+
size 2423437
|
rl/stabilize.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d422517857f064e19d66c8e6d149bcaf5130f07823c4eb335146230c5770e666
|
3 |
+
size 2663843
|
student/insert.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5e988d50948e3d9fe86e7f11f97aa260cc181c13959f7870486b5559f09da759
|
3 |
+
size 10629471
|
student/lift_leaned_leg.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f892954c5d2e5b573ad506538527ac96539a852bb1808bc9e04ab5d4a22c39b4
|
3 |
+
size 10629407
|
student/reach_and_grasp.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c278b9746c3dba753a72a8403ce79aeb1d632e044f18add77c065873900b23cd
|
3 |
+
size 10629407
|
student/screw.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:01f59765b01cafdc3192245435b4140ad336a21b4019b1584616142bfbc2d556
|
3 |
+
size 61067373
|
student/stabilize.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:fe767c5028ada8f1a27b028c572544123fa10c942aacd0d02e3e42f9bfce4e41
|
3 |
+
size 61066035
|