sugarfreez commited on
Commit
4d5391a
1 Parent(s): 8f8a2a0

style(nyz): add naive model zoo table

Browse files
Files changed (1) hide show
  1. README.md +27 -0
README.md CHANGED
@@ -20,3 +20,30 @@ As an important part of OpenXLab from Shanghai AI Laboratory, OpenDILab features
20
  OpenDILab contributes to the integration of the latest and most comprehensive achievements in academia as well as the standardization of complex problems in the industry. Our future vision is to promote the development of AI **from perceptual intelligence to decision intelligence,** taking AI technology to a higher level of the general intelligence era.
21
 
22
  If you want to contact us & join us, you can ✉️ to our team : <opendilab@pjlab.org.cn>.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
  OpenDILab contributes to the integration of the latest and most comprehensive achievements in academia as well as the standardization of complex problems in the industry. Our future vision is to promote the development of AI **from perceptual intelligence to decision intelligence,** taking AI technology to a higher level of the general intelligence era.
21
 
22
  If you want to contact us & join us, you can ✉️ to our team : <opendilab@pjlab.org.cn>.
23
+
24
+
25
+ # Overview of Model Zoo
26
+
27
+ ## Deep Reinforcement Learning
28
+
29
+ | Algo.\Env. | LunarLander | BipedalWalker | Pendulum | Atari (Pong) | Atari (SpaceInvaders) | Atari (Qbert) | MuJoCo (Hopper) | MuJoCo (Halfcheetah) | MuJoCo (Walker2d) |
30
+ | ------------- | ------------- | ------------------------ | ------------ | -------------- | ------------ | ------------------ | --------- | --------- | --------- |
31
+ | [PPO](https://arxiv.org/abs/1707.06347) | [Model](https://huggingface.co/OpenDILabCommunity/LunarLander-v2-ppo) | | | | | | | | |
32
+
33
+ ## Multi-Agent Reinforcement Learning
34
+ <details close>
35
+ <summary>(Click for Details)</summary>
36
+ TBD
37
+ </details>
38
+
39
+ ## Offline Reinforcement Learning
40
+ <details close>
41
+ <summary>(Click for Details)</summary>
42
+ TBD
43
+ </details>
44
+
45
+ ## Model-Based Reinforcement Learning
46
+ <details close>
47
+ <summary>(Click for Details)</summary>
48
+ TBD
49
+ </details>