Skip to content

Commit bca831b

Browse files
sgillenaraffin
andauthored
Add MuJoCo Robotics Envs HER+TQC trained agents (#71)
* Added HER+TQC robotics benchmarks + update FetchSlide hyperparams to add more time * add in HER+TQC Fetch env logs Co-authored-by: Antonin RAFFIN <[email protected]>
1 parent ad5f4ec commit bca831b

File tree

6 files changed

+12013
-2
lines changed

6 files changed

+12013
-2
lines changed

benchmark.md

+4-1
Original file line numberDiff line numberDiff line change
@@ -62,6 +62,10 @@ and also allow users to have access to pretrained agents.*
6262
|dqn |RoadRunnerNoFrameskip-v4 | 40396.350| 7069.131|10M | 603257| 137|
6363
|dqn |SeaquestNoFrameskip-v4 | 2000.290| 606.644|10M | 599505| 69|
6464
|dqn |SpaceInvadersNoFrameskip-v4| 622.742| 201.564|10M | 604311| 155|
65+
|her |FetchPickAndPlace-v1 | -8.921| 6.509|1M | 150000| 3000|
66+
|her |FetchPush-v1 | -10.526| 8.916|1M | 150000| 3000|
67+
|her |FetchReach-v1 | -1.677| 1.069|20k | 150000| 3000|
68+
|her |FetchSlide-v1 | -23.162| 10.625|2M | 150000| 3000|
6569
|her |parking-v0 | -6.970| 2.970|200k | 149980| 7106|
6670
|ppo |Acrobot-v1 | -73.506| 18.201|1M | 149979| 2013|
6771
|ppo |AntBulletEnv-v0 | 2865.922| 56.468|2M | 150000| 150|
@@ -91,7 +95,6 @@ and also allow users to have access to pretrained agents.*
9195
|qrdqn|BeamRiderNoFrameskip-v4 | 17122.941| 10769.997|10M | 596483| 17|
9296
|qrdqn|BreakoutNoFrameskip-v4 | 393.600| 79.828|10M | 579711| 40|
9397
|qrdqn|CartPole-v1 | 500.000| 0.000|50k | 150000| 300|
94-
|qrdqn|EnduroNoFrameskip-v4 | 3231.200| 1311.801|10M | 585728| 5|
9598
|qrdqn|LunarLander-v2 | 70.236| 225.491|100k | 149957| 522|
9699
|qrdqn|MountainCar-v0 | -106.042| 15.536|120k | 149943| 1414|
97100
|qrdqn|PongNoFrameskip-v4 | 20.492| 0.687|10M | 597443| 63|

hyperparams/her.yml

+1-1
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@ FetchPush-v1:
5252
FetchSlide-v1:
5353
env_wrapper:
5454
- sb3_contrib.common.wrappers.TimeFeatureWrapper
55-
n_timesteps: !!float 1e6
55+
n_timesteps: !!float 2.5e6
5656
policy: 'MlpPolicy'
5757
model_class: 'tqc'
5858
n_sampled_goal: 4

0 commit comments

Comments
 (0)