DeepMind Control (DMC)
These are the Environment Wrappers for selected DeepMind Control environments in order to use our Motion Primitive gym interface with them.
Step-Based Environments
When installing fancy_gym with the optional dmc extra (e.g. pip install fancy_gym[dmc]
), all regular dmc tasks are avaible via Shimmy.
Name |
Description |
Action Dim |
Observation Dim |
---|---|---|---|
|
Underactuated double pendulum (Acrobot) with torque applied to the second joint to swing up and balance. |
1 |
6 |
|
Similar to |
1 |
6 |
|
Planar ball-in-cup task where a receptacle must swing to catch a ball. Sparse reward for catching. |
2 |
8 |
|
Cart-pole task where the goal is to balance an unactuated pole by moving a cart, starting near upright. |
1 |
5 |
|
Similar to |
1 |
5 |
|
Cart-pole task with the pole starting downward, requiring swinging up and balancing. |
1 |
5 |
|
Similar to |
1 |
5 |
|
Extension of the Cart-pole domain with two serially connected poles, increasing the balancing challenge. |
1 |
8 |
|
Extension of the Cart-pole domain with three serially connected poles, further increasing the challenge. |
1 |
11 |
|
Planar bipedal cheetah robot tasked with running. The reward is proportional to forward velocity up to a maximum speed. |
6 |
17 |
|
Dog robot task focusing on achieving a standing posture. |
38 |
223 |
|
Dog robot tasked with walking, requiring coordination of movements. |
38 |
223 |
|
Dog robot performing a trotting gait. |
38 |
223 |
|
Dog robot tasked with running, combining speed with stability. |
38 |
223 |
|
Dog robot playing fetch, involving locomotion and object interaction. |
38 |
232 |
|
Finger robot required to rotate an unactuated body on a hinge. |
2 |
9 |
|
Finger robot task to align the tip of a free body with a target, easier version with a larger target. |
2 |
12 |
|
Similar to |
2 |
12 |
|
Fish robot task focused on righting itself in a fluid environment. |
5 |
21 |
|
Fish robot swimming to a target, incorporating fluid dynamics. |
5 |
24 |
|
One-legged hopper robot tasked with achieving a minimal torso height. |
4 |
15 |
|
One-legged hopper robot required to hop, combining height and forward velocity. |
4 |
15 |
|
Simplified humanoid robot maintaining a standing posture. |
21 |
67 |
|
Simplified humanoid robot walking at a specified speed. |
21 |
67 |
|
Simplified humanoid robot running, aiming for a high horizontal speed. |
21 |
67 |
|
Simplified humanoid robot running with a focus on pure state control. |
21 |
55 |
|
Advanced humanoid robot (CMU model) maintaining a standing posture. |
56 |
137 |
|
Advanced humanoid robot (CMU model) walking. |
56 |
137 |
|
Advanced humanoid robot (CMU model) running. |
56 |
137 |
|
Linear quadratic regulator (LQR) task with 2 masses and 1 actuator, focusing on position and control optimization. |
1 |
4 |
|
Linear quadratic regulator (LQR) task with 6 masses and 2 actuators, more complex control optimization challenge. |
2 |
12 |
|
Planar manipulator robot bringing a ball to a target location, with object initialization variations. |
5 |
44 |
|
Planar manipulator task of bringing a peg to a target peg. |
5 |
44 |
|
Planar manipulator task requiring inserting a ball into a basket. |
5 |
44 |
|
Planar manipulator challenge of inserting a peg into a slot. |
5 |
44 |
|
Classic inverted pendulum task with a torque-limited actuator, requiring multiple swings to swing up and balance. |
1 |
3 |
|
Planar point mass in an easy task, with actuators corresponding to global x and y axes |
2 |
4 |
|
Planar point mass in a hard task, with randomized control gains per episode, challenging memoryless agents. |
2 |
4 |
|
Four-legged robot (Quadruped) tasked with walking. |
12 |
78 |
|
Quadruped robot required to run, balancing speed and stability. |
12 |
78 |
|
Quadruped robot in an escape task, combining locomotion with environmental interaction. |
12 |
101 |
|
Quadruped robot playing fetch, involving complex movements and object interaction. |
12 |
90 |
|
Two-link planar reacher with a randomized target, easier version with a larger target sphere. |
2 |
6 |
|
Similar to |
2 |
6 |
|
Stacker task requiring stacking of 2 boxes, with rewards for correct placement and gripper positioning. |
5 |
49 |
|
More complex stacker task with 4 boxes, increasing the stacking challenge. |
5 |
63 |
|
Six-link planar swimmer in fluid dynamics, rewarded for positioning the nose inside a target. |
5 |
25 |
|
Fifteen-link planar swimmer, a more complex version of the swimmer task with extended dynamics. |
14 |
61 |
|
Planar walker robot tasked with maintaining an upright torso and minimal height. |
6 |
24 |
|
Planar walker robot walking task, focusing on forward velocity and stability. |
6 |
24 |
|
Planar walker robot running task, aiming for high speed and balance. |
6 |
24 |
MP Environments
Name |
Description |
Trajectory Horizon |
Action Dimension |
Context Dimension |
---|---|---|---|---|
|
A ProMP wrapped version of the “catch” task for the “ball_in_cup” environment. |
1000 |
10 |
2 |
|
A DMP wrapped version of the “catch” task for the “ball_in_cup” environment. |
1000 |
10 |
2 |
|
A ProMP wrapped version of the “easy” task for the “reacher” environment. |
1000 |
10 |
4 |
|
A DMP wrapped version of the “easy” task for the “reacher” environment. |
1000 |
10 |
4 |
|
A ProMP wrapped version of the “hard” task for the “reacher” environment. |
1000 |
10 |
4 |
|
A DMP wrapped version of the “hard” task for the “reacher” environment. |
1000 |
10 |
4 |