Simple Formation¶
This environment is part of the MPE environments. Please read that page first for general information.
Import |
|
|---|---|
Actions |
Discrete/Continuous |
Parallel API |
Yes |
Manual Control |
No |
Agents |
|
Action Shape |
(5) |
Action Values |
Discrete(5)/Box(0.0, 1.0, (5)) |
Observation Shape |
(6) |
Observation Values |
(-inf, inf) |
N agents must arrange themselves in a circle of radius 0.5 around a central landmark. At each step the ideal circular positions are anchored to the agent with the smallest angle from horizontal, and agents are assigned to positions via bipartite matching (Hungarian algorithm). The shared reward is the negative mean distance from assigned target positions, clipped to [0, 2].
Agent observations: [self_vel, self_pos, landmark_rel_pos]
Agent action space: [no_action, move_left, move_right, move_down, move_up]
Arguments¶
simple_formation_v1.env(N=4, max_cycles=25, continuous_actions=False, terminate_on_success=False)
N: number of agents
max_cycles: number of frames until the episode terminates
continuous_actions: whether action spaces are discrete (default) or continuous
terminate_on_success: when True, the episode ends as soon as every agent is within
0.05 units of its assigned target position.
API¶
- class mpe2.simple_formation.simple_formation.env(**kwargs)¶
- class mpe2.simple_formation.simple_formation.raw_env(N=4, max_cycles=25, continuous_actions=False, render_mode=None, dynamic_rescaling=False, benchmark_data=False, terminate_on_success=False)¶