PressurePlate is a multi-agent environment that requires agents to cooperate during the traversal of a gridworld.

Related tags

Miscellaneous pressureplate

Overview

Description

PressurePlate is a multi-agent environment that requires agents to cooperate during the traversal of a gridworld. The grid is partitioned into several rooms, and each room contains a plate and a closed doorway. Before episodes begin, each agent is assigned a plate that only they can activate. For the group of agents to proceed into the next room, an agent must remain behind, standing on their assigned plate. The task is considered solved when the goal (depicted with a treasure chest) is reached.

Currently, PressurePlate supports four-, five-, and six-player levels but is easily configurable for custom scenarios. See Customizing Scenarios for more information.

Observation Space

Each agent has a distance-limited view of the environment, as defined by the sensor_range attribute of the PressurePlate class. The PressurePlate world is made of several 2D grids, where each grid corresponds to an entity type. For example, one grid corresponds to walls, one grid corresponds to plates, and so on. When queried, the environment produces a subsection of each grid that corresponds to each agent's viewing range. Next, these subsections are flattened and concatenated together. Finally, the agent's (x,y) coordinates are concatenated to the end of the observation vector.

See the below figure for a depiction of this process for Agent 0 and the Doors grid.

Action Space

PressurePlate's action space is discrete and has five options: up, down, left, right, and no-op (do nothing).

For each call of .step(), the ordering of action-execution is randomized.

Reward Function

Each agent receives rewards independent of other agents. If an agent is in the room that contains their assigned plate, their reward is the negative normalized Manhattan distance between their current position and the plate. Otherwise, their reward is the number of rooms between their current room and the room that contains their assigned plate.

Installation

After cloning the repo, cd into pressureplate and:

pip install -e .

Using PressurePlate

Within your Python script, access the three currently-available tasks as follows:

env = gym.make('pressureplate-linear-4p-v0')
env = gym.make('pressureplate-linear-5p-v0')
env = gym.make('pressureplate-linear-6p-v0')

The PressurePlate environment is implemented within the Gym paradigm, and therefore uses the usual .step(), .reset(), and .render() methods.

Customizing Scenarios

To create a custom PressurePlate layout, you can add a layout dictionary to the pressureplate/assets.py file. The dictionary must contain lists of (x,y) coordinates of the following elements:

A unique identifier (e.g., 'FOUR_PLAYERS')
'WALLS'
'DOORS'
'PLATES'
'AGENTS'
'GOAL'

Additionally, you will need to register the new task as a gym environment within pressureplate/__init__.py. Finally, edit the PressurePlate class with pressureplate/environment.py to load your custom layout into the self.layout attribute.

For detailed instructions, please refer to the docstring within pressureplate/assets.py.

You might also like...

Python Interactive Graphical System made during Computer Graphics classes (INE5420-2021.1)

Comments

Question: Why does the observation exclude the wall grid?

Hi, thank you for this environment. I'm curious why the observation space excludes the wall grid in self._get_obs() in environment.py. Is an agent's knowledge of walls not necessary to solving this environment? Or am I missing something? Thank you in advance!

opened by jetnew 0

PressurePlate is a multi-agent environment that requires agents to cooperate during the traversal of a gridworld.

Related tags

Overview

Description

Observation Space

Action Space

Reward Function

Installation

Using PressurePlate

Customizing Scenarios

You might also like...

Python Interactive Graphical System made during Computer Graphics classes (INE5420-2021.1)

On this repo, you'll find every codes I made during my NSI classes (informatical courses)

All exercises done during the Python 3 course in the Video Course (World 1, 2 and 3)

A software dedicated to automaticaly select the agent of your desire in Valorant

The Python agent for Apache SkyWalking

The learning agent learns firstly approaching to the football and then kicking the football to the target position

RELATE is an Environment for Learning And TEaching

Transparently load variables from environment or JSON/YAML file.

AndroidEnv is a Python library that exposes an Android device as a Reinforcement Learning (RL) environment.

Comments

Question: Why does the observation exclude the wall grid?

Owner

Autonomous Agents Research Group (University of Edinburgh)

A simple wrapper to analyse and visualise reinforcement learning agents' behaviour in the environment.

Werkzeug has a debug console that requires a pin. It's possible to bypass this with an LFI vulnerability or use it as a local privilege escalation vector.

This repository requires you to solve a problem by writing some basic python code.

A community based economy bot with python works only with python 3.7.8 as web3 requires cytoolz

This repository holds those infrastructure-level modules, that every application requires that follows the core 12-factor principles.

C++ Environment InitiatorVisual Studio Code C / C++ Environment Initiator

An evolutionary multi-agent platform based on mesa and NEAT

Prints values and types during compilation!

Schemdule is a tiny tool using script as schema to schedule one day and remind you to do something during a day.

Originally used during Marketplace.tf's open period, this program was used to get the profit of items bought with keys and sold for dollars.