CowHerd is a partially-observed reinforcement learning environment

Danijar Hafner

Last update: Mar 6, 2022

Related tags

Deep Learning cowherd

Overview

CowHerd

CowHerd is a partially-observed reinforcement learning environment, where the player walks around an area and is rewarded for milking cows. The cows try to escape and the player can place fences to help capture them. The implementation of CowHerd is based on the Crafter environment.

Play Yourself

You can play the game yourself with an interactive window and keyboard input. The mapping from keys to actions, health level, and inventory state are printed to the terminal.

# Install with GUI
pip3 install 'cowherd[gui]'

# Start the game
cowherd

# Alternative way to start the game
python3 -m cowherd.run_gui

The following optional command line flags are available:

Flag	Default	Description
`--window`	800 800	Window size in pixels, used as width and height.
`--fps`	5	How many times to update the environment per second.
`--record .mp4`	None	Record a video of the trajectory.
`--num_cows`	3	The number of cows in the environment.
`--view`	7 7	The layout size in cells; determines view distance.
`--length`	None	Time limit for the episode.
`--seed`	None	Determines world generation and creatures.

Training Agents

Installation: pip3 install -U cowherd

The environment follows the OpenAI Gym interface:

import cowherd

env = cowherd.Env(seed=0)
obs = env.reset()
assert obs.shape == (64, 64, 3)

done = False
while not done:
  action = env.action_space.sample()
  obs, reward, done, info = env.step(action)

Environment Details

Reward

A reward of +1 is given every time the player milks one of the cows.

Termination

Episodes terminate after 1000 steps.

Observation Space

Each observation is an RGB image that shows a local view of the world around the player, as well as the inventory state of the agent.

Action Space

The action space is categorical. Each action is an integer index representing one of the possible actions:

Integer	Name	Description
0	`noop`	Do nothing.
1	`move_left`	Walk to the left.
2	`move_right`	Walk to the right.
3	`move_up`	Walk upwards.
4	`move_down`	Walk downwards.
5	`do`	Pick up a placed fence or milk a cow.
6	`place_fence`	Place a fence in front of the player.

Questions

Please open an issue on Github.

The Environment I built to study Reinforcement Learning + Pokemon Showdown

pokemon-showdown-rl-environment The Environment I built to study Reinforcement Learning + Pokemon Showdown Been a while since I ran this. Think it is

3 Jan 16, 2022

Wordle Env: A Daily Word Environment for Reinforcement Learning

Wordle Env: A Daily Word Environment for Reinforcement Learning Setup Steps: git pull [email protected]:alex-nooj/wordle_env.git From the wordle_env dire

2 Mar 28, 2022

Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX

CQL-JAX This repository implements Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX (FLAX). Implementation is built on

8 Nov 7, 2022

Reinforcement-learning - Repository of the class assignment questions for the course on reinforcement learning

DSE 314/614: Reinforcement Learning This repository containing reinforcement lea

4 Apr 15, 2022

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

A tour through tensorflow with financial data I present several models ranging in complexity from simple regression to LSTM and policy networks. The s

195 Dec 7, 2022

Python Environment for Bayesian Learning

Pebl is a python library and command line application for learning the structure of a Bayesian network given prior knowledge and observations. Pebl in

103 Jul 14, 2022

A Sklearn-like Framework for Hyperparameter Tuning and AutoML in Deep Learning projects. Finally have the right abstractions and design patterns to properly do AutoML. Let your pipeline steps have hyperparameter spaces. Enable checkpoints to cut duplicate calculations. Go from research to production environment easily.

Neuraxle Pipelines Code Machine Learning Pipelines - The Right Way. Neuraxle is a Machine Learning (ML) library for building machine learning pipeline

555 Dec 24, 2022

This project provides a stock market environment using OpenGym with Deep Q-learning and Policy Gradient.

Stock Trading Market OpenAI Gym Environment with Deep Reinforcement Learning using Keras Overview This project provides a general environment for stoc

769 Dec 25, 2022

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

TensorLayer is a novel TensorFlow-based deep learning and reinforcement learning library designed for researchers and engineers. It provides an extens

7.1k Dec 27, 2022

CowHerd is a partially-observed reinforcement learning environment

Related tags

Overview

CowHerd

Play Yourself

Training Agents

Environment Details

Reward

Termination

Observation Space

Action Space

Questions

You might also like...

The Environment I built to study Reinforcement Learning + Pokemon Showdown

Wordle Env: A Daily Word Environment for Reinforcement Learning

Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX

Reinforcement-learning - Repository of the class assignment questions for the course on reinforcement learning

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

Python Environment for Bayesian Learning

This project provides a stock market environment using OpenGym with Deep Q-learning and Policy Gradient.

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Owner

Danijar Hafner

[CVPR2021] DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasets

PyTorch implementation for Partially View-aligned Representation Learning with Noise-robust Contrastive Loss (CVPR 2021)

BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalanced Tongue Data

Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

DeepMind Alchemy task environment: a meta-reinforcement learning benchmark

Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.

Reinforcement learning models in ViZDoom environment

Predicting path with preference based on user demonstration using Maximum Entropy Deep Inverse Reinforcement Learning in a continuous environment

Multi-agent reinforcement learning algorithm and environment

CowHerd is a partially-observed reinforcement learning environment

Related tags

Overview

CowHerd

Play Yourself

Training Agents

Environment Details

Reward

Termination

Observation Space

Action Space

Questions

You might also like...

The Environment I built to study Reinforcement Learning + Pokemon Showdown

Wordle Env: A Daily Word Environment for Reinforcement Learning

Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX

Reinforcement-learning - Repository of the class assignment questions for the course on reinforcement learning

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

Python Environment for Bayesian Learning

This project provides a stock market environment using OpenGym with Deep Q-learning and Policy Gradient.

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Owner

Danijar Hafner

[CVPR2021] DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasets

PyTorch implementation for Partially View-aligned Representation Learning with Noise-robust Contrastive Loss (CVPR 2021)

BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalanced Tongue Data

Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

DeepMind Alchemy task environment: a meta-reinforcement learning benchmark

Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.

Reinforcement learning models in ViZDoom environment

Predicting path with preference based on user demonstration using Maximum Entropy Deep Inverse Reinforcement Learning in a continuous environment

Multi-agent reinforcement learning algorithm and environment

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.