A New Advantage Actor-Critic Algorithm For Multi-Agent Environments

EasyChair Preprint 4231

6 pages•Date: September 21, 2020

Abstract

Reinforcement learning is one of the most researched fields of artificial intelligence right now. Newer and newer algorithms are being developed, especially for deep reinforcement learning, where the selected action is computed with the assist of a neural network. One of the subcategories of reinforcement learning is multi-agent reinforcement learning, where multiple agents are present in the world. In our paper, we modify an already existing algorithm, the Advantage Actor-Critic (A2C) to be suitable for multi-agent scenarios. Afterwards, we test the modified algorithm on our testbed, a cooperative-competitive pursuit-evasion environment.

Keyphrases: Advantage Actor Critic, Deep Reinforcement Learning, multi-agent reinforcement learning

Links:

https://easychair.org/publications/preprint/sqv4

BibTeX entry

BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:

@booklet{EasyChair:4231,
  author    = {Gabor Paczolay and Istvan Harmati},
  title     = {A New Advantage Actor-Critic Algorithm For Multi-Agent Environments},
  howpublished = {EasyChair Preprint 4231},
  year      = {EasyChair, 2020}}

Download PDF Open PDF in browser