Download PDFOpen PDF in browser

A New Advantage Actor-Critic Algorithm For Multi-Agent Environments

EasyChair Preprint no. 4231

6 pagesDate: September 21, 2020


Reinforcement learning is one of the most researched fields of artificial intelligence right now. Newer and newer algorithms are being developed, especially for deep reinforcement learning, where the selected action is computed with the assist of a neural network. One of the subcategories of reinforcement learning is multi-agent reinforcement learning, where multiple agents are present in the world. In our paper, we modify an already existing algorithm, the Advantage Actor-Critic (A2C) to be suitable for multi-agent scenarios. Afterwards, we test the modified algorithm on our testbed, a cooperative-competitive pursuit-evasion environment.

Keyphrases: Advantage Actor Critic, Deep Reinforcement Learning, multi-agent reinforcement learning

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
  author = {Gabor Paczolay and Istvan Harmati},
  title = {A New Advantage Actor-Critic Algorithm For Multi-Agent Environments},
  howpublished = {EasyChair Preprint no. 4231},

  year = {EasyChair, 2020}}
Download PDFOpen PDF in browser