A bayesian approach for learning and tracking switching, non-stationary opponents

Hernandez-Leal, P; Rosman, Benjamin S; Taylor, ME; Sucar, LE; de Cote, EM

A bayesian approach for learning and tracking switching, non-stationary opponents

http://dl.acm.org/citation.cfm?id=2937137
http://hdl.handle.net/10204/8650

Abstract:

In many situations, agents are required to use a set of strategies (behaviors) and switch among them during the course of an interaction. This work focuses on the problem of recognizing the strategy used by an agent within a small number of interactions. We propose using a Bayesian framework to address this problem. Bayesian policy reuse (BPR) has been empirically shown to be efficient at correctly detecting the best policy to use from a library in sequential decision tasks. In this paper we extend BPR to adversarial settings, in particular, to opponents that switch from one stationary strategy to another. Our proposed extension enables learning new models in an online fashion when the learning agent detects that the current policies are not performing optimally. Experiments presented in repeated games show that our approach is capable of efficiently detecting opponent strategies and reacting quickly to behavior switches, thereby yielding better performance than state-of-the-art approaches in terms of average rewards.

Reference:

Hernandez-Leal, P. Rosman, B.S. Taylor, M.E. Sucar, L.E. de Cote, E.M. 2016. A bayesian approach for learning and tracking switching, non-stationary opponents. In: Autonomous Agents and Multiagent Systems, 9-13 May 2016, Singapore

Hernandez-Leal, P., Rosman, B. S., Taylor, M., Sucar, L., & de Cote, E. (2016). A bayesian approach for learning and tracking switching, non-stationary opponents. ACM. http://hdl.handle.net/10204/8650

Hernandez-Leal, P, Benjamin S Rosman, ME Taylor, LE Sucar, and EM de Cote. "A bayesian approach for learning and tracking switching, non-stationary opponents." (2016): http://hdl.handle.net/10204/8650

Hernandez-Leal P, Rosman BS, Taylor M, Sucar L, de Cote E, A bayesian approach for learning and tracking switching, non-stationary opponents; ACM; 2016. http://hdl.handle.net/10204/8650 .

Download RIS

Autonomous Agents and Multiagent Systems, 9-13 May 2016, Singapore. Due to copyright restrictions, the attached PDF file only contains the abstract of the full text item. For access to the full text item, please consult the publisher's website.

Hernandez-Leal, P
Rosman, Benjamin S
Taylor, ME
Sucar, LE
de Cote, EM

Feb 2016

Policy reuse
Non-stationary opponents
Repeated games

Show full item record

Files in this item

Rosman_2016_ABSTRACT.pdf

This item appears in the following Collection(s)

Conference Publications

Browse

All of ResearchSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Publication Type
- Cluster
- Impact Area

Quick Links

Legislation and compliance

General Enquiries

Tel: + 27 12 841 2911
Email: callcentre@csir.co.za

Physical Address
Meiring Naudé Road
Brummeria
Pretoria
South Africa

Postal Address
PO Box 395
Pretoria 0001
South Africa

Social Connect

Resources on this site are free to download and reuse according to associated licensing provision. Please read the terms and conditions of usage of each resource.

A bayesian approach for learning and tracking switching, non-stationary opponents

A bayesian approach for learning and tracking switching, non-stationary opponents

This item appears in the following Collection(s)

Browse

All of ResearchSpace

This Collection

Quick Links

Legislation and compliance

General Enquiries

Social Connect