Abstract
Research has shown that personalization of health interventions can contribute to an improved effectiveness. Reinforcement learning algorithms can be used to perform such tailoring. In this paper, we present a cluster-based reinforcement learning approach which learns optimal policies for groups of users. Such an approach can speed up the learning process while still giving a level of personalization. We apply both online and batch learning to learn policies over the clusters and introduce a publicly available simulator which we have developed to evaluate the approach. The results show batch learning significantly outperforms online learning. Furthermore, near-optimal clustering is found which proves to be beneficial in learning significantly better policies compared to learning per user and learning across all users.
Original language | English |
---|---|
Title of host publication | PRIMA 2018: Principles and Practice of Multi-Agent Systems |
Subtitle of host publication | Lecture Notes in Computer Science |
Volume | 11224 |
DOIs | |
Publication status | Published - 2018 |
Event | International Conference on Principles and Practice of Multi-Agent Systems - Tokyo, Japan Duration: 29 Oct 2018 → 2 Nov 2018 Conference number: 21 http://2018.prima-conference.org/ |
Conference
Conference | International Conference on Principles and Practice of Multi-Agent Systems |
---|---|
Abbreviated title | PRIMA 2018 |
Country/Territory | Japan |
City | Tokyo |
Period | 29/10/18 → 2/11/18 |
Internet address |