TY - GEN
T1 - Multimodal reinforcement learning for partner specific adaptation in robot-multi-robot interaction
AU - Kirtay, Murat
AU - Hafner, Verena V.
AU - Asada, Minoru
AU - Kuhlen, Anna K.
AU - Oztop, Erhan
N1 - Funding Information:
Additional support is provided by the International Joint Research Promotion Program, Osaka University under the project “Developmentally and biologically realistic modeling of perspective invariant action understanding” and the Japan Society for the Promotion of Science, Grant-in-Aid for Scientific Research - project number 22H03670.
Funding Information:
ACKNOWLEDGMENT This research has received funding from the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy - EXC 2002/1 “Science of Intelligence” - project number 390523135.
Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Successful and efficient teamwork requires knowledge of the individual team members' expertise. Such knowledge is typically acquired in social interaction and forms the basis for socially intelligent, partner-Adapted behavior. This study aims to implement this ability in teams of multiple humanoid robots. To this end, a humanoid robot, Nao, interacted with three Pepper robots to perform a sequential audio-visual pattern recall task that required integrating multimodal information. Nao outsourced its decisions (i.e., action selections) to its robot partners to perform the task efficiently in terms of neural computational cost by applying reinforcement learning. During the interaction, Nao learned its partners' specific expertise, which allowed Nao to turn for guidance to the partner who has the expertise corresponding to the current task state. The cognitive processing of Nao included a multimodal auto-Associative memory that allowed the determination of the cost of perceptual processing (i.e., cognitive load) when processing audio-visual stimuli. In turn, the processing cost is converted into a reward signal by an internal reward generation module. In this setting, the learner robot Nao aims to minimize cognitive load by turning to the partner whose expertise corresponds to a given task state. Overall, the results indicate that the learner robot discovers the expertise of partners and exploits this information to execute its task with low neural computational cost or cognitive load.
AB - Successful and efficient teamwork requires knowledge of the individual team members' expertise. Such knowledge is typically acquired in social interaction and forms the basis for socially intelligent, partner-Adapted behavior. This study aims to implement this ability in teams of multiple humanoid robots. To this end, a humanoid robot, Nao, interacted with three Pepper robots to perform a sequential audio-visual pattern recall task that required integrating multimodal information. Nao outsourced its decisions (i.e., action selections) to its robot partners to perform the task efficiently in terms of neural computational cost by applying reinforcement learning. During the interaction, Nao learned its partners' specific expertise, which allowed Nao to turn for guidance to the partner who has the expertise corresponding to the current task state. The cognitive processing of Nao included a multimodal auto-Associative memory that allowed the determination of the cost of perceptual processing (i.e., cognitive load) when processing audio-visual stimuli. In turn, the processing cost is converted into a reward signal by an internal reward generation module. In this setting, the learner robot Nao aims to minimize cognitive load by turning to the partner whose expertise corresponds to a given task state. Overall, the results indicate that the learner robot discovers the expertise of partners and exploits this information to execute its task with low neural computational cost or cognitive load.
KW - Humanoid robots
KW - Reinforcement learning
KW - Cognitive load
KW - Turning
KW - Biology
KW - Computational efficiency
KW - Costs
UR - http://www.scopus.com/inward/record.url?scp=85146360474&partnerID=8YFLogxK
U2 - 10.1109/Humanoids53995.2022.10000205
DO - 10.1109/Humanoids53995.2022.10000205
M3 - Conference contribution
AN - SCOPUS:85146360474
T3 - IEEE-RAS International Conference on Humanoid Robots
SP - 843
EP - 850
BT - 2022 IEEE-RAS 21st International Conference on Humanoid Robots, Humanoids 2022
PB - IEEE Computer Society
T2 - 2022 IEEE-RAS 21st International Conference on Humanoid Robots, Humanoids 2022
Y2 - 28 November 2022 through 30 November 2022
ER -