In human society, communication is an effective mechanism for coordinating the behaviors of humans. In the field of deep multi-agent reinforcement learning (MARL), agents can also improve the overall learning performance and achieve their objectives by communication. MARL with communication research learns to solve multi-agent tasks (such as navigation, traffic, and video games), by communicating and sharing information.
Your jobIn this project, we focus on interpretability of the communication in MARL algorithms. We aim to bring together causality and multi-agent reinforcement learning, using causal methods to understand the effects of communication on MARL agents’ (learning) behaviors. This could cover better interpreting the potential causal relations between communication strategies and the learning performance, or using causal representation learning for developing more effective and interpretable MARL with communication algorithms.
As a PhD candidate, you will primarily perform research within the scope of the project culminating in a successful dissertation, as well as writing academic articles and presenting your work on international AI and machine learning conferences.
This position offers you rich development opportunities. You will be part of the
Hybrid Intelligence consortium, a network of excellence of universities and institutes in the Netherlands focused on the combination of human and machine intelligence. You will then have the chance to participate in international summer schools, conferences and workshops to broaden your research skills and network. In addition, you will have the opportunity to contribute to teaching and supervising AI-related thesis projects at both the Bachelor’s and Master’s levels (10-15% of your time).
This project is funded by
Hybrid Intelligence gravity project. This particular project is a collaboration between the
Intelligent Systems Lab at Utrecht University and the
Amsterdam Machine Learning Lab at University of Amsterdam. You will be part of the Intelligent Systems group at Utrecht University, and will work under the supervision of Dr Shihan Wang (daily supervisor), Dr Sara Magliacane (co-supervisor) and Professor Mehdi Dastani (promotor).