Empty Banner

MENTOR - Machine-learning based control of complex multi-agent systems
for search and rescue operations in natural disasters

WP3 - Synthesis of Control-Tutored learning strategies for multi-agent systems

This WP will be focussed on the synthesis of multi-agent reinforcement learning strategies informed by control tutors as detailed in [O2]. The work on the control of complex multi-agent systems at NA will be complemented by research on multi-agent reinforcement learning at BO to synthesise strategies combining model-based controllers with MARL. All strategies will be then validated using the two testbed scenarios (synchronisation and herding) described in [O3]. The numerical implementation will be carried  at BO with input on the implementation of the control strategies from NA. All strategies will be then tested and validated numerically so as to evaluate their control and learning performance and contrast them with those of the existing alternatives reviewed as part of WP1.