LEARNING TO SCHEDULE COMMUNICATION IN MULTI-AGENT REINFORCEMENT LEARNING

最新推荐文章于 2024-03-30 21:55:18 发布

Adam婷

最新推荐文章于 2024-03-30 21:55:18 发布

阅读量4.2k

点赞数 2

CC 4.0 BY-SA版权

分类专栏：强化学习深度强化学习论文研读

本文链接：https://blog.youkuaiyun.com/weixin_41697507/article/details/93465924

强化学习同时被 3 个专栏收录

26 篇文章 ¥19.90 ¥99.00

订阅专栏

超级会员免费看

论文研读

38 篇文章

订阅专栏

深度强化学习

19 篇文章

订阅专栏

SchedNet是一种用于多智能体深度强化学习的框架，解决在有限通信带宽和共享通信介质环境下，智能体如何自我调度、编码消息和根据接收到的信息选择行动的问题。通过学习每个代理部分观察信息的重要性，SchedNet决定哪些代理有权广播信息。在合作通信和导航以及捕食者-猎物的应用中，SchedNet相对于其他机制（如无通信和简单调度）表现出显著的性能优势。

ABSTRACT

Many real-world reinforcement learning tasks require multiple agents to make se- quential decisions under the agents’ interaction, where well-coordinated actions among the agents are crucial to achieve the target goal better at these tasks. One way to accelerate the coordination effect is to enable multiple agents to communi- cate with each other in a distributed manner and behave as a group. In this paper, we study a practical scenario when (i) the communication bandwidth is limited and (ii) the agents share the communication medium so that only a restricted num- ber of agents are able to simultaneously use the medium, as in the state-of-the-art wireless networking standards. This calls for a certain form of communication scheduling. In that re

了解本专栏