开彩红  (教授)

博士生导师 硕士生导师

电子邮箱:

所在单位:信息与通信工程系

学历:研究生(博士)毕业

办公地点:翡翠科教楼A605-2

性别:女

联系方式:chkai@hfut.edu.cn QQ:35276426

学位:博士学位

在职信息:在职

毕业院校:香港中文大学

学科:通信与信息系统
计算机应用技术
软件工程其他专业

Multi-agent Reinforcement Learning Based Joint Uplink-downlink Subcarrier Assignment and Power Allocation for D2D Underlay Networks

点击次数:

影响因子:2.1

所属单位:School of Computer Science and Information Engineering, Hefei University of Technology

发表刊物:Wireless Networks

项目来源:the National Natural Science Foundation of China under Grants 61971176 and 61901156,the Anhui Provi

关键字:D2D communication Power allocation Joint uplink–downlink subcarrier assignment DRL DDQN

摘要:This paper investigates the joint uplink–downlink resource allocation in time-varying device-to-device (D2D) underlay wireless cellular networks. Specifically, we formulate the joint optimization problem of the joint uplink–downlink subcarrier assignment and power allocation (SAPA) of D2D pairs, with the purpose of maximizing the sum data rate (SDR) of all D2D pairs while ensuring the basic data rate requirements of both cellular users and D2D pairs. To accommodate the high dynamics of wireless networks, we develop an effective joint uplink-downlink SAPA scheme based on distributed deep reinforcement learning (DRL), wherein each D2D pair acts as an agent and adopts the model-free double-deep Q-network (DDQN) algorithm to solve the joint optimization problem. Moreover, in our proposed DDQN scheme, we assume that all agents maintain the same reward, thus collaborative behavior between agents is inspired to alleviate the mutual interference incurred by subcarrier reuses between the cellular users and D2D pairs. Numerical results show that our proposed DDQN method could quickly converge to the near-optimal performance, has low computational complexity and thus could be adopted in large-scale D2D underlay wireless cellular networks.

合写作者:Xiaowei Meng,Linsheng Mei

第一作者:Caihong Kai

论文类型:期刊论文

通讯作者:Wei Huang

卷号:29

页面范围:891-907

ISSN号:1572 - 8196

是否译文:

发表时间:2023-11-09

收录刊物:SCI、EI

上一条: Structured OFDM Design for Massive MIMO Systems with Dual-Wideband Effects

下一条: Condition Number Improvement of IRS-Aided Near-Field MIMO Channels