基于深度强化学习的云边协同计算迁移研究

陈思光; 陈佳民; 赵传信

doi:10.12263/DZXB.20200131

您当前的位置：

首页 >

文章列表页 >

基于深度强化学习的云边协同计算迁移研究

学术论文 | 更新时间：2025-12-08

- 基于深度强化学习的云边协同计算迁移研究
- Deep Reinforcement Learning Based Cloud-Edge Collaborative Computation Offloading Mechanism
- 电子学报 2021年49卷第1期页码：157-166
- 作者机构：
  
  1. 南京邮电大学江苏省通信与网络技术工程研究中心,江苏,南京,210003
  2. 安徽师范大学网络与信息安全安徽省重点实验室,安徽,芜湖,241002
  3. 南京邮电大学江苏省宽带无线通信和物联网重点实验室,江苏,南京,210003
  4. 南京邮电大学江苏省通信与网络技术工程研究中心,江苏,南京,210003
  5. 安徽师范大学网络与信息安全安徽省重点实验室,安徽,芜湖,241002
  6. 南京邮电大学江苏省宽带无线通信和物联网重点实验室,江苏,南京,210003
- 作者简介：
- 基金信息：
  
  国家自然科学基金 (No.61971235，No.61771258，No.61871412）;六大人才高峰"高层次人才项目 (No.XYDXXJS-044）;江苏省"333高层次人才培养工程"南京邮电大学"1311"人才计划;中国博士后科学基金 (面上一等） (No.2018M630590）;南京邮电大学国家自然科学基金孵化项目 (No.NY217057，No.NY218058）;网络与信息安全安徽省重点实验室开放课题 (No.AHNIS2020001）;江苏省通信与网络技术工程研究中心开放课题重点项目 (No.JSGCZX17011）;赛尔网络下一代互联网技术创新项目 (No.NGII20190702）
- DOI：10.12263/DZXB.20200131
  中图分类号： TP393
- 网络出版：2021-01-25，
  
  纸质出版：2021
- 稿件说明：
移动端阅览
陈思光, 陈佳民, 赵传信. 基于深度强化学习的云边协同计算迁移研究[J]. 电子学报, 2021,49(1):157-166.

CHEN Si-guang, CHEN Jia-min, ZHAO Chuan-xin. Deep Reinforcement Learning Based Cloud-Edge Collaborative Computation Offloading Mechanism[J]. Acta Electronica Sinica, 2021, 49(1): 157-166.
陈思光, 陈佳民, 赵传信. 基于深度强化学习的云边协同计算迁移研究[J]. 电子学报, 2021,49(1):157-166. DOI： 10.12263/DZXB.20200131.

CHEN Si-guang, CHEN Jia-min, ZHAO Chuan-xin. Deep Reinforcement Learning Based Cloud-Edge Collaborative Computation Offloading Mechanism[J]. Acta Electronica Sinica, 2021, 49(1): 157-166. DOI： 10.12263/DZXB.20200131.

摘要

基于单一边缘节点计算、存储资源的有限性及大数据场景对高效计算服务的需求，本文提出了一种基于深度强化学习的云边协同计算迁移机制.具体地，基于计算资源、带宽和迁移决策的综合性考量，构建了一个最小化所有用户任务执行延迟与能耗权重和的优化问题.基于该优化问题提出了一个异步云边协同的深度强化学习算法，该算法充分利用了云边双方的计算能力，可有效满足大数据场景对高效计算服务的需求；同时，面向边缘云中边缘节点所处环境的多样及动态变化性，该算法能自适应地调整迁移策略以实现系统总成本的最小化.最后，大量的仿真结果表明本文所提出的算法具有收敛速度快、鲁棒性高等特点，并能够以最低的计算成本获得近似贪心算法的最优迁移决策.

Abstract

Based on the computation and storage resources limitation of single edge node and the demand for efficient computing services in big data scenarios

this paper proposes a deep reinforcement learning based cloud-edge collaborative computation offloading mechanism. Specifically

based on a comprehensive consideration of computing resources

bandwidth and offloading policy

an optimization problem is formulated to minimize the weight sum of execution delay and energy consumption of all user tasks. An asynchronous cloud-edge collaborative deep reinforcement learning (ACEC-DRL) algorithm is proposed to solve such optimization problem. This algorithm can effectively satisfy the demand of efficient computing services in big data scenario by jointly leveraging the computation capabilities of cloud and edge nodes. Meanwhile

under the various and dynamic environments of edge nodes in the edge cloud

this algorithm can adaptively adjust offloading policy to achieve the minimization of system cost. Finally

the extensive simulation results show that the proposed ACEC-DRL algorithm has the characteristics of fast convergence rate and high robustness

and its optimal offloading policy closely approximates to the solution of greedy algorithm with the lowest computation cost.

关键词

Keywords

references

浏览量

下载量

CSCD

文章被引用时，请邮件提醒。

提交

工具集

关联资源

基于Q学习的蜂窝车联网边缘计算系统PC-5/Uu接口联合卸载策略

基于学习的能量采集认知M2M通信资源分配算法

基于因果思维树的电动汽车电池SOC预测模型