SDN环境下基于深度强化学习的流量调度研究

Research on traffic scheduling based on deep reinforcement learning in SDN

  • 摘要: 软件定义网络(software-defined networks,SDN)流量调度提升网络性能和资源利用率、实现节能和负载均衡至关重要. 传统的多目标优化算法在高流量和网络动态性增加的情况下显著影响算法的收敛速度,难以满足复杂网络环境的多样化需求. 针对此问题,提出了一种基于深度强化学习的流量预测在线路由算法——OTPR-DRL:根据流量特征预测关键流和普通流,结合网络状态和流量信息建立线性规划问题获得关键流路由的最优解. 为满足普通流不同服务质量(quality of service, QoS)需求,引入通用效用函数实现多目标优化,通过多智能体和优先级经验回放机制为普通流选择路由. 实验结果表明,在高流量强度下,OTPR-DRL与现有的算法相比,提高了收敛速度,至少降低了10.26%的网络传输时延,3.09%的丢包率,提高了1.7%的吞吐率.

     

    Abstract: In software-defined networks (SDN), traffic scheduling is crucial for enhancing network performance, resource utilization, energy efficiency, and load balancing.Traditional multi-objective optimization algorithms significantly affect the convergence speed of the algorithms under high traffic and increased network dynamics, making it difficult to meet the diverse demands of complex network environments.To address this problem, a deep reinforcement learning-based traffic prediction on-the-line routing algorithm, OTPR-DRL, is proposed: predicting critical and ordinary flows based on traffic characteristics and building a linear programming problem to obtain the optimal solution for critical flow routing by combining network state and traffic information.In order to meet the different QoS (quality of service) demands of ordinary flows, a general utility function is introduced to achieve multi-objective optimization, and routes are selected for ordinary flows through multi-intelligentsia and priority experience playback mechanisms.Experimental results show that OTPR-DRL improves the convergence speed, reduces at least 10.26% of network transmission delay, reduces 3.09% of packet loss, and improves 1.7% of throughput compared to existing algorithms under high traffic intensity.

     

/

返回文章
返回