This paper focuses on an energy efficient scheduling problem for multiple unmanned aerial vehicles (UAVs) that assist in mobile edge computing. The goal is to maximize the long-term energy efficiency of the UAVs by optimizing their trajectory planning, energy renewal, and application placement. The paper proposes a triple learner based reinforcement learning approach to address the problem, which includes a trajectory learner, an energy learner, and an application learner. Simulations show that the proposed solution outperforms existing approaches.
Jiayuan Chen,
Changyan Yi,
Jialiuyuan Li,
Kun Zhu,
Jun Cai