Reinforcement Learning-Based Collision Avoidance and Optimal Trajectory Planning in UAV Communication Networks

Yu Hsin Hsu, Rung Hung Gau*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

30 Scopus citations


In this paper, we propose a reinforcement learning approach of collision avoidance and investigate optimal trajectory planning for unmanned aerial vehicle (UAV) communication networks. Specifically, each UAV takes charge of delivering objects in the forward path and collecting data from heterogeneous ground IoT devices in the backward path. We adopt reinforcement learning for assisting UAVs to learn collision avoidance without knowing the trajectories of other UAVs in advance. In addition, for each UAV, we use optimization theory to find out a shortest backward path that assures data collection from all associated IoT devices. To obtain an optimal visiting order for IoT devices, we formulate and solve a no-return traveling salesman problem. Given a visiting order, we formulate and solve a sequence of convex optimization problems to obtain line segments of an optimal backward path for heterogeneous ground IoT devices. We use analytical results and simulation results to justify the usage of the proposed approach. Simulation results show that the proposed approach is superior to a number of alternative approaches.

Original languageEnglish
Pages (from-to)306-320
Number of pages15
JournalIEEE Transactions on Mobile Computing
Issue number1
StatePublished - 1 Jan 2022


  • convex optimization
  • optimal trajectory planning
  • Reinforcement learning
  • traveling salesman problem with neighborhood
  • UAV collision avoidance


Dive into the research topics of 'Reinforcement Learning-Based Collision Avoidance and Optimal Trajectory Planning in UAV Communication Networks'. Together they form a unique fingerprint.

Cite this