DRL-Based Improved UAV Swarm Control for Simultaneous Coverage and Tracking with Prior Experience Utilization
Abstract
:1. Introduction
- (1)
- The SCT-DRL algorithm addresses the challenge of managing interactions within UAV swarms by simplifying the problem into a series of pairwise interactions, which allows for a more manageable aggregation of target information. This methodological reduction enables the construction of a dynamic control framework that adapts to the varying number of proximate UAVs and targets, enhancing the swarm’s operational flexibility and efficiency.
- (2)
- The initialization of the value network using trajectory data from one-step optimization techniques (RC). This initialization strategy primes the UAVs to favor actions that expedite task completion, thereby reducing the need for extensive random exploration during the initial training phase. By leveraging this initialization, SCT-DRL builds upon the strengths of RC while mitigating its limitations through the incorporation of DRL. This synergy allows SCT-DRL to evolve beyond the myopic constraints of RC, offering a more foresighted value estimation that is honed through the iterative process of RL.
- (3)
- The efficacy of the SCT-DRL algorithm is empirically validated through a series of numerical experiments, which substantiate its potential to significantly enhance UAV swarm performance in simultaneous coverage and tracking tasks. These findings underscore the algorithm’s capacity to navigate the complexities of multi-UAV operations, offering a promising avenue for advancing autonomous swarm technologies.
2. Problem Formulation
2.1. Decentralized Partially Observable Markov Decision Process (Dec-POMDP)
2.2. Optimization Objective
3. Approach
3.1. Control Framework of SCT-DRL
Algorithm 1 SCT-DRL (Simultaneous Coverage and Tracking with Deep RL) |
|
3.2. Training Mechanism of SCT-DRL
3.2.1. Network Structure
3.2.2. Value Network Training
4. Experiment
4.1. Computational Complexity Analysis
4.1.1. Two-UAV Area Coverage Test
4.1.2. Swarm Area Coverage Test
4.2. Swarm Simultaneous Coverage and Tracking
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Wubben, J.; Hernández, D.; Cecilia, J.M.; Imberón, B.; Calafate, C.T.; Cano, J.C.; Manzoni, P.; Toh, C.K. Assignment and Take-Off Approaches for Large-Scale Autonomous UAV Swarms. IEEE Trans. Intell. Transp. Syst. 2023, 24, 4836–4847. [Google Scholar] [CrossRef]
- Zhang, X.; Duan, L. Energy-saving deployment algorithms of UAV swarm for sustainable wireless coverage. IEEE Trans. Veh. Technol. 2020, 69, 10320–10335. [Google Scholar] [CrossRef]
- Wu, J.; Yu, Y.; Ma, J.; Wu, J.; Han, G.; Shi, J.; Gao, L. Autonomous cooperative flocking for heterogeneous unmanned aerial vehicle group. IEEE Trans. Veh. Technol. 2021, 70, 12477–12490. [Google Scholar] [CrossRef]
- Yang, B.; Shi, H.; Xia, X. Federated imitation learning for UAV swarm coordination in urban traffic monitoring. IEEE Trans. Ind. Inform. 2022, 19, 6037–6046. [Google Scholar] [CrossRef]
- Li, J.; Sun, G.; Duan, L.; Wu, Q. Multi-objective optimization for UAV swarm-assisted IoT with virtual antenna arrays. IEEE Trans. Mob. Comput. 2023, 23, 4890–4907. [Google Scholar] [CrossRef]
- Zhang, X.; Zheng, J.; Su, T.; Ding, M.; Liu, H. An Effective Dynamic Constrained Two-Archive Evolutionary Algorithm for Cooperative Search-Track Mission Planning by UAV Swarms in Air Intelligent Transportation. IEEE Trans. Intell. Transp. Syst. 2023, 25, 944–958. [Google Scholar] [CrossRef]
- Bostelmann-Arp, L.; Steup, C.; Mostaghim, S. Free-Form Coverage Path Planning of Quadcopter Swarms for Search and Rescue Missions Using Multi-Objective Optimization. In Proceedings of the 2024 IEEE Congress on Evolutionary Computation (CEC), Yokohama, Japan, 30 June–5 July 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 1–8. [Google Scholar]
- Yang, Y.; Liang, Y.; Zhao, Y. An Analytical Solution for Obstacle Avoidance in Cooperative Area Coverage using UAV Swarms. In Proceedings of the 2024 36th Chinese Control and Decision Conference (CCDC), Xi’an, China, 25–27 May 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 2432–2437. [Google Scholar]
- Yang, F.; Ji, X.; Yang, C.; Li, J.; Li, B. Cooperative search of UAV swarm based on improved ant colony algorithm in uncertain environment. In Proceedings of the 2017 IEEE International Conference on Unmanned Systems (ICUS), Beijing, China, 27–29 October 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 231–236. [Google Scholar]
- Xiang, L.; Wang, F.; Xu, W.; Zhang, T.; Pan, M.; Han, Z. Dynamic uav swarm collaboration for multi-targets tracking under malicious jamming: Joint power, path and target association optimization. IEEE Trans. Veh. Technol. 2023, 73, 5410–5425. [Google Scholar] [CrossRef]
- Liang, Y.; Yang, Y.; Zhao, Y. Multi-Area Complete Coverage with Fixed-Wing UAV Swarms Based on Modified Ant Colony Algorithm. In Proceedings of the 2022 IEEE International Conference on Unmanned Systems (ICUS), Guangzhou, China, 28–30 October 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 732–737. [Google Scholar]
- Rekabi-Bana, F.; Hu, J.; Krajník, T.; Arvin, F. Unified robust path planning and optimal trajectory generation for efficient 3D area coverage of quadrotor UAVs. IEEE Trans. Intell. Transp. Syst. 2023, 25, 2492–2507. [Google Scholar] [CrossRef]
- Zhou, L.; Leng, S.; Liu, Q.; Wang, Q. Intelligent UAV swarm cooperation for multiple targets tracking. IEEE Internet Things J. 2021, 9, 743–754. [Google Scholar] [CrossRef]
- Zhou, L.; Leng, S.; Wang, Q.; Liu, Q. Integrated sensing and communication in UAV swarms for cooperative multiple targets tracking. IEEE Trans. Mob. Comput. 2022, 22, 6526–6542. [Google Scholar] [CrossRef]
- Khaledyan, M.; Vinod, A.P.; Oishi, M.; Richards, J.A. Optimal coverage control and stochastic multi-target tracking. In Proceedings of the 2019 IEEE 58th Conference on Decision and Control (CDC), Nice, France, 11–13 December 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 2467–2472. [Google Scholar]
- Chen, R.; Li, J.; Shen, L. A self-organized reciprocal control method for multi-robot simultaneous coverage and tracking. Assem. Autom. 2018, 38, 689–698. [Google Scholar] [CrossRef]
- Pimenta, L.C.; Schwager, M.; Lindsey, Q.; Kumar, V.; Rus, D.; Mesquita, R.C.; Pereira, G.A. Simultaneous coverage and tracking (SCAT) of moving targets with robot networks. In Proceedings of the Algorithmic Foundation of Robotics VIII: Selected Contributions of the Eight International Workshop on the Algorithmic Foundations of Robotics; Springer: Berlin/Heidelberg, Germany, 2010; pp. 85–99. [Google Scholar]
- Stergiopoulos, Y.; Tzes, A. Decentralized swarm coordination: A combined coverage/connectivity approach. J. Intell. Robot. Syst. 2011, 64, 603–623. [Google Scholar] [CrossRef]
- Moon, S.; Frew, E.W. Distributed cooperative control for joint optimization of sensor coverage and target tracking. In Proceedings of the 2017 International Conference on Unmanned Aircraft Systems (ICUAS), Miami, FL, USA, 13–16 June 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 759–766. [Google Scholar]
- Li, H.; Long, T.; Xu, G.; Wang, Y. Coupling-degree-based heuristic prioritized planning method for UAV swarm path generation. In Proceedings of the 2019 Chinese Automation Congress (CAC), Hangzhou, China, 22–24 November 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 3636–3641. [Google Scholar]
- Han, L.; Zhang, H. UAV path planning algorithm based on Global Optimal Solution Tracking Enhanced Particle Swarm Optimization. In Proceedings of the 2024 43rd Chinese Control Conference (CCC), Kunming, China, 28–31 July 2024; pp. 3125–3130. [Google Scholar]
- Chen, R.; Xu, N.; Li, J. A self-organized reciprocal decision approach for sensing coverage with multi-UAV swarms. Sensors 2018, 18, 1864. [Google Scholar] [CrossRef] [PubMed]
- Zhang, B.; Jing, T.; Lin, X.; Cui, Y.; Zhu, Y.; Zhu, Z. Deep Reinforcement Learning-based Collaborative Multi-UAV Coverage Path Planning. J. Phys. Conf. Ser. 2024, 2833, 012017. [Google Scholar] [CrossRef]
- Aydemir, F.; Cetin, A. Multi-agent dynamic area coverage based on reinforcement learning with connected agents. Comput. Syst. Sci. Eng. 2023, 45, 215–230. [Google Scholar] [CrossRef]
- Dai, A.; Li, R.; Zhao, Z.; Zhang, H. Graph convolutional multi-agent reinforcement learning for UAV coverage control. In Proceedings of the 2020 International Conference on Wireless Communications and Signal Processing (WCSP), Nanjing, China, 21–23 October 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 1106–1111. [Google Scholar]
Objects | Properties | Descriptions |
---|---|---|
Algorithm | learning rate | 0.0001 |
discount factor | 0.95 | |
batch size | 150 | |
hidden layer size | (100, 100, 100) | |
loss function | Mean-Square Error (MSE) | |
optimization method | Stochastic Gradient Descent (SGD) | |
UAV | decision interval | 100 ms |
covering radius | 25 m | |
body radius | 0.5 m | |
maximum speed | 1 m/s |
Test Scenario | Task Completion Time (s) [ave/75%/90%] | ||
---|---|---|---|
n | Area (m) | RC | SCT-DRL |
2 | 90 × 90 | 53.50/59.91/68.00 | 25.00/29.13/34.00 |
4 | 100 × 100 | 68.38/77.66/89.50 | 35.00/42.25/49.00 |
6 | 150 × 100 | 87.13/100.63/112.38 | 49.00/60.00/66.50 |
8 | 100 × 200 | 102.88/118.44/131.63 | 62.00/71.00/79.00 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chen, Y.; Chen, R.; Huang, Y.; Xiong, Z.; Li, J. DRL-Based Improved UAV Swarm Control for Simultaneous Coverage and Tracking with Prior Experience Utilization. Drones 2024, 8, 784. https://doi.org/10.3390/drones8120784
Chen Y, Chen R, Huang Y, Xiong Z, Li J. DRL-Based Improved UAV Swarm Control for Simultaneous Coverage and Tracking with Prior Experience Utilization. Drones. 2024; 8(12):784. https://doi.org/10.3390/drones8120784
Chicago/Turabian StyleChen, Yiting, Runfeng Chen, Yuchong Huang, Zehao Xiong, and Jie Li. 2024. "DRL-Based Improved UAV Swarm Control for Simultaneous Coverage and Tracking with Prior Experience Utilization" Drones 8, no. 12: 784. https://doi.org/10.3390/drones8120784
APA StyleChen, Y., Chen, R., Huang, Y., Xiong, Z., & Li, J. (2024). DRL-Based Improved UAV Swarm Control for Simultaneous Coverage and Tracking with Prior Experience Utilization. Drones, 8(12), 784. https://doi.org/10.3390/drones8120784