Exploration and Rescue of Shipwreck Survivors using Reinforcement Learning-Empowered Drone Swarms

Authors

DOI:

https://doi.org/10.55972/spectrum.v26i1.421

Keywords:

Multi-Agent Systems, Reinforcement Learning, Simulation

Abstract

The goal of this project is to create a reinforcement learning algorithm that locates shipwrecked individuals using a swarm of drones. A simulated environment was developed to train and visualize the outcome of the trained algorithm, considering the ocean's dynamic circumstances. This project does not discuss image recognition of shipwrecked people, since the true focus of this project is to optimize the search routine of a drone to find the target in the most efficient way possible. The implemented Reinforce algorithm takes into account a dynamic map of probabilities, representing the chances of a person being found, as well as the position of other agents. Outcomes include an open-source python package for the environment and the implementation of the reinforcement learning algorithm. The algorithm demonstrates superiority over the predefined approach, proving the advantages of reinforcement learning in efficiency and effectiveness.

References

W. H. Organization, “Drowning fact sheets,” 2021, accessed in: Feb. 22, 2023. [Online]. Available: https://www.who.int/news-room/fact-sheets/detail/drowning.

U. S. G. Survey, “The distribution of water on, in, and above the earth,” 2019, accessed in: Feb. 22, 2023. [Online]. Available: https://www.usgs.gov/media/images/distribution-water-and-above-earth.

DJI, “Drone rescues around the world,” 2023, accessed in: Feb. 25, 2023. [Online]. Available: https://enterprise.dji.com/drone-rescue-map/.

E. Eliac¸ık, “Artificial intelligence vs. human intelligence: Can a game-changing technology play the game?” 2022, accessed in: Feb. 25, 2023. [Online]. Available: https://dataconomy.com/2022/04/is-artificial-intelligence-better-than-human-intelligence/.

E. T. Alotaibi, S. S. Alqefari, and A. Koubaa, “Lsar: Multi-uav collaboration for search and rescue missions,” IEEE Access, vol. 7, pp. 55 817–55 832, 2019.

B. Ai, M. Jia, H. Xu, J. Xu, Z. Wen, B. Li, and D. Zhang, “Coverage path planning for maritime search and rescue using reinforcement learning,” Ocean Engineering, vol. 241, p. 110098, 2021. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S0029801821014220.

D. Schuldt and J. Kurucar, “Maritime search and rescue via multiple coordinated uas,” U.S. Department of Defense, Defense Technical Information Center, Tech. Rep. AD1030377, January 2017. [Online]. Available: https://apps.dtic.mil/sti/pdfs/AD1035043.pdf.

R. L. Falc˜ao, J. C. C. de Oliveira, P. H. B. A. Andrade, R. R. Rodrigues, F. J. Barth, and J. F. B. Brancalion, “Dsse: An environment for simulation of reinforcement learning-empowered drone swarm maritime search and rescue missions,” Journal of Open Source Software, vol. 9, no. 99, p. 6746, 2024. [Online]. Available: https://doi.org/10.21105/joss.06746.

F. Project, “Pettingzoo,” https://pettingzoo.farama.org/, 2023, accessed in: Mar. 23, 2023.

K. CHOUTRI, M. LAGHA, and L. DALA, “A fully autonomous search and rescue system using quadrotor uav,” International Journal of Computing and Digital Systems, vol. 10, pp. 2–12, 2021.

R. Sutton and A. G. Barto, Reinforcement learning: An introduction. MIT press, 2018.

R. J. Williams, “Simple statistical gradient-following algorithms for connectionist reinforcement learning,” Machine Learning, vol. 8, pp. 229–256, 1992. [Online]. Available: https://doi.org/10.1007/BF00992696.

A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A. Desmaison, A. Kopf, E. Z. Yang, Z. DeVito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, and S. Chintala, “Pytorch: An imperative style, high-performance deep learning library,” CoRR, vol. abs/1912.01703, 2019. [Online]. Available: http://arxiv.org/abs/1912.01703.

Downloads

Published

2025-09-23

How to Cite

[1]
J. F. . Brancalion and F. Barth, “Exploration and Rescue of Shipwreck Survivors using Reinforcement Learning-Empowered Drone Swarms”, Spectrum, vol. 26, no. 1, pp. 14–20, Sep. 2025.

Issue

Section

Operational Analysis and Logistics Engineering

Categories