logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-05132023-124052


Tipo di tesi
Tesi di laurea magistrale
Autore
GANNETTI, MARCO
Indirizzo email
m.gannetti@studenti.unipi.it, marco.gannetti@gmail.com
URN
etd-05132023-124052
Titolo
Station-keeping of Sounding Balloon with Deep Reinforcement Learning
Dipartimento
INGEGNERIA CIVILE E INDUSTRIALE
Corso di studi
INGEGNERIA AEROSPAZIALE
Relatori
relatore Prof. Marcuccio, Salvo
supervisore Ing. Gemignani, Matteo
Parole chiave
  • AI
  • artificial intelligence
  • permanence
  • reinforcement learning
  • sounding balloon
  • Station-keeping
  • stratospheric platform
  • surveillance
Data inizio appello
13/06/2023
Consultabilità
Non consultabile
Data di rilascio
13/06/2063
Riassunto
Sounding balloons, often called weather balloons, are the smallest members of the aerostat family. Compared to other aerial platforms, such as drones or aircraft, or their larger aerostat counterparts (zero-pressure or super-pressure balloons), sounding balloons stay afloat in the atmosphere for relatively short periods of time, are extremely cheap to manufacture and operate and are generally considered to be a cost-effective solution for various applications such as surveillance, communication, and atmospheric research.

The capability to perform station-keeping would be crucial for these balloons to perform specific missions. However, the control of sounding balloons is quite challenging due to the strong dependence on their dynamics on atmospheric conditions. Traditional control methods based on mathematical models are not efficient for controlling these balloons in complex and uncertain environments. Therefore, there is a need for a new approach to effectively control sounding balloons.

The use of Deep Reinforcement Learning (DRL) has shown significant potential in solving complex control problems in various fields, including robotics, gaming, and autonomous vehicles. DRL combines the advantages of deep learning and reinforcement learning to learn optimal control policies. As of today, there is limited research on applying DRL to the control of high-altitude balloons.

The main objective of this thesis is to explore the use of DRL for controlling a sounding balloon to ensure permanence over a specified area. Specifically, we aim to implement the deep Q-network (DQN) algorithm to learn a control policy for the balloon, by exploiting the wind direction at different altitudes, reached by dropping ballast or releasing helium. This work aims to conduct experiments using a simulation environment and evaluate the performance of the trained DQN model in real historical wind data. The goal is to demonstrate that the DQN algorithm can effectively learn a control policy that achieves station-keeping with a high success rate and outperforms the baseline approaches.

We then used an auto-encoder to analyze wind data to investigate the underlying structures of wind data and identifying features where wind patterns were particularly conducive to station-keeping. By reducing the data into a condensed, easily interpretable form, we were able to gain new insights into the atmospheric conditions that affect station-keeping.

Finally, we demonstrated the possibility of station-keeping for the first time in history through an actual launch carried out on 05/25/2023.
File