logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-10312024-113500


Tipo di tesi
Tesi di laurea magistrale
Autore
NAVADEHRAZI, OMID
URN
etd-10312024-113500
Titolo
Adaptive Networking and Resource Allocation for Distributed Data Parallel Training of Generative AI in Constrained Environments
Dipartimento
INFORMATICA
Corso di studi
INFORMATICA E NETWORKING
Relatori
relatore Pagano, Paolo
Parole chiave
  • cpu tarining
  • Distributed Data Parallel (DDP)
  • generative AI
  • network performance
  • resource-constrained environments
Data inizio appello
29/11/2024
Consultabilità
Completa
Riassunto
As AI models and datasets grow larger, the demand for training resources has increased, leaving smaller companies with limited servers and no powerful GPUs at risk of falling behind. This study examines how small-scale environments can use Distributed Data Parallel (DDP) to participate in AI advancements, even with resource constraints. By deploying Docker containers across CPU-based servers connected via Ethernet, the experiment simulates low-resource conditions. Key network metrics are monitored, and a recommendation system is proposed for optimal node selection. The results show that distributed training is feasible in constrained environments, enabling smaller organizations to remain competitive in AI development.
File