A Unified Taxonomy of Deep Learning Optimizers for Scalable and Efficient AI Systems

Carlos Villarreal; Jonathan Luzuriaga; Emilio Quinga; Nicolas Reinoso; Bryan Morales; Diana Martinez-Mosquera

doi:10.54941/ahfe1008074

AHFE International

Accelerating Open Access Science in Human Factors Engineering and Human-Centered Computing

A Unified Taxonomy of Deep Learning Optimizers for Scalable and Efficient AI Systems

Open Access

Article

Conference Proceedings

Authors: Carlos Villarreal, Jonathan Luzuriaga, Emilio Quinga, Nicolas Reinoso, Bryan Morales, Diana Martinez-Mosquera

Abstract

The rapid advancement of artificial intelligence (AI), particularly large language models (LLMs), has created a significant socio-technical divide. The immense computational resources required for AI training increasingly limit participation to a few well-funded entities, hindering the democratization of AI research and raising concerns about environmental sustainability. While optimization algorithms are critical to reducing these resource barriers, the current landscape is highly fragmented, offering limited practical guidance for practitioners in resource-constrained environments. To address this accessibility gap, we present a unified taxonomy of deep learning optimizers that systematically organizes methods by their order of information: zeroth, first, and second order, while integrating emerging, IO-aware and Flash attention paradigms. Instead of merely enumerating algorithms, our approach emphasizes cost-efficiency, memory usage, and hardware constraints as pivotal factors for equitable AI development. Our synthesis of the literature reveals that system-level considerations, particularly IO efficiency, are essential not just for computational performance, but for making large-scale AI accessible. We introduce a decision-oriented framework that translates theoretical insights into practical guidelines, establishing a structured foundation for broader communities to train and deploy human-centered AI systems sustainably and efficiently.

Keywords: Artificial Intelligence, Computational Efficiency, Deep Learning, Efficient AI, Hardware/IO-Aware, Optimization

DOI: 10.54941/ahfe1008074

Cite this paper

Downloads

47

Visits

74

Download PDF

More from this volume

← Evaluating Public Art in Commercial Complexes: A Dual-Channel Emotion Recognition Framework Fusing Facial Micro-Expressions and Semantic Analysis

View all articles in Artificial Intelligence and Social Computing →