A Unified Taxonomy of Deep Learning Optimizers for Scalable and Efficient AI Systems
Abstract
The rapid advancement of artificial intelligence (AI), particularly large language models (LLMs), has created a significant socio-technical divide. The immense computational resources required for AI training increasingly limit participation to a few well-funded entities, hindering the democratization of AI research and raising concerns about environmental sustainability. While optimization algorithms are critical to reducing these resource barriers, the current landscape is highly fragmented, offering limited practical guidance for practitioners in resource-constrained environments. To address this accessibility gap, we present a unified taxonomy of deep learning optimizers that systematically organizes methods by their order of information: zeroth, first, and second order, while integrating emerging, IO-aware and Flash attention paradigms. Instead of merely enumerating algorithms, our approach emphasizes cost-efficiency, memory usage, and hardware constraints as pivotal factors for equitable AI development. Our synthesis of the literature reveals that system-level considerations, particularly IO efficiency, are essential not just for computational performance, but for making large-scale AI accessible. We introduce a decision-oriented framework that translates theoretical insights into practical guidelines, establishing a structured foundation for broader communities to train and deploy human-centered AI systems sustainably and efficiently.
Keywords: Artificial Intelligence, Computational Efficiency, Deep Learning, Efficient AI, Hardware/IO-Aware, Optimization
DOI: 10.54941/ahfe1008074
Cite this paper
More from this volume
- Values-Driven AI Framework for Preschool and Elderly Learning in Bulgaria
- History and Historians in the Age of AI
- Deliver cross-process automation across Finance, HR, Procurement by orchestrating actions across diverse systems -powered by AI & governed workflows
- AI With and Within User Research Across the Product Lifecycle
- Governing the Transition to Action: An Agentic Architecture for Situation-Aware LLMs
- Multi-source Food Names Mapping Using OpenAI vision, Manual Dictionary and Fuzzy Matching Techniques
- Product Design with Human-Machine Collaboration and AI Integration into Design Process
- Trust and Calibration in AI-mediated Decision Support under Conditions of Risk
- Identification of Influential Nodes and Discourse Features within Synthetical Hierarchical Communities in Online Social Networks
- Review of Supervised Machine learning cost estimation techniques for building projects
- A social media site for social well-being? The curious case of BeReal
- Methods and Tool Optimization for Similarity Avoidance in AI-Generated Graphic Design Content


AHFE Open Access