Artificial intelligence engineering

Last updated October 31, 2025

Artificial intelligence engineering (AI engineering) is a technical discipline that focuses on the design, development, and deployment of AI systems. AI engineering involves applying engineering principles and methodologies to create scalable, efficient, and reliable AI-based solutions. It merges aspects of data engineering and software engineering to create real-world applications in diverse domains such as healthcare, finance, autonomous systems, and industrial automation.^[1]^[2]

According to Chip Huyen's book AI Engineering: Building Applications with Foundation Models, the term AI engineering refers to the process of building applications that use foundation models, which are typically models developed by a small number of research laboratories and made available as a service.^[3]

Huyen distinguishes this from machine learning (ML) engineering, which involves building and deploying models developed in-house. She notes that most practical AI systems combine both approaches. For example, a customer-support chatbot may use a generative model to produce responses while also incorporating locally built components such as request classifiers or scoring mechanisms to assess response quality. As a result, the terms AI engineering and ML engineering are often used together or interchangeably in practice.^[3]

The distinction and broader usage of the term have been discussed in industry publications and interviews, where AI engineering has been described as an emerging discipline focused on productionizing applications built with foundation models.^[4]^[5]^[6]

Key components

AI engineering integrates a variety of technical domains and practices, all of which are essential to building scalable, reliable, and ethical AI systems.

Data engineering and infrastructure

Data serves as the cornerstone of AI systems, necessitating careful engineering to ensure premium quality, wide spread availability, and usability. AI engineers gather large, diverse datasets from multiple sources such as databases, APIs, and real-time streams. This data undergoes cleaning, normalization, and preprocessing, often facilitated by automated data pipelines that manage extraction, transformation, and loading (ETL) processes.^[7]

Efficient storage solutions, such as SQL (or NoSQL) databases and data lakes, must be selected based on data characteristics and use cases. Security measures, including encryption and access controls, are critical for protecting sensitive information and ensuring compliance with regulations like GDPR. Scalability is essential, frequently involving cloud services and distributed computing frameworks to handle growing data volumes effectively.^[8]^[9]^[10]

Algorithm selection and optimization

Selecting the appropriate algorithm is crucial for the success of any AI system. Engineers evaluate the problem (which could be classification or regression, for example) to determine the most suitable machine learning algorithm, including deep learning paradigms.^[11]^[12]

Once an algorithm is chosen, optimizing it through hyperparameter tuning is essential to enhance efficiency and accuracy.^[13] Techniques such as grid search or Bayesian optimization are employed, and engineers often utilize parallelization to expedite training processes, particularly for large models and datasets.^[14] For existing models, techniques like transfer learning can be applied to adapt pre-trained models for specific tasks, reducing the time and resources needed for training.^[15]

Deep learning engineering

Deep learning is particularly important for tasks involving large and complex datasets. Engineers design neural network architectures tailored to specific applications, such as convolutional neural networks for visual tasks or recurrent neural networks for sequence-based tasks. Transfer learning, where pre-trained models are fine-tuned for specific use cases, helps streamline development and often enhances performance.^[16]

Optimization for deployment in resource-constrained environments, such as mobile devices, involves techniques like pruning and quantization to minimize model size while maintaining performance. Engineers also mitigate data imbalance through augmentation and synthetic data generation, ensuring robust model performance across various classes.^[16]

Natural language processing

Natural language processing (NLP) is a crucial component of AI engineering, focused on enabling machines to understand and generate human language. The process begins with text preprocessing to prepare data for machine learning models. Recent advancements, particularly transformer-based models like BERT and GPT, have greatly improved the ability to understand context in language.^[17]

AI engineers work on various NLP tasks, including sentiment analysis, machine translation, and information extraction. These tasks require sophisticated models that utilize attention mechanisms to enhance accuracy.^[18] Applications range from virtual assistants and chatbots to more specialized tasks like named-entity recognition (NER) and Part of speech (POS) tagging.^[19]^[20]

Reasoning and decision-making systems

Developing systems capable of reasoning and decision-making is a significant aspect of AI engineering. Whether starting from scratch or building on existing frameworks, engineers create solutions that operate on data or logical rules. Symbolic AI employs formal logic and predefined rules for inference, while probabilistic reasoning techniques like Bayesian networks help address uncertainty. These models are essential for applications in dynamic environments, such as autonomous vehicles, where real-time decision-making is critical.^[21]^[22]

Security

Security is a critical consideration in AI engineering, particularly as AI systems become increasingly integrated into sensitive and mission-critical applications. AI engineers implement robust security measures to protect models from adversarial attacks, such as evasion and poisoning, which can compromise system integrity and performance. Techniques such as adversarial training, where models are exposed to malicious inputs during development, help harden systems against these attacks.^[23]^[24]

Additionally, securing the data used to train AI models is of paramount importance. Encryption, secure data storage, and access control mechanisms are employed to safeguard sensitive information from unauthorized access and breaches. AI systems also require constant monitoring to detect and mitigate vulnerabilities that may arise post-deployment. In high-stakes environments like autonomous systems and healthcare, engineers incorporate redundancy and fail-safe mechanisms to ensure that AI models continue to function correctly in the presence of security threats.^[25]

Ethics and compliance

As AI systems increasingly influence societal aspects, ethics and compliance are vital components of AI engineering. Engineers design models to mitigate risks such as data poisoning and ensure that AI systems adhere to legal frameworks, such as data protection regulations like GDPR. Privacy-preserving techniques, including data anonymization and differential privacy, are employed to safeguard personal information and ensure compliance with international standards.^[26]

Ethical considerations focus on reducing bias in AI systems, preventing discrimination based on race, gender, or other protected characteristics. By developing fair and accountable AI solutions, engineers contribute to the creation of technologies that are both technically sound and socially responsible.^[27]

Workload

An AI engineer's workload revolves around the AI system's life cycle, which is a complex, multi-stage process.^[28] This process may involve building models from scratch or using pre-existing models through transfer learning, depending on the project's requirements.^[29] Each approach presents unique challenges and influences the time, resources, and technical decisions involved.

Problem definition and requirements analysis

Regardless of whether a model is built from scratch or based on a pre-existing model, the work begins with a clear understanding of the problem. The engineer must define the scope, understand the business context, and identify specific AI objectives that align with strategic goals. This stage includes consulting with stakeholders to establish key performance indicators (KPIs) and operational requirements.^[28]

When developing a model from scratch, the engineer must also decide which algorithms are most suitable for the task.^[11] Conversely, when using a pre-trained model, the workload shifts toward evaluating existing models and selecting the one most aligned with the task. The use of pre-trained models often allows for a more targeted focus on fine-tuning, as opposed to designing an entirely new model architecture.^[30]

Data acquisition and preparation

Data acquisition and preparation are critical stages regardless of the development method chosen, as the performance of any AI system relies heavily on high-quality, representative data.

For systems built from scratch, engineers must gather comprehensive datasets that cover all aspects of the problem domain, ensuring enough diversity and representativeness in the data to train the model effectively. This involves cleansing, normalizing, and augmenting the data as needed. Creating data pipelines and addressing issues like imbalanced datasets or missing values are also essential to maintain model integrity during training.^[31]

In the case of using pre-existing models, the dataset requirements often differ. Here, engineers focus on obtaining task-specific data that will be used to fine-tune a general model. While the overall data volume may be smaller, it needs to be highly relevant to the specific problem. Pre-existing models, especially those based on transfer learning, typically require fewer data, which accelerates the preparation phase, although data quality remains equally important.^[32]

Model design and training

The workload during the model design and training phase depends significantly on whether the engineer is building the model from scratch or fine-tuning an existing one.

When creating a model from scratch, AI engineers must design the entire architecture, selecting or developing algorithms and structures that are suited to the problem. For deep learning models, this might involve designing a neural network with the right number of layers, activation functions, and optimizers.^[33] Engineers go through several iterations of testing, adjusting hyperparameters, and refining the architecture.^[13] This process can be resource-intensive, requiring substantial computational power and significant time to train the model on large datasets.

For AI systems based on pre-existing models, the focus is more on fine-tuning. Transfer learning allows engineers to take a model that has already been trained on a broad dataset and adapt it for a specific task using a smaller, task-specific dataset. This method dramatically reduces the complexity of the design and training phase. Instead of building the architecture, engineers adjust the final layers and perform hyperparameter tuning. The time and computational resources required are typically lower than training from scratch, as pre-trained models have already learned general features that only need refinement for the new task.^[29]

Whether building from scratch or fine-tuning, engineers employ optimization techniques like cross-validation and early stopping to prevent overfitting. In both cases, model training involves running numerous tests to benchmark performance and improve accuracy.^[34]

System integration

Once the model is trained, it must be integrated into the broader system, a phase that largely remains the same regardless of how the model was developed. System integration involves connecting the AI model to various software components and ensuring that it can interact with external systems, databases, and user interfaces.

For models developed from scratch, integration may require additional work to ensure that the custom-built architecture aligns with the operational environment, especially if the AI system is designed for specific hardware or edge computing environments. Pre-trained models, by contrast, are often more flexible in terms of deployment since they are built using widely adopted frameworks, which are compatible with most modern infrastructure.^[35]

Engineers use containerization tools to package the model and create consistent environments for deployment, ensuring seamless integration across cloud-based or on-premise systems. Whether starting from scratch or using pre-trained models, the integration phase requires ensuring that the model is ready to scale and perform efficiently within the existing infrastructure.^[36]^[37]

Testing and validation

Testing and validation play a crucial role in both approaches, though the depth and nature of testing might differ slightly. For models built from scratch, more exhaustive functional testing is needed to ensure that the custom-built components of the model function as intended. Stress tests are conducted to evaluate the system under various operational loads, and engineers must validate that the model can handle the specific data types and edge cases of the domain.^[38]

For pre-trained models, the focus of testing is on ensuring that fine-tuning has adequately adapted the model to the task. Functional tests validate that the pre-trained model's outputs are accurate for the new context. In both cases, bias assessments, fairness evaluations, and security reviews are critical to ensure ethical AI practices and prevent vulnerabilities, particularly in sensitive applications like finance, healthcare, or autonomous systems.^[39]

Explainability is also essential in both workflows, especially when working in regulated industries or with stakeholders who need transparency in AI decision-making processes. Engineers must ensure that the model's predictions can be understood by non-technical users and align with ethical and regulatory standards.^[40]

Deployment and monitoring

The deployment stage typically involves the same overarching strategies—whether the model is built from scratch or based on an existing model. However, models built from scratch may require more extensive fine-tuning during deployment to ensure they meet performance requirements in a production environment. For example, engineers might need to optimize memory usage, reduce latency, or adapt the model for edge computing.^[41]^[42]^[43]

When deploying pre-trained models, the workload is generally lighter. Since these models are often already optimized for production environments, engineers can focus on ensuring compatibility with the task-specific data and infrastructure. In both cases, deployment techniques such as phased rollouts, A/B testing, or canary deployments are used to minimize risks and ensure smooth transition into the live environment.^[44]^[45]^[46]

Monitoring, however, is critical in both approaches. Once the AI system is deployed, engineers set up performance monitoring to detect issues like model drift, where the model's accuracy decreases over time as data patterns change. Continuous monitoring helps identify when the model needs retraining or recalibration. For pre-trained models, periodic fine-tuning may suffice to keep the model performing optimally, while models built from scratch may require more extensive updates depending on how the system was designed.^[47]^[48]

Regular maintenance includes updates to the model, re-validation of fairness and bias checks, and security patches to protect against adversarial attacks.

Machine learning operations (MLOps)

MLOps, or Artificial Intelligence Operations (AIOps), is a critical component in modern AI engineering, integrating machine learning model development with reliable and efficient operations practices. Similar to the DevOps practices in software development, MLOps provides a framework for continuous integration, continuous delivery (CI/CD), and automated monitoring of machine learning models throughout their lifecycle. This practice bridges the gap between data scientists, AI engineers, and IT operations, ensuring that AI models are deployed, monitored, and maintained effectively in production environments.^[49]

MLOps is particularly important as AI systems scale to handle more complex tasks and larger datasets. Without robust MLOps practices, models risk underperforming or failing once deployed into production, leading to issues such as downtime, ethical concerns, or loss of stakeholder trust. By establishing automated, scalable workflows, MLOps allows AI engineers to manage the entire lifecycle of machine learning models more efficiently, from development through to deployment and ongoing monitoring.^[50]

Additionally, as regulatory frameworks around AI systems continue to evolve, MLOps practices are critical for ensuring compliance with legal requirements, including data privacy regulations and ethical AI guidelines. By incorporating best practices from MLOps, organizations can mitigate risks, maintain high performance, and scale AI solutions responsibly.^[49]

Challenges

AI engineering faces a distinctive set of challenges that differentiate it from traditional software development. One of the primary issues is model drift, where AI models degrade in performance over time due to changes in data patterns, necessitating continuous retraining and adaptation.^[51] Additionally, data privacy and security are critical concerns, particularly when sensitive data is used in cloud-based models.^[52]^[53] Ensuring model explainability is another challenge, as complex AI systems must be made interpretable for non-technical stakeholders.^[54] Bias and fairness also require careful handling to prevent discrimination and promote equitable outcomes, as biases present in training data can propagate through AI algorithms, leading to unintended results.^[55]^[27] Addressing these challenges requires a multidisciplinary approach, combining technical acumen with ethical and regulatory considerations.

Sustainability

Training large-scale AI models involves processing immense datasets over prolonged periods, consuming considerable amounts of energy. This has raised concerns about the environmental impact of AI technologies, given the expansion of data centers required to support AI training and inference.^[56]^[57]

The increasing demand for computational power has led to significant electricity consumption, with AI-driven applications often leaving a substantial carbon footprint. In response, AI engineers and researchers are exploring ways to mitigate these effects by developing more energy-efficient algorithms, employing green data centers, and leveraging renewable energy sources. Addressing the sustainability of AI systems is becoming a critical aspect of responsible AI development as the industry continues to scale globally.^[58]^[59]

Educational pathways

Education in AI engineering typically involves advanced courses in software and data engineering. Key topics include machine learning, deep learning, natural language processing and computer vision. Many universities now offer specialized programs in AI engineering at both the undergraduate and postgraduate levels, including hands-on labs, project-based learning, and interdisciplinary courses that bridge AI theory with engineering practices.^[60]

Professional certifications can also supplement formal education. Additionally, hands-on experience with real-world projects, internships, and contributions to open-source AI initiatives are highly recommended to build practical expertise.^[60]^[61]

References

↑ "What is Ai Engineering? Exploring the Roles of an Ai Engineer". ARTiBA. Retrieved 2024-10-17.
↑ Marr, Bernard. "15 Amazing Real-World Applications Of AI Everyone Should Know About". Forbes. Retrieved 2024-10-17.
1 2 Huyen, Chip (2025). AI Engineering: Building Applications with Foundation Models. O’Reilly Media. ISBN 9781098166298.
↑ Orosz, Gergely (2025-02-05). "AI Engineering with Chip Huyen". The Pragmatic Engineer. Retrieved 2025-10-30.
↑ Rachitsky, Lenny (2025-10-22). "AI Engineering 101 with Chip Huyen". Lenny’s Podcast. Retrieved 2025-10-30.
↑ Wang, Shawn (swyx) (2023-06-30). "The Rise of the AI Engineer". Latent.Space. Retrieved 2025-10-30.
↑ DEITEL, Paul J.; DEITEL, Harvey M. (2019). Intro to Python for Computer Science and Data Science: Learning to Program with AI, Big Data and The Cloud. Pearson.
↑ Tobin, Donal. "Which Database Is Right for Your Use Case?". Integrate.io. Retrieved 2024-10-18.
↑ Scalzo, Bert (2022-08-16). "Data Modeling 301 for the cloud: data lake and NoSQL data modeling and design". erwin Expert Blog. Retrieved 2024-10-18.
↑ Bahmani, Amir; Alavi, Arash; Buergel, Thore; Upadhyayula, Sushil; Wang, Qiwen; Ananthakrishnan, Srinath Krishna; Alavi, Amir; Celis, Diego; Gillespie, Dan; Young, Gregory; Xing, Ziye; Nguyen, Minh Hoang Huynh; Haque, Audrey; Mathur, Ankit; Payne, Josh (2021-10-01). "A scalable, secure, and interoperable platform for deep data-driven health management". Nature Communications. 12 (1): 5757. Bibcode:2021NatCo..12.5757B. doi:10.1038/s41467-021-26040-1. ISSN 2041-1723. PMC 8486823 . PMID 34599181.
1 2 Bischl, Bernd; Kerschke, Pascal; Kotthoff, Lars; Lindauer, Marius; Malitsky, Yuri; Fréchette, Alexandre; Hoos, Holger; Hutter, Frank; Leyton-Brown, Kevin; Tierney, Kevin; Vanschoren, Joaquin (2016-08-01). "ASlib: A benchmark library for algorithm selection". Artificial Intelligence. 237: 41–58. arXiv: 1506.02465 . doi:10.1016/j.artint.2016.04.003. ISSN 0004-3702.
↑ "Explained: Neural networks". MIT News | Massachusetts Institute of Technology. 2017-04-14. Retrieved 2024-10-18.
1 2 FEURER, Matthias; HUTTER, Frank. "Hyperparameter optimization". AutoML: Methods, Systems, Challenges. pp. 3–38.
↑ "Grid Search, Random Search, and Bayesian Optimization". Keylabs: latest news and updates. 2024-08-21. Retrieved 2024-10-18.
↑ West, Jeremy; Ventura, Dan; Warnick, Sean (2007). "Spring Research Presentation: A Theoretical Foundation for Inductive Transfer". Brigham Young University, College of Physical and Mathematical Sciences. Archived from the original on 2007-08-01. Retrieved 2024-10-18.
1 2 Chaudhury, Krishnendu (2024). Math and Architectures of Deep Learning. Manning Publications.
↑ "The Power of Natural Language Processing". Harvard Business Review. 2022-04-19. ISSN 0017-8012 . Retrieved 2024-10-18.
↑ Vaswani, Ashish; Shazeer, Noam; Parmar, Niki; Uszkoreit, Jakob; Jones, Llion; Gomez, Aidan N.; Kaiser, Lukasz; Polosukhin, Illia (2023-08-01), Attention Is All You Need, arXiv: 1706.03762
↑ Sun, Peng; Yang, Xuezhen; Zhao, Xiaobing; Wang, Zhijuan (November 2018). "An Overview of Named Entity Recognition". 2018 International Conference on Asian Language Processing (IALP). IEEE. pp. 273–278. doi:10.1109/IALP.2018.8629225. ISBN 978-1-7281-1175-9.
↑ Chiche, Alebachew; Yitagesu, Betselot (2022-01-24). "Part of speech tagging: a systematic review of deep learning and machine learning approaches". Journal of Big Data . 9 (1): 10. doi: 10.1186/s40537-022-00561-y . ISSN 2196-1115.
↑ Susskind, Zachary; Arden, Bryce; John, Lizy K.; Stockton, Patrick; John, Eugene B. (2021-09-13), Neuro-Symbolic AI: An Emerging Class of AI Workloads and their Characterization, arXiv: 2109.06133
↑ Garnelo, Marta; Shanahan, Murray (2019-10-01). "Reconciling deep learning with symbolic artificial intelligence: representing objects and relations". Current Opinion in Behavioral Sciences. 29: 17–23. doi: 10.1016/j.cobeha.2018.12.010 . hdl: 10044/1/67796 . ISSN 2352-1546.
↑ Fan, Jiaxin; Yan, Qi; Li, Mohan; Qu, Guanqun; Xiao, Yang (July 2022). "A Survey on Data Poisoning Attacks and Defenses". 2022 7th IEEE International Conference on Data Science in Cyberspace (DSC). IEEE. pp. 48–55. doi:10.1109/DSC55868.2022.00014. ISBN 978-1-6654-7480-1.
↑ Ren, Kui; Zheng, Tianhang; Qin, Zhan; Liu, Xue (2020-03-01). "Adversarial Attacks and Defenses in Deep Learning". Engineering. 6 (3): 346–360. Bibcode:2020Engin...6..346R. doi: 10.1016/j.eng.2019.12.012 . ISSN 2095-8099.
↑ "How should we assess security and data minimisation in AI?". ico.org.uk. 2024-07-31. Retrieved 2024-10-23.
↑ European Parliament. Directorate General for Parliamentary Research Services. (2020). The impact of the general data protection regulation on artificial intelligence. LU: Publications Office. doi:10.2861/293. ISBN 978-92-846-6771-0.
1 2 Ferrara, Emilio (March 2024). "Fairness and Bias in Artificial Intelligence: A Brief Survey of Sources, Impacts, and Mitigation Strategies". Sci. 6 (1): 3. arXiv: 2304.07683 . doi: 10.3390/sci6010003 . ISSN 2413-4155.
1 2 Haakman, Mark; Cruz, Luís; Huijgens, Hennie; van Deursen, Arie (2021-07-08). "AI lifecycle models need to be revised". Empirical Software Engineering. 26 (5): 95. doi: 10.1007/s10664-021-09993-1 . ISSN 1573-7616.
1 2 Fritz (2023-09-21). "Pre-Trained Machine Learning Models vs Models Trained from Scratch". Fritz ai. Retrieved 2024-10-18.
↑ Alshalali, Tagrid; Josyula, Darsana (December 2018). "Fine-Tuning of Pre-Trained Deep Learning Models with Extreme Learning Machine". 2018 International Conference on Computational Science and Computational Intelligence (CSCI). IEEE. pp. 469–473. doi:10.1109/CSCI46756.2018.00096. ISBN 978-1-7281-1360-9.
↑ Jain, Mitaksh (2024-05-27). "Data Acquisition: The Ultimate Guide to Master Machine Learning". Emeritus Online Courses. Retrieved 2024-10-18.
↑ Dodge, Jesse; Ilharco, Gabriel; Schwartz, Roy; Farhadi, Ali; Hajishirzi, Hannaneh; Smith, Noah (2020-02-14), Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping, arXiv: 2002.06305
↑ "What is a Model Architecture? - Hopsworks". www.hopsworks.ai. Retrieved 2024-10-18.
↑ "How the training of the AI models works - The Data Scientist". 2023-08-16. Retrieved 2024-10-18.
↑ Thórisson, Kristinn R.; Benko, Hrvoje; Abramov, Denis; Arnold, Andrew; Maskey, Sameer; Vaseekaran, Aruchunan (2004). "Constructionist Design Methodology for Interactive Intelligences" (PDF). A.I. Magazine. Archived from the original (PDF) on 2006-09-03.
↑ "AI Model Packaging Best Practices | Restackio". Restack. Retrieved 2024-10-23.
↑ Mungoli*, Neelesh (2023). Scalable, Distributed AI Frameworks: Leveraging Cloud Computing for Enhanced Deep Learning Performance and Efficiency. arXiv: 2304.13738 .
↑ "The Role of Performance Testing in AI Applications Digital Product Modernization". Round The Clock Technologies. 2023-07-11. Retrieved 2024-10-23.
↑ "Artificial intelligence validation and verification service". SQS. Retrieved 2024-10-23.
↑ Hand, David J.; Khan, Shakeel (June 2020). "Validating and Verifying AI Systems". Patterns. 1 (3) 100037. doi: 10.1016/j.patter.2020.100037 . ISSN 2666-3899. PMC 7660449 . PMID 33205105.
↑ Katsaragakis, Manolis; Papadopoulos, Lazaros; Konijnenburg, Mario; Catthoor, Francky; Soudris, Dimitrios (October 2020). "Memory Footprint Optimization Techniques for Machine Learning Applications in Embedded Systems". 2020 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE. pp. 1–4. doi:10.1109/ISCAS45731.2020.9181038. ISBN 978-1-7281-3320-1.
↑ Ramkumar, M.; Karthick, R.; Jeyashree, A. (2023). "SMART MONITORING AND ENHANCEMENT OF NETWORK LATENCY IN 5G CLOUD COMPUTING USING AI BASED MACHINE LEARNING MODEL". Ictact Journal on Data Science and Machine Learning. 04 (2): 421–425.
↑ Hua, Haochen; Li, Yutong; Wang, Tonghe; Dong, Nanqing; Li, Wei; Cao, Junwei (2023-01-16). "Edge Computing with Artificial Intelligence: A Machine Learning Perspective" . ACM Comput. Surv. 55 (9): 184:1–184:35. doi:10.1145/3555802. ISSN 0360-0300.
↑ "Rollout and Training • BT Standard". BT Standard. Retrieved 2024-10-23.
↑ Brinkmann, Demetrios (2022-09-15). "The what, why and how of Machine Learning A/B Tests". MLOps Community. Retrieved 2024-10-23.
↑ Roller, Joshua (2024-06-20). "Canary Deployment: What It Is and Why It Matters". IEEE Computer Society. Retrieved 2024-10-23.
↑ Onnes, Annet (2022-05-05), Monitoring AI systems: A Problem Analysis, Framework and Outlook, arXiv: 2205.02562
↑ "Why Continuous Monitoring is Essential for Maintaining AI Integrity – Wisecube AI – Research Intelligence Platform" . Retrieved 2024-10-23.
1 2 Treveil, Mark; Omont, Nicolas; Stenac, Clément; Lefevre, Kenji; Phan, Du; Zentici, Joachim; Lavoillotte, Adrien; Miyazaki, Makoto; Heidmann, Lynn (2020). Introducing MLOps. O'Reilly Media, Inc.
↑ Gift, Noah; Deza, Alfredo (2021). Practical MLOps. O'Reilly Media, Inc.
↑ Carter, Rickey E.; Anand, Vidhu; Harmon, David M.; Pellikka, Patricia A. (2022-01-01). "Model drift: When it can be a sign of success and when it can be an occult problem". Intelligence-Based Medicine. 6 100058. doi: 10.1016/j.ibmed.2022.100058 . ISSN 2666-5212.
↑ Luqman, Alka; Mahesh, Riya; Chattopadhyay, Anupam (2024-01-31), Privacy and Security Implications of Cloud-Based AI Services: A Survey, arXiv: 2402.00896
↑ Hassan, M. Ali (2024-03-20). "Cloud-Based AI: What Does it Mean For Cybersecurity?". Vaporvm. Retrieved 2024-10-23.
↑ Ali, Sajid; Abuhmed, Tamer; El-Sappagh, Shaker; Muhammad, Khan; Alonso-Moral, Jose M.; Confalonieri, Roberto; Guidotti, Riccardo; Del Ser, Javier; Díaz-Rodríguez, Natalia; Herrera, Francisco (2023-11-01). "Explainable Artificial Intelligence (XAI): What we know and what is left to attain Trustworthy Artificial Intelligence". Information Fusion. 99 101805. doi:10.1016/j.inffus.2023.101805. hdl: 10481/84480 . ISSN 1566-2535.
↑ "What Do We Do About the Biases in AI?". Harvard Business Review. 2019-10-25. ISSN 0017-8012 . Retrieved 2024-10-23.
↑ Editorial, MITSloan ME (2024-04-19). "Hyperscale Data Centers Growing at an Impressive Pace, Driven by Generative AI". MIT Sloan Management Review Middle East. Retrieved 2024-10-23.
↑ "Technology News | TechHQ | Latest Technology News & Analysis". TechHQ. 2024-01-26. Retrieved 2024-10-23.
↑ "AI models are devouring energy. Tools to reduce consumption are here, if data centers will adopt. | MIT Lincoln Laboratory". www.ll.mit.edu. Retrieved 2024-10-23.
↑ "Recommendations on Powering Artificial Intelligence and Data Center Infrastructure" (PDF). U.S. Department of Energy. 2024-07-30. Retrieved 2024-10-23.
1 2 "Guide: How To Become An Artificial Intelligence Engineer". Indeed. 2024-05-06.
↑ "How To Become an Artificial Intelligence (AI) Engineer". Upwork. 2023-06-22.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] "What is Ai Engineering? Exploring the Roles of an Ai Engineer". ARTiBA. Retrieved 2024-10-17.

[2] Marr, Bernard. "15 Amazing Real-World Applications Of AI Everyone Should Know About". Forbes. Retrieved 2024-10-17.

[:7-3] 1 2 Huyen, Chip (2025). AI Engineering: Building Applications with Foundation Models. O’Reilly Media. ISBN 9781098166298.

[4] Orosz, Gergely (2025-02-05). "AI Engineering with Chip Huyen". The Pragmatic Engineer. Retrieved 2025-10-30.

[5] Rachitsky, Lenny (2025-10-22). "AI Engineering 101 with Chip Huyen". Lenny’s Podcast. Retrieved 2025-10-30.

[6] Wang, Shawn (swyx) (2023-06-30). "The Rise of the AI Engineer". Latent.Space. Retrieved 2025-10-30.

[7] DEITEL, Paul J.; DEITEL, Harvey M. (2019). Intro to Python for Computer Science and Data Science: Learning to Program with AI, Big Data and The Cloud. Pearson.

[8] Tobin, Donal. "Which Database Is Right for Your Use Case?". Integrate.io. Retrieved 2024-10-18.

[9] Scalzo, Bert (2022-08-16). "Data Modeling 301 for the cloud: data lake and NoSQL data modeling and design". erwin Expert Blog. Retrieved 2024-10-18.

[10] Bahmani, Amir; Alavi, Arash; Buergel, Thore; Upadhyayula, Sushil; Wang, Qiwen; Ananthakrishnan, Srinath Krishna; Alavi, Amir; Celis, Diego; Gillespie, Dan; Young, Gregory; Xing, Ziye; Nguyen, Minh Hoang Huynh; Haque, Audrey; Mathur, Ankit; Payne, Josh (2021-10-01). "A scalable, secure, and interoperable platform for deep data-driven health management". Nature Communications. 12 (1): 5757. Bibcode:2021NatCo..12.5757B. doi:10.1038/s41467-021-26040-1. ISSN 2041-1723. PMC 8486823 . PMID 34599181.

[:1-11] 1 2 Bischl, Bernd; Kerschke, Pascal; Kotthoff, Lars; Lindauer, Marius; Malitsky, Yuri; Fréchette, Alexandre; Hoos, Holger; Hutter, Frank; Leyton-Brown, Kevin; Tierney, Kevin; Vanschoren, Joaquin (2016-08-01). "ASlib: A benchmark library for algorithm selection". Artificial Intelligence. 237: 41–58. arXiv: 1506.02465 . doi:10.1016/j.artint.2016.04.003. ISSN 0004-3702.

[12] "Explained: Neural networks". MIT News | Massachusetts Institute of Technology. 2017-04-14. Retrieved 2024-10-18.

[:2-13] 1 2 FEURER, Matthias; HUTTER, Frank. "Hyperparameter optimization". AutoML: Methods, Systems, Challenges. pp. 3–38.

[14] "Grid Search, Random Search, and Bayesian Optimization". Keylabs: latest news and updates. 2024-08-21. Retrieved 2024-10-18.

[15] West, Jeremy; Ventura, Dan; Warnick, Sean (2007). "Spring Research Presentation: A Theoretical Foundation for Inductive Transfer". Brigham Young University, College of Physical and Mathematical Sciences. Archived from the original on 2007-08-01. Retrieved 2024-10-18.

[:0-16] 1 2 Chaudhury, Krishnendu (2024). Math and Architectures of Deep Learning. Manning Publications.

[17] "The Power of Natural Language Processing". Harvard Business Review. 2022-04-19. ISSN 0017-8012 . Retrieved 2024-10-18.

[18] Vaswani, Ashish; Shazeer, Noam; Parmar, Niki; Uszkoreit, Jakob; Jones, Llion; Gomez, Aidan N.; Kaiser, Lukasz; Polosukhin, Illia (2023-08-01), Attention Is All You Need, arXiv: 1706.03762

[19] Sun, Peng; Yang, Xuezhen; Zhao, Xiaobing; Wang, Zhijuan (November 2018). "An Overview of Named Entity Recognition". 2018 International Conference on Asian Language Processing (IALP). IEEE. pp. 273–278. doi:10.1109/IALP.2018.8629225. ISBN 978-1-7281-1175-9.

[20] Chiche, Alebachew; Yitagesu, Betselot (2022-01-24). "Part of speech tagging: a systematic review of deep learning and machine learning approaches". Journal of Big Data . 9 (1): 10. doi: 10.1186/s40537-022-00561-y . ISSN 2196-1115.

[21] Susskind, Zachary; Arden, Bryce; John, Lizy K.; Stockton, Patrick; John, Eugene B. (2021-09-13), Neuro-Symbolic AI: An Emerging Class of AI Workloads and their Characterization, arXiv: 2109.06133

[22] Garnelo, Marta; Shanahan, Murray (2019-10-01). "Reconciling deep learning with symbolic artificial intelligence: representing objects and relations". Current Opinion in Behavioral Sciences. 29: 17–23. doi: 10.1016/j.cobeha.2018.12.010 . hdl: 10044/1/67796 . ISSN 2352-1546.

[23] Fan, Jiaxin; Yan, Qi; Li, Mohan; Qu, Guanqun; Xiao, Yang (July 2022). "A Survey on Data Poisoning Attacks and Defenses". 2022 7th IEEE International Conference on Data Science in Cyberspace (DSC). IEEE. pp. 48–55. doi:10.1109/DSC55868.2022.00014. ISBN 978-1-6654-7480-1.

[24] Ren, Kui; Zheng, Tianhang; Qin, Zhan; Liu, Xue (2020-03-01). "Adversarial Attacks and Defenses in Deep Learning". Engineering. 6 (3): 346–360. Bibcode:2020Engin...6..346R. doi: 10.1016/j.eng.2019.12.012 . ISSN 2095-8099.

[25] "How should we assess security and data minimisation in AI?". ico.org.uk. 2024-07-31. Retrieved 2024-10-23.

[26] European Parliament. Directorate General for Parliamentary Research Services. (2020). The impact of the general data protection regulation on artificial intelligence. LU: Publications Office. doi:10.2861/293. ISBN 978-92-846-6771-0.

[Ferrara_3-27] 1 2 Ferrara, Emilio (March 2024). "Fairness and Bias in Artificial Intelligence: A Brief Survey of Sources, Impacts, and Mitigation Strategies". Sci. 6 (1): 3. arXiv: 2304.07683 . doi: 10.3390/sci6010003 . ISSN 2413-4155.

[:3-28] 1 2 Haakman, Mark; Cruz, Luís; Huijgens, Hennie; van Deursen, Arie (2021-07-08). "AI lifecycle models need to be revised". Empirical Software Engineering. 26 (5): 95. doi: 10.1007/s10664-021-09993-1 . ISSN 1573-7616.

[:4-29] 1 2 Fritz (2023-09-21). "Pre-Trained Machine Learning Models vs Models Trained from Scratch". Fritz ai. Retrieved 2024-10-18.

[30] Alshalali, Tagrid; Josyula, Darsana (December 2018). "Fine-Tuning of Pre-Trained Deep Learning Models with Extreme Learning Machine". 2018 International Conference on Computational Science and Computational Intelligence (CSCI). IEEE. pp. 469–473. doi:10.1109/CSCI46756.2018.00096. ISBN 978-1-7281-1360-9.

[31] Jain, Mitaksh (2024-05-27). "Data Acquisition: The Ultimate Guide to Master Machine Learning". Emeritus Online Courses. Retrieved 2024-10-18.

[32] Dodge, Jesse; Ilharco, Gabriel; Schwartz, Roy; Farhadi, Ali; Hajishirzi, Hannaneh; Smith, Noah (2020-02-14), Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping, arXiv: 2002.06305

[33] "What is a Model Architecture? - Hopsworks". www.hopsworks.ai. Retrieved 2024-10-18.

[34] "How the training of the AI models works - The Data Scientist". 2023-08-16. Retrieved 2024-10-18.

[35] Thórisson, Kristinn R.; Benko, Hrvoje; Abramov, Denis; Arnold, Andrew; Maskey, Sameer; Vaseekaran, Aruchunan (2004). "Constructionist Design Methodology for Interactive Intelligences" (PDF). A.I. Magazine. Archived from the original (PDF) on 2006-09-03.

[36] "AI Model Packaging Best Practices | Restackio". Restack. Retrieved 2024-10-23.

[37] Mungoli*, Neelesh (2023). Scalable, Distributed AI Frameworks: Leveraging Cloud Computing for Enhanced Deep Learning Performance and Efficiency. arXiv: 2304.13738 .

[38] "The Role of Performance Testing in AI Applications Digital Product Modernization". Round The Clock Technologies. 2023-07-11. Retrieved 2024-10-23.

[39] "Artificial intelligence validation and verification service". SQS. Retrieved 2024-10-23.

[40] Hand, David J.; Khan, Shakeel (June 2020). "Validating and Verifying AI Systems". Patterns. 1 (3) 100037. doi: 10.1016/j.patter.2020.100037 . ISSN 2666-3899. PMC 7660449 . PMID 33205105.

[41] Katsaragakis, Manolis; Papadopoulos, Lazaros; Konijnenburg, Mario; Catthoor, Francky; Soudris, Dimitrios (October 2020). "Memory Footprint Optimization Techniques for Machine Learning Applications in Embedded Systems". 2020 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE. pp. 1–4. doi:10.1109/ISCAS45731.2020.9181038. ISBN 978-1-7281-3320-1.

[42] Ramkumar, M.; Karthick, R.; Jeyashree, A. (2023). "SMART MONITORING AND ENHANCEMENT OF NETWORK LATENCY IN 5G CLOUD COMPUTING USING AI BASED MACHINE LEARNING MODEL". Ictact Journal on Data Science and Machine Learning. 04 (2): 421–425.

[43] Hua, Haochen; Li, Yutong; Wang, Tonghe; Dong, Nanqing; Li, Wei; Cao, Junwei (2023-01-16). "Edge Computing with Artificial Intelligence: A Machine Learning Perspective" . ACM Comput. Surv. 55 (9): 184:1–184:35. doi:10.1145/3555802. ISSN 0360-0300.

[44] "Rollout and Training • BT Standard". BT Standard. Retrieved 2024-10-23.

[45] Brinkmann, Demetrios (2022-09-15). "The what, why and how of Machine Learning A/B Tests". MLOps Community. Retrieved 2024-10-23.

[46] Roller, Joshua (2024-06-20). "Canary Deployment: What It Is and Why It Matters". IEEE Computer Society. Retrieved 2024-10-23.

[47] Onnes, Annet (2022-05-05), Monitoring AI systems: A Problem Analysis, Framework and Outlook, arXiv: 2205.02562

[48] "Why Continuous Monitoring is Essential for Maintaining AI Integrity – Wisecube AI – Research Intelligence Platform" . Retrieved 2024-10-23.

[:5-49] 1 2 Treveil, Mark; Omont, Nicolas; Stenac, Clément; Lefevre, Kenji; Phan, Du; Zentici, Joachim; Lavoillotte, Adrien; Miyazaki, Makoto; Heidmann, Lynn (2020). Introducing MLOps. O'Reilly Media, Inc.

[50] Gift, Noah; Deza, Alfredo (2021). Practical MLOps. O'Reilly Media, Inc.

[51] Carter, Rickey E.; Anand, Vidhu; Harmon, David M.; Pellikka, Patricia A. (2022-01-01). "Model drift: When it can be a sign of success and when it can be an occult problem". Intelligence-Based Medicine. 6 100058. doi: 10.1016/j.ibmed.2022.100058 . ISSN 2666-5212.

[52] Luqman, Alka; Mahesh, Riya; Chattopadhyay, Anupam (2024-01-31), Privacy and Security Implications of Cloud-Based AI Services: A Survey, arXiv: 2402.00896

[53] Hassan, M. Ali (2024-03-20). "Cloud-Based AI: What Does it Mean For Cybersecurity?". Vaporvm. Retrieved 2024-10-23.

[54] Ali, Sajid; Abuhmed, Tamer; El-Sappagh, Shaker; Muhammad, Khan; Alonso-Moral, Jose M.; Confalonieri, Roberto; Guidotti, Riccardo; Del Ser, Javier; Díaz-Rodríguez, Natalia; Herrera, Francisco (2023-11-01). "Explainable Artificial Intelligence (XAI): What we know and what is left to attain Trustworthy Artificial Intelligence". Information Fusion. 99 101805. doi:10.1016/j.inffus.2023.101805. hdl: 10481/84480 . ISSN 1566-2535.

[55] "What Do We Do About the Biases in AI?". Harvard Business Review. 2019-10-25. ISSN 0017-8012 . Retrieved 2024-10-23.

[56] Editorial, MITSloan ME (2024-04-19). "Hyperscale Data Centers Growing at an Impressive Pace, Driven by Generative AI". MIT Sloan Management Review Middle East. Retrieved 2024-10-23.

[57] "Technology News | TechHQ | Latest Technology News & Analysis". TechHQ. 2024-01-26. Retrieved 2024-10-23.

[58] "AI models are devouring energy. Tools to reduce consumption are here, if data centers will adopt. | MIT Lincoln Laboratory". www.ll.mit.edu. Retrieved 2024-10-23.

[59] "Recommendations on Powering Artificial Intelligence and Data Center Infrastructure" (PDF). U.S. Department of Energy. 2024-07-30. Retrieved 2024-10-23.

[:6-60] 1 2 "Guide: How To Become An Artificial Intelligence Engineer". Indeed. 2024-05-06.

[61] "How To Become an Artificial Intelligence (AI) Engineer". Upwork. 2023-06-22.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

[54]

[55]

[56]

[57]

[58]

[59]

[60]

[61]