Artificial intelligence in industry

Last updated

Industrial artificial intelligence, or industrial AI, usually refers to the application of artificial intelligence to industry and business. Unlike general artificial intelligence which is a frontier research discipline to build computerized systems that perform tasks requiring human intelligence, industrial AI is more concerned with the application of such technologies to address industrial pain-points for customer value creation, productivity improvement, cost reduction, site optimization, predictive analysis [1] and insight discovery. [2]

Contents

Artificial intelligence and machine learning have become key enablers to leverage data in production in recent years due to a number of different factors: More affordable sensors and the automated process of data acquisition; More powerful computation capability of computers to perform more complex tasks at a faster speed with lower cost; Faster connectivity infrastructure and more accessible cloud services for data management and computing power outsourcing. [3]

Categories

Possible applications of industrial AI and machine learning in the production domain can be divided into seven application areas: [4]

Taxonomy of application areas and application scenarios for machine learning and artificial intelligence in production Taxonomy of Application Areas.png
Taxonomy of application areas and application scenarios for machine learning and artificial intelligence in production

Each application area can be further divided into specific application scenarios that describe concrete AI/ML scenarios in production. While some application areas have a direct connection to production processes, others cover production adjacent fields like logistics or the factory building. [4]

An example from the application scenario Process Design & Innovation are collaborative robots. Collaborative robotic arms are able to learn the motion and path demonstrated by human operators and perform the same task. [5] Predictive and preventive maintenance through data-driven machine learning are examplary application scenarios from the Machinery & Equipment application area. [4]

Challenges

In contrast to entirely virtual systems, in which ML applications are already widespread today, real-world production processes are characterized by the interaction between the virtual and the physical world. Data is recorded using sensors and processed on computational entities and, if desired, actions and decisions are translated back into the physical world via actuators or by human operators. [6] This poses major challenges for the application of ML in production engineering systems. These challenges are attributable to the encounter of process, data and model characteristics: The production domain's high reliability requirements, high risk and loss potential, the multitude of heterogeneous data sources and the non-transparency of ML model functionality impede a faster adoption of ML in real-world production processes.

The challenges for ML applications in production engineering result from the encounter of process, data and ML model characteristics Challenges for Machine Learning in Production.png
The challenges for ML applications in production engineering result from the encounter of process, data and ML model characteristics

In particular, production data comprises a variety of different modalities, semantics and quality. [7] Furthermore, production systems are dynamic, uncertain and complex, [7] and engineering and manufacturing problems are data-rich but information-sparse. [8] Besides that, due the variety of use cases and data characteristics, problem-specific data sets are required, which are difficult to acquire, hindering both practitioners and academic researchers in this domain. [9]

Process and Industry Characteristics

The domain of production engineering can be considered as a rather conservative industry when it comes to the adoption of advanced technology and their integration into existing processes. This is due to high demands on reliability of the production systems resulting from the potentially high economic harm of reduced process effectiveness due to e.g., additional unplanned downtime or insufficient product qualities. In addition, the specifics of machining equipment and products prevent area-wide adoptions across a variety of processes. Besides the technical reasons, the reluctant adoption of ML is fueled by a lack of IT and data science expertise across the domain. [4]

Data Characteristics

The data collected in production processes mainly stem from frequently sampling sensors to estimate the state of a product, a process, or the environment in the real world. Sensor readings are susceptible to noise and represent only an estimate of the reality under uncertainty. Production data typically comprises multiple distributed data sources resulting in various data modalities (e.g., images from visual quality control systems, time-series sensor readings, or cross-sectional job and product information). The inconsistencies in data acquisition lead to low signal-to-noise ratios, low data quality and great effort in data integration, cleaning and management. In addition, as a result from mechanical and chemical wear of production equipment, process data is subject to various forms of data drifts.

Machine Learning Model Characteristics

ML models are considered as black-box systems given their complexity and intransparency of input-output relation. This reduces the comprehensibility of the system behavior and thus also the acceptance by plant operators. Due to the lack of transparency and the stochasticity of these models, no deterministic proof of functional correctness can be achieved complicating the certification of production equipment. Given their inherent unrestricted prediction behavior, ML models are vulnerable against erroneous or manipulated data further risking the reliability of the production system because of lacking robustness and safety. In addition to high development and deployment costs, the data drifts cause high maintenance costs, which is disadvantageous compared to purely deterministic programs.

Standard processes for data science in production

The development of ML applications – starting with the identification and selection of the use case and ending with the deployment and maintenance of the application – follows dedicated phases that can be organized in standard process models. The process models assist in structuring the development process and defining requirements that must be met in each phase to enter the next phase. The standard processes can be classified into generic and domain-specific ones. Generic standard processes (e.g., CRISP-DM, ASUM-DM, KDD, SEMMA, or Team Data Science Process) describe a generally valid methodology and are thus independent of individual domains. [10] Domain-specific processes on the other hand consider specific peculiarities and challenges of special application areas.

The Machine Learning Pipeline in Production is a domain-specific data science methodology that is inspired by the CRISP-DM model and was specifically designed to be applied in fields of engineering and production technology. [11] To address the core challenges of ML in engineering – process, data, and model characteristics – the methodology especially focuses on use-case assessment, achieving a common data and process understanding data integration, data preprocessing of real-world production data and the deployment and certification of real-world ML applications.

Machine Learning Pipeline in Production Machine Learning Pipeline in Production.png
Machine Learning Pipeline in Production

Industrial data sources

The foundation of most artificial intelligence and machine learning applications in industrial settings are comprehensive datasets from the respective fields. Those datasets act as the basis for training the employed models. [7] In other domains, like computer vision, speech recognition or language models, extensive reference datasets (e.g. ImageNet, Librispeech, [12] The People's Speech) and data scraped from the open internet [13] are frequently used for this purpose. Such datasets rarely exist in the industrial context because of high confidentiality requirements [9] and high specificity of the data. Industrial applications of artificial intelligence are therefore often faced with the problem of data availability. [9]

For these reasons, existing open datasets applicable to industrial applications, often originate from public institutions like governmental agencies or universities and data analysis competitions hosted by companies. In addition to this, data sharing platforms exist. However, most of these platforms have no industrial focus and offer limited filtering abilities regarding industrial data sources.

See also

Related Research Articles

Artificial intelligence (AI), in its broadest sense, is intelligence exhibited by machines, particularly computer systems. It is a field of research in computer science that develops and studies methods and software which enable machines to perceive their environment and uses learning and intelligence to take actions that maximize their chances of achieving defined goals. Such machines may be called AIs.

Computer vision tasks include methods for acquiring, processing, analyzing and understanding digital images, and extraction of high-dimensional data from the real world in order to produce numerical or symbolic information, e.g. in the forms of decisions. Understanding in this context means the transformation of visual images into descriptions of the world that make sense to thought processes and can elicit appropriate action. This image understanding can be seen as the disentangling of symbolic information from image data using models constructed with the aid of geometry, physics, statistics, and learning theory.

Data mining is the process of extracting and discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and statistics with an overall goal of extracting information from a data set and transforming the information into a comprehensible structure for further use. Data mining is the analysis step of the "knowledge discovery in databases" process, or KDD. Aside from the raw analysis step, it also involves database and data management aspects, data pre-processing, model and inference considerations, interestingness metrics, complexity considerations, post-processing of discovered structures, visualization, and online updating.

Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of statistical algorithms that can learn from data and generalize to unseen data, and thus perform tasks without explicit instructions. Recently, artificial neural networks have been able to surpass many previous approaches in performance.

In predictive analytics, data science, machine learning and related fields, concept drift or drift is an evolution of data that invalidates the data model. It happens when the statistical properties of the target variable, which the model is trying to predict, change over time in unforeseen ways. This causes problems because the predictions become less accurate as time passes. Drift detection and drift adaptation are of paramount importance in the fields that involve dynamically changing data and data models.

Adaptable Robotics refers to a field of robotics with a focus on creating robotic systems capable of adjusting their hardware and software components to perform a wide range of tasks while adapting to varying environments. The 1960s introduced robotics into the industrial field. Since then, the need to make robots with new forms of actuation, adaptability, sensing and perception, and even the ability to learn stemmed the field of adaptable robotics. Significant developments such as the PUMA robot, manipulation research, soft robotics, swarm robotics, AI, cobots, bio-inspired approaches, and more ongoing research have advanced the adaptable robotics field tremendously. Adaptable robots are usually associated with their development kit, typically used to create autonomous mobile robots. In some cases, an adaptable kit will still be functional even when certain components break.

Imaging informatics, also known as radiology informatics or medical imaging informatics, is a subspecialty of biomedical informatics that aims to improve the efficiency, accuracy, usability and reliability of medical imaging services within the healthcare enterprise. It is devoted to the study of how information about and contained within medical images is retrieved, analyzed, enhanced, and exchanged throughout the medical enterprise.

Artificial intelligence (AI) has been used in applications throughout industry and academia. Similar to electricity or computers, AI serves as a general-purpose technology that has numerous applications. Its applications span language translation, image recognition, decision-making, credit scoring, e-commerce and various other domains. AI which accommodates such technologies as machines being equipped perceive, understand, act and learning a scientific discipline.

Fault detection, isolation, and recovery (FDIR) is a subfield of control engineering which concerns itself with monitoring a system, identifying when a fault has occurred, and pinpointing the type of fault and its location. Two approaches can be distinguished: A direct pattern recognition of sensor readings that indicate a fault and an analysis of the discrepancy between the sensor readings and expected values, derived from some model. In the latter case, it is typical that a fault is said to be detected if the discrepancy or residual goes above a certain threshold. It is then the task of fault isolation to categorize the type of fault and its location in the machinery. Fault detection and isolation (FDI) techniques can be broadly classified into two categories. These include model-based FDI and signal processing based FDI.

<span class="mw-page-title-main">Marine technology</span> Technologies used in marine environments

Marine technology is defined by WEGEMT as "technologies for the safe use, exploitation, protection of, and intervention in, the marine environment." In this regard, according to WEGEMT, the technologies involved in marine technology are the following: naval architecture, marine engineering, ship design, ship building and ship operations; oil and gas exploration, exploitation, and production; hydrodynamics, navigation, sea surface and sub-surface support, underwater technology and engineering; marine resources ; transport logistics and economics; inland, coastal, short sea and deep sea shipping; protection of the marine environment; leisure and safety.

SEMMA is an acronym that stands for Sample, Explore, Modify, Model, and Assess. It is a list of sequential steps developed by SAS Institute, one of the largest producers of statistics and business intelligence software. It guides the implementation of data mining applications. Although SEMMA is often considered to be a general data mining methodology, SAS claims that it is "rather a logical organization of the functional tool set of" one of their products, SAS Enterprise Miner, "for carrying out the core tasks of data mining".

<span class="mw-page-title-main">Domain adaptation</span> Field associated with machine learning and transfer learning

Domain adaptation is a field associated with machine learning and transfer learning. This scenario arises when we aim at learning a model from a source data distribution and applying that model on a different target data distribution. For instance, one of the tasks of the common spam filtering problem consists in adapting a model from one user to a new user who receives significantly different emails. Domain adaptation has also been shown to be beneficial to learning unrelated sources. Note that, when more than one source distribution is available the problem is referred to as multi-source domain adaptation.

This glossary of artificial intelligence is a list of definitions of terms and concepts relevant to the study of artificial intelligence, its sub-disciplines, and related fields. Related glossaries include Glossary of computer science, Glossary of robotics, and Glossary of machine vision.

<span class="mw-page-title-main">Artificial intelligence in healthcare</span> Overview of the use of artificial intelligence in healthcare

Artificial intelligence in healthcare is a term used to describe the use of machine-learning algorithms and software, or artificial intelligence (AI), to copy human cognition in the analysis, presentation, and understanding of complex medical and health care data, or to exceed human capabilities by providing new ways to diagnose, treat, or prevent disease. Specifically, AI is the ability of computer algorithms to arrive at approximate conclusions based solely on input data.

The industrial internet of things (IIoT) refers to interconnected sensors, instruments, and other devices networked together with computers' industrial applications, including manufacturing and energy management. This connectivity allows for data collection, exchange, and analysis, potentially facilitating improvements in productivity and efficiency as well as other economic benefits. The IIoT is an evolution of a distributed control system (DCS) that allows for a higher degree of automation by using cloud computing to refine and optimize the process controls.

Automated machine learning (AutoML) is the process of automating the tasks of applying machine learning to real-world problems.

Automated Artificial Intelligence (AutoAI) is a variation of the automated machine learning or AutoML technology, which extends the automation of model building towards automation of the full life cycle of a machine learning model. It applies intelligent automation to the task of building predictive machine learning models by preparing data for training and identifying the best type of model for the given data. then choosing the features or columns of data that best support the problem the model is solving. Finally, automation evaluates a variety of tuning options to reach the best result as it generates, then ranks, model-candidate pipelines. The best performing pipelines can be put into production to process new data, and deliver predictions based on the model training. Automated artificial intelligence can also be applied to making sure the model doesn't have inherent bias and automating the tasks for continuous improvement of the model. Managing an AutoAI model requires frequent monitoring and updating, managed by a process known as model operations or ModelOps.

<span class="mw-page-title-main">Artificial intelligence of things</span>

The Artificial Intelligence of Things (AIoT) is the combination of artificial intelligence (AI) technologies with the Internet of things (IoT) infrastructure to achieve more efficient IoT operations, improve human-machine interactions and enhance data management and analytics.

<span class="mw-page-title-main">Workplace impact of artificial intelligence</span> Impact of artificial intelligence on workers

The impact of artificial intelligence on workers includes both applications to improve worker safety and health, and potential hazards that must be controlled.

Automated decision-making (ADM) involves the use of data, machines and algorithms to make decisions in a range of contexts, including public administration, business, health, education, law, employment, transport, media and entertainment, with varying degrees of human oversight or intervention. ADM involves large-scale data from a range of sources, such as databases, text, social media, sensors, images or speech, that is processed using various technologies including computer software, algorithms, machine learning, natural language processing, artificial intelligence, augmented intelligence and robotics. The increasing use of automated decision-making systems (ADMS) across a range of contexts presents many benefits and challenges to human society requiring consideration of the technical, legal, ethical, societal, educational, economic and health consequences.

References

  1. "Reducing downtime using AI in Oil and Gas". Tech27.
  2. Sallomi, Paul. "Artificial Intelligence Goes Mainstream". WSJ. The Wall Street Journal - CIO Journal - Deloitte. Retrieved 9 May 2017.
  3. Schatsky, David; Muraskin, Craig; Gurumurthy, Ragu. "Cognitive technologies: The real opportunities for business". Deloitte Review.
  4. 1 2 3 4 Krauß, J.; Hülsmann, T.; Leyendecker, L.; Schmitt, R. H. (2023). "Application Areas, Use Cases, and Data Sets for Machine Learning and Artificial Intelligence in Production". In Liewald, Mathias; Verl, Alexander; Bauernhansl, Thomas; Möhring, Hans-Christian (eds.). Production at the Leading Edge of Technology. Lecture Notes in Production Engineering. Cham: Springer International Publishing. pp. 504–513. doi:10.1007/978-3-031-18318-8_51. ISBN   978-3-031-18318-8.
  5. "What Does Collaborative Robot Mean ?" . Retrieved 9 May 2017.
  6. Monostori, L.; Kádár, B.; Bauernhansl, T.; Kondoh, S.; Kumara, S.; Reinhart, G.; Sauer, O.; Schuh, G.; Sihn, W.; Ueda, K. (2016-01-01). "Cyber-physical systems in manufacturing". CIRP Annals. 65 (2): 621–641. doi:10.1016/j.cirp.2016.06.005. ISSN   0007-8506.
  7. 1 2 3 Wuest, Thorsten; Weimer, Daniel; Irgens, Christopher; Thoben, Klaus-Dieter (January 2016). "Machine learning in manufacturing: advantages, challenges, and applications". Production & Manufacturing Research. 4 (1): 23–45. doi: 10.1080/21693277.2016.1192517 . ISSN   2169-3277. S2CID   52037185.
  8. Lu, Stephen C-Y. (1990-01-01). "Machine learning approaches to knowledge synthesis and integration tasks for advanced engineering automation". Computers in Industry. 15 (1): 105–120. doi:10.1016/0166-3615(90)90088-7. ISSN   0166-3615.
  9. 1 2 3 Jourdan, Nicolas; Longard, Lukas; Biegel, Tobias; Metternich, Joachim (2021). "Machine Learning For Intelligent Maintenance And Quality Control: A Review Of Existing Datasets And Corresponding Use Cases". doi:10.15488/11280.{{cite journal}}: Cite journal requires |journal= (help)
  10. Azavedo, Ana (2008). "KDD, SEMMA and CRISP-DM: a parallel overview". IADIS European Conf. Data Mining. S2CID   15309704.
  11. Krauß, Jonathan; Dorißen, Jonas; Mende, Hendrik; Frye, Maik; Schmitt, Robert H. (2019). "Machine Learning and Artificial Intelligence in Production: Application Areas and Publicly Available Data Sets: Maschinelles Lernen und Kü nstliche Intelligenz in der Produktion: Anwendungsgebiete und öffentlich zugängliche Datensätze". In Wulfsberg, Jens Peter; Hintze, Wolfgang; Behrens, Bernd-Arno (eds.). Production at the leading edge of technology. Berlin, Heidelberg: Springer. pp. 493–501. doi:10.1007/978-3-662-60417-5_49. ISBN   978-3-662-60417-5. S2CID   213777444.
  12. Panayotov, Vassil; Chen, Guoguo; Povey, Daniel; Khudanpur, Sanjeev (2015). "Librispeech: An ASR corpus based on public domain audio books". 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). pp. 5206–5210. doi:10.1109/icassp.2015.7178964. ISBN   978-1-4673-6997-8. S2CID   2191379 . Retrieved 2023-10-18.
  13. OpenAI (2023). "GPT-4 Technical Report". arXiv: 2303.08774 [cs.CL].