Amazon SageMaker

Amazon SageMaker
Developer(s)	Amazon, Amazon Web Services
Initial release	29 November 2017;6 years ago
Type	Software as a service
Website	aws.amazon.com/sagemaker

Last updated April 30, 2024

Amazon SageMaker is a cloud based machine-learning platform that allows the creation, training, and deployment by developers of machine-learning (ML) models on the cloud.^[1] It can be used to deploy ML models on embedded systems and edge-devices.^[2]^[3] The platform was launched in November 2017.^[4]

Capabilities

SageMaker enables developers to operate at a number of different levels of abstraction when training and deploying machine learning models. At its highest level of abstraction, SageMaker provides pre-trained ML models that can be deployed as-is.^[5] In addition, it offers a number of built-in ML algorithms that developers can train on their own data.^[6]^[7]

The platform also features managed instances of TensorFlow and Apache MXNet, where developers can create their own ML algorithms from scratch.^[8] Regardless of which level of abstraction is used, a developer can connect their SageMaker-enabled ML models to other AWS services, such as the Amazon DynamoDB database for structured data storage,^[9] AWS Batch for offline batch processing,^[9]^[10] or Amazon Kinesis for real-time processing.^[11]

Development interfaces

A number of interfaces are available for developers to interact with SageMaker. First, there is a web API that remotely controls a SageMaker server instance.^[12] While the web API is agnostic to the programming language used by the developer, Amazon provides SageMaker API bindings for a number of languages, including Python, JavaScript, Ruby, Java, and Go.^[13]^[14] In addition, SageMaker provides managed Jupyter Notebook instances for interactively programming SageMaker and other applications.^[15]^[16]

History and features

2017-11-29: SageMaker is launched at the AWS re:Invent conference.^[4]^[6]^[1]
2018-02-27: Managed TensorFlow and MXNet deep neural network training and inference are now supported within SageMaker.^[17]^[8]
2018-02-28: SageMaker automatically scales model inference to multiple server instances.^[18]^[19]
2018-07-13: Support is added for recurrent neural network training, word2vec training, multi-class linear learner training, and distributed deep neural network training in Chainer with Layer-wise Adaptive Rate Scaling (LARS).^[20]^[7]
2018-07-17: AWS Batch Transform enables high-throughput non-realtime machine learning inference in SageMaker.^[21]^[22]
2018-11-08: Support for training and inference of Object2Vec word embeddings.^[23]^[24]
2018-11-27: SageMaker Ground Truth "makes it much easier for developers to label their data using human annotators through Mechanical Turk, third-party vendors, or their own employees."^[25]^[2]
2018-11-28: SageMaker Reinforcement Learning (RL) "enables developers and data scientists to quickly and easily develop reinforcement learning models at scale."^[26]^[2]
2018-11-28: SageMaker Neo enables deep neural network models to be deployed from SageMaker to edge-devices such as smartphones and smart cameras.^[27]^[2]
2018-11-29: The AWS Marketplace for SageMaker is launched. The AWS Marketplace enables 3rd-party developers to buy and sell machine learning models that can be trained and deployed in SageMaker.^[28]
2019-01-27: SageMaker Neo is released as open-source software.^[29]

Notable Customers

NASCAR is using SageMaker to train deep neural networks on 70 years of video data.^[30]
Carsales.com uses SageMaker to train and deploy machine learning models to analyze and approve automotive classified ad listings.^[31]
Avis Budget Group and Slalom Consulting are using SageMaker to develop "a practical on-site solution that could address the over and under utilization of cars in real-time using an optimization engine built in Amazon SageMaker."^[32]
Volkswagen Group uses SageMaker to develop and deploy machine learning in its manufacturing plants.^[33]
Peak and Footasylum use SageMaker in a recommendation engine for footwear.^[34]

Awards

In 2019, CIOL named SageMaker one of the "5 Best Machine Learning Platforms For Developers," alongside IBM Watson, Microsoft Azure Machine Learning, Apache PredictionIO, and AiONE.^[35]

Related Research Articles

Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of statistical algorithms that can learn from data and generalize to unseen data, and thus perform tasks without explicit instructions. Recently, artificial neural networks have been able to surpass many previous approaches in performance.

Unsupervised learning is a method in machine learning where, in contrast to supervised learning, algorithms learn patterns exclusively from unlabeled data. Within such an approach, a machine learning model tries to find any similarities, differences, patterns, and structure in data by itself. No prior human intervention is needed.

<span class="mw-page-title-main">AppScale</span> American cloud infrastructure software company

AppScale is a software company offering cloud infrastructure software and services to enterprises, government agencies, contractors, and third-party service providers. The company commercially supports one software product, AppScale ATS, a managed hybrid cloud infrastructure software platform that emulates the core AWS APIs. In 2019, the company ended commercial support for its open-source serverless computing platform AppScale GTS, but AppScale GTS source code remains freely available to the open-source community.

Learning to rank or machine-learned ranking (MLR) is the application of machine learning, typically supervised, semi-supervised or reinforcement learning, in the construction of ranking models for information retrieval systems. Training data may, for example, consist of lists of items with some partial order specified between items in each list. This order is typically induced by giving a numerical or ordinal score or a binary judgment for each item. The goal of constructing the ranking model is to rank new, unseen lists in a similar way to rankings in the training data.

A cloud database is a database that typically runs on a cloud computing platform and access to the database is provided as-a-service. There are two common deployment models: users can run databases on the cloud independently, using a virtual machine image, or they can purchase access to a database service, maintained by a cloud database provider. Of the databases available on the cloud, some are SQL-based and some use a NoSQL data model.

Eclipse Deeplearning4j is a programming library written in Java for the Java virtual machine (JVM). It is a framework with wide support for deep learning algorithms. Deeplearning4j includes implementations of the restricted Boltzmann machine, deep belief net, deep autoencoder, stacked denoising autoencoder and recursive neural tensor network, word2vec, doc2vec, and GloVe. These algorithms all include distributed parallel versions that integrate with Apache Hadoop and Spark.

TensorFlow is a free and open-source software library for machine learning and artificial intelligence. It can be used across a range of tasks but has a particular focus on training and inference of deep neural networks.

Apache SystemDS is an open source ML system for the end-to-end data science lifecycle.

This is a timeline of Amazon Web Services, which offers a suite of cloud computing services that make up an on-demand computing platform.

An AI accelerator, deep learning processor, or neural processing unit (NPU) is a class of specialized hardware accelerator or computer system designed to accelerate artificial intelligence and machine learning applications, including artificial neural networks and machine vision. Typical applications include algorithms for robotics, Internet of Things, and other data-intensive or sensor-driven tasks. They are often manycore designs and generally focus on low-precision arithmetic, novel dataflow architectures or in-memory computing capability. As of 2024, a typical AI integrated circuit chip contains tens of billions of MOSFET transistors.

Apache MXNet is an open-source deep learning software framework that trains and deploys deep neural networks. It aims to be scalable, allows fast model training, and supports a flexible programming model and multiple programming languages. The MXNet library is portable and can scale to multiple GPUs and machines. It was co-developed by Carlos Guestrin at the University of Washington, along with GraphLab.

The following outline is provided as an overview of and topical guide to machine learning:

Apache Ignite is a distributed database management system for high-performance computing.

Automated machine learning (AutoML) is the process of automating the tasks of applying machine learning to real-world problems.

Deep Learning Studio is a software tool that aims to simplify the creation of deep learning models used in artificial intelligence. It is compatible with a number of open-source programming frameworks popularly used in artificial neural networks, including MXNet and Google's TensorFlow.

<span class="mw-page-title-main">ML.NET</span> Machine learning library

ML.NET is a free software machine learning library for the C# and F# programming languages. It also supports Python models when used together with NimbusML. The preview release of ML.NET included transforms for feature engineering like n-gram creation, and learners to handle binary classification, multi-class classification, and regression tasks. Additional ML tasks like anomaly detection and recommendation systems have since been added, and other approaches like deep learning will be included in future versions.

Amazon Polly is a cloud service by Amazon Web Services, a subsidiary of Amazon.com, that converts text into spoken audio. It allows developers to create speech-enabled applications and products. It was launched in November 2016 and now includes 60 voices across 29 languages, some of which are Neural Text-to-Speech voices of higher quality. Users include Duolingo, a language education platform.

Animashree (Anima) Anandkumar is the Bren Professor of Computing at California Institute of Technology. Previously, she was a senior director of Machine Learning research at NVIDIA and a principal scientist at Amazon Web Services. Her research considers tensor-algebraic methods, deep learning and non-convex problems.

Neural Network Exchange Format (NNEF) is an artificial neural network data exchange format developed by the Khronos Group. It is intended to reduce machine learning deployment fragmentation by enabling a rich mix of neural network training tools and inference engines to be used by applications across a diverse range of devices and platforms.

Automated Artificial Intelligence (AutoAI) is a variation of the automated machine learning or AutoML technology, which extends the automation of model building towards automation of the full life cycle of a machine learning model. It applies intelligent automation to the task of building predictive machine learning models by preparing data for training and identifying the best type of model for the given data. then choosing the features or columns of data that best support the problem the model is solving. Finally, automation evaluates a variety of tuning options to reach the best result as it generates, then ranks, model-candidate pipelines. The best performing pipelines can be put into production to process new data, and deliver predictions based on the model training. Automated artificial intelligence can also be applied to making sure the model doesn't have inherent bias and automating the tasks for continuous improvement of the model. Managing an AutoAI model requires frequent monitoring and updating, managed by a process known as model operations or ModelOps.

References

1 2 Woodie, Alex (2017-11-29). "AWS Takes the 'Muck' Out of ML with SageMaker". datanami. Retrieved 2019-06-09.
1 2 3 4 Rodriguez, Jesus (2018-11-30). "With These New Additions, AWS SageMaker is Starting to Look More Real for Data Scientists". Towards Data Science. Retrieved 2019-06-09.^{[ permanent dead link ]}
↑ Terdiman, Daniel (2018-10-05). "How AI is helping Amazon become a trillion-dollar company". Fast Company. Retrieved 2019-06-09.
1 2 Miller, Ron (2017-11-29). "AWS releases SageMaker to make it easier to build and deploy machine learning models". TechCrunch. Retrieved 2019-06-09.
↑ Ponnapalli, Priya (2019-01-30). "Deploy trained Keras or TensorFlow models using Amazon SageMaker". AWS. Retrieved 2019-06-09.
1 2 "Introducing Amazon SageMaker". AWS. 2017-11-29. Retrieved 2019-06-09.
1 2 Nagel, Becky (2018-07-16). "Amazon Updates SageMaker ML Platform Algorithms, Frameworks". Pure AI. Retrieved 2019-06-09.
1 2 Roumeliotis, Rachel (2018-03-07). "How to jump start your deep learning skills using Apache MXNet". O'Reilly. Retrieved 2019-06-09.
1 2 Marquez, Ernesto. "Evaluate when to use added AWS Step Functions actions". TechTarget. Retrieved 2019-06-09.
↑ "AWS Step Functions Adds Eight More Service Integrations". AWS. 2018-11-29. Retrieved 2019-06-09.
↑ "Deploy Amazon SageMaker and a Data Lake on AWS for Predictive Data Science with New Quick Start". AWS. 2018-08-15. Retrieved 2019-06-09.
↑ Olsen, Rumi (2018-07-19). "Call an Amazon SageMaker model endpoint using Amazon API Gateway and AWS Lambda". AWS. Retrieved 2019-06-09.
↑ "Amazon SageMaker developer resources". AWS. Retrieved 2019-06-09.
↑ Wiggers, Kyle (2018-11-21). "Amazon updates SageMaker with new built-in algorithms and Git integration" . Retrieved 2019-06-09.
↑ "Use Notebook Instances". AWS. Retrieved 2019-06-09.
↑ Gift, Noah (2018-08-17). "Here Come The Notebooks". Forbes. Retrieved 2019-06-09.
↑ "Amazon SageMaker now supports TensorFlow 1.5, Apache MXNet 1.0, and CUDA 9 for P3 Instance Optimization". AWS. 2018-02-27. Retrieved 2019-06-09.
↑ "Auto Scaling in Amazon SageMaker is now Available". AWS. 2018-02-28. Retrieved 2019-06-09.
↑ "Amazon Sagemaker Now Uses Auto-scaling". Polar Seven. 2018-03-24. Retrieved 2019-06-09.
↑ "Amazon SageMaker Announces Several Enhancements to Built-in Algorithms and Frameworks". AWS. 2018-07-13. Retrieved 2019-06-09.
↑ "Amazon SageMaker Now Supports High Throughput Batch Transform Jobs for Non-Real Time Inferencing". AWS. 2018-07-17. Retrieved 2019-06-09.
↑ Simon, Julien (2019-01-24). "Making the most of your Machine Learning budget on Amazon SageMaker". Medium. Retrieved 2019-06-09.
↑ "Introduction to Amazon SageMaker Object2Vec". AWS. 2018-11-08. Retrieved 2019-06-09.
↑ "Amazon SageMaker Now Supports Object2Vec and IP Insights Built-in Algorithms". AWS. 2018-11-19. Retrieved 2019-06-09.
↑ "Introducing Amazon SageMaker Ground Truth - Build Highly Accurate Training Datasets Using Machine Learning". AWS. 2018-11-28. Retrieved 2019-06-09.
↑ "Introducing Reinforcement Learning Support with Amazon SageMaker RL". AWS. 2018-11-28. Retrieved 2019-06-09.
↑ "Introducing Amazon SageMaker Neo - Train Once, Run Anywhere with up to 2x in Performance Improvement". AWS. 2018-11-28. Retrieved 2019-06-09.
↑ Robuck, Mike (2018-11-29). "AWS goes deep and wide with machine learning services and capabilities". FierceTelecom. Retrieved 2019-06-09.
↑ Janakiram, MSV (2019-01-27). "Amazon Open Sources SageMaker Neo To Run Machine Learning Models At The Edge". Forbes. Retrieved 2019-06-09.
↑ Digman, Larry (2019-06-04). "NASCAR to migrate 18 petabytes of video archives to AWS". ZDNet. Retrieved 2019-06-09.
↑ Crozier, Ry (2019-05-02). "Carsales builds Tessa AI to check vehicle ads". IT News. Retrieved 2019-06-09.
↑ "Avis Budget Group and Slalom Further Digitize the Car Rental Process with Machine Learning on AWS". AWS. 2019-05-31. Retrieved 2019-06-09.
↑ "Volkswagen and AWS Join Forces to Transform Automotive Manufacturing". Metrology News. 2019-05-24. Archived from the original on 2020-10-28. Retrieved 2019-06-09.
↑ Mari, Angelica (2019-05-14). "Footasylum steps up artificial intelligence to drive customer centricity". Computer Weekly. Retrieved 2019-06-09.
↑ Pandey, Ashok (2019-02-21). "5 Best Machine Learning Platforms For Developers". CIOL. Retrieved 2019-06-09.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[:6-1] 1 2 Woodie, Alex (2017-11-29). "AWS Takes the 'Muck' Out of ML with SageMaker". datanami. Retrieved 2019-06-09.

[:0-2] 1 2 3 4 Rodriguez, Jesus (2018-11-30). "With These New Additions, AWS SageMaker is Starting to Look More Real for Data Scientists". Towards Data Science. Retrieved 2019-06-09.^{[ permanent dead link ]}

[:8-3] Terdiman, Daniel (2018-10-05). "How AI is helping Amazon become a trillion-dollar company". Fast Company. Retrieved 2019-06-09.

[:5-4] 1 2 Miller, Ron (2017-11-29). "AWS releases SageMaker to make it easier to build and deploy machine learning models". TechCrunch. Retrieved 2019-06-09.

[5] Ponnapalli, Priya (2019-01-30). "Deploy trained Keras or TensorFlow models using Amazon SageMaker". AWS. Retrieved 2019-06-09.

[:2-6] 1 2 "Introducing Amazon SageMaker". AWS. 2017-11-29. Retrieved 2019-06-09.

[:3-7] 1 2 Nagel, Becky (2018-07-16). "Amazon Updates SageMaker ML Platform Algorithms, Frameworks". Pure AI. Retrieved 2019-06-09.

[:4-8] 1 2 Roumeliotis, Rachel (2018-03-07). "How to jump start your deep learning skills using Apache MXNet". O'Reilly. Retrieved 2019-06-09.

[:7-9] 1 2 Marquez, Ernesto. "Evaluate when to use added AWS Step Functions actions". TechTarget. Retrieved 2019-06-09.

[10] "AWS Step Functions Adds Eight More Service Integrations". AWS. 2018-11-29. Retrieved 2019-06-09.

[11] "Deploy Amazon SageMaker and a Data Lake on AWS for Predictive Data Science with New Quick Start". AWS. 2018-08-15. Retrieved 2019-06-09.

[12] Olsen, Rumi (2018-07-19). "Call an Amazon SageMaker model endpoint using Amazon API Gateway and AWS Lambda". AWS. Retrieved 2019-06-09.

[13] "Amazon SageMaker developer resources". AWS. Retrieved 2019-06-09.

[14] Wiggers, Kyle (2018-11-21). "Amazon updates SageMaker with new built-in algorithms and Git integration" . Retrieved 2019-06-09.

[15] "Use Notebook Instances". AWS. Retrieved 2019-06-09.

[16] Gift, Noah (2018-08-17). "Here Come The Notebooks". Forbes. Retrieved 2019-06-09.

[17] "Amazon SageMaker now supports TensorFlow 1.5, Apache MXNet 1.0, and CUDA 9 for P3 Instance Optimization". AWS. 2018-02-27. Retrieved 2019-06-09.

[18] "Auto Scaling in Amazon SageMaker is now Available". AWS. 2018-02-28. Retrieved 2019-06-09.

[19] "Amazon Sagemaker Now Uses Auto-scaling". Polar Seven. 2018-03-24. Retrieved 2019-06-09.

[20] "Amazon SageMaker Announces Several Enhancements to Built-in Algorithms and Frameworks". AWS. 2018-07-13. Retrieved 2019-06-09.

[21] "Amazon SageMaker Now Supports High Throughput Batch Transform Jobs for Non-Real Time Inferencing". AWS. 2018-07-17. Retrieved 2019-06-09.

[22] Simon, Julien (2019-01-24). "Making the most of your Machine Learning budget on Amazon SageMaker". Medium. Retrieved 2019-06-09.

[23] "Introduction to Amazon SageMaker Object2Vec". AWS. 2018-11-08. Retrieved 2019-06-09.

[24] "Amazon SageMaker Now Supports Object2Vec and IP Insights Built-in Algorithms". AWS. 2018-11-19. Retrieved 2019-06-09.

[25] "Introducing Amazon SageMaker Ground Truth - Build Highly Accurate Training Datasets Using Machine Learning". AWS. 2018-11-28. Retrieved 2019-06-09.

[26] "Introducing Reinforcement Learning Support with Amazon SageMaker RL". AWS. 2018-11-28. Retrieved 2019-06-09.

[27] "Introducing Amazon SageMaker Neo - Train Once, Run Anywhere with up to 2x in Performance Improvement". AWS. 2018-11-28. Retrieved 2019-06-09.

[:1-28] Robuck, Mike (2018-11-29). "AWS goes deep and wide with machine learning services and capabilities". FierceTelecom. Retrieved 2019-06-09.

[29] Janakiram, MSV (2019-01-27). "Amazon Open Sources SageMaker Neo To Run Machine Learning Models At The Edge". Forbes. Retrieved 2019-06-09.

[30] Digman, Larry (2019-06-04). "NASCAR to migrate 18 petabytes of video archives to AWS". ZDNet. Retrieved 2019-06-09.

[31] Crozier, Ry (2019-05-02). "Carsales builds Tessa AI to check vehicle ads". IT News. Retrieved 2019-06-09.

[32] "Avis Budget Group and Slalom Further Digitize the Car Rental Process with Machine Learning on AWS". AWS. 2019-05-31. Retrieved 2019-06-09.

[33] "Volkswagen and AWS Join Forces to Transform Automotive Manufacturing". Metrology News. 2019-05-24. Archived from the original on 2020-10-28. Retrieved 2019-06-09.

[34] Mari, Angelica (2019-05-14). "Footasylum steps up artificial intelligence to drive customer centricity". Computer Weekly. Retrieved 2019-06-09.

[35] Pandey, Ashok (2019-02-21). "5 Best Machine Learning Platforms For Developers". CIOL. Retrieved 2019-06-09.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]