Outline of machine learning

Last updated December 09, 2024

The following outline is provided as an overview of, and topical guide to, machine learning:

How can machine learning be categorized?
Paradigms of machine learning
Applications of machine learning
Machine learning hardware
Machine learning tools
Machine learning frameworks
Machine learning libraries
Machine learning algorithms
Machine learning methods
Instance-based algorithm
Regression analysis
Dimensionality reduction
Ensemble learning
Meta-learning
Reinforcement learning
Supervised learning
Unsupervised learning
Semi-supervised learning
Deep learning
Other machine learning methods and problems
Machine learning research
History of machine learning
Machine learning projects
Machine learning organizations
Machine learning conferences and workshops
Machine learning publications
Books on machine learning
Machine learning journals
Persons influential in machine learning
See also
Other
Further reading
References
External links

Machine learning (ML) is a subfield of artificial intelligence within computer science that evolved from the study of pattern recognition and computational learning theory.^[1] In 1959, Arthur Samuel defined machine learning as a "field of study that gives computers the ability to learn without being explicitly programmed".^[2] ML involves the study and construction of algorithms that can learn from and make predictions on data.^[3] These algorithms operate by building a model from a training set of example observations to make data-driven predictions or decisions expressed as outputs, rather than following strictly static program instructions.

How can machine learning be categorized?

An academic discipline
A branch of science
- An applied science
  - A subfield of computer science
    - A branch of artificial intelligence
    - A subfield of soft computing
    - Application of statistics

Paradigms of machine learning

Supervised learning, where the model is trained on labeled data
Unsupervised learning, where the model tries to identify patterns in unlabeled data
Reinforcement learning, where the model learns to make decisions by receiving rewards or penalties.

Applications of machine learning

Machine learning hardware

Machine learning tools

Comparison of deep learning software

Machine learning frameworks

Proprietary machine learning frameworks

Open source machine learning frameworks

Apache Singa
Apache MXNet
Caffe
PyTorch
mlpack
TensorFlow
Torch
CNTK
Accord.Net
Jax
MLJ.jl – A machine learning framework for Julia

Machine learning libraries

Machine learning algorithms

Machine learning methods

Instance-based algorithm

Meta-learning

Reinforcement learning

Supervised learning

Averaged one-dependence estimators (AODE)
Artificial neural network
Case-based reasoning
Gaussian process regression
Gene expression programming
Group method of data handling (GMDH)
Inductive logic programming
Instance-based learning
Lazy learning
Learning Automata
Learning Vector Quantization
Logistic Model Tree
Minimum message length (decision trees, decision graphs, etc.)
- Nearest Neighbor Algorithm
- Analogical modeling
Probably approximately correct learning (PAC) learning
Ripple down rules, a knowledge acquisition methodology
Symbolic machine learning algorithms
Support vector machines
Random Forests
Ensembles of classifiers
- Bootstrap aggregating (bagging)
- Boosting (meta-algorithm)
Ordinal classification
Conditional Random Field
ANOVA
Quadratic classifiers
k-nearest neighbor
Boosting
- SPRINT
Bayesian networks
- Naive Bayes
Hidden Markov models
- Hierarchical hidden Markov model

Bayesian

Bayesian statistics

Bayesian knowledge base
Naive Bayes
Gaussian Naive Bayes
Multinomial Naive Bayes
Averaged One-Dependence Estimators (AODE)
Bayesian Belief Network (BBN)
Bayesian Network (BN)

Decision tree algorithms

Decision tree algorithm

Decision tree
Classification and regression tree (CART)
Iterative Dichotomiser 3 (ID3)
C4.5 algorithm
C5.0 algorithm
Chi-squared Automatic Interaction Detection (CHAID)
Decision stump
Conditional decision tree
ID3 algorithm
Random forest
SLIQ

Linear classifier

Unsupervised learning

Artificial neural networks

Artificial neural network

Association rule learning

Hierarchical clustering

Cluster analysis

Anomaly detection

Semi-supervised learning

Active learning
Generative models
Low-density separation
Graph-based methods
Co-training
Transduction

Deep learning

Machine learning research

History of machine learning

Timeline of machine learning

Machine learning projects

Machine learning projects:

Machine learning organizations

Machine learning conferences and workshops

Artificial Intelligence and Security (AISec) (co-located workshop with CCS)
Conference on Neural Information Processing Systems (NIPS)
ECML PKDD
International Conference on Machine Learning (ICML)
ML4ALL (Machine Learning For All)

Machine learning publications

Books on machine learning

Mathematics for Machine Learning
Hands-On Machine Learning Scikit-Learn, Keras, and TensorFlow
The Hundred-Page Machine Learning Book

Machine learning journals

Machine Learning
Journal of Machine Learning Research (JMLR)
Neural Computation

Persons influential in machine learning

Related Research Articles

In machine learning, supervised learning (SL) is a paradigm where a model is trained using input objects and desired output values, which are often human-made labels. The training process builds a function that maps new data to expected output values. An optimal scenario will allow for the algorithm to accurately determine output values for unseen instances. This requires the learning algorithm to generalize from the training data to unseen situations in a "reasonable" way. This statistical quality of an algorithm is measured via a generalization error.

In machine learning, support vector machines are supervised max-margin models with associated learning algorithms that analyze data for classification and regression analysis. Developed at AT&T Bell Laboratories, SVMs are one of the most studied models, being based on statistical learning frameworks of VC theory proposed by Vapnik and Chervonenkis (1974).

Pattern recognition is the task of assigning a class to an observation based on patterns extracted from data. While similar, pattern recognition (PR) is not to be confused with pattern machines (PM) which may possess (PR) capabilities but their primary function is to distinguish and create emergent patterns. PR has applications in statistical data analysis, signal processing, image analysis, information retrieval, bioinformatics, data compression, computer graphics and machine learning. Pattern recognition has its origins in statistics and engineering; some modern approaches to pattern recognition include the use of machine learning, due to the increased availability of big data and a new abundance of processing power.

Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of statistical algorithms that can learn from data and generalize to unseen data, and thus perform tasks without explicit instructions. Advances in the field of deep learning have allowed neural networks to surpass many previous approaches in performance.

When classification is performed by a computer, statistical methods are normally used to develop the algorithm.

In machine learning, kernel machines are a class of algorithms for pattern analysis, whose best known member is the support-vector machine (SVM). These methods involve using linear classifiers to solve nonlinear problems. The general task of pattern analysis is to find and study general types of relations in datasets. For many algorithms that solve these tasks, the data in raw representation have to be explicitly transformed into feature vector representations via a user-specified feature map: in contrast, kernel methods require only a user-specified kernel, i.e., a similarity function over all pairs of data points computed using inner products. The feature map in kernel machines is infinite dimensional but only requires a finite dimensional matrix from user-input according to the Representer theorem. Kernel machines are slow to compute for datasets larger than a couple of thousand examples without parallel processing.

The following outline is provided as an overview of and topical guide to artificial intelligence:

In statistics and machine learning, ensemble methods use multiple learning algorithms to obtain better predictive performance than could be obtained from any of the constituent learning algorithms alone. Unlike a statistical ensemble in statistical mechanics, which is usually infinite, a machine learning ensemble consists of only a concrete finite set of alternative models, but typically allows for much more flexible structure to exist among those alternatives.

Structured prediction or structured output learning is an umbrella term for supervised machine learning techniques that involves predicting structured objects, rather than discrete or real values.

There are many types of artificial neural networks (ANN).

mlpy is a Python, open-source, machine learning library built on top of NumPy/SciPy, the GNU Scientific Library and it makes an extensive use of the Cython language. mlpy provides a wide range of state-of-the-art machine learning methods for supervised and unsupervised problems and it is aimed at finding a reasonable compromise among modularity, maintainability, reproducibility, usability and efficiency. mlpy is multiplatform, it works with Python 2 and 3 and it is distributed under GPL3.

Massive Online Analysis (MOA) is a free open-source software project specific for data stream mining with concept drift. It is written in Java and developed at the University of Waikato, New Zealand.

mlpack is a free, open-source and header-only software library for machine learning and artificial intelligence written in C++, built on top of the Armadillo library and the ensmallen numerical optimization library. mlpack has an emphasis on scalability, speed, and ease-of-use. Its aim is to make machine learning possible for novice users by means of a simple, consistent API, while simultaneously exploiting C++ language features to provide maximum performance and maximum flexibility for expert users. mlpack has also a light deployment infrastructure with minimum dependencies, making it perfect for embedded systems and low resource devices. Its intended target users are scientists and engineers.

JASP is a free and open-source program for statistical analysis supported by the University of Amsterdam. It is designed to be easy to use, and familiar to users of SPSS. It offers standard analysis procedures in both their classical and Bayesian form. JASP generally produces APA style results tables and plots to ease publication. It promotes open science via integration with the Open Science Framework and reproducibility by integrating the analysis settings into the results. The development of JASP is financially supported by sponsors several universities and research funds. As the JASP GUI is developed in C++ using Qt framework, some of the team left to make a notable fork which is Jamovi which has its GUI developed in JavaScript and HTML5.

This glossary of artificial intelligence is a list of definitions of terms and concepts relevant to the study of artificial intelligence (AI), its subdisciplines, and related fields. Related glossaries include Glossary of computer science, Glossary of robotics, and Glossary of machine vision.

Machine learning in bioinformatics is the application of machine learning algorithms to bioinformatics, including genomics, proteomics, microarrays, systems biology, evolution, and text mining.

References

↑ http://www.britannica.com/EBchecked/topic/1116194/machine-learning This tertiary source reuses information from other sources but does not name them.
↑ Phil Simon (March 18, 2013). Too Big to Ignore: The Business Case for Big Data. Wiley. p. 89. ISBN 978-1-118-63817-0.
↑ Ron Kohavi; Foster Provost (1998). "Glossary of terms". Machine Learning . 30: 271–274. doi: 10.1023/A:1007411609915 .

External links

Data Science: Data to Insights from MIT (machine learning)
Popular online course by Andrew Ng, at Coursera. It uses GNU Octave. The course is a free version of Stanford University's actual course taught by Ng, see.stanford.edu/Course/CS229 available for free].
mloss is an academic database of open-source machine learning software.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[Britannica-1] ttp://www.britannica.com/EBchecked/topic/1116194/machine-learning This tertiary source reuses information from other sources but does not name them.

[arthur_samuel_machine_learning_def-2] Phil Simon (March 18, 2013). Too Big to Ignore: The Business Case for Big Data. Wiley. p. 89. ISBN 978-1-118-63817-0.

[3] Ron Kohavi; Foster Provost (1998). "Glossary of terms". Machine Learning . 30: 271–274. doi: 10.1023/A:1007411609915 .

[1]

[2]

[3]

v t e Differentiable computing
General	Differentiable programming Information geometry Statistical manifold Automatic differentiation Neuromorphic computing Pattern recognition Ricci calculus Computational learning theory Inductive bias
Hardware	IPU TPU VPU Memristor SpiNNaker
Software libraries	TensorFlow PyTorch Keras scikit-learn Theano JAX Flux.jl MindSpore
Portals Computer programming Technology