Rule induction

Last updated June 17, 2023

Rule induction is an area of machine learning in which formal rules are extracted from a set of observations. The rules extracted may represent a full scientific model of the data, or merely represent local patterns in the data.

Data mining in general and rule induction in detail are trying to create algorithms without human programming but with analyzing existing data structures.^[1]^: 415- In the easiest case, a rule is expressed with “if-then statements” and was created with the ID3 algorithm for decision tree learning.^[2]^: 7^[1]^: 348 Rule learning algorithm are taking training data as input and creating rules by partitioning the table with cluster analysis.^[2]^: 7 A possible alternative over the ID3 algorithm is genetic programming which evolves a program until it fits to the data.^[3]^: 2

Creating different algorithm and testing them with input data can be realized in the WEKA software.^[3]^: 125 Additional tools are machine learning libraries for Python, like scikit-learn.

Paradigms

Some major rule induction paradigms are:

Association rule learning algorithms (e.g., Agrawal)
Decision rule algorithms (e.g., Quinlan 1987)
Hypothesis testing algorithms (e.g., RULEX)
Horn clause induction
Version spaces
Rough set rules
Inductive Logic Programming
Boolean decomposition (Feldman)

Algorithms

Some rule induction algorithms are:

Charade^[4]
Rulex
Progol
CN2

Related Research Articles

Natural language processing (NLP) is an interdisciplinary subfield of linguistics, computer science, and artificial intelligence concerned with the interactions between computers and human language, in particular how to program computers to process and analyze large amounts of natural language data. The goal is a computer capable of "understanding" the contents of documents, including the contextual nuances of the language within them. The technology can then accurately extract information and insights contained in the documents as well as categorize and organize the documents themselves.

Data mining is the process of extracting and discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and statistics with an overall goal of extracting information from a data set and transforming the information into a comprehensible structure for further use. Data mining is the analysis step of the "knowledge discovery in databases" process, or KDD. Aside from the raw analysis step, it also involves database and data management aspects, data pre-processing, model and inference considerations, interestingness metrics, complexity considerations, post-processing of discovered structures, visualization, and online updating.

Machine learning (ML) is a field devoted to understanding and building methods that let machines "learn" – that is, methods that leverage data to improve computer performance on some set of tasks.

Decision tree learning is a supervised learning approach used in statistics, data mining and machine learning. In this formalism, a classification or regression decision tree is used as a predictive model to draw conclusions about a set of observations.

Learning classifier systems, or LCS, are a paradigm of rule-based machine learning methods that combine a discovery component with a learning component. Learning classifier systems seek to identify a set of context-dependent rules that collectively store and apply knowledge in a piecewise manner in order to make predictions. This approach allows complex solution spaces to be broken up into smaller, simpler parts.

C4.5 is an algorithm used to generate a decision tree developed by Ross Quinlan. C4.5 is an extension of Quinlan's earlier ID3 algorithm. The decision trees generated by C4.5 can be used for classification, and for this reason, C4.5 is often referred to as a statistical classifier. In 2011, authors of the Weka machine learning software described the C4.5 algorithm as "a landmark decision tree program that is probably the machine learning workhorse most widely used in practice to date".

John Ross Quinlan is a computer science researcher in data mining and decision theory. He has contributed extensively to the development of decision tree algorithms, including inventing the canonical C4.5 and ID3 algorithms. He also contributed to early ILP literature with First Order Inductive Learner (FOIL). He is currently running the company RuleQuest Research which he founded in 1997.

Grammar induction is the process in machine learning of learning a formal grammar from a set of observations, thus constructing a model which accounts for the characteristics of the observed objects. More generally, grammatical inference is that branch of machine learning where the instance space consists of discrete combinatorial objects such as strings, trees and graphs.

Structure mining or structured data mining is the process of finding and extracting useful information from semi-structured data sets. Graph mining, sequential pattern mining and molecule mining are special cases of structured data mining.

Process mining is a family of techniques relating the fields of data science and process management to support the analysis of operational processes based on event logs. The goal of process mining is to turn event data into insights and actions. Process mining is an integral part of data science, fueled by the availability of event data and the desire to improve processes. Process mining techniques use event data to show what people, machines, and organizations are really doing. Process mining provides novel insights that can be used to identify the execution paths taken by operational processes and address their performance and compliance problems.

An incremental decision tree algorithm is an online machine learning algorithm that outputs a decision tree. Many decision tree methods, such as C4.5, construct a tree using a complete dataset. Incremental decision tree methods allow an existing tree to be updated using only new individual data instances, without having to re-process past instances. This may be useful in situations where the entire dataset is not available when the tree is updated, the original data set is too large to process or the characteristics of the data change over time.

Fraud represents a significant problem for governments and businesses and specialized analysis techniques for discovering fraud using them are required. Some of these methods include knowledge discovery in databases (KDD), data mining, machine learning and statistics. They offer applicable and successful solutions in different areas of electronic fraud crimes.

Knowledge extraction is the creation of knowledge from structured and unstructured sources. The resulting knowledge needs to be in a machine-readable and machine-interpretable format and must represent knowledge in a manner that facilitates inferencing. Although it is methodically similar to information extraction (NLP) and ETL, the main criterion is that the extraction result goes beyond the creation of structured information or the transformation into a relational schema. It requires either the reuse of existing formal knowledge or the generation of a schema based on the source data.

<span class="mw-page-title-main">Data science</span> Interdisciplinary field of study on deriving knowledge and insights from data

Data science is an interdisciplinary academic field that uses statistics, scientific computing, scientific methods, processes, algorithms and systems to extract or extrapolate knowledge and insights from noisy, structured, and unstructured data.

Inductive programming (IP) is a special area of automatic programming, covering research from artificial intelligence and programming, which addresses learning of typically declarative and often recursive programs from incomplete specifications, such as input/output examples or constraints.

Feature engineering or feature extraction or feature discovery is the process of using domain knowledge to extract features from raw data. The motivation is to use these extra features to improve the quality of results from a machine learning process, compared with supplying only the raw data to the machine learning process.

Domain driven data mining is a data mining methodology for discovering actionable knowledge and deliver actionable insights from complex data and behaviors in a complex environment. It studies the corresponding foundations, frameworks, algorithms, models, architectures, and evaluation systems for actionable knowledge discovery.

This glossary of artificial intelligence is a list of definitions of terms and concepts relevant to the study of artificial intelligence, its sub-disciplines, and related fields. Related glossaries include Glossary of computer science, Glossary of robotics, and Glossary of machine vision.

Rule-based machine learning (RBML) is a term in computer science intended to encompass any machine learning method that identifies, learns, or evolves 'rules' to store, manipulate or apply. The defining characteristic of a rule-based machine learner is the identification and utilization of a set of relational rules that collectively represent the knowledge captured by the system. This is in contrast to other machine learners that commonly identify a singular model that can be universally applied to any instance in order to make a prediction.

The following outline is provided as an overview of and topical guide to machine learning. Machine learning is a subfield of soft computing within computer science that evolved from the study of pattern recognition and computational learning theory in artificial intelligence. In 1959, Arthur Samuel defined machine learning as a "field of study that gives computers the ability to learn without being explicitly programmed". Machine learning explores the study and construction of algorithms that can learn from and make predictions on data. Such algorithms operate by building a model from an example training set of input observations in order to make data-driven predictions or decisions expressed as outputs, rather than following strictly static program instructions.

References

1 2 Evangelos Triantaphyllou; Giovanni Felici (10 September 2006). Data Mining and Knowledge Discovery Approaches Based on Rule Induction Techniques. Springer Science & Business Media. ISBN 978-0-387-34296-2.
1 2 Alex A. Freitas (11 November 2013). Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer Science & Business Media. ISBN 978-3-662-04923-5.
1 2 Gisele L. Pappa; Alex Freitas (27 October 2009). Automating the Design of Data Mining Algorithms: An Evolutionary Computation Approach. Springer Science & Business Media. ISBN 978-3-642-02541-9.
↑ Sahami, Mehran. "Learning classification rules using lattices." Machine learning: ECML-95 (1995): 343-346.

Quinlan, J. R. (1987). "Generating production rules from decision trees" (PDF). In McDermott, John (ed.). Proceedings of the Tenth International Joint Conference on Artificial Intelligence (IJCAI-87). Milan, Italy. pp. 304–307.

This computer science article is a stub. You can help Wikipedia by expanding it.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[TriantaphyllouFelici2006-1] 1 2 Evangelos Triantaphyllou; Giovanni Felici (10 September 2006). Data Mining and Knowledge Discovery Approaches Based on Rule Induction Techniques. Springer Science & Business Media. ISBN 978-0-387-34296-2.

[Freitas2013-2] 1 2 Alex A. Freitas (11 November 2013). Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer Science & Business Media. ISBN 978-3-662-04923-5.

[PappaFreitas2009-3] 1 2 Gisele L. Pappa; Alex Freitas (27 October 2009). Automating the Design of Data Mining Algorithms: An Evolutionary Computation Approach. Springer Science & Business Media. ISBN 978-3-642-02541-9.

[4] Sahami, Mehran. "Learning classification rules using lattices." Machine learning: ECML-95 (1995): 343-346.

[1]

[2]

[3]

[4]

Rule induction

Contents

Paradigms

Algorithms

Related Research Articles

References