Andrew McCallum

Andrew McCallum
Andrew McCallum
Nationality	American
Alma mater	Dartmouth College ; University of Rochester
Known for	Conditional random field
Awards	ICML Test of Time (2011)
	Scientific career
Fields	Computer Science, Artificial Intelligence
Institutions	WhizBang Labs; University of Massachusetts Amherst ; Google Research
Doctoral advisor	Dana H. Ballard

Last updated November 08, 2024

Andrew McCallum is a professor in the computer science department at University of Massachusetts Amherst.^[3] His primary specialties are in machine learning, natural language processing, information extraction, information integration, and social network analysis.^[4]

Career

McCallum graduated summa cum laude from Dartmouth College in 1989. He completed his Ph.D. at University of Rochester in 1995 under the supervision of Dana H. Ballard. He was then a postdoctoral fellow, working with Sebastian Thrun and Tom M. Mitchell at Carnegie Mellon University. From 1998 to 2000 he was a Research Scientist and Research Coordinator at Justsystem Pittsburgh Research Center. From 2000 to 2002 was Vice President of Research and Development at WhizBang Labs, and Director of its Pittsburgh office. Since 2002, he worked as a professor of computer science at the University of Massachusetts Amherst. In 2020, he also joined Google as a part-time research scientist.

He was elected as a fellow of the Association for the Advancement of Artificial Intelligence ^[5] in 2009, and as an Association for Computing Machinery ^[6] in 2017. From 2014 to 2017 he was the President of International Machine Learning Society (IMLS),^[7] which organizes the International Conference on Machine Learning. He is also the director of the Center for Data Science at UMass, leading a new partnership with the Chan and Zuckerberg Initiative. In 2018, the initiative made an initial grant of 5.5 million to the center, supporting research to facilitate new ways for scientists to explore and discover research articles.^[8]

Main contributions

In collaboration with John D. Lafferty and Fernando Pereira, McCallum developed conditional random fields, first described in a paper presented at the International Conference on Machine Learning (ICML).^[9] In 2011 this research paper won the ICML "Test of Time" (10-year best paper) award.^[10]

McCallum has written several widely used^[11] open-source software toolkits for machine learning, natural language processing and other text processing, including Rainbow,^[12] Mallet (software project), and FACTORIE.^[13] In addition, he was instrumental in publishing the Enron Corpus, a large collection of emails that has been used as a basis for a number of academic studies of social networking and language.

McCallum instigated and directs the nonprofit project OpenReview.net, an online platform that aims to promote openness in scientific communication, particularly the peer review process, by providing a flexible cloud-based web interface and underlying database API.^[14]

Related Research Articles

Michael Irwin Jordan is an American scientist, professor at the University of California, Berkeley, research scientist at the Inria Paris, and researcher in machine learning, statistics, and artificial intelligence.

Daphne Koller is an Israeli-American computer scientist. She was a professor in the department of computer science at Stanford University and a MacArthur Foundation fellowship recipient. She is one of the founders of Coursera, an online education platform. Her general research area is artificial intelligence and its applications in the biomedical sciences. Koller was featured in a 2004 article by MIT Technology Review titled "10 Emerging Technologies That Will Change Your World" concerning the topic of Bayesian machine learning.

Sebastian Thrun is a German-American entrepreneur, educator, and computer scientist. He is CEO of Kitty Hawk Corporation, and chairman and co-founder of Udacity. Before that, he was a Google VP and Fellow, a Professor of Computer Science at Stanford University, and before that at Carnegie Mellon University. At Google, he founded Google X and Google's self-driving car team. He is also an adjunct professor at Stanford University and at Georgia Tech.

Conditional random fields (CRFs) are a class of statistical modeling methods often applied in pattern recognition and machine learning and used for structured prediction. Whereas a classifier predicts a label for a single sample without considering "neighbouring" samples, a CRF can take context into account. To do so, the predictions are modelled as a graphical model, which represents the presence of dependencies between the predictions. What kind of graph is used depends on the application. For example, in natural language processing, "linear chain" CRFs are popular, for which each prediction is dependent only on its immediate neighbours. In image processing, the graph typically connects locations to nearby and/or similar locations to enforce that they receive similar predictions.

Discriminative models, also referred to as conditional models, are a class of models frequently used for classification. They are typically used to solve binary classification problems, i.e. assign labels, such as pass/fail, win/lose, alive/dead or healthy/sick, to existing datapoints.

CALO was an artificial intelligence project that attempted to integrate numerous AI technologies into a cognitive assistant. CALO is an acronym for "Cognitive Assistant that Learns and Organizes". The name was inspired by the Latin word "Calo" which means "soldier's servant". The project started in May 2003 and ran for five years, ending in 2008.

Michael Lederman Littman is a computer scientist, researcher, educator, and author. His research interests focus on reinforcement learning. He is currently a University Professor of Computer Science at Brown University, where he has taught since 2012.

Dieter Fox is a German-American roboticist and a Professor in the Department of Computer Science & Engineering at the University of Washington, Seattle. He is also the Senior Director of Robotics Research at Nvidia. He received his PhD in Computer Science at the University of Bonn in 1998 on mobile robot localization and navigation. He is most notable for his contributions to several fields including robotics, artificial intelligence, machine learning, and ubiquitous computing. Together with Wolfram Burgard and Sebastian Thrun he is a co-author of the book Probabilistic Robotics. He was elected an AAAI Fellow in 2011.

Co-training is a machine learning algorithm used when there are only small amounts of labeled data and large amounts of unlabeled data. One of its uses is in text mining for search engines. It was introduced by Avrim Blum and Tom Mitchell in 1998.

The International Conference on Machine Learning (ICML) is the leading international academic conference in machine learning. Along with NeurIPS and ICLR, it is one of the three primary conferences of high impact in machine learning and artificial intelligence research. It is supported by the International Machine Learning Society (IMLS). Precise dates vary year to year, but paper submissions are generally due at the end of January, and the conference is generally held the following July. The first ICML was held 1980 in Pittsburgh.

The AAAI Conference on Artificial Intelligence (AAAI) is a leading international academic conference in artificial intelligence held annually. It ranks 4th in terms of H5 Index in Google Scholar's list of top AI publications, after ICLR, NeurIPS, and ICML. It is supported by the Association for the Advancement of Artificial Intelligence. Precise dates vary from year to year, but paper submissions are generally due at the end of August to beginning of September, and the conference is generally held during the following February. The first AAAI was held in 1980 at Stanford University, Stanford California.

Structured prediction or structured output learning is an umbrella term for supervised machine learning techniques that involves predicting structured objects, rather than discrete or real values.

In statistics, a maximum-entropy Markov model (MEMM), or conditional Markov model (CMM), is a graphical model for sequence labeling that combines features of hidden Markov models (HMMs) and maximum entropy (MaxEnt) models. An MEMM is a discriminative model that extends a standard maximum entropy classifier by assuming that the unknown values to be learnt are connected in a Markov chain rather than being conditionally independent of each other. MEMMs find applications in natural language processing, specifically in part-of-speech tagging and information extraction.

In machine learning and natural language processing, the pachinko allocation model (PAM) is a topic model. Topic models are a suite of algorithms to uncover the hidden thematic structure of a collection of documents. The algorithm improves upon earlier topic models such as latent Dirichlet allocation (LDA) by modeling correlations between topics in addition to the word correlations which constitute topics. PAM provides more flexibility and greater expressive power than latent Dirichlet allocation. While first described and implemented in the context of natural language processing, the algorithm may have applications in other fields such as bioinformatics. The model is named for pachinko machines—a game popular in Japan, in which metal balls bounce down around a complex collection of pins until they land in various bins at the bottom.

In statistics, generalized iterative scaling (GIS) and improved iterative scaling (IIS) are two early algorithms used to fit log-linear models, notably multinomial logistic regression (MaxEnt) classifiers and extensions of it such as MaxEnt Markov models and conditional random fields. These algorithms have been largely surpassed by gradient-based methods such as L-BFGS and coordinate descent algorithms.

In machine learning, a probabilistic classifier is a classifier that is able to predict, given an observation of an input, a probability distribution over a set of classes, rather than only outputting the most likely class that the observation should belong to. Probabilistic classifiers provide classification that can be useful in its own right or when combining classifiers into ensembles.

John D. Lafferty is an American scientist, Professor at Yale University and leading researcher in machine learning. He is best known for proposing the Conditional Random Fields with Andrew McCallum and Fernando C.N. Pereira.

Eric Poe Xing is an American computer scientist whose research spans machine learning, computational biology, and statistical methodology. Xing is founding President of the world’s first artificial intelligence university, Mohamed bin Zayed University of Artificial Intelligence (MBZUAI).

Dan Roth is the Eduardo D. Glandt Distinguished Professor of Computer and Information Science at the University of Pennsylvania and the Chief AI Scientist at Oracle. Until June 2024 Dan was a VP/Distinguished Scientist at AWS AI. In his role at AWS Roth led over the last three years the scientific effort behind the first-generation Generative AI products from AWS, including Titan Models, Amazon Q efforts, and Bedrock, from inception until they became generally available.

The following outline is provided as an overview of and topical guide to machine learning:

References

↑ "Bio for Andrew McCallum". Archived from the original on 4 June 2010. Retrieved 2010-05-31.
↑ "The Mathematics Genealogy Project - Andrew McCallum". Archived from the original on 6 May 2010. Retrieved 2010-05-31.
↑ "Faculty Directory". cs.umass.edu. Archived from the original on 4 June 2010. Retrieved 2010-05-31.
↑ "Elected AAAI Fellows" . Retrieved 2010-05-31.
↑ "AAAI Fellows Program".
↑ "All Award Winners". awards.acm.org. Retrieved 2017-12-22.
↑ "IMLS Board". machinelearning.org. Retrieved 19 March 2015. President-Elect Andrew McCallum
↑ "UMass Center for Data Science Partners with Chan Zuckerberg Initiative to Accelerate Science and Medicine". umass.edu. Retrieved 2018-01-26.
↑ Lafferty, J.; McCallum, A.; Pereira, F.C.N. (2001). "Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data". International Conference on Machine Learning. pp. 79–87.
↑ "Test-of-Time Award ICML'11". ICML. 2011. Retrieved 15 December 2014. John D. Lafferty, Andrew McCallum, Fernando C. N. Pereira. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data.
↑ "Google Scholar search for +mccallum +rainbow +software" . Retrieved 2010-05-31.
↑ "Rainbow". Archived from the original on 26 May 2010. Retrieved 2010-05-31.
↑ "FACTORIE" . Retrieved 2014-05-20.
↑ "OpenReview".

External links

This biographical article relating to a computer specialist in the United States is a stub. You can help Wikipedia by expanding it.

P ≟ NP

This biographical article relating to a computer scientist is a stub. You can help Wikipedia by expanding it.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[bio-1] "Bio for Andrew McCallum". Archived from the original on 4 June 2010. Retrieved 2010-05-31.

[genealogy-2] "The Mathematics Genealogy Project - Andrew McCallum". Archived from the original on 6 May 2010. Retrieved 2010-05-31.

[facultylist-3] "Faculty Directory". cs.umass.edu. Archived from the original on 4 June 2010. Retrieved 2010-05-31.

[aaai-4] "Elected AAAI Fellows" . Retrieved 2010-05-31.

[5] "AAAI Fellows Program".

[6] "All Award Winners". awards.acm.org. Retrieved 2017-12-22.

[imls-president-7] "IMLS Board". machinelearning.org. Retrieved 19 March 2015. President-Elect Andrew McCallum

[8] "UMass Center for Data Science Partners with Chan Zuckerberg Initiative to Accelerate Science and Medicine". umass.edu. Retrieved 2018-01-26.

[LMP2001-9] Lafferty, J.; McCallum, A.; Pereira, F.C.N. (2001). "Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data". International Conference on Machine Learning. pp. 79–87.

[ICML01test-10] "Test-of-Time Award ICML'11". ICML. 2011. Retrieved 15 December 2014. John D. Lafferty, Andrew McCallum, Fernando C. N. Pereira. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data.

[rainbowusage-11] "Google Scholar search for +mccallum +rainbow +software" . Retrieved 2010-05-31.

[rainbow-12] "Rainbow". Archived from the original on 26 May 2010. Retrieved 2010-05-31.

[factorie-13] "FACTORIE" . Retrieved 2014-05-20.

[14] "OpenReview".

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]