Mengdi Wang

Mengdi Wang
Mengdi Wang
Alma mater	Massachusetts Institute of Technology ; Tsinghua University
	Scientific career
Institutions	Princeton University
Thesis	Stochastic methods for large-scale linear problems, variational inequalities, and convex optimization (2013)
Doctoral advisor	Dimitri Bertsekas

Last updated May 29, 2024

Mengdi Wang is a theoretical computer scientist who is a professor at Princeton University. Her research considers the fundamental theory that underpins reinforcement and machine learning. She was named one of MIT Technology Review 's 35 Under 35 in 2018.

Early life and education

Wang was an undergraduate student at Tsinghua University, where she specialized in automation. At the age of 18, she joined Massachusetts Institute of Technology as a graduate student, where she worked alongside Dimitri Bertsekas.^[1] Her doctoral research developed stochastic methods for large-scale linear systems.^[2]

Research and career

Wang specializes in the theoretical frameworks that underpin machine learning and reinforcement learning.^[3] She joined Princeton University as an assistant professor in 2014.^[4] She was the first person to propose stochastic gradient methods for composition optimisation.^[1] Her early work used reinforcement to minimize risk in financial portfolios and help hospitals identify potential complications.^[3]

Wang has studied Markov decision processes, a model for reinforcement learning. She uses state compression methods to use empirical data to sketch black box Markov processes.^[4]

In 2020, Wang joined the C3.ai Digital Transformation Institute, a consortium of researchers who seek to accelerate the use of artificial intelligence in society. She proposed that reinforcement learning could be used to protect educational establishments from COVID-19.^[5] She used system identification and adaptive control to develop strategies to understand the health status of students, and to deploy algorithms that recommend interventions to decision makers.^[5] In 2024, she was awarded a United States Department of Defense Multidisciplinary University Research Initiative program to develop AI and reinforcement learning for biological systems.^[6] She showed it was possible to use large language models with semantic representation to design MRNA vaccines.^[7]

Awards and honors

2016 Mathematical Optimization Society Young Researcher Prize in Continuous Optimization^[8]
2016 Princeton SEAS Innovation Award^{[ citation needed ]}
2017 NSF Career Award^[9]
2017 Google Faculty Award^[10]
2018 MIT Tech Review 35-Under-35^[1]
2022 WAIC YunFan Award^[11]
2024 American Automatic Control Council Donald Eckman Award ^[12]

Selected publications

Aaron Sidford; Mengdi Wang; Xian Wu; Lin Yang; Yinyu Ye (2018). "Near-Optimal Time and Sample Complexities for Solving Markov Decision Processes with a Generative Model" (PDF). Advances in Neural Information Processing Systems 31. Advances in Neural Information Processing Systems. Wikidata Q59481743.
Mengdi Wang; Ji Liu; Ethan Fang (2016). "Accelerating Stochastic Composition Optimization" (PDF). Advances in Neural Information Processing Systems 29. Advances in Neural Information Processing Systems. Wikidata Q46993803.
Junyu Zhang; Alec Koppel; Amrit Singh Bedi; Csaba Szepesvari; Mengdi Wang (November 2020). "Variational Policy Gradient Method for Reinforcement Learning with General Utilities" (PDF). Advances in Neural Information Processing Systems 33. Advances in Neural Information Processing Systems. Wikidata Q104090329.

Related Research Articles

In machine learning, a neural network is a model inspired by the structure and function of biological neural networks in animal brains.

Reinforcement learning (RL) is an interdisciplinary area of machine learning and optimal control concerned with how an intelligent agent ought to take actions in a dynamic environment in order to maximize the cumulative reward. Reinforcement learning is one of three basic machine learning paradigms, alongside supervised learning and unsupervised learning.

The expression computational intelligence (CI) usually refers to the ability of a computer to learn a specific task from data or experimental observation. Even though it is commonly considered a synonym of soft computing, there is still no commonly accepted definition of computational intelligence.

In probability theory and machine learning, the multi-armed bandit problem is a problem in which a decision maker iteratively selects one of multiple fixed choices when the properties of each choice are only partially known at the time of allocation, and may become better understood as time passes. A fundamental aspect of bandit problems is that choosing an arm does not affect the properties of the arm or other arms.

Claire Jennifer Tomlin is a British researcher in hybrid systems, distributed and decentralized optimization and control theory and holds the Charles A. Desoer Chair at the University of California, at Berkeley.

Harold Joseph Kushner is an American applied mathematician and a Professor Emeritus of Applied Mathematics at Brown University. He is known for his work on the theory of stochastic stability, the theory of non-linear filtering, and for the development of numerical methods for stochastic control problems such as the Markov chain approximation method. He is commonly cited as the first person to study Bayesian optimization, based on work he published in 1964.

Dimitri Panteli Bertsekas is an applied mathematician, electrical engineer, and computer scientist, a McAfee Professor at the Department of Electrical Engineering and Computer Science in School of Engineering at the Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts, and also a Fulton Professor of Computational Decision Making at Arizona State University, Tempe.

Mark Herbert Ainsworth Davis was Professor of Mathematics at Imperial College London. He made fundamental contributions to the theory of stochastic processes, stochastic control and mathematical finance.

<span class="mw-page-title-main">Yasuo Matsuyama</span>

Yasuo Matsuyama is a Japanese researcher in machine learning and human-aware information processing.

Bayesian optimization is a sequential design strategy for global optimization of black-box functions that does not assume any functional forms. It is usually employed to optimize expensive-to-evaluate functions.

<span class="mw-page-title-main">Bruce Hajek</span> American electrical engineer

Bruce Edward Hajek is a Professor in the Coordinated Science Laboratory, the head of the Department of Electrical and Computer Engineering, and the Leonard C. and Mary Lou Hoeft Chair in Engineering at the University of Illinois Urbana–Champaign. He does research in communication networking, auction theory, stochastic analysis, combinatorial optimization, machine learning, information theory, and bioinformatics.

Vivek Shripad Borkar is an Indian electrical engineer, mathematician and an Institute chair professor at the Indian Institute of Technology, Mumbai. He is known for introducing analytical paradigm in stochastic optimal control processes and is an elected fellow of all the three major Indian science academies viz. the Indian Academy of Sciences, Indian National Science Academy and the National Academy of Sciences, India. He also holds elected fellowships of The World Academy of Sciences, Institute of Electrical and Electronics Engineers, Indian National Academy of Engineering and the American Mathematical Society. The Council of Scientific and Industrial Research, the apex agency of the Government of India for scientific research, awarded him the Shanti Swarup Bhatnagar Prize for Science and Technology, one of the highest Indian science awards for his contributions to Engineering Sciences in 1992. He received the TWAS Prize of the World Academy of Sciences in 2009.

The following outline is provided as an overview of and topical guide to machine learning:

<span class="mw-page-title-main">Steve Young (software engineer)</span> British researcher (born 1951)

Stephen John Young is a British researcher, Professor of Information Engineering at the University of Cambridge and an entrepreneur. He is one of the pioneers of automated speech recognition and statistical spoken dialogue systems. He served as the Senior Pro-Vice-Chancellor of the University of Cambridge from 2009 to 2015, responsible for planning and resources. From 2015 to 2019, he held a joint appointment between his professorship at Cambridge and Apple, where he was a senior member of the Siri development team.

Munther A. Dahleh is the William Coolidge Professor of electrical engineering and computer science and director of the Massachusetts Institute of Technology (MIT) Institute for Data, Systems, and Society (IDSS).

Animashree (Anima) Anandkumar is the Bren Professor of Computing at California Institute of Technology. Previously, she was a senior director of Machine Learning research at NVIDIA and a principal scientist at Amazon Web Services. Her research considers tensor-algebraic methods, deep learning and non-convex problems.

Deep reinforcement learning is a subfield of machine learning that combines reinforcement learning (RL) and deep learning. RL considers the problem of a computational agent learning to make decisions by trial and error. Deep RL incorporates deep learning into the solution, allowing agents to make decisions from unstructured input data without manual engineering of the state space. Deep RL algorithms are able to take in very large inputs and decide what actions to perform to optimize an objective. Deep reinforcement learning has been used for a diverse set of applications including but not limited to robotics, video games, natural language processing, computer vision, education, transportation, finance and healthcare.

Éric Moulines is a French researcher in statistical learning and signal processing. He received the silver medal from the CNRS in 2010, the France Télécom prize awarded in collaboration with the French Academy of Sciences in 2011. He was appointed a Fellow of the European Association for Signal Processing in 2012 and of the Institute of Mathematical Statistics in 2016. He is General Engineer of the Corps des Mines (X81).

Elad Hazan is an Israeli-American computer scientist, academic, author and researcher. He is a Professor of Computer Science at Princeton University, and the co-founder and director of Google AI Princeton.

Chelsea Finn is an American computer scientist and assistant professor at Stanford University. Her research investigates intelligence through the interactions of robots, with the hope to create robotic systems that can learn how to learn. She is part of the Google Brain group.

References

1 2 3 "Mengdi Wang | Innovators Under 35". www.innovatorsunder35.com. Retrieved 2024-04-29.
↑ "Stochastic methods for large-scale linear problems, variational inequalities, and convex optimization | WorldCat.org". search.worldcat.org. Retrieved 2024-04-29.
1 2 "From math to meaning: Artificial intelligence blends algorithms and applications". Electrical and Computer Engineering. Retrieved 2024-04-29.
1 2 "CISE Seminar: April 5, 2019 – Mengdi Wang, Princeton University | Center for Information & Systems Engineering". www.bu.edu. Retrieved 2024-04-29.
1 2 "EE faculty members receive grants for COVID-19 research from C3.ai Digital Transformation Institute". Electrical and Computer Engineering. Retrieved 2024-04-29.
↑ "Defense backs pioneering research into machine learning for biological networks". Electrical and Computer Engineering. Retrieved 2024-04-29.
↑ "Can language models read the genome? This one decoded mRNA to make better vaccines". Electrical and Computer Engineering. Retrieved 2024-04-29.
↑ "Assistant Professor Mengdi Wang Wins Young Researcher Prize". Operations Research & Financial Engineering. Retrieved 2024-04-29.
↑ "Professor Mengdi Wang receives NSF CAREER Award". Operations Research & Financial Engineering. Retrieved 2024-04-29.
↑ "Professor Mengdi Wang Receives Google Faculty Award". Operations Research & Financial Engineering. Retrieved 2024-04-29.
↑ "List of Winners of WAIC·Yunfan Award 2022 - GAIAA". www.thegaiaa.org. Retrieved 2024-04-29.
↑ "Donald P. Eckman Award | A2C2". a2c2.org. Retrieved 2024-04-29.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[:0-1] 1 2 3 "Mengdi Wang | Innovators Under 35". www.innovatorsunder35.com. Retrieved 2024-04-29.

[2] "Stochastic methods for large-scale linear problems, variational inequalities, and convex optimization | WorldCat.org". search.worldcat.org. Retrieved 2024-04-29.

[:1-3] 1 2 "From math to meaning: Artificial intelligence blends algorithms and applications". Electrical and Computer Engineering. Retrieved 2024-04-29.

[:2-4] 1 2 "CISE Seminar: April 5, 2019 – Mengdi Wang, Princeton University | Center for Information & Systems Engineering". www.bu.edu. Retrieved 2024-04-29.

[:3-5] 1 2 "EE faculty members receive grants for COVID-19 research from C3.ai Digital Transformation Institute". Electrical and Computer Engineering. Retrieved 2024-04-29.

[:4-6] "Defense backs pioneering research into machine learning for biological networks". Electrical and Computer Engineering. Retrieved 2024-04-29.

[7] "Can language models read the genome? This one decoded mRNA to make better vaccines". Electrical and Computer Engineering. Retrieved 2024-04-29.

[8] "Assistant Professor Mengdi Wang Wins Young Researcher Prize". Operations Research & Financial Engineering. Retrieved 2024-04-29.

[9] "Professor Mengdi Wang receives NSF CAREER Award". Operations Research & Financial Engineering. Retrieved 2024-04-29.

[10] "Professor Mengdi Wang Receives Google Faculty Award". Operations Research & Financial Engineering. Retrieved 2024-04-29.

[11] "List of Winners of WAIC·Yunfan Award 2022 - GAIAA". www.thegaiaa.org. Retrieved 2024-04-29.

[12] "Donald P. Eckman Award | A2C2". a2c2.org. Retrieved 2024-04-29.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

Mengdi Wang
Alma mater	Massachusetts Institute of Technology Tsinghua University
Scientific career
Institutions	Princeton University
Thesis	Stochastic methods for large-scale linear problems, variational inequalities, and convex optimization (2013)
Doctoral advisor	Dimitri Bertsekas