Anima Anandkumar

Last updated
Anima Anandkumar
Alma mater Indian Institute of Technology Madras (BS)
Cornell University (MS, PhD)
Scientific career
Institutions University of California Irvine
California Institute of Technology
Thesis Scalable Algorithms for Distributed Statistical Inference  (2009)
Doctoral advisor Lang Tong

Animashree (Anima) Anandkumar is the Bren Professor of Computing at California Institute of Technology. Previously, she was a senior director of Machine Learning research at NVIDIA and a principal scientist at Amazon Web Services. Her research considers tensor-algebraic methods, deep learning and non-convex problems.

Contents

Education and early career

Anandkumar was born in Mysore. Her parents are both engineers, and her grandfather was a mathematician. [1] Her great-great-grandfather was the Sanskrit scholar R. Shamasastry. She began to study Bharatanatyam and she learnt this style of dancing for many years. [2] She studied electrical engineering at the Indian Institute of Technology Madras and graduated in 2004. [1] She joined Cornell University for her graduate studies, earning a PhD under the supervision of Lang Tong in 2009. Her first project looked at distributed statistical estimation [3] . She was an IBM Fellow at Cornell University between 2008 and 2009. Her thesis considered Scalable Algorithms for Distributed Statistical Inference. [2] During her PhD she worked in the networking group at IBM on end-to-end service-level transactions. She was a postdoctoral scholar at Massachusetts Institute of Technology until 2010, where she worked in the Stochastic Systems Group with Alan Willsky. [4]

Research

In 2010, Anandkumar joined University of California, Irvine, as an assistant professor. At the time, the technology industry was at the beginning of the big data revolution. Here she started working on tensor decompositions of latent variable models. [5] She joined Microsoft Research in New England as a visiting scientist in 2012. In 2013 she was awarded a National Science Foundation CAREER Award to investigate big data and social networks. [6] She was made an assistant professor with tenure at UC Irvine in 2016. [7] She specialised in large-scale machine learning and high-dimensional statistics. [8] Anandkumar was a Principal Scientist at Amazon Web Services from 2016 to 2018. [9] She worked with the Apache MXNet tool, introducing new functionality and developing multi-modal processing algorithms. [9] [10] She represented Amazon Web Services at the Anita Borg Institute in 2017, the Mulan forum for Chinese women entrepreneurs and Shaastra in 2018, discussing Deep Learning. [11] [12] She also worked on Amazon Rekognition, Amazon Lex and Amazon Polly. She was involved with the launch of Amazon SageMaker, an opportunity for developers to use machine learning models. [12] Anandkumar joined the Machine Learning Conference Board of Advisors in 2018. [13] In 2018, Anandkumar joined NVIDIA as Director of Machine Learning Research, and Caltech as the Bren Professor of Computing and Mathematical Sciences. [9] [14] [15] At NVIDIA she opened a new core laboratories in artificial intelligence and machine learning in Santa Clara. [16] [17] She has pushed for governments to invest in robotics and artificial intelligence. [18] She spoke at the 2018 TED xIndiana University about the algorithms she has developed to process big data. [19] [20]

Anima Anandkumar has also developed AI algorithms that with applications in various scientific domains including weather forecasting, drug discovery, scientific simulations and engineering design. She invented Neural Operators that extend deep learning to modeling multi-scale processes in these scientific domains and learn in function spaces and are orders of magnitude faster than traditional simulations. She has developed AI-based high-resolution weather models [21] , an AI-aided method for designing anti-infection medical catheters [22] . Neural operators were featured as a highlight for 2021 in Math and Computer Science by the Quanta Magazine [23] , and genome-scale foundation models with emergent behavior in predicting evolutionary dynamics and protein function in several diverse tasks and scenarios [24] , which won the Association for Computing Machinery (ACM) Gordon Bell Special Prize for High Performance Computing-Based COVID-19 Research in 2022 [25] .

Anandkumar has also done some of the early work on generalist AI agents using language models, which are capable of life-long learning using foundation models in an interactive manner. In particular, her work has shown how interactive in-context learning in language models can be used to construct actions in form of program code to solve complex open-ended tasks in environments such as Minecraft [26] and robotic reinforcement learning [27] .

While at Caltech, Anandkumar co-founded the AI for Science initiative in 2018. In 2023, she was invited by the Presidential Council of Advisors on Science and Technology (PCAST) on AI+Science [28] . In addition, she has given keynotes at the Annual Meeting of the US National Committee for Theoretical and Applied Mechanics [29] , the UCLA distinguished seminar [30] , the SIAM annual meeting [31] , and the Nature Reviews Physics, hosted by the Alan Turing Institute [32] .

Diversity in technology

Anandkumar is committed to improving diversity in the technology sector. She launched a petition to Timothy A. Gonsalves to try and convince him at the Ministry of Human Resource Development to end gender segregation in the admissions process at the Indian Institute of Technology Madras. [33] The petition calls for campus-wide systems to monitor sexual harassment, improved campus security and increased engagement with alumni. [33] [34] She has spoken openly about her own experiences of sexual harassment on social media and called for Intel to stop using female acrobats as entertainment at their conference parties. [35] She was one of several campaigners to rename the Conference on Neural Information Processing Systems 'NIPS' as NeurIPS. [36] In 2018, she was awarded a New York Times Good Tech Award. [37]

Controversies

In December 2020, Anandkumar was embroiled in a Twitter controversy, when she published a list of individuals who allegedly followed, liked or supported any Tweets made by Pedro Domingos allegedly in relation to his controversial views on the renaming of NeurIPS, Timnit Gebru's controversial exit at Google, algorithmic bias or cancel culture, or simply followed him on Twitter. She never clarified how she actually came up with that list. [38] She suggested that followers "try and change the mind of [these] fanboys of Pedro[...] Especially junior people". [39] Following prompt backlash from individuals concerned about the circulation of such a blacklist, Anandkumar deactivated her Twitter account temporarily and issued an apology stating "I am by no means perfect. I am sorry if my actions/words have ever created a threatening environment. My intention was to change hearts and minds, and to raise awareness to the struggles that women and minorities face both online and in the real world. I will find better ways to achieve that goal". [40]

Awards and honors

Anandkumar has won several awards and honours, including: [41]

Related Research Articles

<span class="mw-page-title-main">Quantum computing</span> Technology that uses quantum mechanics

A quantum computer is a computer that takes advantage of quantum mechanical phenomena.

<span class="mw-page-title-main">David Bader (computer scientist)</span> American computer scientist

David A. Bader is a Distinguished Professor and Director of the Institute for Data Science at the New Jersey Institute of Technology. Previously, he served as the Chair of the Georgia Institute of Technology School of Computational Science & Engineering, where he was also a founding professor, and the executive director of High-Performance Computing at the Georgia Tech College of Computing. In 2007, he was named the first director of the Sony Toshiba IBM Center of Competence for the Cell Processor at Georgia Tech.

<span class="mw-page-title-main">Daphne Koller</span> Israeli-American computer scientist

Daphne Koller is an Israeli-American computer scientist. She was a professor in the department of computer science at Stanford University and a MacArthur Foundation fellowship recipient. She is one of the founders of Coursera, an online education platform. Her general research area is artificial intelligence and its applications in the biomedical sciences. Koller was featured in a 2004 article by MIT Technology Review titled "10 Emerging Technologies That Will Change Your World" concerning the topic of Bayesian machine learning.

<span class="mw-page-title-main">Bill Dally</span> American computer scientist and educator (born 1960)

William James Dally is an American computer scientist and educator. He is the chief scientist and senior vice president at Nvidia and was previously a professor of Electrical Engineering and Computer Science at Stanford University and MIT. Since 2021, he has been a member of the President's Council of Advisors on Science and Technology (PCAST).

Learning to rank or machine-learned ranking (MLR) is the application of machine learning, typically supervised, semi-supervised or reinforcement learning, in the construction of ranking models for information retrieval systems. Training data may, for example, consist of lists of items with some partial order specified between items in each list. This order is typically induced by giving a numerical or ordinal score or a binary judgment for each item. The goal of constructing the ranking model is to rank new, unseen lists in a similar way to rankings in the training data.

<span class="mw-page-title-main">Jitendra Malik</span> Indian-American academic (born 1960)

Jitendra Malik is an Indian-American academic who is the Arthur J. Chick Professor of Electrical Engineering and Computer Sciences at the University of California, Berkeley. He is known for his research in computer vision.

<span class="mw-page-title-main">Alan Edelman</span> American mathematician

Alan Stuart Edelman is an American mathematician and computer scientist. He is a professor of applied mathematics at the Massachusetts Institute of Technology (MIT) and a Principal Investigator at the MIT Computer Science and Artificial Intelligence Laboratory (CSAIL) where he leads a group in applied computing. In 2004, he founded a business called Interactive Supercomputing which was later acquired by Microsoft. Edelman is a fellow of American Mathematical Society (AMS), Society for Industrial and Applied Mathematics (SIAM), Institute of Electrical and Electronics Engineers (IEEE), and Association for Computing Machinery (ACM), for his contributions in numerical linear algebra, computational science, parallel computing, and random matrix theory. He is one of the creators of the technical programming language Julia.

<span class="mw-page-title-main">Katherine Yelick</span> American computer scientist and academic

Katherine "Kathy" Anne Yelick, an American computer scientist, is the vice chancellor for research and the Robert S. Pepper Professor of Electrical Engineering and Computer Sciences at the University of California, Berkeley. She is also a faculty scientist at Lawrence Berkeley National Laboratory, where she was Associate Laboratory Director for Computing Sciences from 2010–2019.

Steven H. Low is a Professor of the Computing and Mathematical Sciences Department and the Electrical Engineering Department at the California Institute of Technology. He is known for his work on the theory and mathematical modeling of Internet congestion control, algorithms, and optimization in power systems.

<span class="mw-page-title-main">Eric Xing</span>

Eric Poe Xing is an American computer scientist whose research spans machine learning, computational biology, and statistical methodology. Xing is founding President of the world’s first artificial intelligence university, Mohamed bin Zayed University of Artificial Intelligence (MBZUAI).

<span class="mw-page-title-main">Yoshua Bengio</span> Canadian computer scientist

Yoshua Bengio is a Canadian computer scientist, most noted for his work on artificial neural networks and deep learning. He is a professor at the Department of Computer Science and Operations Research at the Université de Montréal and scientific director of the Montreal Institute for Learning Algorithms (MILA).

An AI accelerator, deep learning processor, or neural processing unit (NPU) is a class of specialized hardware accelerator or computer system designed to accelerate artificial intelligence and machine learning applications, including artificial neural networks and machine vision. Typical applications include algorithms for robotics, Internet of Things, and other data-intensive or sensor-driven tasks. They are often manycore designs and generally focus on low-precision arithmetic, novel dataflow architectures or in-memory computing capability. As of 2024, a typical AI integrated circuit chip contains tens of billions of MOSFET transistors.

Explainable AI (XAI), often overlapping with Interpretable AI, or Explainable Machine Learning (XML), either refers to an artificial intelligence (AI) system over which it is possible for humans to retain intellectual oversight, or refers to the methods to achieve this. The main focus is usually on the reasoning behind the decisions or predictions made by the AI which are made more understandable and transparent. XAI counters the "black box" tendency of machine learning, where even the AI's designers cannot explain why it arrived at a specific decision.

ACM Conference on Fairness, Accountability, and Transparency is a peer-reviewed academic conference series about ethics and computing systems. Sponsored by the Association for Computing Machinery, this conference focuses on issues such as algorithmic transparency, fairness in machine learning, bias, and ethics from a multi-disciplinary perspective. The conference community includes computer scientists, statisticians, social scientists, scholars of law, and others.

Maria-Florina (Nina) Balcan is a Romanian-American computer scientist whose research investigates machine learning, algorithmic game theory, theoretical computer science, including active learning, kernel methods, random-sampling mechanisms and envy-free pricing. She is an associate professor of computer science at Carnegie Mellon University.

<span class="mw-page-title-main">Margaret Mitchell (scientist)</span> U.S. computer scientist

Margaret Mitchell is a computer scientist who works on algorithmic bias and fairness in machine learning. She is most well known for her work on automatically removing undesired biases concerning demographic groups from machine learning models, as well as more transparent reporting of their intended use.

<span class="mw-page-title-main">Deborah Raji</span> Nigerian-Canadian computer scientist and activist

Inioluwa Deborah Raji is a Nigerian-Canadian computer scientist and activist who works on algorithmic bias, AI accountability, and algorithmic auditing. Raji has previously worked with Joy Buolamwini, Timnit Gebru, and the Algorithmic Justice League on researching gender and racial bias in facial recognition technology. She has also worked with Google’s Ethical AI team and been a research fellow at the Partnership on AI and AI Now Institute at New York University working on how to operationalize ethical considerations in machine learning engineering practice. A current Mozilla fellow, she has been recognized by MIT Technology Review and Forbes as one of the world's top young innovators.

Himabindu "Hima" Lakkaraju is an Indian-American computer scientist who works on machine learning, artificial intelligence, algorithmic bias, and AI accountability. She is currently an Assistant Professor at the Harvard Business School and is also affiliated with the Department of Computer Science at Harvard University. Lakkaraju is known for her work on explainable machine learning. More broadly, her research focuses on developing machine learning models and algorithms that are interpretable, transparent, fair, and reliable. She also investigates the practical and ethical implications of deploying machine learning models in domains involving high-stakes decisions such as healthcare, criminal justice, business, and education. Lakkaraju was named as one of the world's top Innovators Under 35 by both Vanity Fair and the MIT Technology Review.

Elad Hazan is an Israeli-American computer scientist, academic, author and researcher. He is a Professor of Computer Science at Princeton University, and the co-founder and director of Google AI Princeton.

<span class="mw-page-title-main">Chelsea Finn</span> American computer scientist and academic

Chelsea Finn is an American computer scientist and assistant professor at Stanford University. Her research investigates intelligence through the interactions of robots, with the hope to create robotic systems that can learn how to learn. She is part of the Google Brain group.

References

  1. 1 2 "Contributors". IEEE Transactions on Information Theory. 54 (7): 3371–3382. 2008. doi:10.1109/TIT.2008.924739. ISSN   0018-9448.
  2. 1 2 Animashree., Anandkumar (2009). Scalable algorithms for distributed statistical inference. OCLC   458398906.
  3. Anandkumar, Animashree; Tong, Lang (2006). "Distributed Statistical Inference using Type Based Random Access over Multi-access Fading Channels". 2006 40th Annual Conference on Information Sciences and Systems: 38–43. doi:10.1109/CISS.2006.286427. ISBN   1-4244-0349-9.
  4. "Anima Anandkumar". Women in Data Science (WiDS) Conference 2019. Archived from the original on 2019-01-08. Retrieved 2019-01-07.
  5. Anandkumar, Anima; Ge, Rong; Hsu, Daniel; Kakade, Sham M.; Telgarsky, Matus (2012-12-08). "Tensor Decompositions for Learning Latent Variable Models". Journal of Machine Learning Research. Fort Belvoir, VA. arXiv: 1210.7559 . Bibcode:2012arXiv1210.7559A. doi:10.21236/ada604494. S2CID   14587260.
  6. 1 2 "NSF Award Search: Award#1251267 - BIGDATA: Small: DA: DCM: Measurement and Learning in Large-Scale Social Networks". www.nsf.gov. Retrieved 2019-01-07.
  7. "Anima Anandkumar". UCI Data Science Initiative. Archived from the original on 2015-03-05. Retrieved 2019-01-07.
  8. "Speakers | UCLA Anderson School of Management". www.anderson.ucla.edu. Archived from the original on 2019-01-08. Retrieved 2019-01-07.
  9. 1 2 3 "In the Research Spotlight: Anima Anandkumar". Amazon Web Services. 2017-05-16. Retrieved 2019-01-07.
  10. Lorica, Ben (2017-03-09). "Deep learning that's easy to implement and easy to scale". O'Reilly Media. Retrieved 2019-01-07.
  11. Amazon Web Services, Distributed Deep Learning , retrieved 2019-01-07
  12. 1 2 Sugandha Lahoti (2018-09-01). "Anima Anandkumar, the machine learning guru behind AWS bids adieu to AWS". Packt Hub. Retrieved 2019-01-07.
  13. Courtney (2018-07-06). "Dr. Anima Anandkumar has Joined the MLconf Board of Advisors!". The Machine Learning Conference. Retrieved 2019-01-07.
  14. "Anima Anandkumar | Research". research.nvidia.com. Retrieved 2019-01-07.
  15. "Caltech Celebrates Newest Cohort of Named Professors | Caltech". The California Institute of Technology. Archived from the original on 2019-01-09. Retrieved 2019-01-08.
  16. NVIDIA, Research at NVIDIA: New Core AI and Machine Learning Lab , retrieved 2019-01-07
  17. Daily-AI (2018-11-30). "NVIDIA Opening Core AI and ML Research Lab in Santa Clara". Daily AI. Archived from the original on 2019-04-14. Retrieved 2019-01-07.
  18. "Dr. Anima Anandkumar". World Government Summit - Speakers. Retrieved 2019-01-07.
  19. "Home". TEDxIndianaUniversity. Retrieved 2019-01-07.
  20. "'From Ashes We Rise' TEDxIU talk to feature six speakers". Indiana Daily Student. Retrieved 2019-01-07.
  21. Pathak, J., Subramanian, S., Harrington, P. B., Raja, S. S., Chattopadhyay, A., Mardani, M., Kurth, T., Hall, D., Li, Z., Azizzadenesheli, K., Hassanzadeh, P., Kashinath, K., Anandkumar, A. (22 February 2022). "FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators". PASC '23: Proceedings of the Platform for Advanced Scientific Computing Conference. abs/2202.11214. arXiv: 2202.11214 .
  22. Zhou, Tingtao; Wan, Xuan; Huang, Daniel Zhengyu; Li, Zongyi; Peng, Zhiwei; Anandkumar, Anima; Brady, John F.; Sternberg, Paul W.; Daraio, Chiara (2024). "AI-aided geometric design of anti-infection catheters". Science Advances. 10 (1): eadj1741. arXiv: 2304.14554 . Bibcode:2024SciA...10J1741Z. doi:10.1126/sciadv.adj1741. PMC   10776022 . PMID   38170782.
  23. The Year in Math and Computer Science, 2021, retrieved 30 March 2024
  24. Zvyagin, Maxim; et al. (10 October 2022). "GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics". The International Journal of High Performance Computing Applications: 2022.10.10.511571. doi:10.1101/2022.10.10.511571. PMC   9709791 . PMID   36451881.
  25. "ACM Gordon Bell Special Prize for HPC-Based COVID-19 Research Awarded to Team for Modelling How Pandemic-Causing Viruses, Especially SARS-CoV-2, are Identified and Classified" . Retrieved 30 March 2024.
  26. Wang, Guanzhi; Xie, Yuqi; Jiang, Yunfan; Mandlekar, Ajay; Xiao, Chaowei; Zhu, Yuke; Fan, Linxi; Anandkumar, Anima (30 March 2023). "Voyager: An Open-Ended Embodied Agent with Large Language Models". arXiv preprint. arXiv: 2305.16291 .
  27. Yecheng Jason Ma; Liang, William; Wang, Guanzhi; Huang, De-An; Bastani, Osbert; Jayaraman, Dinesh; Zhu, Yuke; Fan, Linxi; Anandkumar, Anima (2023). "Eureka: Human-Level Reward Design via Coding Large Language Models". arXiv preprint. arXiv: 2310.12931 .
  28. Anima Anandkumar Briefs PCAST on AI Enabling Science, 2024, retrieved 30 March 2024
  29. Machine Learning in Mechanics: The U.S. National Committee on Theoretical and Applied Mechanics (USNC/TAM), 2020, retrieved 30 March 2024
  30. Anima Anandkumar, 2024, retrieved 30 March 2024
  31. AN21 Invited Presentations, 2024, retrieved 30 March 2024
  32. Nature Reviews Physics: Machine learning research in industry, 2024, retrieved 30 March 2024
  33. 1 2 Ghosh, Shubhangi (2018-01-18). "Shaastra Spotlight Interviews: Anima Anandkumar". The Fifth Estate. Archived from the original on 2019-05-28. Retrieved 2019-01-07.
  34. "More girl students to get admission in IITs from 2018". The Economic Times. 2017-04-15. Retrieved 2019-01-07.
  35. Simonite, Tom (2018-08-17). "AI Is the Future—But Where Are the Women?". Wired. ISSN   1059-1028 . Retrieved 2019-01-07.
  36. Simonite, Tom (2018-10-26). "AI Researchers Fight Over Four Letters: NIPS". Wired. ISSN   1059-1028 . Retrieved 2019-01-07.
  37. Roose, Kevin (2018-12-21). "The 2018 Good Tech Awards". The New York Times. ISSN   0362-4331 . Retrieved 2019-01-08.
  38. Soper, Taylor (16 December 2020). "Retired UW computer science professor embroiled in Twitter spat over AI ethics and 'cancel culture'". GeekWire. Retrieved 17 December 2020.
  39. "Archive of Twitter comments, now deleted". Archived from the original on 2020-12-14.
  40. "My heartfelt apology" . Retrieved 2020-12-16.
  41. "Anima Anandkumar | Simons Institute for the Theory of Computing". simons.berkeley.edu. 22 September 2016. Retrieved 2019-01-07.
  42. "ANNOUNCING THE 2023 GUGGENHEIM FELLOWS" . Retrieved 30 March 2024.
  43. "Second Cohort of AI2050 Senior Fellows Named By Schmidt Futures". 17 October 2023. Retrieved 30 March 2024.
  44. "ACM Gordon Bell Special Prize for HPC-Based COVID-19 Research Awarded to Team for Modelling How Pandemic-Causing Viruses, Especially SARS-CoV-2, are Identified and Classified" . Retrieved 30 March 2024.
  45. "Global computing association names 57 fellows for outstanding contributions that propel technology today". Association for Computing Machinery. January 18, 2023. Retrieved 2023-01-18.
  46. "Caltech Computing + Mathematical Sciences | News | Anima Anandkumar Elevated to IEEE Fellow". Archived from the original on 2020-12-22. Retrieved 2020-12-31.
  47. "NVIDIA Leaders Honored at VentureBeat Transform 'Women in AI Awards'". 17 July 2020.
  48. "Google Research Awards: Fall 2015". Google AI Blog. 12 February 2016. Retrieved 2019-01-07.
  49. "Anandkumar receives AFOSR Young Investigator Award | Center for Machine Learning and Intelligent Systems". 22 January 2015. Retrieved 2019-01-07.
  50. Anandkumar, Anima. "Office Hour with Anima Anandkumar (UC Irvine): Big data conference: Strata + Hadoop World, March 13 - 16, 2017, San Jose, CA". conferences.oreilly.com. Retrieved 2019-01-07.
  51. "Anandkumar Receives 'Early-Career' Sloan Research Fellowship | Center for Machine Learning and Intelligent Systems". 20 February 2014. Retrieved 2019-01-07.
  52. "Microsoft Research Names Anima Anandkumar a Faculty Fellow | The Henry Samueli School of Engineering at UC Irvine". engineering.uci.edu. Retrieved 2019-01-07.
  53. "Honors and Awards". acsp.ece.cornell.edu. Retrieved 2019-01-07.
  54. Anandkumar, Animashree; Tong, Lang (October 2007). "Type-Based Random Access for Distributed Detection Over Multiaccess Fading Channels". IEEE Transactions on Signal Processing. 55 (10): 5032–5043. Bibcode:2007ITSP...55.5032A. CiteSeerX   10.1.1.71.1653 . doi:10.1109/tsp.2007.896302. ISSN   1053-587X. S2CID   10708719.