Andrej Karpathy

Last updated

Andrej Karpathy
Andrej Karpathy, OpenAI.png
Karpathy at Stanford in 2016
Born
Andrej Karpathy

(1986-10-23) 23 October 1986 (age 37)
Bratislava, Czechoslovakia (now Slovakia)
Alma mater
Awards Innovators Under 35 (2020)
Scientific career
Fields Machine Learning
Computer Vision
Artificial intelligence [1]
Institutions
Thesis Connecting Images and Natural Language  (2016)
Doctoral advisor Fei-Fei Li
Website karpathy.ai OOjs UI icon edit-ltr-progressive.svg

Andrej Karpathy (born 23 October 1986 [2] ) is a Slovak-Canadian computer scientist who served as the director of artificial intelligence and Autopilot Vision at Tesla. He co-founded and formerly worked at OpenAI, [3] [4] [5] where he specialized in deep learning and computer vision. [6] [7] [1] [8]

Contents

Education and early life

Karpathy was born in Bratislava, Czechoslovakia (now Slovakia) [9] [10] [11] [12] and moved with his family to Toronto when he was 15. [13] He completed his Computer Science and Physics bachelor's degrees at University of Toronto in 2009 [14] and his master's degree at University of British Columbia in 2011, [14] where he worked on physically-simulated figures (for example, a simulated runner or a simulated person in a crowd).

Karpathy received a PhD from Stanford University in 2016 under the supervision of Fei-Fei Li, focusing on the intersection of natural language processing and computer vision, and deep learning models suited for this task. [15] [16]

Career and research

He authored and was the primary instructor of the first deep learning course at Stanford, CS 231n: Convolutional Neural Networks for Visual Recognition. [17] It became one of the largest classes at Stanford, growing from 150 students in 2015 to 750 in 2017. [18]

Karpathy is a founding member of the artificial intelligence research group OpenAI, [19] [20] where he was a research scientist from 2015 to 2017. [18] In June 2017 he became Tesla's director of artificial intelligence and reported to Elon Musk. [21] [7] [22] He was named one of MIT Technology Review's Innovators Under 35 for 2020. [23] After taking a several months-long sabbatical from Tesla, he announced he was leaving the company in July 2022. [24] As of February 2023, he makes YouTube videos on how to create artificial neural networks. [25]

It was reported on February 9 2023 that Karpathy had announced he was returning to OpenAI. [26]

A year later on February 13 2024, an OpenAI spokesperson confirmed that Karpathy had left OpenAI. [27]

Related Research Articles

<span class="mw-page-title-main">Elon Musk</span> Businessman (born 1971)

Elon Reeve Musk is a businessman and investor. He is the founder, chairman, CEO, and CTO of SpaceX; angel investor, CEO, product architect, and former chairman of Tesla, Inc.; owner, executive chairman, and CTO of X Corp.; founder of the Boring Company and xAI; co-founder of Neuralink and OpenAI; and president of the Musk Foundation. He is one of the wealthiest people in the world, with an estimated net worth of US$213 billion as of February 2024, according to the Bloomberg Billionaires Index, and $210 billion according to Forbes, primarily from his ownership stakes in Tesla and SpaceX.

<span class="mw-page-title-main">Tesla, Inc.</span> American electric vehicle and clean energy company

Tesla, Inc. is an American multinational automotive and clean energy company headquartered in Austin, Texas, which designs, manufactures and sells electric vehicles, stationary battery energy storage devices from home to grid-scale, solar panels and solar shingles, and related products and services.

<span class="mw-page-title-main">George Hotz</span> American software engineer

George Francis Hotz, alias geohot, is an American security hacker, entrepreneur, and software engineer. He is known for developing iOS jailbreaks, reverse engineering the PlayStation 3, and for the subsequent lawsuit brought against him by Sony. From September 2015 onwards, he has been working on his vehicle automation machine learning company comma.ai. Since November 2022, Hotz has been working on tinygrad, a deep learning framework.

Google Brain was a deep learning artificial intelligence research team under the umbrella of Google AI, a research division at Google dedicated to artificial intelligence. Formed in 2011, Google Brain combined open-ended machine learning research with information systems and large-scale computing resources. The team has created tools such as TensorFlow, which allow for neural networks to be used by the public, with multiple internal AI research projects. The team aims to create research opportunities in machine learning and natural language processing. The team was merged into former Google sister company DeepMind to form Google DeepMind in April 2023.

<span class="mw-page-title-main">Future of Life Institute</span> International nonprofit research institute

The Future of Life Institute (FLI) is a nonprofit organization which aims to steer transformative technology towards benefiting life and away from large-scale risks, with a focus on existential risk from advanced artificial intelligence (AI). FLI's work includes grantmaking, educational outreach, and advocacy within the United Nations, United States government, and European Union institutions.

<span class="mw-page-title-main">Fei-Fei Li</span> Chinese American computer scientist (born 1976)

Fei-Fei Li is a China-born American computer scientist, known for establishing ImageNet, the dataset that enabled rapid advances in computer vision in the 2010s. She is Sequoia Capital professor of computer science at Stanford University and former board director at Twitter. Li is a co-director of the Stanford Institute for Human-Centered Artificial Intelligence and a co-director of the Stanford Vision and Learning Lab. She served as the director of the Stanford Artificial Intelligence Laboratory from 2013 to 2018.

<span class="mw-page-title-main">OpenAI</span> Artificial intelligence research organization

OpenAI is a U.S. based artificial intelligence (AI) research organization founded in December 2015, researching artificial intelligence with the goal of developing "safe and beneficial" artificial general intelligence, which it defines as "highly autonomous systems that outperform humans at most economically valuable work". As one of the leading organizations of the AI spring, it has developed several large language models, advanced image generation models, and previously, released open-source models. Its release of ChatGPT has been credited with starting the AI spring.

<span class="mw-page-title-main">Tesla Autopilot</span> Suite of advanced driver-assistance system features by Tesla

Tesla Autopilot is an advanced driver-assistance system (ADAS) developed by Tesla that amounts to partial vehicle automation. Tesla provides "Base Autopilot" on all vehicles, which includes lane centering and traffic-aware cruise control. Owners may purchase an upgrade to "Enhanced Autopilot" (EA) which adds semi-autonomous navigation on limited access roadways, self-parking, and the ability to summon the car from a garage or parking spot. The company claims the features reduce accidents caused by driver negligence and fatigue from long-term driving. Collisions and deaths involving Tesla cars with Autopilot engaged have drawn the attention of the press and government agencies.

<span class="mw-page-title-main">Ilya Sutskever</span> Computer scientist (born 1985/86)

Ilya Sutskever is a Canadian-Israeli computer scientist working in machine learning. Sutskever is a co-founder and former Chief Scientist at OpenAI.

<span class="mw-page-title-main">Ian Goodfellow</span> American computer scientist

Ian J. Goodfellow is an American computer scientist, engineer, and executive, most noted for his work on artificial neural networks and deep learning. He was previously employed as a research scientist at Google Brain and director of machine learning at Apple and has made several important contributions to the field of deep learning including the invention of the generative adversarial network (GAN). Goodfellow co-wrote, as the first author, the textbook Deep Learning (2016) and wrote the chapter on deep learning in the authoritative textbook of the field of artificial intelligence, Artificial Intelligence: A Modern Approach.

PyTorch is a machine learning library based on the Torch library, used for applications such as computer vision and natural language processing, originally developed by Meta AI and now part of the Linux Foundation umbrella. It is recognized as one of the two most popular machine learning libraries alongside TensorFlow, offering free and open-source software released under the modified BSD license. Although the Python interface is more polished and the primary focus of development, PyTorch also has a C++ interface.

DeepScale, Inc. was an American technology company headquartered in Mountain View, California, that developed perceptual system technologies for automated vehicles. On October 1, 2019, the company was acquired by Tesla, Inc.

<span class="mw-page-title-main">Pieter Abbeel</span> Machine learning researcher at Berkeley

Pieter Abbeel is a professor of electrical engineering and computer sciences, Director of the Berkeley Robot Learning Lab, and co-director of the Berkeley AI Research (BAIR) Lab at the University of California, Berkeley. He is also the co-founder of covariant.ai, a venture-funded start-up that aims to teach robots new, complex skills, and co-founder of Gradescope, an online grading system that has been implemented in over 500 universities nationwide. He is best known for his cutting-edge research in robotics and machine learning, particularly in deep reinforcement learning. In 2021, he joined AIX Ventures as an Investment Partner. AIX Ventures is a venture capital fund that invests in artificial intelligence startups.

Elon Musk is the CEO or owner of multiple companies including Tesla, SpaceX, and X Corp, and has expressed many views on a wide variety of subjects, ranging from politics to science.

<span class="mw-page-title-main">Criticism of Tesla, Inc.</span> Systemic criticism of Tesla, Inc. and its products and leadership

Tesla, Inc. has been criticized for its cars, workplace culture, business practices, and occupational safety. Many of the criticisms are also directed toward Elon Musk, the company's CEO and Product Architect. Critics have also accused Tesla of deceptive marketing, unfulfilled promises, and fraud. The company is currently facing criminal and civil investigations into its self-driving claims. Critics have highlighted Tesla's downplaying of issues, and Tesla's alleged retaliation against several whistleblowers.

<span class="mw-page-title-main">Optimus (robot)</span> Planned general purpose robotic humanoid by Tesla, Inc.

Optimus, also known as Tesla Bot, is a conceptual general-purpose robotic humanoid under development by Tesla, Inc. It was announced at the company's Artificial Intelligence (AI) Day event on August 19, 2021. CEO Elon Musk claimed during the event that Tesla would likely build a prototype by 2022. Musk is on record having said that he thinks Optimus "has the potential to be more significant than [Tesla's] vehicle business over time."

Tesla Dojo is a supercomputer designed and built by Tesla for computer vision video processing and recognition. It will be used for training Tesla's machine learning models to improve its Full Self-Driving (FSD) advanced driver-assistance system. According to Tesla, it went into production in July 2023.

<span class="mw-page-title-main">Lex Fridman</span> Russian-American scientist and podcast host (born 1983)

Lex Fridman is a Russian-American computer scientist and podcaster. He hosts the Lex Fridman Podcast, in which he interviews guests, which have included prominent figures in various fields, including science, technology, sports, and politics.

Meta AI is an artificial intelligence laboratory owned by Meta Platforms Inc. Meta AI develops various forms of artificial intelligence, developing augmented and artificial reality technologies. Meta AI is an academic research laboratory focused on generating knowledge for the AI community. This is in contrast to Facebook's Applied Machine Learning (AML) team, which focuses on practical applications of its products.

Shivon Alice Zilis is a Canadian technology executive and venture capitalist.

References

  1. 1 2 Andrej Karpathy publications indexed by Google Scholar OOjs UI icon edit-ltr-progressive.svg
  2. "Self-reported on twitter". Archived from the original on 23 August 2021. Retrieved 25 April 2019.
  3. "Tesla's Autopilot chief steps down after two years". 26 April 2018. Retrieved 9 August 2018.
  4. "A.I. Researchers Leave Elon Musk Lab to Begin Robotics Start-Up". 7 November 2017. Retrieved 9 August 2018.
  5. "A.I. Researchers Are Making More Than $1 Million, Even at a Nonprofit". 19 April 2017. Retrieved 9 August 2018.
  6. "The Guy Who Taught AI to 'Remember' Is Launching a Startup". 28 July 2018. Retrieved 9 August 2018.
  7. 1 2 "Elon Musk has poached a top mind in AI research—from himself". 21 June 2017. Retrieved 9 August 2018.
  8. Andrej Karpathy on Medium
  9. "The Slovak, who leads the development of AI at Tesla, is leaving. It was an honor, says Musk – Živé.sk" . Retrieved 19 July 2022.
  10. Živé.sk (25 June 2020). "Šéf AI v Tesle: Rodák zo Slovenska je medzi TOP 35 mladými novátormi". Živé.sk (in Slovak). Retrieved 19 July 2022.
  11. today, newsy (28 March 2022). "The Slovak, who leads AI in Tesla, left the company for several months. He jokes with Musk about TikTok". Newsy Today. Retrieved 19 July 2022.[ permanent dead link ]
  12. "Slovák Andrej Karpathy z Tesly patrí podľa MIT medzi 35 top inovátorov". TeslaMagazin.sk (in Slovak). 23 June 2020. Retrieved 19 July 2022.
  13. "Next Generation Machine Learning - Training Deep Learning Models in a Browser: Andrej Karpathy Interview | DataScienceWeekly.org". DataScienceWeekly.org. Retrieved 12 November 2018.
  14. 1 2 "Andrej Karpathy Academic Website". cs.stanford.edu. Retrieved 12 November 2018.
  15. Karpathy, Andrej (2016). Connecting Images and Natural Language. stanford.edu (PhD thesis). Stanford University.
  16. "Does 'robo-journalism' pose a threat to reporters?". 23 March 2017. Retrieved 9 August 2018.
  17. "Stanford University CS231n: Convolutional Neural Networks for Visual Recognition". cs231n.stanford.edu. Retrieved 8 September 2022.
  18. 1 2 "Andrej Karpathy". karpathy.ai. Retrieved 8 September 2022.
  19. "Introducing OpenAI". OpenAI. 12 December 2015. Retrieved 8 September 2022.
  20. Fan, Shelly (20 December 2015). "Inside OpenAI: Will Transparency Protect Us From Artificial Intelligence Run Amok?". Singularity Hub. Retrieved 8 September 2022.
  21. Etherington, Darrell (21 June 2017). "Tesla hires deep learning expert Andrej Karpathy to lead Autopilot vision". TechCrunch. Retrieved 10 November 2023.
  22. "Tesla hired a top AI expert to lead a critical aspect of Autopilot -- here's what we know". 22 June 2017. Retrieved 9 August 2018.
  23. "Andrej Karpathy (Innovators Under 35 2020)". MIT Technology Review. Retrieved 8 September 2022.
  24. Kolodny, Lora (13 July 2022). "Tesla AI leader Andrej Karpathy announces he's leaving the company". CNBC. Retrieved 14 July 2022.
  25. "Andrej Karpathy - YouTube". youtube.com. Retrieved 4 February 2023.
  26. @karpathy (9 February 2023). "Some personal news: I am joining OpenAI (again :)). Like many others both in/out of AI, I am very inspired by the impact of their work and I have personally benefited greatly from it. The future potential is especially exciting; it is a great pleasure to jump back in and build!🪄" (Tweet) via Twitter.
  27. "OpenAI Researcher Andrew Karpathy Departs". 13 February 2024. Retrieved 13 February 2024.