Matei Zaharia

Last updated
Matei Zaharia
Alma mater UC Berkeley (Ph.D.)
University of Waterloo (BMath)
Known for Apache Spark
Awards ACM Doctoral Dissertation Award (2014)
Presidential Early Career Award for Scientists and Engineers (2019)
SIGOPS Mark Weiser Award (2023)
Scientific career
Fields Computer science
Institutions UC Berkeley
Stanford University
Databricks
Thesis An Architecture for Fast and General Data Processing on Large Clusters  (2013)
Doctoral advisor Ion Stoica
Scott Shenker
Website people.eecs.berkeley.edu/~matei/

Matei Zaharia is a Romanian-Canadian computer scientist, educator and the creator of Apache Spark. [1] [2] [3]

Contents

As of April 2022, Forbes ranked him and Ion Stoica as the 3rd-richest people in Romania with a net worth of $1.6 billion. [4]

Biography

Zaharia graduated from secondary school at Jarvis Collegiate Institute before moving to become an undergraduate at the University of Waterloo. [5] Zaharia was a gold medalist at the International Collegiate Programming Contest, where his team University of Waterloo placed fourth in the world and first in North America in 2005. [6] During his undergraduate degree at the University of Waterloo, he also greatly contributed to water rendering physics in the now open-source game called 0 A.D. [7] He also helped mod the Age of Mythology scenario called Norse Wars, which was re-adapted into the Age of Empires 3 scenario called Fort Wars. [8] While at University of California, Berkeley's AMPLab in 2009, he created Apache Spark as a faster alternative to MapReduce. [9] He received the 2014 ACM Doctoral Dissertation Award for his PhD research on large-scale computing. [10]

In 2013 Zaharia was one of the co-founders of Databricks where he serves as chief technology officer. [2]

He joined the faculty of MIT in 2015, and then became an assistant professor of computer science at Stanford University in 2016.

In 2019, Zaharia received the Presidential Early Career Award for Scientists and Engineers. [5]

In 2019 he was spearheading MLflow at Databricks, while still teaching. [11] [12] [13]

In 2023, he joined the faculty of the University of California, Berkeley as an associate professor.

See also

Related Research Articles

Scott J. Shenker is an American computer scientist, and professor of computer science at the University of California, Berkeley. He is also the leader of the Extensible Internet Group at the International Computer Science Institute in Berkeley, California.

Venkatesan Guruswami is a senior scientist at the Simons Institute for the Theory of Computing and Professor of EECS and Mathematics at the University of California, Berkeley. He did his high schooling at Padma Seshadri Bala Bhavan in Chennai, India. He completed his undergraduate in Computer Science from IIT Madras and his doctorate from Massachusetts Institute of Technology under the supervision of Madhu Sudan in 2001. After receiving his PhD, he spent a year at UC Berkeley as a Miller Fellow, and then was a member of the faculty at the University of Washington from 2002 to 2009. His primary area of research is computer science, and in particular on error-correcting codes. During 2007–2008, he visited the Institute for Advanced Study as a Member of School of Mathematics. He also visited SCS at Carnegie Mellon University during 2008–09 as a visiting faculty. From July 2009 through December 2020 he was a faculty member in the Computer Science Department in the School of Computer Science at Carnegie Mellon University.

Michael Jay Franklin is an American software entrepreneur and computer scientist specializing in distributed and streaming database technology. He is Liew Family Chair of Computer Science and chairman for the Department of Computer Science at the University of Chicago.

<span class="mw-page-title-main">Ren Ng</span>

Yi-Ren Ng is a Malaysian American scientist who is an associate professor in the Department of Electrical Engineering & Computer Sciences at the University of California, Berkeley. He was the founder, executive chairman and CEO of Lytro, a Mountain View, California-based startup company. Lytro was developing consumer light-field cameras based on Ng's graduate research at Stanford University. Lytro ceased operations in late March 2018.

<span class="mw-page-title-main">Constantinos Daskalakis</span> Greek computer scientist

Constantinos Daskalakis is a Greek theoretical computer scientist. He is a professor at MIT's Electrical Engineering and Computer Science department and a member of the MIT Computer Science and Artificial Intelligence Laboratory. He was awarded the Rolf Nevanlinna Prize and the Grace Murray Hopper Award in 2018.

<span class="mw-page-title-main">Ion Stoica</span> Romanian–American computer scientist

Ion Stoica is a Romanian–American computer scientist specializing in distributed systems, cloud computing and computer networking. He is a professor of computer science at the University of California, Berkeley and co-director of AMPLab. He co-founded Conviva and Databricks with other original developers of Apache Spark.

<span class="mw-page-title-main">Apache Spark</span> Open-source data analytics cluster computing framework

Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming clusters with implicit data parallelism and fault tolerance. Originally developed at the University of California, Berkeley's AMPLab, the Spark codebase was later donated to the Apache Software Foundation, which has maintained it since.

<span class="mw-page-title-main">Databricks</span> American software company

Databricks, Inc. is a global data, analytics and artificial intelligence company founded by the original creators of Apache Spark.

<span class="mw-page-title-main">Apache Mesos</span> Software to manage computer clusters

Apache Mesos is an open-source project to manage computer clusters. It was developed at the University of California, Berkeley.

The ACM SIGOPS Mark Weiser Award is awarded to an individual who has shown creativity and innovation in operating system research. The recipients began their career no earlier than 20 years prior to nomination. The special-interest-group-level award was created in 2001 and is named after Mark Weiser, the father of ubiquitous computing.

<span class="mw-page-title-main">Ali Ghodsi</span> Swedish computer scientist

Ali Ghodsi is an Swedish computer scientist and entrepreneur specializing in distributed systems and big data. He is a co-founder and CEO of Databricks and an adjunct professor at UC Berkeley. He coauthored several influential papers, including Apache Mesos and Apache Spark SQL.

Sylvia Ratnasamy is a Belgian–Indian computer scientist. She is best known as one of the inventors of the distributed hash table (DHT). Her doctoral dissertation proposed the content-addressable networks, one of the original DHTs, and she received the ACM Grace Murray Hopper Award in 2014 for this work. She is currently a professor at the University of California, Berkeley.

AMPLAB was a University of California, Berkeley lab focused on big data analytics located in Soda Hall. The name stands for the Algorithms, Machines and People Lab. It has been publishing papers since 2008 and was officially launched in 2011. The AMPLab was co-directed by Professor Michael J. Franklin, Michael I. Jordan, and Ion Stoica.

Reza Zadeh is an American-Canadian-Iranian computer scientist and technology executive working on machine learning. He is adjunct professor at Stanford University and CEO of Matroid. He has served on the technical advisory boards of Databricks and Microsoft. His work focuses on machine learning, distributed computing, and discrete applied mathematics.

Reynold Xin is a computer scientist and engineer specializing in big data, distributed systems, and cloud computing. He is a co-founder and Chief Architect of Databricks. He is best known for his work on Apache Spark, a leading open-source Big Data project. He was designer and lead developer of the GraphX, Project Tungsten, and Structured Streaming components and he co-designed DataFrames, all of which are part of the core Apache Spark distribution; he also served as the release manager for Spark's 2.0 release.

The ACM Doctoral Dissertation Award is awarded annually by the Association for Computing Machinery to the authors of the best doctoral dissertations in computer science and computer engineering. The award is accompanied by a prize of US$20,000 and winning dissertations are published in the ACM Digital Library. Honorable mentions are awarded $10,000. Financial support is provided by Google. The number of awarded dissertations may vary year-to-year.

<span class="mw-page-title-main">Jelani Nelson</span> American computer scientist (born 1984)

Jelani Osei Nelson is an Ethiopian-American Professor of Electrical Engineering and Computer Sciences at the University of California, Berkeley. He won the 2014 Presidential Early Career Award for Scientists and Engineers. Nelson is the creator of AddisCoder, a computer science summer program for Ethiopian high school students in Addis Ababa.

Boon Thau Loo is a Singaporean-American computer scientist, college administrator, and technology entrepreneur. He is currently the RCA professor in the Computer and Information Science department at the University of Pennsylvania where he leads a research lab working on distributed systems, and serves as the Associate Dean for Graduate Programs at the University of Pennsylvania School of Engineering and Applied Science.

Haoyuan (H.Y.) Li is a computer scientist and entrepreneur specializing in distributed systems, big data, and cloud computing. He is best known for proposing Virtual Distributed File System (VDFS), and creating an open-source data orchestration system, Alluxio. He is the Founder, Chairman, and CEO of Alluxio, Inc, a company commercializing the Alluxio Data Orchestration Technology. He is also an adjunct professor at Peking University. He is a frequent speaker on the topic of AI, Big Data, Cloud Computing, and Open Source at conferences.

Mosharaf Chowdhury is a Bangladeshi-American computer scientist known for his contributions to the fields of computer networking and large-scale systems for emerging machine learning and big data workloads. He is an Associate Professor of Computer Science and Engineering at the University of Michigan, Ann Arbor and leads SymbioticLab. He is the creator of coflow and the co-creator of Apache Spark.

References

  1. Fiscutean, Andrada (August 20, 2019). "Why the US has lost to Russia in these top coding trials for almost a decade". ZDNet.
  2. 1 2 "Meet the 'nerdiest rock star': Matei Zaharia co-creator of Apache Spark | Computing". computing.co.uk. 2015-10-29. Retrieved 2019-12-03.
  3. Piatetsky, Gregory (May 2015). "Exclusive Interview: Matei Zaharia, creator of Apache Spark, on Spark, Hadoop, Flink, and Big Data in 2020".
  4. "Cei mai bogaţi oameni din lume în 2022. Şase români în topul Forbes". Adevărul (in Romanian). 6 April 2022.
  5. 1 2 Iyer, Kavya (July 26, 2019). "Twelve Stanford researchers receive Presidential Early Career Award for Scientists and Engineers". Stanford Daily.
  6. Zaharia, Matei. "Programming Contest Resources". cs.stanford.edu. Retrieved 2020-04-22.
  7. "The Story of 0 A.D." Play0ad.
  8. "Fort Wars Overview".
  9. Woodie, Alex (March 8, 2019). "A Decade Later, Apache Spark Still Going Strong". Datanami.
  10. "Matei Zaharia receives ACM Doctoral Dissertation award". MIT EECS. April 28, 2015. Archived from the original on 2015-07-09.
  11. Brust, Andrew (June 6, 2019). "AI gets rigorous: Databricks announces MLflow 1.0". ZDNet.
  12. Anadiotis, George. "Unifying cloud storage and data warehouses: Delta Lake project hosted by the Linux Foundation". ZDNet. Retrieved 2019-12-03.
  13. Woodie, Alex (2019-12-02). "Will Databricks Build the First Enterprise AI Platform?". Datanami. Retrieved 2019-12-03.