Michael Stonebraker

Last updated
Michael Stonebraker
Michael Stonebraker P1120062.jpg
Michael Stonebraker giving the 2015 Turing lecture
Born (1943-10-11) October 11, 1943 (age 82)
Education Princeton University (BS)
University of Michigan (MS, PhD)
Known for Ingres, Postgres, Vertica, Streambase, Illustra, VoltDB, SciDB
SpouseBeth
Awards IEEE John von Neumann Medal (2005)
ACM Turing Award (2014)
Scientific career
FieldsComputer science
Institutions University of California, Berkeley
University of Michigan
Massachusetts Institute of Technology
Thesis The Reduction of Large Scale Markov Models for Random Chains
Doctoral advisor Arch Waugh Naylor
Notable students Joseph M. Hellerstein
Andy Pavlo
Clifford A. Lynch [2]
Margo Seltzer [2]
Dale Skeen [3]
Marti Hearst [4]
Leilani Battle [5]
Website csail.mit.edu/user/1547

Michael Ralph Stonebraker (born October 11, 1943 [6] ) is an American computer scientist specializing in database systems. Through a series of academic prototypes and commercial startups, Stonebraker's research and products are central to many relational databases. He is also the founder of many database companies, including Ingres Corporation, Illustra, Paradigm4, StreamBase Systems, Tamr, Vertica, VoltDB and Hopara, and served as chief technical officer of Informix. For his contributions to database research, Stonebraker received the 2014 Turing Award, often described as "the Nobel Prize for computing." [7]

Contents

Stonebraker's career can be broadly divided into two phases: his time at University of California, Berkeley when he focused on relational database management systems such as Ingres and Postgres, and, starting in 2001, at Massachusetts Institute of Technology (MIT) where he developed more novel data management techniques such as C-Store, H-Store, SciDB and DBOS. [8] Stonebraker is currently a professor emeritus at UC Berkeley and an adjunct professor at MIT's Computer Science and Artificial Intelligence Laboratory. [9] [10] He is also known as an editor for the book Readings in Database Systems.

Life

Stonebraker grew up in Milton Mills, New Hampshire. [11] He earned his B.S.E. in electrical engineering from Princeton University in 1965, and his M.S. and Ph.D. from the University of Michigan in 1967 and 1971 [12] respectively. His awards include the IEEE John von Neumann Medal and the first SIGMOD Edgar F. Codd Innovations Award. In 1994 he was inducted as a Fellow of the Association for Computing Machinery. [13] In 1997, he was elected a member of the National Academy of Engineering for the development and commercialization of relational and object-relational database systems. In March 2015 it was announced he won the 2014 ACM Turing Award. [7] In September 2015, he won the 2015 Commonwealth Award, chosen by council members of MassTLC. [14]

Ingres

In 1973, Stonebraker and his colleague Eugene Wong started researching relational database systems after reading a series of seminal papers published by Edgar F. Codd on the relational data model. [15]

Their project, known as Ingres (Interactive Graphics and Retrieval System), [16] was one of the first systems (along with System R from IBM) to demonstrate that it was possible to build a practical and efficient implementation of the relational model. A number of key ideas from INGRES are still widely used in relational systems, including the use of B-trees, primary-copy replication, the query rewrite approach to views and integrity constraints, and the idea of rules/triggers for integrity checking in an RDBMS. Additionally, much experimental work was done that provided insights into how to build a locking system that could provide satisfactory transaction performance. [17]

These included Stonebraker, who with fellow Berkeley professors Larry Rowe and Eugene Wong helped found Relational Technology, Inc., later called Ingres Corporation. Subsequently, sold to Computer Associates, Ingres was re-established as an independent company in 2005, and later renamed Actian. Other startups based on Ingres include Sybase, founded by Robert Epstein, a student on the project, and Britton Lee, Inc. Sybase's code was later used as a basis for Microsoft SQL Server. [18]

Postgres

After founding Relational Technology, Stonebraker and Rowe began a "post-Ingres" effort, to address the limitations of the relational model. The new project was named POSTGRES (POST inGRES). [19]

Mariposa and Cohera

After the Postgres project, Stonebraker initiated the Mariposa [20]

The MIT years (2001–present)

Aurora and StreamBase

In the Aurora Project, Stonebraker, along with colleagues from Brandeis University, Brown University, and MIT, focused on data management for streaming data, using a new data model and query language. Unlike relational systems, which "pull" data and process it a record at a time, in Aurora, data is "pushed", arriving asynchronously from external data sources (such as stock ticks, news feeds, or sensors.) The output is itself a stream of results (such as windowed averages) that are sent to users. [21]

C-Store and Vertica

In the C-Store project, started in 2005, Stonebraker, along with colleagues from Brandeis, Brown, MIT, and University of Massachusetts Boston, developed a parallel, shared-nothing column-oriented DBMS for data warehousing. By dividing and storing data in columns, C-Store is able to perform less I/O and get better compression ratios than conventional database systems that store data in rows. [22]

Stonebraker explained that it's because similar data items are side-by-side: Name,Name,Name,Name vs. Name,Address,Zip,Phone#. In 2005, Stonebraker co-founded Vertica to commercialize the technology behind C-Store. [23]

SciDB

In 2008, along with David DeWitt and researchers from Brown, MIT, Portland State University, SLAC, the University of Washington, and the University of Wisconsin–Madison, Stonebraker started SciDB [24] [25] an open-source DBMS specially designed for scientific research applications. [26]

He founded Paradigm4 with Marilyn Matz, who became CEO. Paradigm4 developed SciDB, used mostly by life sciences and financial markets. Novartis, Foundation Medicine, and the National Institutes of Health are some of the company's clients. [14] [27]

NoSQL

In 2010 and 2011, Stonebraker criticized the NoSQL movement. [28] [29] [30]

Selected works

References

  1. "Michael Stonebraker - A.M. Turing Award Winner" . Retrieved 2018-02-06.
  2. 1 2 "Ph.D. Dissertations | EECS at UC Berkeley". www2.eecs.berkeley.edu.
  3. Michael Stonebraker at the Mathematics Genealogy Project
  4. "Nice: or what it was like to be Mike's student" (PDF).
  5. Battle, Leilani Marie (2017). Behavior-driven optimization techniques for scalable data exploration (Thesis). Massachusetts Institute of Technology. hdl:1721.1/111853 . Retrieved 2023-12-27.
  6. "Contributors". IEEE Transactions on Systems, Man, and Cybernetics (4): 562–564. Sep 1972. doi:10.1109/TSMC.1972.4309174.
  7. 1 2 Conner-Simons, Adam (March 25, 2015). "Michael Stonebraker wins $1 million Turing Award". MIT News. Massachusetts Institute of Technology. Retrieved March 25, 2015.
  8. "Postgres pioneer Michael Stonebraker promises to upend the database once more". www.theregister.com. Retrieved 2023-12-27.
  9. "Michael Stonebraker". www2.eecs.berkeley.edu. Retrieved 2018-03-16.
  10. "Michael Stonebraker | MIT CSAIL". www.csail.mit.edu. Retrieved 2018-03-16.
  11. Oral History of Michael Stonebraker; 2012-08-23 Retrieved 2018-08-26.
  12. Stonebraker, Michael Ralph (1971). The Reduction of Large Scale Markov Models for Random Chains (PhD thesis). University of Michigan. OCLC   634008426. ProQuest   302585708.
  13. "Michael Ralph Stonebraker - ACM author profile page" . Retrieved 2011-07-27.
  14. 1 2 Geller, Jessica. "PTC Chief Heppelman named CEO of the year by Mass. tech council." betaBoston. The Boston Globe. Sept. 16, 2015 Archived 2016-01-07 at the Wayback Machine
  15. Codd, E. F. (1970). "A relational model of data for large shared data banks" (PDF). Communications of the ACM. 13 (6): 377–387. doi:10.1145/362384.362685. S2CID   207549016.
  16. Stonebraker, M.; Held, G.; Wong, E.; Kreps, P. (1976). "The design and implementation of INGRES". ACM Transactions on Database Systems. 1 (3): 189. CiteSeerX   10.1.1.109.957 . doi:10.1145/320473.320476. S2CID   1514658.
  17. "Relational Roots". Joseph Hellerstein. 1998. Retrieved 2009-11-24.
  18. "Motivation & DBMS Architecture Overview". Joseph Hellerstein. 1998. Retrieved 2009-11-24.
  19. Stonebraker, M.; Rowe, L. A. (1986). "The design of POSTGRES". ACM SIGMOD Record. 15 (2): 340. doi: 10.1145/16856.16888 .
  20. Stonebraker, M.; Aoki, P. M.; Litwin, W.; Pfeffer, A.; Sah, A.; Sidell, J.; Staelin, C.; Yu, A. (1996). "Mariposa: A wide-area distributed database system". The VLDB Journal the International Journal on Very Large Data Bases. 5: 48–63. CiteSeerX   10.1.1.68.5480 . doi:10.1007/s007780050015. S2CID   5062284.
  21. Abadi, D. J.; Carney, D.; Etintemel, U.; Cherniack, M.; Convey, C.; Lee, S.; Stonebraker, M.; Tatbul, N.; Zdonik, S. (2003). "Aurora: A new model and architecture for data stream management". The VLDB Journal the International Journal on Very Large Data Bases. 12 (2): 120. CiteSeerX   10.1.1.6.1187 . doi:10.1007/s00778-003-0095-z. S2CID   8101432.
  22. (Print edition title: Database Pioneer Rethinks How Data is Organized.Charles Babcock (February 21, 2008). "Database Pioneer Rethinks The Best Way To Organize Data". InformationWeek .
  23. "The Vertica Analytic Database: C-Store 7 Years Later" (PDF)" (PDF). VLDB.org. August 28, 2012.
  24. Brown, P. G. (2010). "Overview of sciDB". Proceedings of the 2010 international conference on Management of data - SIGMOD '10. pp. 963–968. doi:10.1145/1807167.1807271. ISBN   9781450300322. S2CID   14544985.
  25. Stonebraker, M.; Brown, P.; Poliakov, A.; Raman, S. (2011). "The Architecture of SciDB". Scientific and Statistical Database Management. Lecture Notes in Computer Science. Vol. 6809. pp. 1–16. doi:10.1007/978-3-642-22351-8_1. ISBN   978-3-642-22350-1.
  26. "SciDB: Relational daddy answers Google, Hadoop, NoSQL". The Register. 2010-09-13. Retrieved 2012-01-11.
  27. Alspach, Kyle. "New Money: MassChallenge Alum Gets Dorm Room Fund Investment; Drone Co. Raises Seed Round." BostInno. Nov. 30, 2015 Archived 2016-02-07 at the Wayback Machine
  28. Stonebraker, M. (2010). "SQL databases v. NoSQL databases". Communications of the ACM. 53 (4): 10–11. doi:10.1145/1721654.1721659. S2CID   13959501.
  29. Stonebraker, M. (2011). "Stonebraker on NoSQL and enterprises". Communications of the ACM. 54 (8): 10–11. doi:10.1145/1978542.1978546. S2CID   36572502.
  30. Stonebraker, M.; Abadi, D.; Dewitt, D. J.; Madden, S.; Paulson, E.; Pavlo, A.; Rasin, A. (2010). "MapReduce and parallel DBMSs". Communications of the ACM. 53: 64–71. doi:10.1145/1629175.1629197. S2CID   61484899.