Daniel Abadi

Last updated
Daniel Abadi
Education
OccupationProfessor of Computer Science at University of Maryland, College Park [1]
Awards Sloan Fellowship (2011)
ACM Fellow (2020)
Scientific career
FieldsComputer Science
Institutions Yale University
University of Maryland, College Park
Thesis Query Execution in Column-Oriented Database Systems  (2008)
Doctoral advisor Samuel Madden
Website www.cs.umd.edu/~abadi/

Daniel Abadi is the Darnell-Kanal Professor of Computer Science at University of Maryland, College Park. [2] His primary area of research is database systems, with contributions to stream databases, distributed databases, graph databases, and column-store databases. [3] He helped create C-Store, a column-oriented database, and HadoopDB, a hybrid of relational databases and Hadoop. Both database systems were commercialized by companies.

Contents

Abadi was the first to describe the PACELC theorem in a 2010 blog post. PACELC, a response to the CAP theorem, was proved formally in 2018 in a SIGACT News article. [4]

Education and career

Abadi obtained a Bachelor of Science degree in Computer Science and Neuroscience from Brandeis University in 2002. A year later, he graduated from Cambridge University with a master's degree in Computer Speech, Text, and Internet Technology. He then pursued a PhD at Massachusetts Institute of Technology, where he was advised by Samuel Madden. [5] At MIT, Abadi collaborated with several researchers to propose C-Store, a column-oriented database. C-Store was commercialized by Vertica and eventually acquired by Hewlett-Packard. [6] Abadi obtained his PhD degree in 2007, writing a dissertation titled Query Execution in Column-Oriented Database Systems. [7] [5]

He became an assistant professor at Yale University in 2007 and subsequently an associate professor in 2012. [7] In 2010, a company named Hadapt commercialized his research on HadoopDB, a hybrid of relational databases and Hadoop. [6] Hadapt was acquired by Teradata in 2014. [8]

In 2017, he joined University of Maryland, College Park as the Darnell/Kanal Professor in Computer Science. [7]

Awards and recognitions

Abadi's 2008 dissertation Query Execution in Column-Oriented Database Systems received a SIGMOD Jim Gray Doctoral Dissertation Award in 2009. Two PhD students advised by him, Alexander Thomson and Jose Faleiro, also received this award for their dissertations. [9]

He received a NSF CAREER award in 2009 and a Sloan Fellowship in 2011. [10] [11]

Abadi received VLDB's best paper award in 2007 for Scalable Semantic Web Data Management Using Vertical Partitioning and test of time award in 2015 and 2019 for C-Store: A Column-oriented DBMS and HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads, respectively. [12] [13]

He was selected as an ACM Fellow in 2020 "for contributions to stream databases, distributed databases, graph databases, and column-store databases". [3]

Related Research Articles

<span class="mw-page-title-main">MonetDB</span> Open source column-oriented relational database management system

MonetDB is an open-source column-oriented relational database management system (RDBMS) originally developed at the Centrum Wiskunde & Informatica (CWI) in the Netherlands. It is designed to provide high performance on complex queries against large databases, such as combining tables with hundreds of columns and millions of rows. MonetDB has been applied in high-performance applications for online analytical processing, data mining, geographic information system (GIS), Resource Description Framework (RDF), text retrieval and sequence alignment processing.

SIGMOD is the Association for Computing Machinery's Special Interest Group on Management of Data, which specializes in large-scale data management problems and databases.

A column-oriented DBMS or columnar DBMS is a database management system (DBMS) that stores data tables by column rather than by row. Benefits include more efficient access to data when only querying a subset of columns, and more options for data compression. However, they are typically less efficient for inserting new data.

<span class="mw-page-title-main">Georg Gottlob</span> Austrian computer scientist

Georg Gottlob FRS is an Austrian-Italian computer scientist who works in the areas of database theory, logic, and artificial intelligence and is Professor of Informatics at the University of Calabria. He was Professor at the University of Oxford.

<span class="mw-page-title-main">Samuel Madden (computer scientist)</span> American computer scientist

Samuel R. Madden is an American computer scientist specializing in database management systems. He is currently a professor of computer science at the Massachusetts Institute of Technology.

<span class="mw-page-title-main">Greenplum</span>

Greenplum is a big data technology based on MPP architecture and the Postgres open source database technology. The technology was created by a company of the same name headquartered in San Mateo, California around 2005. Greenplum was acquired by EMC Corporation in July 2010.

<span class="mw-page-title-main">Michael Stonebraker</span> American computer scientist (born 1943)

Michael Ralph Stonebraker is a computer scientist specializing in database systems. Through a series of academic prototypes and commercial startups, Stonebraker's research and products are central to many relational databases. He is also the founder of many database companies, including Ingres Corporation, Illustra, Paradigm4, StreamBase Systems, Tamr, Vertica and VoltDB, and served as chief technical officer of Informix. For his contributions to database research, Stonebraker received the 2014 Turing Award, often described as "the Nobel Prize for computing."

<span class="mw-page-title-main">Vertica</span> Software company

Vertica is an analytic database management software company. Vertica was founded in 2005 by the database researcher Michael Stonebraker with Andrew Palmer as the founding CEO. Ralph Breslauer and Christopher P. Lynch served as CEOs later on.

<span class="mw-page-title-main">Tomasz Imieliński</span> Polish-American computer scientist (born 1954)

Tomasz Imieliński is a Polish-American computer scientist, most known in the areas of data mining, mobile computing, data extraction, and search engine technology. He is currently a professor of computer science at Rutgers University in New Jersey, United States.

Philip Alan Bernstein is a computer scientist specializing in database research in the Database Group of Microsoft Research. Bernstein is also an affiliate professor at the University of Washington and frequent committee member or chair of conferences such as VLDB and SIGMOD. He won the SIGMOD Edgar F. Codd Innovations Award in 1994, and in 2011 with Jayant Madhavan and Erhard Rahm the VLDB 10 Year Best Paper Award for their VLDB 2001 paper "Generic Schema Matching with Cupid".

<span class="mw-page-title-main">Actian Vector</span>

Actian Vector is an SQL relational database management system designed for high performance in analytical database applications. It published record breaking results on the Transaction Processing Performance Council's TPC-H benchmark for database sizes of 100 GB, 300 GB, 1 TB and 3 TB on non-clustered hardware.

NewSQL is a class of relational database management systems that seek to provide the scalability of NoSQL systems for online transaction processing (OLTP) workloads while maintaining the ACID guarantees of a traditional database system.

Stefano Ceri is an Italian computer engineer and professor of database management at Politecnico di Milano. He has been visiting professor at Stanford University between 1983 and 1990, and received the ACM SIGMOD Edward Codd Innovations Award in 2013.

<span class="mw-page-title-main">Martin L. Kersten</span> Dutch computer scientist (born 1953)

Martin L. Kersten was a computer scientist with research focus on database architectures, query optimization and their use in scientific databases. He was an architect of the MonetDB system, an open-source column store for data warehouses, online analytical processing (OLAP) and geographic information systems (GIS). He has been (co-) founder of several successful spin-offs of the Centrum Wiskunde & Informatica (CWI).

Presto is a distributed query engine for big data using the SQL query language. Its architecture allows users to query data sources such as Hadoop, Cassandra, Kafka, AWS S3, Alluxio, MySQL, MongoDB and Teradata, and allows use of multiple data sources within a query. Presto is community-driven open-source software released under the Apache License.

In theoretical computer science, the PACELC theorem is an extension to the CAP theorem. It states that in case of network partitioning (P) in a distributed computer system, one has to choose between availability (A) and consistency (C), but else (E), even when the system is running normally in the absence of partitions, one has to choose between latency (L) and loss of consistency (C).

Peter Boncz is a Dutch computer scientist specializing in database systems. He is a researcher at the Centrum Wiskunde & Informatica and professor at the Vrije Universiteit Amsterdam in the special chair of Large-Scale Analytical Data Management.

Zehra Meral Özsoyoglu is a Turkish-American computer scientist specializing in databases, including research on query languages, database model, and indexes, and applications of databases in science, bioinformatics, and medical informatics. She is the Andrew R. Jennings Professor Emeritus of Computer Science at Case Western Reserve University.

<span class="mw-page-title-main">Witold Lipski</span> Polish computer scientist

Witold Lipski Jr. was a Polish computer scientist, and an author of two books: Combinatorics for Programmers and (jointly with Wiktor Marek Combinatorial analysis. Lipski, jointly with his PhD student, Tomasz Imieliński, created foundations of the theory of incomplete information in relational databases.

Tim Kraska is a German computer scientist specializing in data systems and the intersection of systems and machine learning. He is currently an associate professor of computer science at the Massachusetts Institute of Technology.

References

  1. "Daniel Abadi". www.cs.umd.edu.
  2. "Daniel Abadi named Darnell-Kanal Professor of Computer Science". www.cs.umd.edu. 2017-01-26. Retrieved 2021-04-12.
  3. 1 2 "2020 ACM Fellows Recognized for Work that Underpins Today's Computing Innovations". Association of Computing Machinery. Retrieved 2021-04-12.
  4. Golab, Wojciech (2018). "Proving PACELC". ACM SIGACT News. 49: 73–81. doi:10.1145/3197406.3197420. S2CID   3989621.
  5. 1 2 Abadi, Daniel (2008). Query execution in column-oriented database systems (PhD thesis). Massachusetts Institute of Technology.
  6. 1 2 Yu, Sherwin (2012-03-14). "Hadapt: Yale Startup – Yale Scientific Magazine". Yale Scientific Magazine. Retrieved 2023-11-20.
  7. 1 2 3 "College Welcomes 15 New Faculty Members this Fall". University of Maryland College of Computer, Mathematical, and Natural Sciences. 2017-09-29. Retrieved 2023-11-20.
  8. Novet, Jordan (2014-07-22). "Teradata grabs startups Hadapt & Revelytix that make Hadoop easier". VentureBeat. Retrieved 2023-11-20.
  9. "SIGMOD Jim Gray Doctoral Dissertation Award". SIGMOD Website. Retrieved 2023-11-17.
  10. "CAREER: Architecting A Database Management System for Semantic Web Data". National Science Foundation. Retrieved 2023-11-17.
  11. "Fellows Database". Alfred P. Sloan Foundation. Retrieved 2023-11-17.
  12. "VLDB 2007 Best Paper Awards". Very Large Databases Endowment. Retrieved 2023-11-17.
  13. "VLDB Test of Time Award". www.vldb.org. Retrieved 2021-04-12.