Daniel Abadi | |
---|---|
Education |
|
Occupation | Professor of Computer Science at University of Maryland, College Park [1] |
Awards | Sloan Fellowship (2011) ACM Fellow (2020) |
Scientific career | |
Fields | Computer Science |
Institutions | Yale University University of Maryland, College Park |
Thesis | Query Execution in Column-Oriented Database Systems (2008) |
Doctoral advisor | Samuel Madden |
Website | www |
Daniel Abadi is the Darnell-Kanal Professor of Computer Science at University of Maryland, College Park. [2] His primary area of research is database systems, with contributions to stream databases, distributed databases, graph databases, and column-store databases. [3] He helped create C-Store, a column-oriented database, and HadoopDB, a hybrid of relational databases and Hadoop. Both database systems were commercialized by companies.
Abadi was the first to describe the PACELC theorem in a 2010 blog post. PACELC, a response to the CAP theorem, was proved formally in 2018 in a SIGACT News article. [4]
Abadi obtained a Bachelor of Science degree in Computer Science and Neuroscience from Brandeis University in 2002. A year later, he graduated from Cambridge University with a master's degree in Computer Speech, Text, and Internet Technology. He then pursued a PhD at Massachusetts Institute of Technology, where he was advised by Samuel Madden. [5] At MIT, Abadi collaborated with several researchers to propose C-Store, a column-oriented database. C-Store was commercialized by Vertica and eventually acquired by Hewlett-Packard. [6] Abadi obtained his PhD degree in 2007, writing a dissertation titled Query Execution in Column-Oriented Database Systems. [7] [5]
He became an assistant professor at Yale University in 2007 and subsequently an associate professor in 2012. [7] In 2010, a company named Hadapt commercialized his research on HadoopDB, a hybrid of relational databases and Hadoop. [6] Hadapt was acquired by Teradata in 2014. [8]
In 2017, he joined University of Maryland, College Park as the Darnell/Kanal Professor in Computer Science. [7]
Abadi's 2008 dissertation Query Execution in Column-Oriented Database Systems received a SIGMOD Jim Gray Doctoral Dissertation Award in 2009. Two PhD students advised by him, Alexander Thomson and Jose Faleiro, also received this award for their dissertations. [9]
He received a NSF CAREER award in 2009 and a Sloan Fellowship in 2011. [10] [11]
Abadi received VLDB's best paper award in 2007 for Scalable Semantic Web Data Management Using Vertical Partitioning and test of time award in 2015 and 2019 for C-Store: A Column-oriented DBMS and HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads, respectively. [12] [13]
He was selected as an ACM Fellow in 2020 "for contributions to stream databases, distributed databases, graph databases, and column-store databases". [3]
MonetDB is an open-source column-oriented relational database management system (RDBMS) originally developed at the Centrum Wiskunde & Informatica (CWI) in the Netherlands. It is designed to provide high performance on complex queries against large databases, such as combining tables with hundreds of columns and millions of rows. MonetDB has been applied in high-performance applications for online analytical processing, data mining, geographic information system (GIS), Resource Description Framework (RDF), text retrieval and sequence alignment processing.
SIGMOD is the Association for Computing Machinery's Special Interest Group on Management of Data, which specializes in large-scale data management problems and databases.
A column-oriented DBMS or columnar DBMS is a database management system (DBMS) that stores data tables by column rather than by row. Benefits include more efficient access to data when only querying a subset of columns, and more options for data compression. However, they are typically less efficient for inserting new data.
Georg Gottlob FRS is an Austrian-Italian computer scientist who works in the areas of database theory, logic, and artificial intelligence and is Professor of Informatics at the University of Calabria. He was Professor at the University of Oxford.
Samuel R. Madden is an American computer scientist specializing in database management systems. He is currently a professor of computer science at the Massachusetts Institute of Technology.
Greenplum is a big data technology based on MPP architecture and the Postgres open source database technology. The technology was created by a company of the same name headquartered in San Mateo, California around 2005. Greenplum was acquired by EMC Corporation in July 2010.
Michael Ralph Stonebraker is a computer scientist specializing in database systems. Through a series of academic prototypes and commercial startups, Stonebraker's research and products are central to many relational databases. He is also the founder of many database companies, including Ingres Corporation, Illustra, Paradigm4, StreamBase Systems, Tamr, Vertica and VoltDB, and served as chief technical officer of Informix. For his contributions to database research, Stonebraker received the 2014 Turing Award, often described as "the Nobel Prize for computing."
Vertica is an analytic database management software company. Vertica was founded in 2005 by the database researcher Michael Stonebraker with Andrew Palmer as the founding CEO. Ralph Breslauer and Christopher P. Lynch served as CEOs later on.
Tomasz Imieliński is a Polish-American computer scientist, most known in the areas of data mining, mobile computing, data extraction, and search engine technology. He is currently a professor of computer science at Rutgers University in New Jersey, United States.
Philip Alan Bernstein is a computer scientist specializing in database research in the Database Group of Microsoft Research. Bernstein is also an affiliate professor at the University of Washington and frequent committee member or chair of conferences such as VLDB and SIGMOD. He won the SIGMOD Edgar F. Codd Innovations Award in 1994, and in 2011 with Jayant Madhavan and Erhard Rahm the VLDB 10 Year Best Paper Award for their VLDB 2001 paper "Generic Schema Matching with Cupid".
Actian Vector is an SQL relational database management system designed for high performance in analytical database applications. It published record breaking results on the Transaction Processing Performance Council's TPC-H benchmark for database sizes of 100 GB, 300 GB, 1 TB and 3 TB on non-clustered hardware.
NewSQL is a class of relational database management systems that seek to provide the scalability of NoSQL systems for online transaction processing (OLTP) workloads while maintaining the ACID guarantees of a traditional database system.
Stefano Ceri is an Italian computer engineer and professor of database management at Politecnico di Milano. He has been visiting professor at Stanford University between 1983 and 1990, and received the ACM SIGMOD Edward Codd Innovations Award in 2013.
Martin L. Kersten was a computer scientist with research focus on database architectures, query optimization and their use in scientific databases. He was an architect of the MonetDB system, an open-source column store for data warehouses, online analytical processing (OLAP) and geographic information systems (GIS). He has been (co-) founder of several successful spin-offs of the Centrum Wiskunde & Informatica (CWI).
Presto is a distributed query engine for big data using the SQL query language. Its architecture allows users to query data sources such as Hadoop, Cassandra, Kafka, AWS S3, Alluxio, MySQL, MongoDB and Teradata, and allows use of multiple data sources within a query. Presto is community-driven open-source software released under the Apache License.
In theoretical computer science, the PACELC theorem is an extension to the CAP theorem. It states that in case of network partitioning (P) in a distributed computer system, one has to choose between availability (A) and consistency (C), but else (E), even when the system is running normally in the absence of partitions, one has to choose between latency (L) and loss of consistency (C).
Peter Boncz is a Dutch computer scientist specializing in database systems. He is a researcher at the Centrum Wiskunde & Informatica and professor at the Vrije Universiteit Amsterdam in the special chair of Large-Scale Analytical Data Management.
Zehra Meral Özsoyoglu is a Turkish-American computer scientist specializing in databases, including research on query languages, database model, and indexes, and applications of databases in science, bioinformatics, and medical informatics. She is the Andrew R. Jennings Professor Emeritus of Computer Science at Case Western Reserve University.
Witold Lipski Jr. was a Polish computer scientist, and an author of two books: Combinatorics for Programmers and (jointly with Wiktor Marek Combinatorial analysis. Lipski, jointly with his PhD student, Tomasz Imieliński, created foundations of the theory of incomplete information in relational databases.
Tim Kraska is a German computer scientist specializing in data systems and the intersection of systems and machine learning. He is currently an associate professor of computer science at the Massachusetts Institute of Technology.