Joseph M. Hellerstein

Last updated
Joseph M. Hellerstein
Joseph M. Hellerstein.jpg
Born (1968-06-07) 7 June 1968 (age 56) [1]
Alma mater Harvard University (BA)
University of California, Berkeley (MS)
University of Wisconsin–Madison (PhD)
Scientific career
Fields Computer science
Institutions University of California, Berkeley
Doctoral advisor Jeffrey Naughton
Michael Stonebraker
Doctoral students Sam Madden
Boon Thau Loo
Website db.cs.berkeley.edu/jmh

Joseph M. Hellerstein (born 7 June 1968) [1] is an American professor of computer science at the University of California, Berkeley, where he works on database systems and computer networks. He co-founded Trifacta with Jeffrey Heer and Sean Kandel in 2012, which stemmed from their research project, Wrangler. [2]

Contents

Education

Hellerstein attended Harvard University from 1986 to 1990 (AB computer science) and pursued his master's degree in computer science at University of California, Berkeley from 1991 to 1992. He received his Ph.D., also in computer science, from the University of Wisconsin, Madison in 1995, [3] for a thesis on query optimization supervised by Jeffrey Naughton and Michael Stonebraker.

Research

Hellerstein has made contributions to many areas of database systems, such as ad-hoc sensor networks, [4] [5] adaptive query processing, [6] approximate query processing and online aggregation, [7] declarative networking, and data stream processing. [8]

Awards and recognition

Hellerstein's work has been recognized with an Alfred P. Sloan Fellowship, MIT Technology Review's inaugural TR100 list and TR10 list, [9] Fortune 50 smartest in Tech, [10] and three ACM-SIGMOD "Test of Time" awards. [11] He is a Fellow of the Association for Computing Machinery (2009). [12]

Related Research Articles

<span class="mw-page-title-main">Jim Gray (computer scientist)</span> American computer scientist

James Nicholas Gray was an American computer scientist who received the Turing Award in 1998 "for seminal contributions to database and transaction processing research and technical leadership in system implementation".

<span class="mw-page-title-main">R-tree</span> Data structures used in spatial indexing

R-trees are tree data structures used for spatial access methods, i.e., for indexing multi-dimensional information such as geographical coordinates, rectangles or polygons. The R-tree was proposed by Antonin Guttman in 1984 and has found significant use in both theoretical and applied contexts. A common real-world usage for an R-tree might be to store spatial objects such as restaurant locations or the polygons that typical maps are made of: streets, buildings, outlines of lakes, coastlines, etc. and then find answers quickly to queries such as "Find all museums within 2 km of my current location", "retrieve all road segments within 2 km of my location" or "find the nearest gas station". The R-tree can also accelerate nearest neighbor search for various distance metrics, including great-circle distance.

In computer programming contexts, a data cube is a multi-dimensional ("n-D") array of values. Typically, the term data cube is applied in contexts where these arrays are massively larger than the hosting computer's main memory; examples include multi-terabyte/petabyte data warehouses and time series of image data.

<span class="mw-page-title-main">MonetDB</span> Open source column-oriented relational database management system

MonetDB is an open-source column-oriented relational database management system (RDBMS) originally developed at the Centrum Wiskunde & Informatica (CWI) in the Netherlands. It is designed to provide high performance on complex queries against large databases, such as combining tables with hundreds of columns and millions of rows. MonetDB has been applied in high-performance applications for online analytical processing, data mining, geographic information system (GIS), Resource Description Framework (RDF), text retrieval and sequence alignment processing.

M. Dale Skeen is an American computer scientist. He specializes in designing and implementing large-scale computing systems, distributed computing and database management systems.

A visual sensor network or smart camera network or intelligent camera network is a network of spatially distributed smart camera devices capable of processing, exchanging data and fusing images of a scene from a variety of viewpoints into some form more useful than the individual images. A visual sensor network may be a type of wireless sensor network, and much of the theory and application of the latter applies to the former. The network generally consists of the cameras themselves, which have some local image processing, communication and storage capabilities, and possibly one or more central computers, where image data from multiple cameras is further processed and fused. Visual sensor networks also provide some high-level services to the user so that the large amount of data can be distilled into information of interest using specific queries.

<span class="mw-page-title-main">Ronald Fagin</span> American mathematician and computer scientist

Ronald Fagin is an American mathematician and computer scientist, and IBM Fellow at the IBM Almaden Research Center. He is known for his work in database theory, finite model theory, and reasoning about knowledge.

Oscar Peter Buneman, is a British computer scientist who works in the areas of database systems and database theory.

<span class="mw-page-title-main">Georg Gottlob</span> Austrian computer scientist

Georg Gottlob FRS is an Austrian-Italian computer scientist who works in the areas of database theory, logic, and artificial intelligence and is Professor of Informatics at the University of Calabria. He was Professor at the University of Oxford.

<span class="mw-page-title-main">Samuel Madden (computer scientist)</span> American computer scientist

Samuel R. Madden is an American computer scientist specializing in database management systems. He is currently a professor of computer science at the Massachusetts Institute of Technology.

<span class="mw-page-title-main">Michael Stonebraker</span> American computer scientist (born 1943)

Michael Ralph Stonebraker is an American computer scientist specializing in database systems. Through a series of academic prototypes and commercial startups, Stonebraker's research and products are central to many relational databases. He is also the founder of many database companies, including Ingres Corporation, Illustra, Paradigm4, StreamBase Systems, Tamr, Vertica and VoltDB, and served as chief technical officer of Informix. For his contributions to database research, Stonebraker received the 2014 Turing Award, often described as "the Nobel Prize for computing."

<span class="mw-page-title-main">Tomasz Imieliński</span> Polish-American computer scientist (born 1954)

Tomasz Imieliński is a Polish-American computer scientist, most known in the areas of data mining, mobile computing, data extraction, and search engine technology. He is currently a professor of computer science at Rutgers University in New Jersey, United States.

Hari Balakrishnan is the Fujitsu Professor of Computer Science and Artificial Intelligence in the Department of Electrical Engineering and Computer Science at MIT, and the Co-founder and CTO at Cambridge Mobile Telematics.

Online aggregation is a technique for improving the interactive behavior of database systems processing expensive analytical queries. Almost all database operations are performed in batch mode, i.e. the user issues a query and waits till the database has finished processing the entire query. On the contrary, using online aggregation, the user gets estimates of an aggregate query in an online fashion as soon as the query is issued. For example, if the final answer is 1000, after k seconds, the user gets the estimates in form of a confidence interval like [990, 1020] with 95% probability. This confidence keeps on shrinking as the system gets more and more samples.

A sensor network query processor (SNQP), also called a sensorDB, is a user-friendly interface for programming and running applications which translates instructions from declarative programming language with high-level instructions to low-level instructions understood by the operating system. The basic idea of SNQP is the addition of a layer modeling the WSN as a distributed database searchable by a query language similar to SQL.

Laura M. Haas is an American computer scientist noted for her research in database systems and information integration. She is best known for creating systems and tools for the integration of heterogeneous data from diverse sources, including federated technology that virtualizes access to data, and mapping technology that enables non-programmers to specify how data should be integrated.

<span class="mw-page-title-main">Sean Kandel</span>

Sean Kandel is Trifacta's Chief Technical Officer and Co-founder, along with Joseph M. Hellerstein and Jeffrey Heer. He is known for the development of new tools for data transformation and discovery and is the co-developed of Data Wrangler, an interactive tool for data cleaning and transformation.

<span class="mw-page-title-main">Gautam Das (computer scientist)</span> Indian computer scientist

Gautam Das is a computer scientist in the field of databases research. He is an ACM Fellow and IEEE Fellow.

Tim Kraska is a German computer scientist specializing in data systems and the intersection of systems and machine learning. He is currently an associate professor of computer science at the Massachusetts Institute of Technology.

In computer science, macroprogramming is a programming paradigm aimed at expressing the macroscopic, global behaviour of an entire system of agents or computing devices. In macroprogramming, the local programs for the individual components of a distributed system are compiled or interpreted from a macro-program typically expressed by a system-level perspective or in terms of the intended global goal. The aim of macroprogramming approaches is to support expressing the macroscopic interactive behaviour of a whole distributed system of computing devices or agents in a single program, or, similarly, to promote their collective intelligence. It has not to be confused with macros, the mechanism often found in programming languages to express substitution rules for program pieces.

References

  1. 1 2 Library of Congress (1998-07-06). "Hellerstein, Joseph M., 1968-". Library of Congress Name Authority File. Retrieved on 2011-12-15 from http://id.loc.gov/authorities/names/n98044191.html.
  2. "Data Wrangler". vis.stanford.edu. Retrieved 2016-01-20.
  3. "Joseph M. Hellerstein". EECS. Retrieved 7 March 2021.
  4. Madden, S.; Franklin, M. J.; Hellerstein, J. M.; Hong, W. (2002). "TAG". ACM SIGOPS Operating Systems Review. 36: 131–146. doi:10.1145/844128.844142. S2CID   2003075.
  5. Madden, S.; Franklin, M. J.; Hellerstein, J. M.; Hong, W. (2003). "The design of an acquisitional query processor for sensor networks". Proceedings of the 2003 ACM SIGMOD international conference on Management of data - SIGMOD '03. p. 491. doi:10.1145/872757.872817. ISBN   158113634X. S2CID   1006062.
  6. Avnur, R.; Hellerstein, J. M. (2000). "Eddies". ACM SIGMOD Record. 29 (2): 261. doi:10.1145/335191.335420.
  7. Hellerstein, J. M.; Haas, P. J.; Wang, H. J. (1997). "Online aggregation". ACM SIGMOD Record. 26 (2): 171. doi: 10.1145/253262.253291 .
  8. Chandrasekaran, S.; Shah, M. A.; Cooper, O.; Deshpande, A.; Franklin, M. J.; Hellerstein, J. M.; Hong, W.; Krishnamurthy, S.; Madden, S. R.; Reiss, F. (2003). "TelegraphCQ". Proceedings of the 2003 ACM SIGMOD international conference on Management of data - SIGMOD '03. p. 668. doi:10.1145/872757.872857. ISBN   158113634X. S2CID   14965874.
  9. Naone, Erica. "TR10: Cloud Programming - MIT Technology Review". MIT Technology Review. Retrieved 2016-02-01.
  10. "The 50 smartest people in tech". Fortune. Retrieved 2016-02-01.
  11. 2013 The Design of an Acquisitional Query Processor for Sensor Networks. Samuel Madden, Michael J. Franklin, Joseph M. Hellerstein, Wei Hong
  12. "ACM Fellows: Joseph M Hellerstein". fellows.acm.org. Archived from the original on 8 September 2010.