Johannes Gehrke

Last updated

Johannes Gehrke
NationalityGerman
Education
Known forResearch on database systems, data science, and data privacy
Awards
Scientific career
Fields Computer science
Institutions
Academic advisors Raghu Ramakrishnan
Website www.cs.cornell.edu/johannes/ , www.microsoft.com/en-us/research/people/johannes/

Johannes Gehrke is a Technical Fellow at Microsoft focusing on AI. [1] He is an ACM Fellow, an IEEE Fellow, and he received the 2011 IEEE Computer Society Technical Achievement Award and the 2021 ACM SIGKDD Innovation Award. From 1999 to 2015, he was a faculty member in the Department of Computer Science at Cornell University, where at the time of his leaving he was the Tisch University Professor of Computer Science. [2]

Contents

Gehrke is best known for his contributions to database systems, data mining, and data privacy. He developed some of the fastest data mining algorithms for frequent pattern mining, sequential pattern mining, and decision tree construction and one of the first sensor network query processors which pioneered in-network query processing for wireless sensor networks, and he is known for his work on data privacy. His work on data privacy resulted in a new version of OnTheMap published by the US Census Bureau, the very first public data product published by any official government agency in the world with provable privacy guarantees (using a variant of Differential Privacy). [3]

Education

Johannes Gehrke studied from 1990 to 1993 computer science at the Karlsruhe Institute of Technology; he received an M.S. degree from the Department of Computer Science at the University of Texas at Austin in 1995 and a PhD from the University of Wisconsin, Madison in 1999 for a thesis in data mining.

Career

From 1999 to 2015, Gehrke was a professor in the Department of Computer Science at Cornell University. His research group was popularly known as the Big Red Data Group, and he graduated 25 PhD students. From 2005 to 2008, he was Chief Scientist at Fast Search and Transfer. He has been in product groups at Microsoft since 2012, first building Delve and the Office Graph, then building people and feed experiences across all of Microsoft 365, and then serving as chief architect and head of AI of the Microsoft Teams backend. From 2020 to 2023, he had a dual role across research and product, managing all of Microsoft Research in Redmond and CTO and head of AI for the Microsoft Teams backend.

Gehrke received a National Science Foundation Career Award, [4] a Sloan Research Fellowship, [5] and a Humboldt Research Award. In 2011, he received the IEEE Computer Society Technical Achievement Award [6] and a Blavatnik Award for Young Scientists. [7] In 2014, he became a Fellow of the Association for Computing Machinery, [8] and in 2020 he was elected an IEEE Fellow. [9] In 2021, he received the ACM SIGKDD Innovation Award, [10] which "recognizes individuals for their outstanding technical contributions to the field of knowledge discovery in data and data mining that have had lasting impact in furthering the theory and/or development of commercial systems." [11]

Books

Since its second edition, Gehrke has been a co-author of one of the main textbooks on database systems, commonly known as the Cow Book. [12]

Related Research Articles

SIGKDD, representing the Association for Computing Machinery's (ACM) Special Interest Group (SIG) on Knowledge Discovery and Data Mining, hosts an influential annual conference.

Jiawei Han is a Chinese-American computer scientist and writer. He currently holds the position of Michael Aiken Chair Professor in the Department of Computer Science at the University of Illinois at Urbana-Champaign. His research focuses on data mining, text mining, database systems, information networks, data mining from spatiotemporal data, Web data, and social/information network data.

<span class="mw-page-title-main">Osmar R. Zaiane</span>

Osmar R. Zaiane is a researcher, computer scientist, professor at the University of Alberta specializing in data mining and machine learning. He was the secretary treasurer of the Association for Computing Machinery (ACM) Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD) from 2009 to 2012 and treasurer of the ACM Special Interest Group on Health Informatics. He served as the editor-in-chief of the SIGKDD Explorations publication from 2008 to 2010. He was also the associate editor of the same publication from 2004 to 2007.

Hans-Peter Kriegel is a German computer scientist and professor at the Ludwig Maximilian University of Munich and leading the Database Systems Group in the Department of Computer Science. He was previously professor at the University of Würzburg and the University of Bremen after habilitation at the Technical University of Dortmund and doctorate from Karlsruhe Institute of Technology.

<span class="mw-page-title-main">Foster Provost</span> American computer scientist

Foster Provost is an American computer scientist, information systems researcher, and Professor of Data Science, Professor of Information Systems and Ira Rennert Professor of Entrepreneurship at New York University's Stern School of Business. He is also the Director for the Data Science and AI Initiative at Stern's Fubon Center for Technology, Business and Innovation. Professor Provost has a Bachelor of Science from Duquesne University in physics and mathematics and a Master of Science and Ph.D. in computer science from the University of Pittsburgh.

<span class="mw-page-title-main">Usama Fayyad</span> American computer scientist

Usama M. Fayyad is an American-Jordanian data scientist and co-founder of KDD conferences and ACM SIGKDD association for Knowledge Discovery and Data Mining. He is a speaker on Business Analytics, Data Mining, Data Science, and Big Data. He recently left his role as the Chief Data Officer at Barclays Bank.

Philip S. Yu is an American computer scientist and professor of information technology at the University of Illinois at Chicago. He is a prolific author, holds over 300 patents, and is known for his work in the field of data mining.

<span class="mw-page-title-main">Gregory Piatetsky-Shapiro</span> American computer scientist

Gregory I. Piatetsky-Shapiro is a data scientist and the co-founder of the KDD conferences, and co-founder and past chair of the Association for Computing Machinery SIGKDD group for Knowledge Discovery, Data Mining and Data Science. He is the founder and president of KDnuggets, a discussion and learning website for Business Analytics, Data Mining and Data Science.

Latifur Khan joined the University of Texas at Dallas in 2000, where he has been conducting research and teaching as a Professor in the Department of Computer Science.

Geoffrey J. Gordon is a professor at the Machine Learning Department at Carnegie Mellon University in Pittsburgh and director of research at the Microsoft Montréal lab. He is known for his research in statistical relational learning and on anytime dynamic variants of the A* search algorithm. His research interests include multi-agent planning, reinforcement learning, decision-theoretic planning, statistical models of difficult data, computational learning theory, and game theory.

<span class="mw-page-title-main">Gautam Das (computer scientist)</span> Indian computer scientist

Gautam Das is a computer scientist in the field of databases research. He is an ACM Fellow and IEEE Fellow.

Rediet Abebe is an Ethiopian computer scientist working in algorithms and artificial intelligence. She is an assistant professor of computer science at the University of California, Berkeley. Previously, she was a Junior Fellow at the Harvard Society of Fellows.

<span class="mw-page-title-main">Hui Xiong</span> Chinese data scientist

Hui Xiong is a data scientist. He is a distinguished professor at Rutgers University and a distinguished guest professor at the University of Science and Technology of China (USTC).

Wei Wang is a Chinese-born American computer scientist. She is the Leonard Kleinrock Chair Professor in Computer Science and Computational Medicine at University of California, Los Angeles and the director of the Scalable Analytics Institute (ScAi). Her research specializes in big data analytics and modeling, database systems, natural language processing, bioinformatics and computational biology, and computational medicine.

Himabindu "Hima" Lakkaraju is an Indian-American computer scientist who works on machine learning, artificial intelligence, algorithmic bias, and AI accountability. She is currently an Assistant Professor at the Harvard Business School and is also affiliated with the Department of Computer Science at Harvard University. Lakkaraju is known for her work on explainable machine learning. More broadly, her research focuses on developing machine learning models and algorithms that are interpretable, transparent, fair, and reliable. She also investigates the practical and ethical implications of deploying machine learning models in domains involving high-stakes decisions such as healthcare, criminal justice, business, and education. Lakkaraju was named as one of the world's top Innovators Under 35 by both Vanity Fair and the MIT Technology Review.

Yixin Chen is a computer scientist, academic, and author. He is a professor of computer science and engineering at Washington University in St. Louis.

<span class="mw-page-title-main">Nitesh Chawla</span> Computer scientist

Nitesh V. Chawla is a computer scientist and data scientist currently serving as the Frank M. Freimann Professor of Computer Science and Engineering at the University of Notre Dame. He is the Founding Director of the Lucy Family Institute for Data & Society. Chawla's research expertise lies in machine learning, data science, and network science. He is also the co-founder of Aunalytics, a data science software and cloud computing company. Chawla is a Fellow of the: American Association for the Advancement of Sciences (AAAS), Association for Computing Machinery (ACM), Association for the Advancement of Artificial Intelligence, Asia Pacific Artificial Intelligence Association, and Institute of Electrical and Electronics Engineers (IEEE). He has received multiple awards, including the 1st Source Bank Commercialization Award in 2017, Outstanding Teaching Award (twice), IEEE CIS Early Career Award, National Academy of Engineering New Faculty Award, and the IBM Big Data Award in 2013. One of Chawla's most recognized publications, with a citation count of over 30,000, is the research paper titled "SMOTE: Synthetic Minority Over-sampling Technique." Chawla's research has garnered a citation count of over 65,000 and an H-index of 81.

Ranveer Chandra is an Indian American computer scientist who is Managing Director of the Research for Industry group at Microsoft and an affiliate professor at the University of Washington. He is known for his contributions to software-defined networking, wireless networks and digital agriculture. Previously, he served as the Chief Scientist at Microsoft Azure Global and currently holds the position of Chief Technology Officer (CTO) of Agri-Food at Microsoft.

Kai Shu is a computer scientist, academic, and author. He is a Gladwin Development Chair Assistant Professor at the Illinois Institute of Technology.

<span class="mw-page-title-main">Xing Xie</span> Computer scientist at Microsoft Research Asia

Xing Xie is a partner research manager at Microsoft Research Asia. As a computer scientist, his research has focused on data mining, social computing, and responsible AI. He has published more than 400 papers which have been cited more than 60,000 times. He has been on organizing committees or helped with the programs of over 70 conferences and workshops.

References

  1. "Making the future of work work for you with Dr. Johannes Gehrke". Microsoft Research. 17 July 2019. Retrieved 19 July 2019.
  2. "Gehrke, Joachims honored for work in computer science". Cornell Chronicle. Retrieved 3 October 2019.
  3. "OnTheMap Help and Documentation, Confidentiality Protection". United States Census Bureau. Retrieved 9 December 2020.
  4. "Four Cornell faculty members receive NSF 'Early Career' awards | Cornell Chronicle". news.cornell.edu. Retrieved 25 May 2018.
  5. "2003 Annual Report, Alfred P. Sloan Foundation" (PDF).
  6. "Johannes Gehrke • IEEE Computer Society". www.computer.org. Retrieved 25 May 2018.
  7. "Johannes Gehrke | Blavatnik Awards for Young Scientists". blavatnikawards.org. Retrieved 25 May 2018.
  8. "ACM Names Fellows for Innovations in Computing". www.acm.org. Retrieved 25 May 2018.
  9. "IEEE Computer Society Announces 2020 Fellows". IEEE . Retrieved 18 July 2020.
  10. "2021 SIGKDD Innovation Award: Johannes Gehrke". www.sigkdd.org. Retrieved 1 January 2024.
  11. "SIGKDD Awards". www.sigkdd.org. Retrieved 1 January 2024.
  12. "Database Management Systems (Third Edition)". www.cs.wisc.edu. Retrieved 25 May 2018.