Ihab Ilyas | |
---|---|
Born | |
Nationality | Canadian, Egyptian |
Alma mater | Purdue University (PhD) Alexandria University (BSc, MSc) |
Known for | Data science, Data cleaning, Data integration |
Awards | ACM Fellow, 2020; IEEE Fellow, 2022; C.C. Gotlieb Computer Award, 2024; Royal Society of Canada Fellow, 2024 |
Scientific career | |
Fields | Computer science |
Institutions | University of Waterloo, Apple Inc. |
Website | https://cs.uwaterloo.ca/~ilyas/ |
Ihab Francis Ilyas (born May 13, 1973) is a computer scientist who works in data science. He is currently a professor of computer science in the David R. Cheriton School of Computer Science at the University of Waterloo. He also led the Knowledge Platform team at Apple Inc. Ihab is the holder of the Thomson Reuters-NSERC Industrial Research Chair in Data Cleaning at the University of Waterloo. [1] [2] [3]
Ilyas co-founded Tamr Inc., a start-up focusing on large-scale data integration and cleaning, with Andy Palmer and Michael Stonebraker, a Turing Award winner. Ilyas was the CEO of Inductiv Inc., an artificial intelligence start-up that uses machine learning to automate the task of identifying and correcting errors in data, which he co-founded with Theodoros Rekatsinas at the University of Wisconsin-Madison and Christopher Ré at Stanford University. [4] Inductiv was acquired by Apple Inc. in May 2020. [5] [6]
Ilyas was born and raised in Alexandria, Egypt. After completing bachelor's and master's degrees at Alexandria University in 1995 and 1999, respectively, he earned a PhD at Purdue University in 2004 under the supervision of Walid Aref and Ahmed K. Elmagarmid.
After his doctoral studies, Ilyas accepted a position in 2004 as a tenure-track professor at the University of Waterloo's David R. Cheriton School of Computer Science.
Research contributions
Ilyas is best known for the development of database systems and data science, with emphasis on data quality, data cleaning, managing uncertain data, machine learning for data curation, and rank-aware query processing.
Since 2009, he has focused research on data quality and the technical challenges in building data-cleaning systems. He and his research group introduced novel practical algorithms and system prototypes, which circumvent limitations of previous data-cleaning solutions that either focus narrowly on single types of data errors or ignore real-life considerations that prevent their adoption.
With Theodoros Rekatsinas, Christopher Ré, and Xu Chu, he introduced HoloClean, [7] an open-source statistical inference engine to impute, clean and enrich data.
Awards and honours
Ilyas received a Government of Ontario Early Researcher Award in 2008, a provincial program that funds new leading researchers at publicly funded Ontario universities to build a research team. He was named an IBM Canada Advanced Studies Fellow from 2006 to 2010.
Ilyas held a Cheriton Faculty Fellowship [8] at the David R. Cheriton School of Computer Science from 2013 to 2016 and he received the Google Faculty Award in 2014. [9]
He was named an ACM Distinguished Scientist [10] in 2014, and an ACM Fellow in 2020 for his contributions to data cleaning and data integration. [11] [12] He was also named IEEE Fellow in 2022 for his contributions in data cleaning, data integration and rank-aware query processing [13] [14] and a Fellow of the Royal Society of Canada in 2024. [15] In 2024, he also received the 2024 C.C. Gotlieb Computer Award from IEEE Canada in recognition of his contributions to building large-scale machine learning systems for data integration, data cleaning, and knowledge construction. [16]
Since 2018, Ilyas has held the Thomson Reuters-NSERC Industrial Research Chair in Data Cleaning. In 2020, he was named a faculty affiliate at the Vector Institute. [17]
Service
Ilyas was elected a member of Board of Trustees of the Very Large Data Bases Endowment [18] in 2016 and the Vice Chair of the ACM Special Interest Group on Data Management (SIGMOD) [19] in 2017.
Ian Avrum Goldberg is a cryptographer and cypherpunk. He is best known for breaking Netscape's implementation of SSL, and for his role as chief scientist of Radialpoint, a Canadian software company. Goldberg is currently a professor at the Faculty of Mathematics of the David R. Cheriton School of Computer Science within the University of Waterloo, and the Canada Research Chair in Privacy Enhancing Technologies. He was formerly Tor Project board of directors chairman, and is one of the designers of off the record messaging.
David A. Bader is a Distinguished Professor and Director of the Institute for Data Science at the New Jersey Institute of Technology. Previously, he served as the Chair of the Georgia Institute of Technology School of Computational Science & Engineering, where he was also a founding professor, and the executive director of High-Performance Computing at the Georgia Tech College of Computing. In 2007, he was named the first director of the Sony Toshiba IBM Center of Competence for the Cell Processor at Georgia Tech.
Richard Craig Holt was an American-Canadian computer scientist.
Timothy Moon-Yew Chan is a Founder Professor in the Department of Computer Science at the University of Illinois at Urbana–Champaign. He was formerly Professor and University Research Chair in the David R. Cheriton School of Computer Science, University of Waterloo, Canada.
David Ross Cheriton is a Canadian computer scientist, businessman, philanthropist, and venture capitalist. He is a computer science professor at Stanford University, where he founded and leads the Distributed Systems Group.
Margo Ilene Seltzer is an American computer scientist. She is currently the Canada 150 Research Chair in Computer Systems and the Cheriton Family Chair in Computer Science at the University of British Columbia. Previously, Seltzer was the Herchel Smith Professor of Computer Science at Harvard University's John A. Paulson School of Engineering and Applied Sciences and director at the Center for Research on Computation and Society.
Gerhard Weikum is a German computer scientist and Research Director at the Max Planck Institute for Informatics in Saarbrücken, Germany, where he is leading the databases and information systems department. His current research interests include transactional and distributed systems, self-tuning database systems, data and text integration, and the automatic construction of knowledge bases. He is one of the creators of the YAGO knowledge base. He is also the Dean of the International Max Planck Research School for Computer Science (IMPRS-CS).
Joseph M. Hellerstein is an American professor of computer science at the University of California, Berkeley, where he works on database systems and computer networks. He co-founded Trifacta with Jeffrey Heer and Sean Kandel in 2012, which stemmed from their research project, Wrangler.
Michael Jay Franklin is an American software entrepreneur and computer scientist specializing in distributed and streaming database technology. He is Liew Family Chair of Computer Science and chairman for the Department of Computer Science at the University of Chicago.
The Faculty of Engineering is one of six faculties at the University of Waterloo in Waterloo, Ontario, Canada. It has 8,698 undergraduate students, 2176 graduate students, 334 faculty and 52,750 alumni making it the largest engineering school in Canada with external research funding from 195 Canadian and international partners exceeding $86.8 million. Ranked among the top 50 engineering schools in the world, the faculty of engineering houses eight academic units and offers 15 bachelor's degree programs in a variety of disciplines.
James Ian Munro is a Canadian computer scientist. He is known for his fundamental contributions to algorithms and data structures.
Anastasia Ailamaki is a Professor of Computer Sciences at the École Polytechnique Fédérale de Lausanne (EPFL) in Switzerland and the Director of the Data-Intensive Applications and Systems (DIAS) lab. She is also the co-founder of RAW Labs SA, a Swiss company developing real-time analytics infrastructures for heterogeneous big data. Formerly, she was an associate professor of computer science at Carnegie Mellon School of Computer Science.
Ming Li is a Canadian computer scientist, known for his fundamental contributions to Kolmogorov complexity, bioinformatics, machine learning theory, and analysis of algorithms. Li is currently a University Professor at the David R. Cheriton School of Computer Science at the University of Waterloo. He holds a Tier I Canada Research Chair in Bioinformatics. In addition to academic achievements, his research has led to the founding of two independent companies.
Wenfei Fan is a Chinese-British computer scientist and professor of web data management at the University of Edinburgh. His research investigates database theory and database systems.
Raouf Boutaba is an Algerian Canadian computer scientist. His research interests are in resource, network and service management in wired and wireless networked systems. His work focuses on network virtualization, network softwarization, cloud computing, and network security.
Michael James Carey is an American computer scientist. He is currently a Distinguished Professor (Emeritus) of Computer Science in the Donald Bren School at the University of California, Irvine and a Consulting Architect at Couchbase, Inc..
Nadarajah Asokan is a professor of computer science and the David R. Cheriton Chair in Software Systems at the University of Waterloo's David R. Cheriton School of Computer Science. He is also an adjunct professor in the Department of Computer Science at Aalto University.
M. Tamer Özsu, FRSC is a Turkish Canadian computer scientist working in the area of distributed and parallel data management. He is a University Professor in the David R. Cheriton School of Computer Science at the University of Waterloo.
Mark Giesbrecht is a Canadian computer scientist who is the 12th dean of the University of Waterloo’s Faculty of Mathematics, starting from July 1, 2020. He was the Director of the David R. Cheriton School of Computer Science at the University of Waterloo, Canada from July 2014 until June 2020.
Daniel Abadi is the Darnell-Kanal Professor of Computer Science at University of Maryland, College Park. His primary area of research is database systems, with contributions to stream databases, distributed databases, graph databases, and column-store databases. He helped create C-Store, a column-oriented database, and HadoopDB, a hybrid of relational databases and Hadoop. Both database systems were commercialized by companies.