Ihab Ilyas

Last updated
Ihab Ilyas
Professor Ihab F. Ilyas.jpg
Born (1973-05-13) May 13, 1973 (age 50)
NationalityCanadian, Egyptian
Alma mater Purdue University (PhD) Alexandria University (BSc, MSc)
Known for Data science, Data cleaning, Data integration
Awards ACM Fellow, 2020; IEEE Fellow, 2022
Scientific career
Fields Computer science
Institutions University of Waterloo, Apple Inc.
Website https://cs.uwaterloo.ca/~ilyas/

Ihab Francis Ilyas (born May 13, 1973) is a computer scientist who works in data science. He is currently a professor of computer science in the David R. Cheriton School of Computer Science at the University of Waterloo. He also led the Knowledge Platform team at Apple Inc. Ihab is the holder of the Thomson Reuters-NSERC Industrial Research Chair in Data Cleaning at the University of Waterloo. [1] [2] [3]

Contents

Ilyas co-founded Tamr Inc., a start-up focusing on large-scale data integration and cleaning, with Andy Palmer and Michael Stonebraker, a Turing Award winner. Ilyas was the CEO of Inductiv Inc., an artificial intelligence start-up that uses machine learning to automate the task of identifying and correcting errors in data, which he co-founded with Theodoros Rekatsinas at the University of Wisconsin-Madison and Christopher Ré at Stanford University. [4] Inductiv was acquired by Apple Inc. in May 2020. [5] [6]

Education and career

Ilyas was born and raised in Alexandria, Egypt. After completing bachelor's and master's degrees at Alexandria University in 1995 and 1999, respectively, he earned a PhD at Purdue University in 2004 under the supervision of Walid Aref and Ahmed K. Elmagarmid.

After his doctoral studies, Ilyas accepted a position in 2004 as a tenure-track professor at the University of Waterloo's David R. Cheriton School of Computer Science.

Research contributions

Ilyas is best known for the development of database systems and data science, with emphasis on data quality, data cleaning, managing uncertain data, machine learning for data curation, and rank-aware query processing.

Since 2009, he has focused research on data quality and the technical challenges in building data-cleaning systems. He and his research group introduced novel practical algorithms and system prototypes, which circumvent limitations of previous data-cleaning solutions that either focus narrowly on single types of data errors or ignore real-life considerations that prevent their adoption.

With Theodoros Rekatsinas, Christopher Ré, and Xu Chu, he introduced HoloClean, [7] an open-source statistical inference engine to impute, clean and enrich data.

Awards and honours

Ilyas received a Government of Ontario Early Researcher Award in 2008, a provincial program that funds new leading researchers at publicly funded Ontario universities to build a research team. He was named an IBM Canada Advanced Studies Fellow from 2006 to 2010.

Ilyas held a Cheriton Faculty Fellowship [8] at the David R. Cheriton School of Computer Science from 2013 to 2016 and he received the Google Faculty Award in 2014. [9]

He was named an ACM Distinguished Scientist [10] in 2014, and an ACM Fellow in 2020 for his contributions to data cleaning and data integration. [11] [12] He was also named IEEE Fellow in 2022 for his contributions in data cleaning, data integration and rank-aware query processing [13] [14] Since 2018, Ilyas has held the Thomson Reuters-NSERC Industrial Research Chair in Data Cleaning. In 2020, he was named a faculty affiliate at the Vector Institute. [15]

Service

Ilyas was elected a member of Board of Trustees of the Very Large Data Bases Endowment [16] in 2016 and the Vice Chair of the ACM Special Interest Group on Data Management (SIGMOD) [17] in 2017.

Related Research Articles

<span class="mw-page-title-main">Ian Goldberg</span> Cryptographer

Ian Avrum Goldberg is a cryptographer and cypherpunk. He is best known for breaking Netscape's implementation of SSL, and for his role as chief scientist of Radialpoint, a Canadian software company. Goldberg is currently a professor at the Faculty of Mathematics of the David R. Cheriton School of Computer Science within the University of Waterloo, and the Canada Research Chair in Privacy Enhancing Technologies. He was formerly Tor Project board of directors chairman, and is one of the designers of off the record messaging.

<span class="mw-page-title-main">David Bader (computer scientist)</span> American computer scientist

David A. Bader is a Distinguished Professor and Director of the Institute for Data Science at the New Jersey Institute of Technology. Previously, he served as the Chair of the Georgia Institute of Technology School of Computational Science & Engineering, where he was also a founding professor, and the executive director of High-Performance Computing at the Georgia Tech College of Computing. In 2007, he was named the first director of the Sony Toshiba IBM Center of Competence for the Cell Processor at Georgia Tech.

<span class="mw-page-title-main">Timothy M. Chan</span> Canadian computer scientist

Timothy Moon-Yew Chan is a Founder Professor in the Department of Computer Science at the University of Illinois at Urbana–Champaign. He was formerly Professor and University Research Chair in the David R. Cheriton School of Computer Science, University of Waterloo, Canada.

David Ross Cheriton is a Canadian computer scientist, businessman, philanthropist, and venture capitalist. He is a computer science professor at Stanford University, where he founded and leads the Distributed Systems Group.

<span class="mw-page-title-main">Margo Seltzer</span> American computer scientist

Margo Ilene Seltzer is a professor and researcher in computer systems. She is currently the Canada 150 Research Chair in Computer Systems and the Cheriton Family Chair in Computer Science at the University of British Columbia. Previously, Seltzer was the Herchel Smith Professor of Computer Science at Harvard University's John A. Paulson School of Engineering and Applied Sciences and director at the Center for Research on Computation and Society.

<span class="mw-page-title-main">Gerhard Weikum</span> German computer scientist

Gerhard Weikum is a German computer scientist and Research Director at the Max Planck Institute for Informatics in Saarbrücken, Germany, where he is leading the databases and information systems department. His current research interests include transactional and distributed systems, self-tuning database systems, data and text integration, and the automatic construction of knowledge bases. He is one of the creators of the YAGO knowledge base. He is also the Dean of the International Max Planck Research School for Computer Science (IMPRS-CS).

Amit Sheth is a computer scientist at University of South Carolina in Columbia, South Carolina. He is the founding Director of the Artificial Intelligence Institute, and a Professor of Computer Science and Engineering. From 2007 to June 2019, he was the Lexis Nexis Ohio Eminent Scholar, director of the Ohio Center of Excellence in Knowledge-enabled Computing, and a Professor of Computer Science at Wright State University. Sheth's work has been cited by over 48,800 publications. He has an h-index of 106, which puts him among the top 100 computer scientists with the highest h-index. Prior to founding the Kno.e.sis Center, he served as the director of the Large Scale Distributed Information Systems Lab at the University of Georgia in Athens, Georgia.

<span class="mw-page-title-main">Tova Milo</span> Israeli computer scientist

Tova Milo is a full Professor of Computer Science at Tel Aviv University and the Dean of the Faculty of Exact Sciences. She served as the head of the Computer Science Department from 2011 to 2014. Milo is the head of the data management group in Tel Aviv University, and her research focuses on Web data management. She received her PhD from the Hebrew University in 1992 under the supervision of Catriel Beeri, and was a postdoctoral fellow at the University of Toronto and INRIA, France, prior to joining Tel Aviv University.

Anastasia Ailamaki is a Professor of Computer Sciences at the École Polytechnique Fédérale de Lausanne (EPFL) in Switzerland and the Director of the Data-Intensive Applications and Systems (DIAS) lab. She is also the co-founder of RAW Labs SA, a Swiss company developing real-time analytics infrastructures for heterogeneous big data. Formerly, she was an associate professor of computer science at Carnegie Mellon School of Computer Science.

Leonid Libkin is a computer scientist who works in data management, in particular in database theory, and in logic in computer science.

<span class="mw-page-title-main">Raouf Boutaba</span>

Raouf Boutaba is an Algerian Canadian computer scientist. His research interests are in resource, network and service management in wired and wireless networked systems. His work focuses on network virtualization, network softwarization, cloud computing, and network security.

<span class="mw-page-title-main">Michael J. Carey (computer scientist)</span> American computer scientist

Michael James Carey is an American computer scientist. He currently serves as Bren Professor of Information and Computer Science in the Donald Bren School at the University of California, Irvine.

Zehra Meral Özsoyoglu is a Turkish-American computer scientist specializing in databases, including research on query languages, database model, and indexes, and applications of databases in science, bioinformatics, and medical informatics. She is the Andrew R. Jennings Professor Emeritus of Computer Science at Case Western Reserve University.

<span class="mw-page-title-main">N. Asokan</span> Professor of Computer Science at University of Waterloo

Nadarajah Asokan is a professor of computer science and the David R. Cheriton Chair in Software Systems at the University of Waterloo's David R. Cheriton School of Computer Science. He is also an adjunct professor in the Department of Computer Science at Aalto University.

<span class="mw-page-title-main">M. Tamer Özsu</span> Computer scientist (b. 1951)

M. Tamer Özsu, FRSC is a Turkish Canadian computer scientist working in the area of distributed and parallel data management. He is a University Professor in the David R. Cheriton School of Computer Science at the University of Waterloo.

<span class="mw-page-title-main">Mark Giesbrecht</span> Canadian computer scientist

Mark Giesbrecht is a Canadian computer scientist who is the 12th dean of the University of Waterloo’s Faculty of Mathematics, starting from July 1, 2020. He was the Director of the David R. Cheriton School of Computer Science at the University of Waterloo, Canada from July 2014 until June 2020.

Aidong Zhang is a computer scientist whose research topics include machine learning and bioinformatics. She is William Wulf Faculty Fellow and Professor of Computer Science at the University of Virginia, where she also holds affiliations with the Department of Biomedical Engineering and School of Data Science.

Jessica Hullman is a computer scientist and the Ginni Rometty associate professor of Computer Science at Northwestern University. She is known for her research in Information visualization.

Daniel Abadi is the Darnell-Kanal Professor of Computer Science at University of Maryland, College Park. His primary area of research is database systems, with contributions to stream databases, distributed databases, graph databases, and column-store databases. He helped create C-Store, a column-oriented database, and HadoopDB, a hybrid of relational databases and Hadoop. Both database systems were commercialized by companies.

References

  1. "Thomson Reuters and University of Waterloo to fuel innovation in data science, finance education | Waterloo News". Waterloo News. 2016-09-28. Retrieved 2017-09-01.
  2. "UWaterloo adds research chair into data cleansing". IT World Canada. Retrieved 2017-09-01.
  3. "Chairholder Profile | Ihab Ilyas | Thomson Reuters-NSERC Industrial Research Chair in Data Cleaning". Natural Sciences and Engineering Research Council of Canada. May 20, 2014. Retrieved January 13, 2021.
  4. "Waterloo-based AI start-up Inductiv acquired by Apple". Cheriton School of Computer Science | University of Waterloo. May 29, 2020. Retrieved May 31, 2020.
  5. "Apple Buys Machine-Learning Startup to Improve Data Used in Siri". Bloomberg. May 27, 2020. Retrieved May 31, 2020.
  6. "Apple just bought another AI startup to help Siri catch up to rivals Amazon and Google". Business Insider. May 28, 2020. Retrieved May 31, 2020.
  7. "HoloClean: A Machine Learning System for Data Repair and Predictions on Structured Data". HoloClean. 2020. Retrieved January 26, 2021.
  8. "David R. Cheriton Faculty Fellowships in Computer Science | Cheriton School of Computer Science". Cheriton School of Computer Science. 2017-02-10. Archived from the original on 2018-06-12. Retrieved 2017-09-01.
  9. "Google Faculty Research Awards, August 2014" (PDF). Retrieved January 26, 2021.
  10. "ACM's Distinguished Computer Scientists, Engineers and Educators Cited for Global Reach, Real-World Impact Association for Computing Machinery". www.acm.org. Retrieved 2017-09-01.
  11. "Cheriton School of Computer Science Professor Ihab Ilyas named ACM Fellow". David R. Cheriton School of Computer Science. January 13, 2021. Retrieved January 13, 2021.
  12. "Prof. Ihab F. Ilyas | ACM Fellows | Award Winners". Association for Computing Machinery. January 13, 2021. Retrieved January 13, 2021.
  13. "Professor Ihab Ilyas named Fellow of the Institute of Electrical and Electronics Engineers". Cheriton School of Computer Science. 2021-11-25. Retrieved 2021-11-25.
  14. https://www.ieee.org/content/dam/ieee-org/ieee/web/org/about/fellows/2022-ieee-fellows-class.pdf [ bare URL PDF ]
  15. "Vector welcomes 2020 cohort of faculty affiliates". Vector Institute. December 22, 2020. Retrieved January 26, 2021.
  16. "Board of Trustees". www.vldb.org. Retrieved 2017-09-01.
  17. "About SIGMOD – SIGMOD Website". sigmod.org. Retrieved 2017-09-01.