Philip S. Yu

Last updated

Philip S. Yu (born c. 1952) is an American computer scientist and professor of information technology at the University of Illinois at Chicago. He is a prolific author, holds over 300 patents, and is known for his work in the field of data mining.

Contents

Biography

Yu received his BS in electrical engineering from the National Taiwan University, and his MS and PhD also in electrical engineering from Stanford University in 1978, and received an MBA from New York University in 1982.

He started his career in private enterprise, at IBM's Thomas J. Watson Research Center, where he eventually became manager of the Software Tools and Techniques group. Currently he is Distinguished Professor and Wexler Chair in Information Technology at the Department of Computer Science of the University of Illinois at Chicago

Yu holds over 300 U.S. patents, is an ACM and IEEE Fellow, is editor-in-chief of ACM Transactions on Knowledge Discovery from Data, has chaired numerous conferences, and received several awards, including from IBM, the IEEE [1] and, in 2022, he and his coauthors, Yizhou Sun, Jiawei Han, Xifeng Yan, and Tianyi Wu, received the Very Large Data Bases Endowment Inc. (VLDB) 2022 Test of Time award, for their 2011 research paper, PathSim: Meta Path-Based Top-K Similarity Search in Heterogeneous Information Networks. [2]

His research interests are in the fields of "data mining (especially on graph/network mining), social network, privacy preserving data publishing, data stream, database systems, and Internet applications and technologies." [3] Yu is an ISI Highly Cited researcher. According to Google Scholar, Yu's H-index is among the ten highest in computer science. [4]

Selected works

Yu has authored or co-authored several books and over 650 academic articles, [5] including:

Related Research Articles

Scott J. Shenker is an American computer scientist, and professor of computer science at the University of California, Berkeley. He is also the leader of the Extensible Internet Group at the International Computer Science Institute in Berkeley, California.

SIGKDD, representing the Association for Computing Machinery's (ACM) Special Interest Group (SIG) on Knowledge Discovery and Data Mining, hosts an influential annual conference.

Process mining is a family of techniques relating the fields of data science and process management to support the analysis of operational processes based on event logs. The goal of process mining is to turn event data into insights and actions. Process mining is an integral part of data science, fueled by the availability of event data and the desire to improve processes. Process mining techniques use event data to show what people, machines, and organizations are really doing. Process mining provides novel insights that can be used to identify the execution paths taken by operational processes and address their performance and compliance problems.

Jiawei Han is a Chinese-American computer scientist and writer. He currently holds the position of Michael Aiken Chair Professor in the Department of Computer Science at the University of Illinois at Urbana-Champaign. His research focuses on data mining, text mining, database systems, information networks, data mining from spatiotemporal data, Web data, and social/information network data.

In data analysis, anomaly detection is generally understood to be the identification of rare items, events or observations which deviate significantly from the majority of the data and do not conform to a well defined notion of normal behavior. Such examples may arouse suspicions of being generated by a different mechanism, or appear inconsistent with the remainder of that set of data.

In computer science, data stream clustering is defined as the clustering of data that arrive continuously such as telephone records, multimedia data, financial transactions etc. Data stream clustering is usually studied as a streaming algorithm and the objective is, given a sequence of points, to construct a good clustering of the stream, using a small amount of memory and time.

<span class="mw-page-title-main">Tomasz Imieliński</span> Polish-American computer scientist (born 1954)

Tomasz Imieliński is a Polish-American computer scientist, most known in the areas of data mining, mobile computing, data extraction, and search engine technology. He is currently a professor of computer science at Rutgers University in New Jersey, United States.

l-diversity, also written as -diversity, is a form of group based anonymization that is used to preserve privacy in data sets by reducing the granularity of a data representation. This reduction is a trade off that results in some loss of effectiveness of data management or mining algorithms in order to gain some privacy. The l-diversity model is an extension of the k-anonymity model which reduces the granularity of data representation using techniques including generalization and suppression such that any given record maps onto at least k-1 other records in the data. The l-diversity model handles some of the weaknesses in the k-anonymity model where protected identities to the level of k-individuals is not equivalent to protecting the corresponding sensitive values that were generalized or suppressed, especially when the sensitive values within a group exhibit homogeneity. The l-diversity model adds the promotion of intra-group diversity for sensitive values in the anonymization mechanism.

Bing Liu is a Chinese-American professor of computer science who specializes in data mining, machine learning, and natural language processing. In 2002, he became a scholar at University of Illinois at Chicago. He holds a PhD from the University of Edinburgh (1988). His PhD advisors were Austin Tate and Kenneth Williamson Currie, and his PhD thesis was titled Reinforcement Planning for Resource Allocation and Constraint Satisfaction.

<span class="mw-page-title-main">Shojiro Nishio</span>

Shojiro Nishio is a Japanese information scientist and technology scholar and the 18th president of Osaka University. Having co-authored or co-edited more than 55 books and more than 650 refereed journal or conference papers as well as serving on editorial boards of major information sciences journals, Nishio is considered one of the most prominent and influential researchers on database systems and networks.

Huan Liu is a Chinese-born computer scientist.

Latifur Khan joined the University of Texas at Dallas in 2000, where he has been conducting research and teaching as a Professor in the Department of Computer Science.

An associative classifier (AC) is a kind of supervised learning model that uses association rules to assign a target value. The term associative classification was coined by Bing Liu et al., in which the authors defined a model made of rules "whose right-hand side are restricted to the classification class attribute".

Wei Wang is a Chinese-born American computer scientist. She is the Leonard Kleinrock Chair Professor in Computer Science and Computational Medicine at University of California, Los Angeles and the director of the Scalable Analytics Institute (ScAi). Her research specializes in big data analytics and modeling, database systems, natural language processing, bioinformatics and computational biology, and computational medicine.

S. ("Muthu") Muthukrishnan is a computer scientist of Indian origin, known for his work in streaming algorithms, auction design, and pattern matching. He is vice president of sponsored products, Amazon (company) Advertising.

Yixin Chen is a computer scientist, academic, and author. He is a professor of computer science and engineering at Washington University in St. Louis.

<span class="mw-page-title-main">Nitesh Chawla</span> Computer scientist

Nitesh V. Chawla is a computer scientist and data scientist currently serving as the Frank M. Freimann Professor of Computer Science and Engineering at the University of Notre Dame. He is the Founding Director of the Lucy Family Institute for Data & Society. Chawla's research expertise lies in machine learning, data science, and network science. He is also the co-founder of Aunalytics, a data science software and cloud computing company. Chawla is a fellow of the Association for Computing Machinery (ACM), a Fellow of the Association for the Advancement of Artificial Intelligence, a Fellow of the Asia Pacific Artificial Intelligence Association, and a Fellow of the Institute of Electrical and Electronics Engineers (IEEE). He has received multiple awards, including the 1st Source Bank Commercialization Award in 2017, Outstanding Teaching Award (twice), IEEE CIS Early Career Award, National Academy of Engineering New Faculty Award, and the IBM Big Data Award in 2013. One of Chawla's most recognized publications, with a citation count of over 25,000, is the research paper titled "SMOTE: Synthetic Minority Over-sampling Technique." Chawla's research has garnered a citation count of over 62,000 and an H-index of 80.

References

  1. "On Mining Big Data". Illinois Institute of Technology. Retrieved October 4, 2023.
  2. "Philip S. Yu receives Test of Time Award". University of Illinois at Chicago. Retrieved October 4, 2023.
  3. Curriculum Philip S. Yu at cs.uic.edu. Accessed September 2, 2013
  4. See H-index for computer science. Google Scholar's H-index metric includes self-citations.
  5. Philip S. Yu Google Scholar profile