This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these template messages)
|
SIGKDD, representing the Association for Computing Machinery's (ACM) Special Interest Group (SIG) on Knowledge Discovery and Data Mining, hosts an influential annual conference.
The KDD Conference grew from KDD (Knowledge Discovery and Data Mining) workshops at AAAI conferences, which were started by Gregory I. Piatetsky-Shapiro in 1989, 1991, and 1993, and Usama Fayyad in 1994. [1] Conference papers of each proceedings of the SIGKDD International Conference on Knowledge Discovery and Data Mining are published through ACM. [2] KDD is widely considered the most influential forum for knowledge discovery and data mining research. [3] [4]
Year | Conference location |
---|---|
2011 | San Diego, United States |
2012 | Beijing, China |
2013 | Chicago, IL, United States |
2014 | New York City, NY, United States |
2015 | Sydney, Australia |
2016 | San Francisco, CA, United States |
2017 | Halifax, Canada |
2018 | London, England |
2019 | Anchorage, AK, United States |
2020 | San Diego, CA, United States |
2021 | Virtual Conference |
2022 | Washington, D.C., United States |
2023 | Long Beach, California, United States |
2024 [5] | Barcelona, Spain |
The KDD conference has been held each year since 1995, and SIGKDD became an official ACM Special Interest Group in 1998. Past conference locations are listed on the KDD conference web site. [6]
The annual ACM SIGKDD conference is recognized as a flagship venue in the field. Based on statistics provided by independent researcher Lexing Xie in her analysis “Visualizing Citation Patterns of Computer Science Conferences“ [7] as part of the research in Computation Media Lab at Australian National University:
The annual conference of ACM SIGKDD has received the highest rating A* from independent organization Computing Research and Education (a.k.a. CORE). [8]
Like all flagship conferences, SIGKDD imposes a high requirement to present and publish submitted papers. The focus is on innovative research in data mining, knowledge discovery, and large-scale data analytics. Papers emphasizing theoretical foundations are particularly encouraged, as are novel modeling and algorithmic approaches to specific data mining problems in scientific, business, medical, and engineering applications. Visionary papers on new and emerging topics are particularly welcomed. Authors are explicitly discouraged from submitting papers that contain only incremental results or that do not provide significant advances over existing approaches. [9]
In 2014, over 2,600 authors from at least fourteen countries submitted over a thousand papers to the conference. A final 151 papers were accepted for presentation and publication, representing an acceptance rate of 14.6%. [10] This acceptance rate is slightly lower than those of other top computer science conferences, which typically have a rate of 15–25%. [11] The acceptance rate of a conference is only a proxy measure of its quality. For example, in the field of information retrieval, the WSDM conference has a lower acceptance rate than the higher-ranked SIGIR. [12]
The group recognizes members of the KDD community with its annual Innovation Award and Service Award. [13]
Each year KDD presents a Best Paper Award [14] to recognizes papers presented at the annual SIGKDD conference that advance the fundamental understanding of the field of knowledge discovery in data and data mining. Two research paper awards are granted: Best Research Paper Award Recipients and Best Student Paper Award Recipients. [15]
Winning the ACM SIGKDD Best Paper Award (Best Research Track Paper) is widely considered an internationally recognized significant achievement in a researcher's career.[ by whom? ] Authors compete with established professionals in the field, such as tenured professors, executives, and eminent industry experts from top institutions. It is common to find press articles and news announcements from the awardees’ institutions and professional media to celebrate this achievement. [16] [17]
This award recognizes innovative scholarly articles that advance the fundamental understanding of the field of knowledge discovery in data and data mining. Each year, the award is given to authors of the strongest paper by this criterion, selected by a rigorous process. [15]
The selection process follows multiple rounds of peer reviews under stringent criteria. The selection committee consists of leading experts who provide insightful and independent analysis on the merits and degree of innovation of the scholarly articles submitted by each author. The reviewers are required to be recognized subject experts who had extensive contributions to the specific subject area addressed by the paper. Reviewers are also required to be completely unaffiliated with the authors.
First, all papers submitted to the ACM SIGKDD conference are reviewed by research track program committee members. Each submitted paper is extensively reviewed by multiple committee members and detailed feedback is given to each author. After review, decisions are made by the committee members to accept or reject the paper based on the paper’s novelty, technical quality, potential impact, clarity, and whether the experimental methods and results are clear, well executed, and repeatable. [9] During the process, committee members also evaluate the merits of each paper based on above factors, and make decision on recommending candidates for Best Paper Award (Best Research Track Paper).
The candidates for Best Paper Award (Best Research Track Paper) are extensively reviewed by conference chairs and the best paper award committee. The final determination of the award is based on the level of advancement made by authors through the paper to the understanding of the field of knowledge discovery and data mining. Authors of a single paper who are judged to have contributed the highest level of advancement to the field are selected as recipients of this award. Anyone who submits a scholarly article to SIGKDD is considered for this award.
The ACM SIGKDD Best Paper Award (Best Research Track Paper) was given to 49 individuals between 1997 and 2014. Among these individuals, most are distinguished persons and established professionals with celebrated careers, who have made significant contributions to the field.
Year | Name | Position | Affiliation |
---|---|---|---|
1997 | Foster Provost | Professor | New York University |
1997 | Tom Fawcett | Principal Data Scientist | Silicon Valley Data Science |
1998, 1999 | Pedro Domingos | Professor | University of Washington |
2000 | Anne Rogers | Associate Professor | University of Chicago |
2000 | Daryl Pregibon | (Former) Head of Statistical Research | AT&T Labs and Bell Labs |
2000 | Kathleen Fisher | Chair & Professor | Tufts University |
2000 | Corinna Cortes | Head of Research | |
2001 | Ruben H. Zamar | Professor | University of British Columbia |
2001 | Raymond T. Ng | Professor | University of British Columbia |
2001 | Edwin M. Knorr | Tenured Senior Instructor | University of British Columbia |
2002 | Padhraic Smyth | Professor | University of California, Irvine |
Associate Director | Center for Machine Learning and Intelligent Systems | ||
2002 | Darya Chudova | VP of Bioinformatics | Guardant Health |
2003 | Éva Tardos | Professor & Dean | Cornell University |
2003, 2005 | Jon Kleinberg | Professor | Cornell University |
Member | National Academy of Sciences | ||
National Academy of Engineering | |||
American Academy of Arts and Sciences | |||
2003 | David Kempe | Associate Professor | University of Southern California |
2004 | Raymond J. Mooney | Professor | The University of Texas at Austin |
2004 | Mikhail (Misha) Bilenko | Head of AI and Research | Yandex |
2004 | Sugato Basu | Principal Scientist | |
2004, 2005 | Christos Faloutsos | Professor | Carnegie Mellon University |
Fellow | ACM | ||
2005 | Jure Leskovec | Associate Professor | Stanford University |
Chief Scientist | |||
Member, Board of Directors | ACM SIGKDD | ||
2006 | Thorsten Joachims | Chair & Professor | Cornell University |
Fellow | ACM, AAAI, Humboldt | ||
2007 | Srujana Merugu | Principal Data Scientist | Flipkart |
2007 | Deepak Agarwal | VP of Engineering | |
Fellow | American Statistical Association | ||
Member, Board of Directors | ACM SIGKDD | ||
2008 | Wei Wang | Chair & Professor | University of California, Los Angeles |
Director | Scalable Analytics Institute | ||
2008 | Fei Zhou | Professor | University of Florida |
2008 | Xiang Zhang | Associate Professor | Pennsylvania State University |
2009 | Yehuda Koren | Staff Research Scientist | |
2010 | Carlos Guestrin | Director of Machine Learning | Apple Inc |
Professor | University of Washington | ||
Co-founder, CEO | Turi (a.k.a. Dato, GraphLab) | ||
2010 | Dafna Shahaf | Assistant Professor | The Hebrew University of Jerusalem |
2010 | Kai-Wei Chang | Assistant Professor | University of California, Los Angeles |
2010 | Cho-Jui Hsieh | Assistant Professor | University of California, Davis |
2010 | Hsiang-Fu Yu | Applied Scientist | Amazon |
2010 | Chih-Jen Lin | Distinguished Professor | National Taiwan University |
Fellow | ACM, AAAI, IEEE | ||
2011 | Claudia Perlich | Chief Scientist | Dstillery |
Adjunct Professor | New York University | ||
2011 | Saharon Rosset | Associate Professor | Tel Aviv University |
2011 | Shachar Kaufman | Senior Data Scientist | Metromile |
2012 | Thanawin Rakthanmanon | Assistant Professor | Kasetsart University, Thailand |
2012 | Bilson Campana | Staff Software Engineer | |
2012 | Abdullah Mueen | Assistant Professor | University of New Mexico |
2012 | Gustavo Batista | Associate Professor | Universidade de São Paulo |
2012 | Brandon Westover | Director, Critical Care EEG Monitoring Service | Massachusetts General Hospital |
2012 | Qiang Zhu | Data Science Manager | Airbnb |
2012 | Jesin Zakaria | Software Engineer | Microsoft |
2012 | Eamonn Keogh | Professor | University of California, Riverside |
2013 | Edo Liberty | Principal Scientist | Amazon |
Group Manager | Amazon AI Algorithms | ||
2014 | Alex Smola | Director of Machine Learning and Deep Learning | Amazon |
Professor | Carnegie Mellon University | ||
2014 | Sujith Ravi | Staff Research Scientist | |
2014 | Amr Ahmed | Staff Research Scientist | |
2014 | Aaron Li | Founder | Qokka.ai |
(Former) Lead Inference Engineer | Scaled Inference |
This only difference between "Best Student Paper Award" and "Best Paper Award (Best Research Track Paper)" is the limitation in competition.
All authors participating the conference are considered equally for "Best Paper Award (Best Research Track Paper)", and the award does not limit competition to any particular region, population, or age group.
However, "Best Student Paper Award" is limited to student authors only. "Best Student Paper Award" recognizes papers presented at the annual SIGKDD conference, with a student as a first author, that advance the fundamental understanding of the field of knowledge discovery in data and data mining. [15]
SIGKDD sponsors the KDD Cup [18] data mining competition every year in conjunction with the annual conference. It is aimed at members of the industry and academia, particularly students, interested in KDD.
SIGKDD has also published a biannual academic journal titled SIGKDD Explorations [19] since June 1999 [20] when Usama Fayyad took on role of Founding Editor-inChief as ACM SIGKDD was formed. Editors in Chief:
The original founding Board of Directors of SIGKDD in 1998 consist of:
Current Chair:
Former Chairpersons:
Former Executive Committee (2009–2013)
Information Directors:
Data mining is the process of extracting and discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and statistics with an overall goal of extracting information from a data set and transforming the information into a comprehensible structure for further use. Data mining is the analysis step of the "knowledge discovery in databases" process, or KDD. Aside from the raw analysis step, it also involves database and data management aspects, data pre-processing, model and inference considerations, interestingness metrics, complexity considerations, post-processing of discovered structures, visualization, and online updating.
Waikato Environment for Knowledge Analysis (Weka) is a collection of machine learning and data analysis free software licensed under the GNU General Public License. It was developed at the University of Waikato, New Zealand and is the companion software to the book "Data Mining: Practical Machine Learning Tools and Techniques".
Jiawei Han is a Chinese-American computer scientist and writer. He currently holds the position of Michael Aiken Chair Professor in the Department of Computer Science at the University of Illinois at Urbana-Champaign. His research focuses on data mining, text mining, database systems, information networks, data mining from spatiotemporal data, Web data, and social/information network data.
SIGIR is the Association for Computing Machinery's Special Interest Group on Information Retrieval. The scope of the group's specialty is the theory and application of computers to the acquisition, organization, storage, retrieval and distribution of information; emphasis is placed on working with non-numeric information, ranging from natural language to highly structured data bases.
Data Mining and Knowledge Discovery is a bimonthly peer-reviewed scientific journal focusing on data mining published by Springer Science+Business Media. It was started in 1996 and launched in 1997 by Usama Fayyad as founding Editor-in-Chief by Kluwer Academic Publishers. The first Editorial provides a summary of why it was started.
Hans-Peter Kriegel is a German computer scientist and professor at the Ludwig Maximilian University of Munich and leading the Database Systems Group in the Department of Computer Science. He was previously professor at the University of Würzburg and the University of Bremen after habilitation at the Technical University of Dortmund and doctorate from Karlsruhe Institute of Technology.
AMiner is a free online service used to index, search, and mine big scientific data.
Foster Provost is an American computer scientist, information systems researcher, and Professor of Data Science, Professor of Information Systems and Ira Rennert Professor of Entrepreneurship at New York University's Stern School of Business. He is also the Director for the Data Science and AI Initiative at Stern's Fubon Center for Technology, Business and Innovation. Professor Provost has a Bachelor of Science from Duquesne University in physics and mathematics and a Master of Science and Ph.D. in computer science from the University of Pittsburgh.
Jie Tang is a full-time professor at the Department of Computer Science of Tsinghua University. He received a PhD in computer science from the same university in 2006. He is known for building the academic social network search system AMiner, which was launched in March 2006 and now has attracted 2,766,356 independent IP accesses from 220 countries. His research interests include social networks and data mining.
Usama M. Fayyad is an American-Jordanian data scientist and co-founder of KDD conferences and ACM SIGKDD association for Knowledge Discovery and Data Mining. He is a speaker on Business Analytics, Data Mining, Data Science, and Big Data. He recently left his role as the Chief Data Officer at Barclays Bank.
Ramakrishnan Srikant is a Google Fellow at Google.
Gregory I. Piatetsky-Shapiro is a data scientist and the co-founder of the KDD conferences, and co-founder and past chair of the Association for Computing Machinery SIGKDD group for Knowledge Discovery, Data Mining and Data Science. He is the founder and president of KDnuggets, a discussion and learning website for Business Analytics, Data Mining and Data Science.
Domain driven data mining is a data mining methodology for discovering actionable knowledge and deliver actionable insights from complex data and behaviors in a complex environment. It studies the corresponding foundations, frameworks, algorithms, models, architectures, and evaluation systems for actionable knowledge discovery.
Jianchang (JC) Mao is a Chinese-American computer scientist and Vice President, Google Assistant Engineering at Google. His research spans artificial intelligence, machine learning, computational advertising, data mining, and information retrieval. He was named a Fellow of the Institute of Electrical and Electronics Engineers (IEEE) in 2012 for his contributions to pattern recognition, search, content analysis, and computational advertising.
Arthur Zimek is a professor in data mining, data science and machine learning at the University of Southern Denmark in Odense, Denmark.
An associative classifier (AC) is a kind of supervised learning model that uses association rules to assign a target value. The term associative classification was coined by Bing Liu et al., in which the authors defined a model made of rules "whose right-hand side are restricted to the classification class attribute".
Gautam Das is a computer scientist in the field of databases research. He is an ACM Fellow and IEEE Fellow.
Martin Ester is a Canadian-German Full Professor of Computing Science at Simon Fraser University. His research focuses on researcher data mining and machine learning.
Hui Xiong is a data scientist. He is a distinguished professor at Rutgers University and a distinguished guest professor at the University of Science and Technology of China (USTC).
Tina Eliassi-Rad is an American computer scientist and the inaugural President Joseph E. Aoun Professor at Northeastern University. Her research is at the intersection of artificial intelligence, network science, and applied ethics. In 2023, she won the Lagrange Prize for her work on ethical approaches to artificial intelligence.