Special Interest Group on Knowledge Discovery and Data Mining

Last updated

SIGKDD, representing the Association for Computing Machinery's (ACM) Special Interest Group (SIG) on Knowledge Discovery and Data Mining, hosts an influential annual conference.

Contents

Conference history

The KDD Conference grew from KDD (Knowledge Discovery and Data Mining) workshops at AAAI conferences, which were started by Gregory I. Piatetsky-Shapiro in 1989, 1991, and 1993, and Usama Fayyad in 1994. [1] Conference papers of each proceedings of the SIGKDD International Conference on Knowledge Discovery and Data Mining are published through ACM. [2] KDD is widely considered the most influential forum for knowledge discovery and data mining research. [3] [4]

YearConference location
2011 San Diego, United States
2012 Beijing, China
2013 Chicago, IL, United States
2014 New York City, NY, United States
2015 Sydney, Australia
2016 San Francisco, CA, United States
2017 Halifax, Canada
2018 London, England
2019 Anchorage, AK, United States
2020 San Diego, CA, United States
2021Virtual Conference
2022 Washington, D.C., United States
2023 Long Beach, California, United States
2024 [5] Barcelona, Spain

The KDD conference has been held each year since 1995, and SIGKDD became an official ACM Special Interest Group in 1998. Past conference locations are listed on the KDD conference web site. [6]

The annual ACM SIGKDD conference is recognized as a flagship venue in the field. Based on statistics provided by independent researcher Lexing Xie in her analysis “Visualizing Citation Patterns of Computer Science Conferences“ [7] as part of the research in Computation Media Lab at Australian National University:

The annual conference of ACM SIGKDD has received the highest rating A* from independent organization Computing Research and Education (a.k.a. CORE). [8]

Selection Criteria

Like all flagship conferences, SIGKDD imposes a high requirement to present and publish submitted papers. The focus is on innovative research in data mining, knowledge discovery, and large-scale data analytics. Papers emphasizing theoretical foundations are particularly encouraged, as are novel modeling and algorithmic approaches to specific data mining problems in scientific, business, medical, and engineering applications. Visionary papers on new and emerging topics are particularly welcomed. Authors are explicitly discouraged from submitting papers that contain only incremental results or that do not provide significant advances over existing approaches. [9]

In 2014, over 2,600 authors from at least fourteen countries submitted over a thousand papers to the conference. A final 151 papers were accepted for presentation and publication, representing an acceptance rate of 14.6%. [10] This acceptance rate is slightly lower than those of other top computer science conferences, which typically have a rate of 1525%. [11] The acceptance rate of a conference is only a proxy measure of its quality. For example, in the field of information retrieval, the WSDM conference has a lower acceptance rate than the higher-ranked SIGIR. [12]

Awards

The group recognizes members of the KDD community with its annual Innovation Award and Service Award. [13]

Each year KDD presents a Best Paper Award [14] to recognizes papers presented at the annual SIGKDD conference that advance the fundamental understanding of the field of knowledge discovery in data and data mining. Two research paper awards are granted: Best Research Paper Award Recipients and Best Student Paper Award Recipients. [15]

Best Paper Award (Best Research Track Paper)

Winning the ACM SIGKDD Best Paper Award (Best Research Track Paper) is widely considered an internationally recognized significant achievement in a researcher's career.[ by whom? ] Authors compete with established professionals in the field, such as tenured professors, executives, and eminent industry experts from top institutions. It is common to find press articles and news announcements from the awardees’ institutions and professional media to celebrate this achievement. [16] [17]

This award recognizes innovative scholarly articles that advance the fundamental understanding of the field of knowledge discovery in data and data mining. Each year, the award is given to authors of the strongest paper by this criterion, selected by a rigorous process. [15]

Selection Process

The selection process follows multiple rounds of peer reviews under stringent criteria. The selection committee consists of leading experts who provide insightful and independent analysis on the merits and degree of innovation of the scholarly articles submitted by each author. The reviewers are required to be recognized subject experts who had extensive contributions to the specific subject area addressed by the paper. Reviewers are also required to be completely unaffiliated with the authors.

First, all papers submitted to the ACM SIGKDD conference are reviewed by research track program committee members. Each submitted paper is extensively reviewed by multiple committee members and detailed feedback is given to each author. After review, decisions are made by the committee members to accept or reject the paper based on the paper’s novelty, technical quality, potential impact, clarity, and whether the experimental methods and results are clear, well executed, and repeatable. [9] During the process, committee members also evaluate the merits of each paper based on above factors, and make decision on recommending candidates for Best Paper Award (Best Research Track Paper).

The candidates for Best Paper Award (Best Research Track Paper) are extensively reviewed by conference chairs and the best paper award committee. The final determination of the award is based on the level of advancement made by authors through the paper to the understanding of the field of knowledge discovery and data mining. Authors of a single paper who are judged to have contributed the highest level of advancement to the field are selected as recipients of this award. Anyone who submits a scholarly article to SIGKDD is considered for this award.

Previous winners

The ACM SIGKDD Best Paper Award (Best Research Track Paper) was given to 49 individuals between 1997 and 2014. Among these individuals, most are distinguished persons and established professionals with celebrated careers, who have made significant contributions to the field.

YearNamePositionAffiliation
1997 Foster Provost ProfessorNew York University
1997 Tom Fawcett Principal Data ScientistSilicon Valley Data Science
1998, 1999 Pedro Domingos ProfessorUniversity of Washington
2000 Anne Rogers Associate ProfessorUniversity of Chicago
2000Daryl Pregibon(Former) Head of Statistical ResearchAT&T Labs and Bell Labs
2000 Kathleen Fisher Chair & ProfessorTufts University
2000 Corinna Cortes Head of ResearchGoogle
2001 Ruben H. Zamar ProfessorUniversity of British Columbia
2001 Raymond T. Ng ProfessorUniversity of British Columbia
2001 Edwin M. Knorr Tenured Senior InstructorUniversity of British Columbia
2002 Padhraic Smyth ProfessorUniversity of California, Irvine
Associate DirectorCenter for Machine Learning and Intelligent Systems
2002Darya ChudovaVP of BioinformaticsGuardant Health
2003 Éva Tardos Professor & DeanCornell University
2003, 2005 Jon Kleinberg ProfessorCornell University
MemberNational Academy of Sciences
National Academy of Engineering
American Academy of Arts and Sciences
2003 David Kempe Associate ProfessorUniversity of Southern California
2004 Raymond J. Mooney ProfessorThe University of Texas at Austin
2004Mikhail (Misha) BilenkoHead of AI and ResearchYandex
2004Sugato BasuPrincipal ScientistGoogle
2004, 2005 Christos Faloutsos ProfessorCarnegie Mellon University
FellowACM
2005 Jure Leskovec Associate ProfessorStanford University
Chief ScientistPinterest
Member, Board of DirectorsACM SIGKDD
2006 Thorsten Joachims Chair & ProfessorCornell University
FellowACM, AAAI, Humboldt
2007Srujana MeruguPrincipal Data ScientistFlipkart
2007Deepak AgarwalVP of EngineeringLinkedIn
FellowAmerican Statistical Association
Member, Board of DirectorsACM SIGKDD
2008 Wei Wang Chair & ProfessorUniversity of California, Los Angeles
DirectorScalable Analytics Institute
2008 Fei Zhou ProfessorUniversity of Florida
2008 Xiang Zhang Associate ProfessorPennsylvania State University
2009Yehuda KorenStaff Research ScientistGoogle
2010Carlos GuestrinDirector of Machine LearningApple Inc
ProfessorUniversity of Washington
Co-founder, CEOTuri (a.k.a. Dato, GraphLab)
2010Dafna ShahafAssistant ProfessorThe Hebrew University of Jerusalem
2010Kai-Wei ChangAssistant ProfessorUniversity of California, Los Angeles
2010 Cho-Jui Hsieh Assistant ProfessorUniversity of California, Davis
2010Hsiang-Fu YuApplied ScientistAmazon
2010 Chih-Jen Lin Distinguished ProfessorNational Taiwan University
FellowACM, AAAI, IEEE
2011Claudia PerlichChief ScientistDstillery
Adjunct ProfessorNew York University
2011 Saharon Rosset Associate ProfessorTel Aviv University
2011Shachar KaufmanSenior Data ScientistMetromile
2012Thanawin RakthanmanonAssistant ProfessorKasetsart University, Thailand
2012Bilson CampanaStaff Software EngineerGoogle
2012 Abdullah Mueen Assistant ProfessorUniversity of New Mexico
2012 Gustavo Batista Associate ProfessorUniversidade de São Paulo
2012Brandon WestoverDirector, Critical Care EEG Monitoring ServiceMassachusetts General Hospital
2012Qiang ZhuData Science ManagerAirbnb
2012Jesin ZakariaSoftware EngineerMicrosoft
2012 Eamonn Keogh ProfessorUniversity of California, Riverside
2013Edo LibertyPrincipal ScientistAmazon
Group ManagerAmazon AI Algorithms
2014Alex SmolaDirector of Machine Learning and Deep LearningAmazon
ProfessorCarnegie Mellon University
2014 Sujith Ravi Staff Research ScientistGoogle
2014Amr AhmedStaff Research ScientistGoogle
2014Aaron LiFounder Qokka.ai
(Former) Lead Inference EngineerScaled Inference

Best Student Paper Award

This only difference between "Best Student Paper Award" and "Best Paper Award (Best Research Track Paper)" is the limitation in competition.

All authors participating the conference are considered equally for "Best Paper Award (Best Research Track Paper)", and the award does not limit competition to any particular region, population, or age group.

However, "Best Student Paper Award" is limited to student authors only. "Best Student Paper Award" recognizes papers presented at the annual SIGKDD conference, with a student as a first author, that advance the fundamental understanding of the field of knowledge discovery in data and data mining. [15]

KDD-Cup

SIGKDD sponsors the KDD Cup [18] data mining competition every year in conjunction with the annual conference. It is aimed at members of the industry and academia, particularly students, interested in KDD.

SIGKDD Explorations

SIGKDD has also published a biannual academic journal titled SIGKDD Explorations [19] since June 1999 [20] when Usama Fayyad took on role of Founding Editor-inChief as ACM SIGKDD was formed. Editors in Chief:

People

The original founding Board of Directors of SIGKDD in 1998 consist of:

Current Chair:

Former Chairpersons:

Former Executive Committee (2009–2013)

Information Directors:

See also

Related Research Articles

Data mining is the process of extracting and discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and statistics with an overall goal of extracting information from a data set and transforming the information into a comprehensible structure for further use. Data mining is the analysis step of the "knowledge discovery in databases" process, or KDD. Aside from the raw analysis step, it also involves database and data management aspects, data pre-processing, model and inference considerations, interestingness metrics, complexity considerations, post-processing of discovered structures, visualization, and online updating.

<span class="mw-page-title-main">Weka (software)</span> Suite of machine learning software written in Java

Waikato Environment for Knowledge Analysis (Weka) is a collection of machine learning and data analysis free software licensed under the GNU General Public License. It was developed at the University of Waikato, New Zealand and is the companion software to the book "Data Mining: Practical Machine Learning Tools and Techniques".

Jiawei Han is a Chinese-American computer scientist and writer. He currently holds the position of Michael Aiken Chair Professor in the Department of Computer Science at the University of Illinois at Urbana-Champaign. His research focuses on data mining, text mining, database systems, information networks, data mining from spatiotemporal data, Web data, and social/information network data.

<span class="mw-page-title-main">Special Interest Group on Information Retrieval</span> Subgroup of the Association for Computing Machinery

SIGIR is the Association for Computing Machinery's Special Interest Group on Information Retrieval. The scope of the group's specialty is the theory and application of computers to the acquisition, organization, storage, retrieval and distribution of information; emphasis is placed on working with non-numeric information, ranging from natural language to highly structured data bases.

<i>Data Mining and Knowledge Discovery</i> Peer-reviewed scientific journal

Data Mining and Knowledge Discovery is a bimonthly peer-reviewed scientific journal focusing on data mining published by Springer Science+Business Media. It was started in 1996 and launched in 1997 by Usama Fayyad as founding Editor-in-Chief by Kluwer Academic Publishers. The first Editorial provides a summary of why it was started.

Hans-Peter Kriegel is a German computer scientist and professor at the Ludwig Maximilian University of Munich and leading the Database Systems Group in the Department of Computer Science. He was previously professor at the University of Würzburg and the University of Bremen after habilitation at the Technical University of Dortmund and doctorate from Karlsruhe Institute of Technology.

AMiner is a free online service used to index, search, and mine big scientific data.

<span class="mw-page-title-main">Foster Provost</span>

Foster Provost is an American computer scientist, information systems researcher, and Professor of Data Science, Professor of Information Systems and Ira Rennert Professor of Entrepreneurship at New York University's Stern School of Business. He is also the Director for the Data Science and AI Initiative at Stern's Fubon Center for Technology, Business and Innovation. Professor Provost has a Bachelor of Science from Duquesne University in physics and mathematics and a Master of Science and Ph.D. in computer science from the University of Pittsburgh.

Jie Tang is a full-time professor at the Department of Computer Science of Tsinghua University. He received a PhD in computer science from the same university in 2006. He is known for building the academic social network search system AMiner, which was launched in March 2006 and now has attracted 2,766,356 independent IP accesses from 220 countries. His research interests include social networks and data mining.

<span class="mw-page-title-main">Usama Fayyad</span> American computer scientist

Usama M. Fayyad is an American-Jordanian data scientist and co-founder of KDD conferences and ACM SIGKDD association for Knowledge Discovery and Data Mining. He is a speaker on Business Analytics, Data Mining, Data Science, and Big Data. He recently left his role as the Chief Data Officer at Barclays Bank.

<span class="mw-page-title-main">Ramakrishnan Srikant</span> Indian computer scientist

Ramakrishnan Srikant is a Google Fellow at Google.

<span class="mw-page-title-main">Gregory Piatetsky-Shapiro</span> American computer scientist

Gregory I. Piatetsky-Shapiro is a data scientist and the co-founder of the KDD conferences, and co-founder and past chair of the Association for Computing Machinery SIGKDD group for Knowledge Discovery, Data Mining and Data Science. He is the founder and president of KDnuggets, a discussion and learning website for Business Analytics, Data Mining and Data Science.

Domain driven data mining is a data mining methodology for discovering actionable knowledge and deliver actionable insights from complex data and behaviors in a complex environment. It studies the corresponding foundations, frameworks, algorithms, models, architectures, and evaluation systems for actionable knowledge discovery.

Jianchang (JC) Mao is a Chinese-American computer scientist and Vice President, Google Assistant Engineering at Google. His research spans artificial intelligence, machine learning, computational advertising, data mining, and information retrieval. He was named a Fellow of the Institute of Electrical and Electronics Engineers (IEEE) in 2012 for his contributions to pattern recognition, search, content analysis, and computational advertising.

Arthur Zimek is a professor in data mining, data science and machine learning at the University of Southern Denmark in Odense, Denmark.

An associative classifier (AC) is a kind of supervised learning model that uses association rules to assign a target value. The term associative classification was coined by Bing Liu et al., in which the authors defined a model made of rules "whose right-hand side are restricted to the classification class attribute".

<span class="mw-page-title-main">Gautam Das (computer scientist)</span> Indian computer scientist

Gautam Das is a computer scientist in the field of databases research. He is an ACM Fellow and IEEE Fellow.

<span class="mw-page-title-main">Martin Ester</span>

Martin Ester is a Canadian-German Full Professor of Computing Science at Simon Fraser University. His research focuses on researcher data mining and machine learning.

<span class="mw-page-title-main">Hui Xiong</span> Chinese data scientist

Hui Xiong is a data scientist. He is a distinguished professor at Rutgers University and a distinguished guest professor at the University of Science and Technology of China (USTC).

<span class="mw-page-title-main">Tina Eliassi-Rad</span> American computer scientist

Tina Eliassi-Rad is an American computer scientist and the inaugural President Joseph E. Aoun Professor at Northeastern University. Her research is at the intersection of artificial intelligence, network science, and applied ethics. In 2023, she won the Lagrange Prize for her work on ethical approaches to artificial intelligence.

References

  1. "ACM SIGKDD: Conferences". www.sigkdd.org. Archived from the original on 2006-06-15.
  2. "Event: KDD". acm.org. Archived from the original on 2017-06-16. Retrieved 2011-09-01.
  3. "Conference Ranks". www.conferenceranks.com. Archived from the original on 2020-10-22. Retrieved 2019-10-30.
  4. "Conference Ranks". www.conferenceranks.com. Archived from the original on 2016-09-11. Retrieved 2016-08-30.
  5. "KDD 2024". ACM KDD 2024. Archived from the original on 2023-12-14. Retrieved 2023-12-14.
  6. "SIGKDD - Conferences". www.kdd.org. Archived from the original on 2019-04-01. Retrieved 2019-03-08.
  7. "KDD - Knowledge Discovery and Data Mining (1994-2015)". cm.cecs.anu.edu.au. Archived from the original on 2017-12-01. Retrieved 2017-11-19.
  8. "CORE Rankings Portal - Computing Research & Education". core.edu.au. Archived from the original on 2019-10-21. Retrieved 2019-10-30.
  9. 1 2 "[Closed] Call for papers, workshop proposals, tutorial proposals | KDD 2014, 8/24-27, New York: Data Mining for Social Good". www.kdd.org. Archived from the original on 2019-10-30. Retrieved 2019-10-30.
  10. "Data Science view of the KDD 2014". August 27, 2014. Archived from the original on December 21, 2015. Retrieved November 18, 2017.
  11. "Computer Science Conferences Acceptance Rate". Haofeng Jia's Homepage. Archived from the original on 2017-12-01. Retrieved 2017-11-18.
  12. "Top Computer Science Conferences - Computer Science Conference Ranking". research.com. Archived from the original on 2019-09-30. Retrieved 2019-09-24.
  13. "Awards | Sig KDD". www.kdd.org. Archived from the original on 2012-05-26.
  14. "KDD Conference Best Paper Awards". Archived from the original on 2011-07-13. Retrieved 2012-04-07.
  15. 1 2 3 "SIGKDD BEST RESEARCH PAPER AWARDS". Archived from the original on 2017-12-07. Retrieved 2017-11-17.
  16. "Yahoo Wins Best Paper Award at KDD 2009 | research.yahoo.com". research.yahoo.com. Archived from the original on 2023-10-30. Retrieved 2023-10-23.
  17. "KDD 2015 Best Research Paper Award: "Algorithms for Public-Private Social Networks"". blog.research.google. 2015-08-17. Archived from the original on 2023-10-30. Retrieved 2023-10-23.
  18. "ACM KDD CUP". www.kdd.org. Archived from the original on 2011-03-18.
  19. SIGKDD Blog. "SIGKDD Explorations". kdd.org. Archived from the original on 2011-07-26. Retrieved 2007-07-28.
  20. Fayyad, Usama. "SIGKDD Explorations : June 1999, Volume 1, Issue 1". www.kdd.org. ACM. Archived from the original on 2016-01-13. Retrieved 2015-12-31.
  21. "Srikant's Home Page". rsrikant.com. Archived from the original on 2010-03-16. Retrieved 2009-12-18.