Lee Giles

Last updated
Clyde Lee Giles
Born
NationalityAmerican
Alma mater Rhodes College
University of Tennessee
University of Michigan
University of Arizona
Known for CiteSeer, neural networks, information retrieval, digital libraries, web search
Awards
Scientific career
Fields Computer science, search engines, artificial intelligence, deep learning
Institutions Ford Motor Company, Naval Research Laboratory, Air Force Office of Scientific Research, NEC Research Institute, Pennsylvania State University
Doctoral advisor Harrison H. Barrett [1]

Clyde Lee Giles is an American computer scientist and the David Reese Professor at the Penn State College of Information Sciences and Technology (IST) at the Pennsylvania State University. He is also Graduate Faculty Professor of Computer Science and Engineering, Courtesy Professor of Supply Chain and Information Systems, and Director of the Intelligent Systems Research Laboratory. He was Interim Associate Dean of Research in the College of IST. He graduated from Oakhaven High School in Memphis, Tennessee. His graduate degrees are from the University of Michigan and the University of Arizona and his undergraduate degrees are from Rhodes College and the University of Tennessee. His PhD is in optical sciences with advisor Harrison H. Barrett. His academic genealogy includes two Nobel laureates (Felix Bloch and Werner Heisenberg), Arnold Sommerfeld and prominent mathematicians. [2]

Contents

Research

Giles has been associated with the computer science or electrical engineering departments at Princeton University, the University of Pennsylvania, Columbia University, the University of Pisa, the University of Trento and the University of Maryland, College Park. Previous positions were at NEC Research Institute (now NEC Labs), Princeton, NJ; Air Force Research Laboratory; and the United States Naval Research Laboratory. He is best known for his work on the creation of novel scientific and academic search engines and digital libraries and is considered by some one of the founders of academic document search. Earlier research was concerned with recurrent neural networks and optical computing.

His research interests are in intelligent web and cyberinfrastructure tools, search engines and information retrieval, digital libraries, web services, knowledge and information management and extraction, machine learning, and information and data mining. He has created several vertical search engines in these areas. He has over 500 publications with some in Nature, Science and the Proceedings of the National Academy of Sciences. His research is well cited with an h-index of 117 according to Google Scholar and over 60,000 total citations as evidenced in Google Scholar. He has one of the top 200 h-indexes in Computer Science and the top 10 in Information Retrieval. At Penn State he has graduated 36 PHD students. Most of his papers author his name as C. Lee Giles or C.L. Giles.

Awards

He is a Fellow of the Association for Computing Machinery (ACM), [3] Institute of Electrical and Electronics Engineers (IEEE), [4] and International Neural Network Society, (INNS). He also received the Gabor Award [5] from the International Neural Network Society recognizing achievements in engineering/applications in neural networks. Most recently he received the 2018 Institute of Electrical and Electronics Engineers (IEEE) Computational Intelligence Society (CIS) Neural Networks Pioneer Award [6] and the 2018 National Federation of Advanced Information Services (NFAIS) Miles Conrad Award. [7]

He has twice received the IBM Distinguished Faculty Award.[ citation needed ]

Electromagnetics

Before his work on neural networks, Giles published papers on reflection and scattering of electromagnetic waves from magnetic materials for the particular cases of equal refractive indexes. His work is mentioned in the following articles: Fresnel equations, Mie scattering, and Brewster's angle. For Mie scattering, he is a coauthor on the Kerker effect, which was an extension of his work on a planar boundary effect and his idea.

Neural networks

Giles' work on neural networks showed that fundamental computational structures such as regular grammars and finite state machines could be theoretically represented in recurrent neural networks. Another contribution was the Neural Network Pushdown Automata and the first analog differentiable stack. Some of these publications are cited as early work in "deep" learning.

While at AFOSR in Washington DC, Giles in 1986 established the first neural network funding in 20 years and helped DARPA establish theirs.

CiteSeer and search engines

In 1998 and 1999 his work published in Science and Nature with Steve Lawrence estimated the size of the web and showed that search engines did not index that much of it. This work also showed that the web had significantly matured and had a diversity of material and resources.

With Steve Lawrence and Kurt Bollacker, Giles was responsible for the creation in 1997 of automatic citation indexing [8] and CiteSeer, a public academic search engine and digital library for Computer and Information Science. Under his direction CiteSeer was moved to and is being maintained at the Pennsylvania State University. CiteSeer has been replaced by the Next Generation CiteSeer, CiteSeerX.

He is the director of the Next Generation CiteSeer project, CiteSeerX, also at the Pennsylvania State University. In addition, he was responsible for the creation of an academic business search engine and digital library, BizSeer (previously known as SmealSearch). With Isaac Councill, he created automatic acknowledgement indexing, permitting for the first time the automatic search and indexing of acknowledged entities in scholarly and research documents. The search engine for this was AckSeer. He also was the cocreator of the first search engine for robots.txt, BotSeer.

Research in collaboration with Professors Prasenjit Mitra, Karl Mueller, Barbara Garrison and James Kubicki resulted in the development of a search engine and data portal for chemistry, ChemxSeer, ChemXSeer. With Yang Sun, a novel search engine, BotSeer, was designed that searches and indexes robots.txt files on web sites. The Next Generation CiteSeer, CiteSeerx, came online in February 2008, with over one million articles indexed and now with active crawling exceeds 10 million articles. RefSeerX was a context-aware citation recommendation service which recommended papers from CiteSeerX that are most relevant to a given text. These services were based on SeerSuite, a package of open sources tools for searching and indexing academic documents and data.

Expert witness

Giles has been an expert witness for Quinn Emanuel Urquhart & Sullivan, LLP and Davis Polk & Wardwell, LLP representing Google and Yahoo.

Related Research Articles

Information retrieval (IR) in computing and information science is the task of identifying and retrieving information system resources that are relevant to an information need. The information need can be specified in the form of a search query. In the case of document retrieval, queries can be based on full-text or other content-based indexing. Information retrieval is the science of searching for information in a document, searching for documents themselves, and also searching for the metadata that describes data, and for databases of texts, images or sounds.

<span class="mw-page-title-main">Web crawler</span> Software which systematically browses the World Wide Web

A Web crawler, sometimes called a spider or spiderbot and often shortened to crawler, is an Internet bot that systematically browses the World Wide Web and that is typically operated by search engines for the purpose of Web indexing.

CiteSeerX is a public search engine and digital library for scientific and academic papers, primarily in the fields of computer and information science.

<span class="mw-page-title-main">DBLP</span> Computer science bibliography website

DBLP is a computer science bibliography website. Starting in 1993 at Universität Trier in Germany, it grew from a small collection of HTML files and became an organization hosting a database and logic programming bibliography site. Since November 2018, DBLP is a branch of Schloss Dagstuhl – Leibniz-Zentrum für Informatik (LZI). DBLP listed more than 5.4 million journal articles, conference papers, and other publications on computer science in December 2020, up from about 14,000 in 1995 and 3.66 million in July 2016. All important journals on computer science are tracked. Proceedings papers of many conferences are also tracked. It is mirrored at three sites across the Internet.

<span class="mw-page-title-main">Metasearch engine</span> Online information retrieval tool

A metasearch engine is an online information retrieval tool that uses the data of a web search engine to produce its own results. Metasearch engines take input from a user and immediately query search engines for results. Sufficient data is gathered, ranked, and presented to the users.

SmealSearch was a web portal, search engine and digital library for academic business documents that was originally hosted at the defunct eBusiness Research Center at the Pennsylvania State University. It was based on the CiteSeer digital library and search engine technology. Due to lack of support, it moved to the College of Information Sciences and Technology and became BizSeer. It was enhanced and modified by many including Lee Giles, Yang Sun, Sandip Debnath, Isaac Councill, Arvind Rangaswamy, Nirmal Pal, Yves Petinot and Pradeep Teregowda.

Citation analysis is the examination of the frequency, patterns, and graphs of citations in documents. It uses the directed graph of citations — links from one document to another document — to reveal properties of the documents. A typical aim would be to identify the most important documents in a collection. A classic example is that of the citations between academic articles and books. For another example, judges of law support their judgements by referring back to judgements made in earlier cases. An additional example is provided by patents which contain prior art, citation of earlier patents relevant to the current claim. The digitization of patent data and increasing computing power have led to a community of practice that uses these citation data to measure innovation attributes, trace knowledge flows, and map innovation networks.

<span class="mw-page-title-main">Kurt Bollacker</span> American computer scientist

Kurt Bollacker is an American computer scientist with a research background in the areas of machine learning, digital libraries, semantic networks, and electro-cardiographic modeling.

Vasant G. Honavar is an Indian-American computer scientist, and artificial intelligence, machine learning, big data, data science, causal inference, knowledge representation, bioinformatics and health informatics researcher and professor.

<span class="mw-page-title-main">Pat Hanrahan</span> American computer graphics researcher

Patrick M. Hanrahan is an American computer graphics researcher, the Canon USA Professor of Computer Science and Electrical Engineering in the Computer Graphics Laboratory at Stanford University. His research focuses on rendering algorithms, graphics processing units, as well as scientific illustration and visualization. He has received numerous awards, including the 2019 Turing Award.

ACM Multimedia (ACM-MM) is the Association for Computing Machinery (ACM)'s annual conference on multimedia, sponsored by the SIGMM special interest group on multimedia in the ACM. SIGMM specializes in the field of multimedia computing, from underlying technologies to applications, theory to practice, and servers to networks to devices.

<span class="mw-page-title-main">James Z. Wang</span> Chinese-American computer scientist

James Ze Wang is a Chinese-American computer scientist. He is a distinguished professor of the College of Information Sciences and Technology at Pennsylvania State University. He is also an affiliated professor of the Molecular, Cellular, and Integrative Biosciences Program; the Computational Science Graduate Minor; and the Social Data Analytics Graduate Program. He is co-director of the Intelligent Information Systems Laboratory. He was a visiting professor of the Robotics Institute at Carnegie Mellon University from 2007 to 2008. In 2011 and 2012, he served as a program manager in the Office of International Science and Engineering at the National Science Foundation. He is the second son of Chinese mathematician Wang Yuan.

BotSeer was a Web-based information system and search tool used for research on Web robots and trends in Robot Exclusion Protocol deployment and adherence. It was created and designed by Yang Sun, Isaac G. Councill, Ziming Zhuang and C. Lee Giles. BotSeer was in operation from 2007 to 2010, approximately.

Amit Sheth is a computer scientist at University of South Carolina in Columbia, South Carolina. He is the founding Director of the Artificial Intelligence Institute, and a Professor of Computer Science and Engineering. From 2007 to June 2019, he was the Lexis Nexis Ohio Eminent Scholar, director of the Ohio Center of Excellence in Knowledge-enabled Computing, and a Professor of Computer Science at Wright State University. Sheth's work has been cited by over 48,800 publications. He has an h-index of 106, which puts him among the top 100 computer scientists with the highest h-index. Prior to founding the Kno.e.sis Center, he served as the director of the Large Scale Distributed Information Systems Lab at the University of Georgia in Athens, Georgia.

<span class="mw-page-title-main">Dimitri Bertsekas</span> Greek electrical engineer

Dimitri Panteli Bertsekas is an applied mathematician, electrical engineer, and computer scientist, a McAfee Professor at the Department of Electrical Engineering and Computer Science in School of Engineering at the Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts, and also a Fulton Professor of Computational Decision Making at Arizona State University, Tempe.

<span class="mw-page-title-main">Tomasz Imieliński</span> Polish-American computer scientist (born 1954)

Tomasz Imieliński is a Polish-American computer scientist, most known in the areas of data mining, mobile computing, data extraction, and search engine technology. He is currently a professor of computer science at Rutgers University in New Jersey, United States.

Michael David Mitzenmacher is an American computer scientist working in algorithms. He is Professor of Computer Science at the Harvard John A. Paulson School of Engineering and Applied Sciences and was area dean of computer science July 2010 to June 2013. He also runs My Biased Coin, a blog about theoretical computer science.

<span class="mw-page-title-main">Author name disambiguation</span>

Author name disambiguation is a type of disambiguation and record linkage applied to the names of individual people. The process could, for example, distinguish individuals with the name "John Smith".

Venkata Narayana Padmanabhan is a computer scientist and principal researcher at Microsoft Research India. He is known for his research in networked and mobile systems. He is an elected fellow of the Indian National Academy of Engineering, Institute of Electrical and Electronics Engineers and the Association for Computing Machinery. The Council of Scientific and Industrial Research, the apex agency of the Government of India for scientific research, awarded him the Shanti Swarup Bhatnagar Prize for Science and Technology, one of the highest Indian science awards for his contributions to Engineering Sciences in 2016.

Chin-Hui Lee is an information scientist, best known for his work in speech recognition, speaker recognition and acoustic signal processing. He joined Georgia Institute of Technology in 2002 as a professor in the school of electrical and computer engineering

References

  1. "Harrison H. Barrett's Website at University of Arizona" . Retrieved 2013-04-13.
  2. "Biography Page". 2010. Retrieved 2010-10-28.
  3. "C Lee Giles". ACM Fellows. ACM. 2006. Archived from the original on 2012-02-10. Retrieved 2010-01-23. For contributions to information processing and web analysis.
  4. "Fellows - G". IEEE Fellows. IEEE. Retrieved 2010-01-23. C. Lee Giles 1997: for contributions to the theory and practice of neural networks
  5. "INNS Awards".
  6. "IST professor named IEEE Pioneer in neural networks | Penn State University".
  7. "2018 Miles Conrad Awardee".
  8. Giles, C. Lee; Bollacker, Kurt D.; Lawrence, Steve (1998). "CiteSeer: An Automatic Citation Indexing System" (PDF). C. Lee Giles, Pennsylvania State University.