Ramanathan V. Guha

Last updated
Ramanathan V. Guha
Ramanathan V. Guha.jpg
Ramanathan V. Guha (2013)
Born1965 (age 5859)[ citation needed ]
Citizenship India
Alma mater Indian Institute of Technology Madras
University of California, Berkeley
Stanford University
Known for Cyc [1]
Schema.org
Meta Content Framework
Resource Description Framework
RSS
Datacommons.org
Scientific career
Fields Data mining
Institutions Google
IBM
Apple Inc.
Indian Institute of Technology Madras
University of California Berkeley
Stanford University
Thesis Contexts: A formalization and some applications  (1992)
Doctoral advisor John McCarthy
Edward Feigenbaum [2]
Website www.guha.com/cv.html
research.google.com/pubs/author17184.html

Ramanathan V. Guha (born 1965)[ citation needed ] is the creator of widely used web standards such as RSS, RDF and Schema.org. He is also responsible for products such as Google Custom Search. He was a co-founder of Epinions and Alpiri. He worked at Google for nearly two decades, most recently as a Google Fellow, before announcing his departure from the company in August 2024. [3] [4] [5] [6] [7]

Contents

Education

Guha did his schooling from Loyola High School, Pune and graduated with Bachelor of Technology in Mechanical Engineering from the Indian Institute of Technology Madras, Master of Science from University of California, Berkeley and Ph.D from Stanford University. [2] [8]

Career

Guha was one of the early co-leaders of the Cyc Project where he worked from 1987 through 1994 at Microelectronics and Computer Technology Corporation in collaboration with Douglas Lenat. [1] He was responsible for the design and implementation of key parts of the Cyc system, including the CycL knowledge representation language, the upper ontological layers of the Cyc Knowledge Base and some parts of the original Cyc Natural Language understanding system. [4]

Leaving what became Cycorp, Guha founded Q Technology, which created a database schema mapping tool called Babelfish. In 1994, he moved to work at Apple Computer, reporting to Alan Kay, where he developed the Meta Content Framework (MCF) format. In 1997 he joined Netscape where, in collaboration with Tim Bray, he created a new version of MCF that used the XML language and which became the main technical precursor to W3C's Resource Description Framework (RDF) standard. [9] [10]

Guha also contributed to the "smart browsing" features of Netscape 4.5 and was instrumental in Netscape's acquisition of NewHoo, which Netscape rebranded the Open Directory Project. [11] He created the first version of RSS as part of a move in which the My.Netscape portal was opened to wider participation in March 1999. [12] [13] In 1999 he left Netscape and in May co-founded Epinions, [14] where he worked until 2000. [8] Guha founded Alpiri in late 2000 which created TAP, [15] a semantic web application and knowledge base. In 2002, he became a researcher at IBM Almaden Research Center. [8]

In 2005 Guha joined Google and became a Google Fellow. While at Google he has been responsible for Google Custom Search and a number of enhancements to Adwords. He is also one of the founders of Schema.org and Datacommons.org. [8] After nearly two decades with the company, he announced his departure in August 2024. [7]

Awards

On January 26, 2013, Guha was named as a Distinguished Alumnus of the Indian Institute of Technology Madras. [16]

December 8, 2015 named as a ACM Fellow for contributions to structured data representation and specification and their impact on the Web [17]

Related Research Articles

<span class="mw-page-title-main">Cyc</span> Artificial intelligence project

Cyc is a long-term artificial intelligence project that aims to assemble a comprehensive ontology and knowledge base that spans the basic concepts and rules about how the world works.

Knowledge representation and reasoning is the field of artificial intelligence (AI) dedicated to representing information about the world in a form that a computer system can use to solve complex tasks such as diagnosing a medical condition or having a dialog in a natural language. Knowledge representation incorporates findings from psychology about how humans solve problems and represent knowledge, in order to design formalisms that will make complex systems easier to design and build. Knowledge representation and reasoning also incorporates findings from logic to automate various kinds of reasoning.

<span class="mw-page-title-main">Semantic Web</span> Extension of the Web to facilitate data exchange

The Semantic Web, sometimes known as Web 3.0, is an extension of the World Wide Web through standards set by the World Wide Web Consortium (W3C). The goal of the Semantic Web is to make Internet data machine-readable.

The Resource Description Framework (RDF) is a World Wide Web Consortium (W3C) standard originally designed as a data model for metadata. It has come to be used as a general method for description and exchange of graph data. RDF provides a variety of syntax notations and data serialization formats, with Turtle currently being the most widely used notation.

CycL in computer science and artificial intelligence, is an ontology language used by Douglas Lenat's Cyc artificial intelligence project. Ramanathan V. Guha was instrumental in designing early versions of the language. A close CycL variant exists named MELD.

<span class="mw-page-title-main">RSS</span> Family of web feed formats

RSS is a web feed that allows users and applications to access updates to websites in a standardized, computer-readable format. Subscribing to RSS feeds can allow a user to keep track of many different websites in a single news aggregator, which constantly monitor sites for new content, removing the need for the user to manually check them. News aggregators can be built into a browser, installed on a desktop computer, or installed on a mobile device.

<span class="mw-page-title-main">Douglas Lenat</span> Computer scientist and AI pioneer

Douglas Bruce Lenat was an American computer scientist and researcher in artificial intelligence who was the founder and CEO of Cycorp, Inc. in Austin, Texas.

The Web Ontology Language (OWL) is a family of knowledge representation languages for authoring ontologies. Ontologies are a formal way to describe taxonomies and classification networks, essentially defining the structure of knowledge for various domains: the nouns representing classes of objects and the verbs representing relations between the objects.

Meta Content Framework (MCF) is a specification of a content format for structuring metadata about web sites and other data.

RDF Schema (Resource Description Framework Schema, variously abbreviated as RDFS, RDF(S), RDF-S, or RDF/S) is a set of classes with certain properties using the RDF extensible knowledge representation data model, providing basic elements for the description of ontologies. It uses various forms of RDF vocabularies, intended to structure RDF resources. RDF and RDFS can be saved in a triplestore, then one can extract some knowledge from them using a query language, like SPARQL.

<span class="mw-page-title-main">Semantic technology</span> Technology to help machines understand data

The ultimate goal of semantic technology is to help machines understand data. To enable the encoding of semantics with the data, well-known technologies are RDF and OWL. These technologies formally represent the meaning involved in information. For example, ontology can describe concepts, relationships between things, and categories of things. These embedded semantics with the data offer significant advantages such as reasoning over data and dealing with heterogeneous data sources.

The RSS-DEV Working Group was the outgrowth of a fork in RSS format development. The private, non-commercial working group began with a dozen members in three countries, and was chaired by Rael Dornfest, researcher and developer of the Meerkat RSS-reader software.

Web syndication technologies were preceded by metadata standards such as the Meta Content Framework (MCF) and the Resource Description Framework (RDF), as well as by 'push' specifications such as Channel Definition Format (CDF). Early web syndication standards included Information and Content Exchange (ICE) and RSS. More recent specifications include Atom and GData.

Ora Lassila is a Finnish computer scientist who lives in the U.S. and works as a technologist at Amazon Web Services. He has been conducting research into the Semantic Web since 1996, and was co-author, with Tim Berners-Lee and James Hendler, of the article "The Semantic Web" which appeared in Scientific American in 2001, now the most cited paper in the Semantic Web area. His early work in this area included proposing the original RDF Specification with Ralph R. Swick and he has been an elected member of the World Wide Web Consortium (W3C) Advisory Board since 1998. He also belongs to the steering committee of the Semantic Web Science Association.

Ontology-based data integration involves the use of one or more ontologies to effectively combine data or information from multiple heterogeneous sources. It is one of the multiple data integration approaches and may be classified as Global-As-View (GAV). The effectiveness of ontology‑based data integration is closely tied to the consistency and expressivity of the ontology used in the integration process.

Amit Sheth is a computer scientist at University of South Carolina in Columbia, South Carolina. He is the founding Director of the Artificial Intelligence Institute, and a Professor of Computer Science and Engineering. From 2007 to June 2019, he was the Lexis Nexis Ohio Eminent Scholar, director of the Ohio Center of Excellence in Knowledge-enabled Computing, and a Professor of Computer Science at Wright State University. Sheth's work has been cited by over 48,800 publications. He has an h-index of 117, which puts him among the top 100 computer scientists with the highest h-index. Prior to founding the Kno.e.sis Center, he served as the director of the Large Scale Distributed Information Systems Lab at the University of Georgia in Athens, Georgia.

Freebase was a large collaborative knowledge base consisting of data composed mainly by its community members. It was an online collection of structured data harvested from many sources, including individual, user-submitted wiki contributions. Freebase aimed to create a global resource that allowed people to access common information more effectively. It was developed by the American software company Metaweb and run publicly beginning in March 2007. Metaweb was acquired by Google in a private sale announced on 16 July 2010. Google's Knowledge Graph is powered in part by Freebase.

Knowledge extraction is the creation of knowledge from structured and unstructured sources. The resulting knowledge needs to be in a machine-readable and machine-interpretable format and must represent knowledge in a manner that facilitates inferencing. Although it is methodically similar to information extraction (NLP) and ETL, the main criterion is that the extraction result goes beyond the creation of structured information or the transformation into a relational schema. It requires either the reuse of existing formal knowledge or the generation of a schema based on the source data.

In the Semantic Web and in knowledge representation, a metaclass is a class whose instances can themselves be classes. Similar to their role in programming languages, metaclasses in Semantic Web languages can have properties otherwise applicable only to individuals, while retaining the same class's ability to be classified in a concept hierarchy. This enables knowledge about instances of those metaclasses to be inferred by semantic reasoners using statements made in the metaclass. Metaclasses thus enhance the expressivity of knowledge representations in a way that can be intuitive for users. While classes are suitable to represent a population of individuals, metaclasses can, as one of their feature, be used to represent the conceptual dimension of an ontology. Metaclasses are supported in the ontology language OWL and the data-modeling vocabulary RDFS.

Datacommons.org is an open knowledge graph hosted by Google that provides a unified view across multiple public datasets, combining economic, scientific and other open datasets into an integrated data graph. The Datacommons.org site was launched in May 2018 with an initial dataset consisting of fact-checking data published in Schema.org "ClaimReview" format by several fact checkers from the International Fact-Checking Network. Google has worked with partners including the United States Census, the World Bank, and US Bureau of Labor Statistics to populate the repository, which also hosts data from Wikipedia, the National Oceanic and Atmospheric Administration and the Federal Bureau of Investigation. The service expanded during 2019 to include an RDF-style knowledge graph populated from a number of largely statistical open datasets. The service was announced to a wider audience in 2019. In 2020 the service improved its coverage of non-US datasets, while also increasing its coverage of bioinformatics and coronavirus.

References

  1. 1 2 Guha, R. V.; Lenat, Douglas B. (1990). Building large knowledge-based systems: representation and inference in the Cyc project. Reading, Mass: Addison-Wesley Pub. Co. ISBN   978-0-201-51752-1.
  2. 1 2 Ramanathan V. Guha at the Mathematics Genealogy Project
  3. Guha, R.; McCool, R.; Miller, E. (2003). Semantic search. WWW'03. Proceedings of the twelfth international conference on World Wide Web. pp. 700–709. doi:10.1145/775152.775250. ISBN   1-58113-680-3.
  4. 1 2 Ramanathan V. Guha at DBLP Bibliography Server OOjs UI icon edit-ltr-progressive.svg
  5. Ramanathan V. Guha author profile page at the ACM Digital Library
  6. Ramanathan V. Guha's publications indexed by the Scopus bibliographic database. (subscription required)
  7. 1 2 "Post by R. Guha on LinkedIn". LinkedIn. Retrieved 2024-08-12.
  8. 1 2 3 4 Guha, R.V. "CV". Guha.com. Retrieved 2015-12-31.
  9. Andreessen, Marc (1999-01-08). "Innovators of the Net: R.V. Guha and RDF". Netscape. Archived from the original on February 5, 2008. Retrieved 2014-09-14.
  10. Baldazo, Rex (1998-04-15). "The 1st annual Web Innovator Awards - R. V. Guha: Resource Description Framework (RDF)". Builder.com. Archived from the original on November 17, 1999. Retrieved 2016-02-02.
  11. Skrenta, Rich. "About Rich Skrenta" . Retrieved April 20, 2014.
  12. Hines, Matt (1999-03-15). "Netscape Broadens Portal Content Strategy". Newsbytes.
  13. Afzali, Cyrus (1999-03-15). "Netscape Launches Publishing Program". Internet News. Archived from the original on April 20, 2003. Retrieved 2015-11-21.
  14. Zaret, Elliot (2000-08-18). "The rise and fall of Netscape". ZDNet News. Archived from the original on August 18, 2000. Retrieved 2015-12-31.
  15. Guha, R.; McCool, R. (August 2003). "TAP: A Semantic Web platform". Comput. Netw. 42 (5): 557–577. CiteSeerX   10.1.1.58.2546 . doi:10.1016/S1389-1286(03)00225-1.
  16. "IIT-M announces awards for distinguished alumni". The New Indian Express. Jan 28, 2013. Archived from the original on March 4, 2016. Retrieved Feb 4, 2013.
  17. "ACM Fellows named for Computing Innovations that are Advancing Technology in the Digital Age" (PDF). Retrieved Dec 9, 2015.