WinnowTag

Last updated

winnowTag is a web-based recommender system and news aggregator in which a person tags example items as belonging to a topic, thus training statistical text classification software to find more items on that same topic. [1] Released as a publicly available web application in September 2010 [2] by Mindloom, winnowTag uses Winnow content recommendation, a Naive Bayes text classifier evolved from SpamBayes. [3]

Contents

Users of winnowTag create and share tags, and use the shared tags of others to find on-topic content in real time. Users can add feeds, import OPML feed lists, and publish feeds of items found by tags. winnowTag collects and retains items from the last three months of thousands of blog, news, and website Atom and RSS feeds. [4]

Comparison

With many news aggregators users find and add each feed they wish to read, and scan the titles of all items published in those feeds to find content they want. Thus it is possible to consider every item published in a set of feeds, but the volume of items to view grows as the set of feeds grows. [5]

In winnowTag, each tag filters all items in all feeds added by all users to automatically find information on its topic. Thus it is possible to discover on-topic items from a larger set of feeds than could be manually reviewed, but not all the on-topic items are found, and there is no facility for seeing every item published in a set of feeds. [6]

Background

The winnowTag project, funded by The Kaphan Foundation, develops computer technology to help online communities self-organize and cope with large volumes of unstructured information. A principal idea is that users need individually tailored views (tags) reflecting their own specific interests. The tags in turn represent common interests that can be shared between users. [7]

An early premise was derived from ecology. Instead of static, unchanging software, users get fragments of software that can be copied and aligned into different forms, and from these pieces they construct their own unique software that does what they want. Then they pass pieces of software around, and over time this population of fragments of software acts like an evolving ecology and adapts to demands of users. [8]

Related Research Articles

RSS Family of web feed formats

RSS is a web feed that allows users and applications to access updates to websites in a standardized, computer-readable format. These feeds can, for example, allow a user to keep track of many different websites in a single news aggregator. The news aggregator will automatically check the RSS feed for new content, allowing the list to be automatically passed from website to website or from website to user. This passing of content is called web syndication. Websites usually use RSS feeds to publish frequently updated information, such as blog entries, news headlines, or episodes of audio and video series. RSS is also used to distribute podcasts. An RSS document includes full or summarized text, and metadata, like publishing date and author's name.

Social software, also known as social apps, include communication and interactive tools often based on the Internet. Communication tools typically handle the capturing, storing and presentation of communication, usually written but increasingly including audio and video as well. Interactive tools handle mediated interactions between a pair or group of users. They focus on establishing and maintaining a connection among users, facilitating the mechanics of conversation and talk. Social software generally refers to software that makes collaborative behaviour, the organisation and moulding of communities, self-expression, social interaction and feedback possible for individuals. Another element of the existing definition of social software is that it allows for the structured mediation of opinion between people, in a centralized or self-regulating manner. The most improved area for social software is that Web 2.0 applications can all promote cooperation between people and the creation of online communities more than ever before.

Web syndication is a form of syndication in which content is made available from one website to other sites. Most commonly, websites are made available to provide either summaries or full renditions of a website's recently added content. The term may also describe other kinds of content licensing for reuse.

Atom (Web standard)

The name Atom applies to a pair of related Web standards. The Atom Syndication Format is an XML language used for web feeds, while the Atom Publishing Protocol is a simple HTTP-based protocol for creating and updating web resources.

Web feed Data format used for providing users with frequently updated content

On the World Wide Web, a web feed is a data format used for providing users with frequently updated content. Content distributors syndicate a web feed, thereby allowing users to subscribe a channel to it by adding the feed resource address to a news aggregator client. Users typically subscribe to a feed by manually entering the URL of a feed or clicking a link in a web browser or by dragging the link from the web browser to the aggregator, thus "RSS and Atom files provide news updates from a website in a simple form for your computer."

Broadcatching is the downloading of digital content that has been made available over the Internet using RSS.

Furl was a free social bookmarking website that allowed members to store searchable copies of webpages and share them with others. Every member received 5 gigabytes of storage space. The site was founded by Mike Giles in 2003 and purchased by LookSmart in September 2004. Diigo bought it from LookSmart in exchange for equity.

Liferea

Liferea is a news aggregator for online news feeds and podcasts. It supports the major feed formats including RSS/RDF and Atom and can import and export subscription lists in OPML format. Liferea is intended to be a fast, easy to use, and easy to install news aggregator for GTK+ that can be used with the GNOME desktop. Liferea features a script manager, in which users can add custom scripts that run whenever a certain action occurs.

Tag (metadata) Keyword assigned to information

In information systems, a tag is a keyword or term assigned to a piece of information. This kind of metadata helps describe an item and allows it to be found again by browsing or searching. Tags are generally chosen informally and personally by the item's creator or by its viewer, depending on the system, although they may also be chosen from a controlled vocabulary.

RSS enclosures are a way of attaching multimedia content to RSS feeds by providing the URL of a file associated with an entry, such as an MP3 file to a music recommendation or a photo to a diary entry. Unlike e-mail attachments, enclosures are merely hyperlinks to files. The actual file data is not embedded into the feed. Support and implementation among aggregators varies: if the software understands the specified file format, it may automatically download and display the content, otherwise provide a link to it or silently ignore it.

News aggregator Client software that aggregates syndicated web content

In computing, a news aggregator, also termed a feed aggregator, feed reader, news reader, RSS reader or simply an aggregator, is client software or a web application that aggregates syndicated web content such as online newspapers, blogs, podcasts, and video blogs (vlogs) in one location for easy viewing. The updates distributed may include journal tables of contents, podcasts, videos, and news items.

Google Reader Defunct RSS/Atom feed aggregator formerly operated by Google

Google Reader was an RSS/Atom feed aggregator operated by Google. It was created in early 2005 by Google engineer Chris Wetherell and launched on October 7, 2005, through Google Labs. Google Reader grew in popularity to support a number of programs which used it as a platform for serving news and information to people. Google closed Google Reader on July 1, 2013, citing declining use.

This is a list of blogging terms. Blogging, like any hobby, has developed something of a specialised vocabulary. The following is an attempt to explain a few of the more common phrases and words, including etymologies when not obvious.

FeedSync for Atom and RSS, previously Simple Sharing Extensions, are extensions to RSS and Atom feed formats designed to enable the synchronization of information by using a variety of data sources. Initially developed by Ray Ozzie, Chief Software Architect at Microsoft, it is now maintained by Jack Ozzie, George Moromisato, Matt Augustine, Paresh Suthar and Steven Lees. Dave Winer, the designer of the UserLand Software RSS specification variants, has given input for the specifications.

Product finders are information systems that help consumers to identify products within a large palette of similar alternative products. Product finders differ in complexity, the more complex among them being a special case of decision support systems. Conventional decision support systems, however, aim at specialized user groups, e.g. marketing managers, whereas product finders focus on consumers.

Enterprise social software, comprises social software as used in "enterprise" (business/commercial) contexts. It includes social and networked modifications to corporate intranets and other classic software platforms used by large companies to organize their communication. In contrast to traditional enterprise software, which imposes structure prior to use, enterprise social software tends to encourage use prior to providing structure.

GeoRSS is a specification for encoding location as part of a Web feed. (Web feeds are used to describe feeds of content, such as news articles, Audio blogs, video blogs and text blog entries. These web feeds are rendered by programs such as aggregators and web browsers.) The name "GeoRSS" is derived from RSS, the most known Web feed and syndication format.

HCL Connections is a Web 2.0 enterprise social software application developed originally by IBM and acquired by HCL Technologies in July 2019. Connections is an enterprise-collaboration platform which helps teams work more efficiently. Connections is part of HCL collaboration suite which also includes Notes / Domino, Sametime, Portal and Connections.

EMML, or Enterprise Mashup Markup Language, is an XML markup language for creating enterprise mashups, which are software applications that consume and mash data from variety of sources, often performing logical or mathematical operations as well as presenting data. Mashed data produced by enterprise mashups are presented in graphical user interfaces as mashlets, widgets, or gadgets. EMML can also be considered a declarative mashup domain-specific language (DSL). A mashup DSL eliminates the need for complex, time-consuming, and repeatable procedural programming logic to create enterprise mashups. EMML also provides a declarative language for creating visual tools for enterprise mashups.

Inoreader is a web-based content and RSS feed reader, a cloud-based service for web browsers and mobile devices running iOS and Android. It compiles news feeds from online sources for the user in unified layout to customize and share with others. Inoreader was first released by Innologica in 2013.

References