NOV (computers)

Last updated

NOV, or News Overview, is a widely deployed indexing method for Usenet articles, also found in some Internet email implementations. Written in 1992 by Geoff Collyer, NOV replaced a variety of incompatible indexing schemes used in different client programs, each typically requiring custom modifications to each news server before they could be used. In modern NNTP implementations, NOV is exposed as the XOVER and related commands.

Contents

Operation

In its original implementation, the header lines of each incoming message are examined, and a single line of text is appended to the overview files, with one overview file present for each newsgroup. Tab (ASCII code 9) characters and line breaks within the headers are converted to spaces (ASCII code 32), and the header fields within each overview line are then delimited by tab characters.

The first seven fields in a NOV line are fixed and unlabeled:

  1. Subject: header contents
  2. From: header contents
  3. Date: header contents
  4. Message-ID: header contents
  5. References: header contents
  6. Size of the article in octets
  7. Lines: header contents

The header lines are those defined in either RFC 2822 or RFC 1036. If data for any of these fields is missing, a tab alone is put in its place. The value of the size field is approximate, as servers may count line endings as one or two characters. Additionally, the lines value may be calculated by the server, supplied by the message sender, or omitted altogether.

An arbitrary number of additional fields may be added to any NOV line. The eighth and later fields must be labeled in the form "Header-Name: contents", again delimited by tabs. The order and presence of additional fields are allowed to vary from line to line, and from server to server. Some server provide a schema of what is recorded to new overview lines in the form of an NNTP LIST OVERVIEW.FMT command, but this cannot be relied upon to be accurate for older entries.

In practice, most servers supply only one optional field, the contents of the Xref: header, to allow crosspost management.

Variations

While virtually all modern news server and newsreader software employs NOV, it has also found its way into other applications such as email clients. One prominent example is Gnus, which can take advantage of overview files for faster access to large mail folders.

Many newer news servers store NOV data in a variety of formats. A specialized database is frequently used in favor of the original flat file arrangement.

Related Research Articles

Email Method of exchanging digital messages between people over a network

Electronic mail is a method of exchanging messages ("mail") between people using electronic devices. Email entered limited use in the 1960s, but users could only send to users of the same computer, and some early email systems required the author and the recipient to both be online simultaneously, similar to instant messaging. Ray Tomlinson is credited as the inventor of email; in 1971, he developed the first system able to send mail between users on different hosts across the ARPANET, using the @ sign to link the user name with a destination server. By the mid-1970s, this was the form recognized as email.

In computing, the Internet Message Access Protocol (IMAP) is an Internet standard protocol used by email clients to retrieve email messages from a mail server over a TCP/IP connection. IMAP is defined by RFC 3501.

Multipurpose Internet Mail Extensions (MIME) is an Internet standard that extends the format of email messages to support text in character sets other than ASCII, as well as attachments of audio, video, images, and application programs. Message bodies may consist of multiple parts, and header information may be specified in non-ASCII character sets. Email messages with MIME formatting are typically transmitted with standard protocols, such as the Simple Mail Transfer Protocol (SMTP), the Post Office Protocol (POP), and the Internet Message Access Protocol (IMAP).

The Simple Mail Transfer Protocol (SMTP) is a communication protocol for electronic mail transmission. As an Internet standard, SMTP was first defined in 1982 by RFC 821, and updated in 2008 by RFC 5321 to Extended SMTP additions, which is the protocol variety in widespread use today. Mail servers and other message transfer agents use SMTP to send and receive mail messages. SMTP servers commonly use the Transmission Control Protocol on port number 25.

A signature block is a personalized block of text automatically appended at the bottom of an email message, Usenet article, or forum post.

The File Transfer Protocol (FTP) is a standard network protocol used for the transfer of computer files between a client and server on a computer network.

8-bit clean describes a computer system that correctly handles 8-bit character encodings, such as the ISO 8859 series and the UTF-8 encoding of Unicode.

Mbox is a generic term for a family of related file formats used for holding collections of email messages, first implemented for Fifth Edition Unix.

A news server is a collection of software used to handle Usenet articles. It may also refer to a computer itself which is primarily or solely used for handling Usenet. Access to Usenet is only available through news server provider.

An email address identifies an email box to which email messages are delivered. A wide variety of formats were used in early email systems, but only a single format is used today, following the specifications developed for Internet mail systems since the 1980s. This article uses the term email address to refer to the addr-spec defined in RFC 5322, not to the address that is commonly used; the difference is that an address may contain a display name, a comment, or both.

A comma-separated values (CSV) file is a delimited text file that uses a comma to separate values. Each line of the file is a data record. Each record consists of one or more fields, separated by commas. The use of the comma as a field separator is the source of the name for this file format. A CSV file typically stores tabular data in plain text, in which case each line will have the same number of fields.

Kontact free Personal Information Manager

Kontact is a personal information manager and groupware software suite developed by KDE. It supports calendars, contacts, notes, to-do lists, news, and email. It offers a number of inter-changeable graphical UIs all built on top of a common core.

The following tables compare general and technical features of a number of notable email client programs.

In computing, syslog is a standard for message logging. It allows separation of the software that generates messages, the system that stores them, and the software that reports and analyzes them. Each message is labeled with a facility code, indicating the software type generating the message, and assigned a severity level.

Many email clients now offer some support for Unicode. While some use Unicode by default, many others will automatically choose between a legacy encoding and Unicode depending on the mail's content, either automatically or when the user requests it.

A binary-to-text encoding is encoding of data in plain text. More precisely, it is an encoding of binary data in a sequence of printable characters. These encodings are necessary for transmission of data when the channel does not allow binary data or is not 8-bit clean. PGP documentation uses the term "ASCII armor" for binary-to-text encoding when referring to Base64.

Control messages are a special kind of Usenet post that are used to control news servers. They differ from ordinary posts by a header field named Control. The body of the field contains control name and arguments.

Email forwarding generically refers to the operation of re-sending an email message delivered to one email address to one or more different email addresses.

International email arises from the combined provision of internationalized domain names (IDN) and email address internationalization (EAI). The result is email that contains international characters, encoded as UTF-8, in the email header and in supporting mail transfer protocols. The most significant aspect of this is the allowance of email addresses in most of the world's writing systems, at both interface and transport levels.

XOVER is a Network News Transfer Protocol (NNTP) command used to return information from a news server's overview (NOV) database.