Nearline storage

Last updated

Nearline storage (a portmanteau of "near" and "online storage") is a term used [1] in computer science to describe an intermediate type of data storage that represents a compromise between online storage (supporting frequent, very rapid access to data) and offline storage/archiving (used for backups or long-term storage, with infrequent access to data). [2] [3]

Contents

Nearline storage dates back to the IBM 3850 Mass Storage System (MSS) tape library, which was announced in 1974. [4]

Overview

The formal distinction between online, nearline, and offline storage is: [4]

For example, always-on spinning hard disk drives are online storage, while spinning drives that spin down automatically, such as in massive arrays of idle disks (MAID), are nearline storage. Removable media such as tape cartridges that can be automatically loaded, as in tape libraries, are nearline storage, while tape cartridges that must be manually loaded are offline storage.

Robotic nearline storage

A large tape library, with tape cartridges placed on shelves in the front, and a robotic arm moving in the back. Visible height of the library is about 180 cm. StorageTek Powderhorn tape library.jpg
A large tape library, with tape cartridges placed on shelves in the front, and a robotic arm moving in the back. Visible height of the library is about 180 cm.

The nearline storage system knows on which volume (cartridge) the data resides, and usually asks a robot to retrieve it from this physical location (usually: a tape library or optical jukebox) and put it into a tape drive or optical disc drive to enable access by bringing the data it contains online. [4] This process is not instant, but it only requires a few seconds. [5]

Nearline tape and optical storage has the advantage of relatively longer lifespans compared to spinning hard drives, simply due to the storage media being idle and usually stored in protected dust-free enclosures when not in use. In a robotic tape loading system, the tape drive used for accessing data experiences the most wear and may need occasional replacement, but the tapes themselves can last for years to decades. If there are sealable access doors between the access mechanism and the media, it is possible for the idle media storage enclosure to survive fire, floods, lightning strikes, and other disasters.

Hard disk drive nearline storage

MAID (massive array of idle drives) systems archive data in an array of hard disk drives, with most drives in a MAID usually stopped. The MAID system spins up each drive on demand when necessary to read (or in some cases to write) data on that drive. For a given amount of storage capacity, MAID systems have higher densities and lower power and cooling requirements than "hot" storage systems that keep all the disks spinning at full speed at all times.

Some hard drive and storage systems vendors and suppliers use the term in reference to low-rotational speed hard drives that are built to be more reliable than generic desktop and laptop computer hard drives. They are intended to be operational continuously for 24 hours a day, seven days a week, possibly for several years.

Nearline hard drives may be used in personal or small business network-attached storage (NAS) systems, or as non-critical moderate-performance data storage on servers, where greater durability is required for the drive to operate continuously.

By comparison, standard hard drives are assumed to only be in operation for a few hours each day, and are not spinning when the computer is either turned off or in sleep mode. Standard hard drives may also use data caching methods that can improve single-drive performance, but would interfere with the operation of multi-drive RAID storage systems, potentially causing data loss or corruption.

Specifically the term nearline hard drive is being used to refer to high-capacity Serial ATA drives that work with Serial Attached SCSI storage devices. Presumably this usage is by analogy to the high-capacity and low-access speed tape systems. [6]

Related Research Articles

<span class="mw-page-title-main">Computer data storage</span> Storage of digital data readable by computers

Computer data storage is a technology consisting of computer components and recording media that are used to retain digital data. It is a core function and fundamental component of computers.

<span class="mw-page-title-main">Disk storage</span> General category of storage mechanisms

Disk storage is a general category of storage mechanisms where data is recorded by various electronic, magnetic, optical, or mechanical changes to a surface layer of one or more rotating disks. A disk drive is a device implementing such a storage mechanism. Notable types are the hard disk drive (HDD) containing a non-removable disk, the floppy disk drive (FDD) and its removable floppy disk, and various optical disc drives (ODD) and associated optical disc media.

<span class="mw-page-title-main">Hard disk drive</span> Data storage device

A hard disk drive (HDD), hard disk, hard drive, or fixed disk is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating platters coated with magnetic material. The platters are paired with magnetic heads, usually arranged on a moving actuator arm, which read and write data to the platter surfaces. Data is accessed in a random-access manner, meaning that individual blocks of data can be stored and retrieved in any order. HDDs are a type of non-volatile storage, retaining stored data when powered off. Modern HDDs are typically in the form of a small rectangular box.

<span class="mw-page-title-main">Tape drive</span>

A tape drive is a data storage device that reads and writes data on a magnetic tape. Magnetic tape data storage is typically used for offline, archival data storage. Tape media generally has a favorable unit cost and a long archival stability.

<span class="mw-page-title-main">Memory hierarchy</span> Computer architecture

In computer architecture, the memory hierarchy separates computer storage into a hierarchy based on response time. Since response time, complexity, and capacity are related, the levels may also be distinguished by their performance and controlling technologies. Memory hierarchy affects performance in computer architectural design, algorithm predictions, and lower level programming constructs involving locality of reference.

In computing, mass storage refers to the storage of large amounts of data in a persisting and machine-readable fashion. In general, the term is used as large in relation to contemporaneous hard disk drives, but it has been used large in relation to primary memory as for example with floppy disks on personal computers.

In information technology, a backup, or data backup is a copy of computer data taken and stored elsewhere so that it may be used to restore the original after a data loss event. The verb form, referring to the process of doing so, is "back up", whereas the noun and adjective form is "backup". Backups can be used to recover data after its loss from data deletion or corruption, or to recover data from an earlier time. Backups provide a simple form of disaster recovery; however not all backup systems are able to reconstitute a computer system or other complex configuration such as a computer cluster, active directory server, or database server.

Non-volatile memory (NVM) or non-volatile storage is a type of computer memory that can retain stored information even after power is removed. In contrast, volatile memory needs constant power in order to retain data.

IBM manufactured magnetic disk storage devices from 1956 to 2003, when it sold its hard disk drive business to Hitachi. Both the hard disk drive (HDD) and floppy disk drive (FDD) were invented by IBM and as such IBM's employees were responsible for many of the innovations in these products and their technologies. The basic mechanical arrangement of hard disk drives has not changed since the IBM 1301. Disk drive performance and characteristics are measured by the same standards now as they were in the 1950s. Few products in history have enjoyed such spectacular declines in cost and physical size along with equally dramatic improvements in capacity and performance.

<span class="mw-page-title-main">Tape library</span> Storage device containing a robot which automatically loads tapes into tape drives

In computer storage, a tape library, sometimes called a tape silo, tape robot or tape jukebox, is a storage device that contains one or more tape drives, a number of slots to hold tape cartridges, a barcode reader to identify tape cartridges and an automated method for loading tapes. Additionally, the area where tapes that are NOT currently in a silo are stored is also called a tape library. Tape libraries can contain millions of tapes.

The IBM 3850 Mass Storage System was an online tape library used to hold large amounts of infrequently accessed data. It was one of the earliest examples of nearline storage.

Hierarchical storage management (HSM), also known as Tiered storage, is a data storage and Data management technique that automatically moves data between high-cost and low-cost storage media. HSM systems exist because high-speed storage devices, such as solid state drive arrays, are more expensive than slower devices, such as hard disk drives, optical discs and magnetic tape drives. While it would be ideal to have all data available on high-speed devices all the time, this is prohibitively expensive for many organizations. Instead, HSM systems store the bulk of the enterprise's data on slower devices, and then copy data to faster disk drives when needed. The HSM system monitors the way data is used and makes best guesses as to which data can safely be moved to slower devices and which data should stay on the fast devices.

<span class="mw-page-title-main">Direct-attached storage</span>

Direct-attached storage (DAS) is digital storage directly attached to the computer accessing it, as opposed to storage accessed over a computer network. DAS consists of one or more storage units such as hard drives, solid-state drives, optical disc drives within an external enclosure. The term "DAS" is a retronym to contrast with storage area network (SAN) and network-attached storage (NAS).

A virtual tape library (VTL) is a data storage virtualization technology used typically for backup and recovery purposes. A VTL presents a storage component as tape libraries or tape drives for use with existing backup software.

In computing, external storage comprises devices that store information outside a computer. Such devices may be permanently attached to the computer, may be removable or may use removable media.

<span class="mw-page-title-main">Disk pack</span> Obsolete form of removable media

Disk packs and disk cartridges were early forms of removable media for computer data storage, introduced in the 1960s.

Magnetic-tape data storage is a system for storing digital information on magnetic tape using digital recording.

<span class="mw-page-title-main">IBM storage</span> Product portfolio of IBM

The IBM Storage product portfolio includes disk, flash, tape, NAS storage products, storage software and services. IBM's approach is to focus on data management.

The most widespread standard for configuring multiple hard disk drives is RAID, which comes in a number of standard configurations and non-standard configurations. Non-RAID drive architectures also exist, and are referred to by acronyms with tongue-in-cheek similarity to RAID:

This glossary of computer hardware terms is a list of definitions of terms and concepts related to computer hardware, i.e. the physical and structural components of computers, architectural issues, and peripheral devices.

References

  1. Inmon, W. H. (2005-10-07). "Chapter 2: The Data Warehouse Environment". Building the Data Warehouse, Fourth Edition. Whiley publishing. p. 33. ISBN   978-0-7645-9944-6.
  2. "Nearline storage" in "A Glossary of Archival and Records Terminology". Retrieved on 2009-01-30.
  3. Venkatramani, Chitra and Tzi-cker Chiueh (1993). "Survey of Near-Line Storage Technologies: Devices and Systems". Experimental Computer Systems Laboratory.
  4. 1 2 3 Pearson, Tony (2010). "Correct use of the term Nearline". IBM Developerworks, Inside System Storage. Retrieved 2015-08-16.
  5. "Hanel storage systems". Wednesday, 12 June 2019
  6. Seagate Technology Paper TP-543 (2005), . Retrieved on 2012-08-28.