Virtual tape library

Last updated

A virtual tape library (VTL) is a data storage virtualization technology used typically for backup and recovery purposes. A VTL presents a storage component (usually hard disk storage) as tape libraries or tape drives for use with existing backup software.

Contents

Virtualizing the disk storage as tape allows integration of VTLs with existing backup software and existing backup and recovery processes and policies. The benefits of such virtualization include storage consolidation and faster data restore processes. For most mainframe data centers, the storage capacity varies, however protecting its business and mission critical data is always vital.

Most current VTL solutions use SAS or SATA disk arrays as the primary storage component due to their relatively low cost. The use of array enclosures increases the scalability of the solution by allowing the addition of more disk drives and enclosures to increase the storage capacity.

The shift to VTL also eliminates streaming problems that often impair efficiency in tape drives as disk technology does not rely on streaming and can write effectively regardless of data transfer speeds.

By backing up data to disks instead of tapes, VTL often increases performance of both backup and recovery operations. Restore processes are found to be faster than backup regardless of implementations. In some cases, the data stored on the VTL's disk array is exported to other media, such as physical tapes, for disaster recovery purposes (scheme called disk-to-disk-to-tape, or D2D2T).

Alternatively, most contemporary backup software products introduced also direct usage of the file system storage (especially network-attached storage, accessed through NFS and CIFS protocols over IP networks) not requiring a tape library emulation at all. They also often offer a disk staging feature: moving the data from disk to a physical tape for a long-term storage.

While a virtual tape library is very fast, the disk storage within is not designed to be removable, and does not usually involve physically removable external disk drives to be used for data archiving in place of tape. Since the disk storage is always connected to power and data sources and is never physically electrically isolated, it is vulnerable to potential damage and corruption due to nearby building or power grid lightning strikes.

History

The first VTL solution was introduced by Cybernetics in 1992 under the name HSTC (high speed tape cache). [1] Later, IBM released a Virtual Tape Server (VTS) introduced in 1997. It was targeted for a mainframe market, where many legacy applications tend to use a lot of very short tape volumes. It used the ESCON interface, and acted as a disk cache for the IBM 3494 tape library. A competitive offering from StorageTek (acquired in 2005 by Sun Microsystems, then subsequently by Oracle Corporation) was known as Virtual Storage Manager (VSM) which leveraged the market dominant STK Powderhorn library as a back store. Each product line has been enhanced to support larger disk buffer capacities, FICON, and more recently (c. 2010) "tapeless" disk-only environments.

Other offerings in the mainframe space are also "tapeless". DLm has been developed by EMC Corporation, while Luminex has gained popularity and wide acceptance by teaming with Data Domain to provide the benefits of data deduplication behind its Channel Gateway platform. With the consequent reduction in off-site replication bandwidth afforded by deduplication, it is possible and practical for this form of virtual tape to reduce recovery point objective time and recovery time objective to near zero (or instantaneous).

Outside of the mainframe environment, tape drives and libraries mostly featured SCSI. Likewise, VTLs were developed supporting popular SCSI transport protocols such as SPI (legacy systems), Fibre Channel, and iSCSI.

The FalconStor VTL is the foundation of nearly half of the products sold in the VTL market, according to an Enterprise Strategy Group analyst. [2]

In mid-2010s VTLs got a rebirth thanks to hi-capacity "archive" drives from Seagate and HGST and more popular "tape in cloud" and Disk-to-Disk-to-Tape (often in cloud) scenarios. [3]

Amazon and StarWind Software in partnership with Veeam, BackBlaze and Wasabi Technologies offer a so-called gateway products that facilitates backing up and archiving "on premises" data as virtual tapes stored in AWS, Microsoft Azure, Wasabi Technologies and BackBlaze public clouds. [4] [5] [6] The idea is to provide a seamless integration of a backup applications incompatible with the APIs object storages expose. Say, at the time Veeam couldn't do AWS S3 and can't backup to the deep archive tier within Azure still. [7]

See also

Related Research Articles

Internet Small Computer Systems Interface or iSCSI is an Internet Protocol-based storage networking standard for linking data storage facilities. iSCSI provides block-level access to storage devices by carrying SCSI commands over a TCP/IP network. iSCSI facilitates data transfers over intranets and to manage storage over long distances. It can be used to transmit data over local area networks (LANs), wide area networks (WANs), or the Internet and can enable location-independent data storage and retrieval.

Quantum Corporation is a data storage, management, and protection company that provides technology to store, manage, archive, and protect video and unstructured data throughout the data life cycle. Their products are used by enterprises, media and entertainment companies, government agencies, big data companies, and life science organizations. Quantum is headquartered in San Jose, California and has offices around the world, supporting customers globally in addition to working with a network of distributors, VARs, DMRs, OEMs and other suppliers.

In information technology, a backup, or data backup is a copy of computer data taken and stored elsewhere so that it may be used to restore the original after a data loss event. The verb form, referring to the process of doing so, is "back up", whereas the noun and adjective form is "backup". Backups can be used to recover data after its loss from data deletion or corruption, or to recover data from an earlier time. Backups provide a simple form of IT disaster recovery; however not all backup systems are able to reconstitute a computer system or other complex configuration such as a computer cluster, active directory server, or database server.

NetApp, Inc. is an American data infrastructure company that provides unified data storage, integrated data services, and cloud operations (CloudOps) solutions to enterprise customers. The company is based in San Jose, California. It has ranked in the Fortune 500 from 2012 to 2021. Founded in 1992 with an initial public offering in 1995, NetApp offers cloud data services for management of applications and data both online and physically.

FICON is the IBM proprietary name for the ANSI FC-SB-3 Single-Byte Command Code Sets-3 Mapping Protocol for Fibre Channel (FC) protocol. It is a FC layer 4 protocol used to map both IBM's antecedent channel-to-control-unit cabling infrastructure and protocol onto standard FC services and infrastructure. The topology is fabric utilizing FC switches or directors. Valid rates include 1, 2, 4, 8, 16, and 32 Gigabit per second data rates at distances up to 100 km.

Hierarchical storage management (HSM), also known as tiered storage, is a data storage and data management technique that automatically moves data between high-cost and low-cost storage media. HSM systems exist because high-speed storage devices, such as solid-state drive arrays, are more expensive than slower devices, such as hard disk drives, optical discs and magnetic tape drives. While it would be ideal to have all data available on high-speed devices all the time, this is prohibitively expensive for many organizations. Instead, HSM systems store the bulk of the enterprise's data on slower devices, and then copy data to faster disk drives when needed. The HSM system monitors the way data is used and makes best guesses as to which data can safely be moved to slower devices and which data should stay on the fast devices.

<span class="mw-page-title-main">StorageTek</span> Data storage company

Storage Technology Corporation was a data storage technology company headquartered in Louisville, Colorado. New products include data retention systems, which it calls "information lifecycle management" (ILM).

Veritas Backup Exec is a data protection software product designed for customers with mixed physical and virtual environments, and who are moving to public cloud services. Supported platforms include VMware and Hyper-V virtualization, Windows and Linux operating systems, Amazon S3, Microsoft Azure and Google Cloud Storage, among others. All management and configuration operations are performed with a single user interface. Backup Exec also provides integrated deduplication, replication, and disaster recovery capabilities and helps to manage multiple backup servers or multi-drive tape loaders.

<span class="mw-page-title-main">Overland Storage</span>

Overland Storage Inc. is a wholly owned subsidiary of Sphere 3D Corp. It has acquired Tandberg Data shortly before being acquired by Sphere 3D itself. The two subsidiaries were later rebranded under the common Overland-Tandberg brand.

<span class="mw-page-title-main">IBM storage</span> Product portfolio of IBM

The IBM Storage product portfolio includes disk, flash, tape, NAS storage products, storage software and services. IBM's approach is to focus on data management.

Catalogic DPX is an enterprise-level data protection tool that backs up and restores data and applications for a variety of operating systems. It has data protection, disaster recovery and business continuity planning capabilities. Catalogic DPX protects physical servers or virtual machines on VMWare vSphere and Microsoft Hyper-V hypervisors, supports many database applications, including Oracle, SQL Server, SharePoint, Exchange, and SAP HANA. DPX supports agent-based or agent-less backups. Users can map to and use a backed up version of the database if something goes wrong with the primary version. DPX is managed from a single console and catalog. This allows for centralized control of both tape-based and disk-based data protection jobs across heterogeneous operating systems. DPX can protect data centers, remote sites and supports recovery from DR. DPX can protect data to disk, tape or cloud. It is used for various recovery use cases including file, application, BMR, VM or DR. DPX can spin up VMs from backup images, recover physical servers, bring up applications online from snapshot based backups, it can be used to recover from Ransomware.

<span class="mw-page-title-main">Storage area network</span> Network which provides access to consolidated, block-level data storage

A storage area network (SAN) or storage network is a computer network which provides access to consolidated, block-level data storage. SANs are primarily used to access data storage devices, such as disk arrays and tape libraries from servers so that the devices appear to the operating system as direct-attached storage. A SAN typically is a dedicated network of storage devices not accessible through the local area network (LAN).

<span class="mw-page-title-main">Luminex Software</span> American software company

Luminex Software, Inc. is a developer and provider of mainframe connectivity, storage and data protection solutions, including virtual tape and data integration products.

<span class="mw-page-title-main">StorSimple</span>

StorSimple was a privately held company based in Santa Clara, California, marketing cloud storage. It was funded by venture capital from Index Ventures, Redpoint Ventures, Ignition Partners, and Mayfield Fund for a total of $31.5 million.

Cofio Software, headquartered in San Diego, California, was a privately held software company founded in 2006 which produced a product called AIMstor. After being acquired in 2012 the product became known as the Hitachi Data Instance Director. and later became Hitachi Ops Center Protector

NetVault is a set of data protection software developed and supported by Quest Software. NetVault Backup is a backup and recovery software product. It can be used to protect data and software applications in physical and virtual environments from one central management interface. It supports many servers, application platforms, and protocols such as UNIX, Linux, Microsoft Windows, VMware, Microsoft Hyper-V, Oracle, Sybase, Microsoft SQL Server, NDMP, Oracle ACSLS, IBM DAS/ACI, Microsoft Exchange Server, DB2, and Teradata.

Disk-based backup refers to technology that allows one to back up large amounts of data to a disk storage unit. It is often supplemented by tape drives for data archival or replication to another facility for disaster recovery. Backup-to-disk is a popular in enterprise use for both technical and business reasons. Storage devices have gotten faster access time and higher storage capacity. There are different forms of disks used for back up, standard mechanical disks and solid state disks.

Software-defined storage (SDS) is a marketing term for computer data storage software for policy-based provisioning and management of data storage independent of the underlying hardware. Software-defined storage typically includes a form of storage virtualization to separate the storage hardware from the software that manages it. The software enabling a software-defined storage environment may also provide policy management for features such as data deduplication, replication, thin provisioning, snapshots and backup.

Veeam Software is a privately held US-based information technology company owned by Insight Partners. It develops backup, disaster recovery and modern data protection software for virtual, cloud-native, SaaS, Kubernetes and physical workloads. Veeam Software was co-founded by two Russian entrepreneurs, Ratmir Timashev and Andrei Baronov. While Veeam's start was built on protecting data across virtualized workloads, it has significantly expanded to protect data across a wide variety of platforms from AWS, Azure, Google Cloud, Microsoft 365, Kubernetes, etc. Veeam's current CEO, Anand Eswaran, has been pushing Veeam's strategy to accelerate share in the enterprise with adding several layers to Veeam's partnerships. Veeam took over the #1 market share in the data protection category in the second half of 2022. The company headquarters is in Kirkland, Washington, United States.

<span class="mw-page-title-main">Veeam Backup & Replication</span> Backup and disaster recovery software

Veeam Backup & Replication is a proprietary backup app developed by Veeam for virtual environments built on VMware vSphere, Nutanix AHV, and Microsoft Hyper-V hypervisors. The software provides backup, restore and replication functionality for virtual machines, physical servers and workstations as well as cloud-based workload.

References

  1. "History of VTL/IBM".
  2. "InfoStor ESG Report on FalconStor Virtual Tape Library".
  3. "The Rise, Fall, and Rise, of Virtual Tape Libraries".
  4. "Integration of AWS Storage Gateway with Veeam – Backups and backup copy in Cloud". orgedelacruz.uk. Retrieved 6 September 2017.
  5. "Setting Up a Veeam to StarWind Virtual Tape Library Configuration". mpecsinc.com. Retrieved 3 March 2020.
  6. "Archive backups with Veeam and StarWind Virtual Tape Library". tech-coffee.net. Retrieved 20 April 2018.
  7. "Complete the Backup Lifecycle with Veeam's SOBR Archive Tier". Veeam. Retrieved 4 March 2021.