Hybrid array

Last updated

A hybrid array is a form of hierarchical storage management that combines hard disk drives (HDDs) with solid-state drives (SSDs) for I/O speed improvements.

Hierarchical storage management (HSM) is a data storage technique that automatically moves data between high-cost and low-cost storage media. HSM systems exist because high-speed storage devices, such as solid state drive arrays, are more expensive than slower devices, such as hard disk drives, optical discs and magnetic tape drives. While it would be ideal to have all data available on high-speed devices all the time, this is prohibitively expensive for many organizations. Instead, HSM systems store the bulk of the enterprise's data on slower devices, and then copy data to faster disk drives when needed. In effect, HSM turns the fast disk drives into caches for the slower mass storage devices. The HSM system monitors the way data is used and makes best guesses as to which data can safely be moved to slower devices and which data should stay on the fast devices.

Hard disk drive Data storage device

A hard disk drive (HDD), hard disk, hard drive, or fixed disk, is an electromechanical data storage device that uses magnetic storage to store and retrieve digital information using one or more rigid rapidly rotating disks (platters) coated with magnetic material. The platters are paired with magnetic heads, usually arranged on a moving actuator arm, which read and write data to the platter surfaces. Data is accessed in a random-access manner, meaning that individual blocks of data can be stored or retrieved in any order and not only sequentially. HDDs are a type of non-volatile storage, retaining stored data even when powered off.

Solid-state drive data storage device

A solid-state drive (SSD) is a solid-state storage device that uses integrated circuit assemblies as memory to store data persistently. It is also sometimes called a solid-state device or a solid-state disk, although SSDs do not have physical disks.

Contents

Hybrid storage arrays aim to mitigate the ever increasing price-performance gap between HDDs and DRAM by adding a non-volatile flash level to the memory hierarchy. [1] Hybrid arrays thus aim to lower the cost per I/O, compared to using only SSDs for storage. Hybrid architectures can be as simple as involving a single SSD cache for desktop or laptop computers, or can be more complex as configurations for data centers and cloud computing.

Memory hierarchy computer architecture that separates storage into a hierarchy based on response time

In computer architecture, the memory hierarchy separates computer storage into a hierarchy based on response time. Since response time, complexity, and capacity are related, the levels may also be distinguished by their performance and controlling technologies. Memory hierarchy affects performance in computer architectural design, algorithm predictions, and lower level programming constructs involving locality of reference.

Cache (computing) computing component that transparently stores data so that future requests for that data can be served faster

In computing, a cache is a hardware or software component that stores data so that future requests for that data can be served faster; the data stored in a cache might be the result of an earlier computation or a copy of data stored elsewhere. A cache hit occurs when the requested data can be found in a cache, while a cache miss occurs when it cannot. Cache hits are served by reading data from the cache, which is faster than recomputing a result or reading from a slower data store; thus, the more requests that can be served from the cache, the faster the system performs.

Data center building or room where computer servers and related equipment are operated

A data center or data centre is a building, dedicated space within a building, or a group of buildings used to house computer systems and associated components, such as telecommunications and storage systems.

Implementations

Some commercial products for building hybrid arrays include:

Adaptec was a computer storage company and remains a brand for computer storage products. The company was an independent firm from 1981 to 2010, at which point it was acquired by PMC-Sierra, which itself was later acquired by Microsemi, which itself was later acquired by Microchip Technology.

Fusion Drive is Apple Inc.'s name for its implementation of a hybrid drive. Apple's implementation combines a hard disk drive with a NAND flash storage and presents it as a single Core Storage managed logical volume with the space of both drives combined.

Linux Family of free and open-source software operating systems based on the Linux kernel

Linux is a family of free and open-source software operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991 by Linus Torvalds. Linux is typically packaged in a Linux distribution.

See also

In computing, a hybrid drive is a logical or physical storage device that combines a faster storage medium such as solid-state drive (SSD) with a higher-capacity hard disk drive (HDD). The intent is adding some of the speed of SSDs to the cost-effective storage capacity of traditional HDDs. The purpose of the SSD in a hybrid drive is to act as a cache for the data stored on the HDD, improving the overall performance by keeping copies of the most frequently used data on the faster SSD.

Automated tiered storage is the automated progression or demotion of data across different tiers (types) of storage devices and media. The movement of data takes place in an automated way with the help of a software or embedded firmware and is assigned to the related media according to performance and capacity requirements. More advanced implementations include the ability to define rules and policies that dictate if and when data can be moved between the tiers, and in many cases provides the ability to pin data to tiers permanently or for specific periods of time. Implementations vary, but are classed into two broad categories: pure software based implementations that run on general purpose processors supporting most forms of general purpose storage media and embedded automated tiered storage controlled by firmware as part of a closed embedded storage system such as a SAN disk array. Software Defined Storage architectures commonly include a component of tiered storage as part of their primary functions.

In computer science, the five-minute rule is a rule of thumb for deciding whether a data item should be kept in memory, or stored on disk and read back into memory when required. It was first formulated by Jim Gray and G. F. Putzolu in 1985, and then subsequently revised in 1997 and 2007 to reflect changes in the relative cost and performance of memory and persistent storage.

Related Research Articles

RAID is a data storage virtualization technology that combines multiple physical disk drive components into one or more logical units for the purposes of data redundancy, performance improvement, or both. This was in contrast to the previous concept of highly reliable mainframe disk drives referred to as "single large expensive disk" (SLED).

A disk array controller is a device which manages the physical disk drives and presents them to the computer as logical units. It almost always implements hardware RAID, thus it is sometimes referred to as RAID controller. It also often provides additional disk cache.

Clariion

Clariion is a discontinued SAN disk array manufactured and sold by EMC Corporation, it occupied the entry-level and mid-range of EMC's SAN disk array products. In 2011, EMC introduced the EMC VNX Series, designed to replace both the Clariion and Celerra products.

NetApp company

NetApp, Inc. is a hybrid cloud data services and data management company headquartered in Sunnyvale, California. It has ranked in the Fortune 500 since 2012. Founded in 1992 with an IPO in 1995, NetApp offers hybrid cloud data services for management of applications and data across cloud and on-premises environments.

Input/output operations per second is an input/output performance measurement used to characterize computer storage devices like hard disk drives (HDD), solid state drives (SSD), and storage area networks (SAN). Like benchmarks, IOPS numbers published by storage device manufacturers do not directly relate to real-world application performance.

Is a computer storage product by NetApp running ONTAP operation system; terms ONTAP, AFF, FAS often used as synonyms, also filer is a synonyms though it is not an official name. There are two types of FAS systems, Hybrid and All-Flash:

  1. NetApp proprietary custom-build hardware appliances with HDD or SSD drives called hybrid Fabric-Attached Storage
  2. NetApp proprietary custom-build hardware appliances with only SSD drives and optimized ONTAP for low latency called ALL-Flash FAS.

The IBM Storage product portfolio includes disk, flash, tape, NAS storage products, storage software and services. IBM’s approach is to focus on data management.

Fusion-io American technology company

Fusion-io, Inc. was a computer hardware and software systems company based in Cottonwood Heights, Utah, that designed and manufactured products using flash memory technology. The Fusion ioMemory was marketed for applications such as databases, virtualization, cloud computing, big data. Their ioDrive product was considered around 2011 to be one of the fastest storage devices on the market.

SandForce was an American fabless semiconductor company based in Milpitas, California, that designed and manufactured flash memory controllers for solid-state drives (SSDs). On January 4, 2012, SandForce was acquired by LSI Corporation and became the Flash Components Division of LSI. LSI was subsequently acquired by Avago Technologies on May 6, 2014 and on the 29th of that same month Seagate Technology announced its intention to buy LSI's Flash Components Division.

bcache is a cache in the Linux kernel's block layer, which is used for accessing secondary storage devices. It allows one or more fast storage devices, such as flash-based solid-state drives (SSDs), to act as a cache for one or more slower storage devices, such as hard disk drives (HDDs); this effectively creates hybrid volumes and provides performance improvements.

StorTrends is a brand name of disk-based, hybrid array, and solid state storage products for computer data storage in data centers, sold by American Megatrends. StorTrends appliances utilize the iTX architecture, which includes features such as deduplication and compression, SSD caching and SSD tiering, automated tiered storage, replication, data archiving, snapshots, WAN optimization, and a VMware vSphere plug-in.

ExpressCache is a Windows-based SSD caching technology developed by Condusiv Technologies and licensed to a number of laptop manufacturers including Acer, ASUS, Samsung, Sony, Lenovo, and Fujitsu. ExpressCache is also bundled with some SanDisk products such as ReadyCache; SanDisk currently holds an exclusive ExpressCache license for stand-alone storage products.

Enterprise Storage OS, also known as ESOS, is a GNU General Public License (GPL) licensed Linux distribution that serves as a block-level storage server in a storage area network (SAN). ESOS is composed of open-source software projects that are required for a Linux distribution and several proprietary build and install time options. The SCST project is the core component of ESOS; it provides the back-end storage functionality.

Dell EMC ScaleIO is a software-defined storage product from Dell EMC that creates a server-based storage area network (SAN) from local application server storage using existing customer hardware or EMC servers. It converts direct-attached storage into shared block storage. ScaleIO is sold as software-only or preinstalled on commodity hardware. EMC promotes its ScaleIO server storage-area network software as a way to converge computing resources and commodity storage into a "single-layer architecture."

Dell EMC Unity is one of Dell EMC’s mid-range storage array product lines. It was designed from the ground up as the next generation midrange unified storage array after the EMC VNX and VNXe series, which evolved out of the EMC Clariion SAN disk array.

References

  1. Rino Micheloni; Alessia Marelli; Kam Eshghi (2012). Inside Solid State Drives (SSDs). Springer. p. 62. ISBN   978-94-007-5145-3.
  2. Charlie Demerjian (September 9, 2009). "Adaptec's MaxIQ caches RAIDs with SSDs". SemiAccurate. Retrieved October 10, 2016.
  3. Larry Dignan (February 5, 2012). "EMC unveils VFCache, targets Fusion-io". Between the Lines. ZDNet. Retrieved October 10, 2016.
  4. Chris Mellor (March 5, 2013). "One day later: EMC declares war on all-flash array, server flash card rivals: Rolls out XtremIO array, renamed VFCache". The Register. Retrieved October 10, 2016.
  5. 1 2 "Fusion-io spins up ioTurbine, enhances server flash caching". The Register. Retrieved October 10, 2016.
  6. "Fusion-io buys NexGen". theregister.co.uk. Retrieved 2015-03-26.
  7. "Big storage turns the tide in the hybrid flash array market", Computer Weekly , September 2013, retrieved 2015-03-26
  8. The SSD Guy (2013-08-20). "IBM Adds Server-Side Caching". The SSD Guy. Retrieved 2013-12-23.
  9. "LSI MegaRAID CacheCade Pro 2.0 Review". storagereview.com. Retrieved 2015-03-26.
  10. "Hands-on with the Marvell HyperDuo hybrid storage controller". CNET. CBS Interactive. Retrieved 2015-03-26.
  11. "NetApp: Flash as a STORAGE tier? You must be joking". theregister.co.uk. Retrieved 2015-03-26.
  12. Chris Mellor (September 30, 2014). "Oracle crashes all-flash bash: Behold, our hybrid FS1 arrays: Mutant flash/disk box a pillar of storage: It's axiomatic". The Register. Retrieved October 10, 2016.
  13. Les Tokar (December 4, 2011). "NVELO Dataplex SSD Caching Software Review - Seven mSATA SSDs Prove An Amazing Concept". The SSD Review. Retrieved October 10, 2016.
  14. Kristian Vättö (December 16, 2012). "Samsung Acquires SSD Caching Company NVELO". AnandTech. Retrieved October 10, 2016.
  15. Ian Barker (2014-01-27). "AMI StorTrends 3500i offers high performance storage for smaller enterprises". BetaNews. Retrieved 2014-10-17.
  16. "Mutant array upstarts feast on EMC, NetApp's leavings". The Register. Retrieved 2015-03-26.
  17. "Oracle's Flash-Friendly Sun ZFS Storage Is Ready for New SPARCs". enterprisestorageforum.com. 3 April 2013. Retrieved 2015-03-26.