Fusion Drive

Last updated

Fusion Drive is a type of hybrid drive technology created by Apple Inc. It combines a hard disk drive with a NAND flash storage (solid-state drive of 24 GB or more) [1] and presents it as a single Core Storage managed logical volume with the space of both drives combined. [2]

Contents

The operating system automatically manages the contents of the drive so the most frequently accessed files are stored on the faster flash storage, while infrequently used items move to or stay on the hard drive. [3] For example, if spreadsheet software is used often, the software will be moved to the flash storage for faster user access. In software, this logical volume speeds up performance of the computer by performing both caching for faster writes and auto tiering for faster reads.

Availability

The Fusion Drive was announced as part of an Apple event held on October 23, 2012, with the first supporting products being two desktops: the iMac and Mac Mini with OS X Mountain Lion released in late 2012. [3] Fusion Drive remains available in subsequent models of these computers, but was not expanded to other Apple devices: the latest MacBook and Mac Pro models use exclusively flash storage, and while this was an optional upgrade for the mid-2012 non-Retina MacBook Pro discontinued by Apple, it will replace the standard hard disk drive instead of complementing it in the fashion of Fusion Drive. As of November 2021, no Mac offers a fusion drive.

Release dateHDD storageFlash storage
Mac MiniLate 20121 TB128 GB
Late 2014
iMac
(all models)
Late 2012
Late 2013
2014
iMac
(27-inch non-Retina)
Late 20123 TB
Late 2013
iMac
(27-inch Retina)
Late 2014
Mid-2015
iMacLate 20151 TB24 GB
2 TB128 GB
Mid 20171 TB32 GB
2 TB128 GB
3 TB
Early 20191 TB32 GB
2 TB128 GB
3 TB
iMac
(21.5-inch)
Late 20201 TB32 GB

Design

Apple's Fusion Drive design incorporates proprietary features with limited documentation. It has been reported that the design of Fusion Drive has been influenced by a research project called Hystor. [4] According to the paper, [5] this hybrid storage system unifies a high-speed SSD and a large-capacity hard drive with several design considerations of which one has been used in the Fusion Drive.

  1. The SSD and the hard drive are logically merged into a single block device managed by the operating system, which is independent of file systems and requires no changes to applications.
  2. A portion of SSD space is used as a write-back buffer to absorb incoming write traffic, which hides perceivable latencies and boosts write performance.
  3. More frequently accessed data is stored on the SSD and the larger, less frequently accessed data stored on the HDD.
  4. Data movement is based on access patterns: if data has been on the HDD and suddenly becomes frequently accessed, it will usually get moved to the SSD by the program controlling the Fusion Drive. During idle periods, data is adaptively migrated to the most suitable device to provide sustained data processing performance for users.

Several experimental studies [3] [6] [7] [8] [ unreliable source? ] [9] [ unreliable source? ] [10] [ unreliable source? ] have been conducted to speculate about the internal mechanism of Fusion Drive. A number of speculations are available but not completely confirmed.

  1. Fusion Drive is a block-level solution based on Apple's Core Storage, a logical volume manager managing multiple physical devices. [6] [7] The capacity of a Fusion Drive is confirmed to be the sum of two devices. [6] [7] Fusion Drive is file system agnostic and effective for both HFS Plus and ZFS. [8]
  2. Part of the SSD space is used as a write buffer for incoming writes. [6] [7] In the stable state, a minimum 4 GB space is reserved for buffering writes. [3] [6] [7] A small spare area is set aside on the SSD for performance consistency. [7]
  3. Data is promoted to the SSD based on its access frequency. [6] [7] The frequency is detected at the block level [9] and below file system memory cache. [10] Data migration happens in 128 KB chunks during idle or light I/O periods. [6] [7]
  4. Operating system and other critical documents are always cached on the SSD. [6] Applications are likely to be handled similarly. [7] A regular file can reside on both devices. [9]

See also

Related Research Articles

<span class="mw-page-title-main">Computer data storage</span> Storage of digital data readable by computers

Computer data storage is a technology consisting of computer components and recording media that are used to retain digital data. It is a core function and fundamental component of computers.

<span class="mw-page-title-main">Hard disk drive</span> Electro-mechanical data storage device

A hard disk drive (HDD), hard disk, hard drive, or fixed disk, is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating platters coated with magnetic material. The platters are paired with magnetic heads, usually arranged on a moving actuator arm, which read and write data to the platter surfaces. Data is accessed in a random-access manner, meaning that individual blocks of data can be stored and retrieved in any order. HDDs are a type of non-volatile storage, retaining stored data when powered off. Modern HDDs are typically in the form of a small rectangular box.

RAID is a data storage virtualization technology that combines multiple physical disk drive components into one or more logical units for the purposes of data redundancy, performance improvement, or both. This is in contrast to the previous concept of highly reliable mainframe disk drives referred to as "single large expensive disk" (SLED).

<span class="mw-page-title-main">Western Digital</span> American digital storage company

Western Digital Corporation is an American computer drive manufacturer and data storage company, headquartered in San Jose, California. It designs, manufactures and sells data technology products, including data storage devices, data center systems and cloud storage services.

Wear leveling is a technique for prolonging the service life of some kinds of erasable computer storage media, such as flash memory, which is used in solid-state drives (SSDs) and USB flash drives, and phase-change memory. There are several wear leveling mechanisms that provide varying levels of longevity enhancement in such memory systems.

In computing, data recovery is a process of retrieving deleted, inaccessible, lost, corrupted, damaged, or formatted data from secondary storage, removable media or files, when the data stored in them cannot be accessed in a usual way. The data is most often salvaged from storage media such as internal or external hard disk drives (HDDs), solid-state drives (SSDs), USB flash drives, magnetic tapes, CDs, DVDs, RAID subsystems, and other electronic devices. Recovery may be required due to physical damage to the storage devices or logical damage to the file system that prevents it from being mounted by the host operating system (OS).

Input/output operations per second is an input/output performance measurement used to characterize computer storage devices like hard disk drives (HDD), solid state drives (SSD), and storage area networks (SAN). Like benchmarks, IOPS numbers published by storage device manufacturers do not directly relate to real-world application performance.

In computing, a hybrid drive is a logical or physical storage device that combines a faster storage medium such as solid-state drive (SSD) with a higher-capacity hard disk drive (HDD). The intent is adding some of the speed of SSDs to the cost-effective storage capacity of traditional HDDs. The purpose of the SSD in a hybrid drive is to act as a cache for the data stored on the HDD, improving the overall performance by keeping copies of the most frequently used data on the faster SSD drive.

<span class="mw-page-title-main">ReadyBoost</span> Disk caching component of Microsoft Windows

ReadyBoost is a disk caching software component developed by Microsoft for Windows Vista and included in later versions of Windows. ReadyBoost enables NAND memory mass storage CompactFlash, SD card, and USB flash drive devices to be used as a cache between the hard drive and random access memory in an effort to increase computing performance. ReadyBoost relies on the SuperFetch and also adjusts its cache based on user activity. ReadyDrive for hybrid drives is implemented in a manner similar to ReadyBoost.

<span class="mw-page-title-main">Solid-state drive</span> Data storage device

A solid-state drive (SSD) is a solid-state storage device that uses integrated circuit assemblies to store data persistently, typically using flash memory, and functioning as secondary storage in the hierarchy of computer storage. It is also sometimes called a semiconductor storage device, a solid-state device or a solid-state disk, even though SSDs lack the physical spinning disks and movable read–write heads used in hard disk drives (HDDs) and floppy disks. SSD also has rich internal parallelism for data processing.

<span class="mw-page-title-main">Disk buffer</span>

In computer storage, disk buffer is the embedded memory in a hard disk drive (HDD) or solid state drive (SSD) acting as a buffer between the rest of the computer and the physical hard disk platter or flash memory that is used for storage. Modern hard disk drives come with 8 to 256 MiB of such memory, and solid-state drives come with up to 4 GB of cache memory.

A trim command allows an operating system to inform a solid-state drive (SSD) which blocks of data are no longer considered to be 'in use' and therefore can be erased internally.

A hybrid array is a form of hierarchical storage management that combines hard disk drives (HDDs) with solid-state drives (SSDs) for I/O speed improvements.

In computer data storage, Smart Response Technology is a proprietary caching mechanism introduced in 2011 by Intel for their Z68 chipset, which allows a SATA solid-state drive (SSD) to function as cache for a hard disk drive (HDD).

Higher performance in hard disk drives comes from devices which have better performance characteristics. These performance characteristics can be grouped into two categories: access time and data transfer time.

bcache is a cache in the Linux kernel's block layer, which is used for accessing secondary storage devices. It allows one or more fast storage devices, such as flash-based solid-state drives (SSDs), to act as a cache for one or more slower storage devices, such as hard disk drives (HDDs); this effectively creates hybrid volumes and provides performance improvements.

dm-cache is a component of the Linux kernel's device mapper, which is a framework for mapping block devices onto higher-level virtual block devices. It allows one or more fast storage devices, such as flash-based solid-state drives (SSDs), to act as a cache for one or more slower storage devices such as hard disk drives (HDDs); this effectively creates hybrid volumes and provides secondary storage performance improvements.

Apple File System (APFS) is a proprietary file system developed and deployed by Apple Inc. for macOS Sierra (10.12.4) and later, iOS 10.3 and later, tvOS 10.2 and later, watchOS 3.2 and later, and all versions of iPadOS. It aims to fix core problems of HFS+, APFS's predecessor on these operating systems. APFS is optimized for solid-state drive storage and supports encryption, snapshots, and increased data integrity, among other capabilities.

<span class="mw-page-title-main">ZFS</span> File system

ZFS is a file system with volume management capabilities. It began as part of the Sun Microsystems Solaris operating system in 2001. Large parts of Solaris – including ZFS – were published under an open source license as OpenSolaris for around 5 years from 2005, before being placed under a closed source license when Oracle Corporation acquired Sun in 2009–2010. During 2005 to 2010, the open source version of ZFS was ported to Linux, Mac OS X and FreeBSD. In 2010, the illumos project forked a recent version of OpenSolaris, to continue its development as an open source project, including ZFS. In 2013, OpenZFS was founded to coordinate the development of open source ZFS. OpenZFS maintains and manages the core ZFS code, while organizations using ZFS maintain the specific code and validation processes required for ZFS to integrate within their systems. OpenZFS is widely used in Unix-like systems.

References

  1. Dominguez, Alberto (3 January 2019). "The best desktop computers of 2018". Pandora FMS . Archived from the original on 3 January 2019. Retrieved 3 January 2019.
  2. Hutchinson, Lee (October 23, 2012). "Apple Fusion Drive—wait, what? How does this work?". Ars Technica. Condé Nast. Retrieved October 25, 2012.
  3. 1 2 3 4 Shimpi, Anand Lal (October 24, 2012). "Understanding Apple's Fusion Drive". AnandTech. Retrieved October 25, 2012.
  4. "Computer Science Research at Ohio State Makes Impact in Apple's Hybrid Storage Product". www.cse.ohio-state.edu. 2013-04-08. Archived from the original on 2014-10-18. Retrieved 2014-10-12.
  5. Feng Chen, David A. Koufaty and Xiaodong Zhang (2011). Hystor: making the best use of solid state drives in high performance storage systems. International Conference on Supercomputing (ICS '11). pp. 22–23. doi:10.1145/1995896.1995902.
  6. 1 2 3 4 5 6 7 8 "Achieving fusion—with a service training doc, Ars tears open Apple's Fusion Drive". www.arstechnica.com. 2012-11-05.
  7. 1 2 3 4 5 6 7 8 9 "A Month with Apple's Fusion Drive". www.anandtech.com. 2013-01-18.
  8. 1 2 "Fusion Drive - loose ends". jolly.jinx.de/. 2012-10-31.
  9. 1 2 3 "More on BYO Fusion drive". jolly.jinx.de/. 2012-10-31.
  10. 1 2 "Fusion Drive last words". jolly.jinx.de/. 2012-11-04.