This article needs to be updated.(September 2019) |
Design firm | Advanced Micro Devices |
---|---|
Introduced | 2016 |
Type | Servers |
AMD Radeon Instinct is AMD's brand of deep learning oriented GPUs. [1] [2] It replaced AMD's FirePro S brand in 2016. Compared to the Radeon brand of mainstream consumer/gamer products, the Radeon Instinct branded products are intended to accelerate deep learning, artificial neural network, and high-performance computing/GPGPU applications.
The Radeon Instinct product line directly competes with Nvidia's Tesla and Intel Xeon Phi lines of deep learning and GPGPU cards.
Since MI100 introduction in November 2020, the Radeon Instinct family is known as AMD Instinct, dropping Radeon brand from its name.
See also: List of AMD graphics processing units.
The three initial Radeon Instinct products were announced in December 2016, with each based on a different architecture.
The MI6 is a passively cooled, Polaris 10 based card with 16 GB of GDDR5 memory and with a <150 W TDP. [1] [2] At 5.7 TFLOPS (FP16 and FP32), the MI6 is expected to be used primarily for inference, rather than neural network training. The MI6 has a peak double precision (FP64) compute performance of 358 GFLOPS. [3]
The MI8 is a Fiji based card, analogous to the R9 Nano, and expected to have a <175W TDP. [1] The MI8 has 4 GB of High Bandwidth Memory. At 8.2 TFLOPS (FP16 and FP32), the MI8 is marked toward inference. The MI8 has a peak (FP64) double precision compute performance 512 GFLOPS. [4]
The MI25 is a Vega based card, utilizing HBM2 memory. The MI25 performance is expected to be 12.3 TFLOPS using FP32 numbers. In contrast to the MI6 and MI8, the MI25 is able to increase performance when using lower precision numbers, and accordingly is expected to reach 24.6 TFLOPS when using FP16 numbers. The MI25 is rated at <300W TDP with passive cooling. The MI25 also provides 768 GFLOPS peak double precision (FP64) at 1/16th rate. [5]
The MI6, MI8, and MI25 products all support AMD's MxGPU virtualization technology, enabling sharing of GPU resources across multiple users. [1] [6]
MIOpen is AMD's deep learning library to enable GPU acceleration of deep learning. [1] Much of this extends the GPUOpen's Boltzmann Initiative software. [6] This is intended to compete with the deep learning portions of Nvidia's CUDA library. It supports the deep learning frameworks: Theano, Caffe, TensorFlow, MXNet, The Microsoft Cognitive Toolkit, Torch, and Chainer. Programming is supported in OpenCL and Python, in addition to supporting the compilation of CUDA through AMD's Heterogeneous-compute Interface for Portability and Heterogeneous Compute Compiler.
Model (codename) | Release date | Architecture & Fab | Transistors & Die Size | Core | Fillrate [lower-alpha 1] [lower-alpha 2] [lower-alpha 3] | Processing power [lower-alpha 1] [lower-alpha 4] (GFLOPS) | Memory | TBP | Bus interface | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Config [lower-alpha 5] | Clock [lower-alpha 1] (MHz) | Texture (GT/s) | Pixel (GP/s) | Half | Single | Double | Bus type & width | Size (GiB) | Clock (MT/s) | Bandwidth (GB/s) | ||||||
Radeon Instinct MI6 (Polaris 10) [1] [7] [8] [9] [10] | December 2016 | GCN 4th gen 14 nm | 5.7×109 232 mm2 | 2304:144:32 36 CU | 1120 1233 | 177.6 | 39.46 | 5800 | 5800 | 358 | GDDR5 256-bit | 16 | 7000 | 224 | 150 W | PCIe 3.0 x16 |
Radeon Instinct MI8 (Fiji XT) [1] [7] [8] [11] [12] | GCN 3rd gen 28 nm | 8.9×109 596 mm2 | 4096:256:64 64 CU | 1000 | 256.0 | 64.0 | 8200 | 8200 | 512 | HBM 4096-bit | 4 | 1000 | 512 | 175 W | ||
Radeon Instinct MI25 (Vega 10 XT) [1] [7] [8] [13] [14] [15] | GCN 5th gen 14 nm | 12.5×109 510 mm2 | 4096:256:64 64 CU | 1400 1500 | 384 | 96.0 | 24600 | 12300 | 768 | HBM2 2048-bit | 16 | 1704 | 436.2 | 300 W | ||
Radeon Instinct MI50 (Vega 20 GL) [16] [17] [18] [19] | November 2018 | GCN 5th gen 7 nm | 13.2×109 331 mm2 | 3840:240:- 60 CU | 1450 1725 | 348 414 | - | 26500 | 13300 | 6600 | HBM2 4096-bit | 16 or 32 [20] | 2000 | 1024 | PCIe 4.0 x16 | |
Radeon Instinct MI60 (Vega 20 GL) [16] [21] [22] | 4096:256:- 64 CU | 1500 1800 | 384 460.8 | - | 29450 | 14725 | 7362.5 | 32 | 1024 | |||||||
AMD Instinct MI100 (MI100 XL) [23] | November 12, 2020 | CDNA 1.0 7 nm | ? 750 mm2 | 7680:480:- 120 CU | 1000 1502 | 480 720 | - | 184600 | 23100 | 11500 | 2400 | 1228.8 | ||||
AMD Instinct MI210 (?) [24] | December, 2021 | CDNA 2.0 6 nm | 58 x 109 ? mm2 | 6656:416:- 104 CU | 1000 1700 | ? ? | - | 181000 | 22630 | 22630 | HBM2e 4096-bit | 64 | 1600 | 1638 | 300 W | |
AMD Instinct MI250 (?) [25] | November 8, 2021 | 13312:832:- 208 CU | 1000 1700 | ? ? | - | 362100 | 45300 | 45300 | HBM2e 8192-bit | 128 | 3276.8 | 500 W 560 W (Peak) | ||||
AMD Instinct MI250X (?) [26] | 14080:880:- 220 CU | 1000 1700 | ? ? | - | 383000 | 47900 | 47900 |
In computing, floating point operations per second is a measure of computer performance, useful in fields of scientific computations that require floating-point calculations. For such cases, it is a more accurate measure than measuring instructions per second.
A graphics processing unit (GPU) is a specialized electronic circuit designed to rapidly manipulate and alter memory to accelerate the creation of images in a frame buffer intended for output to a display device. GPUs are used in embedded systems, mobile phones, personal computers, workstations, and game consoles.
General-purpose computing on graphics processing units is the use of a graphics processing unit (GPU), which typically handles computation only for computer graphics, to perform computation in applications traditionally handled by the central processing unit (CPU). The use of multiple video cards in one computer, or large numbers of graphics chips, further parallelizes the already parallel nature of graphics processing.
AMD FirePro was AMD's brand of graphics cards designed for use in workstations and servers running professional Computer-aided design (CAD), Computer-generated imagery (CGI), Digital content creation (DCC), and High-performance computing/GPGPU applications. The GPU chips on FirePro-branded graphics cards are identical to the ones used on Radeon-branded graphics cards. The end products differentiate substantially by the provided graphics device drivers and through the available professional support for the software. The product line is split into two categories: "W" workstation series focusing on workstation and primarily focusing on graphics and display, and "S" server series focused on virtualization and GPGPU/High-performance computing.
The AMD Accelerated Processing Unit (APU), formerly known as Fusion, is the marketing term for a series of 64-bit microprocessors from Advanced Micro Devices (AMD), designed to act as a central processing unit (CPU) and graphics processing unit (GPU) on a single die. APUs are general purpose processors that feature integrated graphics processors (IGPs).
AMD FireStream was AMD's brand name for their Radeon-based product line targeting stream processing and/or GPGPU in supercomputers. Originally developed by ATI Technologies around the Radeon X1900 XTX in 2006, the product line was previously branded as both ATI FireSTREAM and AMD Stream Processor. The AMD FireStream can also be used as a floating-point co-processor for offloading CPU calculations, which is part of the Torrenza initiative. The FireStream line has been discontinued since 2012, when GPGPU workloads were entirely folded into the AMD FirePro line.
The Evergreen series is a family of GPUs developed by Advanced Micro Devices for its Radeon line under the ATI brand name. It was employed in Radeon HD 5000 graphics card series and competed directly with Nvidia's GeForce 400 Series.
The Northern Islands series is a family of GPUs developed by Advanced Micro Devices (AMD) forming part of its Radeon-brand, based on the 40 nm process. Some models are based on TeraScale 2 (VLIW5), some on the new TeraScale 3 (VLIW4) introduced with them.
The Radeon HD 7000 series, codenamed "Southern Islands", is a family of GPUs developed by AMD, and manufactured on TSMC's 28 nm process. The primary competitor of Southern Islands, Nvidia's GeForce 600 Series, also shipped during Q1 2012, largely due to the immaturity of the 28 nm process.
Tsubame is a series of supercomputers that operates at the GSIC Center at the Tokyo Institute of Technology in Japan, designed by Satoshi Matsuoka.
Graphics Core Next (GCN) is the codename for a series of microarchitectures and an instruction set architecture that were developed by AMD for its GPUs as the successor to its TeraScale microarchitecture. The first product featuring GCN was launched on January 9, 2012.
Nvidia Tesla was the name of Nvidia's line of products targeted at stream processing or general-purpose graphics processing units (GPGPU), named after pioneering electrical engineer Nikola Tesla. Its products began using GPUs from the G80 series, and have continued to accompany the release of new chips. They are programmable using the CUDA or OpenCL APIs.
Pascal is the codename for a GPU microarchitecture developed by Nvidia, as the successor to the Maxwell architecture. The architecture was first introduced in April 2016 with the release of the Tesla P100 (GP100) on April 5, 2016, and is primarily used in the GeForce 10 series, starting with the GeForce GTX 1080 and GTX 1070, which were released on May 17, 2016 and June 10, 2016 respectively. Pascal was manufactured using TSMC's 16 nm FinFET process, and later Samsung's 14 nm FinFET process.
Volta is the codename for a GPU microarchitecture developed by Nvidia, succeeding Pascal. It was first announced on a roadmap in March 2013, although the first product was not announced until May 2017. The architecture is named after 18th–19th century Italian chemist and physicist Alessandro Volta. It was NVIDIA's first chip to feature Tensor Cores, specially designed cores that have superior deep learning performance over regular CUDA cores. The architecture is produced with TSMC's 12 nm FinFET process. The Ampere microarchitecture is the successor to Volta.
The Radeon 400 series is a series of graphics cards made by AMD. These cards were the first to feature the Polaris GPUs, using the new 14 nm FinFET manufacturing process, developed by Samsung Electronics and licensed to GlobalFoundries. The Polaris family initially included two new chips in the Graphics Core Next (GCN) family. Polaris implements the 4th generation of the Graphics Core Next instruction set, and shares commonalities with the previous GCN microarchitectures.
Radeon Pro is AMD's brand of professional oriented GPUs. It replaced AMD's FirePro brand in 2016. Compared to the Radeon brand for mainstream consumer/gamer products, the Radeon Pro brand is intended for use in workstations and the running of computer-aided design (CAD), computer-generated imagery (CGI), digital content creation (DCC), high-performance computing/GPGPU applications, and the creation and running of virtual reality programs and games.
The Radeon RX Vega series is a series of graphics processors developed by AMD. These GPUs use the Graphics Core Next (GCN) 5th generation architecture, codenamed Vega, and are manufactured on 14 nm FinFET technology, developed by Samsung Electronics and licensed to GlobalFoundries. The series consists of desktop graphics cards and APUs aimed at desktops, mobile devices, and embedded applications.
ROCm is an Advanced Micro Devices (AMD) software stack for graphics processing unit (GPU) programming. ROCm spans several domains: general-purpose computing on graphics processing units (GPGPU), high performance computing (HPC), heterogeneous computing. It offers several programming models: HIP, OpenMP/Message Passing Interface (MPI), OpenCL.
Intel Xe, earlier known unofficially as Gen12, is a GPU architecture developed by Intel.