Radeon Instinct

Last updated
AMD Radeon Instinct
AMD Radeon Instinct wordmark.svg
Design firm Advanced Micro Devices
Introduced2016
TypeServers

AMD Radeon Instinct is AMD's brand of deep learning oriented GPUs. [1] [2] It replaced AMD's FirePro S brand in 2016. Compared to the Radeon brand of mainstream consumer/gamer products, the Radeon Instinct branded products are intended to accelerate deep learning, artificial neural network, and high-performance computing/GPGPU applications.

Contents

The Radeon Instinct product line directly competes with Nvidia's Tesla and Intel Xeon Phi lines of deep learning and GPGPU cards.

Since MI100 introduction in November 2020, the Radeon Instinct family is known as AMD Instinct, dropping Radeon brand from its name.

Products

See also: List of AMD graphics processing units.

The three initial Radeon Instinct products were announced in December 2016, with each based on a different architecture.

MI6

The MI6 is a passively cooled, Polaris 10 based card with 16 GB of GDDR5 memory and with a <150 W TDP. [1] [2] At 5.7 TFLOPS (FP16 and FP32), the MI6 is expected to be used primarily for inference, rather than neural network training. The MI6 has a peak double precision (FP64) compute performance of 358 GFLOPS. [3]

MI8

The MI8 is a Fiji based card, analogous to the R9 Nano, and expected to have a <175W TDP. [1] The MI8 has 4 GB of High Bandwidth Memory. At 8.2 TFLOPS (FP16 and FP32), the MI8 is marked toward inference. The MI8 has a peak (FP64) double precision compute performance 512 GFLOPS. [4]

MI25

The MI25 is a Vega based card, utilizing HBM2 memory. The MI25 performance is expected to be 12.3 TFLOPS using FP32 numbers. In contrast to the MI6 and MI8, the MI25 is able to increase performance when using lower precision numbers, and accordingly is expected to reach 24.6 TFLOPS when using FP16 numbers. The MI25 is rated at <300W TDP with passive cooling. The MI25 also provides 768 GFLOPS peak double precision (FP64) at 1/16th rate. [5]

Software

MxGPU

The MI6, MI8, and MI25 products all support AMD's MxGPU virtualization technology, enabling sharing of GPU resources across multiple users. [1] [6]

MIOpen

MIOpen is AMD's deep learning library to enable GPU acceleration of deep learning. [1] Much of this extends the GPUOpen's Boltzmann Initiative software. [6] This is intended to compete with the deep learning portions of Nvidia's CUDA library. It supports the deep learning frameworks: Theano, Caffe, TensorFlow, MXNet, The Microsoft Cognitive Toolkit, Torch, and Chainer. Programming is supported in OpenCL and Python, in addition to supporting the compilation of CUDA through AMD's Heterogeneous-compute Interface for Portability and Heterogeneous Compute Compiler.

Chipset table

Model
(codename)
Release date Architecture
&  Fab
Transistors
& Die Size
Core Fillrate [lower-alpha 1] [lower-alpha 2] [lower-alpha 3] Processing power [lower-alpha 1] [lower-alpha 4]
(GFLOPS)
MemoryTBPBus
interface
Config [lower-alpha 5] Clock [lower-alpha 1] (MHz)Texture
(GT/s)
Pixel
(GP/s)
Half Single Double Bus type
& width
Size
(GiB)
Clock
(MT/s)
Bandwidth
(GB/s)
Radeon Instinct MI6
(Polaris 10) [1] [7] [8] [9] [10]
December 2016 GCN 4th gen
14 nm
5.7×109
232 mm2
2304:144:32
36 CU
1120
1233
177.639.4658005800358GDDR5
256-bit
167000224150 W PCIe 3.0 x16
Radeon Instinct MI8
(Fiji XT) [1] [7] [8] [11] [12]
GCN 3rd gen
28 nm
8.9×109
596 mm2
4096:256:64
64 CU
1000256.064.082008200512HBM
4096-bit
41000512175 W
Radeon Instinct MI25
(Vega 10 XT) [1] [7] [8] [13] [14] [15]
GCN 5th gen
14 nm
12.5×109
510 mm2
4096:256:64
64 CU
1400
1500
38496.02460012300768HBM2
2048-bit
161704436.2300 W
Radeon Instinct MI50
(Vega 20 GL) [16] [17] [18] [19]
November 2018 GCN 5th gen
7 nm
13.2×109
331 mm2
3840:240:-
60 CU
1450
1725
348
414
-26500133006600HBM2
4096-bit
16 or 32 [20] 20001024 PCIe 4.0 x16
Radeon Instinct MI60
(Vega 20 GL) [16] [21] [22]
4096:256:-
64 CU
1500
1800
384
460.8
-29450147257362.5321024
AMD Instinct MI100
(MI100 XL) [23]
November 12, 2020CDNA 1.0
7 nm
 ?
750 mm2
7680:480:-
120 CU
1000
1502
480
720
-184600231001150024001228.8
AMD Instinct MI210
(?) [24]
December, 2021CDNA 2.0
6 nm
58 x 109
? mm2
6656:416:-
104 CU
1000
1700
 ?
?
-1810002263022630HBM2e
4096-bit
6416001638300 W
AMD Instinct MI250
(?) [25]
November 8, 202113312:832:-
208 CU
1000
1700
 ?
?
-3621004530045300HBM2e
8192-bit
1283276.8500 W
560 W (Peak)
AMD Instinct MI250X
(?) [26]
14080:880:-
220 CU
1000
1700
 ?
?
-3830004790047900
  1. 1 2 3 Boost values (if available) are stated below the base value in italic.
  2. Texture fillrate is calculated as the number of texture mapping units multiplied by the base (or boost) core clock speed.
  3. Pixel fillrate is calculated as the number of render output units multiplied by the base (or boost) core clock speed.
  4. Precision performance is calculated from the base (or boost) core clock speed based on a FMA operation.
  5. Unified Shaders  : Texture Mapping Units  : Render Output Units and Compute Units (CU)

See also

Related Research Articles

FLOPS Measure of computer performance

In computing, floating point operations per second is a measure of computer performance, useful in fields of scientific computations that require floating-point calculations. For such cases, it is a more accurate measure than measuring instructions per second.

Graphics processing unit Specialized electronic circuit; graphics accelerator

A graphics processing unit (GPU) is a specialized electronic circuit designed to rapidly manipulate and alter memory to accelerate the creation of images in a frame buffer intended for output to a display device. GPUs are used in embedded systems, mobile phones, personal computers, workstations, and game consoles.

General-purpose computing on graphics processing units is the use of a graphics processing unit (GPU), which typically handles computation only for computer graphics, to perform computation in applications traditionally handled by the central processing unit (CPU). The use of multiple video cards in one computer, or large numbers of graphics chips, further parallelizes the already parallel nature of graphics processing.

AMD FirePro Brand by AMD

AMD FirePro was AMD's brand of graphics cards designed for use in workstations and servers running professional Computer-aided design (CAD), Computer-generated imagery (CGI), Digital content creation (DCC), and High-performance computing/GPGPU applications. The GPU chips on FirePro-branded graphics cards are identical to the ones used on Radeon-branded graphics cards. The end products differentiate substantially by the provided graphics device drivers and through the available professional support for the software. The product line is split into two categories: "W" workstation series focusing on workstation and primarily focusing on graphics and display, and "S" server series focused on virtualization and GPGPU/High-performance computing.

AMD Accelerated Processing Unit Marketing term by AMD

The AMD Accelerated Processing Unit (APU), formerly known as Fusion, is the marketing term for a series of 64-bit microprocessors from Advanced Micro Devices (AMD), designed to act as a central processing unit (CPU) and graphics processing unit (GPU) on a single die. APUs are general purpose processors that feature integrated graphics processors (IGPs).

AMD FireStream was AMD's brand name for their Radeon-based product line targeting stream processing and/or GPGPU in supercomputers. Originally developed by ATI Technologies around the Radeon X1900 XTX in 2006, the product line was previously branded as both ATI FireSTREAM and AMD Stream Processor. The AMD FireStream can also be used as a floating-point co-processor for offloading CPU calculations, which is part of the Torrenza initiative. The FireStream line has been discontinued since 2012, when GPGPU workloads were entirely folded into the AMD FirePro line.

The Evergreen series is a family of GPUs developed by Advanced Micro Devices for its Radeon line under the ATI brand name. It was employed in Radeon HD 5000 graphics card series and competed directly with Nvidia's GeForce 400 Series.

Radeon HD 6000 series Series of video cards

The Northern Islands series is a family of GPUs developed by Advanced Micro Devices (AMD) forming part of its Radeon-brand, based on the 40 nm process. Some models are based on TeraScale 2 (VLIW5), some on the new TeraScale 3 (VLIW4) introduced with them.

Radeon HD 7000 series Series of video cards

The Radeon HD 7000 series, codenamed "Southern Islands", is a family of GPUs developed by AMD, and manufactured on TSMC's 28 nm process. The primary competitor of Southern Islands, Nvidia's GeForce 600 Series, also shipped during Q1 2012, largely due to the immaturity of the 28 nm process.

Tsubame (supercomputer) Series of supercomputers

Tsubame is a series of supercomputers that operates at the GSIC Center at the Tokyo Institute of Technology in Japan, designed by Satoshi Matsuoka.

Graphics Core Next (GCN) is the codename for a series of microarchitectures and an instruction set architecture that were developed by AMD for its GPUs as the successor to its TeraScale microarchitecture. The first product featuring GCN was launched on January 9, 2012.

Nvidia Tesla Nvidias line of general purpose GPUs

Nvidia Tesla was the name of Nvidia's line of products targeted at stream processing or general-purpose graphics processing units (GPGPU), named after pioneering electrical engineer Nikola Tesla. Its products began using GPUs from the G80 series, and have continued to accompany the release of new chips. They are programmable using the CUDA or OpenCL APIs.

Pascal (microarchitecture) GPU microarchitecture by Nvidia

Pascal is the codename for a GPU microarchitecture developed by Nvidia, as the successor to the Maxwell architecture. The architecture was first introduced in April 2016 with the release of the Tesla P100 (GP100) on April 5, 2016, and is primarily used in the GeForce 10 series, starting with the GeForce GTX 1080 and GTX 1070, which were released on May 17, 2016 and June 10, 2016 respectively. Pascal was manufactured using TSMC's 16 nm FinFET process, and later Samsung's 14 nm FinFET process.

Volta is the codename for a GPU microarchitecture developed by Nvidia, succeeding Pascal. It was first announced on a roadmap in March 2013, although the first product was not announced until May 2017. The architecture is named after 18th–19th century Italian chemist and physicist Alessandro Volta. It was NVIDIA's first chip to feature Tensor Cores, specially designed cores that have superior deep learning performance over regular CUDA cores. The architecture is produced with TSMC's 12 nm FinFET process. The Ampere microarchitecture is the successor to Volta.

The Radeon 400 series is a series of graphics cards made by AMD. These cards were the first to feature the Polaris GPUs, using the new 14 nm FinFET manufacturing process, developed by Samsung Electronics and licensed to GlobalFoundries. The Polaris family initially included two new chips in the Graphics Core Next (GCN) family. Polaris implements the 4th generation of the Graphics Core Next instruction set, and shares commonalities with the previous GCN microarchitectures.

Radeon Pro Brand of AMD graphics cards intended for professional use

Radeon Pro is AMD's brand of professional oriented GPUs. It replaced AMD's FirePro brand in 2016. Compared to the Radeon brand for mainstream consumer/gamer products, the Radeon Pro brand is intended for use in workstations and the running of computer-aided design (CAD), computer-generated imagery (CGI), digital content creation (DCC), high-performance computing/GPGPU applications, and the creation and running of virtual reality programs and games.

The Radeon RX Vega series is a series of graphics processors developed by AMD. These GPUs use the Graphics Core Next (GCN) 5th generation architecture, codenamed Vega, and are manufactured on 14 nm FinFET technology, developed by Samsung Electronics and licensed to GlobalFoundries. The series consists of desktop graphics cards and APUs aimed at desktops, mobile devices, and embedded applications.

ROCm is an Advanced Micro Devices (AMD) software stack for graphics processing unit (GPU) programming. ROCm spans several domains: general-purpose computing on graphics processing units (GPGPU), high performance computing (HPC), heterogeneous computing. It offers several programming models: HIP, OpenMP/Message Passing Interface (MPI), OpenCL.

Intel Xe, earlier known unofficially as Gen12, is a GPU architecture developed by Intel.

References

  1. 1 2 3 4 5 6 7 8 Smith, Ryan (12 December 2016). "AMD Announces Radeon Instinct: GPU Accelerators for Deep Learning, Coming in 2017". Anandtech. Retrieved 12 December 2016.
  2. 1 2 Shrout, Ryan (12 December 2016). "Radeon Instinct Machine Learning GPUs include Vega, Preview Performance". PC Per. Retrieved 12 December 2016.
  3. "Radeon Instinct MI6". Radeon Instinct. AMD. Retrieved 22 June 2017.
  4. "Radeon Instinct MI8". Radeon Instinct. AMD. Retrieved 22 June 2017.
  5. "Radeon Instinct MI25". Radeon Instinct. AMD. Retrieved 22 June 2017.
  6. 1 2 Kampman, Jeff (12 December 2016). "AMD opens up machine learning with Radeon Instinct". TechReport. Retrieved 12 December 2016.
  7. 1 2 3 Shrout, Ryan (12 December 2016). "Radeon Instinct Machine Learning GPUs include Vega, Preview Performance". PC Per. Retrieved 12 December 2016.
  8. 1 2 3 Kampman, Jeff (12 December 2016). "AMD opens up machine learning with Radeon Instinct". TechReport. Retrieved 12 December 2016.
  9. "Radeon Instinct MI6". Radeon Instinct. AMD. Retrieved 22 June 2017.
  10. "AMD Radeon Instinct MI6 Specs". TechPowerUp.
  11. "Radeon Instinct MI8". Radeon Instinct. AMD. Retrieved 22 June 2017.
  12. "AMD Radeon Instinct MI8 Specs". TechPowerUp.
  13. Smith, Ryan (5 January 2017). "The AMD Vega Architecture Teaser: Higher IPC, Tiling, & More, coming in H1'2017". Anandtech.com. Retrieved 10 January 2017.
  14. "Radeon Instinct MI25". Radeon Instinct. AMD. Retrieved 22 June 2017.
  15. "AMD Radeon Instinct MI25 Specs". TechPowerUp.
  16. 1 2 "Next Horizon – David Wang Presentation" (PDF). AMD. AMD.
  17. "Radeon Instinct MI50". AMD. AMD.
  18. "Radeon Instinct MI50 Datasheet" (PDF). AMD. AMD.
  19. "Hands on with the AMD Radeon VII". PC Gamer. Jarred Walton.
  20. "AMD Radeon Instinct™ MI50 Accelerator (32GB)".
  21. "Radeon Instinct MI60". AMD. AMD.
  22. "Radeon Instinct MI60 Datasheet" (PDF). AMD. AMD.
  23. "Radeon Instinct MI100". AMD. AMD.
  24. "MI210 Accelerator". AMD Radeon Instinct MI210. TechPowerUp.
  25. "MI200 Series Accelerator" (PDF). AMD. AMD.
  26. "MI200 Series Accelerator" (PDF). AMD. AMD.