Radeon Instinct

AMD Radeon Instinct
Design firm	Advanced Micro Devices
Introduced	2016
Type	Servers

Last updated January 24, 2022

AMD Radeon Instinct is AMD's brand of deep learning oriented GPUs.^[1]^[2] It replaced AMD's FirePro S brand in 2016. Compared to the Radeon brand of mainstream consumer/gamer products, the Radeon Instinct branded products are intended to accelerate deep learning, artificial neural network, and high-performance computing/GPGPU applications.

Products

The three initial Radeon Instinct products were announced in December 2016, with each based on a different architecture.

MI6

The MI6 is a passively cooled, Polaris 10 based card with 16 GB of GDDR5 memory and with a <150 W TDP.^[1]^[2] At 5.7 TFLOPS (FP16 and FP32), the MI6 is expected to be used primarily for inference, rather than neural network training. The MI6 has a peak double precision (FP64) compute performance of 358 GFLOPS.^[3]

MI8

The MI8 is a Fiji based card, analogous to the R9 Nano, and expected to have a <175W TDP.^[1] The MI8 has 4 GB of High Bandwidth Memory. At 8.2 TFLOPS (FP16 and FP32), the MI8 is marked toward inference. The MI8 has a peak (FP64) double precision compute performance 512 GFLOPS.^[4]

MI25

The MI25 is a Vega based card, utilizing HBM2 memory. The MI25 performance is expected to be 12.3 TFLOPS using FP32 numbers. In contrast to the MI6 and MI8, the MI25 is able to increase performance when using lower precision numbers, and accordingly is expected to reach 24.6 TFLOPS when using FP16 numbers. The MI25 is rated at <300W TDP with passive cooling. The MI25 also provides 768 GFLOPS peak double precision (FP64) at 1/16th rate.^[5]

Software

MxGPU

The MI6, MI8, and MI25 products all support AMD's MxGPU virtualization technology, enabling sharing of GPU resources across multiple users.^[1]^[6]

MIOpen

MIOpen is AMD's deep learning library to enable GPU acceleration of deep learning.^[1] Much of this extends the GPUOpen's Boltzmann Initiative software.^[6] This is intended to compete with the deep learning portions of Nvidia's CUDA library. It supports the deep learning frameworks: Theano, Caffe, TensorFlow, MXNet, The Microsoft Cognitive Toolkit, Torch, and Chainer. Programming is supported in OpenCL and Python, in addition to supporting the compilation of CUDA through AMD's Heterogeneous-compute Interface for Portability and Heterogeneous Compute Compiler.

Chipset table

Model (codename)	Release date	Architecture & Fab	Transistors & Die Size	Core		Fillrate ^{[lower-alpha 1]}^{[lower-alpha 2]}^{[lower-alpha 3]}		Processing power^{[lower-alpha 1]}^{[lower-alpha 4]} (GFLOPS)			Memory				TBP	Bus interface
Model (codename)	Release date	Architecture & Fab	Transistors & Die Size	Config^{[lower-alpha 5]}	Clock^{[lower-alpha 1]} (MHz)	Texture (GT/s)	Pixel (GP/s)	Half	Single	Double	Bus type & width	Size (GiB)	Clock (MT/s)	Bandwidth (GB/s)	TBP	Bus interface
Radeon Instinct MI6 (Polaris 10) ^[1]^[7]^[8]^[9]^[10]	December 2016	GCN 4^th gen 14 nm	5.7×10⁹ 232 mm²	2304:144:32 36 CU	1120 1233	177.6	39.46	5800	5800	358	GDDR5 256-bit	16	7000	224	150 W	PCIe 3.0 x16
Radeon Instinct MI8 (Fiji XT) ^[1]^[7]^[8]^[11]^[12]		GCN 3^rd gen 28 nm	8.9×10⁹ 596 mm²	4096:256:64 64 CU	1000	256.0	64.0	8200	8200	512	HBM 4096-bit	4	1000	512	175 W
Radeon Instinct MI25 (Vega 10 XT)^[1]^[7]^[8]^[13]^[14]^[15]		GCN 5^th gen 14 nm	12.5×10⁹ 510 mm²	4096:256:64 64 CU	1400 1500	384	96.0	24600	12300	768	HBM2 2048-bit	16	1704	436.2	300 W
Radeon Instinct MI50 (Vega 20 GL)^[16]^[17]^[18]^[19]	November 2018	GCN 5^th gen 7 nm	13.2×10⁹ 331 mm²	3840:240:- 60 CU	1450 1725	348 414	-	26500	13300	6600	HBM2 4096-bit	16 or 32^[20]	2000	1024		PCIe 4.0 x16
Radeon Instinct MI60 (Vega 20 GL)^[16]^[21]^[22]	November 2018	GCN 5^th gen 7 nm	13.2×10⁹ 331 mm²	4096:256:- 64 CU	1500 1800	384 460.8	-	29450	14725	7362.5		32	2000	1024
AMD Instinct MI100 (MI100 XL)^[23]	November 12, 2020	CDNA 1.0 7 nm	? 750 mm²	7680:480:- 120 CU	1000 1502	480 720	-	184600	23100	11500		32	2400	1228.8
AMD Instinct MI210 (?)^[24]	December, 2021	CDNA 2.0 6 nm	58 x 10⁹ ? mm²	6656:416:- 104 CU	1000 1700	? ?	-	181000	22630	22630	HBM2e 4096-bit	64	1600	1638	300 W
AMD Instinct MI250 (?)^[25]	November 8, 2021			13312:832:- 208 CU	1000 1700	? ?	-	362100	45300	45300	HBM2e 8192-bit	128		3276.8	500 W 560 W (Peak)
AMD Instinct MI250X (?)^[26]	November 8, 2021			14080:880:- 220 CU	1000 1700	? ?	-	383000	47900	47900	HBM2e 8192-bit	128		3276.8	500 W 560 W (Peak)

1 2 3 Boost values (if available) are stated below the base value in italic.
↑ Texture fillrate is calculated as the number of texture mapping units multiplied by the base (or boost) core clock speed.
↑ Pixel fillrate is calculated as the number of render output units multiplied by the base (or boost) core clock speed.
↑ Precision performance is calculated from the base (or boost) core clock speed based on a FMA operation.
↑ Unified Shaders : Texture Mapping Units : Render Output Units and Compute Units (CU)

Related Research Articles

In computing, floating point operations per second is a measure of computer performance, useful in fields of scientific computations that require floating-point calculations. For such cases, it is a more accurate measure than measuring instructions per second.

A graphics processing unit (GPU) is a specialized electronic circuit designed to rapidly manipulate and alter memory to accelerate the creation of images in a frame buffer intended for output to a display device. GPUs are used in embedded systems, mobile phones, personal computers, workstations, and game consoles.

General-purpose computing on graphics processing units is the use of a graphics processing unit (GPU), which typically handles computation only for computer graphics, to perform computation in applications traditionally handled by the central processing unit (CPU). The use of multiple video cards in one computer, or large numbers of graphics chips, further parallelizes the already parallel nature of graphics processing.

AMD FirePro was AMD's brand of graphics cards designed for use in workstations and servers running professional Computer-aided design (CAD), Computer-generated imagery (CGI), Digital content creation (DCC), and High-performance computing/GPGPU applications. The GPU chips on FirePro-branded graphics cards are identical to the ones used on Radeon-branded graphics cards. The end products differentiate substantially by the provided graphics device drivers and through the available professional support for the software. The product line is split into two categories: "W" workstation series focusing on workstation and primarily focusing on graphics and display, and "S" server series focused on virtualization and GPGPU/High-performance computing.

The AMD Accelerated Processing Unit (APU), formerly known as Fusion, is the marketing term for a series of 64-bit microprocessors from Advanced Micro Devices (AMD), designed to act as a central processing unit (CPU) and graphics processing unit (GPU) on a single die. APUs are general purpose processors that feature integrated graphics processors (IGPs).

AMD FireStream was AMD's brand name for their Radeon-based product line targeting stream processing and/or GPGPU in supercomputers. Originally developed by ATI Technologies around the Radeon X1900 XTX in 2006, the product line was previously branded as both ATI FireSTREAM and AMD Stream Processor. The AMD FireStream can also be used as a floating-point co-processor for offloading CPU calculations, which is part of the Torrenza initiative. The FireStream line has been discontinued since 2012, when GPGPU workloads were entirely folded into the AMD FirePro line.

The Evergreen series is a family of GPUs developed by Advanced Micro Devices for its Radeon line under the ATI brand name. It was employed in Radeon HD 5000 graphics card series and competed directly with Nvidia's GeForce 400 Series.

The Northern Islands series is a family of GPUs developed by Advanced Micro Devices (AMD) forming part of its Radeon-brand, based on the 40 nm process. Some models are based on TeraScale 2 (VLIW5), some on the new TeraScale 3 (VLIW4) introduced with them.

The Radeon HD 7000 series, codenamed "Southern Islands", is a family of GPUs developed by AMD, and manufactured on TSMC's 28 nm process. The primary competitor of Southern Islands, Nvidia's GeForce 600 Series, also shipped during Q1 2012, largely due to the immaturity of the 28 nm process.

Tsubame is a series of supercomputers that operates at the GSIC Center at the Tokyo Institute of Technology in Japan, designed by Satoshi Matsuoka.

Graphics Core Next (GCN) is the codename for a series of microarchitectures and an instruction set architecture that were developed by AMD for its GPUs as the successor to its TeraScale microarchitecture. The first product featuring GCN was launched on January 9, 2012.

Nvidia Tesla Nvidias line of general purpose GPUs

Nvidia Tesla was the name of Nvidia's line of products targeted at stream processing or general-purpose graphics processing units (GPGPU), named after pioneering electrical engineer Nikola Tesla. Its products began using GPUs from the G80 series, and have continued to accompany the release of new chips. They are programmable using the CUDA or OpenCL APIs.

Pascal (microarchitecture) GPU microarchitecture by Nvidia

Pascal is the codename for a GPU microarchitecture developed by Nvidia, as the successor to the Maxwell architecture. The architecture was first introduced in April 2016 with the release of the Tesla P100 (GP100) on April 5, 2016, and is primarily used in the GeForce 10 series, starting with the GeForce GTX 1080 and GTX 1070, which were released on May 17, 2016 and June 10, 2016 respectively. Pascal was manufactured using TSMC's 16 nm FinFET process, and later Samsung's 14 nm FinFET process.

Volta is the codename for a GPU microarchitecture developed by Nvidia, succeeding Pascal. It was first announced on a roadmap in March 2013, although the first product was not announced until May 2017. The architecture is named after 18th–19th century Italian chemist and physicist Alessandro Volta. It was NVIDIA's first chip to feature Tensor Cores, specially designed cores that have superior deep learning performance over regular CUDA cores. The architecture is produced with TSMC's 12 nm FinFET process. The Ampere microarchitecture is the successor to Volta.

The Radeon 400 series is a series of graphics cards made by AMD. These cards were the first to feature the Polaris GPUs, using the new 14 nm FinFET manufacturing process, developed by Samsung Electronics and licensed to GlobalFoundries. The Polaris family initially included two new chips in the Graphics Core Next (GCN) family. Polaris implements the 4th generation of the Graphics Core Next instruction set, and shares commonalities with the previous GCN microarchitectures.

Radeon Pro is AMD's brand of professional oriented GPUs. It replaced AMD's FirePro brand in 2016. Compared to the Radeon brand for mainstream consumer/gamer products, the Radeon Pro brand is intended for use in workstations and the running of computer-aided design (CAD), computer-generated imagery (CGI), digital content creation (DCC), high-performance computing/GPGPU applications, and the creation and running of virtual reality programs and games.

The Radeon RX Vega series is a series of graphics processors developed by AMD. These GPUs use the Graphics Core Next (GCN) 5th generation architecture, codenamed Vega, and are manufactured on 14 nm FinFET technology, developed by Samsung Electronics and licensed to GlobalFoundries. The series consists of desktop graphics cards and APUs aimed at desktops, mobile devices, and embedded applications.

ROCm is an Advanced Micro Devices (AMD) software stack for graphics processing unit (GPU) programming. ROCm spans several domains: general-purpose computing on graphics processing units (GPGPU), high performance computing (HPC), heterogeneous computing. It offers several programming models: HIP, OpenMP/Message Passing Interface (MPI), OpenCL.

Intel Xe, earlier known unofficially as Gen12, is a GPU architecture developed by Intel.

References

1 2 3 4 5 6 7 8 Smith, Ryan (12 December 2016). "AMD Announces Radeon Instinct: GPU Accelerators for Deep Learning, Coming in 2017". Anandtech. Retrieved 12 December 2016.
1 2 Shrout, Ryan (12 December 2016). "Radeon Instinct Machine Learning GPUs include Vega, Preview Performance". PC Per. Retrieved 12 December 2016.
↑ "Radeon Instinct MI6". Radeon Instinct. AMD. Retrieved 22 June 2017.
↑ "Radeon Instinct MI8". Radeon Instinct. AMD. Retrieved 22 June 2017.
↑ "Radeon Instinct MI25". Radeon Instinct. AMD. Retrieved 22 June 2017.
1 2 Kampman, Jeff (12 December 2016). "AMD opens up machine learning with Radeon Instinct". TechReport. Retrieved 12 December 2016.
1 2 3 Shrout, Ryan (12 December 2016). "Radeon Instinct Machine Learning GPUs include Vega, Preview Performance". PC Per. Retrieved 12 December 2016.
1 2 3 Kampman, Jeff (12 December 2016). "AMD opens up machine learning with Radeon Instinct". TechReport. Retrieved 12 December 2016.
↑ "Radeon Instinct MI6". Radeon Instinct. AMD. Retrieved 22 June 2017.
↑ "AMD Radeon Instinct MI6 Specs". TechPowerUp.
↑ "Radeon Instinct MI8". Radeon Instinct. AMD. Retrieved 22 June 2017.
↑ "AMD Radeon Instinct MI8 Specs". TechPowerUp.
↑ Smith, Ryan (5 January 2017). "The AMD Vega Architecture Teaser: Higher IPC, Tiling, & More, coming in H1'2017". Anandtech.com. Retrieved 10 January 2017.
↑ "Radeon Instinct MI25". Radeon Instinct. AMD. Retrieved 22 June 2017.
↑ "AMD Radeon Instinct MI25 Specs". TechPowerUp.
1 2 "Next Horizon – David Wang Presentation" (PDF). AMD. AMD.
↑ "Radeon Instinct MI50". AMD. AMD.
↑ "Radeon Instinct MI50 Datasheet" (PDF). AMD. AMD.
↑ "Hands on with the AMD Radeon VII". PC Gamer. Jarred Walton.
↑ "AMD Radeon Instinct™ MI50 Accelerator (32GB)".
↑ "Radeon Instinct MI60". AMD. AMD.
↑ "Radeon Instinct MI60 Datasheet" (PDF). AMD. AMD.
↑ "Radeon Instinct MI100". AMD. AMD.
↑ "MI210 Accelerator". AMD Radeon Instinct MI210. TechPowerUp.
↑ "MI200 Series Accelerator" (PDF). AMD. AMD.
↑ "MI200 Series Accelerator" (PDF). AMD. AMD.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[boost-7] 1 2 3 Boost values (if available) are stated below the base value in italic.

[texture_fill-8] Texture fillrate is calculated as the number of texture mapping units multiplied by the base (or boost) core clock speed.

[pixel_fill-9] Pixel fillrate is calculated as the number of render output units multiplied by the base (or boost) core clock speed.

[FLOPS-10] Precision performance is calculated from the base (or boost) core clock speed based on a FMA operation.

[cconfig-11] Unified Shaders : Texture Mapping Units : Render Output Units and Compute Units (CU)

[anand-1] 1 2 3 4 5 6 7 8 Smith, Ryan (12 December 2016). "AMD Announces Radeon Instinct: GPU Accelerators for Deep Learning, Coming in 2017". Anandtech. Retrieved 12 December 2016.

[pcper-2] 1 2 Shrout, Ryan (12 December 2016). "Radeon Instinct Machine Learning GPUs include Vega, Preview Performance". PC Per. Retrieved 12 December 2016.

[3] "Radeon Instinct MI6". Radeon Instinct. AMD. Retrieved 22 June 2017.

[4] "Radeon Instinct MI8". Radeon Instinct. AMD. Retrieved 22 June 2017.

[5] "Radeon Instinct MI25". Radeon Instinct. AMD. Retrieved 22 June 2017.

[TR2-6] 1 2 Kampman, Jeff (12 December 2016). "AMD opens up machine learning with Radeon Instinct". TechReport. Retrieved 12 December 2016.

[instinct_pcper-12] 1 2 3 Shrout, Ryan (12 December 2016). "Radeon Instinct Machine Learning GPUs include Vega, Preview Performance". PC Per. Retrieved 12 December 2016.

[TR-13] 1 2 3 Kampman, Jeff (12 December 2016). "AMD opens up machine learning with Radeon Instinct". TechReport. Retrieved 12 December 2016.

[14] "Radeon Instinct MI6". Radeon Instinct. AMD. Retrieved 22 June 2017.

[15] "AMD Radeon Instinct MI6 Specs". TechPowerUp.

[16] "Radeon Instinct MI8". Radeon Instinct. AMD. Retrieved 22 June 2017.

[17] "AMD Radeon Instinct MI8 Specs". TechPowerUp.

[anand_vega-18] Smith, Ryan (5 January 2017). "The AMD Vega Architecture Teaser: Higher IPC, Tiling, & More, coming in H1'2017". Anandtech.com. Retrieved 10 January 2017.

[19] "Radeon Instinct MI25". Radeon Instinct. AMD. Retrieved 22 June 2017.

[20] "AMD Radeon Instinct MI25 Specs". TechPowerUp.

[NH-DWP-21] 1 2 "Next Horizon – David Wang Presentation" (PDF). AMD. AMD.

[22] "Radeon Instinct MI50". AMD. AMD.

[23] "Radeon Instinct MI50 Datasheet" (PDF). AMD. AMD.

[24] "Hands on with the AMD Radeon VII". PC Gamer. Jarred Walton.

[25] "AMD Radeon Instinct™ MI50 Accelerator (32GB)".

[26] "Radeon Instinct MI60". AMD. AMD.

[27] "Radeon Instinct MI60 Datasheet" (PDF). AMD. AMD.

[28] "Radeon Instinct MI100". AMD. AMD.

[29] "MI210 Accelerator". AMD Radeon Instinct MI210. TechPowerUp.

[30] "MI200 Series Accelerator" (PDF). AMD. AMD.

[31] "MI200 Series Accelerator" (PDF). AMD. AMD.

[1]

[2]

[3]

[4]

[5]

[6]

[lower-alpha 1]

[lower-alpha 2]

[lower-alpha 3]

[lower-alpha 4]

[lower-alpha 5]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]