Aurora (supercomputer)

Last updated

Aurora
Aurora environment 1600x900.jpg
Active
  • Deployment: Nov, 2023
Operators Argonne National Laboratory and U.S. Department of Energy
Location Argonne Leadership Computing Facility
Power38.7 MW
Speed1.012   exaFLOPS (Rmax) / 1.98  exaFLOPS (Rpeak) [1]
CostUS$500 million (estimated cost)
PurposeScientific research and development
Website https://www.anl.gov/aurora

Aurora is an Exascale supercomputer that was sponsored by the United States Department of Energy (DOE) and designed by Intel and Cray for the Argonne National Laboratory. [2] It has been the second fastest supercomputer in the world since 2023. It is expected that after optimizing its performance it will exceed 2 ExaFLOPS, making it the fastest computer ever.

Contents

The cost was estimated in 2019 to be US$500 million. [3] Olivier Franza is the chief architect and principal investigator of this design. [4]

History

In 2013 DOE presented their exascale vision of one exaFLOP at 20 MW by 2020. [5] Aurora was first announced in 2015 and to be finished in 2018. It was expected to have a speed of 180 petaFLOPS [6] which would be around the speed of Summit. Aurora was meant to be the most powerful supercomputer at the time of its launch and to be built by Cray with Intel processors. Later, in 2017, Intel announced that Aurora would be delayed to 2021 but scaled up to 1 exaFLOP. In March 2019, DOE said that it would build the first supercomputer with a performance of one exaFLOP in the United States in 2021. [7]

In October 2020, DOE said that Aurora would be delayed again for a further six months, and would no longer be the first exascale computer in the US. [8] In late October 2021 Intel announced that Aurora would now exceed 2 exaFLOPS in peak double-precision compute. [9] The system was fully installed on June 22, 2023. [10]

In May 2024, Aurora appeared at number two on the Top500 supercomputer list, with a performance of 1.012 exaFLOPS, marking a second entry of an Exascale capable system on the Top500. [11] [12] [13] Aurora is still expected to exceed 2 exaFLOPS of performance once the entire system has been brought online and optimizations have been made, exceeding Frontier as the #1 supercomputer on Top500, as optimizing supercomputers can lead to significant performance improvements. [12]

Usage

Functions include research on nuclear fusion, [14] low carbon technologies, subatomic particles, cancer and cosmology. [15] [16] It will also develop new materials that will be useful for batteries and more efficient solar cells. [16] It is to be available to the general scientific community. [17]

Architecture

Aurora has over nine thousand nodes, with each node being composed of two Intel Xeon Max [18] processors, six Intel Max series GPUs and a unified memory architecture, providing a maximum computing power of 130 teraFLOPS per node. [19] It has around 10 petabytes of memory and 230 petabytes of storage.

The machine is estimated to consume around 60 MW of power. [20] For comparison, the fastest computer in the world today, Frontier uses 21 MW while Summit uses 13 MW.

See also

Related Research Articles

<span class="mw-page-title-main">Supercomputer</span> Type of extremely powerful computer

A supercomputer is a type of computer with a high level of performance as compared to a general-purpose computer. The performance of a supercomputer is commonly measured in floating-point operations per second (FLOPS) instead of million instructions per second (MIPS). Since 2017, supercomputers have existed which can perform over 1017 FLOPS (a hundred quadrillion FLOPS, 100 petaFLOPS or 100 PFLOPS). For comparison, a desktop computer has performance in the range of hundreds of gigaFLOPS (1011) to tens of teraFLOPS (1013). Since November 2017, all of the world's fastest 500 supercomputers run on Linux-based operating systems. Additional research is being conducted in the United States, the European Union, Taiwan, Japan, and China to build faster, more powerful and technologically superior exascale supercomputers.

Floating point operations per second is a measure of computer performance in computing, useful in fields of scientific computations that require floating-point calculations.

<span class="mw-page-title-main">MareNostrum</span> Supercomputer in the Barcelona Supercomputing Center

MareNostrum is the main supercomputer in the Barcelona Supercomputing Center. It is the most powerful supercomputer in Spain, one of thirteen supercomputers in the Spanish Supercomputing Network and one of the seven supercomputers of the European infrastructure PRACE.

<span class="mw-page-title-main">National Energy Research Scientific Computing Center</span> Supercomputer facility operated by the US Department of Energy in Berkeley, California

The National Energy Research Scientific Computing Center (NERSC), is a high-performance computing (supercomputer) National User Facility operated by Lawrence Berkeley National Laboratory for the United States Department of Energy Office of Science. As the mission computing center for the Office of Science, NERSC houses high performance computing and data systems used by 9,000 scientists at national laboratories and universities around the country. Research at NERSC is focused on fundamental and applied research in energy efficiency, storage, and generation; Earth systems science, and understanding of fundamental forces of nature and the universe. The largest research areas are in High Energy Physics, Materials Science, Chemical Sciences, Climate and Environmental Sciences, Nuclear Physics, and Fusion Energy research. NERSC's newest and largest supercomputer is Perlmutter, which debuted in 2021 ranked 5th on the TOP500 list of world's fastest supercomputers.

The Oak Ridge Leadership Computing Facility (OLCF), formerly the National Leadership Computing Facility, is a designated user facility operated by Oak Ridge National Laboratory and the Department of Energy. It contains several supercomputers, the largest of which is an HPE OLCF-5 named Frontier, which was ranked 1st on the TOP500 list of world's fastest supercomputers as of June 2023. It is located in Oak Ridge, Tennessee.

<span class="mw-page-title-main">TOP500</span> Database project devoted to the ranking of computers

The TOP500 project ranks and details the 500 most powerful non-distributed computer systems in the world. The project was started in 1993 and publishes an updated list of the supercomputers twice a year. The first of these updates always coincides with the International Supercomputing Conference in June, and the second is presented at the ACM/IEEE Supercomputing Conference in November. The project aims to provide a reliable basis for tracking and detecting trends in high-performance computing and bases rankings on HPL benchmarks, a portable implementation of the high-performance LINPACK benchmark written in Fortran for distributed-memory computers.

Petascale computing refers to computing systems capable of calculating at least 1015 floating point operations per second (1 petaFLOPS). Petascale computing allowed faster processing of traditional supercomputer applications. The first system to reach this milestone was the IBM Roadrunner in 2008. Petascale supercomputers were succeeded by exascale computers.

The National Center for Computational Sciences (NCCS) is a United States Department of Energy (DOE) Leadership Computing Facility that houses the Oak Ridge Leadership Computing Facility (OLCF), a DOE Office of Science User Facility charged with helping researchers solve challenging scientific problems of global interest with a combination of leading high-performance computing (HPC) resources and international expertise in scientific computing.

<span class="mw-page-title-main">Jaguar (supercomputer)</span> Cray supercomputer at Oak Ridge National Laboratory

Jaguar or OLCF-2 was a petascale supercomputer built by Cray at Oak Ridge National Laboratory (ORNL) in Oak Ridge, Tennessee. The massively parallel Jaguar had a peak performance of just over 1,750 teraFLOPS. It had 224,256 x86-based AMD Opteron processor cores, and operated with a version of Linux called the Cray Linux Environment. Jaguar was a Cray XT5 system, a development from the Cray XT4 supercomputer.

Exascale computing refers to computing systems capable of calculating at least "1018 IEEE 754 Double Precision (64-bit) operations (multiplications and/or additions) per second (exaFLOPS)"; it is a measure of supercomputer performance.

This list compares various amounts of computing power in instructions per second organized by order of magnitude in FLOPS.

<span class="mw-page-title-main">History of supercomputing</span>

The history of supercomputing goes back to the 1960s when a series of computers at Control Data Corporation (CDC) were designed by Seymour Cray to use innovative designs and parallelism to achieve superior computational peak performance. The CDC 6600, released in 1964, is generally considered the first supercomputer. However, some earlier computers were considered supercomputers for their day such as the 1954 IBM NORC in the 1950s, and in the early 1960s, the UNIVAC LARC (1960), the IBM 7030 Stretch (1962), and the Manchester Atlas (1962), all of which were of comparable power.

<span class="mw-page-title-main">Supercomputing in Europe</span> Overview of supercomputing in Europe

Several centers for supercomputing exist across Europe, and distributed access to them is coordinated by European initiatives to facilitate high-performance computing. One such initiative, the HPC Europa project, fits within the Distributed European Infrastructure for Supercomputing Applications (DEISA), which was formed in 2002 as a consortium of eleven supercomputing centers from seven European countries. Operating within the CORDIS framework, HPC Europa aims to provide access to supercomputers across Europe.

<span class="mw-page-title-main">Xeon Phi</span> Series of x86 manycore processors from Intel

Xeon Phi is a discontinued series of x86 manycore processors designed and made by Intel. It was intended for use in supercomputers, servers, and high-end workstations. Its architecture allowed use of standard programming languages and application programming interfaces (APIs) such as OpenMP.

<span class="mw-page-title-main">Cray XC40</span> Supercomputer manufactured by Cray

The Cray XC40 is a massively parallel multiprocessor supercomputer manufactured by Cray. It consists of Intel Haswell Xeon processors, with optional Nvidia Tesla or Intel Xeon Phi accelerators, connected together by Cray's proprietary "Aries" interconnect, stored in air-cooled or liquid-cooled cabinets. The XC series supercomputers are available with the Cray DataWarp applications I/O accelerator technology.

<span class="mw-page-title-main">Summit (supercomputer)</span> Supercomputer developed by IBM

Summit or OLCF-4 is a supercomputer developed by IBM for use at Oak Ridge Leadership Computing Facility (OLCF), a facility at the Oak Ridge National Laboratory, capable of 200 petaFLOPS thus making it the 5th fastest supercomputer in the world after Frontier (OLCF-5), Fugaku, LUMI, and Leonardo, with Frontier being the fastest. It held the number 1 position from November 2018 to June 2020. Its current LINPACK benchmark is clocked at 148.6 petaFLOPS.

<span class="mw-page-title-main">Frontier (supercomputer)</span> American supercomputer

Hewlett Packard Enterprise Frontier, or OLCF-5, is the world's first exascale supercomputer. It is hosted at the Oak Ridge Leadership Computing Facility (OLCF) in Tennessee, United States and became operational in 2022. As of December 2023, Frontier is the world's fastest supercomputer. It is based on the Cray EX and is the successor to Summit (OLCF-4). Frontier achieved an Rmax of 1.102 exaFLOPS, which is 1.102 quintillion floating-point operations per second, using AMD CPUs and GPUs.

<span class="mw-page-title-main">Fugaku (supercomputer)</span> Japanese supercomputer

Fugaku(Japanese: 富岳) is a petascale supercomputer at the Riken Center for Computational Science in Kobe, Japan. It started development in 2014 as the successor to the K computer and made its debut in 2020. It is named after an alternative name for Mount Fuji.

<span class="mw-page-title-main">LUMI</span> Supercomputer in Finland

LUMI is a petascale supercomputer located at the CSC data center in Kajaani, Finland. As of January 2023, the computer is the fastest supercomputer in Europe.

Zettascale computing refers to computing systems capable of calculating at least "1021 IEEE 754 Double Precision (64-bit) operations (multiplications and/or additions) per second (zettaFLOPS)". It is a measure of supercomputer performance, and as of July 2022 is a hypothetical performance barrier. A zettascale computer system could generate more single floating point data in one second than was stored by the total digital means on Earth in the first quarter of 2011.

References

  1. "TOP500 May 2024". May 13, 2024. Retrieved May 13, 2024.
  2. Zarley, B. David (March 18, 2019). "America's first exascale supercomputer to be built by 2021". The Verge. Archived from the original on June 5, 2021. Retrieved September 17, 2020.
  3. "Intel and Cray are building a $500 million 'exascale' supercomputer for Argonne National Lab". Archived from the original on February 10, 2023. Retrieved September 26, 2020.
  4. Intel Corporation, "Architecting the Future of Supercomputing", https://www.intel.com/content/www/us/en/newsroom/news/architecting-future-supercomputing.html#gs.4sfi85
  5. "DOE Exascale Initiative" (PDF). Archived (PDF) from the original on March 30, 2021.
  6. Burt, Jeff. "Intel, Cray Awarded $200 Million to Build Powerful Supercomputer". eWEEK. Retrieved September 17, 2020.
  7. "The Argonne National Laboratory Supercomputer will Enable High Performance Computing and Artificial Intelligence at Exascale by 2021". Archived from the original on March 19, 2019.
  8. Black, Doug. "DOE Under Secretary for Science Dabbar's Exascale Update: Frontier to Be First, Aurora to Be Monitored". insideHPC. Archived from the original on October 28, 2020. Retrieved November 6, 2020.
  9. "Intel Innovation Spotlights New Products, Technology and Tools for..." Intel. Archived from the original on October 27, 2021. Retrieved October 27, 2021.
  10. Intel Corproation, "Aurora Supercomputer Blade Installation Complete", https://www.intel.com/content/www/us/en/newsroom/news/aurora-supercomputer-blade-installation-complete.html#gs.20v5fr
  11. "Top 500: Aurora Breaks into Exascale, but Can't Get to the Frontier of HPC". HPCwire. Retrieved May 13, 2024.
  12. 1 2 Shilov, Anton. "The Aurora Supercomputer Is Installed: 2 ExaFLOPS, Tens of Thousands of CPUs and GPUs". www.anandtech.com. Retrieved November 14, 2023.
  13. "Aurora - HPE Cray EX - Intel Exascale Compute Blade, Xeon CPU Max 9470 52C 2.4GHz, Intel Data Center GPU Max, Slingshot-11 | TOP500". www.top500.org. Retrieved May 13, 2024.
  14. "Using Exascale Supercomputers to Make Clean Fusion Energy Possible". September 2, 2022. Archived from the original on December 29, 2022. Retrieved February 10, 2023.
  15. Johnson, Rob. "Aurora Supercomputer to Assist in the Fight Against Cancer". TECHNOLOGY NETWORKS. Archived from the original on June 5, 2021. Retrieved September 26, 2020.
  16. 1 2 "Energy Department to spend 200 million on new aurora supercomputer". NBC News. Archived from the original on June 5, 2021. Retrieved September 26, 2020.
  17. "Aurora, Argonne supercomputer will be the most powerful in the U.S., will be installed at Argonne National Laboratory in the Chicago area". Archived from the original on November 25, 2020. Retrieved September 26, 2020.
  18. Papka, Michael (December 8, 2020), IEEE Chicago and ACM Chicago webinar: Supercomputing and ALCF - Dec 7 2020, archived from the original on November 15, 2021, retrieved December 9, 2020
  19. "Intel's 2021 Exascale Vision in Aurora". anandtech. Archived from the original on June 5, 2021. Retrieved November 24, 2020.
  20. "How Argonne Is Preparing for Exascale in 2022". HPCwire. September 8, 2021. Archived from the original on June 5, 2022. Retrieved June 14, 2022.