Aurora (supercomputer)

Last updated

Aurora
Aurora environment 1600x900.jpg
Active
  • Deployment: Nov, 2023
Operators Argonne National Laboratory and U.S. Department of Energy
Location Argonne Leadership Computing Facility
Power38.7 MW
Speed1.012  exaFLOPS (Rmax) / 1.98  exaFLOPS (Rpeak) [1]
CostUS$500 million (estimated cost)
PurposeScientific research and development
Website https://www.anl.gov/aurora

Aurora is an exascale supercomputer that was sponsored by the United States Department of Energy (DOE) and designed by Intel and Cray for the Argonne National Laboratory. [2] It has been the second fastest supercomputer in the world since 2023. It is expected that after optimizing its performance it will exceed 2 ExaFLOPS, making it the fastest computer ever.

Contents

The cost was estimated in 2019 to be US$500 million. [3] Olivier Franza is the chief architect and principal investigator of this design. [4]

History

In 2013 DOE presented their exascale vision of one exaFLOP at 20 MW by 2020. [5] Aurora was first announced in 2015 and to be finished in 2018. It was expected to have a speed of 180 petaFLOPS [6] which would be around the speed of Summit. Aurora was meant to be the most powerful supercomputer at the time of its launch and to be built by Cray with Intel processors. Later, in 2017, Intel announced that Aurora would be delayed to 2021 but scaled up to 1 exaFLOP. In March 2019, DOE said that it would build the first supercomputer with a performance of one exaFLOP in the United States in 2021. [7]

In October 2020, DOE said that Aurora would be delayed again for a further six months, and would no longer be the first exascale computer in the US. [8] In late October 2021 Intel announced that Aurora would now exceed 2 exaFLOPS in peak double-precision compute. [9] The system was fully installed on June 22, 2023. [10]

In May 2024, Aurora appeared at number two on the Top500 supercomputer list, with a performance of 1.012 exaFLOPS, marking the second entry of an exascale capable system on the Top500. [11] [12] [13] Aurora is still expected to exceed 2 exaFLOPS of performance once the entire system has been brought online and optimizations have been made, exceeding Frontier as the #1 supercomputer on Top500, as optimizing supercomputers can lead to significant performance improvements. [12]

Usage

Functions include research on nuclear fusion, [14] low carbon technologies, subatomic particles, cancer and cosmology. [15] [16] It will also develop new materials that will be useful for batteries and more efficient solar cells. [16] It is to be available to the general scientific community. [17]

Architecture

Aurora has over nine thousand nodes, with each node being composed of two Intel Xeon Max [18] processors, six Intel Max series GPUs and a unified memory architecture, providing a maximum computing power of 130 teraFLOPS per node. [19] It has around 10 petabytes of memory and 230 petabytes of storage.

The machine is estimated to consume around 60 MW of power. [20] For comparison, the fastest computer in the world today, Frontier uses 21 MW while Summit uses 13 MW.

See also

Related Research Articles

<span class="mw-page-title-main">Supercomputer</span> Type of extremely powerful computer

A supercomputer is a type of computer with a high level of performance as compared to a general-purpose computer. The performance of a supercomputer is commonly measured in floating-point operations per second (FLOPS) instead of million instructions per second (MIPS). Since 2017, supercomputers have existed which can perform over 1017 FLOPS (a hundred quadrillion FLOPS, 100 petaFLOPS or 100 PFLOPS). For comparison, a desktop computer has performance in the range of hundreds of gigaFLOPS (1011) to tens of teraFLOPS (1013). Since November 2017, all of the world's fastest 500 supercomputers run on Linux-based operating systems. Additional research is being conducted in the United States, the European Union, Taiwan, Japan, and China to build faster, more powerful and technologically superior exascale supercomputers.

Floating point operations per second is a measure of computer performance in computing, useful in fields of scientific computations that require floating-point calculations.

<span class="mw-page-title-main">National Energy Research Scientific Computing Center</span> Supercomputer facility operated by the US Department of Energy in Berkeley, California

The National Energy Research Scientific Computing Center (NERSC), is a high-performance computing (supercomputer) National User Facility operated by Lawrence Berkeley National Laboratory for the United States Department of Energy Office of Science. As the mission computing center for the Office of Science, NERSC houses high performance computing and data systems used by 9,000 scientists at national laboratories and universities around the country. Research at NERSC is focused on fundamental and applied research in energy efficiency, storage, and generation; Earth systems science, and understanding of fundamental forces of nature and the universe. The largest research areas are in High Energy Physics, Materials Science, Chemical Sciences, Climate and Environmental Sciences, Nuclear Physics, and Fusion Energy research. NERSC's newest and largest supercomputer is Perlmutter, which debuted in 2021 ranked 5th on the TOP500 list of world's fastest supercomputers.

The Oak Ridge Leadership Computing Facility (OLCF), formerly the National Leadership Computing Facility, is a designated user facility operated by Oak Ridge National Laboratory and the Department of Energy. It contains several supercomputers, the largest of which is an HPE OLCF-5 named Frontier, which was ranked 1st on the TOP500 list of world's fastest supercomputers as of June 2023. It is located in Oak Ridge, Tennessee.

<span class="mw-page-title-main">TOP500</span> Database project devoted to the ranking of computers

The TOP500 project ranks and details the 500 most powerful non-distributed computer systems in the world. The project was started in 1993 and publishes an updated list of the supercomputers twice a year. The first of these updates always coincides with the International Supercomputing Conference in June, and the second is presented at the ACM/IEEE Supercomputing Conference in November. The project aims to provide a reliable basis for tracking and detecting trends in high-performance computing and bases rankings on HPL benchmarks, a portable implementation of the high-performance LINPACK benchmark written in Fortran for distributed-memory computers.

The Green500 is a biannual ranking of supercomputers, from the TOP500 list of supercomputers, in terms of energy efficiency. The list measures performance per watt using the TOP500 measure of high performance LINPACK benchmarks at double-precision floating-point format.

Petascale computing refers to computing systems capable of calculating at least 1015 floating point operations per second (1 petaFLOPS). Petascale computing allowed faster processing of traditional supercomputer applications. The first system to reach this milestone was the IBM Roadrunner in 2008. Petascale supercomputers were succeeded by exascale computers.

The National Center for Computational Sciences (NCCS) is a United States Department of Energy (DOE) Leadership Computing Facility that houses the Oak Ridge Leadership Computing Facility (OLCF), a DOE Office of Science User Facility charged with helping researchers solve challenging scientific problems of global interest with a combination of leading high-performance computing (HPC) resources and international expertise in scientific computing.

<span class="mw-page-title-main">Jaguar (supercomputer)</span> Cray supercomputer at Oak Ridge National Laboratory

Jaguar or OLCF-2 was a petascale supercomputer built by Cray at Oak Ridge National Laboratory (ORNL) in Oak Ridge, Tennessee. The massively parallel Jaguar had a peak performance of just over 1,750 teraFLOPS. It had 224,256 x86-based AMD Opteron processor cores, and operated with a version of Linux called the Cray Linux Environment. Jaguar was a Cray XT5 system, a development from the Cray XT4 supercomputer.

Exascale computing refers to computing systems capable of calculating at least "1018 IEEE 754 Double Precision (64-bit) operations (multiplications and/or additions) per second (exaFLOPS)"; it is a measure of supercomputer performance.

<span class="mw-page-title-main">History of supercomputing</span>

The history of supercomputing goes back to the 1960s when a series of computers at Control Data Corporation (CDC) were designed by Seymour Cray to use innovative designs and parallelism to achieve superior computational peak performance. The CDC 6600, released in 1964, is generally considered the first supercomputer. However, some earlier computers were considered supercomputers for their day such as the 1954 IBM NORC in the 1950s, and in the early 1960s, the UNIVAC LARC (1960), the IBM 7030 Stretch (1962), and the Manchester Atlas (1962), all of which were of comparable power.

<span class="mw-page-title-main">Supercomputing in Europe</span> Overview of supercomputing in Europe

Several centers for supercomputing exist across Europe, and distributed access to them is coordinated by European initiatives to facilitate high-performance computing. One such initiative, the HPC Europa project, fits within the Distributed European Infrastructure for Supercomputing Applications (DEISA), which was formed in 2002 as a consortium of eleven supercomputing centers from seven European countries. Operating within the CORDIS framework, HPC Europa aims to provide access to supercomputers across Europe.

<span class="mw-page-title-main">Xeon Phi</span> Series of x86 manycore processors from Intel

Xeon Phi is a discontinued series of x86 manycore processors designed and made by Intel. It was intended for use in supercomputers, servers, and high-end workstations. Its architecture allowed use of standard programming languages and application programming interfaces (APIs) such as OpenMP.

<span class="mw-page-title-main">Titan (supercomputer)</span> American supercomputer

Titan or OLCF-3 was a supercomputer built by Cray at Oak Ridge National Laboratory for use in a variety of science projects. Titan was an upgrade of Jaguar, a previous supercomputer at Oak Ridge, that uses graphics processing units (GPUs) in addition to conventional central processing units (CPUs). Titan was the first such hybrid to perform over 10 petaFLOPS. The upgrade began in October 2011, commenced stability testing in October 2012 and it became available to researchers in early 2013. The initial cost of the upgrade was US$60 million, funded primarily by the United States Department of Energy.

XK7 is a supercomputing platform, produced by Cray, launched on October 29, 2012. XK7 is the second platform from Cray to use a combination of central processing units ("CPUs") and graphical processing units ("GPUs") for computing; the hybrid architecture requires a different approach to programming to that of CPU-only supercomputers. Laboratories that host XK7 machines host workshops to train researchers in the new programming languages needed for XK7 machines. The platform is used in Titan, the world's second fastest supercomputer in the November 2013 list as ranked by the TOP500 organization. Other customers include the Swiss National Supercomputing Centre which has a 272 node machine and Blue Waters has a machine that has Cray XE6 and XK7 nodes that performs at approximately 1 petaFLOPS (1015 floating-point operations per second).

<span class="mw-page-title-main">Cray XC40</span> Supercomputer manufactured by Cray

The Cray XC40 is a massively parallel multiprocessor supercomputer manufactured by Cray. It consists of Intel Haswell Xeon processors, with optional Nvidia Tesla or Intel Xeon Phi accelerators, connected together by Cray's proprietary "Aries" interconnect, stored in air-cooled or liquid-cooled cabinets. The XC series supercomputers are available with the Cray DataWarp applications I/O accelerator technology.

<span class="mw-page-title-main">Summit (supercomputer)</span> Supercomputer developed by IBM

Summit or OLCF-4 is a supercomputer developed by IBM for use at Oak Ridge Leadership Computing Facility (OLCF), a facility at the Oak Ridge National Laboratory, United States of America. As of June 2024, it is the 9th fastest supercomputer in the world on the TOP500 list. It held the number 1 position on this list from November 2018 to June 2020. Its current LINPACK benchmark is clocked at 148.6 petaFLOPS.

<span class="mw-page-title-main">Frontier (supercomputer)</span> American supercomputer

Hewlett Packard Enterprise Frontier, or OLCF-5, is the world's first exascale supercomputer. It is hosted at the Oak Ridge Leadership Computing Facility (OLCF) in Tennessee, United States and became operational in 2022. As of December 2023, Frontier is the world's fastest supercomputer. It is based on the Cray EX and is the successor to Summit (OLCF-4). Frontier achieved an Rmax of 1.102 exaFLOPS, which is 1.102 quintillion floating-point operations per second, using AMD CPUs and GPUs.

<span class="mw-page-title-main">Fugaku (supercomputer)</span> Japanese supercomputer

Fugaku(Japanese: 富岳) is a petascale supercomputer at the Riken Center for Computational Science in Kobe, Japan. It started development in 2014 as the successor to the K computer and made its debut in 2020. It is named after an alternative name for Mount Fuji.

<span class="mw-page-title-main">LUMI</span> Supercomputer in Finland

LUMI is a petascale supercomputer located at the CSC data center in Kajaani, Finland. As of January 2023, the computer is the fastest supercomputer in Europe.

References

  1. "TOP500 May 2024". May 13, 2024. Retrieved May 13, 2024.
  2. Zarley, B. David (March 18, 2019). "America's first exascale supercomputer to be built by 2021". The Verge. Archived from the original on June 5, 2021. Retrieved September 17, 2020.
  3. "Intel and Cray are building a $500 million 'exascale' supercomputer for Argonne National Lab". Archived from the original on February 10, 2023. Retrieved September 26, 2020.
  4. Intel Corporation, Architecting the Future of Supercomputing, August 23, 2023
  5. "DOE Exascale Initiative" (PDF). Archived (PDF) from the original on March 30, 2021.
  6. Burt, Jeff (April 10, 2015). "Intel, Cray Awarded $200 Million to Build Powerful Supercomputer". eWEEK. Retrieved September 17, 2020.
  7. "The Argonne National Laboratory Supercomputer will Enable High Performance Computing and Artificial Intelligence at Exascale by 2021". Archived from the original on March 19, 2019.
  8. Black, Doug (October 9, 2020). "DOE Under Secretary for Science Dabbar's Exascale Update: Frontier to Be First, Aurora to Be Monitored". insideHPC. Archived from the original on October 28, 2020. Retrieved November 6, 2020.
  9. "Intel Innovation Spotlights New Products, Technology and Tools for..." Intel. Archived from the original on October 27, 2021. Retrieved October 27, 2021.
  10. Intel Corproation, "Aurora Supercomputer Blade Installation Complete", October 27, 2021
  11. "Top 500: Aurora Breaks into Exascale, but Can't Get to the Frontier of HPC". HPCwire. May 13, 2024. Retrieved May 13, 2024.
  12. 1 2 Shilov, Anton. "The Aurora Supercomputer Is Installed: 2 ExaFLOPS, Tens of Thousands of CPUs and GPUs". www.anandtech.com. Retrieved November 14, 2023.
  13. "Aurora - HPE Cray EX - Intel Exascale Compute Blade, Xeon CPU Max 9470 52C 2.4GHz, Intel Data Center GPU Max, Slingshot-11 | TOP500". www.top500.org. Retrieved May 13, 2024.
  14. "Using Exascale Supercomputers to Make Clean Fusion Energy Possible". September 2, 2022. Archived from the original on December 29, 2022. Retrieved February 10, 2023.
  15. Johnson, Rob. "Aurora Supercomputer to Assist in the Fight Against Cancer". TECHNOLOGY NETWORKS. Archived from the original on June 5, 2021. Retrieved September 26, 2020.
  16. 1 2 "Energy Department to spend 200 million on new aurora supercomputer". NBC News. April 9, 2015. Archived from the original on June 5, 2021. Retrieved September 26, 2020.
  17. "Aurora, Argonne supercomputer will be the most powerful in the U.S., will be installed at Argonne National Laboratory in the Chicago area". Archived from the original on November 25, 2020. Retrieved September 26, 2020.
  18. Papka, Michael (December 8, 2020), IEEE Chicago and ACM Chicago webinar: Supercomputing and ALCF - Dec 7 2020, archived from the original on November 15, 2021, retrieved December 9, 2020
  19. "Intel's 2021 Exascale Vision in Aurora". anandtech. Archived from the original on June 5, 2021. Retrieved November 24, 2020.
  20. "How Argonne Is Preparing for Exascale in 2022". HPCwire. September 8, 2021. Archived from the original on June 5, 2022. Retrieved June 14, 2022.