Sequoia (supercomputer)

Last updated

Sequoia
IBM BlueGene Q (Sequoia supercomputer)(1).jpg
Operators LLNL
Location Livermore, California,
United States
Power7.9 MW
Operating system CNK operating system
Red Hat Enterprise Linux
Space3,000 square feet (280 m2)
Memory1.5 PiB
Speed20.13 PFLOPS
CostUS$250 million [1] (undisclosed by IBM [2] ); equivalent to $325 million in 2022
Purpose Nuclear weapons, astronomy, energy, human genome, and climate change

IBM Sequoia was a petascale Blue Gene/Q supercomputer constructed by IBM for the National Nuclear Security Administration as part of the Advanced Simulation and Computing Program (ASC). It was delivered to the Lawrence Livermore National Laboratory (LLNL) in 2011 and was fully deployed in June 2012. [3] Sequoia was dismantled in 2020, its last position on the top500.org list was #22 in the November 2019 list.

Contents

On June 14, 2012, the TOP500 Project Committee announced that Sequoia replaced the K computer as the world's fastest supercomputer, with a LINPACK performance of 17.17 petaflops, 63% faster than the K computer's 10.51 petaflops, having 123% more cores than the K computer's 705,024 cores. Sequoia is also more energy efficient, as it consumes 7.9  MW, 37% less than the K computer's 12.6 MW. [4] [5]

As of November 2017, Sequoia had dropped to sixth place on the TOP500 ranking, while it was at third position on June 17, 2013, behind Tianhe-2 and Titan. [6]

Record-breaking science applications have been run on Sequoia, the first to cross 10 petaflops of sustained performance. The cosmology simulation framework HACC achieved almost 14 petaflops with a 3.6 trillion particle benchmark run, [7] while the Cardioid code, [8] [9] which models the electrophysiology of the human heart, achieved nearly 12 petaflops with a near real-time simulation.

The entire supercomputer runs on Linux, with CNK running on over 98,000 nodes, and Red Hat Enterprise Linux running on 768 I/O nodes that are connected to the Lustre filesystem. [10]

Dawn prototype

Dawn prototype U.S. Department of Energy - Science - 477 020 010 (9444537887).jpg
Dawn prototype

IBM built a prototype, called "Dawn", capable of 500 teraflops, using the Blue Gene/P design, to evaluate the Sequoia design. This system was delivered in April 2009 and entered the Top500 list at 9th place in June 2009. [11]

Purpose

Sequoia was used primarily for nuclear weapons simulation, replacing the current Blue Gene/L and ASC Purple supercomputers at Lawrence Livermore National Laboratory. Sequoia was also available for scientific purposes such as astronomy, energy, lattice QCD, study of the human genome, and climate change.

Design

Node architecture

Sequoia was a Blue Gene/Q design, based on previous Blue Gene designs. It consisted of 96 racks containing 98,304 compute nodes, i.e., 1024 per rack. The compute nodes were 16-core A2 processor chips with 16  GB of DDR3 memory each. Thus, the system contained a total of 96·1024·16 = 1,572,864 processor cores with 1.5  PiB memory. It covered an area of about 3,000 square feet (280 m2). The compute nodes were interconnected in a 5-dimensional torus topology.

Job scheduler

LLNL used the SLURM job scheduler, also used by the Dawn prototype and China's Tianhe-IA, to manage Sequoia's resources. [12]

Filesystem

LLNL uses Lustre as the parallel filesystem, and has ported ZFS to Linux as the Lustre OSD (Object Storage Device) to take advantage of the performance and advanced features of the filesystem. [13]

In September 2011, NetApp announced that the DoE had selected the company for 55 PB of storage. [14] [15]

Power usage

U.S. Department of Energy - Science - 477 022 010 (9444538239).jpg

The complete system drew about 7.8  MW of power, but had a unprecedented energy efficiency, performing 2068 Mflops/watt, about 6 times as efficient as Dawn, and more than 2.5 times as efficient as the June 2011 Top 500 leader. [16]

Application

In January 2013, Sequoia set the record for the first supercomputer using more than one million computing cores at a time for a single application. The Stanford Engineering's Center for Turbulence Research (CTR) used it to solve a complex fluid dynamics problem  the prediction of noise generated by a supersonic jet engine. [17] [18]

See also

Related Research Articles

<span class="mw-page-title-main">Supercomputer</span> Type of extremely powerful computer

A supercomputer is a computer with a high level of performance as compared to a general-purpose computer. The performance of a supercomputer is commonly measured in floating-point operations per second (FLOPS) instead of million instructions per second (MIPS). Since 2017, supercomputers have existed which can perform over 1017 FLOPS (a hundred quadrillion FLOPS, 100 petaFLOPS or 100 PFLOPS). For comparison, a desktop computer has performance in the range of hundreds of gigaFLOPS (1011) to tens of teraFLOPS (1013). Since November 2017, all of the world's fastest 500 supercomputers run on Linux-based operating systems. Additional research is being conducted in the United States, the European Union, Taiwan, Japan, and China to build faster, more powerful and technologically superior exascale supercomputers.

In computing, floating point operations per second is a measure of computer performance, useful in fields of scientific computations that require floating-point calculations. For such cases, it is a more accurate measure than measuring instructions per second.

<span class="mw-page-title-main">IBM Blue Gene</span> Series of supercomputers by IBM

Blue Gene was an IBM project aimed at designing supercomputers that can reach operating speeds in the petaFLOPS (PFLOPS) range, with low power consumption.

<span class="mw-page-title-main">Advanced Simulation and Computing Program</span>

The Advanced Simulation and Computing Program is a super-computing program run by the National Nuclear Security Administration, in order to simulate, test, and maintain the United States nuclear stockpile. The program was created in 1995 in order to support the Stockpile Stewardship Program. The goal of the initiative is to extend the lifetime of the current aging stockpile.

Lustre is a type of parallel distributed file system, generally used for large-scale cluster computing. The name Lustre is a portmanteau word derived from Linux and cluster. Lustre file system software is available under the GNU General Public License and provides high performance file systems for computer clusters ranging in size from small workgroup clusters to large-scale, multi-site systems. Since June 2005, Lustre has consistently been used by at least half of the top ten, and more than 60 of the top 100 fastest supercomputers in the world, including the world's No. 1 ranked TOP500 supercomputer in November 2022, Frontier, as well as previous top supercomputers such as Fugaku, Titan and Sequoia.

<span class="mw-page-title-main">TOP500</span> Database project devoted to the ranking of computers

The TOP500 project ranks and details the 500 most powerful non-distributed computer systems in the world. The project was started in 1993 and publishes an updated list of the supercomputers twice a year. The first of these updates always coincides with the International Supercomputing Conference in June, and the second is presented at the ACM/IEEE Supercomputing Conference in November. The project aims to provide a reliable basis for tracking and detecting trends in high-performance computing and bases rankings on HPL benchmarks, a portable implementation of the high-performance LINPACK benchmark written in Fortran for distributed-memory computers.

<span class="mw-page-title-main">JUGENE</span> Former supercomputer in Germany

JUGENE was a supercomputer built by IBM for Forschungszentrum Jülich in Germany. It was based on the Blue Gene/P and succeeded the JUBL based on an earlier design. It was at the introduction the second fastest computer in the world, and the month before its decommissioning in July 2012 it was still at the 25th position in the TOP500 list. The computer was owned by the "Jülich Supercomputing Centre" (JSC) and the Gauss Centre for Supercomputing.

<span class="mw-page-title-main">Cray XT5</span> Family of supercomputers

The Cray XT5 is an updated version of the Cray XT4 supercomputer, launched on November 6, 2007. It includes a faster version of the XT4's SeaStar2 interconnect router called SeaStar2+, and can be configured either with XT4 compute blades, which have four dual-core AMD Opteron processor sockets, or XT5 blades, with eight sockets supporting dual or quad-core Opterons. The XT5 uses a 3-dimensional torus network topology.

The National Center for Computational Sciences (NCCS) is a United States Department of Energy (DOE) Leadership Computing Facility that houses the Oak Ridge Leadership Computing Facility (OLCF), a DOE Office of Science User Facility charged with helping researchers solve challenging scientific problems of global interest with a combination of leading high-performance computing (HPC) resources and international expertise in scientific computing.

<span class="mw-page-title-main">Jaguar (supercomputer)</span> Japans next fastest Intel x86 based supercomputer

Jaguar or OLCF-2 was a petascale supercomputer built by Cray at Oak Ridge National Laboratory (ORNL) in Oak Ridge, Tennessee. The massively parallel Jaguar had a peak performance of just over 1,750 teraFLOPS. It had 224,256 x86-based AMD Opteron processor cores, and operated with a version of Linux called the Cray Linux Environment. Jaguar was a Cray XT5 system, a development from the Cray XT4 supercomputer.

<span class="mw-page-title-main">Tianhe-1</span> Supercomputer

Tianhe-I, Tianhe-1, or TH-1 is a supercomputer capable of an Rmax of 2.5 peta FLOPS. Located at the National Supercomputing Center of Tianjin, China, it was the fastest computer in the world from October 2010 to June 2011 and was one of the few petascale supercomputers in the world.

<span class="mw-page-title-main">Slurm Workload Manager</span> Free and open-source job scheduler for Linux and similar computers

The Slurm Workload Manager, formerly known as Simple Linux Utility for Resource Management (SLURM), or simply Slurm, is a free and open-source job scheduler for Linux and Unix-like kernels, used by many of the world's supercomputers and computer clusters.

<span class="mw-page-title-main">K computer</span> Supercomputer in Kobe, Japan

The K computer – named for the Japanese word/numeral "kei" (京), meaning 10 quadrillion (1016) – was a supercomputer manufactured by Fujitsu, installed at the Riken Advanced Institute for Computational Science campus in Kobe, Hyōgo Prefecture, Japan. The K computer was based on a distributed memory architecture with over 80,000 compute nodes. It was used for a variety of applications, including climate research, disaster prevention and medical research. The K computer's operating system was based on the Linux kernel, with additional drivers designed to make use of the computer's hardware.

<span class="mw-page-title-main">Supercomputing in Europe</span> Overview of supercomputing in Europe

Several centers for supercomputing exist across Europe, and distributed access to them is coordinated by European initiatives to facilitate high-performance computing. One such initiative, the HPC Europa project, fits within the Distributed European Infrastructure for Supercomputing Applications (DEISA), which was formed in 2002 as a consortium of eleven supercomputing centers from seven European countries. Operating within the CORDIS framework, HPC Europa aims to provide access to supercomputers across Europe.

<span class="mw-page-title-main">Supercomputer architecture</span> Design of high-performance computers

Approaches to supercomputer architecture have taken dramatic turns since the earliest systems were introduced in the 1960s. Early supercomputer architectures pioneered by Seymour Cray relied on compact innovative designs and local parallelism to achieve superior computational peak performance. However, in time the demand for increased computational power ushered in the age of massively parallel systems.

<span class="mw-page-title-main">Mira (supercomputer)</span>

Mira is a retired petascale Blue Gene/Q supercomputer. As of November 2017, it is listed on TOP500 as the 11th fastest supercomputer in the world, while it debuted June 2012 in 3rd place. It has a performance of 8.59 petaflops (LINPACK) and consumes 3.9 MW. The supercomputer was constructed by IBM for Argonne National Laboratory's Argonne Leadership Computing Facility with the support of the United States Department of Energy, and partially funded by the National Science Foundation. Mira was used for scientific research, including studies in the fields of material science, climatology, seismology, and computational chemistry. The supercomputer was used initially for sixteen projects selected by the Department of Energy.

<span class="mw-page-title-main">Tianhe-2</span> Supercomputer in Guangzhou, China

Tianhe-2 or TH-2 is a 33.86-petaflops supercomputer located in the National Supercomputer Center in Guangzhou, China. It was developed by a team of 1,300 scientists and engineers.

<span class="mw-page-title-main">Fermi (supercomputer)</span> Supercomputer located at CINECA

Fermi is a 2.097 petaFLOPS supercomputer located at CINECA.

<span class="mw-page-title-main">Sierra (supercomputer)</span> Supercomputer developed by IBM

Sierra or ATS-2 is a supercomputer built for the Lawrence Livermore National Laboratory for use by the National Nuclear Security Administration as the second Advanced Technology System. It is primarily used for predictive applications in nuclear weapon stockpile stewardship, helping to assure the safety, reliability, and effectiveness of the United States' nuclear weapons.

<span class="mw-page-title-main">Michael Gschwind</span> American computer scientist

Michael Karl Gschwind is an American computer scientist who currently is a director and principal engineer at Meta Platforms in Menlo Park, California. He is recognized for his seminal contributions to the design and exploitation of general-purpose programmable accelerators, as an early advocate of sustainability in computer design and as a prolific inventor.

References

  1. Brodkin, John (June 18, 2012). "With 16 petaflops and 1.6M cores, DOE supercomputer is world's fastest". Ars Technica. Retrieved August 17, 2019.
  2. "IBM US nuke-lab beast 'Sequoia' is top of the flops". The Register.
  3. NNSA awards IBM contract to build next generation supercomputer, February 3, 2009
  4. "TOP500 Press Release: Lawrence Livermore's Sequoia Supercomputer Towers above the Rest in Latest TOP500 List". TOP500. July 14, 2012. Archived from the original on August 7, 2012.
  5. Naveena Kottoor (June 18, 2012). "BBC News – IBM supercomputer overtakes Fujitsu as world's fastest". BBC News.
  6. "China's Tianhe-2 Supercomputer Takes No. 1 Ranking on 41st TOP500 List". TOP500. June 17, 2013.
  7. S. Habib; V. Morozov; H. Finkel; A. Pope; K. Heitmann; K. Kumaran; T. Peterka; J. Insley; D. Daniel; P. Fasel; N. Frontiere; Z. Lukic (2012). "The Universe at Extreme Scale: Multi-Petaflop Sky Simulation on the BG/Q". arXiv: 1211.4864 [cs.DC].
  8. "Cardioid Cardiac Modeling Project".
  9. "Venturing into the Heart of High-Performance Computing Simulations".
  10. "IBM supercomputer overtakes Japan's Fujitsu as world's fastest". TechSpot. June 18, 2012.
  11. Dawn Ranking History Archived December 1, 2010, at the Wayback Machine
  12. Multi-Petascale Computing on the Sequoia Architecture Archived August 7, 2011, at the Wayback Machine June 17, 2009
  13. ZFS on Linux for Lustre Archived October 31, 2014, at the Wayback Machine April 13, 2011, Brian Behlendorf, LLNL
  14. U.S. Department of Energy Selects NetApp as the Storage Foundation for One of the World’s Most Powerful Supercomputers, September 28, 2011
  15. Sequoia's 55PB Lustre+ZFS Filesystem on YouTube, April 24, 2012, RichReport
  16. The Top500 List – June 2011
  17. "Stanford Researchers Break Million-core Supercomputer Barrier" Standford Engineering, January 25, 2013.
  18. Stanford engineering Videos's channel on YouTube, January 30, 2013.
Records
Preceded by
K computer
10.51 petaflops
World's most powerful supercomputer
June 2012 – November 2012
Succeeded by
Titan
17.59 petaflops