IBM Sequoia

Last updated

IBM Sequoia
Sequoia6.1000pix.jpg
Operators LLNL
Location Livermore, California,
United States
Power 7.9 MW
Operating system CNK operating system
Red Hat Enterprise Linux
Space 3,000 square feet (280 m2)
Memory 1.5 PiB
Speed 20.13 PFLOPS
Cost Unknown [1]
Purpose Nuclear weapons, astronomy, energy, human genome, and climate change

IBM Sequoia is a petascale Blue Gene/Q supercomputer constructed by IBM for the National Nuclear Security Administration as part of the Advanced Simulation and Computing Program (ASC). It was delivered to the Lawrence Livermore National Laboratory (LLNL) in 2011 and was fully deployed in June 2012. [2]

Petascale computing

In computing, petascale refers to a computer system capable of reaching performance in excess of one petaflops, i.e. one quadrillion floating point operations per second. The standard benchmark tool is LINPACK and Top500.org is the organization which tracks the fastest supercomputers. Some uniquely specialized petascale computers do not rank on the Top500 list since they cannot run LINPACK. This makes comparisons to ordinary supercomputers hard.

Supercomputer extremely powerful computer for its era

A supercomputer is a computer with a high level of performance compared to a general-purpose computer. The performance of a supercomputer is commonly measured in floating-point operations per second (FLOPS) instead of million instructions per second (MIPS). Since 2017, there are supercomputers which can perform up to nearly a hundred quadrillion FLOPS. Since November 2017, all of the world's fastest 500 supercomputers run Linux-based operating systems. Additional research is being conducted in China, the United States, the European Union, Taiwan and Japan to build even faster, more powerful and more technologically superior exascale supercomputers.

IBM American multinational technology and consulting corporation

International Business Machines Corporation (IBM) is an American multinational information technology company headquartered in Armonk, New York, with operations in over 170 countries. The company began in 1911, founded in Endicott, New York, as the Computing-Tabulating-Recording Company (CTR) and was renamed "International Business Machines" in 1924.

Contents

On June 14, 2012, the TOP500 Project Committee announced that Sequoia replaced the K computer as the world's fastest supercomputer, with a LINPACK performance of 17.17 petaflops, 63% faster than the K computer's 10.51 petaflops, having 123% more cores than the K computer's 705,024 cores. Sequoia is also more energy efficient, as it consumes 7.9  MW, 37% less than the K computer's 12.6 MW. [3] [4]

TOP500

The TOP500 project ranks and details the 500 most powerful non-distributed computer systems in the world. The project was started in 1993 and publishes an updated list of the supercomputers twice a year. The first of these updates always coincides with the International Supercomputing Conference in June, and the second is presented at the ACM/IEEE Supercomputing Conference in November. The project aims to provide a reliable basis for tracking and detecting trends in high-performance computing and bases rankings on HPL, a portable implementation of the high-performance LINPACK benchmark written in Fortran for distributed-memory computers.

K computer supercomputer in Kobe

The K computer – named for the Japanese word "kei" (京), meaning 10 quadrillion (1016) – is a supercomputer manufactured by Fujitsu, currently installed at the Riken Advanced Institute for Computational Science campus in Kobe, Hyōgo Prefecture, Japan. The K computer is based on a distributed memory architecture with over 80,000 compute nodes. It is used for a variety of applications, including climate research, disaster prevention and medical research. The K computer's operating system is based on the Linux kernel, with additional drivers designed to make use of the computer's hardware.

LINPACK is a software library for performing numerical linear algebra on digital computers. It was written in Fortran by Jack Dongarra, Jim Bunch, Cleve Moler, and Gilbert Stewart, and was intended for use on supercomputers in the 1970s and early 1980s. It has been largely superseded by LAPACK, which runs more efficiently on modern architectures.

As of November 2017, Sequoia had dropped to sixth place on the TOP500 ranking, but first to third on June 17, 2013, behind Tianhe-2 and Titan. [5] In June 2016, it slipped again, to fourth place on the TOP500 ranking. In June 2017, it slipped again, to fifth place on the TOP500 ranking. [6]

Tianhe-2 supercomputer in Guangzhou

Tianhe-2 or TH-2 is a 33.86-petaflop supercomputer located in National Supercomputer Center in Guangzhou, China. It was developed by a team of 1,300 scientists and engineers.

Titan (supercomputer) American supercomputer

Titan or OLCF-3 is a supercomputer built by Cray at Oak Ridge National Laboratory for use in a variety of science projects. Titan is an upgrade of Jaguar, a previous supercomputer at Oak Ridge, that uses graphics processing units (GPUs) in addition to conventional central processing units (CPUs). Titan is the first such hybrid to perform over 10 petaFLOPS. The upgrade began in October 2011, commenced stability testing in October 2012 and it became available to researchers in early 2013. The initial cost of the upgrade was US$60 million, funded primarily by the United States Department of Energy.

Record-breaking science applications have been run on Sequoia, the first to cross 10 petaflops of sustained performance. The cosmology simulation framework HACC achieved almost 14 petaflops with a 3.6 trillion particle benchmark run, [7] while the Cardioid code, [8] [9] which models the electrophysiology of the human heart, achieved nearly 12 petaflops with a near real-time simulation.

The entire supercomputer runs on Linux, with CNK running on over 98,000 nodes, and Red Hat Enterprise Linux running on 768 I/O nodes that are connected to the Lustre filesystem. [10]

CNK operating system

Compute Node Kernel (CNK) is the node level operating system for the IBM Blue Gene series of supercomputers.

Red Hat Enterprise Linux Linux-based operating system developed by Red Hat

Red Hat Enterprise Linux is a Linux distribution developed by Red Hat and targeted toward the commercial market. Red Hat Enterprise Linux is released in server versions for x86-64, Power ISA, ARM64, and IBM Z, and a desktop version for x86-64. All of Red Hat's official support and training, together with the Red Hat Certification Program, focuses on the Red Hat Enterprise Linux platform. Red Hat Enterprise Linux is often abbreviated to RHEL.

Lustre is a type of parallel distributed file system, generally used for large-scale cluster computing. The name Lustre is a portmanteau word derived from Linux and cluster. Lustre file system software is available under the GNU General Public License and provides high performance file systems for computer clusters ranging in size from small workgroup clusters to large-scale, multi-site clusters.

Dawn prototype

IBM built a prototype, called "Dawn", capable of 500 teraflops, using the Blue Gene/P design, to evaluate the Sequoia design. This system was delivered in April 2009 and entered the Top500 list at 9th place in June 2009. [11]

Purpose

Sequoia will be used primarily for nuclear weapons simulation, replacing the current Blue Gene/L and ASC Purple supercomputers at Lawrence Livermore National Laboratory. Sequoia will also be available for scientific purposes such as astronomy, energy, lattice QCD, study of the human genome, and climate change.

Design

Node architecture

Sequoia is a Blue Gene/Q design, based on previous Blue Gene designs. It consists of 96 racks containing 98,304 compute nodes, i.e., 1024 per rack. The compute nodes are 16-core A2 processor chips with 16  GB of DDR3 memory each. Thus, the system contains a total of 96·1024·16 = 1,572,864 processor cores with 1.5  PiB memory. It covers an area of about 3,000 square feet (280 m2). The compute nodes are interconnected in a 5-dimensional torus topology.

Job scheduler

LLNL will use the SLURM job scheduler, which is also used by the Dawn prototype and China's Tianhe-IA, to manage Sequoia's resources. [12]

Filesystem

LLNL uses Lustre as the parallel filesystem, and has ported ZFS to Linux as the Lustre OSD (Object Storage Device) to take advantage of the performance and advanced features of the filesystem. [13]

In September 2011, NetApp announced that the DoE had selected the company for 55 PB of storage. [14] [15]

Power usage

The complete system will draw about 6  MW of power but is projected to have an unprecedented efficiency in performance per watt. The Sequoia design will perform 3000 Mflops/watt, about 7 times as efficient as the Blue Gene/P design it is replacing, and more than 3 times as efficient as a prior (June 2011) Top 500 leader. [16] [17]

Application

In January 2013, the Sequoia set the record for the first supercomputer using more than one million computing cores at a time for a single application. The Stanford Engineering's Center for Turbulence Research (CTR) used it to solve a complex fluid dynamics problem  the prediction of noise generated by a supersonic jet engine. [18] [19]

See also

Related Research Articles

In computing, floating point operations per second is a measure of computer performance, useful in fields of scientific computations that require floating-point calculations. For such cases it is a more accurate measure than measuring instructions per second.

IBM Blue Gene series of supercomputers by IBM

Blue Gene is an IBM project aimed at designing supercomputers that can reach operating speeds in the PFLOPS (petaFLOPS) range, with low power consumption.

Advanced Simulation and Computing Program

The Advanced Simulation and Computing Program is a super-computing program run by the National Nuclear Security Administration, in order to simulate, test, and maintain the United States nuclear stockpile. The program was created in 1995 in order to support the Stockpile Stewardship Program. The goal of the initiative is to extend the lifetime of the current aging stockpile.

National Energy Research Scientific Computing Center supercomputer facility operated by the US Department of Energy in Berkeley, California

The National Energy Research Scientific Computing Center, or NERSC, is a high performance computing (supercomputer) user facility operated by Lawrence Berkeley National Laboratory for the United States Department of Energy Office of Science. As the mission computing center for the Office of Science, NERSC houses high performance computing and data systems used by 7,000 scientists at national laboratories and universities around the country. NERSC's newest and largest supercomputer is Cori, which was ranked 5th on the TOP500 list of world's fastest supercomputers in November 2016. NERSC is located on the main Berkeley Lab campus in Berkeley, California.

MDGRAPE-3 is an ultra-high performance petascale supercomputer system developed by the Riken research institute in Japan. It is a special purpose system built for molecular dynamics simulations, especially protein structure prediction.

Blue Waters

Blue Waters is a petascale supercomputer at the National Center for Supercomputing Applications (NCSA) at the University of Illinois at Urbana-Champaign. On August 8, 2007, the National Science Board approved a resolution which authorized the National Science Foundation to fund "the acquisition and deployment of the world's most powerful leadership-class supercomputer." The NSF awarded $208 million for the Blue Waters project.

JUGENE

JUGENE was a supercomputer built by IBM for Forschungszentrum Jülich in Germany. It was based on the Blue Gene/P and succeeded the JUBL based on an earlier design. It was at the introduction the second fastest computer in the world, and the month before its decommissioning in July 2012 it was still at the 25th position in the TOP500 list. The computer was owned by the "Jülich Supercomputing Centre" (JSC) and the Gauss Centre for Supercomputing.

Cray XT5

The Cray XT5 is an updated version of the Cray XT4 supercomputer, launched on November 6, 2007. It includes a faster version of the XT4's SeaStar2 interconnect router called SeaStar2+, and can be configured either with XT4 compute blades, which have four dual-core AMD Opteron processor sockets, or XT5 blades, with eight sockets supporting dual or quad-core Opterons. The XT5 uses a 3-dimensional torus network topology.

The National Center for Computational Sciences (NCCS) is a United States Department of Energy Leadership Computing Facility. The NCCS provides resources for calculation and simulation in fields including astrophysics, materials, and climate research. This research is intended to enhance American competitiveness in industry. The NCCS, founded in 1992 and located at Oak Ridge National Laboratory (ORNL), currently manages a 2.33-petaflop Cray XT5 supercomputer named Jaguar for use in open research by academic and corporate researchers. Jaguar was named the world's fastest computer at SC09, a position it held until October 2010. Founded in 1992, the NCCS is a managed activity of the Advanced Scientific Computing Research program of the Department of Energy Office of Science (DOE-SC).

Jaguar (supercomputer) supercomputer that used to be at Oak Ridge National Laboratory

Jaguar or OLCF-2 was a petascale supercomputer built by Cray at Oak Ridge National Laboratory (ORNL) in Oak Ridge, Tennessee. The massively parallel Jaguar had a peak performance of just over 1,750 teraFLOPS. It had 224,256 x86-based AMD Opteron processor cores, and operated with a version of Linux called the Cray Linux Environment. Jaguar was a Cray XT5 system, a development from the Cray XT4 supercomputer.

Tianhe-I, Tianhe-1, or TH-1 is a supercomputer capable of an Rmax of 2.5 petaFLOPS. Located at the National Supercomputing Center of Tianjin, China, it was the fastest computer in the world from October 2010 to June 2011 and is one of the few petascale supercomputers in the world.

Slurm Workload Manager free and open-source job scheduler for Linux and similar computers

The Slurm Workload Manager, or Slurm, is a free and open-source job scheduler for Linux and Unix-like kernels, used by many of the world's supercomputers and computer clusters.

Supercomputer architecture

Approaches to supercomputer architecture have taken dramatic turns since the earliest systems were introduced in the 1960s. Early supercomputer architectures pioneered by Seymour Cray relied on compact innovative designs and local parallelism to achieve superior computational peak performance. However, in time the demand for increased computational power ushered in the age of massively parallel systems.

Mira (supercomputer)

Mira is a petascale Blue Gene/Q supercomputer. As of November 2017, it is listed on TOP500 as the 11th fastest supercomputer in the world, while it debuted June 2012 in 3rd place. It has a performance of 8.59 petaflops (LINPACK) and consumes 3.9 MW. The supercomputer was constructed by IBM for Argonne National Laboratory's Argonne Leadership Computing Facility with the support of the United States Department of Energy, and partially funded by the National Science Foundation. Mira will be used for scientific research, including studies in the fields of material science, climatology, seismology, and computational chemistry. The supercomputer is being utilized initially for sixteen projects, selected by the Department of Energy.

Fermi (supercomputer)

Fermi is a 2.097 petaFLOPS supercomputer located at CINECA.

References

  1. "IBM US nuke-lab beast 'Sequoia' is top of the flops". The Register.
  2. NNSA awards IBM contract to build next generation supercomputer, February 3, 2009
  3. "TOP500 Press Release: Lawrence Livermore's Sequoia Supercomputer Towers above the Rest in Latest TOP500 List". TOP500. July 14, 2012. Archived from the original on August 9, 2012.
  4. Naveena Kottoor (June 18, 2012). "BBC News – IBM supercomputer overtakes Fujitsu as world's fastest". BBC News.
  5. "China's Tianhe-2 Supercomputer Takes No. 1 Ranking on 41st TOP500 List". TOP500. June 17, 2013.
  6. "TOP500 List Refreshed". TOP500. June 2017.
  7. S. Habib; V. Morozov; H. Finkel; A. Pope; K. Heitmann; K. Kumaran; T. Peterka; J. Insley; D. Daniel; P. Fasel; N. Frontiere; Z. Lukic (2012). "The Universe at Extreme Scale: Multi-Petaflop Sky Simulation on the BG/Q". arXiv: 1211.4864 Lock-green.svg [cs.DC].
  8. "Cardioid Cardiac Modeling Project".
  9. "Venturing into the Heart of High-Performance Computing Simulations".
  10. "IBM supercomputer overtakes Japan's Fujitsu as world's fastest". TechSpot. June 18, 2012.
  11. Dawn Ranking History Archived December 1, 2010, at the Wayback Machine .
  12. Multi-Petascale Computing on the Sequoia Architecture Archived August 7, 2011, at the Wayback Machine . June 17, 2009
  13. ZFS on Linux for Lustre Archived October 31, 2014, at the Wayback Machine . April 13, 2011, Brian Behlendorf, LLNL
  14. U.S. Department of Energy Selects NetApp as the Storage Foundation for One of the World’s Most Powerful Supercomputers, September 28, 2011
  15. Sequoia's 55PB Lustre+ZFS Filesystem on YouTube, April 24, 2012, RichReport
  16. The Top500 List – June 2011
  17. The Green500 List – June 2011
  18. "Stanford Researchers Break Million-core Supercomputer Barrier" Standford Engineering, January 25, 2013.
  19. Stanford engineering Videos's channel on YouTube, January 30, 2013.
Records
Preceded by
K computer
10.51 petaflops
World's most powerful supercomputer
June 2012 – November 2012
Succeeded by
Titan
17.59 petaflops