National Energy Research Scientific Computing Center

Last updated
Shyh Wang Hall, which houses the National Energy Research Scientific Computing Center at Lawrence Berkeley National Laboratory. U.S. Department of Energy - Science - 391 009 010 (31308713210).jpg
Shyh Wang Hall, which houses the National Energy Research Scientific Computing Center at Lawrence Berkeley National Laboratory.

The National Energy Research Scientific Computing Center (NERSC), is a high-performance computing (supercomputer) National User Facility operated by Lawrence Berkeley National Laboratory for the United States Department of Energy Office of Science. As the mission computing center for the Office of Science, NERSC houses high performance computing and data systems used by 9,000 scientists at national laboratories and universities around the country. Research at NERSC is focused on fundamental and applied research in energy efficiency, storage, and generation; Earth systems science, and understanding of fundamental forces of nature and the universe. The largest research areas are in High Energy Physics, Materials Science, Chemical Sciences, Climate and Environmental Sciences, Nuclear Physics, and Fusion Energy research. NERSC's newest and largest supercomputer is Perlmutter, which debuted in 2021 ranked 5th on the TOP500 list of world's fastest supercomputers.

Contents

History

NERSC was founded in 1974 as the Controlled Thermonuclear Research Computer Center, or CTRCC, at Lawrence Livermore National Laboratory (LLNL). The center was created to provide computing resources to the fusion energy research community, and began with a Control Data Corporation 6600 computer (SN-1). The first machine procured directly by the center was a CDC 7600, installed in 1975 with a peak performance of 36 megaflop/s (36 million floating point operations per second). In 1976, the center was renamed the National Magnetic Fusion Energy Computer Center.

Subsequent supercomputers include a Cray-1 (SN-6), which was installed in May 1978 and called the "c" machine. In 1985, the world's first Cray-2 (SN-1) was installed as the "b" machine. The bubbles visible in the fluid of the Cray-2's direct liquid cooling system earned it the nickname "Bubbles."

In 1983, the center began providing a small portion of its resources to researchers outside the fusion community. As the center increasingly supported science across many research areas, it changed its name to the National Energy Research Supercomputer Center in 1990.

In 1995, the Department of Energy (DOE) moved NERSC from the Lawrence Livermore National Laboratory to Lawrence Berkeley National Laboratory. A cluster of Cray J90 systems was installed in Berkeley before the main systems at Livermore were shut down for the move in 1996 to provide continuous support for the research community. As a part of the move, the center was renamed the National Energy Research Scientific Computing Center, but kept the NERSC acronym. In 2000, NERSC moved to a new site in Oakland to accommodate the growing footprint of air-cooled supercomputers.

In November 2015, NERSC moved back to the main Berkeley Lab site and is housed in Shyh Wang Hall, an energy-efficient supercomputer facility. [1] [2] The building was financed by the University of California which manages Berkeley Lab for the U.S. Department of Energy (DOE). As with the move from LLNL, a new system was first installed in Berkeley before the machines in Oakland were taken down and moved. The utility infrastructure and computer systems are provided by the DOE.

Computers

The center names its major systems after scientists.

The newest supercomputer Perlmutter, is named after Saul Perlmutter, an astrophysicist at Berkeley Lab who shared the 2011 Nobel Prize in Physics for his contributions to research showing that the expansion of the universe is accelerating. It is a Cray system based on the Shasta architecture, with Zen 3 based AMD Epyc CPUs ("Milan") and NVIDIA Ampere GPUs. [3]

Another NERSC supercomputer is Cori, named after Gerty Cori, a biochemist who was the first American woman to receive a Nobel Prize in science. Cori is a Cray XC40 system with 622,336 Intel processor cores and a theoretical peak performance of 30 petaflop/s (30 quadrillion operations per second). Cori was delivered in two phases. The first phase—also known as the Data Partition—was installed in late 2015 and comprises 12 cabinets and more than 1,600 Intel Xeon "Haswell" compute nodes. The second phase [4] of Cori, installed in summer 2016, [5] added 52 cabinets and more than 9,300 nodes with second-generation Intel Xeon Phi processors (code-named Knights Landing, or KNL for short), making Cori the largest[ citation needed ] supercomputing system for open science based on KNL processors.

NERSC also houses a 200+ petabyte [6] High Performance Storage System (HPSS) for archival mass storage, in use since 1998.

NERSC facilities are accessible through the Energy Sciences Network, or ESnet, which is also managed by Lawrence Berkeley National Laboratory for the Department of Energy.

Projects

NERSC staff lead projects in computational science while also helping prepare the broader research community for the exascale era.

NESAP: The NERSC Exascale Science Applications Program partners with code teams and library and tool developers to prepare applications to use Cori's manycore architecture. Researchers prepare application codes for the new architecture. The NESAP partnership allows 20 projects to collaborate with NERSC, Cray, and Intel by providing access to early hardware, training, and preparation sessions with Intel and Cray staff. Eight of those 20 projects will also have an opportunity for a postdoctoral researcher to investigate computational science issues associated with energy-efficient many-core systems.

Shifter: Shifter is an open-source software tool based on Docker containers that enables NERSC users to analyze datasets from experimental facilities. Such containers allow an application to be packaged with its entire software stack—including some portions of the base OS files—as well as defining user environment variables and application "entry point".

HPC4Mfg (High Performance Computing for Manufacturing): NERSC is one of three DOE supercomputing centers working to create an ecosystem that allows experts at national laboratories to work directly with manufacturing industry members to teach them how to adopt or advance their use of high performance computing (HPC) to address manufacturing challenges with a goal of increasing energy efficiency, reducing environmental impacts and advancing clean energy technologies. The project is led by Lawrence Livermore National Laboratory.

NERSC's user community and scientific impact

In 2021 NERSC was acknowledged in more than 2,000 referenced scientific journal publications. Six Nobel Prize winning individuals or teams have used NERSC in their research.[ citation needed ]

In 2022, NERSC supported nearly 9,000 users from universities, national labs, and industries and has users in 50 US states, the District of Columbia, Puerto Rico, and 45 countries. [ citation needed ] NERSC supported researchers from 514 colleges and universities, 26 Department of Energy National Laboratories, 52 organizations in industry, 31 small businesses, 115 other government labs, and 19 non-profit organizations.[ citation needed ]

Related Research Articles

Lawrence Livermore National Laboratory (LLNL) is a federally funded research and development center in Livermore, California, United States. Originally established in 1952, the laboratory now is sponsored by the United States Department of Energy and administered privately by Lawrence Livermore National Security, LLC.

<span class="mw-page-title-main">Lawrence Berkeley National Laboratory</span> National laboratory located near Berkeley, California, U.S.

Lawrence Berkeley National Laboratory (LBNL) is a federally funded research and development center in the hills of Berkeley, California, United States. Established in 1931 by the University of California (UC), the laboratory is sponsored by the United States Department of Energy and administered by the UC system. Ernest Lawrence, who won the Nobel prize for inventing the cyclotron, founded the Lab and served as its Director until his death in 1958. Located in the Berkeley Hills, the lab overlooks the campus of the University of California, Berkeley.

<span class="mw-page-title-main">United States Department of Energy National Laboratories</span> Laboratories owned by the United States Department of Energy

The United States Department of Energy National Laboratories and Technology Centers is a system of laboratories overseen by the United States Department of Energy (DOE) for scientific and technological research. The primary mission of the DOE national laboratories is to conduct research and development (R&D) addressing national priorities: energy and climate, the environment, national security, and health. Sixteen of the seventeen DOE national laboratories are federally funded research and development centers administered, managed, operated and staffed by private-sector organizations under management and operating (M&O) contracts with the DOE.

<span class="mw-page-title-main">Energy Sciences Network</span>

The Energy Sciences Network (ESnet) is a high-speed computer network serving United States Department of Energy (DOE) scientists and their collaborators worldwide. It is managed by staff at the Lawrence Berkeley National Laboratory.

The Oak Ridge Leadership Computing Facility (OLCF), formerly the National Leadership Computing Facility, is a designated user facility operated by Oak Ridge National Laboratory and the Department of Energy. It contains several supercomputers, the largest of which is an HPE OLCF-5 named Frontier, which was ranked 1st on the TOP500 list of world's fastest supercomputers as of June 2023. It is located in Oak Ridge, Tennessee.

<span class="mw-page-title-main">TOP500</span> Database project devoted to the ranking of computers

The TOP500 project ranks and details the 500 most powerful non-distributed computer systems in the world. The project was started in 1993 and publishes an updated list of the supercomputers twice a year. The first of these updates always coincides with the International Supercomputing Conference in June, and the second is presented at the ACM/IEEE Supercomputing Conference in November. The project aims to provide a reliable basis for tracking and detecting trends in high-performance computing and bases rankings on HPL benchmarks, a portable implementation of the high-performance LINPACK benchmark written in Fortran for distributed-memory computers.

High Performance Storage System (HPSS) is a flexible, scalable, policy-based, software-defined Hierarchical Storage Management product developed by the HPSS Collaboration. It provides scalable hierarchical storage management (HSM), archive, and file system services using cluster, LAN and SAN technologies to aggregate the capacity and performance of many computers, disks, disk systems, tape drives, and tape libraries.

The Office of Science is a component of the United States Department of Energy (DOE). The Office of Science is the lead federal agency supporting fundamental scientific research for energy and the Nation’s largest supporter of basic research in the physical sciences. The Office of Science portfolio has two principal thrusts: direct support of scientific research and direct support of the development, construction, and operation of unique, open-access scientific user facilities that are made available for use by external researchers.

The National Center for Computational Sciences (NCCS) is a United States Department of Energy (DOE) Leadership Computing Facility that houses the Oak Ridge Leadership Computing Facility (OLCF), a DOE Office of Science User Facility charged with helping researchers solve challenging scientific problems of global interest with a combination of leading high-performance computing (HPC) resources and international expertise in scientific computing.

Shaheen is the name of a series of supercomputers owned and operated by King Abdullah University of Science and Technology (KAUST), Saudi Arabia. Shaheen is named after the Peregrine Falcon. The most recent model, Shaheen II, is the largest and most powerful supercomputer in the Middle East.

<span class="mw-page-title-main">Supercomputing in Europe</span> Overview of supercomputing in Europe

Several centers for supercomputing exist across Europe, and distributed access to them is coordinated by European initiatives to facilitate high-performance computing. One such initiative, the HPC Europa project, fits within the Distributed European Infrastructure for Supercomputing Applications (DEISA), which was formed in 2002 as a consortium of eleven supercomputing centers from seven European countries. Operating within the CORDIS framework, HPC Europa aims to provide access to supercomputers across Europe.

<span class="mw-page-title-main">Xeon Phi</span> Series of x86 manycore processors from Intel

Xeon Phi was a series of x86 manycore processors designed and made by Intel. It was intended for use in supercomputers, servers, and high-end workstations. Its architecture allowed use of standard programming languages and application programming interfaces (APIs) such as OpenMP.

<span class="mw-page-title-main">Appro</span> American technology company

Appro was a developer of supercomputing supporting High Performance Computing (HPC) markets focused on medium- to large-scale deployments. Appro was based in Milpitas, California with a computing center in Houston, Texas, and a manufacturing and support subsidiary in South Korea and Japan.

The Cray XC30 is a massively parallel multiprocessor supercomputer manufactured by Cray. It consists of Intel Xeon processors, with optional Nvidia Tesla or Xeon Phi accelerators, connected together by Cray's proprietary "Aries" interconnect, stored in air-cooled or liquid-cooled cabinets. Each liquid-cooled cabinet can contain up to 48 blades, each with eight CPU sockets, and uses 90 kW of power. The XC series supercomputers are available with the Cray DataWarp applications I/O accelerator technology.

<span class="mw-page-title-main">Trinity (supercomputer)</span>

Trinity is a United States supercomputer built by the National Nuclear Security Administration (NNSA) for the Advanced Simulation and Computing Program (ASC). The aim of the ASC program is to simulate, test, and maintain the United States nuclear stockpile.

<span class="mw-page-title-main">Cray XC40</span> Supercomputer manufactured by Cray

The Cray XC40 is a massively parallel multiprocessor supercomputer manufactured by Cray. It consists of Intel Haswell Xeon processors, with optional Nvidia Tesla or Intel Xeon Phi accelerators, connected together by Cray's proprietary "Aries" interconnect, stored in air-cooled or liquid-cooled cabinets. The XC series supercomputers are available with the Cray DataWarp applications I/O accelerator technology.

The Cray XC50 is a massively parallel multiprocessor supercomputer manufactured by Cray. The machine can support Intel Xeon processors, as well as Cavium ThunderX2 processors, Xeon Phi processors and NVIDIA Tesla P100 GPUs. The processors are connected by Cray's proprietary "Aries" interconnect, in a dragonfly network topology. The XC50 is an evolution of the XC40, with the main difference being the support of Tesla P100 processors and the use of Cray software release CLE 6 or 7.

Perlmutter is a supercomputer delivered to the National Energy Research Scientific Computing Center of the United States Department of Energy as the successor to Cori. It is being built by Cray and is based on their Shasta architecture which utilizes Zen 3 based AMD Epyc CPUs ("Milan") and next-generation Nvidia Tesla GPUs. Its intended use-cases are nuclear fusion simulations, climate projections and material and biological research. Phase 1, completed 27 May 2022, reached 70.9 PFLOPS of processing power.

<span class="mw-page-title-main">Aurora (supercomputer)</span> Planned supercomputer

Aurora is a planned supercomputer, originally contracted to be completed by 2018 but through a series of delays at the prime contractor, Intel Corporation, now planned to be completed in 2023. It was originally planned be the worlds’ fastest supercomputer with over 2 exaflops, however a series of delays have cast that into doubt. It is sponsored by the United States Department of Energy (DOE) and designed by Intel and Cray for the Argonne National Laboratory.

References

  1. "Less is More: LBNL Breaks New Ground in Data Center Optimization".
  2. "Berkeley Lab Opens State-of-the-Art Facility for Computational Science | Berkeley Lab". News Center. 2015-11-12. Retrieved 2018-02-08.
  3. "Perlmutter".
  4. "Cori Intel Xeon Phi (KNL) Nodes". National Energy Research Scientific Computing Center. Retrieved 2018-02-09.
  5. "Cori Supercomputer Now Fully Installed at Berkeley Lab". National Energy Research Scientific Computing Center. Retrieved 2018-02-09.
  6. "About". www.nersc.gov. Retrieved 2018-02-08.