Trinity (supercomputer)

Last updated
Trinity
Trinity (supercomputer).jpg
Operators National Nuclear Security Administration
Location Los Alamos National Laboratory
CostUS$174M [1]
PurposePrimarily utilized to perform milestone weapons calculations
Website lanl.gov/projects/trinity/

Trinity (or ATS-1) is a United States supercomputer built by the National Nuclear Security Administration (NNSA) for the Advanced Simulation and Computing Program (ASC). [2] The aim of the ASC program is to simulate, test, and maintain the United States nuclear stockpile.

Contents

History

Trinity technical specifications

Trinity High-Level Technical Specifications [10]
Operational Lifetime2015 to 2020
Architecture Cray XC40
Memory Capacity2.07 PiB
Peak Performance41.5 PF/s
Number of Compute Nodes19,420
Parallel File System Capacity78 PB (69 PiB)
Burst Buffer Capacity3.7 PB
Footprint4606 sq ft
power requirement8.6 MW

Compute Tier

Trinity was built in 2 stages. The first stage incorporated the Intel Xeon Haswell processor while the second stage added a significant performance increase using the Intel Xeon Phi Knights Landing Processor. There are 301,952 Haswell and 678,912 Knights Landing processors in the combined system, yielding a total peak performance of over 40 PF/s (petaflops) [4]

Storage Tiers

There are 5 primary storage tiers; Memory, Burst Buffer, Parallel File System, Campaign Storage, and Archive. [11]

Memory

2 PiB of DDR4 DRAM provide physical memory for the machine. Each processor also has DRAM built on to the tile, providing additional memory capacity. The data in this tier is highly transient and is typically in residence for only a few seconds, being overwritten continuously. [11]

Burst Buffer

Cray supplies the three hundred XC40 Data Warp blades that each contain 2 Burst Buffer nodes and 4 SSD drives. There is a total of 3.78 PB of storage in this tier, capable of moving data at a rate of up to 2 TB/s. In this tier, data is typically resident for a few hours, with data being overwritten in approximately that same time frame. [11]

Parallel File System

Trinity uses a Sonexion based Lustre file system with a total capacity of 78 PB. Throughput on this tier is about 1.8 TB/s (1.6 TiB/s). It is used to stage data in preparation for HPC operations. Data residence in this tier is typically several weeks.

Campaign Storage

The MarFS Filesystem fits into the Campaign Storage tier and combines properties of POSIX and Object storage models. The capacity of this tier is growing at a rate of about 30 PB/year, with a current capacity of over 100 PB. In testing, LANL scientists were able to create 968 billion files in a single directory at a rate of 835 million file creations per second. This storage is designed to be more robust than typical object storage, while sacrificing some of the end user functionality that you would get from a POSIX system. Performance of this tier is between 100-300 GB/s of throughput. Data residence in this tier is longer term, typically lasting several months.

Key Design goals

  • Transparency
  • Data protection
  • Recoverability
  • Ease of administration

MarFS is an open source filesystem. [12]

Archive

The final layer of storage is the Archive. This is a HPSS tape file system that holds approximately 100 PB of data.

Infographic on Trinity's file storage system. Click to enlarge. Trinity Storage Tiers image.jpg
Infographic on Trinity's file storage system. Click to enlarge.

See also

References

  1. 1 2 "Cray Awarded $174 Million Supercomputer Contract From the National Nuclear Security Administration" (Press release). July 10, 2014. Archived from the original on 2017-10-18. Retrieved 2014-08-24.
  2. Morgan, Timothy Prickett (1 October 2020). "With "Crossroads" Supercomputer, HPE Notches Another DOE Win". The Next Platform. Retrieved 5 November 2020.
  3. "Trinity / NERSC-8 RFP". Archived from the original on 2018-11-26. Retrieved 2018-11-26.
  4. 1 2 3 "Trinity Supercomputer's Haswell and KNL Partitions Are Merged". 19 July 2017.
  5. "Novermber [sic] 2015 | TOP500".
  6. "LANL Adds Capacity to Trinity Supercomputer for Stockpile Stewardship". 24 July 2017.
  7. "November 2016 | TOP500".
  8. "November 2018 | TOP500".
  9. "NNSA supercomputers recognized worldwide for speed and performance". Energy.gov. December 3, 2020. Retrieved 2023-11-13.
  10. "Technical Specifications". Archived from the original on 30 May 2023.
  11. 1 2 3 Grider, Gary (2018). "Storage Lessons from HPC: A Multi-Decadal Struggle" (PDF). Storage Developer Conference. www.snia.org.
  12. "MarFS". github.com. Retrieved 2024-07-22.