Graph500

Last updated

The Graph500 is a rating of supercomputer systems, focused on data-intensive loads. The project was announced on International Supercomputing Conference in June 2010. The first list was published at the ACM/IEEE Supercomputing Conference in November 2010. New versions of the list are published twice a year. The main performance metric used to rank the supercomputers is GTEPS (giga- traversed edges per second).

Contents

Richard Murphy from Sandia National Laboratories, says that "The Graph500's goal is to promote awareness of complex data problems", instead of focusing on computer benchmarks like HPL (High Performance Linpack), which TOP500 is based on. [1]

Despite its name, there were several hundreds of systems in the rating, growing up to 174 in June 2014. [2]

The algorithm and implementation that won the championship is published in the paper titled "Extreme scale breadth-first search on supercomputers". [3]

There is also list Green Graph 500, which uses same performance metric, but sorts list according to performance per Watt, like Green 500 works with TOP500 (HPL).

Benchmark

The benchmark used in Graph500 stresses the communication subsystem of the system, instead of counting double precision floating-point. [1] It is based on a breadth-first search in a large undirected graph (a model of Kronecker graph with average degree of 16). There are three computation kernels in the benchmark: the first kernel is to generate the graph and compress it into sparse structures CSR or CSC (Compressed Sparse Row/Column); the second kernel does a parallel BFS search of some random vertices (64 search iterations per run); the third kernel runs a single-source shortest paths (SSSP) computation. Six possible sizes (Scales) of graph are defined: toy (226 vertices; 17 GB of RAM), mini (229; 137 GB), small (232; 1.1 TB), medium (236; 17.6 TB), large (239; 140 TB), and huge (242; 1.1 PB of RAM). [4]

The reference implementation of the benchmark contains several versions: [5]

The implementation strategy that have won the championship on the Japanese K computer is described in. [6]

Top 10 ranking

According to June 2023 release of the list the new Wuhan supercomputer is highest ranked for the SSSP results with 19039.1 GTEPS (and Fugaku is 4th) while for the BFS results its 2nd there with a different lower measurement for GTEPS: [7]

RankCountrySiteMachine (architecture)Number of nodesNumber of coresProblem scale GTEPS
1Flag of Japan.svg  Japan RIKEN Advanced Institute for Computational Science Supercomputer Fugaku (Fujitsu A64FX)152064729907242137096
2Flag of the People's Republic of China.svg  China Wuhan Kunpeng 920+Tesla A100252699955240121804.3
3Flag of the United States.svg  USA FrontierHPE Cray EX235a924887301124029654.6
4Flag of the People's Republic of China.svg  China Pengcheng LabPengcheng Cloudbrain-II (Kunpeng 920+Ascend 910)488936964025242.9
5Flag of the People's Republic of China.svg  China National Supercomputing Center in Wuxi Sunway TaihuLight (Sunway MPP)40768105996804023755.7

Japan also has a new computer ranked 8th.

2022

According to November 2022 release of the list: [8]

RankCountrySiteMachine (architecture)Number of nodesNumber of coresProblem scale GTEPS
1Flag of Japan.svg  Japan RIKEN Advanced Institute for Computational Science Supercomputer Fugaku (Fujitsu A64FX)158976763084841102955
2Flag of the People's Republic of China.svg  China Pengcheng LabPengcheng Cloudbrain-II (Kunpeng 920+Ascend 910)488936964025242.9
3Flag of the People's Republic of China.svg  China National Supercomputing Center in Wuxi Sunway TaihuLight (Sunway MPP)40768105996804023755.7
4Flag of Japan.svg  Japan Information Technology Center, University of Tokyo Wisteria/BDEC-01 (PRIMEHPC FX1000)76803686403716118
5Flag of Japan.svg  Japan Japan Aerospace Exploration Agency TOKI-SORA (PRIMEHPC FX1000)57602764803610813
6Flag of Europe.svg  EU EuroHPC/CSC LUMI-C (HPE Cray EX)1492190976388467.71
7Flag of the United States.svg  US Oak Ridge National Laboratory OLCF Summit (IBM POWER9)204886016407665.7
8Flag of Germany.svg  Germany Leibniz Rechenzentrum SuperMUC-NG (ThinkSystem SD530 Xeon Platinum 8174 24C 3.1GHz Intel Omni-Path)4096196608396279.47
9Flag of Germany.svg  Germany Zuse Institute Berlin Lise (Intel Omni-Path)1270121920385423.94
10Flag of the People's Republic of China.svg  China National Engineering Research Center for Big Data Technology and SystemDepGraph Supernode (DepGraph (+GPU Tesla A100))1128334623.379

2020

Arm-based Fugaku took the top spot of the list. [9]

2016

According to June 2016 release of the list: [10]

RankSiteMachine (architecture)Number of nodesNumber of coresProblem scale GTEPS
1 Riken Advanced Institute for Computational Science K computer (Fujitsu custom)829446635524038621.4
2 National Supercomputing Center in Wuxi Sunway TaihuLight (NRCPC - Sunway MPP)40768105996804023755.7
3 Lawrence Livermore National Laboratory IBM Sequoia (Blue Gene/Q)9830415728644123751
4 Argonne National Laboratory IBM Mira (Blue Gene/Q)491527864324014982
5 Forschungszentrum Jülich JUQUEEN (Blue Gene/Q)16384262144385848
6 CINECA Fermi (Blue Gene/Q)8192131072372567
7 Changsha, China Tianhe-2 (NUDT custom)8192196608362061.48
8CNRS/IDRIS-GENCITuring (Blue Gene/Q)409665536361427
8Science and Technology Facilities Council – Daresbury LaboratoryBlue Joule (Blue Gene/Q)409665536361427
8 University of Edinburgh DIRAC (Blue Gene/Q)409665536361427
8EDF R&DZumbrota (Blue Gene/Q)409665536361427
8 Victorian Life Sciences Computation Initiative Avoca (Blue Gene/Q)409665536361427

2014

According to June 2014 release of the list: [2]

RankSiteMachine (architecture)Number of nodesNumber of coresProblem scale GTEPS
1 RIKEN Advanced Institute for Computational Science K computer (Fujitsu custom)655365242884017977.1
2 Lawrence Livermore National Laboratory IBM Sequoia (Blue Gene/Q)6553610485764016599
3 Argonne National Laboratory IBM Mira (Blue Gene/Q)491527864324014328
4 Forschungszentrum Jülich JUQUEEN (Blue Gene/Q)16384262144385848
5 CINECA Fermi (Blue Gene/Q)8192131072372567
6 Changsha, China Tianhe-2 (NUDT custom)8192196608362061.48
7CNRS/IDRIS-GENCITuring (Blue Gene/Q)409665536361427
7Science and Technology Facilities Council - Daresbury LaboratoryBlue Joule (Blue Gene/Q)409665536361427
7 University of Edinburgh DIRAC (Blue Gene/Q)409665536361427
7EDF R&DZumbrota (Blue Gene/Q)409665536361427
7 Victorian Life Sciences Computation Initiative Avoca (Blue Gene/Q)409665536361427

2013

According to June 2013 release of the list: [11]

RankSiteMachine (architecture)Number of nodesNumber of coresProblem scale GTEPS
1Lawrence Livermore National LaboratoryIBM Sequoia (Blue Gene/Q)6553610485764015363
2Argonne National LaboratoryIBM Mira (Blue Gene/Q)491527864324014328
3Forschungszentrum JülichJUQUEEN (Blue Gene/Q)16384262144385848
4RIKEN Advanced Institute for Computational ScienceK computer (Fujitsu custom)65536524288405524.12
5CINECAFermi (Blue Gene/Q)8192131072372567
6Changsha, ChinaTianhe-2 (NUDT custom)8192196608362061.48
7CNRS/IDRIS-GENCITuring (Blue Gene/Q)409665536361427
7Science and Technology Facilities Council - Daresbury LaboratoryBlue Joule (Blue Gene/Q)409665536361427
7University of EdinburghDIRAC (Blue Gene/Q)409665536361427
7EDF R&DZumbrota (Blue Gene/Q)409665536361427
7Victorian Life Sciences Computation InitiativeAvoca (Blue Gene/Q)409665536361427

See also

Related Research Articles

<span class="mw-page-title-main">Supercomputer</span> Type of extremely powerful computer

A supercomputer is a computer with a high level of performance as compared to a general-purpose computer. The performance of a supercomputer is commonly measured in floating-point operations per second (FLOPS) instead of million instructions per second (MIPS). Since 2017, there have existed supercomputers which can perform over 1017 FLOPS (a hundred quadrillion FLOPS, 100 petaFLOPS or 100 PFLOPS). For comparison, a desktop computer has performance in the range of hundreds of gigaFLOPS (1011) to tens of teraFLOPS (1013). Since November 2017, all of the world's fastest 500 supercomputers run on Linux-based operating systems. Additional research is being conducted in the United States, the European Union, Taiwan, Japan, and China to build faster, more powerful and technologically superior exascale supercomputers.

<span class="mw-page-title-main">IBM Blue Gene</span> Series of supercomputers by IBM

Blue Gene is an IBM project aimed at designing supercomputers that can reach operating speeds in the petaFLOPS (PFLOPS) range, with low power consumption.

Message Passing Interface (MPI) is a standardized and portable message-passing standard designed to function on parallel computing architectures. The MPI standard defines the syntax and semantics of library routines that are useful to a wide range of users writing portable message-passing programs in C, C++, and Fortran. There are several open-source MPI implementations, which fostered the development of a parallel software industry, and encouraged development of portable and scalable large-scale parallel applications.

<span class="mw-page-title-main">High-performance computing</span> Computing with supercomputers and clusters

High-performance computing (HPC) uses supercomputers and computer clusters to solve advanced computation problems.

<span class="mw-page-title-main">ASCI Red</span> Supercomputer

ASCI Red was the first computer built under the Accelerated Strategic Computing Initiative (ASCI), the supercomputing initiative of the United States government created to help the maintenance of the United States nuclear arsenal after the 1992 moratorium on nuclear testing.

<span class="mw-page-title-main">David Bader (computer scientist)</span> American computer scientist

David A. Bader is a Distinguished Professor and Director of the Institute for Data Science at the New Jersey Institute of Technology. Previously, he served as the Chair of the Georgia Institute of Technology School of Computational Science & Engineering, where he was also a founding professor, and the executive director of High-Performance Computing at the Georgia Tech College of Computing. In 2007, he was named the first director of the Sony Toshiba IBM Center of Competence for the Cell Processor at Georgia Tech.

NAS Parallel Benchmarks (NPB) are a set of benchmarks targeting performance evaluation of highly parallel supercomputers. They are developed and maintained by the NASA Advanced Supercomputing (NAS) Division based at the NASA Ames Research Center. NAS solicits performance results for NPB from all sources.

The Parallel Virtual File System (PVFS) is an open-source parallel file system. A parallel file system is a type of distributed file system that distributes file data across multiple servers and provides for concurrent access by multiple tasks of a parallel application. PVFS was designed for use in large scale cluster computing. PVFS focuses on high performance access to large data sets. It consists of a server process and a client library, both of which are written entirely of user-level code. A Linux kernel module and pvfs-client process allow the file system to be mounted and used with standard utilities. The client library provides for high performance access via the message passing interface (MPI). PVFS is being jointly developed between The Parallel Architecture Research Laboratory at Clemson University and the Mathematics and Computer Science Division at Argonne National Laboratory, and the Ohio Supercomputer Center. PVFS development has been funded by NASA Goddard Space Flight Center, The DOE Office of Science Advanced Scientific Computing Research program, NSF PACI and HECURA programs, and other government and private agencies. PVFS is now known as OrangeFS in its newest development branch.

<span class="mw-page-title-main">TOP500</span> Database project devoted to the ranking of computers

The TOP500 project ranks and details the 500 most powerful non-distributed computer systems in the world. The project was started in 1993 and publishes an updated list of the supercomputers twice a year. The first of these updates always coincides with the International Supercomputing Conference in June, and the second is presented at the ACM/IEEE Supercomputing Conference in November. The project aims to provide a reliable basis for tracking and detecting trends in high-performance computing and bases rankings on HPL benchmarks, a portable implementation of the high-performance LINPACK benchmark written in Fortran for distributed-memory computers.

Exascale computing refers to computing systems capable of calculating at least "1018 IEEE 754 Double Precision (64-bit) operations (multiplications and/or additions) per second (exaFLOPS)"; it is a measure of supercomputer performance.

HPC Challenge Benchmark combines several benchmarks to test a number of independent attributes of the performance of high-performance computer (HPC) systems. The project has been co-sponsored by the DARPA High Productivity Computing Systems program, the United States Department of Energy and the National Science Foundation.

<span class="mw-page-title-main">K computer</span> Supercomputer in Kobe, Japan

The K computer – named for the Japanese word/numeral "kei" (京), meaning 10 quadrillion (1016) – was a supercomputer manufactured by Fujitsu, installed at the Riken Advanced Institute for Computational Science campus in Kobe, Hyōgo Prefecture, Japan. The K computer was based on a distributed memory architecture with over 80,000 compute nodes. It was used for a variety of applications, including climate research, disaster prevention and medical research. The K computer's operating system was based on the Linux kernel, with additional drivers designed to make use of the computer's hardware.

The LINPACK Benchmarks are a measure of a system's floating-point computing power. Introduced by Jack Dongarra, they measure how fast a computer solves a dense n by n system of linear equations Ax = b, which is a common task in engineering.

<span class="mw-page-title-main">Supercomputing in Pakistan</span> Overview of supercomputing in Pakistan

The high performance supercomputing program started in mid-to-late 1980s in Pakistan. Supercomputing is a recent area of Computer science in which Pakistan has made progress, driven in part by the growth of the information technology age in the country. Developing on the ingenious supercomputer program started in 1980s when the deployment of the Cray supercomputers was initially denied.

<span class="mw-page-title-main">Summit (supercomputer)</span> Supercomputer developed by IBM

Summit or OLCF-4 is a supercomputer developed by IBM for use at Oak Ridge Leadership Computing Facility (OLCF), a facility at the Oak Ridge National Laboratory, capable of 200 petaFLOPS thus making it the 5th fastest supercomputer in the world after Frontier (OLCF-5), Fugaku, LUMI, and Leonardo, with Frontier being the fastest. It held the number 1 position from November 2018 to June 2020. Its current LINPACK benchmark is clocked at 148.6 petaFLOPS.

The HPCGbenchmark is a supercomputing benchmark test proposed by Michael Heroux from Sandia National Laboratories, and Jack Dongarra and Piotr Luszczek from the University of Tennessee. It is intended to model the data access patterns of real-world applications such as sparse matrix calculations, thus testing the effect of limitations of the memory subsystem and internal interconnect of the supercomputer on its computing performance. Because it is internally I/O bound, HPCG testing generally achieves only a tiny fraction of the peak FLOPS the computer could theoretically deliver.

Torus fusion (tofu) is a proprietary computer network topology for supercomputers developed by Fujitsu. It is a variant of the torus interconnect. The system has been used in the K computer and the Fugaku supercomputer.

The breadth-first-search algorithm is a way to explore the vertices of a graph layer by layer. It is a basic algorithm in graph theory which can be used as a part of other graph algorithms. For instance, BFS is used by Dinic's algorithm to find maximum flow in a graph. Moreover, BFS is also one of the kernel algorithms in Graph500 benchmark, which is a benchmark for data-intensive supercomputing problems. This article discusses the possibility of speeding up BFS through the use of parallel computing.

<span class="mw-page-title-main">Fugaku (supercomputer)</span> Japanese supercomputer

Fugaku(Japanese: 富岳) is a petascale supercomputer at the Riken Center for Computational Science in Kobe, Japan. It started development in 2014 as the successor to the K computer and made its debut in 2020. It is named after an alternative name for Mount Fuji.

JUWELS is a supercomputer developed by Atos Forschungszentrum Jülich, capable of 70.980 petaflops. It replaced the now disused JUQUEEN supercomputer. JUWELS Booster Module is ranked as the eight fastest supercomputer in the world. The JUWELS Booster Module is part of a modular system architecture and a second Xeon based JUWELS Module ranks separately as the 52nd fastest supercomputer in the world.

References

  1. 1 2 The Exascale Report (March 15, 2012). "The Case for the Graph 500 – Really Fast or Really Productive? Pick One". Inside HPC.
  2. 1 2 "June 2014 | Graph 500". Archived from the original on June 28, 2014. Retrieved June 26, 2014.
  3. Ueno, Koji; Suzumura, Toyotaro; Maruyama, Naoya; Fujisawa, Katsuki; Matsuoka, Satoshi (2016). "Extreme scale breadth-first search on supercomputers". 2016 IEEE International Conference on Big Data (Big Data). pp. 1040–1047. doi:10.1109/BigData.2016.7840705. ISBN   978-1-4673-9005-7. S2CID   8680200.
  4. Performance Evaluation of Graph500 on Large-Scale Distributed Environment // IEEE IISWC 2011, Austin, TX; presentation
  5. "Graph500: адекватный рейтинг" (in Russian). Open Systems #1 2011.
  6. Ueno, K.; Suzumura, T.; Maruyama, N.; Fujisawa, K.; Matsuoka, S. (December 1, 2016). "Extreme scale breadth-first search on supercomputers". 2016 IEEE International Conference on Big Data (Big Data). pp. 1040–1047. doi:10.1109/BigData.2016.7840705. ISBN   978-1-4673-9005-7. S2CID   8680200.
  7. "Complete Results - Graph 500". June 14, 2017. Retrieved July 21, 2023.
  8. "November 2022; Graph 500". June 14, 2017. Retrieved November 18, 2022.
  9. "Fujitsu and RIKEN Take First Place in Graph500 Ranking with Supercomputer Fugaku". HPCwire. June 23, 2020. Retrieved August 8, 2020.
  10. "June 2016 | Graph 500". Archived from the original on June 24, 2016. Retrieved July 6, 2016.
  11. "June 2013 | Graph 500". Archived from the original on June 21, 2013. Retrieved June 19, 2013.