Tianhe-2

Last updated

Tianhe-2
Tianhe-2.jpg
Sponsors 863 Program
Location National Supercomputer Center, Guangzhou, China
Architecture32 Intel Xeon E5-2692 12C with 2.200  GHz 4,000 Xeon Phi 31S1P
Power1.6 MW (24 MW with cooling)
Operating system Kylin Linux [1]
Memory15 TiB (1,000 TiB CPU and 375 TiB coprocessor) [1]
Storage12.4 PB
Speed3.86  PFLOPS
Cost2.4 million Yuan (US$390,000) [2]
PurposeSimulation, analysis, and government security applications.

Tianhe-2 or TH-2 (Chinese :天河-2; pinyin :tiānhé-èr; lit.'Heavenriver-2', i.e. 'Milky Way 2') is a 3.86-petaflop supercomputer located in the National Supercomputer Center in Guangzhou, China. [3] It was developed by a team of 1,300 scientists and engineers.

Contents

It was the world's fastest supercomputer according to the TOP500 lists for June 2013, November 2013, June 2014, November 2014, June 2015, and November 2015. [4] [5] The record was surpassed in June 2016 by the Sunway TaihuLight. In 2015, plans by Sun Yat-sen University in collaboration with Guangzhou district and city administration to double its computing capacities were stopped by a U.S. government rejection of Intel's application for an export license for the CPUs and coprocessor boards. [6] [7] [8]

In response to the U.S. sanctions, China introduced the Sunway TaihuLight supercomputer in 2016, which substantially outperforms the Tianhe-2 (and also affected the update of Tianhe-2 to Tianhe-2A, replacing U.S. tech), and in November 2022 ranks eighth in the TOP500 list while using completely domestic technology including the Sunway manycore microprocessor. [9]

History

The development of Tianhe-2 was sponsored by the 863 High Technology Program, initiated by the Chinese government, the government of Guangdong province, and the government of Guangzhou city. [1] It was built by China's National University of Defense Technology (NUDT) in collaboration with the Chinese IT firm Inspur. [1] [5] Inspur manufactured the printed circuit boards and helped with the installation and testing of the system software. [1] The project was originally scheduled for completion in 2015, but was instead declared operational in June 2013. [10] As of June 2013, the supercomputer had yet to become fully operational. It was expected to reach its full computing capabilities by the end of 2013. [5]

In June 2013, Tianhe-2 topped the TOP500 list of fastest supercomputers in the world and was still listed as the fastest machine in the November 2015 list. [11] The computer beat out second-place finisher Titan by nearly a 2-to-1 margin. Titan, which is housed at the United States Department of Energy's Oak Ridge National Laboratory, achieved 17.59 petaflops, while Tianhe-2 achieved 33.86 petaflops. Tianhe-2's performance returned the title of the world's fastest supercomputer to China after Tianhe-I's début in November 2010. The Institute of Electrical and Electronics Engineers said Tianhe-2's win "symbolizes China's unflinching commitment to the supercomputing arms race". [5] In June 2013, China housed 66 of the top 500 supercomputers, second only to the United States' 252 systems. [3] The Chinese total increased to 168 of the top 500 systems by June 2016, overtaking the United States which fell to 165 of the top 500 supercomputers. [12]

Graph500 is an alternate list of top supercomputers based on a benchmark testing analysis of graphs. [13] In their benchmark, the system tested at 2,061 gigaTEPS (traversed edges per second). The top system, IBM Sequoia, tested at 15,363 gigaTEPS. [13] It also holds first place in the HPCG benchmark test proposed by Jack Dongarra, with 0.580 HPCG PFLOPS in June 2014. [14]

Tianhe-2 has been housed at National University of Defense Technology. [15]

Specifications

According to NUDT, Tianhe-2 would have been used for simulation, analysis, and government security applications. [1]

With 16,000 computer nodes, each comprising two Intel Ivy Bridge Xeon processors and three Xeon Phi coprocessor chips, it represented the world's largest installation of Ivy Bridge and Xeon Phi chips, counting a total of 3,120,000 cores [3] (because of US sanctions, the upgrades Tianhe-2A switched out the Xeon Phi accelerators for Matrix-2000, [16] and the upgraded faster system has 4,981,760 cores in total, but it still dropped from 2nd to 4th place because of newer, faster systems added to the list). Each of the 16,000 nodes possessed 88 gigabytes of memory (64 used by the Ivy Bridge processors, and 8 gigabytes for each of the Xeon Phi processors). The total CPU plus coprocessor memory was 1,375  TiB (approximately 1.34  PiB). [1] The system has a 12.4 PiB H2FS file system consisting of IO forwarding nodes providing a 1 TiB/s burst rate backed by a Lustre file system with 100 GiB/s sustained throughput. [17] [18]

During the testing phase, Tianhe-2 was laid out in a non-optimal confined space. When assembled at its final location, the system will have had a theoretical peak performance of 54.9 petaflops. At peak power consumption, the system itself would have drawn 17.6 megawatts of power. Including external cooling, the system drew an aggregate of 24 megawatts. The completed computer complex would have occupied 720 square meters of space. [1]

The front-end system consisted of 4096 Galaxy FT-1500 CPUs, a SPARC derivative designed and built by NUDT. Each FT-1500 has 16 cores and a 1.8  GHz clock frequency. The chip has a performance of 144 gigaflops and runs on 65 watts. The interconnect, called the TH Express-2, designed by NUDT, utilized a fat tree topology with 13 switches each of 576 ports. [1]

Tianhe-2 ran on Kylin Linux, a version of the operating system developed by NUDT. Resource management is based on Slurm Workload Manager. [1]

Criticisms

Researchers have criticized Tianhe-2 for being difficult to use. "It is at the world's frontier in terms of calculation capacity, but the functionality of the supercomputer is still way behind the ones in the US and Japan", says Chi Xuebin, deputy director of the Computer Network and Information Centre. "Some users would need years or even a decade to write the necessary code", he added. [19]

The location of Tianhe-2 is in Southern China, where the warmer weather and thus higher average temperatures could increase electricity consumption by about 10% compared with a location in Northern China.

See also

Related Research Articles

Floating point operations per second is a measure of computer performance in computing, useful in fields of scientific computations that require floating-point calculations.

Cray Inc., a subsidiary of Hewlett Packard Enterprise, is an American supercomputer manufacturer headquartered in Seattle, Washington. It also manufactures systems for data storage and analytics. Several Cray supercomputer systems are listed in the TOP500, which ranks the most powerful supercomputers in the world.

The Texas Advanced Computing Center (TACC) at the University of Texas at Austin, United States, is an advanced computing research center that is based on comprehensive advanced computing resources and supports services to researchers in Texas and across the U.S. The mission of TACC is to enable discoveries that advance science and society through the application of advanced computing technologies. Specializing in high performance computing, scientific visualization, data analysis & storage systems, software, research & development and portal interfaces, TACC deploys and operates advanced computational infrastructure to enable the research activities of faculty, staff, and students of UT Austin. TACC also provides consulting, technical documentation, and training to support researchers who use these resources. TACC staff members conduct research and development in applications and algorithms, computing systems design/architecture, and programming tools and environments.

<span class="mw-page-title-main">TOP500</span> Database project devoted to the ranking of computers

The TOP500 project ranks and details the 500 most powerful non-distributed computer systems in the world. The project was started in 1993 and publishes an updated list of the supercomputers twice a year. The first of these updates always coincides with the International Supercomputing Conference in June, and the second is presented at the ACM/IEEE Supercomputing Conference in November. The project aims to provide a reliable basis for tracking and detecting trends in high-performance computing and bases rankings on HPL benchmarks, a portable implementation of the high-performance LINPACK benchmark written in Fortran for distributed-memory computers.

<span class="mw-page-title-main">National University of Defense Technology</span> Public research military university in Changsha, China

The National University of Defense Technology is a national public research university headquartered in Kaifu, Changsha, Hunan, China. It is affiliated with the Central Military Commission. The university is part of Project 211, Project 985, and the Double First-Class Construction. With the predecessor founded in 1953 as the People's Liberation Army Military Academy of Engineering (中国人民解放军军事工程学院) in Harbin, the institution was officially established in 1978 in Changsha by Deng Xiaoping.

<span class="mw-page-title-main">Sequoia (supercomputer)</span> IBM supercomputer at Lawrence Livermore National Laboratory

IBM Sequoia was a petascale Blue Gene/Q supercomputer constructed by IBM for the National Nuclear Security Administration as part of the Advanced Simulation and Computing Program (ASC). It was delivered to the Lawrence Livermore National Laboratory (LLNL) in 2011 and was fully deployed in June 2012. Sequoia was dismantled in 2020, its last position on the top500.org list was #22 in the November 2019 list.

Manycore processors are special kinds of multi-core processors designed for a high degree of parallel processing, containing numerous simpler, independent processor cores. Manycore processors are used extensively in embedded computers and high-performance computing.

<span class="mw-page-title-main">Tianhe-1</span> Supercomputer

Tianhe-I, Tianhe-1, or TH-1 is a supercomputer capable of an Rmax of 2.5 peta FLOPS. Located at the National Supercomputing Center of Tianjin, China, it was the fastest computer in the world from October 2010 to June 2011 and was one of the few petascale supercomputers in the world.

Exascale computing refers to computing systems capable of calculating at least "1018 IEEE 754 Double Precision (64-bit) operations (multiplications and/or additions) per second (exaFLOPS)"; it is a measure of supercomputer performance.

Nebulae is a petascale supercomputer located at the National Supercomputing Center in Shenzhen, Guangdong, China. Built from a Dawning TC3600 Blade system with Intel Xeon X5650 processors and Nvidia Tesla C2050 GPUs, it has a peak performance of 1.271 petaflops using the LINPACK benchmark suite. Nebulae was ranked the second most powerful computer in the world in the June 2010 list of the fastest supercomputers according to TOP500. Nebulae has a theoretical peak performance of 2.9843 petaflops. This computer is used for multiple applications requiring advanced processing capabilities. It is ranked 10th among the June 2012 list of top500.org.

The National Supercomputing Center of Tianjin is a supercomputing facility located at the National Defense Science and Technology University in Tianjin, China. One of the fastest supercomputers in the world, Tianhe-1A, is located at the facility.

<span class="mw-page-title-main">Supercomputing in China</span> Overview of supercomputing in China

China operates a number of supercomputer centers which, altogether, hold 29.3% performance share of the world's fastest 500 supercomputers. China's Sunway TaihuLight ranks third in the TOP500 list.

<span class="mw-page-title-main">K computer</span> Supercomputer in Kobe, Japan

The K computer – named for the Japanese word/numeral "kei" (京), meaning 10 quadrillion (1016) – was a supercomputer manufactured by Fujitsu, installed at the Riken Advanced Institute for Computational Science campus in Kobe, Hyōgo Prefecture, Japan. The K computer was based on a distributed memory architecture with over 80,000 compute nodes. It was used for a variety of applications, including climate research, disaster prevention and medical research. The K computer's operating system was based on the Linux kernel, with additional drivers designed to make use of the computer's hardware.

<span class="mw-page-title-main">Supercomputing in Europe</span> Overview of supercomputing in Europe

Several centers for supercomputing exist across Europe, and distributed access to them is coordinated by European initiatives to facilitate high-performance computing. One such initiative, the HPC Europa project, fits within the Distributed European Infrastructure for Supercomputing Applications (DEISA), which was formed in 2002 as a consortium of eleven supercomputing centers from seven European countries. Operating within the CORDIS framework, HPC Europa aims to provide access to supercomputers across Europe.

<span class="mw-page-title-main">Xeon Phi</span> Series of x86 manycore processors from Intel

Xeon Phi is a discontinued series of x86 manycore processors designed and made by Intel. It was intended for use in supercomputers, servers, and high-end workstations. Its architecture allowed use of standard programming languages and application programming interfaces (APIs) such as OpenMP.

FeiTeng is the name of several computer central processing units designed and produced in China for supercomputing applications. The microprocessors have been developed by Tianjin Phytium Technology. The processors have also been described as the YinHeFeiTeng family. This CPU family has been developed by a team directed by NUDT's Professor Xing Zuocheng.

<span class="mw-page-title-main">Cray XC40</span> Supercomputer manufactured by Cray

The Cray XC40 is a massively parallel multiprocessor supercomputer manufactured by Cray. It consists of Intel Haswell Xeon processors, with optional Nvidia Tesla or Intel Xeon Phi accelerators, connected together by Cray's proprietary "Aries" interconnect, stored in air-cooled or liquid-cooled cabinets. The XC series supercomputers are available with the Cray DataWarp applications I/O accelerator technology.

The Sunway TaihuLight is a Chinese supercomputer which, as of November 2023, is ranked 11th in the TOP500 list, with a LINPACK benchmark rating of 93 petaflops. The name is translated as divine power, the light of Taihu Lake. This is nearly three times as fast as the previous Tianhe-2, which ran at 34 petaflops. As of June 2017, it is ranked as the 16th most energy-efficient supercomputer in the Green500, with an efficiency of 6.1 GFlops/watt. It was designed by the National Research Center of Parallel Computer Engineering & Technology (NRCPC) and is located at the National Supercomputing Center in Wuxi in the city of Wuxi, in Jiangsu province, China.

The High Performance Conjugate Gradients Benchmark is a supercomputing benchmark test proposed by Michael Heroux from Sandia National Laboratories, and Jack Dongarra and Piotr Luszczek from the University of Tennessee. It is intended to model the data access patterns of real-world applications such as sparse matrix calculations, thus testing the effect of limitations of the memory subsystem and internal interconnect of the supercomputer on its computing performance. Because it is internally I/O bound, HPCG testing generally achieves only a tiny fraction of the peak FLOPS the computer could theoretically deliver.

The Cray XC50 is a massively parallel multiprocessor supercomputer manufactured by Cray. The machine can support Intel Xeon processors, as well as Cavium ThunderX2 processors, Xeon Phi processors and NVIDIA Tesla P100 GPUs. The processors are connected by Cray's proprietary "Aries" interconnect, in a dragonfly network topology. The XC50 is an evolution of the XC40, with the main difference being the support of Tesla P100 processors and the use of Cray software release CLE 6 or 7.

References

  1. 1 2 3 4 5 6 7 8 9 10 Dongarra, Jack (3 June 2013). "Visit to the National University for Defense Technology Changsha, China" (PDF). Netlib . Retrieved 17 June 2013.
  2. Chen, Stephen (20 June 2013). "World's fastest supercomputer may get little use".
  3. 1 2 3 "June 2013". TOP500 . Retrieved 17 June 2013.
  4. "The Top 500 List: June 2013" . Retrieved 10 July 2014.
  5. 1 2 3 4 Davey Alba (17 June 2013). "China's Tianhe-2 Caps Top 10 Supercomputers". IEEE Spectrum. Retrieved 19 June 2013.
  6. "US blocks Intel from selling Xeon chips to Chinese supercomputer projects". PCWorld. Retrieved 19 February 2017.
  7. "U.S. chip block could delay China's powerful 100 petaflop supercomputer". PCWorld. Retrieved 19 February 2017.
  8. Don Clark (9 April 2015). "U.S. Agencies Block Technology Exports for Supercomputer in China". The Wall Street Journal. Retrieved 9 April 2015.
  9. "China Tops Supercomputer Rankings with New 93-Petaflop Machine | TOP500".
  10. Michael Kan, IDG News Service (31 October 2012). "China is building a 100-petaflop supercomputer". infoworld.com. Retrieved 31 October 2012.
  11. "Novermber [sic] 2015 | TOP500".
  12. "List Statistics (June 2016)". TOP500.org . Retrieved 25 September 2016.
  13. 1 2 "The Graph 500 List: June 2013". Graph 500. Archived from the original on 21 June 2013. Retrieved 19 June 2013.
  14. Hemsoth, Nicole (26 June 2014). "New HPC Benchmark Delivers Promising Results". HPCWire. Retrieved 8 September 2014.
  15. "China's Tianhe-2 Remains The World's Fastest sex Supercomputer". Forbes. Retrieved 24 June 2014.
  16. "Matrix-2000 – NUDT – WikiChip". en.wikichip.org. Retrieved 6 October 2019.
  17. Weixia, Xu (June 2014). "Hybrid hierarchy storage system in MilkyWay-2 supercomputer". Frontiers of Computer Science. 8 (3): 367–377. doi:10.1007/s11704-014-3499-6. S2CID   3439580.
  18. Yutong, Lu (5 March 2015). "Overview of Tianhe2 System and Application" (PDF). Idris. Retrieved 6 May 2016.
  19. Mimi Lau in Guangzhou (30 June 2014). "China's world-beating supercomputer fails to impress some potential clients". South China Morning Post.

Further reading

Records
Preceded by
Titan
17.59 petaflops
World's most powerful supercomputer
June 2013 – June 2016
Succeeded by
Sunway TaihuLight
93.01 petaflops