Tianhe-2

Last updated

Tianhe-2
Tianhe-2.jpg
Sponsors 863 Program
Location National Supercomputer Center, Guangzhou, China
Architecture32,000 Intel Xeon E5-2692 12C with 2.200  GHz 48,000 Xeon Phi 31S1P
Power17.6 MW (24 MW with cooling)
Operating system Kylin Linux [1]
Memory1,375 TiB (1,000 TiB CPU and 375 TiB coprocessor) [1]
Storage12.4 PB
Speed33.86  PFLOPS
Cost2.4 billion Yuan (US$390 million) [2]
PurposeSimulation, analysis, and government security applications.

Tianhe-2 or TH-2 (Chinese :天河-2; pinyin :tiānhé-èr; lit. : 'Heavenriver-2', i.e. 'Milky Way 2') is a 33.86-petaflops supercomputer located in the National Supercomputer Center in Guangzhou, China. [3] It was developed by a team of 1,300 scientists and engineers.

Contents

It was the world's fastest supercomputer according to the TOP500 lists for June 2013, November 2013, June 2014, November 2014, June 2015, and November 2015. [4] [5] The record was surpassed in June 2016 by the Sunway TaihuLight. In 2015, plans of the Sun Yat-sen University in collaboration with Guangzhou district and city administration to double its computing capacities were stopped by a U.S. government rejection of Intel's application for an export license for the CPUs and coprocessor boards. [6] [7] [8]

In response to the U.S. sanction, China introduced the Sunway TaihuLight supercomputer in 2016, which substantially outperforms the Tianhe-2 (and also affected the update of Tianhe-2 to Tianhe-2A replacing US tech), and now ranks fourth in the TOP500 list while using completely domestic technology including the Sunway manycore microprocessor. [9]

History

The development of Tianhe-2 was sponsored by the 863 High Technology Program, initiated by the Chinese government, the government of Guangdong province, and the government of Guangzhou city. [1] It was built by China's National University of Defense Technology (NUDT) in collaboration with the Chinese IT firm Inspur. [1] [5] Inspur manufactured the printed circuit boards and helped with the installation and testing of the system software. [1] The project was originally scheduled for completion in 2015, but was instead declared operational in June 2013. [10] As of June 2013, the supercomputer had yet to become fully operational. It was expected to reach its full computing capabilities by the end of 2013. [5]

In June 2013, Tianhe-2 topped the TOP500 list of fastest supercomputers in the world and was still listed as the fastest machine in the November 2015 list. [11] The computer beat out second-place finisher Titan by nearly a 2-to-1 margin. Titan, which is housed at the U.S. Department of Energy's Oak Ridge National Laboratory, achieved 17.59 petaflops, while Tianhe-2 achieved 33.86 petaflops. Tianhe-2's performance returned the title of the world's fastest supercomputer to China after Tianhe-I's début in November 2010. The Institute of Electrical and Electronics Engineers said Tianhe-2's win "symbolizes China's unflinching commitment to the supercomputing arms race". [5] In June 2013, China housed 66 of the top 500 supercomputers, second only to the United States' 252 systems. [3] The Chinese total increased to 168 of the top 500 systems by June 2016, overtaking the United States which fell to 165 of the top 500 supercomputers. [12]

Graph500 is an alternate list of top supercomputers based on a benchmark testing analysis of graphs. [13] In their benchmark, the system tested at 2,061 gigaTEPS (traversed edges per second). The top system, IBM Sequoia, tested at 15,363 gigaTEPS. [13] It also has first place in the HPCG benchmark test proposed by Jack Dongarra, with 0.580 HPCG PFLOPS in June 2014. [14]

Tianhe-2 has been housed at National University of Defense Technology. [15]

Specifications

According to NUDT, Tianhe-2 would have been used for simulation, analysis, and government security applications. [1]

With 16,000 computer nodes, each comprising two Intel Ivy Bridge Xeon processors and three Xeon Phi coprocessor chips, it represented the world's largest installation of Ivy Bridge and Xeon Phi chips, counting a total of 3,120,000 cores [3] (because of US sanctions, the upgrades Tianhe-2A switched out the Xeon Phi accelerators for Matrix-2000, [16] and the upgraded faster system has 4,981,760 cores in total, but still dropped from 2nd to 4th place because of new faster systems added to the list). Each of the 16,000 nodes possessed 88 gigabytes of memory (64 used by the Ivy Bridge processors, and 8 gigabytes for each of the Xeon Phi processors). The total CPU plus coprocessor memory was 1,375  TiB (approximately 1.34  PiB). [1] The system has a 12.4 PiB H2FS file system consisting of IO forwarding nodes providing a 1 TiB/s burst rate backed by a Lustre file system with 100 GiB/s sustained throughput. [17] [18]

During the testing phase, Tianhe-2 was laid out in a non-optimal confined space. When assembled at its final location, the system will have had a theoretical peak performance of 54.9 petaflops. At peak power consumption, the system itself would have drawn 17.6 megawatts of power. Including external cooling, the system drew an aggregate of 24 megawatts. The completed computer complex would have occupied 720 square meters of space. [1]

The front-end system consisted of 4096 Galaxy FT-1500 CPUs, a SPARC derivative designed and built by NUDT. Each FT-1500 has 16 cores and a 1.8  GHz clock frequency. The chip has a performance of 144 gigaflops and runs on 65 watts. The interconnect, called the TH Express-2, designed by NUDT, utilized a fat tree topology with 13 switches each of 576 ports. [1]

Tianhe-2 ran on Kylin Linux, a version of the operating system developed by NUDT. Resource management is based on Slurm Workload Manager. [1]

Criticisms

Researchers have criticized Tianhe-2 for being difficult to use. "It is at the world's frontier in terms of calculation capacity, but the function of the supercomputer is still way behind the ones in the US and Japan", says Chi Xuebin, deputy director of the Computer Network and Information Centre. "Some users would need years or even a decade to write the necessary code", he added. [19]

The location of Tianhe-2 is in Southern China, where the warmer weather with higher temperature could increase the electricity consumption by about 10% compared with a location in Northern China.

See also

Related Research Articles

In computing, floating point operations per second is a measure of computer performance, useful in fields of scientific computations that require floating-point calculations. For such cases it is a more accurate measure than measuring instructions per second.

Cray Inc., a subsidiary of Hewlett Packard Enterprise, is an American supercomputer manufacturer headquartered in Seattle, Washington. It also manufactures systems for data storage and analytics. Several Cray supercomputer systems are listed in the TOP500, which ranks the most powerful supercomputers in the world.

Coprocessor supplementary computer processor that executes under the logical control of a main processor

A coprocessor is a computer processor used to supplement the functions of the primary processor. Operations performed by the coprocessor may be floating point arithmetic, graphics, signal processing, string processing, cryptography or I/O interfacing with peripheral devices. By offloading processor-intensive tasks from the main processor, coprocessors can accelerate system performance. Coprocessors allow a line of computers to be customized, so that customers who do not need the extra performance do not need to pay for it.

TOP500 Ranking of the 500 most powerful supercomputers

The TOP500 project ranks and details the 500 most powerful non-distributed computer systems in the world. The project was started in 1993 and publishes an updated list of the supercomputers twice a year. The first of these updates always coincides with the International Supercomputing Conference in June, and the second is presented at the ACM/IEEE Supercomputing Conference in November. The project aims to provide a reliable basis for tracking and detecting trends in high-performance computing and bases rankings on HPL, a portable implementation of the high-performance LINPACK benchmark written in Fortran for distributed-memory computers.

In computing, performance per watt is a measure of the energy efficiency of a particular computer architecture or computer hardware. Literally, it measures the rate of computation that can be delivered by a computer for every watt of power consumed. This rate is typically measured by performance on the LINPACK benchmark when trying to compare between computing systems.

Cray XT5 supercomputer

The Cray XT5 is an updated version of the Cray XT4 supercomputer, launched on November 6, 2007. It includes a faster version of the XT4's SeaStar2 interconnect router called SeaStar2+, and can be configured either with XT4 compute blades, which have four dual-core AMD Opteron processor sockets, or XT5 blades, with eight sockets supporting dual or quad-core Opterons. The XT5 uses a 3-dimensional torus network topology.

National University of Defense Technology University in China

The National University of Defense Technology, or People's Liberation Army National University of Defense Science and Technology, is a military academy and Class A Double First Class University located in Changsha, Hunan, China. It is under the direct leadership of China's Central Military Commission, and the dual management of the Ministry of National Defense and the Ministry of Education. It is designated for Project 211 and Project 985, the two national plans facilitating the development of Chinese higher education. NUDT was instrumental in the development of the Tianhe-2 supercomputer.

Sequoia (supercomputer)

IBM Sequoia is a petascale Blue Gene/Q supercomputer constructed by IBM for the National Nuclear Security Administration as part of the Advanced Simulation and Computing Program (ASC). It was delivered to the Lawrence Livermore National Laboratory (LLNL) in 2011 and was fully deployed in June 2012.

Manycore processors are specialist multi-core processors designed for a high degree of parallel processing, containing numerous simpler, independent processor cores. Manycore processors are used extensively in embedded computers and high-performance computing.

Tianhe-I, Tianhe-1, or TH-1 is a supercomputer capable of an Rmax of 2.5 petaFLOPS. Located at the National Supercomputing Center of Tianjin, China, it was the fastest computer in the world from October 2010 to June 2011 and is one of the few petascale supercomputers in the world.

Nebulae is a petascale supercomputer located at the National Supercomputing Center in Shenzhen, Guangdong, China. Built from a Dawning TC3600 Blade system with Intel Xeon X5650 processors and Nvidia Tesla C2050 GPUs, it has a peak performance of 1.271 petaflops using the LINPACK benchmark suite. Nebulae was ranked the second most powerful computer in the world in the June 2010 list of the fastest supercomputers according to TOP500. Nebulae has a theoretical peak performance of 2.9843 petaflops. This computer is used for multiple applications requiring advanced processing capabilities. It is ranked 10th among the June 2012 list of top500.org.

The National Supercomputing Center of Tianjin is located at the National Defense Science and Technology University in Tianjin, China. One of the fastest supercomputers in the world, Tianhe-1A, is located at the facility.

Supercomputing in China

China operates a number of supercomputer centers which, altogether, hold 29.3% performance share of world's fastest 500 supercomputers. The origins of these centers go back to 1989, when the State Planning Commission, the State Science and Technology Commission and the World Bank jointly launched a project to develop networking and supercomputer facilities in China. In addition to network facilities, the project included three supercomputer centers. China's Sunway TaihuLight ranks third in the TOP500 list.

K computer Supercomputer in Kobe, Japan

The K computer – named for the Japanese word/numeral "kei" (京), meaning 10 quadrillion (1016) – was a supercomputer manufactured by Fujitsu, installed at the Riken Advanced Institute for Computational Science campus in Kobe, Hyōgo Prefecture, Japan. The K computer was based on a distributed memory architecture with over 80,000 compute nodes. It was used for a variety of applications, including climate research, disaster prevention and medical research. The K computer's operating system was based on the Linux kernel, with additional drivers designed to make use of the computer's hardware.

Supercomputing in Europe overview about supercomputing in Europe

Several centers for supercomputing exist across Europe, and distributed access to them is coordinated by European initiatives to facilitate high-performance computing. One such initiative, the HPC Europa project, fits within the Distributed European Infrastructure for Supercomputing Applications (DEISA), which was formed in 2002 as a consortium of eleven supercomputing centers from seven European countries. Operating within the CORDIS framework, HPC Europa aims to provide access to supercomputers across Europe.

Xeon Phi series of x86 manycore processors from Intel

Xeon Phi is a series of x86 manycore processors designed and made by Intel. It is intended for use in supercomputers, servers, and high-end workstations. Its architecture allows use of standard programming languages and application programming interfaces (APIs) such as OpenMP.

FeiTeng is the name of several computer central processing units designed and produced in China for supercomputing applications. The microprocessors have been developed by Tianjin Phytium Technology. The processors have also been described as the YinHeFeiTeng family. This CPU family has been developed by a team directed by NUDT's Professor Xing Zuocheng.

Cray XC40 Supercomputer manufactured by Cray

The Cray XC40 is a massively parallel multiprocessor supercomputer manufactured by Cray. It consists of Intel Haswell Xeon processors, with optional Nvidia Tesla or Intel Xeon Phi accelerators, connected together by Cray's proprietary "Aries" interconnect, stored in air-cooled or liquid-cooled cabinets. The XC series supercomputers are available with the Cray DataWarp applications I/O accelerator technology.

The Sunway TaihuLight is a Chinese supercomputer which, as of November 2018, is ranked third in the TOP500 list, with a LINPACK benchmark rating of 93 petaflops. The name is translated as divine power, the light of Taihu Lake. This is nearly three times as fast as the previous Tianhe-2, which ran at 34 petaflops. As of June 2017, it is ranked as the 16th most energy-efficient supercomputer in the Green500, with an efficiency of 6.051 GFlops/watt. It was designed by the National Research Center of Parallel Computer Engineering & Technology (NRCPC) and is located at the National Supercomputing Center in Wuxi in the city of Wuxi, in Jiangsu province, China.

The SW26010 is a 260-core manycore processor designed by the National High Performance Integrated Circuit Design Center in Shanghai. It implements the Sunway architecture, a 64-bit reduced instruction set computing (RISC) architecture designed in China. The SW26010 has four clusters of 64 Compute-Processing Elements (CPEs) which are arranged in an eight-by-eight array. The CPEs support single instruction, multiple data (SIMD) instructions, and are capable of performing eight double-precision floating-point operations per cycle. Each cluster is accompanied by a more conventional general-purpose core called the Management Processing Element (MPE) that provides supervisory functions. Each cluster has its own dedicated DDR3 SDRAM controller, and a memory bank with its own address space. The processor runs at a clock speed of 1.45 GHz.

References

  1. 1 2 3 4 5 6 7 8 9 10 Dongarra, Jack (3 June 2013). "Visit to the National University for Defense Technology Changsha, China" (PDF). Netlib . Retrieved 17 June 2013.
  2. Chen, Stephen (20 June 2013). "World's fastest supercomputer may get little use".
  3. 1 2 3 "June 2013". TOP500 . Retrieved 17 June 2013.
  4. "The Top 500 List: June 2013" . Retrieved 10 July 2014.
  5. 1 2 3 4 Davey Alba (17 June 2013). "China's Tianhe-2 Caps Top 10 Supercomputers". IEEE Spectrum. Retrieved 19 June 2013.
  6. "US blocks Intel from selling Xeon chips to Chinese supercomputer projects". PCWorld. Retrieved 19 February 2017.
  7. "U.S. chip block could delay China's powerful 100 petaflop supercomputer". PCWorld. Retrieved 19 February 2017.
  8. Don Clark (9 April 2015). "U.S. Agencies Block Technology Exports for Supercomputer in China". Wall Street Journal tech. Retrieved 9 April 2015.
  9. http://www.top500.org/news/china-tops-supercomputer-rankings-with-new-93-petaflop-machine/
  10. Michael Kan, IDG News Service (31 October 2012). "China is building a 100-petaflop supercomputer". infoworld.com. Retrieved 31 October 2012.
  11. http://www.top500.org/lists/2015/11/
  12. "List Statistics (June 2016)". TOP500.org . Retrieved 25 September 2016.
  13. 1 2 "The Graph 500 List: June 2013". Graph 500. Archived from the original on 21 June 2013. Retrieved 19 June 2013.
  14. Hemsoth, Nicole (26 June 2014). "New HPC Benchmark Delivers Promising Results". HPCWire. Retrieved 8 September 2014.
  15. "China's Tianhe-2 Remains The World's Fastest Supercomputer". Forbes. Retrieved 24 June 2014.
  16. "Matrix-2000 - NUDT - WikiChip". en.wikichip.org. Retrieved 6 October 2019.
  17. Weixia, Xu (June 2014). "Hybrid hierarchy storage system in MilkyWay-2 supercomputer". Frontiers of Computer Science. 8 (3): 367. doi:10.1007/s11704-014-3499-6.
  18. Yutong, Lu (5 March 2015). "Overview of Tianhe2 System and Application" (PDF). Idris. Retrieved 6 May 2016.
  19. Mimi Lau in Guangzhou (30 June 2014). "China's world-beating supercomputer fails to impress some potential clients". South China Morning Post.

Further reading

Records
Preceded by
Titan
17.59 petaflops
World's most powerful supercomputer
June 2013 – June 2016
Succeeded by
Sunway TaihuLight
93.01 petaflops