|Location||National Supercomputer Center, Guangzhou, China|
|Architecture||32,000 Intel Xeon E5-2692 12C with 2.200 GHz 48,000 Xeon Phi 31S1P|
|Power||17.6 MW (24 MW with cooling)|
|Operating system||Kylin Linux|
|Memory||1,375 TiB (1,000 TiB CPU and 375 TiB coprocessor)|
|Cost||2.4 billion Yuan (US$390 million)|
|Purpose||Simulation, analysis, and government security applications.|
Tianhe-2 or TH-2 (Chinese :天河-2; pinyin :tiānhé-èr; lit. : 'Heavenriver-2', i.e. 'Milky Way 2') is a 33.86-petaflops supercomputer located in the National Supercomputer Center in Guangzhou, China. It was developed by a team of 1,300 scientists and engineers.
It was the world's fastest supercomputer according to the TOP500 lists for June 2013, November 2013, June 2014, November 2014, June 2015, and November 2015.The record was surpassed in June 2016 by the Sunway TaihuLight. In 2015, plans of the Sun Yat-sen University in collaboration with Guangzhou district and city administration to double its computing capacities were stopped by a U.S. government rejection of Intel's application for an export license for the CPUs and coprocessor boards.
In response to the U.S. sanction, China introduced the Sunway TaihuLight supercomputer in 2016, which substantially outperforms the Tianhe-2 (and also affected the update of Tianhe-2 to Tianhe-2A replacing US tech), and now ranks fourth in the TOP500 list while using completely domestic technology including the Sunway manycore microprocessor.
This section needs to be updated.September 2016)(
The development of Tianhe-2 was sponsored by the 863 High Technology Program, initiated by the Chinese government, the government of Guangdong province, and the government of Guangzhou city.It was built by China's National University of Defense Technology (NUDT) in collaboration with the Chinese IT firm Inspur. Inspur manufactured the printed circuit boards and helped with the installation and testing of the system software. The project was originally scheduled for completion in 2015, but was instead declared operational in June 2013. As of June 2013, the supercomputer had yet to become fully operational. It was expected to reach its full computing capabilities by the end of 2013.
In June 2013, Tianhe-2 topped the TOP500 list of fastest supercomputers in the world and was still listed as the fastest machine in the November 2015 list. petaflops, while Tianhe-2 achieved 33.86 petaflops. Tianhe-2's performance returned the title of the world's fastest supercomputer to China after Tianhe-I's début in November 2010. The Institute of Electrical and Electronics Engineers said Tianhe-2's win "symbolizes China's unflinching commitment to the supercomputing arms race". In June 2013, China housed 66 of the top 500 supercomputers, second only to the United States' 252 systems. The Chinese total increased to 168 of the top 500 systems by June 2016, overtaking the United States which fell to 165 of the top 500 supercomputers.The computer beat out second-place finisher Titan by nearly a 2-to-1 margin. Titan, which is housed at the U.S. Department of Energy's Oak Ridge National Laboratory, achieved 17.59
Graph500 is an alternate list of top supercomputers based on a benchmark testing analysis of graphs.In their benchmark, the system tested at 2,061 gigaTEPS (traversed edges per second). The top system, IBM Sequoia, tested at 15,363 gigaTEPS. It also has first place in the HPCG benchmark test proposed by Jack Dongarra, with 0.580 HPCG PFLOPS in June 2014.
Tianhe-2 has been housed at National University of Defense Technology.
According to NUDT, Tianhe-2 would have been used for simulation, analysis, and government security applications.
With 16,000 computer nodes, each comprising two Intel Ivy Bridge Xeon processors and three Xeon Phi coprocessor chips, it represented the world's largest installation of Ivy Bridge and Xeon Phi chips, counting a total of 3,120,000 cores TiB (approximately 1.34 PiB). The system has a 12.4 PiB H2FS file system consisting of IO forwarding nodes providing a 1 TiB/s burst rate backed by a Lustre file system with 100 GiB/s sustained throughput.(because of US sanctions, the upgrades Tianhe-2A switched out the Xeon Phi accelerators for Matrix-2000, and the upgraded faster system has 4,981,760 cores in total, but still dropped from 2nd to 4th place because of new faster systems added to the list). Each of the 16,000 nodes possessed 88 gigabytes of memory (64 used by the Ivy Bridge processors, and 8 gigabytes for each of the Xeon Phi processors). The total CPU plus coprocessor memory was 1,375
During the testing phase, Tianhe-2 was laid out in a non-optimal confined space. When assembled at its final location, the system will have had a theoretical peak performance of 54.9 petaflops. At peak power consumption, the system itself would have drawn 17.6 megawatts of power. Including external cooling, the system drew an aggregate of 24 megawatts. The completed computer complex would have occupied 720 square meters of space.
The front-end system consisted of 4096 Galaxy FT-1500 CPUs, a SPARC derivative designed and built by NUDT. Each FT-1500 has 16 cores and a 1.8 GHz clock frequency. The chip has a performance of 144 gigaflops and runs on 65 watts. The interconnect, called the TH Express-2, designed by NUDT, utilized a fat tree topology with 13 switches each of 576 ports.
Tianhe-2 ran on Kylin Linux, a version of the operating system developed by NUDT. Resource management is based on Slurm Workload Manager.
Researchers have criticized Tianhe-2 for being difficult to use. "It is at the world's frontier in terms of calculation capacity, but the function of the supercomputer is still way behind the ones in the US and Japan", says Chi Xuebin, deputy director of the Computer Network and Information Centre. "Some users would need years or even a decade to write the necessary code", he added.
The location of Tianhe-2 is in Southern China, where the warmer weather with higher temperature could increase the electricity consumption by about 10% compared with a location in Northern China.
In computing, floating point operations per second is a measure of computer performance, useful in fields of scientific computations that require floating-point calculations. For such cases it is a more accurate measure than measuring instructions per second.
Cray Inc., a subsidiary of Hewlett Packard Enterprise, is an American supercomputer manufacturer headquartered in Seattle, Washington. It also manufactures systems for data storage and analytics. Several Cray supercomputer systems are listed in the TOP500, which ranks the most powerful supercomputers in the world.
A coprocessor is a computer processor used to supplement the functions of the primary processor. Operations performed by the coprocessor may be floating point arithmetic, graphics, signal processing, string processing, cryptography or I/O interfacing with peripheral devices. By offloading processor-intensive tasks from the main processor, coprocessors can accelerate system performance. Coprocessors allow a line of computers to be customized, so that customers who do not need the extra performance do not need to pay for it.
The TOP500 project ranks and details the 500 most powerful non-distributed computer systems in the world. The project was started in 1993 and publishes an updated list of the supercomputers twice a year. The first of these updates always coincides with the International Supercomputing Conference in June, and the second is presented at the ACM/IEEE Supercomputing Conference in November. The project aims to provide a reliable basis for tracking and detecting trends in high-performance computing and bases rankings on HPL, a portable implementation of the high-performance LINPACK benchmark written in Fortran for distributed-memory computers.
In computing, performance per watt is a measure of the energy efficiency of a particular computer architecture or computer hardware. Literally, it measures the rate of computation that can be delivered by a computer for every watt of power consumed. This rate is typically measured by performance on the LINPACK benchmark when trying to compare between computing systems.
The Cray XT5 is an updated version of the Cray XT4 supercomputer, launched on November 6, 2007. It includes a faster version of the XT4's SeaStar2 interconnect router called SeaStar2+, and can be configured either with XT4 compute blades, which have four dual-core AMD Opteron processor sockets, or XT5 blades, with eight sockets supporting dual or quad-core Opterons. The XT5 uses a 3-dimensional torus network topology.
The National University of Defense Technology, or People's Liberation Army National University of Defense Science and Technology, is a military academy and Class A Double First Class University located in Changsha, Hunan, China. It is under the direct leadership of China's Central Military Commission, and the dual management of the Ministry of National Defense and the Ministry of Education. It is designated for Project 211 and Project 985, the two national plans facilitating the development of Chinese higher education. NUDT was instrumental in the development of the Tianhe-2 supercomputer.
IBM Sequoia is a petascale Blue Gene/Q supercomputer constructed by IBM for the National Nuclear Security Administration as part of the Advanced Simulation and Computing Program (ASC). It was delivered to the Lawrence Livermore National Laboratory (LLNL) in 2011 and was fully deployed in June 2012.
Manycore processors are specialist multi-core processors designed for a high degree of parallel processing, containing numerous simpler, independent processor cores. Manycore processors are used extensively in embedded computers and high-performance computing.
Tianhe-I, Tianhe-1, or TH-1 is a supercomputer capable of an Rmax of 2.5 petaFLOPS. Located at the National Supercomputing Center of Tianjin, China, it was the fastest computer in the world from October 2010 to June 2011 and is one of the few petascale supercomputers in the world.
Nebulae is a petascale supercomputer located at the National Supercomputing Center in Shenzhen, Guangdong, China. Built from a Dawning TC3600 Blade system with Intel Xeon X5650 processors and Nvidia Tesla C2050 GPUs, it has a peak performance of 1.271 petaflops using the LINPACK benchmark suite. Nebulae was ranked the second most powerful computer in the world in the June 2010 list of the fastest supercomputers according to TOP500. Nebulae has a theoretical peak performance of 2.9843 petaflops. This computer is used for multiple applications requiring advanced processing capabilities. It is ranked 10th among the June 2012 list of top500.org.
The National Supercomputing Center of Tianjin is located at the National Defense Science and Technology University in Tianjin, China. One of the fastest supercomputers in the world, Tianhe-1A, is located at the facility.
China operates a number of supercomputer centers which, altogether, hold 29.3% performance share of world's fastest 500 supercomputers. The origins of these centers go back to 1989, when the State Planning Commission, the State Science and Technology Commission and the World Bank jointly launched a project to develop networking and supercomputer facilities in China. In addition to network facilities, the project included three supercomputer centers. China's Sunway TaihuLight ranks third in the TOP500 list.
The K computer – named for the Japanese word/numeral "kei" (京), meaning 10 quadrillion (1016) – was a supercomputer manufactured by Fujitsu, installed at the Riken Advanced Institute for Computational Science campus in Kobe, Hyōgo Prefecture, Japan. The K computer was based on a distributed memory architecture with over 80,000 compute nodes. It was used for a variety of applications, including climate research, disaster prevention and medical research. The K computer's operating system was based on the Linux kernel, with additional drivers designed to make use of the computer's hardware.
Several centers for supercomputing exist across Europe, and distributed access to them is coordinated by European initiatives to facilitate high-performance computing. One such initiative, the HPC Europa project, fits within the Distributed European Infrastructure for Supercomputing Applications (DEISA), which was formed in 2002 as a consortium of eleven supercomputing centers from seven European countries. Operating within the CORDIS framework, HPC Europa aims to provide access to supercomputers across Europe.
Xeon Phi is a series of x86 manycore processors designed and made by Intel. It is intended for use in supercomputers, servers, and high-end workstations. Its architecture allows use of standard programming languages and application programming interfaces (APIs) such as OpenMP.
FeiTeng is the name of several computer central processing units designed and produced in China for supercomputing applications. The microprocessors have been developed by Tianjin Phytium Technology. The processors have also been described as the YinHeFeiTeng family. This CPU family has been developed by a team directed by NUDT's Professor Xing Zuocheng.
The Cray XC40 is a massively parallel multiprocessor supercomputer manufactured by Cray. It consists of Intel Haswell Xeon processors, with optional Nvidia Tesla or Intel Xeon Phi accelerators, connected together by Cray's proprietary "Aries" interconnect, stored in air-cooled or liquid-cooled cabinets. The XC series supercomputers are available with the Cray DataWarp applications I/O accelerator technology.
The Sunway TaihuLight is a Chinese supercomputer which, as of November 2018, is ranked third in the TOP500 list, with a LINPACK benchmark rating of 93 petaflops. The name is translated as divine power, the light of Taihu Lake. This is nearly three times as fast as the previous Tianhe-2, which ran at 34 petaflops. As of June 2017, it is ranked as the 16th most energy-efficient supercomputer in the Green500, with an efficiency of 6.051 GFlops/watt. It was designed by the National Research Center of Parallel Computer Engineering & Technology (NRCPC) and is located at the National Supercomputing Center in Wuxi in the city of Wuxi, in Jiangsu province, China.
The SW26010 is a 260-core manycore processor designed by the National High Performance Integrated Circuit Design Center in Shanghai. It implements the Sunway architecture, a 64-bit reduced instruction set computing (RISC) architecture designed in China. The SW26010 has four clusters of 64 Compute-Processing Elements (CPEs) which are arranged in an eight-by-eight array. The CPEs support single instruction, multiple data (SIMD) instructions, and are capable of performing eight double-precision floating-point operations per cycle. Each cluster is accompanied by a more conventional general-purpose core called the Management Processing Element (MPE) that provides supervisory functions. Each cluster has its own dedicated DDR3 SDRAM controller, and a memory bank with its own address space. The processor runs at a clock speed of 1.45 GHz.
| World's most powerful supercomputer |
June 2013 – June 2016