Anton (computer)

Last updated

Anton supercomputer.jpg

Anton is a massively parallel supercomputer designed and built by D. E. Shaw Research in New York, first running in 2008. It is a special-purpose system for molecular dynamics (MD) simulations of proteins and other biological macromolecules. An Anton machine consists of a substantial number of application-specific integrated circuits (ASICs), interconnected by a specialized high-speed, three-dimensional torus network. [1]

Contents

Unlike earlier special-purpose systems for MD simulations, such as MDGRAPE-3 developed by RIKEN in Japan, Anton runs its computations entirely on specialized ASICs, instead of dividing the computation between specialized ASICs and general-purpose host processors.

Each Anton ASIC contains two computational subsystems. Most of the calculation of electrostatic and van der Waals forces is performed by the high-throughput interaction subsystem (HTIS). [2] This subsystem contains 32 deeply pipelined modules running at 800 MHz arranged much like a systolic array. The remaining calculations, including the bond forces and the fast Fourier transforms (used for long-range electrostatics), are performed by the flexible subsystem. This subsystem contains four general-purpose Tensilica cores (each with cache and scratchpad memory) and eight specialized but programmable SIMD cores called geometry cores. The flexible subsystem runs at 400 MHz. [3]

Anton's network is a 3D torus and thus each chip has 6 inter-node links with a total in+out bandwidth of 607.2 Gbit/s. An inter-node link is composed of two equal one-way links (one traveling in each direction), with each one-way link having 50.6 Gbit/s of bandwidth. Each one-way link is composed of 11 lanes, where a lane is a differential pair of wires signaling at 4.6 Gbit/s. The per-hop latency in Anton's network is 50 ns. Each ASIC is also attached to its own DRAM bank, enabling large simulations. [4]

The performance of a 512-node Anton machine is over 17,000 nanoseconds of simulated time per day for a protein-water system consisting of 23,558 atoms. [5] In comparison, MD codes running on general-purpose parallel computers with hundreds or thousands of processor cores achieve simulation rates of up to a few hundred nanoseconds per day on the same chemical system. The first 512-node Anton machine became operational in October 2008. [6] The multiple petaFLOP, [7] distributed-computing project Folding@home has achieved similar aggregate ensemble simulation timescales, comparable to the total time of a single continuous simulation on Anton, specifically achieving the 1.5-millisecond range in January 2010. [8]

The Anton supercomputer is named after Anton van Leeuwenhoek, [9] who is often referred to as "the father of microscopy" because he built high-precision optical instruments and used them to visualize a wide variety of organisms and cell types for the first time.

The ANTON 2 machine with four 512 nodes and its substantially increased speed and problem size has been described. [10]

The National Institutes of Health have supported an ANTON for the biomedical research community at the Pittsburgh Supercomputing Center, Carnegie Mellon University, and currently (8/20) continues with an ANTON 2 system.

See also

Related Research Articles

<span class="mw-page-title-main">Supercomputer</span> Type of extremely powerful computer

A supercomputer is a computer with a high level of performance as compared to a general-purpose computer. The performance of a supercomputer is commonly measured in floating-point operations per second (FLOPS) instead of million instructions per second (MIPS). Since 2017, there have existed supercomputers which can perform over 1017 FLOPS (a hundred quadrillion FLOPS, 100 petaFLOPS or 100 PFLOPS). For comparison, a desktop computer has performance in the range of hundreds of gigaFLOPS (1011) to tens of teraFLOPS (1013). Since November 2017, all of the world's fastest 500 supercomputers run on Linux-based operating systems. Additional research is being conducted in the United States, the European Union, Taiwan, Japan, and China to build faster, more powerful and technologically superior exascale supercomputers.

<span class="mw-page-title-main">IBM Blue Gene</span> Series of supercomputers by IBM

Blue Gene is an IBM project aimed at designing supercomputers that can reach operating speeds in the petaFLOPS (PFLOPS) range, with low power consumption.

<span class="mw-page-title-main">Parallel computing</span> Programming paradigm in which many processes are executed simultaneously

Parallel computing is a type of computation in which many calculations or processes are carried out simultaneously. Large problems can often be divided into smaller ones, which can then be solved at the same time. There are several different forms of parallel computing: bit-level, instruction-level, data, and task parallelism. Parallelism has long been employed in high-performance computing, but has gained broader interest due to the physical constraints preventing frequency scaling. As power consumption by computers has become a concern in recent years, parallel computing has become the dominant paradigm in computer architecture, mainly in the form of multi-core processors.

<span class="mw-page-title-main">Molecular dynamics</span> Computer simulations to discover and understand chemical properties

Molecular dynamics (MD) is a computer simulation method for analyzing the physical movements of atoms and molecules. The atoms and molecules are allowed to interact for a fixed period of time, giving a view of the dynamic "evolution" of the system. In the most common version, the trajectories of atoms and molecules are determined by numerically solving Newton's equations of motion for a system of interacting particles, where forces between the particles and their potential energies are often calculated using interatomic potentials or molecular mechanical force fields. The method is applied mostly in chemical physics, materials science, and biophysics.

<span class="mw-page-title-main">Computer simulation</span> Process of mathematical modelling, performed on a computer

Computer simulation is the process of mathematical modelling, performed on a computer, which is designed to predict the behaviour of, or the outcome of, a real-world or physical system. The reliability of some mathematical models can be determined by comparing their results to the real-world outcomes they aim to predict. Computer simulations have become a useful tool for the mathematical modeling of many natural systems in physics, astrophysics, climatology, chemistry, biology and manufacturing, as well as human systems in economics, psychology, social science, health care and engineering. Simulation of a system is represented as the running of the system's model. It can be used to explore and gain new insights into new technology and to estimate the performance of systems too complex for analytical solutions.

<span class="mw-page-title-main">Folding@home</span> Distributed computing project simulating protein folding

Folding@home is a distributed computing project aimed to help scientists develop new therapeutics for a variety of diseases by the means of simulating protein dynamics. This includes the process of protein folding and the movements of proteins, and is reliant on simulations run on volunteers' personal computers. Folding@home is currently based at the University of Pennsylvania and led by Greg Bowman, a former student of Vijay Pande.

A physics processing unit (PPU) is a dedicated microprocessor designed to handle the calculations of physics, especially in the physics engine of video games. It is an example of hardware acceleration.

Advanced Numerical Research and Analysis Group (ANURAG) is a laboratory of the Defence Research and Development Organisation (DRDO). Located in Kanchanbagh, Hyderabad, it is involved in the development of computing solutions for numerical analysis and their use in other DRDO projects.

<span class="mw-page-title-main">SGI Origin 2000</span> Series of server computers

The SGI Origin 2000 is a family of mid-range and high-end server computers developed and manufactured by Silicon Graphics (SGI). They were introduced in 1996 to succeed the SGI Challenge and POWER Challenge. At the time of introduction, these ran the IRIX operating system, originally version 6.4 and later, 6.5. A variant of the Origin 2000 with graphics capability is known as the Onyx2. An entry-level variant based on the same architecture but with a different hardware implementation is known as the Origin 200. The Origin 2000 was succeeded by the Origin 3000 in July 2000, and was discontinued on June 30, 2002.

A massively parallel processor array, also known as a multi purpose processor array (MPPA) is a type of integrated circuit which has a massively parallel array of hundreds or thousands of CPUs and RAM memories. These processors pass work to one another through a reconfigurable interconnect of channels. By harnessing a large number of processors working in parallel, an MPPA chip can accomplish more demanding tasks than conventional chips. MPPAs are based on a software parallel programming model for developing high-performance embedded system applications.

D. E. Shaw Research (DESRES) is a privately held biochemistry research company based in New York City. Under the scientific direction of David E. Shaw, the group's chief scientist, D. E. Shaw Research develops technologies for molecular dynamics simulations and applies such simulations to basic scientific research in structural biology and biochemistry, and to the process of computer-aided drug design.

Desmond is a software package developed at D. E. Shaw Research to perform high-speed molecular dynamics simulations of biological systems on conventional computer clusters. The code uses novel parallel algorithms and numerical methods to achieve high performance on platforms containing multiple processors, but may also be executed on a single computer.

<span class="mw-page-title-main">Tianhe-1</span> Supercomputer

Tianhe-I, Tianhe-1, or TH-1 is a supercomputer capable of an Rmax of 2.5 peta FLOPS. Located at the National Supercomputing Center of Tianjin, China, it was the fastest computer in the world from October 2010 to June 2011 and was one of the few petascale supercomputers in the world.

<span class="mw-page-title-main">Supercomputer architecture</span> Design of high-performance computers

Approaches to supercomputer architecture have taken dramatic turns since the earliest systems were introduced in the 1960s. Early supercomputer architectures pioneered by Seymour Cray relied on compact innovative designs and local parallelism to achieve superior computational peak performance. However, in time the demand for increased computational power ushered in the age of massively parallel systems.

<span class="mw-page-title-main">Titan (supercomputer)</span> American supercomputer

Titan or OLCF-3 was a supercomputer built by Cray at Oak Ridge National Laboratory for use in a variety of science projects. Titan was an upgrade of Jaguar, a previous supercomputer at Oak Ridge, that uses graphics processing units (GPUs) in addition to conventional central processing units (CPUs). Titan was the first such hybrid to perform over 10 petaFLOPS. The upgrade began in October 2011, commenced stability testing in October 2012 and it became available to researchers in early 2013. The initial cost of the upgrade was US$60 million, funded primarily by the United States Department of Energy.

Adaptive sampling is a technique used in computational molecular biology to efficiently simulate protein folding when coupled with molecular dynamics simulations.

<span class="mw-page-title-main">Supercomputing in Pakistan</span> Overview of supercomputing in Pakistan

The high performance supercomputing program started in mid-to-late 1980s in Pakistan. Supercomputing is a recent area of Computer science in which Pakistan has made progress, driven in part by the growth of the information technology age in the country. Developing on the ingenious supercomputer program started in 1980s when the deployment of the Cray supercomputers was initially denied.

Daniel Ramot is Israeli-born American entrepreneur, and scientist. He is the co-founder and CEO of TransitTech company, Via.

A domain-specific architecture (DSA) is a programmable computer architecture specifically tailored to operate very efficiently within the confines of a given application domain. The term is often used in contrast to general-purpose architectures, such as CPUs, that are designed to operate on any computer program.

References

  1. David E. Shaw; Martin M. Deneroff; Ron O. Dror; Jeffrey S. Kuskin; Richard H. Larson; John K. Salmon; Cliff Young; Brannon Batson; Kevin J. Bowers; Jack C. Chao; Michael P. Eastwood; Joseph Gagliardo; J.P. Grossman; C. Richard Ho; Douglas J. Ierardi; István Kolossváry; John L. Klepeis; Timothy Layman; Christine McLeavey; Mark A. Moraes; Rolf Mueller; Edward C. Priest; Yibing Shan; Jochen Spengler; Michael Theobald; Brian Towles; Stanley C. Wang (July 2008). Anton, a special-purpose machine for molecular dynamics simulation. pp. 91–97. doi:10.1145/1364782.1364802. ISBN   978-1-59593-706-3. S2CID   52827083.{{cite book}}: |journal= ignored (help) (Related paper published in Proceedings of the 34th Annual International Symposium on Computer Architecture (ISCA '07), San Diego, California, June 9–13, 2007).
  2. Richard H. Larson; John K. Salmon; Ron O. Dror; Martin M. Deneroff; Cliff Young; J.P. Grossman; Yibing Shan; John L. Klepeis; David E. Shaw (2009). High-Throughput Pairwise Point Interactions in Anton, a Specialized Machine for Molecular Dynamics Simulation (PDF). ISBN   978-1-4244-2070-4. Archived from the original (PDF) on June 5, 2011. Retrieved January 13, 2009.{{cite book}}: |journal= ignored (help)
  3. Jeffrey S. Kuskin; Cliff Young; J.P. Grossman; Brannon Batson; Martin M. Deneroff; Ron O. Dror; David E. Shaw (2009). Incorporating Flexibility in Anton, a Specialized Machine for Molecular Dynamics Simulation (PDF). ISBN   978-1-4244-2070-4. Archived from the original (PDF) on December 4, 2008. Retrieved January 13, 2009.{{cite book}}: |journal= ignored (help)
  4. Cliff Young; Ron O. Dror; J. P. Grossman; John K. Salmon; Shaw, David E.; Joseph A. Bank; et al. (2009). "A 32x32x32, spatially distributed 3D FFT in four microseconds on Anton". Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis. New York, NY: ACM. pp. 1–11. doi:10.1145/1654059.1654083. ISBN   978-1-60558-744-8. S2CID   5611246.
  5. "National Resource for Biomedical Supercomputing". Archived from the original on May 23, 2010. Retrieved May 14, 2010.
  6. David E. Shaw; Ron O. Dror; John K. Salmon; J.P. Grossman; Kenneth M. Mackenzie; Joseph A. Bank; Cliff Young; Martin M. Deneroff; Brannon Batson; Kevin J. Bowers; Edmond Chow; Michael P. Eastwood; Douglas J. Ierardi; John L. Klepeis; Jeffrey S. Kuskin; Richard H. Larson; Kresten Lindorff-Larsen; Paul Maragakis; Mark A. Moraes; Stefano Piana; Yibing Shan; Brian Towles (2009). Millisecond-Scale Molecular Dynamics Simulations on Anton (PDF). pp. 1–11. doi:10.1145/1654059.1654099. ISBN   978-1-60558-744-8. S2CID   4390504. Archived from the original (Portland, Oregon) on April 23, 2012. Retrieved April 20, 2012.{{cite book}}: |journal= ignored (help)
  7. Pande Group (March 2017). "Client Statistics by OS". Stanford University. Retrieved February 3, 2012.
  8. Vijay Pande (January 17, 2010). "Folding@home: Paper #72: Major new result for Folding@home: Simulation of the millisecond timescale" . Retrieved September 22, 2011.
  9. John Markoff (July 8, 2008). "Herculean Device for Molecular Mysteries". The New York Times . Retrieved April 25, 2010.
  10. Shaw, David E; Grossman, JP; Bank, Joseph; A Batson, Brannon; Butts, J Adam; Chao, Jack C; Deneroff, Martin M; Dror, Ron O; Even, Amos (2014). "Anton 2: Raising the Bar for Performance and Programmability in a Special-Purpose Molecular Dynamics Supercomputer". SC14: International Conference for High Performance Computing, Networking, Storage and Analysis. New Orleans, LA: ACM. pp. 41–53. doi:10.1109/SC.2014.9. ISBN   978-1-4799-5499-5. S2CID   3354876.