Bill Dally | |
---|---|
Born | William James Dally August 17, 1960 |
Alma mater | |
Awards |
|
Scientific career | |
Institutions | |
Thesis | A VLSI Architecture for Concurrent Data Structures (1986) |
Doctoral advisor | Charles Seitz [1] |
Doctoral students |
William James Dally (born August 17, 1960) is an American computer scientist and educator. [1] [2] He is the chief scientist and senior vice president at Nvidia and was previously a professor of Electrical Engineering and Computer Science at Stanford University and MIT. Since 2021, he has been a member of the President's Council of Advisors on Science and Technology (PCAST). [3] [4]
He developed a number of techniques used in modern interconnection networks including routing-based deadlock avoidance, wormhole routing, link-level retry, virtual channels, global adaptive routing, and high-radix routers.[ jargon ] He has developed efficient mechanisms for communication, synchronization, and naming in parallel computers including message-driven computing and fast capability-based addressing. He has developed a number of stream processors starting in 1995 including Imagine, for graphics, signal, and image processing, and Merrimac, for scientific computing.[ citation needed ]
He has published over 200 papers as well as the textbooks Digital Systems Engineering with John Poulton, and Principles and Practices of Interconnection Networks with Brian Towles. He was inventor or co-inventor on over 70 granted patents.
An author quoted him saying: "Locality is efficiency, Efficiency is power, Power is performance, Performance is king". [5]
Dally has received a bachelor's degree in electrical engineering from Virginia Tech. [6] While working for Bell Telephone Laboratories he contributed to the design of the Bellmac 32, an early 32-bit microprocessor,[ citation needed ] and earned an master's degree in electrical engineering from Stanford University in 1981. He then went to the California Institute of Technology (Caltech) from 1983 to 1986, [6] graduating with a Ph.D. degree in computer science in 1986. At Caltech he designed the MOSSIM simulation engine and an integrated circuit for routing. While at Caltech, he was part of the founding group of Stac Electronics in 1983. [7]
From 1986 to 1997 he taught at MIT where he and his group built the J–Machine and the M–Machine, [8] parallel machines emphasizing low overhead synchronization and communication. During his MIT times he claims to have collaborated on developing design of Cray T3D and Cray T3E supercomputers. He became the Willard R. and Inez Kerr Bell Professor in the Stanford University School of Engineering and chairman of the computer science department at Stanford. He served as chairman for twelve years before moving on to Nvidia. [9]
Dally's corporate involvements include various collaborations at Cray Research since 1989. He did Internet router work at Avici Systems starting in 1997, was chief technical officer at Velio Communications from 1999 until its 2003 acquisition by LSI Logic, founder and chairman of Stream Processors, Inc until it folded. [7]
Dally was elected a Fellow of the Association for Computing Machinery in 2002, and a Fellow of the IEEE, also in 2002. In 2003 he became a consultant for NVIDIA for the first time and helped to develop GeForce 8800 GPUs series. [10] He received the ACM/SIGARCH Maurice Wilkes Award in 2000, the Seymour Cray Computer Science and Engineering Award in 2004, and the IEEE Computer Society Charles Babbage Award in 2006. In 2007 he was elected to the American Academy of Arts and Sciences.
In January 2009 he was appointed chief scientist of Nvidia. [11] He worked full-time at Nvidia, while supervising about 12 of his graduate students at Stanford. [12] He is currently chief scientist and SVP of Nvidia Research. [13]
Among many contributions to technology at Nvidia, Dally also kick-started optical interconnects for GPU [14] and computing systems [15] using micro ring modulators utilizing multiple wavelengths. [16] [17] These systems can lead to the adoption of very high bandwidth, low energy per bit optical interconnects [18] in GPUs [19] and also lead to circuit switched GPU datacenters with significant boost to AI computing efficiency.
In 2009, he was elected to the National Academy of Engineering for contributions to the design of high-performance interconnect networks and parallel computer architectures.
He received the 2010 ACM/IEEE Eckert–Mauchly Award for "outstanding contributions to the architecture of interconnection networks and parallel computers." [20]
Dally is married and has three children. He had a flight mishap in 1992 when the Cessna 210 he was flying from Hanscom Field, Massachusetts to Farmingdale, New York in bad weather conditions experienced an oil leak. He was forced to make a crash landing in the Long Island Sound and was retrieved by a rescue sailboat. [21]
A supercomputer is a type of computer with a high level of performance as compared to a general-purpose computer. The performance of a supercomputer is commonly measured in floating-point operations per second (FLOPS) instead of million instructions per second (MIPS). Since 2022, supercomputers have existed which can perform over 1018 FLOPS, so called exascale supercomputers. For comparison, a desktop computer has performance in the range of hundreds of gigaFLOPS (1011) to tens of teraFLOPS (1013). Since November 2017, all of the world's fastest 500 supercomputers run on Linux-based operating systems. Additional research is being conducted in the United States, the European Union, Taiwan, Japan, and China to build faster, more powerful and technologically superior exascale supercomputers.
Floating point operations per second is a measure of computer performance in computing, useful in fields of scientific computations that require floating-point calculations.
Cray Inc., a subsidiary of Hewlett Packard Enterprise, is an American supercomputer manufacturer headquartered in Seattle, Washington. It also manufactures systems for data storage and analytics. Several Cray supercomputer systems are listed in the TOP500, which ranks the most powerful supercomputers in the world.
David A. Bader is a Distinguished Professor and Director of the Institute for Data Science at the New Jersey Institute of Technology. Previously, he served as the Chair of the Georgia Institute of Technology School of Computational Science & Engineering, where he was also a founding professor, and the executive director of High-Performance Computing at the Georgia Tech College of Computing. In 2007, he was named the first director of the Sony Toshiba IBM Center of Competence for the Cell Processor at Georgia Tech.
The President's Council of Advisors on Science and Technology (PCAST) is a council, chartered in each administration with a broad mandate to advise the president of the United States on science and technology. The current PCAST was established by Executive Order 13226 on September 30, 2001, by George W. Bush, was re-chartered by Barack Obama's April 21, 2010, Executive Order 13539, by Donald Trump's October 22, 2019, Executive Order 13895, and by Joe Biden's February 1, 2021, Executive Order 14007.
Wen-mei Hwu is a Taiwanese-American computer scientist. He is the Senior Director of Research and Senior Distinguished Research Scientist at NVIDIA Corporation as well as the Walter J. Sanders III-AMD Endowed Chair Professor Emeritus in Electrical and Computer Engineering at the University of Illinois at Urbana-Champaign.
Michal Lipson is an American physicist known for her work on silicon photonics. A member of the National Academy of Sciences since 2019, Lipson was named a 2010 MacArthur Fellow for contributions to silicon photonics especially towards enabling GHz silicon active devices. Until 2014, she was the Given Foundation Professor of Engineering at Cornell University in the school of electrical and computer engineering and a member of the Kavli Institute for Nanoscience at Cornell. She is now the Eugene Higgins Professor of Electrical Engineering at Columbia University. In 2009 she co-founded the company PicoLuz, which develops and commercializes silicon nanophotonics technologies. In 2019, she co-founded Voyant Photonics, which develops next generation lidar technology based on silicon photonics. In 2022, Lipson was a co-founder of Xscape photonics to accelerate AI, ML, and simulation hardware. In 2020 Lipson was elected the 2021 vice president of Optica, and she served as the Optica president in 2023.
The National Center for Computational Sciences (NCCS) is a United States Department of Energy (DOE) Leadership Computing Facility that houses the Oak Ridge Leadership Computing Facility (OLCF), a DOE Office of Science User Facility charged with helping researchers solve challenging scientific problems of global interest with a combination of leading high-performance computing (HPC) resources and international expertise in scientific computing.
Tianhe-I, Tianhe-1, or TH-1 is a supercomputer capable of an Rmax of 2.5 peta FLOPS. Located at the National Supercomputing Center of Tianjin, China, it was the fastest computer in the world from October 2010 to June 2011 and was one of the few petascale supercomputers in the world.
The Seymour Cray Computer Engineering Award, also known as the Seymour Cray Award, is an award given by the IEEE Computer Society, to recognize significant and innovative contributions in the field of high-performance computing. The award honors scientists who exhibit the creativity demonstrated by Seymour Cray, founder of Cray Research, Inc., and an early pioneer of supercomputing. Cray was an American electrical engineer and supercomputer architect who designed a series of computers that were the fastest in the world for decades, and founded Cray Research which built many of these machines. Called "the father of supercomputing," Cray has been credited with creating the supercomputer industry. He played a key role in the invention and design of the UNIVAC 1103, a landmark high-speed computer and the first computer available for commercial use.
Approaches to supercomputer architecture have taken dramatic turns since the earliest systems were introduced in the 1960s. Early supercomputer architectures pioneered by Seymour Cray relied on compact innovative designs and local parallelism to achieve superior computational peak performance. However, in time the demand for increased computational power ushered in the age of massively parallel systems.
Titan or OLCF-3 was a supercomputer built by Cray at Oak Ridge National Laboratory for use in a variety of science projects. Titan was an upgrade of Jaguar, a previous supercomputer at Oak Ridge, that uses graphics processing units (GPUs) in addition to conventional central processing units (CPUs). Titan was the first such hybrid to perform over 10 petaFLOPS. The upgrade began in October 2011, commenced stability testing in October 2012 and it became available to researchers in early 2013. The initial cost of the upgrade was US$60 million, funded primarily by the United States Department of Energy.
Steve Scott is a computer architect who currently serves as Corporate Vice President at Microsoft. Scott was previously a Senior Vice President and Chief Technology Officer at Cray Inc., Principal Engineer at Google and the chief technology officer for Nvidia's Tesla business unit. Scott was employed by Cray Research, Inc., Silicon Graphics, Inc., and Cray, Inc. from 1992 to 2011.
The high performance supercomputing program started in mid-to-late 1980s in Pakistan. Supercomputing is a recent area of Computer science in which Pakistan has made progress, driven in part by the growth of the information technology age in the country. Developing on the ingenious supercomputer program started in 1980s when the deployment of the Cray supercomputers was initially denied.
NVLink is a wire-based serial multi-lane near-range communications link developed by Nvidia. Unlike PCI Express, a device can consist of multiple NVLinks, and devices use mesh networking to communicate instead of a central hub. The protocol was first announced in March 2014 and uses a proprietary high-speed signaling interconnect (NVHS).
The Cray XC40 is a massively parallel multiprocessor supercomputer manufactured by Cray. It consists of Intel Haswell Xeon processors, with optional Nvidia Tesla or Intel Xeon Phi accelerators, connected together by Cray's proprietary "Aries" interconnect, stored in air-cooled or liquid-cooled cabinets. The XC series supercomputers are available with the Cray DataWarp applications I/O accelerator technology.
An AI accelerator, deep learning processor or neural processing unit (NPU) is a class of specialized hardware accelerator or computer system designed to accelerate artificial intelligence and machine learning applications, including artificial neural networks and computer vision. Typical applications include algorithms for robotics, Internet of Things, and other data-intensive or sensor-driven tasks. They are often manycore designs and generally focus on low-precision arithmetic, novel dataflow architectures or in-memory computing capability. As of 2024, a typical AI integrated circuit chip contains tens of billions of MOSFETs.
Hewlett Packard Enterprise Frontier, or OLCF-5, is the world's first exascale supercomputer. It is hosted at the Oak Ridge Leadership Computing Facility (OLCF) in Tennessee, United States and became operational in 2022. As of December 2023, Frontier is the world's fastest supercomputer. It is based on the Cray EX and is the successor to Summit (OLCF-4). Frontier achieved an Rmax of 1.102 exaFLOPS, which is 1.102 quintillion floating-point operations per second, using AMD CPUs and GPUs.
Keren Bergman is an American electrical engineer who is the Charles Batchelor Professor at Columbia University. She also serves as the director of the Lightwave Research Laboratory, a silicon photonics research group at Columbia University. Her research focuses on nano-photonics and particularly optical interconnects for low power, high bandwidth computing applications.
The Alps supercomputer is a high-performance computer funded by the Swiss Confederation through the ETH Domain, with its main location in Lugano. It is part of the Swiss National Supercomputing Centre (CSCS), which provides computing services for selected scientific customers.
Locality is efficiency, Efficiency is power, Power is performance, Performance is king
I was on the faculty at MIT for 11 years, where I built a research group that built a number of pioneering supercomputers,