Ken Birman

Last updated
Ken Birman
BornNovember 18, 1955 (1955-11-18) (age 68)
Alma mater Columbia University
University of California, Berkeley
Occupation(s)N. Rama Rao Chair in Computer Science, College of Computing and Information Science, Cornell University
SpouseAnne Neirynck
Website www.cs.cornell.edu/ken/

Kenneth P. Birman (born November 18, 1955) is a professor in the Department of Computer Science at Cornell University. He currently holds the N. Rama Rao Chair in Computer Science.

Contents

Education

Birman received his B.S. from Columbia University and Ph.D. from University of California, Berkeley. [1]

Research and publications

Birman's research is mainly concerned with scalability of distributed systems, security technologies, and system management tools employed in cloud computing.

An ACM Fellow and IEEE Fellow, Birman was Editor in Chief of ACM Transactions on Computer Systems from 1993-1998. He is also the author of several books, most recently Reliable Distributed Computing: Technologies, Web Services, and Applications, published by Springer-Verlag in May 2007. [2]

Virtual Synchrony, Derecho, and the Isis Toolkit

He is best known for developing the Isis Toolkit, [3] [4] [5] which introduced the virtual synchrony execution model for multicast communication. Birman founded Isis Distributed Systems to commercialize this software, which was used by stock exchanges, for air traffic control, and in factory automation. The Isis software operated the New York and Swiss Stock Exchanges for more than a decade, and continues to be actively used in the French air traffic control system and the US Navy AEGIS warship. [6]

The technology permits distributed systems to automatically adapt themselves when failures or other disruptions occur, to securely share keys and security policy data, and to replicate critical services so that availability can be maintained even while some system components are down.

Birman's research group at Cornell has created a series of open-source systems. Most recent among these is Derecho, [7] a C++ library that provides Paxos in a form particularly well suited to modern datacenter networks, which run at very high speeds and can have extremely low node-to-node latencies. In such systems, it is important to adopt a protocol design that streams data as asynchronously as possible, and Derecho is unusual among data replication options in this respect: it uses a new "receiver-driven opportunistic batching" approach, whereby senders rarely need to pause when streaming high volume data.

Other research

Other results of Birman's Cornell research effort include Bimodal Multicast, [8] a probabilistically reliable broadcast protocol, which uses the gossip paradigm; and Astrolabe, [9] a scalable tool for monitoring, data mining and managing large systems.

Selected publications

Selected awards and honors

[10]

Related Research Articles

<span class="mw-page-title-main">Leslie Lamport</span> American computer scientist and mathematician

Leslie B. Lamport is an American computer scientist and mathematician. Lamport is best known for his seminal work in distributed systems, and as the initial developer of the document preparation system LaTeX and the author of its first manual.

Replication in computing involves sharing information so as to ensure consistency between redundant resources, such as software or hardware components, to improve reliability, fault-tolerance, or accessibility.

M. Dale Skeen is an American computer scientist. He specializes in designing and implementing large-scale computing systems, distributed computing and database management systems.

A fundamental problem in distributed computing and multi-agent systems is to achieve overall system reliability in the presence of a number of faulty processes. This often requires coordinating processes to reach consensus, or agree on some data value that is needed during computation. Example applications of consensus include agreeing on what transactions to commit to a database in which order, state machine replication, and atomic broadcasts. Real-world applications often requiring consensus include cloud computing, clock synchronization, PageRank, opinion formation, smart power grids, state estimation, control of UAVs, load balancing, blockchain, and others.

In computer science, state machine replication (SMR) or state machine approach is a general method for implementing a fault-tolerant service by replicating servers and coordinating client interactions with server replicas. The approach also provides a framework for understanding and designing replication management protocols.

Paxos is a family of protocols for solving consensus in a network of unreliable or fallible processors. Consensus is the process of agreeing on one result among a group of participants. This problem becomes difficult when the participants or their communications may experience failures.

Özalp Babaoğlu, is a Turkish computer scientist. He is currently professor of computer science at the University of Bologna, Italy. He received a Ph.D. in 1981 from the University of California at Berkeley. He is the recipient of 1982 Sakrison Memorial Award, 1989 UNIX InternationalRecognition Award and 1993 USENIX AssociationLifetime Achievement Award for his contributions to the UNIX system community and to Open Industry Standards. Before moving to Bologna in 1988, Babaoğlu was an associate professor in the Department of Computer Science at Cornell University. He has participated in several European research projects in distributed computing and complex systems. Babaoğlu is an ACM Fellow and has served as a resident fellow of the Institute of Advanced Studies at the University of Bologna and on the editorial boards for ACM Transactions on Computer Systems, ACM Transactions on Autonomous and Adaptive Systems and Springer-Verlag Distributed Computing.

<span class="mw-page-title-main">Werner Vogels</span> American computer scientist and Amazon CTO

Werner Hans Peter Vogels is the chief technology officer and vice president of Amazon in charge of driving technology innovation within the company. Vogels has broad internal and external responsibilities.

A gossip protocol or epidemic protocol is a procedure or process of computer peer-to-peer communication that is based on the way epidemics spread. Some distributed systems use peer-to-peer gossip to ensure that data is disseminated to all members of a group. Some ad-hoc networks have no central registry and the only way to spread common data is to rely on each member to pass it along to their neighbors.

A reliable multicast is any computer networking protocol that provides a reliable sequence of packets to multiple recipients simultaneously, making it suitable for applications such as multi-receiver file transfer.

<span class="mw-page-title-main">Live distributed object</span>

Live distributed object refers to a running instance of a distributed multi-party protocol, viewed from the object-oriented perspective, as an entity that has a distinct identity, may encapsulate internal state and threads of execution, and that exhibits a well-defined externally visible behavior.

<span class="mw-page-title-main">Distributed data flow</span> Set of events in a distributed application or protocol

Distributed data flow refers to a set of events in a distributed application or protocol.

<span class="mw-page-title-main">CAP theorem</span> Need to sacrifice consistency or availability in the presence of network partitions

In database theory, the CAP theorem, also named Brewer's theorem after computer scientist Eric Brewer, states that any distributed data store can provide only two of the following three guarantees:

<span class="mw-page-title-main">Ion Stoica</span> Romanian–American computer scientist

Ion Stoica is a Romanian–American computer scientist specializing in distributed systems, cloud computing and computer networking. He is a professor of computer science at the University of California, Berkeley and co-director of AMPLab. He co-founded Conviva and Databricks with other original developers of Apache Spark.

Gbcast is a reliable multicast protocol that provides ordered, fault-tolerant (all-or-none) message delivery in a group of receivers within a network of machines that experience crash failure. The protocol is capable of solving Consensus in a network of unreliable processors, and can be used to implement state machine replication. Gbcast can be used in a standalone manner, or can support the virtual synchrony execution model, in which case Gbcast is normally used for group membership management while other, faster, protocols are often favored for routine communication tasks.

Marinus Frans (Frans) Kaashoek is a Dutch computer scientist, entrepreneur, and Charles Piper Professor at the Massachusetts Institute of Technology.

A distributed file system for cloud is a file system that allows many clients to have access to data and supports operations on that data. Each data file may be partitioned into several parts called chunks. Each chunk may be stored on different remote machines, facilitating the parallel execution of applications. Typically, data is stored in files in a hierarchical tree, where the nodes represent directories. There are several ways to share files in a distributed architecture: each solution must be suitable for a certain type of application, depending on how complex the application is. Meanwhile, the security of the system must be ensured. Confidentiality, availability and integrity are the main keys for a secure system.

The Vsync software library is a BSD-licensed open source library written in C# for the .NET platform, providing a wide variety of primitives for fault-tolerant distributed computing, including: state machine replication, virtual synchrony process groups, atomic broadcast with several levels of ordering and durability, a distributed lock manager, persistent replicated data, a distributed key-value store, and scalable aggregation. The system implements the virtual synchrony execution model, and includes an implementation of Leslie Lamport's Paxos Protocol.

Sanjay Ghemawat is an Indian American computer scientist and software engineer. He is currently a Senior Fellow at Google in the Systems Infrastructure Group. Ghemawat's work at Google, much of it in close collaboration with Jeff Dean, has included big data processing model MapReduce, the Google File System, and databases Bigtable and Spanner. Wired have described him as one of the "most important software engineers of the internet age".

Rachid Guerraoui is a Moroccan-Swiss computer scientist and a professor at the School of Computer and Communication Sciences at École Polytechnique Fédérale de Lausanne (EPFL), known for his contributions in the fields of concurrent and distributed computing. He is an ACM Fellow and the Chair in Informatics and Computational Science for the year 2018–2019 at Collège de France for distributed computing.

References

A complete list of Birman's publications can be found here.

Birman's group has built quite a bit of software that can be downloaded, free (notably his group's new Derecho platform). Derecho implements an optimal Paxos protocol, configurable to support the classical durable Paxos with persistent storage, as well as an atomic multicast conforming to the vertical Paxos specification but using virtual synchrony for membership management. Derecho automatically maps to RDMA hardware when possible, and is highly performant both on RDMA and over standard TCP.

  1. "Ken Birman | Cornell Engineering". www.engineering.cornell.edu. Retrieved 2022-12-05.
  2. Birman, Kenneth (2012). Guide to Reliable Distributed Systems. Building High-Assurance Applications and Cloud-Hosted Services. Springer Verlag.
  3. Birman, Kenneth; Joseph, Thomas (Nov 1987). "Reliable communication in the presence of failures". ACM Transactions on Computer Systems . 5 (1): 47–76. doi: 10.1145/7351.7478 . hdl: 1813/6534 . S2CID   11224827.
  4. Birman, Kenneth; Van Renesse, Robbert (1994). Reliable distributed computing with the Isis toolkit. IEEE Computer Society Press.
  5. Birman, Kenneth (1993). "The process group approach to reliable distributed computing". Communications of the ACM . 36 (12): 37–53. CiteSeerX   10.1.1.107.4597 . doi:10.1145/163298.163303. S2CID   12190528.
  6. Birman, Kenneth (July 1999). "A Review of Experiences with Reliable Multicast" (PDF). Software: Practice and Experience . 29 (9): 741–774. doi:10.1002/(sici)1097-024x(19990725)29:9<741::aid-spe259>3.0.co;2-i. hdl: 1813/7380 .
  7. Jha, Sagar; Behrens, Jonathan; Gkountouvas, Theo; Milano, Matthew; Song, Weijia; Tremel, Edward; van Renesse, Robbert; Zink, Sydney; Birman, Ken (April 2019). "Derecho: Fast State Machine Replication for Cloud Services". ACM Transactions on Computer Systems . 36 (2). doi:10.1145/3302258. S2CID   218482757.
  8. Birman, Kenneth; Hayden, Mark; Ozkasap, Oznur; Xiao, Zhen; Budiu, Mihai; Minsky, Yaron (1999). "Bimodal multicast". ACM Transactions on Computer Systems . 17 (2): 41–88. doi: 10.1145/312203.312207 . S2CID   207744063.
  9. Van Renesse, Robbert; Birman, Kenneth; Vogels, Werner (2003). "Astrolabe: A robust and scalable technology for distributed system monitoring, management, and data mining". ACM Transactions on Computer Systems . 21 (2): 164–206. doi:10.1145/762483.762485. S2CID   6204358.
  10. "Cornell University College of Engineering Faculty". 2013. Cornell University. Retrieved 20 August 2013.