CockroachDB

Last updated

Original author(s) Spencer Kimball, Peter Mattis, Ben Darnell
Developer(s) Cockroach Labs
Initial release2015;9 years ago (2015)
Stable release
23.1.1 / May 18, 2023;11 months ago (2023-05-18)
Repository
Written in Go
Available in English
Type RDBMS
License multiple
Cockroach Labs, Inc.
Company type Private
Industry Software
Founded2015;9 years ago (2015)
Founder Spencer Kimball, Peter Mattis, Ben Darnell
Headquarters New York City
Key people
Spencer Kimball (CEO)
Peter Mattis (VP of Engineering)
Ben Darnell (CTO)
Nate Stewart (Chief Product Officer)
Lindsay Grenawalt (Chief People Officer)
ServicesCommercial database management systems
Website cockroachlabs.com

CockroachDB is a commercial distributed SQL database management system developed by Cockroach Labs. [1] [2] CockroachDB is a distributed SQL database built on top of a transactional and consistent key-value store that can withstand datacenter failures. It scales horizontally, [3] offers high availability and can be run in a variety of environments such as Virtual machines, containers and cloud applications. [4] CockroachDB gets its name from cockroaches, as they are known for being disaster-resistant. [4] As all other databases, there are three goals of the database system. [5]

Contents

  1. Provide Long-Term Storage - Databases should be able to store data for long periods of time.
  2. Organization - The data should be organized & described
  3. Shareability - The data within the database should be shared and not excusive to a single user.

History

Cockroach Labs was founded in 2015 by ex-Google employees Spencer Kimball, Peter Mattis, and Ben Darnell. Kimball and Mattis had been key members of the Google File System team, [6] while Darnell was a key member of the Google Reader team. [7]

While at Google, all three had used Google-owned DBMS’s Bigtable and its successor, Spanner. [2] After leaving Google, they wanted to design and build something similar. [8] Spencer Kimball wrote the first iteration of the design in January 2014, and began the open-source project on GitHub in February 2014, allowing outside access and contributions. [9]

Development on GitHub attracted substantial contributions, which earned the project the Open Source Rookie of the Year award by Black Duck Software. [10]

The co-founders supported the project with conferences, networking, meet-ups, and fund-raising financial rounds.

In June 2019, Cockroach Labs announced that CockroachDB would change its license from the free software license Apache License 2.0 to its source-available license, known as the Business Source License (BSL), which forbids “offer[ing] a commercial version of CockroachDB as a service without buying a license,” while remaining free for community use. [11] [12]

Features

CockroachDB stores copies of data in multiple locations to deliver quick access. [8] [13]

It is described as a scalable, consistently-replicated, transactional data store. [14] A single instance can scale from a single laptop to thousands of servers. [2]

CockroachDB is designed to run in the cloud and has a high fault tolerance. According to popular news outlets, it is described as “almost impossible” to take down. [15] [16] [13]

CockroachDB has a consistency model that is designed to match as closely as possible to the capabilities of Google Spanner, but without a dependence on specialized hardware for time synchronization. "No stale reads" is the simplest way to describe this consistency model which has deliberately made the trade-off of having non-linearizable transaction histories. [17] Transactions containing overlapping keys are guaranteed to have external consistency. And so, in practice, systems relying on CockroachDB are very unlikely to reproduce consistency issues because nodes with high variations in clock skew can be removed from clusters, applications can rely on external consistency provided by overlapping keys and writing to the same range, and writes propagate changes to followers' timestamp caches. [18]

See also

Related Research Articles

<span class="mw-page-title-main">MySQL</span> SQL database engine software

MySQL is an open-source relational database management system (RDBMS). Its name is a combination of "My", the name of co-founder Michael Widenius's daughter My, and "SQL", the acronym for Structured Query Language. A relational database organizes data into one or more data tables in which data may be related to each other; these relations help structure the data. SQL is a language that programmers use to create, modify and extract data from the relational database, as well as control user access to the database. In addition to relational databases and SQL, an RDBMS like MySQL works with an operating system to implement a relational database in a computer's storage system, manages users, allows for network access and facilitates testing database integrity and creation of backups.

<span class="mw-page-title-main">Spencer Kimball (computer programmer)</span> American computer programmer, entrepreneur, and business

Spencer Kimball is an American computer programmer, entrepreneur, and business executive. He is the CEO of Cockroach Labs, a company he co-founded in 2014. His work as a programmer includes creating GNU Image Manipulation Program (GIMP) while still in college, and assisting the source code development of CockroachDB, the namesake software of Cockroach Labs. In addition to Cockroach Labs, Kimball was involved in the founding of other tech startups including WeGo and Viewfinder.

<span class="mw-page-title-main">Peter Mattis</span> American computer programmer, entrepreneur

Peter Mattis is an American computer programmer, entrepreneur, and business executive. He is the CTO and co-founder for Cockroach Labs, a company he co-founded in 2014. His work as a programmer includes launching GNU Image Manipulation Program (GIMP) while still in college, and assisting the source code development of CockroachDB, the namesake software of Cockroach Labs.

An embedded database system is a database management system (DBMS) which is tightly integrated with an application software; it is embedded in the application. It is a broad technology category that includes:

Redis is a source-available, in-memory storage, used as a distributed, in-memory key–value database, cache and message broker, with optional durability. Because it holds all data in memory and because of its design, Redis offers low-latency reads and writes, making it particularly suitable for use cases that require a cache. Redis is the most popular NoSQL database, and one of the most popular databases overall. Redis is used in companies like Twitter, Airbnb, Tinder, Yahoo, Adobe, Hulu, Amazon and OpenAI.

<span class="mw-page-title-main">MariaDB</span> Database management system

MariaDB is a community-developed, commercially supported fork of the MySQL relational database management system (RDBMS), intended to remain free and open-source software under the GNU General Public License. Development is led by some of the original developers of MySQL, who forked it due to concerns over its acquisition by Oracle Corporation in 2009.

<span class="mw-page-title-main">Open-core model</span> Business model monetizing commercial open-source software

The open-core model is a business model for the monetization of commercially produced open-source software. The open-core model primarily involves offering a "core" or feature-limited version of a software product as free and open-source software, while offering "commercial" versions or add-ons as proprietary software. The term was coined by Andrew Lampitt in 2008.

A cloud database is a database that typically runs on a cloud computing platform and access to the database is provided as-a-service. There are two common deployment models: users can run databases on the cloud independently, using a virtual machine image, or they can purchase access to a database service, maintained by a cloud database provider. Of the databases available on the cloud, some are SQL-based and some use a NoSQL data model.

<span class="mw-page-title-main">Basho Technologies</span>

Basho Technologies was a distributed systems' company that developed a key-value NoSQL database technology, Riak, and an object storage system built upon the Riak platform, called Riak CS.

<span class="mw-page-title-main">Spanner (database)</span> Cloud-based distributed SQL DBMS service

Spanner is a distributed SQL database management and storage service developed by Google. It provides features such as global transactions, strongly consistent reads, and automatic multi-site replication and failover. Spanner is used in Google F1, the database for its advertising business Google Ads, as well as Gmail and Google Photos.

NewSQL is a class of relational database management systems that seek to provide the scalability of NoSQL systems for online transaction processing (OLTP) workloads while maintaining the ACID guarantees of a traditional database system.

Aerospike Database is a real-time, high performance NoSQL database. Designed for applications that cannot experience any downtime and require high read & write throughput. Aerospike is optimized to run on NVMe SSDs capable of efficiently storing large datasets. Aerospike can also be deployed as a fully in-memory cache database. Aerospike offers Key-Value, JSON Document, and Graph data models. Aerospike is open source distributed NoSQL database management system, marketed by the company also named Aerospike.

<span class="mw-page-title-main">Redis (company)</span> American based private computer software company

Redis Ltd. is an Israeli private computer software company headquartered in Mountain View, California. Redis is the sponsor of the source-available in-memory NoSQL database of the same name and the provider of Redis Enterprise software, cloud services, and tools for global companies. The company’s research and development center is based in Tel Aviv and it has additional offices in London, Austin, and Bengaluru.

WebScaleSQL was an open-source relational database management system (RDBMS) created as a software branch of the production-ready community releases of MySQL. By joining efforts of a few companies and incorporating various changes and new features into MySQL, WebScaleSQL aimed toward fulfilling various needs arising from the deployment of MySQL in large-scale environments, which involve large amounts of data and numerous database servers.

<span class="mw-page-title-main">Ben Darnell</span> American computer programmer

Ben Darnell is an American computer programmer, entrepreneur, and business executive. He is the chief technology officer for Cockroach Labs, a company he co-founded in 2015. Prior to his work at Cockroach Labs, he worked for tech companies that include FriendFeed, Facebook, Brizzly, Dropbox, Viewfinder, and Square, Inc.

TiDB is an open-source NewSQL database that supports Hybrid Transactional and Analytical Processing (HTAP) workloads. Designed to be MySQL compatible, it is developed and supported primarily by PingCAP and licensed under Apache 2.0. It is also available as a paid product. TiDB drew its initial design inspiration from Google's Spanner and F1 papers.

A distributed SQL database is a single relational database which replicates data across multiple servers. Distributed SQL databases are strongly consistent and most support consistency across racks, data centers, and wide area networks including cloud availability zones and cloud geographic zones. Distributed SQL databases typically use the Paxos or Raft algorithms to achieve consensus across multiple nodes.

<span class="mw-page-title-main">YugabyteDB</span> Transactional distributed SQL database

YugabyteDB is a high-performance transactional distributed SQL database for cloud-native applications, developed by Yugabyte.

References

  1. Ovide, Shira (June 4, 2015). "CockroachDB Scampers Off With $6.3 Million to Tackle Database Shortcomings". The Wall Street Journal. Retrieved February 22, 2020.
  2. 1 2 3 Finley, Klint (June 4, 2015). "Ex-Googlers Get Millions to Help You Build the Next Google". Wired. Retrieved February 22, 2020.
  3. Heller, Martin (January 4, 2018). "CockroachDB review: A scale-out SQL database built for survival". InfoWorld. Retrieved March 19, 2024.
  4. 1 2 Pina, Eduardo; Sá, Filipe; Bernardino, Jorge (January 2023). "NewSQL Databases Assessment: CockroachDB, MariaDB Xpand, and VoltDB". Future Internet. 15 (1): 10. doi: 10.3390/fi15010010 . hdl: 10316/114892 . ISSN   1999-5903.
  5. Huawei Technologies Co., Ltd (2023). Database Principles and Technologies – Based on Huawei GaussDB. Springer Nature. hdl:20.500.12657/59331. ISBN   978-981-19-3032-4.
  6. Metz, Cade (July 10, 2012). "Google Remakes Online Empire with 'Colossus'". Wired. Retrieved February 22, 2020.
  7. Wauters, Robin (July 28, 2009). "Steal! Ben Darnell Leaves Google Reader Team, Joins FriendFeed". TechCrunch. Retrieved February 22, 2020.
  8. 1 2 Novet, Jordan (June 4, 2015). "Peter Fenton's latest investment is a database startup called Cockroach". VentureBeat. Retrieved February 22, 2020.
  9. Ewbank, Kay (June 9, 2015). "CockroachDB Released". I Programmer. Retrieved February 22, 2020.
  10. Finleey, Klint (January 28, 2015). "These are the hottest new open-source projects right now". Wired. Retrieved February 22, 2020.
  11. Mattis, Peter; Darnell, Ben; Kimball, Spencer (June 4, 2019). "Why We're Relicensing CockroachDB". CockroachLabs.com. Retrieved February 22, 2020.
  12. "CockroachDB Community License". Cockroach Labs. Retrieved July 28, 2020.
  13. 1 2 Babcock, Charles (June 4, 2015). "CockroachDB: Ultimate in Database Survival". InformationWeek. Retrieved February 22, 2020.
  14. Darfler, Benjamin (August 29, 2014). "CockroachDB: A Scalable, Geo-Replicated, Transactional Datastore". InfoQ. Retrieved February 22, 2020.
  15. Finley, Klint (July 22, 2014). "CockroachDB is the resilient cloud software built by ex-Googlers". Wired. Archived from the original on May 14, 2015. Retrieved February 22, 2020.
  16. Finley, Klint (July 21, 2014). "Out in the Open: Ex-Googlers Building Cloud Software That's Almost Impossible to Take Down". Wired. Retrieved February 22, 2020.
  17. Matei, Andrei (January 24, 2019). "CockroachDB's consistency model".
  18. Cordell, Evan. "The One Crucial Difference Between Spanner and CockroachDB".