ClickHouse

Clickhouse
Clickhouse
Developers	ClickHouse, Inc.
Initial release	June 15, 2016;9 years ago
Stable release	v25.8.10.7-lts / October 8, 2025;38 days ago
Repository	github.com/ClickHouse/ClickHouse/
Written in	C++
Operating system	Linux, FreeBSD, macOS
License	Apache License 2.0
Website	clickhouse.com

Last updated November 16, 2025

ClickHouse is an open-source column-oriented DBMS (columnar database management system) for online analytical processing (OLAP) that allows users to generate analytical reports using SQL queries in real-time. ClickHouse Inc. is headquartered in the San Francisco Bay Area with the subsidiary, ClickHouse B.V., based in Amsterdam, Netherlands.

ClickHouse, Inc. was incorporated in San Francisco, California, in September 2021 to commercialize the open-source ClickHouse database. The company was initially funded with US$50 million from Index Ventures and Benchmark Capital, with participation from Yandex N.V. and others.^[2] On 28 October 2021, the company announced a US$250 million Series B funding round at a valuation of US$2 billion, led by Coatue Management, Altimeter Capital, and others.^[3]

In May 2025, ClickHouse raised US$350 million in a Series C funding round led by Khosla Ventures, with participation from BOND, IVP, Battery Ventures, Bessemer Venture Partners, and existing investors including Index Ventures, Lightspeed Venture Partners, GIC, Benchmark Capital, Coatue Management, FirstMark Capital, and Nebius Group. The round valued the company at approximately US$6.35 billion and brought total funding to more than US$650 million.^[4]^[5]

The company continues to build and maintain the open source project in parallel to a cloud-based offering.

History

ClickHouse’s technology was first developed at Yandex, Russia's largest technology company.^[6] In 2009, Alexey Milovidov and developers started an experimental project to check the hypothesis if it was viable to generate analytical reports in real-time from non-aggregated data that is also constantly added in real-time. The developers spent 3 years to prove this hypothesis, and in 2012 ClickHouse launched in production for the first time to power Yandex.Metrica.

In 2016, the ClickHouse project was released as open-source software under the Apache 2 license to power analytical use cases around the globe. The systems at the time offered a server throughput of a hundred thousand rows per second, ClickHouse outperformed them with a throughput of hundreds of millions of rows per second^{[ citation needed ]}.

Since ClickHouse became available as open source in 2016, its popularity has grown exponentially, as evidenced through adoption by industry-leading companies like Uber, Comcast, eBay, and Cisco.^[7] ClickHouse was also implemented at CERN's LHCb experiment to store and process metadata on 10 billion events with over 1000 attributes per event.^[8]

Acquisitions

ClickHouse, Inc. has made the following acquisitions:

Arctype — October 2022. Arctype’s SQL client was integrated into ClickHouse Cloud to provide a web-based interface that formed the basis for the Cloud Console.^[9]^[10]
PeerDB — July 2024. PeerDB provides change data capture (CDC) technology that enables replication from PostgreSQL into ClickHouse, expanding real-time analytics capabilities. ClickHouse, Inc. offers a hosted version of PeerDB through ClickPipes, a managed ingestion service for ClickHouse Cloud that supports seamless integration with Postgres sources.^[11]^[12]^[13]
HyperDX — March 2025. HyperDX is an open-source observability platform built on ClickHouse, providing a user interface and observability tooling that were integrated into ClickHouse’s observability stack. HyperDX now serves as the primary UI for ClickStack, an open-source observability stack based on ClickHouse.^[14]^[15]^[16]

Features

The main features of the ClickHouse DBMS are:^[17]

True column-oriented DBMS. Nothing is stored with the values. For example, constant-length values are supported to avoid storing their length "number" next to the values.
Linear scalability. It's possible to extend a cluster by adding servers.
Fault tolerance. The system is a cluster of shards, where each shard is a group of replicas. ClickHouse uses asynchronous multi-master replication. Data is written to any available replica, then distributed to all the remaining replicas. ClickHouse Keeper (a C++ implementation of Zookeeper) is used for coordinating processes such as data replication, but it's not involved in query processing and execution.
Capability to store and process petabytes of data.
SQL support. ClickHouse supports an extended SQL-like language that includes arrays and nested data structures, approximate and URI functions, and the availability to connect an external key-value store.
High performance.^[18]
- A vectorized query engine that parallelizes execution to maximize hardware utilization, selecting the most optimized SIMD variant based on the host CPU.
- Data is written as independent table parts without global coordination, enabling fast, parallel inserts. Background merge operations combine parts asynchronously to optimize query performance and storage efficiency.
- Inserts are fully isolated from SELECT queries, and merging inserted data parts happens in the background so as to minimize the impact on concurrent queries.
- Primary key indexes to define the sort order of table data to enable efficient binary search during query execution, reducing scan time from linear to logarithmic.
- Table projections for alternative sort orders, storing internal copies of data sorted by different keys to optimize performance for multiple common filter patterns.
- Skipping indexes to avoid unnecessary reads by adding lightweight column-level statistics (e.g. min/max, unique values) to accelerate filter evaluation by skipping irrelevant data blocks.
- Sampling and approximate calculations are supported.
- Parallel and distributed query processing is available (including JOINs).
Data compression. A column-oriented design, in which values are sorted by an explicit ordering, allows data to be efficiently compressed due to similar values being adjacent on disk. Configurable compression algorithms, such as Zstandard (Zstd), which combines high speed with effective compression, and LZ4 (compression algorithm), known for its rapid (de)compression, as well as configurable codecs such as Delta encoding, allow for high compression rates to be achieved.
Complex type support. Including semi-structured data such as JSON, where the schema is determined at write time based on the fields present.
Vector search support. Available through distance functions, with both exact matching and Approximate Nearest neighbor search indices.^[19]
Change Data Capture (CDC). Through its acquisition of PeerDB, an open-source CDC solution, ClickHouse can mirror inserts, updates, and deletes from external databases such as PostgreSQL in near real time.^[20]
Dictionaries. ClickHouse provides in-memory key-value stores known as Dictionaries, which enable efficient enrichment and accelerate `LEFT ANY JOIN` queries.^[21]
External table functions. A set of external table functions allow users to query data stored in other systems, including PostgreSQL, MongoDB, MySQL, Redis, and object storage platforms such as Amazon S3 and Google Cloud Storage.^[22]
Open file formats. ClickHouse natively supports reading and writing open formats such as Parquet and Avro.^[23]
Open table formats. The system can query modern open table formats including Apache Iceberg and Delta Lake, enabling interoperability with data lakehouse ecosystems.^[24]
Row deduplication engines. Multiple table engines in the MergeTree family, such as `ReplacingMergeTree`, support asynchronous row merging logic to remove duplicates efficiently.^[25]
Incremental materialized views. ClickHouse supports incremental updates to materialized views, where partial aggregation states can be stored and refreshed without recomputing the full dataset.^[26]
Refreshable materialized views. In addition to incremental updates, ClickHouse supports refreshable materialized views that periodically execute queries and persist their results into target tables.^[27]
Hard disk drive (HDD) optimization. The system can process data that doesn't fit in random-access memory (RAM).
Clients for database (DB) connectivity. Database connection options include a Native protocol as well as HTTP API. These protocols are used by the console client (Native) and client libraries wrappers (for Python,^[28] Java,^[29] Golang,^[30] PHP,^[31] NodeJS,^[32] Ruby ^[33] R ^[34]). ODBC driver and JDBC driver are also available for ClickHouse.^[35]^[36]

Limitations

ClickHouse has some features that can be considered disadvantages:

There is no support for transactions.
Historically, UPDATE and DELETE operations in ClickHouse were implemented as background mutations that rewrote table parts, a reliable but expensive process unsuited to frequent row-level changes. Lightweight deletes later reduced this cost by rewriting only a deletion mask column. More recently, ClickHouse introduced patch parts, which record only the modified values and their row positions. These patches are applied immediately at query time and merged in the background, enabling efficient updates and "featherweight deletes", where deletions are expressed as compact patches that remove rows with minimal overhead.^[37]

Use cases

ClickHouse was designed for OLAP queries.^[17] ClickHouse performs well when:

It works with tables (up to 1000 recommended) that contain a large number of columns.
Queries use a large number of rows extracted from the DB, but only a small subset of columns.
Queries are relatively rare (usually around 100 requests per second per server).
High throughput is required when processing a single query (up to billions of rows per second per server).
A query result is mostly filtered or aggregated.
Data update uses a simple scenario (usually batch-only, without complicated transactions).

For simple queries, latencies of 50 ms are typical.

ClickHouse is used in a variety of use cases:

Observability – as a storage engine for logs, metrics, and traces from servers and applications. After configuring regular data uploads (typically in batches of more than 1,000 rows), ClickHouse can be queried to analyze incidents in real time or to monitor service metrics such as error rates and response times.^[38] ClickHouse also offers ClickStack, an opinionated, out-of-the-box open-source observability stack that integrates OpenTelemetry as the data collection layer and the HyperDX user interface.^[39]
Real-time analytics – for workloads where data must be filtered, aggregated, and summarized to support dashboards and interactive visualizations, potentially with a high number of users and concurrent queries.^[40]
Business intelligence – to execute high-performance analytical queries on large datasets, often integrated with external BI platforms for reporting and visualization.^[41]
Machine learning and generative AI – as a database for storing and retrieving training data for machine learning workloads. ClickHouse supports vector search for similarity queries in embedding-based applications and is also used to power AI agent workloads via MCP server implementations.^[42]^[43] ClickHouse has also been described as supporting "agent-facing analytics" for AI systems.^[44]

Benchmark results

According to benchmark tests conducted by its developers,^[18] for OLAP queries ClickHouse is more than 100 times faster than Hive (a DBMS based on the Hadoop technology stack) or MySQL (a common RDBMS).

ClickHouse Inc. maintains ClickBench, an open and reproducible benchmark for analytical database systems, based on real-world web analytics data and designed to evaluate performance across diverse OLAP, OLTP, and cloud-native databases using standardized SQL queries and realistic workloads.^[45] A related benchmark, JSONBench, evaluates the JSON analytics capabilities of modern database systems using a real-world dataset of one billion Bluesky events.^[46]

References

↑ "Release v25.3.2.39-lts". GitHub. Retrieved 13 October 2025.
↑ "ClickHouse Raises $250M Series B to Scale Groundbreaking OLAP Database Management System Globally". 28 October 2021. Retrieved 2025-09-16.
↑ "ClickHouse database spinoff from Yandex raises $250M Series B on $2B valuation". TechCrunch. 28 October 2021. Retrieved 2025-09-16.
↑ "ClickHouse raises $350 million Series C to power analytics for the AI era". ClickHouse Blog. 29 May 2025. Retrieved 2025-09-16.
↑ "Khosla-Led Deal Values Data Startup ClickHouse at $6.35 Billion". Bloomberg. 29 May 2025. Retrieved 2025-09-16.
↑ "Yandex, Russia's biggest technology company, celebrates 20 years". The Economist. 30 September 2017.
↑ Lardinois, Frederic (2022-12-06). "ClickHouse launches ClickHouse Cloud, extends its Series B". TechCrunch. Retrieved 2023-07-24.
↑ "Yandex". www.yandex.com. Retrieved 2025-07-11.
↑ "ClickHouse Launches Cloud Offering For World's Fastest OLAP Database Management System". ClickHouse Blog. 5 December 2022. Retrieved 2025-09-16.
↑ "How we Rebuilt the Cloud Console (While Running It)". ClickHouse Blog. 14 May 2024. Retrieved 2025-09-16.
↑ "ClickHouse acquires PeerDB to boost real-time analytics with Postgres CDC integration". ClickHouse Blog. 30 July 2024. Retrieved 2025-09-16.
↑ "Real-time database startup ClickHouse acquires PeerDB to expand its Postgres support". TechCrunch. 30 July 2024. Retrieved 2025-09-16.
↑ "Postgres integration via ClickPipes". ClickHouse Documentation. Retrieved 2025-09-16.
↑ "ClickHouse acquires HyperDX: The future of open-source observability". ClickHouse Blog. 13 March 2025. Retrieved 2025-09-16.
↑ "ClickHouse Acquires HyperDX to Accelerate the Future of Observability". Business Wire. 13 March 2025. Retrieved 2025-09-16.
↑ "ClickStack: A High-Performance OSS Observability Stack on ClickHouse". ClickHouse Blog. 5 March 2025. Retrieved 2025-09-16.
1 2 "ClickHouse Guide". clickhouse.yandex. Retrieved 2016-11-10.
1 2 "Performance comparison of analytical DBMS". clickhouse.yandex. Retrieved 2016-11-10.
↑ "Exact and approximate vector search". clickhouse.com. Retrieved 2025-09-03.
↑ "ClickHouse acquires PeerDB to boost real-time analytics with Postgres CDC integration". ClickHouse Blog. 2024-06-18.
↑ "Dictionaries". ClickHouse Documentation.
↑ "Table Functions". ClickHouse Documentation.
↑ "Formats". ClickHouse Documentation.
↑ "Formats". ClickHouse Documentation.
↑ "ReplacingMergeTree". ClickHouse Documentation.
↑ "Incremental Materialized Views". ClickHouse Documentation.
↑ "Refreshable Materialized Views". ClickHouse Documentation.
↑ "ClickHouse/clickhouse-connect". GitHub. Retrieved 2025-09-15.
↑ "ClickHouse/clickhouse-java". GitHub. Retrieved 2025-09-15.
↑ "ClickHouse/clickhouse-go". GitHub. Retrieved 2025-09-15.
↑ "smi2/phpClickHouse". GitHub. Retrieved 2016-11-10.
↑ "ClickHouse/clickhouse-js". GitHub. Retrieved 2025-09-15.
↑ "shlima/click_house". GitHub. Retrieved 2025-09-15.
↑ "IMSMWU/RClickHouse". GitHub. Retrieved 2016-11-10.
↑ "ClickHouse/clickhouse-odbc". GitHub. 13 December 2021.
↑ "ClickHouse/clickhouse-jdbc". GitHub. 11 December 2021.
↑ Schreiber, Tom (24 July 2025). "How we built fast UPDATEs for the ClickHouse column store – Part 2: SQL-style UPDATEs". ClickHouse Blog. ClickHouse. Retrieved 9 September 2025.
↑ "Observability with ClickHouse". ClickHouse Documentation. Retrieved 2025-09-16.
↑ "ClickStack overview". ClickHouse Documentation. Retrieved 2025-09-16.
↑ "Real-Time Analytics with ClickHouse". ClickHouse. Retrieved 2025-09-16.
↑ "Business Intelligence with ClickHouse". ClickHouse. Retrieved 2025-09-16.
↑ "Vector Search in ClickHouse". ClickHouse Documentation. Retrieved 2025-09-16.
↑ "ClickHouse MCP server". GitHub. Retrieved 2025-09-16.
↑ "Agent-Facing Analytics". ClickHouse Blog. Retrieved 2025-09-16.
↑ "ClickBench: open source benchmark for analytical databases". ClickBench. Retrieved 2025-09-10.
↑ "JSONBench: open source benchmark for JSON analytics". JSONBench. Retrieved 2025-09-10.

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] "Release v25.3.2.39-lts". GitHub. Retrieved 13 October 2025.

[2] "ClickHouse Raises $250M Series B to Scale Groundbreaking OLAP Database Management System Globally". 28 October 2021. Retrieved 2025-09-16.

[3] "ClickHouse database spinoff from Yandex raises $250M Series B on $2B valuation". TechCrunch. 28 October 2021. Retrieved 2025-09-16.

[4] "ClickHouse raises $350 million Series C to power analytics for the AI era". ClickHouse Blog. 29 May 2025. Retrieved 2025-09-16.

[5] "Khosla-Led Deal Values Data Startup ClickHouse at $6.35 Billion". Bloomberg. 29 May 2025. Retrieved 2025-09-16.

[6] "Yandex, Russia's biggest technology company, celebrates 20 years". The Economist. 30 September 2017.

[7] Lardinois, Frederic (2022-12-06). "ClickHouse launches ClickHouse Cloud, extends its Series B". TechCrunch. Retrieved 2023-07-24.

[8] "Yandex". www.yandex.com. Retrieved 2025-07-11.

[9] "ClickHouse Launches Cloud Offering For World's Fastest OLAP Database Management System". ClickHouse Blog. 5 December 2022. Retrieved 2025-09-16.

[10] "How we Rebuilt the Cloud Console (While Running It)". ClickHouse Blog. 14 May 2024. Retrieved 2025-09-16.

[11] "ClickHouse acquires PeerDB to boost real-time analytics with Postgres CDC integration". ClickHouse Blog. 30 July 2024. Retrieved 2025-09-16.

[12] "Real-time database startup ClickHouse acquires PeerDB to expand its Postgres support". TechCrunch. 30 July 2024. Retrieved 2025-09-16.

[13] "Postgres integration via ClickPipes". ClickHouse Documentation. Retrieved 2025-09-16.

[14] "ClickHouse acquires HyperDX: The future of open-source observability". ClickHouse Blog. 13 March 2025. Retrieved 2025-09-16.

[15] "ClickHouse Acquires HyperDX to Accelerate the Future of Observability". Business Wire. 13 March 2025. Retrieved 2025-09-16.

[16] "ClickStack: A High-Performance OSS Observability Stack on ClickHouse". ClickHouse Blog. 5 March 2025. Retrieved 2025-09-16.

[:1-17] 1 2 "ClickHouse Guide". clickhouse.yandex. Retrieved 2016-11-10.

[:2-18] 1 2 "Performance comparison of analytical DBMS". clickhouse.yandex. Retrieved 2016-11-10.

[:15-19] "Exact and approximate vector search". clickhouse.com. Retrieved 2025-09-03.

[20] "ClickHouse acquires PeerDB to boost real-time analytics with Postgres CDC integration". ClickHouse Blog. 2024-06-18.

[21] "Dictionaries". ClickHouse Documentation.

[22] "Table Functions". ClickHouse Documentation.

[23] "Formats". ClickHouse Documentation.

[24] "Formats". ClickHouse Documentation.

[25] "ReplacingMergeTree". ClickHouse Documentation.

[26] "Incremental Materialized Views". ClickHouse Documentation.

[27] "Refreshable Materialized Views". ClickHouse Documentation.

[28] "ClickHouse/clickhouse-connect". GitHub. Retrieved 2025-09-15.

[29] "ClickHouse/clickhouse-java". GitHub. Retrieved 2025-09-15.

[30] "ClickHouse/clickhouse-go". GitHub. Retrieved 2025-09-15.

[31] "smi2/phpClickHouse". GitHub. Retrieved 2016-11-10.

[32] "ClickHouse/clickhouse-js". GitHub. Retrieved 2025-09-15.

[33] "shlima/click_house". GitHub. Retrieved 2025-09-15.

[34] "IMSMWU/RClickHouse". GitHub. Retrieved 2016-11-10.

[35] "ClickHouse/clickhouse-odbc". GitHub. 13 December 2021.

[36] "ClickHouse/clickhouse-jdbc". GitHub. 11 December 2021.

[37] Schreiber, Tom (24 July 2025). "How we built fast UPDATEs for the ClickHouse column store – Part 2: SQL-style UPDATEs". ClickHouse Blog. ClickHouse. Retrieved 9 September 2025.

[38] "Observability with ClickHouse". ClickHouse Documentation. Retrieved 2025-09-16.

[39] "ClickStack overview". ClickHouse Documentation. Retrieved 2025-09-16.

[40] "Real-Time Analytics with ClickHouse". ClickHouse. Retrieved 2025-09-16.

[41] "Business Intelligence with ClickHouse". ClickHouse. Retrieved 2025-09-16.

[42] "Vector Search in ClickHouse". ClickHouse Documentation. Retrieved 2025-09-16.

[43] "ClickHouse MCP server". GitHub. Retrieved 2025-09-16.

[44] "Agent-Facing Analytics". ClickHouse Blog. Retrieved 2025-09-16.

[45] "ClickBench: open source benchmark for analytical databases". ClickBench. Retrieved 2025-09-10.

[46] "JSONBench: open source benchmark for JSON analytics". JSONBench. Retrieved 2025-09-10.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]