![]() | This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these messages)
|
![]() | |
Developer(s) | Zilliz |
---|---|
Initial release | October 19, 2019 |
Stable release | v2.5.4 / January 24, 2025 .: [1] |
Repository | github |
Written in | C++, Go |
Operating system | Linux, macOS |
Platform | x86, ARM |
Type | Vector database |
License | Apache License 2.0 |
Website | milvus |
Milvus is a distributed vector database developed by Zilliz. It is available as both open-source software and a cloud service.
Milvus is an open-source project under LF AI & Data Foundation [2] distributed under the Apache License 2.0.
Milvus has been developed by Zilliz since 2017. [3]
Milvus joined Linux Foundation as an incubation project in January 2020 and became a graduate in June 2021. [2] The details about its architecture and possible applications were presented on ACM SIGMOD Conference in 2021 [4]
Milvus 2.0, a major redesign of the whole product with a new architecture, [5] was released in January 2022.
Major similarity search related features that are available in the active 2.4.x Milvus branch: [6]
Milvus similarity search engine relies on heavily-modified forks of third-party open-source similarity search libraries, such as Faiss, [7] [8] DiskANN [9] [10] and hnswlib. [11]
Milvus includes optimizations for I/O data layout, specific to graph search indices. [12]
As a database, Milvus provides the following features: [6]
Milvus can be deployed as an embedded database, standalone server, or distributed cluster. Zillis Cloud offers a fully managed version. [16]
Milvus provides GPU accelerated index building and search using Nvidia CUDA technology [17] [18] via Nvidia RAFT library, [19] including a recent GPU-based graph indexing algorithm Nvidia CAGRA [20]
Milvus provides official SDK clients for Java, NodeJS, Python and Go. [21] An additional C# SDK client was contributed by Microsoft. [6] [22] The database can integrate with Prometheus and Grafana for monitoring and alerts, frameworks Haystack [23] and LangChain, [24] IBM Watsonx, [25] and OpenAI models. [26] [27]