SQream DB

Last updated
SQream
Developer(s) SQream Technologies Ltd.
Initial release2014 (2014)
Stable release
2021.2 [1] / 13 September 2021;19 months ago (2021-09-13)
Written in CUDA, C++, Haskell [2]
Operating system Linux
Platform Linux
Type RDBMS
License Proprietary
Website sqream.com

SQream is a relational database management system (RDBMS) that uses graphics processing units (GPUs) from Nvidia. SQream is designed for big data analytics using the Structured Query Language (SQL). [3]

Contents

History

SQream is the first product from SQream Technologies Ltd, founded in 2010 by Ami Gal and Kostya Varakin in Tel Aviv, Israel. [4]

SQream was first released in 2014 [5] after a partnership with an Orange S.A. in Silicon Valley. [6] [7] The company claimed Orange S.A. saved $6 million by using SQream in 2014. [8] [9] SQream is aimed at the budget multi-terabyte analytics market, due to its modest hardware requirements and use of compression. [10]

SQream is also the basis for a product named GenomeStack, for querying many DNA sequences simultaneously. [11] [12] A US$7.4M investment of venture capital was announced in June 2015. [13] It is an example of general-purpose computing on graphics processing units, alongside Omnisci and Kinetica. [14]

The company applied for patents, encompassing parallel execution queries on multi-core processors and speeding up parallel execution on vector processors. [15] [16] [17]

In February 2018, SQream Technologies partnered with Alibaba group's Alibaba Cloud to deliver a GPU Database solution on Alibaba Cloud. [18]

In December 2021, SQream announced that it had acquired no-code data platform Panoply for an undisclosed sum, as part of the push to grow its cloud offering. [19]

Software and features

The column-oriented database SQream platform was designed to manage large, fast-growing volumes of data, for compute-intensive queries. The product claims to improve query performance for very large datasets, over traditional relational database systems.

SQream is designed to run on premise or in the public cloud. [20]

Related Research Articles

<span class="mw-page-title-main">Nvidia</span> American technology company

Nvidia Corporation is an American multinational technology company incorporated in Delaware and based in Santa Clara, California. It is a software and fabless company which designs graphics processing units (GPUs), application programming interface (APIs) for data science and high-performance computing as well as system on a chip units (SoCs) for the mobile computing and automotive market. Nvidia is a dominant supplier of artificial intelligence hardware and software. Its professional line of GPUs are used in workstations for applications in such fields as architecture, engineering and construction, media and entertainment, automotive, scientific research, and manufacturing design.

<span class="mw-page-title-main">IBM Db2</span> Relational model database server

Db2 is a family of data management products, including database servers, developed by IBM. It initially supported the relational model, but was extended to support object–relational features and non-relational structures like JSON and XML. The brand name was originally styled as DB/2, then DB2 until 2017 and finally changed to its present form.

Teradata Corporation is an American software company that provides cloud database and analytics-related software, products, and services. The company was formed in 1979 in Brentwood, California, as a collaboration between researchers at Caltech and Citibank's advanced technology group.

<span class="mw-page-title-main">Graphics processing unit</span> Specialized electronic circuit; graphics accelerator

A graphics processing unit (GPU) is a specialized electronic circuit designed to manipulate and alter memory to accelerate the creation of images in a frame buffer intended for output to a display device. GPUs are used in embedded systems, mobile phones, personal computers, workstations, and game consoles.

<span class="mw-page-title-main">CUDA</span> Parallel computing platform and programming model

CUDA is a parallel computing platform and application programming interface (API) that allows software to use certain types of graphics processing units (GPUs) for general purpose processing, an approach called general-purpose computing on GPUs (GPGPU). CUDA is a software layer that gives direct access to the GPU's virtual instruction set and parallel computational elements, for the execution of compute kernels.

IBM Netezza is a subsidiary of American technology company IBM that designs and markets high-performance data warehouse appliances and advanced analytics applications for uses including enterprise data warehousing, business intelligence, predictive analytics and business continuity planning.

<span class="mw-page-title-main">Greenplum</span>

Greenplum is a big data technology based on MPP architecture and the Postgres open source database technology. The technology was created by a company of the same name headquartered in San Mateo, California around 2005. Greenplum was acquired by EMC Corporation in July 2010.

<span class="mw-page-title-main">Vertica</span> Software company

Vertica Systems is an analytic database management software company. Vertica was founded in 2005 by the database researcher Michael Stonebraker, with Andrew Palmer as the founding CEO. Ralph Breslauer and Christopher P. Lynch served as later CEOs.

HPCC, also known as DAS, is an open source, data-intensive computing system platform developed by LexisNexis Risk Solutions. The HPCC platform incorporates a software architecture implemented on commodity computing clusters to provide high-performance, data-parallel processing for applications utilizing big data. The HPCC platform includes system configurations to support both parallel batch data processing (Thor) and high-performance online query applications using indexed data files (Roxie). The HPCC platform also includes a data-centric declarative programming language for parallel data processing called ECL.

InfiniteGraph is a distributed graph database implemented in Java and C++ and is from a class of NOSQL database technologies that focus on graph data structures. Developers use InfiniteGraph to find useful and often hidden relationships in highly connected, complex big data sets. InfiniteGraph is cross-platform, scalable, cloud-enabled, and is designed to handle very high throughput.

<span class="mw-page-title-main">Actian Vector</span>

Actian Vector is an SQL relational database management system designed for high performance in analytical database applications. It published record breaking results on the Transaction Processing Performance Council's TPC-H benchmark for database sizes of 100 GB, 300 GB, 1 TB and 3 TB on non-clustered hardware.

The OpenPOWER Foundation is a collaboration around Power ISA-based products initiated by IBM and announced as the "OpenPOWER Consortium" on August 6, 2013. IBM is opening up technology surrounding their Power Architecture offerings, such as processor specifications, firmware and software with a liberal license, and will be using a collaborative development model with their partners.

<span class="mw-page-title-main">NVLink</span> High speed chip interconnect

NVLink is a wire-based serial multi-lane near-range communications link developed by Nvidia. Unlike PCI Express, a device can consist of multiple NVLinks, and devices use mesh networking to communicate instead of a central hub. The protocol was first announced in March 2014 and uses a proprietary high-speed signaling interconnect (NVHS).

Single instruction, multiple threads (SIMT) is an execution model used in parallel computing where single instruction, multiple data (SIMD) is combined with multithreading. It is different from SPMD in that all instructions in all "threads" are executed in lock-step. The SIMT execution model has been implemented on several GPUs and is relevant for general-purpose computing on graphics processing units (GPGPU), e.g. some supercomputers combine CPUs with GPUs.

Kinetica is a distributed, memory-first OLAP database developed by Kinetica DB, Inc. Kinetica is designed to use GPUs and modern vector processors to improve performance on complex queries across large volumes of real-time data. Kinetica is well suited for analytics on streaming geospatial and temporal data.

Presto is a distributed query engine for big data using the SQL query language. Its architecture allows users to query data sources such as Hadoop, Cassandra, Kafka, AWS S3, Alluxio, MySQL, MongoDB and Teradata, and allows use of multiple data sources within a query. Presto is community-driven open-source software released under the Apache License.

HEAVY.AI is an American-based software company, that uses graphics processing units (GPUs) and central processing units (CPUs) to query and visualize big data. The company was founded in 2013 by Todd Mostak and Thomas Graham and is headquartered in San Francisco, California.

<span class="mw-page-title-main">Ampere (microarchitecture)</span> GPU microarchitecture by Nvidia

Ampere is the codename for a graphics processing unit (GPU) microarchitecture developed by Nvidia as the successor to both the Volta and Turing architectures. It was officially announced on May 14, 2020 and is named after French mathematician and physicist André-Marie Ampère.

NVIDIA GTC is a global AI conference for developers that brings together developers, engineers, researchers, inventors, and IT professionals. Topics focus on artificial intelligence (AI), computer graphics, data science, machine learning and autonomous machines. Each conference begins with a keynote from Nvidia CEO and Founder Jensen Huang, followed by a variety of sessions and talks with experts from around the world.

<span class="mw-page-title-main">Ada Lovelace (microarchitecture)</span> GPU microarchitecture by Nvidia


Ada Lovelace, also referred to simply as Lovelace, is the codename for a graphics processing unit (GPU) microarchitecture developed by Nvidia as the successor to the Ampere architecture, officially announced on September 20, 2022. It is named after English mathematician Ada Lovelace who is often regarded as the first computer programmer and is the first architecture to include both a first and last name. Nvidia announced the architecture along with the new GeForce 40 series consumer GPUs and the RTX 6000 Ada Generation pro workstation graphics card. The new GPUs were revealed to use TSMC's new 5 nm "4N" process which offers increased efficiency over the previous Samsung 8 nm and TSMC N7 processes used by Nvidia for its last generation Ampere architecture.

References

  1. SQream Technologies (September 13, 2021). "What's new in 2021.2". SQream Technologies. Retrieved September 21, 2020.
  2. Jake Wheat (September 25, 2013). "Using Haskell at SQream Technologies". SQream Technologies. Retrieved July 9, 2018.
  3. Laura Rosbrow-Telem (June 9, 2015). "This insanely fast big data startup uses only one server – and just got $7.4M in funding". Geektime. Retrieved March 28, 2017.
  4. Rachel Wolfson (August 15, 2016). "Q&A with Big Data Thought Leader, Ami Gal – Data Natives Tel Aviv 2016". DataConomy. Retrieved March 28, 2017.
  5. "SQream Tech unveils new big data platform". Geektime.
  6. Timothy Prickett Morgan (March 28, 2014). "Telco Calls On GPU-Native SQream SQL Database". Enterprise Tech. Retrieved March 28, 2017.
  7. "IBM, Orange Use GPUs for Next Generation Enterprise Big Data Analytics at GTC". Nvidia Blog. Retrieved 5 October 2014.
  8. "Getting big data done on a GPU-Based database" (PDF). GPU Technology Conference. Retrieved 5 October 2014.
  9. "SQream Technologies and Orange Silicon Valley Demo Groundbreaking Big Data Platform at GTC". PRWeb. 26 March 2014.
  10. "A Shoebox-Size Data Warehouse Powered by GPUs". Datanami.
  11. "April News From the Bio-IT World Conference and Around the Industry". bio-itworld.com.
  12. "סטארט-אפ בשבוע: מאגר מידע". Israel Globes. May 18, 2015. (in Hebrew)
  13. "SQream Raises $7.4M in Funding Round". Genomeweb (Press release). June 9, 2015. Retrieved June 22, 2015. (Registration required)
  14. Timothy Prickett Morgan (September 22, 2016). "Pushing Database Scalability Up And Out With GPUs". The Next Platform. Retrieved March 28, 2017.
  15. "Patent WO 2012025915 A1 - A system and method for the parallel execution of database queries over cpus and multi core processors". Google Patents. Retrieved 5 October 2014.
  16. "Patent WO 2012025915 A8 - A system and method for the parallel execution of database queries over cpus and multi core processors". Google Patents. Retrieved 5 October 2014.
  17. "Patent WO 2014020605 A1 - A method for pre-processing and processing query operation on multiple data chunk on vector enabled architecture". Google Patents. Retrieved 5 October 2014.
  18. "SQream teams with Alibaba, doubling workforce" . Retrieved 20 February 2018.
  19. "SQream acquires no-code data platform Panoply". TechCrunch. Retrieved 2022-01-03.
  20. "SQream Technologies Launches Beta of GPU Database SQream DB on AWS Cloud". Yahoo Finance. Retrieved 5 October 2017.