Infobright

Last updated
Infobright, Inc.
TypePrivate
Industry Enterprise software & Database management & Data warehousing
Founded2005;18 years ago (2005)
FounderCas Apanowicz
Dominik Ślęzak
Piotr Synak
Jakub Wróblewski
Headquarters Toronto, Ontario, Canada
ProductsInfobright Enterprise Edition (formerly Brighthouse), Infobright Community Edition
Website infobright.com

Infobright is a commercial provider of column-oriented relational database software with a focus in machine-generated data. The company's head office is located in Toronto, Ontario, Canada. Most of its research and development is based in Warsaw, Poland. Support personnel are located in various offices around the world.

Contents

History

Infobright was founded in 2005. It became an open source company in September 2008, when it issued the first free release of its software. At the same time its community site was launched. [1]

The company is funded by venture capital investors Flybridge Capital Partners and Information Venture Partners.[ citation needed ]

In 2009, Infobright was recognized as MySQL's Partner of the Year, [2] and a Gartner Cool Vendor in Data Management and Integration. [3] It is also certified for use with Sun's Unified Storage product line. [4] It is the assignee of published patent applications on data compression, [5] query optimization, [6] and data organization. [7]

In July 2016, Infobright officially transitioned away from its open source community edition to focus on their OEM and direct customer markets.

Technology

Infobright Enterprise Edition (IEE)

Infobright's database software is integrated with MySQL, [8] but with its own proprietary data storage and query optimization layers.

Infobright uses a columnar approach to database design. When data is loaded into a table, it is broken into the groups of 216 rows, further decomposed into separate data packs for each of the columns. By breaking each column by the same number of rows, it maintains its integrity with other columns for the same entry. For example, row 1, column 1 is the first entry in the first datapack for column 1. Row 1 in column 2 is the first entry in the first datapack for column 2.

Each data pack is separately compressed to approximately 20:1 on average.

Infobright Enterprise Edition is available in PostgreSQL and MySQL.

Knowledge Grid A metadata layer (called the Database Knowledge Grid) stores compact information about the contents and relationships between the data packs, replacing the concept of a traditional database index. [9]

Query execution The optimizer uses theories of rough sets and Granular Computing by categorizing which data packs need to be decompressed and by refining such categorization using partial results obtained from the Knowledge Grid and already decompressed data packs. [10]

Related Research Articles

A relational database is a database based on the relational model of data, as proposed by E. F. Codd in 1970. A system used to maintain relational databases is a relational database management system (RDBMS). Many relational database systems are equipped with the option of using SQL for querying and updating the database.

Structured Query Language (SQL), is a domain-specific language used in programming and designed for managing data held in a relational database management system (RDBMS), or for stream processing in a relational data stream management system (RDSMS). It is particularly useful in handling structured data, i.e., data incorporating relations among entities and variables.

<span class="mw-page-title-main">Informix</span> Database management software product family

Informix is a product family within IBM's Information Management division that is centered on several relational database management system (RDBMS) and Multi-model database offerings. The Informix products were originally developed by Informix Corporation, whose Informix Software subsidiary was acquired by IBM in 2001.

<span class="mw-page-title-main">IBM Db2</span> Relational model database server

Db2 is a family of data management products, including database servers, developed by IBM. It initially supported the relational model, but was extended to support object–relational features and non-relational structures like JSON and XML. The brand name was originally styled as DB/2, then DB2 until 2017 and finally changed to its present form.

Online analytical processing, or OLAP, is an approach to answer multi-dimensional analytical (MDA) queries swiftly in computing. OLAP is part of the broader category of business intelligence, which also encompasses relational databases, report writing and data mining. Typical applications of OLAP include business reporting for sales, marketing, management reporting, business process management (BPM), budgeting and forecasting, financial reporting and similar areas, with new applications emerging, such as agriculture.

<span class="mw-page-title-main">Extract, transform, load</span> Procedure in computing

In computing, extract, transform, load (ETL) is a three-phase process where data is extracted, transformed and loaded into an output data container. The data can be collated from one or more sources and it can also be output to one or more destinations. ETL processing is typically executed using software applications but it can also be done manually by system operators. ETL software typically automates the entire process and can be run manually or on reoccurring schedules either as single jobs or aggregated into a batch of jobs.

Oracle Database is a proprietary multi-model database management system produced and marketed by Oracle Corporation.

The SQL SELECT statement returns a result set of records, from one or more tables.

A spatial database is a general-purpose database that has been enhanced to include spatial data that represents objects defined in a geometric space, along with tools for querying and analyzing such data. Most spatial databases allow the representation of simple geometric objects such as points, lines and polygons. Some spatial databases handle more complex structures such as 3D objects, topological coverages, linear networks, and triangulated irregular networks (TINs). While typical databases have developed to manage various numeric and character types of data, such databases require additional functionality to process spatial data types efficiently, and developers have often added geometry or feature data types. The Open Geospatial Consortium (OGC) developed the Simple Features specification and sets standards for adding spatial functionality to database systems. The SQL/MM Spatial ISO/IEC standard is a part of the SQL/MM multimedia standard and extends the Simple Features standard with data types that support circular interpolations. Almost all current relational and object-relational database management systems now have spatial extensions, and some GIS software vendors have developed their own spatial extensions to database management systems.

SAP IQ is a column-based, petabyte scale, relational database software system used for business intelligence, data warehousing, and data marts. Produced by Sybase Inc., now an SAP company, its primary function is to analyze large amounts of data in a low-cost, highly available environment. SAP IQ is often credited with pioneering the commercialization of column-store technology.

Microsoft SQL Server is a proprietary relational database management system developed by Microsoft. As a database server, it is a software product with the primary function of storing and retrieving data as requested by other software applications—which may run either on the same computer or on another computer across a network. Microsoft markets at least a dozen different editions of Microsoft SQL Server, aimed at different audiences and for workloads ranging from small single-machine applications to large Internet-facing applications with many concurrent users.

<span class="mw-page-title-main">Exasol</span> Database management software company

Exasol is an analytics database management software company. Its product is called Exasol, an in-memory, column-oriented, relational database management system

A database shard, or simply a shard, is a horizontal partition of data in a database or search engine. Each shard is held on a separate database server instance, to spread load.

Java Database Connectivity (JDBC) is an application programming interface (API) for the Java programming language which defines how a client may access a database. It is a Java-based data access technology used for Java database connectivity. It is part of the Java Standard Edition platform, from Oracle Corporation. It provides methods to query and update data in a database, and is oriented toward relational databases. A JDBC-to-ODBC bridge enables connections to any ODBC-accessible data source in the Java virtual machine (JVM) host environment.

SAND Nucleus CDBMS is a column-oriented DBMS software system optimized for business intelligence applications, delivering the data warehousing component, developed by SAND Technology Inc.

In computer science, in-memory processing is an emerging technology for processing of data stored in an in-memory database. In-memory processing is one method of addressing the performance and power bottlenecks caused by the movement of data between the processor and the main memory. Older systems have been based on disk storage and relational databases using SQL query language, but these are increasingly regarded as inadequate to meet business intelligence (BI) needs. Because stored data is accessed much more quickly when it is placed in random-access memory (RAM) or flash memory, in-memory processing allows data to be analysed in real time, enabling faster reporting and decision-making in business.

<span class="mw-page-title-main">Actian Zen</span>

Actian Zen is an ACID-compliant, Zero-DBA, Embedded, Nano-footprint, Multi-Model, Multi-Platform database management system (DBMS) developed originally by Pervasive Software, which was acquired by Actian Corporation in 2013.

The following is provided as an overview of and topical guide to databases:

<span class="mw-page-title-main">SingleStore</span>

SingleStore is a proprietary, cloud-native database designed for data-intensive applications. A distributed, relational, SQL database management system (RDBMS) that features ANSI SQL support, it is known for speed in data ingest, transaction processing, and query processing.

A Block Range Index or BRIN is a database indexing technique. They are intended to improve performance with extremely large tables.

References

  1. "Open Source Data Warehouse, Column Database Software, Improve SQL Performance". Community website. Archived from the original on October 19, 2011. Retrieved October 17, 2011.
  2. 2009 MySQL Partners of the Year Archived 2009-05-01 at the Wayback Machine Sun Presents 2009 MySQL Awards
  3. Cool Vendors in Data Management and Integration, 2009. Gartner Report G00165365
  4. Infobright Editions Secure Sun Certification. TMCNet.com, April 23, 2009
  5. Method and system for data compression in a relational database. Patent Application US 2008/0071818 A1
  6. Method and system for storing, organizing and processing data in a relational database. Patent Application US 2008/0071748 A1
  7. Methods and systems for database organization. Patent Application US 2009/0106210 A1
  8. Infobright Data Warehouse Archived 2009-05-08 at the Wayback Machine . MySQL, September, 2008
  9. Brighthouse: an analytic data warehouse for ad hoc queries Archived 2016-05-07 at the Wayback Machine . VLDB 2008 industry paper
  10. Rough Sets in Data Warehousing. RSCTC 2008 invited talk