Carto (company)

Last updated
Developer(s) CARTODB Inc.
Initial releaseSeptember 15, 2011
Written in TypeScript, JavaScript, React, SQL, Python
Type Location intelligence, Geographic information system
Website carto.com

CARTO (formerly CartoDB) is a software as a service (SaaS) spatial analysis platform that provides GIS, web mapping, data visualization, spatial analytics, and spatial data science features. The company is positioned as a Location Intelligence platform due to its tools for geospatial data analysis and visualization that do not require advanced GIS or development experience. As a cloud-native platform, CARTO runs natively on cloud data warehouse platforms overcoming any previous limits on data scale for spatial workloads.

Contents

CARTO is a cloud-first geospatial platform explicitly developed for accelerated and contemporary Geographic Information Systems (GIS). It operates on diverse cloud data warehouse platforms like Google BigQuery, Snowflake, AWS Redshift, Databricks, among others. This enables convenient access to scalable spatial analysis and data visualization in the cloud, streamlining spatial analytics, application development, data engineering, and related workloads. CARTO is accessible as both a cloud-based SaaS offering or a self-hosted deployment for enterprises wishing to run CARTO on their own private infrastructure.

CARTO was initially released in Beta at FOSS4G in Denver in September 2011, [1] and officially debuted as a final release at Where2.0 in April 2012. [2] Since 2014, CARTO is a company independent from Vizzuality. [3] The Spanish start-up raised $7 million from a consortium of investors in September 2014. [4] In September 2015, CARTO received a $23 million in Series B financing. [5] In May 2019, CARTO acquired Geographica, in an effort to boost their professional services offering. [6] In 2021, CARTO raised $61 million in series C financing, with Insight Partners leading the round. [7]

Technology

The CARTO platform consists of several components, ranging from browser-based applications built using React and TypeScript, to REST APIs and libraries that allow geospatial analysis and visualization on top of cloud data warehouses using SQL. Additionally, CARTO is a major contributor to deck.gl, an open-source Javascript library for data visualization.

Workspace

Within the Workspace interface, users can establish connections with various cloud data warehouses. They can integrate their spatial databases, geocode tables, enrich current datasets by incorporating both public and premium data from a data catalog. Additionally, users can easily access a diverse array of tools offered within the CARTO platform.

CARTO Builder

The web application called Builder is where users run analysis to create interactive dashboards and design custom map visualizations. Builder is intended for developers and non-developers to have access to, and be able to use advanced geospatial analysis tools. CARTO integrates lightweight data formats such as Spatial Indexes (H3 and Quadbin) and dynamic tilesets for efficient processing and analysis of large-scale geographical information.

CARTO Workflows

CARTO Workflows is a visual model builder that allows the building of complex spatial analyses and data preparation and transformation workflows without writing code. Workflows is fully cloud-native and runs directly in the data warehouse. CARTO Workflows provides a variety of analysis components for tasks ranging from data preparation to analysis functions. It also offers built-in GenAI capabilities aimed at enhancing productivity and facilitating quicker insights.

Analytics Toolbox

The Analytics Toolbox within CARTO comprises a comprehensive collection of data processing and analytical functions to efficiently amplify the geospatial functionalities accessible across various cloud data warehouses. This toolbox encompasses over 100 sophisticated spatial functions categorized into distinct modules like tiler, data, clustering, and statistics, among others. [8]

There is a specific CARTO Analytics Toolbox with different functions and modules depending on the cloud data warehouse. These toolboxes feature two distinct module types: core modules, which are open source and freely accessible, and advanced modules exclusively accessible with a CARTO account.

Map visualization with Carto of improved Page Load Times for Wikipedia Improvement in Page Load Times on Wikipedia after the ULSFO datacenter deployment (mapped with carto db).png
Map visualization with Carto of improved Page Load Times for Wikipedia

Platform APIs and libraries

CARTO has available a complete suite of APIs, frameworks, connectors, and development tools for the Developer community for building large-scale custom map and data visualization applications.

CARTO + deck.gl

CARTO uses deck.gl, an open-source webGL-based visualization library, to build all geospatial visualizations. CARTO is an active contributor to deck.gl and maintains a CARTO submodule inside deck.gl [10] that allows developers to build their own applications, wrapping seamlessly most functionalities in Maps API and SQL API.

APIs

Maps API: acts as a dynamic tile service for both vector and raster data, which creates new tiles based on client requests. It also supports pre-generated tilesets. It is meant to be used almost exclusively by the CARTO submodule inside deck.gl.

SQL API: allows pushing any kind of valid SQL statements (including parameterized queries) to the data warehouse. By using native SQL code, developers can build advanced logic, data widgets, and powerful analyses inside their geospatial applications.

Other APIs: CARTO offers additional APIs to handle Location Data Services (e.g. geocoding, routing…), import geospatial files to the data warehouse, or manage authentication and other assets in CARTO.

Basemaps

CARTO offers a high-quality basemap service to all users, based on open-source data such as OpenStreetMap. CARTO basemaps can be used for free for non-commercial purposes by applying for a grant.

Data Observatory

CARTO offers a wide range of datasets from around the globe accessible through their Data Observatory, which acts as their spatial data repository. Data scientists, developers, and GIS professionals can augment their data and broaden their analysis with more than 12.000 datasets available in the Data Observatory. [11] The datasets are public or premium covering most global markets. The open datasets include the WorldPop Project and Census data, while Experian, SafeGraph, TomTom, and others are part of the premium data repository.

Communities

The CARTO Community is a collaborative network connecting Geospatial, Analytics, and Data Science professionals with a focus on spatial data science and exploring new frontiers in location-based data applications.

The Spatial Data Science Conference (SDSC) [12] is an annual gathering that serves as a nexus for industry professionals from both private and public sectors to exchange ideas, present innovative methodologies, and showcase impactful use cases in spatial analytics.

In addition to several independent user communities, many businesses and organizations have adopted the Location Intelligence platform for their own needs. Notable groups include Vodafone, [13] Telefónica, Bumble, JLL, T-Mobile, and Coca-Cola.

See also

Related Research Articles

<span class="mw-page-title-main">Esri</span> Geospatial software & SaaS company

Environmental Systems Research Institute, Inc., doing business as Esri, is an American multinational geographic information system (GIS) software company headquartered in Redlands, California. It is best known for its ArcGIS products. With a 40% market share, Esri is the world's leading supplier of GIS software, web GIS and geodatabase management applications.

<span class="mw-page-title-main">SAGA GIS</span>

System for Automated Geoscientific Analyses is a geographic information system (GIS) computer program, used to edit spatial data. It is free and open-source software, developed originally by a small team at the Department of Physical Geography, University of Göttingen, Germany, and is now being maintained and extended by an international developer community.

A GIS software program is a computer program to support the use of a geographic information system, providing the ability to create, store, manage, query, analyze, and visualize geographic data, that is, data representing phenomena for which location is important. The GIS software industry encompasses a broad range of commercial and open-source products that provide some or all of these capabilities within various information technology architectures.

<span class="mw-page-title-main">ArcGIS</span> Geographic information system maintained by Esri

ArcGIS is a family of client, server and online geographic information system (GIS) software developed and maintained by Esri.

Business intelligence software is a type of application software designed to retrieve, analyze, transform and report data for business intelligence. The applications generally read data that has been previously stored, often - though not necessarily - in a data warehouse or data mart.

<span class="mw-page-title-main">QGIS</span> Open-source desktop GIS software

QGIS is a geographic information system (GIS) software that is free and open-source. QGIS supports Windows, macOS, and Linux. It supports viewing, editing, printing, and analysis of geospatial data in a range of data formats. QGIS was previously also known as Quantum GIS.

Oracle Spatial and Graph, formerly Oracle Spatial, is a free option component of the Oracle Database. The spatial features in Oracle Spatial and Graph aid users in managing geographic and location-data in a native type within an Oracle database, potentially supporting a wide range of applications — from automated mapping, facilities management, and geographic information systems (AM/FM/GIS), to wireless location services and location-enabled e-business. The graph features in Oracle Spatial and Graph include Oracle Network Data Model (NDM) graphs used in traditional network applications in major transportation, telcos, utilities and energy organizations and RDF semantic graphs used in social networks and social interactions and in linking disparate data sets to address requirements from the research, health sciences, finance, media and intelligence communities.

ArcGIS Server is the core server geographic information system (GIS) software made by Esri. ArcGIS Server is used for creating and managing GIS Web services, applications, and data. ArcGIS Server is typically deployed on-premises within the organization’s service-oriented architecture (SOA) or off-premises in a cloud computing environment.

A geoportal is a type of web portal used to find and access geographic information and associated geographic services via the Internet. Geoportals are important for effective use of geographic information systems (GIS) and a key element of a spatial data infrastructure (SDI).

NetCDF is a set of software libraries and self-describing, machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data. The project homepage is hosted by the Unidata program at the University Corporation for Atmospheric Research (UCAR). They are also the chief source of netCDF software, standards development, updates, etc. The format is an open standard. NetCDF Classic and 64-bit Offset Format are an international standard of the Open Geospatial Consortium.

KNIME, the Konstanz Information Miner, is a free and open-source data analytics, reporting and integration platform. KNIME integrates various components for machine learning and data mining through its modular data pipelining "Building Blocks of Analytics" concept. A graphical user interface and use of JDBC allows assembly of nodes blending different data sources, including preprocessing, for modeling, data analysis and visualization without, or with only minimal, programming.

BigQuery is a managed, serverless data warehouse product by Google, offering scalable analysis over large quantities of data. It is a Platform as a Service (PaaS) that supports querying using a dialect of SQL. It also has built-in machine learning capabilities. BigQuery was announced in May 2010 and made generally available in November 2011.

Geographic information systems (GIS) play a constantly evolving role in geospatial intelligence (GEOINT) and United States national security. These technologies allow a user to efficiently manage, analyze, and produce geospatial data, to combine GEOINT with other forms of intelligence collection, and to perform highly developed analysis and visual production of geospatial data. Therefore, GIS produces up-to-date and more reliable GEOINT to reduce uncertainty for a decisionmaker. Since GIS programs are Web-enabled, a user can constantly work with a decision maker to solve their GEOINT and national security related problems from anywhere in the world. There are many types of GIS software used in GEOINT and national security, such as Google Earth, ERDAS IMAGINE, GeoNetwork opensource, and Esri ArcGIS.

<span class="mw-page-title-main">WorldMap</span> Web platform

WorldMap is a web platform for creating, displaying, analyzing, and searching spatial data and other data forms across multiple disciplines.

Google Cloud Platform (GCP), offered by Google, is a suite of cloud computing services that provides a series of modular cloud services including computing, data storage, data analytics, and machine learning, alongside a set of management tools. It runs on the same infrastructure that Google uses internally for its end-user products, such as Google Search, Gmail, and Google Docs, according to Verma, et.al. Registration requires a credit card or bank account details.

BisQue is a free, open source web-based platform for the exchange and exploration of large, complex datasets. It is being developed at the Vision Research Lab at the University of California, Santa Barbara. BisQue specifically supports large scale, multi-dimensional multimodal-images and image analysis. Metadata is stored as arbitrarily nested and linked tag/value pairs, allowing for domain-specific data organization. Image analysis modules can be added to perform complex analysis tasks on compute clusters. Analysis results are stored within the database for further querying and processing. The data and analysis provenance is maintained for reproducibility of results. BisQue can be easily deployed in cloud computing environments or on computer clusters for scalability. BisQue has been integrated into the NSF Cyberinfrastructure project CyVerse. The user interacts with BisQue via any modern web browser.

Microsoft Power BI is an interactive data visualization software product developed by Microsoft with a primary focus on business intelligence. It is part of the Microsoft Power Platform. Power BI is a collection of software services, apps, and connectors that work together to turn various sources of data into static and interactive data visualizations. Data may be input by reading directly from a database, webpage, PDF, or structured files such as spreadsheets, CSV, XML, JSON, XLSX, and SharePoint.

<span class="mw-page-title-main">Oracle Cloud</span> Cloud computing service

Oracle Cloud is a cloud computing service offered by Oracle Corporation providing servers, storage, network, applications and services through a global network of Oracle Corporation managed data centers. The company allows these services to be provisioned on demand over the Internet.

Azure Maps is a suite of cloud-based, location-based services provided by Microsoft as part of the company's Azure platform. The platform provides geospatial and location-based services via REST APIs and software development kits (SDKs). The service is typically used to integrate maps or geospatial data into applications.

References

  1. "FOSS4G 2011 Program". FOSS4G. 2011-10-15. Archived from the original on 2013-08-03. Retrieved 2013-08-01.
  2. "Where2.0 2012 Program". O'Reilly. 2012-04-02. Archived from the original on 2013-09-21. Retrieved 2013-08-01.
  3. "CartoDB, la startup que no nació para ser rica, sino famosa". Hipertextual. 2014-12-01. Retrieved 2015-09-11.
  4. CartoDB, Interactive Mapping Start-Up, Raises $7 Million By Mark Scott, bits.blogs.nytimes.com, September 10, 2014
  5. "Announcing our $23M Series B round". CARTO Blog. 2015-09-10. Retrieved 2015-09-25.[ permanent dead link ]
  6. "CARTO acquires Geographica to expand its professional services offering". CARTO. 2019-05-29. Retrieved 2019-06-13.
  7. Dillet, Romain (December 14, 2021). "Carto raises $61 million to help you visualize data on maps". TechCrunch.
  8. "Analytics Toolbox". CARTO. Jan 18, 2024.
  9. "Making Wikimedia Sites faster – Wikimedia Blog". blog.wikimedia.org. 11 July 2014.
  10. deck.gl, Documentation. "@deck.gl/carto". deck.gl. Retrieved February 21, 2024.
  11. CARTO. "Location Data Streams for Business Management — CARTO". carto.com. Retrieved 2020-02-27.
  12. "Spatial Data Science Conference". Spatial Data Science Conference. Jan 18, 2024.
  13. "Vodafone lanza su herramienta de Big Data para empresas y Administraciones Públicas con Carto como partner tecnológico". Sala de Prensa Vodafone. Oct 19, 2017.