HHCode

Last updated

A Helical Hyperspatial Code, also known as an HHCode, is a data storage format for very large spatio-temporal datasets.

The development of HHCode can be traced to efforts by scientists working for the Canadian Hydrographic Service's Atlantic regional offices at the Bedford Institute of Oceanography in Dartmouth, Nova Scotia. CHS was facing a problem with storing very large spatial datasets acquired during hydrographic surveys (thus having a temporal element) and was seeking a method to store the datasets in a relational database structure.

CHS scientists reportedly conceptualized a spatio-temporal indexing system during a hydrographic survey onboard CCGS Hudson on Georges Bank during the summer of 1989. The indexing system involved an adaptation of a Riemannian hypercube data structure, invoking a helical spiral through 3-dimensional space, which allowed for n-size of features.

The actual implementation of the proposed indexing system was termed a helical hyperspatial code and it was first used by modifying an installation of the Oracle database version 4. The HHCode indexing system permitted a highly efficient compression of not only spatial data but other data types as well, while improving search and retrieval times. The HHCode comprises a form of space filling curve and the concept was published by the CHS scientists in the International Hydrographic Review .

Oracle Corporation subsequently learned of the innovation which led to an agreement with the Government of Canada which saw the CHS scientists working with a team of Oracle programmers at an office in Hull, Quebec during the early 1990s to redevelop the Oracle database kernel. The resulting software was the Oracle database version 7, with the Spatial Data Option (SDO). Subsequent development of the SDO has resulted in the Oracle Spatial extension to current Oracle databases.

With the HHCode being an open-source data format, several spatial data and software companies have adopted it in various products targeted at very large corporate data users, namely Helical Systems Inc. and CubeWerx.

Related Research Articles

Database Organized collection of data

In computing, a database is an organized collection of data stored and accessed electronically from a computer system. Where databases are more complex they are often developed using formal design and modeling techniques.

Geographic information system System to capture, manage and present geographic data

A geographic information system (GIS) is a conceptualized framework that provides the ability to capture and analyse spatial and geographic data. GIS applications are computer-based tools that allow the user to create interactive queries, store and edit spatial and non-spatial data, analyze spatial information output, and visually share the results of these operations by presenting them as maps.

In computer science, locality of reference, also known as the principle of locality, is the tendency of a processor to access the same set of memory locations repetitively over a short period of time. There are two basic types of reference locality – temporal and spatial locality. Temporal locality refers to the reuse of specific data and/or resources within a relatively small time duration. Spatial locality refers to the use of data elements within relatively close storage locations. Sequential locality, a special case of spatial locality, occurs when data elements are arranged and accessed linearly, such as traversing the elements in a one-dimensional array.

A coverage is the digital representation of some spatio-temporal phenomenon. ISO 19123 provides the definition:

A GIS file format is a standard of encoding geographical information into a computer file. They are created mainly by government mapping agencies or by GIS software developers.

Canadian Hydrographic Service

The Canadian Hydrographic Service (CHS) is part of the federal department of Fisheries and Oceans Canada and is Canada's authoritative hydrographic office. The CHS represents Canada in the International Hydrographic Organization (IHO).

An XML database is a data persistence software system that allows data to be specified, and sometimes stored, in XML format. This data can be queried, transformed, exported and returned to a calling system. XML databases are a flavor of document-oriented databases which are in turn a category of NoSQL database.

Virtual globe 3D software model or representation of Earth or another world

A virtual globe is a three-dimensional (3D) software model or representation of Earth or another world. A virtual globe provides the user with the ability to freely move around in the virtual environment by changing the viewing angle and position. Compared to a conventional globe, virtual globes have the additional capability of representing many different views on the surface of Earth. These views may be of geographical features, man-made features such as roads and buildings, or abstract representations of demographic quantities such as population.

Shapefile Geospatial vector data format

The shapefile format is a geospatial vector data format for geographic information system (GIS) software. It is developed and regulated by Esri as a mostly open specification for data interoperability among Esri and other GIS software products. The shapefile format can spatially describe vector features: points, lines, and polygons, representing, for example, water wells, rivers, and lakes. Each item usually has attributes that describe it, such as name or temperature.

A spatial database is a database optimized for storing and querying data that represents objects defined in a geometric space. Most spatial databases allow the representation of simple geometric objects such as points, lines and polygons. Some spatial databases handle more complex structures such as 3D objects, topological coverages, linear networks, and TINs. While typical databases have developed to manage various numeric and character types of data, such databases require additional functionality to process spatial data types efficiently, and developers have often added geometry or feature data types. The Open Geospatial Consortium (OGC) developed the Simple Features specification and sets standards for adding spatial functionality to database systems. The SQL/MM Spatial ISO/IEC standard is a part the SQL/MM multimedia standard and extends the Simple Features standard with data types that support circular interpolations.

Oracle Spatial and Graph, formerly Oracle Spatial, is a free option component of the Oracle Database. The spatial features in Oracle Spatial and Graph aid users in managing geographic and location-data in a native type within an Oracle database, potentially supporting a wide range of applications — from automated mapping, facilities management, and geographic information systems (AM/FM/GIS), to wireless location services and location-enabled e-business. The graph features in Oracle Spatial and Graph include Oracle Network Data Model (NDM) graphs used in traditional network applications in major transportation, telcos, utilities and energy organizations and RDF semantic graphs used in social networks and social interactions and in linking disparate data sets to address requirements from the research, health sciences, finance, media and intelligence communities.

A spatiotemporal database is a database that manages both space and time information. Common examples include:

Scalable Video Coding: (SVC) is the name for the Annex G extension of the H.264/MPEG-4 AVC video compression standard. SVC standardizes the encoding of a high-quality video bitstream that also contains one or more subset bitstreams. A subset video bitstream is derived by dropping packets from the larger video to reduce the bandwidth required for the subset bitstream. The subset bitstream can represent a lower spatial resolution, lower temporal resolution, or lower quality video signal. H.264/MPEG-4 AVC was developed jointly by ITU-T and ISO/IEC JTC 1. These two groups created the Joint Video Team (JVT) to develop the H.264/MPEG-4 AVC standard.

A data model in geographic information systems is a mathematical and digital structure for representing geographic phenomena. Generally, geospatial data models represent various aspects of these phenomena, including spatial locations, properties, change over time, and identity. For example, the vector data model represents geography as collections of points, lines, and polygons, and the raster data model represent geography as cell matrices that store numeric values. Data models are implemented throughout the GIS ecosystem, including the software tools for data management and spatial analysis, data stored in a variety of GIS file formats, specifications and standards, and specific designs for GIS installations.

In computer science, the Bx tree is basically a query that is used to update efficient B+ tree-based index structures for moving objects.

The following is provided as an overview of and topical guide to databases:

Array DBMS System that provides database services specifically for arrays

Array database management systems provide database services specifically for arrays, that is: homogeneous collections of data items, sitting on a regular grid of one, two, or more dimensions. Often arrays are used to represent sensor, simulation, image, or statistics data. Such arrays tend to be Big Data, with single objects frequently ranging into Terabyte and soon Petabyte sizes; for example, today's earth and space observation archives typically grow by Terabytes a day. Array databases aim at offering flexible, scalable storage and retrieval on this information category.

The neutral density or empirical neutral density is a density variable used in oceanography, introduced in 1997 by David R. Jackett and Trevor McDougall. It is a function of the three state variables and the geographical location. It has the typical units of density (M/V). Isosurfaces of form “neutral density surfaces”, which are closely aligned with the "neutral tangent plane". It is widely believed, although this has yet to be rigorously proven, that the flow in the deep ocean is almost entirely aligned with the neutral tangent plane, and strong lateral mixing occurs along this plane vs weak mixing across this plane . These surfaces are widely used in water mass analyses. Neutral density is a density variable that depends on the particular state of the ocean, and hence is also a function of time, though this is often ignored. In practice, its construction from a given hydrographic dataset is achieved by means of a computational code, that contains the computational algorithm developed by Jackett and McDougall. Use of this code is currently restricted to the present day ocean.

GeoSPARQL is a standard for representation and querying of geospatial linked data for the Semantic Web from the Open Geospatial Consortium (OGC). The definition of a small ontology based on well-understood OGC standards is intended to provide a standardized exchange basis for geospatial RDF data which can support both qualitative and quantitative spatial reasoning and querying with the SPARQL database query language.

Noel Cressie is an Australian and American statistician. He is Distinguished Professor and Director, Centre for Environmental Informatics, at the University of Wollongong in Wollongong, Australia.

References