Image analysis

Last updated December 05, 2024

Image analysis or imagery analysis is the extraction of meaningful information from images; mainly from digital images by means of digital image processing techniques.^[1] Image analysis tasks can be as simple as reading bar coded tags or as sophisticated as identifying a person from their face.

Computers are indispensable for the analysis of large amounts of data, for tasks that require complex computation, or for the extraction of quantitative information. On the other hand, the human visual cortex is an excellent image analysis apparatus, especially for extracting higher-level information, and for many applications — including medicine, security, and remote sensing — human analysts still cannot be replaced by computers. For this reason, many important image analysis tools such as edge detectors and neural networks are inspired by human visual perception models.

Digital

Digital Image Analysis or Computer Image Analysis is when a computer or electrical device automatically studies an image to obtain useful information from it. Note that the device is often a computer but may also be an electrical circuit, a digital camera or a mobile phone. It involves the fields of computer or machine vision, and medical imaging, and makes heavy use of pattern recognition, digital geometry, and signal processing. This field of computer science developed in the 1950s at academic institutions such as the MIT A.I. Lab, originally as a branch of artificial intelligence and robotics.

It is the quantitative or qualitative characterization of two-dimensional (2D) or three-dimensional (3D) digital images. 2D images are, for example, to be analyzed in computer vision, and 3D images in medical imaging. The field was established in the 1950s—1970s, for example with pioneering contributions by Azriel Rosenfeld, Herbert Freeman, Jack E. Bresenham, or King-Sun Fu.

Techniques

There are many different techniques used in automatically analysing images. Each technique may be useful for a small range of tasks, however there still aren't any known methods of image analysis that are generic enough for wide ranges of tasks, compared to the abilities of a human's image analysing capabilities. Examples of image analysis techniques in different fields include:

2D and 3D object recognition,
image segmentation,
motion detection e.g. Single particle tracking,
video tracking,
optical flow,
medical scan analysis,
3D Pose Estimation.

Applications

The applications of digital image analysis are continuously expanding through all areas of science and industry, including:

anatomy, allows for precise measurements, visualization, and statistical analysis of anatomical structures.^[2]
assay micro plate reading, such as detecting where a chemical was manufactured.
astronomy, such as calculating the size of a planet.
automated species identification (e.g. plant and animal species)
defense
error level analysis
filtering
machine vision, such as to automatically count items in a factory conveyor belt.
materials science, such as determining if a metal weld has cracks.
medicine, such as detecting cancer in a mammography scan.
metallography, such as determining the mineral content of a rock sample.
microscopy, such as counting the germs in a swab.
automatic number plate recognition;
optical character recognition, such as automatic license plate detection.
remote sensing, such as detecting intruders in a house, and producing land cover/land use maps.^[3]^[4]
robotics, such as to avoid steering into an obstacle.
security, such as detecting a person's eye color or hair color.

Object-based

Object-based image analysis (OBIA) involves two typical processes, segmentation and classification. Segmentation helps to group pixels into homogeneous objects. The objects typically correspond to individual features of interest, although over-segmentation or under-segmentation is very likely. Classification then can be performed at object levels, using various statistics of the objects as features in the classifier. Statistics can include geometry, context and texture of image objects. Over-segmentation is often preferred over under-segmentation when classifying high-resolution images.^[5]

Object-based image analysis has been applied in many fields, such as cell biology, medicine, earth sciences, and remote sensing. For example, it can detect changes of cellular shapes in the process of cell differentiation.;^[6] it has also been widely used in the mapping community to generate land cover.^[5]^[7]

When applied to earth images, OBIA is known as geographic object-based image analysis (GEOBIA), defined as "a sub-discipline of geoinformation science devoted to (...) partitioning remote sensing (RS) imagery into meaningful image-objects, and assessing their characteristics through spatial, spectral and temporal scale".^[8]^[7] The international GEOBIA conference has been held biannually since 2006.^[9]

OBIA techniques are implemented in software such as eCognition or the Orfeo toolbox.

Related Research Articles

Computer vision tasks include methods for acquiring, processing, analyzing, and understanding digital images, and extraction of high-dimensional data from the real world in order to produce numerical or symbolic information, e.g. in the form of decisions. "Understanding" in this context signifies the transformation of visual images into descriptions of the world that make sense to thought processes and can elicit appropriate action. This image understanding can be seen as the disentangling of symbolic information from image data using models constructed with the aid of geometry, physics, statistics, and learning theory.

Lidar is a method for determining ranges by targeting an object or a surface with a laser and measuring the time for the reflected light to return to the receiver. Lidar may operate in a fixed direction or it may scan multiple directions, in which case it is known as lidar scanning or 3D laser scanning, a special combination of 3-D scanning and laser scanning. Lidar has terrestrial, airborne, and mobile applications.

A digital elevation model (DEM) or digital surface model (DSM) is a 3D computer graphics representation of elevation data to represent terrain or overlaying objects, commonly of a planet, moon, or asteroid. A "global DEM" refers to a discrete global grid. DEMs are used often in geographic information systems (GIS), and are the most common basis for digitally produced relief maps. A digital terrain model (DTM) represents specifically the ground surface while DEM and DSM may represent tree top canopy or building roofs.

Image registration is the process of transforming different sets of data into one coordinate system. Data may be multiple photographs, data from different sensors, times, depths, or viewpoints. It is used in computer vision, medical imaging, military automatic target recognition, and compiling and analyzing images and data from satellites. Registration is necessary in order to be able to compare or integrate the data obtained from these different measurements.

<span class="mw-page-title-main">Remote sensing</span> Acquisition of information at a significant distance from the subject

Remote sensing is the acquisition of information about an object or phenomenon without making physical contact with the object, in contrast to in situ or on-site observation. The term is applied especially to acquiring information about Earth and other planets. Remote sensing is used in numerous fields, including geophysics, geography, land surveying and most Earth science disciplines. It also has military, intelligence, commercial, economic, planning, and humanitarian applications, among others.

In digital image processing and computer vision, image segmentation is the process of partitioning a digital image into multiple image segments, also known as image regions or image objects. The goal of segmentation is to simplify and/or change the representation of an image into something that is more meaningful and easier to analyze. Image segmentation is typically used to locate objects and boundaries in images. More precisely, image segmentation is the process of assigning a label to every pixel in an image such that pixels with the same label share certain characteristics.

Photogrammetry is the science and technology of obtaining reliable information about physical objects and the environment through the process of recording, measuring and interpreting photographic images and patterns of electromagnetic radiant imagery and other phenomena.

Aerial archaeology is the study of archaeological sites from the air. It is a method of archaeological investigation that uses aerial photography, remote sensing, and other techniques to identify, record, and interpret archaeological features and sites. Aerial archaeology has been used to discover and map a wide range of archaeological sites, from prehistoric settlements and ancient roads to medieval castles and World War II battlefields.

Multispectral imaging captures image data within specific wavelength ranges across the electromagnetic spectrum. The wavelengths may be separated by filters or detected with the use of instruments that are sensitive to particular wavelengths, including light from frequencies beyond the visible light range. It can allow extraction of additional information the human eye fails to capture with its visible receptors for red, green and blue. It was originally developed for military target identification and reconnaissance. Early space-based imaging platforms incorporated multispectral imaging technology to map details of the Earth related to coastal boundaries, vegetation, and landforms. Multispectral imaging has also found use in document and painting analysis.

3D scanning is the process of analyzing a real-world object or environment to collect three dimensional data of its shape and possibly its appearance. The collected data can then be used to construct digital 3D models.

The following outline is provided as an overview of and topical guide to computer vision:

Cognition Network Technology (CNT), also known as Definiens Cognition Network Technology, is an object-based image analysis method developed by Nobel laureate Gerd Binnig together with a team of researchers at Definiens AG in Munich, Germany. It serves for extracting information from images using a hierarchy of image objects, as opposed to traditional pixel processing methods.

Mobile mapping is the process of collecting geospatial data from a mobile vehicle, typically fitted with a range of GNSS, photographic, radar, laser, LiDAR or any number of remote sensing systems. Such systems are composed of an integrated array of time synchronised navigation sensors and imaging sensors mounted on a mobile platform. The primary output from such systems include GIS data, digital maps, and georeferenced images and video.

2D to 3D video conversion is the process of transforming 2D ("flat") film to 3D form, which in almost all cases is stereo, so it is the process of creating imagery for each eye from one 2D image.

<span class="mw-page-title-main">Digital outcrop model</span> Digital 3D representation of the outcrop surface

A digital outcrop model (DOM), also called a virtual outcrop model, is a digital 3D representation of the outcrop surface, mostly in a form of textured polygon mesh.

Vaa3D is an Open Source visualization and analysis software suite created mainly by Hanchuan Peng and his team at Janelia Research Campus, HHMI and Allen Institute for Brain Science. The software performs 3D, 4D and 5D rendering and analysis of very large image data sets, especially those generated using various modern microscopy methods, and associated 3D surface objects. This software has been used in several large neuroscience initiatives and a number of applications in other domains. In a recent Nature Methods review article, it has been viewed as one of the leading open-source software suites in the related research fields. In addition, research using this software was awarded the 2012 Cozzarelli Prize from the National Academy of Sciences.

The Aphelion Imaging Software Suite is a software suite that includes three base products - Aphelion Lab, Aphelion Dev, and Aphelion SDK for addressing image processing and image analysis applications. The suite also includes a set of extension programs to implement specific vertical applications that benefit from imaging techniques.

Digital archaeology is the application of information technology and digital media to archaeology. This includes the use of tools such as databases, 3D models, digital photography, virtual reality, augmented reality, and geographic information systems. Computational archaeology, which covers computer-based analytical methods, can be considered a subfield of digital archaeology, as can virtual archaeology. Digital archaeology plays a key role in data collection, analysis, and public outreach, enhancing the study and preservation of archaeological sites and artifacts.

Remote sensing is used in the geological sciences as a data acquisition method complementary to field observation, because it allows mapping of geological characteristics of regions without physical contact with the areas being explored. About one-fourth of the Earth's total surface area is exposed land where information is ready to be extracted from detailed earth observation via remote sensing. Remote sensing is conducted via detection of electromagnetic radiation by sensors. The radiation can be naturally sourced, or produced by machines and reflected off of the Earth surface. The electromagnetic radiation acts as an information carrier for two main variables. First, the intensities of reflectance at different wavelengths are detected, and plotted on a spectral reflectance curve. This spectral fingerprint is governed by the physio-chemical properties of the surface of the target object and therefore helps mineral identification and hence geological mapping, for example by hyperspectral imaging. Second, the two-way travel time of radiation from and back to the sensor can calculate the distance in active remote sensing systems, for example, Interferometric synthetic-aperture radar. This helps geomorphological studies of ground motion, and thus can illuminate deformations associated with landslides, earthquakes, etc.

Applications of machine learning (ML) in earth sciences include geological mapping, gas leakage detection and geological feature identification. Machine learning is a subdiscipline of artificial intelligence aimed at developing programs that are able to classify, cluster, identify, and analyze vast and complex data sets without the need for explicit programming to do so. Earth science is the study of the origin, evolution, and future of the Earth. The earth's system can be subdivided into four major components including the solid earth, atmosphere, hydrosphere, and biosphere.

References

↑ Solomon, C.J., Breckon, T.P. (2010). Fundamentals of Digital Image Processing: A Practical Approach with Examples in Matlab. Wiley-Blackwell. doi:10.1002/9780470689776. ISBN 978-0470844731.{{cite book}}: CS1 maint: multiple names: authors list (link)
↑ Kędzia, Alicja; Derkowski, Wojciech (2024). "Modern methods of neuroanatomical and neurophysiological research". MethodsX. 13: 102881. doi:10.1016/j.mex.2024.102881. ISSN 2215-0161. PMC 11340600 . PMID 39176151.
↑ Xie, Y.; Sha, Z.; Yu, M. (2008). "Remote sensing imagery in vegetation mapping: a review". Journal of Plant Ecology. 1 (1): 9–23. doi: 10.1093/jpe/rtm005 .
↑ Wilschut, L.I.; Addink, E.A.; Heesterbeek, J.A.P.; Dubyanskiy, V.M.; Davis, S.A.; Laudisoit, A.; Begon, M.; Burdelov, L.A.; Atshabar, B.B.; de Jong, S.M (2013). "Mapping the distribution of the main host for plague in a complex landscape in Kazakhstan: An object-based approach using SPOT-5 XS, Landsat 7 ETM+, SRTM and multiple Random Forests". International Journal of Applied Earth Observation and Geoinformation. 23 (100): 81–94. Bibcode:2013IJAEO..23...81W. doi:10.1016/j.jag.2012.11.007. PMC 4010295 . PMID 24817838.
1 2 Liu, Dan; Toman, Elizabeth; Fuller, Zane; Chen, Gang; Londo, Alexis; Xuesong, Zhang; Kaiguang, Zhao (2018). "Integration of historical map and aerial imagery to characterize long-term land-use change and landscape dynamics: An object-based analysis via Random Forests" (PDF). Ecological Indicators. 95 (1): 595–605. Bibcode:2018EcInd..95..595L. doi:10.1016/j.ecolind.2018.08.004. S2CID 92025959.
↑ Salzmann, M.; Hoesel, B.; Haase, M.; Mussbacher, M.; Schrottmaier, W. C.; Kral-Pointner, J. B.; Finsterbusch, M.; Mazharian, A.; Assinger, A. (2018-02-20). "A novel method for automated assessment of megakaryocyte differentiation and proplatelet formation" (PDF). Platelets. 29 (4): 357–364. doi: 10.1080/09537104.2018.1430359 . ISSN 1369-1635. PMID 29461915. S2CID 3785563.
1 2 Blaschke, Thomas; Hay, Geoffrey J.; Kelly, Maggi; Lang, Stefan; Hofmann, Peter; Addink, Elisabeth; Queiroz Feitosa, Raul; van der Meer, Freek; van der Werff, Harald; van Coillie, Frieke; Tiede, Dirk (2014). "Geographic Object-Based Image Analysis – Towards a new paradigm". ISPRS Journal of Photogrammetry and Remote Sensing. 87 (100). Elsevier BV: 180–191. Bibcode:2014JPRS...87..180B. doi: 10.1016/j.isprsjprs.2013.09.014 . ISSN 0924-2716. PMC 3945831 . PMID 24623958.
↑ G.J. Hay & G. Castilla: Geographic Object-Based Image Analysis (GEOBIA): A new name for a new discipline. In: T. Blaschke, S. Lang & G. Hay (eds.): Object-Based Image Analysis – Spatial Concepts for Knowledge-Driven Remote Sensing Applications. Lecture Notes in Geoinformation and Cartography, 18. Springer, Berlin/Heidelberg, Germany: 75-89 (2008)
↑ "Remote Sensing | Special Issue: Advances in Geographic Object-Based Image Analysis (GEOBIA)". Archived from the original on 2013-12-12.