Region growing

Last updated March 13, 2024

Region growing is a simple region-based image segmentation method. It is also classified as a pixel-based image segmentation method since it involves the selection of initial seed points.

This approach to segmentation examines neighboring pixels of initial seed points and determines whether the pixel neighbors should be added to the region. The process is iterated on, in the same manner as general data clustering algorithms. A general discussion of the region growing algorithm is described below.

Region-based segmentation

The main goal of segmentation is to partition an image into regions. Some segmentation methods such as thresholding achieve this goal by looking for the boundaries between regions based on discontinuities in grayscale or color properties. Region-based segmentation is a technique for determining the region directly. The basic formulation is:^[1]

(a){\text{ }}\bigcup _{i=1}^{n}{R_{i}=R.}

(b){\text{ }}R_{i}{\text{ is a connected region}},{\text{ i}}={\text{1}},{\text{ 2}},{\text{ }}...,{\text{n}}

(c){\text{ }}R_{i}\bigcap R_{j}=\varnothing ,i\neq j

(d){\text{ }}P(R_{i})=\mathrm {TRUE} {\text{ for }}i=1,2,...,n.

(e){\text{ }}P(R_{i}\bigcup R_{j})=\mathrm {FALSE} {\text{ for any adjacent region }}R_{i}{\text{ and }}R_{j}.

P(R_{i})

is a logical predicate defined over the points in set

R_{i}

and

\varnothing

is the null set.

(a) means that the segmentation must be complete; that is, every pixel must be in a region.

(b) requires that points in a region must be connected in some predefined sense.

(c) indicates that the regions must be disjoint.

(d) deals with the properties that must be satisfied by the pixels in a segmented region. For example, $P(R_{i})={\text{TRUE}}$ if all pixels in $R_{i}$ have the same grayscale.

(e) indicates that region $R_{i}$ and $R_{j}$ are different in the sense of predicate $P$ .

Basic concept of seed points

The first step in region growing is to select a set of seed points. Seed point selection is based on some user criterion (for example, pixels in a certain grayscale range, pixels evenly spaced on a grid, etc.). The initial region begins as the exact location of these seeds.

The regions are then grown from these seed points to adjacent points depending on a region membership criterion. The criterion could be, for example, pixel intensity, grayscale texture, or colour.

Since the regions are grown on the basis of the criterion, the image information itself is important. For example, if the criterion were a pixel intensity threshold value, knowledge of the histogram of the image would be of use, as one could use it to determine a suitable threshold value for the region membership criterion.

One can use 4-connected neighborhood to grow from the seed points. An alternative for pixels adjacent relationship is the 8-connected neighborhood. Pixels adjacent to the seed points are examined and classified into the seed points if they have the same intensity value. It is an iterated process until there are no change in two successive iterative stages. Other criteria can be chosen; the main goal is to classify the similarity of the image into regions.

Important issues

Suitable selection of seed points

The selection of seed points is depending on the users. For example, in a grayscale lightning image, we may want to segment the lightning from the background. Then probably, we can examine the histogram and choose the seed points from the highest range of it.

More information of the image is better

Obviously, the connectivity or pixel adjacent information is helpful for us to determine the threshold and seed points.

Minimum area threshold

No region in region growing method result will be smaller than this threshold in the segmented image.

Similarity threshold value

If the difference of pixel-value or the difference value of average grayscale of a set of pixels less than “Similarity threshold value”, the regions will be considered as a same region.

The criteria of similarities or so called homogeneity we choose are also important. It usually depends on the original image and the segmentation result we want.^[2]

Some criteria often used are grayscale (average intensity or variance), color, and texture or shape.

Advantages and disadvantages

Advantages

Can correctly separate the regions that have the same properties we define.
Can provide the original images which have clear edges with good segmentation results.
Simple concept: only need a small number of seed points to represent the property we want, then grow the region.
Can determine the seed points and the criteria we want to make.
Can choose the multiple criteria at the same time.
Theoretical very efficient due to visiting each pixel by a limited bound of times.

Disadvantages

Unless image has had a threshold function applied, a continuous path of points related to color may exist, which connects any two points in the image.
Practically random memory access slows down the algorithm, so adaption might be needed

Notes

↑ Pal, Nikhil R; Pal, Sankar K (1993). "A review on image segmentation techniques". Pattern Recognition. 26 (9): 1277–1278. Bibcode:1993PatRe..26.1277P. doi:10.1016/0031-3203(93)90135-J.
↑ Adoui, Mohammed El; Drisis, Stylianos; Benjelloun, Mohammed (2017-07-21). Analyzing breast tumor heterogeneity to predict the response to chemotherapy using 3D MR images registration. ACM. pp. 56–63. doi:10.1145/3128128.3128137. ISBN 9781450352819. S2CID 24873901.

Related Research Articles

Edge detection includes a variety of mathematical methods that aim at identifying edges, defined as curves in a digital image at which the image brightness changes sharply or, more formally, has discontinuities. The same problem of finding discontinuities in one-dimensional signals is known as step detection and the problem of finding signal discontinuities over time is known as change detection. Edge detection is a fundamental tool in image processing, machine vision and computer vision, particularly in the areas of feature detection and feature extraction.

<span class="mw-page-title-main">Grayscale</span> Image where each pixels intensity is shown only achromatic values of black, gray, and white

In digital photography, computer-generated imagery, and colorimetry, a grayscale image is one in which the value of each pixel is a single sample representing only an amount of light; that is, it carries only intensity information. Grayscale images, a kind of black-and-white or gray monochrome, are composed exclusively of shades of gray. The contrast ranges from black at the weakest intensity to white at the strongest.

The Canny edge detector is an edge detection operator that uses a multi-stage algorithm to detect a wide range of edges in images. It was developed by John F. Canny in 1986. Canny also produced a computational theory of edge detection explaining why the technique works.

In digital image processing and computer vision, image segmentation is the process of partitioning a digital image into multiple image segments, also known as image regions or image objects. The goal of segmentation is to simplify and/or change the representation of an image into something that is more meaningful and easier to analyze. Image segmentation is typically used to locate objects and boundaries in images. More precisely, image segmentation is the process of assigning a label to every pixel in an image such that pixels with the same label share certain characteristics.

In computer vision and image processing, Otsu's method, named after Nobuyuki Otsu, is used to perform automatic image thresholding. In the simplest form, the algorithm returns a single intensity threshold that separate pixels into two classes, foreground and background. This threshold is determined by minimizing intra-class intensity variance, or equivalently, by maximizing inter-class variance. Otsu's method is a one-dimensional discrete analogue of Fisher's Discriminant Analysis, is related to Jenks optimization method, and is equivalent to a globally optimal k-means performed on the intensity histogram. The extension to multi-level thresholding was described in the original paper, and computationally efficient implementations have since been proposed.

In image processing, normalization is a process that changes the range of pixel intensity values. Applications include photographs with poor contrast due to glare, for example. Normalization is sometimes called contrast stretching or histogram stretching. In more general fields of data processing, such as digital signal processing, it is referred to as dynamic range expansion.

Connected-component labeling (CCL), connected-component analysis (CCA), blob extraction, region labeling, blob discovery, or region extraction is an algorithmic application of graph theory, where subsets of connected components are uniquely labeled based on a given heuristic. Connected-component labeling is not to be confused with segmentation.

As applied in the field of computer vision, graph cut optimization can be employed to efficiently solve a wide variety of low-level computer vision problems, such as image smoothing, the stereo correspondence problem, image segmentation, object co-segmentation, and many other computer vision problems that can be formulated in terms of energy minimization. Many of these energy minimization problems can be approximated by solving a maximum flow problem in a graph. Under most formulations of such problems in computer vision, the minimum energy solution corresponds to the maximum a posteriori estimate of a solution. Although many computer vision algorithms involve cutting a graph, the term "graph cuts" is applied specifically to those models which employ a max-flow/min-cut optimization.

In the fields of computer vision and image analysis, the Harris affine region detector belongs to the category of feature detection. Feature detection is a preprocessing step of several algorithms that rely on identifying characteristic points or interest points so to make correspondences between images, recognize textures, categorize objects or build panoramas.

In computer vision, maximally stable extremal regions (MSER) are used as a method of blob detection in images. This technique was proposed by Matas et al. to find correspondences between image elements from two images with different viewpoints. This method of extracting a comprehensive number of corresponding image elements contributes to the wide-baseline matching, and it has led to better stereo matching and object recognition algorithms.

In mathematical morphology and digital image processing, a morphological gradient is the difference between the dilation and the erosion of a given image. It is an image where each pixel value indicates the contrast intensity in the close neighborhood of that pixel. It is useful for edge detection and segmentation applications.

In mathematical morphology and digital image processing, a top-hat transform is an operation that extracts small elements and details from given images. There exist two types of top-hat transform: the white top-hat transform is defined as the difference between the input image and its opening by some structuring element, while the black top-hat transform is defined dually as the difference between the closing and the input image. Top-hat transforms are used for various image processing tasks, such as feature extraction, background equalization, image enhancement, and others.

Image segmentation strives to partition a digital image into regions of pixels with similar properties, e.g. homogeneity. The higher-level region representation simplifies image analysis tasks such as counting objects or detecting changes, because region attributes can be compared more readily than raw pixels.

The random walker algorithm is an algorithm for image segmentation. In the first description of the algorithm, a user interactively labels a small number of pixels with known labels, e.g., "object" and "background". The unlabeled pixels are each imagined to release a random walker, and the probability is computed that each pixel's random walker first arrives at a seed bearing each label, i.e., if a user places K seeds, each with a different label, then it is necessary to compute, for each pixel, the probability that a random walker leaving the pixel will first arrive at each seed. These probabilities may be determined analytically by solving a system of linear equations. After computing these probabilities for each pixel, the pixel is assigned to the label for which it is most likely to send a random walker. The image is modeled as a graph, in which each pixel corresponds to a node which is connected to neighboring pixels by edges, and the edges are weighted to reflect the similarity between the pixels. Therefore, the random walk occurs on the weighted graph.

<span class="mw-page-title-main">Image texture</span>

An image texture is a set of metrics calculated in image processing designed to quantify the perceived texture of an image. Image texture gives us information about the spatial arrangement of color or intensities in an image or selected region of an image.

In applied mathematics, lambda-connectedness deals with partial connectivity for a discrete space.

Foreground detection is one of the major tasks in the field of computer vision and image processing whose aim is to detect changes in image sequences. Background subtraction is any technique which allows an image's foreground to be extracted for further processing.

In image analysis, the average with limited data validity is an image filter for feature-preserving noise removal, consisting in a smoothing filter that only involves pixels satisfying some validity criterion. If some feature of noise elements is known, it is possible to use it to define a criterion to detect invalid pixels, and selectively smooth only invalid pixels using data coming only from valid pixels, thus avoiding to affect other features of the image.

Split and merge segmentation is an image processing technique used to segment an image. The image is successively split into quadrants based on a homogeneity criterion and similar regions are merged to create the segmented result. The technique incorporates a quadtree data structure, meaning that there is a parent-child node relationship. The total region is a parent, and each of the four splits is a child.

In medical imaging, a time-activity curve is a curve of radioactivity plotted on the y-axis against the time plotted on the x-axis. It shows the concentration of a radiotracer within a region of interest in an image, measured over time from a dynamic scan. Generally, when a time-activity curve is obtained within a tissue, it is called as a tissue time-activity curve, which represents the concentration of tracer within a region of interest inside a tissue over time.

References

Jian-Jiun Ding, The class of "Time-Frequency Analysis and Wavelet Transform", the Department of Electrical Engineering, National Taiwan University (NTU), Taipei, Taiwan, 2007.
Jian-Jiun Ding, The class of "Advanced Digital Signal Processing", the Department of Electrical Engineering, National Taiwan University (NTU), Taipei, Taiwan, 2008.
W. K. Pratt, Digital Image Processing 4th Edition, John Wiley & Sons, Inc., Los Altos, California, 2007
M. Petrou and P. Bosdogianni, Image Processing the Fundamentals, Wiley, UK, 2004.
R. C. Gonzalez and R.E. Woods, Digital Image Processing 2nd Edition, Prentice Hall, New Jersey, 2002.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Pal, Nikhil R; Pal, Sankar K (1993). "A review on image segmentation techniques". Pattern Recognition. 26 (9): 1277–1278. Bibcode:1993PatRe..26.1277P. doi:10.1016/0031-3203(93)90135-J.

[2] Adoui, Mohammed El; Drisis, Stylianos; Benjelloun, Mohammed (2017-07-21). Analyzing breast tumor heterogeneity to predict the response to chemotherapy using 3D MR images registration. ACM. pp. 56–63. doi:10.1145/3128128.3128137. ISBN 9781450352819. S2CID 24873901.

[1]

[2]