This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these template messages)
|
Robert M. Haralick (born 1943) is Distinguished Professor in Computer Science at Graduate Center of the City University of New York (CUNY). Haralick is one of the leading figures in computer vision, pattern recognition, and image analysis. He is a Fellow of the Institute of Electrical and Electronics Engineers (IEEE) and a Fellow and past president of the International Association for Pattern Recognition. Professor Haralick is the King-Sun Fu Prize winner of 2016, "for contributions in image analysis, including remote sensing, texture analysis, mathematical morphology, consistent labeling, and system performance evaluation".
Haralick received a B.A. degree in mathematics from the University of Kansas in 1964, a B.S. degree in electrical engineering in 1966, and a M.S. degree in electrical engineering in 1967. In 1969, after completing his Ph.D. at the University of Kansas, he joined the faculty of the electrical engineering department, serving as professor from 1975 to 1978. In 1979 Haralick joined the electrical engineering department at Virginia Polytechnic Institute and State University, where he was a professor and director of the spatial data analysis laboratory.
From 1984 to 1986 Haralick served as vice president of research at Machine Vision International, Ann Arbor, Michigan. Haralick occupied the Boeing Clairmont Egtvedt Professorship in the department of electrical engineering at the University of Washington from 1986 through 2000. At University of Washington, Haralick was an adjunct professor in the computer science department and the bioengineering department.
In 2000 Haralick accepted a Distinguished Professorship position at the computer science department of the Graduate Center of the City University of New York. [1]
Haralick began his work as one of the principal investigators of the NASA ERTS satellite data doing remote sensing image analysis. [2] [3] [4]
Haralick has made a series of contributions in the field of computer vision. In the high-level vision area, he has worked on inferring 3D geometry from one or more perspective projection views. [5] [6] [7] [8] He has also identified a variety of vision problems which are special cases of the consistent labeling problem. His papers on consistent labeling, [9] [10] arrangements, relation homomorphism, [11] matching, and tree search translate some specific computer vision problems to the more general combinatorial consistent labeling problem and then discuss the theory of the look-ahead operators that speed up the tree search. The most basic of these is called Forward Checking. [12] This gives a framework for the control structure required in high-level vision problems. He has also extended the forward-checking tree search technique to propositional logic. [13]
In the low-and mid-level areas, Haralick has worked in image texture analysis using spatial gray tone co-occurrence texture features. [14] [15] These features have been used with success on biological cell images, x-ray images, satellite images, aerial images and many other kinds of images taken at small and large scales. In the feature detection area, Haralick has developed the facet model for image processing. [16] The facet model states that many low-level image processing operations can be interpreted relative to what the processing does to the estimated underlying gray tone intensity surface of which the given image is a sampled noisy version. The facet papers develop techniques for edge detection, [17] [18] line detection, [19] noise removal, [20] peak and pit detection, as well as a variety of other topographic gray tone surface features. [21]
Haralick's work in shape analysis and extraction uses the techniques of mathematical morphology. [22] He has developed the morphological sampling theorem [23] which establishes a sound shape/size basis for the focus of attention mechanisms which can process image data in a multiresolution mode, thereby making some of the image feature extraction processes execute more efficiently. He has also developed recursive morphological algorithms for the computation of opening and closing transforms. The recursive algorithms permit all possible sized openings or closings for a given structuring element to be computed in constant time per pixel. [24] [25] He also developed statistical morphological methodologies for image analysis and noise removal. [26] and noise removal [27]
In the area of document image understanding, Haralick is responsible for the development of comprehensive ground-truthed databases consisting of over 1500 document images, most in English and some in Japanese. The databases are issued on CD-ROMs and are used all around the world by people developing character recognition methodologies and techniques for document image structural decomposition. [28] He has developed algorithms for document image skew angle estimation, zone delineation, and word and text line bounding box delineation. [29] [30]
In a series of papers, Haralick has helped influence the computer vision community to be more sensitive to the needs of computer vision performance characterization and covariance propagation for without this kind of analysis Computer Vision has no robust theory. [31] [32] [33]
Haralick has contributed to the medical image analysis area particularly working with X-ray ventriculargrams . [34] [35] [36] and echocardiography, [37] These papers developed techniques to identify and delineate anatomically accurate boundaries for the left ventricle of the heart.
His most recent work is in the pattern recognition area, particularly in the manifold clustering of high-dimensional data sets, [38] [39] the application of pattern recognition to mathematical combinatorial problems [40] [41] and in the area of Torah codes popularly called Bible codes. In this area he has co-authored a book with Eliyahu Rips, one of the coauthors of the original Statistical Sciences paper. Haralick's research has helped develop sophisticated algorithmic and statistical methodology for Torah code experiments, methodology that he claims can differentiate between the tables that are depicted as encodings in books like Moby Dick and War and Peace from those encodings that occur in the Torah text. [42] [43]
Haralick is a Fellow of IEEE for his contributions in computer vision and image processing and a Fellow of the International Association for Pattern Recognition (IAPR) for his contributions in pattern recognition, image processing, and for service to IAPR. He served as president of IAPR from 1996 to 1998. He has served on the editorial board of "IEEE Transactions on Pattern Analysis and Machine Intelligence" and has been the computer vision area editor for Communications of the ACM and as an associate editor for Computer Vision, Graphics, and Image Processing, The IEEE Transactions on Image Processing and Pattern Recognition. He served on the editorial board of Real Time Imaging and the editorial board of Electronic Imaging. His publications include over 570 archival papers, book chapters, conference proceedings and books. The science citation index lists over 6300 references to his papers.
Haralick has been recognized for his academic research in the Marquis Who's Who books. He is listed in the current editions for Who's Who in the East, Who's Who in America, and Who's Who in the World.
In digital image processing and computer vision, image segmentation is the process of partitioning a digital image into multiple image segments, also known as image regions or image objects. The goal of segmentation is to simplify and/or change the representation of an image into something that is more meaningful and easier to analyze. Image segmentation is typically used to locate objects and boundaries in images. More precisely, image segmentation is the process of assigning a label to every pixel in an image such that pixels with the same label share certain characteristics.
Thomas Shi-Tao Huang was a Chinese-born American computer scientist, electrical engineer, and writer. He was a researcher and professor emeritus at the University of Illinois at Urbana-Champaign (UIUC). Huang was one of the leading figures in computer vision, pattern recognition and human computer interaction.
Azriel Rosenfeld was an American Research Professor, a Distinguished University Professor, and Director of the Center for Automation Research at the University of Maryland, College Park, Maryland, where he also held affiliate professorships in the Departments of Computer Science, Electrical Engineering, and Psychology, and a talmid chochom. He held a Ph.D. in mathematics from Columbia University (1957), rabbinic ordination (1952) and a Doctor of Hebrew Literature degree (1955) from Yeshiva University, honorary Doctor of Technology degrees from Linkoping University (1980) and Oulu University (1994), and an honorary Doctor of Humane Letters degree from Yeshiva University (2000); he was awarded an honorary Doctor of Science degree from the Technion. He was a Fellow of the Association for the Advancement of Artificial Intelligence (1990) and of the Association for Computing Machinery (1994).
King-Sun Fu was a Chinese-born American computer scientist. He was a Goss Distinguished Professor at Purdue University School of Electrical and Computer Engineering in West Lafayette, Indiana. He was instrumental in the founding of International Association for Pattern Recognition (IAPR), served as its first president, and is widely recognized for his extensive and pioneering contributions to the field of pattern recognition and machine intelligence. In honor of the memory of Professor King-Sun Fu, IAPR gives the biennial King-Sun Fu Prize to a living person in the recognition of an outstanding technical contribution to the field of pattern recognition. The first King-Sun Fu Prize was presented in 1988, to Azriel Rosenfeld.
Linda G. Shapiro is a professor in the Department of Computer Science and Engineering, a professor of electrical engineering, and adjunct professor of Biomedical Informatics and Medical Education at the University of Washington.
As applied in the field of computer vision, graph cut optimization can be employed to efficiently solve a wide variety of low-level computer vision problems, such as image smoothing, the stereo correspondence problem, image segmentation, object co-segmentation, and many other computer vision problems that can be formulated in terms of energy minimization. Many of these energy minimization problems can be approximated by solving a maximum flow problem in a graph. Under most formulations of such problems in computer vision, the minimum energy solution corresponds to the maximum a posteriori estimate of a solution. Although many computer vision algorithms involve cutting a graph, the term "graph cuts" is applied specifically to those models which employ a max-flow/min-cut optimization.
In the study of image processing, a watershed is a transformation defined on a grayscale image. The name refers metaphorically to a geological watershed, or drainage divide, which separates adjacent drainage basins. The watershed transformation treats the image it operates upon like a topographic map, with the brightness of each point representing its height, and finds the lines that run along the tops of ridges.
Pyramid, or pyramid representation, is a type of multi-scale signal representation developed by the computer vision, image processing and signal processing communities, in which a signal or an image is subject to repeated smoothing and subsampling. Pyramid representation is a predecessor to scale-space representation and multiresolution analysis.
Local binary patterns (LBP) is a type of visual descriptor used for classification in computer vision. LBP is the particular case of the Texture Spectrum model proposed in 1990. LBP was first described in 1994. It has since been found to be a powerful feature for texture classification; it has further been determined that when LBP is combined with the Histogram of oriented gradients (HOG) descriptor, it improves the detection performance considerably on some datasets. A comparison of several improvements of the original LBP in the field of background subtraction was made in 2015 by Silva et al. A full survey of the different versions of LBP can be found in Bouwmans et al.
Josef KittlerFREng is a British scientist and Distinguished Professor at University of Surrey, specialising in pattern recognition and machine intelligence.
Theodosios Pavlidis is a computer scientist and Distinguished Professor Emeritus of Computer Science at the State University of New York, Stony Brook.
Jagdishkumar Keshoram Aggarwal is an American computer scientist, who is currently retired and is Cullen Trust Endowed Emeritus Professor of the Department of Electrical and Computer Engineering at the Cockrell School of Engineering, University of Texas at Austin. He is known for his contributions in the fields of computer vision, pattern recognition and image processing focusing on human motion and activities. He served in various positions in the Department of Electrical and Computer of the University of Texas at Austin and other institutions.
Tin Kam Ho is a computer scientist at IBM Research with contributions to machine learning, data mining, and classification. Ho is noted for introducing random decision forests in 1995, and for her pioneering work in ensemble learning and data complexity analysis. She is an IEEE fellow and IAPR fellow.
Gang Hua is a Chinese-American computer scientist who specializes in the field of computer vision and pattern recognition. He is an IEEE Fellow, IAPR Fellow and ACM Distinguished Scientist. He is a key contributor to Microsoft's Facial Recognition technologies.
Jiebo Luo is a Chinese-American computer scientist, the Albert Arendt Hopeman Professor of Engineering and Professor of Computer Science at the University of Rochester. He is interested in artificial intelligence, data science and computer vision.
Song-Chun Zhu is a Chinese computer scientist and applied mathematician known for his work in computer vision, cognitive artificial intelligence and robotics. Zhu currently works at Peking University and was previously a professor in the Departments of Statistics and Computer Science at the University of California, Los Angeles. Zhu also previously served as Director of the UCLA Center for Vision, Cognition, Learning and Autonomy (VCLA).
Mark S. Nixon is an author, researcher, editor and an academic. He is the former president of IEEE Biometrics Council, and former vice-Chair of IEEE PSPB. He retired from his position as Professor of Electronics and Computer Science at University of Southampton in 2019.
Jiaya Jia is a tenured professor of the Department of Computer Science and Engineering at The Chinese University of Hong Kong (CUHK). He is an IEEE Fellow, the associate editor-in-chief of one of IEEE’s flagship and premier journals- Transactions on Pattern Analysis and Machine Intelligence (TPAMI), as well as on the editorial board of International Journal of Computer Vision (IJCV).
Mário A. T. Figueiredo is a Portuguese engineer, academic, and researcher. He is an IST Distinguished Professor and holds the Feedzai chair of machine learning at IST, University of Lisbon.