Robert Haralick

Last updated

Robert M. Haralick (born 1943) is Distinguished Professor in Computer Science at Graduate Center of the City University of New York (CUNY). Haralick is one of the leading figures in computer vision, pattern recognition, and image analysis. He is a Fellow of the Institute of Electrical and Electronics Engineers (IEEE) and a Fellow and past president of the International Association for Pattern Recognition. Professor Haralick is the King-Sun Fu Prize winner of 2016, "for contributions in image analysis, including remote sensing, texture analysis, mathematical morphology, consistent labeling, and system performance evaluation".

Contents

Biography

Haralick received a B.A. degree in mathematics from the University of Kansas in 1964, a B.S. degree in electrical engineering in 1966, and a M.S. degree in electrical engineering in 1967. In 1969, after completing his Ph.D. at the University of Kansas, he joined the faculty of the electrical engineering department, serving as professor from 1975 to 1978. In 1979 Haralick joined the electrical engineering department at Virginia Polytechnic Institute and State University, where he was a professor and director of the spatial data analysis laboratory.

From 1984 to 1986 Haralick served as vice president of research at Machine Vision International, Ann Arbor, Michigan. Haralick occupied the Boeing Clairmont Egtvedt Professorship in the department of electrical engineering at the University of Washington from 1986 through 2000. At University of Washington, Haralick was an adjunct professor in the computer science department and the bioengineering department.

In 2000 Haralick accepted a Distinguished Professorship position at the computer science department of the Graduate Center of the City University of New York. [1]

Contributions

Haralick began his work as one of the principal investigators of the NASA ERTS satellite data doing remote sensing image analysis. [2] [3] [4]

Haralick has made a series of contributions in the field of computer vision. In the high-level vision area, he has worked on inferring 3D geometry from one or more perspective projection views. [5] [6] [7] [8] He has also identified a variety of vision problems which are special cases of the consistent labeling problem. His papers on consistent labeling, [9] [10] arrangements, relation homomorphism, [11] matching, and tree search translate some specific computer vision problems to the more general combinatorial consistent labeling problem and then discuss the theory of the look-ahead operators that speed up the tree search. The most basic of these is called Forward Checking. [12] This gives a framework for the control structure required in high-level vision problems. He has also extended the forward-checking tree search technique to propositional logic. [13]

In the low-and mid-level areas, Haralick has worked in image texture analysis using spatial gray tone co-occurrence texture features. [14] [15] These features have been used with success on biological cell images, x-ray images, satellite images, aerial images and many other kinds of images taken at small and large scales. In the feature detection area, Haralick has developed the facet model for image processing. [16] The facet model states that many low-level image processing operations can be interpreted relative to what the processing does to the estimated underlying gray tone intensity surface of which the given image is a sampled noisy version. The facet papers develop techniques for edge detection, [17] [18] line detection, [19] noise removal, [20] peak and pit detection, as well as a variety of other topographic gray tone surface features. [21]

Haralick's work in shape analysis and extraction uses the techniques of mathematical morphology. [22] He has developed the morphological sampling theorem [23] which establishes a sound shape/size basis for the focus of attention mechanisms which can process image data in a multiresolution mode, thereby making some of the image feature extraction processes execute more efficiently. He has also developed recursive morphological algorithms for the computation of opening and closing transforms. The recursive algorithms permit all possible sized openings or closings for a given structuring element to be computed in constant time per pixel. [24] [25] He also developed statistical morphological methodologies for image analysis and noise removal. [26] and noise removal [27]

In the area of document image understanding, Haralick is responsible for the development of comprehensive ground-truthed databases consisting of over 1500 document images, most in English and some in Japanese. The databases are issued on CD-ROMs and are used all around the world by people developing character recognition methodologies and techniques for document image structural decomposition. [28] He has developed algorithms for document image skew angle estimation, zone delineation, and word and text line bounding box delineation. [29] [30]

In a series of papers, Haralick has helped influence the computer vision community to be more sensitive to the needs of computer vision performance characterization and covariance propagation for without this kind of analysis Computer Vision has no robust theory. [31] [32] [33]

Haralick has contributed to the medical image analysis area particularly working with X-ray ventriculargrams . [34] [35] [36] and echocardiography, [37] These papers developed techniques to identify and delineate anatomically accurate boundaries for the left ventricle of the heart.

His most recent work is in the pattern recognition area, particularly in the manifold clustering of high-dimensional data sets, [38] [39] the application of pattern recognition to mathematical combinatorial problems [40] [41] and in the area of Torah codes popularly called Bible codes. In this area he has co-authored a book with Eliyahu Rips, one of the coauthors of the original Statistical Sciences paper. Haralick's research has helped develop sophisticated algorithmic and statistical methodology for Torah code experiments, methodology that he claims can differentiate between the tables that are depicted as encodings in books like Moby Dick and War and Peace from those encodings that occur in the Torah text. [42] [43]

Haralick is a Fellow of IEEE for his contributions in computer vision and image processing and a Fellow of the International Association for Pattern Recognition (IAPR) for his contributions in pattern recognition, image processing, and for service to IAPR. He served as president of IAPR from 1996 to 1998. He has served on the editorial board of "IEEE Transactions on Pattern Analysis and Machine Intelligence" and has been the computer vision area editor for Communications of the ACM and as an associate editor for Computer Vision, Graphics, and Image Processing, The IEEE Transactions on Image Processing and Pattern Recognition. He served on the editorial board of Real Time Imaging and the editorial board of Electronic Imaging. His publications include over 570 archival papers, book chapters, conference proceedings and books. The science citation index lists over 6300 references to his papers.

Haralick has been recognized for his academic research in the Marquis Who's Who books. He is listed in the current editions for Who's Who in the East, Who's Who in America, and Who's Who in the World.

Works

See also

Related Research Articles

<span class="mw-page-title-main">Image segmentation</span> Partitioning a digital image into segments

In digital image processing and computer vision, image segmentation is the process of partitioning a digital image into multiple image segments, also known as image regions or image objects. The goal of segmentation is to simplify and/or change the representation of an image into something that is more meaningful and easier to analyze. Image segmentation is typically used to locate objects and boundaries in images. More precisely, image segmentation is the process of assigning a label to every pixel in an image such that pixels with the same label share certain characteristics.

<span class="mw-page-title-main">Thomas Huang</span> Chinese-American engineer and computer scientist (1936–2020)

Thomas Shi-Tao Huang was a Chinese-born American computer scientist, electrical engineer, and writer. He was a researcher and professor emeritus at the University of Illinois at Urbana-Champaign (UIUC). Huang was one of the leading figures in computer vision, pattern recognition and human computer interaction.

<span class="mw-page-title-main">Azriel Rosenfeld</span>

Azriel Rosenfeld was an American Research Professor, a Distinguished University Professor, and Director of the Center for Automation Research at the University of Maryland, College Park, Maryland, where he also held affiliate professorships in the Departments of Computer Science, Electrical Engineering, and Psychology, and a talmid chochom. He held a Ph.D. in mathematics from Columbia University (1957), rabbinic ordination (1952) and a Doctor of Hebrew Literature degree (1955) from Yeshiva University, honorary Doctor of Technology degrees from Linkoping University (1980) and Oulu University (1994), and an honorary Doctor of Humane Letters degree from Yeshiva University (2000); he was awarded an honorary Doctor of Science degree from the Technion. He was a Fellow of the Association for the Advancement of Artificial Intelligence (1990) and of the Association for Computing Machinery (1994).

King-Sun Fu was a Chinese-born American computer scientist. He was a Goss Distinguished Professor at Purdue University School of Electrical and Computer Engineering in West Lafayette, Indiana. He was instrumental in the founding of International Association for Pattern Recognition (IAPR), served as its first president, and is widely recognized for his extensive and pioneering contributions to the field of pattern recognition and machine intelligence. In honor of the memory of Professor King-Sun Fu, IAPR gives the biennial King-Sun Fu Prize to a living person in the recognition of an outstanding technical contribution to the field of pattern recognition. The first King-Sun Fu Prize was presented in 1988, to Azriel Rosenfeld.

<span class="mw-page-title-main">Linda Shapiro</span>

Linda G. Shapiro is a professor in the Department of Computer Science and Engineering, a professor of electrical engineering, and adjunct professor of Biomedical Informatics and Medical Education at the University of Washington.

As applied in the field of computer vision, graph cut optimization can be employed to efficiently solve a wide variety of low-level computer vision problems, such as image smoothing, the stereo correspondence problem, image segmentation, object co-segmentation, and many other computer vision problems that can be formulated in terms of energy minimization. Many of these energy minimization problems can be approximated by solving a maximum flow problem in a graph. Under most formulations of such problems in computer vision, the minimum energy solution corresponds to the maximum a posteriori estimate of a solution. Although many computer vision algorithms involve cutting a graph, the term "graph cuts" is applied specifically to those models which employ a max-flow/min-cut optimization.

<span class="mw-page-title-main">Watershed (image processing)</span> Transformation defined on a grayscale image

In the study of image processing, a watershed is a transformation defined on a grayscale image. The name refers metaphorically to a geological watershed, or drainage divide, which separates adjacent drainage basins. The watershed transformation treats the image it operates upon like a topographic map, with the brightness of each point representing its height, and finds the lines that run along the tops of ridges.

<span class="mw-page-title-main">Pyramid (image processing)</span> Type of multi-scale signal representation

Pyramid, or pyramid representation, is a type of multi-scale signal representation developed by the computer vision, image processing and signal processing communities, in which a signal or an image is subject to repeated smoothing and subsampling. Pyramid representation is a predecessor to scale-space representation and multiresolution analysis.

Local binary patterns (LBP) is a type of visual descriptor used for classification in computer vision. LBP is the particular case of the Texture Spectrum model proposed in 1990. LBP was first described in 1994. It has since been found to be a powerful feature for texture classification; it has further been determined that when LBP is combined with the Histogram of oriented gradients (HOG) descriptor, it improves the detection performance considerably on some datasets. A comparison of several improvements of the original LBP in the field of background subtraction was made in 2015 by Silva et al. A full survey of the different versions of LBP can be found in Bouwmans et al.

Josef KittlerFREng is a British scientist and Distinguished Professor at University of Surrey, specialising in pattern recognition and machine intelligence.

<span class="mw-page-title-main">Theodosios Pavlidis</span>

Theodosios Pavlidis is a computer scientist and Distinguished Professor Emeritus of Computer Science at the State University of New York, Stony Brook.

<span class="mw-page-title-main">J. K. Aggarwal</span> American engineer (born 1936)

Jagdishkumar Keshoram Aggarwal is an American computer scientist, who is currently retired and is Cullen Trust Endowed Emeritus Professor of the Department of Electrical and Computer Engineering at the Cockrell School of Engineering, University of Texas at Austin. He is known for his contributions in the fields of computer vision, pattern recognition and image processing focusing on human motion and activities. He served in various positions in the Department of Electrical and Computer of the University of Texas at Austin and other institutions.

Tin Kam Ho is a computer scientist at IBM Research with contributions to machine learning, data mining, and classification. Ho is noted for introducing random decision forests in 1995, and for her pioneering work in ensemble learning and data complexity analysis. She is an IEEE fellow and IAPR fellow.

<span class="mw-page-title-main">Gang Hua</span> Chinese-American computer scientist (born 1979)

Gang Hua is a Chinese-American computer scientist who specializes in the field of computer vision and pattern recognition. He is an IEEE Fellow, IAPR Fellow and ACM Distinguished Scientist. He is a key contributor to Microsoft's Facial Recognition technologies.

Jiebo Luo is a Chinese-American computer scientist, the Albert Arendt Hopeman Professor of Engineering and Professor of Computer Science at the University of Rochester. He is interested in artificial intelligence, data science and computer vision.

Song-Chun Zhu is a Chinese computer scientist and applied mathematician known for his work in computer vision, cognitive artificial intelligence and robotics. Zhu currently works at Peking University and was previously a professor in the Departments of Statistics and Computer Science at the University of California, Los Angeles. Zhu also previously served as Director of the UCLA Center for Vision, Cognition, Learning and Autonomy (VCLA).

Mark S. Nixon is an author, researcher, editor and an academic. He is the former president of IEEE Biometrics Council, and former vice-Chair of IEEE PSPB. He retired from his position as Professor of Electronics and Computer Science at University of Southampton in 2019.

Jiaya Jia is a tenured professor of the Department of Computer Science and Engineering at The Chinese University of Hong Kong (CUHK). He is an IEEE Fellow, the associate editor-in-chief of one of IEEE’s flagship and premier journals- Transactions on Pattern Analysis and Machine Intelligence (TPAMI), as well as on the editorial board of International Journal of Computer Vision (IJCV).

<span class="mw-page-title-main">Mario A. T. Figueiredo</span> Portuguese engineer, academic

Mário A. T. Figueiredo is a Portuguese engineer, academic, and researcher. He is an IST Distinguished Professor and holds the Feedzai chair of machine learning at IST, University of Lisbon.

References

  1. "Robert Haralick Distinguished Professor, Ph.D. Program in Computer Science". Recently Appointed Graduate Center Faculty: 1999-present. The Graduate Center, City University of New York. Archived from the original on 2015-09-06. Retrieved 2011-12-04.
  2. Pattern Recognition with Measurement Space and Spatial Clustering for Multiple Images, (with G.L. Kelly), Proceedings of the IEEE, Volume 57, Number 4, April, 1969, pages 654-665.
  3. Using Radar Imagery for Crop Discrimination: A Statistical and Conditional Probability Study, with (F. Caspall and D.S. Simonett), Remote Sensing of Environment, Volume 1, 1970, pages 131-142
  4. An Iterative Clustering Procedure, (with I. Dinstein), IEEE Transactions on Systems, Man, and Cybernetics, Volume SMC 1, Number 3, July, 1971, pages 275-289.
  5. Haralick, Robert M. (1980). "Using Perspective Transformations in Scene Analysis" (PDF). Computer Graphics and Image Processing. 13 (3): 191–221. doi:10.1016/0146-664x(80)90046-5.
  6. Statistical Estimation For Exterior Orientation From Line to Line Correspondences, (with Chung Nan Lee), Image and Vision Computing, Volume 14, 1996, pages 379-388
  7. A Robust Linear Least squares Estimation of Camera Exterior Orientation Using Multiple Geometric Features, (with Qiang Ji, Mauro Costa, and Linda Shapiro), ISPRS Journal of Photogrammetry and Remote Sensing, Volume 55, Number 2, 2000, pages 75-93.
  8. Testing Camera Calibration With Constraints, (with Huang Youcai), Photogrammetric Engineering and Remote Sensing, March 1999, pages 249-258.
  9. The Consistent Labeling Problem: Part I, (with L.G. Shapiro), IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 1, Number 2, April, 1979, pages 173-184.
  10. The Consistent Labeling Problem: Part II, (with L.G. Shapiro), IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 2, Number 3, May, 1980, pages 193-203.
  11. Structural Pattern Recognition, Homomorphisms, and Arrangements, Pattern Recognition, Volume 10, Number 3, June, 1978, pages 223-236.
  12. Increasing Tree Search Efficiency for Constraint Satisfaction Problems, (with G.L. Elliott), Artificial Intelligence, Volume 14, 1980, pages 263-313.
  13. An Approximate Linear Time Propagate and Divide Theorem Prover for Propositional Logic, with S.H. Wu, International Journal of Pattern Recognition and Artificial Intelligence, Volume 1, Number 1, 1987, pages 141-155.
  14. Textural Features for Image Classification, with S. Shamnugam, IEEE Transactions on Cybernetics, Volume SMC-3, No. 2, 1973
  15. Statistical and Structural Approaches to Texture, Proceedings of the IEEE, Volume 67, Number 5, May, 1979, pages 786-804.
  16. A Facet Model for Image Data, (with L. Watson), Computer Graphics and Image Processing, Volume 15, February, 1981, pages 113-129.
  17. Digital Step Edges from Zero Crossing of Second Directional Derivatives, IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 6, Number 1, January, 1984, pages 58-68.
  18. Integrated Directional Derivative Gradient Operator, (with O.A. Zuniga), IEEE Transactions on Systems, Man, and Cybernetics, Volume SMC 17, Number 3, May/June, 1987, pages 508-518.
  19. Ridges and Valleys on Digital Images, Computer Vision, Graphics, and Image Processing, Volume 22, 1983, pages 28-38.
  20. Peak Noise Removal by a Facet Model, (with Y. Yasuoka), Pattern Recognition, Volume 16, Number 1, 1983, pages 23-29.
  21. The Topographic Primal Sketch, (with L.T. Watson and T.J. Laffey), The International Journal of Robotics Research, Volume 2, Number 1, Spring 1983, pages 50-72.
  22. Image Analysis Using Mathematical Morphology, (with S. Sternberg and X. Zhuang), IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 9, Number 4, July, 1987, pages 532-550.
  23. The Digital Morphological Sampling Theorem, (with X. Zhuang, C. Lin, and J.S.J. Lee), IEEE Transactions on Acoustics, Speech, and Signal Processing, Volume 37, Number 12, December, 1989, pages 2067-2090.
  24. Recursive Erosion, Dilation, Opening, and Closing Transforms, (with S. Chen), IEEE Transactions on Image Processing, March, 1995, Volume 4, Number 3, pages 335-345.
  25. Recursive Binary Dilation and Erosion Using Digital Line Structuring Elements In Arbitrary Orientations, (with Desika Nadadur), IEEE Transactions on Image Processing, May 2000
  26. Model Based Morphology: The Opening Spectrum, (with E.R. Dougherty and P.L. Katz), Computer Vision, Graphics, and Image Understanding: Graphical Models and Image Processing, Volume 57, Number 1, January, 1995, pages 1-12.
  27. Estimation of Optimal Morphological Tau opening Parameters Based on Independent Observation of Signal and Noise Pattern Spectra, (with E.R. Dougherty, Y. Chen, C. Agerskov, U. Jacobi, and P.H. Sloth), Journal of Signal Processing, Volume 29, Number 3, December, 1992, pages 265-281.
  28. CD_ROM Standard Document Database Standard, with I.T. Phillips, International Conference on Document Analysis and Recognition, Tsukuba, Japan, 1993
  29. Extraction of Text Lines and Text Blocks on Document Images Based on Statistical Modeling, (with Su Chen and Ihsin Phillips), International Journal of Imaging Systems and Technology, Volume 7, 1996, pages 343-35.
  30. Extraction of Text Words In Document Images Based On A Statistical Characterization, (with Su Chen and Ihsin Phillips), Journal of Electronic Imaging, Volume 5, 1996, page 24-34.
  31. Computer Vision Theory: The Lack Thereof, Computer Vision, Graphics, and Image Processing, Volume 36, 1986, page 372-386.
  32. Propagating Covariance in Computer Vision, International Journal of Pattern Recognition and Artificial Intelligence, Volume 10, Number 5, 1996, pages 561-572.
  33. Error Propagation in Machine Vision, (with Seungku Yi and L.G. Shapiro), Machine Vision and Applications, Volume 7, 1994, pages 93-114.
  34. A general technique for automatic left ventricle boundary validation: relation between gray scale cardioangiograms and observed boundary errors, (with J. Suri, F. Sheehan), Journal of Digital Imaging, 1997, August;10(3 Suppl 1), pages 212-217.
  35. Greedy Algorithm for Error Correction in Automatically Produced Boundaries from Low Contrast Ventriculograms, (with Jasjit S. Suri, Florence H. Sheehan), Pattern Analysis and Applications, Volume 3 Number 1, 2000, pages 39-60.
  36. A Knowledge based Boundary Delineation System For Contrast Ventriculograms", (with Lei Sui and Florence Sheehan), IEEE Transactions on Information Technology in Biomedicine, Volume 5, Number 2, 2001, pages 116-132.
  37. Integrated Surface Model Optimization for Freehand Three Dimensional Echocardiography, IEEE Transactions on Medical Imaging, (with Mingzhou Song, Florence Sheehan, and Richard Johnson), Volume 21, Number 9, September 2002.
  38. Linear Manifold Clustering In High Dimensional Spaces By Stochastic Search, (with Rave Harpaz), Pattern Recognition, Volume 40, Number 10, 2007, pages 2672-2684.
  39. Linear Manifold Correlation Clustering, (with Rave Harpaz), (invited paper) International Journal of Information Technology And Intelligent Computing, Volume 2, Number 2, 2007.
  40. Pattern Recognition Approaches To Solving Combinatorial Problems in Free Groups, (with Alex Miasnikov and Alexei Myasnikov), Contemporary Mathematics, Volume 349, 2004, pages 197-213.
  41. Heuristics for Whitehead Minimization Problem, (with Alex Miasnikov and Alexei Myasnikov), Experimental Mathematics, Volume 14, Number 1, 2005, page 7-14.
  42. Basic Concepts For Testing The Torah Code Hypothesis", International Conference on Pattern Recognition, Volume 3, 2006, pages 104-109.
  43. Testing The Torah Code Hypothesis: The Experimental Protocol, International Conference on Pattern Recognition, Volume 3, 2006, pages 110-115.