Irfan Essa

Last updated
Irfan Essa
Irfan Essa.jpeg
Alma mater MIT
Known forfacial recognition, video stabilization, computational photography, computational journalism
Scientific career
Fields Computer vision, computational journalism, machine learning, computer graphics, robotics
Institutions Georgia Tech
GVU Center
Thesis Analysis, interpretation and synthesis of facial expressions  (1995)
Doctoral advisor Alex Pentland
Website prof.irfanessa.com

Irfan Aziz Essa is a professor in the School of Interactive Computing of the College of Computing, and adjunct professor in the School of Electrical and Computer Engineering at the Georgia Institute of Technology (Georgia Tech). He is an associate dean in Georgia Tech's College of Computing [1] and the director of the new Interdisciplinary Research Center for Machine Learning at Georgia Tech (ML@GT). [2]

Contents

Education

Essa obtained his undergraduate degree in engineering at the Illinois Institute of Technology in 1988. [3] Following this, Essa attended the Massachusetts Institute of Technology, where he received his magister scientiae (Master of Science) in 1990 and his Ph.D. in 1995 at the MIT Media Lab. His doctoral research focused on the implementation of a system to detect emotions from changes in your facial expression, which was later featured in the New York Times. [4] He proceeded to hold a position as a research scientist at MIT from 1994 to 1996 before accepting a position at Georgia Tech.

Professional career

Essa's work focuses mainly in the areas of computer vision, computational photography, computer graphics and animation, robotics, computational perception, human-computer interaction, machine learning, computational journalism and artificial intelligence.

After departing MIT, Essa accepted a position as an assistant professor in the College of Computing at Georgia Tech. Today, he holds the position of a professor, and continues his research endeavors alongside his teaching career.

Essa has taught various courses over the years on digital video special effects, computer vision, computational journalism and computational photography. [5] In the spring of 2013, Essa taught a free online course on computational photography, on the MOOC platform Coursera. [6] He is affiliated with the GVU Center and RIM@GT, and is one of the faculty members of the Computational Perception Laboratory at Georgia Tech.

In addition to this, Essa has organized the Computational Journalism Symposium both in 2008 and 2013. [7] He is credited, alongside his doctoral student Nick Diakopoulos, with coining the term computational journalism back in 2006, when they taught the first class on the subject. [8]

Most recently, Essa has worked as a researcher / consultant with Google to develop a video stabilization algorithm alongside two of his doctoral students, Matthias Grundmann and Vivek Kwatra, which now runs on YouTube, and allows users to stabilize their uploaded videos in real-time. [9]

Selected bibliography

Related Research Articles

<span class="mw-page-title-main">Distance transform</span>

A distance transform, also known as distance map or distance field, is a derived representation of a digital image. The choice of the term depends on the point of view on the object in question: whether the initial image is transformed into another representation, or it is simply endowed with an additional map or field.

General-purpose computing on graphics processing units is the use of a graphics processing unit (GPU), which typically handles computation only for computer graphics, to perform computation in applications traditionally handled by the central processing unit (CPU). The use of multiple video cards in one computer, or large numbers of graphics chips, further parallelizes the already parallel nature of graphics processing.

<span class="mw-page-title-main">Gesture recognition</span> Topic in computer science and language technology

Gesture recognition is a topic in computer science and language technology with the goal of interpreting human gestures via mathematical algorithms. It is a subdiscipline of computer vision. Gestures can originate from any bodily motion or state, but commonly originate from the face or hand. Focuses in the field include emotion recognition from face and hand gesture recognition since they are all expressions. Users can make simple gestures to control or interact with devices without physically touching them. Many approaches have been made using cameras and computer vision algorithms to interpret sign language, however, the identification and recognition of posture, gait, proxemics, and human behaviors is also the subject of gesture recognition techniques. Gesture recognition can be seen as a way for computers to begin to understand human body language, thus building a better bridge between machines and humans than older text user interfaces or even GUIs, which still limit the majority of input to keyboard and mouse and interact naturally without any mechanical devices.

<span class="mw-page-title-main">Automatic image annotation</span>

Automatic image annotation is the process by which a computer system automatically assigns metadata in the form of captioning or keywords to a digital image. This application of computer vision techniques is used in image retrieval systems to organize and locate images of interest from a database.

Ramesh Chandra Jain is a scientist and entrepreneur in the field of information and computer science. He is a Bren Professor in Information & Computer Sciences, Donald Bren School of Information and Computer Sciences, University of California, Irvine.

Object recognition – technology in the field of computer vision for finding and identifying objects in an image or video sequence. Humans recognize a multitude of objects in images with little effort, despite the fact that the image of the objects may vary somewhat in different view points, in many different sizes and scales or even when they are translated or rotated. Objects can even be recognized when they are partially obstructed from view. This task is still a challenge for computer vision systems. Many approaches to the task have been implemented over multiple decades.

<span class="mw-page-title-main">Gregory Abowd</span> American computer scientist

Gregory Dominic Abowd is a computer scientist best known for his work in ubiquitous computing, software engineering, and technologies for autism. He currently serves as the Dean of the College of Engineering and Professor of Electrical and Computer Engineering at Northeastern University. Previously he was the J.Z. Liang Professor in the School of Interactive Computing at the Georgia Institute of Technology, where he joined the faculty in 1994.

The School of Interactive Computing is an academic unit located within the College of Computing at the Georgia Institute of Technology. It conducts both research and teaching activities related to interactive computing at the undergraduate and graduate levels. These activities focus on computing's interaction with users and the environment, as well as how computers impact the quality of people's lives.

Aaron F Bobick is dean of the McKelvey School of Engineering at Washington University in St. Louis. Bobick’s research is in the field of artificial intelligence and computer vision. He has chaired and published papers in top-tier academic conferences in these areas. His research and expert opinions on technology have also been reported in major news sources.

Informatics is the study of computational systems. According to the ACM Europe Council and Informatics Europe, informatics is synonymous with computer science and computing as a profession, in which the central notion is transformation of information. In other countries, the term "informatics" is used with a different meaning in the context of library science, in which case it is synonymous with data storage and retrieval.

Computational journalism can be defined as the application of computation to the activities of journalism such as information gathering, organization, sensemaking, communication and dissemination of news information, while upholding values of journalism such as accuracy and verifiability. The field draws on technical aspects of computer science including artificial intelligence, content analysis, visualization, personalization and recommender systems as well as aspects of social computing and information science.

Multilinear principal component analysis (MPCA) is a multilinear extension of principal component analysis (PCA). MPCA is employed in the analysis of M-way arrays, i.e. a cube or hyper-cube of numbers, also informally referred to as a "data tensor". M-way arrays may be modeled by

<span class="mw-page-title-main">Hanspeter Pfister</span> Swiss computer scientist

Hanspeter Pfister is a Swiss computer scientist. He is the An Wang Professor of Computer Science at the Harvard John A. Paulson School of Engineering and Applied Sciences and an affiliate faculty member of the Center for Brain Science at Harvard University. His research in visual computing lies at the intersection of scientific visualization, information visualization, computer graphics, and computer vision and spans a wide range of topics, including biomedical image analysis and visualization, image and video analysis, and visual analytics in data science.

<span class="mw-page-title-main">Gregory D. Hager</span> American computer scientist

Gregory D. Hager is the Mandell Bellmore Professor of Computer Science and founding director of the Johns Hopkins Malone Center for Engineering in Healthcare at Johns Hopkins University.

Jiebo Luo is a Chinese-American computer scientist, the Albert Arendt Hopeman Professor of Engineering and Professor of Computer Science at the University of Rochester. He is interested in artificial intelligence, data science and computer vision.

Wolfgang Heidrich is a German-Canadian computer scientist and Professor at the King Abdullah University of Science and Technology (KAUST), for which he served as the director of Visual Computing Center from 2014 to 2021. He was previously a professor at the University of British Columbia (UBC), where he was a Dolby Research Chair (2008-2013). His research has combined methods from computer graphics, optics, machine vision, imaging, inverse methods, and perception to develop new Computational Imaging and Display technologies. His more recent interest focuses on hardware-software co-design of the next generation of imaging systems, with applications such as high dynamic range (HDR) imaging, compact computational cameras, hyper-spectral cameras, wavefront sensors, to name just a few.

References

  1. Georgia Tech Directory for Irfan Essa
  2. Georgia Tech Machine Learning Center Directory
  3. Essa's MIT Alumni Page
  4. "Laugh and Your Computer Will Laugh With You, Someday (Published 1997)". The New York Times . Archived from the original on 2015-10-16.
  5. Classes Taught by Professor Essa
  6. Computational Photography by Irfan Essa
  7. Georgia Tech Explores the Digital Future of Journalism
  8. "Computational Journalism Seminar". Archived from the original on 2013-06-26. Retrieved 2013-04-29.
  9. Video Stabilization - Google Research