Krishnendu Chaudhury

Last updated

Krishnendu Chaudhury is an American Technology leader (deep learning, computer vision), inventor, and author. [1] [2] He is the co-founder of Drishti Technologies in Palo Alto, CA. Currently, he is serving as the Chief Technology officer (CTO) of the company. Chaudhury is also the author of the book Mathematics and Architectures of Deep Learning. He has 34 patents [3] to his name and several publications in the field of computer vision and deep learning. [4] Over the course of his career, he has worked at Adobe Systems, Google, and Flipkart. [1] [2]

Contents

Early life and education

In 1994, he completed his Ph.D. in computer science in computer vision and image processing from the University of Kentucky. [5]

Career

In 1995, Chaudhury started working for Adobe Systems as a senior computer scientist. He worked with Adobe for ten years working on Advanced Technology Group and Postscript Core Technology, working in different locations, including San Francisco and San Jose. [1] [2]

In 2005, he joined Google. [1] [2] He worked on image processing and computer vision projects. He worked on several projects that revolved around machine learning and computer vision, including Google's newspaper search feature, which was launched in 2008 on Google's 10th birthday. The product allows readers to go through newspaper archives from hundreds of newspapers and thousands of issues from the 1800s and early 1900s. [6] Chaudhury also managed Google's version of image compression called "WebP," which was launched in October 2010. He worked on the auto-rectification of photos at Google Photos, devising a complex mathematical way to restore parallelism of lines inside an image. He also worked on an early version of a face recognition-based login for Android. [7] After 10 years of service, he left Google and joined Flipkart.

In 2015, Chaudhury joined Flipkart as a Principal Scientist and Head of Image Sciences, handling projects related to computer vision and deep learning. [8] He worked on deep learning in the e-commerce visual search and recommendation systems. Chaudhury led the computer vision and image processing technology team across Flipkart's platforms. [8] His team led the first visual recommendation engine, which would recommend visually similar products. His notable works for Flipkart include Deep Learning based Visually Similar recommendations for Flipkart: A new Convolutional Neural Net (CNN) architecture for visual feature embeddings, [9] and Region CNN-based e-commerce product localization on wild images: Identifying Flipkart apparel similar to the ones worn by models/celebrities/friends in wild images. [10]

In 2017, he co-founded Drishti Technologies in Palo Alto, CA, with Prasad Akella and serial entrepreneur Ashish Gupta. [1] Drishti Technologies works on AI-powered video analytics and video traceability. [11] Drishti is a Silicon Valley AI startup funded by VCs including Andreessen Horowitz, Emergence Capital, and Benhamou Global Ventures. [12] Chaudhury is also on the Technical Advisory Committee of Benhamou Global Ventures since 2017. He holds the position of CTO at Drishti Technologies, working on Applying Deep Learning and Computer Vision. [2]

Selected publications

Selected patents

Books

In 2020, Chaudhury authored Math and Architectures for Deep Learning which was published by Manning Publications, New York. [13] The book demonstrates the mathematical examples of deep learning and narrows the gap between Math and Architectures of Deep Learning, defining both theory and practice, laying out the math of deep learning side by side with practical implementations in Python and PyTorch.

Awards and recognition

In 2022, Chaudhury was included in Best Startup magazine's list of "California's 101 Top CTO's in the Machine Learning Space ". [14]

Related Research Articles

<span class="mw-page-title-main">Computer vision</span> Computerized information extraction from images

Computer vision tasks include methods for acquiring, processing, analyzing and understanding digital images, and extraction of high-dimensional data from the real world in order to produce numerical or symbolic information, e.g. in the forms of decisions. Understanding in this context means the transformation of visual images into descriptions of the world that make sense to thought processes and can elicit appropriate action. This image understanding can be seen as the disentangling of symbolic information from image data using models constructed with the aid of geometry, physics, statistics, and learning theory.

<span class="mw-page-title-main">Machine vision</span> Technology and methods used to provide imaging-based automatic inspection and analysis

Machine vision (MV) is the technology and methods used to provide imaging-based automatic inspection and analysis for such applications as automatic inspection, process control, and robot guidance, usually in industry. Machine vision refers to many technologies, software and hardware products, integrated systems, actions, methods and expertise. Machine vision as a systems engineering discipline can be considered distinct from computer vision, a form of computer science. It attempts to integrate existing technologies in new ways and apply them to solve real world problems. The term is the prevalent one for these functions in industrial automation environments but is also used for these functions in other environment vehicle guidance.

Cognex Corporation is an American manufacturer of machine vision systems, software and sensors used in automated manufacturing to inspect and identify parts, detect defects, verify product assembly, and guide assembly robots. Cognex is headquartered in Natick, Massachusetts, USA and has offices in more than 20 countries.

Krishna Bharat is an Indian rapper and research scientist at Google Inc. He was formerly a founding adviser for Grokstyle Inc. a visual search company and Laserlike Inc., an interest search engine startup based on Machine Learning.

<span class="mw-page-title-main">Alex Fielding</span> American engineer and manager

Alex Fielding is an American engineer and manager. He is the CEO and co-founder of Privateer Space, a space startup with a global online marketplace that aims to connect customers seeking planetary data with orbiting satellites and AI. He co-founded the company with Apple co-founder Steve Wozniak and MacArthur Genius Moriba Jah. Privateer announced in 2023 that it had grown the business from the Google Maps of space to become the first AI powered space data ride sharing platform with an upcoming satellite autopilot system called Pono set to fly on SpaceX in December of 2023. The International Space Station National Labs, in partnership with Privateer announced a deal whereby Privateer publicly tracks and displays mission data on International Space Station telemetry, astronauts, and mission objectives live on the ISS National Labs website. He was co-founder and CEO of robotics company Ripcord, Inc from 2014 to 2021.

Object recognition – technology in the field of computer vision for finding and identifying objects in an image or video sequence. Humans recognize a multitude of objects in images with little effort, despite the fact that the image of the objects may vary somewhat in different view points, in many different sizes and scales or even when they are translated or rotated. Objects can even be recognized when they are partially obstructed from view. This task is still a challenge for computer vision systems. Many approaches to the task have been implemented over multiple decades.

<span class="mw-page-title-main">Dan Quine</span> British computer scientist

Daniel Nicholas Quine is a computer scientist, currently VP Engineering at AltSchool.

<span class="mw-page-title-main">Reverse image search</span> Content-based image retrieval

Reverse image search is a content-based image retrieval (CBIR) query technique that involves providing the CBIR system with a sample image that it will then base its search upon; in terms of information retrieval, the sample image is very useful. In particular, reverse image search is characterized by a lack of search terms. This effectively removes the need for a user to guess at keywords or terms that may or may not return a correct result. Reverse image search also allows users to discover content that is related to a specific sample image or the popularity of an image, and to discover manipulated versions and derivative works.

<span class="mw-page-title-main">Optical head-mounted display</span> Type of wearable device

An optical head-mounted display (OHMD) is a wearable device that has the capability of reflecting projected images as well as allowing the user to see through it. In some cases, this may qualify as augmented reality (AR) technology. OHMD technology has existed since 1997 in various forms, but despite a number of attempts from industry, has yet to have had major commercial success.

<span class="mw-page-title-main">Peter Karow</span> German entrepreneur

Peter Karow is a German entrepreneur, inventor and software developer. He holds several patents in the field of desktop publishing and is known for his work on computer fonts. He contributed with several books and patents to the development of operating systems for computers. He is recognized as the inventor of outline computer fonts.

<span class="mw-page-title-main">Fei-Fei Li</span> American computer scientist (born 1976)

Fei-Fei Li is an American computer scientist, recognized as the GodMother of AI, who was born in China and is known for establishing ImageNet, the dataset that enabled rapid advances in computer vision in the 2010s.She is the Sequoia Capital Professor of Computer Science at Stanford University and former board director at Twitter. Li is a Co-Director of the Stanford Institute for Human-Centered Artificial Intelligence, and a Co-Director of the Stanford Vision and Learning Lab. She served as the director of the Stanford Artificial Intelligence Laboratory (SAIL) from 2013 to 2018.

<span class="mw-page-title-main">Subhasis Chaudhuri</span>

Subhasis Chaudhuri is an Indian electrical engineer and the director at the Indian Institute of Technology, Bombay. He is a former K. N. Bajaj Chair Professor of the Department of Electrical Engineering of IIT Bombay. He is known for his pioneering studies on computer vision and is an elected fellow of all the three major Indian science academies viz. the National Academy of Sciences, India, Indian Academy of Sciences, and Indian National Science Academy. He is also a fellow of Institute of Electrical and Electronics Engineers, and the Indian National Academy of Engineering. The Council of Scientific and Industrial Research, the apex agency of the Government of India for scientific research, awarded him the Shanti Swarup Bhatnagar Prize for Science and Technology, one of the highest Indian science awards, in 2004 for his contributions to Engineering Sciences.

Dorin Comaniciu is a Romanian-American computer scientist. He is the Senior Vice President of Artificial Intelligence and Digital Innovation at Siemens Healthcare.

Lina J. Karam is a Lebanese-American electrical and computer engineer and inventor. She is an IEEE Fellow. Her areas of work span digital signal processing, image/video processing, compression/coding and transmission, computer vision, machine learning/deep learning, perceptual-based visual processing, and automated mobility. She served as an expert delegate of the ISO/IEC JTC1/SC29 Committee and participated in JPEG/MPEG standardization activities. She served as expert consultant in matters related to Intellectual Property (IP)/Patent Litigation, Image/Video Compression and Streaming, Image/Video Processing, Computer Vision, Machine Learning, and Autonomous Driving.

Ashutosh Saxena is an Indian-American computer scientist, researcher, and entrepreneur known for his contributions to the field of artificial intelligence and robotics. His research interests include deep learning, robotics, and 3-dimensional computer vision. Saxena is the co-founder and CEO of Caspar.AI, which is an artificial intelligence company that automates peoples' homes and builds applications such as fall detectors for senior living. Prior to Caspar.AI, Ashutosh co-founded Cognical Katapult, which provides a no credit required alternative to traditional financing for online and omni-channel retail. Before Katapult, Saxena was an assistant professor in the Computer Science Department and faculty director of the RoboBrain Project at Cornell University.

Jiebo Luo is a Chinese-American computer scientist, the Albert Arendt Hopeman Professor of Engineering and Professor of Computer Science at the University of Rochester. He is interested in artificial intelligence, data science and computer vision.

Jannick Rolland is the Brian J. Thompson Professor of Optical Engineering at the Institute of Optics at the University of Rochester. She is also the co-founder and CTO of LighTopTech, a women-owner business founded in 2013 to create medical imaging technologies with biomimetic noninvasive imaging technology. At the University of Rochester, she is the Director of the NSF I/UCRC Center for Freeform Optics (CeFO). She is also the Director of the R.E. Hopkins Center for Optical Design and Engineering that engages undergraduates in optical design, fabrication, and metrology.

<span class="mw-page-title-main">Anwar Chitayat</span>

Anwar Chitayat is the founder and former CEO and chairman of Anorad Corp., which was acquired in 1998 by Rockwell Automation. Mr. Chitayat holds over 95 patents in Electronics, Semiconductors and Automation including Nanotechnology, Interferometry and Linear motors. His achievements in High technology were honored by SEMI at their highest honor for Lifetime Achievement, reserved for individuals who repeatedly enable and lead the technology industry throughout their professional career. In the year 1997, Anwar was awarded the Entrepreneur of the year award by Ernst and Young, and in the year 2009, Anwar was inducted to Long Island Hall of Fame for his impacts on science and technology on Long Island.

<span class="mw-page-title-main">Thiru Vikram</span> Inventor and co-founding CEO of Buffalo Automation

Thiru Vikram is an inventor, engineer and entrepreneur, who is the CEO of Buffalo Automation, an artificial intelligence company headquartered in Buffalo, New York, that provides autonomous navigation technology for commercial ships and recreational boats.

Video matting is a technique for separating the video into two or more layers, usually foreground and background, and generating alpha mattes which determine blending of the layers. The technique is very popular in video editing because it allows to substitute the background, or process the layers individually.

References

  1. 1 2 3 4 5 Cai, Kenrick. "AI Manufacturing Startup Drishti Raises $25 Million To Go Global With Its Factory Floor Analytics". Forbes. Retrieved 2023-08-29.
  2. 1 2 3 4 5 Feldman, Amy. "How This Manufacturing-Automation Startup Signed Up Auto-Parts Giant Denso For Tech That Helps Humans Work Smarter". Forbes. Retrieved 2023-08-29.
  3. 1 2 "CHAUDHURY; Krishnendu Patent Filings". uspto.report. Retrieved 2023-08-29.
  4. "Krishnendu Chaudhury". scholar.google.com. Retrieved 2023-08-29.
  5. "Minutes of the University of Kentucky Board of Trustees, 1994-04-may3". exploreuk.uky.edu. Retrieved 2023-08-29.
  6. Chaudhury, Krishnendu; Jain, Ankur; Thirthala, Sriram; Sahasranaman, Vivek; Saxena, Shobhit; Mahalingam, Selvam (July 2009). "Google Newspaper Search - Image Processing and Analysis Pipeline". 2009 10th International Conference on Document Analysis and Recognition. pp. 621–625. doi:10.1109/ICDAR.2009.272. ISBN   978-1-4244-4500-4. S2CID   12254112.
  7. 1 2 Chaudhury, K.; Mehrotra, R. (1994-09-01). "Optical Flow Estimation Using Smoothness of Intensity Trajectories". CVGIP: Image Understanding. 60 (2): 230–244. doi:10.1006/ciun.1994.1049. ISSN   1049-9660.
  8. 1 2 "Flipkart hires Krishnendu Chaudhury to head Image Sciences". The Times of India. 2015-07-23. ISSN   0971-8257 . Retrieved 2023-08-29.
  9. Devashish, Shankar; Sujay, Narumanchi; A, Ananya, H; Pramod, Kompalli; Krishnendu, Chaudhury (2017-03-07). "Deep Learning based Large Scale Visual Recommendation and Search for E-Commerce". arXiv: 1703.02344 [cs.CV].{{cite arXiv}}: CS1 maint: multiple names: authors list (link)
  10. Shankar, Devashish; Narumanchi, Sujay; H A Ananya; Kompalli, Pramod; Chaudhury, Krishnendu (2017). "Deep Learning based Large Scale Visual Recommendation and Search for E-Commerce". arXiv: 1703.02344 [cs.CV].
  11. "Tech from Drishti's Bengaluru R&D team aiding firms close a 100-year-old data gap in manufacturing". The Times of India. 2021-09-28. ISSN   0971-8257 . Retrieved 2023-08-29.
  12. "History". Drishti. Retrieved 2023-08-30.
  13. Chaudhury, Krishnendu (2023-10-17). Math and Architectures of Deep Learning. Manning. ISBN   978-1-61729-648-2.
  14. Smith, Mark (2022-06-02). "Meet California's 101 Top CTO's in the Machine Learning Space". BestStartup.us. Retrieved 2023-08-30.