Krishnendu Chaudhury

Last updated September 10, 2024

Krishnendu Chaudhury is an American Technology leader (deep learning, computer vision), inventor, and author.^[1]^[2] He is the co-founder of Drishti Technologies in Palo Alto, CA. Currently, he is serving as the Chief Technology officer (CTO) of the company. Chaudhury is also the author of the book Mathematics and Architectures of Deep Learning. He has 34 patents ^[3] to his name and several publications in the field of computer vision and deep learning.^[4] Throughout his career, he has worked at Adobe Systems, Google, and Flipkart.^[1]^[2]

Early life and education

In 1994, Chaudhury completed his Ph.D. in computer science in computer vision and image processing from the University of Kentucky.^[5]

Career

In 1995, Chaudhury started working for Adobe Systems as a senior computer scientist. He worked with Adobe for ten years working on Advanced Technology Group and Postscript Core Technology, working in different locations, including San Francisco and San Jose.^[1]^[2]

In 2005, Chaudhury joined Google.^[1]^[2] He worked on image processing and computer vision projects. He also worked on several projects that revolved around machine learning and computer vision, including Google's newspaper search feature, which was launched in 2008 on Google's 10th birthday. The newspaper search feature allows readers to go through newspaper archives from hundreds of newspapers and thousands of issues from the 1800s and early 1900s.^[6] Chaudhury also managed Google's version of image compression called "WebP," which was launched in October 2010. He worked on the auto-rectification of photos at Google Photos, devising a complex mathematical way to restore the parallelism of lines inside an image. He also worked on an early version of a face recognition-based login for Android.^[7] After 10 years of service, he left Google.

In 2015, Chaudhury joined Flipkart as a Principal Scientist and Head of Image Sciences, handling projects related to computer vision and deep learning.^[8] He worked on deep learning in the e-commerce visual search and recommendation systems. Chaudhury led the computer vision and image processing technology team across Flipkart's platforms.^[8] His team led the first visual recommendation engine, which would recommend visually similar products. His notable works for Flipkart include Deep Learning based Visually Similar recommendations for Flipkart: A new Convolutional Neural Net (CNN) architecture for visual feature embeddings,^[9] and Region CNN-based e-commerce product localization on wild images: Identifying Flipkart apparel similar to the ones worn by models/celebrities/friends in wild images.^[10]

In 2017, he co-founded Drishti Technologies in Palo Alto, CA, with Prasad Akella and serial entrepreneur Ashish Gupta.^[1] Drishti Technologies works on AI-powered video analytics and video traceability.^[11] Drishti is a Silicon Valley AI startup funded by VCs including Andreessen Horowitz, Emergence Capital, and Benhamou Global Ventures.^[12] Chaudhury is also on the Technical Advisory Committee of Benhamou Global Ventures since 2017. He holds the position of CTO at Drishti Technologies, working on Applying Deep Learning and Computer Vision.^[2]

Selected publications

Loss and Gain of Optical Couplers with Collision Detection for CSMA/CD based LANs, A. Das, Krishnendu Chaudhury, IEEE International Conference on Communication Systems, ICCS-89, Singapore, Oct. 12-16, 1989
A Parallel Algorithm for 3D Point Pattern Matching, Krishnendu Chaudhury, R. Mehrotra, IEEE International Conference on Systems, Man and Cybernetics, Charlottesville, Virginia, Oct. 13-16 1991.
Optical Flow from an Extended Frame Sequence, Krishnendu Chaudhury, R. Mehrotra, International Conference on Robotics and Automation, Atlanta, Georgia, May 2–7, 1993.
"Detecting 3D Flow", Krishnendu Chaudhury, R. Mehrotra, International Conference on Robotics and Automation, San Diego, California, May 8–13, 1994.
"Optical Flow Estimation using Smoothness of Intensity Trajectories," Krishnendu Chaudhury, R. Mehrotra, CVGIP: Image Understanding journal, pp. 230–244, vol. 60, No. 2, September 1994.^[7]
"A Trajectory-based Computational Model for Optical Flow Computation," Krishnendu Chaudhury, R. Mehrotra, IEEE Transactions on Robotics and Automation journal, pp. 733–741, vol. 11, No. 5, Oct 1995
"Pose Estimation in Automated Visual Inspection using Genetic Algorithms," S. Hati, K. Chadhury, et al., International Journal of Neural Systems, Aug 1, 2006.
Chaudhury, Krishnendu; Jain, Ankur; Thirthala, Sriram; Sahasranaman, Vivek; Saxena, Shobhit; Mahalingam, Selvam (July 2009). "Google Newspaper Search - Image Processing and Analysis Pipeline". 2009 10th International Conference on Document Analysis and Recognition. pp. 621–625. doi:10.1109/ICDAR.2009.272. ISBN 978-1-4244-4500-4. S2CID 12254112.
"Auto-Rectification of User Photos," Krishnendu Chaudhury, Stephen DiVerdi, Sergey Ioffe, proc. of IEEE International Conference on Image Processing, ICIP 2014, Oct 27-30, 2014, Paris.
Mukherjee, Srayanta; Shankar, Devashish; Ghosh, Atin; Tathawadekar, Nilam; Kompalli, Pramod; Sarawagi, Sunita; Chaudhury, Krishnendu (March 16, 2018). "ARMDN: Associative and Recurrent Mixture Density Networks for eRetail Demand Forecasting". arXiv: 1803.03800 [cs.LG].

Selected patents

Chaudhury has registered more than 34 patents in the field of computer vision and deep learning.^[3]
"Almost Unsupervised Cycle and Action Detection," K. Chaudhury, A.Ashok, S. Narumanchi, D. Shankar, A. Mehra, Drishti Technologies, US Patent 20210216777
"Deep Learning Cycle Detection from the video," K. Chaudhury, A.Ashok, S. Narumanchi, D. Shankar, Drishti Technologies, R. Jain, DRIL-P0013, US Patent 20210117684
"Traceability Systems and Methods," P.Akella, K. Chaudhury, A.Ashok, S. Narumanchi, D. Shankar, Drishti Technologies, US Patent 10890898B2
"Automatic Rectification of Distortions in images," Krishnendu Chaudhury, Stephen Diverdi, US Patent 9064309.
"Edge Aware Smoothing in Images," US Patent US8983188, Grayson Lang, Krishnendu Chaudhury
"Session Based Character Recognition for Document Reconstruction," D. Petrou, Krishnendu Chaudhury, S. Goschin, M. Bridges, US Patent 9424668.
"Liveness Detection in Face Recognition-based Authentication Systems," Krishnendu Chaudhury, Avani Devarasetty, U.S Patent US8856541.
"Automatic extraction of character ground truth data from images," Alessandro Bissacco, Krishnendu Chaudhury, U. S. Patent 8755595.
"Systems and Methods for Line Balancing," P.Akella, K. Chaudhury, et al., US Patent 11054811
"Handheld or Wearable Document Scanner for Google," Krishnendu Chaudhury, Lu Chen, David Petrou, Blaise Aguerra, US Patent 9852348.
"Segmenting Printed Media Pages into Articles," Krishnendu Chaudhury, A. Jain & S. Thirthala, U.S Patent 8290268, Google
"Identifying a Front Page in Media Material," Krishnendu Chaudhury et al., U.S Patent 8218913, Google
"Method and Apparatus for Enhancing Object Boundary Precision in Images," Krishnendu Chaudhury, Ashutosh Kulshreshtha, U.S patent 07813582, Google.
"Digital Image Archiving in Mobile Device System," Krishnendu Chaudhury, Ashutosh Garg, Arvind Saraf, Prasenjit Phukan, U.S Patent 7986843, Google.
"Complexity Based Adaptive Tiling for Transparency Flattener," Krish Chaudhury, Dejan Markovic, U.S Patent 7385727, Adobe Systems.
"A Cubic B Spline Intensity Re-sampling Based Method for Automatic Image Feature Embedding," Krish Chaudhury, Dejan Markovic, U.S. Patent 7734118, Adobe Systems.
"Scaling of Raster Images without Edge Blurring," Krish Chaudhury, Dejan Markovic, U.S. Patent 7817871, Adobe Systems.
"Author Signature for Legal Purposes," Jim Pravetz, Krish Chaudhury, Sunil Agrawal, U.S. Patent 7774608, Adobe Systems).
"Document Modification Detection and Prevention," Krish Chaudhury, Jim Pravetz, Sunil Agrawal, U.S. Patent 7735144, Adobe Systems.
"Document Digest Allowing Selective Changes to a PDF Document," Krish Chaudhury, Jim Pravetz, International Patent, Adobe Systems, PCT/US2004/015116).
"Dynamic Enabling of Functionality in Electronic Document Readers," Krish Chaudhury, Jim Pravetz, U.S. Patent 7278168, Adobe Systems.

Book

In 2020, Chaudhury authored Math and Architectures for Deep Learning which was published by Manning Publications, New York.^[13]

Awards and recognition

In 2022, Chaudhury was included in Best Startup magazine's list of "California's 101 Top CTO's in the Machine Learning Space ".^[14]

Related Research Articles

Computer vision tasks include methods for acquiring, processing, analyzing and understanding digital images, and extraction of high-dimensional data from the real world in order to produce numerical or symbolic information, e.g. in the forms of decisions. Understanding in this context means the transformation of visual images into descriptions of the world that make sense to thought processes and can elicit appropriate action. This image understanding can be seen as the disentangling of symbolic information from image data using models constructed with the aid of geometry, physics, statistics, and learning theory.

<span class="mw-page-title-main">Machine vision</span> Technology and methods used to provide imaging-based automatic inspection and analysis

Machine vision is the technology and methods used to provide imaging-based automatic inspection and analysis for such applications as automatic inspection, process control, and robot guidance, usually in industry. Machine vision refers to many technologies, software and hardware products, integrated systems, actions, methods and expertise. Machine vision as a systems engineering discipline can be considered distinct from computer vision, a form of computer science. It attempts to integrate existing technologies in new ways and apply them to solve real world problems. The term is the prevalent one for these functions in industrial automation environments but is also used for these functions in other environment vehicle guidance.

A CAPTCHA is a type of challenge–response test used in computing to determine whether the user is human in order to deter bot attacks and spam.

Cognex Corporation is an American manufacturer of machine vision systems, software and sensors used in automated manufacturing to inspect and identify parts, detect defects, verify product assembly, and guide assembly robots. Cognex is headquartered in Natick, Massachusetts, USA and has offices in more than 20 countries.

Laser guidance directs a robotics system to a target position by means of a laser beam. The laser guidance of a robot is accomplished by projecting a laser light, image processing and communication to improve the accuracy of guidance. The key idea is to show goal positions to the robot by laser light projection instead of communicating them numerically. This intuitive interface simplifies directing the robot while the visual feedback improves the positioning accuracy and allows for implicit localization. The guidance system may serve also as a mediator for cooperative multiple robots. Examples of proof-of-concept experiments of directing a robot by a laser pointer are shown on video. Laser guidance spans areas of robotics, computer vision, user interface, video games, communication and smart home technologies.

Krishna Bharat is an Indian research scientist at Google Inc. He was formerly a founding adviser for Grokstyle Inc. a visual search company and Laserlike Inc., an interest search engine startup based on Machine Learning.

Alex Fielding is an American engineer and manager. He is the CEO and co-founder of Privateer Space, a space startup with a global online marketplace that aims to connect customers seeking planetary data with orbiting satellites and AI. He co-founded the company with Apple co-founder Steve Wozniak and MacArthur Genius Moriba Jah. Privateer announced in 2023 that it had grown the business from the Google Maps of space to become the first AI powered space data ride sharing platform with an upcoming satellite autopilot system called Pono set to fly on SpaceX in December 2023. The International Space Station National Labs, in partnership with Privateer announced a deal whereby Privateer publicly tracks and displays mission data on International Space Station telemetry, astronauts, and mission objectives live on the ISS National Labs website. He was co-founder and CEO of robotics company Ripcord, Inc from 2014 to 2021.

Object recognition – technology in the field of computer vision for finding and identifying objects in an image or video sequence. Humans recognize a multitude of objects in images with little effort, despite the fact that the image of the objects may vary somewhat in different view points, in many different sizes and scales or even when they are translated or rotated. Objects can even be recognized when they are partially obstructed from view. This task is still a challenge for computer vision systems. Many approaches to the task have been implemented over multiple decades.

<span class="mw-page-title-main">Dan Quine</span> British computer scientist

Daniel Nicholas Quine is a computer scientist, currently VP Engineering at AltSchool.

<span class="mw-page-title-main">Reverse image search</span> Content-based image retrieval

Reverse image search is a content-based image retrieval (CBIR) query technique that involves providing the CBIR system with a sample image that it will then base its search upon; in terms of information retrieval, the sample image is very useful. In particular, reverse image search is characterized by a lack of search terms. This effectively removes the need for a user to guess at keywords or terms that may or may not return a correct result. Reverse image search also allows users to discover content that is related to a specific sample image or the popularity of an image, and to discover manipulated versions and derivative works.

Like.com was a price comparison service website that billed itself as a "visual search engine for products".

An optical head-mounted display (OHMD) is a wearable device that has the capability of reflecting projected images as well as allowing the user to see through it. In some cases, this may qualify as augmented reality (AR) technology. OHMD technology has existed since 1997 in various forms, but despite a number of attempts from industry, has yet to have had major commercial success.

Peter Karow is a German entrepreneur, inventor and software developer. He holds several patents in the field of desktop publishing and is known for his work on computer fonts. He contributed with several books and patents to the development of operating systems for computers. He is recognized as the inventor of outline computer fonts.

<span class="mw-page-title-main">Subhasis Chaudhuri</span>

Subhasis Chaudhuri is an Indian electrical engineer and former director at the Indian Institute of Technology, Bombay. He is a former K. N. Bajaj Chair Professor of the Department of Electrical Engineering of IIT Bombay. He is known for his pioneering studies on computer vision and is an elected fellow of all the three major Indian science academies viz. the National Academy of Sciences, India, Indian Academy of Sciences, and Indian National Science Academy. He is also a fellow of Institute of Electrical and Electronics Engineers, and the Indian National Academy of Engineering. The Council of Scientific and Industrial Research, the apex agency of the Government of India for scientific research, awarded him the Shanti Swarup Bhatnagar Prize for Science and Technology, one of the highest Indian science awards, in 2004 for his contributions to Engineering Sciences.

Ashutosh Saxena is an Indian-American computer scientist, researcher, and entrepreneur known for his contributions to the field of artificial intelligence and large-scale robot learning. His interests include building enterprise AI agents and embodied AI. Saxena is the co-founder and CEO of Caspar.AI, where generative AI parses data from ambient 3D radar sensors to predict 20+ health & wellness markers for pro-active patient care. Prior to Caspar.AI, Ashutosh co-founded Cognical Katapult, which provides a no credit required alternative to traditional financing for online and omni-channel retail. Before Katapult, Saxena was an assistant professor in the Computer Science Department and faculty director of the RoboBrain Project at Cornell University.

Jiebo Luo is a Chinese-American computer scientist, the Albert Arendt Hopeman Professor of Engineering and Professor of Computer Science at the University of Rochester. He is interested in artificial intelligence, data science and computer vision.

Jannick Rolland is the Brian J. Thompson Professor of Optical Engineering at the Institute of Optics at the University of Rochester. She is also the co-founder and CTO of LighTopTech, a women-owner business founded in 2013 to create medical imaging technologies with biomimetic noninvasive imaging technology. At the University of Rochester, she is the Director of the NSF I/UCRC Center for Freeform Optics (CeFO). She is also the Director of the R.E. Hopkins Center for Optical Design and Engineering that engages undergraduates in optical design, fabrication, and metrology.

Anwar Chitayat is the founder and former CEO and chairman of Anorad Corp., which was acquired in 1998 by Rockwell Automation. Mr. Chitayat holds over 95 patents in Electronics, Semiconductors and Automation including Nanotechnology, Interferometry and Linear motors. His achievements in High technology were honored by SEMI in 2000 at their highest honor for Lifetime Achievement, reserved for individuals who repeatedly enable and lead the technology industry throughout their professional career. In 1997, Anwar was awarded the Entrepreneur of the year award by Ernst and Young, and in 2009, Anwar was inducted to Long Island Hall of Fame for his impacts on science and technology on Long Island.

Thiru Vikram is an inventor, engineer and entrepreneur, who is the CEO of Buffalo Automation, a technology company headquartered in Buffalo, New York, that provides autonomous navigation technology for commercial ships and recreational boats.

Video matting is a technique for separating the video into two or more layers, usually foreground and background, and generating alpha mattes which determine blending of the layers. The technique is very popular in video editing because it allows to substitute the background, or process the layers individually.

References

1 2 3 4 5 Cai, Kenrick. "AI Manufacturing Startup Drishti Raises $25 Million To Go Global With Its Factory Floor Analytics". Forbes. Retrieved 2023-08-29.
1 2 3 4 5 Feldman, Amy. "How This Manufacturing-Automation Startup Signed Up Auto-Parts Giant Denso For Tech That Helps Humans Work Smarter". Forbes. Retrieved 2023-08-29.
1 2 "CHAUDHURY; Krishnendu Patent Filings". uspto.report. Retrieved 2023-08-29.
↑ "Krishnendu Chaudhury". scholar.google.com. Retrieved 2023-08-29.
↑ "Minutes of the University of Kentucky Board of Trustees, 1994-04-may3". exploreuk.uky.edu. Retrieved 2023-08-29.
↑ Chaudhury, Krishnendu; Jain, Ankur; Thirthala, Sriram; Sahasranaman, Vivek; Saxena, Shobhit; Mahalingam, Selvam (July 2009). "Google Newspaper Search - Image Processing and Analysis Pipeline". 2009 10th International Conference on Document Analysis and Recognition. pp. 621–625. doi:10.1109/ICDAR.2009.272. ISBN 978-1-4244-4500-4. S2CID 12254112.
1 2 Chaudhury, K.; Mehrotra, R. (1994-09-01). "Optical Flow Estimation Using Smoothness of Intensity Trajectories". CVGIP: Image Understanding. 60 (2): 230–244. doi:10.1006/ciun.1994.1049. ISSN 1049-9660.
1 2 "Flipkart hires Krishnendu Chaudhury to head Image Sciences". The Times of India. 2015-07-23. ISSN 0971-8257 . Retrieved 2023-08-29.
↑ Devashish, Shankar; Sujay, Narumanchi; A, Ananya, H; Pramod, Kompalli; Krishnendu, Chaudhury (2017-03-07). "Deep Learning based Large Scale Visual Recommendation and Search for E-Commerce". arXiv: 1703.02344 [cs.CV].{{cite arXiv}}: CS1 maint: multiple names: authors list (link)
↑ Shankar, Devashish; Narumanchi, Sujay; H A Ananya; Kompalli, Pramod; Chaudhury, Krishnendu (2017). "Deep Learning based Large Scale Visual Recommendation and Search for E-Commerce". arXiv: 1703.02344 [cs.CV].
↑ "Tech from Drishti's Bengaluru R&D team aiding firms close a 100-year-old data gap in manufacturing". The Times of India. 2021-09-28. ISSN 0971-8257 . Retrieved 2023-08-29.
↑ "History". Drishti. Retrieved 2023-08-30.
↑ Chaudhury, Krishnendu (2023-10-17). Math and Architectures of Deep Learning. Manning. ISBN 978-1-61729-648-2.
↑ Smith, Mark (2022-06-02). "Meet California's 101 Top CTO's in the Machine Learning Space". BestStartup.us. Retrieved 2023-08-30.

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[:1-1] 1 2 3 4 5 Cai, Kenrick. "AI Manufacturing Startup Drishti Raises $25 Million To Go Global With Its Factory Floor Analytics". Forbes. Retrieved 2023-08-29.

[:2-2] 1 2 3 4 5 Feldman, Amy. "How This Manufacturing-Automation Startup Signed Up Auto-Parts Giant Denso For Tech That Helps Humans Work Smarter". Forbes. Retrieved 2023-08-29.

[:3-3] 1 2 "CHAUDHURY; Krishnendu Patent Filings". uspto.report. Retrieved 2023-08-29.

[Krishnendu_Chaudhury-4] "Krishnendu Chaudhury". scholar.google.com. Retrieved 2023-08-29.

[5] "Minutes of the University of Kentucky Board of Trustees, 1994-04-may3". exploreuk.uky.edu. Retrieved 2023-08-29.

[:0-6] Chaudhury, Krishnendu; Jain, Ankur; Thirthala, Sriram; Sahasranaman, Vivek; Saxena, Shobhit; Mahalingam, Selvam (July 2009). "Google Newspaper Search - Image Processing and Analysis Pipeline". 2009 10th International Conference on Document Analysis and Recognition. pp. 621–625. doi:10.1109/ICDAR.2009.272. ISBN 978-1-4244-4500-4. S2CID 12254112.

[:4-7] 1 2 Chaudhury, K.; Mehrotra, R. (1994-09-01). "Optical Flow Estimation Using Smoothness of Intensity Trajectories". CVGIP: Image Understanding. 60 (2): 230–244. doi:10.1006/ciun.1994.1049. ISSN 1049-9660.

[:5-8] 1 2 "Flipkart hires Krishnendu Chaudhury to head Image Sciences". The Times of India. 2015-07-23. ISSN 0971-8257 . Retrieved 2023-08-29.

[:6-9] Devashish, Shankar; Sujay, Narumanchi; A, Ananya, H; Pramod, Kompalli; Krishnendu, Chaudhury (2017-03-07). "Deep Learning based Large Scale Visual Recommendation and Search for E-Commerce". arXiv: 1703.02344 [cs.CV].{{cite arXiv}}: CS1 maint: multiple names: authors list (link)

[10] Shankar, Devashish; Narumanchi, Sujay; H A Ananya; Kompalli, Pramod; Chaudhury, Krishnendu (2017). "Deep Learning based Large Scale Visual Recommendation and Search for E-Commerce". arXiv: 1703.02344 [cs.CV].

[11] "Tech from Drishti's Bengaluru R&D team aiding firms close a 100-year-old data gap in manufacturing". The Times of India. 2021-09-28. ISSN 0971-8257 . Retrieved 2023-08-29.

[12] "History". Drishti. Retrieved 2023-08-30.

[13] Chaudhury, Krishnendu (2023-10-17). Math and Architectures of Deep Learning. Manning. ISBN 978-1-61729-648-2.

[14] Smith, Mark (2022-06-02). "Meet California's 101 Top CTO's in the Machine Learning Space". BestStartup.us. Retrieved 2023-08-30.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]