This article contains text that is written in a promotional tone .(April 2024) |
This article's use of external links may not follow Wikipedia's policies or guidelines.(April 2024) |
Alex Waibel | |
---|---|
Born | 2 May 1956 Heidelberg, Germany |
Awards | IEEE Senior Best Paper Award (1990), Allen Newell Award for Research Excellence (2002), Antonio Zampolli Prize (2014), Meta Prize (2011 & 2016), James L. Flanagan Speech and Audio Processing Award (2023) |
Academic background | |
Alma mater | Massachusetts Institute of Technology (BS), Carnegie Mellon University (MS, PhD) |
Doctoral advisor | Raj Reddy |
Academic work | |
Discipline | Computer Science |
Sub-discipline | Artificial Intelligence,Machine Learning,Deep Learning |
Institutions | Carnegie Mellon University,Karlsruhe Institute of Technology |
Notable students | Laurence Devillers |
Alexander Waibel (born 2 May 1956 in Heidelberg,Germany) is a professor of Computer Science at Carnegie Mellon University and Karlsruhe Institute of Technology. Waibel's research interests focus on speech recognition and translation [1] and human communication signals and systems. [2] Alex Waibel made pioneering contributions to speech translation systems,breaking down language barriers through cross-lingual speech communication. In fundamental research on machine learning,he is known for the Time Delay Neural Network (TDNN), [3] the first Convolutional Neural Network (CNN) trained by gradient descent,using backpropagation. [4] Alex Waibel introduced the TDNN in 1987 at ATR in Japan.
The BBC quoted Alex Waibel on his motivation:"We don't want to look things up in dictionaries –so I wanted to build a machine to translate speech." [5]
Alex Waibel spent part of his schooling in Barcelona,before entering the humanistisches Gymnasium in Ludwigshafen. From 1976 to 1979,he studied Electrical Engineering and Computer Science at the Massachusetts Institute of Technology. His Bachelor thesis (1979) was presented with a Guillamin award for the best undergraduate thesis. In the same year,he joined Carnegie Mellon University, [6] where he received an MS degree in 1980 and PhD degrees in computer science and cognitive science in 1986. [7]
Waibel is the director of interACT, [8] the International Center for Advanced Communication Technologies. He was one of the founders of C-STAR, [9] an international consortium for speech translation research,and served as its chairman from 1998 to 2000. In 2003,C-STAR evolved into IWSLT,the International Conference on Spoken Language Translation. Waibel has served as Chairman of its Steering Committee since its inception. He directed and coordinated several multisite research programs in Europe and the US,including the CHIL program [10] (FP-6 Integrated Project on multimodality) in Europe and NSF-ITR project STR-DUST (the first domain independent speech translation project) in the U.S. He was project coordinator of the IP EU-BRIDGE, [11] funded by the EC (2012-2014).
In C-STAR,his team developed the JANUS [12] speech translation system,the first American and European Speech Translation system,and in 2005,the first real-time simultaneous speech translation system for lectures. His lab has also developed several multimodal systems including perceptual meeting rooms,meeting recognizers,meeting browsers,and multimodal dialog systems for humanoid robots.
From 2019 to 2023,he directed OML (Organic Machine Learning) funded by the Federal Ministry of Education and Research (Germany),a fundamental research project to develop incremental and interactive machine learning,aiming to help AI better handle surprise in language and robotics.
In the areas of speech,speech translation,and multimodal interfaces Waibel holds several patents [13] and has founded and co-founded successful commercial ventures. He was the founder and chairman of Mobile Technologies,LLC,maker of the Jibbigo mobile speech-to-speech app to translate speech on a phone.
In 2005,Waibel unveiled the world's first automatic simultaneous translation service at a press conference at Carnegie Mellon University [14] stating that "the lecture translator automatically records,transcribes and translates the speech of a lecturer in real-time,and students can follow the lecture in their language on their PC or mobile phone." [15] Deployed in 2012 as a production[ clarification needed ] [16] and service [17] it serves foreign students as the pioneering service of its kind.
In 2013,Jibbigo was acquired by Facebook Inc. and Waibel joined the company to start the Language Technology Group which would eventually become part of Facebook's broader Applied Machine Learning efforts. [18] He was cofounder and director at Multimodal Technologies,Inc. [19] and at M*Modal,specializing on medical records.
In 2015,he cofounded KITES GmbH,to deploy simultaneous speech translation services to Universities and to the European Parliament. KIT and KITES demonstrated first automatic interpreting services at the European Parliament in 2012 [20] . KITES was acquired by Zoom in 2021 and now delivers automatic subtitling and simultaneous translation during Zoom video conferencing calls. Waibel serves as Research Fellow at Zoom and on Advisory Boards in related enterprises.
In October 2018,Waibel closed out a successful legal case against Wikimedia Foundation citing German libel laws. [21]
Alex Waibel is a recipient of the IEEE Senior Best Paper Award for work on the TDNN (1990). He was awarded the Alcatel-SEL "Forschungspreis Technische Kommunikation" in 1994 for his work on computer speech translation systems. In 2002,he received the Allen Newell Award for Research Excellence and the Meta Prize in 2011 with Jibbigo Mobile Translators for outstanding mobile voice translators bringing speech translation to mobile devices. In 2014,he was the recipient of the Antonio Zampolli Prize for "outstanding contributions to the advancement of Language Resources and Language Technology Evaluation within Human Language Technology",LREC. [22] With InterACT,he was awarded a second Meta Prize for "Outstanding and Innovative Contributions to Cross-Lingual and Multilingual Communication Technologies" (2016). He received the Sustained Accomplishment Award of the ACM-ICMI for his work on multimodal interfaces (2019). [23] In 2023,he became the 21st honoree to receive the IEEE James L. Flanagan Speech and Audio Processing Award for "pioneering work on speech translation and supporting technologies". [24]
Waibel has been a Life Fellow of the IEEE,a Fellow of the International Speech Communication Association (ISCA) and a Member of the National Academy of Sciences of Germany,Leopoldina [25] since 2017. In 2023,Waibel was inducted as Fellow into the Explorers Club citing aviation expeditions and deep sea exploration. [26]
Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR),computer speech recognition or speech to text (STT). It incorporates knowledge and research in the computer science,linguistics and computer engineering fields. The reverse process is speech synthesis.
Carnegie Mellon University (CMU) is a private research university in Pittsburgh,Pennsylvania. The institution was established in 1900 by Andrew Carnegie as the Carnegie Technical Schools. In 1912,it became the Carnegie Institute of Technology and began granting four-year degrees. In 1967,it became Carnegie Mellon University through its merger with the Mellon Institute of Industrial Research,founded in 1913 by Andrew Mellon and Richard B. Mellon and formerly a part of the University of Pittsburgh.
The School of Computer Science (SCS) at Carnegie Mellon University in Pittsburgh,Pennsylvania,US is a school for computer science established in 1988. It has been consistently ranked among the top computer science programs over the decades. As of 2022 U.S. News &World Report ranks the graduate program as tied for second with Stanford University and University of California,Berkeley. It is ranked second in the United States on Computer Science Open Rankings,which combines scores from multiple independent rankings.
Dabbala Rajagopal "Raj" Reddy is an Indian-American computer scientist and a winner of the Turing Award. He is one of the early pioneers of artificial intelligence and has served on the faculty of Stanford and Carnegie Mellon for over 50 years. He was the founding director of the Robotics Institute at Carnegie Mellon University. He was instrumental in helping to create Rajiv Gandhi University of Knowledge Technologies in India,to cater to the educational needs of the low-income,gifted,rural youth. He was the founding chairman of International Institute of Information Technology,Hyderabad. He is the first person of Asian origin to receive the Turing Award,in 1994,known as the Nobel Prize of Computer Science,for his work in the field of artificial intelligence.
Carnegie Mellon Silicon Valley is a degree-granting branch campus of Carnegie Mellon University located in the heart of Silicon Valley in Mountain View,California. It was established in 2002 at the NASA Ames Research Center in Moffett Field.
Multimodal interaction provides the user with multiple modes of interacting with a system. A multimodal interface provides several distinct tools for input and output of data.
Randal E. Bryant is an American computer scientist and academic noted for his research on formally verifying digital hardware and software. Bryant has been a faculty member at Carnegie Mellon University since 1984. He served as the Dean of the School of Computer Science (SCS) at Carnegie Mellon from 2004 to 2014. Dr. Bryant retired and became a Founders University Professor Emeritus on June 30,2020.
Alan W Black is a Scottish computer scientist,known for his research on speech synthesis. He is a professor in the Language Technologies Institute at Carnegie Mellon University in Pittsburgh,Pennsylvania.
The Language Technologies Institute (LTI) is a research institute at Carnegie Mellon University in Pittsburgh,Pennsylvania,United States,and focuses on the area of language technologies. The institute is home to 33 faculty with the primary scholarly research of the institute focused on machine translation,speech recognition,speech synthesis,information retrieval,parsing,information extraction,and multimodal machine learning. Until 1996,the institute existed as the Center for Machine Translation,which was established in 1986. Subsequently,from 1996 onwards,it started awarding degrees,and the name was changed to The Language Technologies Institute. The institute was founded by Professor Jaime Carbonell,who served as director until his death in February 2020. He was followed by Jamie Callan,and then Carolyn Rosé,as interim directors. In August 2023,Mona Diab became the director of the institute.
Jaime Guillermo Carbonell was a computer scientist who made seminal contributions to the development of natural language processing tools and technologies. His extensive research in machine translation resulted in the development of several state-of-the-art language translation and artificial intelligence systems. He earned his B.S. degrees in Physics and in Mathematics from MIT in 1975 and did his Ph.D. under Dr. Roger Schank at Yale University in 1979. He joined Carnegie Mellon University as an assistant professor of computer science in 1979 and lived in Pittsburgh from then. He was affiliated with the Language Technologies Institute,Computer Science Department,Machine Learning Department,and Computational Biology Department at Carnegie Mellon.
Xuedong David Huang is a Chinese American computer scientist and technology executive who has made contributions to spoken language processing and artificial intelligence,including Azure AI Services. He is Zoom's chief technology officer after serving as Microsoft's Technical Fellow and Azure AI Chief Technology Officer for 30 years. Huang is a strong advocate of AI for Accessibility,and AI for Cultural Heritage.
Louis-Philippe Morency is a French Canadian researcher interested in human communication and machine learning applied to a better understanding of human behavior.
Bin He is a Chinese American biomedical engineering scientist. He is the Trustee Professor of the Department of Biomedical Engineering,professor by courtesy in the Department of Electrical and Computer Engineering,and Professor of Neuroscience Institute,and was the head of the department of Biomedical Engineering at Carnegie Mellon University. Prior,he was Distinguished McKnight University Professor of Biomedical Engineering and Medtronic-Bakken Endowed Chair for Engineering in Medicine at the University of Minnesota. He previously served as the director of the Institute for Engineering in Medicine and the Center for Neuroengineering at the University of Minnesota. He was the Editor in Chief of the IEEE Transactions on Biomedical Engineering and serves as the editor in chief of IEEE Reviews in Biomedical Engineering. He was the president of the IEEE Engineering in Medicine &Biology Society (EMBS) from 2009 to 2010 and chair of International Academy of Medical and Biological Engineering from 2018 to 2021.
Time delay neural network (TDNN) is a multilayer artificial neural network architecture whose purpose is to 1) classify patterns with shift-invariance,and 2) model context at each layer of the network.
Jibbigo was a mobile offline language translation application that was developed by Mobile Technologies,LLC and Dr. Alex Waibel,a professor at Carnegie Mellon. Jibbigo is an offline voice translator and does not need phone or data connectivity to function. Spanish-English Jibbigo was released in September,2009 as the first offline Speech Translation application. The company has since expanded its offerings to include ten language pairs sold on both Apple's App Store and Google Play.
Eric Poe Xing is an American computer scientist whose research spans machine learning,computational biology,and statistical methodology. Xing is founding President of the world’s first artificial intelligence university,Mohamed bin Zayed University of Artificial Intelligence (MBZUAI).
Shrikanth Narayanan is an Indian-American Professor at the University of Southern California. He is an interdisciplinary engineer–scientist with a focus on human-centered signal processing and machine intelligence with speech and spoken language processing at its core. A prolific award-winning researcher,educator,and inventor,with hundreds of publications and a number of acclaimed patents to his credit,he has pioneered several research areas including in computational speech science,speech and human language technologies,audio,music and multimedia engineering,human sensing and imaging technologies,emotions research and affective computing,behavioral signal processing,and computational media intelligence. His technical contributions cover a range of applications including in defense,security,health,education,media,and the arts. His contributions continue to impact numerous domains including in human health,national defense/intelligence,and the media arts including in using technologies that facilitate awareness and support of diversity and inclusion. His award-winning patents have contributed to the proliferation of speech technologies on the cloud and on mobile devices and in enabling novel emotion-aware artificial intelligence technologies.
Hideto Tomabechi is a Japanese cognitive scientist computer scientist.
Lawrence Pileggi is the Coraluppi Head and Tanoto Professor of Electrical and Computer Engineering at Carnegie Mellon University. He is a specialist in the automation of integrated circuits,and developing software tools for the optimization of power grids. Pileggi's research has been cited thousands of times in engineering papers.
Gerald Penn is an American computer scientist specializing in mathematical linguistics and speech processing. He is a Professor of Computer Science at the University of Toronto,a senior member of IEEE and AAAI,and a past chair of Association for Mathematics of Language.
{{cite web}}
: CS1 maint: multiple names: authors list (link){{cite web}}
: CS1 maint: multiple names: authors list (link){{cite web}}
: CS1 maint: multiple names: authors list (link)