Roberto Pieraccini

Last updated
Roberto Pieraccini
Roberto-Pieraccini-portrai-2021.jpg
Born (1955-11-15) November 15, 1955 (age 68)
Genova, Italy
NationalityUS, Italian
Scientific career
Fields Speech recognition, spoken dialog systems, natural language understanding, multimodal interaction
Website robertopieraccini.com

Roberto Pieraccini (born 15 November 1955 in Genoa, Italy) is an Italian and US electrical engineer working in the field of speech recognition, natural language understanding, and spoken dialog systems. He has been an active contributor to speech language research and technology since 1981. He is currently the Chief Scientist of Uniphore, a conversational automation technology company.

Contents

Education

He obtained a degree in electrical engineering from the University of Pisa in 1980 with a thesis on the equalization of data channels.

Career

After his graduation, between 1981 and 1989 he worked at CSELT (Centro Studi e Laboratori Telecomunicazioni), the then Italian telephone company's research center, at Bell Labs (Murray Hill, NJ) between 1990 and 1995, and AT&T Labs (Florham Park, NJ) between 1995 and 1999. In 1999 he was Director of the Natural Dialog group at SpeechWorks International until the company was acquired by Scansoft in 2003, and then held a position of manager for the Advanced Conversational Technologies department at IBM Research (Thomas J. Watson Research Center, Yorktown Heights, NY) from 2003 and 2005. He served as the Chief Technology Officer at SpeechCycle from 2005 to 2011. Between 2012 and 2013 he was the Director of the International Computer Science Institute. [1] [2] Between March 2014 and December 2017 he was at Jibo, Inc. as its Director of Advanced Conversational Technologies. He joined Google Zurich in March 2018, and Google New York in July 2022 as a Director of Engineering for the Natural Language Processing Team in the Google Assistant. In May 2023 he joined Uniphore as their Chief Scientist [3]

He was the elected Chair of the IEEE Speech and Language Technical Committee (SLTC) between 2007 and 2008, and on the board of several international conferences and events. He was a member of the editorial boards of the IEEE Signal Processing Magazine and of the International Journal of Speech Technology. He was also the general co-chair of the SIGdial Conference on Dialog and Discourse, held in London in September 2009, and the general technical program chair of Interspeech 2011 held in Florence, Italy, in August 2011. During his career he authored more than 120 articles, book chapters, and conference publications [4] in the fields of speech recognition, language modeling, optical character recognition, and dialog systems. [5] He was elevated to the grade of Fellow of IEEE in 2010 for contributions to statistical natural language understanding and spoken dialog management and learning. [6] He is also a Fellow of ISCA, [7] the International Speech Communication Association.

Books

He is the author of The Voice in the Machine, [8] published by MIT Press in 2011, a general audience book on the history, technology, and the business of computers that understand speech. In September 2021, still with MIT Press, he published AI Assistants, [9] an accessible account of the recent evolution of virtual digital assistant like Siri, Amazon Alexa, and the Google Assistant.

Honors and awards

He is the recipient of PrimiDieci USA 2016, [10] an award sponsored by the Italian-American Chamber of Commerce and recognizing, every year, 10 prominent Italian-Americans in fields such as science, technology, and art.

On December 10, 2019, he received a Doctor in Science honorary degree [11] from the Heriot Watt University of Edinburgh.

Related Research Articles

<span class="mw-page-title-main">Charles E. Leiserson</span> American computer scientist

Charles Eric Leiserson is a computer scientist and professor at Massachusetts Institute of Technology (M.I.T.). He specializes in the theory of parallel computing and distributed computing.

<span class="mw-page-title-main">Andrew Blake (scientist)</span> British scientist

Andrew Blake FREng, FRS, is a British scientist, former laboratory director of Microsoft Research Cambridge and Microsoft Distinguished Scientist, former director of the Alan Turing Institute, Chair of the Samsung AI Centre in Cambridge, honorary professor at the University of Cambridge, Fellow of Clare Hall, Cambridge, and a leading researcher in computer vision.

<span class="mw-page-title-main">Justine Cassell</span> American linguist, professor and human-computer interaction researcher

Justine M. Cassell is an American professor and researcher interested in human-human conversation, human-computer interaction, and storytelling. Since August 2010, she has been on the faculty of the Carnegie Mellon Human Computer Interaction Institute (HCII) and the Language Technologies Institute, with courtesy appointments in Psychology, and the Center for Neural Bases of Cognition. Cassell has served as the chair of the HCII, as associate vice-provost, and as Associate Dean of Technology Strategy and Impact for the School of Computer Science. She currently divides her time between Carnegie Mellon, where she now holds the Dean's Professorship in Language Technologies, and PRAIRIE, the Paris Institute on Interdisciplinary Research in AI, where she also holds the position of senior researcher at Inria Paris.

<span class="mw-page-title-main">Xuedong Huang</span> American computer scientist

Xuedong David Huang is a Chinese American computer scientist and technology executive who has made contributions to spoken language processing and artificial intelligence, including Azure AI Services. He is Zoom's chief technology officer after serving as Microsoft's Technical Fellow and Azure AI Chief Technology Officer for 30 years. Huang is a strong advocate of AI for Accessibility, and AI for Cultural Heritage.

<span class="mw-page-title-main">Alex Waibel</span> American computer scientist

Alexander Waibel is a professor of Computer Science at Carnegie Mellon University and Karlsruhe Institute of Technology. Waibel's research interests focus on speech recognition and translation and human communication signals and systems. Alex Waibel made pioneering contributions to speech translation systems, breaking down language barriers through cross-lingual speech communication. In fundamental research on machine learning, he is known for the Time Delay Neural Network (TDNN), the first Convolutional Neural Network (CNN) trained by gradient descent, using backpropagation. Alex Waibel introduced the TDNN in 1987 at ATR in Japan.

Nelson Harold Morgan is an American computer scientist and professor in residence (emeritus) of electrical engineering and computer science at the University of California, Berkeley. Morgan is the co-inventor of the Relative Spectral (RASTA) approach to speech signal processing, first described in a technical report published in 1991.

Victor Waito Zue is a Chinese American computer scientist and professor at Massachusetts Institute of Technology.

<span class="mw-page-title-main">Shrikanth Narayanan</span> Researcher

Shrikanth Narayanan is an Indian-American Professor at the University of Southern California. He is an interdisciplinary engineer–scientist with a focus on human-centered signal processing and machine intelligence with speech and spoken language processing at its core. A prolific award-winning researcher, educator, and inventor, with hundreds of publications and a number of acclaimed patents to his credit, he has pioneered several research areas including in computational speech science, speech and human language technologies, audio, music and multimedia engineering, human sensing and imaging technologies, emotions research and affective computing, behavioral signal processing, and computational media intelligence. His technical contributions cover a range of applications including in defense, security, health, education, media, and the arts. His contributions continue to impact numerous domains including in human health, national defense/intelligence, and the media arts including in using technologies that facilitate awareness and support of diversity and inclusion. His award-winning patents have contributed to the proliferation of speech technologies on the cloud and on mobile devices and in enabling novel emotion-aware artificial intelligence technologies.

<span class="mw-page-title-main">Pascale Fung</span> Professor

Pascale Fung (馮雁) is a professor in the Department of Electronic & Computer Engineering and the Department of Computer Science & Engineering at the Hong Kong University of Science & Technology(HKUST). She is the director of the newly established, multidisciplinary Centre for AI Research (CAiRE) at HKUST. She is an elected Fellow of the Institute of Electrical and Electronics Engineers (IEEE) for her “contributions to human-machine interactions”, an elected Fellow of the International Speech Communication Association for “fundamental contributions to the interdisciplinary area of spoken language human-machine interactions” and an elected Fellow of the Association for Computational Linguistics (ACL) for her “significant contributions toward statistical NLP, comparable corpora, and building intelligent systems that can understand and empathize with humans”.

<span class="mw-page-title-main">Larry Heck</span>

Larry Paul Heck is currently the Rhesa Screven Farmer, Jr., Advanced Computing Concepts Chair, Georgia Research Alliance Eminent Scholar, and professor at the Georgia Institute of Technology. His career spans many of the sub-disciplines of artificial intelligence, including conversational AI, speech recognition and speaker recognition, natural language processing, web search, online advertising and acoustics. He is probably best known for his role as the founder of the Microsoft Cortana Personal Assistant and his early work in deep learning for speech processing.

<span class="mw-page-title-main">Steve Young (software engineer)</span> British researcher (born 1951)

Stephen John Young is a British researcher, Professor of Information Engineering at the University of Cambridge and an entrepreneur. He is one of the pioneers of automated speech recognition and statistical spoken dialogue systems. He served as the Senior Pro-Vice-Chancellor of the University of Cambridge from 2009 to 2015, responsible for planning and resources. From 2015 to 2019, he held a joint appointment between his professorship at Cambridge and Apple, where he was a senior member of the Siri development team.

<span class="mw-page-title-main">John Makhoul</span> American computer scientist

John Makhoul is a Lebanese-American computer scientist who works in the field of speech and language processing. Dr. Makhoul's work on linear predictive coding was used in the establishment of the Network Voice Protocol, which enabled the transmission of speech signals over the ARPANET. Makhoul is recognized in the field for his vital role in the areas of speech and language processing, including speech analysis, speech coding, speech recognition and speech understanding. He has made a number of significant contributions to the mathematical modeling of speech signals, including his work on linear prediction, and vector quantization. His patented work on the direct application of speech recognition techniques for accurate, language-independent optical character recognition (OCR) has had a dramatic impact on the ability to create OCR systems in multiple languages relatively quickly.

Mari Ostendorf is a professor of electrical engineering in the area of speech and language technology and the vice provost for research at the University of Washington.

<span class="mw-page-title-main">Joseph Keshet</span> Israeli professor of Computer Science

Joseph (Yossi) Keshet is an Israeli professor in the Electrical and Computer Engineering Faculty of the Technion.

Lori Faith Lamel is a speech processing researcher known for her work with the TIMIT corpus of American English speech and for her work on voice activity detection, speaker recognition, and other non-linguistic inferences from speech signals. She works for the French National Centre for Scientific Research (CNRS) as a senior research scientist in the Spoken Language Processing Group of the Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur.

Abeer Alwan is an American electrical engineer and speech processing researcher. She is a professor of electrical and computer engineering in the UCLA Henry Samueli School of Engineering and Applied Science, and vice chair for undergraduate affairs in the Department of Electrical & Computer Engineering.

Ramalingam "Rama" Chellappa is a Bloomberg Distinguished Professor, who works at Johns Hopkins University. At Johns Hopkins University, he is a member of the Center for Language and Speech Processing, the Center for Imaging Science, the Institute for Assured Autonomy, and the Mathematical Institute for Data Sciences. He joined Johns Hopkins University after 29 years at The University of Maryland. Before that, he was an assistant, associate professor, and later, director, of the University of Southern California's Signal and Image Processing institute.

Chin-Hui Lee is an information scientist, best known for his work in speech recognition, speaker recognition and acoustic signal processing. He joined Georgia Institute of Technology in 2002 as a professor in the school of electrical and computer engineering

Yang Liu is a Chinese and American computer scientist specializing in speech processing and natural language processing, and a senior principal scientist for Amazon.

References

  1. Interview with Dr. Roberto Pieraccini, Director of the International Computer Science Institute (ICSI) at Berkeley, USA
  2. Roberto Pieraccini Named New Director of the International Computer Science Institute (ICSI)
  3. Uniphore Appoints Roberto Pieraccini as Chief Scientist
  4. Roberto Pieraccini's publication list
  5. "Roberto Pieraccini's list of publications".
  6. "IEEE 2010 Class of Newly Elevated Fellows". Institute of Electrical and Electronics Engineers (IEEE). Archived from the original on 2013-05-16.
  7. 2009 ISCA Fellows
  8. Pieraccini, Roberto (2012). The voice in the machine : building computers that understand speech. Cambridge, Mass.: MIT Press. ISBN   9780262301534. OCLC   784953196.
  9. Pieraccini, Roberto (2021). AI Assistants. Cambridge, Mass.: MIT Press. ISBN   9780262542555. OCLC   1182021119.
  10. PrimiDieci USA 2016: il sistema delle eccellenze italiane
  11. Pioneer of conversational AI is awarded Honorary Degree