Roberto Pieraccini

Roberto Pieraccini
Roberto Pieraccini
Born	November 15, 1955 (age 68); Genova, Italy
Nationality	US, Italian
	Scientific career
Fields	Speech recognition, spoken dialog systems, natural language understanding, multimodal interaction
Website	robertopieraccini.com

Last updated July 20, 2024

Roberto Pieraccini (born 15 November 1955 in Genoa, Italy) is an Italian and US electrical engineer working in the field of speech recognition, natural language understanding, and spoken dialog systems. He has been an active contributor to speech language research and technology since 1981. He is currently the Chief Scientist of Uniphore, a conversational automation technology company.

Education

He obtained a degree in electrical engineering from the University of Pisa in 1980 with a thesis on the equalization of data channels.

Career

After his graduation, between 1981 and 1989 he worked at CSELT (Centro Studi e Laboratori Telecomunicazioni), the then Italian telephone company's research center, at Bell Labs (Murray Hill, NJ) between 1990 and 1995, and AT&T Labs (Florham Park, NJ) between 1995 and 1999. In 1999 he was Director of the Natural Dialog group at SpeechWorks International until the company was acquired by Scansoft in 2003, and then held a position of manager for the Advanced Conversational Technologies department at IBM Research (Thomas J. Watson Research Center, Yorktown Heights, NY) from 2003 and 2005. He served as the Chief Technology Officer at SpeechCycle from 2005 to 2011. Between 2012 and 2013 he was the Director of the International Computer Science Institute.^[1]^[2] Between March 2014 and December 2017 he was at Jibo, Inc. as its Director of Advanced Conversational Technologies. He joined Google Zurich in March 2018, and Google New York in July 2022 as a Director of Engineering for the Natural Language Processing Team in the Google Assistant. In May 2023 he joined Uniphore as their Chief Scientist ^[3]

He was the elected Chair of the IEEE Speech and Language Technical Committee (SLTC) between 2007 and 2008, and on the board of several international conferences and events. He was a member of the editorial boards of the IEEE Signal Processing Magazine and of the International Journal of Speech Technology. He was also the general co-chair of the SIGdial Conference on Dialog and Discourse, held in London in September 2009, and the general technical program chair of Interspeech 2011 held in Florence, Italy, in August 2011. During his career he authored more than 120 articles, book chapters, and conference publications ^[4] in the fields of speech recognition, language modeling, optical character recognition, and dialog systems.^[5] He was elevated to the grade of Fellow of IEEE in 2010 for contributions to statistical natural language understanding and spoken dialog management and learning.^[6] He is also a Fellow of ISCA,^[7] the International Speech Communication Association.

Books

He is the author of The Voice in the Machine,^[8] published by MIT Press in 2011, a general audience book on the history, technology, and the business of computers that understand speech. In September 2021, still with MIT Press, he published AI Assistants,^[9] an accessible account of the recent evolution of virtual digital assistant like Siri, Amazon Alexa, and the Google Assistant.

Honors and awards

He is the recipient of PrimiDieci USA 2016,^[10] an award sponsored by the Italian-American Chamber of Commerce and recognizing, every year, 10 prominent Italian-Americans in fields such as science, technology, and art.

On December 10, 2019, he received a Doctor in Science honorary degree ^[11] from the Heriot Watt University of Edinburgh.

Related Research Articles

Charles Eric Leiserson is a computer scientist and professor at Massachusetts Institute of Technology (M.I.T.). He specializes in the theory of parallel computing and distributed computing.

The International Computer Science Institute (ICSI) is an independent, non-profit research organization located in Berkeley, California, United States. Since its founding in 1988, ICSI has maintained an affiliation agreement with the University of California, Berkeley, where several of its members hold faculty appointments.

Andrew Blake FREng, FRS, is a British scientist, former laboratory director of Microsoft Research Cambridge and Microsoft Distinguished Scientist, former director of the Alan Turing Institute, Chair of the Samsung AI Centre in Cambridge, honorary professor at the University of Cambridge, Fellow of Clare Hall, Cambridge, and a leading researcher in computer vision.

Justine M. Cassell is an American professor and researcher interested in human-human conversation, human-computer interaction, and storytelling. Since August 2010, she has been on the faculty of the Carnegie Mellon Human Computer Interaction Institute (HCII) and the Language Technologies Institute, with courtesy appointments in Psychology, and the Center for Neural Bases of Cognition. Cassell has served as the chair of the HCII, as associate vice-provost, and as Associate Dean of Technology Strategy and Impact for the School of Computer Science. She currently divides her time between Carnegie Mellon, where she now holds the Dean's Professorship in Language Technologies, and PRAIRIE, the Paris Institute on Interdisciplinary Research in AI, where she also holds the position of senior researcher at Inria Paris.

Xuedong David Huang is a Chinese American computer scientist and technology executive who has made contributions to spoken language processing and artificial intelligence, including Azure AI Services. He is Zoom's chief technology officer after serving as Microsoft's Technical Fellow and Azure AI Chief Technology Officer for 30 years. Huang is a strong advocate of AI for Accessibility, and AI for Cultural Heritage.

Alexander Waibel is a professor of Computer Science at Carnegie Mellon University and Karlsruhe Institute of Technology (KIT). Waibel’s research focuses on automatic speech recognition, translation and human-machine interaction. His work has introduced cross-lingual communication systems, such as consecutive and simultaneous interpreting systems on a variety of platforms. In fundamental research on machine learning, he is known for the Time Delay Neural Network (TDNN), the first Convolutional Neural Network (CNN) trained by gradient descent, using backpropagation. Alex Waibel introduced the TDNN in 1987 at ATR in Japan.

Nelson Harold Morgan is an American computer scientist and professor in residence (emeritus) of electrical engineering and computer science at the University of California, Berkeley. Morgan is the co-inventor of the Relative Spectral (RASTA) approach to speech signal processing, first described in a technical report published in 1991.

Victor Waito Zue is a Chinese American computer scientist and professor at Massachusetts Institute of Technology.

<span class="mw-page-title-main">Shrikanth Narayanan</span> Researcher

Shrikanth Narayanan is an Indian-American Professor at the University of Southern California. He is an interdisciplinary engineer–scientist with a focus on human-centered signal processing and machine intelligence with speech and spoken language processing at its core. A prolific award-winning researcher, educator, and inventor, with hundreds of publications and a number of acclaimed patents to his credit, he has pioneered several research areas including in computational speech science, speech and human language technologies, audio, music and multimedia engineering, human sensing and imaging technologies, emotions research and affective computing, behavioral signal processing, and computational media intelligence. His technical contributions cover a range of applications including in defense, security, health, education, media, and the arts. His contributions continue to impact numerous domains including in human health, national defense/intelligence, and the media arts including in using technologies that facilitate awareness and support of diversity and inclusion. His award-winning patents have contributed to the proliferation of speech technologies on the cloud and on mobile devices and in enabling novel emotion-aware artificial intelligence technologies.

<span class="mw-page-title-main">Pascale Fung</span> Professor

Pascale Fung (馮雁) is a professor in the Department of Electronic & Computer Engineering and the Department of Computer Science & Engineering at the Hong Kong University of Science & Technology(HKUST). She is the director of the Centre for AI Research (CAiRE) at HKUST. She is an elected Fellow of the Institute of Electrical and Electronics Engineers (IEEE) for her “contributions to human-machine interactions”, an elected Fellow of the International Speech Communication Association for “fundamental contributions to the interdisciplinary area of spoken language human-machine interactions” and an elected Fellow of the Association for Computational Linguistics (ACL) for her “significant contributions toward statistical NLP, comparable corpora, and building intelligent systems that can understand and empathize with humans”.

Larry Paul Heck is the Rhesa Screven Farmer, Jr., Advanced Computing Concepts Chair, Georgia Research Alliance Eminent Scholar, Chief Scientist of the AI Hub, Executive Director of the Machine Learning Center, and Professor at the Georgia Institute of Technology. His career spans many of the sub-disciplines of artificial intelligence, including conversational AI, speech recognition and speaker recognition, natural language processing, web search, online advertising and acoustics. He is best known for his role as a co-founder of the Microsoft] Cortana] Personal Assistant and his early work in deep learning] for speech processing.

<span class="mw-page-title-main">Steve Young (software engineer)</span> British researcher (born 1951)

Stephen John Young is a British researcher, Professor of Information Engineering at the University of Cambridge and an entrepreneur. He is one of the pioneers of automated speech recognition and statistical spoken dialogue systems. He served as the Senior Pro-Vice-Chancellor of the University of Cambridge from 2009 to 2015, responsible for planning and resources. From 2015 to 2019, he held a joint appointment between his professorship at Cambridge and Apple, where he was a senior member of the Siri development team.

John Makhoul is a Lebanese-American computer scientist who works in the field of speech and language processing. Dr. Makhoul's work on linear predictive coding was used in the establishment of the Network Voice Protocol, which enabled the transmission of speech signals over the ARPANET. Makhoul is recognized in the field for his vital role in the areas of speech and language processing, including speech analysis, speech coding, speech recognition and speech understanding. He has made a number of significant contributions to the mathematical modeling of speech signals, including his work on linear prediction, and vector quantization. His patented work on the direct application of speech recognition techniques for accurate, language-independent optical character recognition (OCR) has had a dramatic impact on the ability to create OCR systems in multiple languages relatively quickly.

Mari Ostendorf is a professor of electrical engineering in the area of speech and language technology and the vice provost for research at the University of Washington.

Joseph (Yossi) Keshet is an Israeli professor in the Electrical and Computer Engineering Faculty of the Technion.

Jason O. Mars is an American computer scientist, author, and entrepreneur. He is best known for his research into computer architecture and artificial intelligence, particularly in the design and deployment of conversational AI. The best-selling author of Breaking Bots: Inventing a New Voice in the AI Revolution, he has been involved in multiple AI initiatives and startups over the course of his career, including ZeroShotBot, Jaseci, Clinc, Myca, and ImpactfulAI.

Lori Faith Lamel is a speech processing researcher known for her work with the TIMIT corpus of American English speech and for her work on voice activity detection, speaker recognition, and other non-linguistic inferences from speech signals. She works for the French National Centre for Scientific Research (CNRS) as a senior research scientist in the Spoken Language Processing Group of the Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur.

Ramalingam "Rama" Chellappa is a Bloomberg Distinguished Professor, who works at Johns Hopkins University. At Johns Hopkins University, he is a member of the Center for Language and Speech Processing, the Center for Imaging Science, the Institute for Assured Autonomy, and the Mathematical Institute for Data Sciences. He joined Johns Hopkins University after 29 years at The University of Maryland. Before that, he was an assistant, associate professor, and later, director, of the University of Southern California's Signal and Image Processing institute.

Chin-Hui Lee is an information scientist, best known for his work in speech recognition, speaker recognition and acoustic signal processing. He joined Georgia Institute of Technology in 2002 as a professor in the school of electrical and computer engineering

Yang Liu is a Chinese and American computer scientist specializing in speech processing and natural language processing, and a senior principal scientist for Amazon.

References

↑ Wahlster, Wolfgang (2012). "Interview with Dr. Roberto Pieraccini, Director of the International Computer Science Institute (ICSI) at Berkeley, USA". Ki - Künstliche Intelligenz. 26 (3): 289–291. doi:10.1007/s13218-012-0217-0.
↑ Roberto Pieraccini Named New Director of the International Computer Science Institute (ICSI)
↑ Uniphore Appoints Roberto Pieraccini as Chief Scientist
↑ Roberto Pieraccini's publication list
↑ "Roberto Pieraccini's list of publications".
↑ "IEEE 2010 Class of Newly Elevated Fellows". Institute of Electrical and Electronics Engineers (IEEE). Archived from the original on 2013-05-16.
↑ 2009 ISCA Fellows
↑ Pieraccini, Roberto (2012). The voice in the machine : building computers that understand speech. Cambridge, Mass.: MIT Press. ISBN 9780262301534. OCLC 784953196.
↑ Pieraccini, Roberto (2021). AI Assistants. Cambridge, Mass.: MIT Press. ISBN 9780262542555. OCLC 1182021119.
↑ PrimiDieci USA 2016: il sistema delle eccellenze italiane
↑ Pioneer of conversational AI is awarded Honorary Degree Archived 2022-11-02 at the Wayback Machine

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Wahlster, Wolfgang (2012). "Interview with Dr. Roberto Pieraccini, Director of the International Computer Science Institute (ICSI) at Berkeley, USA". Ki - Künstliche Intelligenz. 26 (3): 289–291. doi:10.1007/s13218-012-0217-0.

[2] Roberto Pieraccini Named New Director of the International Computer Science Institute (ICSI)

[3] Uniphore Appoints Roberto Pieraccini as Chief Scientist

[4] Roberto Pieraccini's publication list

[5] "Roberto Pieraccini's list of publications".

[6] "IEEE 2010 Class of Newly Elevated Fellows". Institute of Electrical and Electronics Engineers (IEEE). Archived from the original on 2013-05-16.

[7] 2009 ISCA Fellows

[8] Pieraccini, Roberto (2012). The voice in the machine : building computers that understand speech. Cambridge, Mass.: MIT Press. ISBN 9780262301534. OCLC 784953196.

[9] Pieraccini, Roberto (2021). AI Assistants. Cambridge, Mass.: MIT Press. ISBN 9780262542555. OCLC 1182021119.

[10] PrimiDieci USA 2016: il sistema delle eccellenze italiane

[11] Pioneer of conversational AI is awarded Honorary Degree Archived 2022-11-02 at the Wayback Machine

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

Authority control databases
International	ISNI VIAF WorldCat
National	Norway Germany Israel United States Netherlands
Other	IdRef

Roberto Pieraccini

Born	(1955-11-15) November 15, 1955 (age 68) Genova, Italy
Nationality	US, Italian
Scientific career
Fields	Speech recognition, spoken dialog systems, natural language understanding, multimodal interaction

Website	robertopieraccini.com