Dong Yu (computer scientist)

Last updated February 08, 2026

Dong Yu is a Chinese-American computer scientist, AI researcher, and engineer, known for his research in speech recognition, deep learning, and multi-modal artificial intelligence.^[1] He is a Fellow of the Association for Computing Machinery (ACM), the Institute of Electrical and Electronics Engineers (IEEE), and the International Speech Communication Association (ISCA).^[2] Yu currently serves as Distinguished Scientist and Vice General Manager at Tencent AI Lab and Chief Scientist and Vice General Manager at Tencent Cloud AI.^[3]

Early life and education

Yu received a Bachelor of Science in Electrical Engineering from Zhejiang University and a master's degree in Pattern Recognition and Intelligent Control from the Chinese Academy of Sciences.^[2] He subsequently earned a Master of Science in Computer Science from Indiana University at Bloomington and a Ph.D. in Computer Science from the University of Idaho.^[4]

Career

Yu began his professional career at Microsoft Research in Redmond, Washington, in 1998, where he worked in the Speech and Dialog Research Group.^[5]^[6] During his time at Microsoft, he led research on automatic speech recognition, multi-modal AI, and deep learning framework, contributing to products such as Microsoft Cortana, Skype Translator, and Ford Sync.^[7]

In 2017, Yu joined Tencent America, where he holds dual leadership roles at Tencent AI Lab and Tencent Cloud AI.^[8] His work focuses on developing large language models, multi-modal AI systems, and research toward artificial general intelligence (AGI).^[9] He has led teams that created systems for conversational AI, speech and music synthesis, multi-modal interaction, and intelligent web agents.^[7]

He served as Chair of the IEEE Speech and Language Processing Technical Committee and was Technical Program Co-chair for ICASSP 2021.^[10] In addition, he has contributed as an Associate Editor and Senior Area Editor for the IEEE/ACM Transactions on Audio, Speech, and Language Processing and has acted as Guest Editor for multiple IEEE journals and conference special issues focused on deep learning, speech processing, and conversational AI.^[11]

He was also an adjunct professor at Zhejiang University, holds approximately 60 patents, and was among the founders and core contributors to CNTK, an open-source deep learning framework.^[1]

Research and contributions

Yu is known for the application of deep learning to large-vocabulary speech recognition, including the development of context-dependent deep neural networks (CD-DNN-HMM), which significantly improved recognition accuracy and influenced both academia and industry.^[12]

He has also advanced recurrent and convolutional neural networks,^[13] end-to-end deep learning architectures, multi-modal AI, and speech synthesis, contributing to technologies that underpin modern virtual assistants and human-computer interaction systems.^[11] Yu developed the Computational Network Toolkit (CNTK), an open-source deep learning framework, and introduced scalable training methods for multi-GPU systems.^[14]

More recently, his research has focused on multi-modal large language models and AGI, resulting in models such as SongGeneration, AlphaLLM, LiteSearch, WebVoyager, Cognitive Kernel, R-Zero, and WebEvolver, as well as audio front-end and speech synthesis systems used in Tencent products.^[15]

References

1 2 Jing, Meng (2017-05-02). "Tencent appoints global AI expert to head new Seattle lab". South China Morning Post. Retrieved 2025-12-25.
1 2 "Industry Leaders in Signal Processing and Machine Learning: Dong Yu | IEEE Signal Processing Society". signalprocessingsociety.org. Retrieved 2025-12-25.
↑ Jiang, Sijia. "Tencent steps up AI push with research lab in Seattle". Reuters .
↑ Jia, Marlene (2017-06-19). "Meet China's Leading AI Researchers & Innovators". TOPBOTS. Retrieved 2025-12-25.
↑ "ACM Multimedia 2020 - Industrial Invited Talk". 2020.acmmm.org. Archived from the original on 2021-03-10. Retrieved 2025-12-25.
↑ Hughes, Alyssa (2011-08-29). "Speech Recognition Leaps Forward". Microsoft Research. Retrieved 2025-12-25.
1 2 Ali, Abder Rahman. "Deep Learning (Interview With Dong Yu) – Dr. Abder-Rahman Ali". Harvard. Retrieved 2025-12-25.
↑ Nickelsburg, Monica (2017-04-28). "WeChat parent Tencent is opening an A.I. lab in Seattle led by former Microsoft researcher". GeekWire. Retrieved 2025-12-25.
↑ Yang, Yuan. "China's Tencent to open first US-based AI laboratory". Financial Times . Retrieved 2025-12-25.
↑ "Dong Yu". speakers.acm.org. Retrieved 2025-12-25.
1 2 "People of ACM - Dong Yu is a Distinguished Scientist and Vice General Manager at Tencent AI Lab". www.acm.org. Retrieved 2025-12-25.
↑ Linn, Allison (2016-10-18). "Microsoft researchers reach human parity in conversational speech recognition". The AI Blog. Retrieved 2025-12-25.
↑ Abdel-Hamid, Ossama; Mohamed, Abdel-rahman; Jiang, Hui; Deng, Li; Penn, Gerald; Yu, Dong (2014). "Convolutional Neural Networks for Speech Recognition". IEEE/ACM Transactions on Audio, Speech, and Language Processing. 22 (10): 1533–1545. Bibcode:2014ITASL..22.1533A. doi:10.1109/TASLP.2014.2339736. ISSN 2329-9304.
↑ Mayer, Tricia (2015-12-07). "Microsoft Computational Network Toolkit offers most efficient distributed deep learning computational performance". Microsoft Research. Retrieved 2025-12-25.
↑ Wang, Jiaqi; Jiang, Hanqi; Liu, Yiheng; Ma, Chong; Zhang, Xu; Pan, Yi; Liu, Mengyuan; Gu, Peiran; Xia, Sichen; Li, Wenjun; Zhang, Yutong; Wu, Zihao; Liu, Zhengliang; Zhong, Tianyang; Ge, Bao; Zhang, Tuo; Qiang, Ning; Hu, Xintao; Jiang, Xi; Zhang, Xin; Zhang, Wei; Shen, Dinggang; Liu, Tianming; Zhang, Shu (2024). "A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks". arXiv: 2408.01319v1 [cs.AI].

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[:0-1] 1 2 Jing, Meng (2017-05-02). "Tencent appoints global AI expert to head new Seattle lab". South China Morning Post. Retrieved 2025-12-25.

[:1-2] 1 2 "Industry Leaders in Signal Processing and Machine Learning: Dong Yu | IEEE Signal Processing Society". signalprocessingsociety.org. Retrieved 2025-12-25.

[3] Jiang, Sijia. "Tencent steps up AI push with research lab in Seattle". Reuters .

[4] Jia, Marlene (2017-06-19). "Meet China's Leading AI Researchers & Innovators". TOPBOTS. Retrieved 2025-12-25.

[5] "ACM Multimedia 2020 - Industrial Invited Talk". 2020.acmmm.org. Archived from the original on 2021-03-10. Retrieved 2025-12-25.

[6] Hughes, Alyssa (2011-08-29). "Speech Recognition Leaps Forward". Microsoft Research. Retrieved 2025-12-25.

[:2-7] 1 2 Ali, Abder Rahman. "Deep Learning (Interview With Dong Yu) – Dr. Abder-Rahman Ali". Harvard. Retrieved 2025-12-25.

[8] Nickelsburg, Monica (2017-04-28). "WeChat parent Tencent is opening an A.I. lab in Seattle led by former Microsoft researcher". GeekWire. Retrieved 2025-12-25.

[9] Yang, Yuan. "China's Tencent to open first US-based AI laboratory". Financial Times . Retrieved 2025-12-25.

[10] "Dong Yu". speakers.acm.org. Retrieved 2025-12-25.

[:3-11] 1 2 "People of ACM - Dong Yu is a Distinguished Scientist and Vice General Manager at Tencent AI Lab". www.acm.org. Retrieved 2025-12-25.

[12] Linn, Allison (2016-10-18). "Microsoft researchers reach human parity in conversational speech recognition". The AI Blog. Retrieved 2025-12-25.

[13] Abdel-Hamid, Ossama; Mohamed, Abdel-rahman; Jiang, Hui; Deng, Li; Penn, Gerald; Yu, Dong (2014). "Convolutional Neural Networks for Speech Recognition". IEEE/ACM Transactions on Audio, Speech, and Language Processing. 22 (10): 1533–1545. Bibcode:2014ITASL..22.1533A. doi:10.1109/TASLP.2014.2339736. ISSN 2329-9304.

[14] Mayer, Tricia (2015-12-07). "Microsoft Computational Network Toolkit offers most efficient distributed deep learning computational performance". Microsoft Research. Retrieved 2025-12-25.

[15] Wang, Jiaqi; Jiang, Hanqi; Liu, Yiheng; Ma, Chong; Zhang, Xu; Pan, Yi; Liu, Mengyuan; Gu, Peiran; Xia, Sichen; Li, Wenjun; Zhang, Yutong; Wu, Zihao; Liu, Zhengliang; Zhong, Tianyang; Ge, Bao; Zhang, Tuo; Qiang, Ning; Hu, Xintao; Jiang, Xi; Zhang, Xin; Zhang, Wei; Shen, Dinggang; Liu, Tianming; Zhang, Shu (2024). "A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks". arXiv: 2408.01319v1 [cs.AI].

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

Dong Yu (computer scientist)

Contents

Early life and education

Career

Research and contributions

References