Heng Ji

Last updated
Heng Ji
Born
China
Alma mater New York University (PhD and MSc in Computer Science)
Tsinghua University (M.A. and B.A. in Computational Linguistics)
Known for Information Extraction
Natural Language Processing
Scientific career
Fields Computer Science
Institutions University of Illinois at Urbana-Champaign
Amazon
Rensselaer Polytechnic Institute
City University of New York
Thesis Improving Information Extraction and Translation Using Component Interactions  (2008)
Doctoral advisor Ralph Grishman
Website Personal website

Heng Ji is a computer scientist who works on information extraction and natural language processing. She is well known for her work on joined named entity recognition and relation extraction, [1] as well as for her work on cross-document event extraction. [2] She has been coordinating the popular NIST TAC Knowledge Base Population task since 2010. [3] She has been recognised as one of AI's 10 to watch by IEEE Intelligent Systems in 2013, [4] and has won multiple awards, including a NSF Career Award in 2009, [5] Google Research awards in 2009 and 2014, [6] and an IBM Watson Faculty Award in 2012. [7]

Contents

Education

Heng Ji obtained a Bachelor's and master's degree in Computational Linguistics from Tsinghua University. She subsequently obtained a MSc, then PhD in Computer Science from New York University in 2008 under the supervision of Ralph Grishman. Her PhD thesis was on the topic of information extraction, with a particular focus on joint training of multiple components in the information extraction pipeline, as well as cross-lingual learning. [8]

Career

Upon graduating with a PhD from New York University, Ji took up a position as assistant professor at Queens College, City University of New York, where she founded the BLENDER Lab, [9] which focuses on research on cross-lingual, cross-documents, cross-media information extraction and fusion. In 2013, she joined Rensselaer Polytechnic Institute as an Edward P. Hamilton Development Chair and Tenured associate professor in Computer Science. [10] Since 2019, she has been a full professor at the University of Illinois at Urbana–Champaign, [11] as well as an Amazon Scholar.

Research

Heng Ji works in the area of natural language processing, machine learning and information extraction. She has published over 300 peer-reviewed research papers. [12] Her work is published in the proceedings of computer science conferences, including the Annual Meeting of the Association for Computational Linguistics, The Web Conference, and the ACM Conference on Knowledge Discovery and Data Mining (KDD). Ji is a leading researcher in information extraction, having coordinated the popular NIST TAC Knowledge Base Population shared task since 2010. [3] She is most recognised for her work on modelling interactions between subtasks in information extraction, [1] which was also the topic of her PhD thesis, [8] and for her work on event detection using cross-document signals. [2]

Selected honors and distinctions

Related Research Articles

Johanna Doris Moore FRSE is a computational linguist and cognitive scientist. Her research publications include contributions to natural language generation, spoken dialogue systems, computational models of discourse, intelligent tutoring and training systems, human-computer interaction, user modeling, and knowledge representation.

The North American Chapter of the Association for Computational Linguistics (NAACL) provides a regional focus for members of the Association for Computational Linguistics (ACL) in North America as well as in Central and South America, organizes annual conferences, promotes cooperation and information exchange among related scientific and professional societies, encourages and facilitates ACL membership by people and institutions in the Americas, and provides a source of information on regional activities for the ACL Executive Committee.

<span class="mw-page-title-main">Hans Uszkoreit</span> German computational linguist

Hans Uszkoreit is a German computational linguist.

Jun'ichi Tsujii is a Japanese computer scientist specializing in natural language processing and text mining, particularly in the field of biology and bioinformatics.

Temporal annotation is the study of how to automatically add semantic information regarding time to natural language documents. It plays a role in natural language processing and computational linguistics.

IBM SystemT is a declarative information extraction system. It was first built in 2005, as a research project at IBM's IBM Almaden Research Center. Its name is partially inspired by System R, a seminal project from the same research center.

<span class="mw-page-title-main">Iryna Gurevych</span> German computer scientist

Iryna Gurevych is a German computer scientist. She is Professor at the Department of Computer Science of the Technical University of Darmstadt and Director of Ubiquitous Knowledge Processing Lab.

Emily M. Bender is an American linguist who is a professor at the University of Washington. She specializes in computational linguistics and natural language processing. She is also the director of the University of Washington's Computational Linguistics Laboratory. She has published several papers on the risks of large language models.

Rada Mihalcea is a professor of computer science and engineering at the University of Michigan. Her research focuses on natural language processing, multimodal processing, and computational social science.

<span class="mw-page-title-main">Paola Velardi</span> Professor of computer science

Paola Velardi is a full professor of computer science at Sapienza University in Rome, Italy. Her research encompasses natural language processing, machine learning, business intelligence and semantic web, web information extraction in particular. Velardi is one of the hundred female scientists included in the database "100esperte.it". This online, open database champions the recognition of top-rated female scientists in Science, Technology, Engineering and Mathematics (STEM) areas.

Lillian Lee is a computer scientist whose research involves natural language processing, sentiment analysis, and computational social science. She is a professor of computer science and information science at Cornell University, and co-editor-in-chief of the journal Transactions of the Association for Computational Linguistics.

Martha (Stone) Palmer is an American computer scientist. She is best known for her work on verb semantics, and for the creation of ontological resources such as PropBank and VerbNet.

Bonnie Jean Dorr is an American computer scientist specializing in natural language processing, machine translation, automatic summarization, social computing, and explainable artificial intelligence. She is a professor and director of the Natural Language Processing Research Laboratory in the Department of Computer & Information Science & Engineering at the University of Florida. Gainesville, Florida She is professor emerita of computer science and linguistics and former dean at the University of Maryland, College Park, former associate director at the Florida Institute for Human and Machine Cognition,, and former president of the Association for Computational Linguistics.

Claire Cardie is an American computer scientist specializing in natural language processing. Since 2006, she has been a professor of computer science and information science at Cornell University, and from 2010 to 2011 she was the first Charles and Barbara Weiss Chair of Information Science at Cornell. Her research interests include coreference resolution and sentiment analysis.

Nissim Francez is an Israeli professor, emeritus in the computer science faculty at the Technion, and former head of computational linguistics laboratory in the faculty.

Mona Talat Diab is a computer science professor and director of Carnegie Mellon University's Language Technologies Institute. Previously, she was a professor at George Washington University and a research scientist with Facebook AI. Her research focuses on natural language processing, computational linguistics, cross lingual/multilingual processing, computational socio-pragmatics, Arabic language processing, and applied machine learning.

Marine Carpuat is a computer scientist who works on machine translation and natural language processing. She is known for her research connecting cross-lingual semantics with machine translation. She has been recognized with a NSF Career Award in 2018, a Google Research award in 2016, and Amazon Faculty Awards in 2016 and 2018.

Ellen Riloff is an American computer scientist currently serving as a professor at the School of Computing at the University of Utah. Her research focuses on Natural Language Processing and Computational Linguistics, specifically information extraction, sentiment analysis, semantic class induction, and bootstrapping methods that learn from unannotated texts.

Ani Nenkova is Principal Scientist at Adobe Research, currently on leave from her position as an Associate Professor of Computer and Information Science at the University of Pennsylvania. Her research focuses on computational linguistics and artificial intelligence, with an emphasis on developing computational methods for analysis of text quality and style, discourse, affect recognition, and summarization.

Candace Lee (Candy) Sidner is an American computer scientist whose research has applied artificial intelligence and natural language processing to problems in personal information management, intelligent user interfaces, and human–robot interaction. She is a research professor of computer science at the Worcester Polytechnic Institute, and a former president of the Association for Computational Linguistics.

References

  1. 1 2 Li, Qi; Ji, Heng (2014-06-01). "Incremental Joint Extraction of Entity Mentions and Relations". Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics (ACL2014). Annual Meeting of the Association for Computational Linguistics. Baltimore, Maryland: Association for Computational Linguistics. pp. 402–412.
  2. 1 2 Ji, Heng; Grishman, Ralph (2008-06-01). "Refining Event Extraction through Cross-Document Inference". Proceedings of ACL-08: HLT. Annual Meeting of the Association for Computational Linguistics. Columbus, Ohio: Association for Computational Linguistics. pp. 254–262.
  3. 1 2 Ji, Heng; Grishman, Ralph; Dang, Hoa Trang; Griffitt, Kira; Ellis, Joe (2010-01-01). "Overview of the TAC 2010 Knowledge Base Population Track". Proceedings of Third Text Analysis Conference (TAC). Text Analysis Conference. National Institute of Standards and Technology. CiteSeerX   10.1.1.357.1854 .
  4. 1 2 Zeng, Daniel (2013-09-16). "AI's 10 to Watch". IEEE Intelligent Systems. 28 (3): 86–96. doi:10.1109/MIS.2013.57. S2CID   18869823.
  5. 1 2 "CAREER: Cross-Document Cross-Lingual Event Extraction and Tracking". National Science Foundation. March 1, 2010. Retrieved February 21, 2021.
  6. 1 2 "Google Faculty Research Awards -- February 2014" (PDF). Google. February 2014. Retrieved February 21, 2021.
  7. 1 2 "IBM Announces Student Winners of Watson Case Competition from Cornell University -- Faculty award winners from Nine Universities Also Announced; Professors to Receive $10,000 Grants for Watson Curriculums". IBM. October 23, 2012. Retrieved February 21, 2021.
  8. 1 2 Ji, Heng (2008). Improving Information Extraction and Translation Using Component Interactions (PDF) (PhD). University of Aberdeen.
  9. "BLENDER Lab" . Retrieved February 21, 2021.
  10. "Natural Language Processing Expert Heng Ji Joins Rensselaer". Rensselaer Polytechnic Institute. 2013. Retrieved February 21, 2021.
  11. "Illinois CS Adds Eight New Faculty, Broadening Expertise in NLP, Security, Robotics, and More". Rensselaer Polytechnic Institute. 2019. Retrieved February 21, 2021.
  12. "Heng Ji - Google Scholar Citations". scholar.google.com. Retrieved 2021-02-21.
  13. "Young Scientists" (PDF). World Economic Forum. 2016. Retrieved February 21, 2021.
  14. "Young Scientists Community" (PDF). World Economic Forum. 2017. Retrieved February 21, 2021.
  15. "Best Paper Awards at ACL 2020". ACL 2020. 2020-07-08. Retrieved February 21, 2021.