Joseph Mariani

Last updated
Joseph Mariani
Photo-JM4.png
Born (1950-02-01) 1 February 1950 (age 74)
NationalityFrench
OccupationResearcher in Computer Science

Joseph Mariani (born Joseph-Jean Mariani; 1 February 1950) is a French computer science researcher and pioneer in the field of speech processing.

Contents

Education and career

After obtaining a Doctor of Engineering degree in 1977 from the Pierre and Marie Curie University, Joseph Mariani joined the National Center for Scientific Research (CNRS) in the Computer Science Laboratory for Mechanics and Engineering Sciences (LIMSI) as a researcher. He then was the head of the Speech Communication group from 1982 to 1985. He left for the United States (1985–1986) where he worked as invited researcher at IBM T.J. Watson Research Center (Yorktown Heights, NY, USA). Back in France, from 1987 to 2001 he was in charge of the Human-Machine Communication Department and was Director of LIMSI from 1989 to 2000. Later, he was named Director of the Department of Information and Communication Technologies at the Ministry of Research. Within the Ministry, he created the Techno-Langue and Techno-Vision Programs on the development and evaluation of technologies in these two domains.

During this time, he was named President of the European Language Resources Association (ELRA) and was on the boards of several organizations including the ANFr, the IGN, the OST and INRIA. He participated in the creation of many associations and international conferences such as ELSNET, COCOSDA, ESCA/ISCA, ELRA and LREC.

From 2006 through December 2013, he was director of the Institute for Multilingual and Multimedia Information (IMMI), a CNRS Mixed International Unit, part of the Quaero Program, a collaboration between LIMSI, the Karlsruhe Institute of Technology (KIT) and the University of Aix-la-Chapelle (RWTH). In February 2016, he was named Emeritus Senior Researcher by the CNRS.

Research areas

Joseph's research activities mainly concern Human-Machine Communication, both spoken and written, within the domain of Natural Language Processing.

Early in his career, he concentrated on automatic speech recognition and signal processing.

In the early 1980s, Joseph Mariani was already, within the NATO RSG-10 working group's evaluation activities, using the name “evaluation paradigm” to denote an open evaluation effort seen as a quantitative black-box with performance metrics on shared data, and then combined and compared, a task now referred to as a “shared task”. This evaluation paradigm allowed for the continuous improvement of speech processing and the eventual appearance of vocal assistants such as SIRI, Cortan, ECHO and Google Voice.

He was involved in NIST 2 becoming the center of automatic speech and text processing evaluation activities in the US in 1987. In 1994, with Robert Martin, then Director of the Institut National de la Langue Française (INaLF), he organized the first francophone open text evaluation for morphosyntactic analyzers of French text thanks to the support of two CNRS departments, the Humanities and Social Sciences and the Engineering Sciences. The same year, he helped start a program in the field of linguistic engineering by Aupelf-Uref (now AUF, the Francophone University Association) and coordinated by the Francophone Network on Language Engineering (FRANCIL) to strengthen francophone activities in this area. This encompasses Concerted Research Actions (CRAs), a major action concerning the text and speech4evaluation paradigm. In the early 2000s, he contributed to a major publication on automatic speech processing: Spoken Language Processing5.

Between 2000 and 2010, his activities focused on multilingualism with the development of language matrices for the 24 languages of the European Union6. Later he worked on the publication of the META-NET White Paper Series7 in order to establish an inventory of the resources available for French (dictionaries, grammars and programs).

Since 2010, he has worked on the automatic processing of regional languages8 and is interested in ethical problems related to the use of computers in daily life.

Since 20139, he has collected and studies articles in the whole field of natural language processing, including speech processing and information retrieval. This work has been carried out within the framework of the NLP4NLP project10 that began by using the ISCA archives, and later those of LREC 11, TALN and IEEE and following that, other conferences and revues such as TREC. After this collection phase, which for the first time gathered a major part of the publications in the field, the publications were automatically analyzed from several points of view. First, all of the technical terms were extracted and compiled in a lexicon. Second, each lexical entry was attributed to the author who first used it. This is an innovation12 in scientific publication. The goal was to understand the mechanisms that influence the domain and thus to identify current and future trends. This work included the creation of technical terms, their evolution (appearance and eventual decay and resurgence), such as the term “neural networks”. Another strategy was to create a predictive analysis, which consists of creating a statistical representation of the use of technical terms in order to predict their use over the following four years. The study also examined the impact of one conference on another, on plagiarism and on re-use in scientific publications13. A full synthesis of the NLP4NLP has been published in 2019 under the form of a double publication in Frontiers in Research Metrics and Analytics. [1] [2] Then, starting from this first 50 years analysis (1965-2015), a follow-up study has been conducted to consider five more years (2016-2020). [3] It identified profound changes in research topics as well as in the emergence of a new generation of authors and the appearance of new publications around artificial intelligence, neural networks, machine learning, and word embedding.

Distinctions

Joseph Mariani was nominated knight in the French National Order of Merit (1985) and Officer in the Ordre des Arts et des Lettres (2016). He is an honorary member of the Francophone Association for Speech Communication (AFCP), a fellow and life member of ISCA, where he received the Special Service Medal in 1999, and honorary president of ELRA since 2010.

Bibliography

Joseph Mariani is an author, coauthor or editor of over 500 publications.

Related Research Articles

Slightly over half of the homepages of the most visited websites on the World Wide Web are in English, with varying amounts of information available in many other languages. Other top languages are Chinese, Spanish, Russian, Persian, French, German and Japanese.

Automatic pronunciation assessment is the use of speech recognition to verify the correctness of pronounced speech, as distinguished from manual assessment by an instructor or proctor. Also called speech verification, pronunciation evaluation, and pronunciation scoring, the main application of this technology is computer-aided pronunciation teaching (CAPT) when combined with computer-aided instruction for computer-assisted language learning (CALL), speech remediation, or accent reduction. Pronunciation assessment does not determine unknown speech but instead, knowing the expected word(s) in advance, it attempts to verify the correctness of the learner's pronunciation and ideally their intelligibility to listeners, sometimes along with often inconsequential prosody such as intonation, pitch, tempo, rhythm, and syllable and word stress. Pronunciation assessment is also used in reading tutoring, for example in products such as Microsoft Teams and from Amira Learning. Automatic pronunciation assessment can also be used to help diagnose and treat speech disorders such as apraxia.

TIMIT is a corpus of phonemically and lexically transcribed speech of American English speakers of different sexes and dialects. Each transcribed element has been delineated in time.

A non-native speech database is a speech database of non-native pronunciations of English. Such databases are used in the development of: multilingual automatic speech recognition systems, text to speech systems, pronunciation trainers, and second language learning systems.

Language resource management – Lexical markup framework, produced by ISO/TC 37, is the ISO standard for natural language processing (NLP) and machine-readable dictionary (MRD) lexicons. The scope is standardization of principles and methods relating to language resources in the contexts of multilingual communication.

The International Conference on Language Resources and Evaluation is an international conference organised by the ELRA Language Resources Association every other year with the support of institutions and organisations involved in Natural language processing. The series of LREC conferences was launched in Granada in 1998.

<span class="mw-page-title-main">Alex Waibel</span> American computer scientist

Alexander Waibel is a professor of Computer Science at Carnegie Mellon University and Karlsruhe Institute of Technology (KIT). Waibel’s research focuses on automatic speech recognition, translation and human-machine interaction. His work has introduced cross-lingual communication systems, such as consecutive and simultaneous interpreting systems on a variety of platforms. In fundamental research on machine learning, he is known for the Time Delay Neural Network (TDNN), the first Convolutional Neural Network (CNN) trained by gradient descent, using backpropagation. Alex Waibel introduced the TDNN in 1987 at ATR in Japan.

The LRE Map is a freely accessible large database on resources dedicated to Natural language processing. The original feature of LRE Map is that the records are collected during the submission of different major Natural language processing conferences. The records are then cleaned and gathered into a global database called "LRE Map".

The following outline is provided as an overview of and topical guide to natural-language processing:

UBY-LMF is a format for standardizing lexical resources for Natural Language Processing (NLP). UBY-LMF conforms to the ISO standard for lexicons: LMF, designed within the ISO-TC37, and constitutes a so-called serialization of this abstract standard. In accordance with the LMF, all attributes and other linguistic terms introduced in UBY-LMF refer to standardized descriptions of their meaning in ISOCat.

Julia Hirschberg is an American computer scientist noted for her research on computational linguistics and natural language processing.

UBY is a large-scale lexical-semantic resource for natural language processing (NLP) developed at the Ubiquitous Knowledge Processing Lab (UKP) in the department of Computer Science of the Technische Universität Darmstadt . UBY is based on the ISO standard Lexical Markup Framework (LMF) and combines information from several expert-constructed and collaboratively constructed resources for English and German.

The International Speech Communication Association (ISCA) is a non-profit organization and one of the two main professional associations for speech communication science and technology, the other association being the IEEE Signal Processing Society.

<span class="mw-page-title-main">Pascale Fung</span> Professor

Pascale Fung (馮雁) is a professor in the Department of Electronic & Computer Engineering and the Department of Computer Science & Engineering at the Hong Kong University of Science & Technology(HKUST). She is the director of the Centre for AI Research (CAiRE) at HKUST. She is an elected Fellow of the Institute of Electrical and Electronics Engineers (IEEE) for her “contributions to human-machine interactions”, an elected Fellow of the International Speech Communication Association for “fundamental contributions to the interdisciplinary area of spoken language human-machine interactions” and an elected Fellow of the Association for Computational Linguistics (ACL) for her “significant contributions toward statistical NLP, comparable corpora, and building intelligent systems that can understand and empathize with humans”.

Kumiko Tanaka-Ishii is a computational linguist and a professor in Research Center for Advanced Science and Technology at the University of Tokyo, Japan. She is the author of Semiotics of Programming, an award-winning book semiotically analyzing computer programs along three axes: models of signs, kinds of signs, and systems of signs.

The ISLRN or International Standard Language Resource Number is Persistent Unique Identifier for Language Resources.

Helen Aristar-Dry is an American linguist who currently serves as the series editor for SpringerBriefs in Linguistics. Most notably, from 1991 to 2013 she co-directed The LINGUIST List with Anthony Aristar. She has served as principal investigator or co-Principal Investigator on over $5,000,000 worth of research grants from the National Science Foundation and the National Endowment for the Humanities. She retired as Professor of English Language and Literature from Eastern Michigan University in 2013.

In linguistics and language technology, a language resource is a "[composition] of linguistic material used in the construction, improvement and/or evaluation of language processing applications, (...) in language and language-mediated research studies and applications."

<span class="mw-page-title-main">Joseph Keshet</span> Israeli professor of Computer Science

Joseph (Yossi) Keshet is an Israeli professor in the Electrical and Computer Engineering Faculty of the Technion.

Lori Faith Lamel is a speech processing researcher known for her work with the TIMIT corpus of American English speech and for her work on voice activity detection, speaker recognition, and other non-linguistic inferences from speech signals. She works for the French National Centre for Scientific Research (CNRS) as a senior research scientist in the Spoken Language Processing Group of the Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur.

References

  1. Mariani, Joseph; Francopoulo, Gil; Paroubek, Patrick (2019), "The NLP4NLP Corpus (I): 50 Years of Publication Collaboration and Citation in Speech and Language Processing", Frontiers in Research Metrics and Analytics, 3, doi: 10.3389/frma.2018.00036
  2. Mariani, Joseph; Francopoulo, Gil; Paroubek, Patrick; Vernier, Frédéric (2019), "The NLP4NLP Corpus (I): 50 Years of Research in Speech and Language Processing", Frontiers in Research Metrics and Analytics, 3, doi: 10.3389/frma.2018.00037
  3. Mariani, Joseph; Francopoulo, Gil; Paroubek, Patrick; Vernier, Frédéric (2022), "NLP4NLP+5: The Deep (R)evolution in Speech and Language Processing", Frontiers in Research Metrics and Analytics, 7, doi: 10.3389/frma.2022.863126 , PMC   9363593
  1. ↑ Jean-Sylvain Liénard, Joseph Mariani, 1980, Système de reconnaissance de mots isolés: MOISE - Registered Technical Report ANVAR No 50312, juin 1980
  2. ↑ David Pallet, 1998 The NIST Role in Automatic Speech Recognition Benchmark Tests, LREC 1998
  3. Ralph Grishman, Beth Sundheim, 1996 Message Understanding Conference-6: A Brief History [archive], COLING 1996
  4. Survey of the State of the Art in Human Language Technology [1] [archive]
  5. ↑ Spoken Language Processing [2] [archive]
  6. Language Matrices and the Language Resource Impact, Joseph Mariani, Gil Francopoulo, dans Language Production, Cognition and the lexicon, edited by Gala, Rapp, Bel-Enguix, Springer
  7. ↑ META-NET White Paper Series: French, Joseph Mariani, Patrick Paroubek, Gil Francopoulo, Aurélien Max, François Yvon, Pierre Zweigenbaum. Springer [3] [archive]
  8. ↑ Technologies de la langue: état des lieux, Joseph Mariani, dans Les Technologies pour les langues régionales de France, Colloque du 19 et 20 février 2015 organisé par la DGLFLF
  9. ↑ Rediscovering 25 Years of Discoveries in Spoken Language Processing: A Preliminary ISCA Archive Analysis, Joseph Mariani, Patrick Paroubek, Gil Francopoulo, Marine Delaborde, [4] [archive]
  10. ↑ NLP4NLP: The Cobbler's Children Won't Go Unshod, Gil Francopoulo, Joseph Mariani, Patrick Paroubek, D-Lib Magazine: The Magazine of Digital Library Research, November 2015 [5] [archive]
  11. ↑ Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis, Joseph Mariani, Patrick Paroubek, Gil Francopoulo, Olivier Hamon, LREC 2014, [6] [archive]
  12. ↑ Text Mining for Notabilility Computation, Gil Francopoulo, Joseph Mariani, Patrick Paroubek, LREC 2016, Workshop on Cross-Platform Text-Mining and Natural Language Processing Interoperability [7] [archive]
  13. A Study of Reuse and Plagiarism in LREC papers, Gil Francopoulo, Joseph Mariani, Patrick Paroubek, LREC 2016, http://www.lrec-conf.org/proceedings/lrec2016/index.html [archive]