Language and Communication Technologies

Last updated

Language and Communication Technologies (LCT; also known as human language technologies or language technology for short) is the scientific study of technologies that explore language and communication. It is an interdisciplinary field that encompasses the fields of computer science, linguistics and cognitive science.

Language technology, often called human language technology (HLT), studies methods of how computer programs or electronic devices can analyze, produce, modify or respond to human texts and speech. It consists of natural language processing (NLP) and computational linguistics (CL) on the one hand, and speech technology on the other. It also includes many application oriented aspects of these. Working with language technology often requires broad knowledge not only about linguistics but also about computer science.

Computer science Study of the theoretical foundations of information and computation

Computer science is the study of processes that interact with data and that can be represented as data in the form of programs. It enables the use of algorithms to manipulate, store, and communicate digital information. A computer scientist studies the theory of computation and the practice of designing software systems.

Linguistics is the scientific study of language. It involves analysing language form, language meaning, and language in context. The earliest activities in the documentation and description of language have been attributed to the 6th-century-BC Indian grammarian Pāṇini who wrote a formal description of the Sanskrit language in his Aṣṭādhyāyī.

Contents

History

One of the first problems to be studied in the 1950s, shortly after the invention of computers, was an LCT problem, namely the translation of human languages. The large amounts of funding poured into machine translation testifies to the perceived importance of the field, right from the beginning. It was also in this period that scholars started to develop theories of language and communication based on scientific methods. In the case of language, it was Noam Chomsky who refines the goal of linguistics as a quest for a formal description of language, [1] whilst Claude Shannon and Warren Weaver provided a mathematical theory that linked communication with information. [2]

Machine translation, sometimes referred to by the abbreviation MT is a sub-field of computational linguistics that investigates the use of software to translate text or speech from one language to another.

Noam Chomsky American linguist, philosopher and activist

Avram Noam Chomsky is an American linguist, philosopher, cognitive scientist, historian, social critic, and political activist. Sometimes called "the father of modern linguistics", Chomsky is also a major figure in analytic philosophy and one of the founders of the field of cognitive science. He holds a joint appointment as Institute Professor Emeritus at the Massachusetts Institute of Technology (MIT) and laureate professor at the University of Arizona, and is the author of more than 100 books on topics such as linguistics, war, politics, and mass media. Ideologically, he aligns with anarcho-syndicalism and libertarian socialism.

Claude Shannon American mathematician and information theorist (1916-2001)

Claude Elwood Shannon was an American mathematician, electrical engineer, and cryptographer known as "the father of information theory". Shannon is noted for having founded information theory with a landmark paper, "A Mathematical Theory of Communication", that he published in 1948.

Computers and related technologies have provided a physical and conceptual framework within which scientific studies concerning the notion of communication within a computational framework could be pursued. Indeed, this framework has been fruitful on a number of levels. For a start, it has given birth to a new discipline, known as natural language processing (NLP), or computational linguistics (CL). This discipline studies, from a computational perspective, all levels of language from the production of speech to the meanings of texts and dialogues. And over the past 40 years, NLP has produced an impressive computational infrastructure of resources, techniques, and tools for analyzing sound structure (phonology), word structure (morphology), grammatical structure (syntax) and meaning structure (semantics). As well as being important for language-based applications, this computational infrastructure makes it possible to investigate the structure of human language and communication at a deeper scientific level than was ever previously possible.

Natural language processing field of computer science and linguistics

Not to be confused with Non-linear programming

Computational linguistics is an interdisciplinary field concerned with the statistical or rule-based modeling of natural language from a computational perspective, as well as the study of appropriate computational approaches to linguistic questions.

Phonology is a branch of linguistics concerned with the systematic organization of sounds in spoken languages and signs in sign languages. It used to be only the study of the systems of phonemes in spoken languages, but it may also cover any linguistic analysis either at a level beneath the word or at all levels of language where sound or signs are structured to convey linguistic meaning.

Moreover, NLP fits in naturally with other branches of computer science, and in particular, with artificial intelligence (AI). [3] From an AI perspective, language use is regarded as a manifestation of intelligent behaviour by an active agent. The emphasis in AI-based approaches to language and communication is on the computational infrastructure required to integrate linguistic performance into a general theory of intelligent agents that includes, for example, learning generalizations on the basis of particular experience, the ability to plan and reason about intentionally produced utterances, the design of utterances that will fulfill a particular set of goals. Such work tends to be highly interdisciplinary in nature, as it needs to draw on ideas from such fields as linguistics, cognitive psychology, and sociology. LCT draws on and incorporates knowledge and research from all these fields.

In computer science, artificial intelligence (AI), sometimes called machine intelligence, is intelligence demonstrated by machines, in contrast to the natural intelligence displayed by humans. Colloquially, the term "artificial intelligence" is often used to describe machines that mimic "cognitive" functions that humans associate with the human mind, such as "learning" and "problem solving".

Cognitive psychology is the scientific study of mental processes such as "attention, language use, memory, perception, problem solving, creativity, and thinking". Much of the work derived from cognitive psychology has been integrated into various other modern disciplines such as Cognitive Science and of psychological study, including educational psychology, social psychology, personality psychology, abnormal psychology, developmental psychology, linguistics, and economics.

Sociology Scientific study of human society and its origins, development, organizations, and institutions

Sociology is the study of society, patterns of social relationships, social interaction and culture of everyday life. It is a social science that uses various methods of empirical investigation and critical analysis to develop a body of knowledge about social order, acceptance, and change or social evolution. Sociology is also defined as the general science of society. While some sociologists conduct research that may be applied directly to social policy and welfare, others focus primarily on refining the theoretical understanding of social processes. Subject matter ranges from the micro-sociology level of individual agency and interaction to the macro level of systems and the social structure.

Today

Language and communication are so fundamental to human activity that it is not at all surprising to find that Language and Communication Technologies affect all major areas of society, including health, education, finance, commerce, and travel. Modern LCT is based on a dual tradition of symbols and statistics. This means that nowadays research on language requires access to large databases of information about words and their properties, to large scale computational grammars, to computational tools for working with all levels of language, and to efficient inference systems for performing reasoning. By working computationally it is possible to get to grips with the deeper structure of natural languages, and in particular, to model the crucial interactions between the various levels of language and other cognitive faculties.

Relevant areas of research in LCT include:

Information retrieval (IR) is the activity of obtaining information system resources that are relevant to an information need from a collection of those resources. Searches can be based on full-text or other content-based indexing. Information retrieval is the science of searching for information in a document, searching for documents themselves, and also searching for the metadata that describes data, and for databases of texts, images or sounds.

In linguistics, formal semantics seeks to understand linguistic meaning by constructing precise mathematical models of the principles that speakers use to define relations between expressions in a natural language and the world that supports meaningful discourse. The mathematical tools used are the confluence of formal logic and formal language theory, especially typed lambda calculi.

Knowledge representation and reasoning is the field of artificial intelligence (AI) dedicated to representing information about the world in a form that a computer system can utilize to solve complex tasks such as diagnosing a medical condition or having a dialog in a natural language. Knowledge representation incorporates findings from psychology about how humans solve problems and represent knowledge in order to design formalisms that will make complex systems easier to design and build. Knowledge representation and reasoning also incorporates findings from logic to automate various kinds of reasoning, such as the application of rules or the relations of sets and subsets.

Educational programs

The increasing interest in the field is proved by the existence of several European Masters in this dynamic research area: [4] Degree programmes of the University of Groningen include Language and Communication Technologies.

Erasmus Mundus Masters:

Related Research Articles

Cognitive science interdisciplinary scientific study of the mind and its processes

Cognitive science is the interdisciplinary, scientific study of the mind and its processes. It examines the nature, the tasks, and the functions of cognition. Cognitive scientists study intelligence and behavior, with a focus on how nervous systems represent, process, and transform information. Mental faculties of concern to cognitive scientists include language, perception, memory, attention, reasoning, and emotion; to understand these faculties, cognitive scientists borrow from fields such as linguistics, psychology, artificial intelligence, philosophy, neuroscience, and anthropology. The typical analysis of cognitive science spans many levels of organization, from learning and decision to logic and planning; from neural circuitry to modular brain organization. The fundamental concept of cognitive science is that "thinking can best be understood in terms of representational structures in the mind and computational procedures that operate on those structures."

The following outline is provided as an overview of and topical guide to linguistics:

Natural-language understanding (NLU) or natural-language interpretation (NLI) is a subtopic of natural-language processing in artificial intelligence that deals with machine reading comprehension. Natural-language understanding is considered an AI-hard problem.

Cognitive science is the scientific study either of mind or of intelligence . Practically every formal introduction to cognitive science stresses that it is a highly interdisciplinary research area in which psychology, neuroscience, linguistics, philosophy, computer science, anthropology, and biology are its principal specialized or applied branches. Therefore, we may distinguish cognitive studies of either human or animal brains, mind and brain

George Armitage Miller American cognitive psychologist

George Armitage Miller was an American psychologist who was one of the founders of the cognitive psychology, and more broadly, of cognitive science. He also contributed to the birth of psycholinguistics. Miller wrote several books and directed the development of WordNet, an online word-linkage database usable by computer programs. He authored the paper, "The Magical Number Seven, Plus or Minus Two," in which he observed that many different experimental findings considered together reveal the presence of an average limit of seven for human short-term memory capacity. This paper is frequently cited by psychologists and in the wider culture. Miller won numerous awards, including the National Medal of Science.

<i>Syntactic Structures</i> book by Noam Chomsky

Syntactic Structures is a major work in linguistics by American linguist Noam Chomsky. It was first published in 1957. It introduced the idea of transformational generative grammar. This approach to syntax was fully formal. At its base, this method uses phrase structure rules. These rules break down sentences into smaller parts. Chomsky then combines these with a new kind of rules called "transformations". This procedure gives rise to different sentence structures. Chomsky aimed to show that this limited set of rules "generates" all and only the grammatical sentences of a given language, which are unlimited in number.

Linguistic competence is the system of linguistic knowledge possessed by native speakers of a language. It is distinguished from linguistic performance, which is the way a language system is used in communication. Noam Chomsky introduced this concept in his elaboration of generative grammar, where it has been widely adopted and competence is the only level of language that is studied.

The Education, Audiovisual and Culture Executive Agency or EACEA is an agency of the European Union located in Brussels, Belgium. It manages parts of the Union's programs in education, culture, and audiovisual fields.

The Copenhagen School, officially the Linguistic Circle of Copenhagen, is a group of scholars dedicated to the study of linguistics. It was founded by Louis Hjelmslev (1899–1965) and Viggo Brøndal (1887–1942). In the mid twentieth century the Copenhagen school was one of the most important centres of linguistic structuralism together with the Geneva School and the Prague School. In the late 20th and early 21st century the Copenhagen school has turned from a purely structural approach to linguistics to a functionalist one, Danish functional grammar, which nonetheless incorporates many insights from the founders.

The European Master on Software Engineering, or European Masters Programme in Software Engineering (EMSE) is a two-year joint Master of Science (Msc) program coordinated by four European universities, funded by the Erasmus+ Programme of the European Union.

<i>Aspects of the Theory of Syntax</i> book by Noam Chomsky

Aspects of the Theory of Syntax is a book on linguistics written by American linguist Noam Chomsky, first published in 1965. In Aspects, Chomsky presented a deeper, more extensive reformulation of transformational generative grammar (TGG), a new kind of syntactic theory that he had introduced in the 1950s with the publication of his first book, Syntactic Structures. Aspects is widely considered to be the foundational document and a proper book-length articulation of Chomskyan theoretical framework of linguistics. It presented Chomsky's epistemological assumptions with a view to establishing linguistic theory-making as a formal discipline comparable to physical sciences, i.e. a domain of inquiry well-defined in its nature and scope. From a philosophical perspective, it directed mainstream linguistic research away from behaviorism, constructivism, empiricism and structuralism and towards mentalism, nativism, rationalism and generativism, respectively, taking as its main object of study the abstract, inner workings of the human mind related to language acquisition and production.

The following outline is provided as an overview of and topical guide to social science:

Meaning is a concept used in psychology as well as in other fields such as philosophy, linguistics, semiotics and sociology.

Digital infinity

Digital infinity is a technical term in theoretical linguistics. Alternative formulations are "discrete infinity" and "the infinite use of finite means". The idea is that all human languages follow a simple logical principle, according to which a limited set of digits—irreducible atomic sound elements—are combined to produce an infinite range of potentially meaningful expressions.

The following outline is provided as an overview of and topical guide to natural language processing:

Barbara J. Grosz American computer scientist

Barbara J. Grosz CorrFRSE is an American computer scientist and Higgins Professor of Natural Sciences at Harvard University. She has made seminal contributions to the fields of natural language processing and multi-agent systems.

The QEM, also known as Erasmus Mundus QEM, is an Erasmus Mundus post-graduate master's degree in economics. The program combines "core economic theory with related quantitative disciplines such as Probability, Statistics, Econometrics, Finance, Actuarial Science, Mathematical Modelling, Computation and Simulation, Experimental Design, and Political Science."

References

  1. Noam Chomsky, Syntactic Structures, London: Mouton, 1957.
  2. A Mathematical Theory of Communication. Cm.bell-labs.com (1998-05-18). Retrieved on 2011-03-21. Archived 1998-01-31 at the Wayback Machine
  3. AITopics / NaturalLanguage. Aaai.org. Retrieved on 2011-03-21. Archived 2008-07-31 at the Wayback Machine
  4. Erasmus Mundus – Action 1 – Erasmus Mundus Masters Courses (EMMCs) | EACEA. Eacea.ec.europa.eu. Retrieved on 2011-03-21.