Four-character idiom

Last updated

Four-character idiom may refer to:

Chengyu Chinese "set phrases"; proverbs that carry conventional wisdom

Chengyu are a type of traditional Chinese idiomatic expression, most of which consist of four characters. Chengyu were widely used in Classical Chinese and are still common in vernacular Chinese writing and in the spoken language today. According to the most stringent definition, there are about 5,000 chéngyǔ in the Chinese language, though some dictionaries list over 20,000. Chéngyǔ are considered the collected wisdom of the Chinese culture, and contain the experiences, moral concepts, and admonishments from previous generations of Chinese. Nowadays, chéngyǔ still plays an important role in Chinese conversations and education.

In Korean, sajaseong-eo are four-character idioms, the analog of Chinese chengyu and Japanese yojijukugo, and generally but not always of Chinese origin. They have analogous categorization to the analogs in other languages, such as gosaseong-eo for historical idioms.

Yojijukugo

Yojijukugo is a Japanese lexeme consisting of four kanji. English translations of yojijukugo include "four-character compound", "four-character idiom", "four-character idiomatic phrase", and "four-character idiomatic compound". It is equivalent to the Chinese chengyu.

Related Research Articles

Grammatical conjugation the creation of derived forms of a verb from its principal parts by inflection

In linguistics, conjugation is the creation of derived forms of a verb from its principal parts by inflection. Verbs may inflect for grammatical categories such as person, number, gender, tense, aspect, mood, voice, case, possession, definiteness, politeness, causativity, clusivity, interrogativity, transitivity, valency, polarity, telicity, volition, mirativity, evidentiality, animacy, associativity, pluractionality, and reciprocity. Verbs may also be affected by agreement, polypersonal agreement, incorporation, noun class, noun classifiers, and verb classifiers. Agglutinative and polysynthetic languages tend to have the most complex conjugations albeit some fusional languages such as Archi can also have extremely complex conjugation. Typically the principal parts are the root and/or several modifications of it (stems). All the different forms of the same verb constitute a lexeme, and the canonical form of the verb that is conventionally used to represent that lexeme is called a lemma.

A lexicon, word-hoard, wordbook, or word-stock is the vocabulary of a person, language, or branch of knowledge. In linguistics, a lexicon is a language's inventory of lexemes. The word "lexicon" derives from the Greek λεξικόν (lexicon), neuter of λεξικός (lexikos) meaning "of or for words."

A lexeme is a unit of lexical meaning that exists regardless of the number of inflectional endings it may have or the number of words it may contain. It is a basic abstract unit of meaning. Put more technically, a lexeme is an abstract unit of morphological analysis in linguistics, that roughly corresponds to a set of forms taken by a single word. For example, in English, run, runs, ran and running are forms of the same lexeme, which can be represented as RUN. A related concept is the lemma, which is a particular form of a lexeme that is chosen by convention to represent a canonical form of a lexeme. Lemmas, being a subset of lexemes, are likewise used in dictionaries as the headwords, and other forms of a lexeme are often listed later in the entry if they are not common conjugations of that word.

In linguistics, morphology is the study of words, how they are formed, and their relationship to other words in the same language. It analyzes the structure of words and parts of words, such as stems, root words, prefixes, and suffixes. Morphology also looks at parts of speech, intonation and stress, and the ways context can change a word's pronunciation and meaning. Morphology differs from morphological typology, which is the classification of languages based on their use of words, and lexicology, which is the study of words and how they make up a language's vocabulary.

In computer science, lexical analysis, lexing or tokenization is the process of converting a sequence of characters into a sequence of tokens. A program that performs lexical analysis may be termed a lexer, tokenizer, or scanner, though scanner is also a term for the first stage of a lexer. A lexer is generally combined with a parser, which together analyze the syntax of programming languages, web pages, and so forth.

Lexical may refer to:

In linguistics, a compound is a lexeme that consists of more than one stem. Compounding, composition or nominal composition is the process of word formation that creates compound lexemes. That is, in familiar terms, compounding occurs when two or more words or signs are joined to make one longer word or sign. The meaning of the compound may be similar to or different from the meaning of its components in isolation. The component stems of a compound may be of the same part of speech—as in the case of the English word footpath, composed of the two nouns foot and path—or they may belong to different parts of speech, as in the case of the English word blackbird, composed of the adjective black and the noun bird. With very few exceptions, English compound words are stressed on their first component stem.

Tokelauan is a Polynesian language spoken in Tokelau and on Swains Island in American Samoa. It is closely related to Tuvaluan and distantly related to Samoan and other Polynesian languages. Tokelauan has a co-official status with English in Tokelau. There are approximately 4,260 speakers of Tokelauan, of whom 2,100 live in New Zealand, 1,400 in Tokelau, and 17 in Swains Island. "Tokelau" means "north-northeast".

In linguistics, a grammatical construction is any syntactic string of words ranging from sentences over phrasal structures to certain complex lexemes, such as phrasal verbs.

Literal translation, direct translation, or word-for-word translation is the rendering of text from one language to another one word at a time with or without conveying the sense of the original whole.

In morphology and lexicography, a lemma is the canonical form, dictionary form, or citation form of a set of words (headword). In English, for example, run, runs, ran and running are forms of the same lexeme, with run as the lemma. Lexeme, in this context, refers to the set of all the forms that have the same meaning, and lemma refers to the particular form that is chosen by convention to represent the lexeme. In lexicography, this unit is usually also the citation form or headword by which it is indexed. Lemmas have special significance in highly inflected languages such as Arabic, Turkish and Russian. The process of determining the lemma for a given word is called lemmatisation. The lemma can be viewed as the chief of the principal parts, although lemmatisation is at least partly arbitrary.

Malayic languages language family

The Malayic languages are a branch of the Austronesian family. They include Malay, the national language of Brunei, Indonesia, and Malaysia; Minangkabau in central Sumatra; and Iban in northern Borneo.

Lampung is the language of the Indonesian province of Lampung at the southern tip of Sumatra. It is a dialect cluster with two main dialects, perhaps distinct enough to be considered distinct languages: Abung/Pepadun and Pesisir/Say Batin. A third, Komering, is sometimes considered part of Lampung Api, by others a distinct language. Lampung Api is the prestige variety.

The Pronunciation Lexicon Specification (PLS) is a W3C Recommendation, which is designed to enable interoperable specification of pronunciation information for both speech recognition and speech synthesis engines within voice browsing applications. The language is intended to be easy to use by developers while supporting the accurate specification of pronunciation information for international use.

Meaning-text theory (MTT) is a theoretical linguistic framework, first put forward in Moscow by Aleksandr Žolkovskij and Igor Mel’čuk, for the construction of models of natural language. The theory provides a large and elaborate basis for linguistic description and, due to its formal character, lends itself particularly well to computer applications, including machine translation, phraseology, and lexicography. General overviews of the theory can be found in Mel’čuk (1981) and (1988).

Combined word may refer to:

Ukaan is a poorly described Niger–Congo language or dialect cluster of uncertain affiliation. Roger Blench suspects, based on wordlists, that it may be closest to the (East) Benue–Congo languages. Blench (2012) states that "noun-classes and concord make it look Benue-Congo, but evidence is weak."

In psycholinguistics, a lemma is an abstract conceptual form of a word that has been mentally selected for utterance in the early stages of speech production. A lemma represents a specific meaning but does not have any specific sounds that are attached to it.

Wikidata Free knowledge database project

Wikidata is a collaboratively edited knowledge base hosted by the Wikimedia Foundation. It is a common source of open data that Wikimedia projects such as Wikipedia can use, and anyone else, under a public domain license. This is similar to the way Wikimedia Commons provides storage for media files and access to those files for all Wikimedia projects, and which are also freely available for reuse. Wikidata is powered by the software Wikibase.