I Can Eat Glass

Last updated
The phrase is used as sample text for displaying Chinese fonts in GNOME Font Viewer. This screenshot shows the Simplified Chinese translation of "I can eat glass, it does not hurt me." GNOME Font Viewer "I Can Eat Glass" Chinese Font Sample.png
The phrase is used as sample text for displaying Chinese fonts in GNOME Font Viewer. This screenshot shows the Simplified Chinese translation of "I can eat glass, it does not hurt me."

I Can Eat Glass was a linguistic project documented on the early Web by then-Harvard student Ethan Mollick. [1] The objective was to provide speakers with translations of the phrase "I can eat glass, it does not hurt me" from a wide variety of languages; the phrase was chosen because of its unorthodox nature. [2] Mollick's original page disappeared in or about June 2004, [3] but the phrase has continued as an absurdist example in linguistics. [4] [5] [6] The project is housed on the current website for The Immediate Gratification Players, a student improvisational comedy group of which Mollick was a member and which hosted the original site. [7]

Contents

As Mollick explained, visitors to a foreign country have "an irresistible urge" to say something in that language, and whatever they say (a cited example being along the lines of "Where is the bathroom?") usually marks them as tourists immediately. Saying "I can eat glass, it does not hurt me", however, ensures that the speaker "will be viewed as an insane native, and treated with dignity and respect". [8]

The project grew to considerable size since web surfers were invited to submit translations. [9] The phrase was translated into over 150 languages, including some that are fictional or invented, as well as into code from various computer languages. It became an Internet meme. [10]

See also

Related Research Articles

<span class="mw-page-title-main">Latin</span> Indo-European language of the Italic branch

Latin is a classical language belonging to the Italic branch of the Indo-European languages. Classical Latin is considered a dead language as it is no longer used to produce major texts, while Vulgar Latin evolved into the Romance Languages. Latin was originally spoken by the Latins in Latium, the lower Tiber area around Rome, Italy. Through the expansion of the Roman Republic it became the dominant language in the Italian Peninsula and subsequently throughout the Roman Empire. Even after the fall of Western Rome, Latin remained the common language of international communication, science, scholarship and academia in Europe until well into the early 19th century, when regional vernaculars supplanted it in common academic and political usage—including its own descendants, the Romance languages.

A quotation is the repetition of a sentence, phrase, or passage from speech or text that someone has said or written. In oral speech, it is the representation of an utterance that is introduced by a quotative marker, such as a verb of saying. For example: John said: "I saw Mary today". Quotations in oral speech are also signaled by special prosody in addition to quotative markers. In written text, quotations are signaled by quotation marks. Quotations are also used to present well-known statement parts that are explicitly attributed by citation to their original source; such statements are marked with quotation marks.

UTF-8 is a character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from Unicode Transformation Format – 8-bit. Almost every webpage is stored in UTF-8.

<span class="mw-page-title-main">UTF-16</span> Variable-width encoding of Unicode, using one or two 16-bit code units

UTF-16 (16-bit Unicode Transformation Format) is a character encoding capable of encoding all 1,112,064 valid code points of Unicode (in fact this number of code points is dictated by the design of UTF-16). The encoding is variable-length, as code points are encoded with one or two 16-bit code units. UTF-16 arose from an earlier obsolete fixed-width 16-bit encoding now known as "UCS-2" (for 2-byte Universal Character Set), once it became clear that more than 216 (65,536) code points were needed, including most emoji and important CJK characters such as for personal and place names.

An idiom is a phrase or expression that largely or exclusively carries a figurative or non-literal meaning, rather than making any literal sense. Categorized as formulaic language, an idiomatic expression's meaning is different from the literal meanings of each word inside it. Idioms occur frequently in all languages; in English alone there are an estimated twenty-five thousand idiomatic expressions. Some well known idioms in English are spill the beans, it's raining cats and dogs, and break a leg.

In grammar, an article is any member of a class of dedicated words that are used with noun phrases to mark the identifiability of the referents of the noun phrases. The category of articles constitutes a part of speech.

Greeklish, a portmanteau of the words Greek and English, also known as Grenglish, Latinoellinika/Λατινοελληνικά or ASCII Greek, is the Greek language written using the Latin script. Unlike standardized systems of Romanization of Greek, as used internationally for purposes such as rendering Greek proper names or place names, or for bibliographic purposes, the term Greeklish mainly refers to informal, ad-hoc practices of writing Greek text in environments where the use of the Greek alphabet is technically impossible or cumbersome, especially in electronic media. Greeklish was commonly used on the Internet when Greek people communicate by forum, e-mail, IRC, instant messaging and occasionally on SMS, mainly because older operating systems did not support non-Latin writing systems, or in a unicode form like UTF-8. Nowadays most Greek language content appears in the Greek alphabet.

<span class="mw-page-title-main">Spanglish</span> Hybrid language of Spanish and English

Spanglish is any language variety that results from conversationally combining Spanish and English. The term is mostly used in the United States and refers to a blend of the words and grammar of the two languages. More narrowly, Spanglish can specifically mean a variety of Spanish with heavy use of English loanwords.

<span class="mw-page-title-main">Gullah language</span> Creole language of southern US

Gullah is a creole language spoken by the Gullah people, an African American population living in coastal regions of South Carolina and Georgia as well as extreme northeastern Florida and the extreme southeast of North Carolina.

<span class="mw-page-title-main">Chinglish</span> English that is influenced by a Chinese language

Chinglish is slang for spoken or written English language that is either influenced by a Chinese language, or is poorly translated. In Hong Kong, Macau, Guangdong and Guangxi, the term "Chinglish" refers mainly to Cantonese-influenced English. This term is commonly applied to ungrammatical or nonsensical English in Chinese contexts, and may have pejorative or deprecating connotations. Other terms used to describe the phenomenon include "Chinese English", "China English", "Engrish" and "Sinicized English". The degree to which a Chinese variety of English exists or can be considered legitimate is still up for debate.

You can't have your cake and eat it (too) is a popular English idiomatic proverb or figure of speech. The proverb literally means "you cannot simultaneously retain possession of a cake and eat it, too". Once the cake is eaten, it is gone. It can be used to say that one cannot have two incompatible things, or that one should not try to have more than is reasonable. The proverb's meaning is similar to the phrases "you can't have it both ways" and "you can't have the best of both worlds."

<span class="mw-page-title-main">Telephone game</span> Childrens game

Telephone, or Chinese whispers, is an internationally popular children's game in which messages are whispered from person to person and then the original and final messages are compared. This sequential modification of information is called transmission chaining in the context of cultural evolution research, and is primarily used to identify the type of information that is more easily passed on from one person to another.

<span class="mw-page-title-main">Tiki Wiki CMS Groupware</span> Content management software

Tiki Wiki CMS Groupware or simply Tiki, originally known as TikiWiki, is a free and open source Wiki-based content management system and online office suite written primarily in PHP and distributed under the GNU Lesser General Public License (LGPL-2.1-only) license. In addition to enabling websites and portals on the internet and on intranets and extranets, Tiki contains a number of collaboration features allowing it to operate as a Geospatial Content Management System (GeoCMS) and Groupware web application.

<span class="mw-page-title-main">English-speaking world</span> Countries and regions where English is used

The English-speaking world comprises the 88 countries and territories in which English is an official, administrative, or cultural language. In the early 2000s, between one and two billion people spoke English, making it the largest language by number of speakers, the third largest language by number of native speakers and the most widespread language geographically. The countries in which English is the native language of most people are sometimes termed the Anglosphere. Speakers of English are called Anglophones.

International Components for Unicode (ICU) is an open-source project of mature C/C++ and Java libraries for Unicode support, software internationalization, and software globalization. ICU is widely portable to many operating systems and environments. It gives applications the same results on all platforms and between C, C++, and Java software. The ICU project is a technical committee of the Unicode Consortium and sponsored, supported, and used by IBM and many other companies. ICU has been included as a standard component with Microsoft Windows since Windows 10 version 1703.

Philosophy of language investigates the nature of language and the relations between language, language users, and the world. Investigations may include inquiry into the nature of meaning, intentionality, reference, the constitution of sentences, concepts, learning, and thought.

<span class="mw-page-title-main">English language</span> West Germanic language

English is a West Germanic language in the Indo-European language family, whose speakers, called Anglophones, originated in early medieval England on the island of Great Britain. The namesake of the language is the Angles, one of the ancient Germanic peoples that migrated to Britain. It is the most spoken language in the world, primarily due to the global influences of the former British Empire and the United States. English is the third-most spoken native language, after Standard Chinese and Spanish; it is also the most widely learned second language in the world, with more second-language speakers than native speakers.

Tibetan grammar describes the morphology, syntax and other grammatical features of Lhasa Tibetan, a Sino-Tibetan language. Lhasa Tibetan is typologically an ergative–absolutive language. Nouns are generally unmarked for grammatical number, but are marked for case. Adjectives are never marked and appear after the noun. Demonstratives also come after the noun but these are marked for number. Verbs are possibly the most complicated part of Tibetan grammar in terms of morphology. The dialect described here is the colloquial language of Central Tibet, especially Lhasa and the surrounding area, but the spelling used reflects classical Tibetan, not the colloquial pronunciation.

Yes and no, or similar word pairs, are expressions of the affirmative and the negative, respectively, in several languages, including English. Some languages make a distinction between answers to affirmative versus negative questions and may have three-form or four-form systems. English originally used a four-form system up to and including Early Middle English. Modern English uses a two-form system consisting of yes and no. It exists in many facets of communication, such as: eye blink communication, head movements, Morse code, and sign language. Some languages, such as Latin, do not have yes-no word systems.

Linguistics is the scientific study of language. The areas of linguistic analysis are syntax, semantics (meaning), morphology, phonetics, phonology, and pragmatics. Subdisciplines such as biolinguistics and psycholinguistics bridge many of these divisions.

References

  1. Finegan, Edward (2004). Language: its structure and use. Thomson Wadsworth. p. 254. ISBN   9780838407943 . Retrieved 22 January 2013.
  2. Pollatsek, Alexander; Treiman, Rebecca, eds. (2015). The Oxford handbook of reading. Oxford library of psychology. Oxford New York Auckland: Oxford University Press. pp. 10–11. ISBN   978-0-19-932457-6.
  3. "UTF-8 Sampler". Archived from the original on 2021-02-24. Retrieved 2016-07-01.
  4. "I Can Eat Glass without Hurting Myself". Fifty Words for Snow. Archived from the original on 25 January 2022. Retrieved 28 August 2023.
  5. "i can eat glass". mw.lojban.org. 30 June 2014. Archived from the original on 28 August 2023.
  6. Blyth, Carl (2000). Untangling the Web. Wiley. pp. 48–49. ISBN   9780471392477.
  7. "I Can Eat Glass". IGP. Archived from the original on 2023-08-28. Retrieved 2023-08-28.
  8. Bernstein, Peter, ed. (September 8, 1997). The Practical Guide to Practically Everything. New York: Random House. p. 564. ISBN   9780375750298.
  9. Blyth, Carl S. (2000). Untangling the Web: Nonce's Guide to Language and Culture on the Internet. John Wiley & Sons. p. 48. ISBN   9780471392477 . Retrieved 22 January 2013.
  10. Wooten, Adam (21 October 2011). "International Business: Potty language: Safely navigating international water closets". Deseret News . Archived from the original on 3 March 2016. Retrieved 22 January 2013.