SMOG

Last updated

The SMOG grade is a measure of readability that estimates the years of education needed to understand a piece of writing. SMOG is an acronym for "Simple Measure of Gobbledygook".

SMOG is widely used, particularly for checking health messages. [1] [2] The SMOG grade yields a 0.985 correlation with a standard error of 1.5159 grades with the grades of readers who had 100% comprehension of test materials. [3]

The formula for calculating the SMOG grade was developed by G. Harry McLaughlin as a more accurate and more easily calculated substitute for the Gunning fog index and published in 1969. To make calculating a text's readability as simple as possible an approximate formula was also given — count the words of three or more syllables in three 10-sentence samples, estimate the count's square root (from the nearest perfect square), and add 3.

A 2010 study published in the Journal of the Royal College of Physicians of Edinburgh stated that “SMOG should be the preferred measure of readability when evaluating consumer-oriented healthcare material.” The study found that “The Flesch-Kincaid formula significantly underestimated reading difficulty compared with the gold standard SMOG formula.” [4]

Applying SMOG to other languages lacks statistical validity. [5]

Formulae

To calculate SMOG Index

  1. Take three ten-sentence-long samples from the text in question.
  2. In those sentences, count the polysyllables (words of 3 or more syllables).
  3. Calculate using

This version (sometimes called the SMOG Index) is more easily used for mental math:

  1. Count the number of polysyllabic words in three samples of ten sentences each.
  2. Take the square root of the nearest perfect square
  3. Add 3

SMOG conversion tables compiled by Harold C. McGraw are slightly inaccurate because they are based on the approximate formula. Furthermore, tables for texts of fewer than 30 sentences are statistically invalid, because the formula was normed on 30-sentence samples.

Related Research Articles

<span class="mw-page-title-main">Gini coefficient</span> Measure of inequality of a distribution

In economics, the Gini coefficient, also known as the Gini index or Gini ratio, is a measure of statistical dispersion intended to represent the income inequality, the wealth inequality, or the consumption inequality within a nation or a social group. It was developed by Italian statistician and sociologist Corrado Gini.

<span class="mw-page-title-main">Standard deviation</span> In statistics, a measure of variation

In statistics, the standard deviation is a measure of the amount of variation of a random variable expected about its mean. A low standard deviation indicates that the values tend to be close to the mean of the set, while a high standard deviation indicates that the values are spread out over a wider range.

Plain language is writing designed to ensure the reader understands as quickly, easily, and completely as possible. Plain language strives to be easy to read, understand, and use. It avoids verbose, convoluted language and jargon. In many countries, laws mandate that public agencies use plain language to increase access to programs and services. The United Nations Convention on the Rights of Persons with Disabilities includes plain language in its definition of communication.

Readability is the ease with which a reader can understand a written text. The concept exists in both in natural language and programming languages though in different forms. In natural language, the readability of text depends on its content and its presentation. In programming, things such as programmer comments, choice of loop structure, and choice of names can determine the ease with which humans can read computer program code.

<span class="mw-page-title-main">Hemocytometer</span> Medical device used to count cells

The hemocytometer is a counting-chamber device originally designed and usually used for counting blood cells.

<span class="mw-page-title-main">Standard error</span> Statistical property

The standard error (SE) of a statistic is the standard deviation of its sampling distribution or an estimate of that standard deviation. If the statistic is the sample mean, it is called the standard error of the mean (SEM). The standard error is a key ingredient in producing confidence intervals.

In environmental chemistry, the chemical oxygen demand (COD) is an indicative measure of the amount of oxygen that can be consumed by reactions in a measured solution. It is commonly expressed in mass of oxygen consumed over volume of solution which in SI units is milligrams per litre (mg/L). A COD test can be used to easily quantify the amount of organics in water. The most common application of COD is in quantifying the amount of oxidizable pollutants found in surface water or wastewater. COD is useful in terms of water quality by providing a metric to determine the effect an effluent will have on the receiving body, much like biochemical oxygen demand (BOD).

<span class="mw-page-title-main">Flesch–Kincaid readability tests</span> Indicator for the complexity of texts

The Flesch–Kincaid readability tests are readability tests designed to indicate how difficult a passage in English is to understand. There are two tests: the Flesch Reading-Ease, and the Flesch–Kincaid Grade Level. Although they use the same core measures, they have different weighting factors.

A golf handicap is a numerical measure of a golfer's ability, or potential ability, that is used to enable players of different abilities to compete against one another. Better players are those with the lowest handicaps.

In linguistics, the Gunning fog index is a readability test for English writing. The index estimates the years of formal education a person needs to understand the text on the first reading. For instance, a fog index of 12 requires the reading level of a United States high school senior. The test was developed in 1952 by Robert Gunning, an American businessman who had been involved in newspaper and textbook publishing.

<span class="mw-page-title-main">Fry readability formula</span>

The Fry readability formula is a readability metric for English texts, developed by Edward Fry.

The automated readability index (ARI) is a readability test for English texts, designed to gauge the understandability of a text. Like the Flesch–Kincaid grade level, Gunning fog index, SMOG index, Fry readability formula, and Coleman–Liau index, it produces an approximate representation of the US grade level needed to comprehend the text.

The Coleman–Liau index is a readability test designed by Meri Coleman and T. L. Liau to gauge the understandability of a text. Like the Flesch–Kincaid Grade Level, Gunning fog index, SMOG index, and Automated Readability Index, its output approximates the U.S. grade level thought necessary to comprehend the text.

<span class="mw-page-title-main">Normalized difference vegetation index</span> Graphical indicator of remotely sensed live green vegetation

The normalized difference vegetation index (NDVI) is a widely-used metric for quantifying the health and density of vegetation using sensor data. It is calculated from spectrometric data at two specific bands: red and near-infrared. The spectrometric data is usually sourced from remote sensors, such as satellites.

<span class="mw-page-title-main">Moscow Mathematical Papyrus</span> Areaof a square length 6cm Ancient Egyptian mathematical manuscript

The Moscow Mathematical Papyrus, also named the Golenishchev Mathematical Papyrus after its first non-Egyptian owner, Egyptologist Vladimir Golenishchev, is an ancient Egyptian mathematical papyrus containing several problems in arithmetic, geometry, and algebra. Golenishchev bought the papyrus in 1892 or 1893 in Thebes. It later entered the collection of the Pushkin State Museum of Fine Arts in Moscow, where it remains today.

The pavement condition index (PCI) is a numerical index between 0 and 100, which is used to indicate the general condition of a pavement section. The PCI is widely used in transportation civil engineering and asset management, and many municipalities use it to measure the performance of their road infrastructure and their levels of service. It is a statistical measure and requires manual survey of the pavement. This index was originally developed by the United States Army Corps of Engineers as an airfield pavement rating system, but later modified for roadway pavements and standardized by the ASTM. The surveying processes and calculation methods have been documented and standardized by ASTM for both roads and airport pavements:

<span class="mw-page-title-main">Raygor readability estimate</span>

The Raygor estimate graph is a readability metric for English text. It was developed by Alton L. Raygor, who published it in 1977.

Linsear Write is a readability metric for English text, purportedly developed for the United States Air Force to help them calculate the readability of their technical manuals. It is one of many such readability metrics, but is specifically designed to calculate the United States grade level of a text sample based on sentence length and the number of words used that have three or more syllables. It is similar to the Fry readability formula.

The Spache readability formula is a readability test for writing in English, designed by George Spache. It works best on texts that are for children up to fourth grade. For older children, the Dale–Chall readability formula is more appropriate.

The Dale–Chall readability formula is a readability test that provides a numeric gauge of the comprehension difficulty that readers come upon when reading a text. It uses a list of 3000 words that groups of fourth-grade American students could reliably understand, considering any word not on that list to be difficult.

References

  1. Hedman, Amy S. (January 2008). "Using the SMOG formula to revise a health-related document" . American Journal of Health Education. 39 (1): 61–64. doi:10.1080/19325037.2008.10599016. S2CID   72389214 . Retrieved 2009-01-19.
  2. Ley, P.; T. Florio (February 1996). "The use of readability formulas in health care". Psychology, Health & Medicine. 1 (1): 7–28. doi:10.1080/13548509608400003.
  3. McLaughlin, G. Harry (May 1969). "SMOG Grading — a New Readability Formula" (PDF). Journal of Reading. 12 (8): 639–646. Retrieved 2018-04-26.
  4. Fitzsimmons,P.; Michael, B.; Hulley, J.; Scott, G. (2010). "A readability assessment of online Parkinson's disease information". J R Coll Physicians Edinb. 40 (4): 292–6. doi: 10.4997/JRCPE.2010.401 . PMID   21132132.
  5. Contreras, A.; Garcia-alonso, R.; Echenique, M.; Daye-contreras, F. (1999). "The SOL Formulas for Converting SMOG Readability Scores Between Health Education Materials Written in Spanish, English, and French". Journal of Health Communication. 4 (1): 21–29. doi:10.1080/108107399127066. PMID   10977275.