Cohort study

Last updated

A cohort study is a particular form of longitudinal study that samples a cohort (a group of people who share a defining characteristic, typically those who experienced a common event in a selected period, such as birth or graduation), performing a cross-section at intervals through time. It is a type of panel study where the individuals in the panel share a common characteristic.

Contents

Cohort studies represent one of the fundamental designs of epidemiology which are used in research in the fields of medicine, pharmacy, nursing, psychology, social science, and in any field reliant on 'difficult to reach' answers that are based on evidence (statistics). In medicine for instance, while clinical trials are used primarily for assessing the safety of newly developed pharmaceuticals before they are approved for sale, epidemiological analysis on how risk factors affect the incidence of diseases is often used to identify the causes of diseases in the first place, and to help provide pre-clinical justification for the plausibility of protective factors (treatments).

Comparison with controlled trials

Cohort studies differ from clinical trials in that no intervention, treatment, or exposure is administered to participants in a cohort design; and no control group is defined. Rather, cohort studies are largely about the life histories of segments of populations and the individual people who constitute these segments. [1] [2] Exposures or protective factors are identified as preexisting characteristics of participants. The study is controlled by including other common characteristics of the cohort in the statistical analysis. Both exposure/treatment and control variables are measured at baseline. Participants are then followed over time to observe the incidence rate of the disease or outcome in question. Regression analysis can then be used to evaluate the extent to which the exposure or treatment variable contributes to the incidence of the disease, while accounting for other variables that may be at play.[ citation needed ]

Double-blind randomized controlled trials (RCTs) are generally considered superior methodology in the hierarchy of evidence in treatment, because they allow for the most control over other variables that could affect the outcome, and the randomization and blinding processes reduce bias in the study design. This minimizes the chance that results will be influenced by confounding variables, particularly ones that are unknown. However, educated hypotheses based on prior research and background knowledge are used to select variables to be included in the regression model for cohort studies, and statistical methods can be used to identify and account for potential confounders from these variables. Bias can also be mitigated in a cohort study when selecting participants for the cohort.

RCTs may not be suitable in all cases; such as when the outcome is a negative health effect and the exposure is hypothesized to be a risk factor for the outcome. Ethical standards, and morality, would prevent the use of risk factors in RCTs. The natural or incidental exposure to these risk factors (e.g. time spent in the sun), or self-administered exposure (e.g. smoking), can be measured without subjecting participants to risk factors outside of their individual lifestyles, habits, and choices.

Types

Cohort studies can be retrospective (looking back in time, thus using existing data such as medical records or claims database) or prospective (requiring the collection of new data). [3] Retrospective cohort studies restrict the investigators' ability to reduce confounding and bias because collected information is restricted to data that already exists. There are advantages to this design, however, as retrospective studies are much cheaper and faster because the data has already been collected and stored.

A cohort is a group of people who share a common characteristic or experience within a defined period (e.g., are currently living, are exposed to a drug or vaccine or pollutant, or undergo a certain medical procedure). Thus a group of people who were born on a day or in a particular period, say 1948, form a birth cohort. The comparison group may be the general population from which the cohort is drawn, or it may be another cohort of persons thought to have had little or no exposure to the substance under investigation, but otherwise similar. Alternatively, subgroups within the cohort may be compared with each other.

Application

In medicine, a cohort study is often undertaken to obtain evidence to try to refute the existence of a suspected association between cause and effect; failure to refute a hypothesis often strengthens confidence in it. Crucially, the cohort is identified before the appearance of the disease under investigation. The study groups follow a group of people who do not have the disease for a period of time and see who develops the disease (new incidence). The cohort cannot therefore be defined as a group of people who already have the disease. Prospective (longitudinal) cohort studies between exposure and disease strongly aid in studying causal associations, though distinguishing true causality usually requires further corroboration from further experimental trials.

The advantage of prospective cohort study data is that it can help determine risk factors for contracting a new disease because it is a longitudinal observation of the individual through time, and the collection of data at regular intervals, so recall error is reduced. However, cohort studies are expensive to conduct, are sensitive to attrition and take a long follow-up time to generate useful data. Nevertheless, the results that are obtained from long-term cohort studies are of substantially superior quality to those obtained from retrospective/cross-sectional studies. Prospective cohort studies are considered to yield the most reliable results in observational epidemiology. They enable a wide range of exposure-disease associations to be studied.

Some cohort studies track groups of children from their birth, and record a wide range of information (exposures) about them. The value of a cohort study depends on the researchers' capacity to stay in touch with all members of the cohort. Some studies have continued for decades.

In a cohort study, the population under investigation consists of individuals who are at risk of developing a specific disease or health outcome. [4]

Examples

An example of an epidemiological question that can be answered using a cohort study is whether exposure to X (say, smoking) associates with outcome Y (say, lung cancer). For example, in 1951, the British Doctors Study was started. Using a cohort which included both smokers (the exposed group) and non-smokers (the unexposed group). The study continued through 2001. By 1956, the study provided convincing proof of the association between smoking and the incidence of lung cancer. In a cohort study, the groups are matched in terms of many other variables such as economic status and other health status so that the variable being assessed, the independent variable (in this case, smoking) can be isolated as the cause of the dependent variable (in this case, lung cancer). In this example, a statistically significant increase in the incidence of lung cancer in the smoking group as compared to the non-smoking group is evidence in favor of the hypothesis. However, rare outcomes, such as lung cancer, are generally not studied with the use of a cohort study, but are rather studied with the use of a case-control study.

Cohort Succession can explain most change in literature, art, intellectualism, political opinions, and phonology. [5]

Shorter term studies are commonly used in medical research as a form of clinical trial, or means to test a particular hypothesis of clinical importance. Such studies typically follow two groups of patients for a period of time and compare an endpoint or outcome measure between the two groups.

Randomized controlled trials, or RCTs, are a superior methodology in the hierarchy of evidence, because they limit the potential for bias by randomly assigning one patient pool to an intervention and another patient pool to non-intervention (or placebo). This minimizes the chance that the incidence of confounding variables will differ between the two groups. Nevertheless, it is sometimes not practical or ethical to perform RCTs to answer a clinical question. To take our example, if we already had reasonable evidence that smoking causes lung cancer then persuading a pool of non-smokers to take up smoking in order to test this hypothesis would generally be considered unethical.

Two examples of cohort studies that have been going on for more than 50 years are the Framingham Heart Study and the National Child Development Study (NCDS), the most widely researched of the British birth cohort studies. Key findings of NCDS and a detailed profile of the study appear in the International Journal of Epidemiology. [6]

The Dunedin Multidisciplinary Health and Development Study has been studying the thousand people born in Dunedin, New Zealand, in 1972–1973. The subjects are interviewed regularly, with Phase (age) 52 starting in 2024.

The largest cohort study in women is the Nurses' Health Study. Starting in 1976, it is tracking over 120,000 nurses and has been analyzed for many different conditions and outcomes.

The largest cohort study in Africa is the Birth to Twenty Study, which began in 1990 and tracks a cohort of over 3,000 children born in the weeks following Nelson Mandela's release from prison.

Other famous examples are the Grant Study tracking a number of Harvard graduates from ca. 1950.77, the Whitehall Study tracking 10,308 British civil servants, and the Caerphilly Heart Disease Study, which since 1979 has studied a representative sample of 2,512 men, drawn from the Welsh town of Caerphilly. [7]

The ASPREE-XT study is designed to determine whether there are long-lasting effects of an average of four–five years of treatment with daily low-dose aspirin, with outcome measures including cancer mortality. [8] As of September 2018, there were 16,703 ASPREE-XT participants in Australia. [9] It has been proposed that the existing ASPREE-XT study could provide a platform for a future multigenerational research study. [10]

Variations

Comparison of timelines of case-control and cohort studies. "OR" stands for "odds ratio" and "RR" stands for "relative risk". ExplainingCaseControlSJW-en.svg
Comparison of timelines of case-control and cohort studies. "OR" stands for "odds ratio" and "RR" stands for "relative risk".

The diagram indicates the starting point and direction of cohort and case-control studies. In case-control studies the analysis proceeds from documented disease and investigations are made to arrive at the possible causes of the disease. In cohort studies the assessment starts with the putative cause of disease, and observations are made of the occurrence of disease relative to the hypothesized causal agent. [11]

Current and historical cohorts

A current cohort study represents a true prospective study where the data concerning exposure are assembled prior to the occurrence of the fact to be studied, for instance a disease. An example of a current cohort study is the Oxford Family Planning Association Study in the United Kingdom, which aimed to provide a balanced view of the beneficial and harmful effects of different methods of contraception. This study has provided a large amount of information on the efficacy and safety of contraceptive methods, and in particular oral contraceptives (OCs), diaphragms and intrauterine device (IUDs). [12]

In a historical cohort study the data concerning exposure and occurrence of a disease, births, a political attitude or any other categorical variable are collected after the events have taken place, and the subjects (those exposed and unexposed to the agent under study) are assembled from existing records or health care registers.

A "prospective cohort" defines the groups before the study is done, while historical studies, which are sometimes referred to as "retrospective cohort", define the grouping after the data is collected. Examples of a retrospective cohort are Long-Term Mortality after Gastric Bypass Surgery [13] and The Lothian Birth Cohort Studies . [14]

Although historical studies are sometimes referred to as retrospective study, it a misnomer as the methodological principles of historical cohort studies and prospective studies are the same. [11]

Nested case-control study

A nested case-control study is a case control nested inside of a cohort study. The procedure begins like a normal cohort study, however, as participants develop the outcome of interest they are selected as cases. Once the cases are identified, controls are selected and matched to each case. The process for selecting and matching cases is identical to a normal case control study. An example of a nested case-control study is Inflammatory markers and the risk of coronary heart disease in men and women, which was a case control analyses extracted from the Framingham Heart Study cohort. [15]

Nested case-controls have the advantage of reducing the number of participants that require details follow up or diagnostic testing to assess outcome or exposure status. However, this will also reduce the power of the study, when compared to larger cohort the study population is drawn from.

Household panel survey

Panel surveys are another important sub-type of longitudinal study. They differ from cohort studies by starting with representative cross-sectional samples, rather than cohorts defined by an event. Household panels draw representative samples of households and survey them, following all individuals through time on a usual annual basis. Examples include the US Panel Study of Income Dynamics (since 1968), the German Socio-Economic Panel (since 1984), the British Household Panel Survey (since 1991) and (since 2009) its successor Understanding Society: the UK Household Longitudinal Study, the Household, Income and Labour Dynamics in Australia Survey (since 2001) and the European Community Household Panel (1994–2001).

Cohort analysis in business

For an example in business analysis, see cohort analysis.

See also

Related Research Articles

<span class="mw-page-title-main">Epidemiology</span> Study of health and disease within a population

Epidemiology is the study and analysis of the distribution, patterns and determinants of health and disease conditions in a defined population.

A case–control study is a type of observational study in which two existing groups differing in outcome are identified and compared on the basis of some supposed causal attribute. Case–control studies are often used to identify factors that may contribute to a medical condition by comparing subjects who have the condition with patients who do not have the condition but are otherwise similar. They require fewer resources but provide less evidence for causal inference than a randomized controlled trial. A case–control study is often used to produce an odds ratio. Some statistical methods make it possible to use a case–control study to also estimate relative risk, risk differences, and other quantities.

<span class="mw-page-title-main">Women's Health Initiative</span> Long-term U.S. health study

The Women's Health Initiative (WHI) was a series of clinical studies initiated by the U.S. National Institutes of Health (NIH) in 1991, to address major health issues causing morbidity and mortality in postmenopausal women. It consisted of three clinical trials (CT) and an observational study (OS). In particular, randomized controlled trials were designed and funded that addressed cardiovascular disease, cancer, and osteoporosis.

<span class="mw-page-title-main">Framingham Heart Study</span> Cardiovascular cohort study

The Framingham Heart Study is a long-term, ongoing cardiovascular cohort study of residents of the city of Framingham, Massachusetts. The study began in 1948 with 5,209 adult subjects from Framingham, and is now on its third generation of participants. Prior to the study almost nothing was known about the epidemiology of hypertensive or arteriosclerotic cardiovascular disease. Much of the now-common knowledge concerning heart disease, such as the effects of diet, exercise, and common medications such as aspirin, is based on this longitudinal study. It is a project of the National Heart, Lung, and Blood Institute, in collaboration with Boston University. Various health professionals from the hospitals and universities of Greater Boston staff the project.

<span class="mw-page-title-main">Cohort (statistics)</span> A group of subjects who share a characteristic, such as generation

In statistics, epidemiology, marketing and demography, a cohort is a group of subjects who share a defining characteristic.

A longitudinal study is a research design that involves repeated observations of the same variables over long periods of time. It is often a type of observational study, although it can also be structured as longitudinal randomized experiment.

In medical research, epidemiology, social science, and biology, a cross-sectional study is a type of observational study that analyzes data from a population, or a representative subset, at a specific point in time—that is, cross-sectional data.

The number needed to treat (NNT) or number needed to treat for an additional beneficial outcome (NNTB) is an epidemiological measure used in communicating the effectiveness of a health-care intervention, typically a treatment with medication. The NNT is the average number of patients who need to be treated to prevent one additional bad outcome. It is defined as the inverse of the absolute risk reduction, and computed as , where is the incidence in the control (unexposed) group, and is the incidence in the treated (exposed) group. This calculation implicitly assumes monotonicity, that is, no individual can be harmed by treatment. The modern approach, based on counterfactual conditionals, relaxes this assumption and yields bounds on NNT.

Clinical study design is the formulation of trials and experiments, as well as observational studies in medical, clinical and other types of research involving human beings. The goal of a clinical study is to assess the safety, efficacy, and / or the mechanism of action of an investigational medicinal product (IMP) or procedure, or new drug or device that is in development, but potentially not yet approved by a health authority. It can also be to investigate a drug, device or procedure that has already been approved but is still in need of further investigation, typically with respect to long-term effects or cost-effectiveness.

A nested case–control (NCC) study is a variation of a case–control study in which cases and controls are drawn from the population in a fully enumerated cohort.

<span class="mw-page-title-main">Confounding</span> Variable or factor in causal inference

In causal inference, a confounder is a variable that influences both the dependent variable and independent variable, causing a spurious association. Confounding is a causal concept, and as such, cannot be described in terms of correlations or associations. The existence of confounders is an important quantitative explanation why correlation does not imply causation. Some notations are explicitly designed to identify the existence, possible existence, or non-existence of confounders in causal relationships between elements of a system.

<span class="mw-page-title-main">Observational study</span> Study with uncontrolled variable of interest

In fields such as epidemiology, social sciences, psychology and statistics, an observational study draws inferences from a sample to a population where the independent variable is not under the control of the researcher because of ethical concerns or logistical constraints. One common observational study is about the possible effect of a treatment on subjects, where the assignment of subjects into a treated group versus a control group is outside the control of the investigator. This is in contrast with experiments, such as randomized controlled trials, where each subject is randomly assigned to a treated group or a control group. Observational studies, for lacking an assignment mechanism, naturally present difficulties for inferential analysis.

<span class="mw-page-title-main">Mendelian randomization</span> Statistical method in genetic epidemiology

In epidemiology, Mendelian randomization is a method using measured variation in genes to examine the causal effect of an exposure on an outcome. Under key assumptions, the design reduces both reverse causation and confounding, which often substantially impede or mislead the interpretation of results from epidemiological studies.

<span class="mw-page-title-main">Retrospective cohort study</span> Historic cohort study

A retrospective cohort study, also called a historic cohort study, is a longitudinal cohort study used in medical and psychological research. A cohort of individuals that share a common exposure factor is compared with another group of equivalent individuals not exposed to that factor, to determine the factor's influence on the incidence of a condition such as disease or death. Retrospective cohort studies have existed for approximately as long as prospective cohort studies.

<span class="mw-page-title-main">Prospective cohort study</span> Longitudinal cohort study

A prospective cohort study is a longitudinal cohort study that follows over time a group of similar individuals (cohorts) who differ with respect to certain factors under study to determine how these factors affect rates of a certain outcome. For example, one might follow a cohort of middle-aged truck drivers who vary in terms of smoking habits to test the hypothesis that the 20-year incidence rate of lung cancer will be highest among heavy smokers, followed by moderate smokers, and then non–smokers.

<span class="mw-page-title-main">Nutritional epidemiology</span> Field of medical research on disease and diet

Nutritional epidemiology examines dietary and nutritional factors in relation to disease occurrence at a population level. Nutritional epidemiology is a relatively new field of medical research that studies the relationship between nutrition and health. It is a young discipline in epidemiology that is continuing to grow in relevance to present-day health concerns. Diet and physical activity are difficult to measure accurately, which may partly explain why nutrition has received less attention than other risk factors for disease in epidemiology.

<span class="mw-page-title-main">Caerphilly Heart Disease Study</span> Medical research project

The Caerphilly Heart Disease Study, also known as the Caerphilly Prospective Study (CaPS), is an epidemiological prospective cohort, set up in 1979 in a representative population sample drawn from Caerphilly, a typical small town in South Wales, UK.

Occupational epidemiology is a subdiscipline of epidemiology that focuses on investigations of workers and the workplace. Occupational epidemiologic studies examine health outcomes among workers, and their potential association with conditions in the workplace including noise, chemicals, heat, or radiation, or work organization such as schedules.

The Strong Heart Study is an ongoing cohort study of cardiovascular disease (CVD) and its risk factors among American Indian men and women. The original cohort began in 1984 with 4,549 participants ages 35–74 from 13 tribal nations and communities in Arizona, Oklahoma, North Dakota, and South Dakota. The need for specific ethnic and cultural understanding and sensitivities was recognized from the onset, so the study has a community-based participatory research (CBPR) model. Community members were involved in all stages of conception, design, and implementation of the research. Now in its seventh phase, the extensive research has led to many important findings about heart disease and unique risk factors in native populations. It is a project funded by the National Heart, Lung, and Blood Institute (NHLBI). The study maintains field centers in Oklahoma, North and South Dakota, and Arizona and a coordinating center at the University of Oklahoma Health Sciences Center.

<span class="mw-page-title-main">Stephanie J. London</span> American physician

Stephanie J. London is an American epidemiologist and physician-scientist specializing in environmental health, respiratory diseases, and genetic susceptibility. She is the deputy chief of the epidemiology branch at the National Institute of Environmental Health Sciences.

References

  1. "Cohort Studies". Web Center for Social Research Methods. Archived from the original on 9 September 2011.
  2. Porta, Miquel, ed. (2008). A Dictionary of Epidemiology (5th ed.). New York: Oxford University Press.
  3. "FAQ: What is a cohort study?". California Department of Public Health, Environmental Health Investigations Branch. 2 June 2001. Archived from the original on 26 July 2011. Retrieved 10 September 2010.
  4. Blumenthal, Ursula J.; Fleisher, Jay M.; Esrey, Steve A.; Peasey, Anne (2001). "Chapter 7: Epidemiology: a tool for the assessment of risk" (PDF). In Fewtrell, Lorna; Bartram, Jamie (eds.). Water Quality: Guidelines, Standards, and Health : Assessment of Risk and Risk Management for Water-related Infectious Disease. IWA Publishing. pp. 143–144.
  5. Underwood, Ted; Kiley, Kevin; Shang, Wenyi; Vaisey, Stephen (2022). "Cohort Succession Explains Most Change in Literary Culture". Sociological Science. 9: 184–205. doi: 10.15195/v9.a8 . Retrieved 11 May 2022.
  6. Power C, Elliott J (2006). "Cohort profile: 1958 British Cohort Study". International Journal of Epidemiology. 35 (1): 34–41. doi: 10.1093/ije/dyi183 . PMID   16155052.
  7. "Caerphilly and Speedwell collaborative heart disease studies. The Caerphilly and Speedwell Collaborative Group". Journal of Epidemiology and Community Health. 38 (3): 259–262. 1984. doi:10.1136/jech.38.3.259. PMC   1052363 . PMID   6332166..
  8. "ASPREE Australia | ABOUT ASPREE-XT - ASPREE Australia". 21 July 2019. Archived from the original on 21 July 2019. Retrieved 21 July 2019.
  9. "Daily low-dose aspirin found to have no effect on healthy life span in older people". National Institutes of Health News Releases. 16 September 2018. Retrieved 22 March 2021.
  10. Nunn, Jack S.; Sulovski, Merrin; Tiller, Jane; Holloway, Bruce; Ayton, Darshini; Lacaze, Paul (3 May 2021). "Involving elderly research participants in the co-design of a future multi-generational cohort study". Research Involvement and Engagement. 7 (1): 23. doi: 10.1186/s40900-021-00271-4 . ISSN   2056-7529. PMC   8094476 . PMID   33941290.
  11. 1 2 Meirik, O. "Cohort and Case-Control Studies". In Campana, Aldo (ed.). Reproductive Health. Geneva Foundation for Medical Education and Research.
  12. Vessey, M. P.; Lawless, M. (1984). "The Oxford-Family Planning Association contraceptive study". Clinics in Obstetrics and Gynaecology. 11 (3): 743–757. doi:10.1016/S0306-3356(21)00625-7. ISSN   0306-3356. PMID   6509857.
  13. Adams TD, Gress RE, Smith SC, et al. (2007). "Long-term mortality after gastric bypass surgery" (PDF). N. Engl. J. Med. 357 (8): 753–61. doi:10.1056/NEJMoa066603. PMID   17715409. S2CID   8710295. Archived from the original (PDF) on 20 February 2019.
  14. "The Lothian Birth Cohort Studies". University of Edinburgh. Archived from the original on 22 October 2009. Retrieved 8 May 2011.
  15. Pai, JK; Pischon, T; Ma, J; et al. (2004). "Inflammatory markers and the risk of coronary heart disease in men and women". N. Engl. J. Med. 351 (25): 2599–2610. doi: 10.1056/NEJMoa040967 . PMID   15602020. S2CID   16142059.