UK Biobank | |
---|---|
Mission statement | "Improving the prevention, diagnosis and treatment of a wide range of serious and life-threatening illnesses – including cancer, heart diseases, stroke, diabetes, arthritis, osteoporosis, eye disorders, depression and forms of dementia." |
Commercial? | No |
Location | Stockport, Greater Manchester, UK |
Founder | Rory Collins |
Established | January 2002 |
Website | www |
UK Biobank is a long-term prospective biobank study in the United Kingdom (UK) [1] that houses de-identified [2] biological samples and health-related data [3] on half a million people. Volunteer participants aged 40-69 were recruited between 2006 and 2010 [1] from across Great Britain and consented to share their health data and to be followed for at least 30 years thereafter with the aim to enable scientific discoveries into the prevention, diagnosis, and treatment [3] of disease.
Based in Stockport, Greater Manchester, it is incorporated as a limited company and registered charity [4] in England and Wales, and registered as a charity [5] in Scotland. [6] [7] [8]
UK Biobank holds more than 10,000 variables of data on many of their 500,000 participants to inform research including biological samples, physical measurements, body and brain imaging data, bone density data, activity tracking and lifestyle questionnaire data. [9] Participants continue to provide more data over time. They have over 15 million biological samples stored, which can be requested for use by researchers, and their online database holds over 30 petabytes of data. [10] [11] [12] Their human genome sequencing database, proteomic database, and human imaging project are the largest in the world. [13] [14] The project is enabling scientists to study the onset of diseases such as cancers, heart disease, and age-related conditions in the early stages of their development. [15] [16] [17] Nature has referred to UK Biobank as an "unprecedented open access database." [18]
Since 2012, [19] 30,000 researchers from over 90 countries [20] have registered to use UK Biobank. As of November 2023 there have been over 9,000 [21] peer-reviewed publications using UK Biobank data, including over 3,000 in 2023. [22]
UK Biobank is funded by the Medical Research Council (MRC), the Wellcome Trust, UK government and charitable sources, and philanthropic donations. [23]
UK Biobank was conceived in the early 2000s, [24] with Professor Sir Rory Collins appointed as the Principal Investigator and Chief Executive of UK Biobank in 2005. [25] An incremental approach was adopted to developing the study procedures and technology, using systems designed and developed by the Clinical Trial Service Unit. This consisted of a series of pilot studies of increasing complexity and sophistication with interludes for assessment of results and additional scientific input. In-house trials were conducted during 2005, and a fully integrated clinic was run at Altrincham, Greater Manchester throughout Spring 2006 where 3,800 individuals were assessed. On 22 August 2006, it was announced that the main programme would recruit men and women aged between 40 and 69 based from up to 35 regional centres. [26]
Following the initial pilot stage in the 2005-6 period, [27] the main study began in April 2007 and by the end of that year 50,000 people had taken part. Recruitment reached 100,000 in April 2008, 200,000 in October 2008, 300,000 in May 2009, 400,000 in November 2009 and passed the 500,000 target in July 2010. The volunteers were largely healthy, wealthy and white European. Rather than recruiting more participants into the biobank, the organisation is helping other institutions establish and run similar initiatives. [28] Participant enrolment was declared complete in August 2010. [29] However recruitment proved more efficient than hoped and only 22 centres had been opened when the recruitment target of 500,000 was reached in 2010.
In May 2023, UKRI announced that UK Biobank was set to receive £127.6m to move to a purpose-built facility at Bruntwood SciTech's Manchester Science Park. [30]
In October 2023, a number of funders including Eric Schmidt and Ken Griffin announced further funding for the next five years. [31] [32]
The study is following about 500,000 volunteers in the UK, enrolled at ages from 40 to 69. Initial enrolment took place over four years from 2006, and the volunteers will be followed for at least 30 years thereafter. [33]
Prospective participants were invited to visit an assessment centre, at which they completed an automated questionnaire and were interviewed about lifestyle, medical history and nutritional habits; basic variables such as weight, height, blood pressure etc. were measured; and blood and urine samples were taken. These samples were preserved so that it was possible to later extract DNA and measure other biologically important substances. During the whole duration of the study it was intended that all disease events, drug prescriptions and deaths of the participants are recorded in a database, taking advantage of the centralized UK National Health Service. [34] [35]
During the initial physical examination, basic feedback was provided to the participant regarding their weight, height, BMI, blood pressure, lung vital capacity, bone density and intra-ocular pressure; however if any other medical problems were detected, neither the participant nor their physician would be notified. Problems detected later, such as genetic risk factors, were not conveyed to either participant or physician ("to ensure that volunteers are not penalised by insurance companies, for example, which may require customers to disclose the results of any genetic tests."). [36]
From 2012, researchers were able to apply to use the database (though they are not given access to the volunteers, who will remain strictly anonymous). A typical study using the database might compare a sample of participants who developed a particular disease, such as cancer, heart disease, diabetes or Alzheimer's disease, with a sample of those that did not, in an attempt to measure the benefits, risk contribution and interaction of specific genes, lifestyles, and medications.
In 2017 researchers were able to access the database including genetic information. [37] [38] By 2017 Biobank participants had approximately 1.3 million hospitalisations, 40,000 cancer incidents with 14,000 of them having died. [39]
The study was initially launched with a visit consisting of the following: [39]
Once the visit-based assessment method was proven, the range of investigations was extended to include: [39]
Since the completion of recruitment several new types of data have been added:
Following the initial pilot stage in the 2005-6 period, the main study began in April 2007 and by the end of that year 50,000 people had taken part. Recruitment reached 100,000 in April 2008, 200,000 in October 2008, 300,000 in May 2009, 400,000 in November 2009 and passed the 500,000 target in July 2010. Participant enrolment was declared complete in August 2010. [55] The volunteers were largely healthy, wealthy and white European. Rather than recruiting more participants into the biobank, the organisation is helping other institutions establish and run similar initiatives. [56]
The UK Biobank dataset was opened to applications from researchers in March 2012. [57] The resource is available to scientists from the UK and outside, whether they work in the public or private sector, for industry, academia or a charity, subject to verification that the research is health-related and in the public interest. Researchers must register to be approved to use UK Biobank data. [58] Researchers are required to publish their results in an open source publication site or in an academic journal and return their findings to the UK Biobank. [59]
In 2021, UK Biobank launched its cloud-based Research Analysis Platform (UKB-RAP), providing information technology infrastructure to store and analyse UK Biobank's large dataset regardless of the researcher's own technological capabilities. [60] By 2023 the platform had over 5,000 users. [61] The platform is hosted by Amazon Web Services, which also pledged $1.5 million in research credits for early career researchers and researchers from low and low-middle income countries to reduce limitations when collating, storing and securely accessing data. [62]
By 2023 30,000 researchers had registered to use the resource and over 9,000 peer-reviewed articles based on UK Biobank data had been published. [63]
A 2022 review of UK Biobank data found that road traffic noise exposure increases risk of CVD mortality, stroke and all-cause mortality. [64]
A 2023 review found that participants with sense of meaning and purpose in life have a decreased risk of dementia. [65] Two other studies of participants in the UK Biobank study found that dementia risk was higher for those who were more socially isolated. [66]
A study from the UK Biobank showed a reduction in grey matter thickness, overall reduction in brain size and greater cognitive decline in patients after COVID-19 compared with control groups. [67] [68] The UK Biobank also reported on an increased risk of hospitalization for those who contracted COVID-19 with obesity. [69]
Reviews of UK Biobank data have found that pescatarians and vegetarians have a lower risk of colorectal and prostate cancer compared to red meat eaters. [70] Consumption of processed meat increases risk of breast cancer. [71] They have also found that men with higher total and central adiposity have an increased risk of prostate cancer death. [72]
The UK Biobank project operates within the terms of an Ethics and Governance Framework. [73] [74] [75] The Framework describes a series of standards to which UK Biobank will operate during the creation, maintenance and use of the resource and it elaborates on the commitments that are involved to those participating in the project, researchers and the public more broadly. The independent UK Biobank Ethics and Governance Council provides advice to the project and monitors its conformity with the Framework. [76] The Council also advises more generally on the interests of research participants and the general public in relation to the project.[ citation needed ]
The UK Biobank Board is accountable to the members of the company (the Medical Research Council and The Wellcome Trust) and acts as company directors and as charity trustees. It is chaired by Lord Kakkar, [77] who succeeded Sir Michael Rawlins in January 2020.
The project has been generally praised for its ambitious scope and unique potential. A scientific review panel concluded, the "UK Biobank has the potential, in ways that are not currently available elsewhere, to support a wide range of research". [55] Colin Blakemore, chief executive of the MRC, predicted it "will provide scientists with extraordinary information" [78] and "grow into a unique resource for future generations." [55] There was some early criticism, however. GeneWatch UK, a pressure group that claims to promote the responsible use of genetic information, asserted that the complexity of the programme could result in the finding of "false links between genes and disease", [55] and expressed concern that the genetic information from patients could be patented for commercial purposes. Biobank's chief executive described such a risk as "extremely low, if it exists at all." [78]
Some literature has raised concerns that the UK Biobank is not representative of the diversity of the UK population or is not applicable to diverse populations. [79] [80]
Questions and concerns have been raised about the nature of some of the approved studies that have made use of UK Biobank data. Some of these studies are less obviously related to UK Biobank's self-stated mission of "Improving the prevention, diagnosis and treatment of a wide range of serious and life-threatening illnesses". In 2016, a study on "genetic contributions to social deprivation and household income" was published using UK Biobank data. [81] In 2019, researchers published a study on the genetics of same-sex behaviour, which made use of data from 23andMe and UK Biobank. [82] Following the publication, researchers and bioethicists voiced concerns about the approval and nature of the study, including how the findings were being exploited commercially, similar to a former UK Biobank-based study on educational attainment. [83]
In November 2023, The Observer reported that UK Biobank had approved access for companies in the insurance sector between 2020 and 2023 and that this runs counter some of the publicly made claims about how insurance companies and law enforcement would not access the data and records donated by volunteers. [84] In response, UK Biobank released a response, stating that the quoted passages were only made in relation to identifiable data, not the de-identified data that was ultimately shared with insurance companies. [85]
In 2023 the UK's Secretary of State for Science, Innovation and Technology Michelle Donelan described UK Biobank as "mak[ing] an unparalleled contribution to science across the whole world, by putting invaluable information at researchers' fingertips. It is without question a jewel in the crown of UK science, and an envy of the world". [86]
In October 2024, The Guardian published a story detailing that the far-right "Human Diversity Foundation" had gotten access to UK Biobank data to use it in pseudo-scientific race studies, raising questions about the governance and processes for approving and controlling access to the data. [87] Following the initial publication in The Guardian, UK Biobank released a statement that criticised the report and dismissed the main findings, claiming to have conducted a full investigation that had "found no evidence of misuse of UK Biobank data" as only publicly accessible summary statistics where used. [88] However, in a follow-up report by The Guardian after UK Biobank's statement, the newspaper reported on correspondence between a senior medic and UK Biobank chief executive, Prof Sir Rory Collins, in which Collins said that the inquiries were continuing. [89] In the same article, The Guardian also reported on the US-based startup, Heliospect Genomics that claims to use UK Biobank data to predict traits such as IQ, sex and height, as well as risk of obesity or mental illness, in human embryos for IVF treatment.
The UK Biobank is funded by the UK Department of Health, the Medical Research Council, the Scottish Executive, and the Wellcome Trust medical research charity. The cost of the initial participant recruitment and assessment phase was 62 million GBP. [90]
Epidemiology is the study and analysis of the distribution, patterns and determinants of health and disease conditions in a defined population.
Personalized medicine, also referred to as precision medicine, is a medical model that separates people into different groups—with medical decisions, practices, interventions and/or products being tailored to the individual patient based on their predicted response or risk of disease. The terms personalized medicine, precision medicine, stratified medicine and P4 medicine are used interchangeably to describe this concept, though some authors and organizations differentiate between these expressions based on particular nuances. P4 is short for "predictive, preventive, personalized and participatory".
Genetic discrimination occurs when people treat others differently because they have or are perceived to have a gene mutation(s) that causes or increases the risk of an inherited disorder. It may also refer to any and all discrimination based on the genotype of a person rather than their individual merits, including that related to race, although the latter would be more appropriately included under racial discrimination. Some legal scholars have argued for a more precise and broader definition of genetic discrimination: "Genetic discrimination should be defined as when an individual is subjected to negative treatment, not as a result of the individual's physical manifestation of disease or disability, but solely because of the individual's genetic composition." Genetic Discrimination is considered to have its foundations in genetic determinism and genetic essentialism, and is based on the concept of genism, i.e. distinctive human characteristics and capacities are determined by genes.
In genomics, a genome-wide association study, is an observational study of a genome-wide set of genetic variants in different individuals to see if any variant is associated with a trait. GWA studies typically focus on associations between single-nucleotide polymorphisms (SNPs) and traits like major human diseases, but can equally be applied to any other genetic variants and any other organisms.
The Joondalup Family Health Study (JFHS) will be an extensively characterized, community-based cohort study, which will investigate the complex interplay of environmental, lifestyle and genetic components that affect the risk of common conditions such as asthma, heart disease and diabetes. The Study will take place in Joondalup, Western Australia and will have a strong focus on families and the health of both children and adults.
A biobank is a type of biorepository that stores biological samples for use in research. Biobanks have become an important resource in medical research, supporting many types of contemporary research like genomics and personalized medicine.
Sir Rory Edwards Collins FMedSci FRS is a British physician who is Professor of Medicine and Epidemiology at the Clinical Trial Service Unit within the University of Oxford, the head of the Nuffield Department of Population Health and a Fellow of Green Templeton College, Oxford. His work has been in the establishment of large-scale epidemiological studies of the causes, prevention and treatment of heart attacks, other vascular disease, and cancer, while also being closely involved in developing approaches to the combination of results from related studies ("meta-analyses"). Since September 2005, he has been the Principal Investigator and Chief Executive of the UK Biobank, a prospective study of 500,000 British people aged 40–69 at recruitment.
Generation Scotland is a biobank, a resource of biological samples and information on health and lifestyle from thousands of volunteer donors in Scotland.
Biobank ethics refers to the ethics pertaining to all aspects of biobanks. The issues examined in the field of biobank ethics are special cases of clinical research ethics.
Return of results is a concept in research ethics which describes the extent of the duty of a researcher to reveal and explain the results of research to a research participant.
Privacy for research participants is a concept in research ethics which states that a person in human subject research has a right to privacy when participating in research. Some typical scenarios this would apply to include, or example, a surveyor doing social research conducts an interview with a participant, or a medical researcher in a clinical trial asks for a blood sample from a participant to see if there is a relationship between something which can be measured in blood and a person's health. In both cases, the ideal outcome is that any participant can join the study and neither the researcher nor the study design nor the publication of the study results would ever identify any participant in the study. Thus, the privacy rights of these individuals can be preserved.
CARTaGENE is a population-based cohort based on an ongoing and long-term health study of 40,000 men and women in Québec. It is a regional cohort member of the Canadian Partnership for Tomorrow's Health (CanPath). The project's core mandate is to identify the genetic and environmental causes of common chronic diseases affecting the Québecois population, and to develop personalized medicine and public policy initiatives targeting high-risk groups for the public.
Dynamic consent is an approach to informed consent that enables on-going engagement and communication between individuals and the users and custodians of their data. It is designed to address the many issues that are raised by the use of digital technologies in research and clinical care that enable the wide-scale use, linkage, analysis and integration of diverse datasets and the use of AI and big data analyses. These issues include how to obtain informed consent in a rapidly-changing environment; growing expectations that people should know how their data is being used; increased legal and regulatory requirements for the management of secondary use of data in biobanks and other medical research infrastructure. The approach started to be implemented in 2007 by an Italian group who introduced the ways to have an ongoing process of interaction between researcher and participant where "technology now allows the establishment of dynamic participant–researcher partnerships." The use of digital interfaces in this way was first described as 'Dynamic Consent' in the EnCoRe project. Dynamic Consent therefore describes a personalised, digital interface that enables two-way communication between participants and researchers and is a practical example of how software can be developed to give research participants greater understanding and control over how their data is used. It also enables clinical trial managers, researchers and clinicians to know what type of consent is attached to the use of data they hold and to have an easy way to seek a new consent if the use of the data changes. It is able to support greater accountability and transparency, streamlining consent processes to enable compliance with regulatory requirements.
In genetics, a polygenic score (PGS) is a number that summarizes the estimated effect of many genetic variants on an individual's phenotype. The PGS is also called the polygenic index (PGI) or genome-wide score; in the context of disease risk, it is called a polygenic risk score or genetic risk score. The score reflects an individual's estimated genetic predisposition for a given trait and can be used as a predictor for that trait. It gives an estimate of how likely an individual is to have a given trait based only on genetics, without taking environmental factors into account; and it is typically calculated as a weighted sum of trait-associated alleles.
DNA encryption is the process of hiding or perplexing genetic information by a computational method in order to improve genetic privacy in DNA sequencing processes. The human genome is complex and long, but it is very possible to interpret important, and identifying, information from smaller variabilities, rather than reading the entire genome. A whole human genome is a string of 3.2 billion base paired nucleotides, the building blocks of life, but between individuals the genetic variation differs only by 0.5%, an important 0.5% that accounts for all of human diversity, the pathology of different diseases, and ancestral story. Emerging strategies incorporate different methods, such as randomization algorithms and cryptographic approaches, to de-identify the genetic sequence from the individual, and fundamentally, isolate only the necessary information while protecting the rest of the genome from unnecessary inquiry. The priority now is to ascertain which methods are robust, and how policy should ensure the ongoing protection of genetic privacy.
Catherine Sudlow is a British neurologist. She is a professor of Neurology and Clinical Epidemiology and Head of the Centre for Medical Informatics at the Usher Institute of Population Health Sciences and Informatics at the University of Edinburgh. She is the Chief Scientist of UK Biobank, an honorary Consultant Neurologist in the Division of Clinical Neurosciences in Edinburgh and Director of the British Heart Foundation Data Science Centre.
The Pregnancy Outcome Prediction (POP) Study is a prospective cohort study of 4,512 women who have never given birth, recruited at the Rosie Hospital between January 2008 and July 2012.
Andrew M. McIntosh is a UK academic psychiatrist. He is Professor of Biological Psychiatry at the University of Edinburgh,. The main focus of his research is using genomic and neuroimaging approaches to better understand the causes and causal consequences of Major Depressive Disorder.
The Copenhagen General Population Study is an ongoing Danish longitudinal cohort study, that investigates the epidemiology of a wide range of diseases, in particular cardiovascular disease, respiratory disease, cancer, infections and other multifactorial diseases, in a representative sample of the Danish population. Headquartered at the Department of Clinical Biochemistry at Herlev Hospital, it was established in 2003 with professor Børge Nordestgaard as principal investigator, and it is the world's largest population study that follows the long-term health development of a healthy population over time, currently collecting data on over 170,000 enrolled participants.
{{cite journal}}
: CS1 maint: multiple names: authors list (link){{cite journal}}
: CS1 maint: numeric names: authors list (link)