Jonathan Marchini

Last updated
Jonathan Marchini
Jonathan Marchini.png
Born
Jonathan Laurence Marchini

Chichester
NationalityBritish
Alma mater
Awards
Scientific career
Institutions
Thesis The Statistical Analysis of Brain Images (2002)
Doctoral advisor Brian Ripley. [2]
Doctoral studentsBryan Howie, Teresa Ferreira, Claire Churchouse, Valentina Iotchkova, Jared O'Connell, Matt Kerin, Andy Dahl
Other notable studentsOlivier Delaneau
Website jmarchini.org

Jonathan Laurence Marchini (born 19 May 1973) [3] is a Bayesian statistician and professor of statistical genomics [4] in the Department of Statistics at the University of Oxford, a tutorial fellow in statistics at Somerville College, [5] Oxford and a co-founder and director of Gensci Ltd. [6] He co-leads the Haplotype Reference Consortium. [7]

Contents

Education

He obtained a Bachelor of Science degree in Pure Mathematics and Mathematical Statistics from Exeter University from 1991-94. [8] He then obtained a PGCE in Mathematics Education from the West Sussex Institute of Higher Education from 1994-5. He completed his DPhil in the Department of Statistics at the University of Oxford supervised by Professor Brian Ripley from 1998-2002. [9]

Career and research

Marchini spent three years working as a VSO volunteer teaching A-level Mathematics at Tosamaganga Secondary school, near Iringa, Tanzania, between September 1995 and September 1998. [10]

From 2002 to 2005 he held a Wellcome Trust Training Fellowship in Mathematical Biology, under the supervision of Prof Lon Cardon and Prof Peter Donnelly.

In 2006 he was appointed as a university lecturer (associate professor) in statistical genomics in the Department of Statistics at the University of Oxford and a senior research fellow at Mansfield College. In 2007 he became an affiliated group leader at the Wellcome Trust Center of Human Genetics [11] at the University of Oxford. In 2010 he was re-appointed until retirement [12]

In 2015 he was promoted to professor of statistical genomics [13] [14]

Marchini's research focusses on statistical genetics and population genetics, with a particular emphasis on methods development for genome-wide association studies. He has worked on haplotype estimation, [15] [16] [17] [18] genotype imputation, [19] genotype calling from arrays and sequencing, sparse tensor decomposition for RNA-seq datasets, [20] population structure, [21] phenotype prediction and mixed models, [22] gene-gene interactions [23] and brain imaging genetics. [24]

He was a member of the analysis team for the International HapMap Project, the Wellcome Trust Case-Control Consortium, the 1000 Genomes Project and the UK10K Project. His research group was responsible for the haplotype estimation and genotype imputation for the UK Biobank dataset. He co-leads the Haplotype Reference Consortium. [25]

He has been an ISI Highly Cited Researcher from 2014-2018 [26]

He has acted as an expert witness in a patent trial [27]

Awards

In 2012 he was awarded a Philip Leverhulme Prize [28] for "leading the way by constructing powerful and ingenious novel statistical methodology for population and medical genetics, together with associated fast computational algorithms and software."

Related Research Articles

The International HapMap Project was an organization that aimed to develop a haplotype map (HapMap) of the human genome, to describe the common patterns of human genetic variation. HapMap is used to find genetic variants affecting health, disease and responses to drugs and environmental factors. The information produced by the project is made freely available for research.

<span class="mw-page-title-main">Identity by descent</span> Identical nucleotide sequence due to inheritance without recombination from a common ancestor

A DNA segment is identical by state (IBS) in two or more individuals if they have identical nucleotide sequences in this segment. An IBS segment is identical by descent (IBD) in two or more individuals if they have inherited it from a common ancestor without recombination, that is, the segment has the same ancestral origin in these individuals. DNA segments that are IBD are IBS per definition, but segments that are not IBD can still be IBS due to the same mutations in different individuals or recombinations that do not alter the segment.

<span class="mw-page-title-main">Genome-wide association study</span> Study of genetic variants in different individuals

In genomics, a genome-wide association study, is an observational study of a genome-wide set of genetic variants in different individuals to see if any variant is associated with a trait. GWA studies typically focus on associations between single-nucleotide polymorphisms (SNPs) and traits like major human diseases, but can equally be applied to any other genetic variants and any other organisms.

Population structure is the presence of a systematic difference in allele frequencies between subpopulations. In a randomly mating population, allele frequencies are expected to be roughly similar between groups. However, mating tends to be non-random to some degree, causing structure to arise. For example, a barrier like a river can separate two groups of the same species and make it difficult for potential mates to cross; if a mutation occurs, over many generations it can spread and become common in one subpopulation while being completely absent in the other.

<span class="mw-page-title-main">Gil McVean</span> British statistical geneticist (born 1973)

Gilean Alistair Tristram McVean is a professor of statistical genetics at the University of Oxford, fellow of Linacre College, Oxford and co-founder and director of Genomics plc. He also co-chaired the 1000 Genomes Project analysis group.

<span class="mw-page-title-main">CLEC16A</span> Protein-coding gene in the species Homo sapiens

C-type lectin domain family 16, also known as CLEC16A, is a protein that in humans is encoded by the CLEC16A gene.

In genetics, association mapping, also known as "linkage disequilibrium mapping", is a method of mapping quantitative trait loci (QTLs) that takes advantage of historic linkage disequilibrium to link phenotypes to genotypes, uncovering genetic associations.

Sir Michael Rudolf Stratton, is a British clinical scientist and the third director of the Wellcome Trust Sanger Institute. He currently heads the Cancer Genome Project and is a leader of the International Cancer Genome Consortium.

<span class="mw-page-title-main">ZNF300</span> Human protein-coding gene

Zinc finger protein 300 is a protein that in humans is encoded by the ZNF300 gene. The protein encoded by this gene is a C2H2-type zinc finger DNA binding protein and a likely transcription factor.

Jonathan Karl Pritchard is an English-born professor of genetics at Stanford University, best known for his development of the STRUCTURE algorithm for studying population structure and his work on human genetic variation and evolution. His research interests lie in the study of human evolution, in particular in understanding the association between genetic variation among human individuals and human traits.

<span class="mw-page-title-main">Andrew Singleton</span> British neurogeneticist

Andrew B. Singleton is a British neurogeneticist currently working in the USA. He was born in Guernsey, the Channel Islands in 1972, where he lived until he was 18 years old. His secondary education was conducted at the Guernsey Grammar School. He earned a first class degree in Applied Physiology from Sunderland University and his PhD in neuroscience from the University of Newcastle upon Tyne where he studied the genetics of Alzheimer's disease and other dementias at the Medical Research Council (MRC) Neurochemical Pathology Unit. He moved to the United States in 1999, where he began working at the Mayo Clinic in Jacksonville, Florida studying the genetic basis of Parkinson's disease, ataxia, and dystonia. He moved to the National Institutes of Health in 2001 to head the newly formed Molecular Genetics unit within the Laboratory of Neurogenetics. In 2006 he took over as Chief of the Laboratory of Neurogenetics and became an NIH Distinguished Investigator in the intramural program at the National Institute on Aging (NIA) in 2017. In 2020 he stepped down as the Chief of the Laboratory of Neurogenetics and became the Acting Director of the newly formed Center for Alzheimer's and Related Dementias at the NIA. In 2021 he became the Director of CARD.

In genetics, haplotype estimation refers to the process of statistical estimation of haplotypes from genotype data. The most common situation arises when genotypes are collected at a set of polymorphic sites from a group of individuals. For example in human genetics, genome-wide association studies collect genotypes in thousands of individuals at between 200,000-5,000,000 SNPs using microarrays. Haplotype estimation methods are used in the analysis of these datasets and allow genotype imputation of alleles from reference databases such as the HapMap Project and the 1000 Genomes Project.

Imputation in genetics refers to the statistical inference of unobserved genotypes. It is achieved by using known haplotypes in a population, for instance from the HapMap or the 1000 Genomes Project in humans, thereby allowing to test for association between a trait of interest and experimentally untyped genetic variants, but whose genotypes have been statistically inferred ("imputed"). Genotype imputation is usually performed on SNPs, the most common kind of genetic variation.

<span class="mw-page-title-main">Gonçalo Abecasis</span>

Gonçalo Rocha Abecasis is a Portuguese American biomedical researcher at the University of Michigan and was chair of the Department of Biostatistics in the School of Public Health. He leads a group at the Center for Statistical Genetics in the Department of Biostatistics, where he is also the Felix E. Moore Collegiate Professor of Biostatistics and director of the Michigan Genomic Initiative. His group develops statistical tools to analyze the genetics of human disease.

Mega2 allows the applied statistical geneticist to convert one's data from several input formats to a large number output formats suitable for analysis by commonly used software packages. In a typical human genetics study, the analyst often needs to use a variety of different software programs to analyze the data, and these programs usually require that the data be formatted to their precise input specifications. Conversion of one's data into these multiple different formats can be tedious, time-consuming, and error-prone. Mega2, by providing validated conversion pipelines, can accelerate the analyses while reducing errors.

Sabera Nazneen Rahman is a geneticist who specialises in cancer research and is a non-executive director for Astra Zeneca. She was previously head of Genetics and Epidemiology at the Institute of Cancer Research.

A multilocus genotype is the combination of alleles found at two or more loci in a single individual.

The Wellcome Trust Case Control Consortium is a collaboration between fifty research groups in the United Kingdom in the field of human genetics. Established in 2005, the WTCCC aims to conduct genome-wide association studies (GWASs) to shed light on the genetic architecture of common human diseases. The founding chairman of the consortium was University of Oxford statistician Peter Donnelly. The consortium was funded by £9 million from the Wellcome Trust. According to the consortium's website, it has identified "approximately 90" new susceptibility loci for common human diseases.

In genetics, a haplotype block is a region of an organism's genome in which there is little evidence of a history of genetic recombination, and which contain only a small number of distinct haplotypes. According to the haplotype-block model, such blocks should show high levels of linkage disequilibrium and be separated from one another by numerous recombination events. The boundaries of haplotype blocks cannot be directly observed; they must instead be inferred indirectly through the use of algorithms. However, some evidence suggests that different algorithms for identifying haplotype blocks give very different results when used on the same data, though another study suggests that their results are generally consistent. The National Institutes of Health funded the HapMap project to catalog haplotype blocks throughout the human genome.

Human genetic clustering refers to patterns of relative genetic similarity among human individuals and populations, as well as the wide range of scientific and statistical methods used to study this aspect of human genetic variation.

References

  1. "Leverhulme website" (PDF). Archived from the original (PDF) on 2018-04-02. Retrieved 2018-04-01.
  2. Marchini, J.; Ripley, B. (2000). "A New Statistical Approach to Detecting Significant Activation in Functional MRI. Molecular evolution". NeuroImage. 12 (4): 366–80. doi:10.1006/nimg.2000.0628. PMID   10988031. S2CID   54375337.
  3. "Companies house record".
  4. "Marchini website".
  5. "Somerville website".
  6. "Companies house record".
  7. "HRC website".
  8. "Somerville College Report" (PDF).
  9. Marchini, Jonathan L.; Ripley, Brian D. (October 2000). "A New Statistical Approach to Detecting Significant Activation in Functional MRI". NeuroImage. 12 (4): 366–380. doi:10.1006/nimg.2000.0628. PMID   10988031. S2CID   54375337.
  10. "Photo album".
  11. "WTCHG website".
  12. "Oxford Gazette link".
  13. "Oxford Gazette link". Archived from the original on 2015-09-16. Retrieved 2018-04-01.
  14. "Oxford Gazette link".
  15. O'Connell, Jared; Sharp, Kevin; Shrine, Nick; Wain, Louise; Hall, Ian; Tobin, Martin; Zagury, Jean-Francois; Delaneau, Olivier; Marchini, Jonathan (6 June 2016). "Haplotype estimation for biobank-scale data sets". Nature Genetics. 48 (7): 817–820. doi:10.1038/ng.3583. PMC   4926957 . PMID   27270105.
  16. O'Connell, Jared; Gurdasani, Deepti; Delaneau, Olivier; Pirastu, Nicola; Ulivi, Sheila; Cocca, Massimiliano; Traglia, Michela; Huang, Jie; Huffman, Jennifer E.; Rudan, Igor; McQuillan, Ruth; Fraser, Ross M.; Campbell, Harry; Polasek, Ozren; Asiki, Gershim; Ekoru, Kenneth; Hayward, Caroline; Wright, Alan F.; Vitart, Veronique; Navarro, Pau; Zagury, Jean-Francois; Wilson, James F.; Toniolo, Daniela; Gasparini, Paolo; Soranzo, Nicole; Sandhu, Manjinder S.; Marchini, Jonathan; Gibson, Greg (17 April 2014). "A General Approach for Haplotype Phasing across the Full Spectrum of Relatedness". PLOS Genetics. 10 (4): e1004234. doi: 10.1371/journal.pgen.1004234 . PMC   3990520 . PMID   24743097.
  17. Delaneau, Olivier; Howie, Bryan; Cox, Anthony J.; Zagury, Jean-François; Marchini, Jonathan (October 2013). "Haplotype Estimation Using Sequencing Reads". The American Journal of Human Genetics. 93 (4): 687–696. doi:10.1016/j.ajhg.2013.09.002. PMC   3791270 . PMID   24094745.
  18. Delaneau, Olivier; Zagury, Jean-Francois; Marchini, Jonathan (1 January 2013). "Improved whole-chromosome phasing for disease and population genetic studies". Nature Methods. 10 (1): 5–6. doi:10.1038/nmeth.2307. PMID   23269371. S2CID   205421216.
  19. Howie, Bryan N.; Donnelly, Peter; Marchini, Jonathan; Schork, Nicholas J. (19 June 2009). "A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies". PLOS Genetics. 5 (6): e1000529. doi: 10.1371/journal.pgen.1000529 . PMC   2689936 . PMID   19543373.
  20. Hore, Victoria; Viñuela, Ana; Buil, Alfonso; Knight, Julian; McCarthy, Mark I; Small, Kerrin; Marchini, Jonathan (1 August 2016). "Tensor decomposition for multiple-tissue gene expression experiments". Nature Genetics. 48 (9): 1094–1100. doi:10.1038/ng.3624. PMC   5010142 . PMID   27479908.
  21. Marchini, Jonathan; Cardon, Lon R; Phillips, Michael S; Donnelly, Peter (28 March 2004). "The effects of human population structure on large genetic association studies". Nature Genetics. 36 (5): 512–517. doi: 10.1038/ng1337 . PMID   15052271.
  22. Dahl, Andrew; Iotchkova, Valentina; Baud, Amelie; Johansson, Åsa; Gyllensten, Ulf; Soranzo, Nicole; Mott, Richard; Kranis, Andreas; Marchini, Jonathan (22 February 2016). "A multiple-phenotype imputation method for genetic studies". Nature Genetics. 48 (4): 466–472. doi:10.1038/ng.3513. PMC   4817234 . PMID   26901065.
  23. Marchini, Jonathan; Donnelly, Peter; Cardon, Lon R (27 March 2005). "Genome-wide strategies for detecting multiple loci that influence complex diseases". Nature Genetics. 37 (4): 413–417. doi:10.1038/ng1537. PMID   15793588. S2CID   5922358.
  24. Elliott, Lloyd T.; Sharp, Kevin; Alfaro-Almagro, Fidel; Shi, Sinan; Miller, Karla; Douaud, Gwenaëlle; Marchini, Jonathan; Smith, Stephen (2018). "BioRxiv paper". doi: 10.1101/178806 .{{cite journal}}: Cite journal requires |journal= (help)
  25. "HRC website".
  26. "ISI Highly Cited Researchers".
  27. "Illumina vs Premaitha".
  28. "Leverhulme website" (PDF). Archived from the original (PDF) on 2018-04-02. Retrieved 2018-04-01.