Charles Lee (scientist)

Last updated
Charles Lee PhD, DSc, FACMG
Charles Lee photo.jpg
Personal details
BornAge: 55
Seoul, South Korea
Nationality Canadian
Alma mater University of Alberta (BSc, MSc, PhD)

University of Cambridge (Postdoc)

Harvard Medical School (Clinical Fellow)
Known forDiscovery of widespread structural variation in the human genome

Charles Lee is the Robert Alvine Family Endowed Chair, professor, and a board certified clinical cytogeneticist who has an active research program in the identification and characterization of structural genomic variants using advanced technology platforms. His laboratory was the first to describe genome-wide structural genomic variants (in the form of copy number variants (CNVs)) among humans [1] with the subsequent development of genomic maps [2] [3] that are used in the diagnoses of array-based genetic tests. Lee served as the president of the Human Genome Organisation (HUGO) from 2017 to 2023.

Contents

Education

Career

Positions held

Other appointments

Major research publications

In the press

2024

2023

2020

2018

2016

Related Research Articles

<span class="mw-page-title-main">Human genome</span> Complete set of nucleic acid sequences for humans

The human genome is a complete set of nucleic acid sequences for humans, encoded as the DNA within each of the 24 distinct chromosomes in the cell nucleus. A small DNA molecule is found within individual mitochondria. These are usually treated separately as the nuclear genome and the mitochondrial genome. Human genomes include both protein-coding DNA sequences and various types of DNA that does not encode proteins. The latter is a diverse category that includes DNA coding for non-translated RNA, such as that for ribosomal RNA, transfer RNA, ribozymes, small nuclear RNAs, and several types of regulatory RNAs. It also includes promoters and their associated gene-regulatory elements, DNA playing structural and replicatory roles, such as scaffolding regions, telomeres, centromeres, and origins of replication, plus large numbers of transposable elements, inserted viral DNA, non-functional pseudogenes and simple, highly repetitive sequences. Introns make up a large percentage of non-coding DNA. Some of this non-coding DNA is non-functional junk DNA, such as pseudogenes, but there is no firm consensus on the total amount of junk DNA.

Gene duplication is a major mechanism through which new genetic material is generated during molecular evolution. It can be defined as any duplication of a region of DNA that contains a gene. Gene duplications can arise as products of several types of errors in DNA replication and repair machinery as well as through fortuitous capture by selfish genetic elements. Common sources of gene duplications include ectopic recombination, retrotransposition event, aneuploidy, polyploidy, and replication slippage.

<span class="mw-page-title-main">Copy number variation</span> Repeated DNA variation between individuals

Copy number variation (CNV) is a phenomenon in which sections of the genome are repeated and the number of repeats in the genome varies between individuals. Copy number variation is a type of structural variation: specifically, it is a type of duplication or deletion event that affects a considerable number of base pairs. Approximately two-thirds of the entire human genome may be composed of repeats and 4.8–9.5% of the human genome can be classified as copy number variations. In mammals, copy number variations play an important role in generating necessary variation in the population as well as disease phenotype.

<span class="mw-page-title-main">Human genetic variation</span> Genetic diversity in human populations

Human genetic variation is the genetic differences in and among populations. There may be multiple variants of any given gene in the human population (alleles), a situation called polymorphism.

<span class="mw-page-title-main">Chromosomal rearrangement</span>

In genetics, a chromosomal rearrangement is a mutation that is a type of chromosome abnormality involving a change in the structure of the native chromosome. Such changes may involve several different classes of events, like deletions, duplications, inversions, and translocations. Usually, these events are caused by a breakage in the DNA double helices at two different locations, followed by a rejoining of the broken ends to produce a new chromosomal arrangement of genes, different from the gene order of the chromosomes before they were broken. Structural chromosomal abnormalities are estimated to occur in around 0.5% of newborn infants.

<span class="mw-page-title-main">ARID1B</span> Protein-coding gene in humans

AT-rich interactive domain-containing protein 1B is a protein that in humans is encoded by the ARID1B gene. ARID1B is a component of the human SWI/SNF chromatin remodeling complex.

<span class="mw-page-title-main">CDY1</span> Protein-coding gene in humans

Testis-specific chromodomain protein Y 1 is a protein that in humans is encoded by the CDY1 gene.

<span class="mw-page-title-main">BBS2</span> Protein-coding gene in the species Homo sapiens

Bardet–Biedl syndrome 2 protein is a protein that in humans is encoded by the BBS2 gene.

<span class="mw-page-title-main">EML4</span> Protein-coding gene in the species Homo sapiens

Echinoderm microtubule-associated protein-like 4 is a protein that in humans is encoded by the EML4 gene.

<span class="mw-page-title-main">1000 Genomes Project</span> International research effort on genetic variation

The 1000 Genomes Project (1KGP), taken place from January 2008 to 2015, was an international research effort to establish the most detailed catalogue of human genetic variation at the time. Scientists planned to sequence the genomes of at least one thousand anonymous healthy participants from a number of different ethnic groups within the following three years, using advancements in newly developed technologies. In 2010, the project finished its pilot phase, which was described in detail in a publication in the journal Nature. In 2012, the sequencing of 1092 genomes was announced in a Nature publication. In 2015, two papers in Nature reported results and the completion of the project and opportunities for future research.

<span class="mw-page-title-main">Koolen–De Vries syndrome</span> Rare genetic disorder caused by a deletion of six genes

Koolen–De Vries syndrome (KdVS), also known as 17q21.31 microdeletion syndrome, is a rare genetic disorder caused by a deletion of a segment of chromosome 17 which contains six genes. This deletion syndrome was discovered independently in 2006 by three different research groups.

DECIPHER is a web-based resource and database of genomic variation data from analysis of patient DNA. It documents submicroscopic chromosome abnormalities and pathogenic sequence variants, from over 25000 patients and maps them to the human genome using Ensembl or UCSC Genome Browser. In addition it catalogues the clinical characteristics from each patient and maintains a database of microdeletion/duplication syndromes, together with links to relevant scientific reports and support groups.

Non-allelic homologous recombination (NAHR) is a form of homologous recombination that occurs between two lengths of DNA that have high sequence similarity, but are not alleles.

<span class="mw-page-title-main">Stephen W. Scherer</span> Canadian scientist (born 1964)

Stephen Wayne "Steve" Scherer is a Canadian scientist who currently serves as the Chief of Research at The Hospital for Sick Children (SickKids) and distinguished University Professor at the University of Toronto. He obtained his PhD at the University of Toronto under Professor Lap-chee Tsui. Together they founded Canada's first human genome centre, the Centre for Applied Genomics (TCAG). He is a Senior Fellow of Massey College at the University of Toronto. In 2014, he was named an esteemed Clarivate Citation laureate in Physiology or Medicine for the “Discovery of large-scale gene copy number variation and its association with specific diseases.

Genomic structural variation is the variation in structure of an organism's chromosome, such as deletions, duplications, copy-number variants, insertions, inversions and translocations. Originally, a structure variation affects a sequence length about 1kb to 3Mb, which is larger than SNPs and smaller than chromosome abnormality. However, the operational range of structural variants has widened to include events > 50bp. Some structural variants are associated with genetic diseases, however most are not. Approximately 13% of the human genome is defined as structurally variant in the normal population, and there are at least 240 genes that exist as homozygous deletion polymorphisms in human populations, suggesting these genes are dispensable in humans. While humans carry a median of 3.6 Mbp in SNPs, a median of 8.9 Mbp is affected by structural variation which thus causes most genetic differences between humans in terms of raw sequence data.

<span class="mw-page-title-main">NCAPH2</span> Protein-coding gene in the species Homo sapiens

Condensin-2 complex subunit H2, also known as chromosome-associated protein H2 (CAP-H2) or non-SMC condensin II complex subunit H2 (NCAPH2), is a protein that in humans is encoded by the NCAPH2 gene. CAP-H2 is a subunit of condensin II, a large protein complex involved in chromosome condensation.

Single-cell DNA template strand sequencing, or Strand-seq, is a technique for the selective sequencing of a daughter cell's parental template strands. This technique offers a wide variety of applications, including the identification of sister chromatid exchanges in the parental cell prior to segregation, the assessment of non-random segregation of sister chromatids, the identification of misoriented contigs in genome assemblies, de novo genome assembly of both haplotypes in diploid organisms including humans, whole-chromosome haplotyping, and the identification of germline and somatic genomic structural variation, the latter of which can be detected robustly even in single cells.

<span class="mw-page-title-main">End-sequence profiling</span>

End-sequence profiling (ESP) is a method based on sequence-tagged connectors developed to facilitate de novo genome sequencing to identify high-resolution copy number and structural aberrations such as inversions and translocations.

<span class="mw-page-title-main">Structural variation in the human genome</span> Genomic alterations, varying between individuals

Structural variation in the human genome is operationally defined as genomic alterations, varying between individuals, that involve DNA segments larger than 1 kilo base (kb), and could be either microscopic or submicroscopic. This definition distinguishes them from smaller variants that are less than 1 kb in size such as short deletions, insertions, and single nucleotide variants.

Matthew Edward Hurles is director of the Wellcome Sanger Institute and an honorary professor of Human Genetics and Genomics at the University of Cambridge.

References

  1. 1 2 Iafrate, AJ; Feuk, L; Rivera, MN; et al. (2004). "Detection of large-scale variation in the human genome". Nat. Genet. 36 (9): 949–951. doi: 10.1038/ng1416 . PMID   15286789.
  2. 1 2 Redon, R; Ishikawa, S; Fitch, KR; et al. (2006). "Global variation in copy number in the human genome". Nature. 444 (7118): 444–454. Bibcode:2006Natur.444..444R. doi:10.1038/nature05329. PMC   2669898 . PMID   17122850.
  3. 1 2 Conrad, Donald F.; Pinto, Dalila; Redon, Richard; Feuk, Lars; Gokcumen, Omer; Zhang, Yujun; Aerts, Jan; Andrews, T. Daniel; Barnes, Chris; Campbell, Peter; Fitzgerald, Tomas; Hu, Min; Ihm, Chun Hwa; Kristiansson, Kati; MacArthur, Daniel G.; MacDonald, Jeffrey R.; Onyiah, Ifejinelo; Pang, Andy Wing Chun; Robson, Sam; Stirrups, Kathy; Valsesia, Armand; Walter, Klaudia; Wei, John; Tyler-Smith, Chris; Carter, Nigel P.; Lee, Charles; Scherer, Stephen W.; Hurles, Matthew E. (2010). "Common copy number variation in the human genome: mechanism, selection and disease association". Nature. 464 (7289): 704–712. Bibcode:2010Natur.464..704.. doi:10.1038/nature08516. PMC   3330748 . PMID   19812545.
  4. "University of Alberta: Distinguished Alumni Award: Charles Lee, '90 BSc(Spec), '93 MSc, '96 PhD, medical geneticist". Archived from the original on 2019-11-27. Retrieved 2019-06-16.
  5. Lee, C.; Sasi, R.; Lin, C.C. (1993). "Interstitial localization of telomeric DNA sequences in the Indian muntjac chromosomes: further evidence for tandem chromosome fusions in the karyotypic evolution of the Asian muntjacs". Cytogenet. Cell Genet. 63 (3): 156–159. doi:10.1159/000133525. PMID   8485991.
  6. Lee, C.; Wevrick, R.; Fisher, R. B.; Ferguson-Smith, M. A.; Lin, C. C. (1997). "Human centromeric DNAs". Human Genetics. 100 (3–4): 291–304. doi:10.1007/s004390050508. PMID   9272147. S2CID   615040.
  7. Perry, GH; Dominy, NJ; Claw, KG; et al. (2007). "Diet and the evolution of human gene copy number variation". Nat. Genet. 39 (10): 1256–1260. doi:10.1038/ng2123. PMC   2377015 . PMID   17828263.
  8. Lee, Charles; Iafrate, A John; Brothman, Arthur R. (2007). "Copy number variations and clinical cytogenetic diagnosis of constitutional disorders". Nat. Genet. 39 (7s): S48–54. doi:10.1038/ng2092. PMID   17597782. S2CID   23031436.
  9. Perry, G. H.; Ben-Dor, A.; Tsalenko, A.; Sampas, N.; Rodriguez-Revenga, L.; Tran, C. W.; Scheffer, A.; Steinfeld, I.; Tsang, P.; Yamada, N. A.; Park, H. S.; Kim, J. I.; Seo, J. S.; Yakhini, Z.; Laderman, S.; Bruhn, L.; Lee, C. (2008). "The fine-scale and complex architecture of human copy number variation". Am J Hum Genet. 82 (3): 685–695. doi:10.1016/j.ajhg.2007.12.010. PMC   2661628 . PMID   18304495.
  10. Mills, RE; Walter, K; Stewart, C; et al. (2011). "Mapping copy number variation by population-scale genome sequencing". Nature. 470 (7332): 59–65. Bibcode:2011Natur.470...59.. doi:10.1038/nature09708. PMC   3077050 . PMID   21293372.
  11. Brown, Kim H.; Dobrinski, Kimberly P.; Lee, Arthur S.; Gokcumen, Omer; Mills, Ryan E.; Shi, Xinghua; Chong, Wilson W. S.; Chen, Jin Yun Helen; Yoo, Paulo; David, Sthuthi; Peterson, Samuel M.; Raj, Towfique; Choy, Kwong Wai; Stranger, Barbara E.; Williamson, Robin E.; Zon, Leonard I.; Freeman, Jennifer L.; Lee, Charles (2012). "Extensive genetic diversity and sub-structuring among zebrafish strains revealed through copy number variant analysis". Proc Natl Acad Sci USA. 109 (2): 529–534. Bibcode:2012PNAS..109..529B. doi: 10.1073/pnas.1112163109 . PMC   3258620 . PMID   22203992.
  12. Gokcumen, O; Tischler, V; Tica, J; Zhu, Q; Iskow, RC; Lee, E; Fritz, MH; Langdon, A; Stütz, AM; Pavlidis, P; Benes, V; Mills, RE; Park, PJ; Lee, C; Korbel, JO (2013). "Primate genome architecture influences structural variation mechanisms and functional consequences". Proc Natl Acad Sci USA. 110 (39): 15764–15769. Bibcode:2013PNAS..11015764G. doi: 10.1073/pnas.1305904110 . PMC   3785719 . PMID   24014587.
  13. Sudmant, Peter H.; Rausch, Tobias; Gardner, Eugene J.; Handsaker, Robert E.; Abyzov, Alexej; Huddleston, John; Zhang, Yan; Ye, Kai; Jun, Goo (2015-10-01). "An integrated map of structural variation in 2,504 human genomes". Nature. 526 (7571): 75–81. Bibcode:2015Natur.526...75.. doi:10.1038/nature15394. ISSN   0028-0836. PMC   4617611 . PMID   26432246.