Karen Miga

Last updated
Karen H. Miga
Karen Miga at Bioinformatics Community Conference 2020.jpg
Born
Karen Elizabeth Hayden
Known forTelomere to Telomere Project
Scientific career
Institutions University of California, Santa Cruz
Thesis A Genomic Definition of Centromeres in Complex Genomes  (2011)
Doctoral advisor Huntington Willard
Website www.migalab.com

Karen Elizabeth Hayden Miga is an American geneticist who co-leads the Telomere-to-Telomore (T2T) consortium that released fully complete assembly of the human genome in March 2022. [1] She is an assistant professor of biomolecular engineering at the University of California, Santa Cruz and Associate Director of Human Pangenomics at the UC Santa Cruz Genomics Institute. She was named as "One to Watch" in the 2020 Nature's 10 and one of Time 100’s most influential people of 2022. [2]

Contents

Research and career

In 2012, Miga joined the laboratory of David Haussler at the University of California, Santa Cruz. [3] At UCSC she combined computational and experimental approaches. [4] There she leads the telomere-to-telomere (T2T) consortium, [4] [5] a community based effort that seeks to fully sequence and assemble the human genome. [5] Her research efforts make use of long-read sequencing strategies. [5] [6] She makes use of the Oxford Nanopore Technologies MinION sequencer, which analyses DNA by detecting changes in current flow when DNA passes through nanopores in a membrane. [6]

Miga is the director of the Human Pangenome Production Center that seeks to contribute to the next human pangenome reference map through the creation of 350 T2T diploid genomes. [5] [7] This map will support the development of personalized therapeutics. [5]

In 2022, Miga participated in the closing plenary session of the Clinton Global Initiative meeting, [8] where she discussed the future of equitable genomics research with former President Bill Clinton, U2 lead singer Bono, and Director-General of the World Health Organization (WHO) Tedros Ghebreyesus. 2023, Karen Miga was named a 2023 Searle Scholar [9] to study uncharted heterochromatin regions of the human genome.

Selected publications

Related Research Articles

<span class="mw-page-title-main">Human genome</span> Complete set of nucleic acid sequences for humans

The human genome is a complete set of nucleic acid sequences for humans, encoded as DNA within the 23 chromosome pairs in cell nuclei and in a small DNA molecule found within individual mitochondria. These are usually treated separately as the nuclear genome and the mitochondrial genome. Human genomes include both protein-coding DNA sequences and various types of DNA that does not encode proteins. The latter is a diverse category that includes DNA coding for non-translated RNA, such as that for ribosomal RNA, transfer RNA, ribozymes, small nuclear RNAs, and several types of regulatory RNAs. It also includes promoters and their associated gene-regulatory elements, DNA playing structural and replicatory roles, such as scaffolding regions, telomeres, centromeres, and origins of replication, plus large numbers of transposable elements, inserted viral DNA, non-functional pseudogenes and simple, highly repetitive sequences. Introns make up a large percentage of non-coding DNA. Some of this non-coding DNA is non-functional junk DNA, such as pseudogenes, but there is no firm consensus on the total amount of junk DNA.

<span class="mw-page-title-main">DNA sequencer</span> A scientific instrument used to automate the DNA sequencing process

A DNA sequencer is a scientific instrument used to automate the DNA sequencing process. Given a sample of DNA, a DNA sequencer is used to determine the order of the four bases: G (guanine), C (cytosine), A (adenine) and T (thymine). This is then reported as a text string, called a read. Some DNA sequencers can be also considered optical instruments as they analyze light signals originating from fluorochromes attached to nucleotides.

The National Human Genome Research Institute (NHGRI) is an institute of the National Institutes of Health, located in Bethesda, Maryland.

<span class="mw-page-title-main">Jim Kent</span> American research scientist and computer programmer

William James Kent is an American research scientist and computer programmer. He has been a contributor to genome database projects and the 2003 winner of the Benjamin Franklin Award.

<span class="mw-page-title-main">George Church (geneticist)</span> American geneticist

George McDonald Church is an American geneticist, molecular engineer, chemist, serial entrepreneur, and pioneer in personal genomics and synthetic biology. He is the Robert Winthrop Professor of Genetics at Harvard Medical School, Professor of Health Sciences and Technology at Harvard University and Massachusetts Institute of Technology, and a founding member of the Wyss Institute for Biologically Inspired Engineering at Harvard University.

The Baskin School of Engineering, known simply as Baskin Engineering, is the school of engineering at the University of California, Santa Cruz. It consists of six departments: Applied Mathematics, Biomolecular Engineering, Computational Media, Computer Science and Engineering, Electrical and Computer Engineering, and Statistics.

<span class="mw-page-title-main">Human Genome Project</span> Human genome sequencing programme

The Human Genome Project (HGP) was an international scientific research project with the goal of determining the base pairs that make up human DNA, and of identifying, mapping and sequencing all of the genes of the human genome from both a physical and a functional standpoint. It started in 1990 and was completed in 2003. It remains the world's largest collaborative biological project. Planning for the project started after it was adopted in 1984 by the US government, and it officially launched in 1990. It was declared complete on April 14, 2003, and included about 92% of the genome. Level "complete genome" was achieved in May 2021, with only 0.3% of the bases covered by potential issues. The final gapless assembly was finished in January 2022.

<span class="mw-page-title-main">David Haussler</span> American bioinformatician

David Haussler is an American bioinformatician known for his work leading the team that assembled the first human genome sequence in the race to complete the Human Genome Project and subsequently for comparative genome analysis that deepens understanding the molecular function and evolution of the genome.

<span class="mw-page-title-main">Whole genome sequencing</span> Determining nearly the entirety of the DNA sequence of an organisms genome at a single time

Whole genome sequencing (WGS), also known as full genome sequencing, complete genome sequencing, or entire genome sequencing, is the process of determining the entirety, or nearly the entirety, of the DNA sequence of an organism's genome at a single time. This entails sequencing all of an organism's chromosomal DNA as well as DNA contained in the mitochondria and, for plants, in the chloroplast.

<span class="mw-page-title-main">Pan-genome</span> All genes of all strains in a clade

In the fields of molecular biology and genetics, a pan-genome is the entire set of genes from all strains within a clade. More generally, it is the union of all the genomes of a clade. The pan-genome can be broken down into a "core pangenome" that contains genes present in all individuals, a "shell pangenome" that contains genes present in two or more strains, and a "cloud pangenome" that contains genes only found in a single strain. Some authors also refer to the cloud genome as "accessory genome" containing 'dispensable' genes present in a subset of the strains and strain-specific genes. Note that the use of the term 'dispensable' has been questioned, at least in plant genomes, as accessory genes play "an important role in genome evolution and in the complex interplay between the genome and the environment". The field of study of pangenomes is called pangenomics.

The UCSC Genome Browser is an online and downloadable genome browser hosted by the University of California, Santa Cruz (UCSC). It is an interactive website offering access to genome sequence data from a variety of vertebrate and invertebrate species and major model organisms, integrated with a large collection of aligned annotations. The Browser is a graphical viewer optimized to support fast interactive performance and is an open-source, web-based tool suite built on top of a MySQL database for rapid visualization, examination, and querying of the data at many levels. The Genome Browser Database, browsing tools, downloadable data files, and documentation can all be found on the UCSC Genome Bioinformatics website.

<span class="mw-page-title-main">Reference genome</span> Digital nucleic acid sequence database

A reference genome is a digital nucleic acid sequence database, assembled by scientists as a representative example of the set of genes in one idealized individual organism of a species. As they are assembled from the sequencing of DNA from a number of individual donors, reference genomes do not accurately represent the set of genes of any single individual organism. Instead, a reference provides a haploid mosaic of different DNA sequences from each donor. For example, one of the most recent human reference genomes, assembly GRCh38/hg38, is derived from >60 genomic clone libraries. There are reference genomes for multiple species of viruses, bacteria, fungus, plants, and animals. Reference genomes are typically used as a guide on which new genomes are built, enabling them to be assembled much more quickly and cheaply than the initial Human Genome Project. Reference genomes can be accessed online at several locations, using dedicated browsers such as Ensembl or UCSC Genome Browser.

Third-generation sequencing is a class of DNA sequencing methods which produce longer sequence reads, under active development since 2008.

<span class="mw-page-title-main">Angela N. Brooks</span> American biologist and geneticist

Angela Brooks is an Assistant Professor of Biomolecular Engineering at University of California, Santa Cruz. She is a member of the Genomics Institute.

Medical genetic ethics is a field in which the ethics of medical genetics is evaluated. Like the other field of medicine, medical genetics also face ethical issues.

The UC Santa Cruz Genomics Institute is a public research institution based in the Jack Baskin School of Engineering at the University of California, Santa Cruz. The Genomics Institute's scientists and engineers work on a variety of projects related to genome sequencing, computational biology, large data analytics, and data sharing. The institute also maintains a number of software tools used by researchers worldwide, including the UCSC Genome Browser, Dockstore, and the Xena Browser.

The Vertebrate Genomes Project (VGP) is a project which aims to generate high-quality, complete reference genomes of all 66,000 vertebrate species. It is an international cooperation project with members from more than 50 separate institutions and was launched in February 2017.

Circular consensus sequencing (CCS) is a DNA sequencing method that is used in conjunction with single-molecule real-time sequencing to yield highly accurate long-read sequencing datasets with read lengths averaging 15–25 kb with median accuracy greater than 99.9%. These long reads, which are created via the formation of consensus sequencing obtained from multiple passes on a single DNA molecule, can be used to improve results for complex applications such as single nucleotide and structural variant detection, genome assembly, assembly of difficult polyploid or highly repetitive genomes, and assembly of metagenomes.

<span class="mw-page-title-main">Human Pangenome Reference</span>

The Human Pangenome Reference is a collection of genomes from a diverse cohort of individuals compiled by the Human Pangenome Reference Consortium (HPRC). This first draft pangenome comprises 47 phased, diploid assemblies from a diverse cohort of individuals and was intended to capture the genetic diversity of the human population. The development of this pangenome seeks to address perceived shortcomings in the current human reference genome by offering a more comprehensive and inclusive resource for genomic research and analysis.

References

  1. "Researchers generate the first complete, gapless sequence of a human genome". Genome.gov. Retrieved 2023-02-07.
  2. Doudna, Jennifer (May 23, 2022). "The 100 Most Influential People of 2022: Michael Schatz, Karen Miga, Evan Eichler, and Adam Phillippy". Time. Retrieved 2023-10-18.
  3. "Speaker - Cell Symposia: 20 Years of the Human Genome: from Sequence to Substance". www.cell-symposia.com. Retrieved 2020-12-20.
  4. 1 2 Isenberg, Sara (2019-10-23). "Are we there yet? Karen Miga on the Next Era of Genomics". Santa Cruz Tech Beat. Retrieved 2020-12-20.
  5. 1 2 3 4 5 "Karen Miga | VIB Conferences". www.vibconferences.be. Retrieved 2020-12-20.
  6. 1 2 "Scientists achieve first complete assembly of human X chromosome". EurekAlert!. Retrieved 2020-12-20.
  7. "Dawn of good, fast and cheap human genome assembly". ScienceDaily. Retrieved 2020-12-20.
  8. "Karen Miga discusses the future of equitable genomics research with Clinton, Bono" . news.ucsc.edu. Retrieved 2023-11-1
  9. SEARLE SCHOLARS PROGRAM NAMES 15 SCIENTISTS AS SEARLE SCHOLARS FOR 2023. searlescholars.org. Retrieved 2023-11-1.