Sherman Weissman | |
---|---|
Alma mater | Harvard Medical School |
Known for | Sequencing the SV40 genome |
Scientific career | |
Fields | Genetics |
Institutions | Yale School of Medicine |
Sherman Morton Weissman is an American scientist and the Sterling Professor of Genetics at the Yale School of Medicine. A mentor to Francis Collins, Weissman elucidated the nucleic acid sequence of the SV40 genome.
Weissman was the son of a general practitioner. After attending Harvard Medical School, Weissman interned at Boston City Hospital and was a research fellow with the National Institutes of Health and the National Cancer Institute before taking a faculty position at Yale. [1]
Weissman mentored Francis Collins, the director of the NIH, during Collins's postdoctoral fellowship at Yale. [2] Collins called Weissman "the smartest guy" he has met [3] and credited Weissman with allowing him to establish autonomy as a researcher. In Weissman's lab, Collins developed the technique known as chromosome jumping. [4]
In 1978, Weissman published the complete nucleic acid sequence of the SV40 genome. A week later, Belgian researcher Walter Fiers published the genome sequence in another journal. Until 1+1⁄2 years earlier, the Weissman and Fiers teams had each been working on separate halves of the sequence. As technology allowed for faster sequencing, each team began to work toward sequencing the entire genome on its own. In the months before he came up with the published sequence, Weissman had to retract several "final" sequences once errors were discovered. [5] Weissman was elected to the National Academy of Sciences in 1983. [6]
Weissman's seven children include Jonathan Weissman, a scientist at the Massachusetts Institute of Technology. Jonathan's mother is Myrna Weissman, a professor of epidemiology in psychiatry at Columbia University. [7]
Bioinformatics is an interdisciplinary field of science that develops methods and software tools for understanding biological data, especially when the data sets are large and complex. Bioinformatics uses biology, chemistry, physics, computer science, computer programming, information engineering, mathematics and statistics to analyze and interpret biological data. The subsequent process of analyzing and interpreting data is referred to as computational biology.
Nucleic acids are large biomolecules that are crucial in all cells and viruses. They are composed of nucleotides, which are the monomer components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nucleic acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). If the sugar is ribose, the polymer is RNA; if the sugar is deoxyribose, a variant of ribose, the polymer is DNA.
The human genome is a complete set of nucleic acid sequences for humans, encoded as DNA within the 23 chromosome pairs in cell nuclei and in a small DNA molecule found within individual mitochondria. These are usually treated separately as the nuclear genome and the mitochondrial genome. Human genomes include both protein-coding DNA sequences and various types of DNA that does not encode proteins. The latter is a diverse category that includes DNA coding for non-translated RNA, such as that for ribosomal RNA, transfer RNA, ribozymes, small nuclear RNAs, and several types of regulatory RNAs. It also includes promoters and their associated gene-regulatory elements, DNA playing structural and replicatory roles, such as scaffolding regions, telomeres, centromeres, and origins of replication, plus large numbers of transposable elements, inserted viral DNA, non-functional pseudogenes and simple, highly repetitive sequences. Introns make up a large percentage of non-coding DNA. Some of this non-coding DNA is non-functional junk DNA, such as pseudogenes, but there is no firm consensus on the total amount of junk DNA.
Genomics is an interdisciplinary field of molecular biology focusing on the structure, function, evolution, mapping, and editing of genomes. A genome is an organism's complete set of DNA, including all of its genes as well as its hierarchical, three-dimensional structural configuration. In contrast to genetics, which refers to the study of individual genes and their roles in inheritance, genomics aims at the collective characterization and quantification of all of an organism's genes, their interrelations and influence on the organism. Genes may direct the production of proteins with the assistance of enzymes and messenger molecules. In turn, proteins make up body structures such as organs and tissues as well as control chemical reactions and carry signals between cells. Genomics also involves the sequencing and analysis of genomes through uses of high throughput DNA sequencing and bioinformatics to assemble and analyze the function and structure of entire genomes. Advances in genomics have triggered a revolution in discovery-based research and systems biology to facilitate understanding of even the most complex biological systems such as the brain.
Leroy "Lee" Edward Hood is an American biologist who has served on the faculties at the California Institute of Technology (Caltech) and the University of Washington. Hood has developed ground-breaking scientific instruments which made possible major advances in the biological sciences and the medical sciences. These include the first gas phase protein sequencer (1982), for determining the sequence of amino acids in a given protein; a DNA synthesizer (1983), to synthesize short sections of DNA; a peptide synthesizer (1984), to combine amino acids into longer peptides and short proteins; the first automated DNA sequencer (1986), to identify the order of nucleotides in DNA; ink-jet oligonucleotide technology for synthesizing DNA and nanostring technology for analyzing single molecules of DNA and RNA.
Marco A. Marra is a Distinguished Scientist and Director of Canada's Michael Smith Genome Sciences Centre at the BC Cancer Research Centre and Professor of Medical Genetics at the University of British Columbia (UBC). He also serves as UBC Canada Research Chair in Genome Science for the Canadian Institutes of Health Research and is an inductee in the Canadian Medical Hall of Fame. Marra has been instrumental in bringing genome science to Canada by demonstrating the pivotal role that genomics can play in human health and disease research.
Marshall Warren Nirenberg was an American biochemist and geneticist. He shared a Nobel Prize in Physiology or Medicine in 1968 with Har Gobind Khorana and Robert W. Holley for "breaking the genetic code" and describing how it operates in protein synthesis. In the same year, together with Har Gobind Khorana, he was awarded the Louisa Gross Horwitz Prize from Columbia University.
SV40 is an abbreviation for simian vacuolating virus 40 or simian virus 40, a polyomavirus that is found in both monkeys and humans. Like other polyomaviruses, SV40 is a DNA virus that sometimes causes tumors in animals, but most often persists as a latent infection. SV40 has been widely studied as a model eukaryotic virus, leading to many early discoveries in eukaryotic DNA replication and transcription.
DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, guanine, cytosine, and thymine. The advent of rapid DNA sequencing methods has greatly accelerated biological and medical research and discovery.
The National Human Genome Research Institute (NHGRI) is an institute of the National Institutes of Health, located in Bethesda, Maryland.
The Wellcome Sanger Institute, previously known as The Sanger Centre and Wellcome Trust Sanger Institute, is a non-profit British genomics and genetics research institute, primarily funded by the Wellcome Trust.
The Human Genome Project (HGP) was an international scientific research project with the goal of determining the base pairs that make up human DNA, and of identifying, mapping and sequencing all of the genes of the human genome from both a physical and a functional standpoint. It started in 1990 and was completed in 2003. It remains the world's largest collaborative biological project. Planning for the project started after it was adopted in 1984 by the US government, and it officially launched in 1990. It was declared complete on April 14, 2003, and included about 92% of the genome. Level "complete genome" was achieved in May 2021, with only 0.3% of the bases covered by potential issues. The final gapless assembly was finished in January 2022.
David J. Lipman is an American biologist who from 1989 to 2017 was the director of the National Center for Biotechnology Information (NCBI) at the National Institutes of Health. NCBI is the home of GenBank, the U.S. node of the International Sequence Database Consortium, and PubMed, one of the most heavily used sites in the world for the search and retrieval of biomedical information. Lipman is one of the original authors of the BLAST sequence alignment program, and a respected figure in bioinformatics. In 2017, he left NCBI and became Chief Science Officer at Impossible Foods.
Steven Lloyd Salzberg is an American computational biologist and computer scientist who is a Bloomberg Distinguished Professor of Biomedical Engineering, Computer Science, and Biostatistics at Johns Hopkins University, where he is also Director of the Center for Computational Biology.
Joachim Wilhelm "Jo" Messing was a German-American biologist who was a professor of molecular biology and the fourth director of the Waksman Institute of Microbiology at Rutgers University.
Maxine Frank Singer was an American molecular biologist and science administrator. She was known for her contributions to solving the genetic code, her role in the ethical and regulatory debates on recombinant DNA techniques, and her leadership of Carnegie Institution of Washington.
Francis Sellers Collins is an American physician-scientist who discovered the genes associated with a number of diseases and led the Human Genome Project. He served as director of the National Institutes of Health (NIH) in Bethesda, Maryland, from 17 August 2009 to 19 December 2021, serving under three presidents.
The history of genetics can be represented on a timeline of events from the earliest work in the 1850s, to the DNA era starting in the 1940s, and the genomics era beginning in the 1970s.
The European Nucleotide Archive (ENA) is a repository providing free and unrestricted access to annotated DNA and RNA sequences. It also stores complementary information such as experimental procedures, details of sequence assembly and other metadata related to sequencing projects. The archive is composed of three main databases: the Sequence Read Archive, the Trace Archive and the EMBL Nucleotide Sequence Database. The ENA is produced and maintained by the European Bioinformatics Institute and is a member of the International Nucleotide Sequence Database Collaboration (INSDC) along with the DNA Data Bank of Japan and GenBank.