Jim Kent

Last updated
Jim Kent
Jim Kent.JPG
Photo courtesy of Jim Kent.
Born (1960-02-10) February 10, 1960 (age 64)
NationalityAmerican
Alma mater University of California, Santa Cruz
Awards Overton Prize
Benjamin Franklin Award (Bioinformatics)
Scientific career
Thesis Patching and painting the working draft of the human genome  (2002)
Doctoral advisor Alan M Zahler
Other academic advisors David Haussler [1]
Website www.soe.ucsc.edu/~kent

William James Kent (born February 10, 1960) is an American research scientist and computer programmer. He has been a contributor to genome database projects and the 2003 winner of the Benjamin Franklin Award.

Contents

Early life

Kent was born in Hawaii and grew up in San Francisco, California, United States.

Computer animation

Kent began his programming career in 1983 with Island Graphics Inc. where he wrote the Aegis Animator program for the Amiga home computer. This program combined polygon tweening in 3D with simple 2D cel-based animation. In 1985 he founded and ran a software company, Dancing Flame, which adapted the Aegis Animator to the Atari ST, [2] and created Cyber Paint [3] for that machine. Cyber Paint was a 2D animation program that brought together a wide variety of animation and paint functionality and the delta-compressed animation format developed for CAD-3D. The user could move freely between animation frames and paint arbitrarily, or utilize various animation tools for automatic tweening movement across frames. Cyber Paint was one of the first, if not the first, consumer program that enabled the user to paint across time in a compressed digital video format. Later he developed a similar program, the Autodesk Animator for PC compatibles, where the image compression improved to the point it could play off of hard disk, and one could paint using "inks" that performed algorithmic transformations such as smoothing, transparency, and tiled patterns. The Autodesk Animator was used to create artwork for a wide variety of video games. [4]

Involvement with the Human Genome Project

In 2000, he wrote a program, GigAssembler, [5] that allowed the publicly funded Human Genome Project to assemble and publish the first human genome sequence. His efforts were motivated by the research needs of himself and his colleagues, but also out of concern that the data might be made proprietary via patents by Celera Genomics. [6] In their close race with Celera, Kent and the UCSC Professor David Haussler quickly built a modest cluster of 50 commodity personal computers running a Linux-based operating system to run the software. In contrast, Celera was using what was thought then to be one of the most powerful civilian supercomputers in the world. Kent's first assembly on the human genome was released on June 22. Celera finished its assembly three days later on June 25, and the dual results were announced at the White House on June 26. On July 7, 2000, the Santa Cruz data was made publicly available on the World Wide Web while the research paper describing this publicly funded genome was published in February 2001 special issue of Nature , [7] in parallel with Celera's results in the journal Science . [8] In 2002 Tim O'Reilly described Kent's work as "the most significant work of open source development in the past year". While all of Kent's genomics software is open source in the sense that the source code can be downloaded and read for free, and all of the software can be freely used for academic, nonprofit, and personal use, some of it requires a license, either from UCSC or from Kent Informatics Inc., for commercial use. [9]

After GigAssembler, Kent went on to write BLAT (BLAST-like alignment tool) [10] and the UCSC Genome Browser [11] to help analyze important genome data. Kent continues to work at UCSC primarily on web tools to help understand the human genome. He helps maintain and upgrade the browser, and has worked on comparative genomics, [12] Parasol, a job control management software for the UCSC kilocluster, and the ENCODE Project.

Related Research Articles

<span class="mw-page-title-main">Autodesk Animator</span>

Autodesk Animator is a 2D computer animation and painting program published in 1989 for MS-DOS. It was considered groundbreaking when initially released.

<span class="mw-page-title-main">Ensembl genome database project</span> Scientific project at the European Bioinformatics Institute

Ensembl genome database project is a scientific project at the European Bioinformatics Institute, which provides a centralized resource for geneticists, molecular biologists and other researchers studying the genomes of our own species and other vertebrates and model organisms. Ensembl is one of several well known genome browsers for the retrieval of genomic information.

The Baskin School of Engineering, known simply as Baskin Engineering, is the school of engineering at the University of California, Santa Cruz. It consists of six departments: Applied Mathematics, Biomolecular Engineering, Computational Media, Computer Science and Engineering, Electrical and Computer Engineering, and Statistics.

<span class="mw-page-title-main">Human Genome Project</span> Human genome sequencing programme

The Human Genome Project (HGP) was an international scientific research project with the goal of determining the base pairs that make up human DNA, and of identifying, mapping and sequencing all of the genes of the human genome from both a physical and a functional standpoint. It started in 1990 and was completed in 2003. It remains the world's largest collaborative biological project. Planning for the project started after it was adopted in 1984 by the US government, and it officially launched in 1990. It was declared complete on April 14, 2003, and included about 92% of the genome. Level "complete genome" was achieved in May 2021, with a remaining only 0.3% bases covered by potential issues. The final gapless assembly was finished in January 2022.

<span class="mw-page-title-main">David Haussler</span> American bioinformatician

David Haussler is an American bioinformatician known for his work leading the team that assembled the first human genome sequence in the race to complete the Human Genome Project and subsequently for comparative genome analysis that deepens understanding the molecular function and evolution of the genome.

BLAT is a pairwise sequence alignment algorithm that was developed by Jim Kent at the University of California Santa Cruz (UCSC) in the early 2000s to assist in the assembly and annotation of the human genome. It was designed primarily to decrease the time needed to align millions of mouse genomic reads and expressed sequence tags against the human genome sequence. The alignment tools of the time were not capable of performing these operations in a manner that would allow a regular update of the human genome assembly. Compared to pre-existing tools, BLAT was ~500 times faster with performing mRNA/DNA alignments and ~50 times faster with protein/protein alignments.

<span class="mw-page-title-main">DGLUCY</span> Protein-coding gene in the species Homo sapiens

DGLUCY is a protein that in humans is encoded by the DGLUCY gene.

The UCSC Genome Browser is an online and downloadable genome browser hosted by the University of California, Santa Cruz (UCSC). It is an interactive website offering access to genome sequence data from a variety of vertebrate and invertebrate species and major model organisms, integrated with a large collection of aligned annotations. The Browser is a graphical viewer optimized to support fast interactive performance and is an open-source, web-based tool suite built on top of a MySQL database for rapid visualization, examination, and querying of the data at many levels. The Genome Browser Database, browsing tools, downloadable data files, and documentation can all be found on the UCSC Genome Bioinformatics website.

SOAP is a suite of bioinformatics software tools from the BGI Bioinformatics department enabling the assembly, alignment, and analysis of next generation DNA sequencing data. It is particularly suited to short read sequencing data.

Adam C. Siepel is an American computational biologist known for his research in comparative genomics and population genetics, particularly the development of statistical methods and software tools for identifying evolutionarily conserved sequences. Siepel is currently Chair of the Simons Center for Quantitative Biology and Professor in the Watson School for Biological Sciences at Cold Spring Harbor Laboratory.

GenomeSpace is an environment for genomics software tools and applications. It helps users manage their analysis workflows involving multiple diverse tools, including web applications and desktop tools and facilitates the transfer of data between tools via automatic format conversion. Analyses can use data from local or cloud-based stores.

<span class="mw-page-title-main">Transmembrane Protein 205</span> Protein-coding gene in the species Homo sapiens

Transmembrane Protein 205 (TMEM205) is a protein encoded on chromosome 19 by the TMEM205 gene.

<span class="mw-page-title-main">Scaffolding (bioinformatics)</span> Bioinformatics technique

Scaffolding is a technique used in bioinformatics. It is defined as follows:

Link together a non-contiguous series of genomic sequences into a scaffold, consisting of sequences separated by gaps of known length. The sequences that are linked are typically contiguous sequences corresponding to read overlaps.

Mathieu Daniel Blanchette is a computational biologist and Director of the School of Computer Science at McGill University. His research focuses on developing new algorithms for the detection of functional regions in DNA sequences.

The BED format is a text file format used to store genomic regions as coordinates and associated annotations. The data are presented in the form of columns separated by spaces or tabs. This format was developed during the Human Genome Project and then adopted by other sequencing projects. As a result of this increasingly wide use, this format had already become a de facto standard in bioinformatics before a formal specification was written.

The UC Santa Cruz Genomics Institute is a public research institution based in the Jack Baskin School of Engineering at the University of California, Santa Cruz. The Genomics Institute's scientists and engineers work on a variety of projects related to genome sequencing, computational biology, large data analytics, and data sharing. The institute also maintains a number of software tools used by researchers worldwide, including the UCSC Genome Browser, Dockstore, and the Xena Browser.

Ann Strickler Zweig is a scientist at the University of California, Santa Cruz's Jack Baskin School of Engineering.

Katherine Snowden Pollard is the Director of the Gladstone Institute of Data Science and Biotechnology and a professor at the University of California, San Francisco (UCSF). She is a Chan Zuckerberg Biohub Investigator. She was awarded Fellowship of the International Society for Computational Biology in 2020 and the American Institute for Medical and Biological Engineering in 2021 for outstanding contributions to computational biology and bioinformatics.

Karen Elizabeth Hayden Miga is an American geneticist who co-leads the Telomere-to-Telomore (T2T) consortium that released fully complete assembly of the human genome in March 2022. She is an assistant professor of biomolecular engineering at the University of California, Santa Cruz and Associate Director of Human Pangenomics at the UC Santa Cruz Genomics Institute. She was named as "One to Watch" in the 2020 Nature's 10 and one of Time 100’s most influential people of 2022.

References

  1. Gitschier, J. (2013). "Life, the Universe, and Everything: An Interview with David Haussler". PLOS Genetics. 9 (1): e1003282. doi: 10.1371/journal.pgen.1003282 . PMC   3561096 . PMID   23382705.
  2. "Aegis Animator ST and Art Pak ST". Atarimagazines.com. Retrieved 2016-10-21.
  3. "Cyber Paint". Ataricq.org. 2009-05-23. Retrieved 2016-10-21.
  4. Breton Slivka (2009-05-08). "Busting at the Seams: Autodesk Animator". Bustingseams.blogspot.com. Retrieved 2016-10-21.
  5. Kent WJ, Haussler D (2001). "Assembly of the Working Draft of the Human Genome with GigAssembler". Genome Research. 11 (9): 1461–2. doi:10.1101/gr.183201. PMC   311095 . PMID   11544197.
  6. "Jim Kent, hero of free and open source software". Archived from the original on 2012-03-20. Retrieved 2010-09-19.
  7. International Human Genome Sequencing Consortium (2001). "Initial sequencing and analysis of the human genome" (PDF). Nature. 409 (6822): 860–921. Bibcode:2001Natur.409..860L. doi: 10.1038/35057062 . PMID   11237011.
  8. Venter, JC; et al. (2001). "The sequence of the human genome" (PDF). Science. 291 (5507): 1304–1351. Bibcode:2001Sci...291.1304V. doi: 10.1126/science.1058040 . PMID   11181995. S2CID   85981305.
  9. "Index of /~kent/src". Hgwdev.cse.ucsc.edu. Retrieved 2016-10-21.
  10. Kent, W. J. (2002). "BLAT---The BLAST-Like Alignment Tool". Genome Research. 12 (4): 656–664. doi:10.1101/gr.229202. PMC   187518 . PMID   11932250.
  11. Kent, W. J.; Sugnet, C. W.; Furey, T. S.; Roskin, K. M.; Pringle, T. H.; Zahler, A. M.; Haussler, A. D. (2002). "The Human Genome Browser at UCSC". Genome Research. 12 (6): 996–1006. doi:10.1101/gr.229102. PMC   186604 . PMID   12045153.
  12. Kent WJ, Baertsch R, Hinrichs A, Miller W, Haussler D (2003). "Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes". Proc Natl Acad Sci U S A. 100 (20): 11484–9. Bibcode:2003PNAS..10011484K. doi: 10.1073/pnas.1932072100 . PMC   208784 . PMID   14500911.