Horse genome

Last updated
Twilight, the Thoroughbred mare who was the first horse to have its genome fully sequenced Twilight20008-300.jpg
Twilight, the Thoroughbred mare who was the first horse to have its genome fully sequenced

The horse genome was first sequenced in 2006. The Horse Genome Project mapped 2.7 billion DNA base pairs, [1] and released the full map in 2009. [2] The horse genome is larger than the dog genome, but smaller than the human genome or the bovine genome. [2] It encompasses 31 pairs of autosomes and one sex chromosome pair. [3]

As horses share over 90 hereditary diseases similar to those found in humans, the sequencing of the horse genome has potential applications to both equine and human health. [2] Further, nearly half of the chromosomes in the horse genome show conserved synteny with a human chromosome, far more than between dogs and humans. [2] This is a high degree of conserved synteny and may help researchers use insights from one species to illuminate the other. Mapping the horse genome may also assist in the development of expression arrays to improve treatment of equine lameness, lung disease, reproduction, and immunology. [1] Research also has provided new insights to the development of centromeres. [2]

The $15 million project was funded by National Human Genome Research Institute (NHGRI) of the National Institutes of Health (NIH). [4] Additional funding came from the Dorothy Russell Havemeyer Foundation, the Volkswagen Foundation, the Morris Animal Foundation and the Programmi di Ricerca Scientifica di Rilevante Interesse Nazionale. [2]

Researchers on the project included Kerstin Lindblad-Toh at the Eli and Edythe L. Broad Institute of the Massachusetts Institute of Technology and Harvard University, Ottmar Distl and Tosso Leeb from the University of Veterinary Medicine, in Hanover, Germany and Helmut Blöcker from the Helmholtz Centre for Infection Research in Braunschweig, Germany, and Doug Antczak of Cornell University. [4]

The first horse to have its genome fully sequenced, in 20062007, was a Thoroughbred mare named Twilight, donated by Cornell University. Other breeds used to contribute to the initial map of horse genetic variation included the Akhal-Teke, Andalusian, Arabian, Icelandic, American Quarter Horse, Standardbred, [4] Belgian, Hanoverian, Hokkaido and Fjord horse. [2] This allowed creation of a catalogue of one million single nucleotide polymorphisms (SNPs) to compare genetic variation within and between different breeds. [2]

Next generation sequencing example Mapping Reads.png
Next generation sequencing example

In 2012, a second horse was fully sequenced at Texas A&M University, an 18-year-old Quarter Horse mare named Sugar. Sugar's genome, sequenced with newer techniques, had 3 million genetic variants from Twilight's, notably in genes governing sensory perception, signal transduction, and immunity. Researchers are in the process of sequencing the genome of seven additional horses. One stated goal of additional sequencing is to better understand the genetic basis of disease and of particular traits distinguishing individual horses and breeds in order to better predict and manage health care of horses. [5]

One result of the mapping of the horse genome was locating the mutation that creates the Leopard complex (Lp) spotting pattern seen in breeds such as the Appaloosa. [2] Horses homozygous for the Lp gene are also at risk for congenital stationary night blindness (CSNB). [6] Studies in 2008 and 2010 indicated that both CSNB and leopard complex spotting patterns are linked to TRPM1. [7] [8] As this disorder also afflicts humans, a researcher and lead author from the Broad Institute stated, "This demonstrates the utility of the horse for disease gene mapping." [2]

In 2012, researchers at the University of Copenhagen used next-generation sequencing to sequence four modern domesticated horses of different breeds, a Przewalski's horse, and a donkey, comparing these to DNA from three fossil horses dated between 13,000 and 50,000 years ago. [9] As the horse was only domesticated about 40003500 BCE, [10] this research was stated to "identify the starting point for horse selection and the raw genetic material our ancestors had available." [9]

See also

Related Research Articles

<span class="mw-page-title-main">Human genome</span> Complete set of nucleic acid sequences for humans

The human genome is a complete set of nucleic acid sequences for humans, encoded as DNA within the 23 chromosome pairs in cell nuclei and in a small DNA molecule found within individual mitochondria. These are usually treated separately as the nuclear genome and the mitochondrial genome. Human genomes include both protein-coding DNA sequences and various types of DNA that does not encode proteins. The latter is a diverse category that includes DNA coding for non-translated RNA, such as that for ribosomal RNA, transfer RNA, ribozymes, small nuclear RNAs, and several types of regulatory RNAs. It also includes promoters and their associated gene-regulatory elements, DNA playing structural and replicatory roles, such as scaffolding regions, telomeres, centromeres, and origins of replication, plus large numbers of transposable elements, inserted viral DNA, non-functional pseudogenes and simple, highly repetitive sequences. Introns make up a large percentage of non-coding DNA. Some of this non-coding DNA is non-functional junk DNA, such as pseudogenes, but there is no firm consensus on the total amount of junk DNA.

<span class="mw-page-title-main">Genomics</span> Discipline in genetics

Genomics is an interdisciplinary field of biology focusing on the structure, function, evolution, mapping, and editing of genomes. A genome is an organism's complete set of DNA, including all of its genes as well as its hierarchical, three-dimensional structural configuration. In contrast to genetics, which refers to the study of individual genes and their roles in inheritance, genomics aims at the collective characterization and quantification of all of an organism's genes, their interrelations and influence on the organism. Genes may direct the production of proteins with the assistance of enzymes and messenger molecules. In turn, proteins make up body structures such as organs and tissues as well as control chemical reactions and carry signals between cells. Genomics also involves the sequencing and analysis of genomes through uses of high throughput DNA sequencing and bioinformatics to assemble and analyze the function and structure of entire genomes. Advances in genomics have triggered a revolution in discovery-based research and systems biology to facilitate understanding of even the most complex biological systems such as the brain.

<span class="mw-page-title-main">Appaloosa</span> American horse breed noted for spotted color pattern

The Appaloosa is an American horse breed best known for its colorful spotted coat pattern. There is a wide range of body types within the breed, stemming from the influence of multiple breeds of horses throughout its history. Each horse's color pattern is genetically the result of various spotting patterns overlaid on top of one of several recognized base coat colors. The color pattern of the Appaloosa is of interest to those who study equine coat color genetics, as it and several other physical characteristics are linked to the leopard complex mutation (LP). Appaloosas are prone to develop equine recurrent uveitis and congenital stationary night blindness; the latter has been linked to the leopard complex.

<span class="mw-page-title-main">Roan (color)</span>

Roan is a coat color found in many animals, including horses, cattle, antelope, cat and dogs. It is defined generally as an even mixture of white and pigmented hairs that do not "gray out" or fade as the animal ages. There are a variety of genetic conditions which produce the colors described as "roan" in various species.

<span class="mw-page-title-main">Comparative genomics</span>

Comparative genomics is a field of biological research in which the genomic features of different organisms are compared. The genomic features may include the DNA sequence, genes, gene order, regulatory sequences, and other genomic structural landmarks. In this branch of genomics, whole or large parts of genomes resulting from genome projects are compared to study basic biological similarities and differences as well as evolutionary relationships between organisms. The major principle of comparative genomics is that common features of two organisms will often be encoded within the DNA that is evolutionarily conserved between them. Therefore, comparative genomic approaches start with making some form of alignment of genome sequences and looking for orthologous sequences in the aligned genomes and checking to what extent those sequences are conserved. Based on these, genome and molecular evolution are inferred and this may in turn be put in the context of, for example, phenotypic evolution or population genetics.

<span class="mw-page-title-main">Nyctalopia</span> Condition making it difficult or impossible to see in relatively low light

Nyctalopia, also called night-blindness, is a condition making it difficult or impossible to see in relatively low light. It is a symptom of several eye diseases. Night blindness may exist from birth, or be caused by injury or malnutrition. It can be described as insufficient adaptation to darkness.

<span class="mw-page-title-main">Synteny</span>

In genetics, the term synteny refers to two related concepts:

<span class="mw-page-title-main">Gene mapping</span> Process of locating specific genes

Gene mapping or genome mapping describes the methods used to identify the location of a gene on a chromosome and the distances between genes. Gene mapping can also describe the distances between different sites within a gene.

<span class="mw-page-title-main">Human Genome Project</span> Human genome sequencing programme

The Human Genome Project (HGP) was an international scientific research project with the goal of determining the base pairs that make up human DNA, and of identifying, mapping and sequencing all of the genes of the human genome from both a physical and a functional standpoint. It started in 1990 and was completed in 2003. It remains the world's largest collaborative biological project. Planning for the project started after it was adopted in 1984 by the US government, and it officially launched in 1990. It was declared complete on April 14, 2003, and included about 92% of the genome. Level "complete genome" was achieved in May 2021, with a remaining only 0.3% bases covered by potential issues. The final gapless assembly was finished in January 2022.

<span class="mw-page-title-main">Varnish roan</span>

Varnish roan describes a horse with coloration similar to roan, but with some changes in color over the years, though not to the extreme of a gray. This type of roaning only occurs in conjunction with the Leopard complex. Varnish roans are born with a dark base coat color, usually with some spotting. As the horse ages, white hairs increase over most of the body, and many spotted markings blur or fade. The varnish roan pattern often appears to spread from the white of any original markings. This color pattern is best known in the Appaloosa breed of horse.

<span class="mw-page-title-main">Congenital stationary night blindness</span> Medical condition

Congenital stationary night blindness (CSNB) is a rare non-progressive retinal disorder. People with CSNB often have difficulty adapting to low light situations due to impaired photoreceptor transmission. These patients may also have reduced visual acuity, myopia, nystagmus, and strabismus. CSNB has two forms -- complete, also known as type-1 (CSNB1), and incomplete, also known as type-2 (CSNB2), which are distinguished by the involvement of different retinal pathways. In CSNB1, downstream neurons called bipolar cells are unable to detect neurotransmission from photoreceptor cells. CSNB1 can be caused by mutations in various genes involved in neurotransmitter detection, including NYX. In CSNB2, the photoreceptors themselves have impaired neurotransmission function; this is caused primarily by mutations in the gene CACNA1F, which encodes a voltage-gated calcium channel important for neurotransmitter release. CSNB has been identified in horses and dogs as the result of mutations in TRPM1, GRM6, and LRIT3 .

The Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC) was established by Richard A. Gibbs in 1996 when Baylor College of Medicine was chosen as one of six worldwide sites to complete the final phase of the international Human Genome Project. Gibbs is the current director of the BCM-HGSC.

<span class="mw-page-title-main">Leopard complex</span> Coat pattern in horses

The leopard complex is a group of genetically related coat patterns in horses. These patterns range from progressive increases in interspersed white hair similar to graying or roan to distinctive, Dalmatian-like leopard spots on a white coat. Secondary characteristics associated with the leopard complex include a white sclera around the eye, striped hooves and mottled skin. The leopard complex gene is also linked to abnormalities in the eyes and vision. These patterns are most closely identified with the Appaloosa and Knabstrupper breeds, though its presence in breeds from Asia to western Europe has indicated that it is due to a very ancient mutation.

<span class="mw-page-title-main">Dominant white</span> Horse coat color and its genetics

Dominant white (W) is a group of genetically related coat color alleles on the KIT gene of the horse, best known for producing an all-white coat, but also able to produce various forms of white spotting, as well as bold white markings. Prior to the discovery of the W allelic series, many of these patterns were described by the term sabino, which is still used by some breed registries.

<span class="mw-page-title-main">Stephen J. O'Brien</span> American geneticist

Stephen J. O'Brien is an American geneticist. He is known for his research contributions in comparative genomics, virology, genetic epidemiology, mammalian systematics and species conservation. Member of the National Academy of Sciences and a Foreign Member of the Russian Academy of Sciences. Author or co-author of over 850 scientific articles and the editor of fourteen volumes.

<span class="mw-page-title-main">Roan (horse)</span> Horse coat color pattern characterized by an even mixture of colored and white hairs on the body

Roan is a horse coat color pattern characterized by an even mixture of colored and white hairs on the body, while the head and "points"—lower legs, mane, and tail—are mostly solid-colored. Horses with roan coats have white hairs evenly intermingled throughout any other color. The head, legs, mane, and tail have fewer scattered white hairs or none at all. The roan pattern is dominantly inherited, and is found in many horse breeds. While the specific mutation responsible for roan has not been exactly identified, a DNA test can determine zygosity for roan in several breeds. True roan is always present at birth, though it may be hard to see until after the foal coat sheds out. The coat may lighten or darken from winter to summer, but unlike the gray coat color, which also begins with intermixed white and colored hairs, roans do not become progressively lighter in color as they age. The silvering effect of mixed white and colored hairs can create coats that look bluish or pinkish.

<span class="mw-page-title-main">Scaffolding (bioinformatics)</span> Bioinformatics technique

Scaffolding is a technique used in bioinformatics. It is defined as follows:

Link together a non-contiguous series of genomic sequences into a scaffold, consisting of sequences separated by gaps of known length. The sequences that are linked are typically contiguous sequences corresponding to read overlaps.

Catherine Feuillet is a French geneticist who is currently the Chief Scientific Officer of Inari Agriculture, a Cambridge MA based biotechnology company. Feuillet earned a PhD in plant molecular biology on the isolation and characterization of genes involved in wood formation in eucalyptus trees. She started to work on the genetics of disease resistance in wheat in 1994 during her post-doctoral studies at the Swiss Federal Institute for Agroecology. She then moved as a junior group leader to the University of Zurich where she investigated the molecular basis of fungal disease resistance in wheat and in barley and cloned the first leaf rust resistance gene from wheat. In 2004 she was hired as a research director at the Institut National de la Recherche Agronomique (INRA) in France to lead European and international projects on wheat genomics.

<span class="mw-page-title-main">Ann T. Bowling</span> American geneticist (1943–2000)

Ann Trommershausen Bowling was an American scientist who was one of the world's leading geneticists in the study of horses, conducting research in the areas of molecular genetics and cytogenetics. She was a major figure in the development of testing to determine animal parentage, first with blood typing in the 1980s and then DNA testing in the 1990s. She later became known for her studies of hereditary diseases in horses and equine coat color genetics, as well as research on horse evolution and the development of horse breeds. She studied the population genetics of feral horses, did considerable work to help preserve the Przewalski's horse, and was one of the founding members of the international project to map the horse genome. She was an adjunct professor at the University of California, Davis (UCD), and at the time of her death in 2000 was the executive associate director of the Veterinary Genetics Laboratory (VGL) there. Her unexpected death on December 8, 2000, at age 57 was attributed to a massive stroke.

Naked foal syndrome (NFS) is a genetic disorder specific to the Akhal-Teke horse breed. A rare genodermatosis, it is characterized by almost complete hairlessness and mild ichthyosis. The condition is inherited as a monogenic autosomal recessive trait, and affected horses typically die between a few weeks and three years of age. The exact cause of death in NFS-affected horses is not clear.

References

  1. 1 2 "Sequenced horse genome expands understanding of equine, human diseases". Cornell University College of Veterinary Medicine. 2012-08-21. Retrieved 2013-04-01.
  2. 1 2 3 4 5 6 7 8 9 10 Wade, C.; et al. (2009-11-05). "Domestic Horse Genome Sequenced". Science . 326 (5954). ScienceDaily, LLC: 865–867. Bibcode:2009Sci...326..865W. doi:10.1126/science.1178158. PMC   3785132 . PMID   19892987 . Retrieved 2013-04-01.
  3. "Equus caballus (ID 145) - Genome - NCBI". National Center for Biotechnology Information. 2013-01-30. Retrieved 2013-04-01.
  4. 1 2 3 "2007 Release: Horse Genome Assembled". National Human Genome Research Institute. Retrieved 2013-04-01.
  5. "Quarter Horse Genome Sequenced". The Horse, online edition. Blood Horse Publications. 2012-02-17. Retrieved 2013-04-01.
  6. "Night Blindness in the Appaloosa (CSNB)". The Appaloosa Project. 15 November 2013. Retrieved 2017-09-01.
  7. Oke, Stacey (August 31, 2008). "Shedding Light on Night Blindness in Appaloosas" . The Horse. Retrieved February 7, 2009.
  8. Bellone, R.; Archer, S.; Wade, C. M.; Cuka-Lawson, C.; Haase, B.; Leeb, T.; Forsyth, G.; Sandmeyer, L.; Grahn, B. (December 2010). "Association analysis of candidate SNPs in TRPM1 with leopard complex spotting (LP) and congenital stationary night blindness (CSNB) in horses". Animal Genetics. 41 (s2): 207. doi:10.1111/j.1365-2052.2010.02119.x. S2CID   84404747.
  9. 1 2 "2012 Horse Genome Workshop a Success". The Horse, online edition. Blood Horse Publications. 2012-02-15. Retrieved 2013-04-03.
  10. Anthony, David W. (2007). The Horse, the Wheel, and Language: How Bronze Age Riders from the Eurasian Steppes Shaped the Modern World. Princeton, NJ: Princeton University Press. ISBN   978-0-691-05887-0.