Drosophila Genetic Reference Panel (DGRP) is a suite of Drosophila melanogaster lines derived from an out-crossed population in Raleigh, North Carolina. The founders of these lineages were collected from the Raleigh State Farmer's Market [1] 35°45′51″N78°39′47″W / 35.764254°N 78.662935°W . The suite consists of 205 fully sequenced lines which have been inbred to near homozygosity. The primary goal of the DGRP is to provide a common set of strain for quantitative genetics research in Drosophila . Each researcher who uses the lines from the DGRP will have access to other researchers' data, which will be stored in a publicly available database. This allows for analyses to be performed across studies without having to worry about complications arising from different labs using genomically different lines of fruit flies. [2] [3]
These lines are useful for performing QTL maps, as every line is fully sequenced. This allows for association mapping to be performed, which looks for genomic regions that are correlated to a phenotype. As labs produce QTL maps a comprehensive picture of the Drosophila genome will emerge with unprecedented resolution. Currently, dozens of quantitative traits are being examined by researchers, including longevity, phototaxis, mating behavior, wing morphology and oviposition site preference.
The pilot proof of concept and development of the DGRP came from the Trudy Mackay lab in Raleigh, North Carolina. The sequencing was done by Baylor College of Medicine Sequencing Center, in collaboration with Richard Gibbs and Stephen Richards.
The preliminary sequence data can be obtained from a public database hosted by Baylor College of Medicine in the Texas Medical Center. [4] Raw sequencing data from the project can be found in the NCBI Sequence Read Archive. [5]
Preliminary data has been presented at Genetics Society of America in Boston, Massachusetts. The study investigates the sleep behavior of Drosophila to uncover the genes that are responsible. The data suggest that at least 998 genes are responsible for some of the measurable variation found, including candidate genes CrebB-17A , rutabaga , Shaker and the gene encoding the epidermal growth factor receptor have all been implicated in other studies as being involved in sleep behavior. [6]
Researchers have also uncovered a genotype by diet interaction that drives phenotypic variation. Recently published data indicates that the interaction between a fly's genome and its environment plays a substantial role in determining the phenotype. This suggests that some individuals are obese on a high-fat diet, but can retain a slimmer phenotype on a high-sugar diet. [7]
The genotype of an organism is its complete set of genetic material. Genotype can also be used to refer to the alleles or variants an individual carries in a particular gene or genetic location. The number of alleles an individual can have in a specific gene depends on the number of copies of each chromosome found in that species, also referred to as ploidy. In diploid species like humans, two full sets of chromosomes are present, meaning each individual has two alleles for any given gene. If both alleles are the same, the genotype is referred to as homozygous. If the alleles are different, the genotype is referred to as heterozygous.
The human genome is a complete set of nucleic acid sequences for humans, encoded as the DNA within each of the 24 distinct chromosomes in the cell nucleus. A small DNA molecule is found within individual mitochondria. These are usually treated separately as the nuclear genome and the mitochondrial genome. Human genomes include both protein-coding DNA sequences and various types of DNA that does not encode proteins. The latter is a diverse category that includes DNA coding for non-translated RNA, such as that for ribosomal RNA, transfer RNA, ribozymes, small nuclear RNAs, and several types of regulatory RNAs. It also includes promoters and their associated gene-regulatory elements, DNA playing structural and replicatory roles, such as scaffolding regions, telomeres, centromeres, and origins of replication, plus large numbers of transposable elements, inserted viral DNA, non-functional pseudogenes and simple, highly repetitive sequences. Introns make up a large percentage of non-coding DNA. Some of this non-coding DNA is non-functional junk DNA, such as pseudogenes, but there is no firm consensus on the total amount of junk DNA.
In genetics and bioinformatics, a single-nucleotide polymorphism is a germline substitution of a single nucleotide at a specific position in the genome. Although certain definitions require the substitution to be present in a sufficiently large fraction of the population, many publications do not apply such a frequency threshold.
Mosaicism or genetic mosaicism is a condition in which a multicellular organism possesses more than one genetic line as the result of genetic mutation. This means that various genetic lines resulted from a single fertilized egg. Mosaicism is one of several possible causes of chimerism, wherein a single organism is composed of cells with more than one distinct genotype.
A quantitative trait locus (QTL) is a locus that correlates with variation of a quantitative trait in the phenotype of a population of organisms. QTLs are mapped by identifying which molecular markers correlate with an observed trait. This is often an early step in identifying the actual genes that cause the trait variation.
Genetic architecture is the underlying genetic basis of a phenotypic trait and its variational properties. Phenotypic variation for quantitative traits is, at the most basic level, the result of the segregation of alleles at quantitative trait loci (QTL). Environmental factors and other external influences can also play a role in phenotypic variation. Genetic architecture is a broad term that can be described for any given individual based on information regarding gene and allele number, the distribution of allelic and mutational effects, and patterns of pleiotropy, dominance, and epistasis.
Michael Ashburner was an English biologist and Professor in the Department of Genetics at University of Cambridge. He was also the former joint-head and co-founder of the European Bioinformatics Institute (EBI) of the European Molecular Biology Laboratory (EMBL) and a Fellow of Churchill College, Cambridge.
A polygene is a member of a group of non-epistatic genes that interact additively to influence a phenotypic trait, thus contributing to multiple-gene inheritance, a type of non-Mendelian inheritance, as opposed to single-gene inheritance, which is the core notion of Mendelian inheritance. The term "monozygous" is usually used to refer to a hypothetical gene as it is often difficult to distinguish the effect of an individual gene from the effects of other genes and the environment on a particular phenotype. Advances in statistical methodology and high throughput sequencing are, however, allowing researchers to locate candidate genes for the trait. In the case that such a gene is identified, it is referred to as a quantitative trait locus (QTL). These genes are generally pleiotropic as well. The genes that contribute to type 2 diabetes are thought to be mostly polygenes. In July 2016, scientists reported identifying a set of 355 genes from the last universal common ancestor (LUCA) of all organisms living on Earth.
The Rat Genome Database (RGD) is a database of rat genomics, genetics, physiology and functional data, as well as data for comparative genomics between rat, human and mouse. RGD is responsible for attaching biological information to the rat genome via structured vocabulary, or ontology, annotations assigned to genes and quantitative trait loci (QTL), and for consolidating rat strain data and making it available to the research community. They are also developing a suite of tools for mining and analyzing genomic, physiologic and functional data for the rat, and comparative data for rat, mouse, human, and five other species.
A phene is an individual genetically determined characteristic or trait which can be possessed by an organism, such as eye colour, height, behavior, tooth shape or any other observable characteristic.
Paralytic is a gene in the fruit fly, Drosophila melanogaster, which encodes a voltage gated sodium channel within D. melanogaster neurons. This gene is essential for locomotive activity in the fly. There are 9 different para alleles, composed of a minimum of 26 exons within over 78kb of genomic DNA. The para gene undergoes alternative splicing to produce subtypes of the channel protein. Flies with mutant forms of paralytic are used in fly models of seizures, since seizures can be easily induced in these flies.
Population genomics is the large-scale comparison of DNA sequences of populations. Population genomics is a neologism that is associated with population genetics. Population genomics studies genome-wide effects to improve our understanding of microevolution so that we may learn the phylogenetic history and demography of a population.
Expression quantitative trait loci (eQTLs) are genomic loci that explain variation in expression levels of mRNAs.
Transposons are semi-parasitic DNA sequences which can replicate and spread through the host's genome. They can be harnessed as a genetic tool for analysis of gene and protein function. The use of transposons is well-developed in Drosophila and in Thale cress and bacteria such as Escherichia coli.
Nested association mapping (NAM) is a technique designed by the labs of Edward Buckler, James Holland, and Michael McMullen for identifying and dissecting the genetic architecture of complex traits in corn. It is important to note that nested association mapping is a specific technique that cannot be performed outside of a specifically designed population such as the Maize NAM population, the details of which are described below.
GeneNetwork is a combined database and open-source bioinformatics data analysis software resource for systems genetics. This resource is used to study gene regulatory networks that link DNA sequence differences to corresponding differences in gene and protein expression and to variation in traits such as health and disease risk. Data sets in GeneNetwork are typically made up of large collections of genotypes and phenotypes from groups of individuals, including humans, strains of mice and rats, and organisms as diverse as Drosophila melanogaster, Arabidopsis thaliana, and barley. The inclusion of genotypes makes it practical to carry out web-based gene mapping to discover those regions of genomes that contribute to differences among individuals in mRNA, protein, and metabolite levels, as well as differences in cell function, anatomy, physiology, and behavior.
A recombinant inbred strain or recombinant inbred line (RIL) is an organism with chromosomes that incorporate an essentially permanent set of recombination events between chromosomes inherited from two or more inbred strains. F1 and F2 generations are produced by intercrossing the inbred strains; pairs of the F2 progeny are then mated to establish inbred strains through long-term inbreeding.
Abraham Bentsionovich Korol is a professor in the Institute of Evolution at the University of Haifa. He is a prominent Israeli geneticist and evolutionary biologist known for his work on the evolution of sex and recombination, genome mapping and the genetics of complex traits. Korol was born in Bendery city, Moldavia, then part of the Soviet Union, and immigrated to Israel in 1991. Before immigrating to Israel, Korol was appointed in 1981 as a senior researcher and was awarded the degree of Doctor of Science by the Presidium of Academy of Science USSR in 1988, and became a full professor in 1991. After immigrating to Israel in 1991, Korol has established and headed the Laboratory of Population Genetics and Computational Biology in the Institute of Evolution at the University of Haifa. He became full professor there in 1996 and served as the director of the Institute of Evolution between 2008 and 2013. Since 1994, Korol has filled many scholarly positions including member of the steering committee of Israeli Gene Bank; member of the Human Genome Organization; member of the European Society of Evolutionary Biology; a member of the Coordinating Committee of the International Wheat Genome Sequencing Consortium; member of the Infrastructure Steering Committee of the Israeli Ministry of Science; representative of Haifa University in the Kamea program steering committee ; member of the Advisory Committee of Absorption in Science of the Israeli Ministry of Absorption.
Complex traits are phenotypes that are controlled by two or more genes and do not follow Mendel's Law of Dominance. They may have a range of expression which is typically continuous. Both environmental and genetic factors often impact the variation in expression. Human height is a continuous trait meaning that there is a wide range of heights. There are an estimated 50 genes that affect the height of a human. Environmental factors, like nutrition, also play a role in a human's height. Other examples of complex traits include: crop yield, plant color, and many diseases including diabetes and Parkinson's disease. One major goal of genetic research today is to better understand the molecular mechanisms through which genetic variants act to influence complex traits. Complex traits are also known as polygenic traits and multigenic traits.
A mutation accumulation (MA) experiment is a genetic experiment in which isolated and inbred lines of organisms are maintained such that the effect of natural selection is minimized, with the aim of quantitatively estimating the rates at which spontaneous mutations occur in the studied organism. Spontaneous mutation rates may be directly estimated using molecular techniques such as DNA sequencing, or indirectly estimated using phenotypic assays.