Molecular breeding is the application of molecular biology tools, often in plant breeding [1] [2] and animal breeding. [3] [4] In the broad sense, molecular breeding can be defined as the use of genetic manipulation performed at the level of DNA to improve traits of interest in plants and animals, and it may also include genetic engineering or gene manipulation, molecular marker-assisted selection, and genomic selection. [5] More often, however, molecular breeding implies molecular marker-assisted breeding (MAB) and is defined as the application of molecular biotechnologies, specifically molecular markers, in combination with linkage maps and genomics, to alter and improve plant or animal traits on the basis of genotypic assays. [6]
The areas of molecular breeding include:
Methods in marker assisted breeding include:
Development of SNPs has revolutionized the molecular breeding process as it helps to create dense markers.[ clarification needed ] Another area that is developing is genotyping by sequencing. [10]
Transfer of genes makes possible the horizontal transfer of genes from one organism to another. Thus plants can receive genes from humans or algae or any other organism. This provides limitless opportunities in breeding crop plants.
Molecular breeding resources (including multiomics data) are available for:
Genetic linkage is the tendency of DNA sequences that are close together on a chromosome to be inherited together during the meiosis phase of sexual reproduction. Two genetic markers that are physically near to each other are unlikely to be separated onto different chromatids during chromosomal crossover, and are therefore said to be more linked than markers that are far apart. In other words, the nearer two genes are on a chromosome, the lower the chance of recombination between them, and the more likely they are to be inherited together. Markers on different chromosomes are perfectly unlinked, although the penetrance of potentially deleterious alleles may be influenced by the presence of other alleles, and these other alleles may be located on other chromosomes than that on which a particular potentially deleterious allele is located.
A quantitative trait locus (QTL) is a locus that correlates with variation of a quantitative trait in the phenotype of a population of organisms. QTLs are mapped by identifying which molecular markers correlate with an observed trait. This is often an early step in identifying the actual genes that cause the trait variation.
Backcrossing is a crossing of a hybrid with one of its parents or an individual genetically similar to its parent, to achieve offspring with a genetic identity closer to that of the parent. It is used in horticulture, animal breeding, and production of gene knockout organisms.
Genetic architecture is the underlying genetic basis of a phenotypic trait and its variational properties. Phenotypic variation for quantitative traits is, at the most basic level, the result of the segregation of alleles at quantitative trait loci (QTL). Environmental factors and other external influences can also play a role in phenotypic variation. Genetic architecture is a broad term that can be described for any given individual based on information regarding gene and allele number, the distribution of allelic and mutational effects, and patterns of pleiotropy, dominance, and epistasis.
A polygene is a member of a group of non-epistatic genes that interact additively to influence a phenotypic trait, thus contributing to multiple-gene inheritance, a type of non-Mendelian inheritance, as opposed to single-gene inheritance, which is the core notion of Mendelian inheritance. The term "monozygous" is usually used to refer to a hypothetical gene as it is often difficult to distinguish the effect of an individual gene from the effects of other genes and the environment on a particular phenotype. Advances in statistical methodology and high throughput sequencing are, however, allowing researchers to locate candidate genes for the trait. In the case that such a gene is identified, it is referred to as a quantitative trait locus (QTL). These genes are generally pleiotropic as well. The genes that contribute to type 2 diabetes are thought to be mostly polygenes. In July 2016, scientists reported identifying a set of 355 genes from the last universal common ancestor (LUCA) of all organisms living on Earth.
In molecular biology and other fields, a molecular marker is a molecule, sampled from some source, that gives information about its source. For example, DNA is a molecular marker that gives information about the organism from which it was taken. For another example, some proteins can be molecular markers of Alzheimer's disease in a person from which they are taken. Molecular markers may be non-biological. Non-biological markers are often used in environmental studies.
Steven Dale Tanksley is the Chief Technology Officer of Nature Source Improved Plants. Prior to founding Nature Source Improved Plants, Tanksley served as the Liberty Hyde Bailey professor of plant breeding and biometry and chair of the Genomics Initiative Task Force at Cornell University College of Agriculture and Life Sciences. He is currently a Professor Emeritus at Cornell University.
Marker assisted selection or marker aided selection (MAS) is an indirect selection process where a trait of interest is selected based on a marker linked to a trait of interest, rather than on the trait itself. This process has been extensively researched and proposed for plant- and animal- breeding.
A doubled haploid (DH) is a genotype formed when haploid cells undergo chromosome doubling. Artificial production of doubled haploids is important in plant breeding.
In genetics, association mapping, also known as "linkage disequilibrium mapping", is a method of mapping quantitative trait loci (QTLs) that takes advantage of historic linkage disequilibrium to link phenotypes to genotypes, uncovering genetic associations.
Nested association mapping (NAM) is a technique designed by the labs of Edward Buckler, James Holland, and Michael McMullen for identifying and dissecting the genetic architecture of complex traits in corn. It is important to note that nested association mapping is a specific technique that cannot be performed outside of a specifically designed population such as the Maize NAM population, the details of which are described below.
GeneNetwork is a combined database and open-source bioinformatics data analysis software resource for systems genetics. This resource is used to study gene regulatory networks that link DNA sequence differences to corresponding differences in gene and protein expression and to variation in traits such as health and disease risk. Data sets in GeneNetwork are typically made up of large collections of genotypes and phenotypes from groups of individuals, including humans, strains of mice and rats, and organisms as diverse as Drosophila melanogaster, Arabidopsis thaliana, and barley. The inclusion of genotypes makes it practical to carry out web-based gene mapping to discover those regions of genomes that contribute to differences among individuals in mRNA, protein, and metabolite levels, as well as differences in cell function, anatomy, physiology, and behavior.
A recombinant inbred strain or recombinant inbred line (RIL) is an organism with chromosomes that incorporate an essentially permanent set of recombination events between chromosomes inherited from two or more inbred strains. F1 and F2 generations are produced by intercrossing the inbred strains; pairs of the F2 progeny are then mated to establish inbred strains through long-term inbreeding.
Plant breeding is the science of changing the traits of plants in order to produce desired characteristics. It has been used to improve the quality of nutrition in products for humans and animals. The goals of plant breeding are to produce crop varieties that boast unique and superior traits for a variety of applications. The most frequently addressed agricultural traits are those related to biotic and abiotic stress tolerance, grain or biomass yield, end-use quality characteristics such as taste or the concentrations of specific biological molecules and ease of processing.
Quantitative trait loci mapping or QTL mapping is the process of identifying genomic regions that potentially contain genes responsible for important economic, health or environmental characters. Mapping QTLs is an important activity that plant breeders and geneticists routinely use to associate potential causal genes with phenotypes of interest. Family-based QTL mapping is a variant of QTL mapping where multiple-families are used.
Genomic Selection (GS) predicts the breeding values of an offspring in a population by associating their traits with their high-density genetic marker scores. GS is a method proposed to address deficiencies of marker-assisted selection (MAS) in breeding programs. However, GS is a form of MAS that differs from it by estimating, at the same time, all genetic markers, haplotypes or marker effects along the entire genome to calculate the values of genomic estimated breeding values (GEBV). The potentiality of GS is to explain the genetic diversity of a breeding program through a high coverage of genome-wide markers and to assess the effects of those markers to predict breeding values.
"Envirome" is a concept that relates the core of environmental conditions with the successful biological performance of living beings. This concept was created in genetic epidemiology, in which an envirome is defined as the total set of environmental factors, both present, and past, that affect the state, and in particular the disease state, of an organism. The study of the envirome and its effects is termed enviromics. The term was first coined in the field of psychiatric epidemiology by J.C. Anthony in 1995. More recently, use of the term has been extended to the cellular domain, where cell functional enviromics studies both the genome and envirome from a systems biology perspective. In plants, enviromics is directly related to complex ecophysiology, in which the wide environment of the plants, into an omics scale, can be dissected and understood as a mosaic of possible growing factors and the balance of diverse resources available. In ecology, this concept can be related to the Shelford's law of tolerance. The enviromics is conceived as a pillar of the Modern Plant Breeding, capable to connect the design and development of breeding goals concealing it with the agronomic targets for a climate-smart agriculture. It also has the ability to bridge the knowledge gaps between the different levels of systems biology and phenomics in the context of Gene–environment interaction.
Michael Edward "Mike" Goddard is a professorial fellow in animal genetics at the University of Melbourne, Australia.
Complex traits are phenotypes that are controlled by two or more genes and do not follow Mendel's Law of Dominance. They may have a range of expression which is typically continuous. Both environmental and genetic factors often impact the variation in expression. Human height is a continuous trait meaning that there is a wide range of heights. There are an estimated 50 genes that affect the height of a human. Environmental factors, like nutrition, also play a role in a human's height. Other examples of complex traits include: crop yield, plant color, and many diseases including diabetes and Parkinson's disease. One major goal of genetic research today is to better understand the molecular mechanisms through which genetic variants act to influence complex traits. Complex traits are also known as polygenic traits and multigenic traits.
Rohan L. Fernando is a Sri Lankan American geneticist who is a professor of quantitative genetics in the Department of Animal Science at Iowa State University (ISU), US. Fernando's efforts have focused primarily on theory and methods for use of genetic markers in breeding, theory and methods for genetic evaluations of crossbred animals, methodology related to the estimation of genetic parameters and the prediction of genetic merit in populations undergoing selection and non-random mating, Bayesian methodology for analysis of unbalanced mixed model data, optimization of breeding programs, and use of computer simulation to study dynamics of genetic system.