Reverse ecology

Last updated

Reverse ecology refers to the use of genomics to study ecology with no a priori assumptions about the organism(s) under consideration. The term was suggested in 2007 by Matthew Rockman during a conference on ecological genomics in Christchurch, New Zealand. [1] Rockman was drawing an analogy to the term reverse genetics in which gene function is studied by comparing the phenotypic effects of different genetic sequences of that gene. Most researchers employing reverse ecology make use of some sort of population genomics methodology. This requires that a genome scan is performed on multiple individuals from at least two populations in order to identify genomic regions or sites that show signs of selection. These genome scans usually utilize single nucleotide polymorphism (SNP) markers, though use of microsatellites can work as well (with reduced resolution).

Contents

Methodology

Reverse ecology has been used by researchers to understand environments and other ecological traits of organisms on Earth using genomic approaches. By examining the genes of bacteria, scientists are able to reconstruct what the organisms' environments are like today, or even from millions of years ago. The data could help us understand key events in the history of life on Earth. In 2010, researchers presented a technique to carry out reverse ecology to infer a bacteria's living temperature-range conditions based on the GC content of certain genomic regions. [2]

In 2011, researchers at the University of California, Berkeley were able to demonstrate that one can determine an organism's adaptive traits by looking first at its genome and checking for variations across a population. [3]

See also

Related Research Articles

<span class="mw-page-title-main">Genome</span> All genetic material of an organism

In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA. The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as regulatory sequences, and often a substantial fraction of junk DNA with no evident function. Almost all eukaryotes have mitochondria and a small mitochondrial genome. Algae and plants also contain chloroplasts with a chloroplast genome.

<span class="mw-page-title-main">Mutation</span> Alteration in the nucleotide sequence of a genome

In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, mitosis, or meiosis or other types of damage to DNA, which then may undergo error-prone repair, cause an error during other forms of repair, or cause an error during replication. Mutations may also result from insertion or deletion of segments of DNA due to mobile genetic elements.

<span class="mw-page-title-main">Phenotype</span> Composite of the organisms observable characteristics or traits

In genetics, the phenotype is the set of observable characteristics or traits of an organism. The term covers the organism's morphology, its developmental processes, its biochemical and physiological properties, its behavior, and the products of behavior. An organism's phenotype results from two basic factors: the expression of an organism's genetic code and the influence of environmental factors. Both factors may interact, further affecting the phenotype. When two or more clearly different phenotypes exist in the same population of a species, the species is called polymorphic. A well-documented example of polymorphism is Labrador Retriever coloring; while the coat color depends on many genes, it is clearly seen in the environment as yellow, black, and brown. Richard Dawkins in 1978 and then again in his 1982 book The Extended Phenotype suggested that one can regard bird nests and other built structures such as caddisfly larva cases and beaver dams as "extended phenotypes".

<span class="mw-page-title-main">Thermophile</span> Organism that thrives at relatively high temperatures

A thermophile is an organism—a type of extremophile—that thrives at relatively high temperatures, between 41 and 122 °C. Many thermophiles are archaea, though some of them are bacteria and fungi. Thermophilic eubacteria are suggested to have been among the earliest bacteria.

Molecular evolution is the process of change in the sequence composition of cellular molecules such as DNA, RNA, and proteins across generations. The field of molecular evolution uses principles of evolutionary biology and population genetics to explain patterns in these changes. Major topics in molecular evolution concern the rates and impacts of single nucleotide changes, neutral evolution vs. natural selection, origins of new genes, the genetic nature of complex traits, the genetic basis of speciation, the evolution of development, and ways that evolutionary forces influence genomic and phenotypic changes.

A genetic screen or mutagenesis screen is an experimental technique used to identify and select individuals who possess a phenotype of interest in a mutagenized population. Hence a genetic screen is a type of phenotypic screen. Genetic screens can provide important information on gene function as well as the molecular events that underlie a biological process or pathway. While genome projects have identified an extensive inventory of genes in many different organisms, genetic screens can provide valuable insight as to how those genes function.

<span class="mw-page-title-main">Comparative genomics</span>

Comparative genomics is a field of biological research in which the genomic features of different organisms are compared. The genomic features may include the DNA sequence, genes, gene order, regulatory sequences, and other genomic structural landmarks. In this branch of genomics, whole or large parts of genomes resulting from genome projects are compared to study basic biological similarities and differences as well as evolutionary relationships between organisms. The major principle of comparative genomics is that common features of two organisms will often be encoded within the DNA that is evolutionarily conserved between them. Therefore, comparative genomic approaches start with making some form of alignment of genome sequences and looking for orthologous sequences in the aligned genomes and checking to what extent those sequences are conserved. Based on these, genome and molecular evolution are inferred and this may in turn be put in the context of, for example, phenotypic evolution or population genetics.

<span class="mw-page-title-main">Functional genomics</span> Field of molecular biology

Functional genomics is a field of molecular biology that attempts to describe gene functions and interactions. Functional genomics make use of the vast data generated by genomic and transcriptomic projects. Functional genomics focuses on the dynamic aspects such as gene transcription, translation, regulation of gene expression and protein–protein interactions, as opposed to the static aspects of the genomic information such as DNA sequence or structures. A key characteristic of functional genomics studies is their genome-wide approach to these questions, generally involving high-throughput methods rather than a more traditional "candidate-gene" approach.

<span class="mw-page-title-main">GC-content</span> Percentage of guanine and cytosine in DNA or RNA molecules

In molecular biology and genetics, GC-content is the percentage of nitrogenous bases in a DNA or RNA molecule that are either guanine (G) or cytosine (C). This measure indicates the proportion of G and C bases out of an implied four total bases, also including adenine and thymine in DNA and adenine and uracil in RNA.

<span class="mw-page-title-main">Functional ecology</span>

Functional ecology is a branch of ecology that focuses on the roles, or functions, that species play in the community or ecosystem in which they occur. In this approach, physiological, anatomical, and life history characteristics of the species are emphasized. The term "function" is used to emphasize certain physiological processes rather than discrete properties, describe an organism's role in a trophic system, or illustrate the effects of natural selective processes on an organism. This sub-discipline of ecology represents the crossroads between ecological patterns and the processes and mechanisms that underlie them. It focuses on traits represented in large number of species and can be measured in two ways – the first being screening, which involves measuring a trait across a number of species, and the second being empiricism, which provides quantitative relationships for the traits measured in screening. Functional ecology often emphasizes an integrative approach, using organism traits and activities to understand community dynamics and ecosystem processes, particularly in response to the rapid global changes occurring in earth's environment.

A copiotroph is an organism found in environments rich in nutrients, particularly carbon. They are the opposite to oligotrophs, which survive in much lower carbon concentrations.

<i>k</i>-mer Substrings of length k contained in a biological sequence

In bioinformatics, k-mers are substrings of length contained within a biological sequence. Primarily used within the context of computational genomics and sequence analysis, in which k-mers are composed of nucleotides, k-mers are capitalized upon to assemble DNA sequences, improve heterologous gene expression, identify species in metagenomic samples, and create attenuated vaccines. Usually, the term k-mer refers to all of a sequence's subsequences of length , such that the sequence AGAT would have four monomers, three 2-mers, two 3-mers and one 4-mer (AGAT). More generally, a sequence of length will have k-mers and total possible k-mers, where is number of possible monomers.

In genetics, an isochore is a large region of genomic DNA with a high degree of uniformity in GC content; that is, guanine (G) and cytosine (C) bases. The distribution of bases within a genome is non-random: different regions of the genome have different amounts of G-C base pairs, such that regions can be classified and identified by the proportion of G-C base pairs they contain.

<i>Roseobacter</i> Genus of bacteria

In taxonomy, Roseobacter is a genus of the Rhodobacteraceae. The Roseobacter clade falls within the {alpha}-3 subclass of the class Alphaproteobacteria. The first strain descriptions appeared in 1991 which described members Roseobacterlitoralis and Roseobacterdenitrificans, both pink-pigmented bacteriochlorophyll a-producing strains isolated from marine algae. The role members of the Roseobacter lineage play in marine biogeochemical cycles and climate change cannot be overestimated. Roseobacters make up 25% of coastal marine bacteria and members of this lineage process a significant portion of the total carbon in the marine environment. Roseobacter clade plays an important role in global carbon and sulphur cycles. It can also degrade aromatic compounds, uptake trace metal, and form symbiotic relationship. In term of its application, Roseobacter clade produces bioactive compounds, has been used widely in aquaculture and quorum sensing.

Population genomics is the large-scale comparison of DNA sequences of populations. Population genomics is a neologism that is associated with population genetics. Population genomics studies genome-wide effects to improve our understanding of microevolution so that we may learn the phylogenetic history and demography of a population.

Pathogenomics is a field which uses high-throughput screening technology and bioinformatics to study encoded microbe resistance, as well as virulence factors (VFs), which enable a microorganism to infect a host and possibly cause disease. This includes studying genomes of pathogens which cannot be cultured outside of a host. In the past, researchers and medical professionals found it difficult to study and understand pathogenic traits of infectious organisms. With newer technology, pathogen genomes can be identified and sequenced in a much shorter time and at a lower cost, thus improving the ability to diagnose, treat, and even predict and prevent pathogenic infections and disease. It has also allowed researchers to better understand genome evolution events - gene loss, gain, duplication, rearrangement - and how those events impact pathogen resistance and ability to cause disease. This influx of information has created a need for bioinformatics tools and databases to analyze and make the vast amounts of data accessible to researchers, and it has raised ethical questions about the wisdom of reconstructing previously extinct and deadly pathogens in order to better understand virulence.

<span class="mw-page-title-main">Genome evolution</span> Process by which a genome changes in structure or size over time

Genome evolution is the process by which a genome changes in structure (sequence) or size over time. The study of genome evolution involves multiple fields such as structural analysis of the genome, the study of genomic parasites, gene and ancient genome duplications, polyploidy, and comparative genomics. Genome evolution is a constantly changing and evolving field due to the steadily growing number of sequenced genomes, both prokaryotic and eukaryotic, available to the scientific community and the public at large.

Bacterial genomes are generally smaller and less variant in size among species when compared with genomes of eukaryotes. Bacterial genomes can range in size anywhere from about 130 kbp to over 14 Mbp. A study that included, but was not limited to, 478 bacterial genomes, concluded that as genome size increases, the number of genes increases at a disproportionately slower rate in eukaryotes than in non-eukaryotes. Thus, the proportion of non-coding DNA goes up with genome size more quickly in non-bacteria than in bacteria. This is consistent with the fact that most eukaryotic nuclear DNA is non-gene coding, while the majority of prokaryotic, viral, and organellar genes are coding. Right now, we have genome sequences from 50 different bacterial phyla and 11 different archaeal phyla. Second-generation sequencing has yielded many draft genomes ; third-generation sequencing might eventually yield a complete genome in a few hours. The genome sequences reveal much diversity in bacteria. Analysis of over 2000 Escherichia coli genomes reveals an E. coli core genome of about 3100 gene families and a total of about 89,000 different gene families. Genome sequences show that parasitic bacteria have 500–1200 genes, free-living bacteria have 1500–7500 genes, and archaea have 1500–2700 genes. A striking discovery by Cole et al. described massive amounts of gene decay when comparing Leprosy bacillus to ancestral bacteria. Studies have since shown that several bacteria have smaller genome sizes than their ancestors did. Over the years, researchers have proposed several theories to explain the general trend of bacterial genome decay and the relatively small size of bacterial genomes. Compelling evidence indicates that the apparent degradation of bacterial genomes is owed to a deletional bias.

Genomic streamlining is a theory in evolutionary biology and microbial ecology that suggests that there is a reproductive benefit to prokaryotes having a smaller genome size with less non-coding DNA and fewer non-essential genes. There is a lot of variation in prokaryotic genome size, with the smallest free-living cell's genome being roughly ten times smaller than the largest prokaryote. Two of the bacterial taxa with the smallest genomes are Prochlorococcus and Pelagibacter ubique, both highly abundant marine bacteria commonly found in oligotrophic regions. Similar reduced genomes have been found in uncultured marine bacteria, suggesting that genomic streamlining is a common feature of bacterioplankton. This theory is typically used with reference to free-living organisms in oligotrophic environments.

Landscape genomics is one of many strategies used to identify relationships between environmental factors and the genetic adaptation of organisms in response to these factors. Landscape genomics combines aspects of landscape ecology, population genetics and landscape genetics. The latter addresses how landscape features influence the population structure and gene flow of organisms across time and space. The field of landscape genomics is distinct from landscape genetics in that it is not focused on the neutral genetic processes, but considers, in addition to neutral processes such as drift and gene flow, explicitly adaptive processes, i.e. the role of natural selection.

References

  1. Li, YF; et al. (2008). ""Reverse ecology" and the power of population genomics". Evolution. 62 (12): 2984–2994. doi:10.1111/j.1558-5646.2008.00486.x. PMC   2626434 . PMID   18752601.
  2. Zheng H, Wu H (December 2010). "Gene-centric association analysis for the correlation between the guanine-cytosine content levels and temperature range conditions of prokaryotic species". BMC Bioinformatics. 11 (Suppl 11): S7. doi: 10.1186/1471-2105-11-S11-S7 . PMC   3024870 . PMID   21172057.
  3. Ellison C, et al. (2011). "Population genomics and local adaptation in wild isolates of a model microbial eukaryote". Proceedings of the National Academy of Sciences. 108 (7): 2831–2836. Bibcode:2011PNAS..108.2831E. doi: 10.1073/pnas.1014971108 . PMC   3041088 . PMID   21282627.