Tandem exon duplication

Last updated

Tandem exon duplication is defined as duplication of exons within the same gene to give rise to the subsequent exon. A complete exon analysis of all genes in Homo sapiens , Drosophila melanogaster , and Caenorhabditis elegans has shown 12,291 instances of tandem duplication in exons in human, fly, and worm. Analysis of the intronic region has produced further 4,660 unidentified duplicated exons referred to as unannotated exons. 1,578 of these unannotated exons contained stop codons thus not considered potential exons. 35.1% of the unannotated exons were found in the EST sequence thus confirming the potential of the presence of these exons in protein transcripts. [1]

See also

Related Research Articles

An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word intron is derived from the term intragenic region, i.e. a region inside a gene. The term intron refers to both the DNA sequence within a gene and the corresponding RNA sequence in RNA transcripts. The non-intron sequences that become joined by this RNA processing to form the mature RNA are called exons.

<span class="mw-page-title-main">Pseudogene</span> Functionless relative of a gene

Pseudogenes are nonfunctional segments of DNA that resemble functional genes. Most arise as superfluous copies of functional genes, either directly by gene duplication or indirectly by reverse transcription of an mRNA transcript. Pseudogenes are usually identified when genome sequence analysis finds gene-like sequences that lack regulatory sequences needed for transcription or translation, or whose coding sequences are obviously defective due to frameshifts or premature stop codons. Pseudogenes are a type of junk DNA.

Gene duplication is a major mechanism through which new genetic material is generated during molecular evolution. It can be defined as any duplication of a region of DNA that contains a gene. Gene duplications can arise as products of several types of errors in DNA replication and repair machinery as well as through fortuitous capture by selfish genetic elements. Common sources of gene duplications include ectopic recombination, retrotransposition event, aneuploidy, polyploidy, and replication slippage.

Exon shuffling is a molecular mechanism for the formation of new genes. It is a process through which two or more exons from different genes can be brought together ectopically, or the same exon can be duplicated, to create a new exon-intron structure. There are different mechanisms through which exon shuffling occurs: transposon mediated exon shuffling, crossover during sexual recombination of parental genomes and illegitimate recombination.

<span class="mw-page-title-main">Gene cluster</span>

A gene family is a set of homologous genes within one organism. A gene cluster is a group of two or more genes found within an organism's DNA that encode similar polypeptides, or proteins, which collectively share a generalized function and are often located within a few thousand base pairs of each other. The size of gene clusters can vary significantly, from a few genes to several hundred genes. Portions of the DNA sequence of each gene within a gene cluster are found to be identical; however, the resulting protein of each gene is distinctive from the resulting protein of another gene within the cluster. Genes found in a gene cluster may be observed near one another on the same chromosome or on different, but homologous chromosomes. An example of a gene cluster is the Hox gene, which is made up of eight genes and is part of the Homeobox gene family.

Multiplex ligation-dependent probe amplification (MLPA) is a variation of the multiplex polymerase chain reaction that permits amplification of multiple targets with only a single primer pair. It detects copy number changes at the molecular level, and software programs are used for analysis. Identification of deletions or duplications can indicate pathogenic mutations, thus MLPA is an important diagnostic tool used in clinical pathology laboratories worldwide.

<span class="mw-page-title-main">Small nucleolar RNA SNORD115</span>

In molecular biology, SNORD115 is a non-coding RNA (ncRNA) molecule known as a small nucleolar RNA which usually functions in guiding the modification of other non-coding RNAs. This type of modifying RNA is usually located in the nucleolus of the eukaryotic cell which is a major site of snRNA biogenesis. HBII-52 refers to the human gene, whereas RBII-52 is used for the rat gene and MBII-52 is used for naming the mouse gene.

<span class="mw-page-title-main">HOXC6</span> Protein-coding gene in the species Homo sapiens

Homeobox protein Hox-C6 is a protein that in humans is encoded by the HOXC6 gene. Hox-C6 expression is highest in the fallopian tube and ovary. HoxC6 has been highly expressed in many types of cancers including prostate, breast, and esophageal squamous cell cancer.

<span class="mw-page-title-main">TNNT3</span> Protein-coding gene in the species Homo sapiens

Fast skeletal muscle troponin T (fTnT) is a protein that in humans is encoded by the TNNT3 gene.

<span class="mw-page-title-main">TMEM50A</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 50A is a protein that in humans is encoded by the TMEM50A gene.

<span class="mw-page-title-main">STK19</span> Protein-coding gene in the species Homo sapiens

Serine/threonine-protein kinase 19 is an enzyme that in humans is encoded by the STK19 gene.

<span class="mw-page-title-main">PCDHA6</span> Protein-coding gene in the species Homo sapiens

Protocadherin alpha-6 is a protein that in humans is encoded by the PCDHA6 gene.

<span class="mw-page-title-main">PCDHA4</span> Protein-coding gene in the species Homo sapiens

Protocadherin alpha-4 is a protein that in humans is encoded by the PCDHA4 gene.

<span class="mw-page-title-main">PCDHA2</span> Protein-coding gene in the species Homo sapiens

Protocadherin alpha-2 is a protein that in humans is encoded by the PCDHA2 gene.

<span class="mw-page-title-main">PCDHA9</span> Protein-coding gene in the species Homo sapiens

Protocadherin alpha-9 is a protein that in humans is encoded by the PCDHA9 gene.

<span class="mw-page-title-main">PCDHA5</span> Protein-coding gene in the species Homo sapiens

Protocadherin alpha-5 is a protein that in humans is encoded by the PCDHA5 gene.

<span class="mw-page-title-main">PCDHAC2</span> Protein-coding gene in the species Homo sapiens

Protocadherin alpha-C2 is a protein that in humans is encoded by the PCDHAC2 gene.

<span class="mw-page-title-main">PCDHA3</span> Protein-coding gene in the species Homo sapiens

Protocadherin alpha-3 is a protein that in humans is encoded by the PCDHA3 gene.

<span class="mw-page-title-main">Protocadherin alpha 1</span> Protein-coding gene in the species Homo sapiens

Protocadherin alpha-1 is a protein that in humans is encoded by the PCDHA1 gene.

Low copy repeats (LCRs), also known as segmental duplications (SDs), are DNA sequences present in multiple locations within a genome that share high levels of sequence identity.

References

  1. Letunic I, Copley RR, Bork P (Jun 2002). "Common exon duplication in animals and its role in alternative splicing". Hum Mol Genet. 11 (13): 1561–7. doi: 10.1093/hmg/11.13.1561 . PMID   12045209.