Orphon

Last updated

An orphon is a gene located outside the main chromosomal locus, i.e., it may be dispersed to an unconnected genomic location. [1] [2]

Orphons have been found in both protein-coding and non-protein-coding gene families, which suggests that most gene transcription processes do not constitute a restriction on the development of orphons. Extensive polymorphism in this feature between individuals of the same species was shown. The gene class was first discovered in yeast, sea urchins, and fruitflies, [1] and has since been reported from the genome of many other eukaryote groups including molluscs, [3] amphibians, [4] and mammals including humans. [5]

Related Research Articles

Repeated sequences are short or long patterns of nucleic acids that occur in multiple copies throughout the genome. In many organisms, a significant fraction of the genomic DNA is repetitive, with over two-thirds of the sequence consisting of repetitive elements in humans. Some of these repeated sequences are necessary for maintaining important genome structures such as telomeres or centromeres.

Polyadenylation is the addition of a poly(A) tail to an RNA transcript, typically a messenger RNA (mRNA). The poly(A) tail consists of multiple adenosine monophosphates; in other words, it is a stretch of RNA that has only adenine bases. In eukaryotes, polyadenylation is part of the process that produces mature mRNA for translation. In many bacteria, the poly(A) tail promotes degradation of the mRNA. It, therefore, forms part of the larger process of gene expression.

<span class="mw-page-title-main">Sequence homology</span> Shared ancestry between DNA, RNA or protein sequences

Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a speciation event (orthologs), or a duplication event (paralogs), or else a horizontal gene transfer event (xenologs).

<span class="mw-page-title-main">Amos Bairoch</span>

Amos Bairoch is a Swiss bioinformatician and Professor of Bioinformatics at the Department of Human Protein Sciences of the University of Geneva where he leads the CALIPHO group at the Swiss Institute of Bioinformatics (SIB) combining bioinformatics, curation, and experimental efforts to functionally characterize human proteins.

<span class="mw-page-title-main">U7 small nuclear RNA</span>

The U7 small nuclear RNA is an RNA molecule and a component of the small nuclear ribonucleoprotein complex. The U7 snRNA is required for histone pre-mRNA processing.

<span class="mw-page-title-main">HMGA1</span> Protein-coding gene in the species Homo sapiens

High-mobility group protein HMG-I/HMG-Y is a protein that in humans is encoded by the HMGA1 gene.

<span class="mw-page-title-main">HIST1H3B</span> Protein-coding gene in the species Homo sapiens

Histone H3.1 is a protein that in humans is encoded by the H3C2 gene.

<span class="mw-page-title-main">DNA (cytosine-5)-methyltransferase 3A</span> Protein-coding gene in the species Homo sapiens

DNA (cytosine-5)-methyltransferase 3A (DNMT3A) is an enzyme that catalyzes the transfer of methyl groups to specific CpG structures in DNA, a process called DNA methylation. The enzyme is encoded in humans by the DNMT3A gene.

<span class="mw-page-title-main">Ubiquitin B</span> Protein-coding gene in the species Homo sapiens

Ubiquitin is a protein that in humans is encoded by the UBB gene.

<span class="mw-page-title-main">YY1</span> Transcriptional repressor protein

YY1 is a transcriptional repressor protein in humans that is encoded by the YY1 gene.

<span class="mw-page-title-main">HMGB2</span> Protein-coding gene in the species Homo sapiens

High-mobility group protein B2 also known as high-mobility group protein 2 (HMG-2) is a protein that in humans is encoded by the HMGB2 gene.

<span class="mw-page-title-main">IGHM</span> Gene in the species Homo sapiens

Ig mu chain C region is a protein that in humans is encoded by the IGHM gene.

<span class="mw-page-title-main">CTBP2</span> Protein-coding gene in the species Homo sapiens

C-terminal-binding protein 2 also known as CtBP2 is a protein that in humans is encoded by the CTBP2 gene.

<span class="mw-page-title-main">DNMT3L</span> Protein-coding gene in the species Homo sapiens

DNA (cytosine-5)-methyltransferase 3-like is an enzyme that in humans is encoded by the DNMT3L gene.

<span class="mw-page-title-main">TNP1</span> Protein-coding gene in the species Homo sapiens

Spermatid nuclear transition protein 1 is a protein that in humans is encoded by the TNP1 gene.

<span class="mw-page-title-main">POLE3</span> Protein-coding gene in the species Homo sapiens

DNA polymerase epsilon subunit 3 is an enzyme that in humans is encoded by the POLE3 gene.

The Reference Sequence (RefSeq) database is an open access, annotated and curated collection of publicly available nucleotide sequences and their protein products. RefSeq was introduced in 2000. This database is built by National Center for Biotechnology Information (NCBI), and, unlike GenBank, provides only a single record for each natural biological molecule for major organisms ranging from viruses to bacteria to eukaryotes.

<span class="mw-page-title-main">DNA annotation</span> The process of describing the structure and function of a genome

In molecular biology and genetics, DNA annotation or genome annotation is the process of describing the structure and function of the components of a genome, by analyzing and interpreting them in order to extract their biological significance and understand the biological processes in which they participate. Among other things, it identifies the locations of genes and all the coding regions in a genome and determines what those genes do.

Tc1/mariner is a class and superfamily of interspersed repeats DNA transposons. The elements of this class are found in all animals, including humans. They can also be found in protists and bacteria.

References

  1. 1 2 Childs, G.; Maxson, R.; Cohn, R. H.; Kedes, L. (1981). "Orphons: Dispersed genetic elements derived from tandem repetitive genes of eucaryotes". Cell. 23 (3): 651–663. doi:10.1016/0092-8674(81)90428-1. PMID   6784929. S2CID   44633130.
  2. Borden, P; Jaenichen, R; Zachau, H. G. (1990). "Structural features of transposed human VK genes and implications for the mechanism of their transpositions". Nucleic Acids Research. 18 (8): 2101–7. doi:10.1093/nar/18.8.2101. PMC   330689 . PMID   2159639.
  3. Eirín-López, J. M.; González-Tizón, A. M.; Martinez, A.; Méndez, J. (2002). "Molecular and evolutionary analysis of mussel histone genes (Mytilus spp.): possible evidence of an" orphon origin" for H1 histone genes". Journal of Molecular Evolution. 55 (3): 272–283. Bibcode:2002JMolE..55..272E. doi:10.1007/s00239-002-2325-1. hdl: 2183/22492 . PMID   12187381. S2CID   11565940.
  4. Guimond, A.; Moss, T. (1999). "A ribosomal orphon sequence from Xenopus laevis flanked by novel low copy number repetitive elements". Biological Chemistry. 380 (2): 167–174. doi:10.1515/BC.1999.025. PMID   10195424. S2CID   30071264.
  5. Huber, C.; Thiebe, R.; Hameister, H.; Smola, H.; Lötscher, E.; Zachau, H. G. (1990). "A human immunoglobulin kappa orphon without sequence defects may be the product of a pericentric inversion". Nucleic Acids Research. 18 (12): 3475–3478. doi:10.1093/nar/18.12.3475. PMC   330999 . PMID   2114012.