IRX1 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | IRX1 , IRX-5, IRXA1, iroquois homeobox 1 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 606197 MGI: 1197515 HomoloGene: 19065 GeneCards: IRX1 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Iroquois-class homeodomain protein IRX-1, also known as Iroquois homeobox protein 1, is a protein that in humans is encoded by the IRX1 gene. [5] [6] All members of the Iroquois (IRO) family of proteins share two highly conserved features, encoding both a homeodomain and a characteristic IRO sequence motif. [7] Members of this family are known to play numerous roles in early embryo patterning. [5] IRX1 has also been shown to act as a tumor suppressor gene in several forms of cancer. [8] [9] [10] [11]
IRX1 is a member of the Iroquois homeobox gene family. Members of this family play multiple roles during pattern formation in embryos of numerous vertebrate and invertebrate species. [5] [12] IRO genes are thought to function early in development to define large territories, and again later in development for further patterning specification. [7] Experimental data suggest roles for IRX1 in vertebrates may include development and patterning of lungs, limbs, heart, eyes, and nervous system. [13] [14] [15] [16] [17] [18]
IRX1 is located on the forward DNA strand (see Sense (molecular biology)) of chromosome 5, from position 3596054 - 3601403 at the 5p15.3 location. [5] The human gene product is a 1858 base pair mRNA with 4 predicted exons in humans. [19] Promoter analysis was performed using El Dorado through the Genomatix software page. [20] The predicted promoter region spans 1040 base pairs from position 3595468 through 3595468 on the forward strand of chromosome 5.
IRX1 is relatively isolated, with no other protein coding genes found from position 3177835 – 5070004. [5]
Microarray and RNA seq data suggest that IRX1 is ubiquitously expressed at low levels in adult tissues, with the highest relative levels of expression occurring in the heart, adipose, kidney, and breast tissues. [21] [22] Moderate to high levels are also indicated in the lung, prostate and stomach. [22] [23] Promoter analysis with the El Dorado program from Genomatix predicted that IRX1 expression is regulated by factors that include E2F cell cycle regulators, NRF1, and ZF5, [24] and brachyury. [20] Expression data from human, mouse, and developing mouse brains are available though the Allen Brain Atlas. [25]
The mature IRX1 protein has 480 amino acid residues, with a molecular mass of 49,600 daltons and an isoelectric point of 5.7. A BLAST search revealed that IRX1 contains two highly conserved domains: a homeodomain and a characteristic IRO motif of unknown function. [26] The homeodomain belongs to the TALE (three amino acid loop extension) class of homeodomains, and is characterized by the addition of three extra amino acids between the first and second helix of three alpha helices that comprise the domain. [27] The presence of this well characterized homeodomain strongly suggests that IRX1 acts as a transcription factor. This is further supported by the predicted localization of IRX1 to the nucleus. [28] The IRO motif is a region downstream of the homeodomain that is found only in members of the Iroquois-class homeodomain proteins, though its function is poorly understood. However, its similarity to an internal region of the Notch receptor protein suggests that it may be involved with protein-protein interaction. [7] In addition to these two characteristic domains, IRX1 contains a third domain from the HARE-HTH superfamily [29] fused to the C-terminal end of the homeodomain. [30] This domain adopts a winged helix-turn-helix fold predicted to bind DNA, and is thought to play a role in recruiting effector activities to DNA. [29] Several forms of post-translational modification are predicted, including SUMOylation, C-mannosylation, and phosphorylation, using bioinformatics tools from ExPASy. [31] Bioinformatic analysis of IRX1 with the NetPhos tool predicted 71 potential phosphorylation sites throughout the protein. [32]
Potential protein interacting partners for IRX1 were found using computational tools. The STRING database lists nine putative interacting partners supported by text mining evidence, though closer analysis of the results shows little support for most of these predicted interactions. [33] However, it is possible that one of these proteins, CDKN1A, is involved in the predicted regulation of IRX1 by E2F cell cycle regulators. [20] [33]
IRX1 has a high degree of conservation across vertebrate and invertebrate species. The entire protein is more fully conserved through vertebrate species, while only the homeodomain and IRO motif are conserved in more distant homologs. [12] Homologous sequences were found in species as distantly related to humans as the pig roundworm Ascaris suum, from the family Ascarididae, using BLAST and the ALIGN tool through the San Diego Super Computer Biology Workbench. [26] The following is a table describing the evolutionary conservation of IRX1.
Genus Species | Organism Common Name | Divergence from Humans (MYA) [34] | NCBI Protein Accession Number | Sequence Identity [26] | Protein Length | Common Gene Name |
---|---|---|---|---|---|---|
Homo sapiens [30] | Humans | -- | NP_077313 | 100% | 480 | IRX-1 |
Pongo abelii [35] | Sumatran Orangutan | 15.7 | XP_002815448 | 99% | 480 | IRX-1 |
Bos taurus [36] | Cattle | 94.2 | XP_002696496 | 92.3% | 476 | IRX-1 |
Mus musculus [37] | House Mouse | 92.3 | NP_034703 | 91.5% | 480 | IRX-1 |
Rattus norvegicus [38] | Brown rat | 92.3 | NP_001100801 | 90.4% | 480 | IRX-1 |
Gallus gallus [39] | Red Junglefowl | 296 | NP_001025509 | 72.9% | 467 | IRX-1 |
Xenopus tropicalis [40] | Western clawed frog | 371.2 | NP_001188351 | 68% | 467 | IRX-1 |
Latimeria chalumnae [41] | West Indian Ocean coelacanth | 441.9 | XP_006002089 | 65.1% | 460 | Irx-1-A-like isoform X1 |
Danio rerio [42] | Zebrafish | 400.1 | NP_997067 | 61.1% | 426 | Irx-1 isoform 1 |
Taeniopygia guttata [43] | Zebra finch | 296 | XP_002189063 | 59.7% | 400 | Irx-1-A-like |
Astyanax mexicanus [44] | Mexican tetra | 400.1 | XP_007254591.1 | 58% | 450 | IRX-1 |
Ophiophagus hannah [45] | King cobra | 296 | ETE68928 | 54.5% | 387 | Irx-1-A partial |
Ovis aries [46] | Sheep | 94.2 | XP_004017207 | 43.3% | 260 | IRX-1 |
Condylura cristata [47] | Star-nosed mole | 94.2 | XP_004678440 | 41.7% | 342 | IRX-1 |
Branchiostoma floridae [48] | Lancelet | 713.2 | ACF10237.1 | 35.5% | 461 | Iroquois A isoform 1 |
Strongylocentrotus purpuratus [49] | Purple sea urchin | 742.9 | NP_001123285 | 31.7% | 605 | Iroquois homeobox A |
Ascaris suum [50] | Pig roundworm | 937.5 | F1KXE6 | 29% | 444 | IRX-1 |
Caenorhabditis elegans [51] | Nematode roundworm | 937.5 | NP_492533.2 | 28.6% | 377 | IRX-1 |
Drosophila melanogaster [52] | Fruit fly | 782.7 | NP_524045 | 27% | 717 | Araucan isoform A |
IRX1 is one of six members of the Iroquois-class homeodomain proteins found in humans: IRX2 , IRX3 , IRX4 , IRX5 , and IRX6 . IRX1, IRX2, and IRX4 are found on human chromosome 5, and their orientation corresponds to that of IRX3, IRX5, and IRX6 found on human chromosome 16. [7] It is thought that the genomic organization of IRO genes in conserved gene clusters allows for coregulation and enhancer sharing during development.
A homeobox is a DNA sequence, around 180 base pairs long, that regulates large-scale anatomical features in the early stages of embryonic development. Mutations in a homeobox may change large-scale anatomical features of the full-grown organism.
Pre-B-cell leukemia transcription factor 1 is a protein that in humans is encoded by the PBX1 gene. The homologous protein in Drosophila is known as extradenticle, and causes changes in embryonic development.
Homeobox protein MSX-2 is a protein that in humans is encoded by the MSX2 gene.
PBX/Knotted 1 Homeobox 1 (PKNOX1) is a protein that in humans is encoded by the PKNOX1 gene.
Homeobox protein Hox-C4 is a protein that in humans is encoded by the HOXC4 gene.
Homeobox protein Meis2 is a protein that in humans is encoded by the MEIS2 gene.
Double homeobox, 4 also known as DUX4 is a protein which in humans is encoded by the DUX4 gene. Its misexpression is the cause of facioscapulohumeral muscular dystrophy (FSHD).
Iroquois-class homeodomain protein IRX-3, also known as Iroquois homeobox protein 3, is a protein that in humans is encoded by the IRX3 gene.
Iroquois-class homeodomain protein IRX-2, also known as Iroquois homeobox protein 2, is a protein that in humans is encoded by the IRX2 gene.
Iroquois-class homeodomain protein IRX-4, also known as Iroquois homeobox protein 4, is a protein that in humans is encoded by the IRX4 gene.
Iroquois-class homeodomain protein IRX-5, also known as Iroquois homeobox protein 5, is a protein that in humans is encoded by the IRX5 gene.
Iroquois-class homeodomain protein IRX-6, also known as Iroquois homeobox protein 6, is a protein that in humans is encoded by the IRX6 gene.
Homeobox protein Mohawk, also known as iroquois homeobox protein-like 1, is a protein that in humans is encoded by the MKX gene. MKX is a member of an Iroquois (IRX) family-related class of 'three-amino acid loop extension' (TALE) atypical homeobox proteins characterized by 3 additional amino acids in the loop region between helix I and helix II of the homeodomain.
Protein FAM83A also known as tumor antigen BJ-TSA-9 is a protein that in humans is encoded by the FAM83A gene.
Transmembrane protein 131 (TMEM131) is a protein that is encoded by the TMEM131 gene in humans. The TMEM131 protein contains three domains of unknown function 3651 (DUF3651) and two transmembrane domains. This protein has been implicated as having a role in T cell function and development. TMEM131 also resides in a locus (2q11.1) that is associated with Nievergelt's Syndrome when deleted.
Transmembrane Protein 205 (TMEM205) is a protein encoded on chromosome 19 by the TMEM205 gene.
PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.
Uncharacterized protein C12orf60 is a protein that in humans is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.
BEND2 is a protein that in humans is encoded by the BEND2 gene. It is also found in other vertebrates, including mammals, birds, and reptiles. The expression of BEND2 in Homo sapiens is regulated and occurs at high levels in the skeletal muscle tissue of the male testis and in the bone marrow. The presence of the BEN domains in the BEND2 protein indicates that this protein may be involved in chromatin modification and regulation.
PBX/Knotted 1 Homeobox 2 (PKNOX2) protein belongs to the three amino acid loop extension (TALE) class of homeodomain proteins, and is encoded by PKNOX2 gene in humans. The protein regulates the transcription of other genes and affects anatomical development.
This article incorporates text from the United States National Library of Medicine, which is in the public domain.