Leucine-rich repeats and iq motif containing 1

Last updated
LRRIQ1 chromosome location LRRIQ1 Chromosomal location.png
LRRIQ1 chromosome location
LRRIQ1
Identifiers
Aliases LRRIQ1 , leucine rich repeats and IQ motif containing 1
External IDs MGI: 1922228 HomoloGene: 46007 GeneCards: LRRIQ1
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001079910
NM_032165

NM_001163559
NM_029134
NM_001361058

RefSeq (protein)

NP_001073379

NP_001157031
NP_083410
NP_001347987

Location (UCSC) Chr 12: 85.04 – 85.26 Mb Chr 10: 102.88 – 103.07 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse
Using the leucine-rich repeat Query sequence of the LRRIQ1 protein, the Phyre2 program was utilized to make a figure outlining the predicted secondary structure based on its similarity to template leucine rich motifs. Phyre2 prediction of Secondary Structure.pdf
Using the leucine-rich repeat Query sequence of the LRRIQ1 protein, the Phyre2 program was utilized to make a figure outlining the predicted secondary structure based on its similarity to template leucine rich motifs.

Leucine-rich repeats and IQ motif containing 1 is a protein that in humans is encoded by the LRRIQ1 gene. [5] The protein is likely a nuclear encoding mitochondrial protein [6] and is found in all Metazoans. [7]

Contents

Gene

LRRIQ1 is mapped on chromosome 12, at 12q21.31in humans. LRRIQ1 is near ALX1 on the positive strand, and TSPAN19 and SLC6115 on the negative strand. It covers 208.78kb, from 85430099 to 8563881 on the direct strand. The gene contains 36 exons. [8]

mRNA

The gene contains 31 distinct introns, and the transcript produces 10 different mRNAs. LRRIQ1 has two validated alternative polyadenylation sites. The most common isoform consists of 5,460 base pairs in length, and includes 28 of the total 29 exons. [7] Primates have an elongated 3’ end compared to other mammalian species. Reptiles, birds, and fish also have a truncated 3’end, compared to primate transcripts. [9]

Protein

The protein is a nuclear encoding mitochondrial protein. [6] The protein in humans has 1760 amino acids. The protein is considered largely neutral, though 17% of the primary structure is composed of the hydrophobic leucine-rich repeats. [10]

The leucine-rich repeat forms a structural horseshoe shape, which encourages protein-protein interactions. The most common translated isoform has a predicted molecular weight of 199.3 kdal. [5] [10] Compared to an average of human sequences, the internal composition is rich in Leucine, Glutamic Acid, and Lysine. [11]

Domains and Motifs

LRRIQ1 contains an IQ calmodulin-binding motif found in one isoform. The isoform contains three copies and serves as a binding site for Calmodulin or CaM-like proteins. [5] The Leucine-Rich Repeat domain is found in three isoforms of LRRIQ1. LRRIQ1 contains 4 Leucine Rich Repeats (LRR). The LRR motif provides a structural frame work for the formation of protein-protein interactions, forming a coiled horseshoe shape. [10]

Homology

There are no known paralogs of LRRIQ1 detected in humans.

There are many orthologs of LRRIQ1. Orthologous LRRIQ1 is found in all metazoans. LRRIQ1 is not found in Plants, Bacteria, Archaea, Fungi, or protists. The most distant homolog is found in Drosophila melanogaster [9] (estimated time of divergence 847 million years ago [12] ). The IQ-containing motif and Leucine-rich repeats domains are conserved in Drosophila.

Conservation

The LRRIQ1 gene has been shown to be highly conserved. The gene has true orthologs throughout the taxa mammal and is found in all Metazoans. The time of divergence versus the corrected % divergence (m) was plotted with samples from human, gorilla, domesticate cat, bison, orca whale, Arabian camel, domestic horse, African Bush Elephant, Bald Eagle, Adelie Penguin, Japanese Gecko, Carolina Anole, and Western Clawed Frog. [9] [12] To make slopes for Fibrinogen (considered a comparatively rapidly evolving protein) and Cytochrome C (comparatively slower), Xenopus tropical, Xenopus laevis, Takifugu rubripes, and Bos Taurus were utilized for comparison.

Expression

LRRIQ1 is lowly expressed (0.6 times the average gene) in lung, testis, epithelial tissue, pooled germ cell tumors, brain tissues, embryonic tissues, and adipose tissues. [7]

Interacting Proteins

The presence of the Leucine-Rich Repeat motif provides structural framework for protein-protein interactions. HES4 is the only identified protein that interacts with LRRIQ1. [13]

HES4, is a transcription factor found in humans. The protein binds DNA on N-box motifs. [14]

Clinical significance

To date, the clinical significance of this gene is not known.

Related Research Articles

Chitinase domain-containing protein 1 Protein-coding gene in the species Homo sapiens

Chitinase domain-containing protein 1 (CHID1) is a highly conserved protein of unknown function located on the short (p) arm of chromosome 11 near the telomere. The protein has 27 introns, which allows for many isoforms of this gene. It has several aliases, the most common of which is Stabilin-1 interacting chitinase-like protein (SI-CLP). As indicated by the alias, CHID1 is known to interact with the protein STAB1. CHID1 is expressed ubiquitously at levels nearly 6 times the average gene, and is conserved very far back to organisms such as Caenorhabditis elegans and possibly some prokaryotes. This protein is known to have carbohydrate binding sites, which could be involved in carbohydrate catabolysis.

LOC105377021 is a protein which in humans is encoded by the LOC105377021 gene. LOC105377021 exhibits expressional pathology related to breast cancer, specifically triple negative breast cancer. LOC105377021 contains a serine rich region in addition to predicted alpha helix motifs.

TMEM156 is a gene that encodes the transmembrane protein 156 (TMEM156) in Homo sapiens. It has the clone name of FLJ23235.

LRRC24 Protein-coding gene in the species Homo sapiens

Leucine rich repeat containing 24 is a protein that, in humans, is encoded by the LRRC24 gene. The protein is represented by the official symbol LRRC24, and is alternatively known as LRRC14OS. The function of LRRC24 is currently unknown. It is a member of the leucine-rich repeat (LRR) superfamily of proteins.

<span class="mw-page-title-main">Trinucleotide repeat containing 18</span>

Trinucleotide repeat containing 18 is a protein that in humans is encoded by the TNRC18 gene.

BEND2 is a protein that in humans is encoded by the BEND2 gene. It is also found in other vertebrates, including mammals, birds, and reptiles. The expression of BEND2 in Homo sapiens is regulated and occurs at high levels in the skeletal muscle tissue of the male testis and in the bone marrow. The presence of the BEN domains in the BEND2 protein indicates that this protein may be involved in chromatin modification and regulation.

<span class="mw-page-title-main">TMCO4</span>

Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.

<span class="mw-page-title-main">SHLD1</span>

SHLD1 or shieldin complex subunit 1 is a gene on chromosome 20. The C20orf196 gene encodes an mRNA that is 1,763 base pairs long, and a protein that is 205 amino acids long.

<span class="mw-page-title-main">FAM71E1</span> Mammalian protein found in Homo sapiens

FAM71E1, also known as Family With Sequence Similarity 71 Member E1, is a protein that in humans is encoded by the FAM71E1 gene. It is thought to be ubiquitously expressed at low levels throughout the body, and it is conserved in vertebrates, particularly mammals and some reptiles. The protein is localized to the nucleus and can be exported to the cytoplasm.

ZCCHC18 Protein-coding gene in the species Homo sapiens

Zinc finger CCHC-type containing 18 (ZCCHC18) is a protein that in humans is encoded by ZCCHC18 gene. It is also known as Smad-interacting zinc finger protein 2 (SIZN2), para-neoplastic Ma antigen family member 7b (PNMA7B), and LOC644353. Other names such as zinc finger, CCHC domain containing 12 pseudogene 1, P0CG32, ZCC18_HUMAN had been used to describe this protein.

C15orf39

C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.

C9orf25 Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">LRRIQ3</span>

LRRIQ3, which is also known as LRRC44, is a protein that in humans is encoded by the LRRIQ3 gene. It is predominantly expressed in the testes, and is linked to a number of diseases.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

<span class="mw-page-title-main">KRBA1</span>

KRBA1 is a protein that in humans is encoded by the KRBA1 gene. It is located on the plus strand of chromosome 7 from 149,411,872 to 149,431,664. It is also commonly known under two other aliases: KIAA1862 and KRAB A Domain Containing 1 gene and encodes the KRBA1 protein in humans. The KRBA family of genes is understood to encode different transcriptional repressor proteins

<span class="mw-page-title-main">Fam89A</span>

ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.

<span class="mw-page-title-main">LSMEM2</span>

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.

ISLR Protein-coding gene in the species Homo sapiens

In humans, the immunoglobulin super family containing leucine-rich repeat (ISLR) protein is encoded by the ISLR gene. Current RNA-seq studies show that the protein is highly expressed in the endometrium and ovary and shows expression among 25 other tissues. The protein is seen localized in the cytoplasm, plasma membrane, extracellular exosome, and platelet alpha granule lumen. Furthermore, the protein is known to play a role in platelet degranulation, cell adhesion, and response to elevated platelet cytosolic Ca2+.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000133640 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000019892 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 3 "Entrez Gene: Leucine-rich repeats and IQ motif containing 1" . Retrieved 2016-03-29.
  6. 1 2 Hart, Gerald W.; Akimoto, Yoshihiro (2009-01-01). Varki, Ajit; Cummings, Richard D.; Esko, Jeffrey D.; Freeze, Hudson H.; Stanley, Pamela; Bertozzi, Carolyn R.; Hart, Gerald W.; Etzler, Marilynn E. (eds.). The O-GlcNAc Modification (2nd ed.). Cold Spring Harbor (NY): Cold Spring Harbor Laboratory Press. ISBN   9780879697709. PMID   20301273.
  7. 1 2 3 Thierry-Mieg, Danielle; Thierry-Mieg, Jean. "AceView: Gene:LRRIQ1, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2016-05-03.
  8. "LRRIQ1 leucine-rich repeats and IQ motif containing 1 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-05-03.
  9. 1 2 3 "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2016-05-03.
  10. 1 2 3 Kelley, Lawrence. "PHYRE2 Protein Fold Recognition Server". www.sbg.bio.ic.ac.uk. Retrieved 2016-05-03.
  11. "ProMoST: Protein Modification Screening Tool | proteomics.mcw.edu". proteomics.mcw.edu. Retrieved 2016-05-03.
  12. 1 2 "TimeTree :: The Timescale of Life". timetree.org. Retrieved 2016-05-03.
  13. "mentha: the interactome browser". mentha.uniroma2.it. Retrieved 2016-05-03.
  14. Thierry-Mieg, Danielle; Thierry-Mieg, Jean. "AceView: Gene:HES4, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2016-05-03.

Further reading