LRRIQ3 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | LRRIQ3 , LRRC44, leucine-rich repeats and IQ motif containing 3, leucine rich repeats and IQ motif containing 3 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 617957 MGI: 1921685 HomoloGene: 23668 GeneCards: LRRIQ3 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
LRRIQ3 (Leucine-rich repeats and IQ motif containing 3), which is also known as LRRC44, is a protein that in humans is encoded by the LRRIQ3 gene. [5] It is predominantly expressed in the testes, and is linked to a number of diseases. [6]
LRRIQ3 is found on the minus strand of the end of the short arm of human chromosome 1 at 1p31.1. [7]
There are a total of 7 exons in the putative sequence of LRRIQ3. [7]
LRRIQ3 is expressed as 2 primary isoforms, which produce proteins of length 624 amino acids and 464 amino acids respectively. [7] It is expressed at low levels in human and brown rat tissues, [8] [9] with highest expression levels in testes tissue. There are relatively high expression levels in T cells, the epididymis, the kidney, and a number of glands. [10]
Human protein LRRIQ3 Isoform 1 consists of 624 amino acids, and has a molecular weight of 73.7 kDa. The isoelectric point of LRRIQ3 is 9.73, which suggests that LRRIQ3 is basic at normal physiological pH (~7.4). [11] Additionally, there is strong evidence that human LRRIQ3 localizes to the plasma membrane from antibody staining. [12] LRRIQ3 is rich in lysine residues, with a total of 82 lysines. It is also slightly low on glycines. [13]
In total, there are 4 conserved domains within LRRIQ3: 3 leucine-rich repeats and 1 IQ calmodulin-binding motif. [13] Leucine-rich repeats are typically involved in protein-protein interactions, and form a characteristic α/β horseshoe fold. [14] [15] An IQ motif provides a binding site for calmodulin (CaM) or CaM-like proteins. [16]
LRRIQ3 is predicted to be mostly alpha-helical in structure, including a long alpha-helical C-terminal domain. It is also predicted to function as a monomer. [17] [18] [19] [20]
LRRIQ3 is predicted to undergo many post-translational modifications. These include O-GlcNAcylation, SUMOylation, ubiquitination, and phosphorylation. [22] [23] LRRIQ3 is predicted to have 4 well conserved SUMOylation sites and 1 well conserved ubiquitination site. [22] A representation of these post-translational modifications is shown in the figure below.
There is evidence that LRRIQ3 interacts with a number of proteins from two-hybrid assays and affinity chromatography. The proteins LRRIQ3 interact with include LYN, NCK2, GNB4, and ABL1. [25] [26] These proteins are associated with cell signalling, cytoskeletal reorganization, and cell differentiation, as well as others. [27] [28] [29] [30]
No paralogs exists for LRRIQ3 in humans. [6] However, there are a number of orthologs, as reported by BLAST, some of which are listed below. [31] The number of years since divergence from the human protein, listed in "million of years ago (MYA)" below, were calculated using TimeTree. [32]
Genus and Species | Common Name | Divergence from Human Lineage (MYA) | Accession Number | Sequence length (aa) | Sequence Identity to Human Protein | Sequence Similarity to Human Protein |
---|---|---|---|---|---|---|
Gorilla gorilla gorilla | Gorilla | 9.06 | XP_004026030.1 | 624 | 97% | 98% |
Macaca mulatta | Rhesus monkey | 29.44 | XP_001097148.2 | 623 | 93% | 95% |
Ursus maritimus | Polar bear | 96 | XP_008689049.1 | 625 | 76% | 87% |
Felis catus | Domestic cat | 96 | XP_003990274.1 | 625 | 74% | 86% |
Camelus ferus | Bactrian camel | 96 | XP_006178380.1 | 618 | 73% | 84% |
Oryctolagus cuniculus | European rabbit | 90 | XP_002715603.1 | 622 | 71% | 83% |
Bison bison bison | American bison | 96 | XP_010847739.1 | 625 | 70% | 82% |
Trichechus manatus latirostris | Manatee | 105 | XP_004369192.1 | 623 | 70% | 82% |
Loxodonta africana | African elephant | 105 | XP_003411181.1 | 625 | 68% | 80% |
Condylura cristata | Star-nosed mole | 96 | XP_004679575.1 | 627 | 67% | 80% |
Eptesicus fuscus | Big brown bat | 96 | XP_008137759.1 | 621 | 66% | 80% |
Myotis davidii | Vesper bat | 96 | XP_006775977.1 | 618 | 65% | 79% |
Rattus norvegicus | Norway rat | 90 | NP_001019478.1 | 633 | 62% | 77% |
Mus Musculus | House mouse | 90 | NP_083214.2 | 633 | 63% | 76% |
Sorex araneus | Common shrew | 96 | XP_004603704.1 | 612 | 55% | 73% |
Chrysemys picta bellii | Painted turtle | 312 | XP_005285573.1 | 624 | 40% | 56% |
Pogona vitticeps | Bearded dragon | 312 | XP_020650341.1 | 651 | 35% | 54% |
Apteryx australis mantelli | Brown kiwi | 312 | XP_013800580.1 | 664 | 35% | 54% |
Struthio camelus australis | Southern Ostrich | 312 | XP_009685099.1 | 628 | 34% | 51% |
LRRIQ3 is linked to a number of cancers. RNA-seq experiments have shown that LRRIQ3 is severely down-regulated (Log2-fold changes between -3.4 and -4.2) in a number of disease states, including pancreatic cancer, colorectal cancer, and breast cancer. [33] [34] [35]
F-box/LRR-repeat protein 7 is a protein that in humans is encoded by the FBXL7 gene.
Basic Leucine Zipper and W2 Domain-Containing Protein 2 is a protein that is encoded by the BZW2 gene. It is a eukaryotic translation factor found in species up to bacteria. In animals, it is localized in the cytoplasm and expressed ubiquitously throughout the body. The heart, placenta, skeletal muscle, and hippocampus show higher expression. In various cancers, upregulation tends to lead to higher severity and mortality. It has been found to interact with SARS-CoV-2.
Leucine-rich repeat neuronal protein 3, also known as neuronal leucine-rich repeat protein 3 (NLRR-3), is a protein that in humans is encoded by the LRRN3 gene.
Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.
Leucine-rich repeats and IQ motif containing 1 is a protein that in humans is encoded by the LRRIQ1 gene. The protein is likely a nuclear encoding mitochondrial protein and is found in all Metazoans.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Trinucleotide repeat containing 18 is a protein that in humans is encoded by the TNRC18 gene.
SMIM23 or Small Integral Membrane Protein 23 is a protein which in humans is encoded by the SMIM23 or c5orf50 gene. The longer mRNA isoform is 519 nucleotides which translates to 172 amino acids of a protein. In recent advancements, researchers have identified this gene, along with a few others, could potentially play a role in how facial morphology arises in humans.
KIAA1211L is a protein that in humans is encoded by the KIAA1211L gene. It is highly expressed in the brain. Furthermore, it is localized to the microtubules and the centrosomes and is subcellularly located in the nucleus. Finally, KIAA1211L is associated with certain mental disorders and various cancers.
UPF0575 protein C19orf67 is a protein which in humans is encoded by the C19orf67 gene. Orthologs of C19orf67 are found in many mammals, some reptiles, and most jawed fish. The protein is expressed at low levels throughout the body with the exception of the testis and breast tissue. Where it is expressed, the protein is predicted to be localized in the nucleus to carry out a function. The highly conserved and slowly evolving DUFF3314 region is predicted to form numerous alpha helices and may be vital to the function of the protein.
Chromosome 19 open reading frame 18 (c19orf18) is a protein which in humans is encoded by the c19orf18 gene. The gene is exclusive to mammals and the protein is predicted to have a transmembrane domain and a coiled coil stretch. This protein has a function that is not yet fully understood by the scientific community.
LCHN is a protein that in humans is encoded by the KIAA1147 gene located on chromosome 7. It is likely part of the tripartite DENN domain family of proteins that often function as Rab-GEFs to regulate vesicular trafficking. Both the mRNA and protein have been shown to be upregulated following ischemic stroke, and to be produced at altered levels in patients with FTD-ALS, however the gene's contribution to these states is not well understood.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
Chromosome 18 open reading frame 63 is a protein which in humans is encoded by the C18orf63 gene. This protein is not yet well understood by the scientific community. Research has been conducted suggesting that C18orf63 could be a potential biomarker for early stage pancreatic cancer and breast cancer.
ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
In humans, the immunoglobulin super family containing leucine-rich repeat (ISLR) protein is encoded by the ISLR gene. Current RNA-seq studies show that the protein is highly expressed in the endometrium and ovary and shows expression among 25 other tissues. The protein is seen localized in the cytoplasm, plasma membrane, extracellular exosome, and platelet alpha granule lumen. Furthermore, the protein is known to play a role in platelet degranulation, cell adhesion, and response to elevated platelet cytosolic Ca2+.
Chromosome 1 open reading frame 68, or C1orf68, is a human gene which encodes for skin-specific protein 32. C1orf68 gene is expressed in the skin, is apart of the epidermal differentiation complex, and potentially plays a role in epidermal cornification, and epidermal barrier function.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
Leucine-rich repeat-containing protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.
{{cite web}}
: |last=
has generic name (help){{cite web}}
: |last=
has generic name (help)