LRRN3 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | LRRN3 , FIGLER5, NLRR-3, NLRR3, leucine rich repeat neuronal 3 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 106036 HomoloGene: 36315 GeneCards: LRRN3 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Leucine-rich repeat neuronal protein 3, also known as neuronal leucine-rich repeat protein 3 (NLRR-3), is a protein that in humans is encoded by the LRRN3 gene. [5] [6]
The LRRN3 is located on human chromosome 7, at 7q31.1. [7] It contains 6 distinct gt-ag introns, and transcription produces five different mRNAs that appear to differ by truncation of the 3' end. There are only three main transcript variants that actually encode for the LRRN3 protein, with the longest transcript variant being 3744 base pairs in length. All three of these transcript variants have differing lengths and number of exons, but they all have the exon that includes the entire coding sequence for the LRRN3 protein. [8] There are also two paralogs to the LRRN3 gene. These include the LRRN1 and LRRN2 genes, both of which have the same leucine-rich repeats that are characteristic to this family of genes. [5]
The LRRN3 protein is 708 amino acids in length. The molecular weight of this protein is 79,424 daltons, with an isoelectric point of 8.02. [9] It is known to be a single-pass type I membrane protein because it spans the membrane once, with its N-terminus on the extracellular side of the membrane, and its signal sequence is removed. [7]
The LRRN3 protein contains 12 leucine-rich repeats, along with an LRRNT and an LRR_RI domain. Leucine-rich repeats are unusually rich in the hydrophobic amino acid leucine. They are common in protein-protein interaction motifs and are typically 20-29 amino acids in length. All major classes of LRRs are known to have a curved horseshoe structure with a parallel beta sheet on the concave side and mostly helical elements on the convex side. The LRRN3 protein also has an immunoglobulin domain, a fibronectin type III domain, and a transmembrane region toward the end of the protein. [10] The composition of this protein is shown below.
The LRRN3 gene has been shown to be extremely highly conserved. There are 21 orthologs and homologs for this gene going back to zebra fish. This gene has not been found in any invertebrates or plant species. A multiple sequence alignment has shown this very high conservation of the LRRN3 gene among many different species. All 12 of the leucine-rich repeats, along with the LRRNT and the LRR_RI regions, are highly conserved in all vertebrates for which orthologs of this protein have been obtained. The Ig and FN3 domains also show high conservation in all of the orthologous sequences for mammals and birds, but are not as highly conserved for the rest of the homologous sequences. The high conservation of the leucine-rich repeats, the Ig domain, and the FN3 domain show that these regions must be of importance to the functionality of the LRRN3 gene. [11]
Gene Expression data has shown that the LRRN3 gene is expressed at very high levels in humans, about 2.3 times the average gene. [8] It is most highly expressed in brain, heart, and testes tissues. It is also slightly expressed in kidney, muscle, pharynx, placental, and thymus tissue. The highest expression of the LRRN3 gene for the developmental stages is the fetal stage, but it is also expressed in the infant, juvenile, and adult stages, as can be seen in the EST profile. [12]
The expression data for the LRRN3 gene shows that it has its highest expression in the brain. Overall, the LRRN3 gene seems to have high to moderate expression throughout the majority of brain tissue. The highest expression being found in the cerebral cortex, the hippocampal formation, the cerebellar cortex, and the paraflocculus. All of these regions play a role in some key cognitive function, involving the processing of language and sensory stimuli, memory, and oculomotor behavior. [13] The high expression of the LRRN3 gene in these regions of the brain could show that the LRRN3 gene has some importance in these cognitive function.
The characteristic shape of leucine-rich repeat proteins is an arc or horseshoe shape. This horseshoe shape of the protein is created by a parallel beta sheet on the concave side and mostly helical elements on the convex side. Eleven residue segments of the LRRs, corresponding to the beta-strand and adjacent loop regions are conserved in LRR proteins, whereas the remaining parts of the repeats may be different. The concave face and the adjacent loops are the most common protein interaction surfaces on LRR proteins. 3D structures of some LRR protein-ligand complexes show that the concave surfact of the LRR domain is ideal for interaction with alpha-helix, thus supporting the conclusions that the elongated and curved LRR structure provides a framework for achieving diverse protein-protein interactions. [10]
Comparison of the LRRN3 protein with the chain A of the crystal structure of the LINGO1 ectodomain shows that the LRRN3 protein takes on the characteristic horseshoe shape of most leucine-rich repeat proteins. The LINGO1 ectodomain also has a very long stretch of leucine-rich repeats which is the region that has the best alignment with the LRRN3 protein. This similar region shows that the LRRN3 protein has a structure that includes mostly beta-strands that are connected by loops, with a few alpha helices throughout. [14]
SLITRK1 is a human gene that codes for a transmembrane and signalling protein that is part of the SLITRK gene family, which is responsible for synapse regulation and presynaptic differentiation in the brain. Expression of the gene has been linked to early formation of excitatory synapses through binding with receptor tyrosine phosphatase PTP (LAR-RPTP). Various studies over the years have linked mutations in the gene to conditions on the OCD spectrum, Tourette syndrome and trichotillomania, however the mutations in the genome itself vary greatly between individuals, with most mutations observed being hard to find in repeat studies.
A leucine-rich repeat (LRR) is a protein structural motif that forms an α/β horseshoe fold. It is composed of repeating 20–30 amino acid stretches that are unusually rich in the hydrophobic amino acid leucine. These tandem repeats commonly fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Typically, each repeat unit has beta strand-turn-alpha helix structure, and the assembled domain, composed of many such repeats, has a horseshoe shape with an interior parallel beta sheet and an exterior array of helices. One face of the beta sheet and one side of the helix array are exposed to solvent and are therefore dominated by hydrophilic residues. The region between the helices and sheets is the protein's hydrophobic core and is tightly sterically packed with leucine residues.
Leucine-rich repeat-containing G-protein coupled receptor 5 (LGR5) also known as G-protein coupled receptor 49 (GPR49) or G-protein coupled receptor 67 (GPR67) is a protein that in humans is encoded by the LGR5 gene. It is a member of GPCR class A receptor proteins. R-spondin proteins are the biological ligands of LGR5. LGR5 is expressed across a diverse range of tissue such as in the muscle, placenta, spinal cord and brain and particularly as a biomarker of adult stem cells in certain tissues.
Leucine rich repeat and Immunoglobin-like domain-containing protein 1 also known as LINGO-1 is a protein which is encoded by the LINGO1 gene in humans. It belongs to the family of leucine-rich repeat proteins which are known for playing key roles in the biology of the central nervous system. LINGO-1 is a functional component of the Nogo receptor also known as the reticulon 4 receptor.
SLIT and NTRK-like protein 5 is a protein that in humans is encoded by the SLITRK5 gene.
Transmembrane protein 8B is a protein that in humans is encoded by the TMEM8B gene. It encodes for a transmembrane protein that is 338 amino acids long, and is located on human chromosome 9. Aliases associated with this gene include C9orf127, NAG-5, and NGX61.
Slit is a family of secreted extracellular matrix proteins which play an important signalling role in the neural development of most bilaterians. While lower animal species, including insects and nematode worms, possess a single Slit gene, humans, mice and other vertebrates possess three Slit homologs: Slit1, Slit2 and Slit3. Human Slits have been shown to be involved in certain pathological conditions, such as cancer and inflammation.
Leucine rich repeat containing 57, also known as LRRC57 is a protein encoded in humans by the LRRC57 gene.
MORN1 containing repeat 1, also known as Morn1, is a protein that in humans is encoded by the MORN1 gene.
Leucine rich repeat containing 40 (LRRC40) is a protein that in humans is encoded by the LRRC40 gene.
Family with Sequence Similarity 203, Member B (FAM203B) is a protein encoded by the FAM203B gene (8q24.3) in humans. While FAM203B is only found in humans and possibly non-human primates, its paralog, FAM203A, is highly conserved. The FAM203B protein contains two conserved domains of unknown function, DUF383 and DUF384, and no transmembrane domains. This protein has no known function yet, although the homolog of FAM203A in Caenorhabditis elegans (Y54H5A.2) is thought to help regulate the actin cytoskeleton.
The GPIb-IX-V complex is a profuse membrane receptor complex originating in megakaryocytes and exclusively functional on the surface of platelets. It primarily functions to mediate the first critical step in platelet adhesion, by facilitating binding to von Willebrand factor (VWF) on damaged sub-endothelium under conditions of high fluid shear stress. Although the primary ligand for the GPIb-V-IX receptor is VWF, it can also bind to a number of other ligands in the circulation such as thrombin, P-selectin, factor XI, factor XII, high molecular weight kininogen as well as bacteria. GPIb-IX-V offers a critical role in thrombosis, metastasis, and the life cycle of platelets, and is implicated in a number of thrombotic pathological processes such as stroke or myocardial infarction.
KIAA1841 is a gene in humans that encodes a protein known as KIAA1841. KIAA1841 is targeted for the nucleus and it predicted to play a role in regulating transcription.
EVI5L is a protein that in humans is encoded by the EVI5L gene. EVI5L is a member of the Ras superfamily of monomeric guanine nucleotide-binding (G) proteins, and functions as a GTPase-activating protein (GAP) with a broad specificity. Measurement of in vitro Rab-GAP activity has shown that EVI5L has significant Rab2A- and Rab10-GAP activity.
Leucine rich repeat containing 24 is a protein that, in humans, is encoded by the LRRC24 gene. The protein is represented by the official symbol LRRC24, and is alternatively known as LRRC14OS. The function of LRRC24 is currently unknown. It is a member of the leucine-rich repeat (LRR) superfamily of proteins.
Leucine-rich repeats and IQ motif containing 1 is a protein that in humans is encoded by the LRRIQ1 gene. The protein is likely a nuclear encoding mitochondrial protein and is found in all Metazoans.
LRRIQ3, which is also known as LRRC44, is a protein that in humans is encoded by the LRRIQ3 gene. It is predominantly expressed in the testes, and is linked to a number of diseases.
Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.
KRBA1 is a protein that in humans is encoded by the KRBA1 gene. It is located on the plus strand of chromosome 7 from 149,411,872 to 149,431,664. It is also commonly known under two other aliases: KIAA1862 and KRAB A Domain Containing 1 gene and encodes the KRBA1 protein in humans. The KRBA family of genes is understood to encode different transcriptional repressor proteins
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.