SLC66A3 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | SLC66A3 , C2orf22, PQ loop repeat containing 3, solute carrier family 66 member 3, PQLC3 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 2444067 HomoloGene: 16454 GeneCards: SLC66A3 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Solute carrier family 66 member 3 is a gene in humans that encodes the protein SLC66A3. The function of the SLC66A3 protein is not yet well understood but belongs to a family of five evolutionarily related proteins, the SLC66 lysosomal amino acid transporters. [5] SLC66A3 is localized to the endoplasmic reticulum and has four transmembrane domains. [6]
The SLC66A3 is a gene consisting of 26,831 base pairs spanning from 11,155,467 to 11,178,856 on chromosome 2. [6] SLC66A3 mapped to the plus strand at 2p25.1 and contains 7 exons. [7] The SLC66A3 gene is neighbored by the genes ROCK2, C2orf50, and KCNF1. [6] ROCK2 and C2orf50 are both located upstream of SLC66A3 whereas KCNF1 is located upstream. [6]
SLC66A3 has 14 different mRNAs, there are 12 alternatively-spliced mRNAs that produce functional proteins and 2 unspliced variations that do not. [6] The longest transcript is transcript variant 1 which produces the longest protein at 202 amino acids in length.
The SLC66A3 protein coding gene that produces a protein with the same name. [8] The mRNA transcript variant 1 is 1,717 bp with 7 exons and produces the longest protein, transcript variant 1. SLC66A3 is a 202 amino acid, 22.6 kDa protein with a theoretical isoelectric point of 9.14. [9]
The secondary structure of SLC66A3 is predicted to consist of 7 α-helices. The protein contains a signal peptide, 4 transmembrane regions, and a splicing variant region, and is localized in the endoplasmic reticulum. [7] [8]
I-TASSER predicts that the tertiary structure contains 7 coils with high certainty. [10] DiANNA predicts disulfide bonds between positions 9 - 57, 17 - 121, and 82 - 157. [11]
The promoter of SLC66A3 is 1,169 base pairs long and is located 1000 base pairs upstream of the 5' UTR.
Many different transcription factors regulate the expression of the SLC66A3 gene. Some of these include RNA polymerase II transcription factor II B, Myc associated zinc fingers, EGR/nerve growth factor induced protein C & related factors, and EVI1-myeloid transforming protein.
Transcription Factor | Function |
---|---|
RNA polymerase II transcription factor II B | Stimulation of transcription initiation. [12] |
EVI1-myleoid transforming protein | Involved in the proliferation and differentiation of hematopoietic cells through interaction with GATA-2. [13] |
Krueppel like transcription factors | Involved in β-globin expression and the development of erythrocytes. [14] |
GATA binding factors | Involved in the regulation of transcription during developmental stages and maintaining and developing hematopoietic systems. [15] |
SLC66A3 is expressed at high levels in a variety of tissues throughout the body but is most highly abundant in the whole blood and white blood cells. [6] [16] SLC66A3 is expressed at high levels, approximately 2.1 times more than the average gene. [16]
SLC66A3 is predicted to undergo phosphorylation, N-glycosylation, and myristoylation. [17] [18]
SLC66A3 was present in the common ancestor of animals and is found in all animals but not fungi. [7]
Genus, Species | Common Name | Taxonomic Group | Divergence date from humans (MYA) | accession number | Sequence Length(AA) | Sequence Identity(%) |
Homo sapiens | Human | Primates | 0 | NP_689604.1 | 202 | 100 |
Mus musculus | Mouse | Rodentia | 89 | NP_766162.2 | 202 | 85 |
Phascolarctos cinereus | Koala | Marsupialia | 160 | XP_020841056.1 | 203 | 74 |
Anolis carolinensis | Green anole | Squamata | 318 | XP_003215467.1 | 202 | 64 |
Gallus gallus | Chicken | Galliformes | 318 | XP_040524789.1 | 201 | 71 |
Danio rerio | Zebrafish | Cypriniformes | 433 | NP_001004615.1 | 203 | 59 |
Electrophorus electricus | Electric eel | Gymnotiformes | 433 | XP_026863988.2 | 203 | 58 |
Callorhinchus milii | Elephant shark | Chimaera | 465 | XP_007895274.1 | 218 | 59 |
Petromyzon marinus | Sea lamprey | Petromyzontiformes | 599 | XP_032825995.1 | 201 | 52 |
Crassostrea gigas | Pacific oyster | Ostreida | 736 | XP_034312411.1 | 208 | 36 |
Drosophila willistoni | Fruitfly | Diptera | 736 | XP_002074377.2 | 220 | 34 |
Owenia fusiformis | Tube worm | Canalipalpata | 736 | CAC9661908.1 | 233 | 29 |
Lytechinus variegatus | Variegated sea urchin | Temnopleuroida | 627 | XP_041484893.1 | 224 | 29 |
Asterias rubens | Common starfish | Forcipulatida | 627 | XP_033644897.1 | 213 | 28 |
Amphimedon queenslandica | Sponge | Haplosclerida | 777 | XP_019856231.1 | 221 | 26 |
Protein YIF1A is a Yip1 domain family proteins that in humans is encoded by the YIF1A gene.
Receptor expression-enhancing protein 5 is a protein that in humans is encoded by the REEP5 gene. Receptor Expression Enhancing Protein is a protein encoded for in Humans by the REEP5 gene.
C11orf49 is a protein coding gene that in humans encodes for the C11orf49 protein. It is heavily expressed in brain tissue and peripheral blood mononuclear cells, with the latter being an important component of the immune system. It is predicted that the C11orf49 protein acts as a kinase, and has been shown to interact with HTT and APOE2.
UPF0687 protein C20orf27 is a protein that in humans is encoded by the C20orf27 gene. It is expressed in the majority of the human tissues. One study on this protein revealed its role in regulating cell cycle, apoptosis, and tumorigenesis via promoting the activation of NFĸB pathway.
Solute carrier family 46 member 3 (SLC46A3) is a protein that in humans is encoded by the SLC46A3 gene. Also referred to as FKSG16, the protein belongs to the major facilitator superfamily (MFS) and SLC46A family. Most commonly found in the plasma membrane and endoplasmic reticulum (ER), SLC46A3 is a multi-pass membrane protein with 11 α-helical transmembrane domains. It is mainly involved in the transport of small molecules across the membrane through the substrate translocation pores featured in the MFS domain. The protein is associated with breast and prostate cancer, hepatocellular carcinoma (HCC), papilloma, glioma, obesity, and SARS-CoV. Based on the differential expression of SLC46A3 in antibody-drug conjugate (ADC)-resistant cells and certain cancer cells, current research is focused on the potential of SLC46A3 as a prognostic biomarker and therapeutic target for cancer. While protein abundance is relatively low in humans, high expression has been detected particularly in the liver, small intestine, and kidney.
Coiled-coil domain 47 (CCDC47) is a gene located on human chromosome 17, specifically locus 17q23.3 which encodes for the protein CCDC47. The gene has several aliases including GK001 and MSTP041. The protein itself contains coiled-coil domains, the SEEEED superfamily, a domain of unknown function (DUF1682) and a transmembrane domain. The function of the protein is unknown, but it has been proposed that CCDC47 is involved in calcium ion homeostasis and the endoplasmic reticulum overload response.
TMEM156 is a gene that encodes the transmembrane protein 156 (TMEM156) in Homo sapiens. It has the clone name of FLJ23235.
Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.
KIAA0825 is a protein that in humans is encoded by the gene of the same name, located on chromosome 5, 5q15. It is a possible risk factor in Type II Diabetes, and associated with high levels of glucose in the blood. It is a relatively fast mutating gene, compared to other coding genes. There is however one region which is highly conserved across the species that have the gene, known as DUF4495. It is predicted to travel between the nucleus and the cytoplasm.
Uncharacterized Protein C15orf32 is a protein which in humans is encoded by the C15orf32 gene and is located on chromosome 15, location 15q26.1. Variants of C15orf32 have been linked to bipolar disorder, alcohol use disorder, and acute myeloid leukemia.
TMEM128, also known as Transmembrane Protein 128, is a protein that in humans is encoded by the TMEM128 gene. TMEM128 has three variants, varying in 5' UTR's and start codon location. TMEM128 contains four transmembrane domains and is localized in the Endoplasmic Reticulum membrane. TMEM128 contains a variety of regulation at the gene, transcript, and protein level. While the function of TMEM128 is poorly understood, it interacts with several proteins associated with the cell cycle, signal transduction, and memory.
ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.
SMIM19, also known as Small Integral Membrane Protein 19, encodes the SMIM19 protein. SMIM19 is a confirmed single-pass transmembrane protein passing from outside to inside, 5' to 3' respectively. SMIM19 has ubiquitously high to medium expression with among varied tissues or organs. The validated function of SMIM19 remains under review because of on sub-cellular localization uncertainty. However, all linked proteins research to interact with SMIM19 are associated with the endoplasmic reticulum (ER), presuming SMIM19 ER association
The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.
Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.
Zinc Finger Protein 548 (ZNF548) is a human protein encoded by the ZNF548 gene which is located on chromosome 19. It is found in the nucleus and is hypothesized to play a role in the regulation of transcription by RNA Polymerase II. It belongs to the Krüppel C2H2-type zinc-finger protein family as it contains many zinc-finger repeats.
KIAA2013, also known as Q8IYS2 or MGC33867, is a single-pass transmembrane protein encoded by the KIAA2013 gene in humans. The complete function of KIAA2013 has not yet been fully elucidated.
C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. Tertiary structure predictions for C13orf42 indicate multiple alpha helices.
{{cite journal}}
: Cite journal requires |journal=
(help)