SLC66A3

Last updated
SLC66A3
Identifiers
Aliases SLC66A3 , C2orf22, PQ loop repeat containing 3, solute carrier family 66 member 3, PQLC3
External IDs MGI: 2444067 HomoloGene: 16454 GeneCards: SLC66A3
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001282710
NM_001282711
NM_001282712
NM_152391

NM_001161111
NM_172574

RefSeq (protein)

NP_001269639
NP_001269640
NP_001269641
NP_689604

NP_766162

Location (UCSC) Chr 2: 11.16 – 11.18 Mb Chr 12: 17.04 – 17.05 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Solute carrier family 66 member 3 is a gene in humans that encodes the protein SLC66A3. The function of the SLC66A3 protein is not yet well understood but belongs to a family of five evolutionarily related proteins, the SLC66 lysosomal amino acid transporters. [5] SLC66A3 is localized to the endoplasmic reticulum and has four transmembrane domains. [6]

Contents

Gene

The SLC66A3 is a gene consisting of 26,831 base pairs spanning from 11,155,467 to 11,178,856 on chromosome 2. [6] SLC66A3 mapped to the plus strand at 2p25.1 and contains 7 exons. [7] The SLC66A3 gene is neighbored by the genes ROCK2, C2orf50, and KCNF1. [6] ROCK2 and C2orf50 are both located upstream of SLC66A3 whereas KCNF1 is located upstream. [6]

Chromosome 2. SLC66A3 is located at 2p25.1 Human chromosome 2 ideogram.svg
Chromosome 2. SLC66A3 is located at 2p25.1

Transcripts

SLC66A3 has 14 different mRNAs, there are 12 alternatively-spliced mRNAs that produce functional proteins and 2 unspliced variations that do not. [6] The longest transcript is transcript variant 1 which produces the longest protein at 202 amino acids in length.

Protein

The SLC66A3 protein coding gene that produces a protein with the same name. [8] The mRNA transcript variant 1 is 1,717 bp with 7 exons and produces the longest protein, transcript variant 1. SLC66A3 is a 202 amino acid, 22.6 kDa protein with a theoretical isoelectric point of 9.14. [9]

Conceptual Translation of SLC66A3 Conceptual Translation of SLC66A3.png
Conceptual Translation of SLC66A3

Secondary structure

The secondary structure of SLC66A3 is predicted to consist of 7 α-helices. The protein contains a signal peptide, 4 transmembrane regions, and a splicing variant region, and is localized in the endoplasmic reticulum. [7] [8]

Tertiary structure

I-TASSER predicts that the tertiary structure contains 7 coils with high certainty. [10] DiANNA predicts disulfide bonds between positions 9 - 57, 17 - 121, and 82 - 157. [11]

Gene level regulation

Promoter

The promoter of SLC66A3 is 1,169 base pairs long and is located 1000 base pairs upstream of the 5' UTR.

Transcription factors

Many different transcription factors regulate the expression of the SLC66A3 gene. Some of these include RNA polymerase II transcription factor II B, Myc associated zinc fingers, EGR/nerve growth factor induced protein C & related factors, and EVI1-myeloid transforming protein.

Transcription FactorFunction
RNA polymerase II transcription factor II BStimulation of transcription initiation. [12]
EVI1-myleoid transforming proteinInvolved in the proliferation and differentiation of hematopoietic cells through interaction with GATA-2. [13]
Krueppel like transcription factorsInvolved in β-globin expression and the development of erythrocytes. [14]
GATA binding factorsInvolved in the regulation of transcription during developmental stages and maintaining and developing hematopoietic systems. [15]

Expression patterns

SLC66A3 is expressed at high levels in a variety of tissues throughout the body but is most highly abundant in the whole blood and white blood cells. [6] [16] SLC66A3 is expressed at high levels, approximately 2.1 times more than the average gene. [16]

Protein Level Regulation

Post translational modifications

SLC66A3 is predicted to undergo phosphorylation, N-glycosylation, and myristoylation. [17] [18]

Homology/Evolution

Date of divergence vs corrected divergence of SLC66A3 orthologs compared to cytochrome c and fibrinogen alpha. Divergence data for SLC66A3.png
Date of divergence vs corrected divergence of SLC66A3 orthologs compared to cytochrome c and fibrinogen alpha.

Orthologs

SLC66A3 was present in the common ancestor of animals and is found in all animals but not fungi. [7]

Genus, SpeciesCommon NameTaxonomic GroupDivergence date from humans (MYA)accession numberSequence Length(AA)Sequence Identity(%)
Homo sapiensHumanPrimates0NP_689604.1202100
Mus musculusMouseRodentia89NP_766162.220285
Phascolarctos cinereusKoalaMarsupialia160XP_020841056.120374
Anolis carolinensisGreen anoleSquamata318XP_003215467.120264
Gallus gallusChickenGalliformes318XP_040524789.120171
Danio rerioZebrafishCypriniformes433NP_001004615.120359
Electrophorus electricusElectric eelGymnotiformes433XP_026863988.220358
Callorhinchus miliiElephant sharkChimaera465XP_007895274.121859
Petromyzon marinusSea lampreyPetromyzontiformes599XP_032825995.120152
Crassostrea gigasPacific oysterOstreida736XP_034312411.120836
Drosophila willistoniFruitflyDiptera736XP_002074377.222034
Owenia fusiformisTube wormCanalipalpata736CAC9661908.123329
Lytechinus variegatusVariegated sea urchinTemnopleuroida627XP_041484893.122429
Asterias rubensCommon starfishForcipulatida627XP_033644897.121328
Amphimedon queenslandicaSpongeHaplosclerida777XP_019856231.122126

Related Research Articles

<span class="mw-page-title-main">YIF1A</span> Protein-coding gene in the species Homo sapiens

Protein YIF1A is a Yip1 domain family proteins that in humans is encoded by the YIF1A gene.

<span class="mw-page-title-main">REEP5</span> Protein-coding gene in the species Homo sapiens

Receptor expression-enhancing protein 5 is a protein that in humans is encoded by the REEP5 gene. Receptor Expression Enhancing Protein is a protein encoded for in Humans by the REEP5 gene.

<span class="mw-page-title-main">C11orf49</span> Protein-coding gene in the species Homo sapiens

C11orf49 is a protein coding gene that in humans encodes for the C11orf49 protein. It is heavily expressed in brain tissue and peripheral blood mononuclear cells, with the latter being an important component of the immune system. It is predicted that the C11orf49 protein acts as a kinase, and has been shown to interact with HTT and APOE2.

<span class="mw-page-title-main">C20orf27</span> Protein-coding gene in the species Homo sapiens

UPF0687 protein C20orf27 is a protein that in humans is encoded by the C20orf27 gene. It is expressed in the majority of the human tissues. One study on this protein revealed its role in regulating cell cycle, apoptosis, and tumorigenesis via promoting the activation of NFĸB pathway.

<span class="mw-page-title-main">SLC46A3</span> Protein-coding gene in the species Homo sapiens

Solute carrier family 46 member 3 (SLC46A3) is a protein that in humans is encoded by the SLC46A3 gene. Also referred to as FKSG16, the protein belongs to the major facilitator superfamily (MFS) and SLC46A family. Most commonly found in the plasma membrane and endoplasmic reticulum (ER), SLC46A3 is a multi-pass membrane protein with 11 α-helical transmembrane domains. It is mainly involved in the transport of small molecules across the membrane through the substrate translocation pores featured in the MFS domain. The protein is associated with breast and prostate cancer, hepatocellular carcinoma (HCC), papilloma, glioma, obesity, and SARS-CoV. Based on the differential expression of SLC46A3 in antibody-drug conjugate (ADC)-resistant cells and certain cancer cells, current research is focused on the potential of SLC46A3 as a prognostic biomarker and therapeutic target for cancer. While protein abundance is relatively low in humans, high expression has been detected particularly in the liver, small intestine, and kidney.

<span class="mw-page-title-main">CCDC47</span> Protein-coding gene in humans

Coiled-coil domain 47 (CCDC47) is a gene located on human chromosome 17, specifically locus 17q23.3 which encodes for the protein CCDC47. The gene has several aliases including GK001 and MSTP041. The protein itself contains coiled-coil domains, the SEEEED superfamily, a domain of unknown function (DUF1682) and a transmembrane domain. The function of the protein is unknown, but it has been proposed that CCDC47 is involved in calcium ion homeostasis and the endoplasmic reticulum overload response.

TMEM156 is a gene that encodes the transmembrane protein 156 (TMEM156) in Homo sapiens. It has the clone name of FLJ23235.

<span class="mw-page-title-main">TMCO4</span> Protein-coding gene in the species Homo sapiens

Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.

<span class="mw-page-title-main">KIAA0825</span> Protein-coding gene in the species Homo sapiens

KIAA0825 is a protein that in humans is encoded by the gene of the same name, located on chromosome 5, 5q15. It is a possible risk factor in Type II Diabetes, and associated with high levels of glucose in the blood. It is a relatively fast mutating gene, compared to other coding genes. There is however one region which is highly conserved across the species that have the gene, known as DUF4495. It is predicted to travel between the nucleus and the cytoplasm.

<span class="mw-page-title-main">Uncharacterized protein C15orf32</span> Protein-coding gene in the species Homo sapiens

Uncharacterized Protein C15orf32 is a protein which in humans is encoded by the C15orf32 gene and is located on chromosome 15, location 15q26.1. Variants of C15orf32 have been linked to bipolar disorder, alcohol use disorder, and acute myeloid leukemia.

<span class="mw-page-title-main">TMEM128</span>

TMEM128, also known as Transmembrane Protein 128, is a protein that in humans is encoded by the TMEM128 gene. TMEM128 has three variants, varying in 5' UTR's and start codon location. TMEM128 contains four transmembrane domains and is localized in the Endoplasmic Reticulum membrane. TMEM128 contains a variety of regulation at the gene, transcript, and protein level. While the function of TMEM128 is poorly understood, it interacts with several proteins associated with the cell cycle, signal transduction, and memory.

<span class="mw-page-title-main">Fam89A</span> Human protein and gene

ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.

<span class="mw-page-title-main">SMIM19</span> Protein-coding gene in the species Homo sapiens

SMIM19, also known as Small Integral Membrane Protein 19, encodes the SMIM19 protein. SMIM19 is a confirmed single-pass transmembrane protein passing from outside to inside, 5' to 3' respectively. SMIM19 has ubiquitously high to medium expression with among varied tissues or organs. The validated function of SMIM19 remains under review because of on sub-cellular localization uncertainty. However, all linked proteins research to interact with SMIM19 are associated with the endoplasmic reticulum (ER), presuming SMIM19 ER association

<span class="mw-page-title-main">FAM214B</span> Protein-coding gene in the species Homo sapiens

The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

<span class="mw-page-title-main">ZNF548</span> Protein-coding gene in the species Homo sapiens

Zinc Finger Protein 548 (ZNF548) is a human protein encoded by the ZNF548 gene which is located on chromosome 19. It is found in the nucleus and is hypothesized to play a role in the regulation of transcription by RNA Polymerase II. It belongs to the Krüppel C2H2-type zinc-finger protein family as it contains many zinc-finger repeats.

<span class="mw-page-title-main">KIAA2013</span> Protein-coding gene in the species Homo sapiens

KIAA2013, also known as Q8IYS2 or MGC33867, is a single-pass transmembrane protein encoded by the KIAA2013 gene in humans. The complete function of KIAA2013 has not yet been fully elucidated.

<span class="mw-page-title-main">C13orf42</span> C13orf42 gene page

C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. Tertiary structure predictions for C13orf42 indicate multiple alpha helices.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000162976 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000045679 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. Gyimesi G, Hediger MA (2021-06-25). "SLC66 Lysosomal amino acid transporters in GtoPdb v.2021.2". IUPHAR/BPS Guide to Pharmacology CITE. 2021 (2). doi: 10.2218/gtopdb/F1048/2021.2 . ISSN   2633-1020. S2CID   237877056.
  6. 1 2 3 4 5 6 "SLC66A3 solute carrier family 66 member 3 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2021-12-19.
  7. 1 2 3 "SLC66A3 Gene". www.genecards.org. Retrieved 2021-10-06.
  8. 1 2 "Homo sapiens solute carrier family 66 member 3 (SLC66A3), transcript variant 1, mRNA". 2021-02-16.{{cite journal}}: Cite journal requires |journal= (help)
  9. "SAPS Results". www.ebi.ac.uk. Retrieved 2021-12-21.
  10. "I-TASSER results". zhanggroup.org. Retrieved 2021-12-21.
  11. "DiANNA". bioinformatics.bc.edu. Retrieved 2021-12-21.
  12. Liu X, Bushnell DA, Kornberg RD (January 2013). "RNA polymerase II transcription: structure and mechanism". Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms. 1829 (1): 2–8. doi:10.1016/j.bbagrm.2012.09.003. PMC   4244541 . PMID   23000482.
  13. Yuan X, Wang X, Bi K, Jiang G (December 2015). "The role of EVI-1 in normal hematopoiesis and myeloid malignancies (Review)". International Journal of Oncology. 47 (6): 2028–2036. doi: 10.3892/ijo.2015.3207 . PMID   26496831.
  14. Pollak NM, Hoffman M, Goldberg IJ, Drosatos K (February 2018). "Krüppel-like factors: Crippling and un-crippling metabolic pathways". JACC. Basic to Translational Science. 3 (1): 132–156. doi:10.1016/j.jacbts.2017.09.001. PMC   5985828 . PMID   29876529.
  15. Gao J, Chen YH, Peterson LC (2015-10-06). "GATA family transcriptional factors: emerging suspects in hematologic disorders". Experimental Hematology & Oncology. 4 (1): 28. doi: 10.1186/s40164-015-0024-z . PMC   4594744 . PMID   26445707.
  16. 1 2 "AceView: Gene:PQLC3, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2021-12-19.
  17. "Motif Scan". myhits.sib.swiss. Retrieved 2021-12-21.
  18. "Services". www.healthtech.dtu.dk. Retrieved 2021-12-21.