KIAA1109

Last updated
KIAA1109
Identifiers
Aliases KIAA1109 , 4932438A13Rik, 4732443H21, B830039D19Rik, D630029K19Rik, FSA, Kiaa1109, Tweek, ALKKUCS
External IDs OMIM: 611565 MGI: 2444631 HomoloGene: 52105 GeneCards: KIAA1109
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_015312
NM_032202
NM_001384125

NM_172679

RefSeq (protein)

NP_056127
NP_001371054

NP_766267

Location (UCSC) Chr 4: 122.15 – 122.36 Mb Chr 3: 36.92 – 37.11 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Uncharacterized protein KIAA1109 is a protein that in humans is encoded by the KIAA1109 gene. [5] [6] [7]

Contents

This protein has a function that is not yet understood. KIAA1109 has 3 aliases, FSA (fragile site-associated) protein, MGC110967 and DKFZp781P0474. [8]

Gene

Location

KIAA1109 is found on the long arm of chromosome 4 (4q27), with the genomic sequence starting at 118,818,167 bp and ending at 119,010,362 bp [9]

Gene Neighborhood

The gene neighborhood of KIAA1109 involves 4 other genes. KIAA1109 is a part of the KIAA1109/Tenr/IL2/IL21 gene region. This region consists of the three genes to the right of KIAA1109; ADAD1, IL2 and IL21. [10] Another gene located in the neighborhood of KIAA1109 is TRPC3. This gene is to the left of KIAA1109 on the opposite side of the genes described above. [8]

Expression

According to data on NCBI's EST Abundance Profile page for KIAA1109, the gene is expressed in many different tissues in humans. Human expression is seen most predominately in parathyroid, muscle, ear, eye, mammary gland, lymph node, thymus in addition to 27 other tissues. KIAA1109 is also expressed in various disease states including 12 different tumors as well as bladder carcinoma, chondrosarcoma, glioma, leukemia, lymphoma, non-neoplasia, retinoblastoma tissues. [11] KIAA1109 is expressed in all stages of development from embryoid body to adult, except in infants. No expression of my gene is seen during the infant stage of development. [11]

Promoter

According to Genomatix's ElDorado program the promoter region of KIAA1109 is predicted to be 601 base pairs in length. The promoter region starts 500 base pairs upstream of the 5’ UTR of KIAA1109 mRNA transcript and contains part of this 5’ UTR. [12]

Homology

KIAA1109 is conserved throughout many species. Orthologs have been found in many mammals and other vertebrates. More distant homologs have been identified in animals such as insects. See the mRNA and protein conservation sections below for more details. No human paralogs for KIAA1109 have been identified. [13]

mRNA

Splice Variants

KIAA1109 has 13 mRNA splice variants and 6 unspliced variants. Variant A is the longest and most commonly occurring variant of the gene [14] and is the subject of this article. KIAA1109 variant A is made up of 84 exons and is 15,592 base pairs in length. [10] The accession number for this nucleotide is NM_015312.3.

Conservation

The mRNA sequence of KIAA1109 is highly conserved throughout mammals. The mRNA sequence identity to mammals is no less than 81.9% (in platypus) and ranging up to 99.5% (in chimpanzees). [15] Birds also show fairly high conservation with mRNA sequence identities around 78% in zebra finches. The table blow shows information on the mRNA orthologs.

Genus and species nameCommon namemRNA accession number [16] Sequence length (bp) [16] Sequence identity to human mRNA [15]
Homo sapiensHumanNM_015312.315592
Pan troglodytesChimpanzeeXM_517422.21557899.5%
Macaca mulattaRhesus macaqueXM_001102884.21552997.9%
Callithris jacchusMarmosetXM_002745433.11556697.2%
Equus caballusHorseXM_001915982.11558993.8%
Ailuropoda melanoleucaGiant pandaXM_002923821.11501891%
Oryctolagus cuniculusRabbitXM_002717235.11501590.7%
Mus musculusMouseNM_172679.21588388.2%
Monodelphis domesticaOpossumXM_001370569.11504882%
Ornithorhynchus anatinusPlatypusXM_001513933.11503981.9%
Taeniopygia guttataZebra finchXM_002188249.11548918.9%
Gallus gallusChickenXM_420625.21512378.5%
Tribolium castaneumRed flour beetleXM_967081.21379748.6%

Protein

General Properties

KIAA1109 protein is 5005 amino acids in length, [17] and has a predicted molecular weight of 555519.38 daltons. [18] The isoelectirc point of KIAA1109 protein is predicted to be 6.12. [19]

Composition

The amino acid composition of KIAA1109 protein showed amino acid frequencies within 1.5% of that of normal human proteins for all but Alanine, Serine and Threonine. Alanine has a lower frequency in KIAA1109 than in that of a normal human protein while Serine and Threonine both have a higher frequency in KIAA1109 than in the average human protein. [20]

Conservation

The amino acid sequence of KIAA1109 is highly conserved throughout mammals. The protein identity ranges from 93.2% in Opossum to 99.8% in Chimpanzees and protein similarity is no less than 97% in all mammals included. Birds continue to show fairly high conservation with protein identities around 90% and proteins similarities at a high 96%. While conservation is still high the lower numbers may be due to small truncations on either, the 5’ and 3’ ends of these sequences. [13]

As we move to the more distant species of zebra fish and then the red four beetle and carpenter ant the conservations drops. In the insects the protein identities are down to around 34%. [13]

Genus and species nameCommon nameProtein accession number [16] Sequence length (amino acids) [16] Sequence identity to human protein [13] Sequence similarity to human protein [13]
Homo sapiensHumanNP_056127.25005
Pan troglodytesChimpanzeeXP_517422.2500599.8%99.8%
Macaca mulattaRhesus macaqueXP_001102884.1500799.2%99%
Callithris jacchusMarmosetXP_002745479.1500498.9%99%
Equus caballusHorseXP_001916017.1500698%99%
Ailuropoda melanoleucaGiant pandaXP_002923867.1500598.1%99%
Oryctolagus cuniculusRabbitXP_002717281.1500497.8%99%
Mus musculusMouseNP_766267.2500596.7%99%
Canis familiarisDogXP_540963.2494496.4%99%
Monodelphis domesticaOpossumXP_001370606.1501593.2%97%
Ornithorhynchus anatinusPlatypusXP_001513983.1501293.3%97%
Taeniopygia guttataZebra finchXP_002188285.1499990.7%96%
Gallus gallusChickenXP_420625.2504089.9%96%
Danio rerioZebra fishNP_001139056.1492274.2%84%
Tribolium castaneumRed flour beetleXP_972174.2459834.8%49%
Camponotus floridanusCarpenter antEFN75044.1497934.3

Conserved Domains

NCBI conserved domains search identified two domains in KIAA1109. The first is the fragile site associated C-terminus, which is said to be linked to celiac disease susceptibility according to genome-wide-association studies and may also be associated with polycystic kidney disease. [21] The second conserved region identified by NCBI in KIAA1109 is an uncharacterized conserved protein (DUF2246), whose function is unknown and is conserved in various species from humans to worms. [22]

Post Translation Modifications

KIAA1109 is predicted to undergo various types of post translational modifications including glycate, N-glycosylation, O-GlcNAc, O Glycosylation, Sulfonation and Phosphorylation. [23]

Subcellular Localization

KIAA1109 contains one transmembrane domain from amino acids 26–46. [17] No signal peptides, mitochondrial targeting sequences or chloroplast peptides were predicted for my protein and it is therefore not predicted to localize to secretory pathway, mitochondria or chloroplast. [24]

Interacting Proteins

MADH2 and Beta-catenin were both found to have a physical interaction with my protein as detached by display technonloy by Miyamoto-Sato et al. 2010. [25] [26]

Related Research Articles

<span class="mw-page-title-main">Morn repeat containing 1</span> Protein-coding gene in the species Homo sapiens

MORN1 containing repeat 1, also known as Morn1, is a protein that in humans is encoded by the MORN1 gene.

<span class="mw-page-title-main">KIAA0895</span> Protein-coding gene in the species Homo sapiens

KIAA0895 is a protein that in Homo sapiens is encoded by the KIAA0895 gene. The gene encodes a protein commonly known as the KIAA0895 protein. Its aliases include hypothetical protein LOC23366, OTTHUMP00000206979, OTTHUMP00000206980, 9530077C05Rik, and 1110003N12Rik. It is located at 7p14.2.

<span class="mw-page-title-main">LRRC40</span> Protein-coding gene in the species Homo sapiens

Leucine rich repeat containing 40 (LRRC40) is a protein that in humans is encoded by the LRRC40 gene.

<span class="mw-page-title-main">KIAA1704</span> Protein-coding gene in the species Homo sapiens

KIAA1704, also known as LSR7, is a protein that in humans is encoded by the GPALPP1 gene. The function of KIAA1704 is not yet well understood. KIAA1704 contains one domain of unknown function, DUF3752. The protein contains a conserved, uncharged, repeated motif GPALPP(GF) near the N terminus and an unusual, conserved, mixed charge throughout. It is predicted to be localized to the nucleus.

<span class="mw-page-title-main">TMEM69</span> Protein-coding gene in the species Homo sapiens

TMEM69, also known as Transmembrane protein 69, is a protein that in humans is encoded by the TMEM69 gene. A notable feature of the protein encoded by TMEM69 is the presence of five transmembrane segments.

<span class="mw-page-title-main">KIAA0922</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 131-like, alternatively named uncharacterized protein KIAA0922, is an integral transmembrane protein encoded by the human gene KIAA0922 that is significantly conserved in eukaryotes, at least through protists. Although the function of this gene is not yet fully elucidated, initial microarray evidence suggests that it may be involved in immune responses. Furthermore, its paralog, prolyl endopeptidase (PREP) whose function is known, provides clues as to the function of TMEM131L.

<span class="mw-page-title-main">QSER1</span> Protein-coding gene in the species Homo sapiens

Glutamine Serine Rich Protein 1 or QSER1 is a protein encoded by the QSER1 gene.

<span class="mw-page-title-main">CCDC47</span> Protein-coding gene in humans

Coiled-coil domain 47 (CCDC47) is a gene located on human chromosome 17, specifically locus 17q23.3 which encodes for the protein CCDC47. The gene has several aliases including GK001 and MSTP041. The protein itself contains coiled-coil domains, the SEEEED superfamily, a domain of unknown function (DUF1682) and a transmembrane domain. The function of the protein is unknown, but it has been proposed that CCDC47 is involved in calcium ion homeostasis and the endoplasmic reticulum overload response.

<span class="mw-page-title-main">Proser2</span> Protein-coding gene in the species Homo sapiens

PROSER2, also known as proline and serine rich 2, is a protein that in humans is encoded by the PROSER2 gene. PROSER2, or c10orf47(Chromosome 10 open reading frame 47), is found in band 14 of the short arm of chromosome 10 (10p14) and contains a highly conserved SARG domain. It is a fast evolving gene with two paralogs, c1orf116 and specifically androgen-regulated gene protein isoform 1. The PROSER2 protein has a currently uncharacterized function however, in humans, it may play a role in cell cycle regulation, reproductive functioning, and is a potential biomarker of cancer.

<span class="mw-page-title-main">C6orf201</span> Protein-coding gene in the species Homo sapiens

Chromosome 6 open reading frame 201, C6orf201, is a protein that in humans is encoded by the C6orf201 gene. In humans this gene encodes for a nuclear protein that is primarily expressed in the testis.

C6orf222 is a protein that in humans is encoded by the C6orf222 gene (6p21.31). C6orf222 is conserved in mammals, birds and reptiles with the most distant ortholog being the green sea turtle, Chelonia mydas. The C6orf222 protein contains one mammalian conserved domain: DUF3293. The protein is also predicted to contain a BH3 domain, which has predicted conservation in distant orthologs from the clade Aves.

<span class="mw-page-title-main">Transmembrane protein 268</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 268 is a protein that in humans is encoded by TMEM268 gene. The protein is a transmembrane protein of 342 amino acids long with eight alternative splice variants. The protein has been identified in organisms from the common fruit fly to primates. To date, there has been no protein expression found in organisms simpler than insects.

<span class="mw-page-title-main">FAM163A</span> Protein-coding gene in the species Homo sapiens

FAM163A, also known as cebelin and neuroblastoma-derived secretory protein (NDSP) is a protein that in humans is encoded by the FAM163A gene. This protein has been implicated in promoting proliferation and anchorage-independent growth of neuroblastoma cancer cells. In addition, this protein has been found to be up-regulated in the lung tissue of chronic smokers. FAM163A is found on human chromosome 1q25.2; its protein product is 167 amino acids long. FAM163A contains a very highly conserved signal peptide sequence, coded for by the first ~37 amino acids in its sequence; albeit only conserved in eukaryotes, the most distant of which being the Japanese Rice Fish.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C9orf50</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.

<span class="mw-page-title-main">SMIM19</span> Protein-coding gene in the species Homo sapiens

SMIM19, also known as Small Integral Membrane Protein 19, encodes the SMIM19 protein. SMIM19 is a confirmed single-pass transmembrane protein passing from outside to inside, 5' to 3' respectively. SMIM19 has ubiquitously high to medium expression with among varied tissues or organs. The validated function of SMIM19 remains under review because of on sub-cellular localization uncertainty. However, all linked proteins research to interact with SMIM19 are associated with the endoplasmic reticulum (ER), presuming SMIM19 ER association

<span class="mw-page-title-main">C5orf24</span> Protein-coding gene in the species Homo sapiens

C5orf24 is a protein encoded by the C5orf24 gene (5q31.1) in humans. C5orf24 is primarily localized to the nucleus and is highly conserved with orthologs in mammals, birds, reptiles, amphibians, and fish.

<span class="mw-page-title-main">TMEM212</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 212 is a protein that in humans is encoded by the TMEM212 gene. The protein consists of five transmembrane domains and localizes in the plasma membrane and endoplasmic reticulum. TMEM212 has orthologs in vertebrates but not invertebrates. TMEM212 has been associated with sporadic Parkinson's disease, facial processing, and adiposity in African Americans.

<span class="mw-page-title-main">C13orf42</span> C13orf42 gene page

C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. Tertiary structure predictions for C13orf42 indicate multiple alpha helices.

<span class="mw-page-title-main">SCRN3</span> Protein-coding gene in the species Homo sapiens

Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000138688 Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000037270 Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. Kikuno R, Nagase T, Ishikawa K, Hirosawa M, Miyajima N, Tanaka A, Kotani H, Nomura N, Ohara O (June 1999). "Prediction of the coding sequences of unidentified human genes. XIV. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro". DNA Res. 6 (3): 197–205. doi: 10.1093/dnares/6.3.197 . PMID   10470851.
  6. Kuo MT, Wei Y, Yang X, Tatebe S, Liu J, Troncoso P, Sahin A, Ro JY, Hamilton SR, Savaraj N (Jan 2006). "Association of fragile site-associated (FSA) gene expression with epithelial differentiation and tumor development". Biochem Biophys Res Commun. 340 (3): 887–93. doi:10.1016/j.bbrc.2005.12.088. PMID   16386706.
  7. "Entrez Gene: KIAA1109 KIAA1109".
  8. 1 2 "NCBI, National Center for Biotechnology" . Retrieved 5 Feb 2011.
  9. "Genecards" . Retrieved 9 May 2011.
  10. 1 2 "NCBI, National Center for Biotechnology" . Retrieved 5 Feb 2011.
  11. 1 2 "NCBI, UniGene. EST Profile".
  12. "Genomatix, ElDorado" . Retrieved 3 April 2011.
  13. 1 2 3 4 5 "BLAST, NCBI" . Retrieved 10 March 2011.
  14. "AceView, NCBI" . Retrieved 2 April 2011.
  15. 1 2 "ALIGN, SDSC Biology Workbench" . Retrieved 12 March 2011.
  16. 1 2 3 4 "NCBI National Center for Biotechnology Information" . Retrieved 9 May 2011.
  17. 1 2 "Protein, NCBI" . Retrieved 18 March 2011.
  18. "AASTATS, SDSC Biology WorkBench" . Retrieved 23 April 2011.[ permanent dead link ]
  19. "PI/Mw, ExPasy". Archived from the original on 2003-10-03. Retrieved 6 May 2011.
  20. "CLC Protein Workbench 5.5.5" . Retrieved 6 May 2011.
  21. "Conserved Domains, NCBI" . Retrieved 9 May 2011.
  22. "Conserved Domains, NCBI" . Retrieved 9 May 2011.
  23. "ExPasy Tools" . Retrieved 21 April 2011.
  24. "ChloroP, MITOPROT, Signal P and PTS1. ExPasy" . Retrieved 21 April 2011.
  25. Miyamoto-Sato E, Fujimori S, Ishizaka M, Hirai N, Masuoka K, Saito R, Ozawa Y, Hino K, Washio T, Tomita M, Yamashita T, Oshikubo T, Akasaka H, Sugiyama J, Matsumoto Y, Yanagawa H (February 2010). "A comprehensive resource of interacting protein regions for refining human transcription factor networks". PLOS ONE. 5 (2): e9289. Bibcode:2010PLoSO...5.9289M. doi: 10.1371/journal.pone.0009289 . PMC   2827538 . PMID   20195357.
  26. "9 binary interactions found for search term KIAA1109". IntAct Molecular Interaction Database. EMBL-EBI. Retrieved 2018-08-25.

Further reading