KIAA0895

Last updated
KIAA0895
Identifiers
Aliases KIAA0895 , 9530077C05Rik, 1110003N12Rik, Kiaa0895, mKIAA0895
External IDs MGI: 1915533 HomoloGene: 19280 GeneCards: KIAA0895
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_026739
NM_183273
NM_001378982
NM_001378983

RefSeq (protein)

NP_081015
NP_001365911
NP_001365912

Location (UCSC) Chr 7: 36.32 – 36.39 Mb Chr 9: 22.32 – 22.36 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

KIAA0895 is a protein that in Homo sapiens is encoded by the KIAA0895 gene. The gene encodes a protein commonly known as the KIAA0895 protein. Its aliases include hypothetical protein LOC23366, OTTHUMP00000206979, OTTHUMP00000206980, 9530077C05Rik, and 1110003N12Rik. [5] It is located at 7p14.2. [6]

Contents

Research into the KIAA proteins has shown that they are similar to known genes with functions related to cell signaling/communication, cell structure/motility and nucleic acid management. [7]

Gene

Locus

The KIAA0895 gene is located at 7p14.2. [6] The genomic DNA is 65,976 base pairs long, [8] while the longest mRNA that it produces is 4463 bases long.

It can be transcribed into 15 transcript variants, which in turn can produce 13 different isoforms of the protein. [9]

The location of the KIAA0895 gene on chromosome 7 KIAA0895location.png
The location of the KIAA0895 gene on chromosome 7

Gene Neighborhood

KIAA0895 is surrounded by the following genes on chromosome 7: [8]

Size of gene

The gene encoded for the KIAA0895 protein is 65,975 nucleotides long, from nucleotides 36324150 to 36390125, with seven exons.

mRNA

There are ten different isoforms for KIAA0895. [6]

Protein

The longest protein isoform that is produced by the KIAA0895 gene is termed LOC23366 isoform 1 and is 520 amino acids long. [10] The predicted molecular weight is 61kDa. [11] Additionally, the theoretical isoelectric point is 10. [11]

Amino acid composition

KIAA0895 is a lysine and arginine semi-enriched protein. [12] KIAA0895 is semi-enriched in positively charged lysine and arginine groups, and positively and negatively charged lysine, arginine, glutamic acid and aspartic acid groups. [12] However, KIAA0895 is semi-depleted in non-polar alanine, glycine and proline groups. [12]

The charge distribution analysis shows that there are no negative or mixed charge clusters. [12] However, there is one positive charge cluster from amino acids 12 to 36. [12]

Domain of unknown function 1704
Identifiers
SymbolDUF1704
Pfam PF08014
InterPro IPR012548
Available protein structures:
Pfam   structures / ECOD  
PDB RCSB PDB; PDBe; PDBj
PDBsum structure summary

Regions

LOC23366 contains a protein domain of unknown function called DUF1704. [13] It also contains a region of low complexity from position 120 to position 150 in the protein, [14] and an arginine-rich area from position 12 to position 51. [15]

Promoters

Using ElDorado by Genomatrix, a promoter region sequence was found. [16] The most likely promoter for KIAA0895 starts at 36389926 and goes to 36391149, with a length of 1224. [16]

The predicted serine, threonine, and tyrosine phosphorylation sites of the KIAA0895 protein Kiaa0895 phosphorylation sites.gif
The predicted serine, threonine, and tyrosine phosphorylation sites of the KIAA0895 protein

Post-translational modification

KIAA0895 is predicted to undergo phosphorylation at several serines, threonines, and tyrosines throughout its structure. [17] Phosphorylation at these sites is a form of gene regulation. Phosphorylation results in a conformational change in the structure of many enzymes and receptors. This causes them to become activated or deactivated.

Stem loops

Using a database called Mfold, a stem-loop formation for the 5' UTR region shows that there is a lack of conservation, meaning there is some precedence that these stems and loops are not used for translation regulation. [18] The ΔG value was -12.90 kcal/mol with three loops. [18] Using the same database, a stem-loop formation for the 3' UTR region shows that there is conservation, meaning there is some precedence that they are used for translation regulation. [18] The ΔG value was -636.80 kcal/mol. [18]

Tertiary structure

KIAA0895 has a tertiary structure with alpha helices and beta sheets. [19]

Proposed Tertiary Structure for KIAA0895. Tertiary Structure of KIAA0895.gif
Proposed Tertiary Structure for KIAA0895.
Proposed Tertiary Structure for 24% of KIAA0895. Image coloured by rainbow N - C terminus. KIAA0895TertiaryStructure.png
Proposed Tertiary Structure for 24% of KIAA0895. Image coloured by rainbow N → C terminus.

Interacting proteins

There are three proteins likely to be interacting proteins with KIAA0895. These proteins are ELAVL1, [20] vata, [21] and glym. [21] These interactions have experimental evidence from the sources provided.

Expression

KIAA0895 is most commonly found in the testis, however it also has a strong expression in the kidneys, adrenal glands, and brain. [6]

RNA-seq tissue data is reported as mean TPM (transcripts per million). KIAA0895 Expression..png
RNA-seq tissue data is reported as mean TPM (transcripts per million).
The Mean RPKM values for KIAA0895 in 27 different tissue types. Human tissue-specific expression of KIAA0895.png
The Mean RPKM values for KIAA0895 in 27 different tissue types.

Per Health State

Using an EST profile from NCBI, KIAA0895 has strong expression in cervical tumors and bladder carcinoma. [22]

An EST profile was created by NCBI. EST profiles show approximate gene expression patterns as inferred from EST counts and the cDNA library sources. EST Health State Profile for KIAA0895.png
An EST profile was created by NCBI. EST profiles show approximate gene expression patterns as inferred from EST counts and the cDNA library sources.

Interacting proteins

Using three different databases, three different interacting proteins were found. These include ELAVL1, VATA, and GLYM. [21] There was experimental evidence for all three of these interacting proteins.

Homologs and orthologs

KIAA0895 has over 228 orthologs. [6] Orthologs have been found in mammals and eukaryotes. [23] There are homologs in 9 species. [6] The full list of organisms in which homologs have been found is given below.

Paralogs

KIAA0895 has 7 paralogs in Homo sapiens: [23]

Related Research Articles

<span class="mw-page-title-main">TMEM242</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 242 (TMEM242) is a protein that in humans is encoded by the TMEM242 gene. The tmem242 gene is located on chromosome 6, on the long arm, in band 2 section 5.3. This protein is also commonly called C6orf35, BM033, and UPF0463 Transmembrane Protein C6orf35. The tmem242 gene is 35,238 base pairs long, and the protein is 141 amino acids in length. The tmem242 gene contains 4 exons. The function of this protein is not well understood by the scientific community. This protein contains a DUF1358 domain.

Transmembrane protein 241 is a ubiquitous sugar transporter protein which in humans is encoded by the TMEM241 gene.

<span class="mw-page-title-main">SHOC1</span> Protein-coding gene in the species Homo sapiens

Shortage In Chiasmata 1, also known as SHOC1, is a protein that in humans is encoded by the SHOC1 gene.

<span class="mw-page-title-main">C11orf52</span> Protein-coding gene in the species Homo sapiens

C11orf52 is an uncharacterized protein that in homo sapiens is encoded by the C11orf52 gene.

<span class="mw-page-title-main">Transmembrane protein 255A</span> Mammalian protein found in Homo sapiens

Transmembrane protein 255A is a protein that is encoded by the TMEM255A gene. TMEM255A is often referred to as family with sequence similarity 70, member A (FAM70A). The TMEM255A protein is transmembrane and is predicted to be located the nuclear envelope of eukaryote organisms.

<span class="mw-page-title-main">C8orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.

<span class="mw-page-title-main">C16orf46</span> Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

<span class="mw-page-title-main">TEX9</span> Protein-coding gene in the species Homo sapiens

Testis-expressed protein 9 is a protein that in humans is encoded the TEX9 gene. TEX9 that encodes a 391-long amino acid protein containing two coiled-coil regions. The gene is conserved in many species and encodes orthologous proteins in eukarya, archaea, and one species of bacteria. The function of TEX9 is not yet fully understood, but it is suggested to have ATP-binding capabilities.

<span class="mw-page-title-main">C16orf86</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.

<span class="mw-page-title-main">GOLGA8H</span>

Golgin subfamily A member 8H, also known as GOLGA8H, is a protein that in Homo sapiens is encoded by the GOLGA8H gene. Function of the GOLGA8H involves a process that is carried out at the cellular level which results in the assembly, arrangement of constituent parts, or disassembly of the Golgi apparatus.

LOC101928193 is a protein which in humans is encoded by the LOC101928193 gene. There are no known aliases for this gene or protein. Similar copies of this gene, called orthologs, are known to exist in several different species across mammals, amphibians, fish, mollusks, cnidarians, fungi, and bacteria. The human LOC101928193 gene is located on the long (q) arm of chromosome 9 with a cytogenic location at 9q34.2. The molecular location of the gene is from base pair 133,189,767 to base pair 133,192,979 on chromosome 9 for an mRNA length of 3213 nucleotides. The gene and protein are not yet well understood by the scientific community, but there is data on its genetic makeup and expression. The LOC101928193 protein is targeted for the cytoplasm and has the highest level of expression in the thyroid, ovary, skin, and testes in humans.

Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.

Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.

<span class="mw-page-title-main">WD Repeat and Coiled Coil Containing Protein</span> Protein-coding gene in humans

WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.

<span class="mw-page-title-main">C16orf90</span> Protein-coding gene in the species Homo sapiens

C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.

<span class="mw-page-title-main">C7orf50</span> Mammalian protein found in Homo sapiens

C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.

<span class="mw-page-title-main">C17orf78</span> Mammalian protein found in Homo sapiens

Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

<span class="mw-page-title-main">CCDC121</span> Protein found in humans

Coiled-coil domain containing 121 (CCDC121) is a protein encoded by the CCDC121 gene in humans. CCDC121 is located on the minus strand of chromosome 2 and encodes three protein isoforms. All isoforms of CCDC121 contain a domain of unknown function referred to as DUF4515 or pfam14988.

<span class="mw-page-title-main">C5orf24</span> Protein-coding gene in the species Homo sapiens

C5orf24 is a protein encoded by the C5orf24 gene (5q31.1) in humans. C5orf24 is primarily localized to the nucleus and is highly conserved with orthologs in mammals, birds, reptiles, amphibians, and fish.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000164542 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000036411 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. "GeneCards: KIAA0895 Gene" . Retrieved 2011-04-24.
  6. 1 2 3 4 5 6 "KIAA0895 KIAA0895 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-02-24.
  7. Nagase T, Ishikawa K, Suyama M, Kikuno R, Hirosawa M, Miyajima N, Tanaka A, Kotani H, Nomura N, Ohara O (December 1998). "Prediction of the coding sequences of unidentified human genes. XII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro". DNA Research. 5 (6): 355–64. doi: 10.1093/dnares/5.6.355 . PMID   10048485.
  8. 1 2 "Entrez Gene: KIAA0895" . Retrieved 2011-04-24.
  9. "NCBI AceView: KIAA0895" . Retrieved 2011-04-24.
  10. "NCBI Protein: LOC23366 isoform 1" . Retrieved 2011-05-09.
  11. 1 2 "ExPASy - Compute pI/Mw tool". web.expasy.org. Retrieved 2019-04-17.
  12. 1 2 3 4 5 "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2019-04-17.
  13. "Wellcome Trust Sanger Institute, Pfam". Archived from the original on 2014-06-17. Retrieved 2011-05-09.
  14. "MyHits Dotlet" . Retrieved 2011-05-09.
  15. "Uniprot" . Retrieved 2011-05-09.
  16. 1 2 "ElDorado Introduction". www.genomatix.de. Retrieved 2019-05-13.
  17. "DTU Center for Biological Sciences, NetPhos" . Retrieved 2011-05-09.
  18. 1 2 3 4 "The Mfold Web Server | mfold.rit.albany.edu". unafold.rna.albany.edu. Retrieved 2019-04-17.
  19. "PHYRE2 Protein Fold Recognition Server". www.sbg.bio.ic.ac.uk. Retrieved 2019-04-20.
  20. "mentha: the interactome browser". mentha.uniroma2.it. Retrieved 2019-04-20.
  21. 1 2 3 "2 binary interactions found for search term KIAA0895". IntAct Molecular Interaction Database. Retrieved 2019-04-20.
  22. "Tissue expression of KIAA0895 - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2019-05-13.
  23. 1 2 "NCBI BLAST" . Retrieved 2011-05-09.