KIAA1143

Last updated
KIAA1143
Identifiers
Aliases KIAA1143
External IDs MGI: 1913452 HomoloGene: 10791 GeneCards: KIAA1143
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_020696
NM_001320334

NM_025419

RefSeq (protein)

NP_001307263
NP_065747

NP_079695

Location (UCSC) Chr 3: 44.74 – 44.76 Mb Chr 9: 122.77 – 122.78 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse
3D AI generated tertiary structure of the protein KIAA1143 generated by Alpha fold. KIA1143 Predicted Tertiary Structure 3D image.png
3D AI generated tertiary structure of the protein KIAA1143 generated by Alpha fold.
Poly clonal Antibody Staining of KIAA1143 Protein in green staining. Micro tubules are counter stained in red. Shows localization to nucleus and cytokinetic bridge. Immunohistochemistry Staining of KIAA1143 Showing Pressence in Cytokinetic Bridge.jpg
Poly clonal Antibody Staining of KIAA1143 Protein in green staining. Micro tubules are counter stained in red. Shows localization to nucleus and cytokinetic bridge.

KIAA1143 is an uncharacterized protein in humans that is encoded by the KIAA1143 gene. [5] [6] it may play a role in cell growth mechanisms and regulation/creation of cytoskeletal structure. This gene is located on chromosome 3 on the minus strand

Contents

This protein has a function that is not yet objectively understood. KIAA1143 has no alias and has the longest most functional variant named as uncharacterized protein KIAA1143 isoform 1. [7] The mature mRNA transcript is 5079 Base pairs long while the length of the KIAA1143 protein is 154 amino acids. KIAA1143 has another transcript variant called KIAA1143 variant 2, which contains an alternate 3' terminal exon, resulting in a distinct 3' coding region and 3' UTR, compared to variant 1. The encoded isoform 2 has a distinct C-terminus and is shorter than isoform 1. The KIAA1143 protein belongs to the uncharacterized protein KIAA1143-like Family, and contains DUF4604 domain of unknown function. KIAA1143 has a predicted function of cell structure/mobility by encoding heavy neurofilament subunits in neurons. [8] Immunohistochemistry staining from Thermo Fisher Scientific shows presence of KIAA1143 in a cytokinetic bridge, which is involved in cellular cytokinesis. This indicates KIAA1143 may have a role in regulation of cellular division and/or communication as well.

Gene

Location

KIAA1143 is found on chromosome 3 on the short arm (3p21) and has 3 exons. KIAA1143 is on the Minus Strand. [9] KIAA1143 is on the sense strand and spans from bases 44,690,802-44,819,561. KIAA1143 is located in the gene neighborhood of Zinc finger protein 502, ZNF501, and KIF15 [10]

Transcriptional Regulation

The KIAA1143 was found by using 500 bp of nucleotides upstream using UCSC Genome Browser [9] A number of transcription factors with a matrix similarity greater than or equal to 0.5 that are predicted to regulate transcription of KIAA1143 are listed below with their respective binding site:

Transcription FactorBinding SiteStrand
Grainy Head like 2 (GHRL2) (Grainyhead-like gene family)ACAGAAGA+
Zinc Finer Protein 317 (ZNF317)AACCTGTC-
JunD AGTTGACGTCA-
CREMAGTGACGTCAC and GTCACTGCAGT+ and -
ATF3 AGTGACGTCAC and GTCACTGCAGT+ and -
ATF7 AGTGACGTCAC and GTCACTGCAGT+ and -
Jun dimerization protein (JDP2)AGTGACGTCAC and GTCACTGCAGT+ and -
FOS::JUN 9(Transcription factor Jun)ACTGCAGT+
ZNF682CCCCGCACCGG+
KLF13 TGGAACGCC+
Kruppel like factor 16 GCCCGCCAGG+
KLF10 CGGGCGGTCC-
YY2GGCGGCC+
LIN54CTTTGAGC-
Expression of KIAA1143 in human tissue samples KIA1143 Tissue Expression.jpg
Expression of KIAA1143 in human tissue samples

Expression

KIAA1143 is expressed in all tissues, however, The expression of KIAA1143 is highest in the Ovaries, followed by the brain, thyroid, prostate, and urinary bladder [11]

KIAA1143 is predicted to have subcellular localization in the nucleus [12] [13]

mRNA

Characteristics of Isoform 1

Isoform 1 of KIAA1143 has a 5' UTR region of 17 base pairs and a 3' UTR region of 4597 base pairs. The transcript is 5079 base pairs long [14]

Additional Primary Sequence and Variants (Isoforms)

KIAA1143 has another uncharacterized isoform 2 which contains an alternate 3' terminal exon, resulting in a distinct 3' coding region and 3' UTR, compared to variant 1. The encoded isoform 2 has a distinct C-terminus and is shorter than isoform 1. [15]

Protein

The theoretical molecular weight of the 154amino acid KIAA1143 protein is 17.5kDa and the theoretical pI is 5.84. [12]

Domains, Motifs, and Secondary Structure

Since KIAA1143 is an gene with unknown function, most of the coding gene apart from the promotor and translation start site area is a Domain of unknown function, specifically, DUF4604 Spans amino acids 5-151. KIAA1143 has no Cysteine or tyrosine residues. Cysteine and tyrosine are very good nucleophiles, since they are not present, this gives some light into the possible electrophilic nature of the active site. KIAA1143 has some important Eukaryotic Linear Motif resource (ELM) Domains which give insight into its function. These ELM domains are LIG_BIR_II_1 from amino acids 1-5 and LIG_WRC_WIRS_1 from amino acids 144-149. These domains have importance in apoptotic regulation and actin cytoskeleton rearrangement mechanisms respectively [16] [17]

From Positions 1-10 there is also something called a BIG1 Big-1 (bacterial Ig-like domain 1) domain. Big-1 proteins are surface-expressed proteins that mediate mammalian host cell invasion or attachment. The tandem of Ig-like domains appears to form a rod to link the bacterial outer membrane anchor to the C-terminal lectin-like domain to interact with their receptors in the host cell membrane [18]

Post-Translational Modifications

KIAA1143 is shown to be phosphorylated at positions 2, 50, 68, 113, 115, 116 at either serine or threonine residues. There is a sumoylation consensus at position 76, as well as O-GlcNAc attachment at position 8. There is also an N-myristoylation site from 111-116 [19] [20] [21]

KIAA1143 Protein Isoform 1 significant ELM motifs, and important post translational modifications Post Translational Modifications diagram.jpg
KIAA1143 Protein Isoform 1 significant ELM motifs, and important post translational modifications
KIAA1143 iTASSER tertiary structure predictive modeling KIAA1143 Itasser.jpg
KIAA1143 iTASSER tertiary structure predictive modeling

Tertiary Structures

The KIAA1143 Tertiary structure is predicted below through iTASSER modeling, with a C score of -2.83. Coloring is similar to alphafold confidence graphing [22]

Protein Interactions

KIAA1143 is experimentally determined to have interactions with EAPP (E2F-associated phosphoprotein), ECD (Ecdysoneless Cell Cycle Regulator), GPATCH1 (Evolutionarily Conserved G-Patch Domain-Containing Protein), PRPF8 (Pre-MRNA-Processing-Splicing Factor 8), WDR83 (Mitogen-Activated Protein Kinase Organizer 1), CEP76 (centrosomal protein 76), and APP (Amyloid-beta precursor protein).

Homology and Evolution

Paralogs

There are no paralogs for KIAA1143

Orthologs

KIAA1143 has homologs in over 200 other organisms, including vertebrates, invertebrates, archaea, KIAA1143 is found in clades of organisms except land plants.

Genus speciesCommon NameTaxonomic GroupDivergence (MYA)Accession NumberSeq. Length (aa)Corr. ID to HP (%)Corr. Sim. To HP (%)
Homo sapiens Human primates 0NP_065747.1154100100
Mus musculus House Mouse Rodentia 87NP_079695.21548793
Pogona vitticeps Central Bearded Dragon Reptilla 319XP_020663315.11527284
Sphaerodactylus townsendi townsend's least gecko Reptilla 319XP_048366794.11546881
Zootoca vivipara Common Lizzard Reptila 319XP_034985594.11536477
Python bivittatus burmese python Reptilla 319XP_007437594.11516477
Nothoprocta perdicaria chilean tinamou Aves 319XP_025890652.11596373
Oxyura jamaicensis Ruddy Duck Aves 319XP_035173671.11595471
Aptenodytes forsteri Emperor Penguin Aves 319XP_019330441.11595368
Catharus ustulatus Swainson's thrush Aves 319XP_032933984.11755366
Rhinatrema bivittatum two-lined caecilian (Gymnophiona) 353XP_029445556.11565876
Geotrypetes seraphini gaboon caecilian (Gymnophiona) 353XP_033786333.11565878
Rana temporaria Common frog (anura) 353XP_040209069.11525875
Bufo Bufo Common Toad Amphibia (anura)353XP_040287819.11535774
Latimeria Chalumnae West Indian Ocean coelacanth Sarcoptergyii (lobe-finned fish)414XP_005999062.11514663
Gambusia affinis mosquitofish Cyprinodontiformes (ray-finned fish)431XP_043973326.11604863
Clupea harengusAtlantic Herring Clupeiformes (ray-finned fish)431XP_012692935.21544768
Scyliorhinus canicula Small Spotted catshark Carcharhiniformes (cartilagenous fish)464XP_038652524.11525879
Amblyraja radiataThorny skate Elasmobranchii (cartilagenous fish)464XP_032871011.11535068
Petromyzon marinus sea lamprey Agnatha (jawless fish)599XP_032833525.11784967

The relative rate of change for KIAA1143 is fairly slow compared to fibrinogen and beta-globin, but not as slow as cytochrome c.

KIAA1143s mutation rate compared to fibrinogen, beta-globin, and cytochrome c. Evolution Rate Graph for KIAA1143.png
KIAA1143s mutation rate compared to fibrinogen, beta-globin, and cytochrome c.

Homologus Domains

The DUF4604 domain in KIAA1143 is conserved across all organisms

Phylogeny

All of the orthologs of KIAA1143 are derived from the same common ancestor

Clinical Significance

Pathology

No diseases have been shown to be directly linked to KIAA1143 to this date.

Disease Association

No Disease Association is observed with KIAA1143 to date.

Related Research Articles

<span class="mw-page-title-main">HIKESHI</span> Protein-coding gene in the species Homo sapiens

HIKESHI is a protein important in lung and multicellular organismal development that, in humans, is encoded by the HIKESHI gene. HIKESHI is found on chromosome 11 in humans and chromosome 7 in mice. Similar sequences (orthologs) are found in most animal and fungal species. The mouse homolog, lethal gene on chromosome 7 Rinchik 6 protein is encoded by the l7Rn6 gene.

<span class="mw-page-title-main">Tetratricopeptide repeat protein 39B</span> Protein-coding gene in the species Homo sapiens

Tetratricopeptide repeat protein 39B is a protein that in humans is encoded by the TTC39B gene. TTC39B is also known as C9orf52 or FLJ33868. The main feature within tetratricopeptide repeat 39B is the domain of unknown function 3808 (DUF3808), spanning the majority of the protein.

<span class="mw-page-title-main">TMCO6</span> Protein-coding gene in the species Homo sapiens

Transmembrane and coiled-coil domain 6, TMCO6, is a protein that in humans is encoded by the TMCO6 gene with aliases of PRO1580, HQ1580 or FLJ39769.1.

<span class="mw-page-title-main">Glutamate rich 5</span> Protein-coding gene in the species Homo sapiens

Glutamate rich protein 5 is a protein in humans encoded by the ERICH5 gene, also known as chromosome 8 open reading frame 47 (C8orf47).

<span class="mw-page-title-main">FANCD2OS</span> Protein-coding gene in the species Homo sapiens

Fanconi Anemia Opposite Strand Transcript protein is a predicted protein that in humans is encoded by the FANCD2OS gene. The name is derived from mRNA transcribed from the strand complementary to the FANCD2 gene.

<span class="mw-page-title-main">C6orf62</span> Protein-coding gene in the species Homo sapiens

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">C8orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.

<span class="mw-page-title-main">C16orf86</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.

<span class="mw-page-title-main">C9orf50</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.

LOC101928193 is a protein which in humans is encoded by the LOC101928193 gene. There are no known aliases for this gene or protein. Similar copies of this gene, called orthologs, are known to exist in several different species across mammals, amphibians, fish, mollusks, cnidarians, fungi, and bacteria. The human LOC101928193 gene is located on the long (q) arm of chromosome 9 with a cytogenic location at 9q34.2. The molecular location of the gene is from base pair 133,189,767 to base pair 133,192,979 on chromosome 9 for an mRNA length of 3213 nucleotides. The gene and protein are not yet well understood by the scientific community, but there is data on its genetic makeup and expression. The LOC101928193 protein is targeted for the cytoplasm and has the highest level of expression in the thyroid, ovary, skin, and testes in humans.

<span class="mw-page-title-main">WD Repeat and Coiled Coil Containing Protein</span> Protein-coding gene in humans

WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.

<span class="mw-page-title-main">C7orf50</span> Mammalian protein found in Homo sapiens

C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.

<span class="mw-page-title-main">C17orf78</span> Mammalian protein found in Homo sapiens

Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">FAM120AOS</span> Protein-coding gene in the species Homo sapiens

FAM120AOS, or family with sequence similarity 120A opposite strand, codes for uncharacterized protein FAM120AOS, which currently has no known function. The gene ontology describes the gene to be protein binding. Overall, it appears that the thyroid and the placenta are the two tissues with the highest expression levels of FAM120AOS across a majority of datasets.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">TBC1D30</span> Protein-coding gene in the species Homo sapiens

TBC1D30 is a gene in the human genome that encodes the protein of the same name. This protein has two domains, one of which is involved in the processing of the Rab protein. Much of the function of this gene is not yet known, but it is expressed mostly in the brain and adrenal cortex.

<span class="mw-page-title-main">UPF0602</span> Human gene

UPF0602 is a protein in humans that is encoded by the chromosome 4 open reading frame 47 (c4orf47) gene.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">LRRC74A</span> Protein-coding gene

Leucine-rich repeat-containing protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.

References

  1. 1 2 3 ENSG00000281665 GRCh38: Ensembl release 89: ENSG00000163807, ENSG00000281665 Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000032551 Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. Strausberg RL, Feingold EA, Grouse LH, Derge JG, Klausner RD, Collins FS, et al. (December 2002). "Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences". Proceedings of the National Academy of Sciences of the United States of America. 99 (26): 16899–16903. Bibcode:2002PNAS...9916899M. doi: 10.1073/pnas.242603899 . PMC   139241 . PMID   12477932.
  6. Yu H, Tardivo L, Tam S, Weiner E, Gebreab F, Fan C, et al. (June 2011). "Next-generation sequencing to generate interactome datasets". Nature Methods. 8 (6): 478–480. doi:10.1038/nmeth.1597. PMC   3188388 . PMID   21516116.
  7. "KIAA1143 [ Homo sapiens (human) ]". National Center For Biotechnology Information. Retrieved 18 September 2022.
  8. "Characterization of cDNA clones selected by the GeneMark analysis from size-fractionated cDNA libraries from human brain [ Homo sapiens (human) ]". Oxford Academy. Retrieved 7 December 2022.
  9. 1 2 "KIAA1143 information on Gene Cards". Gene Cards. Retrieved 13 December 2022.
  10. "KIAA1143 BLAT Browsing". USCS Genome Browser. Retrieved 13 December 2022.
  11. "HPA RNA-seq of KIAA1143 to determine tissue specificity" . Retrieved 13 December 2022.
  12. 1 2 "Compositional analysis of KIAA1143 using SAPS tool" . Retrieved 13 December 2022.
  13. "KIAA1143 PSORT II for various orthologs" . Retrieved 13 December 2022.[ permanent dead link ]
  14. "NCBI (National Center for Biotechnology Information) entry on Homo Sapiens KIAA1143 transcript variant 1, mRNA". June 2022. Retrieved 13 December 2022.
  15. "KIAA1143 Gene Entry on NCBI" . Retrieved 13 December 2022.
  16. "ELM - Detail for LIG_BIR_II_1". ELM The Eukaryotic Linear Motif resource for Functional Sites in Proteins. Retrieved 16 December 2022.
  17. "The Eukaryotic Linear Motif resource for Functional Sites in Proteins" . Retrieved 16 December 2022.
  18. "InterPro". Big-1 (bacterial Ig-like domain 1) domain Interpro Entry. Retrieved 16 December 2022.
  19. "GPS SUMO for KIAA1143" . Retrieved 16 December 2022.[ permanent dead link ]
  20. "YinOYang Analysis" . Retrieved 16 December 2022.
  21. "Myhits Motif scan" . Retrieved 16 December 2022.
  22. "KIAA1143 iTASSER modeling" . Retrieved 16 December 2022.