KIAA1143 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | KIAA1143 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1913452; HomoloGene: 10791; GeneCards: KIAA1143; OMA:KIAA1143 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
KIAA1143 is an uncharacterized protein in humans that is encoded by the KIAA1143 gene. [5] [6] it may play a role in cell growth mechanisms and regulation/creation of cytoskeletal structure. This gene is located on chromosome 3 on the minus strand
This protein has a function that is not yet objectively understood. KIAA1143 has no alias and has the longest most functional variant named as uncharacterized protein KIAA1143 isoform 1. [7] The mature mRNA transcript is 5079 Base pairs long while the length of the KIAA1143 protein is 154 amino acids. KIAA1143 has another transcript variant called KIAA1143 variant 2, which contains an alternate 3' terminal exon, resulting in a distinct 3' coding region and 3' UTR, compared to variant 1. The encoded isoform 2 has a distinct C-terminus and is shorter than isoform 1. The KIAA1143 protein belongs to the uncharacterized protein KIAA1143-like Family, and contains DUF4604 domain of unknown function. KIAA1143 has a predicted function of cell structure/mobility by encoding heavy neurofilament subunits in neurons. [8] Immunohistochemistry staining from Thermo Fisher Scientific shows presence of KIAA1143 in a cytokinetic bridge, which is involved in cellular cytokinesis. This indicates KIAA1143 may have a role in regulation of cellular division and/or communication as well.
KIAA1143 is found on chromosome 3 on the short arm (3p21) and has 3 exons. KIAA1143 is on the Minus Strand. [9] KIAA1143 is on the sense strand and spans from bases 44,690,802-44,819,561. KIAA1143 is located in the gene neighborhood of Zinc finger protein 502, ZNF501, and KIF15 [10]
The KIAA1143 was found by using 500 bp of nucleotides upstream using UCSC Genome Browser [9] A number of transcription factors with a matrix similarity greater than or equal to 0.5 that are predicted to regulate transcription of KIAA1143 are listed below with their respective binding site:
Transcription Factor | Binding Site | Strand |
---|---|---|
Grainy Head like 2 (GHRL2) (Grainyhead-like gene family) | ACAGAAGA | + |
Zinc Finer Protein 317 (ZNF317) | AACCTGTC | - |
JunD | AGTTGACGTCA | - |
CREM | AGTGACGTCAC and GTCACTGCAGT | + and - |
ATF3 | AGTGACGTCAC and GTCACTGCAGT | + and - |
ATF7 | AGTGACGTCAC and GTCACTGCAGT | + and - |
Jun dimerization protein (JDP2) | AGTGACGTCAC and GTCACTGCAGT | + and - |
FOS::JUN 9(Transcription factor Jun) | ACTGCAGT | + |
ZNF682 | CCCCGCACCGG | + |
KLF13 | TGGAACGCC | + |
Kruppel like factor 16 | GCCCGCCAGG | + |
KLF10 | CGGGCGGTCC | - |
YY2 | GGCGGCC | + |
LIN54 | CTTTGAGC | - |
KIAA1143 is expressed in all tissues, however, The expression of KIAA1143 is highest in the Ovaries, followed by the brain, thyroid, prostate, and urinary bladder [11]
KIAA1143 is predicted to have subcellular localization in the nucleus [12] [13]
Isoform 1 of KIAA1143 has a 5' UTR region of 17 base pairs and a 3' UTR region of 4597 base pairs. The transcript is 5079 base pairs long [14]
KIAA1143 has another uncharacterized isoform 2 which contains an alternate 3' terminal exon, resulting in a distinct 3' coding region and 3' UTR, compared to variant 1. The encoded isoform 2 has a distinct C-terminus and is shorter than isoform 1. [15]
The theoretical molecular weight of the 154amino acid KIAA1143 protein is 17.5kDa and the theoretical pI is 5.84. [12]
Since KIAA1143 is an gene with unknown function, most of the coding gene apart from the promotor and translation start site area is a Domain of unknown function, specifically, DUF4604 Spans amino acids 5-151. KIAA1143 has no Cysteine or tyrosine residues. Cysteine and tyrosine are very good nucleophiles, since they are not present, this gives some light into the possible electrophilic nature of the active site. KIAA1143 has some important Eukaryotic Linear Motif resource (ELM) Domains which give insight into its function. These ELM domains are LIG_BIR_II_1 from amino acids 1-5 and LIG_WRC_WIRS_1 from amino acids 144-149. These domains have importance in apoptotic regulation and actin cytoskeleton rearrangement mechanisms respectively [16] [17]
From Positions 1-10 there is also something called a BIG1 Big-1 (bacterial Ig-like domain 1) domain. Big-1 proteins are surface-expressed proteins that mediate mammalian host cell invasion or attachment. The tandem of Ig-like domains appears to form a rod to link the bacterial outer membrane anchor to the C-terminal lectin-like domain to interact with their receptors in the host cell membrane [18]
KIAA1143 is shown to be phosphorylated at positions 2, 50, 68, 113, 115, 116 at either serine or threonine residues. There is a sumoylation consensus at position 76, as well as O-GlcNAc attachment at position 8. There is also an N-myristoylation site from 111-116 [19] [20] [21]
The KIAA1143 Tertiary structure is predicted below through iTASSER modeling, with a C score of -2.83. Coloring is similar to alphafold confidence graphing [22]
KIAA1143 is experimentally determined to have interactions with EAPP (E2F-associated phosphoprotein), ECD (Ecdysoneless Cell Cycle Regulator), GPATCH1 (Evolutionarily Conserved G-Patch Domain-Containing Protein), PRPF8 (Pre-MRNA-Processing-Splicing Factor 8), WDR83 (Mitogen-Activated Protein Kinase Organizer 1), CEP76 (centrosomal protein 76), and APP (Amyloid-beta precursor protein).
There are no paralogs for KIAA1143
KIAA1143 has homologs in over 200 other organisms, including vertebrates, invertebrates, archaea, KIAA1143 is found in clades of organisms except land plants.
Genus species | Common Name | Taxonomic Group | Divergence (MYA) | Accession Number | Seq. Length (aa) | Corr. ID to HP (%) | Corr. Sim. To HP (%) |
---|---|---|---|---|---|---|---|
Homo sapiens | Human | primates | 0 | NP_065747.1 | 154 | 100 | 100 |
Mus musculus | House Mouse | Rodentia | 87 | NP_079695.2 | 154 | 87 | 93 |
Pogona vitticeps | Central Bearded Dragon | Reptilla | 319 | XP_020663315.1 | 152 | 72 | 84 |
Sphaerodactylus townsendi | townsend's least gecko | Reptilla | 319 | XP_048366794.1 | 154 | 68 | 81 |
Zootoca vivipara | Common Lizzard | Reptila | 319 | XP_034985594.1 | 153 | 64 | 77 |
Python bivittatus | burmese python | Reptilla | 319 | XP_007437594.1 | 151 | 64 | 77 |
Nothoprocta perdicaria | chilean tinamou | Aves | 319 | XP_025890652.1 | 159 | 63 | 73 |
Oxyura jamaicensis | Ruddy Duck | Aves | 319 | XP_035173671.1 | 159 | 54 | 71 |
Aptenodytes forsteri | Emperor Penguin | Aves | 319 | XP_019330441.1 | 159 | 53 | 68 |
Catharus ustulatus | Swainson's thrush | Aves | 319 | XP_032933984.1 | 175 | 53 | 66 |
Rhinatrema bivittatum | two-lined caecilian | (Gymnophiona) | 353 | XP_029445556.1 | 156 | 58 | 76 |
Geotrypetes seraphini | gaboon caecilian | (Gymnophiona) | 353 | XP_033786333.1 | 156 | 58 | 78 |
Rana temporaria | Common frog | (anura) | 353 | XP_040209069.1 | 152 | 58 | 75 |
Bufo Bufo | Common Toad | Amphibia (anura) | 353 | XP_040287819.1 | 153 | 57 | 74 |
Latimeria Chalumnae | West Indian Ocean coelacanth | Sarcoptergyii (lobe-finned fish) | 414 | XP_005999062.1 | 151 | 46 | 63 |
Gambusia affinis | mosquitofish | Cyprinodontiformes (ray-finned fish) | 431 | XP_043973326.1 | 160 | 48 | 63 |
Clupea harengus | Atlantic Herring | Clupeiformes (ray-finned fish) | 431 | XP_012692935.2 | 154 | 47 | 68 |
Scyliorhinus canicula | Small Spotted catshark | Carcharhiniformes (cartilagenous fish) | 464 | XP_038652524.1 | 152 | 58 | 79 |
Amblyraja radiata | Thorny skate | Elasmobranchii (cartilagenous fish) | 464 | XP_032871011.1 | 153 | 50 | 68 |
Petromyzon marinus | sea lamprey | Agnatha (jawless fish) | 599 | XP_032833525.1 | 178 | 49 | 67 |
The relative rate of change for KIAA1143 is fairly slow compared to fibrinogen and beta-globin, but not as slow as cytochrome c.
The DUF4604 domain in KIAA1143 is conserved across all organisms
All of the orthologs of KIAA1143 are derived from the same common ancestor
No diseases have been shown to be directly linked to KIAA1143 to this date.
No Disease Association is observed with KIAA1143 to date.
Tetratricopeptide repeat protein 39B is a protein that in humans is encoded by the TTC39B gene. TTC39B is also known as C9orf52 or FLJ33868. The main feature within tetratricopeptide repeat 39B is the domain of unknown function 3808 (DUF3808), spanning the majority of the protein.
C12orf40, also known as Chromosome 12 Open Reading Frame 40, HEL-206, and Epididymis Luminal Protein 206 is a protein that in humans is encoded by the C12orf40 gene.
Fanconi Anemia Opposite Strand Transcript protein is a predicted protein that in humans is encoded by the FANCD2OS gene. The name is derived from mRNA transcribed from the strand complementary to the FANCD2 gene.
Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.
Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.
LOC101928193 is a protein which in humans is encoded by the LOC101928193 gene. There are no known aliases for this gene or protein. Similar copies of this gene, called orthologs, are known to exist in several different species across mammals, amphibians, fish, mollusks, cnidarians, fungi, and bacteria. The human LOC101928193 gene is located on the long (q) arm of chromosome 9 with a cytogenic location at 9q34.2. The molecular location of the gene is from base pair 133,189,767 to base pair 133,192,979 on chromosome 9 for an mRNA length of 3213 nucleotides. The gene and protein are not yet well understood by the scientific community, but there is data on its genetic makeup and expression. The LOC101928193 protein is targeted for the cytoplasm and has the highest level of expression in the thyroid, ovary, skin, and testes in humans.
C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.
The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.
C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.
FAM120AOS, or family with sequence similarity 120A opposite strand, codes for uncharacterized protein FAM120AOS, which currently has no known function. The gene ontology describes the gene to be protein binding. Overall, it appears that the thyroid and the placenta are the two tissues with the highest expression levels of FAM120AOS across a majority of datasets.
Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.
TBC1D30 is a gene in the human genome that encodes the protein of the same name. This protein has two domains, one of which is involved in the processing of the Rab protein. Much of the function of this gene is not yet known, but it is expressed mostly in the brain and adrenal cortex.
UPF0602 is a protein in humans that is encoded by the chromosome 4 open reading frame 47 (c4orf47) gene.
Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
Leucine-rich repeat-containing protein 74A (LRRC74A), is a protein encoded by the LRRC74A gene. The protein LRRC74A is localized in the cytoplasm. It has a calculated molecular weight of approximately 55 kDa. The LRRC74A protein is nominally expressed in the testis, salivary gland, and pancreas.