TEKTIP1 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | TEKTIP1 , chromosome 19 open reading frame 71, C19orf71, tektin bundle interacting protein 1 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1921072 HomoloGene: 47801 GeneCards: TEKTIP1 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
TEKTIP1, also known as tektin-bundle interacting protein 1, is a protein that in humans is encoded by the TEKTIP1 gene. [5] [6]
Tektin bundle interacting protein 1 (TEKTIP1) is a protein which in humans is encoded by the TEKTIP1 mRNA. [5] The longest mRNA produced is 826 bases long. [7]
Alias of the protein is C19orf71. [8]
The TEKTIP1 gene is located at 19p13.3. [9]
Several predicted transcription factors may bind in the promoter region: TFDP1, PLAGL1, ZSCAN4, POU2F3. [10] [11] [12] [13] [14]
Acrylamide, [15] cisplatin, [16] and silicon dioxide [17] decrease expression, while butanal (butyraldehyde), and pentanal increase expression [18] of TEKTIP1 mRNA.
Sequence orthology also suggests that valproic acid increases methylation of TEKTIP1 mRNA. [19]
The mRNA of TEKTIP1 is 654 nucleotides long. [5] [8] It contains five exons. [5] The human TEKTIP1 protein is 209 amino acids long and has a predicted molecular mass of 24.5 kDa. [8] The human protein has a theoretical isoelectric point of 9.1. [20]
There are two splice isoforms of the protein, X1 [21] and X2. [22]
X1 | X2 |
---|---|
Tibetan macaque [23] | Chimpanzee [24] |
Olive baboon [25] | Olive baboon [26] |
Panamanian white-faced capuchin [27] | Common marmoset [28] |
Golden [29] and Black [30] snub-nosed monkeys | Golden sub-nosed monkey [31] |
Elk [32] | Elk [33] |
Red deer [34] | Red deer [35] |
Greater horseshoe bat [36] | Bats [37] [38] [39] |
Red fox [40] | Polar bear [41] |
TEKTIP1 is highly enriched in tryptophan and tyrosine. [42] Additionally, the protein is semi-enriched in arginine and proline. [43]
TEKTIP1 is tissue specific and is found in higher levels in the kidney, testis and thymus. [9]
In fetal development, it is found in higher levels in the Adrenal glands during weeks 16-20. [9]
TEKTIP1 protein is said to be specific to the testis and is found in early spermatids during spermatogenesis. [44]
Over 240 organisms have orthologs with human gene TEKTIP1. [9] The most distant organism with an orthologous sequence to human TEKTIP1 is the West African lungfish (estimated date of divergence of 408 MYA). [45] [46]
Genus and Species | Common Name | Taxonomic Group | Median Date of Divergence (MYA) | Accession # | Sequence length (aa) | Sequence Identity to Human Protein % | Sequence Similarity to Human Protein % |
---|---|---|---|---|---|---|---|
Homo sapiens | Human | Primates/Hominidae | 0 | NP_001129052.1 | 209 | 100 | 100 |
Pan troglodyte | Chimpanzee | Primates | 6.4 | XP_003316050.1 | 209 | 98.1 | 98.6 |
Mus musculus | Mouse | Rodentia | 87 | NP_001014836.2 | 217 | 45.6 | 60.4 |
Bos taurus | Cow | Artiodactyla | 94 | NP_001070550 | 208 | 76.6 | 84.2 |
Panthera tigris | Tiger | Carnivora | 94 | XP_042833719.1 | 210 | 34.1 | 38.9 |
Orcinus orca | Killer whale | Cetacea | 94 | XP_012394771.2 | 216 | 63 | 73.1 |
Phascolarctos cinereus | Koala | Diprodontia | 160 | XP_020824948.1 | 211 | 52.9 | 67.4 |
Haliaeetus leucocephalus | Bald Eagle | Accipitiformes | 319 | XP_010580112.1 | 206 | 41.4 | 52 |
Dromaius novaehollandiae | Emu | Aves | 319 | XP_025961204.1 | 190 | 44.8 | 56.7 |
Gallus gallus domesticus | Chicken | Aves | 319 | XP_015155269.2 | 188 | 38.2 | 48.4 |
Crocodylus porosus | Saltwater Crocodile | Crocodilia | 319 | XP_019406144.1 | 210 | 54.3 | 68.3 |
Caretta caretta | Loggerhead Sea Turtle | Testudines | 319 | XP_048687032.1 | 216 | 47.8 | 57.3 |
Zootoca vivipara | Common lizard | Squamata | 319 | XP_034975173.1 | 200 | 48.8 | 59.5 |
Crotalus tigris | Tiger Rattlesnake | Squamata | 319 | XP_039190693.1 | 199 | 45.8 | 57.5 |
Protobothrops mucrosquamatus | Venomous Pit Viper | Squamata | 319 | XP_015665555.1 | 199 | 45.1 | 56.7 |
Bufo bufo | Common Toad | Salientia | 353 | XP_040276375 | 136 | 24.7 | 34.3 |
Geotrypetes seraphini | Gaboon Caecilian | Caecilians | 353 | XP_033813379.1 | 156 | 36.1 | 45.8 |
Microcaecilia unicolor | Tiny Caecilian | Caecilians | 353 | XP_030074922.1 | 142 | 34.3 | 42.6 |
Protopterus annectens | West African Lungfish | Actinopterygii | 408 | XP_043910031.1 | 142 | 28.6 | 39 |
Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.
Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.
Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.
C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.
C2orf81 is a human gene encoding protein c2orf81, which is predicted to have nuclear localization.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.
c7orf26 is a gene in humans that encodes a protein known as c7orf26. Based on properties of c7orf26 and its conservation over a long period of time, its suggested function is targeted for the cytoplasm and it is predicted to play a role in regulating transcription.
C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.
FAM120AOS, or family with sequence similarity 120A opposite strand, codes for uncharacterized protein FAM120AOS, which currently has no known function. The gene ontology describes the gene to be protein binding. Overall, it appears that the thyroid and the placenta are the two tissues with the highest expression levels of FAM120AOS across a majority of datasets.
C2orf72 is a gene in humans that encodes a protein currently named after its gene, C2orf72. It is also designated LOC257407 and can be found under GenBank accession code NM_001144994.2. The protein can be found under UniProt accession code A6NCS6.
Chromosome 12 Open Reading Frame 50 (C12orf50) is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811, on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.
Chromosome 20 open reading frame 85, or most commonly known as C20orf85 is a gene that encodes for the C20orf85 Protein. This gene is not yet well understood by the scientific community.
THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.