CEFIP

Last updated

Cardiac-enriched FHL2-interacting protein (CEFIP) is a protein encoded by the gene C10orf71 on chromosome 10 open reading frame 71. [1] [2] It is primarily understood that this gene is moderately expressed in muscle tissue and cardiac tissue. [3] [4]

Contents

Gene

The cytogenic locus is found at 10q11.23. [1] C10orf71 encodes 28294 base pairs (bp) within chromosome 10 at 49299193-49327487 bp. [1] It is located on the plus strand and is flanked by several other genes. [1]

Chromosome 10 representation showing C10orf71 location and other neighboring genes it is flanked by. C10orf71 Location.png
Chromosome 10 representation showing C10orf71 location and other neighboring genes it is flanked by.

mRNA

The mRNA sequence of C10orf71 has 3 exons and 10 stop codons in the favorable splice form. [1] [6] The two alternative splice forms had 47 and 75 stop codons interspersed throughout the sequence so they were not utilized to obtain further sequence information. The main splice form that was analyzed had the ten stop codons interspersed throughout the 5' and 3' UTR, which was why this splice form was utilized to further analyze. The mRNA of the Homo sapiens ortholog of C10orf71was 5286 bp in length. [1] [6]

Alternative splice formsof C10orf7
Splice 1Stop Codons found starting at bp: 4, 150, 247, 310, 4645, 4774, 4783, 4855, 4986, and 5250

Exons found between bp: 431/432, 544/545, and 5285/5286

Kozak site found at bp 332-334

Splice 247 Stop Codons found interspersed throughout the entire sequence
Splice 375 stop codons found interspersed throughout the entire sequence

The three alternative splice forms found of C10orf71 mRNA sequence and the locations stop codons, exons and the Kozak site found in Splice 1. Splice 1 was utilized to analyze and obtain information about as all of the stop codons found in this splice form were found in the 5' and 3' UTR regions of the sequence. There were three exons found in Splice 1 with a Kozak consensus sequence in the overall sequence as well. [1] [6]

Protein

The mature Homo sapiens homolog of the CEFIP protein encoded by C10orf71 is 1435 amino acids (aa) in length and weighs approximately 156.5 kDa. [6] This homolog has an isoelectric point of 5.94. [6] The range of pH values from Homo sapiens to the latest ortholog analyzed, Rhincodon types, ranged from 5.94-6.93, with it gradually increasing as it went later in the divergence of the ortholog. [6]

SpeciesLength (aa)Molecular Weight (kDa)Isoelectric point
Homo sapiens1,435156.55.94
Gorilla gorilla1,435156.25.91
Mus muculus1,412154.55.81
Gallus gallus1,521167.76.15
Rhincodon typus1,253138.86.93

Comparison of some of the orthologs analyzed when compared to Homo sapiens. The orthologs are arranged from species that are most closely related to the Homo sapiens ortholog to least closely related (top to bottom respectively). [6]

Composition of protein

CEFIP is predicted to be a non-transmembrane, soluble protein. [7] It is predicted to be a nuclear protein with 91.3% confidence with it being fairly confident to be a nuclear protein throughout the orthologs. There was one positive charge cluster found in CEFIP protein sequence, that is located from amino acids 1165–1193. [6] This cluster was moderately conserved throughout the orthologs analyzed. There was also a mixed charge cluster found in the Homo sapiens' sequence of this protein, located from amino acid 750–778, although this cluster was not highly conserved throughout the analyzed orthologs. [6] There was one repeat sequence found as well, TASKPPA, located at amino acids 163-169 and 116–1172. This protein is Proline and Serine rich as well. [6]

Predicted C10orf71 location in orthologs.
SpeciesNuclearCytoplasmicCytoskeletal
Homo sapiens91.3%4.3%4.3%
Gorilla gorilla91.3%4.3%4.3%
Mus musculus82.6%13.0%
Gallus gallus69.6%17.4%4.3%
Rhincodon typus69.6%21.7%
This cartoon represents the basic layout of the CEFIP protein. The overall protein is represented by the orange rectangle, DUF4585 is represented by the blue pentagon, VAC is represented by the green oval, nuclear localization signals are represented by the gray diamonds and N-glycosylation sites are represented by the red diamonds. C10orf71 Cartoon.png
This cartoon represents the basic layout of the CEFIP protein. The overall protein is represented by the orange rectangle, DUF4585 is represented by the blue pentagon, VAC is represented by the green oval, nuclear localization signals are represented by the gray diamonds and N-glycosylation sites are represented by the red diamonds.

Domains and motifs

One confirmed domain of unknown function (DUF) was found within the CEFIP protein sequence, DUF4585. [1] DUF4585 is located on the Homo sapiens protein sequence from amino acid 311–334. DUF4585 was highly conserved throughout the orthologs that were analyzed. There was also a small vacuolar targeting motif (VAC) found within the analyzed protein sequence spanning amino acids 543–546.

Protein structure

The mature CEFIP protein contains nuclear localization signals (NLS), pat4 (RKPK at aa 382, RPRK at aa 640, KRRK at aa 1190) and pat7 (PPWRKPK at aa 379 and PWRKPKT at aa 380) with an NLS score of 0.94. A secondary structure was constructed with a 6.1% confidence level. [7]

Predicted CEFIP protein structure. 3D C10orf71.jpg
Predicted CEFIP protein structure.

Post-translational modifications

There were seven GlcNAc O-glycosylation sites predicted within the protein sequence found at amino acids 116, 120, 139, 165, 468, 470, and 844. [7] There were also several phosphorylation sites found interspersed throughout the sequence. One propeptide cleavage site was predicted at amino acid 38. [7] There were three predicted sumoylation sites found at amino acids 599, 890, and 1176.

Expression

The C10orf71 mRNA was found to be highly expressed in cardiac, muscle, and liver tissue (biology). [1]

Regulation of expression

There were 6 possible promoters found in the sequence. Promoter GXP_6729162 is 1403 bp in length. [9] This promoter had several transcription factors of interest including those involved with myocytes. [9]

Function

There is little scientific information known about the function of CEFIP.

Interacting proteins

There was a total of 25 proteins generated that were predicted to interact with CEFIP (Homo sapiens ortholog). [10] [11] Most of the interactions predicted were physical interactions with CEFIP. [10] These interactions were discovered through a variety of mechanisms including, but not limited to: affinity chromatography, microarray analysis, and tandem mass spectrometry among others. [10] [11] Refer to table for details about the interacting proteins of CEFIP. [12]

Interacting ProteinName of ProteinKnown FunctionLocation Expressed or Associated Diseases
C20orf78Chromosome 20 Open Reading Frame 78UnknownUnconfirmed
BPIFA2 [13] BPI Fold Containing Family A Member 2Plays a role in antibacterial resistance in upper respiratory pathwayExpressed in salivary glands
PPIL6 [14] Peptidyl Prolyl Isomerase Like 6Accelerates folding of proteinsUnconfirmed
KIF17 [15] Kinesin Family Member 17Transports vesicles containing NMDA receptor 2BExpressed in microtubules
KRT78 [16] Keratin 78Forms cytoplasmic network; encodes proteins with intermediate filament domainsExpressed in intermediate filaments
TBX4 [17] T-box4 Transcription FactorEncode transcription factors involved in regulation of developmental processes; assists with regulation of mesoderm differentiation; could play a role in limb pattern formationAssociated with Small Patella Syndrome and Heritable Pulmonary Arterial Hypertension
DNAH8 [18] Dynein Axonemal Heavy Chain 8heavy chain of an axonemal dynein involved in sperm and respiratory cilia motility.Associated with Colchicine Resistance and Mitochondrial Complex V Deficiency, Nuclear Type 1
TSPAN17 [19] Tetraspanin 17Predicted to regulate ADAM10 maturationUnconfirmed
C14orf80 [20] Chromosome 14 Open Reading Frame 80UnknownUnconfirmed
SLC35F4 [21] Solute Carrier Family 35 Member F4Solute transporterUnconfirmed
LHX4 [22] LIM Homeobox 4Predicted to play a role in maturing lungs, development of respiratory mechanisms, and development of the pituitary glandAssociated with Pituitary Hormone Deficiency, Combined 4 and Lhx4=Related Combined Pituitary Hormone
FAM53A [23] Family With Sequence Similarity 53 Member APlays a role in neural developmentPossibly expressed in ventricle tissue
GRIK5 [24] Glutamate Ionotropic Receptor Kainase Type Subunit 5Forms functional heteromeric kainite-preferring ionic channelsAssociated with Schizophrenia
FADS2 [25] Fatty Acid Denaturase 2Regulates unsaturation of fatty acids through introduction of double bonds between define Cysteines of the fatty acid chainsAssociated with Best Vitelliform Macular Dystrophy
GDF2 [26] Growth Differentiation Factor 2Regulates cartilage and bone development; differentiation of cholinergic receptors in CNSUnconfirmed
C17orf77 [27] Chromosome 17 Open Reading Frame 77UnknownUnconfirmed
CFAP45 (CCDC19) [28] Cilia And Flagella Associated Protein 45Associated with pharynx cancerUnconfirmed
DCST2 [29] DC-STAMP Domain Containing 2UnknownUnconfirmed
CTXN1 [30] Cortexin 1Predicted to play a role in IC or EC signaling of the cortical neurons during the development of the forebrain.Unconfirmed
C19orf68 [31] Chromosome 19 Open Reading Frame 68UnknownUnconfirmed
DCAF7 [32] DDB1 And CUL4 Associated Factor 7It's been shown to function as a scaffold protein in kinase signaling. It's also been known to be involved with craniofacial developmentUnconfirmed
DYRK1A [33] Dual Specificity YAK1-Related KinaseMay play a role in brain development and cell proliferation; nuclear localized proteinAssociated with Mental Retardation, Autosomal Dominant 7 and Microcephaly
DYRK1B [34] Dual Specificity Tyrosine-(Y)-Phosphorylation Regulated Kinase 1BPlays a role in the cell cycle; nuclear-localized proteinAssociated with Abdominal Obesity-Metabolic Syndrome 3 and Abdominal Obesity-Metabolic Syndrome
FNTA [35] Protein Farnesyltransferase/Geranylgeranyltransferase Type-1 Subunit AlphaHelps regulate neuromuscular junction developmentUnconfirmed
FNTB [36] Protein Farnesyltransferase Subunit BetaCatalyzes the transfer of a farnesyl moiety from farnesyl diphosphate to a cysteineUnconfirmed

Interacting proteins, their function if known, and any tissues they have been found or predicted to be expressed in and any diseases they have been associated with. [10] [11] [12]

Homologs

Paralogs

There are currently no known paralogs to the C10orf71 gene.

Orthologs

C10orf71 is known to have 68 orthologs in various species including primates (11 species), rodents (8 species), Laurasiatheria carnivores (14 species), Placental mammals (38 species), Sauropsida birds and reptiles (7 species), and fish (11 species). [37] The highly conserved sequences are primarily from primates with the identity percentage of these species being >90%, whereas species such as reptiles, birds, and fish had an identity percentage ≤30%. [6] Refer to table for additional information on dates of divergence, sequence length, and sequence identity and similarity for orthologs. C10orf71 is not present in prokaryotes, archaea, or fungi. [37]

Abbreviation (for Phylogenetic Tree)SpeciesCommon NameProtein Accession #Estimated Date of Divergence (MYA)Sequence Length (aa)Sequence Identity to Human mRNA/protein (%)Sequence Similarity to Human mRNA/protein (%)
HomSHomo sapiens human NP_001128668.11435100100
GorGGorilla gorilla western gorilla XP_018889898.19.06143598.398.7
RhiRRhinopithecus roxellana Golden snub-nosed monkey XP_010381152.129.44143593.495.3
OtoGOtolemur garnettiismall-eared galagoXP_003801705.174141974.981.5
TupCTupaia chinensisChinese tree shrewXP_014439281.182118661.567.8
MusMMus musculus house mouse NP_001182026.190141265.474.3
OctDOctodon degus Common degu XP_004647022.190140763.373.1
HetGHeterocephalus glaber Naked mole-rat XP_004874589.190141163.273.3
CerSCeratotherium simum simum Southern white rhinoceros XP_004432504.19614367481
OrcOOrcinus orca killer whale XP_00428643696143372.679.4
LoxALoxodonta Africana African bush elephant XP_003408977.1105143865.574.5
SarHSarcophilus harrisii Tasmanian devil XP_003755230.2159147049.062.5
GavGGavialis gangeticus crocodile XP_019358113.1312153832.846.7
TinGTinamus guttatus White-throated tinamou XP_010216992.1312152932.446.3
PelSPelodiscus sinensis turtle XP_006118195.1312150532.046.2
GalGGallus gallus chicken XP_421655.3312152130.544.7
MelUMelopsittacus undulates parrot XP_005153970312153830.345.1
OreNOreochromis niloticus Nile tilapia XP_01922182243566112.018.7
SclFScleropages formosus Asian arowana XP_018580403435312511.517.9
DanRDanio rerio Zebrafish XP_005157004.143535919.216.0
CluHClupea harengus Atlantic herring XP_012687674.143536339.113.8
RhiTRhincodon typus whale shark XP_020385611.1473125340.024.8

Ortholog table in descending order to latest ortholog diverged. This table compares the orthologs analyzed, their species names, common names, dates of divergence from Homo sapiens ortholog (MYA), length (aa), and percentage of similarity and identity. [1] [6] [37] [38]

Phylogeny

A phylogenic tree created from the orthologs. They are grouped in similar animal species. The branches are labeled with the four letter code given to them, full species name and common names of these codes can be found in table above. Tree for PS2.jpg
A phylogenic tree created from the orthologs. They are grouped in similar animal species. The branches are labeled with the four letter code given to them, full species name and common names of these codes can be found in table above.

A phylogenetic tree was constructed for the orthologs that were analyzed in comparison to Homo sapiens. With the species of latest divergence being Rhincodon types, or the whale shark. [37] [38]

Evolutionary rate

C10orf71's rate of divergence was faster than that of fibrinogen or Cytochrome C . [37]

The corrected % of divergence compared to the date of divergence for the orthologs. The ortholog points on the graph can be identified by referring to table above. C10orf71 (blue line) appears to have diverged faster than both Fibrinogens (red line) and Cytochrome C (green line). The correct % of divergence was calculated using the equation m/100 = -ln(1- n/100) where m= total number of amino acid changes which have occurred in a 100-amino-acid segment of a protein and n=observed number of amino acid changes per 100 residues. Date of Divergence.png
The corrected % of divergence compared to the date of divergence for the orthologs. The ortholog points on the graph can be identified by referring to table above. C10orf71 (blue line) appears to have diverged faster than both Fibrinogens (red line) and Cytochrome C (green line). The correct % of divergence was calculated using the equation m/100 = –ln(1- n/100) where m= total number of amino acid changes which have occurred in a 100-amino-acid segment of a protein and n=observed number of amino acid changes per 100 residues.

Clinical significance

There was a microarray experiment that also showed evidence that C10orf71's expression was lowered in skeletal muscle tissues that experienced sepsis. [39] There was clinical significance found in the expression level of C10orf71 in an experiment looking at those with Myotonic dystrophy. [39] One microarray analysis produced results that showed C10orf71's expression level decreased in those with prostate cancer as well. [39]

Related Research Articles

<span class="mw-page-title-main">METTL26</span>

METTL26, previously designated C16orf13, is a protein-coding gene for Methyltransferase Like 26, also known as JFP2. Though the function of this gene is unknown, various data have revealed that it is expressed at high levels in various cancerous tissues. Underexpression of this gene has also been linked to disease consequences in humans.

WD repeat-containing protein 90 is a protein that, in humans, is encoded by the WDR90 gene (16p13.3). This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. It contains multiple WD40 repeat domains and one domain of unknown function. This protein is conserved all the way back to invertebrates. Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis.

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

TMEM156 is a gene that encodes the transmembrane protein 156 (TMEM156) in Homo sapiens. It has the clone name of FLJ23235.

<span class="mw-page-title-main">C10orf67</span> Protein-coding gene in the species Homo sapiens

Chromosome 10 open reading frame 67 (C10orf67), also known as C10orf115, LINC01552, and BA215C7.4, is an un-characterized human protein-coding gene. Several studies indicate a possible link between genetic polymorphisms of this and several other genes to chronic inflammatory barrier diseases such as Crohn's Disease and sarcoidosis.

<span class="mw-page-title-main">ERICH2</span> Protein-coding gene in the species Homo sapiens

Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.

Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.

<span class="mw-page-title-main">C21orf62</span> Protein-coding gene in the species Homo sapiens

C21orf62 is a protein that, in humans, is encoded by the C21orf62 gene. C21orf62 is found on human chromosome 21, and it is thought to be expressed in tissues of the brain and reproductive organs. Additionally, C21orf62 is highly expressed in ovarian surface epithelial cells during normal regulation, but is not expressed in cancerous ovarian surface epithelial cells.

<span class="mw-page-title-main">C6orf62</span>

Chromosome 6 open reading frame 62 (C6orf62), also known as X-trans-activated protein 12 (XTP12), is a gene that encodes a protein of the same name. The encoded protein is predicted to have a subcellular location within the cytosol.

<span class="mw-page-title-main">C17orf53</span>

C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">C16orf46</span> Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

C2orf81 is a human gene encoding protein c2orf81, which is predicted to have nuclear localization.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C9orf50</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.

Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.

<span class="mw-page-title-main">C20orf202</span>

C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

<span class="mw-page-title-main">C6orf136</span>

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

Chromosome 4 open reading frame 50 is a protein that in humans is encoded by the C4orf50 gene. The protein localizes in the nucleus. C4orf50 has orthologs in vertebrates but not invertebrates

References

  1. 1 2 3 4 5 6 7 8 9 10 "Home - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-05-07.
  2. "HGNC database of human gene names | HUGO Gene Nomenclature Committee". www.genenames.org. Retrieved 2017-05-07.
  3. "Home - UniGene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-05-07.
  4. Thierry-Mieg, Danielle; Thierry-Mieg, Jean (2012-10-16). "AceView a comprehensive annotation of human and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2017-05-07.
  5. "National Center for Biotechnology Information". www.ncbi.nlm.nih.gov. Retrieved 2017-05-07.
  6. 1 2 3 4 5 6 7 8 9 10 11 12 Workbench, NCSA Biology. "SDSC Biology Workbench". seqtool.sdsc.edu. Archived from the original on 2003-08-11. Retrieved 2017-05-07.
  7. 1 2 3 4 "ExPASy: SIB Bioinformatics Resource Portal - Home". www.expasy.org. Retrieved 2017-05-07.
  8. "i-TASSER". zhanglab.ccmb.med.umich.edu. Retrieved 2017-05-07.
  9. 1 2 "Genomatix: Genome Annotation and Browser: Query Input". www.genomatix.de. Retrieved 2017-05-07.
  10. 1 2 3 4 "GeneMANIA". genemania.org. Retrieved 2017-05-07.
  11. 1 2 3 Lab, Mike Tyers. "BioGRID | Database of Protein, Chemical, and Genetic Interactions". thebiogrid.org. Retrieved 2017-05-07.
  12. 1 2 Database, GeneCards Human Gene. "GeneCards - Human Genes | Gene Database | Gene Search". www.genecards.org. Retrieved 2017-05-07.
  13. Database, GeneCards Human Gene. "BPIFA2 Gene - GeneCards | BPIA2 Protein | BPIA2 Antibody". www.genecards.org. Retrieved 2017-05-09.
  14. Database, GeneCards Human Gene. "PPIL6 Gene - GeneCards | PPIL6 Protein | PPIL6 Antibody". www.genecards.org. Retrieved 2017-05-09.
  15. Database, GeneCards Human Gene. "KIF17 Gene - GeneCards | KIF17 Protein | KIF17 Antibody". www.genecards.org. Retrieved 2017-05-09.
  16. Database, GeneCards Human Gene. "KRT78 Gene - GeneCards | K2C78 Protein | K2C78 Antibody". www.genecards.org. Retrieved 2017-05-09.
  17. Database, GeneCards Human Gene. "TBX4 Gene - GeneCards | TBX4 Protein | TBX4 Antibody". www.genecards.org. Retrieved 2017-05-09.
  18. Database, GeneCards Human Gene. "DNAH8 Gene - GeneCards | DYH8 Protein | DYH8 Antibody". www.genecards.org. Retrieved 2017-05-09.
  19. Database, GeneCards Human Gene. "TSPAN17 Gene - GeneCards | TSN17 Protein | TSN17 Antibody". www.genecards.org. Retrieved 2017-05-09.
  20. Database, GeneCards Human Gene. "C14orf80 Gene - GeneCards | CN080 Protein | CN080 Antibody". www.genecards.org. Retrieved 2017-05-09.
  21. Database, GeneCards Human Gene. "SLC35F4 Gene - GeneCards | S35F4 Protein | S35F4 Antibody". www.genecards.org. Retrieved 2017-05-09.
  22. Database, GeneCards Human Gene. "LHX4 Gene - GeneCards | LHX4 Protein | LHX4 Antibody". www.genecards.org. Retrieved 2017-05-09.
  23. Database, GeneCards Human Gene. "FAM53A Gene - GeneCards | FA53A Protein | FA53A Antibody". www.genecards.org. Retrieved 2017-05-09.
  24. Database, GeneCards Human Gene. "GRIK5 Gene - GeneCards | GRIK5 Protein | GRIK5 Antibody". www.genecards.org. Retrieved 2017-05-09.
  25. Database, GeneCards Human Gene. "FADS2 Gene - GeneCards | FADS2 Protein | FADS2 Antibody". www.genecards.org. Retrieved 2017-05-09.
  26. Database, GeneCards Human Gene. "GDF2 Gene - GeneCards | GDF2 Protein | GDF2 Antibody". www.genecards.org. Retrieved 2017-05-09.
  27. Database, GeneCards Human Gene. "C17orf77 Gene - GeneCards | CQ077 Protein | CQ077 Antibody". www.genecards.org. Retrieved 2017-05-09.
  28. Database, GeneCards Human Gene. "CFAP45 Gene - GeneCards | CFA45 Protein | CFA45 Antibody". www.genecards.org. Retrieved 2017-05-09.
  29. Database, GeneCards Human Gene. "DCST2 Gene - GeneCards | DCST2 Protein | DCST2 Antibody". www.genecards.org. Retrieved 2017-05-09.
  30. Database, GeneCards Human Gene. "CTXN1 Gene - GeneCards | CTXN1 Protein | CTXN1 Antibody". www.genecards.org. Retrieved 2017-05-09.
  31. Database, GeneCards Human Gene. "C19orf68 Gene - GeneCards | CS068 Protein | CS068 Antibody". www.genecards.org. Retrieved 2017-05-09.
  32. Database, GeneCards Human Gene. "DCAF7 Gene - GeneCards | DCAF7 Protein | DCAF7 Antibody". www.genecards.org. Retrieved 2017-05-09.
  33. Database, GeneCards Human Gene. "DYRK1A Gene - GeneCards | DYR1A Protein | DYR1A Antibody". www.genecards.org. Retrieved 2017-05-09.
  34. Database, GeneCards Human Gene. "DYRK1B Gene - GeneCards | DYR1B Protein | DYR1B Antibody". www.genecards.org. Retrieved 2017-05-09.
  35. Database, GeneCards Human Gene. "FNTA Gene - GeneCards | FNTA Protein | FNTA Antibody". www.genecards.org. Retrieved 2017-05-09.
  36. Database, GeneCards Human Gene. "FNTB Gene - GeneCards | FNTB Protein | FNTB Antibody". www.genecards.org. Retrieved 2017-05-09.
  37. 1 2 3 4 5 "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2017-05-07.
  38. 1 2 "TimeTree :: The Timescale of Life". timetree.org. Retrieved 2017-05-07.
  39. 1 2 3 "Home - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-05-07.