C1QL1

Last updated

C1QL1
Available structures
PDB Ortholog search: PDBe RCSB
Identifiers
Aliases C1QL1 , C1QRF, C1QTNF14, CRF, complement component 1, q subcomponent-like 1, complement C1q like 1, CTRP14
External IDs OMIM: 611586; MGI: 1344400; HomoloGene: 4867; GeneCards: C1QL1; OMA:C1QL1 - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_006688

NM_011795

RefSeq (protein)

NP_006679

NP_035925

Location (UCSC) Chr 17: 44.96 – 44.97 Mb Chr 11: 102.83 – 102.84 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

The complement component 1, q subcomponent-like 1 (or C1QL1) is encoded by a gene located at chromosome 17q21.31. It is a secreted protein and is 258 amino acids in length. [5] The protein is widely expressed but its expression is highest in the brain and may also be involved in regulation of motor control. [6] The pre-mRNA of this protein is subject to RNA editing. [7]

Contents

Protein function

Its physiological function is unknown. It is a member of the C1q domain proteins which have important signalling roles in inflammation and in adaptive immunity. [8]

RNA editing

Editing type

The pre-mRNA of this protein is subject to A to I RNA editing, which is catalyzed by a family of adenosine deaminases acting on RNA (ADARs) that specifically recognize adenosines within double-stranded regions of pre-mRNAs and deaminate them to inosine. Inosines are recognised as guanosine by the cell's translational machinery. There are three members of the ADAR family: ADARs 1-3, with ADAR 1 and ADAR 2 being the only enzymatically active members. ADAR 3 is thought to have a regulatory role in the brain. ADAR 1 and ADAR 2 are widely expressed in tissues while ADAR 3 is restricted to the brain. The double-stranded regions of RNA are formed by base-pairing between residues in a region complementary to the region of the editing site. This complementary region is usually found in a neighbouring intron but can also be located in an exonic sequence. The region that pairs with the editing region is known as an Editing Complementary Sequence (ECS).

Editing sites

The candidate editing sites were determined experimentally by comparison of cDNA sequences and genomically encoded DNA from the same individual to avoid single nucleotide polymorphisms (SNPs). Two of the three editing sites found in mouse gene were found in the human transcript. [7] However, only the Q/R site was detected in all RNA, with the T/A site detected just once. Both sites are found within exon 1. [7]

Q/R site

This site is found in exon 1 at position 66. Editing results in a codon change from a glutamine codon to an arginine codon.

T/A site

This site is also found in exon 1, at position 63. It was only detected in one genomic sample indicating that the edited residue may be an SNP. However, the secondary structure of the RNA is predicted, around the editing site, to be highly conserved in mice and humans. This indicates that the T/A site may still be shown to be a site of A to I RNA editing. Editing at this site would result in an amino acid change from a threonine to an alanine.

The ECS is also predicted to be found within exon 1 at a location 5' to the editing region. [7]

Editing regulation

Editing is differentially expressed in the cerebellum and cortex. This regulation is also present in mice suggesting conservation of editing regulation. No editing has been detected in human lung, heart, kidney or spleen tissue. [7]

Evolutionary conservation

The sequence of exon 1 is highly conserved in mammalian species and editing of the pre-mRNA of this protein is likely to occur in mice, rat, dog and cow as well as humans. Even though the ECS is not conserved in non-mammals, an alternative ECS has been predicted in Zebrafish with a similar structure but in a different location. The Ecs is found downstream of the editing sites. [7]

Effects on Protein structure

These predicted editing sites result in the translation of an arginine instead of a glutamine at the Q/R site and an alanine instead of a threonine at the T/A site. These codon changes are nonsynomonous. [7] Since the editing sites are located just before a collagen like trimerization domain, editing may effect protein oligomerization. This region is also likely to be a protease domain. It is not known if the amino acid changes caused by editing could have an effect on these domains. [7]

Related Research Articles

<span class="mw-page-title-main">RNA editing</span> Molecular process

RNA editing is a molecular process through which some cells can make discrete changes to specific nucleotide sequences within an RNA molecule after it has been generated by RNA polymerase. It occurs in all living organisms and is one of the most evolutionarily conserved properties of RNAs. RNA editing may include the insertion, deletion, and base substitution of nucleotides within the RNA molecule. RNA editing is relatively rare, with common forms of RNA processing not usually considered as editing. It can affect the activity, localization as well as stability of RNAs, and has been linked with human diseases.

<span class="mw-page-title-main">Kv1.1</span>

Potassium voltage-gated channel subfamily A member 1 also known as Kv1.1 is a shaker related voltage-gated potassium channel that in humans is encoded by the KCNA1 gene. Isaacs syndrome is a result of an autoimmune reaction against the Kv1.1 ion channel.

Missense mRNA is a messenger RNA bearing one or more mutated codons that yield polypeptides with an amino acid sequence different from the wild-type or naturally occurring polypeptide. Missense mRNA molecules are created when template DNA strands or the mRNA strands themselves undergo a missense mutation in which a protein coding sequence is mutated and an altered amino acid sequence is coded for.

<span class="mw-page-title-main">GRIA3</span> Protein-coding gene in humans

Glutamate receptor 3 is a protein that in humans is encoded by the GRIA3 gene.

5-HT<sub>2C</sub> receptor Serotonin receptor protein distributed mainly in the choroid plexus

The 5-HT2C receptor is a subtype of the 5-HT2 receptor that binds the endogenous neurotransmitter serotonin (5-hydroxytryptamine, 5-HT). Like all 5-HT2 receptors, it is a G protein-coupled receptor (GPCR) that is coupled to Gq/G11 and mediates excitatory neurotransmission. HTR2C denotes the human gene encoding for the receptor, that in humans is located on the X chromosome. As males have one copy of the gene and females have one of the two copies of the gene repressed, polymorphisms at this receptor can affect the two sexes to differing extent.

<span class="mw-page-title-main">FLNA</span> Protein-coding gene in humans

Filamin A, alpha (FLNA) is a protein that in humans is encoded by the FLNA gene.

<span class="mw-page-title-main">ADAR</span> Mammalian protein found in Homo sapiens

The double-stranded RNA-specific adenosine deaminase enzyme family are encoded by the ADAR family genes. ADAR stands for adenosine deaminase acting on RNA. This article focuses on the ADAR proteins; This article details the evolutionary history, structure, function, mechanisms and importance of all proteins within this family.

<span class="mw-page-title-main">GRIA2</span> Mammalian protein found in Homo sapiens

Glutamate ionotropic receptor AMPA type subunit 2 is a protein that in humans is encoded by the GRIA2 gene and it is a subunit found in the AMPA receptors.

<span class="mw-page-title-main">GRIK2</span> Protein-coding gene in the species Homo sapiens

Glutamate ionotropic receptor kainate type subunit 2, also known as ionotropic glutamate receptor 6 or GluR6, is a protein that in humans is encoded by the GRIK2 gene.

<span class="mw-page-title-main">GRIK1</span> Protein-coding gene in the species Homo sapiens

Glutamate receptor, ionotropic, kainate 1, also known as GRIK1, is a protein that in humans is encoded by the GRIK1 gene.

<span class="mw-page-title-main">CYFIP2</span> Protein-coding gene in the species Homo sapiens

Cytoplasmic FMR1-interacting protein 2 is a protein that in humans is encoded by the CYFIP2 gene. Cytoplasmic FMR1 interacting protein is a 1253 amino acid long protein and is highly conserved sharing 99% sequence identity to the mouse protein. It is expressed mainly in brain tissues, white blood cells and the kidney.

<span class="mw-page-title-main">GRIA4</span>

Glutamate receptor 4 is a protein that in humans is encoded by the GRIA4 gene.

<span class="mw-page-title-main">GABRA3</span> Protein-coding gene in humans

Gamma-aminobutyric acid receptor subunit alpha-3 is a protein that in humans is encoded by the GABRA3 gene.

<span class="mw-page-title-main">KIAA1109</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein KIAA1109 is a protein that in humans is encoded by the KIAA1109 gene.

<span class="mw-page-title-main">BLCAP</span> Protein-coding gene in the species Homo sapiens

Bladder cancer-associated protein is a protein that in humans is encoded by the BLCAP gene.

<span class="mw-page-title-main">ARL6IP4</span> Protein-coding gene in humans

ADP-ribosylation-like factor 6 interacting protein 4 (ARL6IP4), also called SRp25 is the product of the ARL6IP4 gene located on chromosome 12q24. 31. Its function is unknown.

<span class="mw-page-title-main">CFAP206</span> Protein-coding gene in the species Homo sapiens

Cilia And Flagella Associated Protein 206 (CFAP206) is a gene that in humans encodes a protein “DUF3508”. This protein has a function that is not currently very well understood. Other known aliases are “dJ382I10.1, UPF0704 Protein C6orf165.” In humans, the gene coding sequence is 56,501 base pairs long, with an mRNA of 2,215 base pairs, and a protein sequence of 622 amino acids. The C6orf165 gene is conserved in chimpanzee, rhesus monkey, dog, cow, mouse, rat, chicken, zebrafish, mosquito, frog, and more C6orf165 is rarely expressed in humans, with relatively high expression in brain, lungs (trachea) and testis. The molecular weight of UPF0704 is 71,193 Da and the PI is 6.38

<span class="mw-page-title-main">C12orf60</span> Protein-coding gene in humans

Uncharacterized protein C12orf60 is a protein that in humans is encoded by the C12orf60 gene. The gene is also known as LOC144608 or MGC47869. The protein lacks transmembrane domains and helices, but it is rich in alpha-helices. It is predicted to localize in the nucleus.

The split gene theory is a theory of the origin of introns, long non-coding sequences in eukaryotic genes between the exons. The theory holds that the randomness of primordial DNA sequences would only permit small (< 600bp) open reading frames (ORFs), and that important intron structures and regulatory sequences are derived from stop codons. In this introns-first framework, the spliceosomal machinery and the nucleus evolved due to the necessity to join these ORFs into larger proteins, and that intronless bacterial genes are less ancestral than the split eukaryotic genes. The theory originated with Periannan Senapathy.

<span class="mw-page-title-main">GPATCH2L</span> It is Wikipedia article of unknown gene called "GPATCH2L".

GPATCH2L is a protein that is encoded by the GPATCH2L human gene located at 14q24.3. In humans, the length of mRNA in GPATCH2L (NM_017926) is 14,021 base pairs and the gene spans bases is 62,422 nt between chr14: 76,151,922 - 76,214,343. GPATCH2L is on the positive strand. IFT43 is the gene directly before GPATCH2L on the positive strand and LOC105370575 is the uncharacterized gene on the negative strand, which is approximately one and a half the size of GPATCH2L. Known aliases for GPATCH2L contain C14orf118, FLJ20689, FLJ10033, and KIAA1152. GPATCH2L produces 28 distinct introns, 17 different mRNAs, 14 alternatively spliced variants, and 3 unspliced forms. It has 5 probable alternative promoters, 7 validated polyadenylation sites, and 6 predicted promoters of varying lengths.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000131094 Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000045532 Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. C1QL1 Gene - GeneCards | C1QRF Protein | C1QRF Antibody
  6. Bérubé NG, Swanson XH, Bertram MJ, et al. (January 1999). "Cloning and characterization of CRF, a novel C1q-related factor, expressed in areas of the brain involved in motor function". Brain Res. Mol. Brain Res. 63 (2): 233–40. doi:10.1016/S0169-328X(98)00278-2. PMID   9878755.
  7. 1 2 3 4 5 6 7 8 Sie CP, Maas S (April 2009). "Conserved recoding RNA editing of vertebrate C1q-related factor C1QL1". FEBS Lett. 583 (7): 1171–4. doi: 10.1016/j.febslet.2009.02.044 . PMID   19275900. S2CID   33286445.
  8. Ghai R, Waters P, Roumenina LT, et al. (2007). "C1q and its growing family". Immunobiology. 212 (4–5): 253–66. doi: 10.1016/j.imbio.2006.11.001 . PMID   17544811.

Further reading