Effective number of codons

Last updated

Effective number of codons (abbreviated as ENC or Nc) is a measure to study the state of codon usage biases in genes and genomes. [1] The way that ENC is computed has obvious similarities to the computation of effective population size in population genetics. [2] Although it is easy to compute ENC values, it has been shown that this measure is one of the best measures to show codon usage bias. [3]

Since the original suggestion of the ENC, several investigators have tried to improve the method, [4] [5] but it seems that there is much room to improve this measure. [6] [7] [8] [9] [10] [11]

Related Research Articles

Codon usage bias A genetic bias towards the preferential use of one of the redundant codons that encode the same amino acid over the others

Codon usage bias refers to differences in the frequency of occurrence of synonymous codons in coding DNA. A codon is a series of three nucleotides that encodes a specific amino acid residue in a polypeptide chain or for the termination of translation.

GC-content The percentage of guanine and cytosine in DNA or RNA molecules

In molecular biology and genetics, GC-content is the percentage of nitrogenous bases in a DNA or RNA molecule that are either guanine (G) or cytosine (C). This measure indicates the proportion of G and C bases out of an implied four total bases, also including adenine and thymine in DNA and adenine and uracil in RNA.

A synonymous substitution is the evolutionary substitution of one base for another in an exon of a gene coding for a protein, such that the produced amino acid sequence is not modified. This is possible because the genetic code is "degenerate", meaning that some amino acids are coded for by more than one three-base-pair codon; since some of the codons for a given amino acid differ by just one base pair from others coding for the same amino acid, a mutation that replaces the "normal" base by one of the alternatives will result in incorporation of the same amino acid into the growing polypeptide chain when the gene is translated. Synonymous substitutions and mutations affecting noncoding DNA are often considered silent mutations; however, it is not always the case that the mutation is silent.

In genetics, the Ka/Ks ratio, also known as ω or dN/dS ratio, is used to estimate the balance between neutral mutations, purifying selection and beneficial mutations acting on a set of homologous protein-coding genes. It is calculated as the ratio of the number of nonsynonymous substitutions per non-synonymous site (Ka), in a given period of time, to the number of synonymous substitutions per synonymous site (Ks), in the same period. The latter are assumed to be neutral, so that the ratio indicates the net balance between deleterious and beneficial mutations. Values of Ka/Ks significantly above 1 are unlikely to occur without at least some of the mutations being advantageous. If beneficial mutations are assumed to make little contribution, then Ks estimates the degree of evolutionary constraint.

Fucosidase

Tissue alpha-L-fucosidase is an enzyme that in humans is encoded by the FUCA1 gene.

Bradykinin receptor B<sub>2</sub>

Bradykinin receptor B2 is a G-protein coupled receptor for bradykinin, encoded by the BDKRB2 gene in humans.

TAS2R3

Taste receptor type 2 member 3 is a protein that in humans is encoded by the TAS2R3 gene.

TAS2R14

Taste receptor type 2 member 14 is a protein that in humans is encoded by the TAS2R14 gene.

SULT1A3

Sulfotransferase 1A3/1A4 is an enzyme that in humans is encoded by the SULT1A3 gene.

SULT1E1

Estrogen sulfotransferase is an enzyme that in humans is encoded by the SULT1E1 gene.

ZAK

Sterile alpha motif and leucine zipper containing kinase AZK, also known as ZAK, is a human gene.

ABCD3

ATP-binding cassette sub-family D member 3 is a protein that in humans is encoded by the ABCD3 gene.

RILP (gene)

Rab-interacting lysosomal protein is a protein that in humans is encoded by the RILP gene.

ABCB9

ATP-binding cassette sub-family B member 9 is a protein that in humans is encoded by the ABCB9 gene.

PSG5

Pregnancy-specific beta-1-glycoprotein 5 is a protein that in humans is encoded by the PSG5 gene.

PSG4

Pregnancy-specific beta-1-glycoprotein 4 is a protein that in humans is encoded by the PSG4 gene.

RHOBTB2

Rho-related BTB domain-containing protein 2 is a protein that in humans is encoded by the RHOBTB2 gene.

UGT2B10

UDP-glucuronosyltransferase 2B10 is an enzyme that in humans is encoded by the UGT2B10 gene.

GPR182

GPR182 is a human gene which is an orphan G-protein coupled receptor.

SLC22A10

Solute carrier family 22 member 10 (SLC22A10), also known as organic anion transporter 5 (OAT5), is a protein that in humans is encoded by the SLC22A10 gene.

References

  1. Wright F. (1990). "The 'effective number of codons' used in a gene". Gene. 87 (1): 23–29. doi:10.1016/0378-1119(90)90491-9. PMID   2110097.
  2. Kimura, M. & Crow, J.F. (1964). "The number of alleles that can be maintained in a finite population". Genetics. 49 (4): 725–738. doi:10.1093/genetics/49.4.725. PMC   1210609 . PMID   14156929.
  3. Comeron JM, Aguadé M (1998). "An evaluation of measures of synonymous codon usage bias". J. Mol. Evol. 47 (3): 268–274. Bibcode:1998JMolE..47..268C. doi:10.1007/PL00006384. PMID   9732453. S2CID   21862217.
  4. Novembre J.A. (2002). "Accounting for background nucleotide composition when measuring codon usage bias". Mol. Biol. Evol. 19 (8): 1390–1394. doi: 10.1093/oxfordjournals.molbev.a004201 . PMID   12140252.
  5. Fuglsang A. (2004). "The 'effective number of codons' revisited". Biochem. Biophys. Res. Commun. 317 (3): 957–964. doi:10.1016/j.bbrc.2004.03.138. PMID   15081433.
  6. Marashi S.A.; Najafabadi H.S. (2004). "How reliable re-adjustment is: correspondence regarding A. Fuglsang, "The 'effective number of codons' revisited"". Biochem. Biophys. Res. Commun. 324 (1): 1–2. doi:10.1016/j.bbrc.2004.08.213. PMID   15464973.
  7. Fuglsang A. (2005). "On the methodological weakness of 'the effective number of codons': a reply to Marashi and Najafabadi". Biochem. Biophys. Res. Commun. 327 (1): 1–3. doi:10.1016/j.bbrc.2004.11.133. PMID   15629420.
  8. Banerjee T.; Gupta S.K.; Ghosh T.C. (2005). "Towards a resolution on the inherent methodological weakness of the "effective number of codons used by a gene"". Biochem. Biophys. Res. Commun. 330 (4): 1015–8. doi:10.1016/j.bbrc.2005.02.150. PMID   15823544.
  9. Fuglsang A. (2006). "Estimating the "effective number of codons": the Wright way of determining codon homozygosity leads to superior estimates". Genetics. 172 (2): 1301–7. doi:10.1534/genetics.105.049643. PMC   1456227 . PMID   16299393.
  10. Fuglsang A. (2006). "Accounting for background nucleotide composition when measuring codon usage bias: brilliant idea, difficult in practice". Mol. Biol. Evol. 23 (7): 1345–7. doi: 10.1093/molbev/msl009 . PMID   16679346.
  11. Fuglsang A. (2008). "Impact of bias discrepancy and amino acid usage on estimates of the effective number of codons used in a gene, and a test for selection on codon usage". Gene. 410 (1): 82–8. doi:10.1016/j.gene.2007.12.001. PMID   18248919.