Effective number of codons (abbreviated as ENC or Nc) is a measure to study the state of codon usage biases in genes and genomes. [1] The way that ENC is computed has obvious similarities to the computation of effective population size in population genetics. [2] Although it is easy to compute ENC values, it has been shown that this measure is one of the best measures to show codon usage bias. [3]
Since the original suggestion of the ENC, several investigators have tried to improve the method, [4] [5] but it seems that there is much room to improve this measure. [6] [7] [8] [9] [10] [11]
Codon usage bias refers to differences in the frequency of occurrence of synonymous codons in coding DNA. A codon is a series of three nucleotides that encodes a specific amino acid residue in a polypeptide chain or for the termination of translation.
In molecular biology and genetics, GC-content is the percentage of nitrogenous bases in a DNA or RNA molecule that are either guanine (G) or cytosine (C). This measure indicates the proportion of G and C bases out of an implied four total bases, also including adenine and thymine in DNA and adenine and uracil in RNA.
A synonymous substitution is the evolutionary substitution of one base for another in an exon of a gene coding for a protein, such that the produced amino acid sequence is not modified. This is possible because the genetic code is "degenerate", meaning that some amino acids are coded for by more than one three-base-pair codon; since some of the codons for a given amino acid differ by just one base pair from others coding for the same amino acid, a mutation that replaces the "normal" base by one of the alternatives will result in incorporation of the same amino acid into the growing polypeptide chain when the gene is translated. Synonymous substitutions and mutations affecting noncoding DNA are often considered silent mutations; however, it is not always the case that the mutation is silent.
In genetics, the Ka/Ks ratio, also known as ω or dN/dS ratio, is used to estimate the balance between neutral mutations, purifying selection and beneficial mutations acting on a set of homologous protein-coding genes. It is calculated as the ratio of the number of nonsynonymous substitutions per non-synonymous site (Ka), in a given period of time, to the number of synonymous substitutions per synonymous site (Ks), in the same period. The latter are assumed to be neutral, so that the ratio indicates the net balance between deleterious and beneficial mutations. Values of Ka/Ks significantly above 1 are unlikely to occur without at least some of the mutations being advantageous. If beneficial mutations are assumed to make little contribution, then Ks estimates the degree of evolutionary constraint.
Tissue alpha-L-fucosidase is an enzyme that in humans is encoded by the FUCA1 gene.
Bradykinin receptor B2 is a G-protein coupled receptor for bradykinin, encoded by the BDKRB2 gene in humans.
Taste receptor type 2 member 3 is a protein that in humans is encoded by the TAS2R3 gene.
Taste receptor type 2 member 14 is a protein that in humans is encoded by the TAS2R14 gene.
Sulfotransferase 1A3/1A4 is an enzyme that in humans is encoded by the SULT1A3 gene.
Estrogen sulfotransferase is an enzyme that in humans is encoded by the SULT1E1 gene.
Sterile alpha motif and leucine zipper containing kinase AZK, also known as ZAK, is a human gene.
ATP-binding cassette sub-family D member 3 is a protein that in humans is encoded by the ABCD3 gene.
Rab-interacting lysosomal protein is a protein that in humans is encoded by the RILP gene.
ATP-binding cassette sub-family B member 9 is a protein that in humans is encoded by the ABCB9 gene.
Pregnancy-specific beta-1-glycoprotein 5 is a protein that in humans is encoded by the PSG5 gene.
Pregnancy-specific beta-1-glycoprotein 4 is a protein that in humans is encoded by the PSG4 gene.
Rho-related BTB domain-containing protein 2 is a protein that in humans is encoded by the RHOBTB2 gene.
UDP-glucuronosyltransferase 2B10 is an enzyme that in humans is encoded by the UGT2B10 gene.
GPR182 is a human gene which is an orphan G-protein coupled receptor.
Solute carrier family 22 member 10 (SLC22A10), also known as organic anion transporter 5 (OAT5), is a protein that in humans is encoded by the SLC22A10 gene.