CodonCode Aligner

Last updated
CodonCode Aligner
Developer(s) CodonCode Corporation
Stable release
4.2.5 / 2013
Operating system Mac OS X, Windows
Type Bioinformatics
License commercial; free for limited use (trace viewing & editing)
Website http://www.codoncode.com/aligner

CodonCode Aligner is a commercial application for DNA sequence assembly, sequence alignment, and editing on Mac OS X and Windows.

Contents

Features

Features include chromatogram editing, end clipping, and vector trimming, sequence assembly and contig editing, aligning cDNA against genomic templates, sequence alignment and editing, alignment of contigs to each other with ClustalW, MUSCLE, or built-in algorithms, mutation detection, including detection of heterozygous single-nucleotide polymorphism, analysis of heterozygous insertions and deletions, start online BLAST searches, restriction analysis (find and view restriction cut sites), trace sharpening, and support for Phred, Phrap, ClustalW, and MUSCLE.

History

The first beta version of CodonCode Aligner was released in April 2003, followed by the first full version in June 2003. Major upgrades were released in 2003, 2004, 2005, 2006, 2007, and 2008.

In April 2009, CodonCode Aligner had been cited in more than 400 scientific publications. Citations cover a wide variety of biomedical research areas, including HIV research, [1] [2] [3] biogeography and environmental biology, [4] [5] DNA methylation studies, [6] genetic diseases, [7] [8] [9] clinical microbiology, [10] [11] and evolution research and phylogenetics. [12] [13] [14]

See also

Related Research Articles

<span class="mw-page-title-main">Sequence alignment</span> Process in bioinformatics that identifies equivalent sites within molecular sequences

In bioinformatics, a sequence alignment is a way of arranging the sequences of DNA, RNA, or protein to identify regions of similarity that may be a consequence of functional, structural, or evolutionary relationships between the sequences. Aligned sequences of nucleotide or amino acid residues are typically represented as rows within a matrix. Gaps are inserted between the residues so that identical or similar characters are aligned in successive columns. Sequence alignments are also used for non-biological sequences, such as calculating the distance cost between strings in a natural language or in financial data.

In genetics, an expressed sequence tag (EST) is a short sub-sequence of a cDNA sequence. ESTs may be used to identify gene transcripts, and were instrumental in gene discovery and in gene-sequence determination. The identification of ESTs has proceeded rapidly, with approximately 74.2 million ESTs now available in public databases. EST approaches have largely been superseded by whole genome and transcriptome sequencing and metagenome sequencing.

<span class="mw-page-title-main">Clustal</span>

Clustal is a series of widely used computer programs used in bioinformatics for multiple sequence alignment. There have been many versions of Clustal over the development of the algorithm that are listed below. The analysis of each tool and its algorithm is also detailed in their respective categories. Available operating systems listed in the sidebar are a combination of the software availability and may not be supported for every current version of the Clustal tools. Clustal Omega has the widest variety of operating systems out of all the Clustal tools.

<span class="mw-page-title-main">Conserved sequence</span> Similar DNA, RNA or protein sequences within genomes or among species

In evolutionary biology, conserved sequences are identical or similar sequences in nucleic acids or proteins across species, or within a genome, or between donor and receptor taxa. Conservation indicates that a sequence has been maintained by natural selection.

<span class="mw-page-title-main">Phi X 174</span> A single-stranded DNA virus that infects bacteria

The phi X 174 bacteriophage is a single-stranded DNA (ssDNA) virus that infects Escherichia coli, and the first DNA-based genome to be sequenced. This work was completed by Fred Sanger and his team in 1977. In 1962, Walter Fiers and Robert Sinsheimer had already demonstrated the physical, covalently closed circularity of ΦX174 DNA. Nobel prize winner Arthur Kornberg used ΦX174 as a model to first prove that DNA synthesized in a test tube by purified enzymes could produce all the features of a natural virus, ushering in the age of synthetic biology. In 1972–1974, Jerard Hurwitz, Sue Wickner, and Reed Wickner with collaborators identified the genes required to produce the enzymes to catalyze conversion of the single stranded form of the virus to the double stranded replicative form. In 2003, it was reported by Craig Venter's group that the genome of ΦX174 was the first to be completely assembled in vitro from synthesized oligonucleotides. The ΦX174 virus particle has also been successfully assembled in vitro. In 2012, it was shown how its highly overlapping genome can be fully decompressed and still remain functional.

<span class="mw-page-title-main">Multiple sequence alignment</span> Alignment of more than two molecular sequences

Multiple sequence alignment (MSA) may refer to the process or the result of sequence alignment of three or more biological sequences, generally protein, DNA, or RNA. In many cases, the input set of query sequences are assumed to have an evolutionary relationship by which they share a linkage and are descended from a common ancestor. From the resulting MSA, sequence homology can be inferred and phylogenetic analysis can be conducted to assess the sequences' shared evolutionary origins. Visual depictions of the alignment as in the image at right illustrate mutation events such as point mutations that appear as differing characters in a single alignment column, and insertion or deletion mutations that appear as hyphens in one or more of the sequences in the alignment. Multiple sequence alignment is often used to assess sequence conservation of protein domains, tertiary and secondary structures, and even individual amino acids or nucleotides.

T-Coffee is a multiple sequence alignment software using a progressive approach. It generates a library of pairwise alignments to guide the multiple sequence alignment. It can also combine multiple sequences alignments obtained previously and in the latest versions can use structural information from PDB files (3D-Coffee). It has advanced features to evaluate the quality of the alignments and some capacity for identifying occurrence of motifs (Mocca). It produces alignment in the aln format (Clustal) by default, but can also produce PIR, MSF, and FASTA format. The most common input formats are supported.

<span class="mw-page-title-main">Retroviral psi packaging element</span>

The retroviral psi packaging element, also known as the Ψ RNA packaging signal, is a cis-acting RNA element identified in the genomes of the retroviruses Human immunodeficiency virus (HIV) and Simian immunodeficiency virus (SIV). It is involved in regulating the essential process of packaging the retroviral RNA genome into the viral capsid during replication. The final virion contains a dimer of two identical unspliced copies of the viral genome.

<span class="mw-page-title-main">COPS6</span> Protein-coding gene in humans

COP9 signalosome complex subunit 6 is a protein that in humans is encoded by the COPS6 gene.

<span class="mw-page-title-main">DDB1</span> Protein-coding gene in the species Homo sapiens

DNA damage-binding protein 1 is a protein that in humans is encoded by the DDB1 gene.

<span class="mw-page-title-main">Poliovirus receptor-related 1</span> Protein-coding gene in the species Homo sapiens

Poliovirus receptor-related 1 (PVRL1), also known as nectin-1 and CD111 (formerly herpesvirus entry mediator C, HVEC) is a human protein of the immunoglobulin superfamily (IgSF), also considered a member of the nectins. It is a membrane protein with three extracellular immunoglobulin domains, a single transmembrane helix and a cytoplasmic tail. The protein can mediate Ca2+-independent cellular adhesion further characterizing it as IgSF cell adhesion molecule (IgSF CAM).

<span class="mw-page-title-main">APOBEC3F</span> Protein-coding gene in the species Homo sapiens

DNA dC->dU-editing enzyme APOBEC-3F is a protein that in humans is encoded by the APOBEC3F gene.

HERV-K_19q12 provirus ancestral Pol protein is a protein that in humans is encoded by the ERVK6 gene.

<span class="mw-page-title-main">EIF3M</span> Protein-coding gene in the species Homo sapiens

Eukaryotic translation initiation factor 3, subunit M (eIF3m) also known as PCI domain containing 1 (herpesvirus entry mediator) (PCID1), is a protein that in humans is encoded by the EIF3M gene.

Human bocavirus (HBoV) is the name given to all viruses in the genus Bocaparvovirus of virus family Parvoviridae that are known to infect humans. HBoV1 and HBoV3 are members of species Primate bocaparvovirus 1 whereas viruses HBoV2 and HBoV4 belong to species Primate bocaparvovirus 2. Some of these viruses cause human disease. HBoV1 is strongly implicated in causing some cases of lower respiratory tract infection, especially in young children, and several of the viruses have been linked to gastroenteritis, although the full clinical role of this emerging infectious disease remains to be elucidated.

MacVector is a commercial sequence analysis application for Apple Macintosh computers running Mac OS X. It is intended to be used by molecular biologists to help analyze, design, research and document their experiments in the laboratory. MacVector 18.1 is a Universal Binary capable of running on Intel and Apple Silicon Macs.

<span class="mw-page-title-main">Late protein</span>

A late protein is a viral protein that is formed after replication of the virus. One example is VP4 from simian virus 40 (SV40).

The Staden Package is computer software, a set of tools for DNA sequence assembly, editing, and sequence analysis. It is open-source software, released under a BSD 3-clause license.

<span class="mw-page-title-main">Digital transcriptome subtraction</span>

Digital transcriptome subtraction (DTS) is a bioinformatics method to detect the presence of novel pathogen transcripts through computational removal of the host sequences. DTS is the direct in silico analogue of the wet-lab approach representational difference analysis (RDA), and is made possible by unbiased high-throughput sequencing and the availability of a high-quality, annotated reference genome of the host. The method specifically examines the etiological agent of infectious diseases and is best known for discovering Merkel cell polyomavirus, the suspect causative agent in Merkel-cell carcinoma.

<span class="mw-page-title-main">Desmond G. Higgins</span>

Desmond Gerard Higgins is a Professor of Bioinformatics at University College Dublin, widely known for CLUSTAL, a series of computer programs for performing multiple sequence alignment. According to Nature, Higgins' papers describing CLUSTAL are among the top ten most highly cited scientific papers of all time.

References

  1. Bailey JR, Sedaghat AR, Kieffer T, Brennan T, Lee PK, Wind-Rotolo M, Haggerty CM, Kamireddi AR, Liu Y, Lee J, Persaud D, Gallant JE, Cofrancesco J, Quinn TC, Wilke CO, Ray SC, Siliciano JD, Nettles RE, Siliciano RF (2006). "Residual Human Immunodeficiency Virus Type 1 Viremia in Some Patients on Antiretroviral Therapy Is Dominated by a Small Number of Invariant Clones Rarely Found in Circulating CD4+ T Cells'Ć". J Virol. 80 (13): 6441–6457. doi:10.1128/JVI.00591-06. PMC   1488985 . PMID   16775332.
  2. Calis JC, Rotteveel HP, van der Kuyl AC, Zorgdrager F, Kachala D, van Hensbroek MB, Cornelissen M (2008). "Severe anaemia is not associated with HIV-1 env gene characteristics in Malawian children". BMC Infect Dis. 8: 26. doi: 10.1186/1471-2334-8-26 . PMC   2311312 . PMID   18312662. Open Access logo PLoS transparent.svg
  3. Mild M, Esbjörnsson J, Fenyö EM, Medstrand P (2007). "Frequent Intrapatient Recombination between Human Immunodeficiency Virus Type 1 R5 and X4 Envelopes: Implications for Coreceptor Switch'ñø". J Virol. 81 (7): 3369–3376. doi:10.1128/JVI.01295-06. PMC   1866041 . PMID   17251288.
  4. Pendley CJ, Becker EA, Karl JA, Blasky AJ, Wiseman RW, Hughes AL, O'Connor SL, O'Connor DH (2008). "MHC class I characterization of Indonesian cynomolgus macaques". Immunogenetics. 60 (7): 339–351. doi:10.1007/s00251-008-0292-4. PMC   2612123 . PMID   18504574.
  5. Behnke A, Bunge J, Barger K, Breiner H, Alla V, Stoeck T (2006). "Microeukaryote Community Patterns along an O2/H2S Gradient in a Supersulfidic Anoxic Fjord (Framvaren, Norway)'Ć". Appl Environ Microbiol. 72 (5): 3626–3636. doi:10.1128/AEM.72.5.3626-3636.2006. PMC   1472314 . PMID   16672511.
  6. Bart A, van Passel MWJ, van Amsterdam K, van der Ende A (2005). "Direct detection of methylation in genomic DNA". Nucleic Acids Res. 33 (14): e124. doi:10.1093/nar/gni121. PMC   1184226 . PMID   16091626.
  7. Andersson LS, Juras R, Ramsey DT, Eason-Butler J, Ewart S, Cothran G, Lindgren G (2008). "Equine Multiple Congenital Ocular Anomalies maps to a 4.9 megabase interval on horse chromosome 6". BMC Genet. 9: 88. doi: 10.1186/1471-2156-9-88 . PMC   2653074 . PMID   19099555. Open Access logo PLoS transparent.svg
  8. Tremblay K, Lemire M, Potvin C, Tremblay A, Hunninghake GM, Raby BA, Hudson TJ, Perez-Iratxeta C, Andrade-Navarro MA, Laprise C (2008). "Genes to Diseases (G2D) Computational Method to Identify Asthma Candidate Genes". PLOS ONE. 3 (8): e2907. Bibcode:2008PLoSO...3.2907T. doi: 10.1371/journal.pone.0002907 . PMC   2488373 . PMID   18682798. Open Access logo PLoS transparent.svg
  9. McCullough BJ, Adams JC, Shilling DJ, Feeney MP, Sie KCY, Tempel BL (2007). "3p-Syndrome Defines a Hearing Loss Locus in 3p25.3". Hear Res. 224 (1–2): 51–60. doi:10.1016/j.heares.2006.11.006. PMC   1995240 . PMID   17208398.
  10. Pignone M, Greth KM, Cooper J, Emerson D, Tang J (2006). "Identification of Mycobacteria by Matrix-Assisted Laser Desorption Ionization-Time-of-Flight Mass Spectrometry". J Clin Microbiol. 44 (6): 1963–1970. doi:10.1128/JCM.01959-05. PMC   1489414 . PMID   16757585.
  11. van Amsterdam K, Bart A, van der Ende A (2005). "A Helicobacter pylori TolC Efflux Pump Confers Resistance to Metronidazole". Antimicrob Agents Chemother. 49 (4): 1477–1482. doi:10.1128/AAC.49.4.1477-1482.2005. PMC   1068630 . PMID   15793129.
  12. Baxter SW, Papa R, Chamberlain N, Humphray SJ, Joron M, Morrison C, Ffrench-Constant RH, McMillan WO, Jiggins CD (2008). "Convergent Evolution in the Genetic Basis of Müllerian Mimicry in Heliconius Butterflies". Genetics. 180 (3): 1567–1577. doi:10.1534/genetics.107.082982. PMC   2581958 . PMID   18791259.
  13. Siddall ME, Trontelj P, Utevsky SY, Nkamany M, Macdonald KS (2007). "Diverse molecular data demonstrate that commercially available medicinal leeches are not Hirudo medicinalis". Proc Biol Sci. 274 (1617): 1481–1487. doi:10.1098/rspb.2007.0248. PMC   2176162 . PMID   17426015.
  14. Stoeck T, Kasper J, Bunge J, Leslin C, Ilyin V, Epstein S (2007). "Protistan Diversity in the Arctic: A Case of Paleoclimate Shaping Modern Biodiversity?". PLOS ONE. 2 (8): e728. Bibcode:2007PLoSO...2..728S. doi: 10.1371/journal.pone.0000728 . PMC   1940325 . PMID   17710128. Open Access logo PLoS transparent.svg