Nucleic acid secondary structure

Last updated
The image above contains clickable links
Interactive image of nucleic acid structure (primary, secondary, tertiary, and quaternary) using DNA helices and examples from the VS ribozyme and telomerase and nucleosome. (PDB: ADNA, 1BNA, 4OCB, 4R4V, 1YMO, 1EQZ ) DNA RNA structure (2).png
The image above contains clickable links Interactive icon.svg
The image above contains clickable links
Interactive image of nucleic acid structure (primary, secondary, tertiary, and quaternary) using DNA helices and examples from the VS ribozyme and telomerase and nucleosome. ( PDB: ADNA, 1BNA, 4OCB, 4R4V, 1YMO, 1EQZ )

Nucleic acid secondary structure is the basepairing interactions within a single nucleic acid polymer or between two polymers. It can be represented as a list of bases which are paired in a nucleic acid molecule. [1] The secondary structures of biological DNAs and RNAs tend to be different: biological DNA mostly exists as fully base paired double helices, while biological RNA is single stranded and often forms complex and intricate base-pairing interactions due to its increased ability to form hydrogen bonds stemming from the extra hydroxyl group in the ribose sugar.[ citation needed ]

Contents

In a non-biological context, secondary structure is a vital consideration in the nucleic acid design of nucleic acid structures for DNA nanotechnology and DNA computing, since the pattern of basepairing ultimately determines the overall structure of the molecules.

Fundamental concepts

Base pairing

AT base pair jypx3.png
GC base pair jypx3.png
Top, an AT base pair demonstrating two intermolecular hydrogen bonds; bottom, a GC base pair demonstrating three intermolecular hydrogen bonds.

In molecular biology, two nucleotides on opposite complementary DNA or RNA strands that are connected via hydrogen bonds are called a base pair (often abbreviated bp). In the canonical Watson-Crick base pairing, adenine (A) forms a base pair with thymine (T) and guanine (G) forms one with cytosine (C) in DNA. In RNA, thymine is replaced by uracil (U). Alternate hydrogen bonding patterns, such as the wobble base pair and Hoogsteen base pair, also occurparticularly in RNAgiving rise to complex and functional tertiary structures. Importantly, pairing is the mechanism by which codons on messenger RNA molecules are recognized by anticodons on transfer RNA during protein translation. Some DNA- or RNA-binding enzymes can recognize specific base pairing patterns that identify particular regulatory regions of genes. Hydrogen bonding is the chemical mechanism that underlies the base-pairing rules described above. Appropriate geometrical correspondence of hydrogen bond donors and acceptors allows only the "right" pairs to form stably. DNA with high GC-content is more stable than DNA with low GC-content, but contrary to popular belief, the hydrogen bonds do not stabilize the DNA significantly and stabilization is mainly due to stacking interactions. [2]

The larger nucleobases, adenine and guanine, are members of a class of doubly ringed chemical structures called purines; the smaller nucleobases, cytosine and thymine (and uracil), are members of a class of singly ringed chemical structures called pyrimidines. Purines are only complementary with pyrimidines: pyrimidine-pyrimidine pairings are energetically unfavorable because the molecules are too far apart for hydrogen bonding to be established; purine-purine pairings are energetically unfavorable because the molecules are too close, leading to overlap repulsion. The only other possible pairings are GT and AC; these pairings are mismatches because the pattern of hydrogen donors and acceptors do not correspond. The GU wobble base pair, with two hydrogen bonds, does occur fairly often in RNA.

Nucleic acid hybridization

Hybridization is the process of complementary base pairs binding to form a double helix. Melting is the process by which the interactions between the strands of the double helix are broken, separating the two nucleic acid strands. These bonds are weak, easily separated by gentle heating, enzymes, or physical force. Melting occurs preferentially at certain points in the nucleic acid. [3] T and A rich sequences are more easily melted than C and G rich regions. Particular base steps are also susceptible to DNA melting, particularly T A and T G base steps. [4] These mechanical features are reflected by the use of sequences such as TATAA at the start of many genes to assist RNA polymerase in melting the DNA for transcription.

Strand separation by gentle heating, as used in PCR, is simple providing the molecules have fewer than about 10,000 base pairs (10 kilobase pairs, or 10 kbp). The intertwining of the DNA strands makes long segments difficult to separate. The cell avoids this problem by allowing its DNA-melting enzymes (helicases) to work concurrently with topoisomerases, which can chemically cleave the phosphate backbone of one of the strands so that it can swivel around the other. Helicases unwind the strands to facilitate the advance of sequence-reading enzymes such as DNA polymerase.

Secondary structure motifs

The main nucleic acid helix structures (A-, B- and Z-form) A-DNA, B-DNA and Z-DNA.png
The main nucleic acid helix structures (A-, B- and Z-form)

Nucleic acid secondary structure is generally divided into helices (contiguous base pairs), and various kinds of loops (unpaired nucleotides surrounded by helices). Frequently these elements, or combinations of them, are further classified into additional categories including, for example, tetraloops, pseudoknots, and stem-loops. Topological approaches can be used to categorize and compare complex structures that arise from combining these elements in various arrangements.

Double helix

The double helix is an important tertiary structure in nucleic acid molecules which is intimately connected with the molecule's secondary structure. A double helix is formed by regions of many consecutive base pairs.

The nucleic acid double helix is a spiral polymer, usually right-handed, containing two nucleotide strands which base pair together. A single turn of the helix constitutes about ten nucleotides, and contains a major groove and minor groove, the major groove being wider than the minor groove. [5] Given the difference in widths of the major groove and minor groove, many proteins which bind to DNA do so through the wider major groove. [6] Many double-helical forms are possible; for DNA the three biologically relevant forms are A-DNA, B-DNA, and Z-DNA, while RNA double helices have structures similar to the A form of DNA.

Stem-loop structures

An RNA stem-loop secondary structure Stem-loop.svg
An RNA stem-loop secondary structure

The secondary structure of nucleic acid molecules can often be uniquely decomposed into stems and loops. The stem-loop structure (also often referred to as an "hairpin"), in which a base-paired helix ends in a short unpaired loop, is extremely common and is a building block for larger structural motifs such as cloverleaf structures, which are four-helix junctions such as those found in transfer RNA. Internal loops (a short series of unpaired bases in a longer paired helix) and bulges (regions in which one strand of a helix has "extra" inserted bases with no counterparts in the opposite strand) are also frequent.

There are many secondary structure elements of functional importance to biological RNAs; some famous examples are the Rho-independent terminator stem-loops and the tRNA cloverleaf. Active research is on-going to determine the secondary structure of RNA molecules, with approaches including both experimental and computational methods (see also the List of RNA structure prediction software).

Pseudoknots

An RNA pseudoknot structure. For example, the RNA component of human telomerase. Pseudoknot.svg
An RNA pseudoknot structure. For example, the RNA component of human telomerase.

A pseudoknot is a nucleic acid secondary structure containing at least two stem-loop structures in which half of one stem is intercalated between the two halves of another stem. Pseudoknots fold into knot-shaped three-dimensional conformations but are not true topological knots. The base pairing in pseudoknots is not well nested; that is, base pairs occur that "overlap" one another in sequence position. This makes the presence of general pseudoknots in nucleic acid sequences impossible to predict by the standard method of dynamic programming, which uses a recursive scoring system to identify paired stems and consequently cannot detect non-nested base pairs with common algorithms. However, limited subclasses of pseudoknots can be predicted using modified dynamic programs. [8] Newer structure prediction techniques such as stochastic context-free grammars are also unable to consider pseudoknots.

Pseudoknots can form a variety of structures with catalytic activity [9] and several important biological processes rely on RNA molecules that form pseudoknots. For example, the RNA component of the human telomerase contains a pseudoknot that is critical for its activity. [7] The hepatitis delta virus ribozyme is a well known example of a catalytic RNA with a pseudoknot in its active site. [10] [11] Though DNA can also form pseudoknots, they are generally not present in standard physiological conditions.

Secondary structure prediction

Most methods for nucleic acid secondary structure prediction rely on a nearest neighbor thermodynamic model. [12] [13] A common method to determine the most probable structures given a sequence of nucleotides makes use of a dynamic programming algorithm that seeks to find structures with low free energy. [14] Dynamic programming algorithms often forbid pseudoknots, or other cases in which base pairs are not fully nested, as considering these structures becomes computationally very expensive for even small nucleic acid molecules. Other methods, such as stochastic context-free grammars can also be used to predict nucleic acid secondary structure.

For many RNA molecules, the secondary structure is highly important to the correct function of the RNA often more so than the actual sequence. This fact aids in the analysis of non-coding RNA sometimes termed "RNA genes". One application of bioinformatics uses predicted RNA secondary structures in searching a genome for noncoding but functional forms of RNA. For example, microRNAs have canonical long stem-loop structures interrupted by small internal loops.

RNA secondary structure applies in RNA splicing in certain species. In humans and other tetrapods, it has been shown that without the U2AF2 protein, the splicing process is inhibited. However, in zebrafish and other teleosts the RNA splicing process can still occur on certain genes in the absence of U2AF2. This may be because 10% of genes in zebrafish have alternating TG and AC base pairs at the 3' splice site (3'ss) and 5' splice site (5'ss) respectively on each intron, which alters the secondary structure of the RNA. This suggests that secondary structure of RNA can influence splicing, potentially without the use of proteins like U2AF2 that have been thought to be required for splicing to occur. [15]

Secondary structure determination

RNA secondary structure can be determined from atomic coordinates (tertiary structure) obtained by X-ray crystallography, often deposited in the Protein Data Bank. Current methods include 3DNA/DSSR [16] and MC-annotate. [17]

See also

Related Research Articles

<span class="mw-page-title-main">Base pair</span> Unit consisting of two nucleobases bound to each other by hydrogen bonds

A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA and RNA. Dictated by specific hydrogen bonding patterns, "Watson–Crick" base pairs allow the DNA helix to maintain a regular helical structure that is subtly dependent on its nucleotide sequence. The complementary nature of this based-paired structure provides a redundant copy of the genetic information encoded within each strand of DNA. The regular structure and data redundancy provided by the DNA double helix make DNA well suited to the storage of genetic information, while base-pairing between DNA and incoming nucleotides provides the mechanism through which DNA polymerase replicates DNA and RNA polymerase transcribes DNA into RNA. Many DNA-binding proteins can recognize specific base-pairing patterns that identify particular regulatory regions of genes.

<span class="mw-page-title-main">DNA</span> Molecule that carries genetic information

Deoxyribonucleic acid is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of all known organisms and many viruses. DNA and ribonucleic acid (RNA) are nucleic acids. Alongside proteins, lipids and complex carbohydrates (polysaccharides), nucleic acids are one of the four major types of macromolecules that are essential for all known forms of life.

<span class="mw-page-title-main">Nucleic acid</span> Class of large biomolecules essential to all known life

Nucleic acids are biopolymers, macromolecules, essential to all known forms of life. They are composed of nucleotides, which are the monomer components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nucleic acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). If the sugar is ribose, the polymer is RNA; if the sugar is deoxyribose, a version of ribose, the polymer is DNA.

<span class="mw-page-title-main">RNA</span> Family of large biological molecules

Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself or by forming a template for the production of proteins. RNA and deoxyribonucleic acid (DNA) are nucleic acids. The nucleic acids constitute one of the four major macromolecules essential for all known forms of life. RNA is assembled as a chain of nucleotides. Cellular organisms use messenger RNA (mRNA) to convey genetic information that directs synthesis of specific proteins. Many viruses encode their genetic information using an RNA genome.

An inverted repeat is a single stranded sequence of nucleotides followed downstream by its reverse complement. The intervening sequence of nucleotides between the initial sequence and the reverse complement can be any length including zero. For example, 5'---TTACGnnnnnnCGTAA---3' is an inverted repeat sequence. When the intervening length is zero, the composite sequence is a palindromic sequence.

<span class="mw-page-title-main">Nucleobase</span> Nitrogen-containing biological compounds that form nucleosides

Nucleobases are nitrogen-containing biological compounds that form nucleosides, which, in turn, are components of nucleotides, with all of these monomers constituting the basic building blocks of nucleic acids. The ability of nucleobases to form base pairs and to stack one upon another leads directly to long-chain helical structures such as ribonucleic acid (RNA) and deoxyribonucleic acid (DNA). Five nucleobases—adenine (A), cytosine (C), guanine (G), thymine (T), and uracil (U)—are called primary or canonical. They function as the fundamental units of the genetic code, with the bases A, G, C, and T being found in DNA while A, G, C, and U are found in RNA. Thymine and uracil are distinguished by merely the presence or absence of a methyl group on the fifth carbon (C5) of these heterocyclic six-membered rings. In addition, some viruses have aminoadenine (Z) instead of adenine. It differs in having an extra amine group, creating a more stable bond to thymine.

<span class="mw-page-title-main">Peptide nucleic acid</span> Biological molecule

Peptide nucleic acid (PNA) is an artificially synthesized polymer similar to DNA or RNA.

In a chain-like biological molecule, such as a protein or nucleic acid, a structural motif is a common three-dimensional structure that appears in a variety of different, evolutionarily unrelated molecules. A structural motif does not have to be associated with a sequence motif; it can be represented by different and completely unrelated sequences in different proteins or RNA.

<span class="mw-page-title-main">Hoogsteen base pair</span>

A Hoogsteen base pair is a variation of base-pairing in nucleic acids such as the A•T pair. In this manner, two nucleobases, one on each strand, can be held together by hydrogen bonds in the major groove. A Hoogsteen base pair applies the N7 position of the purine base and C6 amino group, which bind the Watson–Crick (N3–C4) face of the pyrimidine base.

<span class="mw-page-title-main">Triple-stranded DNA</span> DNA structure

Triple-stranded DNA is a DNA structure in which three oligonucleotides wind around each other and form a triple helix. In triple-stranded DNA, the third strand binds to a B-form DNA double helix by forming Hoogsteen base pairs or reversed Hoogsteen hydrogen bonds.

<span class="mw-page-title-main">Stem-loop</span> Intramolecular base-pairing pattern in RNA and DNA

Stem-loop intramolecular base pairing is a pattern that can occur in single-stranded RNA. The structure is also known as a hairpin or hairpin loop. It occurs when two regions of the same strand, usually complementary in nucleotide sequence when read in opposite directions, base-pair to form a double helix that ends in an unpaired loop. The resulting structure is a key building block of many RNA secondary structures. As an important secondary structure of RNA, it can direct RNA folding, protect structural stability for messenger RNA (mRNA), provide recognition sites for RNA binding proteins, and serve as a substrate for enzymatic reactions.

<span class="mw-page-title-main">Nucleic acid double helix</span> Structure formed by double-stranded molecules

In molecular biology, the term double helix refers to the structure formed by double-stranded molecules of nucleic acids such as DNA. The double helical structure of a nucleic acid complex arises as a consequence of its secondary structure, and is a fundamental component in determining its tertiary structure.The structure was discovered by Rosalind Franklin and her student Raymond Gosling, but the term "double helix" entered popular culture with the publication in 1968 of The Double Helix: A Personal Account of the Discovery of the Structure of DNA by James Watson.

<span class="mw-page-title-main">Biomolecular structure</span> 3D conformation of a biological sequence, like DNA, RNA, proteins

Biomolecular structure is the intricate folded, three-dimensional shape that is formed by a molecule of protein, DNA, or RNA, and that is important to its function. The structure of these molecules may be considered at any of several length scales ranging from the level of individual atoms to the relationships among entire protein subunits. This useful distinction among scales is often expressed as a decomposition of molecular structure into four levels: primary, secondary, tertiary, and quaternary. The scaffold for this multiscale organization of the molecule arises at the secondary level, where the fundamental structural elements are the molecule's various hydrogen bonds. This leads to several recognizable domains of protein structure and nucleic acid structure, including such secondary-structure features as alpha helixes and beta sheets for proteins, and hairpin loops, bulges, and internal loops for nucleic acids. The terms primary, secondary, tertiary, and quaternary structure were introduced by Kaj Ulrik Linderstrøm-Lang in his 1951 Lane Medical Lectures at Stanford University.

Nucleic acid structure prediction is a computational method to determine secondary and tertiary nucleic acid structure from its sequence. Secondary structure can be predicted from one or several nucleic acid sequences. Tertiary structure can be predicted from the sequence, or by comparative modeling.

Therapeutic gene modulation refers to the practice of altering the expression of a gene at one of various stages, with a view to alleviate some form of ailment. It differs from gene therapy in that gene modulation seeks to alter the expression of an endogenous gene whereas gene therapy concerns the introduction of a gene whose product aids the recipient directly.

<span class="mw-page-title-main">Nucleic acid tertiary structure</span> Three-dimensional shape of a nucleic acid polymer

Nucleic acid tertiary structure is the three-dimensional shape of a nucleic acid polymer. RNA and DNA molecules are capable of diverse functions ranging from molecular recognition to catalysis. Such functions require a precise three-dimensional structure. While such structures are diverse and seemingly complex, they are composed of recurring, easily recognizable tertiary structural motifs that serve as molecular building blocks. Some of the most common motifs for RNA and DNA tertiary structure are described below, but this information is based on a limited number of solved structures. Many more tertiary structural motifs will be revealed as new RNA and DNA molecules are structurally characterized.

<span class="mw-page-title-main">Nucleic acid structure</span> Biomolecular structure of nucleic acids such as DNA and RNA

Nucleic acid structure refers to the structure of nucleic acids such as DNA and RNA. Chemically speaking, DNA and RNA are very similar. Nucleic acid structure is often divided into four different levels: primary, secondary, tertiary, and quaternary.

<span class="mw-page-title-main">Complementarity (molecular biology)</span> Lock-and-key pairing between two structures

In molecular biology, complementarity describes a relationship between two structures each following the lock-and-key principle. In nature complementarity is the base principle of DNA replication and transcription as it is a property shared between two DNA or RNA sequences, such that when they are aligned antiparallel to each other, the nucleotide bases at each position in the sequences will be complementary, much like looking in the mirror and seeing the reverse of things. This complementary base pairing allows cells to copy information from one generation to another and even find and repair damage to the information stored in the sequences.

<span class="mw-page-title-main">Triple helix</span> Set of three congruent geometrical helices with the same axis

In the fields of geometry and biochemistry, a triple helix is a set of three congruent geometrical helices with the same axis, differing by a translation along the axis. This means that each of the helices keeps the same distance from the central axis. As with a single helix, a triple helix may be characterized by its pitch, diameter, and handedness. Examples of triple helices include triplex DNA, triplex RNA, the collagen helix, and collagen-like proteins.

Non-canonical base pairs are planar hydrogen bonded pairs of nucleobases, having hydrogen bonding patterns which differ from the patterns observed in Watson-Crick base pairs, as in the classic double helical DNA. The structures of polynucleotide strands of both DNA and RNA molecules can be understood in terms of sugar-phosphate backbones consisting of phosphodiester-linked D 2’ deoxyribofuranose sugar moieties, with purine or pyrimidine nucleobases covalently linked to them. Here, the N9 atoms of the purines, guanine and adenine, and the N1 atoms of the pyrimidines, cytosine and thymine, respectively, form glycosidic linkages with the C1’ atom of the sugars. These nucleobases can be schematically represented as triangles with one of their vertices linked to the sugar, and the three sides accounting for three edges through which they can form hydrogen bonds with other moieties, including with other nucleobases. The side opposite to the sugar linked vertex is traditionally called the Watson-Crick edge, since they are involved in forming the Watson-Crick base pairs which constitute building blocks of double helical DNA. The two sides adjacent to the sugar-linked vertex are referred to, respectively, as the Sugar and Hoogsteen edges.

References

  1. Dirks, Robert M.; Lin, Milo; Winfree, Erik & Pierce, Niles A. (2004). "Paradigms for computational nucleic acid design". Nucleic Acids Research . 32 (4): 1392–1403. doi:10.1093/nar/gkh291. PMC   390280 . PMID   14990744.
  2. Yakovchuk, Peter; Protozanova, Ekaterina; Frank-Kamenetskii, Maxim D. (2006). "Base-stacking and base-pairing contributions into thermal stability of the DNA double helix". Nucleic Acids Research. 34 (2): 564–574. doi:10.1093/nar/gkj454. PMC   1360284 . PMID   16449200.
  3. Breslauer KJ, Frank R, Blöcker H, Marky LA (1986). "Predicting DNA duplex stability from the base sequence". PNAS. 83 (11): 3746–3750. Bibcode:1986PNAS...83.3746B. doi: 10.1073/pnas.83.11.3746 . PMC   323600 . PMID   3459152.
  4. Richard Owczarzy (2008-08-28). "DNA melting temperature - How to calculate it?". High-throughput DNA biophysics. owczarzy.net. Archived from the original on 2015-04-30. Retrieved 2008-10-02.
  5. Alberts; et al. (1994). The Molecular Biology of the Cell. New York: Garland Science. ISBN   978-0-8153-4105-5.
  6. Pabo C, Sauer R (1984). "Protein-DNA recognition". Annu Rev Biochem. 53: 293–321. doi:10.1146/annurev.bi.53.070184.001453. PMID   6236744.
  7. 1 2 Chen JL, Greider CW (2005). "Functional analysis of the pseudoknot structure in human telomerase RNA". Proc Natl Acad Sci USA. 102 (23): 8080–5. Bibcode:2005PNAS..102.8080C. doi: 10.1073/pnas.0502259102 . PMC   1149427 . PMID   15849264.
  8. Rivas E, Eddy SR (1999). "A dynamic programming algorithm for RNA structure prediction including pseudoknots". J Mol Biol. 285 (5): 2053–2068. arXiv: physics/9807048 . doi:10.1006/jmbi.1998.2436. PMID   9925784. S2CID   2228845.
  9. Staple, David W.; Butcher, Samuel E. (2005-06-14). "Pseudoknots: RNA Structures with Diverse Functions". PLOS Biol. 3 (6): e213. doi:10.1371/journal.pbio.0030213. ISSN   1545-7885. PMC   1149493 . PMID   15941360.
  10. Doudna, Jennifer A.; Ferré-D'Amaré, Adrian R.; Zhou, Kaihong (October 1998). "Crystal structure of a hepatitis delta virus ribozyme". Nature. 395 (6702): 567–574. Bibcode:1998Natur.395..567F. doi:10.1038/26912. PMID   9783582. S2CID   4359811.
  11. Lai, Michael M. C. (1995-06-01). "The Molecular Biology of Hepatitis Delta Virus". Annual Review of Biochemistry. 64 (1): 259–286. doi:10.1146/annurev.bi.64.070195.001355. ISSN   0066-4154. PMID   7574482.
  12. Xia T, SantaLucia J Jr, Burkard ME, Kierzek R, Schroeder SJ, Jiao X, Cox C, Turner DH (October 1998). "Thermodynamic parameters for an expanded nearest-neighbor model for formation of RNA duplexes with Watson-Crick base pairs". Biochemistry. 37 (42): 14719–35. CiteSeerX   10.1.1.579.6653 . doi:10.1021/bi9809425. PMID   9778347.
  13. Mathews DH, Disney MD, Childs JL, Schroeder SJ, Zuker M, Turner DH (May 2004). "Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure". PNAS. 101 (19): 7287–92. Bibcode:2004PNAS..101.7287M. doi: 10.1073/pnas.0401799101 . PMC   409911 . PMID   15123812.
  14. Zuker, M. (1989-04-07). "On finding all suboptimal foldings of an RNA molecule". Science. 244 (4900): 48–52. Bibcode:1989Sci...244...48Z. doi:10.1126/science.2468181. ISSN   0036-8075. PMID   2468181.
  15. Lin, Chien-Ling; Taggart, Allison J.; Lim, Kian Huat; Cygan, Kamil J.; Ferraris, Luciana; Creton, Robert; Huang, Yen-Tsung; Fairbrother, William G. (13 November 2015). "RNA structure replaces the need for U2AF2 in splicing". Genome Research. 26 (1): 12–23. doi:10.1101/gr.181008.114. PMC   4691745 . PMID   26566657.
  16. Lu, XJ; Bussemaker, HJ; Olson, WK (2 December 2015). "DSSR: an integrated software tool for dissecting the spatial structure of RNA". Nucleic Acids Research. 43 (21): e142. doi:10.1093/nar/gkv716. PMC   4666379 . PMID   26184874.
  17. "MC-Annotate". www-lbit.iro.umontreal.ca.