| Biopolymer |

Nucleic acids carry the genetic information required for the correct functioning and reproduction of living organisms. In eukaryotic organisms the cells contain a nucleus in which the genome DNA is distributed across a number of chromosomes. In prokaryotic organisms the DNA is not contained in a separate organelle such as a nucleus. Retroviral genomic information is contained in RNA.

The definition of nucleic acids
Nucleic acids are linear polymers of four basic subunits called nucleotides. A nucleotide consists of a five-carbon furanose sugar joined to both a phosphate and a planar nitrogenous base. The phosphate and the nitrogenous base are bonded to the O5' and C1' ends of the sugar, respectively (see Figure 4). The sugar atom names end with a prime to distinguish them from base atom names.
There are two types of nucleic acids: ribonucleic acid (RNA) using the sugar ribose, and deoxyribonucleic acid (DNA) using the related sugar 2-deoxyribose. A nucleic acid polymer or strand is formed by joining the free 3'OH of nucleotide n with the phosphate group of nucleotide n+1.
Five different bases are found in nucleic acids: the two purines adenine and guanine and the three pyrimidines: cytosine, uracil, and thymine (Figure 3). Adenine, cytosine, and guanine are used in both RNA and DNA. Uracil is used only in RNA and is replaced by thymine in DNA. Adenine and guanine are attached to the sugar at the N9 position; cytosine, uracil, and thymine at N1. This arrangement permits some very specific and favorable hydrogen bonding between certain base pairs: adenine with uracil (or thymine) and cytosine with guanine.
|
Nucleic acids adopt helical structures containing one to four strands (chains). DNA is normally double-stranded or duplex with the chains oriented anti-parallel to each other. These strands are joined by Watson-Crick or ordinary base pairing. RNA is generally single stranded. However, most RNAs are folded into complex shapes containing numerous intra-strand Watson-Crick helices (Saenger 1984). DNA or RNA duplexes, wherein one strand is all purines and the other strand is all pyrimidines, can accept a third strand of pyrimidines oriented parallel to the purine strand. This strand uses Hoogsten base pairing (Watson 1976). Short synthetic sequences containing only guanines can form a four-stranded parallel helix.
Nucleic acids are highly symmetric molecules. A base and its Watson-Crick complement exhibit pseudo-dyad symmetry. In addition, the base pair at n is closely related to the pair at n + 1 by a coordinate transformation. Thus, to a first approximation, the backbones of both strands of a Watson-Crick duplex are identical. The third or Hoogsteen strand resembles the Watson-Crick pyrimidine strand, but differs enough so that it cannot be generated by a coordinate transformation.
) rotation. This displacement from the global helical axis results in two distinct grooves between the strands: a large major groove and a smaller minor groove.
Here we describe some of the tools available for building and manipulating nucleic acid models using the Biopolymer module. 
Methodology and implementation.
The Append command can create molecules with different geometries and numbers of strands (e.g., B-form DNA duplex connected to A-form DNA triplex and then back to B-form DNA duplex). The Append command always connects to the sense strand of the base pair it added. It connects the other strands only if the distance between the O3' and the P is less than 3 Å. Longer bonds must be made manually. The single stranded geometries provided by the Append command are merely the sense strand of the corresponding duplex. They are intended to build short single stranded regions of otherwise duplex molecules. This can be used to create gaps or overhangs. The geometries of all the nucleotide fragments are extracted from the work of Arnott (Arnott and Hukins 1972, Arnott et al. 1972, Arnott and Selsing 1974).
Whenever Insight II builds a duplex nucleic acid molecule (for example, called MYDNA), the structure is created in the "standard" form for nucleic acid, which has the following features:
1. The DNA is composed of two distinct molecules, one for each
strand
2. The first strand is called MYDNA_1.
3. The second strand is called MYDNA_2.
4. The two strands are in an assembly called MYDNA.
Many of the nucleic acid functions in Insight II require the structure to be in this standard form (for example the Measure command). If you import a nucleic acid structure from another program and it is not in the standard form, you need to put it into the standard form. This may require that you Unmerge the two strands, Rename them to conform to the standard form, and Associate the two molecules into an assembly.
Deleting or replacing nucleotides.
Use the Nucleotide/Delete command to remove a single nucleotide from one strand of a nucleic acid. The remaining residues (if any) are not renumbered. If the deleted base had both a 5' and 3' neighbor, the result is a gap, since the surrounding bases are not bonded.
Measuring nucleotide geometric properties.
The Nucleotide/Measure command (in the Nucleotide pulldown) is used to compute several angles and distances involving pairs of bases and the global helical axis. It is based on Dickerson's NEWHEL91 program suite.
Modifying existing nucleic acid structures
After nucleic acid molecules are built using the Append command (in the Nucleotide pulldown), the molecules should be properly terminated before further work (e.g., minimization) is done.