About TSRI
Research & Faculty
News & Publications
Scientific Calendars
Scripps Florida
PhD Program
Campus Services
Work at TSRI
TSRI in the Community
Giving to TSRI
Directory
Library
Contact
Site Map & Search
TSRI Home

Scientific Report 2005


Molecular Biology




Computation and Visualization in Structural Biology


A.J. Olson, D.S. Goodsell, M.F. Sanner, A. Gillet, Y. Hu, R. Huey, C. Li, S. Karnati, W. Lindstrom, G.M. Morris, A. Omelchenko, M. Pique, B. Norledge, R. Rosenstein, D. Stoffler, Y. Zhao

In the Molecular Graphics Laboratory, we develop novel computational methods to analyze, understand, and communicate the structure and interactions of complex biomolecular systems. This past year, we showed the effectiveness of 3-dimensional molecular models as a tangible human-computer interface in educational and research settings. Within our component-based visualization environment, we continue to develop methods for predicting biomolecular interactions, analyzing biomolecular structure and function, and presenting the biomolecular world in education and outreach.

We have applied these methods to several important systems in human health and welfare. We continue the search for inhibitors of HIV protease to fight the growing problem of drug resistance in HIV disease. We used AutoDock, a suite of programs for predicting bound conformations and binding energies for biomolecular complexes, in the virtual screening of large databases of compounds and ultimately identified new compounds for use in the treatment of cancer. We used methods for predicting protein interactions to probe the mechanism of blood coagulation.

Tangible Interfaces for Structural Biology

We are using the evolving technology of computer autofabrication (“3-dimensional printing”) to produce physical models of complex molecular assemblies (Fig. 1).

Fig. 1. A sample of the molecular models built by using automated fabrication techniques shows a wide range of molecular representations, scales, and sizes.

With this technology, a physical model based on a virtual computer model is built up layer by layer. The great advantage of autofabrication is that nearly any shape can be built; the shape is limited only by the imagination of the researcher and the structural integrity of the building material. We have used 2 technologies: 1 that is much like using a hot glue gun, in which the model is built from layers of molten plastic, and 1 in which gypsum powder and colored binders applied with an ink jet technology are used to create full-color models.

In collaboration with the Human Interfaces Technology Laboratory at the University of Washington, Seattle, Washington, we developed an augmented reality environment that embeds these 3-dimensional models within the virtual environment of the computer. The goal of this technology is to create a sense of user presence in a computational interaction, combining the intuitive tactile interaction of model manipulation with the rich bioinformatics and visualization tools that are available in the computer environment. As shown in Figure 2, the augmented reality environment tracks the position of the model, displaying a video image of the model and user and overlaying a computer-generated image that is spatially registered with the model as the user manipulates and explores the structure.

Fig. 2. Top, The augmented reality environment. The user holds the model under a FireWire camera. Bottom, A video image of the model is displayed on the computer screen, with an overlaid computer-generated image. Here, the electrostatic potential and field of superoxide dismutase are shown with volume-rendered clouds and small animated arrows.

In tests of the model, high school and college students reported that they experienced a compelling sense of realism of the virtual object and enhanced interaction with the subject matter.

We use the program Python Molecule Viewer to create a diverse range of different representations for both our virtual molecular objects and our tangible models, simplifying integration of the models with the virtual environment. Python Molecule Viewer allows us to combine backbone representation, atomic representations, and surfaces and to incorporate markers for spatial tracking. We are also using computer-aided design and manufacturing methods to design mechanical connectors and magnetic fittings that incorporate aspects of flexibility and interaction into the models. Vision, the visual programming interface, is used to integrate nonmolecular features and properties, such as electrostatics and hydrophobicity, into the virtual and physical environment.

Component-Based Visualization Environment

To facilitate the integration and interoperation of computational models and techniques from a wide variety of scientific disciplines, we continue to expand our component-based software environment. The environment is centered on Python, a high-level, object-oriented, interpretive programming language. This approach allows the compartmentalization and reuse of software components. Python provides a powerful “glue” for assembling computational components and, at the same time, a flexible language for the interactive scripting of new applications.

We recently added a visual programming environment, Vision, that supports the interactive and visual combination of computational nodes into networks that correspond to algorithms coded at a high level (Fig. 3).

Fig. 3. Vision, a visual programming environment, allows users to build networks of visualization software, creating new computational pipelines and novel visualizations of data. The canvas is shown at the center, where users interactively combine computational nodes. The network shown is a visualization of an electron micrograph reconstruction of a virus, colored by the radial depth and with a sector removed to show the interior structure.

Vision provides nonprogrammers an intuitive interface for building networks that describe new computational pipelines and novel visualizations of data. The basic molecular visualization methods of Python Molecule Viewer, a molecular symmetry generator, and a volume-rendering method are a few of the currently available nodes, and new nodes are easy to create in the Python language. The combination of the visual programming model and the ability to interactively inspect and edit nodes written in a high-level language creates an unprecedented number of levels at which users can interact with the program. The software tools developed by using our software components have been distributed to more than 10,600 users, with an average of 250 downloads a month during the past year.

We released a new version of our software tools in December 2004 that contains a large number of improvements and additions. In particular, we streamlined our distribution mechanism and included concurrent versioning system entries that allow users to update the software once it has been installed. We fixed several bugs and added new packages, including mesh decimation algorithms and support for manipulating and visualizing volumetric data. In addition, we increased the number of tests that are run on a nightly basis to more than 2500.

Modeling of Flexibility

In a project funded by the National Institutes of Health, we developed Flexibility Tree, a hierarchical and multiresolution representation of the flexibility of biological macromolecules that can be used in computational simulations. With this software, a user can encode a small subset of a protein’s conformational subspace. After implementing the core infrastructure of Flexibility Tree and integrating it with Python Molecule Viewer and Vision, we are building such trees for molecular systems, including HIV type 1 protease and protein kinases.

A number of laboratories around the world have developed software tools for extracting the information that describes how the various parts of proteins move relative to each other. We are now using Flexibility Tree to assess the quality of the decomposition of the protein structure into rigid bodies provided by these tools as well as the accuracy of the motions calculated by using these methods. Early results indicate that when small local perturbations are allowed in addition to the motions predicted by these tools, the Flexibility Tree covers a conformational space that includes both open and closed conformations of our test systems with accuracy sufficient for docking experiments. Our next step will be to design prototype docking tools that can include protein flexibility based on the Flexibility Tree.

Virtual Screening with Autodock

We have developed new interactive tools to streamline the process of virtual screening in AutoDock. With these tools, users can perform docking experiments to evaluate the binding of a database of molecules with a particular macromolecule of interest. In collaboration with I.A. Wilson, Department of Molecular Biology, we used the method to discover new inhibitors for aminoimidazole carboxamide ribonucleotide transformylase, a target for new cancer chemotherapeutic agents. The diversity set from the National Cancer Institute was screened, and 44 potential candidates were identified. In vitro inhibition assays indicated that 8 of the 44 were soluble compounds, had chemical scaffolds that differed from the general folate template, and caused inhibition when used in micromolar concentrations. Currently, we are optimizing the lead candidates; our goal is to obtain novel nonfolate inhibitors.

AutoDock is currently used in more than 3200 academic and commercial laboratories worldwide. We continued development of AutoDock by testing a new empirical free-energy force field. The force field incorporates a charge-based model for evaluation of hydrophobicity and an improved method for evaluating the geometry of hydrogen bonding. The force field was calibrated by using a set of 138 protein complexes of known structure taken from the Ligand Protein Database from the laboratory of C.L. Brooks, Department of Molecular Biology. We anticipate that the revised AutoDock, which incorporates this new force field and methods for selective flexibility in the protein target, will be released in 2005.

We also used AutoDock to predict intermolecular interactions in several biological systems. In collaboration with C.F. Barbas, Department of Molecular Biology, we investigated the binding of peptides to the catalytic aldolase antibody 93F3. To explore the large conformational space available to these peptides, we used a divide-and-conquer approach that separates the search space into searchable blocks. In studies with G. Legge, University of Texas, Austin, Texas, we explored the interaction between the cytoplasmic tail of tissue factor and the WW domain of proline isomerase PIN1, focusing on the interaction of several key phosphoserine residues.

Fighting Drug Resistance in HIV Disease

We are continuing our work on inhibitors to fight drug resistance in the treatment of AIDS (Fig. 4).

Fig. 4. The predicted bound conformation of sanguinarine, a potential lead compound for the development of novel HIV protease inhibitors.

In collaboration with K.B. Sharpless and C.-H. Wong, Department of Chemistry, we have focused on the design of inhibitors that assemble within the active site of HIV protease. We showed that the triazole formed in the click chemistry reaction is an effective mimic for the peptide group in traditional inhibitors, forming similar hydrogen-bonding interactions.

Currently, we are moving the FightAIDS@Home system from an outside provider to a new server strategy that will be implemented in the Molecular Graphics Laboratory. FightAIDS@Home enlists the worldwide community in a large computational effort to design effective therapeutic agents to fight AIDS. Personal computers are used in the program when the computers are not in use by their owners, providing an enormous, and largely untapped, computational resource. The current goal is to identify inhibitors that are effective against the wild-type virus and against common mutant forms of the virus. The large computational resources provided by FightAIDS@Home enables the screening of large databases of compounds and use of multiple mutant targets, allowing estimation of the potential of a compound to remain effective when viral mutations occur that cause resistance to drugs currently used to treat HIV disease.

Predicting Protein-Protein Interactions

With the goal of creating a comprehensive tool for predicting protein-protein interactions, we incorporated both SurfDock and AutoDock into the Python programming environment. SurfDock uses a variable-resolution spherical harmonics representation to find candidate orientations, and AutoDock is then used to explore local atomic rearrangements at the interface. We tested the method on a set of 59 protein-protein complexes of known structure and optimized the level of smoothing used in the spherical harmonics approximation of the molecular surfaces. The results of the docking test depended on the force field used to score possible orientations. The best results were obtained with a residue-based pair-wise potential of mean force.

Visual Methods from Atoms to Cells

Understanding structural molecular biology is essential to foster progress and critical decision making among students, policy makers, and the general public. In the past year, we continued our longstanding commitment to science education and outreach with a combination of presentations, popular and professional illustrations and animation, 3-dimensional tangible models, and a presence on the Worldwide Web. In these projects, we use the diverse visualization tools developed in the Molecular Graphics Laboratory to disseminate results that range from atomic structure to cellular function.

We created a 3-dimensional model that demonstrates viral assembly. The model is composed of pentamers from the structure of poliovirus, with embedded magnets on the interacting faces. When 12 or more of these pentamer models are placed in a closed container and gently shaken, they self-assemble in a matter of seconds to form a spherical capsid.

We also continued several regular features that informally present molecular structure and function. The “Molecule of the Month” at the Protein Data Bank (http://www.rcsb.org/pdb) provides an accessible introduction to this central database of biomolecular structure. Each month, a new molecule is presented with a description of its structure, function, and relevance to health and welfare (Fig. 5).

Fig. 5. Three different types of catalase. Catalase was presented as a Molecule of the Month in 2004 after a request from a high school teacher.

Visitors are then given suggestions about how to begin their own exploration of the structures in the data bank. Currently, we are collaborating with T. Herman, Milwaukee School of Engineering, Milwaukee, Wisconsin, to combine material from the “Molecule of the Month” with 3-dimensional models and multimedia tutorials to create educational modules for use at high school and college levels. Other projects include “The Molecular Perspective,” articles in the journal The Oncologist that present structures of interest to clinical oncologists and provide a source of continuing education for physicians, and “Recognition in Action,” a new series in the Journal of Molecular Recognition.

Publications

Berman, H.M., Ten Eyck, L.F., Goodsell, D.S., Haste, N.M., Kornev, A. Taylor, S.S. The cAMP binding domain: an ancient signaling module. Proc. Natl. Acad. Sci. U. S. A. 102:45, 2005.

Brik, A., Alexandros, J., Lin, Y.-C., Elder, J.H., Olson, A.J., Wlodawer, A., Goodsell, D.S., Wong, C.-H. 1,2,3-Triazole as a peptide surrogate in the rapid synthesis of HIV protease inhibitors. Chembiochem 6:1167, 2005.

Gillet, A., Sanner, M., Stoffler, D., Goodsell, D.S., Olson, A.J. Augmented reality with tangible auto-fabricated models for molecular biology applications. In: IEEE Visualization: Proceedings of the Conference on Visualization ’04. IEEE Computer Society, Washington, DC, 2004, p. 235.

Gillet, A., Sanner, M., Stoffler, D., Olson, A. Tangible augmented interfaces for structural molecular biology. IEEE Comput. Graph. Appl. 25:13, 2005.

Gillet, A., Sanner, M., Stoffler, D., Olson, A. Tangible interfaces for structural molecular biology. Structure (Camb.) 13:483, 2005.

Goodsell, D.S. Computational docking of biomolecular complexes with AutoDock. In: Protein-Protein Interactions: A Molecular Cloning Manual, 2nd ed. Golemis, E., Adams, P. (Eds.). Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, in press.

Goodsell, D.S. The molecular perspective: cyclins. Oncologist 9:592, 2004; Stem Cells 22:1121, 2004.

Goodsell, D.S. The molecular perspective: cytochrome c and apoptosis. Oncologist 9:226, 2004; Stem Cells 22:428, 2004.

Goodsell, D.S. The molecular perspective: L-asparaginase. Oncologist 10:238, 2005; Stem Cells 23:710, 2005.

Goodsell, D.S. The molecular perspective: major histocompatibility complex. Oncologist 10:80, 2005; Stem Cells 23:454, 2005.

Goodsell, D.S. The molecular perspective: morphine. Oncologist 9:717, 2004; Stem Cells 23:144, 2005.

Goodsell, D.S. The molecular perspective: nicotine and nitrosamines. Oncologist 9:353, 2004; Stem Cells 22:645, 2004.

Goodsell, D.S. The molecular perspective: polycyclic aromatic hydrocarbons. Oncologist 9:469, 2004; Stem Cells 22:873, 2004.

Goodsell, D.S. Recognition in action: flipping pyrimidine dimers. J. Mol. Recognit. 18:193, 2005.

Goodsell, D.S. Representing structural information. In: Current Protocols in Bioinformatics. Baxeranis, A.D., Davison, D.B. (Eds.). Wiley & Sons, Hoboken, NJ, in press.

Goodsell, D.S. Visual methods from atoms to cells. Structure (Camb.) 13:347, 2005.

Li, C., Xu, L., Wolan, D.W., Wilson, I.A., Olson, A.J. Virtual screening of human 5-aminoimidazole-4-carboxamide ribonucleotide transformylase against the NCI diversity set by use of AutoDock to identify novel nonfolate inhibitors. J. Med. Chem. 47:6681, 2004.

Sanner, M.F. A component-based software environment for visualizing large macromolecular assemblies. Structure (Camb.) 13:447, 2005.

Sanner, M.F. Using the Python programming language for bioinformatics. In: Encyclopedia of Genetics, Genomics, Proteomics and Bioinformatics. Jorde, L.B., Little, P.F.R., Dunn, M.J., et al. (Eds.). Wiley & Sons, Hoboken, NJ, in press.

Zhu, X., Tanaka, F., Hu, Y., Heine, A., Fuller, R., Zhong, G., Olson, A.J., Lerner, R.A., Barbas, C.F. III, Wilson, I.A. The origin of enantioselectivity in aldolase antibodies: crystal structure, site-directed mutagenesis, and computational analysis. J. Mol. Biol. 343:1269, 2004.

 

Arthur J. Olson, Ph.D.

Professor

David S. Goodsell Jr., Ph.D.
Associate Professor

Michel Sanner, Ph.D.
Associate Professor



Faculty