 |
|
Scientific Report 2005
Molecular Biology
Nuclear
Magnetic Resonance of 3-Dimensional Structure and Dynamics of Proteins in Solution
P.E.
Wright, H.J. Dyson, R. Burge, R. De Guzman, T. Dunzendorfer-Matt, J. Ferreon, N. Greenman,
T.-H. Huang, M. Kostic, J. Lansing, B. Lee, M. Landes, M. Martinez-Yamout,
T. Nishikawa, J. Wojciak, M. Zeeb, E. Manlapaz, L.L. Tennant, J. Chung, D.A.
Case, J. Gottesfeld, R. Evans,* M. Montminy*
* Salk Institute, La Jolla, California
We use multidimensional nuclear magnetic resonance (NMR) spectroscopy to investigate
the structures, dynamics, and interactions of proteins in solution. Such studies
are essential for understanding the mechanisms of action of these proteins and for
elucidating structure-function relationships. The focus of our current research
is protein-protein and proteinnucleic acid interactions involved in the regulation
of gene expression.
Transcription FactorNucleic Acid Complexes
NMR methods are being used to determine the 3-dimensional structures and intramolecular
dynamics of zinc finger motifs from several eukaryotic transcriptional regulatory
proteins, both free and complexed with target nucleic acid. Zinc fingers are among
the most abundant domains in eukaryotic genomes. They play a central role in the
regulation of gene expression at both the transcriptional and the posttranscriptional
levels, mediated through their interactions with DNA, RNA, or protein components
of the transcriptional machinery. The C2H2 zinc finger, first
identified in transcription factor IIIA (TFIIIA), is used by numerous transcription
factors to achieve sequence-specific recognition of DNA. There is growing evidence,
however, that some C2H2 zinc finger proteins control gene
expression both through their interactions with DNA regulatory elements and, at
the posttranscriptional level, by binding to RNA.
The best-characterized example of a C2H2 zinc finger protein that binds specifically
to both DNA and to RNA is TFIIIA, which contains 9 zinc fingers. We showed previously
that different subsets of zinc fingers are responsible for high-affinity binding
of TFIIIA to DNA (fingers 13) and to 5S RNA (fingers 46). To obtain
insights into the mechanism by which the TFIIIA zinc fingers recognize both DNA
and RNA, we are using NMR methods to determine the structures of the complex formed
by zf1-3 (a protein containing fingers 13) with DNA and by zf4-6 (a protein
consisting of fingers 46) with a fragment of 5S RNA.
Three-dimensional structures were determined previously for the complex of zf1-3 with the cognate
15-bp oligonucleotide duplex. The structures contain several novel features and
reveal that prevailing models of DNA recognition, which assume that zinc fingers
are independent modules that contact bases through a limited set of amino acids,
are outmoded.
In addition to its role in binding to and regulating the 5S RNA gene, TFIIIA also forms a complex
with the 5S RNA transcript. We recently determined the NMR structure of the complex
formed by zinc fingers 46 with a truncated form of 5S RNA (Fig. 1).
 |
| Fig. 1. Structure of zinc fingers 46 of TFIIIA bound to 5S RNA. The protein backbone is shown as a ribbon,
and the phosphate backbone and bases of the RNA are displayed as gray tubes. |
The structure
has provided important insights into the structural basis for 5S RNA recognition.
Finger 4 of the protein recognizes both the structure of the RNA backbone and the
specific bases in the loop E motif of the RNA, in a classic lock-and-key interaction.
Fingers 5 and 6, with a single residue between them, undergo mutual induced-fit
folding with the loop A region of the RNA, which is highly flexible in the absence
of the protein. NMR studies of 2 alternate splice variants
of the Wilms tumor zinc finger protein are in progress. These proteins differ only
through insertion of 3 additional amino acids (the tripeptide lysine-threonine-serine)
in the linker between fingers 3 and 4, yet have marked differences in their DNA-binding
properties and subcellular localization. 15N relaxation measurements
indicate that the insertion increases the flexibility of the linker between fingers
3 and 4 and abrogates binding of the fourth zinc finger to its cognate site in the
DNA major groove, thereby modulating DNA-binding activity. The x-ray structure of
the DNA complex has now been determined, and NMR studies of RNA binding are in progress.
We have also determined the structure of the first member of a novel class of C2H2
zinc finger proteins that bind specifically to double-stranded RNA.
Several novel zinc binding motifs have recently been identified that mediate gene expression at
the posttranscriptional level by regulating mRNA processing and metabolism. Regulatory
proteins of the TIS11 family bind specifically, through a pair of novel CCCH zinc
fingers, to the adenosine-uridinerich element in the 3´
untranslated region of short-lived cytokine, growth factor, and protooncogene mRNAs
and control expression by promoting rapid degradation of the message. We recently
determined the NMR structure of the complex formed between the tandem zinc finger
domain of TIS11d and its binding site on the adenosine-uridinerich element.
This structure showed sequence-specific recognition of single-stranded RNA through
formation of a network of hydrogen bonds between the polypeptide backbone and the
Watson-Crick edges of the bases.
Protein-Protein Interactions in Transcriptional Regulation
Transcriptional
regulation in eukaryotes relies on protein-protein interactions between DNA-bound
factors and coactivators that, in turn, interact with the basal transcription machinery.
The transcriptional coactivator CREB-binding protein (CBP) and its homolog p300
play an essential role in cell growth, differentiation, and development. Understanding
the molecular mechanisms by which CBP and p300 recognize their various target proteins
is of fundamental biomedical importance. CBP and p300 have been implicated in diseases
such as leukemia, cancer, and mental retardation and are novel targets for therapeutic
intervention.
We previously
determined the structure of the kinase-inducible activation domain of the transcription
factor CREB bound to its target domain (the KIX domain) in CBP. Ongoing work is
directed toward mapping the interactions between KIX and the transcriptional activation
domains of the proto-oncogene c-Myb and of the mixed-lineage leukemia protein. The
solution structure of the ternary complex composed of KIX, c-Myb, and the mixed-lineage
leukemia protein has been completed (Fig. 2) and provides insights into the structural
basis for the ability of the KIX domain to interact simultaneously and allosterically
with 2 different effectors. Our work has also provided new understanding of the
thermodynamics of the coupled folding and binding processes involved in interaction
of KIX with transcriptional activation domains.
 |
| Fig. 2. Structure of the ternary complex between the KIX domain of CBP (pale gray) and the transcriptional
activation domains of c-Myb and the mixed-lineage leukemia protein (MLL). |
Recently, we determined the structure of the complex between the hypoxia-inducible factor
Hif-1α and the CH1 domain of CBP. The interaction between Hif-1α
and CBP/p300 is of major therapeutic interest because of the central role Hif-1α
plays in tumor progression and metastasis; disruption of this interaction leads
to attenuation of tumor growth. A protein named CITED2 functions as a negative feedback
regulator of the hypoxic response by competing with Hif-1α for binding to the CH1 domain of CBP. We determined the structure of the complex
formed between CITED2 and the CH1 domain and were able to show that the CH1 domain
is folded into a stable 3-dimensional structure even in the absence of binding partners.
The intrinsically unstructured Hif-1α and CITED2 domains use partly overlapping surfaces of the CH1 motif to achieve high-affinity
binding and compete effectively with each other for CBP/p300. The structure of another
zinc-binding module of CBP, the ZZ domain, has a novel fold (Fig. 3), but its function
is not yet understood. We are continuing to map the multiplicity of interactions
between CBP/p300 domains and their numerous biological targets to understand the
complex interplay of interactions that mediate key biological processes in health
and disease.
 |
| Fig. 3. Structure of the
ZZ zinc finger domain of CBP. |
Publications
De Guzman,
R.N., Goto, N.K., Dyson, H.J., Wright, P.E. Structural
basis for cooperative transcription factor binding to the CBP coactivator. J. Mol.
Biol., in press.
De Guzman,
R.N., Wojciak, J.M., Martinez-Yamout, M.A., Dyson, H.J., Wright, P.E.
CBP/p300 TAZ1 domain forms a structural scaffold for ligand binding. Biochemistry
44:490, 2005.
Dyson,
H.J., Wright, P.E.
Intrinsically unstructured proteins and their function. Nat. Rev. Mol. Cell Biol.
6:197, 2005.
Gearhart,
M.D., Dickinson, L., Ehley, J., Melander, C., Dervan, P.B., Wright, P.E., Gottesfeld,
J.M. Inhibition of
DNA binding by human estrogen related receptor-2 and estrogen receptor α
with minor groove binding polyamides. Biochemistry 44:4196, 2005.
Legge, G.B., Martinez-Yamout, M.A., Hambly, D.M., Trinh, T., Lee, B.M., Dyson, H.J., Wright,
P.E. ZZ domain of CBP: an unusual zinc finger fold in a protein interaction module. J. Mol. Biol. 343:1081,
2004.
Möller, H.M., Martinez-Yamout, M.A., Dyson, H.J. Wright, P.E. Solution
structure of the N-terminal zinc fingers of the Xenopus laevis double-stranded
RNA-binding protein ZFa. J. Mol. Biol. 351:718, 2005.
Folding of Proteins and Protein Fragments
P.E.
Wright, H.J. Dyson, C. Nishimura, D. Felitsky, Y. Yao, J. Chung, L.L. Tennant,
V. Bychkova*
*
Institute of Protein Research, Puschino, Russia
The molecular mechanism by which proteins fold into their 3-dimensional structures remains
one of the most important unsolved problems in structural biology. Nuclear magnetic
resonance (NMR) spectroscopy is uniquely suited to provide information on the structure
of transient intermediates formed during protein folding. Previously, we used NMR
methods to show that many peptide fragments of proteins have a tendency to adopt
folded conformations in water solution. The presence of transiently populated folded
structures, including reverse turns, helices, nascent helices, and hydrophobic clusters,
in water solutions of short peptides has important implications for initiation of
protein folding. Formation of elements of secondary structure probably plays an
important role in the initiation of protein folding by reducing the number of conformations
that must be explored by the polypeptide chain and by directing subsequent folding
pathways.
Apomyoglobin Folding Pathway
A major program
in our laboratory is directed toward a structural and mechanistic description of
the apomyoglobin folding pathway. Previously, we used quenched-flow pulse labeling
methods in conjunction with 2-dimensional NMR spectroscopy to map the kinetic folding
pathway of the wild-type protein. With these methods, we showed that an intermediate
in which the A, G, and H helices adopt hydrogen-bonded secondary structure is formed
within 6 ms of the initiation of refolding. Folding then proceeds by stabilization
of structure in the B helix and then in the C and E helices. We are using carefully
selected myoglobin mutants and both optical stopped-flow spectroscopy and NMR methods
to further probe the kinetic folding pathway. For some of the mutants studied, the
changes in amino acid sequence resulted in changes in the folding pathway of the protein. These experiments
are providing novel insights into both the local and the long-range interactions
that stabilize the kinetic folding intermediate. Of particular importance, long-range
interactions have been observed that indicate nativelike packing of some of the
helices in the kinetic molten globule intermediate.
Apomyoglobin
provides a unique opportunity for detailed characterization of the structure and
dynamics of a protein-folding intermediate. Conditions were previously identified
under which the apomyoglobin molten globule intermediate is sufficiently stable
for acquisition of multidimensional heteronuclear NMR spectra. Analysis of 13C
and other chemical shifts and measurements of polypeptide dynamics provided unprecedented
insights into the structure of this state.
The A, G, and
H helices and part of the B helix are folded and form the core of the molten globule.
This core is stabilized by relatively nonspecific hydrophobic interactions that
restrict the motions of the polypeptide chain. Fluctuating helical structure is
formed in regions outside the core, although the population of helix is low and
the chain retains considerable flexibility. The F helix acts as a gate for heme
binding and only adopts stable structure in the fully folded holoprotein.
The acid-denatured
(unfolded) state of apomyoglobin is an excellent model for the fluctuating local
interactions that lead to the transient formation of unstable elements of secondary
structure and local hydrophobic clusters during the earliest stages of folding.
NMR data indicated substantial formation of helical secondary structure in the acid-denatured
state in regions that form the A and H helices in the folded protein and also revealed
nonnative structure in the D and E helix region.
Because the
A and H regions adopt stabilized helical structure in the earliest detectable folding
intermediate, these results lend strong support to folding models in which spontaneous
formation of local elements of secondary structure plays a role in initiating formation
of the A-[B]-G-H molten globule folding intermediate. In addition to formation of
transient helical structure, formation of local hydrophobic clusters has been detected
by using 15N relaxation measurements. Significantly, these clusters are
formed in regions where the average surface area buried upon folding is large. In
contrast to acid-denatured unfolded apomyoglobin, the urea-denatured state is largely
devoid of structure, although residual hydrophobic interactions have been detected
by using relaxation measurements.
We measured
residual dipolar couplings for unfolded states of apomyoglobin by using partial
alignment in strained polyacrylamide gels. These data provide novel insights into
the structure and dynamics of the unfolded polypeptide chain. We have shown that
the residual dipolar couplings arise from the well-known statistical properties
of flexible polypeptide chains. Residual dipolar couplings provide valuable insights
into the dynamic and conformational propensities of unfolded and partly folded states
of proteins and hold great promise for charting the upper reaches of protein-folding
landscapes.
To probe long-range
interactions in unfolded and partially folded states of apomyoglobin, we introduced
spin-label probes at several sites throughout the polypeptide chain. These experiments
led to the surprising discovery that structures with nativelike topology exist within
the ensemble of conformations formed by the acid-denatured state of apomyoglobin.
They also indicated that the packing of helices in the molten globule state is similar
to that in the native folded protein.
The view of
protein folding that results from our work on apomyoglobin is one in which collapse
of the polypeptide chain to form increasingly compact states leads to progressive
accumulation of secondary structure and increasing restriction of fluctuations in
the polypeptide backbone. Chain flexibility is greatest at the earliest stages of
folding, in which transient elements of secondary structure and local hydrophobic
clusters are formed. As the folding protein becomes increasingly compact, backbone
motions become more restricted, the hydrophobic core is formed and extended, and
nascent elements of secondary structure are progressively stabilized. The ordered
tertiary structure characteristic of the native protein, with well-packed side chains
and relatively low-amplitude local dynamics, appears to form rather late in folding.
We recently
introduced a variation on the classic quench-flow technique, which makes use of
the capabilities of modern NMR spectrometers and heteronuclear NMR experiments,
to study the proteins labeled along the folding pathway in an unfolded state in
an aprotic organic solvent. This method allows detection of many more amide proton
probes than in the classic method, which required formation of the fully folded
protein and the measurement of the proteins NMR spectrum in water solutions
(Fig. 1).
 |
| Fig. 1. High-resolution
view of the backbone structure of the 6.4-ms burst-phase kinetic folding intermediate
of apomyoglobin. The tube thickness and darkness indicate the extent of folding
into helical structure. Helices that are fully folded are indicated by thick, dark
tubes. Regions that are partly folded are intermediate in thickness and shade, and
regions of the protein that remain fully unstructured in the kinetic intermediate
are represented by thin lines. |
This method is particularly useful in documenting changes in the folding
pathway that result in the destabilization of parts of the protein in the molten
globule intermediate. We recently introduced
self-compensating mutations designed to change the amino acid sequence such that
the average area buried upon folding in the A and E helix regions is significantly
changed, while the 3-dimensional structure of the final folded state remains the
same. These studies indicated that the average area buried upon folding is an accurate
predictor of those parts of the apomyoglobin molecule that will fold first and participate
in the molten globule intermediate (Fig. 2).
 |
| Fig. 2. Correlation between average surface area buried upon folding (AABUF, gray line) and regions of apomyoglobin
that are folded in the kinetic burst-phase intermediate. Folded regions are indicated
by high values of the proton occupancy (A0, black circles). Data are shown for the
wild-type protein (A) and for a mutant protein (B) in which hydrophobic residues
are moved from the A helix into the E helix region, thereby changing the folding
pathway in a predictable manner.
|
Folding-Unfolding Transitions in Cellular Metabolism
Many species
of bacteria sense and respond to their own population density by an intricate autoregulatory
mechanism known as quorum sensing; the bacteria release extracellular signal molecules,
called autoinducers, for cell-cell communication within and between bacterial species.
A number of bacteria appear to use quorum sensing for regulation of gene expression
in response to fluctuations in cell population density. Processes regulated in this
way include symbiosis, virulence, competence, conjugation, production of antibiotics,
motility, sporulation, and formation of biofilms.
We determined
the 3-dimensional solution structure of a complex composed of the N-terminal 171
residues of the quorum-sensing protein SdiA of Escherichia coli and an autoinducer
molecule, N-octanoyl-1-homoserine lactone (HSL). The SdiA-HSL system shows
the folding switch behavior associated with quorum-sensing factors produced
by other bacterial species. In the presence of HSL, the SdiA protein is stable and
folded and can be produced in good yields from an E coli expression system.
In the absence of the autoinducer, the protein is expressed into inclusion bodies.
Samples of the SdiA-HSL complex can be denatured but cannot be refolded in aqueous
buffers. The solution structure of the complex provides a likely explanation for
this behavior. The autoinducer molecule is tightly bound in a deep pocket in the
hydrophobic core and is bounded by specific hydrogen bonds to the side chains of
conserved residues. The autoinducer thus forms an integral part of the hydrophobic
core of the folded SdiA.
Publications
Dyson,
H.J., Wright, P.E.
Elucidation of the protein folding landscape by NMR. Methods Enzymol. 394:299,
2005.
Dyson,
H.J., Wright, P.E.
Intrinsically unstructured proteins and their functions. Nat. Rev. Mol. Cell Biol.
6:197, 2005.
Nishimura,
C., Dyson, H.J., Wright, P.E. Enhanced
picture of protein-folding intermediates using organic solvents in H/D exchange
and quench-flow experiments. Proc. Natl Acad. Sci. U. S. A. 102:4765, 2005.
Nishimura,
C., Dyson, H.J. Wright, P.E.
Identification of native and nonnative structure in kinetic folding intermediates
of apomyoglobin. J. Mol. Biol., in press.
Nishimura,
C., Lietzow, M.A., Dyson H.J., Wright, P.E.
Sequence determinants of a protein folding pathway. J. Mol. Biol. 351:383, 2005.
|
 |