Crystal Structure of the Molybdenum Cofactor Biosynthesis Protein MobA from Escherichia coli at...

11
Structure, Vol. 8, 1115–1125, November, 2000, 2000 Elsevier Science Ltd. All rights reserved. PIIS0969-2126(00)00518-9 Crystal Structure of the Molybdenum Cofactor Biosynthesis Protein MobA from Escherichia coli at Near-Atomic Resolution pterin termed molybdopterin or MPT [1, 2] (see Figure 1a). Although this is the active form of the cofactor found in all eu- karyotic molybdoenzymes, many bacterial molybdoenzymes re- quire a modified form of the cofactor for activity. The modifica- tion reaction involves attachment of a mononucleotide to the Clare E. M. Stevenson,* Frank Sargent, Grant Buchanan, Tracy Palmer, and David M. Lawson* § * Department of Biological Chemistry and Department of Molecular Microbiology John Innes Centre terminal phosphate of molybdopterin to form a molybdopterin dinucleotide. The type of nucleotide modification is usually Norwich NR4 7UH United Kingdom specific to different molybdoenzyme families such that molyb- doenzymes of the dimethylsulphoxide (DMSO) reductase Centre for Metalloprotein Spectroscopy and Biology School of Biological Sciences group bind molybdopterin guanine dinucleotide (MGD) (see Figure 1b), whereas many bacterial members of molybdenum University of East Anglia Norwich NR4 7TJ hydroxylase family require the cytidine monophosphate (CMP)- modified form of molybdopterin (MCD) for function [1, 3]. Key United Kingdom MGD-requiring enzymes, such as nitrate reductase, play im- portant roles in anaerobic respiratory pathways, particularly in the enteric bacteria. Indeed, strains of Salmonella typhi defec- Summary tive in the synthesis of the molybdenum cofactor are impaired in their ability to infect host cells [4]. Background: All mononuclear molybdoenzymes bind molyb- denum in a complex with an organic cofactor termed molyb- The two-step pathway for MPT synthesis is highly conserved between the kingdoms and is best characterized in the Gram- dopterin (MPT). In many bacteria, including Escherichia coli, molybdopterin can be further modified by attachment of a GMP negative eubacterium Escherichia coli. In the first stage, a guanosine derivative, probably GTP, is rearranged to form pre- group to the terminal phosphate of molybdopterin to form molybdopterin guanine dinucleotide (MGD). This modification cursor Z [5]. In the second step, the molybdopterin synthase, encoded by the E. coli moaD and moaE genes, incorporates reaction is required for the functioning of many bacterial molyb- doenzymes, including the nitrate reductases, dimethylsulf- sulfur stepwise into precursor Z to generate MPT. The MoeB protein acts as direct sulfur donor for this reaction [6, 7]. The oxide (DMSO) and trimethylamine-N-oxide (TMAO) reductases, and formate dehydrogenases. The GMP attachment step is incorporation of molybdenum into MPT to form the molybde- num cofactor (Moco) is catalyzed by the homologous proteins catalyzed by the cellular enzyme MobA. MogA in E. coli, Cnx1 in plants, and gephyrin in mammals. These proteins bind molybdopterin with high affinity, and in Results: The crystal structure of the 21.6 kDa E. coli MobA has been determined by MAD phasing with selenomethionine- the case of Cnx1, the G domain has been shown to catalyze the formation of Moco from an as-yet-unknown form of Mo [8]. substituted protein and subsequently refined at 1.35 A ˚ resolu- tion against native data. The structure consists of a central, All of the characterized molybdoenzymes in E. coli require the GMP-modified form of Moco for activity. The GMP addition predominantly parallel b sheet sandwiched between two layers of a helices and resembles the dinucleotide binding Rossmann reaction is catalyzed by the MobA protein (also termed protein FA [9]). It is a reaction unique to prokaryotes, and therefore fold. One face of the molecule bears a wide depression that is lined by a number of strictly conserved residues, and this may represent a potential drug target. Although the identity of the GMP donor is not known, by analogy with the uridyl- and feature suggests that this is where substrate binding and catal- ysis take place. cytidyl-transferases [10, 11], GTP would seem to be the most likely. All of these reactions liberate pyrophosphate and have a strict requirement for Mg 21 . Indirect evidence indicates that Conclusions: Through comparisons with a number of struc- tural homologs, we have assigned plausible functions to sev- the latter is also true for MobA [12]. In E. coli the mobA gene is cotranscribed with a second gene, mobB, which is not re- eral of the residues that line the substrate binding pocket. The enzymatic mechanism probably proceeds via a nucleophilic quired for MGD synthesis [13, 14]. Mutants in mobA are pleio- tropically defective in molybdoenzyme activities and synthe- attack by MPT on the GMP donor, most likely GTP, to produce MGD and pyrophosphate. By analogy with related enzymes, size elevated levels of molybdopterin [15]. The MobA protein has been isolated previously, and it has proved to be mono- this process is likely to require magnesium ions. meric in solution. MobA shows tight binding to Cibacron blue affinity resin and has a putative P loop sequence. This suggests Introduction that it is capable of binding nucleotides and phosphate, al- though experimental evidence indicates that the isolated pro- Mononuclear molybdenum enzymes form an important class of redox enzymes since they catalyze a number of critical tein is unable to bind GTP [9, 16, 17]. Nevertheless, the binding of GTP in the presence of an accessory protein and/or MPT metabolic reactions in bacteria, plants, and animals. Recent structural analyses of molybdenum (and tungsten) enzymes cannot be ruled out. MGD-containing molybdoenzymes gener- ally incorporate a bis-MGD cofactor where two MGD units from a variety of sources have revealed that the molybdenum atom is coordinated through the dithiolene group of a tricyclic coordinate a single molybdenum in an antiparallel fashion [18]. Key words: crystallography; molybdopterin; multiwavelength anomalous § To whom correspondence should be addressed (email david.lawson@ bbsrc.ac.uk). dispersion (MAD); pyrophosphorylase; three-dimensional structure

Transcript of Crystal Structure of the Molybdenum Cofactor Biosynthesis Protein MobA from Escherichia coli at...

Structure, Vol. 8, 1115–1125, November, 2000, 2000 Elsevier Science Ltd. All rights reserved. PII S0969-2126(00)00518-9

Crystal Structure of the Molybdenum CofactorBiosynthesis Protein MobA from Escherichia coliat Near-Atomic Resolution

pterin termed molybdopterin or MPT [1, 2] (see Figure 1a).Although this is the active form of the cofactor found in all eu-karyotic molybdoenzymes, many bacterial molybdoenzymes re-quire a modified form of the cofactor for activity. The modifica-tion reaction involves attachment of a mononucleotide to the

Clare E. M. Stevenson,* Frank Sargent,†‡

Grant Buchanan,†‡ Tracy Palmer,†‡

and David M. Lawson*§

*Department of Biological Chemistry and†Department of Molecular MicrobiologyJohn Innes Centre terminal phosphate of molybdopterin to form a molybdopterin

dinucleotide. The type of nucleotide modification is usuallyNorwich NR4 7UHUnited Kingdom specific to different molybdoenzyme families such that molyb-

doenzymes of the dimethylsulphoxide (DMSO) reductase‡Centre for Metalloprotein Spectroscopy and BiologySchool of Biological Sciences group bind molybdopterin guanine dinucleotide (MGD) (see

Figure 1b), whereas many bacterial members of molybdenumUniversity of East AngliaNorwich NR4 7TJ hydroxylase family require the cytidine monophosphate (CMP)-

modified form of molybdopterin (MCD) for function [1, 3]. KeyUnited KingdomMGD-requiring enzymes, such as nitrate reductase, play im-portant roles in anaerobic respiratory pathways, particularly inthe enteric bacteria. Indeed, strains of Salmonella typhi defec-Summarytive in the synthesis of the molybdenum cofactor are impairedin their ability to infect host cells [4].Background: All mononuclear molybdoenzymes bind molyb-

denum in a complex with an organic cofactor termed molyb- The two-step pathway for MPT synthesis is highly conservedbetween the kingdoms and is best characterized in the Gram-dopterin (MPT). In many bacteria, including Escherichia coli,

molybdopterin can be further modified by attachment of a GMP negative eubacterium Escherichia coli. In the first stage, aguanosine derivative, probably GTP, is rearranged to form pre-group to the terminal phosphate of molybdopterin to form

molybdopterin guanine dinucleotide (MGD). This modification cursor Z [5]. In the second step, the molybdopterin synthase,encoded by the E. coli moaD and moaE genes, incorporatesreaction is required for the functioning of many bacterial molyb-

doenzymes, including the nitrate reductases, dimethylsulf- sulfur stepwise into precursor Z to generate MPT. The MoeBprotein acts as direct sulfur donor for this reaction [6, 7]. Theoxide (DMSO) and trimethylamine-N-oxide (TMAO) reductases,

and formate dehydrogenases. The GMP attachment step is incorporation of molybdenum into MPT to form the molybde-num cofactor (Moco) is catalyzed by the homologous proteinscatalyzed by the cellular enzyme MobA.MogA in E. coli, Cnx1 in plants, and gephyrin in mammals.These proteins bind molybdopterin with high affinity, and inResults: The crystal structure of the 21.6 kDa E. coli MobA

has been determined by MAD phasing with selenomethionine- the case of Cnx1, the G domain has been shown to catalyzethe formation of Moco from an as-yet-unknown form of Mo [8].substituted protein and subsequently refined at 1.35 A resolu-

tion against native data. The structure consists of a central, All of the characterized molybdoenzymes in E. coli requirethe GMP-modified form of Moco for activity. The GMP additionpredominantly parallel b sheet sandwiched between two layers

of a helices and resembles the dinucleotide binding Rossmann reaction is catalyzed by the MobA protein (also termed proteinFA [9]). It is a reaction unique to prokaryotes, and thereforefold. One face of the molecule bears a wide depression that

is lined by a number of strictly conserved residues, and this may represent a potential drug target. Although the identity ofthe GMP donor is not known, by analogy with the uridyl- andfeature suggests that this is where substrate binding and catal-

ysis take place. cytidyl-transferases [10, 11], GTP would seem to be the mostlikely. All of these reactions liberate pyrophosphate and havea strict requirement for Mg21. Indirect evidence indicates thatConclusions: Through comparisons with a number of struc-

tural homologs, we have assigned plausible functions to sev- the latter is also true for MobA [12]. In E. coli the mobA geneis cotranscribed with a second gene, mobB, which is not re-eral of the residues that line the substrate binding pocket. The

enzymatic mechanism probably proceeds via a nucleophilic quired for MGD synthesis [13, 14]. Mutants in mobA are pleio-tropically defective in molybdoenzyme activities and synthe-attack by MPT on the GMP donor, most likely GTP, to produce

MGD and pyrophosphate. By analogy with related enzymes, size elevated levels of molybdopterin [15]. The MobA proteinhas been isolated previously, and it has proved to be mono-this process is likely to require magnesium ions.meric in solution. MobA shows tight binding to Cibacron blueaffinity resin and has a putative P loop sequence. This suggestsIntroductionthat it is capable of binding nucleotides and phosphate, al-though experimental evidence indicates that the isolated pro-Mononuclear molybdenum enzymes form an important class

of redox enzymes since they catalyze a number of critical tein is unable to bind GTP [9, 16, 17]. Nevertheless, the bindingof GTP in the presence of an accessory protein and/or MPTmetabolic reactions in bacteria, plants, and animals. Recent

structural analyses of molybdenum (and tungsten) enzymes cannot be ruled out. MGD-containing molybdoenzymes gener-ally incorporate a bis-MGD cofactor where two MGD unitsfrom a variety of sources have revealed that the molybdenum

atom is coordinated through the dithiolene group of a tricyclic coordinate a single molybdenum in an antiparallel fashion [18].

Key words: crystallography; molybdopterin; multiwavelength anomalous§ To whom correspondence should be addressed (email [email protected]). dispersion (MAD); pyrophosphorylase; three-dimensional structure

Structure1116

Figure 1. Structures of Molybdopterin-Moand Molybdopterin Guanine Dinucleotide

(a) Molybdopterin-Mo (MPT-Mo).(b) Molybdopterin guanine dinucleotide (MGD).The exact coordination of the molybdenumis unknown in the isolated cofactors and issubject to variation in the different molyb-doenzymes [18].

The mechanisms whereby these cofactors become inserted which is illustrated schematically in Figure 3. The moleculeresembles a shallow bowl with an outer-rim diameter of ap-and “dimerize” are unknown.

In this paper we report the crystal structure of MobA, which proximately 45 A and a height of around 35 A. The concavesurface of the bowl descends some 10–15 A from the lip towe determined by the multiwavelength anomalous dispersion

(MAD) method. Additionally, we describe the features of the the base of a canyon approximately 22 A in length and 15 Aacross at its widest point. One edge of this canyon is delineatedmolecular model that has been obtained after refinement at

1.35 A resolution. As this is the first structure of a molybdopterin by a surface loop, part of which is poorly ordered in the struc-ture, that lies between strand b1 and a 310 helix. The canyonnucleotidyl transferase, it provides important insights into

structure–function relationships in this class of enzyme. would appear to be the most suitable location for a substratebinding pocket. MGD, the product of the MobA catalysis, spansapproximately 20 A when it is in an extended conformation asResults and Discussionobserved in the molybdoenzymes, such as TMAO reductase[20]. MGD thus matches the dimensions of this cleft.Overall Structure

The functional unit of MobA consists of a single 21.6 kDapolypeptide chain of 194 amino acids. It is folded into a sin- Consensus Loop

The surface loop that occurs between strands b1 and the 310gle domain comprising a central, predominantly parallel, 7stranded b sheet that is flanked on both sides by a helices helix is highly conserved in MobA sequences (see next section)

and thus is hereafter referred to as the consensus loop. Remi-(see Figure 2). The connectivity of the b sheet is 21x, 21x,13x, 12x, 21, 12 using the nomenclature of Richardson [19], niscent of the P loops (or Walker A motifs) of nucleotide binding

Figure 2. Stereoview Ca Trace of MobA withEvery Tenth Residue Numbered

The poorly ordered loop (residues 15–21) ishighlighted in bold. The arrow indicates thelocation of the putative substrate bindingpocket and shows the approximate directionof view for Figures 5, 6, and 7. The figure wasproduced with O and OPLOT [45].

Crystal Structure of E. coli MobA1117

Figure 3. Topological Arrangement of Sec-ondary Structural Elements in MobA

b strands are represented by single squaresif they point toward the viewer and doublesquares if they point in the opposite direction.The a helices are depicted as circles and thesingle 310 helix is shown as a hexagon. Theb strands of the b hairpin, h1 and h2, arerepresented as arrows lying in the plane ofthe paper. Those elements that are shadedcorrespond closely in space and connectivityto those found in the classical Rossmann di-nucleotide binding fold.

proteins that are implicated in phosphate binding [21, 22], it is interactions with reactants through their backbone atoms whilehaving highly variable side chains.rich in positively charged residues (arginine/lysine) and glycine

residues. In these proteins the phosphate groups are generallybound by hydrogen bonds with main chain NH groups of the Electrostatic Surface Potential of the Binding Pocket

The distribution of charge on the concave surface of MobA isloop (some NH groups being donated by glycine) and a con-served lysine residue. Shorter loops are associated with dinu- quite striking (see Figure 5c). The western edge of the canyon

displays a ridge of electropositivity running from north to south,cleotide binding where the ligand is a cofactor (e.g., NAD),such as in the dehydrogenases, while longer loops are associ- while the eastern edge presents an extensive electronegative

patch. These may play a role in protein–protein recognition,ated with mononucleotide binding (e.g., ATP) and the ligandis a substrate of the reaction, such as in the kinases [23]. The perhaps during the import of substrates or export of product

by accessory proteins. The extreme oxygen lability of isolatedlength of the loop in MobA suggests a closer relationship tothe mononucleotide binding P loops. The disorder seen in MPT and MGD [24] may necessitate this in order to minimize

the exposure of cofactor to solvent. A direct interaction withthe MobA consensus loop may be a crystallographic artifactcaused by the binding of a metal-bis-citrate complex in a crys- MogA, the supplier of the substrate MPT-Mo, is a strong possi-

bility. Its molecular surface is largely electronegative or neutraltal contact region directly adjacent to the loop, although itseems likely that it is naturally very flexible in the absence of in character (calculated using Protein Data Bank [PDB] entry

1D17; not shown) and thus could interact favorably with thebound ligand. The similarity of the consensus loop sequenceto ADP-glucose pyrophosphorylase, CDP-ribitol pyrophos- electropositive ridge of MobA during substrate delivery. At the

deeper, northern end of the canyon, the electronegative basephorylase, and geranylgeranyl pyrophosphate synthetase haspreviously been noted [16]. The first two of these enzymes suggests a potential to bind Mg21. Region C in particular would

appear to be eminently suitable as it lies adjacent to the clustercatalyze the formation of a pyrophosphate bond between twophosphorylated substrates, while the third liberates pyrophos- of strictly conserved residues Asp-101, Asn-180, and Asn-182.

As suggested above, by analogy with the P loops of otherphate during turnover. These observations suggest a mecha-nistic link between these enzymes and hence a role for the nucleotide binding proteins [23], the residues (including the

conserved residues Arg-19 and Lys-25) decorating the consen-consensus loop in catalysis.sus loop provide positive charges that could be involved inbinding the phosphate groups of substrates and products . ASequence Conservation in MobA Proteins

A comparison of MobA homologs from various sources, at hydrogen bond between the side chains of Lys-25 and Asp-101 may also be significant. The more neutral region D is linedleast three of which have been shown to synthesize MGD [9,

14], is shown in Figure 4. The majority of surface sequence by a number of hydrophobic groups that may favor pterinbinding through van der Waals stacking interactions.conservations occur on the concave face of the molecule,

where they either line the canyon or reside in the consensusloop (see Figure 5a). These have been mapped onto a molecular Structural Homologs of MobA

The overall topology of MobA is reminiscent of that describedsurface in Figure 5b, where warm colors indicate a high degreeof sequence identity and cold colors indicate little or no conser- as the Rossmann fold, which is associated with dinucleotide

binding [25]. The similarity extends from the N terminus ofvation. Most of these identities are clustered around the north-ern end of the canyon (regions A–C), where it is at its deepest, MobA up to and including strand b4 (see Figure 3). Thereafter,

the connectivity varies somewhat and, furthermore, the nextand this observation strongly suggests that this is where cataly-sis takes place. Nevertheless, it should be borne in mind that adjacent strand in the b sheet of MobA (b6) is antiparallel to the

rest, whereas the central b sheet of the classical dinucleotidethis kind of analysis does not flag residues that make critical

Structure1118

Figure 4. Comparison of MobA Sequences

Alignments of MobA sequences from E. coli,Haemophilus influenzae, Rhodobacter sphae-roides, Pseudomonas putida, Bacillus subtilis,and Methanobacterium thermoautotrophicumdisplayed using the program ALSCRIPT [50].The initial alignments were produced usingGCG [51] and then adjusted manually to mini-mize deletions and insertions within second-ary structural elements. Secondary-structureassignments for E. coli MobA were made us-ing DSSP [52]. Cylinders represent a helicesand arrows indicate b strands. The short 310

helix is indicated by 3t, and h1 and h2 indicatethe two arms of a b hairpin. The region of theconsensus loop, which is poorly defined inthe electron density, is indicated by a graybar. Residues that line the putative substratebinding pocket are italicized in bold. Aminoacids displaying at least 50% conservationare boxed, while those that are completelyconserved are boxed in gray. Residues thatare also structurally conserved in GlmU areindicated by the black triangles.

binding fold is strictly parallel. While the substrates of MobA In order to find structural homologs of MobA, the PDB wasinterrogated using the DALI server [27]. Matches were foundare both mononucleotides, the product, MGD, is clearly a dinu-

cleotide, and herein lies the connection. The link is further for several nucleotide binding proteins, including alcohol dehy-drogenase (PDB entry 1OHX), diaminopimelic acid dehydroge-strengthened by the observation that the dinucleotide binding

site in these proteins lies at the C-terminal end of the b sheet nase (PDB entry 1DAP [28]), CTP:glycerol-3-phosphate cytidy-lyltransferase (PDB entry 1COZ [11]), GMP synthetase (PDBin an equivalent position to that predicted to be the substrate

binding site of MobA. entry 1GPM [29]), the bacterial cell division protein FtsZ (PDBentry 1FSZ [30]), and UDP-galactose 4-epimerase (PDB entryMogA, the enzyme that precedes MobA in the MGD biosyn-

thetic pathway, is also an a/b protein with a central, predomi- 1XEL [31]). Despite low Z-scores (ranging from 3.7 to 2.1), thesestructures all contain bound nucleotides (or dinucleotides) that,nantly parallel b sheet [26]. In this case, however, there are six

strands with connectivity 21x, 12x, 13x, 21, 21 (PDB entry after superposition onto the MobA coordinates, all cluster inregions A–C of the canyon, although many of these ligands1D17). A manual superposition using interactive graphics gives

a reasonable match between strands b1, b2, and b4 of MobA made serious steric clashes with MobA polypeptide, in particu-lar with the consensus loop.and strands b1, b2, and b3 of MogA, although the agreement

elsewhere is poor. Significantly, this structural alignment Subsequently, a literature search revealed a more interestingset of homologs. The first of these was that of CMP:2-keto-3-places the consensus loop of MobA in an equivalent position

to a disordered loop in MogA that lies between b1 and a1 deoxy-manno-octonic acid synthetase (CKS) [32], although thecoordinates were not available for a superposition. This has a(residues 15–19), although a functional role for this loop has

not been proposed. As with MobA, a cavity that has been central 7 strand b sheet with identical connectivity to MobA.The remaining secondary-structural elements are conservedimplicated in pterin binding exists at the C terminus of the b

sheet [26]. in the N-terminal portion of the molecule roughly as far as

Crystal Structure of E. coli MobA1119

residue 130 (or the end of strand b5) in both structures, al-though CKS has additional elements inserted between strandsb5 and b6 and strands b6 and b7. CKS catalyzes the magne-sium-dependent activation of 2-keto-3-deoxy-manno-octonicacid (KDO) through the attachment of CMP, for which CTPserves as a substrate; the reaction liberates pyrophosphate asa product. As in MobA, there is an obvious cavity visible at theC-terminal end of the b sheet, and this cavity is partly definedby the loop between strands b1 and b2. This loop in CKScontains a pair of glycine residues in addition to an arginine(Arg-10) and a lysine (Lys-19). The latter two residues havebeen shown to be catalytically important by chemical modifica-tion and site-directed mutagenesis and are strictly conservedbetween the eight known sequences of CKS (data not shown).Unfortunately, this structure contains no bound ligand.

The recently determined crystal structure of GlmU, anN-acetylglucosamine-1-phosphate uridyltransferase [33], isperhaps the most significant known structural homolog ofMobA for the reasons outlined below. It is a bifunctional en-zyme having both acetyl transferase and pyrophosphorylaseactivities and two discrete domains that appear to be indepen-dently associated with each of these activities. Relevent to thisdiscussion is the latter activity, whereby UMP is transferredfrom UTP to N-acetylglucosamine-6-phosphate (GlcNAc-6-P)with the concomitant release of pyrophosphate. The pyrophos-phorylase domain lies at the N terminus of the polypeptidechain and is connected to the C-terminal domain by a longa-helical linker (20 residues). The N-terminal domain has a central,7 strand b sheet with the same topology as that of MobA. Thetwo structures are closely similar; a least-squares superposi-tion of the ligand-bound form of GlmU onto MobA, basedon main chain atoms from 100 equivalent residues, gave an over-all root-mean-square deviation (rmsd) of 1.6 A (see Figure 6).The final helix of MobA, a7, overlaps with the N terminus ofthe long a-helical linker of GlmU. There is some variation in thelength of loop regions, including those that define the shapeand size of the active site binding pocket. The equivalent ofthe MobA consensus loop in GlmU is 1 residue longer andadopts a different conformation such that region A is com-pletely occluded, while the b3–a3 loop adjacent to region B is2 residues shorter with respect to MobA. The difference inlength somewhat foreshortens this arm of the binding pocket.The largest difference occurs in the b5–b6 loop, which containsan extra 34 residues in GlmU. This insertion makes region Dmuch smaller in GlmU. The superposition also places the prod-uct of GlmU, UDP-GlcNAc, in the proposed substrate bindingpocket of MobA without any serious steric clashes. The pyrimi-

(b) Molecular surface displaying degrees of sequence identity for residuesin and around the pocket according to the alignment shown in Figure 4.Warm colors indicate a high level of sequence identity, while cold colorsindicate little or no conservation. Strictly conserved residues are shown inred.(b) Electrostatic surface potentials of MobA. Potentials of less than 210kT, neutral, and greater than 10 kT are displayed in red, white, and blue,respectively.Surfaces for (a) and (b) were calculated with SwissPDBviewer [55] (http://www.expasy.ch/spdbv) and rendered using POV-Ray (http://www.povray.org). In order to get a more accurate impression of the surface conservationsand charge, those side chains that could not be confidently placed in elec-tron density were inserted and modeled into energetically favorable confor-

Figure 5. The Putative Substrate Binding Pocket of MobA mations. This was necessary for only one strictly conserved residue, namelyArg-19. All parts of this figure display the same view of the molecule. The(a) Ribbon diagram showing the locations of strictly conserved surface resi-white line delineates the proposed substrate binding pocket, and the labelsdues. The diagram was produced with MOLSCRIPT [53] and Raster3D [54]A–D are for use in the discussion.(blue 5 nitrogen, red 5 oxygen, yellow 5 carbon).

Structure1120

Figure 6. Stereoview Superposition of GlmUupon MobA

Only the N-terminal domain of GlmU (residues3–236) is shown (red), while the whole ofMobA is depicted (blue). The least-square su-perposition is based on main chain atomsfrom residues 5–12, 25–46, 50–63, 71–75, 81–91, 98–120, 125–130, 171–176, and 213–217from GlmU and residues 6–13, 25–46, 49–62,66–70, 78–88, 94–116, 121–126, 134–139, and167–171 from MobA (100 residues from each).Also shown in green is the UDP-GlcNAc li-gand of GlmU. The direction of view is similarto that of Figure 5.

dine base lies in the equivalent of region B in MobA, and the in phosphate binding and is in close proximity to a conservedaspartic acid (Asp-99). These overlap the conserved lysine andpyrophosphate bridge occupies region C. The ribose moiety

sits between these two regions, while the GlcNAc group covers aspartic acid residues in MobA. Significantly, the aspartic acidin SpsA coordinates a divalent metal ion, in this case Mn21,region D. Significantly, several of the residues that are thought

to be important in MobA from the analyses above (see Figure which bridges the a- and b-phosphates of the UDP ligand andis thought to promote leaving-group departure during cataly-5a) are structurally conserved in GlmU. These include Leu-11

(Leu-12), Gly-14 (Gly-15), Arg-18 (Arg-19), Lys-25 (Lys-25), Gly- sis. A second metal ion, assigned as Mg21, occupies the mostlikely position for the donor sugar binding and is therefore81 (Gly-78), Asp-105 (Asp-101), and Asn-227 (Asn-182), where

the MobA equivalents are given in parentheses. These are probably not catalytic [34].also indicated in the alignment shown in Figure 4. Within the“consensus loop,” an arginine-to-alanine mutation at position Substrate Docking and Possible Mechanism

The extreme reactivity of MPT and the problems associated18 dramatically impairs uridyltransferase activity, while thesubstitution of alanine residues for Gly-14 and Lys-25 resulted with its isolation have hitherto made the development of a

definitive enzymatic assay somewhat problematic [24]. Al-in an 8-fold decrease in activity [33]. While Arg-18 does notinteract directly with the ligand in this structure, a potential though the nature of the nucleotide substrate (or GMP donor)

has not been experimentally established, by analogy with otherinteraction with the Pg of UTP was proposed, although thiswould involve a significant movement of this residue. The intro- enzymes, such as the uridyl- and cytidyl-transferases [10, 11],

that perform similar chemistry, GTP would appear to be theduction of a Cb at residue 14 presumably causes steric clasheswith the ribose group. Lys-25 is hydrogen bonded to both Asp- most likely candidate, with the liberation of pyrophosphate as

a by-product of the reaction.101 and Asn-227, and although it is not directly involved incatalysis, it may orient these groups for UTP recognition. The Based on the superposition of MobA upon the ligand-bound

form of GlmU (Figure 6), we have produced a docking modelformer interaction is structurally conserved in MobA, and asimilar interaction between conserved lysine and aspartic acid for GTP bound to MobA. This model is depicted in Figure 7.

The guanyl moiety has been placed in region B, where it makesresidues in GMP synthetase has been implicated in Pg recogni-tion and Mg21 binding [29]. Although no Mg21 is present in van der Waals stacking interactions with conserved residues

on either side. Namely, these residues are Gly-14, Gly-15, Gly-the GlmU structure, it has been shown to be critical for theuridyltransferase activity of GlmU and other XDP-sugar pyro- 78, Pro-79, and Gly-82. Presumably, the substitution of the

glycine residues for any other amino acid would either causephosphorylases [10]. A potential Mg21 binding site was pro-posed involving Gln-193 and Glu-195, which line the substrate steric hindrance at the Cb or reduce the flexibility of the peptide

backbone. The latter is perhaps more important at the N termi-pocket to the right-hand side of the UDP-GlcNAc in Figure 6.Structural equivalents of these residues are not present in nus of the consensus loop. As mentioned above, the b3–a3

loop, which lies N-terminal to Gly-78 in MobA, is slightly longerMobA.A more distant structural homolog is the nucleotide-diphos- than its equivalent in GlmU and therefore may be involved in the

discrimination between pyrimidines and purines. Nevertheless,pho-sugar transferase, SpsA [34]. If we ignore strand b6, whichlies at one end of the central b sheet, then SpsA has the same the lack of conserved residues lying in the plane of the guanyl

moiety makes it difficult to speculate about specific interac-sheet connectivity as MobA. A manual superposition of theSpsA ligand-bound coordinates (PDB entry 1QGQ) gave a rea- tions between this group and the protein. The ribose group

straddles regions B and C and stacks against the conservedsonable agreement with MobA over much of the structure (aleast-squares superposition based on the main chain atoms Leu-12. The triphosphate group begins in region C with the Pa

and extends toward region D. Evidence suggests that MobAof 68 equivalent residues gave an rmsd of 1.9 A). The equivalentof the MobA consensus loop in SpsA is relatively short and requires Mg21 for activity [12], although none was apparent in

the crystal structure, possibly because of the presence of ci-devoid of glycine residues. There is, however, a lysine (Lys-13) residue at the C-terminal end of the loop that is involved trate in the crystallization and the relatively low pH (5.5). Thus,

Crystal Structure of E. coli MobA1121

Figure 7. Stereoview of MobA with GTP andMg21 in the Putative Substrate BindingPocket

This docking model was produced using theligand-bound structure of GlmU as a guide.The Mg21 replaces a water molecule presentin the MobA crystal structure. No proteinatoms have been rearranged to improve thefit. The Ca trace is shown in gray, and all thestrictly conserved surface residues of MobAare shown in full. The side chain of Arg-19was disordered in the X-ray structure, but ithas been included here for completeness.The direction of view is the same as that inFigure 6 (blue, nitrogen; red, oxygen; black,protein carbon; green, GTP carbon; yellow,phosphorus; and magenta, magnesium).

this model also incorporates a Mg21 ion (which simply replaces could be involved in stabilizing the oxyanionic transition state.By analogy with Glm-U, region A in the MobA substrate bindinga water molecule from the crystal structure) in region C adja-

cent to the conserved cluster of Asp-101, Asn-180, and Asn- pocket may disappear if the consensus loop folds over theactive site. In order to promote catalysis, the nucleophilicity182 side chains, where it bridges the a- and b-phosphates in

an equivalent position to that seen for the Mn21 in SpsA [34]. of the MPT could be enhanced through proton abstractionfrom the phosphate group by a general base, but the identityThis location for the Mg21 seemed more appropriate for MobA

than did that proposed for GlmU (see previous section) be- of such a group is not clear from our model. It is possible thata residue from an accessory protein that delivers MPT to thecause the sequence conservation in the structurally equivalent

region in MobA is poor. Large insertions in the b5–b6 loop of active site performs this function.In conclusion, the high-resolution MobA crystal structureCKS and GlmU relative to that of MobA impinge on the binding

pocket, such that region D is smaller in both of these enzymes and substrate docking model provide a framework for rationalsite-directed mutagenesis in order to further probe the detailsthan that of MobA. This may be because their substrates, KDO

and GlcNAc-6-P, respectively, are considerably smaller than of the enzymatic mechanism and ultimately determine the mo-lecular basis for substrate specificity. These experiments areMPT. This observation implies a role for region D in MPT bind-

ing. However, this could not be modeled with any confidence. in progress in parallel with ligand binding studies.Nevertheless, we cannot exclude the possibility that MPT bindsin regions A–C and the nucleotide occupies region D. Biological Implications

Given the close structural similarity to GlmU and the conser-vation of several of the residues that line the substrate binding Pterin-based molybdenum cofactors (molybdopterin) are pres-

ent in the active sites of more than 30 redox enzymes. Althoughpocket, it seems likely that the synthesis of MGD proceedsby a mechanism comparable to that of GlmU and XDP-sugar molybdopterin (MPT) is the active form of the cofactor found

in eukaryotic molybdoenzymes, most prokaryotic enzymes re-pyrophosphorylases in general [10]. An NMR study of thio-substituted nucleotides demonstrated that the mechanism of quire a modification of this basic structure in order to function.

This involves the attachment of a nucleotide moiety onto theUDP-glucose pyrophosphorylase involves stereochemical in-version at the Pa position [35]. This would imply an associative terminal phosphate group of the molybdopterin, and it is crucial

for the activity of key bacterial anaerobic respiratory enzymes.mechanism whereby a nucleophilic attack by the acceptorspecies (MPT) on the nucleotide donor species (GTP) at the It is a reaction unique to prokaryotes, and it therefore may

represent a potential drug target.Pa would give rise to a pentacoordinate intermediate. The latterwould then decay to produce MGD and pyrophosphate. The We have determined the high-resolution X-ray structure of

E. coli MobA, which catalyzes the attachment of GMP to thebridging coordination of Mg21 seen in our model would bothhelp to stabilize the oxyanionic transition state (on Pa) and cofactor. The structure contains a central b sheet that is sand-

wiched between a helices and is reminiscent of the Rossmannsubsequently make the pyrophosphate a better leaving group.The conserved Lys-25 may help to correctly orient Asp-101 dinucleotide binding fold. The surface of the protein bears a

distinctive cleft that is lined by highly conserved residues andfor Mg21 binding and may possibly assist in neutralizing chargeon the phosphates. It has been suggested that the N terminus is of appropriate dimensions to accommodate the reaction

product, molybdopterin guanine dinucleotide. These featuresof the a-helical linker in GlmU (equivalent of a7 in MobA) isimportant in charge stabilization [33], although it seems likely strongly suggest that this is where catalysis takes place. A

surface loop lying adjacent to this cleft has a sequence resem-that a tilting of the helix axis may be necessary in both enzymesin order to perform this function. The role of the consensus bling the P loops of other nucleotide binding proteins that are

involved in phosphate binding. The structure is closely relatedloop in MobA is less clear, and main chain interactions with thephosphates of GTP in this binding mode would seem unlikely. to that of GlmU, an N-acetylglucosamine-1-phosphate uridyl-

transferase, which is likely to perform similar chemistry. TheNevertheless, the closing of the consensus loop over the sub-strate seems a distinct possibility, and the conserved Arg-19 latter has been crystallized with bound product, which also

Structure1122

Table 1. X-Ray Data Collection Statistics

Beamline PX9.5 (SRS) PX9.5 (SRS) PX9.5 (SRS) ID14-2 (ESRF)

Data set SeMet-peak SeMet-inflection SeMet-remote NativeWavelength (A) 0.9786 0.9802 0.9000 0.933Resolution (A) 2.0 2.0 2.0 1.35Unique reflections 22192 22210 22276 38263Completeness (%)a 98.3 (96.4) 98.3 (96.5) 98.6 (97.1) 97.9 (83.4)Redundancyb 3.9 3.9 3.9 3.8Rmerge

a 0.052 (0.172) 0.052 (0.185) 0.045 (0.133) 0.047 (0.142),I./,sI.a 18.3 (5.9) 17.2 (5.0) 19.4 (6.4) 6.9 (5.2)Riso

c 0.260 0.259 0.265 —

a Figures in parentheses refer to data in the highest resolution bin.b Anomalous pairs treated as separate reflections for MAD data.c Native data set is used as the reference.

used low-salt buffers containing 50 mM NaCl instead of the 0.5 M KCl. Theoccupies the proposed substrate binding pocket of MobAresultant samples were highly aggregated. In ensuing preparations the usewhen the two structures are superposed. A substrate-dockingof 0.5 M KCl alleviated this problem, and the result was monodispersemodel based on this superposition has enabled us to speculatesolutions with approximate molecular sizes of 22–25 kDa. The SeMet protein,

about the enzymatic mechanism and the likely function of the however, did show some minor aggregation. Subsequently, the samplesresidues that line the cavity. The molybdopterin cofactors are were concentrated to approximately 12 mg/ml by using Centricon-10 con-

centrators (Millipore). To remove any remaining traces of aggregated mate-extremely sensitive to oxygen and are unlikely to exist in freerial prior to crystallization, the sample was again filtered through a 0.1 mlsolution. An analysis of the surface charge distribution of MobAUltrafree filter. Crystals were grown by vapor diffusion in hanging drops byreveals extensive patches of both positive and negative poten-using VDX plates (Hampton Research) at 277 K. The precipitant consistedtial adjacent to the cleft that may be involved in protein–proteinof 20% (v/v) isopropanol, 2% (w/v) PEG 1500 in 100 mM citric acid [pH 5.5].

recognition during substrate delivery and/or product release. These conditions were supplemented with 1 mM DTT and 1 mM EDTA forthe SeMet protein. Drops consisted of 1 ml of protein solution mixed with

Experimental Procedures 1 ml of precipitant. The crystals began to form after 2 weeks, but they tookup to 8 weeks to reach full size (500 3 300 3 200 mm).

Expression, Purification, and CrystallizationThe E. coli mobA gene was amplified from plasmid pCI1 [13] with primersthat incorporated NcoI and BglII restriction sites and cloned into pQE60 Data Collection

All crystal manipulations were performed using Hampton Research tools.(Qiagen). This resulted in a clone, pTPEC1, that synthesized MobA as afusion protein with a C-terminal oligo-histidine tag (Ser-His6). The region Initially, crystals were cryoprotected by soaking for up to 5 min in mother

liquor containing ethylene glycol in place of 25% of the buffer volume, aftercovering most of mobA, including the C-terminal tag, was excised frompTPEC1 by digestion with DraIII and HindIII and cloned into pREM3 [17], which they were mounted in cryoloops and flash cooled by plunging into

liquid nitrogen. The Crystal Cap system was then used to transfer the crystalswhich had previously been digested with the same enzymes. The resultantclone, which was designated pTPEC3, overproduces MobA to approxi- to a dewar for storage and subsequent transportation to the synchrotron.

For data collection, crystals were mounted on the goniometer with cryotongsmately 20% of total cell protein. For production of MobA, 1 liter culturesof JM109 [F9 traD36 lacIqD(lacZ)M15 proA1B1/e14(McrA2)D(lac-proAB) thi and maintained at 100 K with an Oxford Cryosystems Cryostream cooler.

The crystals belong to space group P21212, with cell parameters of a 5gyrA96 (nalr)endA1 hsdR17 (rk2mk

1) relA1 supE44 recA1] carrying pTPEC3were grown aerobically at 378C to an absorbance of 0.6 at 600 nm in LB 76.6 A, b 5 41.8 A, and c 5 54.5 A (when measured at 100 K). Assuming a

single 22.6 kDa molecule (including affinity tag) per asymmetric unit, anmedium containing 125 mg/ml ampicillin. Cultures were then shifted to 308Cand induced for MobA production by addition of 0.4 mM (final concentration) estimation of solvent content gave a very low value of 36% [37]. X-ray data

for the native protein were collected on beamline ID14–2 (l 5 0.933 A) atisopropyl b-D-thiogalactopyranoside for 16 hr prior to harvesting. For sele-nomethionine (SeMet) substitution of MobA, the methionine auxotrophic E. the European Synchrotron Radiation Facility (ESRF) in Grenoble, France,

with an Area Detector Systems Corporation Quantum 4 charged coupledcoli strain P4X (Hfr, metB1), which had been previously transformed withpREP4 (KanR, lacI1; Roche), was transformed with plasmid pTPEC3. Cells device (CCD) detector. The crystal used for the native data collection dif-

fracted strongly to a maximum resolution of 1.35 A. This data set waswere grown at 378C in 1 liter of M9 medium [36] containing 0.4% glucosethat was supplemented with 40 mg of 19 amino acids (minus methionine) collected in two 908 passes: a high-resolution pass (1.35 A) of longer expo-

sures (8 s per 18 image) to record the relatively weak reflections at the edgeplus 60 mg of SeMet (Sigma). At an absorbance of 0.6 at 600 nm, MobAexpression was induced by addition of 1 mM isopropyl b-D-thiogalactopyra- of the detector; and a medium-resolution pass (2.5 A) of shorter exposures

(2 s per 28 image) to adequately record the strong reflections in the middlenoside, and the cells were grown for a further 24 hr at 378C. Cells wereharvested, washed, and resuspended in a buffer containing 50 mM Tris-HCl of the detector. These reflections had been saturated in the previous pass.

The SeMet X-ray data were collected by using a 165 mm MAR-Research(pH 7.6), 0.5 M KCl, 1 mM b-mercaptoethanol, and a mini EDTA-free com-plete protease inhibitor tablet (Roche). The cells were then disrupted by CCD detector on beamline PX9.5 at the Synchrotron Radiation Source (SRS)

at Daresbury, UK. All SeMet crystals taken directly from the shipping dewarFrench pressing, as described [17]. The crude supernatant was applied toa 5 ml nickel(II)-charged Hi-trap metal chelation column (Amersham Phar- showed poorly ordered diffraction and evidence of ice formation. However,

this was alleviated by “in vitro” annealing [38], whereby the crystals weremacia Biotech). Unbound proteins were removed by washing with buffer A(50 mM Tris-HCl [pH 7.6], 0.5 M KCl, and 1 mM b-mercaptoethanol). The returned to cryoprotectant (supplemented with 10 mM dithiothreitol) for 10

min before recooling in the Cryostream. In order to make contemporaneoushistidine-tagged MobA was eluted with a 0–1 M imidazole gradient in bufferA. The fractions containing MobA (and which eluted at approximately 400 measurements of anomalous pairs, care was taken to align the crystal along

the b cell axis (refined mis-setting angles were 1.98 about the vertical axismM imidazole) were collected in tubes supplemented with Chelex resin(Biorad). The protein sample was then dialyzed overnight against 20 mM and 20.18 about the X-ray beam). Prior to data collection, a fluorescence

scan revealed a broad Se K-edge (z18 eV) with an f9 minimum of 27.5 e2Tris (pH 7.6), 0.5 M KCl, and 1 mM b-mercaptoethanol. The polydispersityof protein samples was assessed at this stage by Dynamic light scattering and an f″ maximum of 4.1 e2, and this made it difficult to determine the

best wavelengths at which to collect data. In the event, a single wavelengthusing a Dynapro-MSTC instrument (Protein Solutions). The sample was fil-tered through a 0.1 mm Ultrafree filter (Millipore) before measurements were was collected for the peak (0.9786 A), a high-energy remote (0.9000 A), and

a low-energy remote (1.2000 A), and three wavelengths were collected fortaken at 48C and 208C, with the protein at around 2 mg/ml. Initial preparations

Crystal Structure of E. coli MobA1123

using REFMAC [46] and automatic solvent modeling with ARP [47]. TheTable 2. Model Parameters and Qualitysurface loop comprising residues 15–21 is poorly resolved in the electron

Contents of Model (Molecules/Nonhydrogen Atoms) density maps, and all the large side chains were omitted from the model inthis region. A clearly defined region of electron density resulting from aProtein (residues/atoms) 188/1423nonprotein entity was modeled as two citrate molecules coordinating aCitrate 2/26single metal ion. The peak height for the metal ion was comparable to thatLithium ion 1of a water molecule. Analysis of anomalous difference Fouriers calculatedWater 149using the SeMet data yielded no significant peaks at this site, even with thedata collected at a wavelength of 1.2 A (low-energy remote). By contrast,Refinementpeaks as high as 10 s were associated with the protein sulfurs. This ruled

Resolution range (A) 40–1.35 out K1 and Ca21 as possible candidates because they have theoretical f9Rcryst

a (based on 95% of data; %) 18.4 values greater than sulfur at this wavelength. Na1 and Mg21 are more likelyRfree

a (based on 5% of data; %) 21.3 as they have lower f9 values than sulfur, although a peak should neverthelesshave been observable. This leaves Li1, which has virtually no anomalousRoot-Mean-Square Deviations from Idealitysignal at 1.2 A. Subsequent refinement of a Li1-bis-citrate complex gave a

Bond distances (A) 0.011 good fit to the electron density. The Li1 ion had a final temperature factorAngle distances (A) 0.029 of 20 A2, which compared favorably to those of its oxygen neighbors havingChiral volume (A3) 0.106 an average value of 21 A2. However, Li1 was not specifically added to thePlanar torsion angles (8) 5.8 protein at any stage during the purification or crystallization and, therefore,Staggered torsion angles (8) 13.1 if the metal is Li1, its origin is unknown. The Li1-bis-citrate moiety is locatedOrthonormal torsion angles (8) 19.5 in a crystal contact region and makes several hydrogen bonds with its two

protein neighbors. It is conceivable that these interactions are essential forAverage B Factors (A2)crystallogenesis.

Main chain atoms 19Side chain atoms 22 Model QualityOther heterogens 24 The final model contained a total of 188 residues in a single chain and wasWaters 32 lacking 3 residues of the native transcript at both termini, as well as a furtherOverall 22 7 residues of the affinity tag at the C terminus. Nevertheless, density for

additional residues was present at each of the termini, but these coulda The R factors Rcryst and Rfree are calculated as follows: R 5 S(|Fobs 2 Fcalc|)/not be confidently modeled. In the final refinement phase, all atoms wereS|Fobs| 3 100, where Fobs and Fcalc are the observed and calculated structuresubjected to anisotropic temperature factor refinement. This was consideredfactor amplitudes, respectively.to be justified because the observation to parameter ratio was still relativelyhigh (approximately 10) and there was a resultant drop in the Rfree of around1.5% relative to an equivalent refinement using only isotropic thermal param-eters. The final Rcryst and Rfree values were 18.4% and 21.3%, respectively.the inflection point (0.9798 A, 0.9795 A, and 0.9802 A). All data sets wereThe mean coordinate error was estimated to be 0.06 A from the Data Preci-collected in a single 1808 sweep of 18 images to 2.0 A resolution.sion Index [48]. The final model quality was evaluated using PROCHECK[49]. Approximately 90% of the residues lay in the most favored regions ofData Processing and Structure Solutionthe Ramachandran plot, although there was a single outlier, Lys-16, whichThe native X-ray data were processed with DENZO and scaled and mergedresides in the disordered portion of the consensus loop. Otherwise, thewith SCALEPACK [39], and all subsequent downstream processing andoverall criteria required for a structure at this resolution were either satisfiedstatistical analysis was effected using programs from the CCP4 suite [40].or exceeded. The parameters of the final model are summarized in Table 2.A subset of the data comprising a random 5% of the reflections was set

aside for the calculation of “free” (Rfree) crystallographic R factors [41] duringSubstrate Docking Modelmodel refinement. For the MAD data, after SCALEPACK ran (the “no merge”The GTP-bound structure of MobA was modeled by simply docking a GTPoption was used), the resultant files were processed with the program SOLVEmolecule into the active site pocket with the program O [45]. The superposed(http://www.solve.lanl.gov) [42]. Refinement of the anomalous scatteringligand-bound structure of GlmU served as a guide [33]. Changes were madefactors showed that the data set collected at a wavelength of 0.9802 A gaveto several of the rotatable bonds of the GTP molecule in order to improvethe most negative f9 value (27.0) and hence was the closest to the truethe fit, while the protein structure remained unchanged. A Mg21 ion wasinflection point. Consequently, this data set was used in preference to theplaced in a location previously occupied by a water molecule in the crystalother two that had been collected at wavelengths close to the inflectionstructure. The resultant model was not energy minimized.point (i.e. 0.9798 A and 0.9795 A). The prefered inflection point data set was

then taken, together with the peak and high-energy remote data sets, forAcknowledgmentsthe subsequent phasing calculations in SOLVE. The low-energy remote data

were not required for structure solution but were used for another purposeThis work was funded by the BBSRC. G. B. is grateful for a special student-(see below). The 194 residue primary structure of E. coli MobA contains aship jointly funded by the John Innes Centre and the University of Easttotal of eight methionine residues. SOLVE automatically determined theAnglia. T. P. is a Royal Society Research Fellow. We are grateful for supportpositions of five of these, although subsequent analysis of the final modeland access to the SRS in Daresbury and the ESRF in Grenoble and wouldshowed that the remaining three were disordered in the structure. Phasingparticularly like to thank beamline scientists J. Nicholson (SRS) and E. Mitch-calculations based on the five ordered SeMet positions gave an overallell (ESRF). L. Delarbre is acknowledged for assistance during X-ray datafigure of merit of 0.71 at 2.0 A resolution, which increased to 0.83 aftercollection. We are indebted to S. Bornemann and D. Lowe for useful discus-solvent flattening and histogram matching with DM [43]. An experimentallysions and for critical reading of this manuscript. We would also like to thankphased Fobs electron density map calculated at this stage gave clear contrastY. Bourne for the kind provision of the GlmU X-ray coordinates.between protein and bulk solvent regions, and many secondary-structural

elements were apparent. Data collection and processing statistics of alldata used in the structure solution and refinement are summarized in Table 1. Received: July 13, 2000

Revised: September 5, 2000Accepted: September 19, 2000Model Building and Refinement

Given the quality of phases and the availability of high-resolution nativedata, this seemed to be an ideal candidate for the warpNtrace automatic Referenceschain-tracing procedure of the program wARP [44]. Indeed, this was suc-cessful in placing 163 out of the expected 201 residues into the model in 1. Romao, M.J., et al., and Huber, R. (1995). Crystal structure of the xan-

thine oxidase-related aldehyde oxidoreductase from D. gigas. Sciencethree fragments with a connectivity index of 0.95. Subsequent iterations ofmodel building using the program O [45] were alternated with refinement 270, 1170–1176.

Structure1124

2. Kisker, C., et al., and Rees, D.C. (1997). Molecular basis of sulfite oxidase 24. Rajagopalan, K.V., and Johnson, J.L. (1992). The pterin molybdenumcofactors. J. Biol. Chem. 267, 10199–10202.deficiency from the structure of sulfite oxidase. Cell 91, 973–983.

25. Rossmann, M.G., Liljas, A., Branden, C.-I., and Bansazak, L.J. (1975).3. Schindelin, H., Kisker, C., Hilton, J., Rajagpalan, K.V., and Rees, D.C.Evolutionary relationships among dehydrogenases. In The Enzymes,(1996). Crystal structure of DMSO reductase: redox-linked changes invol. 11, I.P.D. Boyer, ed. (New York: Academic Press), pp. 61–102.molybdopterin coordination. Science 272, 1615–1621.

26. Liu, M.T., Wuebbens, M.M., Rajagopalan, K.V., and Schindelin, H. (2000).4. Contreras, I., Toro, C.S., Troncoso, G., and Mora, G.C. (1997). SalmonellaCrystal structure of the gephyrin-related molybdenum cofactor biosyn-typhi mutants defective in anaerobic respiration are impaired in theirthesis protein MogA from Escherichia coli. J. Biol. Chem. 275, 1814–ability to replicate within epithelial cells. Microbiology 143, 2665–2672.1822.5. Wuebbens, M.M., and Rajagopalan, K.V. (1995). Investigation of the

27. Holm, L., and Sander, C. (1995). DALI: a network tool for protein structureearly steps of molybdopterin biosynthesis in Escherichia coli throughcomparison. Trends. Biochem. Sci. 20, 478–480.the use of in vivo labeling studies. J. Biol. Chem. 270, 1082–1087.

28. Scapin, G., Reddy, S.G., and Blanchard, J.S. (1996). Three-dimensional6. Pitterle, D.M., and Rajagopalan, K.V. (1993). The biosynthesis of molyb-structure of meso-diaminopimelic acid dehydrogenase from Coryne-dopterin in Escherichia coli. Purification and characterization of thebacterium glutamicum. Biochemistry 35, 13540–13551.converting factor. J. Biol. Chem. 268, 13499–13505.

29. Tesmer, J.J., Klem, T.J., Deras, M.L., Davisson, V.J., and Smith, J.L.7. Pitterle, D.M., Johnson, J.L., and Rajagopalan, K.V. (1993). In vitro syn-(1996). The crystal structure of GMP synthetase reveals a novel catalyticthesis of molybdopterin from precursor Z using purified converting fac-triad and is a structural paradigm for two enzyme families. Nature Struct.tor. Role of protein-bound sulfur in formation of the dithiolene. J. Biol.Biol. 3, 74–86.Chem. 268, 13506–13509.

30. Lowe, J., and Amos, L.A. (1998). Crystal structure of the bacterial cell-8. Kuper, J., Palmer, T., Mendel, R.R., and Schwarz, G. (2000). Mutations indivision protein FtsZ. Nature 391, 203–206.the molybdenum cofactor biosynthetic protein Cnx1G from Arabidopsis

31. Thoden, J.B., Frey, P.A., and Holden, H.M. (1996). Molecular structurethaliana define functions for molybdopterin binding, molybdenum inser-of the NADH/UDP-glucose abortive complex of UDP- galactose 4-epi-tion, and molybdenum cofactor stabilization. Proc. Natl. Acad. Sci. USAmerase from Escherichia coli: implications for the catalytic mechanism.97, 6475–6480.Biochemistry 35, 5137–5144.9. Palmer, T., Vasishta, A., Whitty, P.W., and Boxer, D.H. (1994). Isolation

32. Jelakovic, S., Jann, K., and Schulz, G.E. (1996). The three-dimensionalof protein FA, a product of the mob locus required for molybdenumstructure of capsule-specific CMP: 2-keto-3-deoxy-manno-octonic acidcofactor biosynthesis in Escherichia coli. Eur. J. Biochem. 222, 687–692.synthetase from Escherichia coli. FEBS Lett. 391, 157–161.10. Fukui, T., Kazuta, Y., Katsube, T., Tagaya, M., and Tanizawa, K. (1993).

33. Brown, K., Pompeo, F., Dixon, S., Mengin-Lecreulx, D., Cambillau, C.,Exploring the active site in UDP-glucose pyrophosphorylase by affinityand Bourne, Y. (1999). Crystal structure of the bifunctional N-acetylglu-labelling and site-directed mutagenesis. Biotechnol. Appl. Biochem. 18,cosamine 1-phosphate uridyltransferase from Escherichia coli: a para-209–216.digm for the related pyrophosphorylase superfamily. EMBO J. 18, 4096–11. Weber, C.H., Park, Y.S., Sanker, S., Kent, C., and Ludwig, M.L. (1999).4107.A prototypical cytidylyltransferase: CTP:glycerol-3-phosphate cytidylyl-

34. Charnock, S.J., and Davies, G.J. (1999). Structure of the nucleotide-transferase from Bacillus subtilis. Structure 7, 1113–1124.diphospho-sugar transferase, SpsA from Bacillus subtilis, in native and12. Boxer, D.H., Low, D.C., Pommier, J., and Giordano, G. (1987). Involve-nucleotide-complexed forms. Biochemistry 38, 6380–6385.ment of a low-molecular-weight substance in in vitro activation of the

35. Sheu, K.F., Richard, J.P., and Frey, P.A. (1979). Stereochemical coursesmolybdoenzyme respiratory nitrate reductase from a chlB mutant of

of nucleotidyltransferase and phosphotransferase action. Uridine di-Escherichia coli. J. Bacteriol. 169, 4678–4685.

phosphate glucose pyrophosphorylase, galactose-1- phosphate uridy-13. Iobbi-Nivol, C., Palmer, T., Whitty, P.W., McNairn, E., and Boxer, D.H.

lyltransferase, adenylate kinase, and nucleoside diphosphate kinase.(1995). The mob locus of Escherichia coli K12 required for molybdenum Biochemistry 18, 5548–5556.cofactor biosynthesis is expressed at very low levels. Microbiology 141, 36. Sambrook, J., Fritsch, E.F., and Maniatis, T. (1989). Molecular Cloning.1663–1671. A laboratory manual (Cold Spring Harbor: Cold Spring Harbor Laboratory

14. Palmer, T., Santini, C.L., Iobbi-Nivol, C., Eaves, D.J., Boxer, D.H., and Press).Giordano, G. (1996). Involvement of the narJ and mob gene products 37. Matthews, B.W. (1968). Solvent content of protein crystals. J. Mol. Biol.in distinct steps in the biosynthesis of the molybdoenzyme nitrate reduc- 33, 491–497.tase in Escherichia coli. Mol. Microbiol. 20, 875–884. 38. Stevenson, C.E.M., Mayer, S.M., Delarbre, L., and Lawson, D.M. (2000).

15. Johnson, J.L., Indermaur, L.W., and Rajagopalan, K.V. (1991). Molybde- Crystal annealing - nothing to lose. Journal of Crystal Growth, in press.num cofactor biosynthesis in Escherichia coli. Requirement of the chlB 39. Otwinowski, Z., and Minor, W. (1997). Processing of X-ray diffractiongene product for the formation of molybdopterin guanine dinucleotide. data collected in oscillation mode. Meth. Enzymol. 276, 307–326.J. Biol. Chem. 266, 12140–12145. 40. CCP4 (Collaborative Computational Project 4) (1994). The CCP4 Suite:

16. Palmer, T., Goodfellow, I.P., Sockett, R.E., McEwan, A.G., and Boxer, Programs for Protein Crystallography. Acta Crystallog. D 50, 760–763.D.H. (1998). Characterisation of the mob locus from Rhodobacter 41. Brunger, A.T. (1993). Assessment of phase accuracy by cross valida-sphaeroides required for molybdenum cofactor biosynthesis. Biochim. tion - the free R-value - methods and applications. Acta Crystallogr DBiophys. Acta 1395, 135–140. 49, 24–36.

17. Eaves, D.J., Palmer, T., and Boxer, D.H. (1997). The product of the 42. Terwilliger, T.C., and Berendzen, J. (1999). Automated MAD and MIRmolybdenum cofactor gene mobB of Escherichia coli is a GTP-binding structure solution. Acta Crystallogr. D 55, 849–861.protein. Eur. J. Biochem. 246, 690–697. 43. Cowtan, K. (1994). DM: an automated procedure for phase improvement

18. Kisker, C., Schindelin, H., and Rees, D.C. (1997). Molybdenum-cofactor- by density modification. Joint CCP4 and ESF-EACBM Newsletter oncontaining enzymes: structure and mechanism. Annu. Rev. Biochem. Protein Crystallography 31, 34–38.66, 233–267. 44. Perrakis, A., Morris, R., and Lamzin, V.S. (1999). Automated protein

19. Richardson, J.S. (1981). The anatomy and taxonomy of protein struc- model building combined with iterative structure refinement. Naturetures. Adv. Prot. Chem. 34, 167–339. Struct. Biol. 6, 458–463.

20. Czjzek, M., Dos Santos, J.P., Pommier, J., Giordano, G., Mejean, V., and 45. Jones, T.A., Zou, J.Y., Cowan, S.W., and Kjeldgaard, M. (1991). ImprovedHaser, R. (1998). Crystal structure of oxidized trimethylamine N-oxide methods for building protein models in electron density maps and thereductase from Shewanella massilia at 2.5 A resolution. J. Mol. Biol. location of errors in these models. Acta Crystallogr. D 47, 110–119.284, 435–447. 46. Murshudov, G.N., Vagin, A.A., and Dodson, E.J. (1997). Refinement of

21. Walker, J.E., Saraste, M., Runswick, M.J., and Gay, N.J. (1982). Distantly macromolecular structures by the maximum-likelihood method. Actarelated sequences in the a- and b-subunits of ATP synthase, myosin, Crystallogr. D 53, 240–255.kinases and other ATP-requiring enzymes and a common nucleotide 47. Lamzin, V.S., and Wilson, K.S. (1993). Automated refinement of proteinbinding fold. EMBO J. 1, 945–951. models. Acta Crystallogr. D 49, 129–147.

22. Saraste, M., Sibbald, P.R., and Wittinghofer, A. (1990). The P-loop - a 48. Cruickshank, D.W.J. (1999). Remarks about protein structure precision.common motif in ATP- and GTP-binding proteins. Trends Biochem. Sci. Acta Crystallogr. D 55, 583–601.15, 430–434. 49. Laskowski, R.A., Macarthur, M.W., Moss, D.S., and Thornton, J.M.

23. Schulz, G.E. (1992). Binding of nucleotides by proteins. Curr. Opin. (1993). PROCHECK: a program to check the stereochemical quality ofprotein structures. J. Appl. Cryst. 26, 283–291.Struct. Biol. 2, 61–67.

Crystal Structure of E. coli MobA1125

50. Barton, G.J. (1993). Alscript - a tool to format multiple sequence align-ments. Prot. Eng. 6, 37–40.

51. Devereux, J., Haeberli, P., and Smithies, O. (1984). A comprehensiveset of sequence-analysis programs for the Vax. Nucleic Acids Res. 12,387–395.

52. Kabsch, W., and Sander, C. (1983). Dictionary of protein secondarystructure - pattern-recognition of hydrogen-bonded and geometricalfeatures. Biopolymers 22, 2577–2637.

53. Kraulis, P.J. (1991). MOLSCRIPT: a program to produce detailed andschematic plots of protein structures. J. Appl. Cryst. 24, 946–950.

54. Merritt, E.A., and Bacon, D.J. (1997). Raster3D: photorealistic moleculargraphics. Meth. Enzymol. 277, 505–524.

55. Guex, N., and Peitsch, M.C. (1997). SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling. Electro-phoresis 18, 2714–2723.

Accession Numbers

The coordinates and structure factor data for MobA have been depositedin the Protein Data Bank with accession codes 1E5K and R1E5KSF, respec-tively.