Genomic clustering of cyanogenic glucoside biosynthetic genes aids their identification in Lotus...

14
Genomic clustering of cyanogenic glucoside biosynthetic genes aids their identification in Lotus japonicus and suggests the repeated evolution of this chemical defence pathway Adam M. Takos 1 , Camilla Knudsen 1 , Daniela Lai 1 , Rubini Kannangara 1 , Lisbeth Mikkelsen 1 , Mohammed S. Motawia 1 , Carl E. Olsen 2 , Shusei Sato 3 , Satoshi Tabata 3 , Kirsten Jørgensen 1 , Birger L. Møller 1 and Fred Rook 1,* 1 Plant Biochemistry Laboratory, Department of Plant Biology and Biotechnology, University of Copenhagen, 1871 Frederiksberg, Denmark, 2 Department of Basic Sciences and Environment, University of Copenhagen, 1871 Frederiksberg, Denmark, and 3 Kazusa DNA Research Insititute, Kisarazu, Japan Received 11 May 2011; revised 20 June 2011; accepted 21 June 2011. * For correspondence (fax +45 35333300; e-mail [email protected]). SUMMARY Cyanogenic glucosides are amino acid-derived defence compounds found in a large number of vascular plants. Their hydrolysis by specific b-glucosidases following tissue damage results in the release of hydrogen cyanide. The cyanogenesis deficient1 (cyd1) mutant of Lotus japonicus carries a partial deletion of the CYP79D3 gene, which encodes a cytochrome P450 enzyme that is responsible for the first step in cyanogenic glucoside biosynthesis. The genomic region surrounding CYP79D3 contains genes encoding the CYP736A2 protein and the UDP-glycosyltransferase UGT85K3. In combination with CYP79D3, these genes encode the enzymes that constitute the entire pathway for cyanogenic glucoside biosynthesis. The biosynthetic genes for cyanogenic glucoside biosynthesis are also co-localized in cassava (Manihot esculenta) and sorghum (Sorghum bicolor), but the three gene clusters show no other similarities. Although the individual enzymes encoded by the biosynthetic genes in these three plant species are related, they are not necessarily orthologous. The independent evolution of cyanogenic glucoside biosynthesis in several higher plant lineages by the repeated recruitment of members from similar gene families, such as the CYP79s, is a likely scenario. Keywords: cyanogenic glucosides, gene clustering, cytochrome P450, Lotus japonicus, Manihot esculenta, Sorghum bicolor. INTRODUCTION Plants have evolved a large spectrum of chemical defence compounds, and while some are produced by only a limited number of related species, others, such as cyanogenic glu- cosides, are more widely distributed. Over 60 different cya- nogenic glucosides, amino acid-derived a-hydroxynitrile glucoside compounds, are known from over 2600 plant species ranging from ferns to angiosperms, including a disproportionately large number of crops (Seigler, 1975; Conn, 1980; Jones, 1998). Upon tissue disruption, cyano- genic glucosides are degraded by specific b-glucosidases, resulting in the release of hydrogen cyanide, which provides a defence mechanism against generalist herbivores (Morant et al., 2008). Cyanogenesis is often regarded as an evolu- tionarily ancient plant defence mechanism with a single evolutionary origin implied due to the fact that all presently identified enzymes for the first committed step, the conver- sion of an amino acid into an oxime, are cytochrome P450s belonging to the CYP79 family (for reviews, see Bak et al., 2006; Bjarnholt and Møller, 2008). The first genes encoding enzymes for cyanogenic gluco- side biosynthesis were identified in Sorghum bicolor using biochemical approaches (Koch et al., 1995; Bak et al., 1998a; Jones et al., 1999), which provided the basis for the identi- fication of genes in other plant species by sequence homo- logy. Sorghum produces the aromatic cyanogenic glucoside dhurrin, derived from tyrosine, and its biosynthesis requires three sequential steps involving two cytochrome P450 enzymes and an UDP-glucosyltransferase. The first cyto- chrome P450, CYP79A1, converts tyrosine into an oxime intermediate, which is converted by the second cytochrome ª 2011 The Authors 1 The Plant Journal ª 2011 Blackwell Publishing Ltd The Plant Journal (2011) doi: 10.1111/j.1365-313X.2011.04685.x

Transcript of Genomic clustering of cyanogenic glucoside biosynthetic genes aids their identification in Lotus...

Genomic clustering of cyanogenic glucoside biosyntheticgenes aids their identification in Lotus japonicus and suggeststhe repeated evolution of this chemical defence pathway

Adam M. Takos1, Camilla Knudsen1, Daniela Lai1, Rubini Kannangara1, Lisbeth Mikkelsen1, Mohammed S. Motawia1,

Carl E. Olsen2, Shusei Sato3, Satoshi Tabata3, Kirsten Jørgensen1, Birger L. Møller1 and Fred Rook1,*

1Plant Biochemistry Laboratory, Department of Plant Biology and Biotechnology, University of Copenhagen,

1871 Frederiksberg, Denmark,2Department of Basic Sciences and Environment, University of Copenhagen, 1871 Frederiksberg, Denmark, and3Kazusa DNA Research Insititute, Kisarazu, Japan

Received 11 May 2011; revised 20 June 2011; accepted 21 June 2011.*For correspondence (fax +45 35333300; e-mail [email protected]).

SUMMARY

Cyanogenic glucosides are amino acid-derived defence compounds found in a large number of vascular plants.

Their hydrolysis by specific b-glucosidases following tissue damage results in the release of hydrogen cyanide.

The cyanogenesis deficient1 (cyd1) mutant of Lotus japonicus carries a partial deletion of the CYP79D3 gene,

which encodes a cytochrome P450 enzyme that is responsible for the first step in cyanogenic glucoside

biosynthesis. The genomic region surrounding CYP79D3 contains genes encoding the CYP736A2 protein and

the UDP-glycosyltransferase UGT85K3. In combination with CYP79D3, these genes encode the enzymes that

constitute the entire pathway for cyanogenic glucoside biosynthesis. The biosynthetic genes for cyanogenic

glucoside biosynthesis are also co-localized in cassava (Manihot esculenta) and sorghum (Sorghum bicolor),

but the three gene clusters show no other similarities. Although the individual enzymes encoded by the

biosynthetic genes in these three plant species are related, they are not necessarily orthologous. The

independent evolution of cyanogenic glucoside biosynthesis in several higher plant lineages by the repeated

recruitment of members from similar gene families, such as the CYP79s, is a likely scenario.

Keywords: cyanogenic glucosides, gene clustering, cytochrome P450, Lotus japonicus, Manihot esculenta,

Sorghum bicolor.

INTRODUCTION

Plants have evolved a large spectrum of chemical defence

compounds, and while some are produced by only a limited

number of related species, others, such as cyanogenic glu-

cosides, are more widely distributed. Over 60 different cya-

nogenic glucosides, amino acid-derived a-hydroxynitrile

glucoside compounds, are known from over 2600 plant

species ranging from ferns to angiosperms, including a

disproportionately large number of crops (Seigler, 1975;

Conn, 1980; Jones, 1998). Upon tissue disruption, cyano-

genic glucosides are degraded by specific b-glucosidases,

resulting in the release of hydrogen cyanide, which provides

a defence mechanism against generalist herbivores (Morant

et al., 2008). Cyanogenesis is often regarded as an evolu-

tionarily ancient plant defence mechanism with a single

evolutionary origin implied due to the fact that all presently

identified enzymes for the first committed step, the conver-

sion of an amino acid into an oxime, are cytochrome P450s

belonging to the CYP79 family (for reviews, see Bak et al.,

2006; Bjarnholt and Møller, 2008).

The first genes encoding enzymes for cyanogenic gluco-

side biosynthesis were identified in Sorghum bicolor using

biochemical approaches (Koch et al., 1995; Bak et al., 1998a;

Jones et al., 1999), which provided the basis for the identi-

fication of genes in other plant species by sequence homo-

logy. Sorghum produces the aromatic cyanogenic glucoside

dhurrin, derived from tyrosine, and its biosynthesis requires

three sequential steps involving two cytochrome P450

enzymes and an UDP-glucosyltransferase. The first cyto-

chrome P450, CYP79A1, converts tyrosine into an oxime

intermediate, which is converted by the second cytochrome

ª 2011 The Authors 1The Plant Journal ª 2011 Blackwell Publishing Ltd

The Plant Journal (2011) doi: 10.1111/j.1365-313X.2011.04685.x

P450, CYP71E1, into a hydroxynitrile, which is then stabilized

by glucosylation catalysed by the UDP-glucosyltransferase

UGT85B1, resulting in dhurrin formation (Figure 1).

Similarly, in other plant species, members of the CYP79

family have been shown to catalyse the first step in

cyanogenic glucoside biosynthesis. In cassava (Manihot

esculenta), the cyanogenic glucosides linamarin and lot-

australin are synthesized from the amino acids valine and

isoleucine, respectively, involving the activity of CYP79D1

and CYP79D2 (Andersen et al., 2000). The model legume

Lotus japonicus contains two genes, CYP79D3 and

CYP79D4, that encode cytochrome P450 enzymes involved

in the production of linamarin and lotaustralin and related

non-cyanogenic b- and c-hydroxynitrile glucosides called

rhodiocyanosides (Forslund et al., 2004). Rhodiocyanoside

biosynthesis is thought to diverge from cyanogenic gluco-

side biosynthesis at the level of the hydroxynitrile interme-

diate, and natural variation for the presence or absence of

rhodiocyanosides involves a genetic locus that we refer to

here as Rho (Figure 1) (Bjarnholt et al., 2008). In white clover

(Trifolium repens), an adaptive defence polymorphism

controlling the presence or absence of cyanogenic gluco-

sides in natural populations is conferred by variation at the

Ac locus. Ac encodes the CYP79D15 enzyme, which cataly-

ses the first step in lotaustralin and linamarin biosynthesis

(Olsen et al., 2008). The production of oximes from amino

acids by members of the CYP79 family is also part of the

biosynthesis of non-cyanogenic defence compounds such

as glucosinolates and camalexin (Bak et al., 1998b; Hull

et al., 2000; Reintanz et al., 2001; Glawischnig et al., 2004).

Functional homologues of the sorghum CYP71E1 and

UGT85B1 enzymes are more difficult to identify because

they are members of large protein families and lack

conserved sequence motifs indicating their specific meta-

bolic function, and only recently have they been identified in

cassava (Jørgensen et al., 2011; Kannangara et al., 2011).

The legume Lotus japonicus is used as a genetic model to

study cyanogenesis, and we recently reported a high-

throughput genetic screen for cyanogenesis deficient

mutants (Takos et al., 2010). Here we identify the cyd1 locus

as encoding the CYP79D3 gene, providing genetic evidence

that CYP79D3 is the enzyme that produces cyanogenic

Figure 1. The hydroxynitrile glucoside biosynthetic pathways in S. bicolor and L. japonicus.

In S. bicolor, the synthesis of dhurrin from tyrosine involves CYP79A1 (Koch et al., 1995), CYP71E1 (Bak et al., 1998a) and UGT85B1 (Jones et al., 1999). L. japonicus

produces the cyanogenic glucosides linamarin (from valine) and lotaustralin (from isoleucine), as well as non-cyanogenic hydroxynitrile glucosides called

rhodiocyanoside A and D (from isoleucine). CYP79D3 is the predominant enzyme in leaves; CYP79D4 is expressed in the root (Forslund et al., 2004). The isoleucine-

derived 2-methylbutanal oxime intermediate is used in the synthesis of lotaustralin and rhodiocyanosides A and D. CYP736A2 is involved in the production of

cyanogenic glucosides, while rhodiocyanoside production depends on the product encoded by the Rho locus. The UDP-glucosyltransferases identified belong to the

UGT85 family.

2 Adam M. Takos et al.

ª 2011 The AuthorsThe Plant Journal ª 2011 Blackwell Publishing Ltd, The Plant Journal, (2011), doi: 10.1111/j.1365-313X.2011.04685.x

glucosides in leaves of mature Lotus plants. The genomic

region containing CYP79D3 contains at least five loci encod-

ing enzymes involved in hydroxynitrile glucoside biosynthe-

sis. These include the CYP736A2 and UGT85K3 genes, which

complete the cyanogenic glucoside biosynthetic pathway,

and the genetic locus responsible for natural variation in

rhodiocyanoside biosynthesis (Rho). Clustering of genes

involved in cyanogenic glucoside biosynthesis was also

observed in cassava and sorghum, but the three genomic

regions are very diverse in structure. Based on the available

plant genome sequence data, it is a likely possibility that the

recruitment of these sets of genes for cyanogenic glucoside

synthesis represent independent events.

RESULTS

The cyanogenesis deficient1 mutant is defective in CYP79D3

A screen for mutants defective in cyanogenesis (Takos et al.,

2010) identified cyd1 as an acyanogenic mutant that, under

the screening conditions, did not contain hydroxynitrile

glucosides (either cyanogenic glucosides or rhodiocyano-

sides) in leaves. As synthesis of all hydroxynitrile glucosides

in the cyd1 mutant was severely affected, a defect in an early

step of the biosynthetic pathway, possibly in the formation of

the oxime from the amino acid precursors, was suggested

(Figure 1). Two paralogous cytochrome P450 genes,

CYP79D3 and CYP79D4, had been shown previously to

convert valine and isoleucine to the corresponding oximes,

and of these CYP79D3 is predominantly expressed in leaves

(Forslund et al., 2004). To determine the chromosomal

position of the cyd1 locus, the mutant was crossed with

L. japonicus cv. Gifu B-129 (Handberg and Stougaard, 1992).

Analysis of F2 plants for the acyanogenic phenotype dem-

onstrated close linkage of the recessive cyd1 locus with the

SSLP marker TM1185 on genomic contig CM0241 of chro-

mosome 3. In the L. japonicus genome sequence (Sato et al.,

2008; http://www.kazusa.or.jp/lotus), both CYP79D3 and

CYP79D4 are localized on contig CM0241 and estimated to be

240 kb apart. The sequences of the CYP79D3 (CM0241.700)

and CYP79D4 (CM0241.310) genes were determined in the

cyd1 mutant, and while both exons of the CYP79D4 gene

were intact, it was not possible to amplify the second exon of

CYP79D3 from the cyd1 mutant.

The Lotus genome database contains a 1926 bp fragment,

LjSGA_030566, that contains a sequence that is 98% iden-

tical at the nucleotide level to part of the CYP79D3 intron and

the entire second exon. LjSGA_030566 is thought to be a

pseudo-gene lacking the first exon, and its precise chromo-

somal location is presently unknown. A number of single

nucleotide differences, and most notably a 32 bp insertion in

the LjSGA_030566 sequence, distinguish it from CYP79D3.

Primers spanning this insertion site amplified two bands

from genomic DNA of the wild-type MG-20 (191 and 223 bp),

but only the larger LjSGA_030566-derived band was ampli-

fied from the cyd1 mutant (Figure 2a). Lack of a CYP79D3

transcript in the cyd1 mutant also indicated that the gene

was defective (Figure 2b).

To confirm that loss of a functional CYP79D3 gene was

responsible for the lack of cyanogenesis in the cyd1 mutant,

the gene was re-introduced by transforming the cyd1 mutant

with the CYP79D3 coding region under the control of the 35S

CaMV promoter. Cyanogenesis was restored in multiple

independent transformed lines as shown for line 5 in

Figure 2c. Complementation of the cyd1 phenotype was

further analysed by LC-MS of leaf extracts, and showed not

only restored biosynthesis of lotaustralin and linamarin, but

also of the non-cyanogenic hydroxynitrile glucosides rhod-

iocyanoside A and D (Figure 2d).

Contig CM0241 contains a UGT85 that is involved

in cyanogenic glucoside biosynthesis

The first UDP-glucosyltransferase identified as glucosylating

hydroxynitriles was SbUGT85B1, which is involved in

dhurrin production in sorghum (Jones et al., 1999). In

L. japonicus, the CM0241 contig contains a member of an-

other UGT85 sub-family (gene CM0241.610) that has been

assigned the name UGT85K3 following the recommenda-

tions for nomenclature of Mackenzie et al. (1997). A second

gene, UGT85K2 (T45I18.30), encodes a protein that is 86%

identical at the amino acid level to UGT85K3. UGT85K2 is

located further down on chromosome 3 at approximately

31.6 cM (approximately 18 cM from CM0241). Both

UGT85K2 and UGT85K3 show 41% amino acid identity with

SbUGT85B1, and 56% identity with UGT85K4 and UGT85K5,

which are involved in cyanogenic glucoside production in

cassava (Kannangara et al., 2011).

The coding regions of UGT85K2 and UGT85K3 were

amplified as cDNAs from MG-20 and expressed in Escher-

ichia coli. To test the ability of UGT85K2 and UGT85K3 to

glucosylate the different hydroxynitriles, E. coli extracts

were incubated with 20 mM of the putative acceptors and

3.3 lM UDP-[14C]-glucose as donor. The reaction products

formed were visualized on TLC plates and verified by LC-MS

analysis. Both UGT85K2 and UGT85K3 were able to glu-

cosylate acetone cyanohydrin (the linamarin aglycone), and

2-hydroxy-2-methylbutyronitrile (the lotaustralin aglycone)

(Figure 3).

The CYP736A2 gene completes a functional biosynthetic

pathway for cyanogenic glucoside biosynthesis in Lotus

japonicus

In sorghum, the second step in cyanogenic glucoside bio-

synthesis is mediated by CYP71E1 (Bak et al., 1998a), and a

functional homologue, CYP71E7, was recently characterized

in cassava (Jørgensen et al., 2011). CYP71E7 was identified

using CYP71E1 as the query in a BLASTp search of the cas-

sava genome sequence, and the proteins show 49% identity.

Using either CYP71E1 or CYP71E7 to query the Lotus

Cyanogenic glucoside biosynthetic gene clusters 3

ª 2011 The AuthorsThe Plant Journal ª 2011 Blackwell Publishing Ltd, The Plant Journal, (2011), doi: 10.1111/j.1365-313X.2011.04685.x

genome sequence identified members of the CYP83E sub-

family as the most similar, with identity scores of approx-

imately 40%. Using CYP71E7 as the query, the highest

identity score (42%) was obtained for CYP83E4 (CM1089.70),

and no member of the CYP71E sub-family was identified in

the Lotus genome. However, the CM0241 contig contains a

member of another cytochrome P450 gene family named

CYP736A2 (CM0241.850), as well as three closely related

pseudo-genes (CM0241.540, CM0241.880 and CM0241.940).

The CYP736A2 amino acid sequence shows approximately

35% identity with CYP71E1 and CYP71E7. The CYP71, CYP83

and CYP736 families all belong to the CYP71 clan, which

contains over 50 CYP families (Nelson and Werck-Reichhart,

2011). Within the CYP71 clan, the three families are part of

the same sub-branch formed by seven CYP families. Their

phylogenetic relationship is shown in Figure 4. The genome

sequences of sorghum, cassava and Lotus were queried

using the CYP71E1, CYP71E7, CYP736A2 and CYP83E4

sequences. The sequences with the highest homology

scores from each BLASTp search were collected, and a

phylogenetic tree was constructed using the MEGA5

package (Figure 4) (Tamura et al., 2011). Additional protein

sequences of members characteristic for the relevant CYP71

clan families, mostly from other plant species, were included

in the analysis. Families just outside this sub-branch of the

CYP71 clan are represented by CYP76A3, CYP80A1,

CYP92A1 and CYP786A1, and anchor the tree.

As shown in Figure 4, the CYP71 and CYP83 families are

most closely related to each other, and group with the

CYP726 and CYP99 families. These four families are now

Figure 2. The cyd1 mutant is defective in CYP79D3.

(a) PCR amplification showing the 32 bp insertion polymorphism between

CYP79D3 and LjSGA_030566 using MG-20 and cyd1 genomic DNA as

template.

(b) Absence of the CYP79D3 transcript in the cyd1 mutant was determined by

quantitative RT-PCR.

(c) Restoration of cyanogenesis in leaves of the cyd1 mutant transformed with

a 35S:CYP79D3 construct was detected using HCN-sensitive Feigl–Anger

paper.

(d) Extracted ion chromatogram of the MG-20 parental line, the cyd1 mutant,

and the cyd1 mutant complemented with the 35S:CYP79D3 construct. Apical

leaves of 5-month-old tissue culture-grown plants were analysed. Extracted

ion peaks are for sodium adducts: linamarin (m/z 270, cyan); rhodiocyanoside

A and D (m/z 282, red); lotaustralin (m/z 284, blue).

(a) (b)

Figure 3. Glucosylation of hydroxynitrile substrates by UGT85K2 and

UGT85K3.

(a) Activity of E. coli-expressed UGT85K2 and UGT85K3 on acetone cyano-

hydrin (AC) and 2-hydroxy-2-methylbutyronitrile (HBN) substrates using UDP-

[14C]-glucose (G) as donor and analysed by TLC.

(b) Unlabelled standard for linamarin (lin) and lotaustralin (lot) run on the

same plate and visualized following charring by H2SO4.

4 Adam M. Takos et al.

ª 2011 The AuthorsThe Plant Journal ª 2011 Blackwell Publishing Ltd, The Plant Journal, (2011), doi: 10.1111/j.1365-313X.2011.04685.x

Figure 4. Phylogenetic analysis of CYP736A2 in relation to the CYP71Es and CYP83Es.

The protein sequences most closely related to CYP736A2, CYP71E1, CYP71E7 and CYP83E4 (indicated with black circles) were collected from the Lotus, sorghum and

cassava genomes. A phylogenetic tree was constructed by the neighbour-joining method using a 1000-replicate bootstrap test. Gene identifiers are those used in the

respective genome databases; additional assigned names are indicated. Characteristic members of several families from the CYP71 clan were included for

comparison (indicated with diamonds).

Cyanogenic glucoside biosynthetic gene clusters 5

ª 2011 The AuthorsThe Plant Journal ª 2011 Blackwell Publishing Ltd, The Plant Journal, (2011), doi: 10.1111/j.1365-313X.2011.04685.x

considered to have merged due to expansion of the CYP71

family as more sequences have become available (Nelson

and Werck-Reichhart, 2011). The CYP736 family containing

the Lotus CYP736A2 gene is not found in monocots, and

consequently no sorghum homologues were identified. The

CYP736 family is most closely related to the CYP84 and

CYP750 families, and is clearly distinct from the CYP71E sub-

family (Figure 4).

The genomic co-localization of CYP79D3 and UGT85K3

with CYP736A2 suggested that the latter might encode the

second step of the cyanogenic glucoside pathway. To test

this possibility, all three genes were cloned into the

pJAM1502 vector under the control of the CaMV 35S

promoter, and their activity was tested by Agrobacterium-

mediated transient expression in Nicotiana benthamiana.

Testing the L. japonicus genes separately and in various

combinations showed that only the triple combination of

CYP79D3, CYP736A2 and either UGT85K2 or UGT85K3

resulted in the production of linamarin and lotaustralin

(Figures 5a and S1). Interestingly, the production of rhodi-

ocyanosides was not observed. The cassava gene CYP71E7

was able to effectively substitute for CYP736A2, but the

Lotus CYP83E4 was not (Figure 5b,c). The in planta enzy-

matic activities provide strong support for the idea that these

genes make up the cyanogenic glucoside biosynthetic

pathway in L. japonicus, and that the genes encoding the

pathway are clustered on chromosome 3.

The cyd4 mutant is defective in CYP736A2

In planta expression demonstrated the role of CYP736A2 in

the biosynthesis of the cyanogenic glucosides linamarin and

lotaustralin, and that CYP736A2 was not involved in the

synthesis of rhodiocyanosides (see Figure 5). Consequently,

a putative mutant in the CYP736A2 gene was expected to

Figure 5. Reconstitution of the L. japonicus cyanogenic glucoside biosynthetic pathway by transient expression in Nicotiana benthamiana.

(a) Extracted ion chromatogram of tobacco leaves co-infiltrated with the L. japonicus genes CYP79D3, CYP736A2 and UGT85K3. Extracted ion peaks are for sodium

adducts: linamarin (m/z 270, cyan); rhodiocyanoside A and D (m/z 282, red); lotaustralin (m/z 284, blue), as indicated in Figure 2(d).

(b) As (a), with cassava CYP71E7 replacing CYP736A2.

(c) As (a), with CYP83E4 replacing CYP736A2.

(d) As (a), with the mutated CYP736A2 (R366K) from cyd4.

6 Adam M. Takos et al.

ª 2011 The AuthorsThe Plant Journal ª 2011 Blackwell Publishing Ltd, The Plant Journal, (2011), doi: 10.1111/j.1365-313X.2011.04685.x

show no or reduced levels of cyanogenic glucosides but

wild-type levels of rhodiocyanosides, resulting from the

presence of an intact Rho gene (Figure 1). One of the lines

obtained in the genetic screen for cyanogenesis-deficient

mutants, referred to here as cyd4, displayed the predicted

metabolic profile, with a >60% reduction in lotaustralin and

wild-type levels of rhodiocyanosides (Figure 6). Sequencing

of the CYP736A2 gene in the cyd4 mutant showed a G fi A

nucleotide substitution, changing the arginine at position

366 to a lysine. R366 in CYP736A2 is the invariant arginine

residue of the EXXR motif that is present in virtually all

cytochrome P450 proteins, and is essential for formation of

the tertiary structure (Hasemann et al., 1995; Rupasinghe

et al., 2006). When the mutated version of CYP736A2 was

cloned as a cDNA from cyd4 and transiently expressed in

Nicotiana benthamiana in combination with CYP79D3 and

UGT85K3, no production of either linamarin or lotaustralin

was observed (Figure 5d). These data provide genetic sup-

port for the role of CYP736A2 in the biosynthesis of cyano-

genic glucosides in L. japonicus.

Expression of cyanogenic glucoside biosynthetic genes in

relation to leaf age

Quantitative real-time PCR was used to examine the expres-

sion profile of CYP736A2, UGT85K2 and UGT85K3 in com-

parison with that of CYP79D3. Characteristically, CYP79D3

expression is related to leaf age, with the highest expression

observed in the apical leaves (Figure 7) (Forslund et al.,

2004). CYP736A2 expression showed a similar expression

profile to that observed for CYP79D3, with transcript levels

decreasing in older leaves. The UGT85K2 and UGT85K3 gene

expression levels did not change with leaf age. Although the

genes are not strictly co-regulated, the overlap in expression

pattern of CYP79D3, CYP736A2, UGT85K2 and UGT85K3

supports the view that these genes encode the enzymes of

the pathway for cyanogenic glucoside biosynthesis in

L. japonicus.

Natural variation in rhodiocyanoside biosynthesis is genet-

ically linked

We previously investigated natural variation in hydroxynit-

rile glucoside content in L. japonicus accessions, and

showed that absence of the non-cyanogenic rhodiocyano-

sides A and D in the MG-74 accession (see Figure 1) was due

to a single recessive genetic locus named rho (Bjarnholt

et al., 2008). The map position of rho was initially established

using 30 re-selected rho F2 lines from a MG-74 · MG-20

cross, and all showed the MG-74 genotype at marker

TM1185 on chromosome 3. The mapping population was

expanded to 350 F2 lines by initially selecting recombinants

between markers TM0436 (at 10.5 cM) and TM0403 (at

24 cM). These two markers were polymorphic between MG-

20 and MG-74 and flanked the region containing rho. The

recombinant F2 lines, and, where necessary, their segregat-

ing F3 progeny, were analysed for rhodiocyanoside pro-

duction, and genotyped using additional markers. This

established that rho was localized between markers TM1185

on contig CM0241 (at 13.3 cM) and TM0450 (at 15.3 cM). The

genomic sequence in this region contains most of contig

CM0241 but is discontinuous between CM0241 and BAC

clone TM0450, and the identity of the Rho gene remains to

be established.

Clustering of genes involved in cyanogenic glucoside

biosynthesis in the Lotus, cassava and sorghum genomes

The clustering of the genes for biosynthesis of cyanogenic

glucosides and related hydroxynitrile glucosides in the

genome of L. japonicus prompted the analysis of additional

plant genomes of cyanogenic species. Such an analysis is

possible in cassava and sorghum due to recent progress in

the availability of genome sequences (Paterson et al., 2009;

Cassava Genome Project 2010, http://www.phytozome.net/

cassava) and identification of CYP71 and UGT85 homo-

logues involved in cyanogenic glucoside biosynthesis in

cassava (Jørgensen et al., 2011; Kannangara et al., 2011).

In the sorghum genome sequence (available at http://

www.phytozome.net/sorghum), the three genes previously

identified as encoding the cyanogenic glucoside biosynthe-

tic pathway are clustered within a 104 kb region localized at

the top of chromosome 1 (Figure 8). Figure 8 also shows an

uncharacterized member of the CYP71E sub-family,

Figure 6. The cyd4 mutant defective in CYP736A2 contains reduced levels of

cyanogenic glucosides.

(a) Extracted ion chromatogram of the cyd4 mutant. Extracted ion peaks are

for sodium adducts: linamarin (m/z 270, cyan); rhodiocyanoside A and D (m/z

282, red); lotaustralin (m/z 284, blue), as indicated in Figure 2(d).

(b) Quantitative comparison of hydroxynitrile glucoside content in leaves of

MG-20 wild-type and cyd4. Values are the means of five biological repli-

cates � SE.

Cyanogenic glucoside biosynthetic gene clusters 7

ª 2011 The AuthorsThe Plant Journal ª 2011 Blackwell Publishing Ltd, The Plant Journal, (2011), doi: 10.1111/j.1365-313X.2011.04685.x

Sb01g001160, located to the left of CYP71E1 (Sb01g001180),

that has 61% amino acid identity to CYP71E1 (see also

Figure 4). The region generally shows low gene density, but

also contains an alcohol dehydrogenase (Sb01g001170) and

two putative transposases of the hAT element superfamily.

Clustering of the biosynthetic genes for cyanogenic glu-

coside metabolism was also observed in cassava; a 100 kb

region of the 4.6 Mb Scaffold08265 (genome draft version

Cassava4.1; http://www.phytozome.net/cassava) is shown in

Figure 8. In cassava, two cytochrome P450 genes of the

Figure 7. Expression of the L. japonicus cyanogenic glucoside biosynthetic genes in relation to leaf age, determined by quantitative RT-PCR.

Values are relative to the expression level in the 3rd leaf, and are the means of three biological replicates � SE.

(a) Expression of CYP79D3.

(b) Expression of CYP736A2.

(c) Expression of UGT85K3.

(d) Expression of UGT85K2.

Figure 8. Schematic representation of the clustering of cyanogenic glucoside biosynthetic genes in the genomes of L. japonicus, S. bicolor and M. esculenta.

Functional genes are presented by arrows indicating their orientation. Confirmed genes in cyanogenic glucoside biosynthesis are labelled above each bar, with

CYP79 genes in pink, CYP71E and CYP736 genes in green, and UGT85 genes in blue. The three CYP736A2-like pseudo-genes are indicated below the L. japonicus

bar, as is the additional CYP71 in S. bicolor. The Rho locus is within 2 cM of the CM0241 contig. The remaining genes are numbered and annotated as follows: (1)

acireductone dioxygenase, (2) nucleic acid binding, OB-fold, (3,4) hypothetical proteins, (5) ribonuclease H, (6) short-chain dehydrogenase/reductase, (7,8) putative

transposases, (9) leucine-rich repeat receptor-like protein kinase, (10,11) a/b-fold hydrolases, (12) hypothetical protein, (13–15) putative hydroxynitrile lyases, (16)

hypothetical protein.

8 Adam M. Takos et al.

ª 2011 The AuthorsThe Plant Journal ª 2011 Blackwell Publishing Ltd, The Plant Journal, (2011), doi: 10.1111/j.1365-313X.2011.04685.x

CYP79 family, CYP79D1 and CYP79D2, are known, and the

CYP79D2 gene (cassava4.1_005079) is part of the gene

cluster, which also contains the two CYP71E paralogues

(Andersen et al., 2000; Jørgensen et al., 2011) and the UDP-

glucosyltransferase genes UGT85K4 and UGT85K5 (Kann-

angara et al., 2011). The CYP79D1 gene (cassava4.1_005059)

is not part of the gene cluster and is localized on Scaf-

fold08167, which is 356 kb long. The CYP79D1 and CYP79D2

genes are each adjacent to a highly similar leucine-rich repeat

receptor-like protein kinase gene (>90% identity), suggesting

a duplication event. In contrast to the gene clusters in Lotus

and sorghum, the one in cassava is relatively gene-rich. Of

the additional genes in the region, a set of three genes with

homology to hydroxynitrile lyases is noteworthy, as these

may play a role in cyanogenic glucoside catabolism (cas

sava4.1_027709, cassava4.1_031085 and cassava4.1_014254,

numbered 13–15 in Figure 8).

The region of the cyanogenic glucoside biosynthetic gene

cluster in Lotus is the most extended in terms of overall size.

Only a limited number of functional genes are present,

including a putative aci-reductone dioxygenase (CM0241.-

600) and a retrotransposon-related ribonuclease H domain-

containing protein (CM0241.860), while additional partial

gene fragments related to retroelements are also present

(not indicated in Figure 8). Although clustering of genes for

cyanogenic glucoside biosynthesis is observed in all three

species, no other points of similarity could be identified, and

the regions differ substantially in size, gene density, the

identity of additional genes and the presence of transposon-

related sequences.

DISCUSSION

CYP79D3 as shared first enzyme in the biosynthesis of

hydroxynitrile glucosides

Cyanogenesis is the release of toxic HCN following tis-

sue damage and hydrolysis of cyanogenic glucosides, and

results from the inherent instability of the a-hydroxynitri-

le aglycones. In L. japonicus, the isoleucine-derived a-

hydroxynitrile glucoside lotaustralin co-occurs with the

non-cyanogenic compounds rhodiocyanoside A (a c-hy-

droxynitrile glucoside) and rhodiocyanoside D (a b-hy-

droxynitrile glucoside) (Forslund et al., 2004). We previously

reported natural variation in rhodiocyanoside content in

L. japonicus, and a diversification of the hydroxynitrile

glucoside biosynthetic pathway at the level of the nitrile

intermediate was suggested based on biochemical consid-

erations (Bjarnholt et al., 2008). The present study of the cyd1

mutant provides genetic evidence for the idea that the first

enzymatic step is shared. The cyd1 mutant was previously

identified in a leaf-based screen for cyanogenesis-deficient

mutants, and metabolite analysis showed that it contains

neither cyanogenic glucosides nor the related rhodiocyano-

sides (Takos et al., 2010). This suggested a defect early in the

biosynthetic pathway,possiblyat the formationof the oximes

from the amino acid precursors valine and isoleucine. The

map position of the cyd1 mutant and the absence of the sec-

ond exon indicate that cyd1 contains a defect in the CYP79D3

gene encoding a leaf-expressed oxime-producing cyto-

chrome P450 enzyme. Complementation of the cyd1 mutant

with a functional CYP79D3 gene restored the biosynthesis of

all hydroxynitrile glucosides.

Genomic clustering identified the CYP736A2 and UGT85K3

genes

The genomic region with the genes encoding CYP79D3 and

its paralogue CYP79D4 also contained candidate genes

for the remaining two steps in cyanogenic glucoside bio-

synthesis in L. japonicus: the UDP-glucosyltransferase

UGT85K3 and the cytochrome P450 CYP736A2. Both

UGT85K3 and the closely related UGT85K2 were able to

glucosylate the hydroxynitrile aglycones of linamarin and

lotaustralin in vitro. No close homologue of the sorghum

CYP71E1 or cassava CYP71E7 was identified in the Lotus

genome sequence, and the candidacy of CYP736A2 was

suggested by the observed genomic co-localization. The

activity of CYP736A2 was demonstrated by reconstituting

the Lotus biosynthetic pathway using transient co-expres-

sion in N. benthamiana. The triple combination of CYP79D3,

CYP736A2 and UGT85K3 produced the cyanogenic gluco-

sides linamarin and lotaustralin in tobacco. The cyd4 mutant

provided genetic evidence for the role of CYP736A2 in cya-

nogenic glucoside biosynthesis. The CYP736A2 protein in

the cyd4 mutant is defective in the invariant arginine of the

EXXR motif (Hasemann et al., 1995), and its substitution by a

lysine is likely to result in improper protein folding, thus

explaining the observed lack of enzymatic activity. The cyd4

mutant showed a large reduction in cyanogenic glucoside

content but wild-type levels of rhodiocyanosides, indicating

that CYP736A2 is specific for cyanogenic glucoside biosyn-

thesis. The metabolic profile of the cyd4 mutant most likely

reflects the activity of the Rho enzyme, primarily involved in

producing rhodiocyanosides but also capable of some cya-

nogenic glucoside production, although the possible exis-

tence of a third oxime-metabolizing enzyme cannot be

excluded. Consistent with the idea that the Rho enzyme does

not exclusively produces rhodiocyanosides is the observa-

tion that none of the available cyd mutants or any of eighty

natural Lotus accessions produce rhodiocyanosides in the

absence of lotaustralin. Identification and molecular char-

acterization of the Rho gene is required to confirm this.

Gene clustering in cyanogenic glucoside biosynthesis

Our work in L. japonicus demonstrated that the biosynthetic

genes for cyanogenic glucoside metabolism were co-local-

ized in the genome. Additional gene clusters containing the

genes for the biosynthesis of cyanogenic glucosides were

observed in sorghum and cassava. The regions differed

Cyanogenic glucoside biosynthetic gene clusters 9

ª 2011 The AuthorsThe Plant Journal ª 2011 Blackwell Publishing Ltd, The Plant Journal, (2011), doi: 10.1111/j.1365-313X.2011.04685.x

considerably in size and structure, with only the presence of

the cyanogenic glucoside biosynthetic genes in common.

The Lotus cluster is 355 kb long when including CYP79D4

but not considering the Rho locus. The region containing

CYP79D3, CYP736A2 and UGT85K3 is 160 kb long, larger

than the 104 kb region containing the corresponding genes

in sorghum, or the 83 kb region observed in cassava. The

cluster in cassava is relatively gene-dense and contains a

total of 13 genes, including two CYP71E and two UGT85K

genes. The diverse genomic structure of the three gene

clusters suggests they have independent origins.

While gene clusters originating from repeated tandem

duplication events are common in eukaryotes, the genetic

and physical clustering of non-homologous genes that are

part of the same biosynthetic pathway is rare. In plants, such

clustering of non-homologous genes has only been

described for a number of chemical defence compounds,

mostly terpenoids (reviewed by Chu et al., 2011). Genes for

benzoxazinoid biosynthesis in maize (Zea mays) are

co-localized within a 6 cM region on chromosome 4 (Frey

et al., 1997), genetic loci for avenacin biosynthesis in oat

(Avena sativa) are clustered within 3.6 cM of the b-amyrin

synthase gene AsbAS1 encoding the first committed

enzyme of the pathway (Qi et al., 2004), and a 168 kb region

containing genes for the biosynthesis of momilactones has

been identified in rice (Oryza sativa) (Shimura et al., 2007).

Rice also contains a second diterpenoid biosynthetic gene

cluster that is multi-functional, containing genes for the

production of two distinct sets of phytoalexins (Swamina-

than et al., 2009). Arabidopsis contains a gene cluster for

synthesis of the triterpene thalianol (Field and Osbourn,

2008).

It is presently unclear how metabolic gene clusters arise

and why gene clustering is observed for some plant

secondary metabolic pathways, for example the biosynthe-

sis of terpenoids and cyanogenic glucosides, but not for

others, such as anthocyanin or glucosinolate biosynthesis.

Several selective advantages that may drive and maintain

the clustering of non-homologous genes in the biosynthesis

of plant defence compounds have been proposed (Gierl and

Frey, 2001; Qi et al., 2004; Chu et al., 2011). Clustering would

allow inheritance of the defensive trait as a single functional

unit, and in polymorphic populations this advantage could

drive the clustering of genes following their functional

evolution. Similarly, clustering of the genes of a biosynthetic

pathway with toxic reaction intermediates would reduce the

likelihood of its disruption by segregation. Clustering may

also allow coordinated expression of all genes in the

pathway, potentially by regulation at the chromatin level.

These proposed selective advantages for gene clusters are

not mutually exclusive, and may each contribute to estab-

lishing or maintaining this genomic organization. For exam-

ple, a two-stage selection process was suggested for cluster

formation in prokaryotes, where the proximity of genes

resulting from selection driving genes together would

establish the conditions for a separate selection for co-

regulation and operon assembly (Martin and McInerney,

2009).

When evaluating these selective advantages in relation to

gene clustering in the cyanogenic glucoside biosynthetic

pathway, the toxicity of intermediates is the most convinc-

ing amongst them. Cyanogenesis is a two-component

system, and inheritance as a single defence unit would

require the b-glucosidase to be co-localized with the bio-

synthetic genes. In L. japonicus, some linkage is observed

between LjBGD2 (Takos et al., 2010; T33P07.150) and the

biosynthetic genes, they are approximately 20 cM apart.

However in sorghum, the b-glucosidase responsible for the

hydrolysis of dhurrin is localized on chromosome 8

(Sb08g007570) while the biosynthetic genes are localized

on chromosome 1. Operon-like gene clusters that facilitate

coordinated expression of genes at the chromatin level have

been suggested (Qi et al., 2004; Field and Osbourn, 2008). In

L. japonicus, the CYP79D3 and CYP79D4 genes are differen-

tially expressed, while UGT85K2 and UGT85K3 expression

levels do not follow leaf age, arguing against co-ordination

of gene expression driving the clustering of genes in

cyanogenic glucoside biosynthesis. In contrast, the toxicity

of the intermediates in the biosynthesis of cyanogenic

glucosides has been experimentally demonstrated for the

dhurrin biosynthetic pathway. Introduction of the sorghum

genes CYP79A1 and CYP71E1 in Arabidopsis resulted in

stunted plants with pleiotropic phenotypes, which were

alleviated by the subsequent introduction of UGT85B1,

resulting in dhurrin production and restoration of plant

growth (Kristensen et al., 2005).

The CYP79 family and the repeated evolution of cyanogenic

glucoside biosynthesis

The complete set of genes required for the biosynthesis of

cyanogenic glucosides has now been identified in three

plant species: the monocot S. bicolor and the dicots

M. esculenta and L. japonicus. The genes encoding the

different enzymes in these species are related but not nec-

essarily orthologous. This suggests that cyanogenic gluco-

side biosynthesis has evolved independently in several

higher plant lineages by recruiting members from similar

gene families such as the CYP79s.

Members of the CYP79 family characteristically produce

oximes from amino acids in the biosynthetic pathways of

several chemical plant defence compounds, and besides

cyanogenic glucosides these include glucosinolates and

camalexin (Bak et al., 1998b; Hull et al., 2000; Reintanz et al.,

2001; Glawischnig et al., 2004). Presently available sequence

data show the first appearance of the CYP79 family in

angiosperms, as is the case for the CYP71 family (Nelson and

Werck-Reichhart, 2011). Because of the high substrate

specificity of the oxime-producing CYP79 enzyme, it is

10 Adam M. Takos et al.

ª 2011 The AuthorsThe Plant Journal ª 2011 Blackwell Publishing Ltd, The Plant Journal, (2011), doi: 10.1111/j.1365-313X.2011.04685.x

regarded as the most selective step in determining which

cyanogenic glucoside is produced in a certain plant species

(Bjarnholt and Møller, 2008). A small CYP79 gene family is

present in higher plant genomes, including non-cyanogenic

species such as soybean (Glycine max) and poplar (Populus

trichocarpa). The biological role of most of these CYP79

genes, including additional CYP79 genes in cyanogenic

species, remains to be established. Phylogenetic analysis

shows that the CYP79 enzymes involved in cyanogenic

glucoside biosynthesis are more closely related to other

CYP79s in the same plant species or lineage than to CYP79s

involved in cyanogenic glucoside biosynthesis in more

distant plant species (Figure S2). For instance, CYP79D3

and CYP79D4 from L. japonicus are part of a legume-specific

clade, and only CYP79D15, which produces cyanogenic

glucosides in white clover (T. repens), is likely to be orthol-

ogous (>90% identity).

The independent evolution of a similar function from

homologous genes is referred to as ‘repeated evolution’,

and is thought to represent the majority of convergent

evolution events in plant secondary metabolism (Cseke

et al., 1998; Pichersky and Lewinsohn, 2011). For example, a

polyphyletic origin was proposed for the evolution of the

pyrrolizidine alkaloid-mediated defence system in angio-

sperms (Reimann et al., 2004). The first enzyme in this

pathway, homospermidine synthase, was independently

recruited from deoxyhypusine synthase at least four times

by loss of its ability to bind its protein substrate eIF5A. The

first enzyme in the synthesis of benzoxazinoid pesticides,

BX1, and the indole-3-glycerol phosphate lyase enzyme

(IGL) both produce indole, as an intermediate and as a

volatile defence compound, respectively, and are thought to

have originated from independent gene duplications of the

a-subunit of tryptophan synthase (Frey et al., 2000).

The ability to produce oximes from specific amino acids

seems to be the overall characteristic of the CYP79 family. It

has been suggested that the glucosinolate defence pathway,

which has evolved independently on at least two occasions,

originated from the cyanogenic glucoside pathway (Rodman

et al., 1998), but it can equally be said that both indepen-

dently evolved from the predisposition of members of the

CYP79 family to produce oximes from amino acids and the

need to convert the oximes into more stable and less

reactive compounds.

Broad substrate specificity facilitated the repeated evolu-

tion of cyanogenic glucoside biosynthesis

In contrast with the CYP79s, members of the CYP71 and UGT

families often have diverse and broad substrate specificities,

and it has been argued that enzymatic versatility enables a

limited number of glucosyltransferase genes to produce the

large number of glucosides found in plants (Jones et al.,

1999; Hansen et al., 2003; Schwab, 2003; Bjarnholt and

Møller, 2008; Jørgensen et al., 2011). Broad substrate spec-

ificity or promiscuity of enzymes provide an immediately

accessible starting point for the evolution of new functions

(Khersonsky and Tawfik, 2010), and would facilitate the

repeated evolution of cyanogenic glucoside biosynthesis

following the emergence of a more specific signature

enzyme, here for example a CYP79. It has also been

suggested that the emergence of a signature enzyme by

duplication and functional divergence could ‘seed’ the

formation of a metabolic gene cluster (Chu et al., 2011).

The independent evolution of cyanogenic glucoside

biosynthesis is most clearly demonstrated by the identifi-

cation of CYP736A2 in L. japonicus. Although it could be

argued that L. japonicus is exceptional in not possessing a

CYP71E orthologue, the primary basis for identifying

cyanogenic glucoside biosynthetic genes in plant species

to date has been their homology to the original biochem-

ically identified sorghum genes. Like the CYP79 and CYP71

families, CYP736s belong to the wider CYP71 clan. The

CYP736 family shows an early emergence in plant evolu-

tion and is first seen in liverworts, but is also absent from

several plant lineages including monocots (Nelson and

Werck-Reichhart, 2011). Recent reports connect the CYP736

family with defence responses against microbial patho-

gens in grape (Vitis vinifera) (Cheng et al., 2010), and using

microarray data, with nematode and Phytophthora infec-

tion and with nodulation in soybean (Guttikonda et al.,

2010).

Convergent evolution of cyanogenic glucoside biosynthesis

in plants and insects

In what is referred to as the ‘chemical arms race’ between

plants and herbivores, the same solution may recur fre-

quently, particularly for biosynthetic pathways involving a

relatively small number of enzymes. While cyanogenic glu-

cosides provide plants with a defence mechanism against

generalist herbivores, some specialized herbivores have

co-evolved with their host plants. The larvae of the Burnet

moth (Zygaena filipendulae) feed on Lotus corniculatus and

sequester the cyanogenic glucosides linamarin and lot-

australin from their food plants for use in their own defence

against predators (Zagrobelny et al., 2007). They are also

able to synthesize both compounds de novo when the sup-

ply from their food is insufficient. By convergent evolution,

the Burnet moth has obtained a biosynthetic pathway with

identical reaction intermediates but involving insect ver-

sions of the two cytochrome P450s, called CYP405A2 and

CYP332A3, and a UGT gene named UGT33A1 (Jensen et al.,

2011). This example from insects supports the idea of inde-

pendent recruitment of genes for cyanogenic glucoside

synthesis in plants. Further insights into the evolution of

cyanogenic defence pathways in plants can be obtained by

the identification of the biosynthetic genes in a diverse range

of species, including non-angiosperms, which seem to lack

members of the CYP79 family (Nelson and Werck-Reichhart,

Cyanogenic glucoside biosynthetic gene clusters 11

ª 2011 The AuthorsThe Plant Journal ª 2011 Blackwell Publishing Ltd, The Plant Journal, (2011), doi: 10.1111/j.1365-313X.2011.04685.x

2011). Gene clustering, if a general feature, may aid this

identification.

EXPERIMENTAL PROCEDURES

Plant materials and growth conditions

Lotus japonicus cv. MG-20 (Legume Base, Miyazaki University, Ja-pan) and the cyd1 mutant (Takos et al., 2010) were grown on half-strength Murashige & Skoog basal salt mix containing 1% w/v su-crose in 1% w/v agar. Plants were germinated and grown at 24�Cwith a 16 h light cycle unless otherwise indicated. Nicotianabenthamiana plants were germinated from seed and grown in soilin a greenhouse at 22�C.

Chemicals

UDP-[14C]-glucose (Perkin Elmer NEN Radiochemicals, http://www.perkinelmer.com/) was dried under nitrogen gas to removeethanol and dissolved in water. The substrates acetone cyanohy-drin and 2-hydroxy-2-methylbutyronitrile were synthesized in ourlab by M.S.M. Lotaustralin was purified by preparative HPLC(Bjarnholt et al., 2008). Linamarin was purchased from AG Scientific(http://www.agscientific.com/). Amygdalin was purchased fromSigma-Aldrich (http://www.sigmaaldrich.com/).

RNA extraction and cDNA synthesis

Total RNA was prepared from 100 mg of plant tissue using anRNeasy plant mini kit (Qiagen, http://www.qiagen.com/) withon-column DNase I digestion. For cDNA synthesis, 1–2 lg of totalRNA was reverse-transcribed by SuperScript III reverse transcrip-tase (Invitrogen, http://www.invitrogen.com/) in a reaction primedusing 50 lM oligo(dT)20 or using an iScript cDNA synthesis kit (Bio-Rad, http://www.bio-rad.com/), which contains MMLV-derived re-verse transcriptase and is primed using a blend of oligo(dT) andrandom hexamer primers.

Genetic markers and genome analysis

The L. japonicus genome sequence (Sato et al., 2008) and markerinformation is available at http://www.kazusa.or.jp/lotus/index.html.Genome assembly build 2.5 was used in our analysis. Primersflanking the 32 bp insertion in LjSGA_030566 and distinguishing itfrom CYP79D3 were 5¢-GGAGAAAGATCAGAATCAATGAC-3¢ and 5¢-GATTCGCTGTGCAATGAGCTGAC -3¢. The polymorphism betweenthe PCR products was visualized on a 4% agarose gel. The S. bicolorgenome sequence (version 1.0 release) and the genome sequence ofM. esculenta (version 4.1) are both available from http://www.phy-tozome.net. Named cytochrome P450 sequences are available fromthe cytochrome P450 homepage at http://drnelson.uthsc.edu/cyto-chromeP450.html.

Preparation of constructs

The cDNA sequences of genes were amplified by PCR fromL. japonicus MG-20 cDNA prepared from RNA extracted from youngleaf tissue, and from cassava cv. TME12 cDNA. The PCR productswere cloned by Gateway recombination into the entry vector pDO-NOR207 (Invitrogen). Expression constructs were prepared by lin-earizing the pDONOR207 entry clones by restriction digest, andcloned by Gateway recombination into either the S-tag vectorpJAM1786 (similar to pJAM1784 but with reading frame C.1) forexpression in bacteria or pJAM1502 (Luo et al., 2007) for both stableand transient expression in plants. All the PCR primers containedthe attB1 and attB2 Gateway cloning sites, and the sequences aregiven in Table S1.

Complementation of the cyd1 mutant with CaMV35S:Lj-

CYP79D3

The CYP79D3 cDNA sequence under the control of the CaMV 35Spromoter/terminator elements in pJAM1502 was transferred intoAgrobacterium tumefaciens (AGL1) by electroporation. Transfor-mation of the cyd1 mutant with this construct was based on amethod using hypocotyl explants as previously described (Lombariet al., 2005).

Cyanogenesis detection and metabolite profiling

Cyanogenesis was visualized using Feigl–Anger paper (Feigl andAnger, 1966), which was prepared by wetting Whatman 3MM paper(http://www.whatman.com/) in a 5 g L)1 chloroform solution ofcopper ethylacetoacetate (Alfa Aesar, http://www.alfa.com/) and4,4¢-methylenebis(N,N-dimethylaniline) (Sigma-Aldrich). The driedpaper was stored at 4�C until use. Plant tissue was disrupted by afreeze–thaw cycle, and HCN release was visualized as describedpreviously (Takos et al., 2010).

For metabolite profiling of hydroxynitrile glucosides, plantmaterial was weighed and extracted in 300 ll of 85% v/v methanolspiked with amygdalin to 200 lM to correct for volume losses duringsample preparation. The extraction was performed by boiling in awater bath for 3 min and subsequent cooling of the sample on ice.The extracts were filtered through 45 lm Ultrafree-MC DuraporePVDF filters (Millipore, http://www.millipore.com/) and diluted 1:10in water. Extracts were stored in glass vials at 4�C prior to analysis.Analytical LC-MS was performed using an Agilent 1100 Series LC(Agilent Technologies, http://www.agilent.com) coupled to a BrukerHCT-Ultra ion trap mass spectrometer (Bruker Daltonics, http://www.bdal.com). A Zorbax SB-C18 column (Agilent, 2.1 mmid · 50 mm, 1.8 lm) was used, with chromatography conditionsas described previously (Takos et al., 2010). The mass spectrometerwas run in positive electrospray mode.

Quantitative real-time PCR

Transcript levels of genes were measured by real-time PCR using aSYBR Green I-based method on a Rotor-Gene Q cycler (Qiagen).The PCR reaction conditions were as previously described (Takoset al., 2010). For each primer pair, a single PCR product wasobtained from each cDNA as verified by gel electrophoresis andmelt curve analysis. Normalized gene expression of target geneswas obtained from the difference in cycle threshold of target genesand the reference gene, taking into account primer efficiencies cal-culated from standard curves of diluted PCR product using Q-Genesoftware (Muller et al., 2002). The L. japonicus RNA polymerase IIlarge subunit (AV777095) was used as a reference gene, and thesequences of the real-time PCR primers are given in Table S1.

Glycosyltransferase assays

Clones of the E. coli XJa autolysis strain (Zymo Research, http://www.zymoresearch.com) containing the cDNA sequences ofLjUGT85K3 and LjUGT85K2 in pJAM1786 were grown at 28�C inLuria broth containing ampicillin and 3 mM L-arabinose. Expressionwas induced after 5 h by addition of 4 mM IPTG, and incubation wascontinued overnight. Cells were harvested by centrifugation for 10min at 4�C, and briefly washed with 100 mM Tris/HCl, 2 mM EDTA,pH 7.5. Pellets were resuspended in 100 mM Tris/HCl pH 7.5, 2 mM

EDTA, 5 mM DTT, containing Complete protease inhibitor cocktail(Roche, http://www.roche.com). The resuspension was frozen inaliquots of 400 ll in liquid nitrogen, and then thawed at 37�C for5 min to lyse cells. The crude extract was clarified by centrifugation

12 Adam M. Takos et al.

ª 2011 The AuthorsThe Plant Journal ª 2011 Blackwell Publishing Ltd, The Plant Journal, (2011), doi: 10.1111/j.1365-313X.2011.04685.x

at 20,000 · g for 20 min at 4�C, and the supernatant used for gly-cosylation assays. All steps were performed at 4�C. The amount ofrecombinant S-tagged enzyme was quantified using a FRETWorksS-Tag assay kit (Novagen, http://www.merck-chemicals.com/).

The glycosylation assay was preformed as previously described(Hansen et al., 2003) in assay mixtures of 20 ll containing 0.1 lgrecombinant S-tagged enzyme, 100 mM Tris/HCl pH 7.5, an initialconcentration of 20 mM of putative substrates (acetonecyanohydrin or 2-hydroxy-2-methylbutyronitrile) and 3.3 lM UDP-[14C]-glucose. Assay mixtures were incubated at 30�C for 60 min,and the reaction was terminated by addition of 2 ll 10% acetic acid.Glycosylation products were separated by TLC on Silica Gel 60 F254plates (Merck, http://www.merck-chemicals.com/) using a mobilephase of ethyl acetate:acetone:dichloromethane:methanol:water(40:30:12:10:8 by volume). Radiolabelled compounds were visual-ized by exposure of PhosphorImager screens (Molecular Dynamics,http://www.moleculardynamics.com/) for 4 days. Pictures weredeveloped using a Storm 860 scanner and Storm scanner controlsoftware version 5.03 (Amersham Biosciences, http://www.gelifes-ciences.com/). In addition to the assay mixtures, 5 lg of unlabelledlinamarin and lotaustralin were spotted on the same TLC plate. Afterseparation, the linamarin and lotaustralin lanes were cut from theTLC plate and the hydroxynitrile glucosides were visualized bydipping in 20% v/v H2SO4 in methanol to char the sugars.

Transient expression in leaves of N. benthamiana plants

Overnight cultures of A. tumefaciens (AGL1) containing expressionconstructs of cDNAs under the control of CaMV 35S promoter/ter-minator elements in pJAM1502 and the gene-silencing inhibitorprotein p19 (Voinnet et al., 2003) were grown. The cells were har-vested and resuspended to an OD600 of 2.0 in 10 mM MES, 10 mM

MgCl2 and 100 lM acetosyringone. After 4 h incubation at ambienttemperature, the A. tumefaciens was used to infiltrate leaves of3-week-old N. benthamiana plants. A. tumefaciens strains contain-ing expression constructs of plant cDNAs and the strain containingthe expression construct for the p19-encoding gene were co-infil-trated. After 4–5 days, leaf discs (1 cm diameter) were cut frominfiltrated leaves and extracted in 85% v/v methanol for metaboliteanalysis as described above.

ACKNOWLEDGEMENTS

This work was financially supported by the Danmark Grundforsk-ningsfond (Danish National Research Foundation) under the ‘NielsBohr Visiting Professorship’ program. B.L.M. acknowledgesfinancial support from the Villum Research Center ‘Pro-ActivePlants’ and from the Danish Council for Independent Research/Technology and Production Sciences. C.K. acknowledges a PhDstipend granted by the Faculty of Life Sciences, University ofCopenhagen, Denmark.

SUPPORTING INFORMATION

Additional Supporting Information may be found in the onlineversion of this article:Figure S1. LC-MS profiles of transient expression in N. benthami-ana.Figure S2. Phylogenetic tree of the CYP79 family.Table S1. PCR primer sequences.Please note: As a service to our authors and readers, this journalprovides supporting information supplied by the authors. Suchmaterials are peer-reviewed and may be re-organized for onlinedelivery, but are not copy-edited or typeset. Technical supportissues arising from supporting information (other than missingfiles) should be addressed to the authors.

REFERENCES

Andersen, M.D., Busk, P.K., Svendsen, I. and Møller, B.L. (2000) Cytochromes

P-450 from cassava (Manihot esculenta Crantz) catalyzing the first steps in

the biosynthesis of the cyanogenic glucosides linamarin and lotaustralin. J.

Biol. Chem. 275, 1966–1975.

Bak, S., Kahn, R.A., Nielsen, H.L., Møller, B.L. and Halkier, B.A. (1998a) Cloning

of three A-type cytochromes P450, CYP71E1, CYP98, and CYP99 from

Sorghum bicolor (L.) Moench by a PCR approach and identification by

expression in Escherichia coli of CYP71E1 as a multifunctional cytochrome

P450 in the biosynthesis of the cyanogenic glucoside dhurrin. Plant Mol.

Biol. 36, 393–405.

Bak, S., Nielsen, H.L. and Halkier, B.A. (1998b) The presence of CYP79

homologues in glucosinolate-producing plants shows evolutionary con-

servation of the enzymes in the conversion of amino acid to aldoxime in the

biosynthesis of cyanogenic glucosides and glucosinolates. Plant Mol. Biol.

38, 725–734.

Bak, S., Paquette, S.M., Morant, M. et al. (2006) Cyanogenic glucosides: a

case study for evolution and application of cytochromes P450. Phytochem.

Rev. 5, 309–329.

Bjarnholt, N. and Møller, B.L. (2008) Hydroxynitrile glucosides. Phytochem-

istry, 69, 1947–1961.

Bjarnholt, N., Rook, F., Motawia, M.S., Cornett, C., Jørgensen, C., Olsen, C.E.,

Jaroszewski, J.W., Bak, S. and Møller, B.L. (2008) Diversification of an

ancient theme: hydroxynitrile glucosides. Phytochemistry, 69, 1507–1516.

Cheng, D.W., Lin, H., Takahashi, Y., Walker, M.A., Civerolo, E.L. and Stenger,

D.C. (2010) Transcriptional regulation of the grape cytochrome P450

monooxygenase gene CYP736B expression in response to Xylella fastidi-

osa infection. BMC Plant Biol. 10, 135.

Chu, H.Y., Wegel, E. and Osbourn, A. (2011) From hormones to secondary

metabolism: the emergence of metabolic gene clusters in plant. Plant J. 66,

66–79.

Conn, E.E. (1980) Cyanogenic compounds. Annu. Rev. Plant Physiol. 31, 433–

451.

Cseke, L., Dudareva, N. and Pichersky, E. (1998) Structure and evolution of

linalool synthase. Mol. Biol. Evol. 15, 1491–1498.

Feigl, F. and Anger, V. (1966) Replacement of benzidine by copper ethylace-

toacetate and tetra base as spot-test reagent for hydrogen cyanide and

cyanogen. Analyst, 91, 282–284.

Field, B. and Osbourn, A.E. (2008) Metabolic diversification – independent

assembly of operon-like gene clusters in different plants. Science, 320, 543–

547.

Forslund, K., Morant, M., Jørgensen, B., Olsen, C.E., Asamizu, E., Sato, S.,

Tabata, S. and Bak, S. (2004) Biosynthesis of the nitrile glucosides rhodi-

ocyanoside A and D and the cyanogenic glucosides lotaustralin and

linamarin in Lotus japonicus. Plant Physiol. 135, 71–84.

Frey, M., Chomet, P., Glawischnig, E. et al. (1997) Analysis of a chemical plant

defense mechanism in grasses. Science, 277, 696–699.

Frey, M., Stettner, C., Pare, P.W., Schmelz, E.A., Tumlinson, J.H. and Gierl, A.

(2000) An herbivore elicitor activates the gene for indole emission in maize.

Proc. Natl Acad. Sci. USA, 97, 14801–14806.

Gierl, A. and Frey, M. (2001) Evolution of benzoxazinone biosynthesis and

indole production in maize. Planta, 213, 493–498.

Glawischnig, E., Hansen, B.G., Olsen, C.E. and Halkier, B.A. (2004) Camalexin

is synthesized from indole-3-acetaldoxime, a key branching point between

primary and secondary metabolism in Arabidopsis. Proc. Natl Acad. Sci.

USA, 101, 8245–8250.

Guttikonda, S.K., Trupti, J., Bisht, N.C., Chen, H., An, Y.-Q.C., Pandey, S., Xu,

D. and Yu, O. (2010) Whole genome co-expression analysis of soybean

cytochrome P450 genes identifies nodulation-specific P450 monooxygen-

ases. BMC Plant Biol. 10, 243.

Handberg, K. and Stougaard, J. (1992) Lotus japonicus, an autogamous,

diploid legume species for classical and molecular genetics. Plant J. 2, 487–

496.

Hansen, K.S., Kristensen, C., Tattersall, D.B., Jones, P.R., Olsen, C.E., Bak, S.

and Møller, B.L. (2003) The in vitro substrate regiospecificity of re-

combinant UGT85B1, the cyanohydrin glucosyltransferase from Sorghum

bicolor. Phytochemistry, 64, 143–151.

Hasemann, C.A., Kurumbail, R.G., Boddupalli, S.S., Peterson, J.A. and Deis-

enhofer, J. (1995) Structure and function of cytochromes P450: a compar-

ative analysis of three crystal structures. Structure, 2, 41–62.

Cyanogenic glucoside biosynthetic gene clusters 13

ª 2011 The AuthorsThe Plant Journal ª 2011 Blackwell Publishing Ltd, The Plant Journal, (2011), doi: 10.1111/j.1365-313X.2011.04685.x

Hull, A.K., Vij, R. and Celenza, J.L. (2000) Arabidopsis cytochrome P450s that

catalyze the first step of tryptophan-dependent indole-3-acetic acid bio-

synthesis. Proc. Natl Acad. Sci. USA, 97, 2379–2384.

Jensen, N.B., Zagrobelny, M., Hjernø, K., Olsen, C.E., Houghton-Larsen, J.,

Borch, J., Møller, B.L. and Bak, S. (2011) Convergent evolution in biosyn-

thesis of cyanogenic defence compounds in plants and insects. Nat.

Commun. 2, 273.

Jones, D.A. (1998) Why are so many food plants cyanogenic? Phytochemistry,

47, 155–162.

Jones, P.R., Møller, B.L. and Høj, P.B. (1999) The UDP-glucose: p-hydroxy-

mandelonitrile-O-glucosyltransferase that catalyzes the last step in

synthesis of the cyanogenic glucoside dhurrin in Sorghum bicolor. J. Biol.

Chem. 274, 35483–35491.

Jørgensen, K., Morant, A.V., Morant, M., Jensen, N.B., Olsen, C.E., Kann-

angara, R., Motawia, M.S., Møller, B.L. and Bak, S. (2011) Biosynthesis of

the cyanogenic glucosides linamarin and lotaustralin in cassava (Manihot

esculenta Crantz): isolation, biochemical characterization and expression

pattern of CYP71E7, the oxime-metabolizing cytochrome P450 enzyme.

Plant Physiol. 155, 282–292.

Kannangara, R., Motawia, M.S., Hansen, N.K.K., Paquette, S.M., Olsen, C.E.,

Møller, B.L. and Jørgensen, K. (2011) Characterization and expression

profile of two UDP-glucosyltransferases, UGT85K4 and UGT85K5, catalyz-

ing the last step in cyanogenic glucoside biosynthesis in cassava. Plant J.

doi: 10.1111/j.1365-313X.2011.04695.x.

Khersonsky, O. and Tawfik, D.S. (2010) Enzyme promiscuity: a mechanistic

and evolutionary perspective. Annu. Rev. Biochem. 79, 471–505.

Koch, B.M., Sibbesen, O., Halkier, B.A., Svendsen, I. and Møller, B.L. (1995)

The primary sequence of cytochrome P450tyr, the multifunctional

N-hydroxylase catalyzing the conversion of L-tyrosine to p-hydroxyphe-

nylacetaldehyde oxime in the biosynthesis of the cyanogenic glucoside

dhurrin in Sorghum bicolor (L.) Moench. Arch. Biochem. Biophys. 323, 177–

186.

Kristensen, C., Morant, M., Olsen, C.E., Ekstrøm, C.T., Galbraith, D.W.,

Møller, B.L. and Bak, S. (2005) Metabolic engineering of dhurrin in

transgenic Arabidopsis plants with marginal inadvertent effects on the

metabolome and transcriptome. Proc. Natl Acad. Sci. USA, 102, 1179–

1784.

Lombari, P., Ercolano, E., El Alaoui, H. and Chiurazzi, M. (2005) Agrobacte-

rium-mediated in vitro transformation. In Lotus japonicus Handbook

(Marquez, A.J., ed.). Dordrecht, The Netherlands: Springer, pp. 251–259.

Luo, J., Nishiyama, Y., Fuell, C. et al. (2007) Convergent evolution in the BAHD

family of acyl transferases: identification and characterization of anthocy-

anin acyl transferases from Arabidopsis thaliana. Plant J. 50, 678–695.

Mackenzie, P.I., Owens, I.S., Burchell, B. et al. (1997) The UDP glycosyl-

transferase gene superfamily: recommended nomenclature update based

on evolutionary divergence. Pharmacogenetics, 7, 255–269.

Martin, F.J. and McInerney, J.O. (2009) Recurring cluster and operon

assembly for phenylacetate degradation genes. BMC Evol. Biol. 9, 36.

Morant, A.V., Jørgensen, K., Jørgensen, C., Paquette, S.M., Sanchez-Perez,

R., Møller, B.L. and Bak, S. (2008) b-Glucosidases as detonators of plant

chemical defense. Phytochemistry, 69, 1795–1813.

Muller, P.Y., Janovjak, H., Miserez, A.R. and Dobbie, Z. (2002) Processing of

gene expression data generated by quantitative real-time RT-PCR. Bio-

Techniques, 32, 2–7.

Nelson, D. and Werck-Reichhart, D. (2011) A P450-centric view of plant evo-

lution. Plant J. 66, 194–211.

Olsen, K.M., Hsu, S.-C. and Small, L.L. (2008) Evidence on the molecular basis

of the Ac/ac adaptive cyanogenesis polymorphism in white clover (Trifo-

lium repens L.). Genetics, 179, 517–526.

Paterson, A.H., Bowers, J.E., Bruggmann, R. et al. (2009) The Sorghum

bicolor genome and the diversification of grasses. Nature, 457, 551–556.

Pichersky, E. and Lewinsohn, E. (2011) Convergent evolution in plant spe-

cialized metabolism. Annu. Rev. Plant Biol. 62, 549–566.

Qi, X., Bakht, S., Leggett, M., Maxwell, C., Melton, R. and Osbourn, A. (2004)

A gene cluster for secondary metabolism in oat: implications for the evo-

lution of metabolic diversity in plants. Proc. Natl Acad. Sci. USA, 101, 8233–

8238.

Reimann, A., Nurhayati, N., Backenkohler, A. and Ober, D. (2004) Repeated

evolution of the pyrrolizidine alkaloid-mediated defense system in separate

angiosperm lineages. Plant Cell, 16, 2772–2784.

Reintanz, B., Lehnen, M., Reichelt, M., Gershenzon, J., Kowalczyk, M.,

Sandberg, G., Godde, M., Uhl, R. and Palme, K. (2001) Bus, a bushy Ara-

bidopsis CYP79F1 knockout mutant with abolished synthesis of short-chain

aliphatic glucosinolates. Plant Cell, 13, 351–367.

Rodman, J.E., Soltis, P.S., Soltis, D.E., Sytsma, K.J. and Karol, K.G. (1998)

Parallel evolution of glucosinolate biosynthesis inferred from congruent

nuclear and plastid gene phylogenies. Am. J. Bot. 85, 997–1006.

Rupasinghe, S., Schuler, M.A., Kagawa, N., Yuan, H., Lei, L., Zhao, B.,

Kelly, S.L., Waterman, M.R. and Lamb, D.C. (2006) The cytochrome P450

gene family CYP157 does not contain EXXR in the K-helix reducing the

absolute conserved P450 residues to a single cysteine. FEBS Lett. 580,

6338–6342.

Sato, S., Nakamura, Y., Kaneko, T. et al. (2008) Genome structure of the

legume, Lotus japonicus. DNA Res. 15, 227–239.

Schwab, W. (2003) Metabolome diversity: too few genes, too many metabo-

lites? Phytochemistry, 62, 837–849.

Seigler, D.S. (1975) Isolation and characterization of naturally occurring cya-

nogenic compounds. Phytochemistry, 14, 9–29.

Shimura, K., Okada, A., Okada, K. et al. (2007) Identification of a biosynthetic

gene cluster in rice for momilactones. J. Biol. Chem. 282, 34013–34018.

Swaminathan, S., Morrone, D., Wang, Q., Fulton, D.B. and Peters, R.J. (2009)

CYP76M7 is an ent-cassadiene C11a-hydroxylase defining a second mul-

tifunctional diterpenoid biosynthetic gene cluster in rice. Plant Cell, 21,

3315–3325.

Takos, A., Lai, D., Mikkelsen, L., Hachem, M.A., Shelton, D., Motawia, M.S.,

Olsen, C.E., Wang, T.L., Martin, C. and Rook, F. (2010) Genetic screening

identifies cyanogenesis-deficient mutants of Lotus japonicus and reveals

enzymatic specificity in hydroxynitrile glucoside metabolism. Plant Cell, 22,

1605–1619.

Tamura, K., Peterson, D., Peterson, N., Stecher, G., Nei, M. and Kumar, S.

(2011) MEGA5: molecular evolutionary genetics analysis using maximum

likelihood, evolutionary distance, and maximum parsimony methods. Mol.

Biol. Evol. doi: 10.1093/molbev/msr121.

Voinnet, O., Rivas, S., Mestre, P. and Baulcombe, D. (2003) An enhanced

transient expression system in plants based on suppression of gene

silencing by the p19 protein of tomato bushy stunt virus. Plant J. 33, 949–

956.

Zagrobelny, M., Bak, S., Ekstrøm, C.T., Olsen, C.E. and Møller, B.L. (2007) The

cyanogenic glucoside composition of Zygaena filipendulae (Lepidoptera:

Zygaenidae) as effected by feeding on wild-type and transgenic lotus

populations with variable cyanogenic glucoside profiles. Insect Biochem.

Mol. Biol. 37, 10–18.

14 Adam M. Takos et al.

ª 2011 The AuthorsThe Plant Journal ª 2011 Blackwell Publishing Ltd, The Plant Journal, (2011), doi: 10.1111/j.1365-313X.2011.04685.x