Halloween genes and nuclear receptors in ecdysteroid biosynthesis and signalling in the pea aphid:...

14
Halloween genes and nuclear receptors in ecdysteroid biosynthesis and signalling in the pea aphid O. Christiaens*§, M. Iga*§, R. A. Velarde†, P. Rougé‡ and G. Smagghe* *Laboratory of Agrozoology, Department of Crop Protection, Ghent University, Ghent, Belgium; Department of Biology, Wake Forest University, Winston-Salem, NC, USA; and Université de Toulouse, UMR 152 IRD-Université Paul Sabatier, Faculté des Sciences Pharmaceutiques, Toulouse, France AbstractThe pea aphid (Acyrthosiphon pisum) is the first whole genome sequenced insect with a hemi- metabolic development and an emerging model organism for studies in ecology, evolution and development. The insect steroid moulting hormone 20-hydroxyecdysone (20E) controls and coordinates development in insects, especially the moulting/ metamorphosis process. We, therefore present here a comprehensive characterization of the Halloween genes phantom, disembodied, shadow, shade, spook and spookiest, coding for the P450 enzymes that control the biosynthesis of 20E. Regarding the pres- ence of nuclear receptors in the pea aphid genome, we found 19 genes, representing all of the seven known subfamilies. The annotation and phylogenetic analysis revealed a strong conservation in the class of Insecta. But compared with other sequenced insect genomes, three orthologues are missing in the Acyrthosiphon genome, namely HR96, PNR-like and Knirps. We also cloned the EcR, Usp, E75 and HR3. Finally, 3D-modelling of the ligand-binding domain of Ap-EcR exhibited the typical canonical structural scaffold with 12 a-helices associated with a short hairpin of two antiparallel b-strands. Upon docking, 20E was located in the hormone-binding groove, supporting the hypothesis that EcR has a role in 20E signalling. Keywords: ecdysteroid biosynthesis, nuclear recep- tors, ecdysteroid cascade, pea aphid, development, moulting, metamorphosis. Introduction The insect steroid hormone, 20-hydroxyecdysone (20E) controls and coordinates the development in insects. A peak titer of 20E triggers the moulting/metamorphosis process, allowing insects to lose their old exoskeleton and enter into the next developmental stage. To produce the steroid hormone 20E, the precursor hormone ecdysone (E) is synthesized in the prothoracic glands from dietary cho- lesterol or phytosterols as insects cannot synthesize the steroid precursor cholesterol de novo. This E is then secreted into the body cavity and converted into 20E in various peripheral tissues predominantly in the insect midgut and fat body (Petryk et al., 2003; Rewitz et al., 2006a; Iga & Smagghe, 2009). The whole trafficking mechanism of the ecdysteroid precursors has not been elucidated yet in insects, but it may use similar mechanisms as the steroidogenesis in vertebrates (Rees, 1995; Gilbert et al., 2002; Lafont et al., 2005; Huang et al., 2008). Cytochrome P450 (CYP) enzymes, well known for their monooxygenase activity, constitute one of the largest families and are distributed throughout a wide variety of living organisms, from bacteria to mammals (Werck-Reichhart & Feyereisen, 2000). To date, four P450 enzymes, namely CYP306A1 (Phantom, Phm), CYP302A1 (Disembodied, Dib), CYP315A1 (Shadow, Sad) and CYP314A1 (Shade, Shd), involved in the ecdys- teroid biosynthesis have been identified and character- ized. As shown in Fig. 1, the products of phm, dib and sad sequentially convert the precursor of E, 2,22,25- trideoxyecdysone (ketodiol), into 22,2-dideoxyecdysone (ketotriol), 2-deoxyecdysone and E (Chavez et al., 2000; Warren et al., 2002, 2004; Niwa et al., 2004, 2005; Rewitz et al., 2006b). Further on, the product of shd mediates the last step of the conversion from E into 20E (Petryk et al., 2003; Rewitz et al., 2006a; Maeda et al., 2008). In addi- tion, CYP307A1 (Spook, Spo), the paralogue gene of Spo, CYP307A2 (Spookier, Spok) and CYP307B1 (Spookiest, Correspondence: Guy Smagghe, Laboratory of Agrozoology, Department of Crop Protection, Ghent University, 9000 Ghent, Belgium. Tel.: +32 9 2646150; fax: +32 9 2646239; e-mail: [email protected] §Equally contributed. Insect Molecular Biology Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200 doi: 10.1111/j.1365-2583.2009.00957.x © 2010 The Authors Journal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200 187

Transcript of Halloween genes and nuclear receptors in ecdysteroid biosynthesis and signalling in the pea aphid:...

Halloween genes and nuclear receptors in ecdysteroidbiosynthesis and signalling in the pea aphid

O. Christiaens*§, M. Iga*§, R. A. Velarde†, P. Rougé‡and G. Smagghe*

*Laboratory of Agrozoology, Department of CropProtection, Ghent University, Ghent, Belgium;†Department of Biology, Wake Forest University,Winston-Salem, NC, USA; and ‡Université de Toulouse,UMR 152 IRD-Université Paul Sabatier, Faculté desSciences Pharmaceutiques, Toulouse, France

Abstractimb_957 187..200

The pea aphid (Acyrthosiphon pisum) is the firstwhole genome sequenced insect with a hemi-metabolic development and an emerging modelorganism for studies in ecology, evolution anddevelopment. The insect steroid moulting hormone20-hydroxyecdysone (20E) controls and coordinatesdevelopment in insects, especially the moulting/metamorphosis process. We, therefore present here acomprehensive characterization of the Halloweengenes phantom, disembodied, shadow, shade, spookand spookiest, coding for the P450 enzymes thatcontrol the biosynthesis of 20E. Regarding the pres-ence of nuclear receptors in the pea aphid genome,we found 19 genes, representing all of the sevenknown subfamilies. The annotation and phylogeneticanalysis revealed a strong conservation in the classof Insecta. But compared with other sequenced insectgenomes, three orthologues are missing in theAcyrthosiphon genome, namely HR96, PNR-like andKnirps. We also cloned the EcR, Usp, E75 and HR3.Finally, 3D-modelling of the ligand-binding domain ofAp-EcR exhibited the typical canonical structuralscaffold with 12 a-helices associated with a shorthairpin of two antiparallel b-strands. Upon docking,20E was located in the hormone-binding groove,supporting the hypothesis that EcR has a role in20E signalling.

Keywords: ecdysteroid biosynthesis, nuclear recep-tors, ecdysteroid cascade, pea aphid, development,moulting, metamorphosis.

Introduction

The insect steroid hormone, 20-hydroxyecdysone (20E)controls and coordinates the development in insects. Apeak titer of 20E triggers the moulting/metamorphosisprocess, allowing insects to lose their old exoskeleton andenter into the next developmental stage. To produce thesteroid hormone 20E, the precursor hormone ecdysone (E)is synthesized in the prothoracic glands from dietary cho-lesterol or phytosterols as insects cannot synthesize thesteroid precursor cholesterol de novo. This E is thensecreted into the body cavity and converted into 20E invarious peripheral tissues predominantly in the insectmidgut and fat body (Petryk et al., 2003; Rewitz et al.,2006a; Iga & Smagghe, 2009). The whole traffickingmechanism of the ecdysteroid precursors has not beenelucidated yet in insects, but it may use similar mechanismsas the steroidogenesis in vertebrates (Rees, 1995; Gilbertet al., 2002; Lafont et al., 2005; Huang et al., 2008).

Cytochrome P450 (CYP) enzymes, well known fortheir monooxygenase activity, constitute one of thelargest families and are distributed throughout a widevariety of living organisms, from bacteria to mammals(Werck-Reichhart & Feyereisen, 2000). To date, fourP450 enzymes, namely CYP306A1 (Phantom, Phm),CYP302A1 (Disembodied, Dib), CYP315A1 (Shadow,Sad) and CYP314A1 (Shade, Shd), involved in the ecdys-teroid biosynthesis have been identified and character-ized. As shown in Fig. 1, the products of phm, dib andsad sequentially convert the precursor of E, 2,22,25-trideoxyecdysone (ketodiol), into 22,2-dideoxyecdysone(ketotriol), 2-deoxyecdysone and E (Chavez et al., 2000;Warren et al., 2002, 2004; Niwa et al., 2004, 2005; Rewitzet al., 2006b). Further on, the product of shd mediates thelast step of the conversion from E into 20E (Petryk et al.,2003; Rewitz et al., 2006a; Maeda et al., 2008). In addi-tion, CYP307A1 (Spook, Spo), the paralogue gene of Spo,CYP307A2 (Spookier, Spok) and CYP307B1 (Spookiest,

Correspondence: Guy Smagghe, Laboratory of Agrozoology, Departmentof Crop Protection, Ghent University, 9000 Ghent, Belgium. Tel.: +32 92646150; fax: +32 9 2646239; e-mail: [email protected]

§Equally contributed.

InsectMolecular

Biology

Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200 doi: 10.1111/j.1365-2583.2009.00957.x

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200 187

Spot), involved in the initial conversion process from7-dehydrochoresterol into ketodiol are identified, but theirbiochemical functions are not well understood (Namikiet al., 2005; Ono et al., 2006). Spok has so far only beenidentified in Drosophila, while spot is identified in mosqui-toes (Aedes aegypti and Anopheles gambiae), honeybees (Apis mellifera) and red flour beetles (Tribolium cas-taneum). Together, they are called the Halloween genes.The Halloween genes have been identified/predicted inmultiple insect species (Niwa et al., 2004, 2005; Warrenet al., 2004; Sieglaff et al., 2005; Rewitz et al., 2006a,b,2007; Iga and Smagghe, 2009) and the function of thesegenes is characterized in the fruitfly (Drosophila melano-gaster), the silkmoth (Bombyx mori) and the tobaccohornworm (Manduca sexta). In addition to insects, theHalloween genes are also identified in the crustaceangenome of Daphnia pulex (Rewitz & Gilbert, 2008), sug-gesting a high conservation for ecdysteroid biosynthesisin the Arthropoda phylum.

After secretion into the hemolymph, 20E will start themoulting/metamorphosis process by acting directly uponthe transcriptional activity of specific target genes throughchromosome puffing. Ashburner (1973) proposed aformal model to explain control of the transcription of thevast network of genes whose activity is induced by the

hormone. As shown in Fig. 1, the first step in this cascadeis the binding of 20E to a postulated receptor protein, aheterodimer formed by the ecdysone receptor (EcR) andUltraspiracle (Usp), which are both members of thenuclear receptor (NR) superfamily (Henrich, 2005; Billaset al., 2009). Activation of this receptor complex initiatesand mediates the transcription of a number of otherNRs, in a cascade with at first expression of ‘early’, then‘early-late’ and finally ‘late’ genes for a successfulmoulting/metamorphosis.

The NR superfamily is a group of ligand-activated tran-scription factors which are present in various animals.Studies of these NRs in both mammals and arthropodsrevealed seven distinct subfamilies (NR0-NR6) in whichthese NRs can be classified. All of them possess a highly-conserved DNA-binding domain (DBD), containing twoC4-type zinc finger regions, that is responsible for bindingof the transcription factor to the DNA. Except the NR0superfamily, all NRs also contain a less-conserved ligandbinding domain (LBD) with which the receptor is able tobind its ligand. Unlike most other transcription factors,NRs can be activated by binding of small lipophilic ligandssuch as hormones and fatty acids that are capable ofgoing through the cell membrane. Besides moultingand metamorphosis, NRs are involved in, e.g. embryonic

ketodiolBlack box

(2 22 25dE)Phm

20E biosynthesis

cholesterolketotriol

2-deoxyecdysone

(2,22,25dE)

(2,22dE)

(2dE)

Spo/Spok/SpotDib

ecdysone20E

( )

Shd Sad

20E signaling

EcR

‘early’ genes

Usp

E75 E74 E93BR

‘early-late’ genes HR3 HR4 KR-H1 HR38 E78

FTZ-F1

‘late’ genes

Figure 1. Summary of the biosynthesis of20-hydroxyecdysone (20E) and the 20E regulatorycascade. In the upper part, the biosynthetic schemepresents the functions of the Halloween genes (Spo/Spok/Spot, Phm, Dib, Sad and Shd) that are boxed,while intermediate products are mentioned in bold. Inthe lower part, binding of 20E to the EcR-Uspcomplex starts the ecdysteroid cascade with theexpression of the so called ‘early’ genes (EcR, E75,BR, E74 and E93) that will then be responsible forthe upregulation of a set of ‘early-late’ genes(including HR3, HR4, HR38 and E78). Via FTZ-F1,the signal will eventually be passed on to the ‘late’genes. The nuclear receptors are boxed. Redraftedafter Rewitz et al. (2007) and Bonneton et al. (2008).

188 O. Christiaens et al.

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200

development (King-Jones & Thummel, 2005), cell differ-entiation (Siaussat et al., 2007), reproduction (Raikhelet al., 1999), and are therefore also considered as impor-tant novel targets in pest insect control (Palli et al., 2005;Billas et al., 2009).

Recent genome projects of both vertebrate and insectspecies contributed a lot in identifying the different NRs. Intotal, 48 NRs are known in humans (Robinson-Rechaviet al., 2001), over 284 are present in Caenorhabditiselegans (Gissendanner et al., 2004), 49 in mouse and 47in rat (Zhang et al., 2004). In insects on the other hand,the number of NRs found is surprisingly lower. In Droso-phila, only 21 NR genes have been identified, 20 inAnopheles, 22 in Apis, 19 in Bombyx and 21 in Tribolium(Adams et al., 2000; Holt et al., 2002; Velarde et al., 2006;Bonneton et al., 2008; Cheng et al., 2008). The latterstudies also showed that these NRs, especially the DBDand LBD, are highly conserved in holometabolous insects.So far, however, no complete set of NRs from hemime-tabolous insects, which are indirect developers that donot undergo a pupal metamorphosis stage, has beendescribed. The recent genome project on the pea aphid,Acyrthosiphon pisum (International Aphid Genomic Con-sortium, 2010) gives us the unique opportunity to presenta comprehensive identification and characterization of theNRs in this important hemipteran insect, which is anemerging model organism for ecological, developmentaland evolutionary studies (Brisson & Stern, 2006; Stern,2008).

In this paper, we will first focus on the Halloween genesthat control the ecdysteroid biosynthesis pathway to buildup a peak titer of 20E hormone. In a second part, we willunravel the presence of the NRs in the pea aphid with anemphasis on understanding and identifying the pathway ofhormone signalling by 20E through a regulatory cascadeof NRs – especially the functional receptor formed by theheterodimer EcR-Usp – and also on ‘early’, ‘early/late’and ‘late’ genes. We performed a phylogenetic analysisto confirm the annotation and to investigate evolutionarytraits of the pea aphid in the phylum of Arthropoda.Besides, we also cloned the EcR, Usp, E75 and HR3.Finally, we constructed a 3D-modelling of the LBD ofAp-EcR to evaluate if it exhibits the typical canonical struc-tural scaffold with 12 a-helices, and then performed aligand docking to support the theory that EcR has a role in20E hormone signalling.

Results and discussion

Phylogenetic analysis of Halloween genes

The candidates of the A. pisum Halloween genes wereobtained from AphidBase (http://www.aphidbase.com/aphidbase/) by TBLASTN using the amino acid sequence

of Apis mellifera and T. castaneum. We found three can-didates of A. pisum spook (Ap-spo1, Ap-spo2 andAp-spo3), one candidate for phantom (Ap-phm) and dis-embodied (Ap-dib) and also three candidates for shade(Ap-shd1, Ap-shd2 and Ap-shd3). The AphidBase ID andcross reference number are shown in Table 1. The expres-sion of these predicted sequences was confirmed byreverse transcriptase-PCR (RT-PCR) (data not shown).

The predicted sequences have an open reading frame(ORF) encoding the putative protein of Ap-Spo1 (518amino acids), Ap-Spo2 (528 amino acids), Ap-Spo3(507 amino acids), Ap-Phm (492 amino acids), Ap-Dib(493 amino acids), Ap-Sad (443 amino acids), Ap-Shd1(518 amino acids), Ap-Shd2 (518 amino acids) andAp-Shd3 (505 amino acids). The product size of thesecandidates is consistent with the character of the differ-ent CYP products (approx. 500 amino acids). Alignmentof the different A. pisum Halloween genes candidateswith those of other insect orders (Lepidoptera, Diptera,Hymenoptera and Coleoptera) show high conservation ofinsect P450 motifs (helix-C, helix-I, helix-K, PERF-motifand heme-binding domain) (Fig. S1A–E). For Spo, thehelix-C and helix-I structures that are usually so typicalfor P450 proteins were not well conserved but this isconsistent with the P450 proteins in other members ofthe class of Insecta. Only the heme-binding domain ofAp-Shd3 shows a significant difference compared withthat of other insect orthologs. The Ap-Shd3 completelylacked the sequence of the heme-binding domain whichmeans the protein may not be functional at all. RT-PCRshowed however, that the protein is expressed in the peaaphid (data not shown). This leads to the hypothesis thatthe protein might have other functions or another role inthe pea aphid than just the ones attributed to the Shdproteins so far.

The result of phylogenetic analysis shows two classes:the 2 Clan with Spo/Spok/Spot and Phm, and the MitoClan with Dib, Sad and Shd (Fig. 2). In the A. pisumgenome we detected two Spo-like products, Ap-Spo1 andAp-Spo2. Both are on the same branch as Spo ortho-logues of other species, and sequence comparison showsthat both Ap-Spo1 and Ap-Spo2 are quite similar to eachother (85% identity). Ap-Spo3, however, is very differentfrom Ap-Spo1 and Ap-Spo2, showing only 38% and 39%identity, respectively. When we compare with other Hal-loween genes, we notice that Ap-Spo3 shows 33–42%identity with Spo/Spok orthologues, and 44–47% identitywith Spot orthologues, suggesting Ap-Spo3 might be aSpot orthologue rather than a Spo orthologue. In addition,phylogenetic analysis confirms this hypothesis since bothAp-Spo3 and Spot orthologues are branched together(Fig. 2). Three Shd candidates were identified in A. pisum.Two of them, Ap-Shd1 and Ap-Shd2 show a high conser-vation (94% identity), suggesting these could be

Halloween and NR genes in the pea aphid 189

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200

Tab

le1.

Hal

low

een

gene

sid

entifi

edin

Acy

rtho

siph

onpi

sum

;in

clud

ing

Gen

bank

IDs

ofD

roso

phila

mel

anog

aste

r,A

pid

mel

lifer

aan

dTr

ibol

ium

cast

aneu

mar

eal

soin

clud

ed

Nam

eF

unct

ion

D.

mel

anog

aste

r(D

m)

T.ca

stan

eum

(Tc)

A.

mel

lifer

a(A

m)

Aph

idB

ase

IDR

efse

q

iden

tity

%

Dm

Am

Tc

CY

P30

7A1/

2sp

ook

/sp

ooki

erU

nkno

wn

AF

4844

15/

NM

_001

1109

90A

AJJ

0100

0951

–A

CY

PI0

0151

9(A

p-sp

o1)

XM

_001

9457

2643

/45

–52

AC

YP

I002

012

(Ap-

spo2

)X

M_0

0194

6260

43/4

3–

51C

YP

307B

1sp

ooki

est

Unk

now

n–

AA

JJ01

0011

63A

AD

G05

0050

80A

CY

PI0

0071

6(A

p-sp

o3)

XM

_001

9486

80–

4746

CY

P30

6A1

phan

tom

25-H

ydro

xyla

seA

F48

4413

XM

_963

384

XM

_391

946

AC

YP

I006

623

(Ap-

phm

)X

M_0

0194

7839

3438

39C

YP

302A

1di

sem

bodi

ed22

-Hyd

roxy

lase

AF

2375

60X

M_9

6915

9X

M_0

0112

2832

AC

YP

I006

729

(Ap-

dib)

XM

_001

9482

6444

4849

CY

P31

5A1

shad

ow2-

Hyd

roxy

lase

AY

0791

70X

M_9

6502

9X

M_3

9536

0A

CY

PI0

0097

3(A

p-sa

d)X

M_0

0194

4148

3236

38C

YP

314A

1sh

ade

20-H

ydro

xyla

seA

F48

4414

XM

_967

606

DQ

2440

74A

CY

PI0

0822

8(A

p-sh

d1)

XM

_001

9485

7242

5143

AC

YP

I006

755

(Ap-

shd2

)X

M_0

0194

8633

4250

43A

CY

PI0

0981

3(A

p-sh

d3)

XM

_001

9424

7038

4438

Inth

eca

seof

T.ca

stan

eum

CY

P30

7A1/

2an

dC

YP

307B

1,co

ntig

sw

ere

give

non

whi

chth

ege

neis

foun

d.Id

emfo

rA

pis

mel

lifer

aC

YP

307B

1.O

nth

erig

ht,

iden

tity

perc

enta

ges

betw

een

A.

pisu

mse

quen

ces

and

the

resp

ectiv

eor

thol

ogs

are

also

pres

ente

d.

Ag-Shd

Aa-Shd

Dm-Shd

Ms-Shd

Bm-Shd

Am-Shd

100

100

97

68

100

Mito

Ap-Shd3

Ap-Shd2

Ap-Shd1

Tc-Shd

Aa-Dib

Ag-Dib

100

100

99

100

100

100

99

o Clan

Dm-Dib

Ms-Dib

Bm-Dib

Am-Dib

Tc-Dib

Ap-Dib

100

85

58

100

100

Am-Sad

Ap-Sad

Ag-Sad

Aa-Sad

Dm-Sad

Tc-Sad

Bm-Sad

100

94100

Bm-Sad

Ms-Sad

Am-Phm

Tc-Phm

Ms-Phm

Bm-Phm

Dm-Phm

100

100

63

92

2 Clan

Ag-Phm

Aa-Phm

Ap-Phm

Aa-Spot

Ag-Spot

Am-Spot

100

81

100

99

67100

n

Tc-Spot

Ap-Spo3

Ap-Spo2

Ap-Spo1

Tc-Spo

Ms-Spo

B S

100

100

52

100

100

Bm-Spo

Ag-Spo

Aa-Spo

Dm-Spo

Dm-Spok

100

71

68

77

0.2

Figure 2. Phylogenetic tree of the Halloween genes. This tree wasconstructed using the neighbour-joining method performed with theamino acid sequences of the whole sequences. Bootstrap values aspercentage of a 1000 replicates >50 are indicated on the tree. Aa:Aedes aegypti, Ag: Anopheles gambiae, Am: Apis mellifera, Ap:Acyrthosiphon pisum, Bm: Bombyx mori, Dm: Drosophila melanogaster,Ms: Manduca sexta, Tc: Tribolium castaneum.

190 O. Christiaens et al.

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200

duplicated genes, while another Shd candidate, Ap-Shd3,only shows 66% identity with both Ap-Shd1 and Ap-Shd2.Ap-Shd1 and Ap-Shd2 show 38–51% and 38–50% iden-tity with Shd orthologues, respectively, while Ap-Shd3exhibits 34–44% identity to those same Shd orthologues.As we described before, Ap-Shd3 will probably not func-tion as a P450 enzyme since it lacks the necessary heme-binding domain which is important for P450 enzymeactivity. We can therefore assume that only Ap-Shd1and/or Ap-Shd2 are likely to be responsible for convertingE into 20E.

Identification of the nuclear receptors in the genome ofAcyrthosiphon pisum displays strong conservationin insects

All available Gene prediction sets (Gnomon, Augustus,Genscan, GeneID) and all available A. pisum sequencedata were used to identify the NRs in the pea aphidgenome. The in silico detection of NRs in the genome isgreatly facilitated by the strongly conserved DBD andLBD regions that characterize these NRs (Table 2). Blastsearches were performed using peptide sequences of allknown NRs from D. melanogaster, Apis mellifera and T.castaneum. As a result, an initial 20 NR sequences wereidentified in the A. pisum genome, representing all of theseven NR subfamilies. Predicted mRNA sequences andgene models were also manually edited if necessaryusing the Apollo Genome Annotation Curation Tool(Lewis et al., 2002). After further analysis, two NR0sequences turned out to be duplicated Knirps-like (Kni-like) genes, which bring the total set of different NRgenes to 19.

RT-PCR was used to confirm the presence of the pre-dicted NR mRNAs in the transcriptome of the pea aphid(data not shown). All NR mRNAs were picked up byRT-PCR, except for the HR83 gene. We did not manage toget a conclusive result for this gene despite using severaldifferent primer pairs. Some of them resulted in clearsingle bands, but the fragment size was not as would bepredicted based on the annotated gene. This means thateither we picked up a wrong fragment, or the exon/intronprediction of the gene is incorrect. Further sequencing ofthis fragment should give us more information about theidentity of this fragment and about the transcription of thisgene. These results prove that the complete set of NRsfound in the A. pisum genome, except HR83, is tran-scribed and none of them are pseudogenes.

Table 2 presents all the pea aphid orthologues for eachof the previously annotated D. melanogaster (Adamset al., 2000), Apis mellifera (Velarde et al., 2006), B. mori(Cheng et al., 2008) and T. castaneum (Bonneton et al.,2008) NRs. Similar numbers of NRs were found in thesefive insect genomes. All NRs are also structurally very

similar to their orthologues. All of them possess a DBDand LBD, except for the NR0 subfamily, which onlycontains a DBD. As could be predicted from previousanalyses of NRs, pairwise alignments of the conserveddomains of D. melanogaster and A. pisum NRs show avery high (71–99%) convergence for DBDs while theLBDs are more divergent (26–97%; with 77% identity forHR39 being the second highest). The most divergent NRsare HR83 (NR2E5) and TLL (NR2E2), while SVP (NR2F3)shows the least divergence.

In general, these results prove that NRs have a verystrong conservation among insects, also outside theholometabolous insect group. All pea aphid NRs showsimilar identity percentages for its orthologues as the iden-tity percentages which were reported in earlier NR anno-tation publications, where the NRs in T. castaneum andApis mellifera were compared with the NRs in D. melano-gaster, even though we would expect bigger differencesbased on the evolutionary distances of these species. TheNRs that are part of the 20E regulatory cascade, the‘early’ gene E75 (NR1D3) and the ‘early-late’ genes HR3(NR1F4), HR4 (NR2A4), HR38 (NR4A4), E78 (NR1E1)and FTZ-F1 (NR5A3) also show the same kind of conver-gence as reported with other species, demonstrating thatall the main NR members of this cascade are present inthe pea aphid. One remarkable observation was theextremely high conservation of the SVP-LBD amonginsects, much more than for the other NRs (97%, 96% and99% compared with D. melanogaster, T. castaneum andApis mellifera orthologues, respectively). The latter phe-nomenon may suggest that the structure of SVP, theinsect orthologue of the vertebrate chicken ovalbuminupstream transcription factor (COUP-TF), is critical to itsfunction and is under strong selective pressure againstamino acid replacements in the LBD of the molecule. In D.melanogaster, where two isoforms of this protein areexpressed, SVP has multiple reported functions. It isrequired for the development of four of the eight photore-ceptors that develop in the ommatidia of the eye (Hiromiet al., 1993; Begemann et al., 1995; Kramer et al., 1995),it is a key component in the control of cell proliferation inMalpighian tubules (Kerber et al., 1998) and it also has animportant role as a regulator in the development of neu-roblasts by acting upon the Hunchback/Krüppel switchnecessary for neuroblast differentiation (Kanai et al.,2005). In Ae. aegypti, this protein also has an effect on thevitellogenesis by acting as a negative regulator in theecdysone receptor complex-mediated transactivation inthe fat body (Miura et al., 2002).

Three NRs which were previously found in other insectspecies seem to be missing in the A. pisum genome:namely the NR1 subfamily member HR96, the NR0 sub-family member Knirps (Kni), and the NR2 family memberPNR-like (NR2E6).

Halloween and NR genes in the pea aphid 191

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200

Tab

le2.

Nuc

lear

rece

ptor

sin

Acy

rtho

siph

onpi

sum

NuR

eBA

SE

Nam

eP

rodu

ctD

roso

phila

mel

anog

aste

rTr

ibol

ium

cast

aneu

mA

pis

mel

lifer

aA

phid

Bas

eID

Ref

seq

Dm

/Ap

iden

tity

%Tc

/Ap

iden

tity

%A

m/A

pid

entit

y%

DB

DLB

DD

BD

LBD

DB

DLB

D

NR

1D3

Ecd

yson

e-in

duce

dpr

otei

n75

E75

NP

_524

133

TC

_124

40X

P_3

9379

0A

CY

PI0

0777

3X

M_0

0194

6050

9558

9574

9577

NR

1E1

Ecd

yson

e-in

duce

dpr

otei

n78

E78

NP

_524

195

TC

_039

35X

P_3

9652

7A

CY

PI0

0230

7X

M_0

0195

2697

9651

94*

6093

60

NR

1F4

Hor

mon

ere

cept

orlik

ein

46H

R3

NP

_788

303

TC

_089

09X

P_3

9212

8.3

(LO

C10

0162

388)

9753

100

7110

072

NR

1H1

Ecd

yson

ere

cept

orE

cRN

P_7

2445

6T

C_1

2112

NP

_001

0916

85.2

AC

YP

I001

692

XM

_001

9426

3288

6397

7599

74N

R2A

4H

epat

ocyt

enu

clea

rfa

ctor

4H

NF

4N

P_4

7688

7.2

TC

_087

26–

AC

YP

I009

409

XM

_001

9468

9388

7492

7888

–†

NR

2B4

Ultr

aspi

racl

eU

sp/R

XR

NP

_476

781

TC

_140

27N

P_0

0101

1634

.1A

CY

PI0

0593

4X

M_0

0194

7055

9253

9572

9575

NR

2D1

Hor

mon

ere

cept

orlik

ein

78H

R78

NP

_524

203

TC

_045

98X

P_3

9276

9A

CY

PI0

0423

4X

M_0

0194

8276

8733

9244

9343

NR

2E2

Taill

ess

TLL

NP

_524

596

TC

_004

41no

orth

olog

ueA

CY

PI0

0936

0X

M_0

0194

5880

8326

8326

8526

NR

2E3

Hor

mon

ere

cept

or51

HR

51N

P_7

2545

7T

C_0

9378

XP

_396

999.

3A

CY

PI0

0760

1X

M_0

0194

8835

9667

9476

9475

NR

2E4

Dis

satis

fact

ion

DS

FN

P_4

7714

0T

C_0

1069

XP

_624

265.

2(L

OC

1001

6104

0)93

7491

8284

62N

R2E

5H

orm

one

Rec

epto

r83

HR

83N

P_6

4964

7T

C_1

0460

noor

thol

ogue

AC

YP

I481

0271

–84

1476

28

NR

2F3

Sev

enup

SV

PN

P_7

3168

1T

C_0

1722

XP

_392

402

AC

YP

I005

513

XM

_001

9439

8696

9794

9696

99N

R3B

4E

stro

gen-

rela

ted

rece

ptor

ER

RN

P_7

2934

0T

C_0

9140

NP

_001

1559

88.1

AC

YP

I009

262

XM

_001

9489

6492

5295

5795

58

NR

4A4

Hor

mon

ere

cept

orlik

ein

38H

R38

NP

_477

119

TC

_131

46X

P_6

2398

7A

CY

PI0

0390

9X

M_0

0194

4676

9973

9774

9671

NR

5A3

Fus

hita

razu

tran

scrip

tion

fact

or1

FT

Z-F

1N

P_7

3035

9T

C_0

2550

noor

thol

ogue

AC

YP

I003

708

XM

_001

9454

2999

6799

7799

74

NR

5B1

Hor

mon

ere

cept

orlik

ein

39H

R39

NP

_476

932

TC

_149

86X

P_3

9691

8A

CY

PI0

0635

0X

M_0

0194

6992

8377

8882

8985

NR

6A1

Hor

mon

ere

cept

or4

HR

4N

P_0

0103

3823

TC

_005

43X

P_3

9440

1.3

AC

YP

I008

092

XM

_001

9456

9192

5891

7692

74

NR

0A2

Kni

rps-

like-

1K

NR

L-1

NP

_788

552

TC

_034

13X

P_3

9593

2.2

AC

YP

I490

9691

noLB

D95

noLB

D91

noLB

DK

nirp

s-lik

e-2

KN

RL-

2(L

OC

1001

6845

0)N

R0A

3E

agle

EG

NP

_524

206

TC

_034

09no

orth

olog

ueA

CY

PI4

8166

87no

LBD

95no

LBD

92no

LBD

NR

1J1

Hor

mon

ere

cept

or96

HR

96N

P_5

2449

3T

C_1

0645

XP

_624

213.

2no

tpr

esen

tno

tpr

esen

t

NR

2E6

Pho

tore

cept

orsp

ecifi

cN

RA

pPN

Rno

orth

olog

ueT

C_1

3148

XP

_624

042.

1no

tpr

esen

tno

tpr

esen

t

NR

0A1

Kni

rps

KN

IN

P_5

2418

7no

orth

olog

ueno

orth

olog

ueno

tpr

esen

tno

tpr

esen

t

Gen

bank

IDs

ofD

.m

elan

ogas

ter

and

Api

sm

ellif

era

orth

olog

ues

are

show

n,al

ong

with

the

Bee

tleB

ase

IDs

ofth

eT.

cast

aneu

mN

Rs.

On

the

right

,id

entit

ype

rcen

tage

sbe

twee

nth

eLB

Dof

A.

pisu

mN

Rs

and

the

LBD

ofD

.m

elan

ogas

ter,

T.ca

stan

eum

and

Api

sm

ellif

era

orth

olog

sar

eal

sopr

esen

ted.

*E78

DB

Dof

T.ca

stan

eum

was

inco

mpl

ete.

†Seq

uenc

esw

ere

not

avai

labl

e.

192 O. Christiaens et al.

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200

HR96 is an orphan receptor belonging to the NR1 sub-family (NR1J1). It is closely related to the EcR itself and isbelieved to be related to the vertebrate vitamin D-receptor(VDR), PXR and CAR, all of which bind a wide variety ofxenobiotics (Laudet, 1997). HR96 is proven to play arole in the response of D. melanogaster to xenobioticsas reported by King-Jones et al. (2006), but a functionregarding development or metamorphosis has not beenreported yet. This NR is a part of the 20E signallingcascade and it is known that this ecdysteroid-induced NRcan bind to the hp27 20E response element. This sug-gests that HR96 can compete with EcR-Usp for binding toa common set of target sequences (Fisk & Thummel,1995), but since this NR has no known hormone ligand, itis difficult to speculate on its actual function regarding theecdysteroid cascade. The absence of this gene suggestsits role in this signalling pathway is redundant in aphids orbeing taken over by another protein or NR.

A member of the NR2 group that was initially identifiedin the honey bee, the NR2E6, and an orthologue forvertebrate photoreceptor-cell-specific nuclear receptors(PNRs), is missing in the A. pisum genome. This gene isalso missing in the Drosophila genomes, although it hasbeen identified in the T. castaneum genome. The absenceof NR2E6 in the A. pisum and Drosophila genomes is asecondary loss in these lineages. A function for NR2E6 inthe development of the compound eye has been proposedbased on mRNA ‘in situ’ localizations in the Apis melliferadeveloping compound eyes (Velarde et al., 2006). Thefact that A. pisum shares with D. melanogaster the vastmajority of the genes involved in compound eye differen-tiation (Shigenobu et al., 2009) suggests this gene hasbeen retained potentially to regulate lineage specific dif-ferences in compound eye architectures.

The third missing NR in Acyrthosiphon is Knirps(NR0A1), while we identified two paralogues of Knirps-likeand one orthologue of Eagle. The T. castaneum, Apismellifera, B. mori genomes also seem to lack a Knirpsorthologue gene, as seen in the phylogenetic tree of theNR0 subfamily members. In Drosophila, Knirps has beencharacterized as encoding a transcriptional repressorimportant for the segmentation pathway (Nauber et al.,1988). The two other genes in the NR0 group Knirps-like(knrl) and Eagle (egon) are present in the A. pisumgenome. Analysis of these genes in the honey beesuggested no direct involvement during segmentation(Dearden et al., 2006), as is the case in Drosophila.However, in the case of T. castaneum, Knirps-like hasbeen characterized as having specific functions duringhead segmentation (Cerny et al., 2008). Our A. pisumanalysis supports the notion that these genes have beenindependently duplicated in different insect lineages. Atleast in the case of Dipterans and Coleopterans, Knirpsand Knirps-like have retained an ancestral role during

segmentation, which has been likely lost from honey beesand pea aphids. Knirps-like and Eagle may also functionas transcriptional repressors, but it remains to be deter-mined in which pathways they participate.

Further phylogenetic analysis for nuclear receptors ofpea aphid in phylum of Arthropoda

Besides the phylogenetic analysis of the NR0 subfamily(Fig. 3), NRs from the 6 other different subfamilies (NR1–NR6) were also examined by phylogenetic analysis andcompared with NRs from several different species repre-senting the major insect orders, such as Lepidoptera,Diptera, Hymenoptera, Coleoptera, and also from Crusta-cea and Arachnida (Fig. 4). This phylogenetic analysisshowed that many of the NRs of A. pisum show closerelationship with the NRs of the human louse (Pediculushumanus), as has been observed for genes throughoutthe genome. The T. castaneum NRs also showed veryhigh convergence with both the A. pisum and P. humanusNRs for a number of NRs, even though the red flour beetleis a member of the Endopterygota, while the pea aphidand the human louse belong to the infraclass of theParaneoptera.

When we look at these phylogenetic trees in Fig. 4 indetail, we notice that A. pisum NRs show a much higherresemblance to the T. castaneum and Apis mellifera ortho-logues than to the Diptera and Lepidoptera NRs, whichoften cluster together in a separate branch, even branch-ing off before the Crustacea and Arachnida. This deviationfrom normal topology, as shown in the trees of Fig. 4A andB for EcR and E78, respectively, is due to a long branchattraction caused by an acceleration of evolutionary rate inthe Mecopterida line (Diptera + Lepidoptera). This is con-sistent with the earlier findings of Bonneton et al. (2008)who have discovered that some NRs in Mecopteridaspecies (Diptera + Lepidoptera), including EcR and E78,have undergone an increase in evolutionary rate. Otherphylogenetic trees also confirm their results (data notshown).

In order to distinguish the different NR2 subfamilygenes found in the genome, we also constructed phylo-genetic trees for this entire subfamily. The NR2 subfamilytree (Fig. 5) clearly shows that the PNR-orthologue, foundin T. castaneum and Apis mellifera is missing in the peaaphid. Genes are clustered together according to thegroup (A–F) they belong to.

Elements of the 20-hydroxyecdysoneregulatory cascade

The 20E signalling cascade, as mentioned earlier in thiswork, is involved in moulting/metamorphosis and develop-ment. NRs play a very important role in this signalling

Halloween and NR genes in the pea aphid 193

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200

pathway. Binding of 20E to the EcR-Usp heterodimer isthe start of this signal. This complex, after binding to thehormone, will act as a transcription factor, immediatelyinducing expression of a number of ‘early’ genes, includ-ing the NRs E75 and HR96. These ‘early’ gene productswill then be responsible for the upregulation of a set of‘early-late’ genes, including the NRs HR3, HR4, E78 andHR39. Through FTZ-F1, this signal will then be passed on

to induce expression of the ‘late’ genes (Fig. 1; Table 2).So far, most attention in this field has gone to holometab-olous insects, which undergo a pupal metamorphosisstage. No extensive set of NRs for a hemimetabolousinsect has been identified until now. Even though the peaaphid still undergoes several larval stages, it is possiblethat there are differences between the moulting processesof hemi- and holometabolous insects.

Most of the NRs involved in the 20E regulatory cascadeproved to be present, not only in the genome of pea aphid,but also in its transcriptome, indicating they are expressedcorrectly. Only the HR96 gene, which was discussedabove, is missing from this set of ecdysone-inducible NRs.And since its function in the moulting/metamorphosis pro-cesses is still unclear, speculation about the implicationsfor the entire pathway are very difficult to make.

EcR and Usp, two NRs that are at the basis of the 20Esignalling cascade, were cloned and sequenced in orderto confirm the annotation and sequence of both genes(Fig. S2A, B). Primers used to pick up the fragment span-ning most of the cDNA are listed in Table S1. The EcR inA. pisum has the typical DBD and LBD found in its ortho-logues in other insects. The EcR-DBD shows the typicalC4 zinc finger domains in this protein, as is the case forthe P-box, the D-box and the A/T-box. Regarding theEcR-LBD we score a strong conservation, with 63%, 75%and 74% identity compared with the Drosophila, Triboliumand Apis orthologues, respectively. Furthermore, thetypical structure with 12 a-helices is well conserved. Bothretrieved sequences confirmed the in silico analysis. Twoother important NRs in the 20E signalling cascade, the socalled ‘early’ genes, E75 and HR3 were also cloned andpartially sequenced in order to confirm their presence inthe transcriptome (Fig. S2C, D).

3D-modelling of the ligand binding pocket of Ap-EcRand ligand docking

The 3D model built for Ap-EcR-LBD exhibits the canonicalstructural scaffold of the EcR-LDBs, made of 12 a-helicesassociated with a short hairpin of two antiparallelb-strands (Fig. 6A). In addition, docking of 20E into thehormone-binding groove of Ap-EcR-LBD revealed abinding scheme similar to that found for other EcR-LBD(e.g. from the beetles Leptinotarsa decemlineata, Tene-brio molitor and Anthonomus grandis) (Billas et al., 2003;Soin et al., 2009). Upon docking, the alkyl chain of thehormone becomes inserted into one of the two pocketslocated at the bottom of the hormone-binding groove(Fig. 6B). A network of nine hydrogen bonds connects thehormone to residues Glu20, Met56, Thr57, Ala 112 andTyr122, forming the binding groove (Fig. 6C). Stackinginteractions with aromatic residues Phe111 and Trp238help to complete the interaction.

DmKNRL

DpseKNRL

TcKNRL

AmKNRL

ApKNRL1

ApKNRL2

DmKNI

DpseKNI

AmEG1

AmEG2

DmEG

DpseEG

ApEG

TcEG

NR0B1 HUMAN

NR0B1 MOUSE

NR0B2 HUMAN

NR0B2 MOUSE

0.05 changes

100

74

97

77

100

100

100

100

100

100

Figure 3. Phylogenetic tree of the insect NR0 subfamily members,showing the clustering of the novel Acyrthosiphon pisum genes withtheir respective orthologues. Novel A. pisum members of this group arehighlighted in bold. The tree was rooted using the NR0B1 and NR0B2vertebrate sequences as outgroup. The tree was constructed using theneighbour-joining method with the maximum length of sequence,resulting in 160 complete aligned sites. Support for the branches, whenpresent, is indicated as a percentage of 1000 bootstrap replicates ofneighbour-joining. Am: Apis mellifera, Ap: Acyrthosiphon pisum, Dm:Drosophila melanogaster, Dpse: Drosophila pseudoobscura, Tc:Tribolium castaneum.

194 O. Christiaens et al.

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200

Experimental procedures

Annotation of Halloween and nuclear receptor genes

The 1.0 release of the A. pisum genome was used as a basis forthe bioinformatic analysis. Putative Halloween genes sequenceswere searched and obtained by TBLASTN, using the knownorthologues from Apis mellifera and T. castaneum against thecomplete scaffold collection of the pea aphid genome.

NR protein sequences from D. melanogaster, T. castaneumand Apis mellifera were first used in BlastP searches against theNCBI Gnomon version 1 predicted protein sequences to findputative A. pisum orthologues. In case no orthologues were foundin the pea aphid Gnomon protein data set, we searched the Acyr1.0 assembly of the pea aphid genome (International AphidGenomic Consortium, 2010, main paper) for homologoussequences using TBLASTN. After identification and localization inthe genome, genes were examined and manually edited if nec-essary using the Apollo Genome Annotation Curation Tool (Lewiset al., 2002). This editing was done based on alignments ofD. melanogaster, T. castaneum and Apis mellifera orthologuestogether with several gene prediction programs (Gnomon, Augus-tus, Genscan and GeneID).

Phylogenetic analysis

Whole amino acid sequences for the Halloween gene ortho-logues in Apis mellifera, T. castaneum, D. melanogaster, Ae.aegypti, An. gambiae, M. sexta and B. mori and for the nuclearreceptors of D. melanogaster, T. castaneum and Apis melliferawere collected from the GenBank database. The LBD and DBDsequences of B. mori, An. gambiae, Ae. aegypti, Culex quinque-fasciatus, P. humanus corporis and Daphnia magna wereretrieved by Blast searches of A. pisum LBD sequences againstthe GenBank database or against the species’ sequencedgenome if no GenBank entry was present. The chosen NRsequences were then aligned by CLUSTALW2/CLUSTALX2(Larkin et al., 2007). The trees were made by the neighbour-joining method using MEGA4 software (Tamura et al., 2007).Bootstrap analysis with 1000 replicates for each branch positionwas used to assess support for nodes in the tree (Felsenstein,1985).

Confirmation of transcription of the Halloween and NR genes

Presence of these transcripts in the A. pisum RNA was examinedby RT-PCR. The pea aphids were taken from a continuous colony

A EcR Culex LBD

EcR Aedes LBD

92

99 EcR Aedes LBD

EcR Anopheles LBD

EcR Drosophila LBD

EcR Bombyx LBD

EcR Tribolium LBD

EcR Pediculus LBD

99

96

100

78

88 EcR Pediculus LBD

EcR Apis LBD

EcR Acyrthosiphon LBD

EcR Ixodes LBD

EcR Daphnia LBD

88

51

55

B

0.05

E78 Culex LBD

E78 Aedes LBD

E78 Anopheles LBD

E78 Drosophila LBD

84

100

100

64

E78 Apis LBD

E78 Tribolium LBD

E78 Pediculus LBD

E78 Acyrthosiphon LBD

E78 Ixodes LBD

96

E78 Daphnia LBD

0.05

Figure 4. Phylogenetic trees of EcR (A), E78 (B),HR39 (C) and ERR (D). This tree was constructedusing the neighbour-joining method performed withthe amino acid sequences of the LBD of the selectedsequences. Bootstrap values as percentage of a1000 replicates > 50 are indicated on the tree.

Halloween and NR genes in the pea aphid 195

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200

in the Laboratory of Agrozoology at Ghent University. A mixture ofdifferent stages of A. pisum and a collection of newborn aphidsonly was used to extract total RNA using the TRI Reagent (Sigma,Bornem, Belgium), based on the single-step liquid phase sepa-ration method reported by Chomczynski & Sacchi (1987). Then,cDNA was synthesized from 1 mg of this RNA in a 20 ml reactionusing the First Strand cDNA synthesis kit (Roche, Berlin,Germany) according to the manufacturer’s instructions. BothcDNA samples (newborn only and a mixture of stages) were usedin these RT-PCR experiments. Primers were designed usingPrimer3 software (Rozen & Skaletsky, 2000) and are listed inTable S1.

Cloning and sequencing of EcR and USP, E75 and HR3 genes

Pea aphid EcR and Usp, E75 and HR3 were isolated byRT-PCR and afterwards sequenced. Same A. pisum cDNA asused in 2.3 was used for the initial PCR reactions. The PCRproducts were then purified using the Cycle Pure kit (OmegaBio-Tek, Norcross, GA, USA) and were ligated into a pGEM-Tvector (Promega, Madison, WI, USA) according to the manu-facturer’s instructions. Afterwards, plasmids were transformed incompetent Escherichia coli XL-1 Blue Cells by heat shock andthen plated out on a carbenicillin-containing LB agar plate. After16 h incubation, formed colonies were checked by colony PCR

and several of these positive colonies were then purified usingPlasmid mini prep kit (Omega Bio-Tek) and sent for sequencing(Agowa, Berlin, Germany).

3D-modelling of the ligand binding pocket of Ap-EcR andligand docking

Multiple amino acid sequence alignments were carried out withCLUSTAL-X (Thompson et al., 1997) using the Risler’s structuralmatrix for homologous amino acid residues (Risler et al., 1998).Molecular modelling of the EcR ligand-binding domain (EcR-LBD) from the pea aphid (Acces. NP_001152831.1), Ap-EcR-LBD, was performed on a Silicon Graphics O2 R10000workstation, using the programs InsightII, Homology and Dis-cover3 (Accelrys, San Diego, CA, USA). The atomic coordinatesof Tribolium Tc-EcR-LBD in complex with ecdysone (RCSBProtein Data Bank code 2NXX) (Iwema et al., 2007) were usedto build the 3D model of the receptor. The high percentages ofboth identity (~75%) and similarity (~90%) that ApEcR-LBDshares with the template Tc-EcR-LBD allowed us to build quitean accurate 3D model. Steric conflicts were corrected duringthe model building procedure using the rotamer library (Ponder& Richards, 1987) and the search algorithm of the Homologyprogram (Mas et al., 1992) to maintain proper side-chain orien-tation. An energy minimization of the final model was carried out

C HR39 Aedes LBD100

98 HR39 Culex LBD

HR39 Anopheles LBD

HR39 Drosophila LBD

HR39 Bombyx LBD

HR39 Tribolium LBD

98

89

HR39 Nasonia LBD

HR39 Apis LBD

HR39 Pediculus LBD

HR39 Acyrthosiphon LBD

HR39 Daphnia LBD

100

53

82

D

0.02

ERR Aedes LBD

ERR Culex LBD

ERR Anopheles LBD

ERR Drosophila LBD

96

100

83

ERR Pediculus LBD

ERR Acyrthosiphon LBD

ERR Apis LBD

ERR Daphnia LBD

ERR Ixodes LBDERR Ixodes LBD

0.05 Figure 4. Continued.

196 O. Christiaens et al.

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200

by 150 cycles of steepest descent using the cvff forcefield ofDiscover. PROCHECK (Laskowski et al., 1993) was used toassess the geometric quality of the 3D model. In this respect,about 87% of the residues of the modelled Ap-EcR-LBD werecorrectly assigned to the best-allowed regions of the Ramachan-dran plot. The remaining residues were located in the gener-ously allowed regions of the plot except for three residues(Asn36, Glu39 and Glu42), which occur in the non-allowedregion (result not shown). Molecular cartoons were drawn withPyMol (W.L. DeLano, http://pymol.sourceforge.net). The fold rec-ognition program Phyre (http://www.sbg.bio.ic.ac.uk/phyre/html/index.html) (Bennett-Lovsey et al., 2008), which also used 2NXXand structurally related proteins as templates, yielded a readilysuperposable 3D model for Ap-EcR-LBD. However, some dis-crepancies that essentially deal with the shape of the loops,

connecting the a-helical stretches, were observed with our lab-made, modelled structure. Importantly, these discrepanciesoccur far from the groove responsible for the binding ofecdysone.

Docking was performed with InsightII using Discover3 as aforcefield and we took TcEcR-LBD in complex with ecdysone asa template for docking. Clipping planes of Ap-EcR-LBD com-plexed to 20E were rendered with PyMol.

Acknowledgements

The authors are grateful for the support of the SpecialResearch Fund of Ghent University and the Fund ofScientific Research (FWO-Vlaanderen, Belgium) to GS.

ADmHNF4

TcHNF4

ApHNF4

DmUSP

99

100

76

BAmUSP

TcUSP

ApUSP

DmHR78

T HR78

73

100

10074

D

F

TcHR78

AmHR78

ApHR78

TcSVP

DmSVP100

99

81

100

FAmSVP

ApSVP

DmHR83

AmHR83

71

100

100

100

TcHR83

ApHR83

AmPNR

TcPNR

DmHR51

85

77

77

100

E

DmHR51

TcHR51

AmHR51

ApHR51

TcDSF98

97

88

100

50

ApDSF

DmDSF

AmDSF

DmTLL

A TLL

86

100

87

9480

74

AmTLL

TcTLL

ApTLL

94

99

0.1

Figure 5. Phylogenetic tree of the NR2 subfamilymembers with NR2A, NR2B, NR2D, NR2E andNR2F. This tree was constructed using theneighbour-joining method performed with thefull-length protein sequences of NR2 subfamilymembers. The PNR-like NRs found in Triboliumcastaneum and Apis mellifera are also added,although no orthologue in the pea aphid could befound. Bootstrap values as percentage of a 1000replicates >50 are indicated on the tree. Am: Apismellifera, Ap: Acyrthosiphon pisum, Dm: Drosophilamelanogaster, Tc: Tribolium castaneum.

Halloween and NR genes in the pea aphid 197

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200

PR acknowledges the financial support of Université PaulSabatier and CNRS.

References

Adams, M.D., Celniker, S.E., Holt, R.A., Evans, C.A., Gocayne,J.D., Amanatides, P.G et al. (2000) The genome sequence ofDrosophila melanogaster. Science 287: 2185–2195.

Ashburner, M. (1973) Sequential gene activation by ecdysone inpolytene chromosomes of Drosophila melanogaster. Dev Biol35: 47–61.

Begemann, G., Michon, A.M., Van der Voorn, L., Wepf, R. andMlodzik, M. (1995) The Drosophila orphan nuclear receptorSeven-up requires the Ras pathway for its function in photo-receptor determination. Development 121, 225–235.

Bennett-Lovsey, R.M., Hebert, A.D., Stemberg, M.J.E.and Kelley, L.A. (2008) Exploring the extremes of se-quence/structure space with ensemble fold recognition in theprogram Phyre. Proteins Struct Funct Bioinformatics 70: 611–625.

Billas, I.M.L., Iwema, T., Garnier, J.M., Mitschler, A., Rochel, N.and Moras, D. (2003) Structural adaptability in the ligand-binding pocket of the ecdysone hormone receptor. Nature426: 91–96.

Billas, I.M.I., Browning, C., Lawrence, M.C., Graham, L.D.,Moras, D. and Hill, R.J. (2009) The structure and function ofecdysone receptors. In Ecdysone: Structures and Functions(Smagghe, G., ed.), pp. 335–360. Springer, Dordrecht.

Bonneton, F., Chaumot, A. and Laudet, V. (2008) Annotation ofTribolium nuclear receptors reveals an increase in evolution-ary rate of a network controlling the ecdysone cascade. InsectBiochem Mol Biol 38: 416–429.

Brisson, J.A. and Stern, D.L. (2006) The pea aphid, Acyrthosi-phon pisum: an emerging genomic model system for ecologi-cal, developmental and evolutionary studies. Bioessays 28:747–755.

Cerny, A., Grossmann, D., Bucher, G. and Klinger, M. (2008) TheTribolium ortholog of knirps and knirps-related is crucial forhead segmentation but plays a minor role during abdominalpatterning. Dev Biol 321: 284–294.

Chavez, V.M., Marques, G., Delbecque, J.P., Kobayashi, K., Holl-ingsworth, M., Burr, J. et al. (2000) The Drosophila disembod-ied gene controls late embryonic morphogenesis and codesfor a cytochrome P450 enzyme that regulates embryonicecdysone levels. Development 127: 4115–4126.

Cheng, D., Xia, Q., Duan, J., Wei, L., Huang, C., Li, Z. et al.(2008) Nuclear receptors in Bombyx mori: insights intogenomic structure and developmental expression. InsectBiochem Mol Biol 38: 1130–1137.

A

B CB C

Figure 6. (A) Ribbon diagram of the modelledAp-EcR-LBD. The 12 a-helices and the two b-strandsforming the 3D-structure are labelled and differentlycoloured. N and C indicate the N-terminal andC-terminal ends of the polypeptide chain,respectively. (B) Clipping plane across theecdysone-binding groove showing the insertion of thealkyl chain of 20-hydroxyecdysone (20E)(represented in pink stick) in one (black star) of thetwo pockets located at the bottom of the groove. (C)Network of hydrogen bonds (black dotted lines)anchoring 20E (pink stick) to amino acid residuesforming the hormone-binding groove of Ap-EcR-LBD.Aromatic residues involved in stacking interactionswith 20E are coloured orange.

198 O. Christiaens et al.

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200

Chomczynski, P. and Sacchi, N. (1987) Single-step method ofRNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal Biochem 162: 156–159.

Dearden, P.K., Wilson, M.J., Sablan, L., Osborne, P.W., Havler,M., McNaughton, E. et al. (2006) Patterns of conservation andchange in honey bee developmental genes. Genome Res 16:1376–1384.

Felsenstein, J. (1985) Confidence limits on phylogenies: anapproach using the bootstrap. Evolution 39: 783–791.

Fisk, G.J. and Thummel, C.S. (1995) Isolation, regulation, andDNA-binding properties of three Drosophila nuclear hormonereceptor superfamily members. Proc Natl Acad Sci USA 92:10604–10608.

Gilbert, L.I., Rybczynski, R. and Warren, J.T. (2002) Control andbiochemical nature of the ecdysteroidogenic pathway. AnnuRev Entomol 47: 883–916.

Gissendanner, C.R., Crossgrove, K., Kraus, K.A., Maina, C.V.and Sluder, A.E. (2004) Expression and function of conservednuclear receptor genes in C. elegans. Dev Biol 266: 399–416.

Henrich, V.C. (2005) The ecdysteroid receptor. In Comprehen-sive Molecular Insect Science, Vol. 3 (Gilbert, L.I., Iatrou, K.and Gill, S., eds), pp. 243–286. Elsevier, Oxford.

Hiromi, Y., Mlodzik, M., West, S.R., Rubin, G.M. and Goodman,C.S. (1993) Ectopic expression of seven-up causes cell fatechanges during ommatidial assembly. Development 118:1123–1135.

Holt, R.A., Subramanian, G.M., Halpern, A., Sutton, G.G.,Charlab, R., Nusskern, D.R. et al. (2002) The genomesequence of the malaria mosquito Anopheles gambiae.Science 298: 129–149.

Huang, X., Warren, J.T. and Gilbert, L.I. (2008) New players in theregulation of ecdysone biosynthesis. J Genet Genomics 35:1–10.

Iga, M. and Smagghe, G. (2009) Identification and expressionprofile of Halloween genes involved in ecdysteroid biosynthe-sis in Spodoptera littoralis. Peptides, in press.

International Aphid Genomics Consortium (2010) Genomesequence of the pea aphid Acyrthosiphon pisum. PLoS Biol,DOI 10.1371/journal.pbio.1000313.

Iwema, T., Billas, I.M.L., Beck, Y., Bonneton, F., Nierengarten, H.,Chaumot, A. et al. (2007) Structural and functional character-ization of a novel type of ligand-independent RXR-USP recep-tor. EMBO J 26: 3770–3782.

Kanai, M.I., Okabe, M. and Hiromi, Y. (2005) Seven-up controlsswitching of transcription factors that specify temporal identi-ties of Drosophila neuroblasts. Dev Cell 8: 203–213.

Kerber, B., Fellert, S. and Hoch, M. (1998) Seven-up, the Droso-phila homolog of the COUP-TF orphan receptors, controlscell proliferation in the insect kidney. Genes Dev 12: 1781–1786.

King-Jones, K. and Thummel, C.S. (2005) Nuclear receptors – aperspective from Drosophila. Nat Rev Genet 6: 311–323.

King-Jones, K., Horner, M.A., Lam, G. and Thummel, C.S. (2006)The DHR96 nuclear receptor regulates xenobiotic responsesin Drosophila. Cell Metab 4: 37–48.

Kramer, S., West, S.R. and Hiromi, Y. (1995) Cell fate control inthe Drosophila retina by the orphan receptor seven-up: its rolein the decisions mediated by the ras signaling pathway. Devel-opment 121: 1361–1372.

Lafont, R., Dauphin-Villemant, C., Warren, J.T. and Rees, H.(2005) Ecdysteroid chemistry and biochemistry. In Compre-

hensive Molecular Insect Science, Vol. 3 (Gilbert, L.I., Iatrou,K. and Gill, S., eds), pp. 125–195. Elsevier, Oxford.

Larkin, M.A., Blackshields, G., Brown, N.P., Chenna, R., McGet-tigan, P.A., McWilliam, H. et al. (2007) CLUSTALW andCLUSTALX Version 2. Bioinformatics 23: 2947–2948.

Laskowski R.A., MacArthur M.W., Moss D.S. and Thornton J.M.(1993) PROCHECK: a program to check the stereochemistryof protein structures. J Appl Cryst 26: 283–291.

Laudet, V. (1997) Evolution of the nuclear receptor superfamily:early diversification from an ancestral orphan receptor. J MolEndocrinol 19: 207–226.

Lewis, S.E., Searle, S.M.J., Harris, N., Gibson, M., Iyer, V., Ricter,J. et al. (2002) Apollo: a sequence annotation editor. GenomeBiol 3: research0082.1–research0082.14.

Maeda, S., Nakashima, A., Yamada, R., Hara, N., Fujimoto, Y.,Ito, Y. et al. (2008) Molecular cloning of ecdysone20-hydroxylase and expression pattern of the enzyme duringembryonic development of silkworm Bombyx mori. CompBiochem Physiol 149B: 507–516.

Mas, M.T., Smith, K.C., Yarmush, D.L., Aisaka, K. and Fine, R.M.(1992) Modeling the anti-CEA antibody combining site byhomology and conformational search. Proteins Struct FunctGenet 14: 483–498.

Miura, K., Zhu, J., Dittmer, N.T., Chen, L. and Raikhel, A.S. (2002)A COUP-TF/Svp homolog is highly expressed during vitello-genesis in the mosquito Aedes aegypti. J Mol Endocrinol 29:223–238.

Namiki, T., Niwa, R., Sakudoh, T., Shirai, K., Takeuchi, H. andKataoka, H. (2005) Cytochrome P450 CYP307A1/Spook: aregulator for ecdysone synthesis in insects. Biochem BiophysRes Commun 337: 367–374.

Nauber, U., Pankratz, M.J., Kienlin, A., Seifert, E., Klemm, U. andJaeckle, H. (1988) Abdominal segmentation of the Drosophilaembryo requires a hormone receptor-like protein encoded bythe gap gene knirps. Nature 336: 489–492.

Niwa, R., Matsuda, T., Yoshiyama, T., Namiki, T., Mita, K.,Fujimoto, Y. et al. (2004) CYP306A1, a cytochrome P450enzyme, is essential for ecdysteroid biosynthesis in the pro-thoracic glands of Bombyx and Drosophila. J Biol Chem 279:35942–35949.

Niwa, R., Sakudoh, T., Namiki, T., Saida, K., Fujimoto, Y. andKataoka, H. (2005) The ecdysteroidogenic P450 Cyp302a1/disembodied from the silkworm, Bombyx mori, is transcription-ally regulated by prothoracicotropic hormone. Insect Mol Biol14: 563–571.

Ono, H., Rewitz, K.F., Shinoda, T., Itoyama, K., Petryk, A., Ryb-czynski, R. et al. (2006) Spook and Spookier code for stage-specific components of the ecdysone biosynthetic pathway inDiptera. Dev Biol 298: 555–570.

Palli, S.R., Hormann, R.E., Schlattner, U. and Lezzi, M. (2005)Ecdysteroid receptors and their applications in agriculture andmedicine. Vitam Horm 73: 59–100.

Petryk, A., Warren, J.T., Marqués, G., Jarcho, M.P., Gilbert, L.I.,Kahler, J. et al. (2003) Shade is the Drosophila P450 enzymethat mediates the hydroxylation of ecdysone to the steroidinsect molting hormone 20-hydroxyecdysone. Proc Natl AcadSci USA 100: 13773–13778.

Ponder, J.W. and Richards, F.M. (1987) Tertiary templates forproteins. Use of packing criteria in the enumeration of allowedsequences for different structural classes. J Mol Biol 193:775–791.

Halloween and NR genes in the pea aphid 199

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200

Raikhel, A.S., Miura, K. and Segraves, W.A. (1999) Nuclearreceptors in mosquito vitellogenesis. Am Zool 39: 722–735.

Rees, H.H. (1995) Ecdysteroid biosynthesis and inactivation inrelation to function. Eur J Entomol 92: 9–39.

Rewitz, K.F. and Gilbert, L.I. (2008) Daphnia Halloween genesthat encode cytochrome P450s mediating the synthesis of thearthropod molting hormone: evolutionary implications. BMCEvol Biol 8: 60.

Rewitz, K.F., Rybczynski, R., Warren, J.T. and Gilbert, L.I.(2006a). Developmental expression of Manduca shade, theP450 mediating the final step in molting hormone synthesis.Mol Cell Endocrinol 247: 166–174.

Rewitz, K.F., Rybczynski, R., Warren, J.T. and Gilbert, L.I.(2006b). Identification, characterization and developmentalexpression of Halloween genes encoding P450 enzymesmediating ecdysone biosynthesis in the tobacco hornworm,Manduca sexta. Insect Biochem Mol Biol 36: 188–199.

Rewitz, K.F., O’Connor, M.B. and Gilbert, L.I. (2007) Molecularevolution of the insect Halloween family of cytochrome P450s:phylogeny, gene organization and functional conservation.Insect Biochem Mol Biol 37: 741–753.

Risler, J.-L., Delorme, M.O., Delacroix, H. and Henaut, A. (1998)Amino acid substitutions in structurally related proteins. Apattern recognition approach. Determination of a new an effi-cient scoring matrix. J Mol Biol 204: 1019–1029.

Robinson-Rechavi, M., Carpentier, A.S., Duffraisse, M. andLaudet, V. (2001) How many nuclear hormone receptorsare there in the human genome? Trends Genet 17: 554–556.

Rozen, S. and Skaletsky, H.J. (2000) Primer3 on the WWW forgeneral users and for biologist programmers. Methods MolBiol 132: 365–386.

Shigenobu, S., Bickel, R.D., Brisson, J.A., Butts, T., Chang, C.-C.,Christiaens, O et al. (2009) Comprehensive survey of devel-opmental genes in the pea aphid, Acyrthosiphon pisum:frequent lineage-specific duplications and losses ofdevelopmental genes. Insect Mol Biol, in press.

Siaussat, D., Bozzolan, F., Porcheron, P. and Debernard, S.(2007) Identification of steroid hormone signaling pathway ininsect cell differentiation. Cell Mol Life Sci 64: 365–376.

Sieglaff, D.H., Duncan, K.A. and Brown, M.R., 2005. Expressionof genes encoding proteins involved in ecdysteroidogenesis inthe female mosquito, Aedes aegypti. Insect Biochem Mol Biol35: 471–490.

Stern, D.L. (2008) Aphids. Curr Biol 18: 504–505.Soin, T., Iga, M., Swevers, L., Rougé, P., Janssen, C.R. and

Smagghe, G. (2009) Towards Coleoptera-specific high-throughput screening systems for compounds with ecdysoneactivity: development of EcR reporter assays using weevil(Anthonomus grandis)-derived cell lines and in silico analysisof ligand binding to A. grandis EcR ligand-binding pocket.Insect Biochem Mol Biol 39: 523–534.

Tamura, K., Dudley, J., Nei, M. and Kumar, S. (2007) MEGA4:Molecular Evolutionary Genetics Analysis (MEGA) softwareversion 4.0. Mol Biol Evol 24: 1596–1599.

Thompson, J.D., Gibson, T.J., Plewniak, F., Jeanmougin, F.and Higgins, D.G. (1997) The CLUSTAL-X windows inter-face: flexible strategies for multiple sequence alignmentaided by quality analysis tool. Nucleic Acids Res 15: 4876–4882.

Velarde, R.A., Robinson, G.E. and Fahrbach, S.E. (2006) Nuclearreceptors of the honey bee: annotation and expression in theadult brain. Insect Mol Biol 15: 583–595.

Warren, J.T., Petryk, A., Marques, G., Jarcho, M., Parvy, J.P.,Dauphin-Villament, C., O’Connor, M.B. and Gilbert, L.I. (2002)Molecular and biochemical characterization of two P450enzymes in the ecdysteroidogenic pathway of Drosophilamelanogaster. Proc Natl Acad Sci USA 99: 11043–11048.

Warren, J.T., Petryk, A., Marques, G., Parvy, J.P., Shinoda, T.,Itoyama, K. et al. (2004) Phantom encodes the 25-hydroxylase of Drosophila melanogaster and Bombyx mori: aP450 enzyme critical in ecdysone biosynthesis. InsectBiochem Mol Biol 34: 991–1010.

Werck-Reichhart, D. and Feyereisen, R. (2000) CytochromesP450: a success story. Genome Biol 1: reviews 3003.1–reviews 3003.1–9.

Zhang, Z., Burch, P.E., Cooney, A.J., Lanz, R.B., Pereira, F.A.,Wu, J. et al. (2004) Genomic analysis of the nuclear receptorfamily: new insights into structure, regulation, and evolutionfrom the rat genome. Genome Res 14: 580–590.

Supporting Information

Additional Supporting Information may be found in theonline version of this article under the DOI reference: DOI10.1111/j.1365-2583.2009.00957.x

Figure S1. Amino acid sequence alignment of Spo/Spok/Spot (A), Phm(B), Dib (C) Sad (D) and Shd (E). Residues in black are identities and ingray are similarities. The conserved P450 motifs are indicated: helix-C,helix-I, helix-K, PERF-motif and heme-binding domain.

Figure S2. Amino acid sequences of the cloned and sequenced fragmentsof EcR (A), Usp (B), E75 (C) and HR3 (D). For EcR and Usp, the entire DBDhas been sequenced and the P box (yellow), D box (red) and T/A box (teal)are indicated. The eight zinc-coordinating cysteines in the DBD are boxed inthe sequence. For E75, only a part of the DBD has been sequenced.

Table S1. Primers used for the RT-PCR detection in the transcriptome ofthe Halloween (upper part) and nuclear receptor genes and for the cloningof EcR, Usp, E75 and HR3 (lower part) in the pea aphid. The primers weredesigned to overlap at least one intron, except for the KNRL and EG genes,which mainly consist out of one big exon and only possess a very smallsecond exon. Expected fragment sizes are also indicated in the table

Please note: Wiley-Blackwell are not responsible for thecontent or functionality of any supporting materialssupplied by the authors. Any queries (other than missingmaterial) should be directed to the corresponding authorfor the article.

200 O. Christiaens et al.

© 2010 The AuthorsJournal compilation © 2010 The Royal Entomological Society, Insect Molecular Biology (2010), 19 (Suppl. 2), 187–200