Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells

8
Reports Regulated transcription initiation underlies state changes in cell phenotype and is coordinated by transcription factors binding to gene-proximal promoters or distal regulatory regions such as enhancers. The interaction between enhancers and transcription induction during cellular differentiation has been cited as one of the outstanding mysteries of modern biology (1). Enhancer chromatin landscapes change drastically between developing tissues and differentiated cells (24). Active enhancers initiate production of RNAs (eRNAs) (5) and enhancer action during differentiation can be assessed by sequencing of steady state (6, 7) or nascent RNA (810), demonstrating that eRNA and target gene expression are correlated. eRNA production is also correlated to physical proximity between enhancers and promoters (8, 9). However, the general temporal relationship between enhancer and promoter activation across biological system is unknown. Genome-scale 5RACE (CAGE) detects transcription start sites (TSS), including the bidirectional TSS characteristic of active enhancers (11). Based on a large set of reporter assays, CAGE-defined enhancers are 2-3 times more likely to validate (12) than un-transcribed chromatin- defined enhancer candidates from the ENCODE consortium (13). In this study, we used CAGE to dissect the relationship be- tween dynamic changes in mRNA and eRNA in 33 time courses of differentiation and activation. The time courses in- cluded stem cells (embryonic, induced pluripotent, tropho- blastic and mesenchymal stem cells) and committed progenitors undergoing terminal differentia- tion toward mesodermal, endo- dermal and ectodermal fates, as Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells Erik Arner,* Carsten O Daub,* Kristoffer Vitting-Seerup,* Robin Andersson,* Berit Lilje, Finn Drablos, Andreas Lennartsson, Michelle Rönnerblad, Olga Hrydziuszko, Morana Vitezic, Tom C Freeman, Ahmad Alhendi, Peter Arner, Richard Axton, J Kenneth Baillie, Anthony Beckhouse, Beatrice Bodega, James Briggs, Frank Brombacher, Margaret Davis, Michael Detmar, Anna Ehrlund, Mitsuhiro Endoh, Afsaneh Eslami, Michela Fagiolini, Lynsey Fairbairn, Geoffrey J Faulkner, Carmelo Ferrai, Malcolm E Fisher, Lesley Forrester, Daniel Goldowitz, Reto Guler, Thomas Ha, Mitsuko Hara, Meenhard Herlyn, Tomokatsu Ikawa, Chieko Kai, Hiroshi Kawamoto, Levon Khachigian, Peter S Klinken, Soichi Kojima, Haruhiko Koseki, Sarah Klein, Niklas Mejhert, Ken Miyaguchi, Yosuke Mizuno, Mitsuru Morimoto, Kelly J Morris, Christine Mummery, Yutaka Nakachi, Soichi Ogishima, Mariko Okada-Hatakeyama, Yasushi Okazaki, Valerio Orlando, Dmitry Ovchinnikov, Robert Passier, Margaret Patrikakis, Ana Pombo, Xian-Yang Qin, Sugata Roy, Hiroki Sato, Suzana Savvi, Alka Saxena, Anita Schwegmann, Daisuke Sugiyama, Rolf Swoboda, Hiroshi Tanaka, Andru Tomoiu, Louise N Winteringham, Ernst Wolvetang, Chiyo Yanagi-Mizuochi, Misako Yoneda, Susan Zabierowski, Peter Zhang, Imad Abugessaisa, Nicolas Bertin, Alexander D. Diehl, Shiro Fukuda, Masaki Furuno, Jayson Harshbarger, Akira Hasegawa, Fumi Hori, Sachi Ishikawa-Kato, Yuri Ishizu, Masayoshi Itoh, Tsugumi Kawashima, Miki Kojima, Naoto Kondo, Marina Lizio, Terrence F. Meehan, Christopher J Mungall, Mitsuyoshi Murata, Hiromi Nishiyori-Sueki, Serkan Sahin, Sayako Sato-Nagao, Jessica Severin, Michiel JL de Hoon, Jun Kawai, Takeya Kasukawa, Timo Lassmann, Harukazu Suzuki, Hideya Kawaji, Kim M Summers, Christine Wells, FANTOM Consortium, David A Hume, †‡ Alistair RR Forrest, †‡ Albin Sandelin, †‡ Piero Carninci, †‡ Yoshihide Hayashizaki †‡ The list of author affiliations is available in the Supplementary Text. *These authors contributed equally to this work. †FANTOM5 Phase 2 Core authors. ‡Corresponding author. E-mail: [email protected] (D.A.H.); [email protected] (A.R.R.F.); [email protected] (A.S.); [email protected] (P.C.); [email protected] (Y.H.) While it is generally accepted that cellular differentiation requires changes to transcriptional networks, dynamic regulation of promoters and enhancers at specific sets of genes has not been previously studied en masse. Exploiting the fact that active promoters and enhancers are transcribed, we simultaneously measured their activity in 19 human and 14 mouse time courses covering a wide range of cell types and biological stimuli. Enhancer RNAs, then mRNAs encoding transcription factors dominated the earliest responses. Binding sites for key lineage transcription factors were simultaneously over-represented in enhancers and promoters active in each cellular system. Our data support a highly generalizable model in which enhancer transcription is the earliest event in successive waves of transcriptional change during cellular differentiation or activation. / sciencemag.org/content/early/recent / 12 February 2015 / Page 1 / 10.1126/science.1259418 on February 16, 2015 www.sciencemag.org Downloaded from on February 16, 2015 www.sciencemag.org Downloaded from on February 16, 2015 www.sciencemag.org Downloaded from on February 16, 2015 www.sciencemag.org Downloaded from on February 16, 2015 www.sciencemag.org Downloaded from on February 16, 2015 www.sciencemag.org Downloaded from on February 16, 2015 www.sciencemag.org Downloaded from on February 16, 2015 www.sciencemag.org Downloaded from

Transcript of Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells

Reports

Regulated transcription initiation underlies state changes in cell phenotype and is coordinated by transcription factors binding to gene-proximal promoters or distal regulatory regions such as enhancers. The interaction between enhancers and transcription induction during cellular differentiation has been cited as one of the outstanding mysteries of modern biology (1). Enhancer chromatin landscapes change drastically between developing tissues and differentiated cells (2–4). Active enhancers initiate production of RNAs (eRNAs) (5) and enhancer action during differentiation can be assessed by sequencing of steady state (6, 7) or nascent RNA (8–10), demonstrating that eRNA and target gene expression are correlated. eRNA production is also correlated to physical proximity between enhancers and promoters (8, 9). However, the general temporal relationship between enhancer and promoter activation across biological system is unknown.

Genome-scale 5′RACE (CAGE) detects transcription start sites (TSS), including the bidirectional TSS characteristic of active enhancers (11). Based on a large set of reporter assays, CAGE-defined enhancers are 2-3 times more likely to validate (12) than un-transcribed chromatin-defined enhancer candidates from the ENCODE consortium (13). In this study, we used CAGE to dissect the relationship be-tween dynamic changes in mRNA and eRNA in 33 time courses of differentiation and activation. The time courses in-cluded stem cells (embryonic, induced pluripotent, tropho-blastic and mesenchymal stem cells) and committed progenitors undergoing terminal differentia-tion toward mesodermal, endo-dermal and ectodermal fates, as

Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells Erik Arner,*† Carsten O Daub,*† Kristoffer Vitting-Seerup,*† Robin Andersson,*† Berit Lilje, Finn Drablos, Andreas Lennartsson, Michelle Rönnerblad, Olga Hrydziuszko, Morana Vitezic, Tom C Freeman, Ahmad Alhendi, Peter Arner, Richard Axton, J Kenneth Baillie, Anthony Beckhouse, Beatrice Bodega, James Briggs, Frank Brombacher, Margaret Davis, Michael Detmar, Anna Ehrlund, Mitsuhiro Endoh, Afsaneh Eslami, Michela Fagiolini, Lynsey Fairbairn, Geoffrey J Faulkner, Carmelo Ferrai, Malcolm E Fisher, Lesley Forrester, Daniel Goldowitz, Reto Guler, Thomas Ha, Mitsuko Hara, Meenhard Herlyn, Tomokatsu Ikawa, Chieko Kai, Hiroshi Kawamoto, Levon Khachigian, Peter S Klinken, Soichi Kojima, Haruhiko Koseki, Sarah Klein, Niklas Mejhert, Ken Miyaguchi, Yosuke Mizuno, Mitsuru Morimoto, Kelly J Morris, Christine Mummery, Yutaka Nakachi, Soichi Ogishima, Mariko Okada-Hatakeyama, Yasushi Okazaki, Valerio Orlando, Dmitry Ovchinnikov, Robert Passier, Margaret Patrikakis, Ana Pombo, Xian-Yang Qin, Sugata Roy, Hiroki Sato, Suzana Savvi, Alka Saxena, Anita Schwegmann, Daisuke Sugiyama, Rolf Swoboda, Hiroshi Tanaka, Andru Tomoiu, Louise N Winteringham, Ernst Wolvetang, Chiyo Yanagi-Mizuochi, Misako Yoneda, Susan Zabierowski, Peter Zhang, Imad Abugessaisa, Nicolas Bertin, Alexander D. Diehl, Shiro Fukuda, Masaki Furuno, Jayson Harshbarger, Akira Hasegawa, Fumi Hori, Sachi Ishikawa-Kato, Yuri Ishizu, Masayoshi Itoh, Tsugumi Kawashima, Miki Kojima, Naoto Kondo, Marina Lizio, Terrence F. Meehan, Christopher J Mungall, Mitsuyoshi Murata, Hiromi Nishiyori-Sueki, Serkan Sahin, Sayako Sato-Nagao, Jessica Severin, Michiel JL de Hoon, Jun Kawai, Takeya Kasukawa, Timo Lassmann, Harukazu Suzuki, Hideya Kawaji,† Kim M Summers,† Christine Wells,† FANTOM Consortium, David A Hume,†‡ Alistair RR Forrest,†‡ Albin Sandelin,†‡ Piero Carninci,†‡ Yoshihide Hayashizaki†‡ The list of author affiliations is available in the Supplementary Text.

*These authors contributed equally to this work.

†FANTOM5 Phase 2 Core authors.

‡Corresponding author. E-mail: [email protected] (D.A.H.); [email protected] (A.R.R.F.); [email protected] (A.S.); [email protected] (P.C.); [email protected] (Y.H.)

While it is generally accepted that cellular differentiation requires changes to transcriptional networks, dynamic regulation of promoters and enhancers at specific sets of genes has not been previously studied en masse. Exploiting the fact that active promoters and enhancers are transcribed, we simultaneously measured their activity in 19 human and 14 mouse time courses covering a wide range of cell types and biological stimuli. Enhancer RNAs, then mRNAs encoding transcription factors dominated the earliest responses. Binding sites for key lineage transcription factors were simultaneously over-represented in enhancers and promoters active in each cellular system. Our data support a highly generalizable model in which enhancer transcription is the earliest event in successive waves of transcriptional change during cellular differentiation or activation.

/ sciencemag.org/content/early/recent / 12 February 2015 / Page 1 / 10.1126/science.1259418

on

Feb

ruar

y 16

, 201

5w

ww

.sci

ence

mag

.org

Dow

nloa

ded

from

o

n F

ebru

ary

16, 2

015

ww

w.s

cien

cem

ag.o

rgD

ownl

oade

d fr

om

on

Feb

ruar

y 16

, 201

5w

ww

.sci

ence

mag

.org

Dow

nloa

ded

from

o

n F

ebru

ary

16, 2

015

ww

w.s

cien

cem

ag.o

rgD

ownl

oade

d fr

om

on

Feb

ruar

y 16

, 201

5w

ww

.sci

ence

mag

.org

Dow

nloa

ded

from

o

n F

ebru

ary

16, 2

015

ww

w.s

cien

cem

ag.o

rgD

ownl

oade

d fr

om

on

Feb

ruar

y 16

, 201

5w

ww

.sci

ence

mag

.org

Dow

nloa

ded

from

o

n F

ebru

ary

16, 2

015

ww

w.s

cien

cem

ag.o

rgD

ownl

oade

d fr

om

well as fully differentiated primary cells and cell lines re-sponding to stimuli (growth factors and pathogens) (Fig. 1A-B; tables S1-S3, Supplementary Methods). In total, 1,189 CAGE libraries from 408 distinct time points in the 33 time courses were analyzed (Fig. 1B, Auxiliary data tables S1 and S2). Differentiation or response to stimulus was assessed by monitoring cell morphology changes, reproducible induc-tion of known lineage markers and similarity of the end-point transcriptome to differentiated cells from the steady state samples of FANTOM5 (14) (Auxiliary data table S1).

The current data expands the set of known human and mouse core promoters from the FANTOM5 body-wide steady-state atlas (14) to 201,802 and 158,966, and the set of transcribed enhancers to 65,423 and 44,459. Of all identified core promoters in human and mouse, 51% and 61% varied significantly in expression in at least one time course. Out of the 103,355 differentially expressed human promoters, 80,152 were within genes on the same strand. 55,626 of these are potential alternative promoters (See Supplemen-tary Methods), overlapping a total of 13,138 genes. We found 65 human genes that had a dynamic switch between alter-native promoters within a time course leading to exclusion of exons encoding protein domains (table S4).

Of all enhancers identified in FANTOM5, 42,274 human (65%) and 34,338 mouse (77%) enhancers were expressed in at least one CAGE library in the current study. Of these, 5,371 (13%) human and 6,824 (20%) mouse enhancers changed expression significantly over time in at least one time course. Most of these enhancer changes were time-course-specific (56% in human, 67% in mouse). In contrast, the fraction of promoters regulated in only a single time course was smaller (29% in human, 33% in mouse).

We profiled 13 cellular systems with high temporal reso-lution within the first hours of cellular induction (Fig. 1B). We focused on the first six hours in nine of these time courses (five human and four mouse having sufficient num-bers of dynamic promoters and enhancers, table S1).

Based on unsupervised clustering, we identified a set of distinct response pattern classes, shared by multiple time courses, by analyzing expression fold changes vs. time 0 in each time course. For each response class, we defined specif-ic expression rules (fig. S1), enabling consistent response class labeling of any dynamically transcribed enhancer or TSS in a time-course specific fashion (figs. S2-S4). Transcrip-tion factor (TF) promoters were analyzed as a distinct group. Since the majority of enhancers and promoters that were dynamically changing in this set were up-regulated over time (fig. S5), we focused on the six up-regulated re-sponse classes (Fig. 1C).

Multiple enhancers, TF promoters and non-TF promot-ers were found in all response classes (Fig. 1D, fig. S6, Auxil-iary data table S3), but with different preferences. Enhancers were more common in the early peaking classes (“rapid short”, “early standard” and “rapid long” responses). TF promoters were generally induced after enhancers (pre-

ferring the “late standard” response and “long response” classes) and non-TF promoters were most common in the “late gradual response” class that increased gradually with time (Fig. 1E), suggesting that many of these genes were the direct or indirect targets of the induced transcription fac-tors. Simulation studies, as well as gene-specific RNA half-life data (15), showed that differential degradation rates of RNA species (11) could not explain the observed class pref-erences (Supplementary text, Supplementary figs. S7-S8). While these patterns were evident across cell types and spe-cies, few promoters (mean 8.5% across classes) and even fewer enhancers (mean 5.1% across classes) were assigned to the same response class in two or more time courses (Fig. 1F).

We looked further at a literature-curated set of 232 im-mediate early response (IER) genes (table S5). While 65% of the IER genes had at least one promoter that was up-regulated within the first two hours in at least two time courses, no consistent pattern of IER expression was obvi-ous between time courses (fig. S9). For example, only 42 promoters were induced early in five or more human time courses (fig. S10A). Even fewer enhancers shared an early response: only 11 were induced in three or more time cours-es (fig. S10B), and of these, half neighbored a known IER gene. Thus, the immediate early response pattern is general-izable across different cell states, but the cohort of IER genes are not.

In general, up-regulated enhancers in the rapid short re-sponse class were transcribed earlier than their proximal (+/−200kb) promoters (Fig. 2A-B and S11). Proximal TF promoters were, in turn, more highly and more rapidly acti-vated than proximal non-TF promoters. In order to compare the responses over time, we employed the “Center of Mass” (CM) statistic identifying the time point by which 50% of the expression change in the enhancer or promoter had oc-curred. Enhancers changed most rapidly, followed by TF promoters, then non-TF promoters (Fig. 2C). The temporal differences were highlighted further when enhancers were compared to their proximal promoters (within +/−200kb) (Fig. 2C). For 85.8% of enhancer-non-TF-promoter pairs and 74.6% of enhancer-TF-promoter pairs, the CM occurred ear-lier for the enhancer (P < 1.0e-106, Wilcoxon signed rank test). We hypothesized that these results might reflect larger chromatin structures; indeed, enhancer-promoter pairs de-fined by topological domains (TADs) (16) gave similar re-sults (figs. S11-S12), and moreover, enhancers (or promoters) within the same TAD were more likely to be in the same response class (Fig. 2D). Similarly, groups of enhancer-promoter pairs (defined either by genomic proximity or TAD boundaries) were more similar in terms of CM shifts than expected by random (fig. S13, P<1.0e−14, Mann-Whitney U test).

We used ENCODE (13) data to demonstrate that enhanc-ers dynamically expressed in the MCF-7+HRG time course were more likely to be marked with high DNAse sensitivity,

/ sciencemag.org/content/early/recent / 12 February 2015 / Page 2 / 10.1126/science.1259418

and enriched in H3K27ac and RNAPII ChIP signal in steady-state MCF-7 cells than enhancers that were not active throughout the time course (fig. S14A). Indeed, ChIA-pet interaction data from steady-state MCF-7 cells (17) showed that these dynamic enhancers interacted with promoters to a much larger degree than non-active enhancers, but the fraction of promoter-interacting enhancers was high regard-less of response class (Fig. 2E), suggesting that many dy-namically changing enhancers are proximal to their promoter target(s) and primed beforehand in terms of chromatin state. However, chromatin patterns in the un-stimulated state were not sufficient to distinguish between temporal enhancer classes (fig. S14B).

Transcription factor binding sites implicated in regulat-ing enhancer and promoter expression were assessed by inferring motif activities (18) – a statistic that describes the ability of a DNA motif to explain observed expression changes across a given set of samples – based on motif oc-currence in the regions -300 to +100bp from the major TSS of each promoter and ±200bp from the center of each en-hancer, resulting in a derived activity profile across time for each DNA binding motif and time course. Motif sets with high predictive power in enhancers and promoters over-lapped significantly (FDR<0.05, Fisher’s Exact test) in 29/33 time courses (Fig. 3A). Many of these highly contributing motifs described binding sites for known lineage-specific regulators in specific time courses, such as FOS in MCF-7 cells stimulated by HRG, GATA6 in cardiomyocyte differen-tiation and NFKB in macrophages. On average, motif activi-ty scores correlated positively across time between enhancers and promoters, with significantly higher correla-tion for motifs identified as significantly active (Supplemen-tary text) in both enhancers and promoters (P < 6.9e-8, Mann-Whitney test) (Fig. 3B), however, in general, motif activity reached a maximum in enhancers earlier than in promoters (P < 1.8e-14, Wilcoxon signed rank test; Fig. 3C,D). Thus, the general observation of enhancer transcrip-tion waves preceding those of promoters identified above was supported by motif activity.

In summary, by using a large-scale comparative analysis across many different tissues and time-courses, and simul-taneously sampling expression at gene promoters and en-hancers, we reveal that enhancer transcription is the most common rapid transcription change occurring when cells initiate a state change. Enhancer RNA concentration peaked as early as 15 min after the transition trigger was applied in some time courses. Although earlier studies of single time courses have reported enhancer activity prior to gene activa-tion in a small set of enhancer-gene pairs (8, 9, 19), we can now establish this phenomenon as a general feature of mammalian transcriptional regulation, across a multitude of biological systems. This challenges previous models which suggested that linked enhancers and promoters are co-expressed over time (e.g., refs. (8, 15, 19, 20)). Indeed, even in the case of late response classes, candidate enhancers

appear to be activated in advance of promoters in their vi-cinity (fig. S11). The rapid burst of eRNA activity at 15 min was frequently followed by a rapid return to base line (Fig. 1D). In these cases, it may be that once the target promoter has been activated, enhancer activity is no longer required. Other enhancers were rapidly activated and then continu-ously expressed. These eRNAs may have additional func-tional roles, such as the recently suggested role in promoting elongation (15).

REFERENCES AND NOTES 1. M. Levine, C. Cattoglio, R. Tjian, Looping back to leap forward: Transcription enters

a new era. Cell 157, 13–25 (2014). Medline doi:10.1016/j.cell.2014.02.009 2. S. Bonn, R. P. Zinzen, C. Girardot, E. H. Gustafson, A. Perez-Gonzalez, N.

Delhomme, Y. Ghavi-Helm, B. Wilczyński, A. Riddell, E. E. Furlong, Tissue-specific analysis of chromatin state identifies temporal signatures of enhancer activity during embryonic development. Nat. Genet. 44, 148–156 (2012). Medline doi:10.1038/ng.1064

3. A. S. Nord, M. J. Blow, C. Attanasio, J. A. Akiyama, A. Holt, R. Hosseini, S. Phouanenavong, I. Plajzer-Frick, M. Shoukry, V. Afzal, J. L. Rubenstein, E. M. Rubin, L. A. Pennacchio, A. Visel, Rapid and pervasive changes in genome-wide enhancer usage during mammalian development. Cell 155, 1521–1531 (2013). Medline doi:10.1016/j.cell.2013.11.033

4. A. B. Stergachis, S. Neph, A. Reynolds, R. Humbert, B. Miller, S. L. Paige, B. Vernot, J. B. Cheng, R. E. Thurman, R. Sandstrom, E. Haugen, S. Heimfeld, C. E. Murry, J. M. Akey, J. A. Stamatoyannopoulos, Developmental fate and cellular maturity encoded in human regulatory DNA landscapes. Cell 154, 888–903 (2013). Medline doi:10.1016/j.cell.2013.07.020

5. T.-K. Kim, M. Hemberg, J. M. Gray, A. M. Costa, D. M. Bear, J. Wu, D. A. Harmin, M. Laptewicz, K. Barbara-Haley, S. Kuersten, E. Markenscoff-Papadimitriou, D. Kuhl, H. Bito, P. F. Worley, G. Kreiman, M. E. Greenberg, Widespread transcription at neuronal activity-regulated enhancers. Nature 465, 182–187 (2010). Medline doi:10.1038/nature09033

6. H. Wu, A. S. Nord, J. A. Akiyama, M. Shoukry, V. Afzal, E. M. Rubin, L. A. Pennacchio, A. Visel, Tissue-specific RNA expression marks distant-acting developmental enhancers. PLOS Genet. 10, e1004610 (2014). Medline doi:10.1371/journal.pgen.1004610

7. M. Rönnerblad, R. Andersson, T. Olofsson, I. Douagi, M. Karimi, S. Lehmann, I. Hoof, M. de Hoon, M. Itoh, S. Nagao-Sato, H. Kawaji, T. Lassmann, P. Carninci, Y. Hayashizaki, A. R. Forrest, A. Sandelin, K. Ekwall, E. Arner, A. Lennartsson; FANTOM consortium, Analysis of the DNA methylome and transcriptome in granulopoiesis reveals timed changes and dynamic enhancer methylation. Blood 123, e79–e89 (2014). Medline

8. M. U. Kaikkonen, N. J. Spann, S. Heinz, C. E. Romanoski, K. A. Allison, J. D. Stender, H. B. Chun, D. F. Tough, R. K. Prinjha, C. Benner, C. K. Glass, Remodeling of the enhancer landscape during macrophage activation is coupled to enhancer transcription. Mol. Cell 51, 310–325 (2013). Medline doi:10.1016/j.molcel.2013.07.010

9. N. Hah, S. Murakami, A. Nagari, C. G. Danko, W. L. Kraus, Enhancer transcripts mark active estrogen receptor binding sites. Genome Res. 23, 1210–1223 (2013). Medline doi:10.1101/gr.152306.112

10. W. Li, D. Notani, Q. Ma, B. Tanasa, E. Nunez, A. Y. Chen, D. Merkurjev, J. Zhang, K. Ohgi, X. Song, S. Oh, H. S. Kim, C. K. Glass, M. G. Rosenfeld, Functional roles of enhancer RNAs for oestrogen-dependent transcriptional activation. Nature 498, 516–520 (2013). Medline doi:10.1038/nature12210

11. R. Andersson, P. Refsing Andersen, E. Valen, L. J. Core, J. Bornholdt, M. Boyd, T. Heick Jensen, A. Sandelin, Nuclear stability and transcriptional directionality separate functionally distinct RNA species. Nat. Commun. 5, 5336 (2014). Medline doi:10.1038/ncomms6336

12. R. Andersson, C. Gebhard, I. Miguel-Escalada, I. Hoof, J. Bornholdt, M. Boyd, Y. Chen, X. Zhao, C. Schmidl, T. Suzuki, E. Ntini, E. Arner, E. Valen, K. Li, L. Schwarzfischer, D. Glatz, J. Raithel, B. Lilje, N. Rapin, F. O. Bagger, M. Jørgensen, P. R. Andersen, N. Bertin, O. Rackham, A. M. Burroughs, J. K. Baillie, Y. Ishizu, Y. Shimizu, E. Furuhata, S. Maeda, Y. Negishi, C. J. Mungall, T. F. Meehan, T. Lassmann, M. Itoh, H. Kawaji, N. Kondo, J. Kawai, A. Lennartsson, C. O. Daub, P. Heutink, D. A. Hume, T. H. Jensen, H. Suzuki, Y. Hayashizaki, F. Müller, A. R. Forrest, P. Carninci, M. Rehli, A. Sandelin; FANTOM Consortium, A. R. R. Forrest,

/ sciencemag.org/content/early/recent / 12 February 2015 / Page 3 / 10.1126/science.1259418

P. Carninci, M. Rehli, A. Sandelin, An atlas of active enhancers across human cell types and tissues. Nature 507, 455–461 (2014). Medline doi:10.1038/nature12787

13. ENCODE Project Consortium, An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012). Medline

14. FANTOM Consortium and the RIKEN PMI and CLST (DGT), A. R. R. Forrest, H. Kawaji, M. Rehli, J. K. Baillie, M. J. de Hoon, V. Haberle, T. Lassman, I. V. Kulakovskiy, M. Lizio, M. Itoh, R. Andersson, C. J. Mungall, T. F. Meehan, S. Schmeier, N. Bertin, M. Jørgensen, E. Dimont, E. Arner, C. Schmidl, U. Schaefer, Y. A. Medvedeva, C. Plessy, M. Vitezic, J. Severin, C. Semple, Y. Ishizu, R. S. Young, M. Francescatto, I. Alam, D. Albanese, G. M. Altschuler, T. Arakawa, J. A. Archer, P. Arner, M. Babina, S. Rennie, P. J. Balwierz, A. G. Beckhouse, S. Pradhan-Bhatt, J. A. Blake, A. Blumenthal, B. Bodega, A. Bonetti, J. Briggs, F. Brombacher, A. M. Burroughs, A. Califano, C. V. Cannistraci, D. Carbajo, Y. Chen, M. Chierici, Y. Ciani, H. C. Clevers, E. Dalla, C. A. Davis, M. Detmar, A. D. Diehl, T. Dohi, F. Drabløs, A. S. Edge, M. Edinger, K. Ekwall, M. Endoh, H. Enomoto, M. Fagiolini, L. Fairbairn, H. Fang, M. C. Farach-Carson, G. J. Faulkner, A. V. Favorov, M. E. Fisher, M. C. Frith, R. Fujita, S. Fukuda, C. Furlanello, M. Furino, J. Furusawa, T. B. Geijtenbeek, A. P. Gibson, T. Gingeras, D. Goldowitz, J. Gough, S. Guhl, R. Guler, S. Gustincich, T. J. Ha, M. Hamaguchi, M. Hara, M. Harbers, J. Harshbarger, A. Hasegawa, Y. Hasegawa, T. Hashimoto, M. Herlyn, K. J. Hitchens, S. J. Ho Sui, O. M. Hofmann, I. Hoof, F. Hori, L. Huminiecki, K. Iida, T. Ikawa, B. R. Jankovic, H. Jia, A. Joshi, G. Jurman, B. Kaczkowski, C. Kai, K. Kaida, A. Kaiho, K. Kajiyama, M. Kanamori-Katayama, A. S. Kasianov, T. Kasukawa, S. Katayama, S. Kato, S. Kawaguchi, H. Kawamoto, Y. I. Kawamura, T. Kawashima, J. S. Kempfle, T. J. Kenna, J. Kere, L. M. Khachigian, T. Kitamura, S. P. Klinken, A. J. Knox, M. Kojima, S. Kojima, N. Kondo, H. Koseki, S. Koyasu, S. Krampitz, A. Kubosaki, A. T. Kwon, J. F. Laros, W. Lee, A. Lennartsson, K. Li, B. Lilje, L. Lipovich, A. Mackay-Sim, R. Manabe, J. C. Mar, B. Marchand, A. Mathelier, N. Mejhert, A. Meynert, Y. Mizuno, D. A. de Lima Morais, H. Morikawa, M. Morimoto, K. Moro, E. Motakis, H. Motohashi, C. L. Mummery, M. Murata, S. Nagao-Sato, Y. Nakachi, F. Nakahara, T. Nakamura, Y. Nakamura, K. Nakazato, E. van Nimwegen, N. Ninomiya, H. Nishiyori, S. Noma, S. Noma, T. Noazaki, S. Ogishima, N. Ohkura, H. Ohimiya, H. Ohno, M. Ohshima, M. Okada-Hatakeyama, Y. Okazaki, V. Orlando, D. A. Ovchinnikov, A. Pain, R. Passier, M. Patrikakis, H. Persson, S. Piazza, J. G. Prendergast, O. J. Rackham, J. A. Ramilowski, M. Rashid, T. Ravasi, P. Rizzu, M. Roncador, S. Roy, M. B. Rye, E. Saijyo, A. Sajantila, A. Saka, S. Sakaguchi, M. Sakai, H. Sato, S. Savvi, A. Saxena, C. Schneider, E. A. Schultes, G. G. Schulze-Tanzil, A. Schwegmann, T. Sengstag, G. Sheng, H. Shimoji, Y. Shimoni, J. W. Shin, C. Simon, D. Sugiyama, T. Sugiyama, M. Suzuki, N. Suzuki, R. K. Swoboda, P. A. ’t Hoen, M. Tagami, N. Takahashi, J. Takai, H. Tanaka, H. Tatsukawa, Z. Tatum, M. Thompson, H. Toyodo, T. Toyoda, E. Valen, M. van de Wetering, L. M. van den Berg, R. Verado, D. Vijayan, I. E. Vorontsov, W. W. Wasserman, S. Watanabe, C. A. Wells, L. N. Winteringham, E. Wolvetang, E. J. Wood, Y. Yamaguchi, M. Yamamoto, M. Yoneda, Y. Yonekura, S. Yoshida, S. E. Zabierowski, P. G. Zhang, X. Zhao, S. Zucchelli, K. M. Summers, H. Suzuki, C. O. Daub, J. Kawai, P. Heutink, W. Hide, T. C. Freeman, B. Lenhard, V. B. Bajic, M. S. Taylor, V. J. Makeev, A. Sandelin, D. A. Hume, P. Carninci, Y. Hayashizaki; A promoter-level mammalian expression atlas. Nature 507, 462–470 (2014). Medline doi:10.1038/nature13182

15. K. Schaukowitch, J. Y. Joo, X. Liu, J. K. Watts, C. Martinez, T. K. Kim, Enhancer RNA facilitates NELF release from immediate early genes. Mol. Cell 56, 29–42 (2014). 10.1016/j.molcel.2014.08.023 Medline doi:10.1016/j.molcel.2014.08.023

16. J. R. Dixon, S. Selvaraj, F. Yue, A. Kim, Y. Li, Y. Shen, M. Hu, J. S. Liu, B. Ren, Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012). Medline doi:10.1038/nature11082

17. G. Li, X. Ruan, R. K. Auerbach, K. S. Sandhu, M. Zheng, P. Wang, H. M. Poh, Y. Goh, J. Lim, J. Zhang, H. S. Sim, S. Q. Peh, F. H. Mulawadi, C. T. Ong, Y. L. Orlov, S. Hong, Z. Zhang, S. Landt, D. Raha, G. Euskirchen, C. L. Wei, W. Ge, H. Wang, C. Davis, K. I. Fisher-Aylor, A. Mortazavi, M. Gerstein, T. Gingeras, B. Wold, Y. Sun, M. J. Fullwood, E. Cheung, E. Liu, W. K. Sung, M. Snyder, Y. Ruan, Extensive promoter-centered chromatin interactions provide a topological basis for transcription regulation. Cell 148, 84–98 (2012). Medline doi:10.1016/j.cell.2011.12.014

18. FANTOM Consortium, H. Suzuki, A. R. Forrest, E. van Nimwegen, C. O. Daub, P. J. Balwierz, K. M. Irvine, T. Lassmann, T. Ravasi, Y. Hasegawa, M. J. de Hoon, S. Katayama, K. Schroder, P. Carninci, Y. Tomaru, M. Kanamori-Katayama, A.

Kubosaki, A. Akalin, Y. Ando, E. Arner, M. Asada, H. Asahara, T. Bailey, V. B. Bajic, D. Bauer, A. G. Beckhouse, N. Bertin, J. Björkegren, F. Brombacher, E. Bulger, A. M. Chalk, J. Chiba, N. Cloonan, A. Dawe, J. Dostie, P. G. Engström, M. Essack, G. J. Faulkner, J. L. Fink, D. Fredman, K. Fujimori, M. Furuno, T. Gojobori, J. Gough, S. M. Grimmond, M. Gustafsson, M. Hashimoto, T. Hashimoto, M. Hatakeyama, S. Heinzel, W. Hide, O. Hofmann, M. Hörnquist, L. Huminiecki, K. Ikeo, N. Imamoto, S. Inoue, Y. Inoue, R. Ishihara, T. Iwayanagi, A. Jacobsen, M. Kaur, H. Kawaji, M. C. Kerr, R. Kimura, S. Kimura, Y. Kimura, H. Kitano, H. Koga, T. Kojima, S. Kondo, T. Konno, A. Krogh, A. Kruger, A. Kumar, B. Lenhard, A. Lennartsson, M. Lindow, M. Lizio, C. Macpherson, N. Maeda, C. A. Maher, M. Maqungo, J. Mar, N. A. Matigian, H. Matsuda, J. S. Mattick, S. Meier, S. Miyamoto, E. Miyamoto-Sato, K. Nakabayashi, Y. Nakachi, M. Nakano, S. Nygaard, T. Okayama, Y. Okazaki, H. Okuda-Yabukami, V. Orlando, J. Otomo, M. Pachkov, N. Petrovsky, C. Plessy, J. Quackenbush, A. Radovanovic, M. Rehli, R. Saito, A. Sandelin, S. Schmeier, C. Schönbach, A. S. Schwartz, C. A. Semple, M. Sera, J. Severin, K. Shirahige, C. Simons, G. St Laurent, M. Suzuki, T. Suzuki, M. J. Sweet, R. J. Taft, S. Takeda, Y. Takenaka, K. Tan, M. S. Taylor, R. D. Teasdale, J. Tegnér, S. Teichmann, E. Valen, C. Wahlestedt, K. Waki, A. Waterhouse, C. A. Wells, O. Winther, L. Wu, K. Yamaguchi, H. Yanagawa, J. Yasuda, M. Zavolan, D. A. Hume; and Riken Omics Science Center, T. Arakawa, S. Fukuda, K. Imamura, C. Kai, A. Kaiho, T. Kawashima, C. Kawazu, Y. Kitazume, M. Kojima, H. Miura, K. Murakami, M. Murata, N. Ninomiya, H. Nishiyori, S. Noma, C. Ogawa, T. Sano, C. Simon, M. Tagami, Y. Takahashi, J. Kawai, Y. HayashizakiThe transcriptional network that controls growth arrest and differentiation in a human myeloid leukemia cell line. Nat. Genet. 41, 553–562 (2009). Medline doi:10.1038/ng.375

19. C.-L. Hsieh, T. Fei, Y. Chen, T. Li, Y. Gao, X. Wang, T. Sun, C. J. Sweeney, G. S. Lee, S. Chen, S. P. Balk, X. S. Liu, M. Brown, P. W. Kantoff, Enhancer RNAs participate in androgen receptor-driven looping that selectively enhances gene activation. Proc. Natl. Acad. Sci. U.S.A. 111, 7319–7324 (2014). Medline doi:10.1073/pnas.1324151111

20. N. E. IIott, J. A. Heward, B. Roux, E. Tsitsiou, P. S. Fenwick, L. Lenzi, I. Goodhead, C. Hertz-Fowler, A. Heger, N. Hall, L. E. Donnelly, D. Sims, M. A. Lindsay, Long non-coding RNAs and enhancer RNAs regulate the lipopolysaccharide-induced inflammatory response in human monocytes. Nat. Commun. 5, 3979–3979 (2014). Medline doi:10.1038/ncomms4979

21. M. Kanamori-Katayama, M. Itoh, H. Kawaji, T. Lassmann, S. Katayama, M. Kojima, N. Bertin, A. Kaiho, N. Ninomiya, C. O. Daub, P. Carninci, A. R. Forrest, Y. Hayashizaki, Unamplified cap analysis of gene expression on a single-molecule sequencer. Genome Res. 21, 1150–1159 (2011). Medline doi:10.1101/gr.115469.110

22. S. Anders, W. Huber, Differential expression analysis for sequence count data. Genome Biol. 11, R106 (2010). Medline doi:10.1186/gb-2010-11-10-r106

23. M. D. Robinson, D. J. McCarthy, G. K. Smyth, edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010). Medline doi:10.1093/bioinformatics/btp616

24. R. D. Finn, A. Bateman, J. Clements, P. Coggill, R. Y. Eberhardt, S. R. Eddy, A. Heger, K. Hetherington, L. Holm, J. Mistry, E. L. Sonnhammer, J. Tate, M. Punta, Pfam: The protein families database. Nucleic Acids Res. 42, D222–D230 (2014). Medline doi:10.1093/nar/gkt1223

25. J. M. Vaquerizas, S. K. Kummerfeld, S. A. Teichmann, N. M. Luscombe, A census of human transcription factors: Function, expression and evolution. Nat. Rev. Genet. 10, 252–263 (2009). Medline doi:10.1038/nrg2538

26. J. C. Roach, K. D. Smith, K. L. Strobe, S. M. Nissen, C. D. Haudenschild, D. Zhou, T. J. Vasicek, G. A. Held, G. A. Stolovitzky, L. E. Hood, A. Aderem, Transcription factor expression in lipopolysaccharide-activated peripheral-blood-derived mononuclear cells. Proc. Natl. Acad. Sci. U.S.A. 104, 16245–16250 (2007). Medline

27. E. Wingender, T. Schoeps, J. Dönitz, TFClass: An expandable hierarchical classification of human transcription factors. Nucleic Acids Res. 41, D165–D170 (2013). Medline doi:10.1093/nar/gks1123

28. D. Karolchik, G. P. Barber, J. Casper, H. Clawson, M. S. Cline, M. Diekhans, T. R. Dreszer, P. A. Fujita, L. Guruvadoo, M. Haeussler, R. A. Harte, S. Heitner, A. S. Hinrichs, K. Learned, B. T. Lee, C. H. Li, B. J. Raney, B. Rhead, K. R. Rosenbloom, C. A. Sloan, M. L. Speir, A. S. Zweig, D. Haussler, R. M. Kuhn, W. J. Kent, The UCSC Genome Browser database: 2014 update. Nucleic Acids Res. 42, D764–D770 (2014). Medline doi:10.1093/nar/gkt1168

29. C. Notredame, D. G. Higgins, J. Heringa, T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 302, 205–217 (2000).

/ sciencemag.org/content/early/recent / 12 February 2015 / Page 4 / 10.1126/science.1259418

Medline doi:10.1006/jmbi.2000.4042 30. P. Arnold, I. Erb, M. Pachkov, N. Molina, E. van Nimwegen, MotEvo: Integrated

Bayesian probabilistic methods for inferring regulatory sites and motifs on multiple alignments of DNA sequences. Bioinformatics 28, 487–494 (2012). Medline doi:10.1093/bioinformatics/btr695

31. M. Pachkov, P. J. Balwierz, P. Arnold, E. Ozonov, E. van Nimwegen, SwissRegulon, a database of genome-wide annotations of regulatory sites: Recent updates. Nucleic Acids Res. 41, D214–D220 (2013). Medline doi:10.1093/nar/gks1145

32. J. W. Tullai, M. E. Schaffer, S. Mullenbrock, G. Sholder, S. Kasif, G. M. Cooper, Immediate-early and delayed primary response genes are distinct in function and genomic architecture. J. Biol. Chem. 282, 23981–23995 (2007). Medline doi:10.1074/jbc.M702044200

ACKNOWLEDGMENTS

For a full list of acknowledgements and contributions, see Supplementary text. Funding: FANTOM5 was made possible by a Research Grant for RIKEN Omics Science Center from MEXT to Y. Hayashizaki. It was also supported by Research Grants for RIKEN Preventive Medicine and Diagnosis Innovation Program to Y. Hayashizaki and RIKEN Centre for Life Science Technologies, Division of Genomic Technologies (from the MEXT, Japan). Additional funding is listed in Supplementary text. Data accessibility: All CAGE data needed to reproduce the study has been deposited at DDBJ under accession numbers DRA000991, DRA002711, DRA002747 and DRA002748. Additional visualizations of the data are available at http://fantom.gsc.riken.jp/5/. The human iPSC lines that were subjected to cortical neuronal differentiation can be made available following completion of an MTA with the Australian Institute for Bioengineering and Nanotechnology of The University of Queensland.

SUPPLEMENTARY MATERIALS www.sciencemag.org/cgi/content/full/science.1259418/DC1 Materials and Methods Supplementary Text Figs. S1 to S14 Tables S1 to S5 Auxiliary data tables S1 to S3 References (21–32) 31 July 2014; accepted 26 January 2015 Published online 12 February 2015 10.1126/science.1259418

/ sciencemag.org/content/early/recent / 12 February 2015 / Page 5 / 10.1126/science.1259418

Fig. 1. Time course design and definition of response classes. A. Schematic illustration of the time course experiments included in the study, arranged according to a development tree. Germ layers are shown as boxes. Black stars indicate time series sampled with high resolution. B. Overview of time courses according to sampling strategy. X-axis indicates time after induction. Each dot indicates CAGE sampling, typically done in biological triplicates. C. Stylistic representation of each of the major up-regulated response patterns (classes) identified as described in the main text. Y-axis shows log2 fold change vs. time 0, X-axis shows time in minutes. D. Mean expression log2 fold change across time courses for enhancers and promoters classified into each response pattern (as in panel C). The 95% confidence intervals of means are shown. E. Boxplots of fractions showing the preference for enhancers, TF promoters and other promoters for respective response class. F. Overlap between time courses in terms of enhancers and promoters in respective class. Barplots show the frequency (Y-axis) of the number of time courses (out of 9) sharing a specific feature (X-axis).

/ sciencemag.org/content/early/recent / 12 February 2015 / Page 6 / 10.1126/science.1259418

Fig. 2. Temporal shifts between enhancer and promoter activity. A. Smoothed mean expression over time for all enhancers classified into the rapid short response group and all differentially expressed proximal (+/−200kb) promoters, split by gene type. Controls for class specificity (dotted lines) constitute promoters proximal to randomly sampled enhancers from other classes. Shaded areas indicate 95% confidence intervals. B. Example of expression timing in an enhancer-promoter pair (EGR1), showing activation of enhancers before promoter activation. MCF-7 ChIA-PET interaction data are visualized at the bottom as green lines; each line represents a cluster of ChIA-PET paired tags consisting of at least three pairs, where line end thickness is proportional to the number of paired tags in the cluster. Right panel shows the expression level of promoter and enhancer in MCF-7 cells after induction with HRG. Error bars indicate standard deviation. C. Left: Distribution of Center of Mass (CM) of expression changes (see main text) for enhancers, TF-promoters and promoters of other genes. Right: difference in CM (“shift”) between enhancers-promoter pairs linked by proximity (+/− 200kb) split by gene type. Black dots indicate 25th, 50th and 75th percentiles. Asterisks indicate significance (P<1.0 e−106. Mann-Whitney U test). D. The similarity of enhancer/promoter response classification (Fig. 1C) within each TAD was analyzed by calculating the frequency of identically classified enhancer/promoters in all pairwise comparisons. Frequency distributions are shown as violin plots. Controls are made by randomly sampling the same number of enhancers or promoters and calculate the classification similarity as above (repeated 100 times for each TAD). Asterisks indicate significance (P < 0.01, Mann-Whitney U test), dots represent percentiles as above. E. Fraction of enhancers that interact (by RNAPII-ChIA-PET) with promoters in unstimulated MCF-7 cells, split by enhancer response class in the MCF-7+HRG time course..

/ sciencemag.org/content/early/recent / 12 February 2015 / Page 7 / 10.1126/science.1259418

Fig. 3. Motif analysis of linked enhancers and promoters over time. A. Overlap of motifs classified as significant for driving expression in enhancers and promoters. Top row: bar plot of motif overlap odds ratios, colored by significance. Bottom row: Venn diagrams of motif set overlap. B. Distributions of average Pearson correlation coefficient between motif activities in enhancers and promoters in all motifs investigated (black) and motifs significantly active in both enhancers and promoters (gray). C. Distribution of shift (minutes) in motif activity center of mass (see Fig. 2D) in promoters compared to enhancers. D. Examples of motif activity in enhancers preceding that of promoters. Motif activity is plotted as the average of activity Z-scores per time point. Error bars indicate the standard deviation.

/ sciencemag.org/content/early/recent / 12 February 2015 / Page 8 / 10.1126/science.1259418