Identification of potential miRNAs and their targets in Vriesea carinata (Poales, Bromeliaceae)
Transcript of Identification of potential miRNAs and their targets in Vriesea carinata (Poales, Bromeliaceae)
This article appeared in a journal published by Elsevier. The attachedcopy is furnished to the author for internal non-commercial researchand education use, including for instruction at the authors institution
and sharing with colleagues.
Other uses, including reproduction and distribution, or selling orlicensing copies, or posting to personal, institutional or third party
websites are prohibited.
In most cases authors are permitted to post their version of thearticle (e.g. in Word or Tex form) to their personal website orinstitutional repository. Authors requiring further information
regarding Elsevier’s archiving and manuscript policies areencouraged to visit:
http://www.elsevier.com/authorsrights
Author's personal copy
Plant Science 210 (2013) 214– 223
Contents lists available at SciVerse ScienceDirect
Plant Science
jo u r n al homep age: www.elsev ier .com/ locate /p lantsc i
Identification of potential miRNAs and their targets in Vriesea carinata(Poales, Bromeliaceae)
Frank Guzmana, Mauricio Pereira Almerãob, Ana Paula Korbesa, Ana Paula Christoff a,Camila Martini Zanellaa, Fernanda Bereda, Rogério Margisa,b,∗
a PPGBM at Federal University of Rio Grande do Sul – UFRGS, Porto Alegre, RS, Brazilb Centre of Biotechnology and PPGBCM, Laboratory of Genomes and Plant Population, Federal University of Rio Grande do Sul – UFRGS, Porto Alegre, RS,Brazil
a r t i c l e i n f o
Article history:Received 8 September 2012Received in revised form 24 April 2013Accepted 23 May 2013Available online xxx
Keywords:Vriesea carinatamiRNAsHigh-throughput sequencingStress response
a b s t r a c t
The miRNAs play important roles in regulation of gene expression at the post-transcriptional level. A smallRNA and RNA-seq of libraries were constructed to identify miRNAs in Vriesea carinata, a native bromeliadspecies from Brazilian Atlantic Rainforest. Illumina technology was used to perform high throughputsequencing and data was analyzed using bioinformatics tools. We obtained 2,191,509 mature miRNAssequences representing 54 conserved families in plant species. Further analysis allowed the predictionof secondary structures for 19 conserved and 16 novel miRNAs. Potential targets were predicted frompre-miRNAs by sequence homology and validated using RTqPCR approach. This study provides the firstidentification of miRNAs and their potential targets of a bromeliad species.
© 2013 Elsevier Ireland Ltd. All rights reserved.
1. Introduction
MicroRNAs (miRNAs) are small non-coding regulatory RNAswidely found in unicellular and multicellular organisms that actas regulators of gene expression at the post-transcriptional levelon genes containing miRNA binding sites [1]. Mature miRNAs aresingle-stranded RNA molecules of approximately 21 nucleotides(nt) in length processed from a precursor molecule (pre-miRNA)[2]. To regulate protein-coding genes the mature miRNA binds inthe mRNA target site leading to mRNA degradation or translationrepression [3]. In plants, miRNAs have diverse biological functionsand are involved in the regulation of optimal growth and devel-opment, as well as other physiological processes, including abioticand biotic stress responses [4]. Several studies showed that manymiRNAs are conserved across different plant families [5]. However,it was also reported species and family specific miRNAs that areexpressed in low levels and probably have evolved recently [6].
Bromeliaceae family, with 3248 species distributed in 58 genera[7], is an example of a large and well described adaptive radiation
Abbreviations: miRNA, microRNA; sRNA, small RNA; pre-miRNA, microRNA pre-cursor; MFEI, minimal folding energy index; nt, nucleotides; vca, Vriesea carinata.
∗ Corresponding author at: Centre of Biotechnology and PPGBCM, Laboratory ofGenomes and Plant Population, Building 43431, Federal University of Rio Grande doSul – UFRGS, P.O. Box 15005, CEP 91501-970, Porto Alegre, RS, Brazil.Tel.: +55 51 33087766; fax: +55 51 33087309.
E-mail addresses: [email protected], [email protected](R. Margis).
of plant families in the Neotropics. The family is composed of ter-restrial xerophytes and both facultative and obligatory epiphytesspecies which acquired, throughout their evolution, interesting anddifferent adaptive mechanisms such as central tanks, CAM photo-synthetic pathway, and bear absorptive trichomes, characteristicsthat allowed them to occupy a wide range of habitats [8,9]. Con-sequently, Bromeliaceae constitutes one of the most ecologicallydiverse and species-rich clades of flowering plants native to theNew World [10]. In Brazil, the Atlantic Rainforest is considered oneof the main centers of diversity and endemism of Bromeliaceae,showing 31 genera and 803 species of which 10 genera and 653species are endemic [11]. Vriesea carinata is an epiphytic or ter-restrial species distributed along the Brazilian Atlantic Rainforest.As a typical species of this biome [12], V. carinata is an interest-ing model for studying the expression of miRNAs in Bromeliaceae.The first step to study the expression of miRNAs is to identify miR-NAs and its targets in different natural conditions. For this purpose,we performed a high-throughput sequencing analysis (Solexa tech-nology) of small RNAs (sRNAs) from the endemic Brazilian AtlanticRainforest species V. carinata.
2. Materials and methods
2.1. Plant material and RNA isolation
Total RNA was isolated from V. carinata leaves using Trizolreagent (Invitrogen, CA, USA), according to manufacturer’s pro-tocol. The RNA quality was evaluated by electrophoresis on a 1%
0168-9452/$ – see front matter © 2013 Elsevier Ireland Ltd. All rights reserved.http://dx.doi.org/10.1016/j.plantsci.2013.05.013
Author's personal copy
F. Guzman et al. / Plant Science 210 (2013) 214– 223 215
agarose gel and quantification was determined using a Nanodrop(Nanodrop Technologies, Wilmington, DE, USA).
2.2. Deep sequencing
Total RNA (>10 �g) from leaves was sent to Fasteris SA (Plan-les-Ouates, Switzerland) for processing. One small RNA (sRNA) librarywas constructed and sequenced the Illumina HiSeq2000 platform.Briefly, the construction of the small RNA libraries consisted of thefollowing successive steps: acrylamide gel purification of the RNAfraction corresponding to the size range 20–30 nt, ligation of the 3pand 5p adapters to the RNA in two separate subsequent steps, eachfollowed by acrylamide gel purification, cDNA synthesis followedby acrylamide gel purification, and a final step of PCR amplificationto generate a cDNA colony template library for Illumina sequencing.
A polyadenylated transcript sequencing (mRNA-seq) was per-formed using the following successive steps: poly-A purification,cDNA synthesis using poly-T primer shotgun to generate inserts of500 nt, 3p and 5p adapters ligations, pre-amplification, colony gen-eration and sequencing. The Illumina output data correspondes tosequence tags of 100 bases.
2.3. Accession numbers
Sequencing data is available in Gene Expression Omnibus (GEO)under the series accession GSE38250 (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE38250). This accession contains theRNA-seq and sRNA libraries derived from V. carinata leaves.
2.4. Analysis of small RNA library
The overall procedure for analyzing Illumina small RNA libraryis shown in Fig. S1. All low quality reads (with FASTq value bel-low 13) were removed and 5′ and 3′ adapter sequences weretrimmed using Genome Analyzer Pipeline (Illumina) by Fasteris.The remaining low quality reads with ‘n’ were removed with Prin-Seq script [13]. Sequences shorter than 18 nt and larger than 25 ntwere excluded from further analysis. Small RNAs derived fromViridiplantae rRNAs, tRNAs, snRNAs and snoRNAs deposited at thetRNAdb [14], SILVA rRNA [15], and NONCODE v3.0 [16] databasesand from Poales mtRNA and cpRNA deposited at NCBI GenBankdatabase (http://ftp.ncbi.nlm.nih.gov) were identified by mappingwith Bowtie [17]. After data cleaning (low quality reads, adaptersequences), mRNA-seq data was assembled de novo in contigs usingCLC Genome Workbench version 4.0.2 (CLCbio, Aarhus, Denmark)with default parameters. In total, 41,171 contigs were assembledand used as reference for pre-miRNA and target sequence identifi-cation.
2.5. Identification of conserved and novel miRNAs
In order to determine plant conserved miRNAs, sRNA sequenceswere aligned with conserved non-redundant Viridiplantae miRNAsdeposited at miRBase (Release 18, November 2011) using Bowtie.Complete alignment of the sequences was required and no mis-matches allowed. To search for novel miRNAs, sRNA sequenceswere matched against a set of proprietary V. carinata mRNAtranscript database using SOAP2 [18]. The SOAP2 output was fil-tered with an in house filter precursor tool to separate candidatesequences as miRNA precursors with an anchoring pattern of ablock of aligned small RNAs with perfect matches. The candidateprecursors obtained were confirmed manually using Tablet soft-ware [19] to visualize the anchoring pattern. As miRNAs precursorshave a characteristic hairpin structure, the next step to select pre-cursor candidates was the secondary structure analysis by RNAfold
with RNAfold webserver and using annotation algorithm from theUEA sRNA toolkit [20].
We used the mfold web server (http://mfold.rna.albany.edu/?q=mfold/RNA-Folding-Form) to identify the minimal foldingfree energy (MFE, �G kcal/mol) of each miRNA precursor. This valueestimates the stability of the miRNA candidate-target duplex. Then,the adjust minimal folding free energy (AMFE) and the minimalfolding free energy index (MFEI) were calculated according to theprevious report [21]. AMFE means the MFE of a RNA sequencewith 100 nt in length, which is equal to MFE/(length of a poten-tial pre-miRNA) × 100. MFEI is equal to MFE/(length of a potentialpre-miRNA)/(The percentage of nucleotides G and C). In addition,perfect stem-loop structures should have the sRNA sequence at onearm of the stem and a respective anti-sense sequence at the oppo-site arm. Finally, precursor candidate sequences were confirmed asnovel by BLASTn algorithm from the miRBase (www.mirbase.org)and NCBI databases.
2.6. Prediction of miRNA targets
The mRNA contigs previously assembled were clustered usingthe Gene Indices Clustering Tools (http://compbio.dfci.harvard.edu/tgi/software/) [22] to reduce any sequence redundancy. Theclustering output was passed to CAP3 assembler [23] for multi-ple alignment and consensus building. Contigs that cannot reachthe threshold set and fall into any assembly should remain as alist of singletons. The prediction of target genes of the most abun-dant mature miRNAs from the conserved and novel pre-miRNAswas performed by psRNAtarget [24] using V. carinata assembledunigenes longer than 600 bp (default parameters and expectationvalue of 3.5). Candidate RNA sequences were then annotated byassignment of putative gene descriptions based on sequence simi-larity with previously identified genes. These genes were annotatedwith those details deposited in the protein database of NR andSwiss Prot/Uniprot protein database using BLASTx implementedin blast2GO v2.3.5 software [25]. The annotation was improvedby analysis of conserved domains/families using InterProScan tooland Gene Ontology terms were determined by GOslim tool fromblast2GO software. At the same time the orientation of the trans-cripts were obtained from BLAST annotations.
2.7. miRNAs and target confirmation by RT-qPCR
In order to validate the in silico predicted V. carinata miRNAs andsome of their mRNAs targets, RT-qPCR reactions were performed asdescribed in previous works [26,27]. RNA samples were extractedfrom two different tissues, leaf and ovary in six biological repli-cates, with the Trizol reagent (Invitrogen, CA, USA). The RNA qualitywas accessed by electrophoresis on a 1% agarose gel. Thereafter thecDNA were obtained for 17 miRNAs based on the stem-loop method[28]. Also the cDNA for mRNA targets validation were obtainedbased on the poli-T amplification by reverse transcription of anM-MLV RNA Polymerase (Invitrogen, CA, USA), accordingly withthe manufacturer instructions. Primers used for Stem loop cDNAsynthesis, mature miRNA expression and mRNA target amplifica-tion were described in Supplementary Tables S1–S3, respectively.The RT-qPCR amplifications were performed in a CFX 384 Real-Time PCR System (Bio Rad), using SYBR Green (Invitrogen). PCRreactions were carried out in a final volume of 10 �L, containing5 �L of diluted cDNA (1:100) and 5 �L of reagents mix: 1X SYBRGreen, 0.025 mM dNTP, 1X PCR buffer, 3 mM MgCl2, 0.25 U Plati-nun Taq DNA Polimerase (Invitrogen) and 200 nM of each reverseand forward primer. The RT-qPCR conditions were set as follow:94 ◦C for 5 min, 40 cycles of 94 ◦C for 15 s, 60 ◦C for 10 s and 25 sat 72 ◦C. In the end of the PCR run, a melting curve were evalu-ated. Samples were analyzed in four technical replicates, and a no
Author's personal copy
216 F. Guzman et al. / Plant Science 210 (2013) 214– 223
Table 1Summary of data from sequencing of V. carinata small RNA libraries.
Type Number of reads Percentage (%)
Total readsa 15,986,233 10018–25 nt 13,834,378 86<18 nt 752,929 5>25 nt 1,398,926 9
a Reads with high quality.
template negative control was included. RNA input normalizationsfor miRNA were performed with the miR011 and miR397, selectedby geNorm [29] as the best combination of normalizers. For targetmRNAs the combination of the genes vca-miR011-5p-2-t (his-tone acetyl tranferase mbd9) and vca-miR396-5p-2-t (nuclear porecomplex protein nup98-nup96) were set as the best combinationfor RNA input normalization by geNorm. To calculate the relativeexpression of miRNAs and mRNA targets the 2−��ct method wereused [30]. To compare pairwise differences in expression, Student’st-test was performed considering p < 0.05.
3. Results
3.1. V. carinata RNA library sequencing
To identify miRNAs, a sRNA library was constructed from leavesof V. carinata. After library sequencing, removal of adapter, insert,and short RNAs smaller than 18 nt long and longer than 25 nt long,a total of 15,986,233 reads were obtained (Table 1). The length dis-tribution pattern and the number of reads between 18 and 25 nt(13,834,378) in the redundant and non-redundant sRNAs datasetsare shown in Fig. 1 and Table S4, respectively. The highest abun-dance was found for sequences in the range of 21–24 nt, whereasthe highest reads redundancy was observed in the 24 nt length.Around 15.84% of reads matched miRNAs, 7% matched noncodingsRNAs, (rRNA, tRNA, snRNA, snoRNA), 6% matched organellar sRNAs(mtRNA, cpRNA) and 70.96% matched other sRNAs (Table 2).
Because the genome of V. carinata is not publically available,we sequenced the mRNA transcriptome of V. carinata leaves foruse as a reference sequence in further analysis. The pooled mRNA-seq yielded 21,424,214 reads, which were imported into the CLCGenomics Workbench and de novo assembled into 41,171 contigswith an average length of 695 bp.
Size 18
Size 19
Size 20
Size 21
Size 22
Size 23
Size 24
Size 25
lenght
Redund ant reads Unique reads
Lenght
Tota
l rea
ds 1
06
0
1
2
3
4
5
6
7
8
Fig. 1. Length distribution of unique and redundant V. carinata small RNAs.
Table 2Categorization of V. carinata noncoding and organellar small RNAs.a
Class of small RNA Number of reads Percentage (%)
miRNAs 2,191,509 15.84rRNA 840,795 6.07tRNA 131,927 0.95snRNA 7108 0.05snoRNA 734 0.01mtRNA 344,331 2.48cpRNA 500,623 3.61Other sRNAs 9,817,351 70.96
a 18–25 nt reads considered.
3.2. Identification of conserved miRNAs in V. carinata
There were 4.677 miRNAs from 47 Magnoliophyta speciesdeposited in the microRNA database (miRBase, Release 18.0,November 2011). To identify conserved mature miRNAs in V. cari-nata, sRNA library was matched against a set of 2.585 plant uniquemature miRNAs from miRBase. We identified 1.101.505 reads rep-resented by 182 conserved plant miRNAs, which were distributedin 54 conserved miRNAs families with an average of about 4miRNA members per family (Fig. 2 and Table S5). The most abun-dant families were MIR166 (20 members), MIR156 (19 members),MIR396 (15 members), MIR169 (11 members), MIR167 (10 mem-bers) and MIR159 (7 members). Some of these families are oftenamong most represented miRNA families in other plant species[31–35]. Thirty-one families were represented by only one member(Table S5).
Globally, the relative representation of each miRNA family interms of number of reads was variable. For example, the most rep-resented families were MIR167, MIR159 and MIR166 with ∼39%(433.596 reads), ∼29% (320.878 reads) and ∼14% (154.768 reads)of total reads (Table S2). The frequencies of reads ranged from 1(14 families) to 433.596 reads (MIR167), indicating that expressionvaried significantly among different miRNA families. In some cases,variation of number of reads was very impressive (1–433.596 reads,MIR167; 1–136.889 reads, MIR166 and 2–319.041 reads, MIR159).These results indicated that different members have clearly dif-ferent expression levels in each miRNA family. Also, the highabundance could reflect the role of these miRNA families in fun-damental biological process.
Since the genome of V. carinata is not publically available, thesRNA library was matched against a set of de novo assembledcontigs from V. carinata mRNA-seq of leaves to identify puta-tive pre-miRNA sequences. Candidate sequences with hairpin-likestructure and mature miRNAs anchored in the 5p or 3p or inboth arms were further considered. Our analysis allowed the iden-tification of 19 pre-miRNA sequences grouped in 13 conservedmiRNA families (Table 3). The pre-miRNA length ranged from 78to 209 pb and the lower minimal folding free energy (MFE) rangedfrom −36.90 to −83.60 (an average −49.84) and minimal foldingfree energy index (MFEI) ranged from −0.66 to −1.24 (an aver-age of −1.00). MFEI is a criterion for distinguishing miRNAs fromother RNAs and previous studies have shown that it is more likelyto be a potential miRNA if a sequence has a MFEI value ≤ −0.85[21]. For conserved miRNA in V. carinata, we detected only oneexception in which conserved pre-miRNA showed MFEI under theexpected value (−0.66; vca-MIR168-2) and according to Zhanget al. [41], this value is expected for mRNA. Even though MFEIis an excellent parameter to identify pre-miRNA, some studiesshowed considerable deviation of the expected value (≥−0.85)for conserved pre-miRNA in other plant species [32,33]. All pre-miRNA predicted by RNAfold had regular stem-loop structures(Fig. S2).
Author's personal copy
F. Guzman et al. / Plant Science 210 (2013) 214– 223 217
Tab
le
3C
har
acte
rist
ics
of
pre
-miR
NA
s
iden
tifi
ed
in
V. c
arin
ata
mat
chin
g
con
serv
ed
miR
NA
fam
ilie
s
in
oth
er
pla
nt
spec
ies.
Pre-
miR
NA
Prec
urs
or
miR
NA
s
Mat
ure
miR
NA
s
Con
tig
nam
e
Len
gth
MFE
AM
FE
MFE
I
5p
mor
e
abu
nd
ant
3p
mor
e
abu
nd
ant
Tota
l rea
ds
Tota
l iso
miR
NA
s
Sequ
ence
mem
ber
Rea
ds
Sequ
ence
mem
ber
Rea
ds
vca-
MIR
156-
1
con
tig
1187
5210
3
−58.
70−5
6.99
−1.0
5U
GA
CA
GA
AG
AG
AG
UG
AG
CA
C
9473
UG
CU
CA
CU
UC
UC
UU
UC
UG
UC
AG
730
14,9
63
55vc
a-M
IR15
6-2
con
tig
1376
86
−47.
20
−54.
88
−1.2
4
UU
GA
CA
GA
AG
AU
AG
AG
AG
CA
C
8388
GC
UC
UC
UA
UG
CU
UC
UG
UC
AU
C
359
8921
22vc
a-M
IR15
9
con
tig
1404
10
209
−83.
60
−40.
00
−0.9
1
GA
GC
UU
CC
UU
GG
GU
CC
AA
AG
A
3500
UU
UG
GA
UU
GA
AG
GG
AG
CU
CU
A
319,
041
332,
989
49vc
a-M
IR16
0
con
tig
1460
36
103
−54.
40
−52.
82
−0.9
9
UG
CC
UG
GC
UC
CC
UG
AA
UG
CC
A
291
ND
–
307
4vc
a-M
IR16
6-1
con
tig
1327
16
96
−52.
60
−54.
79
−1.0
1
GG
AA
UG
UU
GU
CU
GG
UU
CG
AA
A
240
UC
GG
AC
CA
GG
CU
UC
AU
UC
CC
C
136,
889
148,
317
30vc
a-M
IR16
6-2
con
tig
1452
5512
9
−46.
31−3
5.89
−0.8
4
GG
AA
UG
UU
GU
CU
GG
CU
CG
AG
G
1564
UC
GG
AC
CA
GG
CU
UC
AU
UC
CC
C
136,
889
149,
916
34vc
a-M
IR16
6-3
con
tig
7568
4
89
−39.
80
−44.
72
−0.8
3
GG
AA
UG
UU
GU
CC
GG
CU
CG
AG
G
1842
UC
GG
AC
CA
GG
CU
UC
AU
UC
CC
C
136,
889
149,
967
35vc
a-M
IR16
7-1
con
tig
1269
3910
4
−56.
10−5
3.94
−1.2
2U
GA
AG
CU
GC
CA
GC
AU
GA
UC
UG
A
410,
104
CA
GA
UC
AU
GU
GG
CA
GU
UU
CA
U
17
411,
899
30vc
a-M
IR16
7-2
con
tig
8566
7
107
−38.
70
−36.
17
−0.7
6
UG
AA
GC
UG
CC
AG
CA
UG
AU
CU
GA
410,
104
AG
AU
CA
UG
UU
GC
AG
UU
UC
AU
C
40
411,
769
19vc
a-M
IR16
8-1
con
tig
8494
1
91
−54.
90
−60.
33
−0.9
6
UC
GC
UU
GG
UG
CA
GG
UC
GG
GA
A
80,4
71
UC
CC
GG
CU
UG
CA
CC
AA
CU
GA
A
357
87,3
36
61vc
a-M
IR16
8-2
con
tig
9753
4
90
−39.
70
−44.
11
−0.6
6
UC
GC
UU
GG
UG
CA
GG
UC
GG
GA
A
80,4
71
CC
CG
CC
UU
GC
AC
CA
AC
UG
AA
U
124
86,3
14
23vc
a-M
IR17
2-1
con
tig
1421
80
130
−50.
80
−39.
08
−1.0
8
GU
GG
CA
UC
AU
GA
AG
AU
UC
AC
A
484
AG
AA
UC
UU
GA
UG
AU
GC
UG
CA
U
1783
67,6
57
35vc
a-M
IR17
2-2
con
tig
6959
6
94
−46.
80
−49.
79
−1.1
7
CA
GC
AU
CA
UC
CA
GA
UU
CA
CA
U
697
AG
AA
UC
UU
GA
UG
AU
GC
UG
CA
U
1783
68,2
48
53vc
a-M
IR39
3
con
tig
1443
92
79
−40.
20
−50.
89
−1.2
2
UC
CA
AA
GG
GA
UC
GC
AU
UG
AU
C
1017
UC
AA
UG
CG
AU
UU
CU
UU
GG
AA
U
50,3
29
52,0
13
28vc
a-M
IR39
6
con
tig
1039
4290
−36.
90
−41.
00
−1.0
5
UU
CC
AC
AG
CU
UU
CU
UG
AA
CU
4329
GU
UC
AA
UA
AA
GC
UG
UG
GG
AA
A
1116
10,0
58
37vc
a-M
IR39
7
con
tig
2034
7
78
−37.
70
−48.
33
−0.9
9
UC
AU
UG
AG
UG
CA
GC
GU
UG
AU
G
1514
UC
AG
CG
CU
UC
AC
UC
AA
UC
AU
G
54
2423
16vc
a-M
IR40
8
con
tig
7432
7
104
−61.
50
−59.
13
−0.9
9
AC
GG
GG
AC
GA
GG
UC
GG
GC
AU
G
54
UG
CA
CU
GC
CU
CU
UC
CC
UG
GC
U
848
1953
12vc
a-M
IR52
8
con
tig
8393
83
−45.
50
−54.
82
−0.9
9
UG
GA
AG
GG
GC
AU
GC
AG
AG
GA
G
16,0
88
UU
UC
CU
GU
GC
UC
GC
CC
CU
UC
C
413
17,0
25
27vc
a-M
IR53
5
con
tig
1422
06
94
−55.
60
−59.
15
−1.1
1
UG
AC
GA
UG
AG
AG
AG
AG
CA
CG
C
46,8
06
GU
GC
UC
UC
UC
UC
GU
UG
UC
AC
C
806
49,5
98
78
ND
:
Not
det
ecte
d.
3.3. Identification of novel miRNAs in V. carinata
Following the criteria in the identification of conserved pre-miRNAs, we obtained 17 potential novel miRNAs grouped in 16miRNA families (Table 4). In addition to the hairpin structure, thedetection of complementary antisense miRNA (miRNA*) in all pre-miRNA was a strong indication to consider these sequences as truecandidates. Comparison between the mature sequences of candi-date miRNAs and sequences deposited in miRBase suggest thatthese candidates are novel miRNAs not identified in others speciesand probably specific to Bromeliacea. These sequences (miRNA*)are rarely found by cloning because of their quick degradation incells and its detection represented further evidence for the exist-ence of mature miRNAs [36].
The novel pre-miRNA sequences in V. carinata ranged from 79to 235 nt and the lower minimal folding free energy (MFE) rangedfrom −26.20 to −109.50 and minimal folding free energy index(MFEI) ranged from −0.87 to −1.48 (Table 4). All novel pre-miRNAidentified were in accordance with the expected value (≥−0.85)[21]. Likewise to conserved pre-miRNA, all novel pre-miRNA pre-dicted by RNAfold had regular stem-loop structures (Fig. S2).
3.4. IsomiRs sequences
We detected multiple mature miRNAs variants named isoformsor isomiRs in both arms (5p and 3p) of identified conserved andnovel pre-miRNAs (Tables 3 and 4). Because they vary widely inabundance, we considered all isomiRs in each pre-miRNA even ifthey showed only one read. For example, vca-nMIR016 showedsix isomiRs in 3p arm varying from 2 to 13 reads and six isomiRsin 5p arm with reads ranging from 1 to 17 (Fig. 3). The exist-ence of these multiple variants reflects the variability in miRNAbiogenesis, specifically in DCL cleavage sites or subsequent pro-cessing/degradation or later processing steps [37]. Although thebiological functions of isomiRs remain to be determined, isomiRsfrom a single pre-miRNA could be a way of broadening the regula-tory network [38,39].
3.5. Identification and classification of targets for pre-miRNAsidentified
To understand the biological function of miRNAs in V. carinata,the putative targets sites of the miRNA candidates were identifiedby aligning the most abundant mature miRNAs of each conservedand novel pre-miRNA identified to a set of V. carinata assembledunigenes using psRNAtarget. Despite the use of 4 as cut-off in pre-vious works to infer miRNA targets [40,41], we adopted the strictercut-off threshold of 3.5. Based on this criterion, we found 145 poten-tial targets in total (for 27 miRNA families), of which 79 weretargets of conserved miRNAs (13 miRNA families) and 66 were tar-gets of novel miRNAs (14 miRNA families). Only for three novelmiRNAs families (vca-nMIR002, vca-nMIR003 and vca-nMIR016)targets were not detected. Detailed annotation results are given inTables 5and S6.
All targets regulated by the identified conserved and novel miR-NAs in this study were subjected to GO analysis to evaluate theirpotential functions. The categorization of these genes according tobiological process, cellular component and molecular function aresummarized in Fig. 4. Based on biological process, these genes wereclassified into 12 categories and the four most overrepresentedGO terms were response to metabolic, cellular, developmental andmulticellular processes, suggesting that V. carinata miRNAs areinvolved in a broad range of physiological functions. Interestingly,we found 8 and 6 candidate genes in the category of responseto abiotic stimulus in conserved and novel miRNAs, respectively(Tables 5 and S6). Categories based on molecular function revealed
Author's personal copy
218 F. Guzman et al. / Plant Science 210 (2013) 214– 223Ta
ble
4C
har
acte
rist
ics
of
nov
el
pre
-miR
NA
s
iden
tifi
ed
in
V. c
arin
ata.
pre
-miR
NA
Prec
urs
or
miR
NA
s
Mat
ure
miR
NA
s
Con
tig
nam
e
Len
gth
MFE
AM
FE
MFE
I
5p
mor
e
abu
nd
ant
3p
mor
e
abu
nd
ant
Tota
l rea
ds
Tota
l iso
miR
NA
s
Sequ
ence
mem
ber
Rea
ds
Sequ
ence
mem
ber
Rea
ds
vca-
MIR
001
con
tig
1041
45
178
−60.
60
−34.
04
−0.8
7
AA
TAA
TAA
TTC
GTT
GG
GC
CC
AA
AC
12
TGA
AC
CC
AA
CG
AG
TTA
TTA
TTG
CT
48
120
24vc
a-M
IR00
2
con
tig
1041
45
180
−72.
80
−45.
50
−1.2
3
TAA
TAA
CTC
GTT
GG
GTT
CA
AA
CC
A
20
GG
TTTG
GG
CC
CA
AC
GA
ATT
1
42
9vc
a-M
IR00
3
con
tig
1093
07
90
−26.
20
−29.
11
−1.0
9
AA
ATA
ATG
ATG
TGTC
GTT
GA
TGC
G
78
ATA
TAG
CA
TTA
TTA
TGG
TTG
ATG
A
4
159
17vc
a-M
IR00
4
con
tig
1233
19
88
−55.
80
−63.
41
−1.4
0
TTG
TGC
TTTT
AG
AA
GC
ATT
CC
GG
C
18
GG
ATC
GG
AA
TGC
TTC
TAA
TAG
CA
C
1
29
8vc
a-M
IR00
5
con
tig
1245
61
222
−109
.50
−49.
32
−1.2
3
TTC
CG
GTC
TGG
CTT
AC
GA
CA
T
158
TTG
AC
AG
CTT
TCA
AA
GG
GTT
T
2444
3035
58vc
a-M
IR00
6
con
tig
1256
68
145
−58.
20
−40.
14
−1.1
4
TCC
TGG
ATA
ATA
CA
ATA
ATT
GC
GC
11
AA
TTA
TTG
TATC
ATT
TAG
GG
GG
GA
9
31
9vc
a-M
IR00
7
con
tig
7768
1
105
−32.
60
−31.
05
−1.2
5
AG
GA
CA
GA
TCA
GA
TTC
ATT
GA
ATC
4
ATT
CA
ATG
AA
TCTA
TCTG
TCA
AT
8
23
8vc
a-M
IR00
8
con
tig
7928
6
90
−37.
00
−40.
66
−1.4
8
AG
AC
AG
ATC
AG
GTA
CA
TTG
AA
TCT
144
AA
TGTA
TCTA
TCTG
TCA
ATA
GTA
A
7
174
21vc
a-M
IR00
9
con
tig
1470
42
230
−108
.00
−46.
96
−1.1
9
AG
TAA
AG
ATG
GG
ATG
GA
TGG
CA
GA
1112
AG
ATT
CA
CTA
GG
TGG
TCC
ATG
GC
T
25
1862
76vc
a-M
IR01
0
con
tig
1272
91
102
−48.
10
−47.
16
−1.1
2
TGC
GG
TTA
GG
ATT
ATA
GTG
GT
1027
AC
TATT
TCA
CC
AA
CA
AC
CC
AC
TAA
122
5183
95vc
a-M
IR01
1
con
tig
7253
910
9
−54.
30−4
9.82
−1.2
3
TTC
AG
AG
CA
TCTG
TTC
AG
TGC
717
TAG
GTG
CTC
CG
AA
TCA
TCTA
GG
23
1199
37vc
a-M
IR01
2
con
tig
1770
6
141
−84.
60
−60.
00
−1.0
2
TCC
GC
CA
CG
TCA
TCTG
TGTA
C
99
TCA
CG
GA
TGA
CG
TGG
CG
GA
CC
2399
2992
35vc
a-M
IR01
3
con
tig
6846
1
86
−50.
70
−58.
95
−1.2
4
TTC
GC
AG
GA
GA
GA
TGA
TGC
CG
457
GC
ATC
GTC
TCTC
CG
GC
GA
CG
G
16
562
19vc
a-M
IR01
4
con
tig
8322
8
235
−82.
80
−35.
23
−0.8
8
GTT
GTC
CTA
TGTT
CC
GG
CA
GA
9
TGC
AG
GA
GTG
TAG
GA
TAA
CG
C
506
527
10vc
a-M
IR01
5
con
tig
9188
9
188
−75.
00
−39.
89
−1.3
4
TCC
AC
AG
GC
TTTC
TTG
AA
CTA
310,
363
TTC
AA
GA
AA
GTT
CG
TGG
AA
A
4962
335,
732
56vc
a-M
IR01
6
con
tig
7074
1
79
−48.
00
−60.
76
−1.0
2
TCA
GG
AG
AG
ATG
AC
AC
CC
A
169
GG
TGTC
AC
CTC
TCC
TGG
AC
55
306
14
that the target genes were related to eight functions and thefour most frequent terms were hydrolase activity, protein binding,small molecule binding and nucleic acid binding. In the categoryof cellular component, the analysis revealed that the target genesare expressed on a high percentage in intracellular membrane-bounded organelle and plastid.
3.6. Biological confirmation of identified miRNAs and targets
The RT-qPCR method was used to validate the expression of themost abundant mature sequences from seventeen miRNAs (vca-miR156-1, vca-miR156-2, vca-miR159, vca-miR166, vca-miR168,vca-miR393, vca-miR396, vca-miR397, vca-miR528, vca-miR535,vca-miR009, vca-miR010, vca-miR011, vca-miR012, vca-miR013,vca-miR014 and vca-miR015) and some of their targets. We con-firmed that these miRNAs were expressed in leaf and ovary tissues(results not shown). For the miRNAs vca-miR166 and vca-miR393we found that their expressions were inversely correlated with theexpression of some of its targets (Fig. S3).
4. Discussion
Several miRNAs have been identified via computational orexperimental approaches in different plant families, but there is nosequence or functional information available about miRNAs in anyBromeliaceae species. V. carinata is a typical representative of theBromeliaceae in the Brazilian Atlantic Rainforest biome and there-fore emerges as interesting model for this botanical family. We usedIllumina technology for deep sequencing of sRNA library to identifymiRNAs in this species and results are discussed.
Size profile is an important feature to distinguish miRNA fromother sRNAs and most of the mature miRNAs are of 21–25 nt. Thelength distribution pattern indicated that the majority of the sRNAwas 24 nt, accounting for ∼51% of the total reads, followed by 21 nt(∼19%), 22 nt (∼9%) and 23 nt (∼6%). This distribution pattern ishighly consistent with previous studies in other plants [42,43] anddifferences in distribution pattern (e.g., for some species 21 nt wasthe most abundant) may reflect the composition of the sRNAs ofa given plant species according to tissue (or cell) and physiologi-cal conditions [44]. Another explanation is that molecules of 24 ntare the typical size of Dicer-digestion product and are often themost abundant endogenous plant sRNAs [45,46]. sRNAs of 24 nt areknown to be involved in heterochromatin modification, especiallyin a genome with a high content of repetitive sequences [47,48]. Thehigh percentage of 24 nt sRNAs found in V. carinata could reflect thecomplexity of the genome of bromeliads [49].
Computational methods have been successfully used to predicthundreds of miRNAs in a wide variety of plant species. Illuminasequencing is a powerful tool to estimate expression profiles ofmiRNA. This technology provides the resources to know the abun-dance of various miRNA families and even distinguish betweendifferent members of a given family. In our case, we found sig-nificant differences among the number and abundance of themembers identified in each family, similarity observed in otherstudies [50,51], suggesting that this wide variation is due to a func-tional divergence in the conserved miRNA families. In this study, weidentified 182 conserved plant miRNAs distributed in 54 miRNAsfamilies, very similar to found for barley, using the same approach(126 miRNA distributed in 58 miRNA families) [52].
Although conserved miRNAs have been identified by sequencingand by comparison again miRNAs identified in other species, mostof plant species-specific miRNAs remain unidentified due to the lev-els of expression of these are very low producing a small numberof reads sequenced in comparison to the conserved miRNAs [53].For this reason, we used a new approach to identify novel miRNAs
Author's personal copy
F. Guzman et al. / Plant Science 210 (2013) 214– 223 219
1
10
100
1.000
10.00 0
100.00 0
1.000 .00 0
miR
156
miR
157
miR
158
miR
159
miR
160
miR
161
miR
162
miR
164
miR
165
miR
166
miR
167
miR
168
miR
169
miR
170
miR
171
miR
172
miR
319
miR
390
miR
393
miR
394
miR
395
miR
396
miR
397
miR
398
miR
399
miR
400
miR
403
miR
408
miR
447
miR
528
miR
529
miR
530
miR
535
miR
824
miR
827
miR
828
miR
844
miR
845
miR
858
miR
859
miR
866
miR
1030
miR
1507
miR
1509
miR
1885
miR
2118
miR
2934
miR
3711
miR
4995
miR
5083
miR
5139
miR
5179
miR
5368
miR
5538
miRNA famili es
19
2
3
7
3
3
3
7
2
20
10
3
11
1
7 5
1
1
1
5
15
5 4
2
2
1
1
1
22
1
11
1
11
11
1
1 11 1 1 1
11
1
1 1 111
miRNA families
10
0
1
2
3
4
5
6
Tota
l rea
ds (l
og10
)
Fig. 2. Number of identified miRNAs in each conserved miRNA family in plants. The values above the bars indicate the number of members identified fot each conservedmiRNA family.
in species without availability of genomic resources using simul-taneously sequences from sRNA and RNAseq libraries. Using thismethodology, we identified 16 potential miRNAs candidates spe-cific for V. carinata. In all cases, complementary antisense miRNA
(miRNA*) was identified providing more evidence for their exist-ence as novel miRNAs [54,55].
To understand the function of the identified miRNAs, theirputative targets were predicted using a bioinformatics approach.
Fig. 3. Predicted secondary structures of conserved and novel miRNAs of V. carinata. Secondary structures of vca-nMIR013 and vca-nMIR016 precursors, their locations andthe expression of small RNAs mapped onto these precursors. The sequences corresponding to the most abundant mature miRNAs in the 5p and 3p arms are labeled in redand purple, respectively. Values on the left side of the miRNA sequence represent read counts in the leaf library.
Author's personal copy
220 F. Guzman et al. / Plant Science 210 (2013) 214– 223
Table 5Predicted putative targets of novel miRNAs in V. carinata.
Mature miRNA Contig Scoreb Inhibition Putative function
vca-miR001-3p contig 98870 2 Cleavage Rna binding proteinvca-miR004-5p contig 69835 rc 2.5 Cleavage Kinesin family member 6vca-miR005-3p contig 119481a 2 Cleavage Tetratricopeptide repeat-containing protein
contig 75544a 2.5 Cleavage Two-component response regulator arr3contig 66635 rca 3 Cleavage Heat shock 70 kda mitocondrialcontig 67788a 3 Cleavage Pyruvate kinase isozyme chloroplasticcontig 67948 rc 3.5 Cleavage E3 ubiquitin ligase big brothercontig 69145 3.5 Cleavage Fertility restorercontig 95670 3.5 Cleavage Probable lrr receptor-like serine threonine-protein kinasecontig 143421 3.5 Cleavage R3h domain-containing proteincontig 70604 rc 3.5 Cleavage Reticulon-like protein b17contig 67529 3.5 Cleavage Tho complex subunit 1
vca-miR006-5p contig 139947 rc 3 Cleavage Lysosomal alpha-mannosidasecontig 75839 3 Translation Probable beta- -galactosyltransferase 2contig 140146 3.5 Translation Chloroplastic group iia intron splicing facilitator chloroplasticcontig 81980 rc 3.5 Cleavage Omega-amidase nit2
vca-miR007-3p contig 140165 rc 0.5 Cleavage Calmodulin-like proteincontig 72901 rc 3 Cleavage Chlorophyllase- chloroplasticcontig 68350 3 Cleavage Soluble starch synthase chloroplastic amyloplasticcontig 140661 rc 3.5 Translation Tubulin-specific chaperone d
vca-miR008-5p contig 69155 rc 2 Translation Probable protein phosphatase 2c 18contig 140200 rc 3 Cleavage Abc transporter f family member 3contig 71460 rc 3 Translation Membrane proteincontig 79377 rc 3.5 Cleavage 1-phosphatidylinositol 3-phosphate 5-kinase fab1contig 67906 rc 3.5 Cleavage Aminopeptidase c
vca-miR009-5p contig 140774 rca 3 Cleavage Mitochondrial substrate carrier family proteincontig 118843 rc 3.5 Cleavage Cellulose synthase
vca-miR010-5p contig 71336a 1 Cleavage Abc transporter i family member 1vca-miR011-5p contig 89802a 3 Cleavage Dna binding protein
contig 140009a 3 Cleavage Methyl- -binding domain-containing protein 9contig 67880 rc 3.5 Cleavage Phenylalanyl-trna synthetase beta subunitcontig 122123 3.5 Cleavage Serine threonine-protein kinase nek1
vca-miR012-3p contig 71550 rca 3 Translation Ribonucleoside-diphosphate reductase small chaincontig 69862a 3 Cleavage Wrky transcription factor 11contig 34475 rc 3.5 Cleavage Oligopeptidase bcontig 140456 rca 3.5 Translation Protein resurrection 1
vca-miR013-5p contig 139979a 2.5 Cleavage Calcium-transporting atpase plasma membrane-typecontig 66522a 3 Cleavage Glucan -beta-glucosidasecontig 66366a 3 Cleavage Glutamyl-trna amidotransferase subunit acontig 68601a 3 Cleavage Probable protein phosphatase 2c 38contig 118464 rc 3.5 Cleavage Casein kinase ii subunit betacontig 705 3.5 Cleavage Retrotransposon ty3-gypsy subclasscontig 66929 rc 3.5 Translation Transcription factor gte4contig 141231 rc 3.5 Translation U3 small nucleolar ribonucleoprotein protein imp4
vca-miR014-3p contig 83228 rca 0 Cleavage Nucleoid dna-binding protein cnd41contig 66712a 3 Cleavage Uncharacterized protein ycf36contig 140254 rc 3.5 Cleavage Pumilio homolog 1
vca-miR015-5p contig 140456 rca 3 Translation Protein resurrection1contig 72376 rc 3.5 Translation Ethylene-responsive transcription factor 1contig 140188 rc 3.5 Cleavage Protein root hair defective 3contig 66270 rc 3.5 Cleavage Ubiquitin carboxyl-terminal hydrolase 7
In bold: putative targets with response to abiotic stimulus.a Targets evaluated in RTqPCR experiments.b psRNATarget value.
As expected, many putative targets of conserved miRNAs weretranscriptions factors, similar reported in other studies [56–58].Among the most important transcription factors identified, wefound squamosa promoter binding protein (SBP)-like (SPL) genes,which are targets of MIR156 and MIR535 families, affecting diversedevelopmental processes such as leaf development, shoot mat-uration, phase change and flowering in plants [59,60]. We alsoidentified the auxin response factor (ARF), a plant-specific family ofDNA binding proteins involved in hormone signal transduction thatare targets of MIR160 and MIR167 families [61,62]. Other importantgenes identified and targeted by MIR156 and MIR396 were penta-tricopeptide repeat genes (PPR) which show high importance inthe regulation of gene expression. This gene family is implicated inpost-transcriptional processes such as splicing, editing, processingand translation specifically in organelles such as mitochondria andchloroplasts [63].
In the case of the novel miRNAs, we also identified othertranscription factors as targets. WRKY is the putative target ofvca-nMIR012 and belongs to a large family of transcription factorsthat regulate various physiological processes, including pathogendefense, senescence, trichome development and signal transduc-tion [64]. The ethylene-responsive transcription factor 1 (ESR1)was targeted by vca-nMIR015 and is involved in regulating geneexpression patterns in meristems, modulating organ developmentand promoting shoot regeneration through the cytokinin signalingpathway [65,66]. Other important putative target identified for vca-nMIR013 was the transcription factor GTE4. It is involved in theactivation and maintenance of cell division in the meristems andby this controls cell numbers in differentiated organs [67].
Based in the GO analysis, we found 14 candidate genes withresponse to abiotic stimulus. These results indicate that the puta-tive targets from identified miRNAs are involved in a large number
Author's personal copy
F. Guzman et al. / Plant Science 210 (2013) 214– 223 221
Fig. 4. Gene categories and the distribution of target genes of the most abundant mature miRNAs in the conserved and novel pre-miRNA identified in V. carinata. (A) Molecularfunction, (B) biological process and (C) cellular component.
of primary physiological and metabolic processes and are likelyinvolved in the adaptation of V. carinata to different types of stressand environmental conditions observed in natura. Similar resultsin other plants confirmed the miRNA molecular regulation in dif-ferent plant abiotic stresses [68]. Future experimental validationwill determine how many of these predicted targets are genuinelytargeted by miRNAs in V. carinata in specific environmental andphysiological conditions.
This work provides a comprehensive investigation of the miRNApopulation in leaves of the bromeliad species V. carinata. The resultssupport the currently idea that miRNAs are conserved in plantsand play an important role in several physiological processes asshown by bioinformatics analysis of miRNA putative target genes.Although the exact function of these putative target genes remainsto be confirmed, the present study provides novel insights for thecomprehension of the molecular processes involved in the con-served miRNA function.
Acknowledgements
This work was sponsored by PDJ (509828/2010-8) and Pro-ductivity and Research Grant (307868/2011-7) from the National
Council for Scientific and Technological Development (CNPq,Brazil).
Appendix A. Supplementary data
Supplementary data associated with this article can befound, in the online version, at http://dx.doi.org/10.1016/j.plantsci.2013.05.013.
References
[1] A.C. Mallory, H. Vaucheret, Functions of microRNAs and related small RNAs inplants, Nat. Genet. 38 (Suppl.) (2006) S31–S36.
[2] A.M. Denli, B.B.J. Tops, R.H.A. Plasterk, R.F. Ketting, G.J. Hannon, Processingof primary microRNAs by the Microprocessor complex, Nature 432 (2004)231–235.
[3] M.W. Jones-Rhoades, D.P. Bartel, B. Bartel, MicroRNAS and their regulatory rolesin plants, Annu. Rev. Plant Biol. 57 (2006) 19–53.
[4] R. Sunkar, Y.-F. Li, G. Jagadeeswaran, Functions of microRNAs in plant stressresponses, Trends Plant Sci. 17 (2012) 196–203.
[5] T. Dezulian, J. Palatnik, D. Huson, D. Weigel, Conservation and divergence ofmicroRNA families in plants, Genome Biol. 6 (2005) P13.
[6] E. Allen, Z. Xie, A.M. Gustafson, G.-H. Sung, J.W. Spatafora, J.C. Carrington, Evo-lution of microRNA genes by inverted duplication of target gene sequences inArabidopsis thaliana, Nat. Genet. 36 (2004) 1282–1290.
Author's personal copy
222 F. Guzman et al. / Plant Science 210 (2013) 214– 223
[7] H.E. Luther, An alphabetical list of bromeliad binomials, in: Sarasota BromeliadSociety and Marie Selby Botanical Gardens, 2010.
[8] T.A. Ranker, D.E. Soltis, P.S. Soltis, A.J. Gilmartin, Subfamilial relationships of theBromeliaceae: evidence from chloroplast DNA restriction site variation, Syst.Bot. 15 (1990) 425–434.
[9] D.H. Benzing, Bromeliaceae, in: Profile of an Adaptive Radiation, CambridgeUniversity Press, Cambridge, 2000.
[10] T.J. Givnish, M.H.J. Barfuss, B. Van Ee, R. Riina, K. Schulte, R. Horres, P.A. Gonsiska,R.S. Jabaily, D.M. Crayn, J.A.C. Smith, K. Winter, G.K. Brown, T.M. Evans, B.K.Holst, H. Luther, W. Till, G. Zizka, P.E. Berry, K.J. Sytsma, Phylogeny, adaptiveradiation, and historical biogeography in Bromeliaceae: insights from an eight-locus plastid phylogeny, Am. J. Bot. 98 (2011) 872–895.
[11] S. Porembski, G. Martinelli, R. Ohlemuller, W. Barthlott, Diversity and ecologyof saxicolous vegetation mats on inselbergs in the Brazilian Atlantic rainforest,Divers. Distrib. 4 (1998) 107–119.
[12] L.B. Smith, R.J. Downs, Flora Neotropica: Monography 14, Part 2 – Tilland-sioideae, The New York Botanical Garden Hafner Press, New York, 1977.
[13] R. Schmieder, R. Edwards, Quality control and preprocessing of metagenomicdatasets, Bioinformatics (Oxford, Engl.) 27 (2011) 863–864.
[14] F. Jühling, M. Mörl, R.K. Hartmann, M. Sprinzl, P.F. Stadler, J. Pütz, tRNAdb 2009:compilation of tRNA sequences and tRNA genes, Nucleic Acids Res. 37 (2009)D159–D162.
[15] E. Pruesse, C. Quast, K. Knittel, B.M. Fuchs, W. Ludwig, J. Peplies, F.O. Glöckner,SILVA: a comprehensive online resource for quality checked and aligned ribo-somal RNA sequence data compatible with ARB, Nucleic Acids Res. 35 (2007)7188–7196.
[16] S. He, C. Liu, G. Skogerbø, H. Zhao, J. Wang, T. Liu, B. Bai, Y. Zhao, R. Chen, NON-CODE v2.0: decoding the non-coding, Nucleic Acids Res. 36 (2008) D170–D172.
[17] B. Langmead, C. Trapnell, M. Pop, S.L. Salzberg, Ultrafast and memory-efficientalignment of short DNA sequences to the human genome, Genome Biol. 10(2009) R25.
[18] R. Li, C. Yu, Y. Li, T.-W. Lam, S.-M. Yiu, K. Kristiansen, J. Wang, SOAP2: animproved ultrafast tool for short read alignment, Bioinformatics (Oxford, Engl.)25 (2009) 1966–1967.
[19] I. Milne, M. Bayer, L. Cardle, P. Shaw, G. Stephen, F. Wright, D. Marshall, Tablet– next generation sequence assembly visualization, Bioinformatics (Oxford,Engl.) 26 (2010) 401–402.
[20] S. Moxon, F. Schwach, T. Dalmay, D. Maclean, D.J. Studholme, V. Moulton,A toolkit for analysing large-scale plant small RNA datasets, Bioinformatics(Oxford, Engl.) 24 (2008) 2252–2253.
[21] B.H. Zhang, X.P. Pan, S.B. Cox, G.P. Cobb, T.A. Anderson, Evidence that miRNAsare different from other RNAs, CMLS 63 (2006) 246–254.
[22] G. Pertea, X. Huang, F. Liang, V. Antonescu, R. Sultana, S. Karamycheva, Y. Lee,J. White, F. Cheung, B. Parvizi, J. Tsai, J. Quackenbush, TIGR Gene Indices clus-tering tools (TGICL): a software system for fast clustering of large EST datasets,Bioinformatics (Oxford, Engl.) 19 (2003) 651–652.
[23] X. Huang, A. Madan, CAP3: A DNA sequence assembly program, Genome Res. 9(1999) 868–877.
[24] X. Dai, P.X. Zhao, psRNATarget: a plant small RNA target analysis server, NucleicAcids Res. 39 (2011) W155–W159.
[25] A. Conesa, S. Götz, Blast2GO: a comprehensive suite for functional analysis inplant genomics, Int. J. Plant Genom. 2008 (2008) 619832.
[26] F.R. Kulcheski, F.C. Marcelino-Guimaraes, A.L. Nepomuceno, R.V. Abdelnoor, R.Margis, The use of microRNAs as reference genes for quantitative polymerasechain reaction in soybean, Anal. Biochem. 406 (2010) 185–192.
[27] F.R. Kulcheski, L.F. de Oliveira, L.G. Molina, M.P. Almerao, F.A. Rodrigues, J.Marcolino, J.F. Barbosa, R. Stolf-Moreira, A.L. Nepomuceno, F.C. Marcelino-Guimaraes, R.V. Abdelnoor, L.C. Nascimento, M.F. Carazzolle, G.A. Pereira, R.Margis, Identification of novel soybean microRNAs involved in abiotic andbiotic stresses, BMC Genom. 12 (2011) 307.
[28] C. Chen, D.A. Ridzon, A.J. Broomer, Z. Zhou, D.H. Lee, J.T. Nguyen, M. Barbisin,N.L. Xu, V.R. Mahuvakar, M.R. Andersen, K.Q. Lao, K.J. Livak, K.J. Guegler, Real-time quantification of microRNAs by stem-loop RT-PCR, Nucleic Acids Res. 33(2005) e179.
[29] J. Vandesompele, K. De Preter, F. Pattyn, B. Poppe, N. Van Roy, A. De Paepe,F. Speleman, Accurate normalization of real-time quantitative RT-PCR data bygeometric averaging of multiple internal control genes, Genome Biol. 3 (2002),research0034.
[30] K.J. Livak, T.D. Schmittgen, Analysis of relative gene expression data using real-time quantitative PCR and the 2(−Delta Delta C(T)) Method, Methods 25 (2001)402–408.
[31] R. Sunkar, G. Jagadeeswaran, In silico identification of conserved microRNAs inlarge number of diverse plant species, BMC Plant Biol. 8 (2008) 37.
[32] T.P. Frazier, F. Xie, A. Freistaedter, C.E. Burklew, B. Zhang, Identification andcharacterization of microRNAs and their target genes in tobacco (Nicotianatabacum), Planta 232 (2010) 1289–1308.
[33] T. Unver, I. Parmaksiz, E. Dündar, Identification of conserved micro-RNAs andtheir target transcripts in opium poppy (Papaver somniferum L.), Plant Cell Rep.29 (2010) 757–769.
[34] F. Guzman, M.P. Almerao, A.P. Korbes, G. Loss-Morais, R. Margis, Identifica-tion of microRNAs from Eugenia uniflora by high-throughput sequencing andbioinformatics analysis, PLoS ONE 7 (2012) e49811.
[35] A.P. Korbes, R.D. Machado, F. Guzman, M.P. Almerao, L.F. de Oliveira, G. Loss-Morais, A.C. Turchetto-Zolet, A. Cagliari, F. dos Santos Maraschin, M. Margis-Pinheiro, R. Margis, Identifying conserved and novel microRNAs in developingseeds of Brassica napus using deep sequencing, PLoS ONE 7 (2012) e50663.
[36] Q.-X. Song, Y.-F. Liu, X.-Y. Hu, W.-K. Zhang, B. Ma, S.-Y. Chen, J.-S. Zhang, Identi-fication of miRNAs and their target genes in developing soybean seeds by deepsequencing, BMC Plant Biol. 11 (2011) 5.
[37] R.D. Morin, G. Aksay, E. Dolgosheina, H.A. Ebhardt, V. Magrini, E.R. Mardis, S.C.Sahinalp, P.J. Unrau, Comparative analysis of the small RNA transcriptomes ofPinus contorta and Oryza sativa, Genome Res. 18 (2008) 571–584.
[38] L. Guo, Z. Lu, Global expression analysis of miRNA gene cluster and familybased on isomiRs from deep sequencing data, Comput. Biol. Chem. 34 (2010)165–171.
[39] L. Guo, H. Li, T. Liang, J. Lu, Q. Yang, Q. Ge, Z. Lu, Consistent isomiR expressionpatterns and 3′ addition events in miRNA gene clusters and families implicatefunctional and evolutionary relationships, Mol. Biol. Rep. 39 (2012) 6699–6706.
[40] R. Schwab, J.F. Palatnik, M. Riester, C. Schommer, M. Schmid, D. Weigel, Specificeffects of microRNAs on the plant transcriptome, Dev. Cell 8 (2005) 517–527.
[41] B. Zhang, X. Pan, E.J. Stellwag, Identification of soybean microRNAs and theirtargets, Planta 229 (2008) 161–182.
[42] G. Szittya, S. Moxon, D.M. Santos, R. Jing, M.P.S. Fevereiro, V. Moulton, T. Dalmay,High-throughput sequencing of Medicago truncatula short RNAs identifies eightnew miRNA families, BMC Genom. 9 (2008) 593.
[43] C.-Z. Zhao, H. Xia, T.P. Frazier, Y.-Y. Yao, Y.-P. Bi, A.-Q. Li, M.-J. Li, C.-S. Li, B.-H.Zhang, X.-J. Wang, Deep sequencing identifies novel and conserved microRNAsin peanuts (Arachis hypogaea L.), BMC Plant Biol. 10 (2010) 3.
[44] Q.-H. Zhu, A. Spriggs, L. Matthew, L. Fan, G. Kennedy, F. Gubler, C. Helliwell,A diverse set of microRNAs and microRNA-like small RNAs in developing ricegrains, Genome Res. 18 (2008) 1456–1465.
[45] H. Vaucheret, Post-transcriptional small RNA pathways in plants: mechanismsand regulations, Genes Develop. 20 (2006) 759–771.
[46] R. Rajagopalan, H. Vaucheret, J. Trejo, D.P. Bartel, A diverse and evolutionar-ily fluid set of microRNAs in Arabidopsis thaliana, Genes Develop. 20 (2006)3407–3425.
[47] A.J. Herr, Pathways through the small RNA world of plants, FEBS Lett. 579 (2005)5879–5888.
[48] F. Vazquez, Arabidopsis endogenous small RNAs: highways and byways, TrendsPlant Sci. 11 (2006) 460–468.
[49] F.C. Favoreto, C.R. Carvalho, A.B.P. Lima, A. Ferreira, W.R. Clarindo, Genome sizeand base composition of Bromeliaceae species assessed by flow cytometry,Plant Syst. Evol. 298 (2012) 1185–1193.
[50] D. Gonzalez-Ibeas, J. Blanca, L. Donaire, M. Saladié, A. Mascarell-Creus, A. Cano-Delgado, J. Garcia-Mas, C. Llave, M.A. Aranda, Analysis of the melon (Cucumismelo) small RNAome by high-throughput pyrosequencing, BMC Genom. 12(2011) 393.
[51] J.R. Puzey, A. Karger, M. Axtell, E.M. Kramer, Deep annotation of Popu-lus trichocarpa microRNAs from diverse tissue sets, PLoS ONE 7 (2012)e33034.
[52] S. Lv, X. Nie, L. Wang, X. Du, S.S. Biradar, X. Jia, S. Weining, Identificationand Characterization of MicroRNAs from Barley (Hordeum vulgare L.) by High-Throughput Sequencing, Int. J. Mol. Sci. 13 (2012) 2973–2984.
[53] N. Fahlgren, M.D. Howell, K.D. Kasschau, E.J. Chapman, C.M. Sullivan, J.S.Cumbie, S.A. Givan, T.F. Law, S.R. Grant, J.L. Dangl, J.C. Carrington, High-throughput sequencing of Arabidopsis microRNAs: evidence for frequent birthand death of MIRNA genes, PLoS ONE 2 (2007) e219.
[54] G. Martínez, J. Forment, C. Llave, V. Pallás, G. Gómez, High-throughput sequenc-ing, characterization and detection of new and conserved cucumber miRNAs,PLoS ONE 6 (2011) e19523.
[55] C. Wang, X. Wang, N.K. Kibet, C. Song, C. Zhang, X. Li, J. Han, J. Fang, Deepsequencing of grapevine flower and berry short RNA library for discovery ofnovel microRNAs and validation of precise sequences of grapevine microRNAsdeposited in miRBase, Physiol. Plant. 143 (2011) 64–81.
[56] V. Pantaleo, G. Szittya, S. Moxon, L. Miozzi, V. Moulton, T. Dalmay, J. Burgyan,Identification of grapevine microRNAs and their targets using high-throughputsequencing and degradome analysis, Plant J. Cell. Mol. Biol. 62 (2010)960–976.
[57] D. De Paola, F. Cattonaro, D. Pignone, G. Sonnante, The miRNAome of globeartichoke: conserved and novel micro RNAs and target analysis, BMC Genom.13 (2012) 41.
[58] M. Colaiacovo, A. Subacchi, P. Bagnaresi, A. Lamontanara, L. Cattivelli, P. Fac-cioli, A computational-based update on microRNAs and their targets in barley(Hordeum vulgare L.), BMC Genom. 11 (2010) 595.
[59] G. Wu, M.Y. Park, S.R. Conway, J.-W. Wang, D. Weigel, R.S. Poethig, Thesequential action of miR156 and miR172 regulates developmental timing inArabidopsis, Cell 138 (2009) 750–759.
[60] M. Shikata, T. Koyama, N. Mitsuda, M. Ohme-Takagi, Arabidopsis SBP-box genesSPL10, SPL11 and SPL2 control morphological change in association with shootmaturation in the reproductive phase, Plant Cell Physiol. 50 (2009) 2133–2145.
[61] M.-F. Wu, Q. Tian, J.W. Reed, Arabidopsis microRNA167 controls patterns ofARF6 and ARF8 expression, and regulates both female and male reproduction,Development (Cambr., Engl.) 133 (2006) 4211–4218.
[62] J.H. Yang, S.J. Han, E.K. Yoon, W.S. Lee, Evidence of an auxin signal pathway,microRNA167-ARF8-GH3, and its response to exogenous auxin in cultured ricecells, Nucleic Acids Res. 34 (2006) 1892–1899.
[63] S. Fujii, I. Small, The evolution of RNA editing and pentatricopeptide repeatgenes, New Phytol. 191 (2011) 37–47.
[64] T. Eulgem, P.J. Rushton, S. Robatzek, I.E. Somssich, The WRKY superfamily ofplant transcription factors, Trends Plant Sci. 5 (2000) 199–206.
[65] H. Banno, Y. Ikeda, Q.W. Niu, N.H. Chua, Overexpression of Arabidopsis ESR1induces initiation of shoot regeneration, Plant Cell 13 (2001) 2609–2619.
Author's personal copy
F. Guzman et al. / Plant Science 210 (2013) 214– 223 223
[66] T. Kirch, R. Simon, M. Grunewald, W. Werr, The Dornroschen/Enhancer of shootregeneration1 gene of Arabidopsis acts in the control of meristem cell fate andlateral organ development, Plant Cell 15 (2003) 694–705.
[67] C.A. Airoldi, F.D. Rovere, G. Falasca, G. Marino, M. Kooiker, M.M. Altamura, S. Cit-terio, M.M. Kater, The Arabidopsis BET bromodomain factor GTE4 is involved in
maintenance of the mitotic cell cycle during plant development, Plant Physiol.152 (2010) 1320–1334.
[68] S. Zhang, Y. Yue, L. Sheng, Y. Wu, G. Fan, A. Li, X. Hu, M. ShangGuan, C. Wei,PASmiR: a literature-curated database for miRNA molecular regulation in plantresponse to abiotic stress, BMC Plant Biol. 13 (2013) 1–8.