The conflicting themes of nonviolence and violence in ancient ...
Integrating Incomplete Fossils by Isolating Conflicting Signal in Saturated and Non-Independent...
Transcript of Integrating Incomplete Fossils by Isolating Conflicting Signal in Saturated and Non-Independent...
[15:53 20/4/2014 Sysbio-syu022.tex] Page: 1 1–19
Syst. Biol. 0(0):1–19, 2014© The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved.For Permissions, please email: [email protected]:10.1093/sysbio/syu022
Integrating Incomplete Fossils by Isolating Conflicting Signal in Saturated andNon-Independent Morphological Characters
LILIANA M. DÁVALOS1,2,∗, PAÚL M. VELAZCO3, OMAR M. WARSI1, PETER D. SMITS4, and NANCY B. SIMMONS3
1Department of Ecology and Evolution, Stony Brook University, 650 Life Sciences Building Stony Brook, NY 11794, USA, 2Consortium forInter-Disciplinary Environmental Research, School of Marine and Atmospheric Sciences, Stony Brook University, 129 Dana Hall, Stony Brook, NY 11794,USA, 3Division of Vertebrate Zoology (Mammalogy), American Museum of Natural History, NY 10024, USA, and 4Committee on Evolutionary Biology,
University of Chicago, 1025 E. 57th Street, Culver Hall 402, Chicago, IL 60637, USA∗Correspondence to be sent to: Department of Ecology and Evolution, Stony Brook University, 650 Life Sciences Building, Stony Brook, NY 11794, USA;
E-mail: [email protected].
Received 20 March 2013; reviews returned 28 May 2013; accepted 18 March 2014Associate Editor: Mark Fishbein
Abstract.—Morphological characters are indispensable in phylogenetic analyses for understanding the pattern, process, andtempo of evolution. If characters are independent and free of systematic errors, then combining as many different kinds ofcharacters as are available will result in the best-supported phylogenetic hypotheses. But since morphological charactersare subject to natural selection for function and arise from the expression of developmental pathways, they may not beindependent, a situation that may amplify any underlying homoplasy. Here, we use new dental and multi-locus geneticdata from bats (Mammalia: Chiroptera) to quantify saturation and similarity in morphological characters and introduce twolikelihood-based approaches to identify strongly conflicting characters and integrate morphological and molecular data.We implement these methods to analyze the phylogeny of incomplete Miocene fossils in the radiation of Phyllostomidae(New World Leaf-nosed Bats), perhaps the most ecologically diverse family of living mammals. Morphological charactersproduced trees incongruent with molecular phylogenies, were saturated, and showed rates of change higher than mostmolecular substitution rates. Dental characters encoded variation similar to that in other morphological characters, whilemolecular characters encoded highly dissimilar variation in comparison. Saturation and high rates of change indicaterandomization of phylogenetic signal in the morphological data, and extensive similarity suggests characters are non-independent and errors are amplified. To integrate the morphological data into tree building while accounting forhomoplasy, we used statistical molecular scaffolds and combined phylogenetic analyses excluding a small subset of stronglyconflicting dental characters. The phylogenies revealed the Miocene nectar-feeding †Palynephyllum nests within the crownnectar-feeding South American subfamily Lonchophyllinae, while the Miocene genus †Notonycteris is sister to the extantcarnivorous Vampyrum. These relationships imply new calibration points for timing of radiation of the ecologically diversePhyllostomidae. [Chiroptera; conflict; dentition; morphology; Phyllostomidae; saturation; scaffold; systematic error.]
Fossil remains provide crucial data for understandingthe pattern, process, and tempo of evolution (Wienset al. 2010; Ronquist et al. 2012a), and for this reasonphylogenetic analyses of morphological characters areindispensable. Combining data from as many charactersystems as are available can also provide the mostpowerful inference of phylogeny, and the strongest testof evolutionary hypotheses (de Queiroz and Gatesy2007; Hermsen and Hendricks 2008). This rationale forintegrating morphological data through combinationcrucially depends on both the absence of systematic errorand independence among characters (de Queiroz andGatesy 2007; Springer et al. 2007). However, differentcharacter systems may have different properties ofphylogenetic signal (e.g., arising from very differentrates of evolution or the influence of natural selection).Integrating these systems in phylogenetic analyses thenrequires accounting for these differences (Wake 1991;Kangas et al. 2004; Eick et al. 2005; Springer et al. 2007).
When analyses fail to accurately model evolutionaryprocesses (e.g., by assuming neutral evolution whena character is under selection), additional data maymislead through bias. This is called systematic error(Phillips et al. 2004). Systematic errors have been foundwhen constraints on character states in nucleotides andmultiple substitutions at the same site produce endstates that are random with respect to shared history
(Simmons and Freudenstein 2003), when mutationalbiases are incorrectly modeled (Gruber et al. 2007), ifnatural selection for a specific function has resultedin adaptive convergence (Li et al. 2008; Liu et al.2010), and when correlated evolution at nearby sitesviolates the assumption of independence implicit inmost phylogenetic analyses (Davis et al. 2009). Ceilingson the number of character states, misleading adaptiveconvergence, and character dependency have all beenobserved in morphological characters (Wagner 2000;Wiens et al. 2003; Kangas et al. 2004; Dávalos et al. 2012).Further, evolutionary adaptation is expected to shapemorphological characters to a greater extent than manynucleotide substitutions because all morphologicalcharacters are phenotypic (Wake 1991; Springer et al.2007; 2008). Nonetheless, the potential for systematicerror to mislead morphological phylogenies andundermine data combination has not been as thoroughlyexplored as it has with molecular data. Crucially,comparisons among loci and models that account forbiased data make systematic error more tractable withmolecular data (Muse and Gaut 1994; Gruber et al. 2007;Blanquart and Lartillot 2008; Davies et al. 2012).
Molecular scaffolds in which trees derived frommolecular sequence analyses are used to constrainsearches for optimal morphological trees weredeveloped as a way to integrate data under the
1
Systematic Biology Advance Access published May 9, 2014
[15:53 20/4/2014 Sysbio-syu022.tex] Page: 2 1–19
2 SYSTEMATIC BIOLOGY
assumption that morphological characters may betoo homoplasious to be directly analyzed (Springeret al. 2001; 2007). Morphologists and paleontologistshave criticized scaffolds because they fail to conveyphylogenetic uncertainty arising from both themolecular and morphological data, and assign muchgreater weight to molecular sequences a priori (Manoset al. 2007; Hermsen and Hendricks 2008). By notanalyzing all characters on equal footing, scaffoldsoverlook potential support in molecular data sets forclades strongly supported by morphology (de Queirozand Gatesy 2007).
As models to analyze morphological charactershave been proposed, developed, and extended toinclude relaxed-clock analyses (Lewis 2001; Pyron 2011;Ronquist et al. 2012a), the question of how to integratemorphological characters in phylogenetic analyses hasbecome more pressing. Beyond providing constraintsfor dating divergence and understanding the tempo andmode of trait evolution, morphological characters canalso contribute to understanding the evolution of extanttaxa (Wiens 2004). In some cases, including fossil taxain combined analyses may actually change perceivedrelationships among extant lineages (Manos et al. 2007;Hermsen and Hendricks 2008). Uncovering errors inmorphological data and improving methods to integrateand analyze morphological characters are therefore ofgreat importance to fully inferring and analyzing theTree of Life (Wiens 2009).
In this study, we introduce and analyze multi-locusDNA sequences and a set of >270 dental characters forthe mammalian family Phyllostomidae with the goalof investigating the phylogenetic signal and featuresof morphological data. Phyllostomid bats are perhapsthe single most ecologically diverse mammalian family,including specialized lineages that feed exclusivelyon nectar, blood, hard or soft fruits, in addition totaxa that are carnivores and/or gleaning insectivores(Rojas et al. 2011; Santana et al. 2011; Dumont et al.2012). Fossil phyllostomids are known from the middleMiocene of northern South America (Czaplewski et al.2003b) and extensive Pleistocene deposits in Centraland South America, and the Caribbean (Czaplewskiet al. 2003a; Fracasso and Salles 2005; Dávalos andRussell 2012; Dávalos and Turvey 2012). The outstandingdiversity of phyllostomids makes the evolution oftheir taxonomic diversity, feeding specializations, andbiogeography of interest to systematists, evolutionaryecologists, and functional morphologists (Rojas et al.2011, 2012; Dumont et al. 2012; Santana et al. 2012). Thephylogenetic placement of Miocene fossils, in particular,is critical to understanding the tempo of phyllostomiddiversification. To date, only one analysis includingone (of three) Miocene fossil lineages has ever beenconducted, and it encompassed only 7 of 58 extant genera(Czaplewski et al. 2003b). Successfully integratingmorphological characters into our analyses is thereforecrucial to understanding the evolution of phyllostomids.
Given their outstanding ecological diversity in a singleunarguably monophyletic group, phyllostomids are
uniquely suited for testing for signals of systematic errorin morphological characters driven by functional anddevelopmental constraints (Springer et al. 2007). Sincethese constraints affect multiple traits simultaneously(Wiens et al. 2003; Kangas et al. 2004; Carrier et al.2005), combinations of character states will end upbeing similar to one another and the observed signalwill amplify any underlying errors. The occurrenceof identical character states that do not arise bycommon descent but by other processes is homoplasy.One form of homoplasy involves rates of change sohigh that the observed distribution of states cannotbe distinguished from a random assignment, andthis can be diagnosed by examining saturation incharacter state changes. Specifically, in this studywe aimed to: (1) measure saturation and similarityin morphological characters, as well as phylogeneticconflict with molecular trees; (2) find ways tolocate phylogenetic conflicts among morphologicalcharacters (if substantial homoplasy is detected); and (3)extend methods to integrate morphological data withmolecular sequence data to account for systematic errorsuncovered.
MATERIALS AND METHODS
Taxonomic SamplingPhyllostomidae (Mammalia: Chiroptera:
Noctilionoidea) is a group of Neotropical batswhose monophyly is strongly supported by bothmorphological and molecular synapomorphies(Simmons 1998; Baker et al. 2000, 2003; Wettereret al. 2000). Phyllostomid bat species were traditionallyclassified into subfamilies based on dietary habitsand associated morphologies (Wetterer et al. 2000).We sampled densely among lineages characterized byinsectivory and nectarivory because Miocene fossils(†Palynephyllum, †Notonycteris) have been assigned togroups sharing these dietary specializations based onsimilarity to extant taxa and, for †Notonycteris, analysesof 48 dental characters in seven genera (Czaplewski et al.2003b). Two subfamilies formerly recognized based onmorphological characters and later shown by moleculardata to be paraphyletic were sampled. The subfamilyGlossophaginae sensu Wetterer et al. (2000) comprisedprimarily nectarivorous lineages whose morphologicalsynapomorphies are associated with feeding ecology,implying ecological convergence (Dávalos et al. 2012).This group has subsequently been divided into twomonophyletic groups, Glossophaginae sensu stricto andLonchophyllinae (Baker et al. 2003). No similar patternsof support from ecologically driven character changeshave been found for the subfamily Phyllostominaesensu Wetterer et al. (2000), another group shown tobe paraphyletic with molecular data (Baker et al. 2003;Datzmann et al. 2010). Lineages of phyllostominessensu Wetterer et al. (2000) retain the primitive feedingecology of insectivory (Rojas et al. 2011).
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 3 1–19
2014 DÁVALOS—SATURATED AND NON-INDEPENDENT MORPHOLOGICAL DATA 3
Sampling densely among primarily nectarivorousand insectivorous lineages enabled inclusion of allthe potential relatives of four fossil species. Three ofthese taxa are extremely incomplete Miocene fossiltaxa—†Notonycteris magdalenensis, †N. sucharadeus, and†Palynephyllum antimaster—known only from isolatedteeth, partial mandibles, and partial postcranial bones.We also included the better-preserved Quaternary†Phyllonycteris major (Choate and Birney 1968).Six additional noctilionoid species in the familiesMormoopidae, Noctilionidae, and Thyropteridae wereincluded as outgroups. Thyroptera tricolor (Chiroptera:Thyropteridae) was used to root trees. The total numberof genera sampled was 50, with both morphologicaland molecular data for 85 extant species. An additional28 species were represented only by morphologicaldata including fossils and extant species and subspeciesknown only from the type of material and lacking tissuevouchers at this time.
Morphological DataThe new morphological data generated to place
the fossils comprised representatives of all currentlyrecognized phyllostomid subfamilies, including 107species and subspecies in 46 phyllostomid genera.We identified 278 dental characters of potentialphylogenetic significance based on comparisonsamong ingroup taxa. Characters were generatedusing reductive character coding sensu Wilkinson(1995). We used combinations of two character statesto indicate taxonomic polymorphism in compositeterminal taxa (species or subspecies). Of the 278characters scored, 112 were treated as ordered inphylogenetic analyses. These multistate charactersdescribe apparently progressive gradations in size,shape, or relative position, and were coded as orderedto preserve hypotheses of homology of states andtransformations. We assembled the morphologicalmatrix using MorphoBank (www.morphobank.org), apublic online database for assembling and managingmorphological matrices (O’Leary and Kaufman 2012).Each cell in the matrix is documented with a labeledimage that can be zoomed into or downloaded to betterobserve the structure in question. The morphologicalmatrix is available at MorphoBank as project P891.
OverviewOur analyses aimed to find the best way to investigate
the data structure of morphological data and integratemorphological characters with DNA sequences. Figure 1summarizes these analyses of morphological characterstructure and data integration and serves as a generalguide to the methods implemented here.
Measuring saturation in morphological character statechanges.—A maximum parsimony (MP) phylogenetic
approach and curve fitting were used to assess theextent of saturation in changes in morphologicalcharacter states using the state: step relationship (Fig. 1,upper right; Wagner 2000). Saturation indicates onekind of homoplasy, as character states end up beingidentical without common descent (Dávalos et al.2012). To quantify saturation, we first inferred amorphological phylogeny using the Unix version ofPAUP* v. 4b10 (Swofford 2002) in MP heuristic searchesincluding ordered characters and 1000 replicates ofrandom sequence addition, followed by tree–bisection–reconnection (TBR). All maximally parsimonious treeswere retained.
The goal of the state: step plot is to test thehypothesis that new character states accumulatecontinuously throughout evolutionary history.Character optimization and ordering could affectthis relationship: accelerated transformation (acctran)would place changes earlier in the evolutionary historyof the clade, and ordered steps would lengthen thetree compared with unordered steps. These choiceswould skew results in the direction of rejecting the nullhypothesis of continuous accumulation of characterstates. For this reason, the resulting trees were used tooptimize morphological character state changes usingthe delayed transformation option (deltran) and allunordered characters. Each step of the MP phylogeniesbeginning with the first transformation starting fromthe root was then matched to its beginning and endcharacter state. We could then determine whether theend character state had been previously observed or not.The age of each node was determined by the relativebranching of the MP morphological phylogenies, andwhen identical relative ages were found, using the datesof divergence estimated by Dumont et al. (2012).
The resulting table includes the step in the phylogeny,the cumulative number of character states observed,and whether or not that particular step resultedin a new state or not (Supplementary Table S1,http://dx.doi.org/10.5061/dryad.pg4c0). The steps ofthe phylogeny were divided into two sets: thosethat result in new states and those that do not. Ifnew character states were distributed throughout thesequence of steps in the MP tree, then there shouldbe no significant difference in the distributions of stepnumbers from both groups. That is, if new characterstates are equally probable throughout evolutionaryhistory, then steps that result in new states shouldnot be concentrated at low or high step numbers, andboth kinds of steps would have similar step ranks. Wetested this null hypothesis using a Wilcoxon signed-rank test (Wilcoxon 1945) as implemented in R v.2.15.1(R Development Core Team 2012). The inflection pointbeyond which the accumulation of new character statesslows down relative to the steps of the phylogenywas identified by fitting a segmented linear modelusing the segmented R package v.0.2-9.4 (Muggeo 2008).We compared predictions from two hypotheses ofnon-linear accumulation of character states: a finite-state models in which there is an upper boundary
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 4 1–19
4 SYSTEMATIC BIOLOGY
Estimatingphylogenies
End points
Statisticalscaffold
Dissimilaritymatrix
Molecularmatrix
Morphologicalmatrix
Dissimilaritymatrix
Relative dissimilaritydistribution
(non-independence)
Phylogenies Phylogenies
Combinedphylogenies
Relativeevolutionary
rates
Simulationscomparing
trees with and withoutconflicting nodes
Significantlyconflictingcharacters
Alternativecombined
phylogenies
Conflictingnodes
Combined datamatrix
Characterexhaustion
Morphologicalcharacter
diagnostics
State-stepplot
Molecular posterioras constraint
Constrainedphylogenetic analyses
Data matrices
Analyses
Legend
FIGURE 1. Graphic summary of analyses performed to estimate character state exhaustion in morphological characters, quantify rates ofevolution in molecular and morphological partitions, and integrate molecular and morphological data. The workflow begins with molecularand morphological data matrices, and subsequent analyses: (1) examine saturation and independence in the morphological data (upper right),(2) examine character independence (upper center), (3) build a phylogenetic constraint from molecular sequences and produce a constrainedintegrated phylogeny or statistical scaffold (left half), (4) generate unconstrained combined phylogenies (lower left), (5) identify nodes thatconflict between constrained and unconstrained phylogenies (middle center), and (6) use conflicting nodes and simulations to identify outliermorphological characters supporting conflicting nodes and exclude these from an alternative set of combined phylogenies (middle and lowerright).
on the total number of character states, and theordered-state model in which each state has a finitenumber of descendent states (Wagner 2000). Non-linearrelationships describing the finite-state and ordered-state hypotheses were fitted to the data using theequations of Dávalos et al. (2012) and the nls commandin R (Fig. 1, upper right). The fit of the non-linear modelsto the data was compared using the Akaike InformationCriterion (AIC), with a lower AIC indicating better fit ofthe curve to the data (Burnham and Anderson 2002).
Quantifying similarity between characters.—Selectionfor function under similar ecological pressureshas been proposed as the mechanism producingadaptive convergence in character states (Wienset al. 2003; Springer et al. 2007). Correlationsbetween morphological character states resultingfrom development are also expected to generatehomoplasy by limiting both character states andcombinations of character states (Wake 1991). Both ofthese mechanisms are known to operate on mammaliandentition (Kangas et al. 2004; Evans et al. 2007) andwill result in non-independence of character states.To explore the independence of character states, wecalculated the Gower pairwise dissimilarity among
all morphological and molecular characters (Gower1966). The Gower dissimilarity has been adopted incommunity ecology and genomics, transcriptomics,and proteomics to explore correlations in variablescomprising continuous measures, categories (e.g.,different base pairs), binaries, and ordinals (Legendreand Gallagher 2001; Marengo et al. 2003; Peterson 2003;Alheit et al. 2011). It ranges from 0 to 1, with 0 indicatingcomplete identity and 1 complete dissimilarity inpatterns found in pairs of variables.
The coding of states using similar symbols fromone morphological character to the next does notimply an equivalence of states between characters.To overcome this challenge posed by morphologicaldata, we recoded each character as a binary matrix ofsimilarity and difference between species. For example,if a morphological character had states 0012 for thefirst four species, the comparison between the first twospecies would yield a 0; between species 2 and 3, a 1;and between species 3 and 4, a 1. This transformationmade all recoded characters directly comparable to oneanother, and Gower dissimilarities were calculated fromthese recoded characters.
To generate dissimilarity distributions comparablebetween morphological and molecular data, invariable
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 5 1–19
2014 DÁVALOS—SATURATED AND NON-INDEPENDENT MORPHOLOGICAL DATA 5
base pairs were excluded from calculations. Allambiguous characters of both kinds of data wereconverted to empty cells, and nucleotides were coded ascategorical variables. The daisy command in the clusterR package v.1.14.3 (Maechler et al. 2012) was then usedto calculate the Gower dissimilarities (Fig. 1, uppercenter). This implementation ignored missing valuesin calculating the dissimilarities for a given row ofcharacters. The frequency distributions of dissimilaritieswere then compared by plotting histograms andby estimating and visualizing the relative frequencydistribution using the R package reldist v.1.6-1 (Handcockand Morris 1999; Handcock 2013).
Molecular DataSmall pieces of tissue (∼2–10 mg) were used for DNA
extraction using the QIAmp DNA Micro Kit (QiagenN.V. Catalogue no. 56304). To maximize DNA recovery,we followed the manufacturer’s tissue protocol with anextra step of using carrier RNA after lysis. All DNA waseluted in molecular-grade water. DNA yields from tissueextractions ranged from 20–100 ng/�l.
We collected new partial sequences from seven nuclearloci: two introns, thy or thyrotropin beta chain, stat5aor signal transducer and activator of 5A; one 3′-untranslated region (UTR), plcb4 or phospholipase Cbeta 4; three autosomal exons, bdnf or brain-derivedneurotrophic factor, ttn6 or titin 6, rag2 or recombination-activating protein 2; and the X-chromosome exonatp7a or ATPase-7A. New complete sequences weregenerated for the mitochondrial locus cytb or cytochromeb. To maximize sampling of nectarivorous lineages,published sequences from the mitochondrial genes cox1or cytochrome oxidase I, cytb, and the ribosomal RNAs12S, tRNAval and 16S were also included (Baker et al.2003; Clare et al. 2007; 2011). All sequences analyzed inthis study are shown in Table 1.
We used primers designed by Eick et al. (2005) toamplify and sequence introns, those by Teeling et al.(2005) for the 3′-UTR, those by Murphy and O’Brien(2007) for the nuclear exons; and the combination ofprimers by Jansa et al. (1999) to amplify the cytb gene.To increase efficiency in amplification with noctilionoidtaxa, we redesigned the bdnf and ttn6 primers, andused newly designed internal primers for sequencingthe ttn6 locus. All primers used in this study areshown in Supplementary Table S2. Touchdown PCRamplifications were used for all genes, except thy,for which a single annealing temperature was used(Supplementary Tables S3 and S4). All amplificationswere carried out using the Taq PCR Core Kit (QiagenN.V. Catalogue no. 201223) with the reagent mixshown in Supplementary Table S5. Fragments werecycle- sequenced in both directions using external andsometimes internal primers (Supplementary Table S2).Automated Applied Biosystems 3730 sequencers wereused to analyze the sequenced fragments. Contigs werealigned using the assembly tool in Geneious v.5.3.6
(Drummond et al. 2011). Quality scores and visualinspection of chromatograms were used to identifyheterozygous positions.
Alignment.—We used the Perl script transAlign v1.2(Bininda-Emonds 2005) to align all nuclear exons andprotein-coding mitochondrial genes. The multiplesequence alignment method multiple alignmentthrough fast Fourier transform (MAFFT) v6.710bapplying the einsi protocol was used to align non-codingnuclear sequences (Katoh et al. 2005; Katoh and Toh2008). An alignment of mitochondrial ribosomal RNA(mtrDNA) sequences based on the secondary structureof the genes was obtained directly from Dávalos et al.(2012). After trimming ends to reduce missing data, theconcatenated alignment comprised 9584 base pairs.
Selection of partition scheme and optimal models ofDNA evolution.—The molecular data were dividedinto genomic and functional partitions. The maximumnumber of partitions (nine data blocks) comprisedthree codon positions each for mitochondrial andnuclear protein-coding genes, mtrDNA loops and stems,and introns together with one 3′ UTR. The optimalpartitioning scheme and model of DNA evolution forindividual partitions were selected using the BayesianInformation Criterion (BIC) in the PartitionFinderpython script v.1.0.0 (Lanfear et al. 2012). Briefly,PartitionFinder takes the predefined partitions in thealignment and compares combinations of data blocksand models of evolution for those data blocks usingML phylogenies as implemented in PhyML (Guindonet al. 2010). The approximate gamma distributionaccommodates sites with very low rates of change (Yang1994). To avoid possible dependencies between gammaand the proportion of invariable sites (Gu et al. 1995), therange of models of sequence evolution was restricted toeither parameter, but not both. The greedy algorithm,which uses the BIC to guide the search for optimalpartitioning schemes, was used to reduce computationaltime. Results from PartitionFinder comprised both theoptimal partitioning scheme and the best-fit models ofevolution for its individual partitions, and these wereapplied in subsequent ML and Bayesian phylogeneticanalyses. Downstream analyses included tree topologyand branch length searches. Partitioned alignments andresulting trees were deposited in TreeBASE under ID13970.
Phylogenetic AnalysesThe central goal of our phylogenetic analyses was to
integrate morphological and molecular data (Fig. 1). Twoapproaches were used: (1) statistical molecular scaffoldsand (2) combined analyses. Previous analyses have useda phylogeny derived from molecular data as a backboneconstraint in MP searches using morphological data—a molecular scaffold (Springer et al. 2001; Manoset al. 2007). Besides a priori assigning greater weight
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 6 1–19
6 SYSTEMATIC BIOLOGY
to molecular sequences (Hermsen and Hendricks2008), this approach has two disadvantages. First, itdoes not account for the phylogenetic uncertaintyassociated with the summary molecular phylogeny, asa single majority-rule consensus tree resulting from MPbootstrap analyses is generally used (Springer et al.2001; Manos et al. 2007). Second, measures of branchsupport for the morphological data are superimposedon the molecular scaffold phylogeny through bootstrapsor matrix representation parsimony, again losing thephylogenetic uncertainty associated with both types ofdata. We extended the scaffolding approach to overcomeboth of these issues by generating statistical scaffolds.With our approach morphological characters mayinfluence the phylogeny of taxa represented in molecularposterior phylogenies if these are in poorly supportednodes, but not if nodes are strongly supported.
Statistical molecular scaffolds.—A combination of Bayesianand ML approaches were used to obtain statisticalmolecular scaffolds (Fig. 1, left half). First, a posteriordistribution of molecular phylogenies was obtainedthrough Metropolis-coupled Markov chain MonteCarlo Bayesian analyses applying the best partitioningscheme and optimal models of sequence evolution foreach partition using MrBayes v.3.2.0 (Ronquist et al.2012b). Most MrBayes runs were completed in theCIPRES Science Gateway (Miller et al. 2010). Nucleotidefrequencies, shape of the discrete approximation ofthe gamma distribution of rates across the alignment,transition/transversion rate ratios, and rate matrixparameters were all unlinked between partitions; and therate of evolution was allowed to vary between partitions(prset ratepr = variable). Four independent searchesran for 10 million generations with a starting treederived from maximum likelihood (ML) analyses. Thisstarting tree was obtained through a ML search usingthe Randomized Axelerated ML (RAxML) algorithmv.7.2.7 (Stamatakis 2006). The preliminary analysiswas conducted on a maximally partitioned alignment(nine data blocks) and implemented the general timereversible (GTR) model of nucleotide evolution witha discrete approximation of a gamma distribution toaccount for variation in rates between sites for all datablocks (Tavaré 1986; Yang 1994).
To improve chain mixing, the temperature of theheated chains was lowered from the MrBayes default of0.02 to 0.01. Burn-in of Bayesian searches was assessedby examining log-likelihood and tree length estimatesusing Tracer v.1.5 (Rambaut and Drummond 2007).Convergence within searches was assessed through thestandard deviations of split frequencies, and betweensearches by comparing posterior estimates of log-likelihood, tree lengths, and relative rate parameters.Standard deviations of split frequencies <0.01 wereinterpreted as indicative of within-search convergence.Parameters were assessed to have converged betweenruns if their median posterior distributions differed byno more than 1 standard deviation from one another.
Estimates of autocorrelation time between posteriorsamples were used to subsample posterior phylogeniesto obtain an uncorrelated posterior sample of 1000phylogenies.
Second, each phylogeny in the uncorrelated posteriorsample of Bayesian molecular phylogenies was usedas a backbone constraint in ML searches in RAxMLusing the Markov k-state variable model with a gammadistribution of rate variation among characters toanalyze the morphological data (Mkv + gamma) (Fig. 1;Lewis 2001). Branch lengths and model parameters forthe combined data were optimized on each resultingtree using a partition file specifying different ratematrices and approximate gamma distribution foreach molecular partition, and the Mkv+gamma modelfor the morphological model (i.e., f e, -t each_tree,-q file_with_partitions, -s combined_data, and -K MKcommands). The resulting trees were then summarizedinto a maximum clade credibility tree with medianbranch lengths using TreeAnnotator v.1.6.1 (Rambautand Drummond 2010).
Unconstrained combined data analyses.—Theunconstrained combined morphological and moleculardata set were analyzed using ML and Bayesianapproaches. To obtain a ML estimate of the combinedphylogeny, we used RAxML with a partition file, and theMkv+gamma model to analyze the morphological data.The rapid bootstrap algorithm of RAxML was followedby thorough ML searches (i.e., f a, -q file_with_partitions,-s combined_data, and -K MK commands) (Stamatakiset al. 2008). One hundred bootstrap pseudoreplicateswere used to estimate branch support.
Bayesian estimates of phylogeny were obtained byrunning four independent analyses for 10 milliongenerations with the optimal molecular partitionscheme, the best-fit models of sequence evolution, andunlinked parameters for partitions (Fig. 1, lower leftand center). The Mkv+gamma model of evolution withsymmetric rates of change between character states(default, or Prset symdirihyperpr = fixed(infinity)) was usedto analyze the morphological partition. Burn-in andconvergence were assessed as with the molecular dataset (see the “Statistical molecular scaffolds” section). Thetemperature of the heated chains was lowered to 0.001to improve chain mixing.
Analyses of morphological data.—To make our analysescomparable to the majority of published morphologicalphylogenies (e.g., Wetterer et al. 2000; Dávalos et al. 2012),we analyzed the morphological data using MP, ML,and Bayesian methods. MP heuristic searches used 20replicates of random sequence addition followed by TBRin PAUP*. ML optimizations applied the Mkv+gammamodel in 100 searches using the rapid bootstrapalgorithm followed by a thorough search for the optimaltree. Three independent Bayesian analyses ran for 40million generations with the Mkv+gamma model ofcharacter evolution and a prior on the distribution of
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 7 1–19
2014 DÁVALOS—SATURATED AND NON-INDEPENDENT MORPHOLOGICAL DATA 7T
ABL
E1.
Spec
ies
and
Gen
Bank
acce
ssio
nnu
mbe
rsfo
rea
chlo
cus
anal
yzed
Spec
ies
atp7
abd
nfpl
cb4
rag2
stat
5ath
ytt
n612
S,tR
NA
val ,1
6Sco
x1cy
tb
Ano
ura
caud
ifer
KC
7830
05K
C78
2952
KC
7830
58K
C78
3112
—K
C78
3214
KC
7831
23A
Y39
5835
EF07
9981
L195
06A
nour
age
offr
oyi
AY
8344
95A
Y83
4519
AY
8359
51A
F316
431
——
AY
8347
18—
EF07
9996
FJ15
5495
Ano
ura
lati
dens
——
——
——
——
EF08
0020
—A
rtib
eus
gnom
usK
C78
3006
KC
7829
53K
C78
3059
—K
C78
3173
—K
C78
3124
—EF
0800
67EU
1609
57A
rtib
eus
jam
aice
nsis
KC
7830
07K
C78
2954
KC
7830
60FN
6416
74K
C78
3174
KC
7832
15K
C78
3125
AF0
6134
0A
F061
340
GQ
8616
67A
rtib
eus
litur
atus
KC
7830
08K
C78
2955
KC
7830
61—
—K
C78
3216
KC
7831
26—
EF08
0083
AY
6847
40A
rtib
eus
obsc
ures
—K
C78
2956
KC
7830
62—
—K
C78
3217
KC
7831
27—
EF08
0106
—B
rach
yphy
llaK
C78
3010
KC
7829
57K
C78
3063
AF3
1643
6K
C78
3175
—K
C78
3128
AY
3958
06—
AY
6204
67ca
vern
arum
Bra
chyp
hylla
nana
——
FN64
3251
——
——
——
AY
6204
44C
arol
liabr
evic
auda
KC
7830
11K
C78
2958
KC
7830
64K
C78
3113
KC
7831
76—
KC
7831
29—
JF45
3684
AF5
1195
1C
arol
liape
rspi
cilla
taK
C78
3012
KC
7829
59K
C78
3065
KC
7831
14K
C78
3177
KC
7832
18K
C78
3130
AY
3958
36EF
0802
11FJ
5897
15C
hoer
onis
cus
godm
ani
——
—A
F316
440
——
——
EU09
6698
—C
hoer
onis
cus
min
orK
C78
3013
KC
7829
60K
C78
3066
——
KC
7832
19K
C78
3131
—EF
0802
94K
C78
3055
Cho
eron
ycte
ris
——
——
——
—A
Y39
5808
——
Mex
ican
aC
hrot
opte
rus
auri
tus
KC
7830
14K
C78
2961
KC
7830
67A
F316
442
KC
7831
78K
C78
3220
KC
7831
32A
F411
538
EF08
0303
KC
7830
57D
esm
odus
rotu
ndus
KC
7830
15K
C78
2962
KC
7830
68A
F316
444
KC
7831
79K
C78
3221
KC
7831
33A
F263
228
JF43
5307
FJ84
7517
Dia
emus
youn
gi—
——
AF3
1644
5—
——
AF4
1153
4EF
0803
28FJ
1554
75D
iphy
llaec
auda
ta—
KC
7829
63K
C78
3069
KC
7831
15K
C78
3180
—K
C78
3134
AF4
1153
3—
DQ
0773
99E
nchi
sthe
nes
hart
iiK
C78
3016
KC
7829
64K
C78
3070
AF3
1644
9—
KC
7832
22—
AY
3958
38EU
1610
64U
6651
7E
roph
ylla
bom
bifr
ons
——
——
——
——
—G
U93
7269
Ero
phyl
lase
zeko
rni
KC
7830
17K
C78
2965
KC
7830
71A
F316
450
KC
7831
81K
C78
3223
KC
7831
35A
Y39
5839
—G
U93
7254
Glo
ssop
haga
——
——
——
——
—A
F382
886
com
mis
sari
siG
loss
opha
ga—
——
——
——
—JF
4591
62A
F382
875
long
iros
tris
Glo
ssop
haga
mor
enoi
——
——
——
——
—A
F382
882
Glo
ssop
haga
sori
cina
KC
7830
18—
KC
7830
72A
F316
452
KC
7831
82A
J865
666
KC
7831
36A
Y39
5840
EF08
0360
AF4
2308
1G
lyph
onyc
teri
sda
vies
iK
C78
3019
KC
7829
66K
C78
3073
AF3
1646
4K
C78
3183
KC
7832
24K
C78
3137
AY
3958
12EF
0803
64A
Y38
0747
Gly
phon
ycte
ris
sylv
estr
is—
—FN
6432
58A
F316
471
——
—A
Y39
5841
EF08
0366
AY
3807
46H
ylon
ycte
ris
unde
rwoo
di—
—FN
6432
59A
F316
453
——
—A
Y39
5813
——
Lam
pron
ycte
ris
brac
hyot
isK
C78
3020
KC
7829
67K
C78
3074
AF3
1646
3K
C78
3184
KC
7832
25K
C78
3138
AF4
1153
6EF
0803
70A
Y38
0748
Lept
onyc
teri
scu
raso
ae—
—FN
6432
60—
——
——
—A
F382
889
Lich
onyc
teri
sob
scur
a—
——
——
——
—JF
4488
53—
Lion
ycte
ris
spur
relli
—K
C78
2968
KC
7830
75—
—K
C78
3226
—A
Y39
5815
EF08
0374
AF4
2310
0Lo
ncho
phyl
laha
ndle
yi—
——
——
——
——
AF4
2309
4Lo
ncho
phyl
lam
orda
x—
——
KC
7831
16K
C78
3185
—K
C78
3139
——
AF4
2309
5Lo
ncho
phyl
laro
bust
a—
—FN
6432
61FN
6416
77—
——
——
AF4
2309
1Lo
ncho
phyl
lath
omas
iK
C78
3021
KC
7829
69K
C78
3076
AF3
1645
6K
C78
3186
KC
7832
27K
C78
3140
AY
3958
42EF
0803
77A
F423
086
Lonc
horh
ina
auri
taK
C78
3022
KC
7829
70K
C78
3077
KC
7831
17—
KC
7832
28K
C78
3141
AY
3958
43—
FJ15
5494
Lonc
horh
ina
orin
ocen
sis
—K
C78
2971
KC
7830
78—
——
KC
7831
42—
——
Loph
osto
ma
bras
ilien
seK
C78
3023
KC
7829
72K
C78
3079
AF3
1648
9K
C78
3187
KC
7832
29K
C78
3143
AF4
1154
4H
Q54
5592
FJ15
5486
Loph
osto
ma
carr
iker
iK
C78
3024
KC
7829
73K
C78
3080
KC
7831
18K
C78
3188
KC
7832
30K
C78
3144
—EF
0804
29—
Loph
osto
ma
evot
is—
——
AF4
4208
0—
——
——
FJ15
5491
Loph
osto
ma
schu
lzi
KC
7830
25K
C78
2974
KC
7830
81A
F442
079
KC
7831
89K
C78
3231
KC
7831
45—
EF08
0432
FJ15
5485
Loph
osto
ma
silv
icol
umK
C78
3026
KC
7829
75A
Y83
5949
——
KC
7832
32K
C78
3146
AF2
6323
0—
—
(Con
tinu
ed)
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 8 1–19
8 SYSTEMATIC BIOLOGY
TA
BLE
1.C
onti
nued
.
Spec
ies
atp7
abd
nfpl
cb4
rag2
stat
5ath
ytt
n612
S,tR
NA
val ,1
6Sco
x1cy
tb
Mac
roph
yllu
mK
C78
3027
KC
7829
76K
C78
3082
—K
C78
3190
KC
7832
33K
C78
3147
AF4
1154
0EU
0967
73FJ
1554
84m
acro
phyl
lum
Mac
rotu
sca
lifor
nicu
s—
—K
F471
662
AF3
1645
9K
F471
663
——
——
AY
3807
44M
acro
tus
wat
erho
usii
KC
7830
28K
C78
2977
KC
7830
83K
C78
3119
KC
7831
91K
C78
3234
KC
7831
48A
F263
229
——
Mic
rony
cter
isbr
osse
ti—
——
——
——
——
AY
3807
71M
icro
nyct
eris
hirs
uta
KC
7830
29K
C78
2978
KC
7830
84A
F316
465
KC
7831
92K
C78
3235
KC
7831
49A
Y39
5819
EF08
0447
AY
3807
69M
icro
nyct
eris
hom
ezi
——
——
——
——
—A
Y38
0754
Mic
rony
cter
ism
atse
sK
C78
3030
KC
7829
79K
C78
3085
KC
7831
20—
KC
7832
36K
C78
3150
——
DQ
0774
19M
icro
nyct
eris
meg
alot
isK
C78
3031
KC
7829
80K
C78
3086
AF3
1646
7K
C78
3193
—K
C78
3151
AY
3958
21EU
0967
80D
Q07
7426
Mic
rony
cter
ism
icro
tis
KC
7830
32K
C78
2981
KC
7830
87—
KC
7831
94K
C78
3237
KC
7831
52—
—A
Y38
0756
Mic
rony
cter
ism
inut
aK
C78
3033
KC
7829
82K
C78
3088
—K
C78
3195
——
AY
3958
23—
AY
3807
52M
icro
nyct
eris
KC
7830
34K
C78
2983
KC
7830
89A
F316
470
KC
7831
96K
C78
3238
KC
7831
53—
—D
Q07
7406
schm
idto
rum
Mim
onco
zum
elae
—K
C78
2984
KC
7830
90—
KC
7831
97—
KC
7831
54—
——
Mim
oncr
enul
atum
KC
7830
35—
KC
7830
91—
EU65
2033
—K
C78
3155
AF4
1154
3EU
0967
81FJ
1554
78M
onop
hyllu
spl
etho
don
——
——
——
——
—A
F382
887
Mon
ophy
llus
redm
ani
KC
7830
36K
C78
2985
KC
7830
92—
KC
7831
98—
KC
7831
56—
—A
F382
888
Mor
moo
psbl
ainv
illei
KC
7830
37K
C78
2986
KC
7830
93A
Y02
8169
KC
7831
99K
C78
3239
KC
7831
57—
—A
F338
686
Mus
onyc
teri
sha
rris
oni
——
—A
F316
475
——
—A
Y39
5844
——
Noc
tilio
albi
vent
ris
KC
7830
38A
Y83
4520
KC
7830
94A
F330
811
KC
7832
00gA
J865
658
KC
7831
58—
EF08
0524
AF3
3080
6N
octi
liole
pori
nus
KC
7830
39K
C78
3004
KC
7830
95A
F330
816
KC
7832
01K
C78
3240
KC
7831
59A
F263
224
EF08
0534
AF3
3079
6P
hyllo
derm
ast
enop
s—
KC
7829
87K
C78
3096
AF3
1648
0—
KC
7832
41—
AF4
1154
2EU
0968
30FJ
1554
80P
hyllo
nyct
eris
aphy
lla—
——
AF3
1647
8—
——
——
AF1
8703
3P
hyllo
nyct
eris
poey
iK
C78
3040
KC
7829
88K
C78
3097
KC
7831
21K
C78
3202
KC
7832
42K
C78
3160
——
GU
9372
40P
hyllo
stom
usdi
scol
or—
——
——
——
—EF
0805
46—
Phy
llost
omus
elon
gatu
sK
C78
3041
—K
C78
3098
—K
C78
3203
—K
C78
3161
—EF
0805
51K
C78
3056
Phy
llost
omus
hast
atus
KC
7830
42—
KC
7830
99—
EU65
2033
KC
7832
43K
C78
3162
AF4
1154
1EF
0805
56FJ
1554
79P
lata
lina
geno
vens
ium
——
——
——
——
—A
F423
101
Pla
tyrr
hinu
she
lleri
KC
7830
43K
C78
2991
KC
7831
00A
F316
481
KC
7832
04K
C78
3244
KC
7831
63—
EF08
0579
FJ15
4140
Pte
rono
tus
davy
iK
C78
3044
KC
7829
92K
C78
3101
AF3
3869
2K
C78
3205
KC
7832
45K
C78
3164
AF4
0717
6—
AF3
3867
1P
tero
notu
spa
rnel
liiK
C78
3045
—K
C78
3102
KC
7831
22K
C78
3206
KC
7832
46—
——
AF3
3866
1P
ygod
erm
abi
labi
atum
KC
7830
46K
C78
2994
KC
7831
03A
F316
483
—K
C78
3247
KC
7831
66A
Y39
5826
—A
Y60
4438
Rhi
noph
ylla
alet
hina
——
——
——
——
—A
F187
028
Rhi
noph
ylla
fisch
erae
——
——
——
——
—A
F187
032
Rhi
noph
ylla
pum
ilio
KC
7830
47K
C78
2995
KC
7831
04A
F316
484
KC
7832
07EU
3719
60K
C78
3167
AY
3958
27EF
0805
98A
F187
031
Stur
nira
liliu
mK
C78
3048
KC
7829
96K
C78
3105
—K
C78
3208
KC
7832
48—
—EF
0806
84D
Q31
2398
Thy
ropt
era
tric
olor
AY
8345
02K
C78
2997
KC
7831
06—
KC
7832
09K
C78
3249
AY
8347
25A
F263
233
——
Tona
tia
saur
ophi
laK
C78
3049
KC
7829
98K
C78
3107
AF4
4208
6K
C78
3210
KC
7832
50K
C78
3169
AF4
1153
0EF
0807
34—
Trac
hops
cirr
hosu
sK
C78
3050
KC
7829
99FN
6432
73—
AJ8
6542
2K
C78
3251
—A
F411
539
EF08
0747
FJ15
5483
Trin
ycte
ris
nice
fori
KC
7830
51K
C78
3000
KC
7831
08—
KC
7832
11K
C78
3252
KC
7831
70A
Y39
5830
——
Uro
derm
abi
loba
tum
KC
7830
52K
C78
3001
KC
7831
09A
F316
491
—EU
3719
73K
C78
3171
AY
3958
31EF
0807
88A
Y16
9955
Vam
pyro
des
cara
ccio
liK
C78
3053
KC
7830
02K
C78
3110
AF3
1649
4K
C78
3212
EU37
1991
—A
Y39
5846
EF08
0804
AY
1570
34V
ampy
rode
sm
ajor
——
——
——
——
—H
Q63
7422
Vam
pyru
msp
ectr
umK
C78
3054
KC
7830
03K
C78
3111
AF3
1649
5K
C78
3213
KC
7832
53—
AF4
1153
7EF
0808
09|
FJ15
5482
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 9 1–19
2014 DÁVALOS—SATURATED AND NON-INDEPENDENT MORPHOLOGICAL DATA 9
branch lengths that was shorter than the MrBayes default(prset Brlenspr = Unconstrained:Exp(100)).
Comparisons between PhylogeniesWe carried out simulations and implemented a
likelihood approach to determine which morphologicalcharacters significantly supported incorrect nodes(Fig. 1, middle and lower right). Three nodes recoveredin combined data analyses, deemed incorrect basedon the molecular statistical scaffold and previouslypublished molecular and morphological phylogenies(Simmons 1996; Baker et al. 2003; Datzmann et al. 2010;Dávalos et al. 2012), were used to develop this method.First, we calculated the per-character likelihoods of themorphological data given the scaffold tree, and giventhe same tree modified only to contain the incorrectnode (Fig. 1, middle right). This was accomplished byusing the computation of per-site log likelihoods andthe Mkv+gamma model in RAxML (i.e., using the -f g, -z file_with_trees, -s morphological_data, and -K MKcommands) (Stamatakis 2006). Then, null distributionsof differences between the fit of characters to the treewith and without the incorrect node were generated foreach of the 278 morphological characters. Morphologicalcharacters were simulated using the rTraitDisc functionin the ape R package v.3.0-5 (Paradis et al. 2004).Simulating discrete characters requires a phylogenywith branch lengths and a transition matrix of ratesof change between character states (Pagel 1994). Thestructure of each transition matrix assigned both thenumber of character states and whether or not thecharacter was ordered. To facilitate simulation, transitionmatrices consisted only of 0/1 values. This meantthat the branch lengths of each phylogeny used assimulation input had to be multiplied by the rate ofcharacter evolution so that the rate of change in thecorresponding transition matrix equaled 1. The posteriordistributions of Bayesian morphological phylogeniesand evolutionary rate parameter (m in MrBayes) wereused to generate these scaled phylogenies. Finally, theper-character log-likelihoods of simulated data sets wereobtained using RAxML and the resulting distributionsof differences in site likelihoods were used to determinewhich morphological characters were outliers in theirsupport for particular nodes (Fig. 1, lower right).
RESULTS
Characteristics of the Morphological DataThe state: step plot showed the accumulation of
character states slowed down as steps accumulated inthe phylogeny (Fig. 2). The inflection point of the curvewas estimated at 878 steps (standard error = 2.625).Steps in which a new character state was first observedoccurred earlier in the phylogeny and were ranked lowerthan other steps (W =203,034, P-value = 6.587E-05). The
0
200
400
600
0 1000 2000
No. of steps in MP phylogeny
No.
of s
tate
s ob
serv
ed
Character change
Finite states model
Observed
Ordered states model
FIGURE 2. State: step plot for morphological data optimized usingMP. Lines fitted using the finite-states (or rarefaction) and ordered-states (or hypervolume) models are shown for all points beyond thelinear accumulation of character states.
ordered-character relationship provided a substantiallybetter fit to the data (AIC = 11,081) than the finite-staterelationship (AIC = 18,662). Curves predicted using thefitted models are shown in Figure 2.
The frequency distributions of pairwise dissimilaritiesfrom molecular and morphological data were markedlydifferent (Fig. 3A). Relative distribution analysesrevealed the greatest difference between morphologicaland molecular distributions consisted of an excess ofvery low dissimilarities, and lack of dissimilaritiesabove 0.2 among morphological characters (Fig. 3B).Dental characters encoded variation similar to thatin other morphological characters, whereas molecularcharacters encoded highly dissimilar variation incomparison. Phylogenetic analyses of the dental dataresulted in weakly supported trees that were markedlydifferent from previous morphological and molecularphylogenies (Supplementary Fig. S1). Optimal treesrecovered using each approach were used in subsequentsite-likelihood comparisons.
Phylogenetic AnalysesThe optimal partition found using PartitionFinder had
seven partitions: each of the three codon positions for themitochondrial data, mt rRNA loops, mt rRNA stems,introns and the 3′ UTR, and protein-coding nuclear
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 10 1–19
10 SYSTEMATIC BIOLOGY
0
Fre
quen
cy (
perc
ent)
Morphological
0
5.2
10.4
15.6
Molecular
2.1
4.2
6.3
8.5
Dissimilarity between characters
Rel
ativ
e de
nsity
bet
wee
n m
orph
olog
ical
cha
ract
ers
0.0 0.2 0.4 0.6 0.8 1.0
0
2
4
6
8
10
A
B
FIGURE 3. Frequency distributions and relative frequencydistribution of Gower dissimilarities from molecular andmorphological characters. (A) Frequency distributions of dissimilarityof morphological (upper) and molecular (lower) characters. (B)Frequency distribution of dissimilarities between morphologicalcharacters relative to dissimilarities between molecular characters.Solid bars show the observed relative density, and the solid black lineshows the smoothed density of morphological character dissimilaritiesrelative to the dissimilarity among variable molecular characters.The dashed black line indicates the expected relative density if thefrequency distributions of dissimilarities from morphological andmolecular data were not significantly different. Significant differencesbetween frequency distributions occur in intervals above and belowthe dashed line.
sequences. The optimal model of sequence evolutionfor nuclear protein-coding genes, mitochondrial secondand third codon positions, and mt rRNA loops wasthe GTR+gamma (Tavaré 1986; Yang 1994). For intronsand the 3′ UTR, mitochondrial first positions and mtrRNA stems, the optimal model was the symmetrical(SYM)+gamma (Zharkikh 1994).
After discarding a burn-in of 1 million generations,Bayesian analyses of the molecular data using optimalpartitions and models yielded a resolved and generallywell-supported phylogeny (Supplementary Fig. S2).Examination of the posterior distribution of log-likelihoods and tree lengths showed that theseparameters were autocorrelated up to every 7000generations. The posterior distribution of trees wassampled every seven trees to obtain an uncorrelatedposterior for subsequent analyses. A random sample of1000 trees was used to constrain ML searches to generatethe statistical scaffold. The resulting phylogeny reflectedthe phylogenetic uncertainty and support from both theBayesian molecular posterior and the morphological MLsearches (Fig. 4A). The placement of incomplete Miocenefossils received high support, corresponding to 99–100%of sampled trees.
Convergence in combined Bayesian analyses wasachieved after 1 million generations, and the summaryof multiple independent runs resulted in a well-resolvedand weakly supported phylogeny (Fig. 4B). The relativelevels of support were similar to those of the combinedML phylogeny (cf. Fig. 5A), with two importantdifferences in resolution. The combined ML phylogeny,statistical scaffold, and morphological phylogeniesplaced the monotypic extant genus Neonycteris (knownonly from morphology) in a clade with Trinycteris (and,often, Glyphonycteris Figs. 4A and 5A and SupplementaryFig. S1). The combined ML phylogeny and statisticalscaffold placed †Palynephyllum in a clade with Platalinaand Xeronycteris (Figs. 4A and 5A). In contrast,the combined Bayesian phylogeny placed Neonycteriswithin Micronycteris with a posterior probability of0.91, and †Palynephyllum in a clade of all nectar-feeding phyllostomids (Fig. 4B; Bayesian posteriorprobability = 0.91). The phylogenetic position of thetwo †Notonycteris species was unchanged in combinedphylogenies relative to the statistical scaffold andwas also supported in morphological phylogenies. MLanalyses of the combined data resulted in a phylogenysimilar to Bayesian analyses, crucially differing inplacing Lonchorhina as sister to Phyllostominae (Fig. 5A).
Comparisons between PhylogeniesThree nodes highlighted in Figures 4B and 5A
were identified as incorrect based on the statisticalscaffold: (1) the node uniting all nectar-feedingphyllostomids (MRCA of Lonchophylla and Glossophagain Fig. 4B), (2) Lonchorhina as sister to the subfamilyPhyllostominae, and (3) Neonycteris nested withinMicronycteris. The first two nodes have also been rejectedin previous phyllostomid phylogenies generated usingdata included in our analyses (Baker et al. 2003; Dávaloset al. 2012), and a combination of overlapping and non-overlapping loci (Datzmann et al. 2010). No previouscombined analysis has included Neonycteris, which isknown only from the type series (Simmons 2005).Molecular data are not available for this monotypic
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 11 1–19
2014 DÁVALOS—SATURATED AND NON-INDEPENDENT MORPHOLOGICAL DATA 11
BA
Mic
rony
cter
is
Mim
on
†Not
onyc
teris
Trac
hops
Tona
tia
Mim
on c
renu
latu
m
Phyl
lode
rma
Phyl
lost
omus
Lonc
horh
ina Lo
ncho
phyl
la
Car
ollia
Gly
phon
ycte
ris Rhi
noph
ylla
Vam
pyro
des
Artib
eus
Anou
ra
Cho
eron
iscu
s
Mac
rotu
s
Pygo
derm
a
†Pal
ynep
hyllu
m
Plat
yrrh
inus
Stur
nira
Cho
eron
ycte
ris
Neo
nyct
eris
Lich
onyc
teris
Thyr
opte
ra
Mus
onyc
teris
Mac
roph
yllu
m
Mor
moo
ps
Hyl
onyc
teris
Des
mod
us
Ench
isth
enes
Chr
otop
teru
s
Noc
tilio
Plat
alin
a
Lam
pron
ycte
ris
Dip
hylla
Uro
derm
a
Xero
nyct
eris
Lion
ycte
ris
Pter
onot
us
Trin
ycte
ris
Vam
pyru
mDia
emus
Brac
hyph
ylla
Glo
ssop
haga
Lept
onyc
teris
Mon
ophy
llus Ph
yllo
nyct
eris
Erop
hylla
Loph
osto
ma
Trin
ycte
ris
Pygo
derm
aDes
mod
us
Mac
roph
yllu
m
Lich
onyc
teris
Hyl
onyc
teris
Uro
derm
a
Lion
ycte
risPl
atal
ina
Mor
moo
ps
Dia
emus
Chr
otop
teru
s
Stur
nira
Xero
nyct
eris
Vam
pyru
m
Ench
isth
enes
Pter
onot
us
Thyr
opte
ra
Plat
yrrh
inus
Dip
hylla
Cho
eron
ycte
ris
Lam
pron
ycte
ris
†Pal
ynep
hyllu
m
Noc
tilio
Neo
nyct
eris
Mus
onyc
teris
Mac
rotu
s
Mic
rony
cter
is
Mim
on c
renu
latu
m
†Not
onyc
teris
Trac
hops
Tona
tia
Loph
osto
ma
Phyl
lode
rma
Phyl
lost
omus
Lonc
horh
ina
Car
ollia
Gly
phon
ycte
ris
Rhi
noph
ylla
Vam
pyro
des
Artib
eus
Lonc
hoph
ylla
Anou
ra
Cho
eron
iscu
s
Brac
hyph
ylla
Er
ophy
lla
Phyl
lony
cter
is
Mon
ophy
llus
Lept
onyc
teris
Glo
ssop
haga
Mic
rony
cter
is
Mim
on
Pla
nt-v
isiti
ng
Su
pp
ort
10
0
47
% P
os
teri
or
pro
ba
bil
ity
1.0
0
0.1
3
Phy
llost
omin
ae
95
59
89
95
97
98
99
90
73
6782
58
71
91
99
96
82
95
91
95
98
91
79
77
91
9986
98
99
47
67
7295
74
87
79
9386
99
99
92
95
99
75
99
98
59
99
99
56
0.1
chan
ges/
char
acte
r0.
1 ch
ange
s/ch
arac
ter
0.65
0.620.5
7
0.48
0.91
0.61
0.91
0.89
0.54
0.25
0.99
0.87
0.82
0.99
0.98
0.88
0.13
0.46
0.64
0.90
0.82
0.93
0.800.7
70.56
0.34
0.98
0.69
0.49
0.63
0.71
0.99
0.38
0.89
0.80 0.6
2
0.42
0.97
0.52
0.93
0.98
0.91
0.70
0.79
0.96
0.76
0.77
0.99
0.99
0.77
0.44
0.58
0.99
0.99
0.99
0.78
0.48
0.98
0.97
0.980.9
5
0.99
0.98
0.51
0.96
0.63
0.45
0.99
0.85
0.96
0.57
0.98
0.98
0.86
0.74
0.57
0.51
0.99
0.98
FIG
UR
E4.
Sum
mar
ies
ofst
atis
tica
lm
olec
ular
scaf
fold
san
dco
mbi
ned
Baye
sian
anal
yses
.Bra
nche
sco
llaps
edin
totr
iang
les
had
max
imum
supp
ort,
†id
enti
fies
foss
illin
eage
s,no
n-m
onop
hyle
tic
gene
raar
ein
bold
and
outg
roup
sar
ein
gray
font
.Sub
fam
ilyPh
yllo
stom
inae
sens
uBa
ker
etal
.(20
03)i
sin
dica
ted.
(A)S
tati
stic
alsc
affo
ldph
ylog
eny.
Max
imum
clad
ecr
edib
ility
tree
deri
ved
from
ML
anal
yses
ofm
orph
olog
ical
data
cons
trai
ned
toea
chof
1000
unco
rrel
ated
Baye
sian
post
erio
rm
olec
ular
phyl
ogen
ies.
The
mea
nof
the
log-
likel
ihoo
dof
the
data
give
nth
em
odel
was
−119
,913
(sta
ndar
dde
viat
ion
=10
7).D
ashe
dlin
ein
dica
tes
bran
chis
not
show
nto
scal
e.Pe
rcen
tsu
ppor
tva
lues
<10
0ar
esh
own
abov
eor
belo
win
tern
albr
anch
es,b
ranc
han
dnu
mbe
rco
lors
are
scal
edto
supp
ort.
(B)
Com
bine
dBa
yesi
anph
ylog
eny.
Con
sens
usph
ylog
ram
ofal
lcom
pati
ble
bipa
rtit
ions
inBa
yesi
anpo
ster
ior
tree
sfr
omfo
urin
depe
nden
tru
ns(e
ight
chai
nsto
tal)
ofBa
yesi
anan
alys
esof
com
bine
dD
NA
sequ
ence
and
mor
phol
ogic
alda
ta.T
heha
rmon
icm
ean
ofth
epo
ster
ior
log-
likel
ihoo
dof
the
mod
elgi
ven
the
data
was
−115
,118
.Pe
rcen
tBay
esia
npo
ster
ior
prob
abili
ties
<1.
00ar
esh
own
belo
wor
abov
ein
tern
albr
anch
es,b
ranc
han
dnu
mbe
rco
lors
are
scal
edto
post
erio
rpr
obab
ility
.Hig
hlig
hted
node
sco
nflic
ted
wit
hth
em
olec
ular
phyl
ogen
y(a
ndth
est
atis
tica
lsca
ffol
d)an
dw
ere
furt
her
anal
yzed
toid
enti
fym
orph
olog
ical
char
acte
rsth
atst
rong
lysu
ppor
ted
them
(see
Fig.
6).
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 12 1–19
12 SYSTEMATIC BIOLOGY
Bo
ots
tra
ps
up
po
rt1
00
9
Phy
llost
omin
ae
Vam
pyro
des
Lept
onyc
teris
Erop
hylla
Mon
ophy
llus
Mim
on G
lyph
onyc
teris
Mim
on c
renu
latu
m
Pygo
derm
a
Mic
rony
cter
is Uro
derm
a
Mac
roph
yllu
m
Noc
tilio
Chr
otop
teru
s
Rhi
noph
ylla
Anou
ra†Pal
ynep
hyllu
m
Loph
osto
ma
Vam
pyru
m
Tona
tiaDes
mod
us
Ench
isth
enes
†Not
onyc
teris
Plat
alin
a
Neo
nyct
eris
Car
ollia
Mus
onyc
teris
Mac
rotu
s
Lonc
hoph
ylla
Cho
eron
ycte
ris
Artib
eus
Lonc
horh
ina Br
achy
phyl
la
Thyr
opte
ra
Hyl
onyc
teris
Mor
moo
ps
Lion
ycte
ris
Trac
hops
Xero
nyct
eris
Cho
eron
iscu
s
Dia
emus
Plat
yrrh
inus
Stur
nira
Lich
onyc
teris
Phyl
lost
omus
Lonc
hoph
ylla G
loss
opha
ga
Pter
onot
us
Phyl
lode
rma
Trin
ycte
ris
Dip
hylla
Phyl
lony
cter
is
Lam
pron
ycte
ris
Bo
ots
tra
ps
up
po
rt1
00
9
0.1
chan
ges/
char
acte
r
BA
Phyl
lode
rma
Plat
yrrh
inus
Vam
pyru
m
Mic
rony
cter
is
Lept
onyc
teris
Lion
ycte
ris
Stur
nira
Tona
tia
Hyl
onyc
teris
Xero
nyct
eris
Trin
ycte
ris
Vam
pyro
des
Cho
eron
ycte
ris
Mim
on
Mac
rotu
s
Anou
ra
Glo
ssop
haga
Gly
phon
ycte
risLonc
horh
ina
Loph
osto
ma
Rhi
noph
ylla
Neo
nyct
eris
Plat
alin
a
Erop
hylla
Pygo
derm
a
Mim
on c
renu
latu
m
Car
ollia
Uro
derm
a
Dip
hylla
Pter
onot
us
Thyr
opte
ra
Mor
moo
ps
Chr
otop
teru
s
†Not
onyc
teris
Dia
emus
Phyl
lost
omus
Mus
onyc
teris
†Pal
ynep
hyllu
m
Mon
ophy
llus
Trac
hops
Mac
roph
yllu
m
Noc
tilio
Lonc
hoph
ylla
Artib
eus
Lich
onyc
teris
Brac
hyph
ylla
Ench
isth
enes
Phyl
lony
cter
is
Cho
eron
iscu
s
Des
mod
us Lam
pron
ycte
ris
0.1
chan
ges/
char
acte
r
98
46
92
98
99
45
80
69
93
99
99
37
24
73
55
2888
74
99
41
77
25
95
99
78
47
9998
30
32
97
30
5281
29
6974
31
8757
4098
10
44
97
45
68
60
42
12
97
33
74
76
37
49
55
40
91
90
34
32
46
65
88
24
36
66
85
65
9
90
90
19
78
4376
55
52
48
52
93
66
72
23
78
12
95
97
40
86
98
70
28
44
68
30
27
98 41
31
15
6055
66
43
22
61
67
83
59
43
96
14
47
95 23
71
979652
39
20
48
2545 29
99
14
3033
9
91
53
21
94
56
95
2015
30
96
62
8172
85
82
51
82
84
86
89
92
77
87
93
93
95
5881
93
56
93
77
50
97
37 36
61
83
96
67
31
85
48
96
38
99
6176
26
49
40
69
92
11
21
73
FIG
UR
E5.
ML
phyl
ogen
ies
resu
ltin
gfr
oman
alys
esof
com
bine
dD
NA
sequ
ence
and
mor
phol
ogic
alda
taus
ing
RA
xML.
Bran
ches
colla
psed
into
tria
ngle
sha
dm
axim
umsu
ppor
t,†
iden
tifie
sfo
ssil
gene
raor
spec
ies,
non-
mon
ophy
leti
cge
nera
are
inbo
ldan
dou
tgro
ups
are
show
nin
gray
font
.Das
hed
line
indi
cate
sbr
anch
isno
tsho
wn
tosc
ale.
Perc
entb
oots
trap
valu
es<
100
are
show
nab
ove
orbe
low
inte
rnal
bran
ches
,bra
nch
and
num
ber
colo
rsar
esc
aled
tobo
otst
rap
valu
e.Su
bfam
ilyPh
yllo
stom
inae
sens
uBa
ker
etal
.(20
03)i
sin
dica
ted.
(A)M
Lph
ylog
eny
ofco
mbi
ned
data
incl
udin
gal
lmor
phol
ogic
alch
arac
ters
(log
-lik
elih
ood
=−1
19,7
68).
The
high
light
edno
deco
nflic
ted
wit
hth
em
olec
ular
phyl
ogen
y(a
ndth
est
atis
tica
lsca
ffol
d)an
dw
asfu
rthe
ran
alyz
edto
iden
tify
mor
phol
ogic
alch
arac
ters
that
stro
ngly
supp
orte
dit
(see
Fig.
6).(
B)
ML
phyl
ogen
yof
part
itio
ned
DN
Ase
quen
ceda
ta,a
ndm
orph
olog
ical
data
excl
udin
g32
mor
phol
ogic
alch
arac
ters
(Sup
plem
enta
ryTa
ble
S6;l
og-l
ikel
ihoo
d=
−117
,270
).
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 13 1–19
2014 DÁVALOS—SATURATED AND NON-INDEPENDENT MORPHOLOGICAL DATA 13
genus. Both previous morphological analyses and MLanalyses of combined data (Fig. 5A) suggest thatNeonycteris is more closely related to Trinycteris andGlyphonycteris than to Micronycteris (Simmons 1996).
The log-likelihood of character node support wascompared with the null distribution of per-characterdifference in log-likelihood. The distribution of supportor rejection for incorrect nodes from individualdental characters is shown in Figure 6. The threenodes received different levels of support from themorphological data, with overwhelmingly more outlierssupporting than rejecting the monophyly of nectar-feeding phyllostomids (10:1), fewer outliers supportingthan rejecting Lonchorhina as sister to the Phyllostominae,and 10 outliers supporting and five rejecting theinclusion of Neonycteris in Micronycteris (13:14; Fig. 5).
Thirty-two significant positive outliers (onecharacter overlapped in support for two nodes)were then excluded from the combined data matrix(Supplementary Table S6). The resulting phylogenywas generally weakly supported, especially within thesubfamily Phyllostominae and between Lonchophyllaand its relatives (Fig. 5B). However, support valuesacross nodes from analyses of the trimmed data setwere often higher than those from analyses including allthe data (Fig. 5B). The position of †Palynephyllum wasequivalent to that found in the statistical scaffold (i.e.,within the crown Lonchophyllinae sensu Baker et al.2003) and †Notonycteris remained sister to Vampyrum(Fig. 5B).
DISCUSSION
Integrating Morphological and Molecular DataWe propose new methods to integrate morphological
and molecular characters that more rigorously addressconflict caused by morphological homoplasy. Ina previous contribution (Dávalos et al. 2012), weexamined saturation in both morphological andmolecular character states and applied likelihoodapproaches to identify strongly conflicting molecularcharacters. Here, we focus on diagnosing homoplasy inmorphological data, and implementing new methodsto isolate significantly conflicting signal and integratemorphological and molecular characters. These newmethods were applied to an empirical data set toplace the Miocene fossils in the phylogeny of NewWorld leaf-nosed bats and understand the evolutionaryhistory of the family. Instead of either automaticallyconstraining the signal from morphological data toconform to a single molecular scaffold (Springer et al.2001), or unconditionally combining the data (Hermsenand Hendricks 2008), we aimed to uncover evidenceof homoplasy in morphological data, and identifyand exclude characters involved in conflicts arisingfrom homoplasy (Wiens et al. 2003; Dávalos et al.2012). To accomplish these objectives, we exploredthe patterns of variation in morphological data,extended molecular scaffolds to account for variation
Dental characters ●
●
MandibularMaxillary
● CanineIncisors
MolarsPremolars
● Significant
B
A
−4
−2
0
2
Sup
port
for
Neo
nyct
eris
in M
icro
nyct
eris
C
−2
0
2
Sup
port
for
nect
ar−
feed
ing
clad
e
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
Sup
port
for
Lonc
horh
ina
sits
ter
to P
hyllo
stom
inae
−2
0
2
●●●
●
●
●●●●●
●
●
●●●●●●●
●●
●
●
●●
●
●
●
●
●
0 100 200
Character
4
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●●
●
●●
●
●
●
●
●
FIGURE 6. Observed differences in morphological per-characterlog-likelihood between trees containing the clades and the statisticalscaffold tree. Values above the 0 line indicate support, below indicaterejection. Large symbols indicate statistically significant outliers. (A)Morphological support or rejection for a clade of nectar-feedingphyllostomids. (B) Morphological support or rejection for a cladeof Lonchorhina and Phyllostominae. (C) Morphological support orrejection for Neonycteris nesting within Micronycteris.
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 14 1–19
14 SYSTEMATIC BIOLOGY
in phylogenies estimated from DNA sequences, andintroduced a site-likelihood approach to identifycharacters supporting particular nodes. These methodsrespond to a major challenge in phylogenetics: howto integrate morphological and molecular data andestimate relationships between fossil and extant taxa(Manos et al. 2007; Wiens 2009).
Molecular scaffolds and combined data analysesreflect two divergent views on integratingmorphological data in phylogenetic analyses. Scaffoldswere originally introduced to overcome two analyticalchallenges in optimizing morphological and moleculardata: the use of MP and the lack of molecular data forextinct taxa (Springer et al. 2001). Since then, models ofmorphological evolution have been proposed, ML andBayesian methods to analyze combined data have beenimplemented (Lewis 2001; Ronquist and Huelsenbeck2003), and the impact of missing data on phylogenies hasbeen found to be small (Wiens and Moen 2008; Smithet al. 2009; Wiens 2009; Wiens and Tiu 2012). Instead,biological drivers of homoplasy, including ecologicalconvergence and correlations among character statesarising through development (Kangas et al. 2004; Evanset al. 2007; Dávalos et al. 2012), have emerged as themain reasons to use scaffolds (Springer et al. 2007; 2008).
The alternative to scaffolds is combining andsimultaneously analyzing morphological and moleculardata on equal footing (Hermsen and Hendricks 2008).The most compelling argument for using all availabledata (or the total evidence) is that all phylogeneticallyinformative characters should be used in estimatingphylogenies. In the absence of systematic error andif each character provides independent evidence forrelationships, combining all the data will then providethe most powerful inference of phylogeny (de Queirozand Gatesy 2007). The profusion of genomic data hasrevealed that systematic error from mutational bias canbe common (Phillips et al. 2004; Baurain et al. 2007;Dávalos and Perkins 2008), and natural selection cangenerate significant support for incorrect relationships(Castoe et al. 2009; Li et al. 2010; Liu et al. 2010; Davieset al. 2012). Constraints imposed by the genetic code,molecular interactions, selection, and developmentalprocesses imply that many molecular and morphologicalcharacters are not independent (Emerson and Hastings1998; O’Keefe and Wagner 2001; Carrier et al. 2005; Skellyet al. 2013). Critically, models that account for varioussources of systematic error with molecular sequencesare available and can be deployed when saturation ormutational biases are suspected (Muse and Gaut 1994;Schöniger and Von Haeseler 1994; Gruber et al. 2007;Blanquart and Lartillot 2008). Comparisons between locimake the detection of convergence shaped by adaptationpossible with genetic data (Castoe et al. 2009; Li et al.2010; Liu et al. 2010; Dávalos et al. 2012; Davies et al.2012).
Models of morphological evolution have been slowerto develop, as states across different morphologicalcharacters are fundamentally incomparable, an obstaclethat is largely absent for molecular characters. The
most widely applied model (Mkv), for example,is an extension of the Jukes-Cantor model firstpublished in 1969 (Jukes and Cantor 1969). Under thesecircumstances, methods that analyze the accumulationof homoplasy and correlated evolution in morphologicalcharacters can be helpful in deciding whether systematicerror is present, or if the assumption of independence iswarranted (Emerson and Hastings 1998; Wagner 2000;O’Keefe and Wagner 2001). Detecting and correctingfor the consequences of ecological convergence drivenby morphological adaptation is more difficult becauseit requires comparisons to independent phylogenies(Wiens et al. 2003; Holland et al. 2010). Either way,investigating the data structure and phylogenetic signalof morphological characters can help illuminate thebiological processes that shape phenotypic variation(Dávalos et al. 2012).
As in previous analyses of the phylogeny of extantmammals (Springer et al. 2007; 2008; Lee and Camens2009), morphological characters rejected relationshipsestimated with multiple loci (Supplementary Fig. S1).Based on their results, Springer et al. (2007) concludedthat homology and homoplasy were confounded inmorphological characters, with homoplasy prevailing,and cautioned the need to reexamine both charactercoding and models of morphological evolution.Hermsen and Hendricks (2008) and Asher et al.(2008) have nonetheless argued for combining databecause the evidence of morphological homoplasy arosethrough incongruence with molecular phylogenies, andincongruence may also be observed among differentgenes. The data structure of our new morphologicalcharacters, however, revealed the randomization ofphylogenetic signal and the potential for amplificationof homoplasy before any comparisons with molecularphylogenies were undertaken.
Just as the randomization of phylogenetic signal can beassessed in molecular data by examining the saturationof changes through time, the plateauing of new characterstates relative to changes in the phylogeny or characterexhaustion indicates homoplasy in morphological data(Wagner 2000). The pattern of character state changesfound in the dental character data in phyllostomidsindicates saturation (Fig. 2). Another indication ofsaturation in molecular data is a high rate of changeinferred using model-based methods. We comparedthe relative rates of change estimated for each of thepartitions in Bayesian combined analyses (mi parametersin MrBayes output) to compare the morphologicalrate of change to molecular substitution rates. With amedian rate of 2.9 substitutions/character, the relativerate of change for the morphological partition wasthe second highest observed. Only mitochondrial thirdcodon positions had higher rates of change (median = 7.2substitutions/site), and morphological rates of changewere consistently at least one order of magnitude higherthan all nuclear partitions and most mitochondrial data(Table 2). Both saturation and high rates of changeindicate homoplasy is present in the morphologicaldata.
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 15 1–19
2014 DÁVALOS—SATURATED AND NON-INDEPENDENT MORPHOLOGICAL DATA 15
TABLE 2. Posterior estimates of relative rates of change fromnon-clock combined Bayesian analysis of combined data, ranked fromfastest to slowest
Partition Median 95% HPD lower 95% HPD upper
Mt pos. 3 7.244 6.710 7.801Morphology 2.889 2.553 3.240Mt rRNA loops 1.875 1.718 2.033Mt pos. 1 0.478 0.429 0.533Mt rRNA stems 0.364 0.327 0.403Introns 0.263 0.241 0.285Mt pos. 2 0.128 0.101 0.160Nuclear 0.118 0.108 0.128
Notes: Rates are in substitutions/site for molecular data andchanges/character for morphology. HPD = high probability density,mt = mitochondrial, pos. = position, rRNA = ribosomal RNA.
High rates of change alone cannot explain howsaturation in morphological changes affected thecombined phylogeny. If changes from one character stateto another were random relative to common descent,and independent from one another, then resulting treesshould be unresolved or weakly supported. Stronglysupported conflicts must arise through additionalprocesses besides high rates of change. One of thenodes uncovered in combined analyses and identifiedas conflicting with prior hypotheses was stronglysupported (Bayesian posterior probability >0.90 inFig. 4B and ML bootstrap >75% in Fig. 5A). Evolutionaryprocesses that superimposed on saturated changes resultin strong homoplasious signal have been extensivelystudied in molecular data (Phillips et al. 2004). Insome lineages, for example, mutational biases towardAT at 4-fold degenerate positions produce significantlyconflicting signal at saturated sites (Gruber et al. 2007;Dávalos and Perkins 2008). This process generatescharacter states that are both homoplastic and correlated,as all sites throughout the genome experience similarmutational bias and end up having similar characterstates.
One consequence of proposed functional anddevelopmental constraints on the phenotype is thenon-independence of morphological character states.As expected if functional and developmental processesoperated simultaneously on multiple aspects ofmorphology, dental characters were much more similarto one another than molecular characters (Fig. 3).These analyses were not meant to infer particularassociations between character states. There arehypothesis-testing approaches to infer such associations(Huelsenbeck et al. 2003; Pagel and Meade 2006),but these were not designed to conduct all possiblepairwise comparisons (Bollback 2006). The dissimilarityanalyses do suggest pervasive non-independencein morphological characters and, together with thesaturation of character state changes, have implicationsfor integrating morphology in phylogenetic analyses.
The morphological characters employed in this studywere saturated and recurrently encoded similar patternsof variation, indicating direct unweighted analyses of
these data would be inappropriate. The frequencydistribution of morphological dissimilarities suggeststhat new models of evolution will need to account fornon-independence between morphological characters.For molecular data, such models (e.g., doublet and codonmodels) are computationally intensive and knownbiochemical properties of nucleotides and proteins helpdefine and estimate parameters of correlated evolution(Muse and Gaut 1994; Schöniger and Von Haeseler1994). The processes that generate correlations amongmorphological traits are understood best for mammalianteeth and angiosperm leaves, and these might be thebest targets for developing and testing more complexmodels (Kangas et al. 2004; Geeta et al. 2012). Finally,the assumption of independence among charactersthat underlies the key argument for total evidence isviolated to a greater extent with morphological thanwith molecular data. These findings warrant closeexamination of the morphological characters to betteraccount for their combination of phylogenetic andhomoplasious saturated signal.
We introduced a new method to identify stronglyconflicting morphological characters and reduce theimpact of homoplasy on combined phylogeneticanalyses. Unfortunately, the characters supporting thosenodes could not have been identified a priori. Forexample, nectar feeding in phyllostomids is associatedwith modifications to the incisors (Carstens et al. 2002),but none of the characters supporting the nectar-feeding clade involved the incisors (Fig. 6A). In aprecedent for these results, Carstens et al. (2002) foundno significant conflict between the phylogenetic signalsfrom incisor, hyoid, and tongue characters involvedin nectarivory and signal from other morphologicalcharacters. Upper and lower dentition characters, mostlyfrom the premolars and molars, supported the other twonodes. Despite the focus on only three nodes, excludinga subset of characters a posteriori reduced the number ofstrongly supported conflicts across the entire phylogeny(Supplementary Table S6, cf. Fig. 4A and 5).
Instead of providing the most powerful phylogenetichypothesis, our results suggest that combined analysesmay reflect homoplasy in the morphological data.Given the saturation, high rates of evolution, non-independence, conflicting signal, and inadequacy ofcurrent models of evolution for inferring phylogeny fromdental characters, the statistical scaffold summarizes thephylogenies that best capture phylogenetic signal in thedata (Fig. 4A).
Phylogeny of Miocene PhyllostomidaeWe undertook the first family-wide analyses of
Miocene fossil phylogeny. The integrative phylogeniesanalyzed here confirmed the close relationship betweenthe two †Notonycteris species and the carnivorousgenera Vampyrum and Chrotopterus (Czaplewski et al.2003b) (Figs. 4 and 5). Since its original description,†Palynephyllum has been hypothesized to be a nectar-feeding lineage based on its morphology (Czaplewski
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 16 1–19
16 SYSTEMATIC BIOLOGY
et al. 2003b). That hypothesis was supported in allphylogenetic analyses (Figs. 4 and 5). The resolutionvaried between †Palynephyllum being nested within aspurious nectar-feeding clade in combined Bayesiananalyses (Fig. 4B), or nesting among the lonchophyllinesPlatalina, Xeronycteris, and Lionycteris (Figs. 4A and5). The first resolution is equivalent to the datingconstraint imposed in several previous dating analysesfor phyllostomids (Baker et al. 2010; Dávalos 2010; Rojaset al. 2011; Dumont et al. 2012), while the second isunprecedented and may change ingroup divergencedates. The difficulties with integrating morphologicalcharacters in phylogenetic analyses and fragmentarynature of the †Palynephyllum remains suggest cautionin concluding that this fossil is a crown lonchophylline.An independent line of evidence—biogeography—supports the latter conclusion.
Biogeography of †Palynephyllum.—In glossophagines,known diversification events among genera areexclusively restricted to Mesoamerica and the WestIndies (Dávalos 2010; Rojas et al. 2012). In contrast, bothgeneric and species-level diversity of lonchophyllinesis concentrated in South America (Gregorin andDitchfield 2005; Mantilla-Meluk 2007). In the Miocene,the La Venta site was isolated from Mesoamerica andinterconnected to South America (Hoorn et al. 1995).A phylogenetic placement of the fossil †Palynephyllumas a stem glossophagine or a stem plant-visitingphyllostomid would imply that, outside the genusAnoura, glossophagines either failed to diversify on theSouth Americas continent or did diversify and thenwent extinct. These hypotheses are both plausible, butrequire additional explanations. †Palynephyllum tracesthe evolutionary history of the lonchophyllines to SouthAmerica, where all known diversification events in thatclade have unfolded and is thus consistent with boththe paleogeography of the region and the biogeographyof diversification in nectar-feeding phyllostomids.
CONCLUSIONS
By investigating the rates of evolution and quantifyingthe similarity between dental characters scored inphyllostomid bats and their relatives, we demonstratethat these morphological data show saturation andnon-independence. We introduce two approaches tocompensate for these model violations, one extendingmolecular scaffolds to posterior series of trees in alikelihood framework, and the other identifying stronglyconflicting characters for subsequent character analyses.Applying these new methods to analyses integratingmolecular and morphological data, we confirm closerelationships among the extant carnivorous generaChrotopterous and Vampyrum, and †Notonycteris; andsupport the novel conclusion that †Palynephyllum is amember of the nectarivorus crown Lonchophyllinae.Both fossils trace the history of their clades to the middleMiocene of northern South America. By quantifyinghomoplasious signal and similarities, and identifying
strongly conflicting characters, the approaches wedeveloped here are one step toward future methodsto model the evolution of saturated and correlatedmorphological characters.
SUPPLEMENTARY MATERIAL
Data available from the Dryad Digital Repository:http://dx.doi.org/10.5061/dryad.pg4c0.
FUNDING
The National Science Foundation (DEB-0949759 toL.M.D. and DEB-0949859 to N.B.S.).
ACKNOWLEDGMENTS
We thank Noa Krawczyk, James Proffitt, Dani Saks,and Susan Tsang for collection of morphological data,and the NSF Research Experience for Undergraduates(REU) program at the AMNH for supporting Saks,Krawczyk, and Proffitt. This work would have beenimpossible without the work of collectors and theinfrastructure provided by natural history collections.For access to morphological specimens we thank thestaff at the Mammalogy Department of the AmericanMuseum of Natural History (AMNH); the Field Museumof Natural History (FMNH); the Florida Museum ofNatural History (FLMNH); the Instituto de CienciasNaturales (ICN. Bogotá, Colombia); the InstitutoNacional de Investigaciones Geológico-Mineras (IGM,Bogota, Colombia); the Museo de Historia Natural,Universidad Nacional Mayor de San Marcos (MUSM,Lima, Peru); the Museum of Vertebrate Zoology at theUniversity of California, Berkeley (MVZ); the RoyalOntario Museum (ROM); the Natural Science ResearchLaboratory at Texas Tech University (TTU); Bill Clemensat the University of California Museum of Paleontology(UCMP); and Kris Helgen at the Smithsonian Institution(USNM). For tissue loans, we thank the staff at theAmbrose Monell Cryo-Collection at AMNH, and BrucePatterson and John D. Phelps at the FMNH. For greatideas and editing for Figure 1, we thank Romain Dahan,Samantha DelSerra, Jacob Roday, Danny Rojas, Laurel R.Yohe, and Xiaoyu Zhang. We thank Nicole M. Gerardofor finding a way to render morphological characterstates comparable. Part of this work was performedon the Abel Cluster, owned by the University of Osloand the Norwegian metacenter for High PerformanceComputing (NOTUR), and operated by the ResearchComputing Services group at USIT, the University ofOslo IT department.
REFERENCES
Alheit K., Reif J., Maurer H., Hahn V., Weissmann E., MiedanerT., Wurschum T. 2011. Detection of segregation distortion loci intriticale (x Triticosecale Wittmack) based on a high-density DArTmarker consensus genetic linkage map. BMC Genomics 12:380.
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 17 1–19
2014 DÁVALOS—SATURATED AND NON-INDEPENDENT MORPHOLOGICAL DATA 17
Asher R.J., Geisler J.H., Sánchez-Villagra M.R. 2008. Morphology,paleontology, and placental mammal phylogeny. Syst. Biol.57:311–317.
Baker R.J., Bininda-Emonds O.R.P., Mantilla-Meluk H., Porter C.A.,Van Den Bussche R. 2010. Molecular timescale of diversificationof feeding strategy and morphology in New World leaf-nosedbats (Phyllostomidae): a phylogenetic perspective. In: GunnellG.F., Simmons N.B., editors. Evolutionary History of Bats:Fossils, Molecules and Morphology. Cambridge (MA): CambridgeUniversity Press.
Baker R.J. Porter C.A., Hoofer S.R., Van Den Bussche R.A. 2003.Diversification among New World Leaf-Nosed bats: an evolutionaryhypothesis and classification inferred from digenomic congruenceof DNA sequence. Occasional Papers, Museum of Texas TechUniversity 230:1–32.
Baker R.J., Porter C.A., Patton J.C., Van Den Bussche R.A. 2000.Systematics of bats of the family Phyllostomidae based on RAG2DNA sequences. Occasional Papers, Museum of Texas TechUniversity 202:1–16.
Baurain D., Brinkmann H., Philippe H. 2007. Lack of resolution inthe animal phylogeny: closely spaced cladogeneses or undetectedsystematic errors? Mol. Biol. Evol. 24:6–9.
Bininda-Emonds O.R.P. 2005. transAlign: using amino acids tofacilitate the multiple alignment of protein-coding DNA sequences.BMC Bioinform. 6:156.
Blanquart S., Lartillot N. 2008. A site- and time-heterogeneous modelof amino-acid replacement. Mol. Biol. Evol. 25:842–858.
Bollback J.P. 2006. SIMMAP: stochastic character mapping of discretetraits on phylogenies. BMC Bioinform. 7:88.
Burnham K.P., Anderson D.R. 2002. Model selection and multimodelinference. New York: Springer-Verlag.
Carrier D.R., Chase K., Lark K.G. 2005. Genetics of canid skeletalvariation: size and shape of the pelvis. Genome Res. 15:1825–1830.
Carstens B.C., Lundrigan B.L., Myers P. 2002. A phylogenyof the neotropical nectar-feeding bats (Chiroptera: Phyllostomidae)based on morphological and molecular data. J. Mamm. Evol.9:23–53.
Castoe T.A., de Koning A.P.J., Kim H.-M., Gu W., Noonan B.P., NaylorG., Jiang Z.J., Parkinson C.L., Pollock D.D. 2009. Evidence for anancient adaptive episode of convergent molecular evolution. Proc.Natl Acad. Sci. USA 106:8986–8991.
Choate J.R., Birney E.C. 1968. Sub-recent Insectivora and Chiropterafrom Puerto Rico, with the description of a new bat of the genusStenoderma. J. Mammal. 49:400–412.
Clare E.L., Lim B.K., Engstrom M.D., Eger J.L., Hebert P.D.N. 2007. DNAbarcoding of Neotropical bats: species identification and discoverywithin Guyana. Mol. Ecol. Notes 7:184–90.
Clare E.L., Lim B.K., Fenton M.B., Hebert P.D.N. 2011. Neotropicalbats: estimating species diversity with DNA barcodes. PLoS One6:e22648.
Czaplewski N.J., Krejca J., Miller T.E. 2003a. Late quaternary batsfrom Cebada Cave, Chiquibul Cave System, Belize. Caribb. J. Sci.39:23–33.
Czaplewski N.J., Takai M., Naeher T.M., Shigehara N., Setoguchi T.2003b. Additional bats from the middle Miocene La Venta fauna ofColombia. Revista de la Academia Colombiana de Ciencias Físicas,Exactas y Naturales 27:263–282.
Datzmann T., von Helversen O., Mayer F. 2010. Evolution of nectarivoryin phyllostomid bats (Phyllostomidae Gray, 1825, Chiroptera:Mammalia). BMC Evol. Biol. 10:165.
Dávalos L.M. 2010. Earth history and the evolution of Caribbean bats.In: Fleming T.H., Racey P.A., editors. Island bats: ecology, evolution,and conservation. Chicago (IL): University of Chicago Press.p. 96–115.
Dávalos L.M., Cirranello A.L., Geisler J.H., Simmons N.B. 2012.Understanding phylogenetic incongruence: lessons fromPhyllostomid bats. Biol. Rev. 87:991–1023.
Dávalos L.M., Perkins S.L. 2008. Saturation and base compositionbias explain phylogenomic conflict in Plasmodium. Genomics91:433–442.
Dávalos L.M., Russell A.L. 2012. Deglaciation explains bat extinctionin the Caribbean. Ecol. Evol. 2:1–7.
Dávalos L.M., Turvey S. 2012. West Indian mammals: the old,the new, and the recently extinct. In: Patterson B.D., Acosta L.P.,editors. Bones, clones, and biomes: an extended history of recentneotropical mammals. Chicago (IL): University of Chicago Press,p. 157–202.
Davies K.T.J., Cotton J.A., Kirwan J.D., Teeling E.C., Rossiter S.J. 2012.Parallel signatures of sequence evolution among hearing genes inecholocating mammals: an emerging model of genetic convergence.Heredity 108:480–489.
Davis B.H., Poon A.F.Y., Whitlock M.C. 2009. Compensatory mutationsare repeatable and clustered within proteins. Proc. R. Soc. B276:1823–1827.
de Queiroz A., Gatesy J. 2007. The supermatrix approach to systematics.Trends Ecol. Evol. 22:34–41.
Drummond A.J., Ashton B., Buxton S., Cheung M., Cooper A., DuranC., Field M., Heled J., Kearse M., Markowitz S., Moir R., Stones-Havas S., Sturrock S., Thierer T., Wilson A. 2011. Geneious. Geneiousversion 5.3.6. Available from http://www.geneious.com/.
Dumont E.R., Dávalos L.M., Goldberg A., Voigt C.C., Rex K.,Santana S.E. 2012. Morphological innovation, diversification andthe invasion of a new adaptive zone. Proc. R. Soc. B 279:1797–1805.
Eick G.N., Jacobs D.S., Matthee C.A. 2005. A nuclear DNA phylogeneticperspective on the evolution of echolocation and historicalbiogeography of extant bats (Chiroptera). Mol. Biol. Evol. 22:1869–1886.
Emerson S.B., Hastings P.A. 1998. Morphological correlations inevolution: consequences for phylogenetic analysis. Q Rev Biol73:141–162.
Evans A.R., Wilson G.P., Fortelius M., Jernvall J. 2007. High-levelsimilarity of dentitions in carnivorans and rodents. Nature 445:78–81.
Fracasso M.P.D.A., Salles L.D.O. 2005. Diversity of quaternary batsfrom Serra da Mesa (State of Goiás, Brazil). Zootaxa 817:1–19.
Geeta R., Dávalos L.M., Levy A., Bohs L., Lavin M., Mummenhoff K.,Sinha N., Wojciechowski M. F. 2012. Keeping it simple: floweringplants tend to retain, and revert to, simple leaves. New Phytol.193:481–493.
Gower J.C. 1966. Some distance properties of latent root and vectormethods used in multivariate analysis. Biometrika 53:325–338.
Gregorin R., Ditchfield D.A. 2005. New genus and species ofnectar-feeding bat in the tribe Lonchophyllini (Phyllostomidae:Glossophaginae) from northeastern Brazil. J. Mammal. 86:403–414.
Gruber K., Voss R., Jansa S. 2007. Base-compositional heterogeneityin the RAG1 locus among didelphid marsupials: implications forphylogenetic inference and the evolution of GC content. Syst. Biol.56:83–96.
Gu X., Fu Y.X., Li W.H. 1995. Maximum likelihood estimation of theheterogeneity of substitution rate among nucleotide sites. Mol. Biol.Evol. 12:546–57.
Guindon S., Dufayard J.-F., Lefort V., Anisimova M., Hordijk W.,Gascuel O. 2010. New Algorithms and methods to estimatemaximum-likelihood phylogenies: assessing the performance ofPhyML 3.0. Syst. Biol. 59:307–21.
Handcock M.S. 2013. reldist: relative distribution methods. R package.Available from http://www.stat.ucla.edu/∼handcock/RelDist/.
Handcock M.S., Morris M. 1999. Relative distribution methods in thesocial sciences. New York: Springer.
Hermsen E.J., Hendricks J.R. 2008. W(h)ither fossils? Studyingmorphological character evolution in the age of molecularsequences. Ann. Mo. Bot. Gard. 95:72–100.
Holland B.R., Spencer H.G., Worthy T.H., Kennedy M. 2010. Identifyingcliques of convergent characters: concerted evolution in thecormorants and shags. Syst. Biol. 59:433–445.
Hoorn C., Guerrero J., Sarmiento G.A., Lorente M.A. 1995. Andeantectonics as a cause for changing drainage patterns in Miocenenorthern South America. Geology 23:237–240.
Huelsenbeck J.P., Nielsen R., Bollback J.P. 2003. Stochastic mapping ofmorphological characters. Syst. Biol. 52:131–158.
Jansa S.A., Goodman S.M., Tucker P.K. 1999. Molecular phylogenyand biogeography of the native rodents of Madagascar (Muridae:Nesomyinae): a test of the single-origin hypothesis. Cladistics15:253–270.
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 18 1–19
18 SYSTEMATIC BIOLOGY
Jukes T.H., Cantor C.R. 1969. Evolution of protein molecules. In: MunroH.N., editor. Mammalian protein metabolism. New York: AcademicPress. p. 21–132.
Kangas A.T., Evans A.R., Thesleff I., Jernvall J. 2004. Nonindependenceof mammalian dental characters. Nature 432:211–214.
Katoh K., Kuma K.-I., Toh H., Miyata T. 2005. MAFFT version 5:improvement in accuracy of multiple sequence alignment. NucleicAcids Res. 33:511–518.
Katoh K., Toh H. 2008. Recent developments in the MAFFT multiplesequence alignment program. Brief. Bioinform. 9:286–298.
Lanfear R., Calcott B., Ho S.Y.W., Guindon S. 2012.PartitionFinder: combined selection of partitioning schemesand substitution models for phylogenetic analyses. Mol. Biol. Evol.29:1695–1701.
Lee M.S.Y., Camens A.B. 2009. Strong morphological support for themolecular evolutionary tree of placental mammals. J. Evol. Biol.22:2243–2257.
Legendre P., Gallagher E. 2001. Ecologically meaningfultransformations for ordination of species data. Oecologia129:271–280.
Lewis P.O. 2001. A likelihood approach to estimating phylogeny fromdiscrete morphological character data. Syst. Biol. 50:913–925.
Li G., Wang J., Rossiter S.J., Jones G., Cotton J.A., Zhang S. 2008. Thehearing gene Prestin reunites echolocating bats. Proc. Natl Acad. Sci.USA 105:13959–13964.
Li Y., Liu Z., Shi P., Zhang J. 2010. The hearing gene Prestin unitesecholocating bats and whales. Curr. Biol. 20:R55–R56.
Liu Y., Cotton J.A., Shen B., Han X., Rossiter S.J., Zhang S. 2010.Convergent sequence evolution between echolocating bats anddolphins. Curr. Biol. 20:R53–R54.
Maechler M., Rousseeuw P., Struyf A., Hubert M., Hornik K. 2012.cluster: cluster analysis basics and extensions. R package version1.15.2.
Manos P.S., Soltis P.S., Soltis D.E., Manchester S.R., Oh S.-H., BellC.D., Dilcher D.L., Stone D.E. 2007. Phylogeny of extant andfossil juglandaceae inferred from the integration of molecular andmorphological data sets. Syst. Biol. 56:412–430.
Mantilla-Meluk H. 2007. Lonchophyllini, the Chocoan bats. RevistaInstitucional Universidad Tecnológica del Chocó 26:49–57.
Marengo E., Robotti E., Gianotti V., Righetti P.G., Cecconi D., DomeniciE. 2003. A new integrated statistical approach to the diagnostic useof two-dimensional maps. Electrophoresis 24:225–236.
Miller M.A., Pfeiffer W., Schwartz T. 2010. Creating the CIPRESscience gateway for inference of large phylogenetic trees. GatewayComputing Environments Workshop (GCE), 2010. p. 1–8.
Muggeo V.M.R. 2008. Segmented: an R package to fit regression modelswith broken-line relationships. R News 8:20–25.
Murphy W.J., O’Brien S.J. 2007. Designing and optimizing comparativeanchor primers for comparative gene mapping and phylogeneticinference. Nat. Protoc. 2:3022–3030.
Muse S.V., Gaut B.S. 1994. A likelihood approach for comparingsynonymous and nonsynonymous nucleotide substitution rates,with application to the chloroplast genome. Mol. Biol. Evol. 11:715–24.
O’Keefe FR, Wagner P.J. 2001. Inferring and testing hypotheses ofcladistic character dependence by using character compatibility.Syst. Biol. 50:657–675.
O’Leary M.A., Kaufman S.G. 2012. MorphoBank 3.0: Web applicationfor morphological phylogenetics and taxonomy. Available fromhttp://morphobank.org.
Pagel M. 1994. Detecting correlated evolution on phylogenies: a generalmethod for the comparative analysis of discrete characters. Proc. R.Soc. B 255:37–45.
Pagel M., Meade A. 2006. Bayesian analysis of correlated evolution ofdiscrete characters by reversible-jump Markov Chain Monte Carlo.Am. Nat. 167:808–825.
Paradis E., Claude J., Strimmer K. 2004. APE: analyses of phylogeneticsand evolution in R language. Bioinformatics 20:289–290.
Peterson L.E. 2003. Partitioning large-sample microarray-based geneexpression profiles using principal components analysis. Comput.Methods Programs Biomed. 70:107–119.
Phillips M.J., Delsuc F., Penny D. 2004. Genome-scale phylogeny andthe detection of systematic biases. Mol. Biol. Evol. 21:1455–1458.
Pyron R.A. 2011. Divergence time estimation using fossils as terminaltaxa and the origins of Lissamphibia. Syst. Biol. 60:466–480.
R Development Core Team. 2012. R: a language and environment forstatistical computing. Vienna, Austria: R Foundation for StatisticalComputing.
Rambaut A., Drummond A.J. 2007. Tracer. Available fromhttp://beast.bio.ed.ac.uk/Tracer.
Rambaut A., Drummond A.J. 2010. TreeAnnotator: MCMC Outputanalysis. Available from http://beast.bio.ed.ac.uk.
Rojas D., Vale Á., Ferrero V., Navarro L. 2011. When did plants becomeimportant to leaf-nosed bats? Diversification of feeding habits in thefamily Phyllostomidae. Mol. Ecol. 20:2217–2228.
Rojas D., Vale Á., Ferrero V., Navarro L. 2012. The role of frugivoryin the diversification of bats in the Neotropics. J. Biogeogr. 39:1948–1960.
Ronquist F., Huelsenbeck J.P. 2003. MrBayes 3: Bayesian phylogeneticinference under mixed models. Bioinformatics 19:1572–1574.
Ronquist F., Klopfstein S., Vilhelmsen L., Schulmeister S., Murray D.L.,Rasnitsyn A.P. 2012a. A total-evidence approach to dating withfossils, applied to the early radiation of the Hymenoptera. Syst. Biol.61:973–999.
Ronquist F., Teslenko M., van der Mark P., Ayres D.L., Darling A.,Höhna S., Larget B., Liu L., Suchard M.A., Huelsenbeck J.P. 2012b.MrBayes 3.2: Efficient Bayesian phylogenetic inference and modelchoice across a large model space. Syst. Biol. 61:539–542.
Santana S.E., Geipel I., Dumont E.R., Kalka M.B., Kalko E.K.V. 2011. Allyou can eat: high performance capacity and plasticity in the commonbig-eared bat, Micronycteris microtis (Chiroptera: Phyllostomidae).PLoS One 6:e28584.
Santana S.E., Grosse I.R., Dumont E.R.. 2012. Dietary hardness,loading behavior, and the evolution of skull form in bats. Evolution66:2587–2598.
Schöniger M., Von Haeseler A. 1994. A stochastic model for theevolution of autocorrelated DNA sequences. Mol. Phylogenet. Evol.3:240–247.
Simmons M.P., Freudenstein J.V. 2003. The effects of increasing geneticdistance on alignment of, and tree construction from, rDNAinternal transcribed spacer sequences. Mol. Phylogenet. Evol. 26:444–451.
Simmons N.B. 1996. A new species of Micronycteris (Chiroptera:Phyllostomidae) from northeastern Brazil, with comments onphylogenetic relationships. Am. Mus. Novit. 3158:1–35.
Simmons N.B. 1998. A reappraisal of interfamilial relationships of bats.In: Kunz T.H., Racey P.A., editors. Bats: phylogeny, morphology,echolocation and conservation. Washington (DC): SmithsonianInstitution Press. p. 3–26.
Simmons N.B. 2005. Order Chiroptera. In: Wilson D.E., Reeder D.M.,editors. Mammal species of the World: a taxonomic and geographicreference. Baltimore: Johns Hopkins University Press. p. 313–529.
Skelly D.A., Merrihew G.E., Riffle M., Connelly C.F., Kerr E.O.,Johansson M., Jaschob D., Graczyk B., Shulman N.J., WakefieldJ., Cooper S.J., Fields S., Noble W.S., Muller E.G.D., Davis T.N.,Dunham M.J., MacCoss M.J., Akey J.M. 2013. Integrative phenomicsreveals insight into the structure of phenotypic diversity in buddingyeast. Genome Res. 23:1496–1504.
Smith S., Beaulieu J., Donoghue M. 2009. Mega-phylogeny approach forcomparative biology: an alternative to supertree and supermatrixapproaches. BMC Evol. Biol. 9:37.
Springer M.S., Burk-Herrick A., Meredith R., Eizirik E., Teeling E.,O’Brien S.J., Murphy W.J. 2007. The adequacy of morphology forreconstructing the early history of placental mammals. Syst. Biol.56:673–684.
Springer M.S., Meredith R.W., Eizirik E., Teeling E., Murphy W.J.2008. Morphology and placental mammal phylogeny. Syst. Biol. 57:499–503.
Springer M.S., Teeling E.C., Madsen O., Stanhope M.J., de JongW.W. 2001. Integrated fossil and molecular data reconstruct batecholocation. Proc. Natl Acad. Sci. USA 98:6241–6246.
Stamatakis A. 2006. RAxML-VI-HPC: maximum likelihood-basedphylogenetic analyses with thousands of taxa and mixed models.Bioinformatics 22:2688–2690.
Stamatakis A., Hoover P., Rougemont J. 2008. A rapid bootstrapalgorithm for the RAxML Web servers. Syst. Biol. 57:758–771.
[15:53 20/4/2014 Sysbio-wwwmorphobank.org] Page: 19 1–19
2014 DÁVALOS—SATURATED AND NON-INDEPENDENT MORPHOLOGICAL DATA 19
Swofford D.L. 2002. PAUP*. Phylogenetic analysis usingparsimony (*and other methods). Sunderland (MA): SinauerAssociates.
Tavaré S. 1986. Some probabilistic and statistical problemson the analysis of DNA sequences. Lect. Math. Life Sci. 17:57–86.
Teeling E.C., Springer M.S., Madsen O., Bates P., O’Brien S.J., MurphyW.J. 2005. A molecular phylogeny for bats illuminates biogeographyand the fossil record. Science 307:580–584.
Wagner P.J. 2000. Exhaustion of morphologic character states amongfossil taxa. Evolution 54:365–386.
Wake D.B. 1991. Homoplasy: the result of natural selection, or evidenceof design limitations? Am. Nat. 138:543–567.
Wetterer A.L., Rockman M.V., Simmons N.B. 2000. Phylogeny ofphyllostomid bats (Mammalia: Chiroptera): data from diversemorphological systems, sex chromosomes, and restriction sites.Bull. Am. Mus. Nat. Hist. 248:1–200.
Wiens J.J. 2004. The role of morphological data in phylogenyreconstruction. Syst. Biol. 53:653–661.
Wiens J.J. 2009. Paleontology, genomics, and combined-dataphylogenetics: can molecular data improve phylogeny estimationfor fossil taxa? Syst. Biol. 58:87–99.
Wiens J.J., Chippindale P.T., Hillis D.M. 2003. When are phylogeneticanalyses misled by convergence? A case study in Texas cavesalamanders. Syst. Biol. 52:501–514.
Wiens J.J., Kuczynski C.A., Townsend T., Reeder T.W., Mulcahy D.G.,Sites J.W. 2010. Combining phylogenomics and fossils in higher-levelsquamate reptile phylogeny: molecular data change the placementof fossil taxa. Syst. Biol. 59:674–688.
Wiens J.J., Moen D.S. 2008. Missing data and the accuracy of Bayesianphylogenetics. J. Syst. Evol. 46:307–314.
Wiens J.J., Tiu J. 2012. Highly incomplete taxa can rescue phylogeneticanalyses from the negative impacts of limited taxon sampling. PLoSOne 7:e42925.
Wilcoxon F. 1945. Individual comparisons by ranking methods.Biometr. Bull. 1:80–83.
Wilkinson M. 1995. A comparison of two methods of characterconstruction. Cladistics 11:297–308.
Yang Z. 1994. Maximum likelihood phylogenetic estimation from DNAsequences with variable rates over sites: approximate methods.J. Mol. Evol. 39:306–314.
Zharkikh A. 1994. Estimation of evolutionary distances betweennucleotide sequences. J. Mol. Evol. 39:315–329.