Elements of regional beetle faunas: faunal variation and compositional breakpoints along climate,...

15
Elements of regional beetle faunas: faunal variation and compositional breakpoints along climate, land cover and geographical gradients Jani Heino 1 * and Janne Alahuhta 2 1 Finnish Environment Institute, Natural Environment Centre, Biodiversity, PO Box 413, FI-90014 Oulu, Finland; and 2 Department of Geography, University of Oulu, PO Box 3000, FI-90014 Oulu, Finland Summary 1. Regional faunas are structured by historical, spatial and environmental factors. We studied large-scale variation in four ecologically different beetle groups (Coleoptera: Dytiscidae, Cara- bidae, Hydrophiloidea, Cerambycidae) along climate, land cover and geographical gradients, examined faunal breakpoints in relation to environmental variables, and investigated the best fit pattern of assemblage variation (i.e. randomness, checkerboards, nestedness, evenly spaced, Gleasonian, Clementsian). We applied statistical methods typically used in the analysis of local ecological communities to provide novel insights into faunal compositional patterns at large spatial grain and geographical extent. 2. We found that spatially structured variation in climate and land cover accounted for most variation in each beetle group in partial redundancy analyses, whereas the individual effect of each explanatory variable group was generally much less important in accounting for varia- tion in provincial species composition. 3. We also found that climate variables were most strongly associated with faunal break- points, with temperature-related variables alone accounting for about 20% of variation at the first node of multivariate regression tree for each beetle group. The existence of faunal break- points was also shown by the ‘elements of faunal structure’ analyses, which suggested Clem- entsian gradients across the provinces, that is, that there were two or more clear groups of species responding similarly to the underlying ecological gradients. 4. The four beetle groups showed highly similar biogeographical patterns across our study area. The fact that temperature was related to faunal breakpoints in the species composition of each beetle group suggests that climate sets a strong filter to the distributions of species at this combination of spatial grain and spatial extent. This finding held true despite the ecologi- cal differences among the four beetle groups, ranging from fully aquatic to fully terrestrial and from herbivorous to predaceous species. 5. The existence of Clementsian gradients may be a common phenomenon at large scales, and it is likely to be caused by crossing multiple species pools determined by climatic and his- torical factors on the distributions of species. Key-words: assemblage composition, compositional gradients, environmental gradients, idealized models, multivariate regression trees, partial redundancy analysis, regionalisation Introduction Regional biotas develop under the influences of evolution- ary, historical and environmental factors. These factors can be understood as a spatiotemporal continuum, with evolutionary factors largely driving differences in species composition among broad geographic regions over long time periods, historical factors such as the most recent ice age causing variation among smaller regions, and environ- mental factors being important not only at large spatial scales but also within smaller regions (Brown & Lomolino 1998; Mittelbach 2012). Examining the relative impor- tance of these factors for faunal composition in different *Correspondence author. E-mail: jani.heino@environment.fi © 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society Journal of Animal Ecology 2014 doi: 10.1111/1365-2656.12287

Transcript of Elements of regional beetle faunas: faunal variation and compositional breakpoints along climate,...

Elements of regional beetle faunas: faunal variation

and compositional breakpoints along climate, land

cover and geographical gradients

Jani Heino1* and Janne Alahuhta2

1Finnish Environment Institute, Natural Environment Centre, Biodiversity, PO Box 413, FI-90014 Oulu, Finland; and2Department of Geography, University of Oulu, PO Box 3000, FI-90014 Oulu, Finland

Summary

1. Regional faunas are structured by historical, spatial and environmental factors. We studied

large-scale variation in four ecologically different beetle groups (Coleoptera: Dytiscidae, Cara-

bidae, Hydrophiloidea, Cerambycidae) along climate, land cover and geographical gradients,

examined faunal breakpoints in relation to environmental variables, and investigated the best

fit pattern of assemblage variation (i.e. randomness, checkerboards, nestedness, evenly spaced,

Gleasonian, Clementsian). We applied statistical methods typically used in the analysis of

local ecological communities to provide novel insights into faunal compositional patterns at

large spatial grain and geographical extent.

2. We found that spatially structured variation in climate and land cover accounted for most

variation in each beetle group in partial redundancy analyses, whereas the individual effect of

each explanatory variable group was generally much less important in accounting for varia-

tion in provincial species composition.

3. We also found that climate variables were most strongly associated with faunal break-

points, with temperature-related variables alone accounting for about 20% of variation at the

first node of multivariate regression tree for each beetle group. The existence of faunal break-

points was also shown by the ‘elements of faunal structure’ analyses, which suggested Clem-

entsian gradients across the provinces, that is, that there were two or more clear groups of

species responding similarly to the underlying ecological gradients.

4. The four beetle groups showed highly similar biogeographical patterns across our study

area. The fact that temperature was related to faunal breakpoints in the species composition

of each beetle group suggests that climate sets a strong filter to the distributions of species at

this combination of spatial grain and spatial extent. This finding held true despite the ecologi-

cal differences among the four beetle groups, ranging from fully aquatic to fully terrestrial

and from herbivorous to predaceous species.

5. The existence of Clementsian gradients may be a common phenomenon at large scales,

and it is likely to be caused by crossing multiple species pools determined by climatic and his-

torical factors on the distributions of species.

Key-words: assemblage composition, compositional gradients, environmental gradients,

idealized models, multivariate regression trees, partial redundancy analysis, regionalisation

Introduction

Regional biotas develop under the influences of evolution-

ary, historical and environmental factors. These factors

can be understood as a spatiotemporal continuum, with

evolutionary factors largely driving differences in species

composition among broad geographic regions over long

time periods, historical factors such as the most recent ice

age causing variation among smaller regions, and environ-

mental factors being important not only at large spatial

scales but also within smaller regions (Brown & Lomolino

1998; Mittelbach 2012). Examining the relative impor-

tance of these factors for faunal composition in different*Correspondence author. E-mail: [email protected]

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society

Journal of Animal Ecology 2014 doi: 10.1111/1365-2656.12287

regions is a prerequisite for understanding the interactions

between regional species pools and local communities

(e.g. Cornell 2013), delineating regions for environmental

assessment and conservation (e.g. Bailey 2010), and pre-

dicting the effects of global change on species distribu-

tions at large spatial scales (e.g. Elith & Leathwick 2009).

New insights into those biogeographical patterns can be

attained by applying analytical methods that have been

more typically applied in the analysis of local ecological

communities.

Regional faunas show multiple patterns of interest to

ecologists. These include variations in species richness,

endemism, rarity and composition, all of which may show

various patterns in space and time (Brown & Lomolino

1998; Gaston 2000). Elements of faunal composition can

also be evaluated with regard to which idealized assem-

blage pattern they fit best. Leibold & Mikkelson (2002)

devised an approach to examine such assemblage patterns,

whereby random distributions of species are contrasted

with five idealized models. These idealized models include

checkerboards (e.g. Gotelli & McCabe 2002), nestedness

(e.g. Patterson & Atmar 1986), evenly spaced gradients

(e.g. Tilman 1982), Gleasonian gradients (e.g. Gleason

1926) and Clementsian gradients (e.g. Clements 1916).

Although this approach of ‘elements of metacommunity

structure’ (EMS) was originally aimed at separating dif-

ferent patterns across a set of local communities (Leibold

& Mikkelson 2002), the same approach can be adapted to

examine distribution patterns and faunal structure at large

biogeographical scales, across regions or across sets of

islands (Presley & Willig 2010; Meynard et al. 2013).

The five idealized spatial patterns and randomness

should be tested simultaneously with the same data set

(Leibold & Mikkelson 2002; Dallas & Presley 2014),

although most previous studies have tested different pat-

terns in isolation. For example, checkerboards and nested-

ness are examples of such patterns typically examined in

isolation. Checkerboards refer to a situation where species

pairs have mutually exclusive distributions, whereby two

species replace each other across a set of sites (Diamond

1975; Gotelli & Graves 1996). While most studies have

examined checkerboards across a set of islands or patchy

mainland habitats (Gotelli & McCabe 2002; G€otzenberger

et al. 2012), there is no reason not to examine checker-

boards across large spatial scales on continents. For

example, Gotelli, Graves & Rahbek (2010) examined

checkerboard distributions of birds based on large grain

sizes at a regional extent and found that some species

pairs occurred together less often than expected by chance

expectations. Hence, checkerboards may also occur at

large grain sizes and large geographical extents, although

the mechanisms underlying such patterns should differ

between large-scale biogeographical studies and smaller-

scale studies encompassing local communities.

The nested subset pattern has also been studied exten-

sively in various systems, ranging from islands through

habitat patches to continental habitats (Wright et al.

1998). A perfectly nested pattern occurs when species

poor faunas comprise subsets of those in progressively

richer faunas, and when the species with the broadest

range include those of species with progressively smaller

ranges (Patterson & Atmar 1986). Nestedness has been

examined at large biogeographic scales, too, and these

studies have often found a significantly nested pattern

(Baselga 2008). However, very few studies have tried to

compare such biogeographical variation with other idea-

lised patterns, which have been studied more extensively

across local communities along habitat gradients.

The remaining spatial patterns can be divided between

three main idealized models. Evenly spaced gradients

describe a situation, where species range boundaries are

hyperdispersed along the underlying environmental gradi-

ent (Tilman 1982), indicating maximal differences among

species in environmental tolerances (Presley & Willig

2010). Based on the idea of Clementsian gradients, com-

munities are tight collections of two or more species

groups, responding similarly to environmental gradients

and hence showing discrete community boundaries (Cle-

ments 1916). In contrast, based on the idea of Gleasonian

gradients, each species responds to the environmental gra-

dients individualistically and communities are mere collec-

tions of species, whose ranges happen to overlap in space

and time (Gleason 1926). While the debate over commu-

nity gradients has been ongoing for almost a century,

most attention has been devoted to terrestrial plants and

examinations of local communities (Allen & Hoekstra

1992; McIntosh 1995; Hoagland & Collins 1997). It is

thus important to test the fit of these idealised models in

various other organismal groups, ecological systems and

spatial scales (Henriques-Silva, Lindo & Peres-Neto 2013;

Meynard et al. 2013).

The EMS approach is mainly a descriptive, pattern-based

approach and it does not tell us much about the underlying

mechanisms responsible for the patterns detected. There

has hence been recent interest in accounting for variation in

community composition using spatial and environmental

predictors (Cottenie 2005; Meynard et al. 2013). While

most studies have taken a continuum perspective, whereby

community composition has been examined using direct

gradient analysis methods, such as redundancy analysis

(RDA; Dray et al. 2012; Meynard et al. 2013), recently

developed constrained clustering methods, such as multi-

variate regression trees, allow not only description and pre-

diction of variation in community composition, but also

provide a means to find breakpoints of variation in assem-

blage composition (De’ath 2002; Borcard, Gillet & Legen-

dre 2011). Such breakpoints may occur when there is a

clear change in assemblage composition that is typically

associated with a clear change in environmental conditions.

Surprisingly, very few studies have assessed both continu-

ous variation and breakpoints of community composition

in the same study, and these studies have mainly considered

local habitat gradients within a small region (Davidson

et al. 2010; Sahuquillo & Miracle 2013; Heino, Ilmonen &

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

2 J. Heino & J. Alahuhta

Paasivirta 2014). However, it is at large spatial grain sizes

and geographical extents where the occurrence of break-

points in faunal compositions should be most easily dis-

cernible, because studies spanning large spatial scales are

likely to cross multiple regional species pools determined

by the effects of historical factors and climatic constraints

on the distributions of species (Brown & Lomolino 1998;

Heino 2011). Hence, those breakpoints should also be

examined using large-scale biotic, climatic and land cover

data. Such examinations are also valuable owing to the fact

that they provide important information about biogeo-

graphic regionalisation for environmental assessment and

biodiversity conservation (Bailey 2010; Rueda, Rodriguez

& Hawkins 2010).

We examined geographical variation and breakpoints in

the regional faunal composition of four beetle groups

(Coleoptera: Carabidae, Dytiscidae, Hydrophiloidea,

Cerambycidae), showing a wide variety of habitats (both

aquatic and terrestrial groups) and trophic modes (rang-

ing from predators through omnivores to herbivores).

Our study region comprised the whole Fennoscandia and

Denmark, where large gradients in climatic conditions

(i.e. from temperate to arctic regions), land cover (i.e.

from deciduous forest to tundra landscapes) and land use

(i.e. heavily anthropogenically modified to near-pristine

regions) occur in a relatively small geographical area. We

specifically examined the following questions: (i) What are

the contributions of climatic, land cover and spatial vari-

ables to variation in faunal composition? (ii) What envi-

ronmental factors best determine the breakpoints in

faunal composition if such breakpoints exist? (iii) Which

idealized model best fits the empirical data of the four

beetle families? (iv) Are there differences in these patterns

among aquatic (Dytiscidae, Hydrophiloidea) and terres-

trial (Carabidae, Cerambycidae) beetles? Based on evi-

dence from previous studies, we expected that the aquatic

and terrestrial groups would respond relatively similarly

to climatic gradients (V€ais€anen, Heli€ovaara & Immonen

1992; Heino 2001), but show different responses to land

cover gradients due to different habitat requirements.

Materials and methods

study area

We analysed biological and environmental data from the 101

biogeographic provinces belonging to Denmark, Sweden, Norway

and Finland (V€ais€anen, Heli€ovaara & Immonen 1992; V€ais€anen

& Heli€ovaara 1994). Prior to the analyses, we merged small

coastal provinces in Norway to provide a better and more accu-

rate representation of species ranges. We also omitted a single

small Swedish island province (Gotska Sand€o), as it was a clear

outlier in preliminary ordination analyses. After these modifica-

tions, the number of provinces to be analysed decreased from 101

to 79. Each province has typical characteristics of climate and

land cover, thereby ‘province’ being a relatively homogeneous

study unit. The insect faunas in the study area are relatively well

known, offering good data for insect groups used infrequently in

biogeographical analyses (V€ais€anen, Heli€ovaara & Immonen

1992). Thus, although most insect biodiversity studies are affected

by variation in detection of species, we are confident that these

data adequately characterize provincial assemblages and are rep-

resentative of true ecological patterns. Furthermore, there are

actually certain benefits of using biogeographical province as

sampling units rather than using equal area map grids. First, any

change in the spatial grain of the grid system has strong impacts

on the final outcomes. Secondly, in case of fragmented data, most

of such grid cells tend to appear empty, although they are very

likely to be occupied by the target species. Use of more natural

geographical units may hence be superior to grid systems, espe-

cially when distributional data are fragmented, as is the case of

most insect groups. We hence used the 79 provinces as sampling

units (i.e. grain size) and, hence, the species recorded from each

province were pooled to represent a single assemblage.

the study organisms and species data

We analysed the literature data for four beetle groups, including

ground beetles (Carabidae; Lindroth 1985, 1986), diving beetles

(Dytiscidae; Nilsson & Holmen 1995), longhorn beetles (Ceram-

bycidae; Bily & Mehl 1989) and water scavenger beetles (Hydro-

philoidea; Holmen 1987). Ground beetles comprise a large family

of mainly terrestrial beetles, which are predatory, omnivorous,

granivorous or herbivorous species as adults and mostly preda-

ceous as larvae (Lindroth 1985). Diving beetles occur in fresh

waters and, in some cases, brackish water environments, and they

are mostly predaceous as larvae and predators or scavengers as

adults (Nilsson & Holmen 1995). Longhorn beetles are primarily

forest insects that live in timber as larvae and feed on various

types of plant material as adults (Bily & Mehl 1989). Water scav-

enger beetles are mainly predaceous, scavenging or herbivorous

as adults and predaceous as larvae. They occur chiefly in fresh-

water or semi-aquatic environments (Holmen 1987). These four

groups thus comprise main life styles and habitats amongst bee-

tles, allowing us to analyse variation in ecologically highly differ-

ent, yet phylogenetically relatively closely related organisms.

The diversity of these four beetle groups is considerable across

the 79 provinces considered in this study. The most diverse group is

Carabidae (total number of species = 388; mean = 159, SD =

56�9), followed by Dytiscidae (total number of species = 155;

mean = 78�9, SD = 19�3), Cerambycidae (total number of species =

118; mean = 46�5, SD = 17�6) and Hydrophiloidea (total number

of species = 116; mean = 52�4, SD = 22�6).

climate variables

Climate variables included average annual temperature (°C),

maximum temperature of the warmest month (°C), minimum

temperature of the coldest month (°C), precipitation of the wet-

test month (mm) and precipitation of the driest month (mm).

The climate variables were average values for each biogeographi-

cal province and were derived from WORLDCLIM with

0�93 km 9 0�93 km resolution (Hijmans et al. 2005).

land cover variables

Land cover and land use variables consisted of percentages of

fresh water, forests, open areas, wetlands, agricultural areas and

urban development. Land cover and land use variables were

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

Elements of regional beetle faunas 3

obtained from European CORINE with 100 m resolution.

Although European CORINE data are temporally overlapping,

yet of recent origin, in comparison to the insect data that have

accumulated within a longer time period, we considered our data

to represent well provincial differences in land use. This is

because the southernmost provinces (i.e. which are most heavily

burdened by human land use nowadays) have differed from the

northernmost provinces (i.e. which are very little modified even

nowadays) in land use for a long time. For example, major

deforestation had already began in Denmark 3000 years ago

(Bradshaw 2005), whereas widespread human presence was not

evidenced in the nature of northern Scandinavia until 500 years

ago (van der Linden et al. 2008). We also used distance to the

sea as a variable representing marine influence on the insect

fauna. The starting point for the distance measurement was the

midpoint of a province, which is located at the centre of gravity

(i.e. centroid) of the polygon (i.e. true centroid of the province).

Finally, average elevation and elevation range within the province

were also included as land cover variables, as these variables are

related to the range of habitat types along elevation gradients.

Elevation variables were obtained from 3D Digital Elevation

Model over Europe with 25 m resolution.

province area

We considered province area (km2) as a separate group of predic-

tor variables, because at this scale, it is not clearly mechanisti-

cally associated with the environmental variables we considered.

Province area may also be considered as a proxy for sampling

effects (i.e. a larger province is likely to sample more species than

a smaller province), although sampling effects should be less

important for overall faunal structure (i.e. our focus in this study)

than for species richness.

spatial variables

We used an eigenfunction spatial analysis by means of the family

of Moran’s eigenvector maps to model spatial structures among

the provinces (Dray, Legendre & Peres-Neto 2006; Griffith &

Peres-Neto 2006). To do this, we ran principal coordinates of

neighbour matrix analyses (PCNM) based on Euclidean distance

among the provinces as input (Borcard & Legendre 2002; Bor-

card, Gillet & Legendre 2011). We retained the PCNM eigenvec-

tors with positive eigenvalues as explanatory variables in further

analyses (see below). Basically, the first PCNMs with large eigen-

values describe broadscale spatial structures, whereas those

PCNMs with small eigenvalues describe fine-scale spatial varia-

tion (Peres-Neto & Legendre 2010). The PCNMs are mutually

orthogonal and linearly unrelated spatial variables and can be

used to model spatial variation in provincial species composition

in this study. Significant spatial variation as related to PCNMs

may be a consequence of environmental autocorrelation, dispersal

limitation or historical effects on provincial species composition

(Dray et al. 2012; Peres-Neto, Leibold & Dray 2012). PCNM

analysis was conducted using the R package PCNM (Legendre

et al. 2013).

statist ical methods

We first examined variation in species composition as opposed to

turnover in species composition (for definitions, see Anderson

et al. 2011) across the provinces, and therefore, we chose to use

constrained ordination and constrained clustering methods as our

main statistical tools (Legendre & Legendre 2012). Our main sta-

tistical method was canonical RDA that analyses variation in

species composition in relation to explanatory variables (Rao

1964). Prior to the RDAs, species data of each beetle group were

Hellinger-transformed to account for numerous zero values and

making the data analysable using linear methods (Legendre &

Gallagher 2001). We first selected variables in the final RDA

models of each set of variables (i.e. land cover, climate, spatial)

based on the forward selection method proposed by Blanchet,

Legendre & Borcard (2008). We next used partial redundancy

analysis (pRDA) to decompose variation in the provincial species

composition of each beetle group among climate, land cover, spa-

tial variables and province area following the protocols explained

in detail elsewhere (Borcard, Legendre & Drapeau 1992; Ander-

son & Gribble 1998; Legendre & Legendre 2012). Variation parti-

tioning among four sets of predictor variables results in

individual climate, individual land cover, individual spatial and

province area fractions, as well as their shared effects and unex-

plained variance. Adjusted coefficient of determination (adj. R2)

was reported in all variation partitioning analyses, because they

are unbiased measures of explained variation (Peres-Neto et al.

2006). RDAs and variation partitioning were conducted using the

R package vegan (Oksanen et al. 2013).

We constructed Mantel correlograms (Sokal & Oden 1978) for

each beetle group to examine spatial autocorrelation in faunal

composition across the provinces. Mantel correlograms are a

multivariate version of univariate Moran I correlograms and

have been shown to be effective in portraying spatial patterns

(Borcard & Legendre 2012). We used Hellinger-transformed spe-

cies data as the basis of Mantel correlograms. In these analyses,

the number of distance classes was based on Sturge’s rule (Bor-

card, Gillet & Legendre 2011), and the significance of Mantel

correlations at each distance class was based on Holm correction

(Holm 1979). Mantel correlograms were done using the R pack-

age vegan (Oksanen et al. 2013).

We examined breakpoints of faunal variation along climatic and

land cover gradients using multivariate regression tree analysis

(MRT; De’ath 2002). We used MRT based on Euclidean distance

of Hellinger-transformed multivariate species data (response vari-

ables) and province area, climate and land cover variables as

explanatory variables. We also ran trial analyses with latitude and

longitude included among the explanatory variables, but their

inclusion did not lead to clearly better MRT models. MRT forms

clusters of sites by repeatedly splitting the data, each split is defined

by a simple rule based on the values of explanatory variables, and

the splits are chosen to minimize the dissimilarity of sites within

clusters (De’ath 2002; Borcard, Gillet & Legendre 2011). MRT

results in a tree whose terminal site groups or ‘leaves’ are composed

of subsets of sites that are chosen to minimize the within-group

sums of squares. Each successive partitioning of data is defined by

a threshold value of one of the explanatory variables. This method

retains a solution with the greatest predictive power and can handle

a wide variety of situations, including a situation where assem-

blage–environment relationships are nonlinear (Borcard, Gillet &

Legendre 2011; Legendre & Legendre 2012). In brief, the computa-

tion of MRTs consists of (i) constrained partitioning of the data

and (ii) cross-validation of the results (De’ath 2002). In this study,

we chose the ‘best’ tree with the minimum cross-validated error

(CVRE). MRTs were constructed using the R package MVPART-

wrap (Ouellette & Legendre 2013).

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

4 J. Heino & J. Alahuhta

We also examined elements of provincial faunal structure based

on the original EMS approach and the ‘range perspective’ in our

analyses (Leibold & Mikkelson 2002). The analysis of the ele-

ments of faunal structure was based on three metrics: (i) coher-

ence, (ii) turnover and (iii) boundary clumping. Prior to

calculating these three metrics, provinces-by-species presence–

absence matrix is ordinated through reciprocal averaging (i.e. cor-

respondence analysis; Gauch 1982). Correspondence analysis

defines a latent gradient, ordering sites and species along that

gradient (Leibold & Mikkelson 2002; Presley & Willig 2010). The

first metric that we evaluated is coherence. It is based on calculat-

ing the number of embedded absences in the ordinated matrix

and, subsequently, comparing the observed value to a null distri-

bution of embedded absences from randomizations. An embed-

ded absence means that there is a gap in a species range (Leibold

& Mikkelson 2002; Presley & Willig 2010). A small number of

embedded absences lead to positive coherence, while a large num-

ber of embedded absences mean negative coherence. Significantly

negative coherence thus provides support to a checkerboard dis-

tribution of species, non-significant coherence to randomness,

and significantly positive coherence to nestedness, evenly spaced

gradients, Gleasonian gradients or Clementsian gradients (Lei-

bold & Mikkelson 2002). A second metric is called turnover. This

metric is evaluated if coherence is positive, and it is used for

deciding which gradient model best fits the data. Species turnover

is measured as the number of times one species replaces another

between two sites in an ordinated matrix (Presley & Willig 2010).

Significantly negative turnover is related to nestedness, whereas

non-significant (i.e. moderately positive) or significantly positive

turnover points to evenly spaced, Gleasonian or Clementsian gra-

dients (Leibold & Mikkelson 2002; Henriques-Silva, Lindo &

Peres-Neto 2013). These last three types of gradients can be sepa-

rated based on evaluation of boundary clumping (Leibold &

Mikkelson 2002; Presley, Higgins & Willig 2010). Boundary

clumping is analysed using Morisita’s index and a chi-square test

comparing observed and expected distributions of range bound-

ary locations. Values of this index that are not different from 1

indicate randomly distributed range boundaries (i.e. Gleasonian

gradients), values significantly larger than 1 indicate clumped

range boundaries (i.e. Clementsian gradients) and values signifi-

cantly < 1 indicate hyperdispersed range boundaries (i.e. evenly

spaced gradients). The significance of the index values for coher-

ence and turnover was tested using the fixed-equiprobable null

model (Gotelli 2000), where the species richness of each site was

maintained (i.e. row sums are fixed), but species occurrences were

assumed to be equiprobable (i.e. column sums are not fixed).

Because species richness typically varies along environmental gra-

dients, this null model makes sense ecologically (Presley et al.

2009). We also used a strict and conservative fixed–fixed null

model to find out if the null model used affected the results. In

the fixed–fixed null model, the species richness of each site was

maintained (i.e. row sums are fixed) and species occurrences were

the same as species frequencies of occupancy (i.e. column sums

are fixed). We used 999 simulations to provide random matrices

for testing coherence and turnover. Elements of faunal structure

were evaluated for each beetle group along primary (axis 1) cor-

respondence analysis axis based on each null model. Elements of

faunal structure were analysed using the R package metacom

(Dallas 2013) in the R environment (version 3.0.1, R Core Team

2013).

Although the original EMS approach (Leibold & Mikkelson

2002) has been criticized by some authors on the basis that a

data set may simultaneously show different significant patterns

(Ulrich & Gotelli 2013), other researchers have recently shown

that it is a good means to indicate ‘the best fit’ of an empirical

data set with one of the idealized patterns (Meynard et al. 2013;

Dallas & Presley 2014; de la Sancha et al. 2014). Furthermore,

one major difference between the EMS approach and the tradi-

tional nestedness and co-occurrence analyses is that the former

approach examines different patterns along a single major gradi-

ent in the data, while the latter two analyses examine a given pat-

tern in the whole sites-by-species matrix simultaneously (Presley,

Higgins & Willig 2010; Presley & Willig 2010).

We used a combination of k-means partitioning, chi-square

tests and correspondence analysis to assess the robustness of the

MRT results and to associate the findings from the MRT and

EMS analyses regarding faunal breakpoints. We hence used

k-means partitioning to cluster the provinces based on the number

of groups corresponding those from the MRTs at the first node

and based on the number of final MRT leaves. K-means parti-

tioning works by forming groups through identifying high-density

regions in the data and, contrary to other clustering methods, it

is based on a pre-determined number of final groups (Borcard,

Gillet & Legendre 2011). We used either Hellinger-transformed

or chi-square-transformed data as the input matrix in k-means

partitioning (for the transformations, see Legendre & Gallagher

2001). The Hellinger transformation connected the Hellinger-

based RDAs and MRTs and the chi-square transformation

connected the findings with correspondence analysis that is based

on chi-square distance (Gauch 1982). We compared the matches

between the MRT and k-means clusters using chi-square tests

(Borcard, Gillet & Legendre 2011). We also compared the

matches between the four beetle groups using chi-square tests.

K-means clustering was done using the R package Rcmdr (Fox

2005) and chi-square tests with permutation using the R package

coin (Hothorn et al. 2008). Finally, we plotted the province and

species scores of each beetle group on correspondence analysis

ordination plots to show that the MRT breakpoints at the first

node were associated with breaks in species distributions and pro-

vincial species composition detected in the EMS analysis.

Results

Eigenfunction spatial analysis produced 28 eigenvectors

with positive eigenvalues. These ranged from those

describing large-scale spatial trends (i.e. those with large

eigenvalues) to those related to small-scale spatial rela-

tionships among sites (i.e. those with small eigenvalues).

These spatial variables were subsequently used to describe

variation in the provincial species composition of each

beetle group. The spatial models in the RDAs included

the following numbers of significant spatial variables: div-

ing beetles 16 spatial variables, ground beetles 20 spatial

variables, water scavenger beetles 17 spatial variables and

longhorn beetles 13 spatial variables (Appendix S1, Sup-

porting information). In general, the spatial eigenvector

variables (SEVs) with the largest eigenvalues (SEV1 and

SEV2) were selected first for each beetle group, followed

by a variable combination of intermediate and small-scale

spatial variables depending on the beetle group.

Highly similar sets of significant climate and land cover

variables were selected in the final RDA models of each

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

Elements of regional beetle faunas 5

beetle group (Appendix S1, Supporting information). Var-

iation in the provincial species composition of diving bee-

tles was significantly accounted for by (in order of

importance) mean annual temperature, maximum temper-

ature, precipitation of the wettest month and minimum

temperature, as well as by agriculture, forests, wetlands,

altitude range and open area. For ground beetles, the

selected climate variables were the same as for diving bee-

tles, but the set of land cover variables included agricul-

ture, forests, wetland, altitude range, open area and water

systems area. For longhorn beetles, minimum tempera-

ture, maximum temperature, mean annual temperature

and precipitation of the wettest month were significant cli-

mate variables, whereas agriculture, forests, wetland, alti-

tude range, open area, urban area and water systems area

were significant land cover variables. The provincial spe-

cies composition of water scavenger beetles was closely

associated with mean annual temperature, maximum tem-

perature, precipitation of the wettest month and minimum

temperature, as well as significantly related to agriculture,

forests, wetlands, altitude range, open area and water sys-

tems area.

Variation partitioning based on pRDAs showed that

the three main groups of predictor variables and province

area accounted for variation in the species composition of

each beetle group highly similarly (Table 1). Thus, the

contributions of climate, land cover and spatial variables

were very high, accounting for 31–51% of variation in

provincial species composition. Of the individual effects

of the explanatory variable groups (Table 1), spatial vari-

ables were clearly the most important ones (variation

accounted for: 7�5–11�6%), followed by the minor individ-

ual effects of climate (0�7–2�7%) and land cover variables

(0�9–1�8%). There was also joint variation among two of

the variable groups, and even these joint effects typically

slightly overcame the individual effects of climate and

land cover variables (Table 1). Province area had clearly

weaker effects on provincial species composition than any

Table 1. Results of variation partitioning of species composition in each beetle group among four predictor variable groups (land cover,

climate, spatial and province area). Shown are total, individual and joint fractions of variation. Province area was not considered an

environmental variable, but was included to account for the species–area relationship. Significant individual fractions are in bold

Diving beetles Ground beetles Scavenger beetles Longhorn beetles

d.f. Adj. R2 d.f. Adj. R2 d.f. Adj. R2 d.f. Adj. R2

Fractions

Land cover = L 5 0�376 6 0�364 6 0�332 7 0�314Climate = C 4 0�424 4 0�398 4 0�343 4 0�342Spatial = S 16 0�462 20 0�513 17 0�419 13 0�398Area = A 1 0�188 1 0�192 1 0�139 1 0�157L+C 9 0�459 10 0�443 10 0�400 11 0�397L+S 21 0�509 26 0�543 23 0�466 20 0�441L+A 6 0�403 7 0�388 7 0�351 8 0�344C+S 20 0�526 24 0�546 21 0�456 17 0�450C+A 5 0�426 5 0�403 5 0�347 5 0�348S+A 17 0�465 21 0�52 18 0�425 14 0�407L+C+S 25 0�534 30 0�559 27 0�475 24 0�473L+C+A 10 0�462 11 0�449 11 0�404 12 0�402L+S+A 22 0�518 27 0�549 24 0�475 21 0�451C+S+A 21 0�528 25 0�554 22 0�466 18 0�459L+C+S+A 26 0�537 31 0�564 28 0�482 25 0�477

Individual fractions

L| C+S+A 5 0�009 6 0�010 6 0�015 7 0�018C| L+S+A 4 0�019 4 0�016 4 0�007 4 0�027S| L+C+A 16 0�075 20 0�116 17 0�077 13 0�076A| L+C+S 1 0�003 1 0�006 1 0�007 1 0�005

Joint fractions

L∩C 0 0�045 0 0�018 0 0�034 0 0�025C∩S 0 0�040 0 0�044 0 0�047 0 0�031L∩S 0 0�027 0 0�035 0 0�042 0 0�035L∩A 0 0�000 0 0�002 0 0�004 0 0�004C∩A 0 0�006 0 0�000 0 0�002 0 0�005S∩A 0 0�000 0 0�000 0 �0�002 0 0�000L∩C∩A 0 �0�006 0 0�000 0 �0�007 0 �0�005L∩C∩S 0 0�134 0 0�133 0 0�120 0 0�109C∩S∩A 0 0�017 0 0�019 0 0�013 0 0�020L∩S∩A 0 �0�001 0 0�000 0 �0�003 0 �0�002C∩L∩S∩A 0 0�169 0 0�168 0 0�127 0 0�131

Residuals 0 0�463 0 0�436 0 0�518 0 0�522

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

6 J. Heino & J. Alahuhta

of the three main predictor variable groups (Table 1). The

total amounts of variation attributable to the four explan-

atory variable groups were relatively high, ranging from

47�7% for longhorn beetles to 56�4% for ground beetles.

The Mantel correlograms were highly similar among

the beetle groups (Appendix S2, Supporting information).

There was significant positive autocorrelation at the small-

est distance classes, no significant autocorrelation at the

middle classes and significant negative autocorrelation at

the largest distance classes.

The MRT analyses showed that, for each beetle group,

temperature variables were most clearly associated with

faunal breakpoints. For diving beetles, mean annual

temperature alone accounted for 24% of variation in

species composition, followed by percentage open areas,

precipitation of the driest month, minimum temperature,

percentage agriculture cover and altitude range (Fig. 1).

These other variables were much weaker contributors to

the total explained variation of 48% than mean annual

temperature. For ground beetles, mean annual tempera-

ture was again associated with the strongest faunal break-

point, accounting for 23% of variation in species

composition, followed by clearly minor contributions

(< 8%) to the total explained variance of 51% by percent-

age urban areas, mean annual temperature, percentage

agriculture cover, distance to the sea, altitude range and

minimum temperature (Fig. 2). For water scavenger bee-

tles, maximum temperature accounted for 19% of varia-

tion, followed by minimum temperature, maximum

temperature, distance to the sea and percentage agricultural

Coe

ffici

ent o

f det

erm

inat

ion

(R2 )

48·12

45·78

41·43

36·44

31·07

24·33

0(1)

24·33%

Mean annual temp (oC) < 3·035

Mean annual temp (oC) >3·035

(2)6·74%

Open area (%) >0·185

Open area(%) < 0·185

n = 11

(#3)

(5)4·35%

Agriculture(%) < 0·015

Agriculture(%) >0·015

n = 11

(#5)

(11)2·35%

Alt range(m)

>1783

Alt range(m)

< 1783

n = 3

(#7)

n = 13

(#8)

(3)5·37%

Prec. of driest month (mm) >54·09

Prec. of driest month(mm) < 54·09

n = 3

(#10)

(7)4·99%

Min temp(oC)< –5·755

Min temp(oC) >–5·755

n = 19

(#12)

n = 19

(#13)

R2 × 100 = 48·1 % Error = 0·519 CV Error = 0·702 SE = 0·0371

Fig. 1. Multivariate regression tree for diving beetles. This is the ‘best’ tree with minimum cross-validated error.

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

Elements of regional beetle faunas 7

cover (Fig. 3). These variables explained 43% of variation

in species composition. For longhorn beetles, minimum

temperature alone accounted for 21% of variation in spe-

cies composition, followed by percentage forest cover, per-

centage urban land cover, maximum temperature and

distance to the sea (Fig. 4). These variables accounted for a

total of 41% of variation in species composition. The

results of the MRT analyses remained largely the same

when latitude and longitude were included in the set of the

explanatory variables. Their inclusion did not significantly

increase the explanatory power of the MRTs, and they

appeared only in the lower nodes of the trees (results not

shown).

The EMS analysis indicated that each beetle group

showed (i) positive coherence (i.e. the number of embed-

ded absences was lower than expected by chance), (ii)

more turnover than expected by chance (i.e. the number

of replacements was higher than expected by chance) and

(iii) significantly higher boundary clumping than 1 (i.e.

based on Morisita’s I) (Table 2). The same result held

true for both fixed-equiprobable and fixed–fixed null mod-

els. These findings indicated that the beetle faunas fitted

best with Clementsian gradients along the primary axis of

variation in provincial species composition.

The groupings based on the MRTs and k-means parti-

tioning were very similar, and significant chi-square tests

strengthened this conclusion (Table 3). This result held

with both distance transformations of species data that

were used as the basis of k-means partitioning. Chi-square

tests also indicated that the breakpoints at the first MRT

node were highly similar between the beetle groups

(Table 4; see also Appendices S3 and S4, Supporting

information), and the same was true for k-means clusters

(Table 4). Finally, plotting the two groups of the first

Coe

ffici

ent o

f det

erm

inat

ion

(R2 )

51·61

49·71

47·29

44·39

41·25

37·71

31·4

23·61

0(1)

23·61%

Mean annual temp(oC)< 3·395

Mean annual temp(oC) >3·395

(2)

7·79%

Urban(%) < 0·005

Urban(%) >0·005

(4)3·54%

Agriculture(%) < 0·035

Agriculture(%) >0·035

n = 3

(#7)

(8)3·14%

Sea(km)

< 334·5

Sea(km)

>334·5

n = 8

(#5)

n = 10

(#6)

(5)2·90%

Altrange(m) < 938·2

Altrange(m) >938·2

n = 12

(#9)

n = 7

(#10)

(3)

6·31%

Mean annual temp(oC) < 7·015

Mean annual temp(oC) >7·015

n = 16

(#17)

(6)2·42%

Altrange(m) >1099

Altrange(m) < 1099

n = 3

(#13)

(13)1·90%

Min temp (C)< –8·668

Mintemp (C) >–8·668

n = 8

(#15)

n = 12

(#16)

R 2 × 100 = 51·6 % Error = 0·484 CV Error = 0·716 SE = 0·0338

Fig. 2. Multivariate regression tree for ground beetles. This is the ‘best’ tree with minimum cross-validated error.

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

8 J. Heino & J. Alahuhta

MRT node on CA ordination plots clearly indicated that

the breaks were strongly associated between those two

methods (Fig. 5), also indicating that MRT and EMS

found the same breaks in the beetle data sets.

Discussion

While a large number of studies have examined variation

in species richness across large spatial grains and extents

(see review by Field et al. 2009), fewer studies have exam-

ined species compositional variation at such broad scales

(V€ais€anen, Heli€ovaara & Immonen 1992; Baselga 2008;

Rueda, Rodriguez & Hawkins 2010). We found that

approximately half of variation in the provincial species

composition of the four beetle groups was accounted for

by the four groups of explanatory variables. In particular,

the shared effects of the variable groups were considerable

in partial RDAs, implying that both climate and land

cover vary geographically across the study area and affect

provincial species composition simultaneously. In fact,

there are clearly parallel gradients in land cover and cli-

mate across the study area. Hence, it is difficult to show

decisively which explanatory variable set is mostly respon-

sible for variation in species composition. However, given

that both climate and history often are related to varia-

tion in species richness (Gaston 2000; Fattorini & Ulrich

2012a; Cornell 2013) and species composition (V€ais€anen,

Heli€ovaara & Immonen 1992; Heino 2001; Fattorini &

Ulrich 2012b) along large geographical gradients, we

assume that they are also responsible for variation in bee-

tle species composition across the provinces in our study

area. These findings concur with previous studies in the

Coe

ffici

ent o

f det

erm

inat

ion

(R2 )

43·50

40·75

37·94

35·11

27·86

19·42

0(1)

19·42%

Maxtemp (oC) < 18·8

Max temp(oC) >18·8

(2)

7·25%

Min temp (oC) < –13·67

Min temp (oC) >–13·67

(4)

2·81%

Sea (km) < 360

Sea (km)>360

n = 7

(#4)

n = 6

(#5)

(5)

2·83%

Max temp(oC) < 14·57

Max temp(oC) >14·57

n = 5

(#7)

n = 11

(#8)

(3)

8·44%

Min temp (oC) < –9·762

Min temp (oC) >–9·762

n = 20

(#10)

(7)

2·75%

Agriculture(%) < 0·43

Agriculture(%) >0·43

n = 18

(#12)

n = 12

(#13)

R2 × 100 = 43·5 % Error : 0·565 CV Error : 0·78 SE : 0·0465

Fig. 3. Multivariate regression tree for water scavenger beetles. This is the ‘best’ tree with minimum cross-validated error.

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

Elements of regional beetle faunas 9

same area, which have suggested the influences of climate

on the regionalisation of various animal and plant groups

(Lahti, Kurtto & V€ais€anen 1988; V€ais€anen, Heli€ovaara &

Immonen 1992; Heino 2001). Our findings are also similar

to those at a broader spatial scale across Europe, where

climatic and historical factors were important in determin-

ing species richness, endemism and composition of regio-

nal faunas of longhorn beetles (Baselga 2008) and

darkling beetles (Fattorini & Baselga 2012). In contrast,

the degree to which land cover affected species composi-

tion is more difficult to infer. We could expect that if land

cover was important, we would have seen differences in

the particular land cover variables between aquatic and

terrestrial beetles. This was not the case, however, and

generally the same land cover variables appeared in the

final constrained ordination models.

There was significant positive autocorrelation at the

smallest distance classes and significant negative autocor-

relation at the largest distance classes in all beetle groups.

This finding suggests (i) that contiguous provinces have

similar species composition, as a consequence of the fact

that that neighbouring provinces share similar environ-

mental conditions and (ii) that provinces far from each

other have increasingly dissimilar faunas, suggesting that

climatic differences between provinces increase faunal dif-

ferences. Thus, it can be assumed that due to their cli-

matic similarities or differences, significant spatial

autocorrelation results from climate forcing on species

41·49

38·38

34·12

27·73

21·06

0(1)

21·06%

Min temp(oC) < –8·59

Min temp(oC) >–8·59

(2)

6·39%

Urban(%) < 0·005

Urban(%) >0·005

n = 26

(#6)

(4)

3·11%

Sea (km)

< 334·5

Sea(km)

>334·5

n = 9

(#4)

n = 10

(#5)

(3)

6·66%

Forests(%) < 0·12

Forests(%) >0·12

n = 11

(#8)

(7)

4·25%

Max temp(oC) < 15·64

Max temp(oC) >15·64

n = 5

(#10)

n = 18

(#11)

R2 × 100 = 41·5 % Error : 0·585 CV Error : 0·762 SE : 0·0379

Coe

ffici

ent o

f det

erm

inat

ion

(R2 )

Fig. 4. Multivariate regression tree for longhorn beetles. This is the ‘best’ tree with minimum cross-validated error.

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

10 J. Heino & J. Alahuhta

ranges. However, significant negative autocorrelation may

also suggest that provinces that are geographically far

from each other are differently affected by historical fac-

tors (Brown & Lomolino 1998; Baselga 2008). For exam-

ple, most of our study region was covered by ice sheets

until about 10 000 years ago, and biotas have hence colo-

nized this region relatively recently (V€ais€anen, Heli€ovaara

& Immonen 1992; Heino 2001). Thus, it is possible that

(i) different species have colonized the region using differ-

ent dispersal routes, (ii) this region is still under coloniza-

tion, and (iii) climate change or land use alteration leads

to increasing ranges of some species, reduces the ranges of

some others and perhaps will lead to new faunal associa-

tions. Historical and climatic factors may not only lead to

continuous variation in faunal composition, but also con-

tribute to more abrupt changes in faunal characteristics.

Breakpoints in faunal composition were most strongly

related to climate. This finding was evidenced by the fact

that the first division of the faunal composition of each bee-

tle group was always related to temperature variables even

when latitude and longitude were included in the set of con-

straining variables. The temperature variables accounted

for about half of the total variation explained by the multi-

variate regression trees for each beetle group, whereas the

remaining land cover and climate variables alone accounted

for considerably less variation in faunal composition. These

findings suggest that if faunal breakpoints exist at large spa-

tial scales, they are first and foremost determined by cli-

matic conditions. Whether or not climatic factors vary

continuously or show clear breakpoints themselves across

Table 3. Comparisons between the multivariate regression tree

(MRT) and k-means classifications for each beetle group. Shown

are the results of v2-tests of the match between two MRT groups at

the first node and two k-means groups. Also shown are the results

of v2-tests of the match between MRT final leaves and k-means

partitioning based on the corresponding number of groups. K-

means partitioning was based on data transformed using either

Hellinger transformation or chi-square transformation

No.

groups

K-means with transformed species

data

Hellinger Chi-square

v2 P v2 P

MRT first node

Diving beetles 2 13�70 < 0�001 60�26 < 0�001Ground beetles 2 48�57 < 0�001 48�57 < 0�001Scavenger beetles 2 53�52 < 0�001 53�52 < 0�001Longhorn beetles 2 71�05 < 0�001 71�05 < 0�001

MRT final leaves

Diving beetles 7 246�46 < 0�001 289�57 < 0�001Ground beetles 9 363�04 < 0�001 358�93 < 0�001Scavenger beetles 7 283�37 < 0�001 205�40 < 0�001Longhorn beetles 6 229�09 < 0�001 196�74 < 0�001

Table 2. Results of the ‘elements of metacommunity structure’ analysis of patterns in provincial species composition. All analyses on the

first correspondence analysis axis based on each null model point strongly towards Clementsian gradients

Null model

Coherence Turnover Boundary clumping

Abs Z-value P-value

Sim

mean

Sim

SD Rep Z-value P-value

Sim

mean

Sim

SD Index P-value d.f.

Fixed-Equiprobable

Diving beetles 3234 �26�71 0�001 5475 83 746 882 69�67 0�001 26 020 10 346 2�45 <0�001 152

Ground beetles 8438 �31�21 0�001 16 406 255 4 282 211 65�50 0�001 168 758 62 794 2�85 <0�001 385

Scavenger beetles 2224 �21�85 0�001 4405 99 343 100 24�87 0�001 30 648 12 560 8�28 <0�001 113

Longhorn beetles 2875 �15�55 0�001 4833 125 420 386 19�34 0�001 49 674 19 159 4�43 <0�001 115

Fixed–FixedDiving beetles 3234 �18�43 0�001 5102 101 746 882 55�24 0�001 12 671 13 290 2�45 <0�001 152

Ground beetles 8438 �22�31 0�001 14 113 254 4 282 211 50�38 0�001 92 286 83 159 2�85 <0�001 385

Scavenger beetles 2224 �8�89 0�001 3262 116 343 100 69�20 0�001 1451 4936 8�28 <0�001 113

Longhorn beetles 2875 �8�20 0�001 3989 135 420 386 20�73 0�001 13 346 19 633 4�43 <0�001 115

Z-value, (Observed index value-Simulated mean)/Simulated SD; Abs, number of embedded absences; Rep, number of replacements; Sim

mean, average of simulated index values; Sim SD, standard deviation of simulated index values.

Table 4. Comparisons of the match of classifications between the

beetle groups at the first node of multivariate regression tree (a)

and between the beetle groups for the two k-means clusters based

on Hellinger (b) or chi-square (c) transformation of species data.

The v2 values are shown below the diagonal, and the P-values

are above the diagonal

Diving

beetles

Ground

beetles

Scavenger

beetles

Longhorn

beetles

(a) v2 or PDiving beetles – <0�001 <0�001 <0�001Ground beetles 67�49 – <0�001 <0�001Scavenger beetles 31�69 24�88 – 0�005Longhorn beetles 36�89 39�25 9�33 –

(b) v2 or PDiving beetles – <0�001 <0�001 0�012Ground beetles 13�54 – <0�001 <0�001Scavenger beetles 13�52 56�42 – <0�001Longhorn beetles 7�27 24�86 12�27 –

(c) v2 Or P

Diving beetles – <0�001 <0�001 <0�001Ground beetles 61�06 – <0�001 <0�001Scavenger beetles 60�47 56�42 – <0�001Longhorn beetles 20�42 24�86 12�27 –

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

Elements of regional beetle faunas 11

large spatial scales is open to debate, but maps show clear

temperature variation across the study area (Appendix S3,

Supporting information). It has to be highlighted, however,

that multivariate regression trees indicate the existence of a

breakpoint even in continuous data (De’ath 2002; Borcard,

Gillet & Legendre 2011), emphasizing the need to supple-

ment ecological inferences by means of other data analyses.

The breakpoints in faunal variation are also supported

by the distributions of single species of beetles across the

study area. There are some northern species, occurring

north of approximately 65°N in Finland, Sweden and Nor-

way, and larger numbers of southern species, occurring

south of approximately 58°N in Denmark and southern-

most Sweden (Lindroth 1985; Holmen 1987; Bily & Mehl

1989; Nilsson & Holmen 1995). Thus, despite the fact that

the ranges of most species in the study area are inclined

towards south and that there is a latitudinal gradient in

species richness (Lindroth 1985; V€ais€anen & Heli€ovaara

1994), the compositional gradients of beetles are more com-

plex than a clear latitudinal cline often detected in species

richness (Baselga 2008; Fattorini & Ulrich 2012b). Such

compositional gradients may again be caused by certain

sets of species responding to climatic conditions in a similar

manner or that there are groups of species that have colo-

nized the study area using the same dispersal routes.

The finding that patterns of faunal composition best fit-

ted Clementsian gradients also suggested clear breakpoints

in species composition. This finding is intuitive, given that

there are these southern and northern faunal elements in

the study area. The idea of Clementsian gradients suggests

that there are two or more groups of species showing simi-

lar responses to the environment and that the responses dif-

fer among the groups (Clements 1916; Allen & Hoekstra

1992). Most studies testing this idea have considered local

CA1 (EV = 0·284)

CA

2 (E

V =

0·1

20)

+

++

+

+++

+

++

+

++++

+

+

+++

+

+

+

++

+

+

++ ++++++

+

+

+

+

+++

+

+

+ +++

+

+

+++

+

+

++

+

+

+

+

++ +

+

+

+

+

++

+

+

+

+

+++

+

+

++

++

+

+

+

+

+++

+

+

+

++

++

+

+

+

+

+

+

++

+

+

+

+

+

+

+ +

++

+

++

CA1 (EV = 0·248)

CA

2 (E

V =

0·1

01)

++

+

+

+

+

+

+

+

++

+++

++

++

+

++

+

+

++

+

+

+++ +

+

+++

+

+

+

+

++

+

+

+ +

++

+

++

+

+

+

++ ++++ +++

+++ ++

++ + ++ +++

+

++

+++

+ ++

+

+ +

+

+

++

+

++++

+

+

+

++

++

+

++ +++

++

++++

+

+

CA

2 (E

V =

0·1

24) + +

++

+ ++

+

+

+

++

+

+++

+

+

+++

+

+

++

+

+ ++

+

++ ++

+

+

++

++

++

+

+ ++

+

+

+++++

+

+

+

+

++

+

+

+

+

++

+

+

+

+

++

+

++

++

+

+++

++ +

+

+++

+

+

+

+

+

+

+ ++

+ +

+

+

+

+

++

+

+

+ +

++

+

+

+

+

+

+

+

+

+

+

+

+

+

+++

+

+

++

+

++

+

+

++

+

+

+

+

+

+

+++

+

+

+

+

+

+

+

+

+

+

+

+

+

+

+ +

+

+

+

+

+

+

+

++

+

+

+

+

+

+++

+

+++

++

+++

+

+

++

++

+

+

+

++

+

++ ++++

+

+

+

+

++

+

+

++++

+

+

++ ++++

+

+

+

++

+ ++

+

+

+ +

+

+

++ +

+

+

++

+

+

+

+

+

+

+

++

+

+ +++

+

++ +

++

+

+

+

+

+

+

++

+

+

+

+

++

+

+

+

+

++

+

+

++

++

+

+

++ +

+++

+

+

++++

+

++

++

+

+

+

++

+

++ +

+

++

+

+

+

+

+

+

+

+

+

+

+

+

+

+

+

+

++

+

+++

+

+

+

+

+

+

+

+

+

+

+

+++

+

+

+

+

+

++

+

+

++

++

+

+

+ ++

+

+

+

++

+

++ +

–3 –2 –1 0 1 2 3

–2–1

01

23

–2 0 2 4 6

–5–4

–3–2

–10

1

–3 –2 –1 0 1 2

–3–2

–10

1

–2 –1 0 1 2

–2–1

01

CA1 (EV = 0·263) CA1 (EV = 0·302)

CA

2 (E

V =

0·0

79)

+

+

+

+

+

+

+

+ + ++

+

+

+

++

++

+

+

+++ +

+++

+

+

+

+

+

++

++

++

+

++

+++

+

+++

+

+

++

+

+

++

+

+

+++

+

+

+

+

+

++

+

+

+

+

+

+

+

+

+

+++ +

+

+

+

+

+

+

+

+

+

++

+

+

+

+

+

+

++

+

+

+

+

+

++

+

+

++

+

++++

+ +

+++

+

+

++

+++

+

+

+

+

+++

+

+++

+

+

+

+

++ ++

++

++

+

+

++

(a) (b)

(c) (d)

Mean oC < 3·03

Mean oC > 3·03

Mean oC < 3·39

Mean oC > 3·39

Max oC < 18·80

Max oC > 18·80

Min oC < –8·59

Min oC > –8·59

Fig. 5. Ordination plots from correspondence analysis (CA) of each beetle group, showing the scores of the provinces (open or filled cir-

cles) and species (crosses) along the first two CA axes. Particular attention should be given to CA axis 1. The values of environmental

variables associated with the breakpoints at the first node of the multivariate regression trees are shown in inserts. Shown are CA plots

for (a) diving beetles, (b) ground beetles, (c) water scavenger beetles and (d) longhorn beetles.

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

12 J. Heino & J. Alahuhta

communities across landscapes or relatively small regions,

where environmental control typically accounts for such

species compositional gradients (Presley et al. 2009; Presley

& Willig 2010). However, at large grain sizes and spatial

extents, local environmental gradients are unlikely to

account for Clementsian gradients, but rather climate or

historical factors are more likely candidates for explaining

abrupt changes in faunal composition (L�opez-Gonz�alez

et al. 2012). Such abrupt changes may result from the fact

that large-scale studies cross multiple species pools that are

formed by either historical forces or climatic constraints or,

more likely, their joint effects on species ranges. Such

abrupt changes in faunal composition are even more likely

at broader spatial scales than that of our study, and the

very foundations of biogeographical regionalisation indeed

rest on the assumption of clearly definable geographical

areas harbouring species typical to each region in various

taxonomic groups (Brown & Lomolino 1998; Heikinheimo

et al. 2007; Rueda, Rodriguez & Hawkins 2010).

Geographical patterns in faunal composition were very

similar among ecologically disparate beetle taxa, as evi-

denced by RDA, spatial correlograms and multivariate

regression trees. Climate gradients, particularly those

related to temperature, also accounted for major faunal

breakpoints of beetles in the study area, suggesting that cli-

mate determines species broadscale distributions and leads

to clear groups of species responding similarly to environ-

mental gradients. The finding that the best idealized model

describing faunal variation across the provinces was ‘Clem-

entsian gradients’ also lent support for the existence of

groups of species having similar environmental relation-

ships. Given such close associations between faunal varia-

tion, faunal breakpoints and environmental conditions,

broadscale environmental factors and environmental diver-

sity (Faith 2003) may be used for preliminary regionalisa-

tion in the context of environmental assessment and

biodiversity studies in the present study area (Lahti, Kurtto

& V€ais€anen 1988; V€ais€anen, Heli€ovaara & Immonen 1992)

and possibly in other regions (Heikinheimo et al. 2007;

Rueda, Rodriguez & Hawkins 2010). The methodological

approaches we used in this study could hence be widely

applied in attempts to find abiotic proxy variables for

faunal and floral patterns. Such proxies for biogeographic

regionalisation may also be highly promising and cost-

effective means in regions where no appropriate data on

species distributions exist.

Acknowledgements

We thank the associate editor, two anonymous reviewers and Simone

Fattorini for excellent comments that greatly improved this paper. We also

thank Maija Lantto for keyboard work when compiling the biological

data. This study was supported by grants from the Academy of Finland.

Data accessibility

The biological data we used can be found entirely in the literature (Lind-

roth 1985, 1986; Holmen 1987; Bily & Mehl 1989; Nilsson & Holmen

1995), and the climate and land cover data are downloadable in the Inter-

net (Climate data: http://www.worldclim.org/; Land cover data: http://

www.eea.europa.eu/publications/COR0-landcover).

References

Allen, T.F.H. & Hoekstra, T.W. (1992) Toward a Unified Ecology. Colum-

bia University Press, New York.

Anderson, M.J. & Gribble, N.A. (1998) Partitioning the variation among

spatial, temporal and environmental components in a multivariate data

set. Australian Journal of Ecology, 23, 158–167.Anderson, M.J., Crist, T.O., Chase, J.M., Vellend, M., Inouye, B.D., Free-

stone, A.L. et al. (2011) Navigating the multiple meanings of b diver-

sity: a roadmap for the practicing ecologist. Ecology Letters, 14, 19–28.Bailey, R.G. (2010) Ecosystem Geography. From Ecoregions to Sites.

Springer, New York.

Baselga, A. (2008) Determinants of species richness, endemism and turn-

over in European longhorn beetles. Ecography, 31, 263–271.Bily, S. & Mehl, O. (1989) Longhorn beetles (Coleoptera, Cerambycidae) of

Fennoscandia and Denmark. Fauna Entomologica Scandinavica, 22, 1–203.Blanchet, F.G., Legendre, P. & Borcard, D. (2008) Forward selection of

explanatory variables. Ecology, 89, 2623–2632.Borcard, D., Gillet, F. & Legendre, P. (2011) Numerical Ecology with R.

Springer, New York.

Borcard, D. & Legendre, P. (2002) All-scale spatial analysis of ecological

data by means of principal coordinates of neighbour matrices. Ecologi-

cal Modelling, 153, 51–68.Borcard, D. & Legendre, P. (2012) Is the Mantel correlogram powerful

enough to be useful in ecological analysis? A simulation study. Ecology,

93, 1473–1481.Borcard, D., Legendre, P. & Drapeau, P. (1992) Partialling out the spatial

component of ecological variation. Ecology, 73, 1045–1055.Bradshaw, E.M. (2005) Mid-to late-Holocene land-use change and lake

development at Dallund S0, Denmark: synthesis of multiproxy data,

linking land and lake. The Holocene, 15, 1152–1162.Brown, J.H. & Lomolino, M.V. (1998) Biogeography, 2nd edn. Sinauer,

Sunderland, Massachusetts, USA.

Clements, F.E. (1916) Plant Succession. An Analysis of the Development of

Vegetation. Carnegie Institution, Washington, DC.

Cornell, H.V. (2013) Is regional species diversity bounded or unbounded?

Biological Reviews, 88, 140–165.Cottenie, K. (2005) Integrating environmental and spatial processes in

ecological community dynamics. Ecology Letters, 8, 1175–1182.Dallas, T. (2013) metacom: Analysis of the “elements of metacommunity

structure”. R package version 1.2. http://CRAN.R-project.org/pack-

age=metacom

Dallas, T. & Presley, S.J. (2014) Relative importance of host environment,

transmission potential and host phylogeny to the structure of parasite

metacommunities. Oikos, 123, 866–874.Davidson, T.A., Sayer, C.D., Perrow, M., Bramm, M. & Jeppesen, E.

(2010) The simultaneous inference of zooplanktivorous fish and macro-

phyte density from sub-fossil cladoceran assemblages: a multivariate

regression tree approach. Freshwater Biology, 55, 546–564.De’ath, G. (2002) Multivariate regression trees: a new technique for

modelling species-environment relationships. Ecology, 83, 1105–1111.Diamond, J.M. (1975) Assembly of species communities. Ecology and Evo-

lution of Communities (eds M.L. Cody & J.M. Diamond), pp. 342–444.Harvard University Press, Cambridge.

Dray, S., Legendre, P. & Peres-Neto, P.R. (2006) Spatial modelling: a

comprehensive framework for principal coordinate analysis of neigh-

bour matrices (PCNM). Ecological Modelling, 196, 483–493.Dray, S., P�elissier, R., Couteron, P., Fortin, M.J., Legendre, P., Peres--

Neto, P.R. et al. (2012) Community ecology in the age of multivariate

multiscale spatial analysis. Ecological Monographs, 82, 257–275.Elith, J. & Leathwick, J.R. (2009) Species Distribution Models: ecological

explanation and prediction across space and time. Annual Review of

Ecology, Evolution, and Systematics, 40, 677–697.Faith, D.P. (2003) Environmental diversity (ED) as surrogate information

for species-level biodiversity. Ecography, 26, 374–379.Fattorini, S. & Baselga, A. (2012) Species richness and turnover patterns

in European tenebrionid beetles. Insect Conservation and Diversity, 5,

331–345.Fattorini, S. & Ulrich, W. (2012a) Drivers of species richness in European

Tenebrionidae (Coleoptera). Acta Oecologica, 43, 22–28.

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

Elements of regional beetle faunas 13

Fattorini, S. & Ulrich, W. (2012b) Spatial distributions of European Tene-

brionidae point to multiple postglacial colonization trajectories. Biologi-

cal Journal of the Linnean Society, 105, 318–329.Field, R., Hawkins, B.A., Cornell, H.V., Currie, D.J., Diniz-Filho, J.A.F.,

Guegan, J.-F. et al. (2009) Spatial species-richness gradients across

scales: a meta-analysis. Journal of Biogeography, 36, 132–147.Fox, J. (2005) The r commander: a basic statistics graphical user interface

to R. Journal of Statistical Software, 14, 1–42.Gaston, K.J. (2000) Global patterns in biodiversity. Nature, 405, 220–227.Gauch, H.G. (1982) Multivariate Analysis in Community Ecology. Cam-

bridge University Press, Cambridge.

Gleason, H.A. (1926) The individualistic concept of the plant association.

Bulletin of the Torrey Botanical Club, 53, 7–26.Gotelli, N.J. (2000) Null model analysis of species co-occurrence patterns.

Ecology, 81, 2606–2621.Gotelli, N.J. & Graves, G.R. (1996) Null Models in Ecology. Smithsonian

Institute Press, Washington.

Gotelli, N.J., Graves, G.R. & Rahbek, C. (2010) Macroecological signals

of species interactions in the Danish avifauna. Proceedings of the

National Academy of Sciences, 107, 530–535.Gotelli, N.J. & McCabe, D.J. (2002) Species coexistence: a meta-analysis

of J. M. Diamond’s assembly rules model. Ecology, 83, 2091–2096.G€otzenberger, L., de Bello, F., Br�athen, K.A., Davison, J., Dubuis, A.,

Guisan, A. et al. (2012) Ecological assembly rules in plant communities:

approaches, patterns and prospects. Biological Reviews, 87, 111–127.Griffith, D.A. & Peres-Neto, P.R. (2006) Spatial modeling in ecology: the

flexibility of eigenfunction spatial analyses. Ecology, 87, 2603–2613.Heikinheimo, H., Fortelius, M., Eronen, J. & Mannila, H. (2007) Biogeog-

raphy of European land mammals shows environmentally distinct and

spatially coherent clusters. Journal of Biogeography, 34, 1053–1064.Heino, J. (2001) Regional gradient analysis of freshwater biota: do similar

biogeographic patterns exist among multiple taxonomic groups? Journal

of Biogeography, 28, 69–77.Heino, J. (2011) A macroecological perspective of diversity patterns in the

freshwater realm. Freshwater Biology, 56, 1703–1722.Heino, J., Ilmonen, J. & Paasivirta, L. (2014) Continuous variation of

macroinvertebrate communities along environmental gradients in north-

ern streams. Boreal Environment Research, 19, 21–38.Henriques-Silva, R., Lindo, Z. & Peres-Neto, P.R. (2013) A community of

metacommunities: exploring pattern in species distributions across large

geographical scales. Ecology, 94, 627–639.Hijmans, R.J., Cameron, S.E., Parra, J.L., Jones, P.G. & Jarvis, A. (2005)

Very high resolution interpolated climate surfaces for global land areas.

International Journal of Climatology, 25, 1965–1978.Hoagland, B.W. & Collins, S.L. (1997) Gradient models, gradient analysis,

and hierarchical structure in plant communities. Oikos, 78, 23–30.Holm, S. (1979) A simple sequentially rejective multiple test procedure.

Scandinavian Journal of Statistics, 6, 65–70.Holmen, M. (1987) The Hydrophiloidea (Coleoptera) of Fennoscandia

and Denmark. Fauna Entomologica Scandinavica, 18, 1–254.Hothorn, T., Hornik, K., van de Wiel, M.A. & Zeileis, A. (2008) Imple-

menting a class of permutation tests: the coin package. Journal of

Statistical Software, 28, 1–23.Lahti, T., Kurtto, A. & V€ais€anen, R.A. (1988) Floristic composition and

regional species richness of vascular plants in Finland. Annales Botanici

Fennici, 25, 281–291.Legendre, P. & Gallagher, E.D. (2001) Ecologically meaningful transfor-

mations for ordination of species data. Oecologia, 129, 271–280.Legendre, P. & Legendre, L. (2012) Numerical Ecology, 3rd edn. Elsevier,

Amsterdam.

Legendre, P., Borcard, D., Blanchet, F.G. & Dray, S. (2013) PCNM:

MEM spatial eigenfunction and principal coordinate analyses. R pack-

age version 2.1-2/r109. http://R-Forge.R-project.org/projects/sedar/

Leibold, M.A. & Mikkelson, G.M. (2002) Coherence, species turnover,

and boundary clumping: elements of metacommunity structure. Oikos,

97, 237–250.van der Linden, M., Barke, J., Vickery, E., Charman, D.J. & van Geel, B.

(2008) Late Holocene human impact and climate change recorded in a

North Swedish peat deposit. Palaeogeography, Palaeoclimatology, Palae-

oecology, 258, 1–27.Lindroth, C.H. (1985) The Carabidae (Coleoptera) of Fennoscandia and

Denmark. Fauna Entomologica Scandinavica, 15, 1–225.Lindroth, C.H. (1986) The Carabidae (Coleoptera) of Fennoscandia and

Denmark. Fauna Entomologica Scandinavica, 15, 226–497.

McIntosh, R.P. (1995) H. A. Gleason’s “individualistic community

concept” and theory of animal communities: a continuing controversy.

Biological Reviews, 70, 317–357.Meynard, C.N., Lavergen, S., Boulangeat, I., Garraud, L., Van Es, J.,

Mouquet, N. et al. (2013) Disentangling the drivers of metacommuni-

ty structure across spatial scales. Journal of Biogeography, 40, 1560–1571.

Mittelbach, G.G. (2012) Community Ecology. Sinauer, Sunderland, Massa-

chusetts, USA.

Nilsson, A.N. & Holmen, M. (1995) The aquatic adephaga of Fennoscan-

dia and Denmark 2, Dytiscidae. Fauna Entomologica Scandinavica, 32,

1–188.Oksanen, J., Blanchet, F.G., Kindt, R., Legendre, P., Minchin, P.R.,

O’Hara, R.B. et al. (2013) vegan: Community Ecology Package. R

package version 2.0-9. http://CRAN.R-project.org/package=vegan.

Ouellette, M.-H. & Legendre, P. (2013) MVPARTwrap: Additional func-

tionalities for package mvpart. R package version 0.1-9.2. http://

CRAN.Rproject.org/package=MVPARTwrap.

Patterson, B.D. & Atmar, W. (1986) Nested subsets and the structure of

insular mammal faunas and archipelagos. Biological Journal of the Lin-

nean Society, 28, 65–82.Peres-Neto, P.R. & Legendre, P. (2010) Estimating and controlling for

spatial structure in the study of ecological communities. Global Ecology

and Biogeography, 19, 174–184.Peres-Neto, P.R., Leibold, M.A. & Dray, S. (2012) Assessing the effects of

spatial contingency and environmental filtering on metacommunity phy-

logenetics. Ecology, 93, 14–30.Peres-Neto, P.R., Legendre, P., Dray, S. & Borcard, D. (2006) Variation

partitioning of species data matrices: estimation and comparison of frac-

tions. Ecology, 87, 2614–2625.Presley, S.J., Higgins, C.L. & Willig, M.R. (2010) A comprehensive frame-

work for the evaluation for metacommunity structure. Oikos, 119,

908–917.Presley, S.J. & Willig, M.R. (2010) Bat metacommunity structure on

Caribbean islands and the role of endemics. Global Ecology and Bioge-

ography, 19, 185–199.Presley, S.J., Higgins, C.L., L�opez-Gonz�alex, C. & Stevens, R.D. (2009)

Elements of metacommunity structure of Paraguayan bats: multiple

gradients require analysis of multiple ordination axes. Oecologia, 160,

781–793.R Core Team (2013) R: A Language and Environment for Statistical Com-

puting. R Foundation for Statistical Computing, Vienna. http://www.

R-project.org/.

Rao, C.R. (1964) The use and interpretation of principal component

analysis in applied research. Sankhy�a: The Indian Journal of Statistics,

Series A, 26, 329–358.Rueda, M., Rodriguez, M.A. & Hawkins, B.A. (2010) Towards a biogeo-

graphic regionalization of the European biota. Journal of Biogeography,

37, 2067–2076.Sahuquillo, M. & Miracle, M.R. (2013) The role of historic and climatic

factors in the distribution of crustacean communities in Iberian Medi-

terranean ponds. Freshwater Biology, 58, 1251–1266.de la Sancha, N., Higgins, C.L., Presley, S.J. & Strauss, R.E. (2014) Meta-

community structure in a highly fragmented forest: has deforestation in

the Atlantic Forest altered historic biogeographic patterns? Diversity

and Distributions, 20, 1058–1070.Sokal, R.R. & Oden, N.L. (1978) Spatial autocorrelation in biology. 1.

Methodology. Biological Journal of the Linnean Society, 10, 199–228.Tilman, D. (1982) Resource Competition and Community Structure. Prince-

ton University Press, Princeton, New Jersey, USA.

Ulrich, W. & Gotelli, N.J. (2013) Pattern detection in null model analysis.

Oikos, 122, 2–18.V€ais€anen, R. & Heli€ovaara, K. (1994) Hot-spots of insect diversity in

northern Europe. Annales Zoologici Fennici, 31, 71–81.V€ais€anen, R., Heli€ovaara, K. & Immonen, A. (1992) Biogeography of

northern European insects: province records in multivariate analysis

(Saltatoria, Lepidoptera: Sesiidae; Coleoptera: Bubrestidae, Cerambyci-

dae). Annales Zoologici Fennici, 28, 57–81.Wright, D.H., Patterson, B.D., Mikkelson, G.M., Cutler, A. & Atmar, W.

(1998) A comparative analysis of nested subset patterns of species com-

position. Oecologia, 113, 1–20.

Received 2 May 2014; accepted 30 August 2014

Handling Editor: Frank Johansson

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

14 J. Heino & J. Alahuhta

Supporting Information

Additional Supporting Information may be found in the online version

of this article.

Appendix S1. The predictor variables selected in the forward

selection of each variable group for each beetle group.

Appendix S2. A Mantel correlogram for each beetle group.

Appendix S3. Temperature maps of northern Europe.

Appendix S4. Province groupings for each beetle group based on

the multivariate regression trees (MRT) analysis.

© 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society, Journal of Animal Ecology

Elements of regional beetle faunas 15