-
Notifications
You must be signed in to change notification settings - Fork 10
/
Copy pathabellan_11_geographic_799788.pdf.txt
2666 lines (1916 loc) · 73.8 KB
/
abellan_11_geographic_799788.pdf.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"><html xmlns="http://www.w3.org/1999/xhtml">
<head>
<title>Geographic location and phylogeny are the main determinants of the size of the geographical range in aquatic beetles</title>
<meta name="Subject" content="BMC Evolutionary Biology 2011, 11:344. doi:10.1186/1471-2148-11-344"/>
<meta name="Keywords" content=" "/>
<meta name="Author" content="Pedro Abellán"/>
<meta name="Creator" content="Arbortext Advanced Print Publisher 10.0.1082/W Unicode"/>
<meta name="Producer" content="Acrobat Distiller 9.4.2 (Windows)"/>
<meta name="CreationDate" content=""/>
</head>
<body>
<pre>
Abellán and Ribera BMC Evolutionary Biology 2011, 11:344
http://www.biomedcentral.com/1471-2148/11/344
RESEARCH ARTICLE
Open Access
Geographic location and phylogeny are the main
determinants of the size of the geographical
range in aquatic beetles
Pedro Abellán1,2* and Ignacio Ribera1
Abstract
Background: Why some species are widespread while others are very restricted geographically is one of the most
basic questions in biology, although it remains largely unanswered. This is particularly the case for groups of
closely related species, which often display large differences in the size of the geographical range despite sharing
many other factors due to their common phylogenetic inheritance. We used ten lineages of aquatic Coleoptera
from the western Palearctic to test in a comparative framework a broad set of possible determinants of range size:
species’ age, differences in ecological tolerance, dispersal ability and geographic location.
Results: When all factors were combined in multiple regression models between 60-98% of the variance was
explained by geographic location and phylogenetic signal. Maximum latitudinal and longitudinal limits were
positively correlated with range size, with species at the most northern latitudes and eastern longitudes displaying
the largest ranges. In lineages with lotic and lentic species, the lentic (better dispersers) display larger distributional
ranges than the lotic species (worse dispersers). The size of the geographical range was also positively correlated
with the extent of the biomes in which the species is found, but we did not find evidence of a clear relationship
between range size and age of the species.
Conclusions: Our findings show that range size of a species is shaped by an interplay of geographic and
ecological factors, with a phylogenetic component affecting both of them. The understanding of the factors that
determine the size and geographical location of the distributional range of species is fundamental to the study of
the origin and assemblage of the current biota. Our results show that for this purpose the most relevant data may
be the phylogenetic history of the species and its geographical location.
Background
Why some species are widespread while others are very
restricted geographically is one of the most basic questions in biology, although it remains basically unanswered, despite the sustained interest from ecologists,
biogeographers and evolutionary biologists (e.g., [1-5]).
A range of ecological and evolutionary explanations
have been suggested for the observed range size variation, based on differences in niche breadth or environmental tolerance, body size, population abundance,
latitude, environmental variability, colonization and
extinction dynamics, and dispersal ability [3,6-8].
* Correspondence: [email protected]
1
Institute of Evolutionary Biology (CSIC-UPF), Passeig Maritim de la
Barceloneta 37, 08003 Barcelona, Spain
Full list of author information is available at the end of the article
However, there are still fundamental questions unresolved, best exemplified by the fact that closely related
species often display dramatic differences in range size
for largely unknown reasons. Tests of these differences
remain relatively scarce, have been performed for examples of very few taxa (usually vertebrates), and generally
fail to address the potentially confounding effects of the
phylogenetic relatedness of species.
In this work we aim to test some likely determinants
of the size of the geographical range in a phylogenetic
comparative framework. Closely related species are
expected to show more similarity than those that are
distantly related because they share more common evolutionary history [9,10]. How range size evolves and the
extent of heritability of the geographical range sizes of
species has received much attention in the last years
© 2011 Abellán and Ribera; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative
Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and
reproduction in any medium, provided the original work is properly cited.
Abellán and Ribera BMC Evolutionary Biology 2011, 11:344
http://www.biomedcentral.com/1471-2148/11/344
[11-14], as evidence for range size heritability would
have important implications for ecology, evolution, and
biogeography [15]. Although a number of studies have
investigated the existence of phylogenetic signal in range
size in a variety of clades and from a wide range of analytical approaches, the patterns found have been mixed
and the “heritability” of range size remains a contentious
issue [13,14]. Phylogenetic comparative methods applied
to whole lineages and not only species-pairs (e.g.,
[16-18]) may provide a more robust and powerful
approach to estimate the phylogenetic signal in range
size.
Among the potential determinants of range size we
include ecological tolerance, dispersal ability, geographic
location, and age of the species. For all of them we test
their phylogenetic signal, as well as the phylogenetic signal of the size and location of the range itself.
1) Ecological tolerance. A broad ecological niche
allows a species to persist in a wide range of different
environments, while a narrow niche restricts a species
to the few places where its niche requirements are met
[19,20]. Hence, species with broad niches should be distributed over a wider range of different biomes than
species with narrower ecological requirements, leading
to larger geographical ranges [21]. When these biomes
are distributed equally along environmental gradients
this poses the problem that species with lager ranges
will inevitably overlap more different biomes. However,
in the western Palaearctic (the centre of distribution of
most of our studied lineages, see below) this problem is
partly alleviated by the very heterogeneous distribution
of environmental gradients. In this case, those species
occurring in large biomes (e.g., [20,21]) would have
large range sizes (as found e.g. by [14]), but they should
not necessarily occupy more biomes than species with
smaller ranges.
2) Dispersal ability. As a surrogate measure of dispersal ability we use water flow, as previous studies in
freshwater invertebrates have established the relationship between main habitat type (lotic or lentic) and the
size of the geographical range [22,23]. Lentic species
should have better dispersal abilities due to the shorter
geological duration of their habitats, and display on
average larger geographical ranges than the species inhabiting the more persistent lotic habitats [24]. However,
the role of habitat constraints in aquatic organisms has
yet not been assessed from a phylogenetic comparative
framework in lineages in which there are species inhabiting both habitat types.
3) Geographic location. There are multiple cases of
closely related species with a similar biology and ecology
with extreme differences in the size of the geographical
range. In these cases, the biogeographic settings in
which species arise and evolve could determine their
Page 2 of 15
range sizes [14,25], with species with a “privileged” geographical position displaying higher range-sizes. For
example, latitudinal gradients in geographic range size
(Rapoport’s rule) have been extensively studied and
documented [6,26,27]. Evidence supporting that range
sizes increase with latitude in the Palearctic and Nearctic above 40°-50°N has been found in a number of terrestrial groups [26], but the extent to which this is a
general pattern remains contentious and has rarely been
tested in a phylogenetic framework.
4) Age and area. We also consider the possible relationship between age and area, which requires to be
tested in the context of a phylogeny even if not in the
same comparative framework as the previous factors.
Originally proposed to explain the distribution of the
endemic flora of some islands [28], in its basic form the
“age and area” hypothesis states that the older a species
is the more likely it is to have occupied a wider geographical area. More precisely, it could be expected that
species have a “life cycle” from origin to extinction that
could be described through a variety of simple models
(see [26] for a review). Although a number of studies
have examined the evidence for geographic range size
changes over evolutionary time across a wide range of
clades (but never in insects), no consistent evidence has
emerged to support any particular model [29]. An inherent limitation of all these studies is that a direct test of
the age and area model requires the geographic range
size of a species or a clade to be known throughout its
evolutionary history [3]. This is usually not possible
without extensive palaeontological data. Hence, a different approach has to be taken in neontological studies,
considering interspecific variation in range sizes of contemporary species as a reflection of the intraspecific
relationship [30]. The examination of the interspecies
relationship between geographic range size and age
could thus be used as a surrogate of the transformation
of range size with age in individual species [29,31].
We use a set of lineages of closely related species of
different families of aquatic Coleoptera to investigate the
relative role of different factors on determining the size
of the geographical range of species. The Western
Palearctic water beetle fauna is a suitable model to
study range size issues, as water beetles are a rich and
well-known insect group in both Europe and the Mediterranean Basin, exhibiting a high level of endemism but
also with species widely distributed across the Palearctic
and Holartic regions [32-34]. Spatial determinants of
range size and temporal patterns of range evolution in
invertebrates may differ substantially from that found in
previous studies using vertebrate clades. Hence, the use
of phylogenies at the species level for different groups of
beetles, one of the most diverse and understudied
lineages of animals, in what is in fact a set of
Abellán and Ribera BMC Evolutionary Biology 2011, 11:344
http://www.biomedcentral.com/1471-2148/11/344
Page 3 of 15
independent evolutionary replicates, provides a unique
opportunity to assess these issues from a phylogenetic
comparative framework.
4) The Hydroporus planus group (genus Hydroporus)
includes 51 species with a Palearctic distribution [36,40],
also with both lotic and lentic species. We sampled 30
species, including most of the western Palearctic fauna.
Methods
Family Hydraenidae
Background on the studied groups
5) The subgenus Enicocerus (genus Ochthebius) includes
15 recognized species exclusive of running waters [41],
distributed in Europe and the middle East. We studied 9
species, including all member of the O. exculptus group
[41].
6) The genus Limnebius Leach, with an almost worldwide distribution [42], is one of the most diverse genera
of the family Hydraenidae. In his revision of the Palearctic species, Jäch [42] recognized several species groups,
based on both external morphology and the structure of
the male genitalia. Among them, the Limnebius nitidus
subgroup includes 11 western Palearctic species with a
rather uniform external morphology [43]. Several species
of this lineage have very restricted allopatric distributions, often limited to a single valley or mountain system, but there are also some species with wider
geographical ranges. All inhabit running waters. We
included all the species within this subgroup with a sole
exception (L. nitifarus).
7-8) The “Haenydra“ lineage (genus Hydraena) currently includes 86 recognized species [44,45] usually
found in clean, fast flowing waters, often in mountain
streams. They are distributed in the north Mediterranean region from Iberia to Iran. Many species of this
lineage have very restricted distributions, often limited
to a single valley or mountain system, but there are also
some species with very wide geographical ranges, such
as e.g., H. gracilis, present in the whole Europe from
north Iberia to the Urals [45]. Here, we included two
different monophyletic lineages within “Haenydra“: the
H. gracilis and the H. dentipes clades [46], with 27 and
28 species respectively, of which we include 14 and 20.
9) The “Phothydraena” lineage (genus Hydraena [47])
currently include 9 recognized species, usually found in
clean, fast flowing waters, often in mountain streams.
Molecular data were available for seven species.
We have used a phylogenetically heterogeneous set of
ten monophyletic lineages of water beetles (Table 1)
occurring in the western Palearctic, some with both
lotic and lentic species, and others encompassing exclusively either lotic or lentic species. The lineages used
here belong to three different families of two suborders
of Coleoptera (Adephaga and Polyphaga), representing
several independent invasions of the aquatic medium
[35]. The full list of species and data used in this study
are provided in Additional file 1.
Family Dytiscidae
1) The Ilybius subaeneus group (genus Ilybius [36])
includes 33 recognized species, occurring almost exclusively in stagnant water and with generally wide geographical ranges throughout large parts of the Palearctic or
Nearctic, with some Holarctic species [36,37]. Together
with the genus Rhantus, they are the most species-rich
clade of the Palearctic fauna confined to stagnant water.
Our dataset included 27 species.
2) The genus Deronectes is the largest clade of
Palearctic Dytiscidae entirely confined to running
waters, with a predominantly Mediterranean distribution
reaching central Asia in the east [36]. Species are usually
restricted to relatively small geographical ranges, frequently in mountain regions. Here, we focused on the
western Mediterranean clade, encompassing 26 recognized species or subspecies [38] of which our final dataset included 24.
3) The genus Graptodytes includes 21 recognized species distributed in the western Palearctic region [39],
with both lotic and lentic species. Our final dataset
included 18 taxa.
Table 1 Lineages of water beetles studied
Lineage
Taxa
Habitat
Ilybius subaeneus group (Dytiscidae)
27 (33)
Lentic
Western Mediterranean Deronectes (Dytiscidae)
24 (29)
Lotic
Subgenus Enicocerus (Hydraenidae)
Limnebius nitidus subgroup (Hydraenidae)
9 (14)
10 (10)
Lotic
Lotic
Hydraena gracilis lineage (Hydraenidae)
14 (27)
Lotic
Hydraena dentipes lineage (Hydraenidae)
20 (28)
Lotic
9 (9)
Lotic
Mixed
“Phothydraena” lineage (Hydraenidae)
Palaearctic Graptodytes (Dytiscidae)
18 (23)
Hydroporus planus group s.l. (Dytiscidae)
30 (52)
Mixed
West Palaearctic Hydrochus (Hydrochidae)
13 (14)
Mixed
Lineages of water beetles included in this study. The number of species
included (in parenthesis the total number of species in the lineage) and the
habitat preference are indicated.
Family Hydrochidae
10) The genus Hydrochus includes about 180 described
species [48]. In the west Mediterranean (Iberian Peninsula, Morocco and south France) the genus is represented by 12 recognized species, 7 of them endemic to
the area, which form a monophyletic group that also
includes H. roberti, so far recorded from the Caucasus
and Turkey [49]. We include 12 of the 13 species of this
clade.
Phylogenetic data
We reconstructed the phylogenetic relationships within
each lineage of water beetles from different
Abellán and Ribera BMC Evolutionary Biology 2011, 11:344
http://www.biomedcentral.com/1471-2148/11/344
combinations of mitochondrial and nuclear genes,
depending on data availability. Phylogenies of Graptodytes, Enicocerus, “Haenydra“ and Hydrochus were taken
from recent works ([39,41,46,49]) respectively), pruning
the trees to keep one specimen per species. Phylogenies
of Deronectes, Ilybius and Hydroporus were updated
with additional species and new analyses from [38] and
[40] respectively; finally, phylogenies of Limnebius and
Phothydraena were newly built for this work, with the
same genes and methodology used in [46] for the two
lineages of “Haenydra“ (see Additional file 1 for details
of the sequence data used for each lineage, Additional
file 2, Table S1 for the primers used for amplification
and sequencing, and Additional file 3 for the final trees
used).
New phylogenies were built using a fast maximum
likelihood algorithm as implemented in RAxML v7.0
[50], after aligning length-variable regions with MAFFT
v5.8 [51]. For the RAxML searches we used a partition
by gene fragment, with a GTR+G evolutionary model
independently estimated for each partition, following the
methodology used in [46]. To estimate the relative age
of divergence of the lineages we used the Bayesian
relaxed phylogenetic approach implemented in BEAST
v1.4.7 [52], which allows variation in substitution rates
among branches. We implemented a GTR+I+G model
of DNA substitution with four rate categories using the
mitochondrial data set, as the a priori rate used was
estimated for mitochondrial genes only (see below). We
used an uncorrelated lognormal relaxed molecular clock
model to estimate substitution rates and the Yule process of speciation as the tree prior. Well supported
nodes in the analyses of the combined sequence (when
nuclear genes were used) were constrained to ensure
that the Beast analyses obtained the same topology. We
ran two independent analyses for each group sampling
each 1000 generations, and used TRACER version 1.4 to
determine convergence, measure the effective sample
size of each parameter and calculate the mean and 95%
highest posterior density interval for divergence times.
Results of the two runs were combined with LogCombiner v1.4.7 and the consensus tree compiled with
TreeAnnotator v1.4.7 [52]. As each lineage was analysed
separately, to establish the relationship between age and
size of the geographical range we only require a relative
dating of species within the lineage, not an absolute dating. Notwithstanding this, we used an approximate dating using as prior evolutionary rate for the combined
mitochondrial sequence (including protein coding and
ribosomal genes) a normal distribution with average rate
of 0.01 substitutions/site/MY, with a standard deviation
of 0.001. This rate is close to recent estimations of different groups of Coleoptera [46,53] and to the standard
arthropod mitochondrial clock of 2.3% [54,55].
Page 4 of 15
The evolutionary age of each species was calculated as
the estimated age (in millions of years) of the most
recent node that connects it to any other taxon or
clade. The age estimates of Beast have usually large 95%
confidence intervals, which has to be considered in the
interpretation of the Results.
Geographical data and biogeographic factors
We created shaded maps of the distribution of the different species in a Geographic Information System
based on the information compiled from published and
unpublished sources [36,37,44,48,56,57]; checklist of the
species of the Italian fauna, v. 2.0, http://www.faunaitalia.it). This resulted in individual species maps containing one or more polygons of distribution (species maps
are available from the authors upon request). We then
calculated different descriptors of the species’ ranges:
total range-size, maximum latitudinal and longitudinal
limits, and latitudinal and longitudinal centroids.
Total range-size was calculated as the total area of the
polygon or polygons, after reprojecting species maps to
equal-area projections. For those species only known
from their locality type (four species), range size was
arbitrarily set to 100 km2. Latitudinal and longitudinal
centroid positions (centre of mass of the polygon or
polygons) and maximum latitudinal and longitudinal
limits were computed as geographical coordinates.
All spatial data were processed using ArcGIS 9.2 software (Environmental Systems Research Institute Inc.,
Redlands, CA). Area variables (total range-size and average size of biomes) were log10 transformed for the
analyses.
Ecological factors
The main habitat type of the studied species was defined
according to the general water flow regime, and three
categories were distinguished: (1) lotic (strictly running
water); (2) both running and standing water; and (3)
lentic (strictly standing water) (see [22] for details on
habitat choice criteria) (Additional file 1). For some analyses we pooled species in categories (2) and (3), thus
dividing species limited to running water from the rest.
Water flow is the most important habitat characteristic
determining the composition of the assemblages of
aquatic Coleoptera, and species tend to be restricted to
either standing water bodies or to running water, both
in the larval and in the more dispersive adult stage (see
[22,24] and references therein).
To determine the role of niche breadth on range-size
we used the number of different biomes that partly or
completely overlap with the species ranges based on the
biomes delineated by the World Wildlife Fund (http://
www.worldwildlife.org/science/ecoregions/item1847.
html). We discarded biomes that overlapped less than
Abellán and Ribera BMC Evolutionary Biology 2011, 11:344
http://www.biomedcentral.com/1471-2148/11/344
1% of the species’ range, to minimize the effects of
uncertainty in range size calculations. To try to disentangle the effect of the range size per se from that of an
increased ecological tolerance, we tested whether species
that are found in larger biomes have larger ranges than
species in smaller biogeographic units. If so, this would
suggest that range size is a function of the available biogeographic space, rather than the number of biomes
being a function of the size of the range. For that purpose we calculated the mean size of the biomes that
partly or completely overlap with the species ranges
[25].
Data analyses
Phylogenetic signal
We used a randomization procedure to test whether range
attributes exhibit a significant tendency for related species
to resemble each other according to the methodology proposed by Blomberg et al. [17]. The basic idea is to ask
whether a given tree (topology and branch lengths) better
fits a set of tip data as compared with the fit obtained
when the data have been randomly permuted across the
tips of the tree, thus destroying any phylogenetic signal
that may have existed [17]. Thus, the degree of resemblance among relatives can be distinguished from random
by comparing observed patterns of the variance of independent contrasts of the trait to a null model of shuffling
taxa labels across the tips of the phylogeny.
To quantify the amount of phylogenetic signal we calculated the metric K, which compares the observed signal in a trait to the signal under a Brownian motion
model of trait evolution on a phylogeny [17]. The higher
the K statistic, the more phylogenetic signal in a trait. K
values of 1 correspond to a Brownian motion process,
which implies some degree of phylogenetic signal. K
values closer to zero correspond to a random or convergent pattern of evolution, while K values greater than 1
indicate strong phylogenetic signal. We used the R package ‘Picante’ [58] to compute K and the significance test.
We also used phylogenetic eigenvector regression
(PVR; [59]) as an additional assessment of the phylogenetic signal in range properties and to correct for this
signal in analysing the relationship between range-size
and biogeographical variables (see below). The basic
idea of PVR is to carry out a principal coordinate analysis of the matrix of pairwise phylogenetic distances
between species and use the eigenvectors as predictors
in a multiple regression against species traits (in this
case, geographic range properties). The subset of eigenvectors to use as PVR components for each range attribute was obtained using a stepwise multiple regression
[60]. The R 2 of the multiple regression model of the
trait against the eigenvectors provides an estimate of the
amount of phylogenetic signal in the data [59].
Page 5 of 15
To assess if habitat occupation exhibited phylogenetic
signal, habitat type was used as a qualitative (discretelycoded) character, and species were assigned to either of
the three habitat type classes: lotic, lentic or both. In
this case, phylogenetic signal was computed with Pagel’s
l [16,18], a more appropriate approach for discrete
traits. The value of l varies from 0 to 1, where 0 corresponds with the complete absence of phylogenetic structure and 1 means that variation in the trait is perfectly
correlated with phylogeny. We used the fitDiscrete function of the R package ‘Geiger’ [61] to obtain the maximum likelihood estimate of l. In order to address
whether significant phylogenetic signal existed in our
datasets, we compared the negative log likelihood when
there was no signal (i.e. using the tree transformed
lambda = 0) to that when lambda was estimated using
the original tree topology by using a likelihood ratio test.
PGLS correlations
We explored the association between range size and different biogeographical and ecological factors in a phylogenetic framework. We used the Phylogenetic
Generalized Least Squares approach (PGLS; [62]) as
implemented in Compare 4.6 b, which allows tests for
correlations between two continuous traits and between
a discrete independent variable and a continuous dependent variable [63]. PGLS can be viewed as an extension
of Felsenstein’s independent contrasts method [64] that
allows for flexibility in the underlying evolutionary
assumptions. This flexibility is obtained through the use
of a single parameter (alpha), which can be interpreted
as a measure of evolutionary constraint acting on the
phenotypes. When alpha is small, generalized least
squares approximates Felsenstein’s independent contrasts analysis, and when alpha is large, comparative
data are less dependent on phylogeny and approximate
a raw, nonphylogenetic correlation analysis. The Compare software computes the maximum likelihood estimate of alpha (from a range of different alphas), and
provides parameter estimates given that maximum likelihood. To assess the significance of the relationship
between traits we tested if the regression slope differed
from zero. Since the correlation coefficient is directly
related to the regression slope, if this differs significantly
from zero, the correlation coefficient will too. For this,
we used the corMartins function of the R package ‘Ape’
[65] with the estimated value of alpha to create the correlation structure, and then fitted the linear model with
the gls function.
Range-size vs. Age
In order to examine the relationship between geographic
range size and species age, plots of range size (log10
transformed) against species age (i.e. the estimated age
of divergence between species) were produced for each
group [29,31]. Statistical significance was determined
Abellán and Ribera BMC Evolutionary Biology 2011, 11:344
http://www.biomedcentral.com/1471-2148/11/344
Page 6 of 15
using linear or quadratic regressions as appropriate
according to an extra sum-of-squares F test.
Global determinants of range size
A multiple regression analysis was used to determine
the relative importance of the different variables, regressing the log-transformed range-size (response variable)
upon the explanatory variables (range properties, species’ age, niche breadth and habitat preference) and the
phylogenetic PVR components. Habitat type was coded
as a dummy variable (0, strictly lotic species; 1, species
inhabiting lentic waters or both lentic and lotic waters).
Preliminary analyses showed that some subsets of the
geographic properties of the range were often correlated.
This was also corroborated by visual examination
through Principal Component Analysis. Similarly, the
average size of biomes and the number of biomes were
usually highly correlated with maximum latitude or
longitude. As a consequence, high levels of multicollinearity were detected, as indicated by high values of the
Variance Inflation Factor [66]. To avoid this multicollinearity, only maximum latitude and longitude (the main
determinants of range-size as assessed by PGLS, and
usually not correlated between them) were finally used
as biogeographic factors. In those lineages in which both
variables (maxLat and maxLon) were significantly correlated, only the main determinant of range size was used.
We used a stepwise model selection procedure to
select the multiple linear regression models with the
smallest Akaike’s Information Criterion (AIC). To
account for multiple comparisons we applied the Bonferroni correction.
Results
Phylogenetic signal
Range size showed significant phylogenetic signal (as measured with a randomization test) in four of the ten lineages,
displaying relatively low values of the K statistic (Table 2).
However, no phylogenetic signal remained statistically significant for range size after Bonferroni correction. The
maximum longitude and the longitude of centroid showed
significant phylogenetic signal for most of the studied
lineages (the exceptions were Ilybius, Enicocerus and
Hydrochus, the former two encompassing few taxa) with
values of K generally high, while minimum longitude, maximum latitude, and the latitude of centroid showed significant phylogenetic signal in four of the lineages. After
Bonferroni correction, only some lineages showed significant phylogenetic signal, with maximum longitude displaying the higher number of significant cases.
The average size of biomes did not show phylogenetic
signal in any lineage whereas niche breadth, as estimated by number of biomes overlapping with geographic range, displayed phylogenetic signal only for the
H. dentipes and Graptodytes lineages. Three lineages did
not exhibit phylogenetic signal for any of the range attributes: Ilybius, Enicocerus and Hydrochus.
Table 2 Phylogenetic signal (K statistic)
Lineage
Size
maxLon
minLon
maxLat
minLat
LonC
LatC
avB
nB
Ilybius
0.06
(0.787)
0.24
(0.076)
0.11
(0.456)
0.13
(0.418)
0.16
(0.201)
0.16
(0.189)
0.23
(0.097)
0.07
(0.729)
0.11
(0.565)
Deronectes
0.22
(0.386)
0.91
(0.001*)
0.63
(0.038*)
0.19
(0.581)
0.41
(0.037*)
0.91
(0.003**)
0.26
(0.262)
0.25
(0.228)
0.31
(0.215)
Enicocerus
0.48
(0.777)
0.60
(0.572)
0.65
(0.525)
0.30
(0.977)
0.74
(0.547)
0.59
(0.613)
0.32
(0.972)
0.42
(0.861)
0.42
(0.923)
Limnebius
0.76
(0.034*)
1.06
(0.042*)
0.73
(0.041*)
1.23
(0.003**)
0.43
(0.315)
0.90
(0.046*)
1.00
(0.009*)
0.47
(0.186)
0.72
(0.071)
H. gracilis
0.59
(0.052)
0.829
(0.011*)
0.329
(0.515)
0.28
(0.655)
0.69
(0.017*)
0.69
(0.018*)
0.26
(0.730)
0.22
(0.836)
0.54
(0.111)
H. dentipes
0.60
(0.046*)
1.30
(0.000**)
0.73
(0.008*)
1.11
(0.000**)
0.21
(0.690)
1.22
(0.000**)
0.78
(0.008*)
0.50
(0.086)
0.82
(0.013*)
Phothydraena
0.16
(0.841)
0.70
(0.039*)
0.85
(0.046*)
0.33
(0.542)
0.49
(0.243)
1.08
(0.005**)
0.74
(0.105)
0.63
(0.059)
0.20
(0.668)
Graptodytes
0.30
(0.019*)
0.62
(0.001**)
0.27
(0.069)
0.39
(0.008*)
0.22
(0.067)
0.55
(0.000**)
0.65
(0.000**)
0.19
(0.099)
0.33
(0.009*)
Hydroporus
0.57
(0.008*)
0.81
(0.001**)
0.31
(0.318)
0.62
(0.018*)
0.64
(0.022*)
0.69
(0.003**)
0.92
(0.001**)
0.36
(0.289)
0.32
(0.119)
Hydrochus
0.73
(0.394)
0.88
(0.173)
0.80
(0.45)
0.81
(0.258)
0.62
(0.617)
0.87
(0.192)
0.82
(0.226)
0.40
(0.898)
0.72
(0.375)
Phylogenetic signal as estimated with K statistic for different range attributes. The P-value, based on the variance of phylogenetically independent contrasts
relative to tip shuffling randomization, is provided in parenthesis. Asterisks indicate significant P-values: * P < 0.05; ** P < 0.005 (Bonferroni critical value for ten
tests). Codes: Size, range size (log-transformed); maxLong and minLong, maximum and minimum longitude of ranges, respectively; maxLat and minLat, maximum
and minimum latitude, respectively; LonC and LatC, longitude and latitude of centroids, respectively; avB, average size of the biogeographic provinces that partly
or completely overlap with the species ranges (log-transformed); nB, number of biogeographic provinces that partly or completely overlap with the species
ranges.
Abellán and Ribera BMC Evolutionary Biology 2011, 11:344
http://www.biomedcentral.com/1471-2148/11/344
Page 7 of 15
The phylogenetic eigenvector regression (PVR) gave in
general a stronger phylogenetic signal than the randomization tests. This was especially evident in Ilybius, for
which PVR showed high levels of phylogenetic signal in
contrast with non significant K values. Range-size
showed moderate levels of phylogenetic signal, but some
descriptors of the spatial position of ranges had higher
levels (Table 3), which remained mostly significant after
Bonferroni correction. Notably, a high fraction of the
variance in the northern longitudinal limits and longitudinal centroids of the ranges in most lineages was
explained by phylogenetic relationships among species.
As happened with the randomization tests, the average
size of biomes and the number of biomes were rarely
correlated with phylogeny (Table 3).
Among the lineages with species with both habitat types
(lotic and lentic), habitat type exhibited significant phylogenetic signal for Hydroporus, as computed with Pagel’s l
(l = 0.89, P < 0.01), but not for “Phothydraena“ (l < 0.00,
P = 1.0), Graptodytes (l < 0.00, P = 0.4) or Hydrochus (l =
1, P = 0.3). Estimates of the phylogenetic signal with the K
statistic gave similar results, with Hydroporus exhibiting
significant signal (K = 0.63, P < 0.01) but not the remaining lineages (Phothydraena, K = 0.17, P = 0.9; Graptodytes,
K = 0.15, P = 0.3; Hydrochus, K = 0.98, P = 0.1).
Ecological tolerance
Range size was significantly and positively correlated
with both the spatial extent and the number of biomes
in which species are found in most of the tested
lineages, as measured after phylogenetic correction with
PGLS correlations (Table 4). Correlations remained significant after Bonferroni correction only in the case of
the number of occupied biomes.
Dispersal ability
Habitat preference, taken as a surrogate of dispersal
ability, was significantly and positively correlated with
range-size after phylogenetic correction with PGLS in
those lineages with both lotic and lentic species, with
the only exception of Hydrochus (Table 4). Correlation
values were particularly high for Phothydraena and
Graptodytes, which remained significant after Bonferroni
correction.
Geographic location
After phylogenetic correction, range size was significantly and positively correlated with maximum latitude
and longitude for most of the lineages, with the only
exceptions of Limnebius (marginally significant) and
Phothydraena (Table 4; see also Figure 1). These positive correlations remained significant for maximum latitude after Bonferroni correction for multiple tests.
Maximum latitude and longitude had also the highest
correlation values for most of the lineages. The latitude
and longitude of the centroid were correlated with the
size of geographic range in four and five of the ten
lineages respectively: Enicocerus, Hydraena dentipes,
Graptodytes and Hydroporus, plus Ilybius in the case of
latitude of the centroid (Table 4). Minimum longitude
and latitude were only significantly correlated with
range size in Ilybius and Hydrochus respectively, in both
cases with a negative correlation (Table 4). Range size in
Phothydraena was not significantly correlated with any
of the range proprieties studied, although minimum latitude was marginally significant (Table 4).
Age and area
The preferred model for the plot of global geographic
range size against species age was in all cases a straight
line, i.e. quadratic regression did not provide a significantly better fit to the data than did linear regression
(Figure 2; see also Additional file 2, Table S2). The general tendency was to increase range size with evolutionary age, but with the sole exception of the H. dentipes
lineage (with a significant positive relationship) the
Table 3 PVR coefficients
Lineage
Ilybius
Size
maxLon
minLon
maxLat
minLat
LonC
LatC
avB
nB
0.843**
0.983**
1.000**
0.719**
0.651**
0.862**
0.222
0.103