Limited sampling hampers "big data" estimation of species richness in a tropical biodiversity hotspot

Kristine Engemann, Brian Enquist, Brody Sandel, Brad Boyle, Peter M. Jørgensen, Naia Morueta-Holme, Robert K. Peet, Cyrille Violle, Jens Christian Svenning

Research output: Contribution to journalArticle

44 Citations (Scopus)

Abstract

Macro-scale species richness studies often use museum specimens as their main source of information. However, such datasets are often strongly biased due to variation in sampling effort in space and time. These biases may strongly affect diversity estimates and may, thereby, obstruct solid inference on the underlying diversity drivers, as well as mislead conservation prioritization. In recent years, this has resulted in an increased focus on developing methods to correct for sampling bias. In this study, we use sample-size-correcting methods to examine patterns of tropical plant diversity in Ecuador, one of the most species-rich and climatically heterogeneous biodiversity hotspots. Species richness estimates were calculated based on 205,735 georeferenced specimens of 15,788 species using the Margalef diversity index, the Chao estimator, the second-order Jackknife and Bootstrapping resampling methods, and Hill numbers and rarefaction. Species richness was heavily correlated with sampling effort, and only rarefaction was able to remove this effect, and we recommend this method for estimation of species richness with "big data" collections.

Original languageEnglish (US)
Pages (from-to)807-820
Number of pages14
JournalEcology and Evolution
Volume5
Issue number3
DOIs
StatePublished - Feb 1 2015

Fingerprint

species richness
biodiversity
species diversity
sampling
sampling bias
bootstrapping
prioritization
information sources
Ecuador
methodology
diversity index
space and time
museum
method

Keywords

  • Ecuador
  • Rarefaction
  • Resampling
  • Richness estimation
  • Sampling effort

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Ecology
  • Nature and Landscape Conservation

Cite this

Engemann, K., Enquist, B., Sandel, B., Boyle, B., Jørgensen, P. M., Morueta-Holme, N., ... Svenning, J. C. (2015). Limited sampling hampers "big data" estimation of species richness in a tropical biodiversity hotspot. Ecology and Evolution, 5(3), 807-820. https://doi.org/10.1002/ece3.1405

Limited sampling hampers "big data" estimation of species richness in a tropical biodiversity hotspot. / Engemann, Kristine; Enquist, Brian; Sandel, Brody; Boyle, Brad; Jørgensen, Peter M.; Morueta-Holme, Naia; Peet, Robert K.; Violle, Cyrille; Svenning, Jens Christian.

In: Ecology and Evolution, Vol. 5, No. 3, 01.02.2015, p. 807-820.

Research output: Contribution to journalArticle

Engemann, K, Enquist, B, Sandel, B, Boyle, B, Jørgensen, PM, Morueta-Holme, N, Peet, RK, Violle, C & Svenning, JC 2015, 'Limited sampling hampers "big data" estimation of species richness in a tropical biodiversity hotspot', Ecology and Evolution, vol. 5, no. 3, pp. 807-820. https://doi.org/10.1002/ece3.1405
Engemann, Kristine ; Enquist, Brian ; Sandel, Brody ; Boyle, Brad ; Jørgensen, Peter M. ; Morueta-Holme, Naia ; Peet, Robert K. ; Violle, Cyrille ; Svenning, Jens Christian. / Limited sampling hampers "big data" estimation of species richness in a tropical biodiversity hotspot. In: Ecology and Evolution. 2015 ; Vol. 5, No. 3. pp. 807-820.
@article{ace0acc028ab4647886fab8f129fa246,
title = "Limited sampling hampers {"}big data{"} estimation of species richness in a tropical biodiversity hotspot",
abstract = "Macro-scale species richness studies often use museum specimens as their main source of information. However, such datasets are often strongly biased due to variation in sampling effort in space and time. These biases may strongly affect diversity estimates and may, thereby, obstruct solid inference on the underlying diversity drivers, as well as mislead conservation prioritization. In recent years, this has resulted in an increased focus on developing methods to correct for sampling bias. In this study, we use sample-size-correcting methods to examine patterns of tropical plant diversity in Ecuador, one of the most species-rich and climatically heterogeneous biodiversity hotspots. Species richness estimates were calculated based on 205,735 georeferenced specimens of 15,788 species using the Margalef diversity index, the Chao estimator, the second-order Jackknife and Bootstrapping resampling methods, and Hill numbers and rarefaction. Species richness was heavily correlated with sampling effort, and only rarefaction was able to remove this effect, and we recommend this method for estimation of species richness with {"}big data{"} collections.",
keywords = "Ecuador, Rarefaction, Resampling, Richness estimation, Sampling effort",
author = "Kristine Engemann and Brian Enquist and Brody Sandel and Brad Boyle and J{\o}rgensen, {Peter M.} and Naia Morueta-Holme and Peet, {Robert K.} and Cyrille Violle and Svenning, {Jens Christian}",
year = "2015",
month = "2",
day = "1",
doi = "10.1002/ece3.1405",
language = "English (US)",
volume = "5",
pages = "807--820",
journal = "Ecology and Evolution",
issn = "2045-7758",
publisher = "John Wiley and Sons Ltd",
number = "3",

}

TY - JOUR

T1 - Limited sampling hampers "big data" estimation of species richness in a tropical biodiversity hotspot

AU - Engemann, Kristine

AU - Enquist, Brian

AU - Sandel, Brody

AU - Boyle, Brad

AU - Jørgensen, Peter M.

AU - Morueta-Holme, Naia

AU - Peet, Robert K.

AU - Violle, Cyrille

AU - Svenning, Jens Christian

PY - 2015/2/1

Y1 - 2015/2/1

N2 - Macro-scale species richness studies often use museum specimens as their main source of information. However, such datasets are often strongly biased due to variation in sampling effort in space and time. These biases may strongly affect diversity estimates and may, thereby, obstruct solid inference on the underlying diversity drivers, as well as mislead conservation prioritization. In recent years, this has resulted in an increased focus on developing methods to correct for sampling bias. In this study, we use sample-size-correcting methods to examine patterns of tropical plant diversity in Ecuador, one of the most species-rich and climatically heterogeneous biodiversity hotspots. Species richness estimates were calculated based on 205,735 georeferenced specimens of 15,788 species using the Margalef diversity index, the Chao estimator, the second-order Jackknife and Bootstrapping resampling methods, and Hill numbers and rarefaction. Species richness was heavily correlated with sampling effort, and only rarefaction was able to remove this effect, and we recommend this method for estimation of species richness with "big data" collections.

AB - Macro-scale species richness studies often use museum specimens as their main source of information. However, such datasets are often strongly biased due to variation in sampling effort in space and time. These biases may strongly affect diversity estimates and may, thereby, obstruct solid inference on the underlying diversity drivers, as well as mislead conservation prioritization. In recent years, this has resulted in an increased focus on developing methods to correct for sampling bias. In this study, we use sample-size-correcting methods to examine patterns of tropical plant diversity in Ecuador, one of the most species-rich and climatically heterogeneous biodiversity hotspots. Species richness estimates were calculated based on 205,735 georeferenced specimens of 15,788 species using the Margalef diversity index, the Chao estimator, the second-order Jackknife and Bootstrapping resampling methods, and Hill numbers and rarefaction. Species richness was heavily correlated with sampling effort, and only rarefaction was able to remove this effect, and we recommend this method for estimation of species richness with "big data" collections.

KW - Ecuador

KW - Rarefaction

KW - Resampling

KW - Richness estimation

KW - Sampling effort

UR - http://www.scopus.com/inward/record.url?scp=84922756988&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84922756988&partnerID=8YFLogxK

U2 - 10.1002/ece3.1405

DO - 10.1002/ece3.1405

M3 - Article

VL - 5

SP - 807

EP - 820

JO - Ecology and Evolution

JF - Ecology and Evolution

SN - 2045-7758

IS - 3

ER -