Species Distribution Model: Classification and Regression Trees_TAKE2Model Category: Classification and Regression Tree (CART) Model Description: Classification and regression trees work by successively splitting data points at nodes that create more homogeneous groups with respect to explanatory variables. This splitting can be done by a variety of criteria including minimizing sum of squares about the mean for regression trees, and minimizing Shannan-Weiner Diversity index for categorical variables. These nodes are often summarized as the proportion sum of squares explained by the split, and for classification trees misclassification rate is used. When “growing” the tree, nodes continue to be made until improvement from additional nodes is less than some specified threshold. This leaves final tree structure sensitive to this arbitrarily specified cutoff and can results in over-grown or overly simplified trees. Setting a liberal threshold and using techniques for pruning trees, such as finding the best explanatory tree of a given size, and then using cross-validation to find the tree size with the best predictive power. Model Assumptions: In regression trees nodes are often based on Least Squares regression, so the using these trees requires all the assumptions of least squares regression to be met, including normally distributed data and predictors, homogeneity of variances between data points, Model Response Data: These algorithms can work with a variety of response variables including presence-absence, abundance and survival data, but Presence-only data will not work. Model Explanatory Data: These models are well suited for using both categotrical and continuous explanatory variables, in fact this is the difference between classification trees and regression trees. Classifiction trees work with categorical covariates and regression trees work with continuous covariates. However, as the methods used to create nodes are specific to categorical or continuous variables, a single tree cannot handle both categorical and continuous variables. Model Links and Use with R: R Package Tree: Classification and regression treesR Package RPart: Recursive partitioning for classification, regression and survival treeshttps://cran.r-project.org/web/packages/rpart/index.html Example Papers: De’ath & Fabricus (2000) give a great introduction to CARTs for ecolgists and model coral distributions using the method. Glenn De’ath and Katharina E. Fabricius 2000. CLASSIFICATION AND REGRESSION TREES: A POWERFUL YET SIMPLE TECHNIQUE FOR ECOLOGICAL DATA ANALYSIS. Ecology 81:3178–3192 Lawler et al.(2006) review several methods for predicting distribution shifts including regression trees. LAWLER, J. J., WHITE, D., NEILSON, R. P. and BLAUSTEIN, A. R. (2006), Predicting climate-induced range shifts: model differences and model reliability. Global Change Biology, 12. Example with R:
2 Comments
Species Distribution Model: BioclimSteven GrayNovember 16, 2015Model Category: Climate-envelope-model Model Description: Bioclim compares values of environmental variables to the percentile distribution of values at occurence locations to determine similarity.Values around the median are deemed suitable locations with the tails of the percentile being less suitable. Model Assumptions: Species distribution is influenced by climatic variables and at the scale of climate variables, climate variables are normally distributed, and variables influence study species. Model Response Data: Presence-only. Model Explanatory Data: Mainly used for continuous climate variables. GARP was created as a model that has less restrictive assumptions than Bioclim. Model Links and Use with R: linked phrase linked phrase Example Papers: Thorne, J. H., Seo, C., Basabose, A., Gray, M., Belfiore, N. M., & Hijmans, R. J. (2013). Alternative biological assumptions strongly influence models of climate change effects on mountain gorillas. Ecosphere, 4(9), art108. Gomez, J. J., & Cassini, M. H. (2014). Analysis of environmental correlates of sexual segregation in northern elephant seals using species distribution models. Marine biology, 161(2), 481-487. Example with R:
Species Distribution Model: Generalized Additive ModelModel Category: Regression Model Description: Used for fitting non-linear functions. Builds upon the Generalized Linear Model by replacing the linear predictor with an additive predictor (Hastie and Tibshirani 1986) which is useful for modelling more complex ecological responses (e.g., Elith et al. 2006). Model Assumptions: Absence of interaction effects (see link) Effects of the covariates on the response are additive (e.g., Swartzman et al. 1992) Data fits a specific distribution (e.g., Swartzman et al. 1992) Model Response Data: Presence and absence Model Explanatory Data: Categorical or continuous (see Lehmann 1998) Model Links and Use with R: CRAN package ‘mcgv’ see page 36 Generalized additive models: an introduction with R (Wood 2006) also see: Hastie, T., and R. Tibshirani. 1986. Generalized additive models. Statistical Science 1(3):297-310. Example Papers: Example with R: GAM EXAMPLE
|
Spatial Ecology @ MSUClick on "Category" below to search for R code compiled by the Zarnetske Spatial & Community Ecology Lab and students in MSU's Spatial Ecology graduate course (FOR870/FW870) Category
All
Archive
October 2016
|