114
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      The coefficient of determination R 2 and intra-class correlation coefficient from generalized linear mixed-effects models revisited and expanded

      research-article

      Read this article at

          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The coefficient of determination R 2 quantifies the proportion of variance explained by a statistical model and is an important summary statistic of biological interest. However, estimating R 2 for generalized linear mixed models (GLMMs) remains challenging. We have previously introduced a version of R 2 that we called for Poisson and binomial GLMMs, but not for other distributional families. Similarly, we earlier discussed how to estimate intra-class correlation coefficients (ICCs) using Poisson and binomial GLMMs. In this paper, we generalize our methods to all other non-Gaussian distributions, in particular to negative binomial and gamma distributions that are commonly used for modelling biological data. While expanding our approach, we highlight two useful concepts for biologists, Jensen's inequality and the delta method, both of which help us in understanding the properties of GLMMs. Jensen's inequality has important implications for biologically meaningful interpretation of GLMMs, whereas the delta method allows a general derivation of variance associated with non-Gaussian distributions. We also discuss some special considerations for binomial GLMMs with binary or proportion data. We illustrate the implementation of our extension by worked examples from the field of ecology and evolution in the R environment. However, our method can be used across disciplines and regardless of statistical environments.

          Related collections

          Most cited references29

          • Record: found
          • Abstract: not found
          • Article: not found

          Unrepeatable Repeatabilities: A Common Mistake

            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Using observation-level random effects to model overdispersion in count data in ecology and evolution

            Overdispersion is common in models of count data in ecology and evolutionary biology, and can occur due to missing covariates, non-independent (aggregated) data, or an excess frequency of zeroes (zero-inflation). Accounting for overdispersion in such models is vital, as failing to do so can lead to biased parameter estimates, and false conclusions regarding hypotheses of interest. Observation-level random effects (OLRE), where each data point receives a unique level of a random effect that models the extra-Poisson variation present in the data, are commonly employed to cope with overdispersion in count data. However studies investigating the efficacy of observation-level random effects as a means to deal with overdispersion are scarce. Here I use simulations to show that in cases where overdispersion is caused by random extra-Poisson noise, or aggregation in the count data, observation-level random effects yield more accurate parameter estimates compared to when overdispersion is simply ignored. Conversely, OLRE fail to reduce bias in zero-inflated data, and in some cases increase bias at high levels of overdispersion. There was a positive relationship between the magnitude of overdispersion and the degree of bias in parameter estimates. Critically, the simulations reveal that failing to account for overdispersion in mixed models can erroneously inflate measures of explained variance (r 2), which may lead to researchers overestimating the predictive power of variables of interest. This work suggests use of observation-level random effects provides a simple and robust means to account for overdispersion in count data, but also that their ability to minimise bias is not uniform across all types of overdispersion and must be applied judiciously.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              A comparison of observation-level random effect and Beta-Binomial models for modelling overdispersion in Binomial data in ecology & evolution

              Overdispersion is a common feature of models of biological data, but researchers often fail to model the excess variation driving the overdispersion, resulting in biased parameter estimates and standard errors. Quantifying and modeling overdispersion when it is present is therefore critical for robust biological inference. One means to account for overdispersion is to add an observation-level random effect (OLRE) to a model, where each data point receives a unique level of a random effect that can absorb the extra-parametric variation in the data. Although some studies have investigated the utility of OLRE to model overdispersion in Poisson count data, studies doing so for Binomial proportion data are scarce. Here I use a simulation approach to investigate the ability of both OLRE models and Beta-Binomial models to recover unbiased parameter estimates in mixed effects models of Binomial data under various degrees of overdispersion. In addition, as ecologists often fit random intercept terms to models when the random effect sample size is low (<5 levels), I investigate the performance of both model types under a range of random effect sample sizes when overdispersion is present. Simulation results revealed that the efficacy of OLRE depends on the process that generated the overdispersion; OLRE failed to cope with overdispersion generated from a Beta-Binomial mixture model, leading to biased slope and intercept estimates, but performed well for overdispersion generated by adding random noise to the linear predictor. Comparison of parameter estimates from an OLRE model with those from its corresponding Beta-Binomial model readily identified when OLRE were performing poorly due to disagreement between effect sizes, and this strategy should be employed whenever OLRE are used for Binomial data to assess their reliability. Beta-Binomial models performed well across all contexts, but showed a tendency to underestimate effect sizes when modelling non-Beta-Binomial data. Finally, both OLRE and Beta-Binomial models performed poorly when models contained <5 levels of the random intercept term, especially for estimating variance components, and this effect appeared independent of total sample size. These results suggest that OLRE are a useful tool for modelling overdispersion in Binomial data, but that they do not perform well in all circumstances and researchers should take care to verify the robustness of parameter estimates of OLRE models.
                Bookmark

                Author and article information

                Journal
                J R Soc Interface
                J R Soc Interface
                RSIF
                royinterface
                Journal of the Royal Society Interface
                The Royal Society
                1742-5689
                1742-5662
                September 2017
                13 September 2017
                13 September 2017
                : 14
                : 134
                : 20170213
                Affiliations
                [1 ]Evolution and Ecology Research Centre, and School of Biological, Earth and Environmental Sciences, University of New South Wales , Sydney, New South Wales 2052, Australia
                [2 ]Diabetes and Metabolism Division, Garvan Institute of Medical Research , Sydney, New South Wales 2010, Australia
                [3 ]Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow , Graham Kerr Building, Glasgow G12 8QQ, UK
                [4 ]Population Ecology Group, Institute of Ecology, Friedrich Schiller University Jena , Dornburger Strasse 159, 07743 Jena, Germany
                Author notes
                Author information
                http://orcid.org/0000-0002-7765-5182
                http://orcid.org/0000-0001-6663-7520
                http://orcid.org/0000-0002-9124-2261
                Article
                rsif20170213
                10.1098/rsif.2017.0213
                5636267
                28904005
                d04fd88d-91e4-4c86-86e7-92932228dca1
                © 2017 The Authors.

                Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.

                History
                : 20 March 2017
                : 2 August 2017
                Funding
                Funded by: Australian Research Council, http://dx.doi.org/10.13039/501100000923;
                Award ID: FT130100268
                Funded by: Deutsche Forschungsgemeinschaft, http://dx.doi.org/10.13039/501100001659;
                Award ID: SCHI 1188/1-2
                Categories
                1004
                28
                24
                70
                Life Sciences–Mathematics interface
                Research Article
                Custom metadata
                September, 2017

                Life sciences
                repeatability,heritability,goodness of fit,model fit,variance decomposition,reliability analysis

                Comments

                Comment on this article