4
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Machine learning of language use on Twitter reveals weak and non-specific predictions

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Depressed individuals use language differently than healthy controls and it has been proposed that social media posts can be used to identify depression. Much of the evidence behind this claim relies on indirect measures of mental health and few studies have tested if these language features are specific to depression versus other aspects of mental health. We analysed the Tweets of 1006 participants who completed questionnaires assessing symptoms of depression and 8 other mental health conditions. Daily Tweets were subjected to textual analysis and the resulting linguistic features were used to train an Elastic Net model on depression severity, using nested cross-validation. We then tested performance in a held-out test set (30%), comparing predictions of depression versus 8 other aspects of mental health. The depression trained model had modest out-of-sample predictive performance, explaining 2.5% of variance in depression symptoms ( R 2 = 0.025, r = 0.16). The performance of this model was as-good or superior when used to identify other aspects of mental health: schizotypy, social anxiety, eating disorders, generalised anxiety, above chance for obsessive-compulsive disorder, apathy, but not significant for alcohol abuse or impulsivity. Machine learning analysis of social media data, when trained on well-validated clinical instruments, could not make meaningful individualised predictions regarding users’ mental health. Furthermore, language use associated with depression was non-specific, having similar performance in predicting other mental health problems.

          Related collections

          Most cited references56

          • Record: found
          • Abstract: not found
          • Article: not found

          Regularization and variable selection via the elastic net

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Research domain criteria (RDoC): toward a new classification framework for research on mental disorders.

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Prevalence, severity, and comorbidity of 12-month DSM-IV disorders in the National Comorbidity Survey Replication.

              Little is known about the general population prevalence or severity of DSM-IV mental disorders. To estimate 12-month prevalence, severity, and comorbidity of DSM-IV anxiety, mood, impulse control, and substance disorders in the recently completed US National Comorbidity Survey Replication. Nationally representative face-to-face household survey conducted between February 2001 and April 2003 using a fully structured diagnostic interview, the World Health Organization World Mental Health Survey Initiative version of the Composite International Diagnostic Interview. Nine thousand two hundred eighty-two English-speaking respondents 18 years and older. Twelve-month DSM-IV disorders. Twelve-month prevalence estimates were anxiety, 18.1%; mood, 9.5%; impulse control, 8.9%; substance, 3.8%; and any disorder, 26.2%. Of 12-month cases, 22.3% were classified as serious; 37.3%, moderate; and 40.4%, mild. Fifty-five percent carried only a single diagnosis; 22%, 2 diagnoses; and 23%, 3 or more diagnoses. Latent class analysis detected 7 multivariate disorder classes, including 3 highly comorbid classes representing 7% of the population. Although mental disorders are widespread, serious cases are concentrated among a relatively small proportion of cases with high comorbidity.
                Bookmark

                Author and article information

                Contributors
                sekelley@tcd.ie
                Journal
                NPJ Digit Med
                NPJ Digit Med
                NPJ Digital Medicine
                Nature Publishing Group UK (London )
                2398-6352
                25 March 2022
                25 March 2022
                2022
                : 5
                : 35
                Affiliations
                [1 ]GRID grid.8217.c, ISNI 0000 0004 1936 9705, School of Psychology, , Trinity College Dublin, ; Dublin, Ireland
                [2 ]GRID grid.8217.c, ISNI 0000 0004 1936 9705, Trinity College Institute of Neuroscience, , Trinity College Dublin, ; Dublin, Ireland
                [3 ]GRID grid.8217.c, ISNI 0000 0004 1936 9705, Global Brain Health Institute, , Trinity College Dublin, ; Dublin, Ireland
                Author information
                http://orcid.org/0000-0003-4387-881X
                http://orcid.org/0000-0002-2790-7281
                http://orcid.org/0000-0001-9065-403X
                Article
                576
                10.1038/s41746-022-00576-y
                8956571
                35338248
                9cc2ee44-15a0-4842-8a48-3162fa41dabe
                © The Author(s) 2022

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 23 March 2021
                : 11 February 2022
                Funding
                Funded by: SFI-HRB-Wellcome Trust (204814/Z/16/A)
                Categories
                Article
                Custom metadata
                © The Author(s) 2022

                human behaviour
                human behaviour

                Comments

                Comment on this article