36
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Computer-Assisted Text Analysis for Comparative Politics

      Read this article at

      ScienceOpenPublisher
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Recent advances in research tools for the systematic analysis of textual data are enabling exciting new research throughout the social sciences. For comparative politics, scholars who are often interested in non-English and possibly multilingual textual datasets, these advances may be difficult to access. This article discusses practical issues that arise in the processing, management, translation, and analysis of textual data with a particular focus on how procedures differ across languages. These procedures are combined in two applied examples of automated text analysis using the recently introduced Structural Topic Model. We also show how the model can be used to analyze data that have been translated into a single language via machine translation tools. All the methods we describe here are implemented in open-source software packages available from the authors.

          Related collections

          Most cited references27

          • Record: found
          • Abstract: not found
          • Book: not found

          The operated Markov´s chains in economy (discrete chains of Markov with the income)

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Finding scientific topics.

            A first step in identifying the content of a document is determining which topics that document addresses. We describe a generative model for documents, introduced by Blei, Ng, and Jordan [Blei, D. M., Ng, A. Y. & Jordan, M. I. (2003) J. Machine Learn. Res. 3, 993-1022], in which each document is generated by choosing a distribution over topics and then choosing each word in the document from a topic selected according to this distribution. We then present a Markov chain Monte Carlo algorithm for inference in this model. We use this algorithm to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics. We show that the extracted topics capture meaningful structure in the data, consistent with the class designations provided by the authors of the articles, and outline further applications of this analysis, including identifying "hot topics" by examining temporal dynamics and tagging abstracts to illustrate semantic content.
              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              BLEU

                Bookmark

                Author and article information

                Journal
                Political Analysis
                Polit. anal.
                Oxford University Press (OUP)
                1047-1987
                1476-4989
                2015
                January 04 2017
                2015
                : 23
                : 2
                : 254-277
                Article
                10.1093/pan/mpu019
                a8d183e2-8b62-483d-b310-fc397aeab669
                © 2015

                https://www.cambridge.org/core/terms

                History

                Comments

                Comment on this article