3
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Location Extraction from Social Media : Geoparsing, Location Disambiguation, and Geotagging

      Read this article at

      ScienceOpenPublisher
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Location extraction, also called “toponym extraction,” is a field covering geoparsing, extracting spatial representations from location mentions in text, and geotagging, assigning spatial coordinates to content items. This article evaluates five “best-of-class” location extraction algorithms. We develop a geoparsing algorithm using an OpenStreetMap database, and a geotagging algorithm using a language model constructed from social media tags and multiple gazetteers. Third-party work evaluated includes a DBpedia-based entity recognition and disambiguation approach, a named entity recognition and Geonames gazetteer approach, and a Google Geocoder API approach. We perform two quantitative benchmark evaluations, one geoparsing tweets and one geotagging Flickr posts, to compare all approaches. We also perform a qualitative evaluation recalling top N location mentions from tweets during major news events. The OpenStreetMap approach was best (F1 0.90+) for geoparsing English, and the language model approach was best (F1 0.66) for Turkish. The language model was best (F1@1km 0.49) for the geotagging evaluation. The map database was best (R@20 0.60+) in the qualitative evaluation. We report on strengths, weaknesses, and a detailed failure analysis for the approaches and suggest concrete areas for further research.

          Related collections

          Most cited references40

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          You are where you tweet

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            YFCC100M

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Real-Time Crisis Mapping of Natural Disasters Using Social Media

                Bookmark

                Author and article information

                Contributors
                (View ORCID Profile)
                Journal
                ACM Transactions on Information Systems
                ACM Trans. Inf. Syst.
                Association for Computing Machinery (ACM)
                1046-8188
                1558-2868
                October 31 2018
                October 31 2018
                : 36
                : 4
                : 1-27
                Affiliations
                [1 ]University of Southampton IT Innovation Centre, Southampton, UK
                [2 ]Information Technologies Institute, CERTH, Thermi-Thessaloniki, Greece
                Article
                10.1145/3202662
                de9b92f2-3909-4bbe-8647-7187cf1a20ae
                © 2018

                http://www.acm.org/publications/policies/copyright_policy#Background

                History

                Comments

                Comment on this article