8
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Data sets for author name disambiguation: an empirical analysis and a new resource

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Data sets of publication meta data with manually disambiguated author names play an important role in current author name disambiguation (AND) research. We review the most important data sets used so far, and compare their respective advantages and shortcomings. From the results of this review, we derive a set of general requirements to future AND data sets. These include both trivial requirements, like absence of errors and preservation of author order, and more substantial ones, like full disambiguation and adequate representation of publications with a small number of authors and highly variable author names. On the basis of these requirements, we create and make publicly available a new AND data set, SCAD-zbMATH. Both the quantitative analysis of this data set and the results of our initial AND experiments with a naive baseline algorithm show the SCAD-zbMATH data set to be considerably different from existing ones. We consider it a useful new resource that will challenge the state of the art in AND and benefit the AND research community.

          Related collections

          Most cited references21

          • Record: found
          • Abstract: not found
          • Article: not found

          Impact of bibliometrics upon the science system: Inadvertent consequences?

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            A brief survey of automatic methods for author name disambiguation

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              DBLP

                Bookmark

                Author and article information

                Contributors
                mark-christoph.mueller@h-its.org
                Journal
                Scientometrics
                Scientometrics
                Scientometrics
                Springer Netherlands (Dordrecht )
                0138-9130
                27 March 2017
                27 March 2017
                2017
                : 111
                : 3
                : 1467-1500
                Affiliations
                [1 ]ISNI 0000 0001 2275 2842, GRID grid.424699.4, , Heidelberg Institute for Theoretical Studies, ; Heidelberg, Germany
                [2 ]DBLP, Trier, Germany
                [3 ]ISNI 0000 0001 1519 1565, GRID grid.434104.6, Mathematics Department, , FIZ Karlsruhe, ; Berlin, Germany
                Author information
                http://orcid.org/0000-0001-5639-7682
                Article
                2363
                10.1007/s11192-017-2363-5
                5438420
                28596627
                c1971199-d294-4405-a4a6-cbad48c4dba2
                © The Author(s) 2017

                Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

                History
                : 5 July 2016
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/501100007316, Klaus Tschira Stiftung;
                Funded by: FundRef http://dx.doi.org/10.13039/501100001664, Leibniz-Gemeinschaft;
                Award ID: SAW-2015-LZI-2
                Categories
                Article
                Custom metadata
                © Akadémiai Kiadó, Budapest, Hungary 2017

                Computer science
                author name disambiguation,author name homography,author name variability,data sets,digital libraries

                Comments

                Comment on this article