43
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Sources of bias in artificial intelligence that perpetuate healthcare disparities—A global review

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          While artificial intelligence (AI) offers possibilities of advanced clinical prediction and decision-making in healthcare, models trained on relatively homogeneous datasets, and populations poorly-representative of underlying diversity, limits generalisability and risks biased AI-based decisions. Here, we describe the landscape of AI in clinical medicine to delineate population and data-source disparities.

          Methods

          We performed a scoping review of clinical papers published in PubMed in 2019 using AI techniques. We assessed differences in dataset country source, clinical specialty, and author nationality, sex, and expertise. A manually tagged subsample of PubMed articles was used to train a model, leveraging transfer-learning techniques (building upon an existing BioBERT model) to predict eligibility for inclusion (original, human, clinical AI literature). Of all eligible articles, database country source and clinical specialty were manually labelled. A BioBERT-based model predicted first/last author expertise. Author nationality was determined using corresponding affiliated institution information using Entrez Direct. And first/last author sex was evaluated using the Gendarize.io API.

          Results

          Our search yielded 30,576 articles, of which 7,314 (23.9%) were eligible for further analysis. Most databases came from the US (40.8%) and China (13.7%). Radiology was the most represented clinical specialty (40.4%), followed by pathology (9.1%). Authors were primarily from either China (24.0%) or the US (18.4%). First and last authors were predominately data experts (i.e., statisticians) (59.6% and 53.9% respectively) rather than clinicians. And the majority of first/last authors were male (74.1%).

          Interpretation

          U.S. and Chinese datasets and authors were disproportionately overrepresented in clinical AI, and almost all of the top 10 databases and author nationalities were from high income countries (HICs). AI techniques were most commonly employed for image-rich specialties, and authors were predominantly male, with non-clinical backgrounds. Development of technological infrastructure in data-poor regions, and diligence in external validation and model re-calibration prior to clinical implementation in the short-term, are crucial in ensuring clinical AI is meaningful for broader populations, and to avoid perpetuating global health inequity.

          Author summary

          Artificial Intelligence (AI) creates opportunities for accurate, objective and immediate decision support in healthcare with little expert input–especially valuable in resource-poor settings where there is shortage of specialist care. Given that AI poorly generalises to cohorts outside those whose data was used to train and validate the algorithms, populations in data-rich regions stand to benefit substantially more vs data-poor regions, entrenching existing healthcare disparities. Here, we show that more than half of the datasets used for clinical AI originate from either the US or China. In addition, the U.S. and China contribute over 40% of the authors of the publications. While the models may perform on-par/better than clinician decision-making in the well-represented regions, benefits elsewhere are not guaranteed. Further, we show discrepancies in gender and specialty representation–notably that almost three-quarters of the coveted first/senior authorship positions were held by men, and radiology accounted for 40% of all clinical AI manuscripts. We emphasize that building equitable sociodemographic representation in data repositories, in author nationality, gender and expertise, and in clinical specialties is crucial in ameliorating health inequities.

          Related collections

          Most cited references60

          • Record: found
          • Abstract: not found
          • Article: not found

          Scoping studies: towards a methodological framework

            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Scoping studies: advancing the methodology

            Background Scoping studies are an increasingly popular approach to reviewing health research evidence. In 2005, Arksey and O'Malley published the first methodological framework for conducting scoping studies. While this framework provides an excellent foundation for scoping study methodology, further clarifying and enhancing this framework will help support the consistency with which authors undertake and report scoping studies and may encourage researchers and clinicians to engage in this process. Discussion We build upon our experiences conducting three scoping studies using the Arksey and O'Malley methodology to propose recommendations that clarify and enhance each stage of the framework. Recommendations include: clarifying and linking the purpose and research question (stage one); balancing feasibility with breadth and comprehensiveness of the scoping process (stage two); using an iterative team approach to selecting studies (stage three) and extracting data (stage four); incorporating a numerical summary and qualitative thematic analysis, reporting results, and considering the implications of study findings to policy, practice, or research (stage five); and incorporating consultation with stakeholders as a required knowledge translation component of scoping study methodology (stage six). Lastly, we propose additional considerations for scoping study methodology in order to support the advancement, application and relevance of scoping studies in health research. Summary Specific recommendations to clarify and enhance this methodology are outlined for each stage of the Arksey and O'Malley framework. Continued debate and development about scoping study methodology will help to maximize the usefulness and rigor of scoping study findings within healthcare research and practice.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Systematic review or scoping review? Guidance for authors when choosing between a systematic or scoping review approach

              Background Scoping reviews are a relatively new approach to evidence synthesis and currently there exists little guidance regarding the decision to choose between a systematic review or scoping review approach when synthesising evidence. The purpose of this article is to clearly describe the differences in indications between scoping reviews and systematic reviews and to provide guidance for when a scoping review is (and is not) appropriate. Results Researchers may conduct scoping reviews instead of systematic reviews where the purpose of the review is to identify knowledge gaps, scope a body of literature, clarify concepts or to investigate research conduct. While useful in their own right, scoping reviews may also be helpful precursors to systematic reviews and can be used to confirm the relevance of inclusion criteria and potential questions. Conclusions Scoping reviews are a useful tool in the ever increasing arsenal of evidence synthesis approaches. Although conducted for different purposes compared to systematic reviews, scoping reviews still require rigorous and transparent methods in their conduct to ensure that the results are trustworthy. Our hope is that with clear guidance available regarding whether to conduct a scoping review or a systematic review, there will be less scoping reviews being performed for inappropriate indications better served by a systematic review, and vice-versa.
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: InvestigationRole: MethodologyRole: Project administrationRole: SupervisionRole: VisualizationRole: Writing – review & editing
                Role: ConceptualizationRole: VisualizationRole: Writing – review & editing
                Role: ResourcesRole: SoftwareRole: SupervisionRole: VisualizationRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: MethodologyRole: ResourcesRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: Formal analysisRole: InvestigationRole: MethodologyRole: Project administration
                Role: Formal analysisRole: InvestigationRole: MethodologyRole: ResourcesRole: SoftwareRole: SupervisionRole: VisualizationRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: Project administrationRole: ResourcesRole: SupervisionRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: Data curationRole: Writing – review & editing
                Role: Data curationRole: Formal analysis
                Role: Data curationRole: Formal analysisRole: Visualization
                Role: ConceptualizationRole: Data curationRole: Writing – review & editing
                Role: ConceptualizationRole: Formal analysisRole: InvestigationRole: MethodologyRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: Writing – review & editing
                Role: Data curationRole: VisualizationRole: Writing – review & editing
                Role: Editor
                Journal
                PLOS Digit Health
                PLOS Digit Health
                plos
                PLOS Digital Health
                Public Library of Science (San Francisco, CA USA )
                2767-3170
                31 March 2022
                March 2022
                : 1
                : 3
                : e0000022
                Affiliations
                [1 ] Massachusetts Institute of Technology, Institute for Medical Engineering and Science, Cambridge, MA, United States of America
                [2 ] Harvard TH Chan School of Public Health, Department of Biostatistics, Boston, MA, United States of America
                [3 ] Beth Israel Deaconess Medical Center, Department of Medicine, Boston, MA, United States of America
                [4 ] Harvard Medical School, Department of Library Services, Boston, MA, United States of America
                [5 ] Massachusetts Institute of Technology, Institute for Data, Systems and Society, Cambridge, MA, United States of America
                [6 ] Harvard Medical School, Boston, MA, United States of America
                [7 ] Adobe Inc, Adobe Research, San Jose, CA, United States of America
                [8 ] Montpellier University, Montpellier Research in Management, Montpellier, France
                [9 ] Harvard TH Chan School of Public Health, Boston, MA, United States of America
                [10 ] Massachusetts General Hospital, Harvard Medical School, Boston, MA, United States of America
                [11 ] Massachusetts Institute of Technology, Department of Computer Science and Molecular Biology, Cambridge, MA, United States of America
                [12 ] Einstein Medical Center Philadelphia, Department of Medicine, Philadelphia, PA, United States of America
                [13 ] BeiGene, Applied Innovation, Cambridge, MA, United States of America
                [14 ] Emory University, Department of Radiology and Biomedical Informatics, Atlanta, GA, United States of America
                Brown University, UNITED STATES
                Author notes

                Leo Anthony Celi is the Editor-in Chief of PLOS Digital Health and Judy Gichoya Wawira is a Section Editor for PLOS Digital Health.

                Author information
                https://orcid.org/0000-0002-5786-2627
                https://orcid.org/0000-0001-6119-0889
                https://orcid.org/0000-0002-1119-1346
                https://orcid.org/0000-0002-4347-0198
                https://orcid.org/0000-0002-2122-6741
                https://orcid.org/0000-0003-4054-5112
                https://orcid.org/0000-0003-1319-6893
                https://orcid.org/0000-0003-0851-6223
                Article
                PDIG-D-21-00034
                10.1371/journal.pdig.0000022
                9931338
                36812532
                1236309e-1882-4715-b2ed-8bb4e8c7a67a
                © 2022 Celi et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 10 July 2021
                : 7 February 2022
                Page count
                Figures: 6, Tables: 2, Pages: 19
                Funding
                LAC is funded by NIBIB grant R01 EB017205. The funders of the grant had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Grant detail can be found at: https://grantome.com/grant/NIH/R01-EB017205-01A1 No other authors received any specific funding for this work.
                Categories
                Research Article
                Computer and Information Sciences
                Artificial Intelligence
                Medicine and Health Sciences
                Radiology and Imaging
                Medicine and Health Sciences
                Epidemiology
                Medical Risk Factors
                Computer and Information Sciences
                Artificial Intelligence
                Machine Learning
                Biology and Life Sciences
                Bioengineering
                Biotechnology
                Medical Devices and Equipment
                Engineering and Technology
                Bioengineering
                Biotechnology
                Medical Devices and Equipment
                Medicine and Health Sciences
                Medical Devices and Equipment
                Biology and Life Sciences
                Ecology
                Ecological Metrics
                Species Diversity
                Ecology and Environmental Sciences
                Ecology
                Ecological Metrics
                Species Diversity
                Medicine and Health Sciences
                Pathology and Laboratory Medicine
                Clinical Pathology
                Medicine and Health Sciences
                Ophthalmology
                Custom metadata
                All the code used to build the machine learnings models is available at GitHub under https://github.com/Rebero/ml-disparities-mit.

                Comments

                Comment on this article

                scite_
                0
                0
                0
                0
                Smart Citations
                0
                0
                0
                0
                Citing PublicationsSupportingMentioningContrasting
                View Citations

                See how this article has been cited at scite.ai

                scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

                Similar content63

                Cited by86

                Most referenced authors1,032