1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Comparing different versions of computer-aided detection products when reading chest X-rays for tuberculosis

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Computer-aided detection (CAD) was recently recommended by the WHO for TB screening and triage based on several evaluations, but unlike traditional diagnostic tests, software versions are updated frequently and require constant evaluation. Since then, newer versions of two of the evaluated products have already been released. We used a case control sample of 12,890 chest X-rays to compare performance and model the programmatic effect of upgrading to newer versions of CAD4TB and qXR. We compared the area under the receiver operating characteristic curve (AUC), overall, and with data stratified by age, TB history, gender, and patient source. All versions were compared against radiologist readings and WHO’s Target Product Profile (TPP) for a TB triage test. Both newer versions significantly outperformed their predecessors in terms of AUC: CAD4TB version 6 (0.823 [0.816–0.830]), version 7 (0.903 [0.897–0.908]) and qXR version 2 (0.872 [0.866–0.878]), version 3 (0.906 [0.901–0.911]). Newer versions met WHO TPP values, older versions did not. All products equalled or surpassed the human radiologist performance with improvements in triage ability in newer versions. Humans and CAD performed worse in older age groups and among those with TB history. New versions of CAD outperform their predecessors. Prior to implementation CAD should be evaluated using local data because underlying neural networks can differ significantly. An independent rapid evaluation centre is necessitated to provide implementers with performance data on new versions of CAD products as they are developed.

          Author summary

          The World Health Organization recommended the use of artificial intelligence (AI)-powered computer-aided detection (CAD) for TB screening and triage in 2021. One year on, we comprehensively compare the performance of the newest versions of two CAD (CAD4TB and qXR) to their WHO-evaluated predecessors. We found that both newer versions significantly improved upon their predecessor’s ability to detect TB, performing better than the human readers. We also showed that the AI underlying new software versions can differ remarkably from the old and resemble an entirely new product altogether. We further demonstrate that, unlike laboratory diagnostic tools, CAD software updates could significantly impact the selection of appropriate threshold scores, the number of people with TB detected and cost-effectiveness. With newer CAD versions being rolled out almost annually, our results therefore underscore the need for rapid evidence generation to evaluate newer CAD versions in the fast-growing medical AI industry.

          Related collections

          Most cited references7

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Using artificial intelligence to read chest radiographs for tuberculosis detection: A multi-site evaluation of the diagnostic accuracy of three deep learning systems

          Deep learning (DL) neural networks have only recently been employed to interpret chest radiography (CXR) to screen and triage people for pulmonary tuberculosis (TB). No published studies have compared multiple DL systems and populations. We conducted a retrospective evaluation of three DL systems (CAD4TB, Lunit INSIGHT, and qXR) for detecting TB-associated abnormalities in chest radiographs from outpatients in Nepal and Cameroon. All 1196 individuals received a Xpert MTB/RIF assay and a CXR read by two groups of radiologists and the DL systems. Xpert was used as the reference standard. The area under the curve of the three systems was similar: Lunit (0.94, 95% CI: 0.93–0.96), qXR (0.94, 95% CI: 0.92–0.97) and CAD4TB (0.92, 95% CI: 0.90–0.95). When matching the sensitivity of the radiologists, the specificities of the DL systems were significantly higher except for one. Using DL systems to read CXRs could reduce the number of Xpert MTB/RIF tests needed by 66% while maintaining sensitivity at 95% or better. Using a universal cutoff score resulted different performance in each site, highlighting the need to select scores based on the population screened. These DL systems should be considered by TB programs where human resources are constrained, and automated technology is available.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found
            Is Open Access

            Tuberculosis detection from chest x-rays for triaging in a high tuberculosis-burden setting: an evaluation of five artificial intelligence algorithms

              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Computer aided detection of tuberculosis on chest radiographs: An evaluation of the CAD4TB v6 system

              There is a growing interest in the automated analysis of chest X-Ray (CXR) as a sensitive and inexpensive means of screening susceptible populations for pulmonary tuberculosis. In this work we evaluate the latest version of CAD4TB, a commercial software platform designed for this purpose. Version 6 of CAD4TB was released in 2018 and is here tested on a fully independent dataset of 5565 CXR images with GeneXpert (Xpert) sputum test results available (854 Xpert positive subjects). A subset of 500 subjects (50% Xpert positive) was reviewed and annotated by 5 expert observers independently to obtain a radiological reference standard. The latest version of CAD4TB is found to outperform all previous versions in terms of area under receiver operating curve (ROC) with respect to both Xpert and radiological reference standards. Improvements with respect to Xpert are most apparent at high sensitivity levels with a specificity of 76% obtained at a fixed 90% sensitivity. When compared with the radiological reference standard, CAD4TB v6 also outperformed previous versions by a considerable margin and achieved 98% specificity at the 90% sensitivity setting. No substantial difference was found between the performance of CAD4TB v6 and any of the various expert observers against the Xpert reference standard. A cost and efficiency analysis on this dataset demonstrates that in a standard clinical situation, operating at 90% sensitivity, users of CAD4TB v6 can process 132 subjects per day at an average cost per screen of $5.95 per subject, while users of version 3 process only 85 subjects per day at a cost of $8.38 per subject. At all tested operating points version 6 is shown to be more efficient and cost effective than any other version.
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: Writing – original draftRole: Writing – review & editing
                Role: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: Funding acquisitionRole: Writing – review & editing
                Role: ConceptualizationRole: ValidationRole: Writing – review & editing
                Role: Data curationRole: Funding acquisitionRole: Writing – review & editing
                Role: Data curationRole: Formal analysisRole: MethodologyRole: Writing – review & editing
                Role: ConceptualizationRole: Funding acquisitionRole: InvestigationRole: Project administrationRole: SupervisionRole: Writing – review & editing
                Role: ConceptualizationRole: InvestigationRole: MethodologyRole: Project administrationRole: SupervisionRole: Writing – review & editing
                Role: Editor
                Journal
                PLOS Digit Health
                PLOS Digit Health
                plos
                PLOS Digital Health
                Public Library of Science (San Francisco, CA USA )
                2767-3170
                14 June 2022
                June 2022
                : 1
                : 6
                : e0000067
                Affiliations
                [1 ] Stop TB Partnership, Le Grand-Saconnex, Geneva, Switzerland
                [2 ] International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), Dhaka, Bangladesh
                WHO: Organisation mondiale de la Sante, SWITZERLAND
                Author notes

                The authors have declared that no competing interests exist.

                ‡ These authors are joint senior authors on this work.

                Author information
                https://orcid.org/0000-0002-3956-5223
                https://orcid.org/0000-0001-5004-2573
                https://orcid.org/0000-0002-8833-4130
                https://orcid.org/0000-0002-6054-3571
                https://orcid.org/0000-0002-3765-5153
                Article
                PDIG-D-22-00062
                10.1371/journal.pdig.0000067
                9931298
                36812562
                74742fa8-731f-4157-8f4c-4f96cca1aeb1
                © 2022 Qin et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 6 March 2022
                : 15 May 2022
                Page count
                Figures: 0, Tables: 4, Pages: 11
                Funding
                Funded by: funder-id http://dx.doi.org/10.13039/501100008627, Global Affairs Canada;
                Award ID: STBP/TBREACH/GSA/W5-24
                This project was funded by Global Affairs Canada through the Stop TB Partnership’s TB REACH Initiative (grant number STBP/TBREACH/GSA/W5-24). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Computer and Information Sciences
                Software Engineering
                Computer Software
                Engineering and Technology
                Software Engineering
                Computer Software
                People and Places
                Population Groupings
                Professions
                Medical Personnel
                Radiologists
                Medicine and Health Sciences
                Medical Conditions
                Infectious Diseases
                Bacterial Diseases
                Tuberculosis
                Medicine and Health Sciences
                Medical Conditions
                Tropical Diseases
                Tuberculosis
                People and Places
                Population Groupings
                Age Groups
                Computer and Information Sciences
                Artificial Intelligence
                Medicine and Health Sciences
                Critical Care and Emergency Medicine
                Triage
                Medicine and Health Sciences
                Health Care
                Health Care Policy
                Screening Guidelines
                Computer and Information Sciences
                Software Engineering
                Software Tools
                Engineering and Technology
                Software Engineering
                Software Tools
                Custom metadata
                All numeric data and codes used in this manuscript are available here: https://github.com/ZZQin/MachineBGD/tree/master/2.0%20Version%20Comparison.

                Comments

                Comment on this article