5
views
0
recommends
+1 Recommend
1 collections
    0
    shares

      Submit your digital health research with an established publisher
      - celebrating 25 years of open access

      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Using a Natural Language Processing Approach to Support Rapid Knowledge Acquisition

      research-article

      Read this article at

      ScienceOpenPublisherPMC
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Implementing artificial intelligence to extract insights from large, real-world clinical data sets can supplement and enhance knowledge management efforts for health sciences research and clinical care. At Vanderbilt University Medical Center (VUMC), the in-house developed Word Cloud natural language processing system extracts coded concepts from patient records in VUMC’s electronic health record repository using the Unified Medical Language System terminology. Through this process, the Word Cloud extracts the most prominent concepts found in the clinical documentation of a specific patient or population. The Word Cloud provides added value for clinical care decision-making and research. This viewpoint paper describes a use case for how the VUMC Center for Knowledge Management leverages the condition-disease associations represented by the Word Cloud to aid in the knowledge generation needed to inform the interpretation of phenome-wide association studies.

          Related collections

          Most cited references29

          • Record: found
          • Abstract: found
          • Article: not found

          The Unified Medical Language System (UMLS): integrating biomedical terminology.

          The Unified Medical Language System (http://umlsks.nlm.nih.gov) is a repository of biomedical vocabularies developed by the US National Library of Medicine. The UMLS integrates over 2 million names for some 900,000 concepts from more than 60 families of biomedical vocabularies, as well as 12 million relations among these concepts. Vocabularies integrated in the UMLS Metathesaurus include the NCBI taxonomy, Gene Ontology, the Medical Subject Headings (MeSH), OMIM and the Digital Anatomist Symbolic Knowledge Base. UMLS concepts are not only inter-related, but may also be linked to external resources such as GenBank. In addition to data, the UMLS includes tools for customizing the Metathesaurus (MetamorphoSys), for generating lexical variants of concept names (lvg) and for extracting UMLS concepts from text (MetaMap). The UMLS knowledge sources are updated quarterly. All vocabularies are available at no fee for research purposes within an institution, but UMLS users are required to sign a license agreement. The UMLS knowledge sources are distributed on CD-ROM and by FTP.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations

            Motivation: Emergence of genetic data coupled to longitudinal electronic medical records (EMRs) offers the possibility of phenome-wide association scans (PheWAS) for disease–gene associations. We propose a novel method to scan phenomic data for genetic associations using International Classification of Disease (ICD9) billing codes, which are available in most EMR systems. We have developed a code translation table to automatically define 776 different disease populations and their controls using prevalent ICD9 codes derived from EMR data. As a proof of concept of this algorithm, we genotyped the first 6005 European–Americans accrued into BioVU, Vanderbilt's DNA biobank, at five single nucleotide polymorphisms (SNPs) with previously reported disease associations: atrial fibrillation, Crohn's disease, carotid artery stenosis, coronary artery disease, multiple sclerosis, systemic lupus erythematosus and rheumatoid arthritis. The PheWAS software generated cases and control populations across all ICD9 code groups for each of these five SNPs, and disease-SNP associations were analyzed. The primary outcome of this study was replication of seven previously known SNP–disease associations for these SNPs. Results: Four of seven known SNP–disease associations using the PheWAS algorithm were replicated with P-values between 2.8 × 10−6 and 0.011. The PheWAS algorithm also identified 19 previously unknown statistical associations between these SNPs and diseases at P < 0.01. This study indicates that PheWAS analysis is a feasible method to investigate SNP–disease associations. Further evaluation is needed to determine the validity of these associations and the appropriate statistical thresholds for clinical significance. Availability:The PheWAS software and code translation table are freely available at http://knowledgemap.mc.vanderbilt.edu/research. Contact: josh.denny@vanderbilt.edu
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Phenome-Wide Association Studies

                Bookmark

                Author and article information

                Contributors
                Journal
                JMIR Med Inform
                JMIR Med Inform
                JMI
                JMIR Medical Informatics
                JMIR Publications (Toronto, Canada )
                2291-9694
                2024
                30 January 2024
                : 12
                : e53516
                Affiliations
                [1 ] Center for Knowledge Management Vanderbilt University Medical Center Nashville, TN United States
                [2 ] Department of Biomedical Informatics Vanderbilt University School of Medicine Vanderbilt University Medical Center Nashville, TN United States
                Author notes
                Corresponding Author: Taneya Y Koonce taneya.koonce@ 123456vumc.org
                Author information
                https://orcid.org/0000-0002-4014-467X
                https://orcid.org/0000-0002-2677-6734
                https://orcid.org/0000-0002-2526-3857
                https://orcid.org/0000-0003-0356-9481
                https://orcid.org/0000-0002-3081-6487
                https://orcid.org/0000-0001-6699-6806
                https://orcid.org/0000-0002-7644-9803
                Article
                v12i1e53516
                10.2196/53516
                10865202
                38289670
                d0996c48-75ff-4701-905a-c1e915509272
                ©Taneya Y Koonce, Dario A Giuse, Annette M Williams, Mallory N Blasingame, Poppy A Krump, Jing Su, Nunzia B Giuse. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 30.01.2024.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.

                History
                : 10 October 2023
                : 8 December 2023
                : 15 December 2023
                : 4 January 2024
                Categories
                Viewpoint
                Viewpoint

                natural language processing,electronic health records,machine learning,data mining,knowledge management,nlp

                Comments

                Comment on this article