An integrative machine learning framework for classifying SEER breast cancer

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Breast cancer is the commonest type of cancer in women worldwide and the leading cause of mortality for females. The aim of this research is to classify the alive and death status of breast cancer patients using the Surveillance, Epidemiology, and End Results dataset. Due to its capacity to handle enormous data sets systematically, machine learning and deep learning has been widely employed in biomedical research to answer diverse classification difficulties. Pre-processing the data enables its visualization and analysis for use in making important decisions. This research presents a feasible machine learning-based approach for categorizing SEER breast cancer dataset. Moreover, a two-step feature selection method based on Variance Threshold and Principal Component Analysis was employed to select the features from the SEER breast cancer dataset. After selecting the features, the classification of the breast cancer dataset is carried out using Supervised and Ensemble learning techniques such as Ada Boosting, XG Boosting, Gradient Boosting, Naive Bayes and Decision Tree. Utilizing the train-test split and k-fold cross-validation approaches, the performance of various machine learning algorithms is examined. The accuracy of Decision Tree for both train-test split and cross validation achieved as 98%. In this study, it is observed that the Decision Tree algorithm outperforms other supervised and ensemble learning approaches for the SEER Breast Cancer dataset.

Related collections

Most cited references 26

Record: found
Abstract: found
Article: found

Is Open Access

Current and future burden of breast cancer: Global statistics for 2020 and 2040

Melina Arnold, Eileen Morgan, Harriet Rumgay … (2022)

Background Breast cancer is the most commonly diagnosed cancer worldwide, and its burden has been rising over the past decades. In this article, we examine and describe the global burden of breast cancer in 2020 and predictions for the year 2040. Methods Estimates of new female breast cancer cases and deaths in 2020 were abstracted from the GLOBOCAN database. Age-standardized incidence and mortality rates were calculated per 100,000 females by country, world region, and level of human development. Predicted cases and deaths were computed based on global demographic projections for the year 2040. Results Over 2.3 million new cases and 685,000 deaths from breast cancer occurred in 2020. Large geographic variation across countries and world regions exists, with incidence rates ranging from <40 per 100,000 females in some Asian and African countries, to over 80 per 100,000 in Australia/New Zealand, Northern America, and parts of Europe. Smaller geographical variation was observed for mortality; however, transitioning countries continue to carry a disproportionate share of breast cancer deaths relative to transitioned countries. By 2040, the burden from breast cancer is predicted to increase to over 3 million new cases and 1 million deaths every year because of population growth and ageing alone. Conclusion Breast cancer is the most common cancer worldwide and continues to have a large impact on the global number of cancer deaths. Global efforts are needed to counteract its growing burden, especially in transitioning countries where incidence is rising rapidly, and mortality rates remain high. • With over 2.3 million new cases and 685,000 deaths in 2020, breast cancer is the most commonly diagnosed cancer worldwide. • Most cases occur in transitioned countries yet transitioning countries have disproportionate share of breast cancer deaths. • The future burden of breast cancer is predicted to increase to over 3 million new cases and 1 million deaths in 2040.

0 comments Cited 602 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Artificial intelligence in cancer imaging: Clinical challenges and applications

Wenya Linda Bi, Ahmed Hosny, Matthew Schabath … (2019)

Abstract Judgement, as one of the core tenets of medicine, relies upon the integration of multilayered data with nuanced decision making. Cancer offers a unique context for medical decisions given not only its variegated forms with evolution of disease but also the need to take into account the individual condition of patients, their ability to receive treatment, and their responses to treatment. Challenges remain in the accurate detection, characterization, and monitoring of cancers despite improved technologies. Radiographic assessment of disease most commonly relies upon visual evaluations, the interpretations of which may be augmented by advanced computational analyses. In particular, artificial intelligence (AI) promises to make great strides in the qualitative interpretation of cancer imaging by expert clinicians, including volumetric delineation of tumors over time, extrapolation of the tumor genotype and biological course from its radiographic phenotype, prediction of clinical outcome, and assessment of the impact of disease and treatment on adjacent organs. AI may automate processes in the initial interpretation of images and shift the clinical workflow of radiographic detection, management decisions on whether or not to administer an intervention, and subsequent observation to a yet to be envisioned paradigm. Here, the authors review the current state of AI as applied to medical imaging of cancer and describe advances in 4 tumor types (lung, brain, breast, and prostate) to illustrate how common clinical problems are being addressed. Although most studies evaluating AI applications in oncology to date have not been vigorously validated for reproducibility and generalizability, the results do highlight increasingly concerted efforts in pushing AI technology to clinical use and to impact future directions in cancer care.

0 comments Cited 525 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Automated diagnosis of breast ultrasonography images using deep neural networks

Xiaofeng Qi, Lei Zhang, Yao Chen … (2019)

0 comments Cited 71 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

P. Manikandan: manimkn89@gmail.com

C. Ponnuraja: cponnuraja@gmail.com

Journal

Journal ID (nlm-ta): Sci Rep

Journal ID (iso-abbrev): Sci Rep

Title: Scientific Reports

Publisher: Nature Publishing Group UK (London )

ISSN (Electronic): 2045-2322

Publication date (Electronic): 1 April 2023

Publication date PMC-release: 1 April 2023

Publication date Collection: 2023

Volume: 13

Electronic Location Identifier: 5362

Affiliations

[1 ]GRID grid.413015.2, ISNI 0000 0004 0505 215X, Department of Data Science, , Loyola College, ; Chennai, 600 034 India

[2 ]GRID grid.417330.2, ISNI 0000 0004 1767 6138, ICMR-National Institute for Research in Tuberculosis, ; Chennai, 600 031 India

Article

Publisher ID: 32029

DOI: 10.1038/s41598-023-32029-1

PMC ID: 10067827

SO-VID: a8c8ee95-ba51-4e9f-a246-8cc83c0d5250

License:

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

History

Date received : 23 January 2023

Date accepted : 21 March 2023

Custom metadata

ScienceOpen disciplines: Uncategorized

Keywords: breast cancer,machine learning

Data availability:

ScienceOpen disciplines: Uncategorized

Keywords: breast cancer, machine learning

Comments

Comment on this article

scite_

Smart Citations

Citing PublicationsSupportingMentioningContrasting

View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

Cited by 2

See all cited by

Most referenced authors 235

See all reference authors

An integrative machine learning framework for classifying SEER breast cancer

Read this article at

Abstract

Related collections

Annual Reviews AI, Machine Learning, and Society

Most cited references 26

Current and future burden of breast cancer: Global statistics for 2020 and 2040

Artificial intelligence in cancer imaging: Clinical challenges and applications

Automated diagnosis of breast ultrasonography images using deep neural networks

Author and article information

Contributors

Journal

Affiliations

Article

History

Categories

Custom metadata

Comments

Comment on this article

Similar content 300

Cited by 2

Most referenced authors 235