Big data in basic and translational cancer research

Jiang, Peng; Sinha, Sanju; Aldape, Kenneth; Hannenhalli, Sridhar; Sahinalp, Cenk S.; Ruppin, Eytan

doi:10.1038/s41568-022-00502-0

ScienceOpen: research and publishing network

For Publishers

For Researchers

Blog
About

Search
Advanced search

views

recommends

Record: found
Abstract: found
Article: not found

Big data in basic and translational cancer research

review-article

Author(s): Peng Jiang ¹ ^, , Sanju Sinha ¹ , Kenneth Aldape ² , Sridhar Hannenhalli ¹ , Cenk Sahinalp ¹ , Eytan Ruppin ¹ ^,

Publication date (Electronic): 5 September 2022

Journal: Nature Reviews. Cancer

Publisher: Nature Publishing Group UK

Keywords: Computational biology and bioinformatics, Cancer genomics, Cancer epigenetics, Cancer therapy

Read this article at

ScienceOpenPublisher PMC

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Historically, the primary focus of cancer research has been molecular and clinical studies of a few essential pathways and genes. Recent years have seen the rapid accumulation of large-scale cancer omics data catalysed by breakthroughs in high-throughput technologies. This fast data growth has given rise to an evolving concept of ‘big data’ in cancer, whose analysis demands large computational resources and can potentially bring novel insights into essential questions. Indeed, the combination of big data, bioinformatics and artificial intelligence has led to notable advances in our basic understanding of cancer biology and to translational advancements. Further advances will require a concerted effort among data scientists, clinicians, biologists and policymakers. Here, we review the current state of the art and future challenges for harnessing big data to advance cancer research and treatment.

Abstract

The increasing size of cancer datasets requires new ways of thinking for analysing and integrating these data. In this Review, Jiang et al. discuss considerations and strategies for wielding ‘big data’ ― large, information-rich datasets ― in basic research and for translational applications such as identifying biomarkers, informing clinical trials and developing new assays and treatments.

Related collections

Most cited references 181

Record: found
Abstract: found
Article: found

Hallmarks of Cancer: The Next Generation

Douglas Hanahan, Robert A. Weinberg (2011)

The hallmarks of cancer comprise six biological capabilities acquired during the multistep development of human tumors. The hallmarks constitute an organizing principle for rationalizing the complexities of neoplastic disease. They include sustaining proliferative signaling, evading growth suppressors, resisting cell death, enabling replicative immortality, inducing angiogenesis, and activating invasion and metastasis. Underlying these hallmarks are genome instability, which generates the genetic diversity that expedites their acquisition, and inflammation, which fosters multiple hallmark functions. Conceptual progress in the last decade has added two emerging hallmarks of potential generality to this list-reprogramming of energy metabolism and evading immune destruction. In addition to cancer cells, tumors exhibit another dimension of complexity: they contain a repertoire of recruited, ostensibly normal cells that contribute to the acquisition of hallmark traits by creating the "tumor microenvironment." Recognition of the widespread applicability of these concepts will increasingly affect the development of new means to treat human cancer. Copyright © 2011 Elsevier Inc. All rights reserved.

0 comments Cited 12345 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Comprehensive Integration of Single-Cell Data

Tim Stuart, Andrew Butler, Paul Hoffman … (2019)

Single-cell transcriptomics has transformed our ability to characterize cell states, but deep biological understanding requires more than a taxonomic listing of clusters. As new methods arise to measure distinct cellular modalities, a key analytical challenge is to integrate these datasets to better understand cellular identity and function. Here, we develop a strategy to "anchor" diverse datasets together, enabling us to integrate single-cell measurements not only across scRNA-seq technologies, but also across different modalities. After demonstrating improvement over existing methods for integrating scRNA-seq data, we anchor scRNA-seq experiments with scATAC-seq to explore chromatin differences in closely related interneuron subsets and project protein expression measurements onto a bone marrow atlas to characterize lymphocyte populations. Lastly, we harmonize in situ gene expression and scRNA-seq datasets, allowing transcriptome-wide imputation of spatial gene expression patterns. Our work presents a strategy for the assembly of harmonized references and transfer of information across datasets.

0 comments Cited 5787 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Robust enumeration of cell subsets from tissue expression profiles

Aaron Newman, Chih Long Liu, Michael Green … (2015)

We introduce CIBERSORT, a method for characterizing cell composition of complex tissues from their gene expression profiles. When applied to enumeration of hematopoietic subsets in RNA mixtures from fresh, frozen, and fixed tissues, including solid tumors, CIBERSORT outperformed other methods with respect to noise, unknown mixture content, and closely related cell types. CIBERSORT should enable large-scale analysis of RNA mixtures for cellular biomarkers and therapeutic targets (http://cibersort.stanford.edu).

0 comments Cited 5327 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Peng Jiang:

ORCID: http://orcid.org/0000-0002-7828-5486

peng.jiang@nih.gov

Eytan Ruppin:

ORCID: http://orcid.org/0000-0002-7862-3940

eytan.ruppin@nih.gov

Journal

Journal ID (nlm-ta): Nat Rev Cancer

Journal ID (iso-abbrev): Nat Rev Cancer

Title: Nature Reviews. Cancer

Publisher: Nature Publishing Group UK (London )

ISSN (Print): 1474-175X

ISSN (Electronic): 1474-1768

Publication date (Electronic): 5 September 2022

Pages: 1-15

Affiliations

[1 ]GRID grid.94365.3d, ISNI 0000 0001 2297 5165, Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, , National Institutes of Health, ; Bethesda, MD USA

[2 ]GRID grid.94365.3d, ISNI 0000 0001 2297 5165, Laboratory of Pathology, Center for Cancer Research, National Cancer Institute, , National Institutes of Health, ; Bethesda, MD USA

Author information

Peng Jiang http://orcid.org/0000-0002-7828-5486

Cenk Sahinalp http://orcid.org/0000-0002-2170-2808

Eytan Ruppin http://orcid.org/0000-0002-7862-3940

Article

Publisher ID: 502

DOI: 10.1038/s41568-022-00502-0

PMC ID: 9443637

PubMed ID: 36064595

SO-VID: 74a45863-66db-4e9b-930d-988f2867c2f8

Copyright © © This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

License:

This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.

Big data in basic and translational cancer research

Read this article at

Abstract

Abstract

Related collections

Cancer Metabolism

Most cited references 181

Hallmarks of Cancer: The Next Generation

Comprehensive Integration of Single-Cell Data

Robust enumeration of cell subsets from tissue expression profiles

Author and article information

Contributors

Journal

Affiliations

Author information

Article

History

Categories

Comments

Comment on this article

Similar content 793

Cited by 49

Most referenced authors 9,986