7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A novel scan statistics approach for clustering identification and comparison in binary genomic data

      research-article
      1 , 2 , , 1
      BMC Bioinformatics
      BioMed Central
      11th International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB 2014) (CIBB 2014)
      26-28 June 2014
      Scan statistics, Viral integration sites, Cluster identification, Binary genomic data

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          In biomedical research a relevant issue is to identify time intervals or portions of a n-dimensional support where a particular event of interest is more likely to occur than expected. Algorithms that require to specify a-priori number/dimension/length of clusters assumed for the data suffer from a high degree of arbitrariness whenever no precise information are available, and this may strongly affect final estimation on parameters. Within this framework, spatial scan-statistics have been proposed in the literature, representing a valid non-parametric alternative.

          Results

          We adapt the so called Bernoulli-model scan statistic to the genomic field and we propose a multivariate extension, named Relative Scan Statistics, for the comparison of two series of Bernoulli r.v. defined over a common support, with the final goal of highlighting unshared event rate variations. Using a probabilistic approach based on success probability estimates and comparison (likelihood based), we can exploit an hypothesis testing procedure to identify clusters and relative clusters. Both the univariate and the novel multivariate extension of the scan statistic confirm previously published findings.

          Conclusion

          The method described in the paper represents a challenging application of scan statistics framework to problem related to genomic data. From a biological perspective, these tools offer the possibility to clinicians and researcher to improve their knowledge on viral vectors integrations process, allowing to focus their attention to restricted over-targeted portion of the genome.

          Electronic supplementary material

          The online version of this article (doi:10.1186/s12859-016-1173-8) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references17

          • Record: found
          • Abstract: found
          • Article: not found

          Evaluating cluster alarms: a space-time scan statistic and brain cancer in Los Alamos, New Mexico.

          This article presents a space-time scan statistic, useful for evaluating space-time cluster alarms, and illustrates the method on a recent brain cancer cluster alarms in Los Alamos, NM. The space-time scan statistic accounts for the preselection bias and multiple testing inherent in a cluster alarm. Confounders and time trends can be adjusted for. The observed excess of brain cancer in Los Alamos was not statistically significant. The space-time scan statistic is useful as a screening tool for evaluating which cluster alarms merit further investigation and which clusters are probably chance occurrences.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Genome-wide analysis of retroviral DNA integration.

            Retroviral vectors are often used to introduce therapeutic sequences into patients' cells. In recent years, gene therapy with retroviral vectors has had impressive therapeutic successes, but has also resulted in three cases of leukaemia caused by insertional mutagenesis, which has focused attention on the molecular determinants of retroviral-integration target-site selection. Here, we review retroviral DNA integration, with emphasis on recent genome-wide studies of targeting and on the status of efforts to modulate target-site selection.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Hot spots of retroviral integration in human CD34+ hematopoietic cells.

              Insertional oncogenesis is a possible consequence of the integration of gamma-retroviral (RV) or lentiviral (LV) vectors into the human genome. RV common insertion sites (CISs) have been identified in hematopoietic malignancies and in the nonmalignant progeny of transduced hematopoietic stem/progenitor cells (HSCs), possibly as a consequence of clonal selection in vivo. We have mapped a large number of RV and LV integrations in human CD34(+) HSCs, transduced in vitro and analyzed without selection. Recurrent insertion sites (hot spots) account for more than 21% of the RV integration events, while they are significantly less frequent in the case of LV vectors. RV but not LV hot spots are highly enriched in proto-oncogenes, cancer-associated CISs, and growth-controlling genes, indicating that at least part of the biases observed in the HSC progeny in vivo are characteristics of RV integration, already present in nontransplanted cells. Genes involved in hematopoietic and immune system development are targeted at high frequency and enriched in hot spots, suggesting that the CD34(+) gene expression program is instrumental in directing RV integration. The lower propensity of LV vectors for integrating in potentially dangerous regions of the human genome may be a factor determining a better safety profile for gene therapy applications.
                Bookmark

                Author and article information

                Contributors
                pellin.danilo@hsr.it
                diserio.clelia@hsr.it
                Conference
                BMC Bioinformatics
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central (London )
                1471-2105
                22 September 2016
                22 September 2016
                2016
                : 17
                Issue : Suppl 11 Issue sponsor : Publication of this supplement has not been supported by sponsorship. Information about the source of funding for publication charges can be found in the individual articles. The articles have undergone the journal's standard peer review process for supplements. The Supplement Editors declare that they were not involved in the peer review process for any articles that they are an author of and that they have no other competing interests.
                : 320
                Affiliations
                [1 ]University Center of Statistics for the Biomedical Sciences, Vita-Salute San Raffaele University, Via Olgettina 58, Milan, 20132 Italy
                [2 ]Johann Bernoulli Institute, University of Groningen, Nijenborgh 9, Groningen, 9747 AG Netherlands
                Article
                1173
                10.1186/s12859-016-1173-8
                5046198
                135dd065-2391-48db-9d1b-47942a2e263f
                © The Author(s) 2016

                Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                11th International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB 2014)
                CIBB 2014
                Cambridge, UK
                26-28 June 2014
                History
                Categories
                Research
                Custom metadata
                © The Author(s) 2016

                Bioinformatics & Computational biology
                scan statistics,viral integration sites,cluster identification,binary genomic data

                Comments

                Comment on this article