86
views
0
recommends
+1 Recommend
0 collections
    8
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Identification of candidate regulatory SNPs by combination of transcription-factor-binding site prediction, SNP genotyping and haploChIP

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Disease-associated SNPs detected in large-scale association studies are frequently located in non-coding genomic regions, suggesting that they may be involved in transcriptional regulation. Here we describe a new strategy for detecting regulatory SNPs (rSNPs), by combining computational and experimental approaches. Whole genome ChIP-chip data for USF1 was analyzed using a novel motif finding algorithm called BCRANK. 1754 binding sites were identified and 140 candidate rSNPs were found in the predicted sites. For validating their regulatory function, seven SNPs found to be heterozygous in at least one of four human cell samples were investigated by ChIP and sequence analysis (haploChIP). In four of five cases where the SNP was predicted to affect binding, USF1 was preferentially bound to the allele containing the consensus motif. Allelic differences in binding for other proteins and histone marks further reinforced the SNPs regulatory potential. Moreover, for one of these SNPs, H3K36me3 and POLR2A levels at neighboring heterozygous SNPs indicated effects on transcription. Our strategy, which is entirely based on in vivo data for both the prediction and validation steps, can identify individual binding sites at base pair resolution and predict rSNPs. Overall, this approach can help to pinpoint the causative SNPs in complex disorders where the associated haplotypes are located in regulatory regions. Availability: BCRANK is available from Bioconductor ( http://www.bioconductor.org/).

          Related collections

          Most cited references32

          • Record: found
          • Abstract: found
          • Article: not found

          TRANSFAC: transcriptional regulation, from patterns to profiles.

          The TRANSFAC database on eukaryotic transcriptional regulation, comprising data on transcription factors, their target genes and regulatory binding sites, has been extended and further developed, both in number of entries and in the scope and structure of the collected data. Structured fields for expression patterns have been introduced for transcription factors from human and mouse, using the CYTOMER database on anatomical structures and developmental stages. The functionality of Match, a tool for matrix-based search of transcription factor binding sites, has been enhanced. For instance, the program now comes along with a number of tissue-(or state-)specific profiles and new profiles can be created and modified with Match Profiler. The GENE table was extended and gained in importance, containing amongst others links to LocusLink, RefSeq and OMIM now. Further, (direct) links between factor and target gene on one hand and between gene and encoded factor on the other hand were introduced. The TRANSFAC public release is available at http://www.gene-regulation.com. For yeast an additional release including the latest data was made available separately as TRANSFAC Saccharomyces Module (TSM) at http://transfac.gbf.de. For CYTOMER free download versions are available at http://www.biobase.de:8080/index.html.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            ChIP-seq accurately predicts tissue-specific activity of enhancers.

            A major yet unresolved quest in decoding the human genome is the identification of the regulatory sequences that control the spatial and temporal expression of genes. Distant-acting transcriptional enhancers are particularly challenging to uncover because they are scattered among the vast non-coding portion of the genome. Evolutionary sequence constraint can facilitate the discovery of enhancers, but fails to predict when and where they are active in vivo. Here we present the results of chromatin immunoprecipitation with the enhancer-associated protein p300 followed by massively parallel sequencing, and map several thousand in vivo binding sites of p300 in mouse embryonic forebrain, midbrain and limb tissue. We tested 86 of these sequences in a transgenic mouse assay, which in nearly all cases demonstrated reproducible enhancer activity in the tissues that were predicted by p300 binding. Our results indicate that in vivo mapping of p300 binding is a highly accurate means for identifying enhancers and their associated activities, and suggest that such data sets will be useful to study the role of tissue-specific enhancers in human biology and disease on a genome-wide scale.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              JASPAR: an open-access database for eukaryotic transcription factor binding profiles.

              The analysis of regulatory regions in genome sequences is strongly based on the detection of potential transcription factor binding sites. The preferred models for representation of transcription factor binding specificity have been termed position-specific scoring matrices. JASPAR is an open-access database of annotated, high-quality, matrix-based transcription factor binding site profiles for multicellular eukaryotes. The profiles were derived exclusively from sets of nucleotide sequences experimentally demonstrated to bind transcription factors. The database is complemented by a web interface for browsing, searching and subset selection, an online sequence analysis utility and a suite of programming tools for genome-wide and comparative genomic analysis of regulatory regions. JASPAR is available at http://jaspar. cgb.ki.se.
                Bookmark

                Author and article information

                Journal
                Nucleic Acids Res
                Nucleic Acids Res
                nar
                nar
                Nucleic Acids Research
                Oxford University Press
                0305-1048
                1362-4962
                July 2009
                18 May 2009
                18 May 2009
                : 37
                : 12
                : e85
                Affiliations
                1The Linnaeus Centre for Bioinformatics, 2Department of Genetics and Pathology, Rudbeck Laboratory, Uppsala University, Sweden and 3Interdisciplinary Centre for Mathematical and Computer Modelling, Warsaw University, Poland
                Author notes
                *To whom correspondence should be addressed. Tel: +0046739246433; Fax: +0046184716698; Email: alvaro.rada@ 123456lcb.uu.se
                Correspondence may also be addressed to Claes Wadelius. Tel: +0046184714076; Fax: +0046184714808; Email: claes.wadelius@ 123456genpat.uu.se

                Present address: Alvaro Rada-Iglesias, The Linnaeus Centre for Bioinformatics, Uppsala University, Sweden.

                The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors.

                Article
                gkp381
                10.1093/nar/gkp381
                2709586
                19451166
                86c9cbc5-1636-4946-8a50-48cb8e118cc6
                © 2009 The Author(s)

                This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 4 December 2008
                : 24 April 2009
                : 27 April 2009
                Categories
                Methods Online

                Genetics
                Genetics

                Comments

                Comment on this article