1,568
views
0
recommends
+1 Recommend
0 collections
    20
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      HTSeq--a Python framework to work with high-throughput sequencing data

      , ,
      Bioinformatics
      Oxford University Press (OUP)

      Read this article at

          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Motivation: A large choice of tools exists for many standard tasks in the analysis of high-throughput sequencing (HTS) data. However, once a project deviates from standard workflows, custom scripts are needed. Results: We present HTSeq, a Python library to facilitate the rapid development of such scripts. HTSeq offers parsers for many common data formats in HTS projects, as well as classes to represent data, such as genomic coordinates, sequences, sequencing reads, alignments, gene model information and variant calls, and provides data structures that allow for querying via genomic coordinates. We also present htseq-count, a tool developed with HTSeq that preprocesses RNA-Seq data for differential expression analysis by counting the overlap of reads with genes. Availability and implementation: HTSeq is released as an open-source software under the GNU General Public Licence and available from http://www-huber.embl.de/HTSeq or from the Python Package Index at https://pypi.python.org/pypi/HTSeq. Contact: sanders@fs.tum.de

          Related collections

          Most cited references5

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Pybedtools: a flexible Python library for manipulating genomic datasets and annotations

          Summary: pybedtools is a flexible Python software library for manipulating and exploring genomic datasets in many common formats. It provides an intuitive Python interface that extends upon the popular BEDTools genome arithmetic tools. The library is well documented and efficient, and allows researchers to quickly develop simple, yet powerful scripts that enable complex genomic analyses. Availability: pybedtools is maintained under the GPL license. Stable versions of pybedtools as well as documentation are available on the Python Package Index at http://pypi.python.org/pypi/pybedtools. Contact: dalerr@niddk.nih.gov; arq5x@virginia.edu Supplementary Information: Supplementary data are available at Bioinformatics online.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Cython: The Best of Both Worlds

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              SWIG: An easy to use tool for integrating scripting languages with C and C++

                Bookmark

                Author and article information

                Journal
                Bioinformatics
                Bioinformatics
                Oxford University Press (OUP)
                1367-4803
                1460-2059
                January 08 2015
                January 15 2015
                September 25 2014
                January 15 2015
                : 31
                : 2
                : 166-169
                Article
                10.1093/bioinformatics/btu638
                be4d554c-c520-4c54-8fd2-43255f903061
                © 2015
                History

                Comments

                Comment on this article