4
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Dilated Convolution with Dilated GRU for Music Source Separation

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Stacked dilated convolutions used in Wavenet have been shown effective for generating high-quality audios. By replacing pooling/striding with dilation in convolution layers, they can preserve high-resolution information and still reach distant locations. Producing high-resolution predictions is also crucial in music source separation, whose goal is to separate different sound sources while maintaining the quality of the separated sounds. Therefore, this paper investigates using stacked dilated convolutions as the backbone for music source separation. However, while stacked dilated convolutions can reach wider context than standard convolutions, their effective receptive fields are still fixed and may not be wide enough for complex music audio signals. To reach information at remote locations, we propose to combine dilated convolution with a modified version of gated recurrent units (GRU) called the `Dilated GRU' to form a block. A Dilated GRU unit receives information from k steps before instead of the previous step for a fixed k. This modification allows a GRU unit to reach a location with fewer recurrent steps and run faster because it can execute partially in parallel. We show that the proposed model with a stack of such blocks performs equally well or better than the state-of-the-art models for separating vocals and accompaniments.

          Related collections

          Most cited references5

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          On the Properties of Neural Machine Translation: Encoder–Decoder Approaches

            Bookmark
            • Record: found
            • Abstract: not found
            • Book Chapter: not found

            The 2016 Signal Separation Evaluation Campaign

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Improving music source separation based on deep neural networks through data augmentation and network blending

                Bookmark

                Author and article information

                Journal
                04 June 2019
                Article
                1906.01203
                1c144702-b664-432e-abbe-fda9cb76009d

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                cs.SD cs.LG eess.AS

                Artificial intelligence,Electrical engineering,Graphics & Multimedia design

                Comments

                Comment on this article