16
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Lyrics-to-Audio Alignment by Unsupervised Discovery of Repetitive Patterns in Vowel Acoustics

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Most of the previous approaches to lyrics-to-audio alignment used a pre-developed automatic speech recognition (ASR) system that innately suffered from several difficulties to adapt the speech model to individual singers. A significant aspect missing in previous works is the self-learnability of repetitive vowel patterns in the singing voice, where the vowel part used is more consistent than the consonant part. Based on this, our system first learns a discriminative subspace of vowel sequences, based on weighted symmetric non-negative matrix factorization (WS-NMF), by taking the self-similarity of a standard acoustic feature as an input. Then, we make use of canonical time warping (CTW), derived from a recent computer vision technique, to find an optimal spatiotemporal transformation between the text and the acoustic sequences. Experiments with Korean and English data sets showed that deploying this method after a pre-developed, unsupervised, singing source separation achieved more promising results than other state-of-the-art unsupervised approaches and an existing ASR-based system.

          Related collections

          Most cited references11

          • Record: found
          • Abstract: not found
          • Article: not found

          Image Alignment and Stitching: A Tutorial

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Kernel k-means

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Scaling and time warping in time series querying

                Bookmark

                Author and article information

                Journal
                2017-01-21
                Article
                1701.06078
                5cbe23d5-3ec9-4926-9b34-1791adf6a525

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                13 pages, Under review as a journal paper at IEEE/ACM Transactions on Audio, Speech, and Language Processing
                cs.SD cs.LG

                Artificial intelligence,Graphics & Multimedia design
                Artificial intelligence, Graphics & Multimedia design

                Comments

                Comment on this article