Annotation-Free Automatic Music Transcription with Scalable Synthetic Data and Adversarial Domain Confusion

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Automatic Music Transcription (AMT) is a crucial technology in music information processing. Despite recent improvements in performance through machine learning approaches, existing methods often achieve high accuracy in domains with abundant annotation data, primarily due to the difficulty of creating annotation data. A practical transcription model requires an architecture that does not require an annotation data. In this paper, we propose an annotation-free transcription model achieved through the utilization of scalable synthetic audio for pre-training and adversarial domain confusion using unannotated real audio. Through evaluation experiments, we confirm that our proposed method can achieve higher accuracy under annotation-free conditions compared to when learning with mixture of annotated real audio data. Additionally, through ablation studies, we gain insights into the scalability of this approach and the challenges that lie ahead in the field of AMT research.

Related collections

Author and article information

Journal

Publication date Created: 16 December 2023

Article

ArXiV ID: 2312.10402

SO-VID: 3620ce47-5c7c-4dca-b4dd-696507e88ed3

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Comments 6 pages, 1 figure

Categories cs.SD cs.AI eess.AS

ScienceOpen disciplines: Artificial intelligence,Electrical engineering,Graphics & Multimedia design

Data availability:

ScienceOpen disciplines: Artificial intelligence, Electrical engineering, Graphics & Multimedia design

Annotation-Free Automatic Music Transcription with Scalable Synthetic Data and Adversarial Domain Confusion

Read this article at

Abstract

Related collections

Radiology and Natural Language Processing

Author and article information

Journal

Article

History

Custom metadata

Comments

Comment on this article

Similar content 58