Juan Carlos Martínez Sevilla presented his paper “A Holistic Approach for Aligned Music and Lyrics Transcription” at the International Conference on Document Analysis and Recognition (ICDAR) in San José, California, USA. Their paper introduces an approach to the Aligned Music Notation and Lyrics Transcription (AMNLT) challenge, which is to retrieve content from images of vocal music by developing data-driven approaches that transcribe music and text while providing proper alignment. Their holistic neural approaches achieve significant improvements over existing methods in transcription and alignment metrics.
Martinez Sevilla, Juan C. & Ríos Vila, Antonio & Castellanos, Francisco & Calvo-Zaragoza, Jorge. (2023). A Holistic Approach for Aligned Music and Lyrics Transcription.