TY - CONF T1 - Melody extraction by means of a source-filter model and pitch contour characterization (MIREX 2015) T2 - Music Information Retrieval Evaluation eXchange (MIREX) Y1 - 2015 A1 - Bosch, J. A1 - Gómez, E. JF - Music Information Retrieval Evaluation eXchange (MIREX) ER - TY - CONF T1 - Melovizz: A Web-based tool for Score-Informed Melody Extraction Visualization T2 - ISMIR (Late Breaking Demo) Y1 - 2015 A1 - Bosch, J. A1 - Mayor, O. A1 - Gómez, E. JF - ISMIR (Late Breaking Demo) ER - TY - CONF T1 - Bridging the Audio-Symbolic Gap: The Discovery of Repeated Note Content Directly from Polyphonic Music Audio T2 - 53rd AES Conference on Semantic Audio Y1 - 2014 A1 - Collins, Tom A1 - Sebastian Böck A1 - Krebs, Florian A1 - Widmer, Gerhard JF - 53rd AES Conference on Semantic Audio CY - London, UK ER - TY - CONF T1 - The Complete Classical Music Companion V0.9 T2 - 53rd AES Conference on Semantic Audio Y1 - 2014 A1 - Andreas Arzt A1 - Sebastian Böck A1 - Flossmann, Sebastian A1 - Frostel, Harald A1 - Gasser, Martin A1 - Widmer, Gerhard JF - 53rd AES Conference on Semantic Audio CY - London, UK ER - TY - CONF T1 - Exploiting Instrument-wise Playing/Non-Playing Labels for Score Synchronization of Symphonic Music T2 - Proceedings of the 15th International Society for Music Information Retrieval Conference Y1 - 2014 A1 - Alessio Bazzica A1 - Cynthia C. S. Liem A1 - Hanjalic, Alan AB - Score synchronization with an audio-visual recording of a symphonic music performance is usually done by solving an audio-to-MIDI alignment problem. In this paper we investigate what role visual channel can have in this process. In particular, we focus on the possibility to represent both the score and the performance by the information about what instrument is active at a given time stamp. More specifically, we investigate to what extent instrument-level 'play' (P) and 'non-play' (NP) labels are informative in the synchronization process. After introducing the P/NP-based representation of the music piece, both at the score and performance level, we define an efficient way of computing the distance between the two representations, which serves as input for the synchronization step based on dynamic time warping. In parallel with assessing the effectiveness of the proposed representation, we also study its robustness when missing and/or erroneous labels occur. Our experimental results show that P/NP-based music piece representation is informative for performance-to-score synchronization and may benefit the existing audio-only approaches. JF - Proceedings of the 15th International Society for Music Information Retrieval Conference CY - Taipei, Taiwan ER - TY - Generic T1 - Improved musical onset detection with convolutional neural networks T2 - Proceedings of the 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014) Y1 - 2014 A1 - Jan Schlüter A1 - Sebastian Böck JF - Proceedings of the 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014) ER - TY - Generic T1 - Melody extraction in symphonic classical music: a comparative study of mutual agreement between humans and algorithms T2 - 9th Conference on Interdisciplinary Musicology – CIM14 Y1 - 2014 A1 - Bosch, J. A1 - Gómez, E. JF - 9th Conference on Interdisciplinary Musicology – CIM14 CY - Berlin ER - TY - CONF T1 - Mobile Music Genius: Reggae at the Beach, Metal on a Friday Night? T2 - Proceedings of the 2014 ACM International Conference on Multimedia Retrieval (ICMR) Y1 - 2014 A1 - Markus Schedl A1 - Georg Breitschopf A1 - Bogdan Ionescu JF - Proceedings of the 2014 ACM International Conference on Multimedia Retrieval (ICMR) CY - Glasgow, UK ER - TY - CONF T1 - The Piano Music Companion T2 - Proceedings of the Conference on Prestigious Applications of Intelligent Systems (PAIS) Y1 - 2014 A1 - Andreas Arzt A1 - Sebastian Böck A1 - Flossmann, S. A1 - Frostel, H. A1 - Gasser, M. A1 - Cynthia C. S. Liem A1 - Widmer, G. JF - Proceedings of the Conference on Prestigious Applications of Intelligent Systems (PAIS) ER - TY - Generic T1 - Automatic Melodic and Structural Analysis of Music Material for Enriched Concert Related Experiences T2 - In Proc. of ACM Multimedia Y1 - 2013 A1 - J. Bosch JF - In Proc. of ACM Multimedia CY - Barcelona ER - TY - CONF T1 - Enhanced peak picking for onset detection with recurrent neural networks T2 - Proceedings of the 6th International Workshop on Machine Learning and Music Y1 - 2013 A1 - Sebastian Böck A1 - Schlüter, Jan A1 - Widmer, Gerhard KW - onset detection KW - peak-picking AB -

We present a new neural network based peak-picking algorithm for common onset detection functions. Compared to existing hand-crafted methods it yields a better performance and leads to a much lower number of false negative detections. The performance is evaluated on basis of a huge dataset with over 25k annotated onsets and shows a significant improvement over existing methods in cases of signals with previously unknown levels.

JF - Proceedings of the 6th International Workshop on Machine Learning and Music CY - Prague, Czech Republic ER - TY - CONF T1 - Innovating the Classical Music Experience in the PHENICX Project: Use Cases and Initial User Feedback T2 - 1st International Workshop on Interactive Content Consumption (WSICC) at EuroITV 2013 Y1 - 2013 A1 - Cynthia C. S. Liem A1 - Ron van der Sterren A1 - Marcel van Tilburg A1 - Álvaro Sarasúa A1 - Juan J. Bosch A1 - Jordi Janer A1 - Mark S. Melenhorst A1 - Emilia Gómez A1 - Alan Hanjalic KW - interactivity KW - multimedia information systems KW - multimodality KW - music information retrieval KW - performing arts KW - social networks KW - user studies AB -

The FP7 PHENICX project focuses on creating a new digital classical concert experience, improving the accessibility of classical music concert performances by enhancing and enriching them in novel digital ways, In this paper, we present the project’s foreseen use cases. Subsequently, we summarize initial use case feedback from two different user groups. Despite the early stage of the project, the feedback already gives important insight into real-world considerations to make for interactive music content consumption solutions.

JF - 1st International Workshop on Interactive Content Consumption (WSICC) at EuroITV 2013 CY - Como, Italy ER - TY - CONF T1 - Local Group Delay based Vibrato and Tremolo Suppression for Onset Detection T2 - Proceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR 2013) Y1 - 2013 A1 - Sebastian Böck A1 - Widmer, Gerhard KW - local group delay KW - onset detection KW - tremolo suppression KW - vibrato suppression AB -

We present SuperFlux - a new onset detection algorithm with vibrato suppression. It is an enhanced version of the universal spectral flux onset detection algorithm, and reduces the number of false positive detections considerably by tracking spectral trajectories with a maximum filter. Especially for music with heavy use of vibrato (e.g., sung operas or string performances), the number of false positive detections can be reduced by up to 60% without missing any additional events. Algorithm performance was evaluated and compared to state-of-the-art methods on the basis of three different datasets comprising mixed audio material (25,927 onsets), violin recordings (7,677 onsets) and operatic solo voice recordings (1,448 onsets). Due to its causal nature, the algorithm is applicable in both offline and online real-time scenarios.

JF - Proceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR 2013) CY - Curitiba, Brazil ER - TY - Generic T1 - Looking Beyond Sound: Unsupervised Analysis of Musician Videos T2 - 14th International Workshop on Image and Audio Analysis for Multimedia Interactive services (WIA2MIS) Y1 - 2013 A1 - Cynthia C. S. Liem A1 - Alessio Bazzica A1 - Alan Hanjalic AB -
In this work, we focus on visual information conveyed by performing musicians. While musicians are playing, their movement relates to their musical performance. As such, analysis of this information can support structural characterization and timeline indexing of a recorded performance, especially in cases when such analyses are not trivially computed from the musical audio. We propose an unsupervised visual analysis method, in which visual novelty is inferred from motion orientation histograms of regions of interest. Considering our method in a case study on audiovisually recorded jam sessions, we show that our analysis of the visual channel yields promising and meaningful performance-related information, including information complementary to the audio channel.
JF - 14th International Workshop on Image and Audio Analysis for Multimedia Interactive services (WIA2MIS) PB - IEEE CY - Paris, France ER - TY - CONF T1 - Maximum Filter Vibrato Suppression for Onset Detection T2 - Proceedings of the 16th International Conference on Digital Audio Effects (DAFx-13) Y1 - 2013 A1 - Sebastian Böck A1 - Widmer, Gerhard KW - maximum filter KW - onset detection KW - vibrato suppression AB -

 

We present SuperFlux - a new onset detection algorithm with vibrato suppression. It is an enhanced version of the universal spectral flux onset detection algorithm, and reduces the number of false positive detections considerably by tracking spectral trajectories with a maximum filter. Especially for music with heavy use of vibrato (e.g., sung operas or string performances), the number of false positive detections can be reduced by up to 60% without missing any additional events. Algorithm performance was evaluated and compared to state-of-the-art methods on the basis of three different datasets comprising mixed audio material (25,927 onsets), violin recordings (7,677 onsets) and operatic solo voice recordings (1,448 onsets). Due to its causal nature, the algorithm is applicable in both offline and online real-time scenarios.

JF - Proceedings of the 16th International Conference on Digital Audio Effects (DAFx-13) CY - Maynooth, Ireland ER - TY - CONF T1 - Musical Onset Detection with Convolutional Neural Networks T2 - Proceedings of the 6th International Workshop on Machine Learning and Music Y1 - 2013 A1 - Schlüter, Jan A1 - Sebastian Böck KW - convolutional neural networks KW - onset detection JF - Proceedings of the 6th International Workshop on Machine Learning and Music CY - Prague, Czech Republic ER - TY - Generic T1 - Rhytmic Pattern Modeling for Beat and Downbeat Tracking in Musical Audio T2 - Proceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR 2013) Y1 - 2013 A1 - Krebs, Florian A1 - Sebastian Böck A1 - Widmer, Gerhard AB -

Rhythmic patterns are an important structural element in music. This paper investigates the use of rhythmic pattern modeling to infer metrical structure in musical audio recordings. We present a Hidden Markov Model (HMM) based system that simultaneously extracts beats, downbeats, tempo, meter, and rhythmic patterns. Our model builds upon the basic structure proposed by Whiteley et. al, which we further modified by introducing a new observation model: rhythmic patterns are learned directly from data, which makes the model adaptable to the rhythmical structure of any kind of music. For learning rhythmic patterns and evaluating beat and downbeat tracking, 697 ballroom dance pieces were annotated with beat and measure information. The results showed that explicitly modeling rhythmic patterns of dance styles drastically reduces octave errors (detection of half or double tempo) and substantially improves downbeat tracking.

JF - Proceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR 2013) ER -