High-Resolution Sustain Pedal Depth Estimation from Piano Audio across Room Acoustics
Fang, Kun ; Zhang, Hanwen ; Wang, Ziyu ; Fujinaga, Ichiro
Fang, Kun
Zhang, Hanwen
Wang, Ziyu
Fujinaga, Ichiro
Supervisor
Department
Machine Learning
Embargo End Date
Type
Conference proceeding
Date
2025
License
Language
English
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Piano sustain pedal detection has previously been approached as a binary on/off classification task, limiting its application in real-world piano performance scenarios where pedal depth significantly influences musical expression. This paper presents a novel approach for high-resolution estimation that predicts continuous pedal depth values. We introduce a Transformer-based architecture that not only matches state-of-the-art performance on the traditional binary classification task but also achieves high accuracy in continuous pedal depth estimation. Furthermore, by estimating continuous values, our model provides musically meaningful predictions for sustain pedal usage, whereas baseline models struggle to capture such nuanced expressions with their binary detection approach. Additionally, this paper investigates the influence of room acoustics on sustain pedal estimation using a synthetic dataset that includes varied acoustic conditions. We train our model with different combinations of room settings and test it in an unseen new environment using a “leave-one-out” approach. Our findings show that the two baseline models and ours are not robust to unseen room conditions. Statistical analysis further confirms that reverberation influences model predictions and introduces an over-estimation bias.
Citation
H. Zhang, K. Fang, Z. Wang, and I. Fujinaga, “High-Resolution Sustain Pedal Depth Estimation from Piano Audio across Room Acoustics”, doi: 10.5281/ZENODO.17811439
Source
Proceedings of the International Society for Music Information Retrieval Conference
Conference
26th International Society for Music Information Retrieval Conference (ISMIR 2025)
Keywords
Subjects
Source
26th International Society for Music Information Retrieval Conference (ISMIR 2025)
Publisher
International Society for Music Information Retrieval
