Loading...
RelUNet: Relative Channel Fusion U-Net for Multichannel Speech Enhancement
Aldarmaki, Ibrahim ; Solorio, Thamar ; Raj, Bhiksha ; Aldarmaki, Hanan
Aldarmaki, Ibrahim
Solorio, Thamar
Raj, Bhiksha
Aldarmaki, Hanan
Supervisor
Department
Natural Language Processing
Embargo End Date
Type
Conference proceeding
Date
License
http://creativecommons.org/licenses/by/4.0/
Language
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Neural multi-channel speech enhancement models, specifically U-Net-based ones, show promising performance and generalization potential. These models encode input channels independently, and integrate them during later stages, or rely on extensive feature extraction methods. We propose a novel modification by incorporating relative information from the outset, where each channel is processed in conjunction with a reference channel through stacking. This exploits comparative differences to adaptively fuse information between channels, enhancing the representation of spatial features and improving the overall performance. Theoretical analysis shows that differential encoding leads to more compact manifolds, contributing to better generalization by capturing invariant features. The experiments conducted on the CHiME-3 dataset demonstrate improvements in speech enhancement metrics across various architectures. We show that our method enhances model capabilities in spatial feature processing, demonstrating strong performance in TDOA estimation on simulated datasets.
Citation
I. Aldarmaki, T. Solorio, B. Raj, H. Aldarmaki, "RelUNet: Relative Channel Fusion U-Net for Multichannel Speech Enhancement," 2026, pp. 21031-21035.
Source
ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Conference
ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Keywords
46 Information and Computing Sciences, 4603 Computer Vision and Multimedia Computation, 4611 Machine Learning
Subjects
Source
ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Publisher
IEEE
