Research Article
M. Kolbæk, Z.-H. Tan, and J. Jensen, “Speech intelligibility potential of general and specialized deep neural network based speech enhancement systems,” IEEE/ACM Trans. Audio Speech Lang. Process. 25, 153-167 (2017).
10.1109/TASLP.2016.2628641Z. Huang, S. Watanabe, S.-W. Yang, P. García, and S. Cohen, “Investigating self-supervised learning for speech enhancement and separation,” Proc. ICASSP, 6837-6841 (2022).
10.1109/ICASSP43922.2022.9746303O. Tal, M. Mandel, F. Kreuk, and Y. Adi, “A systematic comparison of phonetic aware techniques for speech enhancement,” Proc. Interspeech, 1193-1197 (2022).
10.21437/Interspeech.2022-695R. Shankar, K. Tan, B. Xu, and A. Kumar, “A closer look at wav2vec2 embeddings for on-device single-channel speech enhancement,” Proc. ICASSP, 751-755 (2024).
10.1109/ICASSP48485.2024.10447539S. Hwang, S. W. Park, and Y. Park, “Causal speech enhancement based on a two-branch nested U-Net architecture using self-supervised speech embeddings,” Proc. ICASSP, 11466-11470 (2025).
10.1109/ICASSP49660.2025.10888248S.-W. Yang, H.-J. Chang, Z. Huang, A. T. Liu, C.-I. Lai, H. Wu, J. Shi, X. Chang, H.-S. Tsai, W.-C. Huang, T.-H. Feng, P.-H. Chi, Y. Y. Lin, Y.-S. Chuang, T.-H. Huang, W.-C. Tseng, K. Lakhotia, S.-W. Li, S. Watanabe, and H.-Y. Lee, “A large-scale evaluation of speech foundation models,” IEEE/ACM Trans. Audio Speech Lang. Process. 32, 2884-2899 (2024).
10.1109/TASLP.2024.3389631A. Défossez, G. Synnaeve, and Y. Adi, “Real time speech enhancement in the waveform domain,” Proc. Interspeech, 3291-3295 (2020).
10.21437/Interspeech.2020-2409- Publisher :The Acoustical Society of Korea
- Publisher(Ko) :한국음향학회
- Journal Title :The Journal of the Acoustical Society of Korea
- Journal Title(Ko) :한국음향학회지
- Volume : 45
- No :1
- Pages :55-61
- Received Date : 2025-12-17
- Accepted Date : 2026-01-16
- DOI :https://doi.org/10.7776/ASK.2026.45.1.055



The Journal of the Acoustical Society of Korea









